Try SpeechPulse with our 30-day free trial version.
- Windows 10 64-bit (version 1809 or later) or Windows 11 64-bit
How to add language models
SpeechPulse installer only includes the "English (small)" language model. For better accuracy and multi-language support, you can download additional language models listed below.
These models run only on the CPU.
On a CPU-only machine, these models run faster than CPU/GPU models. So if you want fast speech recognition on a PC without an Nvidia GPU, you should download one of these models.
- English (tiny)
- English (base)
- English (small)
- English (medium)
- Multi (tiny)
- Multi (base)
- Multi (small)
- Multi (medium)
- Multi (large) - Part 1
Multi (large) - Part 2
You need to place the downloaded models into the "models" directory inside the SpeechPulse installation directory (e.g.: "C:\Program Files\SpeechPulse\models"). Next time you run SpeechPulse, it will automatically detect the new models.
CPU and GPU models
These models can run on both the CPU and the GPU. They give the fastest results with CUDA-enabled Nvidia GPUs.
On CPU, they consume less RAM compared to CPU-only models. However, these models are slightly slower on CPU compared to CPU-only models.
You also need one of these models to generate subtitles with SpeechPulse.
- English (tiny) L
- English (base) L
- English (small) L
- English (medium) L
- Multi (tiny) L
- Multi (base) L
- Multi (small) L
- Multi (medium) L
- Multi (large) L
You need to unzip the downloaded models into the “models” directory inside the SpeechPulse installation directory (e.g.: “C:\Program Files\SpeechPulse\models”). Next time you run SpeechPulse, it will automatically detect the new models.
* To run these models on the GPU, you need an Nvidia GPU with CUDA support. You also need to download CUDA libraries from here. Then unzip the cuda_libs.zip to the SpeechPulse installation directory (e.g.: "C:\Program Files\SpeechPulse").
You can use CPU-only models and CPU/GPU models in the same SpeechPulse installation.
How to get better accuracy
1) Try to reduce the background noise
SpeechPulse has decent accuracy even with moderate background noise. However, for the best accuracy and lowest latency, you should try to lower the background noise as much as possible.
2) Try to speak in complete sentences
Short phrases can confuse the AI language models, causing poor accuracy. So try to speak in complete sentences as much as possible.
3) Use a larger language model
Larger language models have better accuracy. They also work well under background noise. However, larger models require more RAM and have higher latencies.
∗SpeechPulse is 100% safe and does not contain any malware. If your computer displays any warnings, please report it as a "false positive" (incorrect detection) to your virus guard manufacturer.