SpeechPulse Updates

7/8/2025 - Version 10.8.8 (macOS)

A new hotkey to cycle through languages.

7/7/2025 - Version 10.8.8 (Windows)

A new hotkey to cycle through languages.

6/25/2025 - Version 10.8.7 (macOS)

Now supports the Whisper large V2 MLX model, which you can download using the built-in model downloader (SpeechPulse macOS version already supports the Whisper large V3 MLX model).

6/23/2025 - Version 10.8.7 (Windows)

Now supports the Whisper large V3 model, which you can download using the built-in model downloader.

6/15/2025 - Version 10.8.6 (macOS)

Added support for training in Portuguese, Polish, Russian, and Hungarian languages.

6/15/2025 - Version 10.8.6 (Windows)

Fixed an issue that caused poor training accuracy for the Hungarian language.

6/12/2025 - Version 10.8.5 (Windows)

Added support for training in Portuguese, Polish, Russian, and Hungarian languages.

6/9/2025 - Version 10.8.4 (macOS)

A new option to limit the set of languages shown in the language selector for easier language switching.
A new option to remove linebreaks from the output text in the Live mode. This will prevent sending incomplete messages when dictating to messaging apps.

6/7/2025 - Version 10.8.4 (Windows)

A new option to limit the set of languages shown in the language selector for easier language switching.
A new option to remove linebreaks from the output text in the Live mode. This will prevent sending incomplete messages when dictating to messaging apps.

5/27/2025 - Version 10.8.3 (macOS)

A new hotkey to cycle through AI templates.
LLM API model selection window now supports searching for model names.

5/27/2025 - Version 10.8.3 (Windows)

A new hotkey to cycle through AI templates.
LLM API model selection window now supports searching for model names.

5/24/2025 - Version 10.8.2 (macOS)

File mode now supports saving transcriptions to the source file locations.
Several improvements for better accuracy and faster transcription when using Whisper MLX models.
Voice command guide now includes a print button to save the commands list for easy reference.
German language now supports switching the punctuation mode using voice commands.
Misc. improvements.

5/21/2025 - Version 10.8.0 (Windows)

File mode now supports saving transcriptions to the source file locations.
Voice command guide now includes a print button to save the commands list for easy reference.
German language now supports switching the punctuation mode using voice commands.
Misc. improvements.

5/15/2025 - Version 10.8.1 (macOS)

Several improvements for better accuracy, faster transcription, and higher reliability when using Whisper MLX models.

5/13/2025 - Version 10.8.0 (macOS)

SpeechPulse now supports Whisper MLX models optimized for Apple silicon Macs.
Trained word detection now runs on Apple silicon GPUs, allowing for faster transcription while a training profile is active.

5/6/2025 - Version 10.7.8 (Windows)

Fixed an issue that caused SpeechPulse to crash when the transcription is empty and a training profile is active.
Misc. improvements.

5/4/2025 - Version 10.7.7 (macOS)

The training feature now allows you to alter the pronunciation of words to match your dictation.
You can now train words that contain numbers. For example, you can train the word "GQw-45mT" by adding the pronunciation "gqw forty five mt".
Several performance improvements to speed up the detection of trained words when a training profile is active.
Added support for the gpt-4o-transcribe and gpt-4o-mini-transcribe models. You can find the API settings for these models from here.
System audio mode now includes two collapsible sections (Mic and System) to display the full transcription for better readability.
Listening and processing indicators now follow the mouse cursor more smoothly.
SpeechPulse now supports manual punctuation for the German language (in addition to English).
Supported manual punctuation commands are listed in the voice command guide (Settings->Voice command guide).
Manual punctuation will be available for other languages in the future.
Reduced the CPU usage in the push-to-talk mode (when idle).
Fixed an issue that caused SpeechPulse to hang when it can't read the microphone input. Now keeps the UI alive even when the microphone is not responding.
Various UI improvements.
Misc. improvements.

4/30/2025 - Version 10.7.7 (Windows)

The training feature now allows you to alter the pronunciation of words to match your dictation.
You can now train words that contain numbers. For example, you can train the word "GQw-45mT" by adding the pronunciation "gqw forty five mt".
Misc. improvements.

4/28/2025 - Version 10.7.6 (Windows)

Fixed an issue that caused poor training accuracy.

4/28/2025 - Version 10.7.5 (Windows)

Several performance improvements to speed up the detection of trained words when a training profile is active.
Misc. improvements.

4/26/2025 - Version 10.7.4 (Windows)

Fixed an issue that caused SpeechPulse to hang when it can't read the microphone input. Now keeps the UI alive even when the microphone is not responding.
Misc. improvements.

4/23/2025 - Version 10.7.3 (Windows)

Fixed an issue that caused the listening and processing indicators to shift to an incorrect position on high-dpi screens.
Listening and processing indicators now follow the mouse cursor more smoothly.

4/22/2025 - Version 10.7.2 (Windows)

Added support for the gpt-4o-transcribe and gpt-4o-mini-transcribe models. You can find the API settings for these models from here.

4/20/2025 - Version 10.7.1 (Windows)

System audio mode now includes two collapsible sections (Mic and System) to display the full transcription for better readability.

4/18/2025 - Version 10.7.0 (Windows)

SpeechPulse now supports manual punctuation for the German language (in addition to English).
Supported manual punctuation commands are listed in the voice command guide (Settings->Voice command guide).
Manual punctuation will be available for other languages in the future.

4/16/2025 - Version 10.6.3 (Windows)

Fixed an issue that caused SpeechPulse to crash for some specific audio segments under certain (rare) conditions.

4/14/2025 - Version 10.6.2 (Windows)

Fixed an issue that caused SpeechPulse to randomly crash with some specific audio files and speech segments.
A new option to reduce the CPU usage when running on an NVIDIA GPU.

4/11/2025 - Version 10.6.0 (macOS)

System audio mode now supports text selection, copying, and finding operations while the transcription is running.
System audio mode now supports pause and resume operations.
SpeechPulse now remembers the last used mode (from Live mode, File mode, and System audio mode) and automatically loads into that mode.
System audio mode now skips recording to WAV files while pausing.

4/11/2025 - Version 10.6.1 (Windows)

System audio mode now skips recording to WAV files while pausing.

4/10/2025 - Version 10.6.0 (Windows)

System audio mode now supports text selection, copying, and finding operations while the transcription is running.
System audio mode now supports pause and resume operations.
SpeechPulse now remembers the last used mode (from Live mode, File mode, and System audio mode) and automatically loads into that mode.
A new option to disable the "SpeechPulse Minimized" notification.

4/8/2025 - Version 10.5.4 (macOS)

UI improvements.

4/8/2025 - Version 10.5.4 (Windows)

Reduced CPU usage when running on an NVIDIA GPU.

4/6/2025 - Version 10.5.3 (Windows)

UI improvements.

4/3/2025 - Version 10.5.2 (macOS)

You can now back up all of your SpeechPulse settings and data, including app settings, training profiles, training data, trained models, dictation history, and saved transcriptions/diarizations, using the new backup option.
The backup operation saves all settings and data to a single archive file on your local disk drive.
You can later restore SpeechPulse (on the same computer or on a different one) using the new restore option.
SpeechPulse doesn't support backup/restore between Windows and macOS (both computers should run Windows or both should run macOS).
A new option to increase the paste mode clipboard wait time to prevent any text insertion issues.

4/3/2025 - Version 10.5.2 (Windows)

Added a disk synchronization step before the backup operation. This will avoid any settings mismatch in a rare case of OS caching.

4/1/2025 - Version 10.5.1 (Windows)

Added support for the NVIDIA GPUs with Compute Capability 5.x or lower.

3/27/2025 - Version 10.5.0 (Windows)

You can now back up all of your SpeechPulse settings and data, including app settings, training profiles, training data, trained models, dictation history, and saved transcriptions/diarizations, using the new backup option.
The backup operation saves all settings and data to a single archive file on your local disk drive.
You can later restore SpeechPulse (on the same computer or on a different one) using the new restore option.
SpeechPulse doesn't support backup/restore between Windows and macOS (both computers should run Windows or both should run macOS).

3/23/2025 - Version 10.4.2 (Windows)

Added support for the new RTX 50 series NVIDIA GPUs.
GPU execution requires newer CUDA libraries, which you can download using the built-in library downloader.
A new option to enable a faster and higher accuracy computation mode on NVIDIA GPUs with Compute Capability 7.0 or higher and 6 GB or larger VRAM.
A new option to increase the paste mode clipboard wait time to prevent any text insertion issues.

3/16/2025 - Version 10.4.0 (macOS)

Now supports training for the French, Italian, Spanish, and Dutch languages (in addition to English and German).
Added voice commands for non-English languages.
For example, now you can say "Neue Zeile" to add a new line and "Neuer Absatz" to add a new paragraph while dictating in German.
Supported voice commands are listed for each language in the voice command guide (Settings->Voice command guide).
Several improvements to prevent missing punctuation marks in non-English languages.
Fixed an issue that caused the Text inserter to insert multiple times in the real-time processing mode.
Fixed an issue that caused the "New line" and "New paragraph" commands to fail when dictated after a pause.
Misc. improvements.

3/11/2025 - Version 10.4.0 (Windows)

Added voice commands for more languages.
Fixed an issue that caused SpeechPulse to fail with conflicting CUDA libraries.
Misc. improvements.

3/6/2025 - Version 10.3.0 (Windows)

Now supports voice commands for the German, French, Italian, Spanish, and Dutch languages (in addition to English).
For example, now you can say "Neue Zeile" to add a new line and "Neuer Absatz" to add a new paragraph while dictating in German.
Supported voice commands are listed for each language in the voice command guide (Settings->Voice command guide).
Voice commands will be available for other languages in the future.

3/5/2025 - Version 10.2.0 (Windows)

Now supports training for the French, Italian, Spanish, and Dutch languages (in addition to English and German).
Several improvements to prevent missing punctuation marks in non-English languages.
Fixed an issue that caused the Text inserter to insert multiple times in the real-time processing mode.
Fixed an issue that caused the "New line" and "New paragraph" commands to fail when dictated after a pause.
Misc. improvements.

2/27/2025 - Version 10.1.0 (macOS)

Now supports training for the German language.
Misc. improvements.

2/25/2025 - Version 10.1.0 (Windows)

Now supports training for the German language.
Misc. improvements.

2/22/2025 - Version 10.0.6 (macOS)

* Training on the SpeechPulse macOS version is currently running on the CPU. We will try to add GPU training in the future for faster training.

A new feature to support training new words to SpeechPulse.

2/18/2025 - Version 10.0.6 (Windows)

* The first training run in this new version will take slightly longer, depending on the number of words in the speech profile.

Several improvements for significantly better training accuracy.
Fixed an issue that caused training to get stuck at a suboptimal solution.

2/16/2025 - Version 10.0.5 (Windows)

Fixed an issue that caused the UI to slow down.
Fixed an issue that caused poor training accuracy.

2/15/2025 - Version 10.0.4 (Windows)

Reduced the disk space required for training data.
Misc. improvements for training.

2/12/2025 - Version 10.0.3 (Windows)

Several improvements to make the training faster and more accurate.

2/11/2025 - Version 10.0.2 (Windows)

Several improvements for better training accuracy.

2/11/2025 - Version 10.0.1 (Windows)

Several improvements to the training feature.

2/10/2025 - Version 10.0.0 (Windows)

A new feature to support training new words to SpeechPulse.

1/19/2025 - Version 9.4.1 (Windows and macOS)

Transcription and diarization editors now export the edited subtitle files.
Misc. improvements.

1/8/2025 - Version 9.4.0 (macOS)

A new dynamic AI template to read the AI instructions from your dictation.
For example, you can say, "Correct grammar and spelling in the last dictated text," to correct any grammar/spelling issues in your last dictated text.
Similarly, you can say, "Correct grammar and spelling in the clipboard text," to correct any grammar/spelling issues in the text you have already copied to the clipboard (e.g., using CTRL+C).

1/7/2025 - Version 9.4.0 (Windows)

A new dynamic AI template to read the AI instructions from your dictation.
For example, you can say, "Correct grammar and spelling in the last dictated text," to correct any grammar/spelling issues in your last dictated text.
Similarly, you can say, "Correct grammar and spelling in the clipboard text," to correct any grammar/spelling issues in the text you have already copied to the clipboard (e.g., using CTRL+C).

1/5/2025 - Version 9.3.2 (macOS)

UI improvements.
Several performance improvements for the transcription window.
Fixed an issue that causes higher memory usage in the transcription and diarization windows.
Fixed an issue that causes higher memory usage in the system audio mode.

1/5/2025 - Version 9.3.2 (Windows)

Several performance improvements for the transcription window.
Fixed an issue that causes higher memory usage in the system audio mode.

1/2/2025 - Version 9.3.1 (Windows)

Fixed an issue that can cause missing speech segments around silent parts of the audio (Live mode, File mode).
Fixed an issue that causes higher memory usage in the transcription and diarization windows.
Misc. improvements.

12/30/2024 - Version 9.3.0 (Windows)

File mode now supports adding new files, removing pending files, and reordering the file list using drag and drop while the transcription is running.
UI improvements.

12/27/2024 - Version 9.2.1 (Windows)

UI improvements (System audio mode).
A new hotkey to enable real-time processing.
Misc. improvements.

12/27/2024 - Version 9.2.1 (macOS)

UI improvements (System audio mode).
A new hotkey to enable real-time processing.
Misc. improvements.

12/25/2024 - Version 9.2.0 (macOS)

A new speaker diarization implementation for the file mode (in addition to the original implementation).
The new diarization implementation supports setting active speakers per file for better accuracy. It is also significantly faster than the original implementation.
Real-time processing is now more accurate and responsive (live mode and system audio mode).
Fixed an issue of missing speech segments in the real-time live mode and system audio mode.
Misc. improvements.

12/20/2024 - Version 9.2.0 (Windows)

A new speaker diarization implementation for the File mode (in addition to the original implementation).
The new diarization implementation supports setting active speakers per file for better accuracy.
The new diarization implementation is significantly faster than the original implementation.
You can switch between the two diarization implementations using the File mode UI.

12/18/2024 - Version 9.1.2 (macOS)

Automatic speaker diarization in the system audio mode can segment the transcription for each individual speaker.
Several improvements to the automatic speaker diarization (File mode and system audio mode).
Fixed an issue that caused the push-to-talk auto mic off function to fail.
UI improvements.

12/18/2024 - Version 9.1.2 (Windows)

UI improvements.

12/17/2024 - Version 9.1.1 (Windows)

Several improvements to the automatic speaker diarization (File mode and system audio mode).
UI improvements.

12/16/2024 - Version 9.1.0 (Windows)

Several improvements to the automatic speaker diarization (File mode and system audio mode).
Fixed an issue that caused slightly lower accuracy and higher delay in the live dictation mode.

12/12/2024 - Version 9.0.1 (Windows)

Fixed an issue that caused SpeechPulse to crash with some non-English languages.

12/09/2024 - Version 9.0.0 (Windows)

Automatic speaker diarization in the system audio mode can segment the transcription for each individual speaker.

12/03/2024 - Version 8.3.0 (macOS)

Diarization and transcription editors now use the left mouse click to set the audio player position.
Diarization and transcription editors now support play/pause using the Tab key.
The diarization editor now uses the Enter key to split segments.
Pressing the Backspace key at the beginning of a diarization segment will combine it with the previous segment.
UI improvements.

12/03/2024 - Version 8.3.0 (Windows)

Diarization and transcription editors now use the left mouse click to set the audio player position.
Diarization and transcription editors now support play/pause using the Tab key.
The diarization editor now uses the Enter key to split segments.
Pressing the Backspace key at the beginning of a diarization segment will combine it with the previous segment.
UI improvements.

11/30/2024 - Version 8.2.3 (Windows)

A new option to select a non-default output device as the system audio source.

11/30/2024 - Version 8.2.2 (macOS)

Several improvements/fixes for the system audio mode.

11/30/2024 - Version 8.2.2 (Windows)

Several improvements/fixes for the system audio mode.

11/29/2024 - Version 8.2.1 (macOS)

Diarization and transcription editors now support the "Find and Replace" function.
Now supports manually setting speaker counts for diarization.
The Minimize/Maximize hotkey is now remapped to the "Minimize to tray/Maximize from tray" functionality.
New options to remove silent/noisy audio segments and reduce text hallucinations in the system audio mode.
Fixed an issue that caused incorrect spacing in the system audio mode.

11/28/2024 - Version 8.2.1 (Windows)

Fixed an issue that caused incorrect spacing in the system audio mode.

11/28/2024 - Version 8.2.0 (Windows)

Diarization and transcription editors now support the "Find and Replace" function.
Now supports manually setting speaker counts for diarization.
The Minimize/Maximize hotkey is now remapped to the "Minimize to tray/Maximize from tray" functionality.
New options to remove silent/noisy audio segments and reduce text hallucinations in the system audio mode.

11/24/2024 - Version 8.1.4 (macOS)

A new option to enable high-accuracy timestamps for the system audio mode.
A new option to enable high-accuracy timestamps for the Real-time Live Mode.

11/23/2024 - Version 8.1.3 (macOS)

A new option to combine microphone and system transcriptions into a single editor in the system audio mode.
A new option to enable tagging/highlighting microphone and system transcriptions in the combined system audio mode.
A new option to set a silence timeout for the system audio mode. SpeechPulse will consider a speech segment is complete after this duration of silence.
A new option to automatically restart the system transcription in case the system audio stream is stopped.
Misc. improvements.

11/20/2024 - Version 8.1.3 (Windows)

Several improvements for the system audio mode.
Misc. improvements.

11/19/2024 - Version 8.1.2 (Windows)

A new option to enable tagging/highlighting microphone and system transcriptions in the combined system audio mode.
A new option to set a silence timeout for the system audio mode. SpeechPulse will consider a speech segment is complete after this duration of silence.
Disabled the mouse wheel scrolling of dropdown controls to prevent accidental selection.
Misc. improvements.

11/16/2024 - Version 8.1.1 (Windows)

A new option to combine microphone and system transcriptions into a single editor in the system audio mode.
Several performance improvements (reduced CPU and RAM usage).

11/13/2024 - Version 8.1.0 (Windows)

Timestamps in system audio mode are now more accurate.
System audio mode now displays a processing message if any audio remains to be transcribed.

11/10/2024 - Version 8.0.8 (macOS)

Several performance improvements for transcription and diarization UIs.

11/10/2024 - Version 8.0.8 (Windows)

Several performance improvements for transcription and diarization UIs.
Fixed an issue that caused paragraph segmentation to fail.

11/09/2024 - Version 8.0.7 (macOS)

System audio mode supports real-time transcription of mic and system audio to an internal editor (no mouse focus required).
Record mic and system audio to WAV files.
Automatic paragraph segmentation and improved sentence segmentation.
The diarization edit speaker names window now suggests existing speaker names when you type.
Fixed an issue that caused diarization (via API) to fail with long audio segments.
Supports translation to English via Whisper APIs (in addition to via offline models).
Added support for some missing audio file formats, including Opus.
A new hotkey to add text replacements (mappings) without opening the settings window.
Fixed an issue that caused high memory usage on transcription and diarization windows.
Transcription and diarization windows are now more responsive (reduced CPU and RAM usage).
Misc. improvements/fixes.

11/08/2024 - Version 8.0.7 (Windows)

Fixed an issue that caused high memory usage on transcription and diarization windows.
Transcription and diarization windows are now more responsive (reduced CPU and RAM usage).

11/03/2024 - Version 8.0.6 (Windows)

Several improvements to prevent missing punctuation marks in Live Mode, File Mode, and system audio mode.

11/02/2024 - Version 8.0.5 (Windows)

Fixed an issue that caused SpeechPulse to reset model/device settings under certain conditions.

10/30/2024 - Version 8.0.4 (Windows)

Updated the Multi (turbo) model with a newer, more accurate version. You can delete the older model and download the new one using the built-in model downloader.
Fixed an issue that caused SpeechPulse to hang in the system audio mode on some PCs/laptops.
Paragraph segmentation is now skippable (to reduce transcription duration for lengthy audio files).
Several improvements to prevent missing punctuation marks in lengthy audio files (English language).

10/29/2024 - Version 8.0.3 (Windows)

Reduce text hallucinations in the Live Mode and File Mode.
Misc. improvements.

10/28/2024 - Version 8.0.2 (Windows)

Reduce text hallucinations in the system audio mode.
System audio mode now supports timestamps.

10/26/2024 - Version 8.0.1 (Windows)

Several performance/accuracy improvements for the system audio mode.

10/25/2024 - Version 8.0.0 (Windows)

System audio mode supports real-time transcription of mic and system audio to an internal editor (no mouse focus required).
Record mic and system audio to WAV files.
Automatic paragraph segmentation and improved sentence segmentation.
The diarization edit speaker names window now suggests existing speaker names when you type.
Fixed an issue that caused diarization (via API) to fail with long audio segments.
Supports translation to English via Whisper APIs (in addition to via offline models).
Added support for some missing audio file formats, including Opus.
A new hotkey to add text replacements (mappings) without opening the settings window.
Misc. improvements/fixes.

10/14/2024 - Version 7.1.0 (macOS)

Now supports real-time transcription in live mode (Experimental - Currently has a high delay when inserting/editing the dictated text due to input buffering on macOS).
A new option to edit/delete the saved diarization speakers.
Fixed an issue that caused SpeechPulse to crash when the transcription contains invalid characters.

10/5/2024 - Version 7.1.0 (Windows)

SpeechPulse (Windows) now supports the Whisper V3 (turbo) model. You can download it using the built-in model downloader.
This model has comparable accuracy to the Whisper (large) model and runs significantly faster.
Real-time processing mode now only updates the changed portion of the dictation, making dictation less distracting.

10/4/2024 - Version 6.3.3 (macOS)

SpeechPulse (macOS) now comes with the Whisper V3 (turbo) model as the default model. This model has comparable accuracy to the Whisper (large) model and runs significantly faster.

10/1/2024 - Version 7.0.1 (Windows)

Several performance improvements for the real-time processing.

9/30/2024 - Version 7.0.0 (Windows)

* Please uninstall any older SpeechPulse versions and delete the installation folder (e.g., C:\Program Files\SpeechPulse) before installing this new version. During the uninstall procedure, you have the option to keep the downloaded models and CUDA libraries—there is no need to delete them.

Now supports real-time transcription in live mode.
Real-time processing significantly improves the accuracy of both Auto and Manual punctuation modes.
A new option to edit/delete the saved diarization speakers.

9/18/2024 - Version 6.3.2 (macOS)

Fixed an issue that caused SpeechPulse (macOS) to crash when pressing the caps lock key.

9/17/2024 - Version 6.3.1 (macOS)

Reduce text hallucinations/repetitions in the Live mode.
Now automatically re-transcribes any incorrectly transcribed portions of the text in the File mode (e.g., text repetitions/hallucinations).
A new transcription editor similar to the diarization editor.
Both transcription and diarization editors now support re-transcribing selected portions of the text. You can select sentences, paragraphs, or any block of text and re-transcribe using the context menu.
SpeechPulse uses different internal settings for each re-transcription, making it possible to correct any text repetitions/hallucinations.
You can also restore a previous transcription using the context menu.
Misc. improvements.

9/17/2024 - Version 6.3.1 (Windows)

Several performance improvements for the transcription editor to support lengthy audio files.

9/14/2024 - Version 6.3.0 (Windows)

A new transcription editor similar to the diarization editor.
Both transcription and diarization editors now support re-transcribing selected portions of the text. You can select sentences, paragraphs, or any block of text and re-transcribe using the context menu.
Misc. improvements.

9/11/2024 - Version 6.2.0 (Windows)

Reduce text hallucinations/repetitions in the Live mode.
Now automatically re-transcribes any incorrectly transcribed portions of the text in the File mode (e.g., text repetitions/hallucinations).
Also supports manually re-transcribing audio segments in the diarization window. You can use this feature to re-transcribe any incorrectly transcribed segments.
SpeechPulse uses different internal settings for each re-transcription, making it possible to correct any text repetitions/hallucinations.
You can also restore a previous transcription for a segment using the context menu.

9/9/2024 - Version 6.1.5 (Windows)

A new option to play a notification sound after inserting text into the text edit area (in Live Mode).
A new option to change the diarization editor font family (via "Settings->Options->General settings").
Misc. improvements.

9/9/2024 - Version 6.1.5 (macOS)

Misc. improvements.

9/8/2024 - Version 6.1.4 (macOS)

A new option to play a notification sound after inserting text into the text edit area (in Live Mode).
Fixed an issue that caused the audio slider in the diarization window to get stuck with 0 duration.

9/7/2024 - Version 6.1.3 (macOS)

Better accuracy for word-level timestamps.
Word highlighting in the diarization window now has better sync with the audio.
Changed the app name from "speechpulse" to "SpeechPulse" (please remove the previous version from the Application folder before installing this new one).

9/6/2024 - Version 6.1.2 (macOS)

A new option to change the diarization editor font family (via "Settings->Options->General settings").
Fixed an issue that caused SpeechPulse to crash during app exit after running speaker diarization.
Misc. improvements.

9/5/2024 - Version 6.1.1 (macOS)

Now highlights the current word during playback in the diarization window.
Supports playback from an arbitrary position in the dirization window. Simply right-click on a word and select Play to start playback from the current word.

9/5/2024 - Version 6.1.1 (Windows)

Several improvements for word highlighting in the diarization window.

9/4/2024 - Version 6.1.0 (Windows)

Now highlights the current word during playback in the diarization window.
Supports playback from an arbitrary position in the dirization window. Simply right-click on a word and select Play to start playback from the current word.
Now supports the .caf (Core Audio Format) file format in File Mode.

9/2/2024 - Version 6.0.0 (macOS)

SpeechPulse now supports a new light theme, allowing you to choose between dark and light modes.
Now supports the .caf (Core Audio Format) file format in File Mode.

9/1/2024 - Version 6.0.0 (Windows)

SpeechPulse now supports a new light theme, allowing you to choose between dark and light modes.

Dark Mode Light Mode

8/29/2024 - Version 5.3.8 (macOS)

Remove any preceding spaces in the Whisper speech API responses.

8/29/2024 - Version 5.3.8 (Windows)

Custom vocabularies and custom prompts now work with Whisper speech APIs.
Remove any preceding spaces in the Whisper speech API responses.

8/28/2024 - Version 5.3.7 (Windows and macOS)

Saved diarizations now use half the file size to save audio files.

8/27/2024 - Version 5.3.6 (Windows)

Improved the speaker diarization implementation to prevent misaligned diarization boundaries.

8/26/2024 - Version 5.3.6 (macOS)

Improved the speaker diarization implementation to prevent misaligned diarization boundaries.

8/25/2024 - Version 5.3.5 (macOS)

File mode and speaker diarization on macOS now use available memory more efficiently. This prevents a possible slowdown on base model M series Macs with 8 GB RAM.

8/24/2024 - Version 5.3.4 (macOS)

A new context menu option to independently edit speaker names for each cell in the diarization window.
Fixed a decoding issue that caused speaker diarization to fail on some specific video files.
Misc. improvements for diarization.

8/24/2024 - Version 5.3.4 (Windows)

A new context menu option to independently edit speaker names for each cell in the diarization window.
Misc. improvements for diarization.

8/21/2024 - Version 5.3.3 (Windows and macOS)

Batch file transcription now ignores any invalid audio/video files and continues to process all valid files.
Auto speaker name detection now saves speaker names in the diarization window of previously saved diarizations.

8/16/2024 - Version 5.3.2 (Windows)

Fixed an issue that caused SpeechPulse to crash/hang on some Windows PCs/laptops.

8/15/2024 - Version 5.3.1 (macOS)

Now you can save the speaker diarization output for future editing.
A new option to change the font size of the speaker diarization editor.
Misc. improvements.

8/14/2024 - Version 5.3.1 (Windows)

Now you can save the speaker diarization output for future editing.
A new option to change the font size of the speaker diarization editor.
Misc. improvements.

8/10/2024 - Version 5.3.0 (macOS)

Now supports custom vocabularies in the Auto punctuation mode.
Speaker diarization now supports automatic speaker tagging. You only need to enter each speaker's name once. SpeechPulse will automatically add speaker names for future transcriptions.
Fixed an issue that caused incorrect diarization and incorrect subtitles with the OpenAI Whisper API.
Fixed an issue of missing punctuation in the Auto punctuation mode.
Misc. improvements.

8/9/2024 - Version 5.3.0 (Windows)

Now supports custom vocabularies in the Auto punctuation mode.
Speaker diarization now supports automatic speaker tagging. You only need to enter each speaker's name once. SpeechPulse will automatically add speaker names for future transcriptions.
Fixed an issue that caused incorrect diarization and incorrect subtitles with the OpenAI Whisper API.
Fixed an issue of missing punctuation in the Auto punctuation mode.
Misc. improvements.

8/7/2024 - Version 5.2.2 (Windows)

Fixed an issue that caused the edit cursor to follow the mouse in the AI template and diarization windows.
UI improvements.

8/1/2024 - Version 5.2.3 (macOS)

UI improvements.

7/30/2024 - Version 5.2.2 (macOS)

Fixed an issue that caused the edit cursor to follow the mouse in the AI template and diarization windows.

7/22/2024 - Version 5.2.1 (macOS)

History file list now supports navigation with arrow keys.
A new button to open the history folder in Finder.
Re-added custom prompts.
Misc. improvements.

7/16/2024 - Version 5.2.0 (Windows)

History file list now supports navigation with arrow keys.
A new button to open the history folder in file explorer.
Re-added custom prompts.

7/14/2024 - Version 5.1.0 (macOS)

Notify the user when new SpeechPulse versions are available.

7/13/2024 - Version 5.1.0 (Windows)

Notify the user when new SpeechPulse versions are available.
Fixed an issue that caused SpeechPulse to reset model settings.

7/12/2024 - Version 5.0.0 (macOS)

A new UI with a modern look.
Misc. improvements.

7/10/2024 - Version 5.0.0 (Windows)

A new UI with a modern look.
Misc. improvements.

Live Mode New live mode UI File Mode New file mode UI History View New history mode UI Speaker Diarization View New Speaker Diarization UI

7/2/2024 - Version 4.5.6 (Windows)

Fixed an issue that caused SpeechPulse to crash silently in some specific CPUs and Windows versions.

6/27/2024 - Version 4.5.3 (Windows and macOS)

SpeechPulse can now process the currently copied text in the clipboard using AI language models. First, use CTRL+C to copy the text to the clipboard. Then, start clipboard processing using the clipboard processing hotkey (configurable via "Settings->Options->Hotkeys").
Fixed an issue that caused incorrect formatting of phone numbers.

6/25/2024 - Version 4.5.2 (Windows)

Fixed an issue that caused UI artifacts when moving SpeechPulse UI between two monitors with different DPI scaling.

6/22/2024 - Version 4.5.1 (Windows and macOS)

Fixed an issue that caused significant text hallucinations and poor accuracy in the Auto punctuation mode.
The automatic microphone on/off feature in the push-to-talk mode is now optional. It can be enabled/disabled via "Settings->Options->General settings".

6/20/2024 - Version 4.5.0 (Windows)

SpeechPulse (Windows) now comes with a built-in model/library downloader.
Model downloader can be used to download speech models, language models, and CUDA GPU libraries.

6/16/2024 - Version 4.4.0 (macOS)

SpeechPulse (macOS) now supports OpenAI compatible Whisper speech APIs.
Fixed an issue that prevented AI template editing.
UI improvements.

6/14/2024 - Version 4.4.0 (Windows)

SpeechPulse (Windows) now supports OpenAI compatible Whisper speech APIs.
UI improvements.

6/11/2024 - Version 4.3.1 (macOS)

SpeechPulse (macOS) now supports OpenAI compatible external AI language (LLM) APIs. For example, you can connect to your Ollama server using SpeechPulse.
Improved internal implementation for the AI language (LLM) feature. Now delivers more precise results for your AI templates.
Push-to-talk mode now automatically turns on/off the microphone.
A new option to keep the speech model loaded in RAM for faster transcription.
Misc. improvements.

6/09/2024 - Version 4.3.1 (Windows)

Push-to-talk mode now automatically turns on/off the microphone.
Better error handling when connecting to LLM APIs.
Fixed an issue that caused the auto-microphone-off feature to stop working.
UI improvements.

6/08/2024 - Version 4.3.0 (Windows)

SpeechPulse (Windows) now supports OpenAI compatible external AI language (LLM) APIs. For example, you can connect to your Ollama server using SpeechPulse.
Improved internal implementation for the AI language (LLM) feature. Now delivers more precise results for your AI templates.
Fixed an issue that caused missing words near the diarization boundaries.
Misc. improvements.

6/03/2024 - Version 4.2.0 (macOS)

SpeechPulse (macOS) now supports automatic speaker diarization.
File mode now supports generating all output formats in a single pass.
A new option to limit the number of words per subtitle line.
UI improvements.

5/26/2024 - Version 4.1.4 (macOS)

You can configure how many days of previous recordings to keep or disable history recordings via "Settings->Options->General settings". (this option was missing in the previous version)
A new indicator to inform users when the push-to-talk mode is active.

5/26/2024 - Version 4.2.0 (Windows)

SpeechPulse (Windows) now supports automatic speaker diarization.
Supports both CPU and GPU execution for diarization.
File mode now supports generating all output formats in a single pass.
A new option to limit the number of words per subtitle line.
A new indicator to inform users when the push-to-talk mode is active.
UI improvements.
Removed the support for onnx models due to their limitations.

5/16/2024 - Version 4.1.3 (macOS)

SpeechPulse now supports recording your dictations as WAV files inside the "Documents/SpeechPulse" folder.
You can configure how many days of previous recordings to keep or disable history recordings via "Settings->Options->General settings".
The new history window allows you to process your previous dictation recordings using different speech models, language models, AI Templates, or any other different settings.
Now supports importing other compatible offline language models. For example, you can run LLAMA 3 using SpeechPulse. Just place the GGUF language model file inside the SpeechPulse models directory. SpeechPulse will automatically detect the model on startup. For AI language models on macOS, it's recommended to have at least 16GB of RAM.
Fixed an issue that caused SpeechPulse to skip AI formatting when dictated to the built-in editor.
UI changes for better usability on high DPI screens.

5/14/2024 - Version 4.0.1 (macOS)

Fixed an issue that caused SpeechPulse to output text in all lowercase letters in the manual punctuation mode.

5/14/2024 - Version 4.1.3 (Windows)

Fixed an issue that caused SpeechPulse to output text in all lowercase letters in the manual punctuation mode.

5/12/2024 - Version 4.1.2 (Windows)

Fixed an issue that caused SpeechPulse to crash with the latest NVIDIA drivers. Now supports all the recent driver versions.
UI improvements.

5/10/2024 - Version 4.1.0 (Windows)

SpeechPulse now supports recording your dictations as WAV files inside the "Documents/SpeechPulse" folder.
You can configure how many days of previous recordings to keep or disable history recordings via "Settings->Options->General settings".
The new history window allows you to process your previous dictation recordings using different speech models, language models, AI Templates, or any other different settings.
Now supports importing other compatible offline language models. For example, you can run LLAMA 3 using SpeechPulse. Just place the GGUF language model file inside the SpeechPulse models directory. SpeechPulse will automatically detect the model on startup.
Fixed an issue that caused SpeechPulse to skip AI formatting when dictated to MS Word, WordPad, and the built-in editor.
UI changes for better usability on high DPI screens.

5/05/2024 - Version 4.0.0 (macOS)

SpeechPulse (macOS) now supports offline AI language models for text formatting (currently only for the English language). You can use them to enhance your dictated text in real time.
Common use cases include grammar, spelling, and punctuation correction, summarizing text, formatting text for Email, chat, notes, etc.
AI language models support prompting to get the desired output. SpeechPulse has an "AI Templates" feature where you can customize the prompts for your specific use cases.
File mode also supports AI language models for text processing.
A new option to customize the deactivation delay in the Automatic speech input mode.
A new option to increase the context length of the AI language models (longer context lengths require more RAM).
Use memory more efficiently.

5/03/2024 - Version 4.0.1 (Windows)

A new option to increase the context length of the AI language models (longer context lengths require more RAM/VRAM).

5/02/2024

Replaced the original English (standard) AI language model with a slightly better one.

5/01/2024 - Version 4.0.0 (Windows)

SpeechPulse (Windows) now supports offline AI language models for text formatting (currently only for the English language). You can use them to enhance your dictated text in real time.
Common use cases include grammar, spelling, and punctuation correction, summarizing text, formatting text for Email, chat, notes, etc.
AI language models support prompting to get the desired output. SpeechPulse has an "AI Templates" feature where you can customize the prompts for your specific use cases.
AI language models support both CPU and GPU execution. However, a GPU is recommended for faster live transcription.
File mode also supports AI language models for text processing.
A new option to customize the deactivation delay in the Automatic speech input mode.

4/25/2024 - Version 3.7.2

Fixed an issue that caused SpeechPulse to ignore audio files with capital letters in the file extension.

4/25/2024 - Version 3.7.1

A new option to enable/disable automatic number formatting in the manual punctuation mode.
Better formatting for numbers that include the words million and billion.
Fixed an issue that caused incorrect date formatting.

4/24/2024 - Version 3.7.0 (macOS)

* You may have to modify your existing custom mappings with this update (especially the mappings that include the period symbol).

Manual punctuation mode now supports automatically formatting numbers, dates, currency values, etc.
Fixed an issue that caused missing punctuation marks in the manual punctuation mode.
Fixed an issue where SpeechPulse would insert a letter when the push-to-talk hotkey was released.

4/23/2024 - Version 3.7.0 (Windows)

* You may have to modify your existing custom mappings with this update (especially the mappings that include the period symbol).

Manual punctuation mode now supports automatically formatting numbers, dates, currency values, etc.
Fixed an issue that caused missing punctuation marks in the manual punctuation mode.
A new option to press the ENTER key after inserting text into the text edit area.
Allow file mode even if no microphone is connected.
UI improvements.

4/16/2024 - Version 3.6.9 (macOS)

A new option to press the ENTER key after inserting text into the text edit area.
Allow file mode even if no microphone is connected.
UI improvements.

4/14/2024 - Version 3.6.7 (macOS)

Fixed a crash that occurred in the activation window.
UI improvements.

4/10/2024 - Version 3.6.8 (Windows)

Fixed a crash that occurred when switching models or quitting the program on a GPU.
Misc. improvements.

4/9/2024 - Version 3.6.7 (Windows)

Fixed an issue that prevented graceful termination.
Misc. improvements.

4/8/2024 - Version 3.6.6 (macOS)

Fixed an issue that caused SpeechPulse to return a previous transcription result for failed transcriptions.
Preserve punctuation mode when switching languages.
Export/import custom mappings to easily transfer mappings from one installation to another.
A new controls overview window to explain different punctuation, spacing, and speech input modes.
Fixed an issue that caused missing words after a hyphen in the manual punctuation mode.
Capitalization correction now supports phrases with hyphens (e.g. Wi-Fi).
Fixed an issue that prevented graceful termination.
Misc. improvements.

4/7/2024 - Version 3.6.6 (Windows)

Preserve punctuation mode when switching languages.
Export/import custom mappings to easily transfer mappings from one installation to another.
A new controls overview window to explain different punctuation, spacing, and speech input modes.
Fixed an issue that prevented graceful termination.
Faster model load times.
Misc. improvements.

4/5/2024 - Version 3.6.5 (Windows)

Fixed an issue that caused missing words after a hyphen in the manual punctuation mode.
Capitalization correction now supports phrases with hyphens (e.g. Wi-Fi).
UI improvements.

4/1/2024 - Version 3.6.4

Fixed an issue that caused incorrect mappings.

3/29/2024 - Version 3.6.3

Fixed an issue caused by incompatible character encodings on different systems.

3/29/2024 - Version 3.6.2 (macOS)

Fixed an issue that caused incorrect transcription for longer audio segments.
File mode now supports manual punctuation (For English full-text transcription).
Misc. improvements.

3/28/2024 - Version 3.6.2 (Windows)

File mode now supports manual punctuation (For English full-text transcription).
Faster initial load times.
Misc. improvements.

3/25/2024 - Version 3.6.1 (Windows)

On Windows, prevent hotkeys from making a notification sound.

3/24/2024 - Version 3.6.0

Allows you to add a list of proper names for capitalization correction.
Prevents subword replacements in the Mappings feature (non-RegEx mode).

3/21/2024 - Version 3.5.4 (Windows)

Fixed an issue that caused incorrect capitalization for the first-person pronoun "I".

3/20/2024 - Version 3.5.3

A new option to automatically reduce system output volume during push-to-talk.
Improved capitalization in the manual punctuation mode.

3/15/2024 - Version 3.5.2 (macOS)

Stop hotkeys and auto spacing from making a notification sound on macOS.

3/14/2024 - Version 3.5.1

Fixed the issue of missing period punctuation marks in the manual punctuation mode.
Reduce text hallucinations.

3/2/2024 - Version 3.5.0

Now supports manual spacing.
A new option to load SpeechPulse in the background and automatically minimize to the system tray.
A taskbar notification to indicate if the microphone is not working (need to enable via Settings).
Two new hotkeys for push-to-talk with auto punctuation and manual punctuation.

2/26/2024 - Version 3.4.2

Reduce text hallucinations.

2/21/2024 - Version 3.4.1

A history page to display previously dictated text of the current session.
Disabled Speech Profiles in the push-to-talk mode.
UI improvements.

2/15/2024 - Version 3.4.0

Push-to-talk speech input with a new hotkey.

2/12/2024 - Version 3.3.7

New options to enable/disable voice activity detection.
Now supports replacing text selections with dictation.

2/06/2024 - Version 3.3.6

Automatically transfer focus from the SpeechPulse window to the last active text edit.
Displays a "No text edit in focus!" message if there's no text edit in focus.

2/04/2024 - Version 3.3.5

Disabled automatic turn-off in file mode.
UI improvements.

2/03/2024 - Version 3.3.4

A new option to automatically stop listening when the user starts typing.
Taskbar notifications to indicate start/stop listening.
Notification sounds to indicate start/stop listening.
Now remember the file mode output format between sessions.
A new option to make the processing indicator white when dictating to a dark background.

2/01/2024 - Version 3.3.3

Now supports customizable subtitle widths in file mode.
Supports timestamps in live mode (With the Auto punctuation).
Direct text insertion in Microsoft WordPad.
Uses a faster implementation for type mode.

1/27/2024 - Version 3.3.2

Fixed an issue that prevented text generation when dictating to Microsoft Word.
The mappings feature now supports case-insensitive replacement and wildcards (regular expressions).
Better capitalization in the manual punctuation mode.

1/23/2024 - Version 3.3.1

If you are in a noisy environment and SpeechPulse generates random text, you can use the Speech Profiles feature to add a new speech profile that only detects your own voice.
Reduce text hallucinations.

1/19/2024 - Version 3.3.0

Now supports direct text insertion, editing, and formatting in Microsoft Word.
A new button to minimize SpeechPulse to the system tray.

1/14/2024 - Version 3.2.0

Improved capitalization in the manual punctuation mode.
Improved editor with voice commands for text formatting.
Keyboard press commands to press keys and hotkeys with your voice. (e.g. "Press Enter", "Press Control Z")
Displays a processing indicator and a dictated text label next to the cursor for better user experience.
Better and more reliable voice command implementation.
A new hotkey to switch punctuation modes.

12/13/2023 - Version 3.1.0

Now remembers language model, device, microphone, and file mode output folder settings between sessions.

12/10/2023 - Version 3.0.0

With the manual punctuation mode, you can dictate common punctuation marks like commas, periods, question marks, exclamation marks, colons, semicolons, etc.
Supports "new line" and "new paragraph" commands within a continuous speech segment.

12/02/2023 - Version 2.5.0

Added language hotkeys.
New voice command "Transfer text" for SpeechPulse editor.
UI improvements.

11/29/2023 - Version 2.4.0

Now comes with a built-in text editor.

11/14/2023 - Version 2.3.0

New Custom Voice Hotkeys feature can trigger custom keyboard shortcuts with voice commands.
Text Inserter can insert custom text snippets with voice commands.
Misc. improvements.

11/11/2023 - Version 2.2.0

New Custom Mappings feature can replace SpeechPulse's text output with your own words/phrases. For example, you can replace the phrase "speech pulse" with "SpeechPulse" using custom mappings.

11/08/2023 - Version 2.1.1

Trial version now supports all language models.
Changed the hotkey implementation.

11/06/2023 - Version 2.1.0

Added voice commands.
Lower latency in Live Mode.
More robust against background noise.

11/01/2023 - Version 2.0.0

Added hotkeys.

SpeechPulse Updates

Quick Links

Legal

Contact Us