SpeechPulse Updates
12/20/2024 - Version 9.2.0 (Windows)
- A new speaker diarization implementation for the File mode (in addition to the original implementation).
- The new diarization implementation supports setting active speakers per file for better accuracy.
- The new diarization implementation is significantly faster than the original implementation.
- You can switch between the two diarization implementations using the File mode UI.
12/18/2024 - Version 9.1.2 (macOS)
- Automatic speaker diarization in the system audio mode can segment the transcription for each individual speaker.
- Several improvements to the automatic speaker diarization (File mode and system audio mode).
- Fixed an issue that caused the push-to-talk auto mic off function to fail.
- UI improvements.
12/18/2024 - Version 9.1.2 (Windows)
- UI improvements.
12/17/2024 - Version 9.1.1 (Windows)
- Several improvements to the automatic speaker diarization (File mode and system audio mode).
- UI improvements.
12/16/2024 - Version 9.1.0 (Windows)
- Several improvements to the automatic speaker diarization (File mode and system audio mode).
- Fixed an issue that caused slightly lower accuracy and higher delay in the live dictation mode.
12/12/2024 - Version 9.0.1 (Windows)
- Fixed an issue that caused SpeechPulse to crash with some non-English languages.
12/09/2024 - Version 9.0.0 (Windows)
- Automatic speaker diarization in the system audio mode can segment the transcription for each individual speaker.
12/03/2024 - Version 8.3.0 (macOS)
- Diarization and transcription editors now use the left mouse click to set the audio player position.
- Diarization and transcription editors now support play/pause using the Tab key.
- The diarization editor now uses the Enter key to split segments.
- Pressing the Backspace key at the beginning of a diarization segment will combine it with the previous segment.
- UI improvements.
12/03/2024 - Version 8.3.0 (Windows)
- Diarization and transcription editors now use the left mouse click to set the audio player position.
- Diarization and transcription editors now support play/pause using the Tab key.
- The diarization editor now uses the Enter key to split segments.
- Pressing the Backspace key at the beginning of a diarization segment will combine it with the previous segment.
- UI improvements.
11/30/2024 - Version 8.2.3 (Windows)
- A new option to select a non-default output device as the system audio source.
11/30/2024 - Version 8.2.2 (macOS)
- Several improvements/fixes for the system audio mode.
11/30/2024 - Version 8.2.2 (Windows)
- Several improvements/fixes for the system audio mode.
11/29/2024 - Version 8.2.1 (macOS)
- Diarization and transcription editors now support the "Find and Replace" function.
- Now supports manually setting speaker counts for diarization.
- The Minimize/Maximize hotkey is now remapped to the "Minimize to tray/Maximize from tray" functionality.
- New options to remove silent/noisy audio segments and reduce text hallucinations in the system audio mode.
- Fixed an issue that caused incorrect spacing in the system audio mode.
11/28/2024 - Version 8.2.1 (Windows)
- Fixed an issue that caused incorrect spacing in the system audio mode.
11/28/2024 - Version 8.2.0 (Windows)
- Diarization and transcription editors now support the "Find and Replace" function.
- Now supports manually setting speaker counts for diarization.
- The Minimize/Maximize hotkey is now remapped to the "Minimize to tray/Maximize from tray" functionality.
- New options to remove silent/noisy audio segments and reduce text hallucinations in the system audio mode.
11/24/2024 - Version 8.1.4 (macOS)
- A new option to enable high-accuracy timestamps for the system audio mode.
- A new option to enable high-accuracy timestamps for the Real-time Live Mode.
11/23/2024 - Version 8.1.3 (macOS)
- A new option to combine microphone and system transcriptions into a single editor in the system audio mode.
- A new option to enable tagging/highlighting microphone and system transcriptions in the combined system audio mode.
- A new option to set a silence timeout for the system audio mode. SpeechPulse will consider a speech segment is complete after this duration of silence.
- A new option to automatically restart the system transcription in case the system audio stream is stopped.
- Misc. improvements.
11/20/2024 - Version 8.1.3 (Windows)
- Several improvements for the system audio mode.
- Misc. improvements.
11/19/2024 - Version 8.1.2 (Windows)
- A new option to enable tagging/highlighting microphone and system transcriptions in the combined system audio mode.
- A new option to set a silence timeout for the system audio mode. SpeechPulse will consider a speech segment is complete after this duration of silence.
- Disabled the mouse wheel scrolling of dropdown controls to prevent accidental selection.
- Misc. improvements.
11/16/2024 - Version 8.1.1 (Windows)
- A new option to combine microphone and system transcriptions into a single editor in the system audio mode.
- Several performance improvements (reduced CPU and RAM usage).
11/13/2024 - Version 8.1.0 (Windows)
- Timestamps in system audio mode are now more accurate.
- System audio mode now displays a processing message if any audio remains to be transcribed.
11/10/2024 - Version 8.0.8 (macOS)
- Several performance improvements for transcription and diarization UIs.
11/10/2024 - Version 8.0.8 (Windows)
- Several performance improvements for transcription and diarization UIs.
- Fixed an issue that caused paragraph segmentation to fail.
11/09/2024 - Version 8.0.7 (macOS)
- System audio mode supports real-time transcription of mic and system audio to an internal editor (no mouse focus required).
- Record mic and system audio to WAV files.
- Automatic paragraph segmentation and improved sentence segmentation.
- The diarization edit speaker names window now suggests existing speaker names when you type.
- Fixed an issue that caused diarization (via API) to fail with long audio segments.
- Supports translation to English via Whisper APIs (in addition to via offline models).
- Added support for some missing audio file formats, including Opus.
- A new hotkey to add text replacements (mappings) without opening the settings window.
- Fixed an issue that caused high memory usage on transcription and diarization windows.
- Transcription and diarization windows are now more responsive (reduced CPU and RAM usage).
- Misc. improvements/fixes.
11/08/2024 - Version 8.0.7 (Windows)
- Fixed an issue that caused high memory usage on transcription and diarization windows.
- Transcription and diarization windows are now more responsive (reduced CPU and RAM usage).
11/03/2024 - Version 8.0.6 (Windows)
- Several improvements to prevent missing punctuation marks in Live Mode, File Mode, and system audio mode.
11/02/2024 - Version 8.0.5 (Windows)
- Fixed an issue that caused SpeechPulse to reset model/device settings under certain conditions.
10/30/2024 - Version 8.0.4 (Windows)
- Updated the Multi (turbo) model with a newer, more accurate version. You can delete the older model and download the new one using the built-in model downloader.
- Fixed an issue that caused SpeechPulse to hang in the system audio mode on some PCs/laptops.
- Paragraph segmentation is now skippable (to reduce transcription duration for lengthy audio files).
- Several improvements to prevent missing punctuation marks in lengthy audio files (English language).
10/29/2024 - Version 8.0.3 (Windows)
- Reduce text hallucinations in the Live Mode and File Mode.
- Misc. improvements.
10/28/2024 - Version 8.0.2 (Windows)
- Reduce text hallucinations in the system audio mode.
- System audio mode now supports timestamps.
10/26/2024 - Version 8.0.1 (Windows)
- Several performance/accuracy improvements for the system audio mode.
10/25/2024 - Version 8.0.0 (Windows)
- System audio mode supports real-time transcription of mic and system audio to an internal editor (no mouse focus required).
- Record mic and system audio to WAV files.
- Automatic paragraph segmentation and improved sentence segmentation.
- The diarization edit speaker names window now suggests existing speaker names when you type.
- Fixed an issue that caused diarization (via API) to fail with long audio segments.
- Supports translation to English via Whisper APIs (in addition to via offline models).
- Added support for some missing audio file formats, including Opus.
- A new hotkey to add text replacements (mappings) without opening the settings window.
- Misc. improvements/fixes.
10/14/2024 - Version 7.1.0 (macOS)
- Now supports real-time transcription in live mode (Experimental - Currently has a high delay when inserting/editing the dictated text due to input buffering on macOS).
- A new option to edit/delete the saved diarization speakers.
- Fixed an issue that caused SpeechPulse to crash when the transcription contains invalid characters.
10/5/2024 - Version 7.1.0 (Windows)
- SpeechPulse (Windows) now supports the Whisper V3 (turbo) model. You can download it using the built-in model downloader.
- This model has comparable accuracy to the Whisper (large) model and runs significantly faster.
- Real-time processing mode now only updates the changed portion of the dictation, making dictation less distracting.
10/4/2024 - Version 6.3.3 (macOS)
- SpeechPulse (macOS) now comes with the Whisper V3 (turbo) model as the default model. This model has comparable accuracy to the Whisper (large) model and runs significantly faster.
10/1/2024 - Version 7.0.1 (Windows)
- Several performance improvements for the real-time processing.
9/30/2024 - Version 7.0.0 (Windows)
* Please uninstall any older SpeechPulse versions and delete the installation folder (e.g., C:\Program Files\SpeechPulse) before installing this new version. During the uninstall procedure, you have the option to keep the downloaded models and CUDA libraries—there is no need to delete them.
- Now supports real-time transcription in live mode.
- Real-time processing significantly improves the accuracy of both Auto and Manual punctuation modes.
- A new option to edit/delete the saved diarization speakers.
9/18/2024 - Version 6.3.2 (macOS)
- Fixed an issue that caused SpeechPulse (macOS) to crash when pressing the caps lock key.
9/17/2024 - Version 6.3.1 (macOS)
- Reduce text hallucinations/repetitions in the Live mode.
- Now automatically re-transcribes any incorrectly transcribed portions of the text in the File mode (e.g., text repetitions/hallucinations).
- A new transcription editor similar to the diarization editor.
- Both transcription and diarization editors now support re-transcribing selected portions of the text. You can select sentences, paragraphs, or any block of text and re-transcribe using the context menu.
- SpeechPulse uses different internal settings for each re-transcription, making it possible to correct any text repetitions/hallucinations.
- You can also restore a previous transcription using the context menu.
- Misc. improvements.
9/17/2024 - Version 6.3.1 (Windows)
- Several performance improvements for the transcription editor to support lengthy audio files.
9/14/2024 - Version 6.3.0 (Windows)
- A new transcription editor similar to the diarization editor.
- Both transcription and diarization editors now support re-transcribing selected portions of the text. You can select sentences, paragraphs, or any block of text and re-transcribe using the context menu.
- Misc. improvements.
9/11/2024 - Version 6.2.0 (Windows)
- Reduce text hallucinations/repetitions in the Live mode.
- Now automatically re-transcribes any incorrectly transcribed portions of the text in the File mode (e.g., text repetitions/hallucinations).
- Also supports manually re-transcribing audio segments in the diarization window. You can use this feature to re-transcribe any incorrectly transcribed segments.
- SpeechPulse uses different internal settings for each re-transcription, making it possible to correct any text repetitions/hallucinations.
- You can also restore a previous transcription for a segment using the context menu.
9/9/2024 - Version 6.1.5 (Windows)
- A new option to play a notification sound after inserting text into the text edit area (in Live Mode).
- A new option to change the diarization editor font family (via "Settings->Options->General settings").
- Misc. improvements.
9/9/2024 - Version 6.1.5 (macOS)
- Misc. improvements.
9/8/2024 - Version 6.1.4 (macOS)
- A new option to play a notification sound after inserting text into the text edit area (in Live Mode).
- Fixed an issue that caused the audio slider in the diarization window to get stuck with 0 duration.
9/7/2024 - Version 6.1.3 (macOS)
- Better accuracy for word-level timestamps.
- Word highlighting in the diarization window now has better sync with the audio.
- Changed the app name from "speechpulse" to "SpeechPulse" (please remove the previous version from the Application folder before installing this new one).
9/6/2024 - Version 6.1.2 (macOS)
- A new option to change the diarization editor font family (via "Settings->Options->General settings").
- Fixed an issue that caused SpeechPulse to crash during app exit after running speaker diarization.
- Misc. improvements.
9/5/2024 - Version 6.1.1 (macOS)
- Now highlights the current word during playback in the diarization window.
- Supports playback from an arbitrary position in the dirization window. Simply right-click on a word and select Play to start playback from the current word.
9/5/2024 - Version 6.1.1 (Windows)
- Several improvements for word highlighting in the diarization window.
9/4/2024 - Version 6.1.0 (Windows)
- Now highlights the current word during playback in the diarization window.
- Supports playback from an arbitrary position in the dirization window. Simply right-click on a word and select Play to start playback from the current word.
- Now supports the .caf (Core Audio Format) file format in File Mode.
9/2/2024 - Version 6.0.0 (macOS)
- SpeechPulse now supports a new light theme, allowing you to choose between dark and light modes.
- Now supports the .caf (Core Audio Format) file format in File Mode.
9/1/2024 - Version 6.0.0 (Windows)
- SpeechPulse now supports a new light theme, allowing you to choose between dark and light modes.
8/29/2024 - Version 5.3.8 (macOS)
- Remove any preceding spaces in the Whisper speech API responses.
8/29/2024 - Version 5.3.8 (Windows)
- Custom vocabularies and custom prompts now work with Whisper speech APIs.
- Remove any preceding spaces in the Whisper speech API responses.
8/28/2024 - Version 5.3.7 (Windows and macOS)
- Saved diarizations now use half the file size to save audio files.
8/27/2024 - Version 5.3.6 (Windows)
- Improved the speaker diarization implementation to prevent misaligned diarization boundaries.
8/26/2024 - Version 5.3.6 (macOS)
- Improved the speaker diarization implementation to prevent misaligned diarization boundaries.
8/25/2024 - Version 5.3.5 (macOS)
- File mode and speaker diarization on macOS now use available memory more efficiently. This prevents a possible slowdown on base model M series Macs with 8 GB RAM.
8/24/2024 - Version 5.3.4 (macOS)
- A new context menu option to independently edit speaker names for each cell in the diarization window.
- Fixed a decoding issue that caused speaker diarization to fail on some specific video files.
- Misc. improvements for diarization.
8/24/2024 - Version 5.3.4 (Windows)
- A new context menu option to independently edit speaker names for each cell in the diarization window.
- Misc. improvements for diarization.
8/21/2024 - Version 5.3.3 (Windows and macOS)
- Batch file transcription now ignores any invalid audio/video files and continues to process all valid files.
- Auto speaker name detection now saves speaker names in the diarization window of previously saved diarizations.
8/16/2024 - Version 5.3.2 (Windows)
- Fixed an issue that caused SpeechPulse to crash/hang on some Windows PCs/laptops.
8/15/2024 - Version 5.3.1 (macOS)
- Now you can save the speaker diarization output for future editing.
- A new option to change the font size of the speaker diarization editor.
- Misc. improvements.
8/14/2024 - Version 5.3.1 (Windows)
- Now you can save the speaker diarization output for future editing.
- A new option to change the font size of the speaker diarization editor.
- Misc. improvements.
8/10/2024 - Version 5.3.0 (macOS)
- Now supports custom vocabularies in the Auto punctuation mode.
- Speaker diarization now supports automatic speaker tagging. You only need to enter each speaker's name once. SpeechPulse will automatically add speaker names for future transcriptions.
- Fixed an issue that caused incorrect diarization and incorrect subtitles with the OpenAI Whisper API.
- Fixed an issue of missing punctuation in the Auto punctuation mode.
- Misc. improvements.
8/9/2024 - Version 5.3.0 (Windows)
- Now supports custom vocabularies in the Auto punctuation mode.
- Speaker diarization now supports automatic speaker tagging. You only need to enter each speaker's name once. SpeechPulse will automatically add speaker names for future transcriptions.
- Fixed an issue that caused incorrect diarization and incorrect subtitles with the OpenAI Whisper API.
- Fixed an issue of missing punctuation in the Auto punctuation mode.
- Misc. improvements.
8/7/2024 - Version 5.2.2 (Windows)
- Fixed an issue that caused the edit cursor to follow the mouse in the AI template and diarization windows.
- UI improvements.
8/1/2024 - Version 5.2.3 (macOS)
- UI improvements.
7/30/2024 - Version 5.2.2 (macOS)
- Fixed an issue that caused the edit cursor to follow the mouse in the AI template and diarization windows.
7/22/2024 - Version 5.2.1 (macOS)
- History file list now supports navigation with arrow keys.
- A new button to open the history folder in Finder.
- Re-added custom prompts.
- Misc. improvements.
7/16/2024 - Version 5.2.0 (Windows)
- History file list now supports navigation with arrow keys.
- A new button to open the history folder in file explorer.
- Re-added custom prompts.
7/14/2024 - Version 5.1.0 (macOS)
- Notify the user when new SpeechPulse versions are available.
7/13/2024 - Version 5.1.0 (Windows)
- Notify the user when new SpeechPulse versions are available.
- Fixed an issue that caused SpeechPulse to reset model settings.
7/12/2024 - Version 5.0.0 (macOS)
- A new UI with a modern look.
- Misc. improvements.
7/10/2024 - Version 5.0.0 (Windows)
- A new UI with a modern look.
- Misc. improvements.
Live Mode File Mode History View Speaker Diarization View
7/2/2024 - Version 4.5.6 (Windows)
- Fixed an issue that caused SpeechPulse to crash silently in some specific CPUs and Windows versions.
6/27/2024 - Version 4.5.3 (Windows and macOS)
- SpeechPulse can now process the currently copied text in the clipboard using AI language models. First, use CTRL+C to copy the text to the clipboard. Then, start clipboard processing using the clipboard processing hotkey (configurable via "Settings->Options->Hotkeys").
- Fixed an issue that caused incorrect formatting of phone numbers.
6/25/2024 - Version 4.5.2 (Windows)
- Fixed an issue that caused UI artifacts when moving SpeechPulse UI between two monitors with different DPI scaling.
6/22/2024 - Version 4.5.1 (Windows and macOS)
- Fixed an issue that caused significant text hallucinations and poor accuracy in the Auto punctuation mode.
- The automatic microphone on/off feature in the push-to-talk mode is now optional. It can be enabled/disabled via "Settings->Options->General settings".
6/20/2024 - Version 4.5.0 (Windows)
- SpeechPulse (Windows) now comes with a built-in model/library downloader.
- Model downloader can be used to download speech models, language models, and CUDA GPU libraries.
6/16/2024 - Version 4.4.0 (macOS)
- SpeechPulse (macOS) now supports OpenAI compatible Whisper speech APIs.
- Fixed an issue that prevented AI template editing.
- UI improvements.
6/14/2024 - Version 4.4.0 (Windows)
- SpeechPulse (Windows) now supports OpenAI compatible Whisper speech APIs.
- UI improvements.
6/11/2024 - Version 4.3.1 (macOS)
- SpeechPulse (macOS) now supports OpenAI compatible external AI language (LLM) APIs. For example, you can connect to your Ollama server using SpeechPulse.
- Improved internal implementation for the AI language (LLM) feature. Now delivers more precise results for your AI templates.
- Push-to-talk mode now automatically turns on/off the microphone.
- A new option to keep the speech model loaded in RAM for faster transcription.
- Misc. improvements.
6/09/2024 - Version 4.3.1 (Windows)
- Push-to-talk mode now automatically turns on/off the microphone.
- Better error handling when connecting to LLM APIs.
- Fixed an issue that caused the auto-microphone-off feature to stop working.
- UI improvements.
6/08/2024 - Version 4.3.0 (Windows)
- SpeechPulse (Windows) now supports OpenAI compatible external AI language (LLM) APIs. For example, you can connect to your Ollama server using SpeechPulse.
- Improved internal implementation for the AI language (LLM) feature. Now delivers more precise results for your AI templates.
- Fixed an issue that caused missing words near the diarization boundaries.
- Misc. improvements.
6/03/2024 - Version 4.2.0 (macOS)
- SpeechPulse (macOS) now supports automatic speaker diarization.
- File mode now supports generating all output formats in a single pass.
- A new option to limit the number of words per subtitle line.
- UI improvements.
5/26/2024 - Version 4.1.4 (macOS)
- You can configure how many days of previous recordings to keep or disable history recordings via "Settings->Options->General settings". (this option was missing in the previous version)
- A new indicator to inform users when the push-to-talk mode is active.
5/26/2024 - Version 4.2.0 (Windows)
- SpeechPulse (Windows) now supports automatic speaker diarization.
- Supports both CPU and GPU execution for diarization.
- File mode now supports generating all output formats in a single pass.
- A new option to limit the number of words per subtitle line.
- A new indicator to inform users when the push-to-talk mode is active.
- UI improvements.
- Removed the support for onnx models due to their limitations.
5/16/2024 - Version 4.1.3 (macOS)
- SpeechPulse now supports recording your dictations as WAV files inside the "Documents/SpeechPulse" folder.
- You can configure how many days of previous recordings to keep or disable history recordings via "Settings->Options->General settings".
- The new history window allows you to process your previous dictation recordings using different speech models, language models, AI Templates, or any other different settings.
- Now supports importing other compatible offline language models. For example, you can run LLAMA 3 using SpeechPulse. Just place the GGUF language model file inside the SpeechPulse models directory. SpeechPulse will automatically detect the model on startup. For AI language models on macOS, it's recommended to have at least 16GB of RAM.
- Fixed an issue that caused SpeechPulse to skip AI formatting when dictated to the built-in editor.
- UI changes for better usability on high DPI screens.
5/14/2024 - Version 4.0.1 (macOS)
- Fixed an issue that caused SpeechPulse to output text in all lowercase letters in the manual punctuation mode.
5/14/2024 - Version 4.1.3 (Windows)
- Fixed an issue that caused SpeechPulse to output text in all lowercase letters in the manual punctuation mode.
5/12/2024 - Version 4.1.2 (Windows)
- Fixed an issue that caused SpeechPulse to crash with the latest NVIDIA drivers. Now supports all the recent driver versions.
- UI improvements.
5/10/2024 - Version 4.1.0 (Windows)
- SpeechPulse now supports recording your dictations as WAV files inside the "Documents/SpeechPulse" folder.
- You can configure how many days of previous recordings to keep or disable history recordings via "Settings->Options->General settings".
- The new history window allows you to process your previous dictation recordings using different speech models, language models, AI Templates, or any other different settings.
- Now supports importing other compatible offline language models. For example, you can run LLAMA 3 using SpeechPulse. Just place the GGUF language model file inside the SpeechPulse models directory. SpeechPulse will automatically detect the model on startup.
- Fixed an issue that caused SpeechPulse to skip AI formatting when dictated to MS Word, WordPad, and the built-in editor.
- UI changes for better usability on high DPI screens.
5/05/2024 - Version 4.0.0 (macOS)
- SpeechPulse (macOS) now supports offline AI language models for text formatting (currently only for the English language). You can use them to enhance your dictated text in real time.
- Common use cases include grammar, spelling, and punctuation correction, summarizing text, formatting text for Email, chat, notes, etc.
- AI language models support prompting to get the desired output. SpeechPulse has an "AI Templates" feature where you can customize the prompts for your specific use cases.
- File mode also supports AI language models for text processing.
- A new option to customize the deactivation delay in the Automatic speech input mode.
- A new option to increase the context length of the AI language models (longer context lengths require more RAM).
- Use memory more efficiently.
5/03/2024 - Version 4.0.1 (Windows)
- A new option to increase the context length of the AI language models (longer context lengths require more RAM/VRAM).
5/02/2024
- Replaced the original English (standard) AI language model with a slightly better one.
5/01/2024 - Version 4.0.0 (Windows)
- SpeechPulse (Windows) now supports offline AI language models for text formatting (currently only for the English language). You can use them to enhance your dictated text in real time.
- Common use cases include grammar, spelling, and punctuation correction, summarizing text, formatting text for Email, chat, notes, etc.
- AI language models support prompting to get the desired output. SpeechPulse has an "AI Templates" feature where you can customize the prompts for your specific use cases.
- AI language models support both CPU and GPU execution. However, a GPU is recommended for faster live transcription.
- File mode also supports AI language models for text processing.
- A new option to customize the deactivation delay in the Automatic speech input mode.
4/25/2024 - Version 3.7.2
- Fixed an issue that caused SpeechPulse to ignore audio files with capital letters in the file extension.
4/25/2024 - Version 3.7.1
- A new option to enable/disable automatic number formatting in the manual punctuation mode.
- Better formatting for numbers that include the words million and billion.
- Fixed an issue that caused incorrect date formatting.
4/24/2024 - Version 3.7.0 (macOS)
* You may have to modify your existing custom mappings with this update (especially the mappings that include the period symbol).
- Manual punctuation mode now supports automatically formatting numbers, dates, currency values, etc.
- Fixed an issue that caused missing punctuation marks in the manual punctuation mode.
- Fixed an issue where SpeechPulse would insert a letter when the push-to-talk hotkey was released.
4/23/2024 - Version 3.7.0 (Windows)
* You may have to modify your existing custom mappings with this update (especially the mappings that include the period symbol).
- Manual punctuation mode now supports automatically formatting numbers, dates, currency values, etc.
- Fixed an issue that caused missing punctuation marks in the manual punctuation mode.
- A new option to press the ENTER key after inserting text into the text edit area.
- Allow file mode even if no microphone is connected.
- UI improvements.
4/16/2024 - Version 3.6.9 (macOS)
- A new option to press the ENTER key after inserting text into the text edit area.
- Allow file mode even if no microphone is connected.
- UI improvements.
4/14/2024 - Version 3.6.7 (macOS)
- Fixed a crash that occurred in the activation window.
- UI improvements.
4/10/2024 - Version 3.6.8 (Windows)
- Fixed a crash that occurred when switching models or quitting the program on a GPU.
- Misc. improvements.
4/9/2024 - Version 3.6.7 (Windows)
- Fixed an issue that prevented graceful termination.
- Misc. improvements.
4/8/2024 - Version 3.6.6 (macOS)
- Fixed an issue that caused SpeechPulse to return a previous transcription result for failed transcriptions.
- Preserve punctuation mode when switching languages.
- Export/import custom mappings to easily transfer mappings from one installation to another.
- A new controls overview window to explain different punctuation, spacing, and speech input modes.
- Fixed an issue that caused missing words after a hyphen in the manual punctuation mode.
- Capitalization correction now supports phrases with hyphens (e.g. Wi-Fi).
- Fixed an issue that prevented graceful termination.
- Misc. improvements.
4/7/2024 - Version 3.6.6 (Windows)
- Preserve punctuation mode when switching languages.
- Export/import custom mappings to easily transfer mappings from one installation to another.
- A new controls overview window to explain different punctuation, spacing, and speech input modes.
- Fixed an issue that prevented graceful termination.
- Faster model load times.
- Misc. improvements.
4/5/2024 - Version 3.6.5 (Windows)
- Fixed an issue that caused missing words after a hyphen in the manual punctuation mode.
- Capitalization correction now supports phrases with hyphens (e.g. Wi-Fi).
- UI improvements.
4/1/2024 - Version 3.6.4
- Fixed an issue that caused incorrect mappings.
3/29/2024 - Version 3.6.3
- Fixed an issue caused by incompatible character encodings on different systems.
3/29/2024 - Version 3.6.2 (macOS)
- Fixed an issue that caused incorrect transcription for longer audio segments.
- File mode now supports manual punctuation (For English full-text transcription).
- Misc. improvements.
3/28/2024 - Version 3.6.2 (Windows)
- File mode now supports manual punctuation (For English full-text transcription).
- Faster initial load times.
- Misc. improvements.
3/25/2024 - Version 3.6.1 (Windows)
- On Windows, prevent hotkeys from making a notification sound.
3/24/2024 - Version 3.6.0
- Allows you to add a list of proper names for capitalization correction.
- Prevents subword replacements in the Mappings feature (non-RegEx mode).
3/21/2024 - Version 3.5.4 (Windows)
- Fixed an issue that caused incorrect capitalization for the first-person pronoun "I".
3/20/2024 - Version 3.5.3
- A new option to automatically reduce system output volume during push-to-talk.
- Improved capitalization in the manual punctuation mode.
3/15/2024 - Version 3.5.2 (macOS)
- Stop hotkeys and auto spacing from making a notification sound on macOS.
3/14/2024 - Version 3.5.1
- Fixed the issue of missing period punctuation marks in the manual punctuation mode.
- Reduce text hallucinations.
3/2/2024 - Version 3.5.0
- Now supports manual spacing.
- A new option to load SpeechPulse in the background and automatically minimize to the system tray.
- A taskbar notification to indicate if the microphone is not working (need to enable via Settings).
- Two new hotkeys for push-to-talk with auto punctuation and manual punctuation.
2/26/2024 - Version 3.4.2
- Reduce text hallucinations.
2/21/2024 - Version 3.4.1
- A history page to display previously dictated text of the current session.
- Disabled Speech Profiles in the push-to-talk mode.
- UI improvements.
2/15/2024 - Version 3.4.0
- Push-to-talk speech input with a new hotkey.
2/12/2024 - Version 3.3.7
- A new option to display audio buffer filling.
- New options to enable/disable voice activity detection.
- Now supports replacing text selections with dictation.
2/06/2024 - Version 3.3.6
- Automatically transfer focus from the SpeechPulse window to the last active text edit.
- Displays a "No text edit in focus!" message if there's no text edit in focus.
2/04/2024 - Version 3.3.5
- Disabled automatic turn-off in file mode.
- UI improvements.
2/03/2024 - Version 3.3.4
- A new option to automatically stop listening when the user starts typing.
- Taskbar notifications to indicate start/stop listening.
- Notification sounds to indicate start/stop listening.
- Now remember the file mode output format between sessions.
- A new option to make the processing indicator white when dictating to a dark background.
2/01/2024 - Version 3.3.3
- Now supports customizable subtitle widths in file mode.
- Supports timestamps in live mode (With the Auto punctuation).
- Direct text insertion in Microsoft WordPad.
- Uses a faster implementation for type mode.
1/27/2024 - Version 3.3.2
- Fixed an issue that prevented text generation when dictating to Microsoft Word.
- The mappings feature now supports case-insensitive replacement and wildcards (regular expressions).
- Better capitalization in the manual punctuation mode.
1/23/2024 - Version 3.3.1
- If you are in a noisy environment and SpeechPulse generates random text, you can use the Speech Profiles feature to add a new speech profile that only detects your own voice.
- Reduce text hallucinations.
1/19/2024 - Version 3.3.0
- Now supports direct text insertion, editing, and formatting in Microsoft Word.
- A new button to minimize SpeechPulse to the system tray.
1/14/2024 - Version 3.2.0
- Improved capitalization in the manual punctuation mode.
- Improved editor with voice commands for text formatting.
- Keyboard press commands to press keys and hotkeys with your voice. (e.g. "Press Enter", "Press Control Z")
- Displays a processing indicator and a dictated text label next to the cursor for better user experience.
- Better and more reliable voice command implementation.
- A new hotkey to switch punctuation modes.
12/13/2023 - Version 3.1.0
- Now remembers language model, device, microphone, and file mode output folder settings between sessions.
12/10/2023 - Version 3.0.0
- With the manual punctuation mode, you can dictate common punctuation marks like commas, periods, question marks, exclamation marks, colons, semicolons, etc.
- Supports "new line" and "new paragraph" commands within a continuous speech segment.
12/02/2023 - Version 2.5.0
- Added language hotkeys.
- New voice command "Transfer text" for SpeechPulse editor.
- UI improvements.
11/29/2023 - Version 2.4.0
- Now comes with a built-in text editor.
11/14/2023 - Version 2.3.0
- New Custom Voice Hotkeys feature can trigger custom keyboard shortcuts with voice commands.
- Text Inserter can insert custom text snippets with voice commands.
- Misc. improvements.
11/11/2023 - Version 2.2.0
- New Custom Mappings feature can replace SpeechPulse's text output with your own words/phrases. For example, you can replace the phrase "speech pulse" with "SpeechPulse" using custom mappings.
11/08/2023 - Version 2.1.1
- Trial version now supports all language models.
- Changed the hotkey implementation.
11/06/2023 - Version 2.1.0
- Added voice commands.
- Lower latency in Live Mode.
- More robust against background noise.
11/01/2023 - Version 2.0.0
- Added hotkeys.