Runtime Speech Recognizer
Documentation for the Runtime Speech Recognizer plugin. Get it on Fab.
📄️ Overview
Runtime Speech Recognizer Documentation
📄️ How to use the plugin
The Runtime Speech Recognizer plugin is designed to recognize words from incoming audio data. It uses a slightly modified version of whisper.cpp to work with the engine. To use the plugin, follow these steps:
📄️ How to use language models
Selecting, downloading and packaging models
📄️ Recognition parameter list
These parameters can only be set while the recognizer is not running.
📄️ Supported languages
This is the complete list of languages supported by the available language models.
📄️ Commands recognition
Compute Levenshtein Similarity
📄️ Minimizing freezes
This guide addresses two common sources of freezing in the RuntimeSpeechRecognizer plugin and provides practical solutions to mitigate the performance impact.
📄️ Troubleshooting
Most of the issues are related to the language model staging, which can cause problems in the packaged build. Namely, you might encounter the following logs:
📄️ Demo Project
Note: RuntimeAudioImporter is required for this demo to obtain audio data as an example. Please download this plugin before using the demo (Fab).
📄️ How to improve performance
1. Decrease Step Size