Home
Softono
vosk-api

vosk-api

Open source Apache-2.0 Jupyter Notebook
14.8K
Stars
1.7K
Forks
592
Issues
142
Watchers
1 week
Last Commit

About vosk-api

Vosk-api is an offline, open-source speech recognition toolkit designed for deployment across a wide range of environments, from small devices like Android smartphones, iOS hardware, and Raspberry Pi to large server clusters and data centers. It supports continuous, large-vocabulary transcription with zero-latency response via a streaming API and offers features such as reconfigurable vocabulary and speaker identification. The system currently supports over 20 languages and dialects, including English, Spanish, French, German, Chinese, Russian, Japanese, and Hindi, with models optimized at approximately 50 MB to ensure efficient performance. Vosk provides native bindings for numerous programming languages, including Python, Java, C, Node.js, C++, Rust, and Go, making it highly accessible for developers. Typical use cases include powering chatbots, smart home appliances, and virtual assistants, as well as generating subtitles for media and transcribing lectures or interviews. Its ability to operate without an

Platforms

Web Self-hosted iOS Android

Languages

Jupyter Notebook

Links

Vosk Speech Recognition Toolkit

Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come.

Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.

Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++, Rust, Go and others.

Vosk supplies speech recognition for chatbots, smart home appliances, virtual assistants. It can also create subtitles for movies, transcription for lectures and interviews.

Vosk scales from small devices like Raspberry Pi or Android smartphone to big clusters.

Documentation

For installation instructions, examples and documentation visit Vosk Website.