Saturday, May 22, 2021

Vosk Speech Recognition Toolkit

Build Status

РУС

中文

Vosk is a speech recognition toolkit. The best things in Vosk are:

  1. Supports 18 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian. More to come.
  2. Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
  3. Installs with simple pip3 install vosk
  4. Portable per-language models are only 50Mb each, but there are much bigger server models available.
  5. Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
  6. There are bindings for different programming languages, too - java/csharp/javascript etc.
  7. Allows quick reconfiguration of vocabulary for best accuracy.
  8. Supports speaker identification beside simple speech recognition.

Documentation

See the following secions for more information:

If you have any questions, feel free to

  • Post an issue on github
  • Send us an e-mail at contact@alphacephei.com
  • Join our group dedicated to speech recognition on Telegram @speech_recognition
  • We have a Wechat group which is pretty big, so it is invitation-only. Mail us to join the group and provide some information about yourself.


from Hacker News https://ift.tt/346Ij9w

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.