An open-source automatic speech recognition system from OpenAI that converts audio into text with high accuracy.
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. It is designed to transcribe speech in multiple languages and translate non-English speech into English. Whisper models show strong ASR performance across many languages and can also perform multilingual speech recognition, speech translation, and language identification.
Multilingual speech recognition
Translation capabilities
Robust to accents and background noise
Open-source accessibility
Available in various model sizes
Automatic Speech Recognition
No reviews yet. Be the first to share your experience!