Automatic Speech Recognition
Transcribe spoken audio to text using speech recognition models
ASR models convert audio recordings to text across multiple languages. They can also translate speech directly to English.
Available Models
- Whisper Large v2 – OpenAI's multilingual speech recognition model with word-level timestamps