📖 The AI Tool Bible

Whisper

OpenAI's open-source speech-to-text — the de-facto baseline.

FreeAudioWhisper large-v38.6 / 10
Visit website →

Whisper is OpenAI's open-source speech recognition model. Free to self-host, multilingual, and the baseline against which everything else is measured. Also available via OpenAI's API for those who don't want to run a GPU.

Pros

  • Free, open weights
  • Multilingual
  • Strong baseline accuracy

Cons

  • ⚠️ No diarisation built in
  • ⚠️ Hallucinations on silent segments

Use cases

transcriptionself-hostedmultilingual

Compare with similar tools

All in Audio