Question 1

What languages are supported?

Accepted Answer

Whisper handles 90+ languages and detects the spoken language automatically. You can also set it manually for best accuracy.

Question 2

Is my audio stored?

Accepted Answer

No — your audio is processed on our server to create the transcript and is not saved.

Question 3

How long can the audio be?

Accepted Answer

Clips up to roughly 10 minutes work best. For longer recordings, split them into parts.

Question 4

Which file types work?

Accepted Answer

Most common audio and video files (mp3, wav, m4a, webm, ogg, mp4, and more).

Speech to Text

About Speech to Text