Fastest model. Best for short files under 30 minutes that do not require accurate timestamps.
Most reliable model. Best for both transcriptions and captions workflows
Most versatile model. Best for recordings with noisy environments, dialects and accents, singing voices, and Chinese/Cantonese.
Most accurate model. Best for clean audio recordings that require best-in-class accuracy
Please do not include personally identifiable information in the job name or file names.
Language options slightly differ based on model type.