Convert audio file to text, returns complete recognition result
Authenticate using X-Api-Key header
Language or locale code (optional). Supports both ISO 639-1 language codes (e.g. "en", "zh") and BCP 47 locale codes (e.g. "en-US", "zh-CN"). When a language code is provided, the system will automatically resolve it to the most common locale (e.g. "en" -> "en-US"). Improves recognition performance if provided, otherwise auto-detected by service.
"en-US"
Sample rate (required, unit: Hz, e.g. 16000, 22050, 44100, 48000)
x >= 116000
Whether to return timestamps (optional, default false)
Whether to enable speaker diarization (optional, default false)
The body is of type file.
Successfully returns recognition result
Request ID
"req_1234567890"
Recognized text
"Hello, this is a test message."
Audio duration in milliseconds
2500
Creation time
"2024-01-01T12:00:00Z"
Detected language code, e.g. en, en-US
"en"
Word-level information (if timestamps are enabled)
Speaker information (if speaker diarization is enabled)