API pricing
Transparent, usage-based pricing for the all-in-one audio API. One wallet, every capability — credits never expire and a failed job is never charged.
The wedge
The fairest economics in audio AI — designed so experimenting costs nothing and production stays predictable.
Create a key and get $1 of free API credit on your first key. Build before you pay.
Every job is prepaid and refunded automatically if processing fails. You only pay for output.
Top up your API wallet once; the balance carries forever. No monthly reset, no use-it-or-lose-it.
Speech, transcription, music, stems, voices and more — billed from a single wallet, no per-product contracts.
Rates
Billed per minute of audio processed, deducted from your prepaid API wallet. No tiers to decode, no seats, no minimums.
| Capability | Per minute | Per hour | Minutes per $1 |
|---|---|---|---|
| Speech to Text | $0.01/min | $0.60 | 100 min |
| Text to Speech | $0.04/min | $2.40 | 25 min |
| Music Generation | $0.04/min | $2.40 | 25 min |
| Voice Cloning | $0.04/min | $2.40 | 25 min |
| Stem Separation | $0.10/min | $6.00 | 10 min |
| Speaker Separation | $0.20/min | $12.00 | 5 min |
| Voice Conversion | $0.13/min | $7.80 | 7.7 min |
| Speech Translation | $0.40/min | $24.00 | 2.5 min |
| Media Conversion | $0.01/min | $0.60 | 100 min |
OpenAI-compatible /v1/audio/speech, /transcriptions and /translations bill at the Text to Speech and Speech to Text rates above.
Calculator
Enter the minutes you expect to process per month. Live rates, no signup.
Enter the minutes of audio you expect to process per month.
Speech to Text
$0.01/min
Text to Speech
$0.04/min
Music Generation
$0.04/min
Voice Cloning
$0.04/min
Stem Separation
$0.10/min
Speaker Separation
$0.20/min
Voice Conversion
$0.13/min
Speech Translation
$0.40/min
Media Conversion
$0.01/min
Estimated monthly total
Prepaid from your API wallet · failed jobs are never charged.
FAQ
Create an API key, claim your $1 of free credit, and ship your first call in minutes.