AI voice agent pricing compared: why the headline rate lies
Vapi, Retell, and most voice AI platforms bill per component, so the headline per-minute number isn't your real cost. Here's how pricing actually works — and how flat-rate compares.
The first thing to know about AI voice agent pricing is that the headline number usually isn't your real cost. Most platforms bill per component — so the rate on the homepage is one slice of an invoice, not the whole thing. Here's how the models actually work, and how to compare them honestly.
Call2Me is flat $0.10/min voice + $0.05/min telephony — one number, no spreadsheet. $5 in free credits, no card.
The two pricing models
Per-component (usage-based). This is how most developer-first platforms bill. Each layer — speech-to-text (STT), the LLM, text-to-speech (TTS), and telephony — has its own rate, often from a different provider.
- Vapi charges a $0.05/min platform fee on top of the STT/LLM/TTS/telephony you assemble. The fee is predictable; the components on top are not, because they depend on the models you pick. (Vapi alternative →)
- Retell bills per component too — STT, LLM, TTS, and telephony each at their own rate. (Retell AI alternative →)
- Bland leans on a more bundled, proprietary stack, which trades component-level visibility for a simpler-looking bill. (Bland AI alternative →)
The upside of per-component pricing is flexibility — you only pay for the exact models you use. The downside is predictability: a month's cost is a small spreadsheet you reconcile across providers, and two teams on the "same" plan can pay very different amounts.
Flat-rate. Call2Me bills a flat $0.10/min for the voice pipeline (STT + LLM + TTS), plus $0.05/min when a phone number is involved, plus $0.05/min for recording. No per-provider reconciliation — your cost is minutes × a rate you already know.
How to actually compare cost
Headline rates are marketing; invoices are reality. To compare honestly:
- Build the same agent on each candidate. Same prompt, same rough voice quality.
- Run a few hundred real minutes. Short test calls under-represent LLM and TTS usage.
- Read the invoice, not the homepage. Add up every line — STT, LLM, TTS, telephony, recording, storage.
- Model your real volume. A flat rate is trivial to project; a per-component bill you should extrapolate from the test invoice, not the headline.
With modern platforms the difference in caller experience is small, so the decision usually comes down to pricing predictability and setup time — which is exactly where flat-rate, batteries-included platforms separate from composable ones. For the full landscape, see our best AI voice agents guide.
So which pricing model should you pick?
- Per-component (Vapi, Retell) if you want to pay only for the exact models you use and don't mind reconciling a multi-line bill — great for teams optimizing every layer.
- Flat-rate (Call2Me) if you'd rather model your cost in one multiplication and avoid surprise line items — great for teams that want predictability and a fast start.
Build your agent on Call2Me free — $5 in credits, no card — and the per-minute math is the rate on the page. No spreadsheet required.
Frequently asked
Q.How much does an AI voice agent cost per minute?
It depends on the billing model. Platforms like Vapi and Retell bill per component — speech-to-text, the LLM, text-to-speech, and telephony each have their own rate — so the real per-minute cost is the sum, often well above the headline. Call2Me is a flat $0.10/min for the voice pipeline plus $0.05/min when a phone number is involved, so your per-minute cost is one number you can predict.
Q.Why is Vapi's pricing hard to predict?
Vapi's $0.05/min is a platform fee, charged on top of the STT, LLM, TTS, and telephony you assemble — each billed separately, often by a different provider. Your monthly total depends on which models you pick and how chatty your calls are, so two teams on the same plan can pay very different amounts.
Q.What's the cheapest AI voice agent?
There's no single cheapest, because per-component platforms vary with the models you choose. The only honest comparison is to build the same agent on each candidate and run a few hundred real minutes, then read the actual invoice rather than the headline rate. Flat-rate pricing like Call2Me's is easiest to model in a spreadsheet up front.
Q.Does call recording or telephony cost extra?
On per-component platforms, telephony is almost always a separate line, and recording or storage can be too. Call2Me keeps it explicit: $0.10/min voice pipeline, +$0.05/min when a phone number is involved, +$0.05/min for recording — so there are no surprise line items at the end of the month.
Keep reading
All posts- Comparison
Best AI voice agents in 2026: an honest, use-case-by-use-case guide
Vapi, Retell, Bland, Synthflow, Air, and Call2Me each win for a different team. Here's which AI voice agent platform fits which use case — no fake rankings, just the real trade-offs.
Jun 11, 20262 min - Comparison
A Vapi alternative with flat pricing and a built-in knowledge base
Vapi gives you maximum composability. If assembling STT/LLM/TTS and your own RAG is more work than you wanted — and a four-line invoice is hard to predict — here's the honest comparison.
Jun 11, 20264 min - Comparison
Vapi vs Retell: how they differ, and the flat-rate third option
Vapi and Retell are both developer-first voice AI platforms with usage-based billing. Here's where they actually differ — and where a batteries-included, flat-priced alternative fits.
Jun 11, 20263 min