Rime
Enterprise text-to-speech built for high-stakes phone calls, where a mispronounced name loses the customer.
Paid link, we may earn a commission. How this works.
Scored on the same voice-agent rubric as the full platforms, so a building block like this scores low on the axes it does not address. Read its value score against its job.
See how it stacks up · Full rankings →The accuracy specialist. Rime is a voice engine you wire into your own stack (Vapi, LiveKit, Pipecat), pitched at getting names and numbers right on calls you cannot afford to fumble. Per-character pricing from $0.03 per 1,000 characters, with a HIPAA option for healthcare.
About $0.04 to 0.10 for a minute of conversation, once the phone line and the AI are added in.
That's roughly $2.40–6.00 an hour. Plans: $0/mo (Starter).
Pricing
Show the cost breakdown
| What the platform charges to run the agent, before the phone line and the AI usage are added on. | — |
|---|---|
| The step that turns what the caller says out loud into text the AI can read. | — |
| The AI 'brain' that reads what the caller said and works out what to say back. | $0.01 /min |
| The step that turns the AI's written reply back into a spoken voice. | $0.02 /min |
| The phone line itself: the service that connects the call to a real phone number. Usually billed on top of the platform. | $0.01 /min |
| The total you actually pay for one minute of conversation once every piece is added up: the platform, the AI, the voice and the phone line. | $0.04–0.10 /min |
Rime is text-to-speech billed per character, not an all-in platform, so the per-minute figures here are a representative model, not a quoted rate. On the pricing page the three production models are Mist at $0.03/1k characters, Arcana at $0.04/1k characters and Coda at $0.05/1k characters (captured 2026-05-31). A minute of agent speech is roughly 900 to 1,000 characters, and an agent speaks for about half of a phone minute, so Mist works out near $0.015 to 0.02 of TTS per conversation-minute. Wire in your own language model (about $0.01/min) and a phone line such as Twilio (about $0.014/min) and a realistic all-in lands around $0.04 to 0.10. Rime brings no speech-to-text, language model or telephony of its own, so those are not Rime charges. The Dec 2025 launch post quoted Mist from $20/million and Arcana from $30/million on the Growth plan; the live pricing page is the figure used here. HIPAA BAA and SOC 2 reports are stated for the Enterprise tier on the pricing page; GDPR is not stated there, so it is left unticked.
Every plan in one place: the monthly fee, what each one includes, and the features it unlocks. Anything beyond a plan's allowance, or on a pay-as-you-go tier, is billed at the per-minute rate above. A blank in the features means the vendor's plan page does not state it for that plan, not that it is unavailable.
| Starter | Enterprise | |
|---|---|---|
| Price | Free | Custom |
| Included | 3,000 minutes | — |
| Plan notes | Pay as you go from $0.03/1k characters; 3,000 minutes free to start; 20 concurrent TTS generations; public Slack support. | Custom per-character rates with volume discounts; unlimited concurrency and custom voice clones; SLAs and dedicated support; cloud, on-prem or VPC; BAA (HIPAA) and SOC 2 reports. |
| What each plan unlocks | ||
| Voice cloning | — | Unlimited custom clones |
| Concurrent calls | 20 concurrent TTS generations | Up to unlimited |
| Priority support | Public Slack | SLAs + dedicated support |
- Starter Free3,000 minutes
Pay as you go from $0.03/1k characters; 3,000 minutes free to start; 20 concurrent TTS generations; public Slack support.
- Voice cloning
- —
- Concurrent calls
- 20 concurrent TTS generations
- Priority support
- Public Slack
- Enterprise Custom—
Custom per-character rates with volume discounts; unlimited concurrency and custom voice clones; SLAs and dedicated support; cloud, on-prem or VPC; BAA (HIPAA) and SOC 2 reports.
- Voice cloning
- Unlimited custom clones
- Concurrent calls
- Up to unlimited
- Priority support
- SLAs + dedicated support
Each plan bundles a set amount of talk time a month.
Prices in USD as set by the vendor · last checked 2026-06-03 · vendor pricing →
Slide your expected monthly volume to see roughly what Rime would cost.
A rough estimate from Rime's sourced rates, not a quote. Always confirm on the vendor's own pricing page before you commit.
At a glance
- Speech-to-text
- Text-to-speech
- Rime Mist, Rime Arcana, Rime Coda
- Languages
- en, es, fr, pt, de, ja, he
- Integrations
- Vapi, LiveKit, Pipecat, SignalWire, Together AI, Native API / SDK, Rime CLI
Compliance
Our full take
Rime is a voice engine, not a full agent platform. You do not log in and build a phone bot. You take Rime’s text-to-speech and plug it into something else (Vapi, LiveKit, Pipecat) that handles the call. What Rime sells is the voice itself, and specifically a voice that gets the hard words right. Its own line is “built for calls you can’t afford to get wrong,” and that is the honest centre of the pitch: pronunciation accuracy on names, medications, account numbers and addresses, the kind of detail that quietly loses a customer when an agent fumbles it.
The pricing is usage-based and billed per character, so there is no single all-in number to quote. On the pricing page today there are three production models. Mist is the low-latency one for live phone agents at $0.03 per 1,000 characters. Arcana is the multilingual, more expressive model at $0.04 per 1,000 characters. Coda is the newest flagship at $0.05 per 1,000 characters. New accounts start on Starter with 3,000 minutes free and 20 concurrent generations, then pay as they go. Enterprise is a custom per-character rate with volume discounts.
Because every comparison on this site works in per-minute terms, here is the workings. A minute of spoken agent audio is roughly 900 to 1,000 characters. In a real phone call the agent only talks for about half the time, the caller has the other half, so a conversation-minute generates closer to 450 to 500 characters of speech. On Mist at $0.03 per 1,000 that is about $0.015 to 0.02 of voice per minute. Add your own language model (call it $0.01 a minute) and a phone line such as Twilio (about $0.014 a minute) and a realistic all-in sits around $0.04 to 0.10 a minute. Two things to be clear about: that all-in is a model, not a rate Rime quotes, and the language-model and telephony parts are not Rime charges, they are what you pay other suppliers in the stack Rime sits inside.
One wrinkle worth flagging. Rime’s December 2025 launch post quoted Mist from $20 per million characters and Arcana from $30 per million on the Growth plan. The live pricing page today shows the per-model rates above (Mist at $30 per million, which is the $0.03 per 1,000 figure). Where the two disagree, this page uses the live pricing page, captured 2026-05-31. If you are negotiating, get the current rate card in writing rather than trusting either number second-hand.
On voices and reach, Rime advertises a library of 300-plus voices, with Arcana v3 carrying 94 flagship voices on its own. The language list is short though: seven languages (English, Spanish, French, Portuguese, German, Japanese and Hebrew, per the docs as of 19 May 2026). If you need to sound native across twenty markets, this is not the engine for that yet. Where it concentrates its effort is English-language realism and getting domain-specific words right, which is a deliberate trade rather than a gap.
Compliance is where Rime is interesting for the regulated buyer, with a caveat. The pricing page states that the Enterprise tier includes a BAA (HIPAA) and SOC 2 reports, alongside on-prem and VPC deployment options. Rime also leans hard into healthcare in its marketing and runs on Oracle Cloud Infrastructure for that market. So the HIPAA claim is sourced and ticked here. SOC 2 is mentioned as “reports” on the Enterprise tier, but we have not yet linked a primary certification letter or stated a Type 1 versus Type 2 level, so both SOC 2 boxes stay unticked until we see the paperwork. GDPR is not mentioned on the pricing page at all, so it is unticked too. If you are in healthcare or finance, treat the HIPAA line as the starting point of a conversation with their sales team, not the finish line, and get the BAA and the SOC 2 report scope in writing before you build.
A few practical limits. Rime brings no speech-to-text, no language model and no telephony of its own, so the headline per-character price genuinely is just the voice. That is fine if you are a developer wiring up a stack, and real setup work if you are a non-technical buyer expecting to plug in and go. There is no documented self-serve affiliate or partner programme either, so the link from this page is the plain website, not a tracked one. And the ease-of-use score reflects that Rime is a building block, not a finished product: you need a framework around it.
My read: Rime earns a place on the shortlist when the deciding factor is pronunciation accuracy on high-value, often regulated calls, and you are already building your own agent rather than buying an all-in platform. The per-character pricing is competitive, the HIPAA path is real, and Mist is built for the low-latency phone use case. Look elsewhere if you need a large multilingual voice library, voice work in languages outside the seven, or a platform that hands you the phone line and the language model in one bill.
The 1 to 10 scores and the latency figure on this page are an editorial preview, our provisional read to get the framework in place, not a measured result. We have not run Rime through our own listening or call tests yet, so there is no Voxrater latency figure here. The pricing, voice and compliance detail is sourced from Rime’s pricing, docs and homepage plus the LiveKit, Pipecat and Vapi integration pages, all captured 2026-05-31.
Rime compared
Our in-depth pieces that put Rime side by side with the field, with the sourced numbers and a clear pick.
Alternatives to Rime
Other platforms that overlap with Rime on the same kind of work, ranked by how many capabilities they share, then by cheaper all-in cost per minute. Compare any of them side by side on the compare page.
Tracking Rime? Get the next test result
We re-test and re-price the platforms we cover. Join the list and the next dated update lands in your inbox.
Newsletter launching soon.
Sources
- Rime pricing page re-captured 2026-06-02 for the quarterly re-verification; pricing reviewed against the live page (screenshot in evidence/). · captured 2026-06-02
- Rime pricing page: Mist $0.03/1k, Arcana $0.04/1k, Coda $0.05/1k chars; Starter 3,000 free minutes, 20 concurrent; Enterprise custom with BAA (HIPAA) and SOC 2 reports, on-prem/VPC · captured 2026-05-31
- New pricing launch (19 Dec 2025): Starter/Growth/Enterprise plans, Mist from $20/million and Arcana from $30/million per-character on Growth · captured 2026-05-31
- Rime voices docs (as of 19 May 2026): 7 languages (en, es, fr, pt, de, ja, he); Arcana v3 has 94 flagship voices; Mist v3/v2, Arcana v3, Coda models · captured 2026-05-31
- Rime homepage: enterprise TTS for healthcare/finance/telecom; named customer outcome stats (Domino's, ConverseNow, Fortune 500 containment) · captured 2026-05-31
- LiveKit Agents integration guide: Rime as a TTS provider · captured 2026-05-31
- Pipecat integration: RimeTTSService (WebSocket, word-level timing, interruption) and RimeHttpTTSService · captured 2026-05-31
- Vapi docs: Rime listed as a voice/TTS provider · captured 2026-05-31