Millis AI Pricing Overview

At Millis AI, we designed our pricing to be transparent and straightforward. Here’s a breakdown of how our pricing works, including the costs associated with using different Large Language Models (LLMs), Text-to-Speech (TTS), and Speech-to-Text (STT) providers.

Price Breakdown

Base Charge

We charge a base rate of $0.02 per minute for using the Millis AI platform. This rate is in addition to the fees charged by other providers such as LLM, TTS, and STT services.

LLM Model Pricing

The pricing for LLMs is based on the number of tokens processed. Here, we translate token-based pricing into an estimated cost per minute based on typical usage:

GPT-4o:
- Input: $5.00 per 1 million tokens
- Output: $15.00 per 1 million tokens
- Estimated costs per minute: $0.004 per minute
GPT-4 Turbo:
- Input: $10.00 per 1 million tokens
- Output: $30.00 per 1 million tokens
- Estimated costs per minute: $0.008 per minute
GPT-3.5 Turbo:
- Input: $0.50 per 1 million tokens
- Output: $1.50 per 1 million tokens
- Estimated costs per minute: $0.0004 per minute
Meta Llama-3:
- Input: $0.90 per 1 million tokens
- Output: $0.90 per 1 million tokens
- Estimated costs per minute: $0.00018 per minute
Your LLM: No charge for using your own custom LLM
Choose By Millis - Optimize for Best Latency:
- Millis automatically selects the best available LLM model from above with the lowest latency for your specific configuration and functions.
- This option is perfect for users looking for the most efficient performance without the need to manually switch between models.

TTS Provider Pricing

TTS pricing is based on the number of characters or words that the agent speaks. This means longer responses from your agent will cost more. Each provider has its own pricing structure:

Service	Cost per 1,000 Characters (USD)	Approx. Cost per Minute (USD)
Eleven Labs	0.10	~$0.05
OpenAI	0.015	~$0.0075
Cartesia	0.0392	~$0.0196
PlayHT	0.25	~$0.125
Deepgram	0.015	~$0.0075
Neets	0.001	~$0.0005
Rime	0.075	~$0.0375

A typical minute of voice interaction might involve processing around 500 characters.

We offer volume discounts for customers with high usage, which can reduce Eleven Labs costs by up to 25%. Reach out if you’re interested.

STT Pricing

For converting spoken inputs into text, we charge $0.0043 per minute.

Example Calculation

Let’s calculate the total cost for a 10-minute session using GPT-4o and Eleven Labs TTS, with an average speech rate. Assuming each party speaks for approximately 5 minutes during the 10-minute session:

Millis AI base charge: 10 min x $0.02/min =$ 0.20
GPT-4o LLM charge: 10 min x $0.004/min =$ 0.04
Eleven Labs TTS charge: Since the agent only speaks for about half of the session (5 minutes), and assuming an average speech rate translates to 4,000 characters spoken by the agent:
- 4,000 characters x $0.10/1,000 characters =$ 0.40
STT charge: Since the human also speaks for approximately half of the session (5 minutes):
- 5 minutes x $0.0043/min =$ 0.0215
Total cost: $0.20 (base) +$ 0.04 (LLM) + $0.40 (TTS) +$ 0.0215 (STT) = $0.6615

Overview

Core Concepts

Integration Guides

Telephony

Millis AI Pricing Overview

Price Breakdown

Base Charge

LLM Model Pricing

TTS Provider Pricing

STT Pricing

Example Calculation

Overview

Core Concepts

Integration Guides

Telephony

​Price Breakdown

​Base Charge

​LLM Model Pricing

​TTS Provider Pricing

​STT Pricing

​Example Calculation

Price Breakdown

Base Charge

LLM Model Pricing

TTS Provider Pricing

STT Pricing

Example Calculation