Question 1

How is LLM API cost calculated?

Accepted Answer

Providers bill per token, separately for input (your prompt plus any context) and output (the model's response). Cost per request is (input tokens ÷ 1M × input price) + (output tokens ÷ 1M × output price). Multiply by your request volume to get daily, monthly, and annual cost. This calculator does that across the major models so you can compare.

Question 2

Why is my AI bill higher than expected?

Accepted Answer

Usually one of three things: bloated prompts and context (input tokens add up fast when you stuff in documents or chat history), using a top-tier model for routine work that a cheaper model handles fine, and no caching of repeated requests. Long system prompts sent on every call are a common hidden cost.

Question 3

What is the cheapest way to run an AI feature?

Accepted Answer

Route simple or high-volume requests to a small, cheap model (GPT-4o mini, Claude Haiku, Gemini Flash) and reserve the expensive models for genuinely hard tasks. Cache identical or near-identical responses, trim context to what the model actually needs, and batch where latency allows. Those three moves often cut a bill by more than half.

Question 4

How many tokens is a typical request?

Accepted Answer

Roughly, one token is about 0.75 words (or about 4 characters). A short chat turn might be a few hundred tokens; a retrieval-augmented request with injected documents can be several thousand input tokens. Output is whatever the model generates. Check your provider dashboard for real per-request numbers.

Question 5

What if I do not know my token counts?

Accepted Answer

Use the "Paste my text" mode: drop in your system prompt, a typical user message, and a typical response, and the calculator counts the tokens for you. Or copy the ready-made prompt on the page and give it to ChatGPT, Claude, or an AI agent with access to your app — it will return the system, input, and output token values to plug in.

Question 6

Are these prices exact?

Accepted Answer

They are approximate list prices for planning and change often, so confirm current rates with your provider. Real spend also depends on caching, retries, batching, and system-prompt overhead. Use this to ballpark a budget and compare models, not as a billing forecast.

AI / LLM API
Cost Calculator

How do you want to enter usage?

Model

What one request looks like

How much it gets used

Don't know your numbers?

Same workload, by model

The bill that surprises founders

Frequently asked questions

How is LLM API cost calculated?

Why is my AI bill higher than expected?

What is the cheapest way to run an AI feature?

How many tokens is a typical request?

What if I do not know my token counts?

Are these prices exact?

Want your AI feature to be cheap and reliable?

Turn your idea into revenue

AI / LLM APICost Calculator

How do you want to enter usage?

Model

What one request looks like

How much it gets used

Don't know your numbers?

Same workload, by model

The bill that surprises founders

Frequently asked questions

How is LLM API cost calculated?

Why is my AI bill higher than expected?

What is the cheapest way to run an AI feature?

How many tokens is a typical request?

What if I do not know my token counts?

Are these prices exact?

Want your AI feature to be cheap and reliable?

Turn your idea into revenue

AI / LLM API
Cost Calculator