AI Token Cost Calculator

To estimate the cost of an LLM API call, choose a model, enter input and output token counts, and optionally set a repeat workload (requests per day or per month). The calculator returns per-request cost, input cost, output cost, cached-prompt cost and savings, plus daily and monthly projections — works as an OpenAI token calculator, Claude token calculator, or general LLM cost calculator for ChatGPT-style API workflows.

Estimate token cost by model
Choose a provider and model, then enter the token counts you expect per request. This works well for OpenAI token cost checks, Claude token cost estimates, and repeated daily or monthly workload planning.

Calculation mode

This calculator focuses on OpenAI, Anthropic Claude, and Google Gemini model pricing, with broader model coverage inside each supported provider.

Model pricing is driven by a local pricing table verified against the official source links shown below.

Use billable prompt tokens before any cache savings are applied.

Use the completion or response tokens you expect back from the model.

Count only the input tokens billed at the cached-input rate. Cached tokens should not exceed total input tokens.

Used only in repeated mode. Enter how many requests you expect per day or per month.

Workload horizon

Results are approximate. Provider billing rules, tokenizer differences, retries, and add-on features can change your final bill.

Pricing is currently sourced from @pydantic/genai-prices plus our model mapping layer. Official provider pricing pages remain the source of truth.

Accuracy notice

This calculator is a best-effort estimate, not an exact billing system. Prices can be incomplete or drift because model providers do not publish perfectly machine-readable pricing data for every billing rule and pricing tier.

If you are making a finance, procurement, or production rollout decision, verify the selected model against the provider's official pricing page before relying on the estimate.

Estimated cost breakdown
The result card shows per-request cost, the selected workload total, and how input, output, and cached-input pricing contribute to the estimate.

Choose an OpenAI, Claude, or Gemini model to see the estimate

The calculator will estimate per-request token cost first, then scale it to a daily or monthly workload if you enable repeated mode.

AI Token Cost

How this AI token cost calculator works

The first release stays simple on purpose: you bring the token counts, the page applies the selected model's local pricing table, and the result scales to a daily or monthly workload when needed. In practice, that makes it useful as an OpenAI token calculator, Claude token calculator, or general token cost estimator.

Choose a provider and model
Pick one of the supported models from OpenAI, Anthropic Claude, or Google Gemini.
Enter token counts
Fill in input, output, and optional cached-input tokens for one request.
Scale the estimate
Switch to repeated mode when you want the same request estimate projected across a daily or monthly workload.

Notes and boundaries

This page estimates token-based API cost from a third-party aggregated pricing dataset plus our current model mapping. It is useful for OpenAI token calculator and Claude token cost workflows, but it does not attempt to reconcile exact provider invoices, billing exports, tool-use charges, or retries.

Cached-input pricing is provider-specific. In this calculator, cached tokens are treated as a discounted portion of input tokens rather than extra tokens on top of the prompt.

If you searched for a ChatGPT token calculator, treat this page as an API pricing estimator for OpenAI models rather than a calculator for ChatGPT subscription plans.

Some providers apply special long-context, batch, grounded-search, or regional-processing pricing. The assumptions panel shows what is modeled in the current estimate, but official provider pages still win when there is any mismatch.

AI token cost calculator FAQ

Can I use this page as an OpenAI token calculator?
Yes. If you choose a supported OpenAI model, the page estimates prompt, completion, and cached-input cost from the model's current pricing data, then scales the result by request volume when repeated mode is enabled.
Can I estimate Claude token cost here?
Yes. Choose a supported Anthropic Claude model and enter the input, output, and optional cached-input tokens you expect per request. The calculator then estimates per-request, daily, or monthly token cost.
Does this work as a ChatGPT token calculator?
For API usage, yes. If you are pricing OpenAI API models used in a ChatGPT-like product flow, this page works as a ChatGPT token calculator. It is not meant to estimate ChatGPT Plus, Pro, Team, or Enterprise subscription fees.
What is the difference between an OpenAI token calculator and a token cost calculator?
An OpenAI token calculator usually refers to pricing one OpenAI model. A token cost calculator is broader: it applies the same token-based math across supported providers such as OpenAI, Claude, and Gemini.
What are cached tokens?
Cached tokens are prompt tokens billed at a discounted cached-input rate when the provider supports prompt caching or an equivalent cache mechanism.
Why might this estimate differ from my actual bill?
Real bills can differ because of tokenizer differences, retries, tool calls, grounding, long-context tiers, regional processing, provider-side pricing changes, and differences between aggregated pricing data and the provider's final billing logic.
Which providers and models are included?
The page currently focuses on OpenAI, Anthropic Claude, and Google Gemini, and aims to cover a broader set of their priced text and reasoning models while still excluding categories like image, audio, embedding, and other non-comparable SKUs.

Related AI planning tools

These tools sit next to LLM cost math in the planned AI cluster and help users move from pricing to comparison and token planning.

Next
AI Model Pricing Comparison
Compare model pricing and specs side by side when the task moves from one estimate to cross-model research.
Coming soon
Next
Context Budget Planner
Plan how much context room and token budget a recurring workload needs before you run it in production.
Coming soon
Next
PDF Token Estimator
Estimate how many tokens a document may consume before it reaches an API workflow.
Coming soon