Question 1

How do you calculate LLM API costs?

Accepted Answer

LLM API costs equal (monthly input tokens / 1,000,000 × input price) plus (monthly output tokens / 1,000,000 × output price). Most providers price input and output tokens differently, with output tokens 3-5x more expensive. Prompt caching can discount cached input tokens by 90% on Anthropic and OpenAI. Multiply per-request token usage by your monthly request volume and apply the formula across all model tiers you use.

Question 2

What percentage of revenue should LLM API costs be?

Accepted Answer

Keep customer-facing LLM API spend below 12% of revenue to maintain SaaS-typical 65-75% gross margins. Above 15% of revenue, gross margins compress to 50-60% and investors begin pricing the company as AI-services rather than SaaS. The biggest levers to control this ratio are prompt caching, model tiering (cheaper models for easy tasks), and aggressive output token capping.

Question 3

How does prompt caching reduce LLM costs?

Accepted Answer

Prompt caching reuses processed input tokens across requests with the same prefix (system prompts, tool definitions, RAG context). Anthropic and OpenAI charge 90% less for cached input tokens, with cache TTL of roughly 5 minutes. Well-engineered systems achieve 40-80% cache hit rates, which can cut input token costs by 36-72% without changing the underlying model behavior.

Question 4

What is the cheapest LLM API for production?

Accepted Answer

Among frontier models, Gemini 2.5 Flash is the cheapest at $0.50/$3 per 1M input/output tokens, followed by Claude Haiku 4.5 at $1/$5. For most production workloads, the right choice is the cheapest model that still meets quality requirements. Many production systems use cheap models (Flash, Haiku, GPT-5 mini) for routine work and reserve frontier models (Opus, GPT-5) for hard reasoning tasks via a model-routing layer.

LLM API Cost Calculator

Model & Pricing

Traffic

Optimization

Why LLM Costs Need Their Own Tracking

Frequently Asked Questions

Related Tools

AI Agent Operating Cost Calculator

SaaS Spend Audit Calculator

Gross Margin Calculator

Track LLM Spend in Your Live Financials