Question 1

How much does the Claude API cost?

Accepted Answer

Claude API pricing depends on the model you choose. Claude Haiku 3.5 costs $0.80 per million input tokens and $4.00 per million output tokens. Claude Sonnet 4 costs $3.00/$15.00, and Claude Opus 4 costs $15.00/$75.00. You only pay for the tokens you use, with no minimum commitment or monthly fee.

Question 2

What is the cheapest Claude model?

Accepted Answer

Claude Haiku 3.5 is the cheapest model at $0.80 per million input tokens and $4.00 per million output tokens. For even lower costs, combine Haiku with the Batch API (50% off) to get input pricing as low as $0.40 per million tokens.

Question 3

How do I estimate my Claude API costs?

Accepted Answer

Use our cost calculator. Enter your model, average input and output tokens per request, and requests per day. A rough rule of thumb: 1 token is about 4 characters of English text, and 1 million tokens is approximately 750,000 words or 1,500 pages of standard text.

Question 4

What is prompt caching and how much does it save?

Accepted Answer

Prompt caching lets you cache repeated parts of your input (system prompts, shared context, few-shot examples) and pay only 10% of the normal input price for cached tokens. The initial cache write costs 25% more than the base input price. If 70% of your input is cacheable, you save roughly 63% on input costs after the first request.

Question 5

What is the Claude Batch API?

Accepted Answer

The Batch API gives you a flat 50% discount on both input and output tokens for workloads that do not need real-time responses. You submit a JSONL file of requests, and results are processed within a 24-hour window. It is ideal for content generation, bulk classification, data processing, and evaluation pipelines.

Question 6

Is the Claude API cheaper than OpenAI?

Accepted Answer

It depends on the model tier. Claude Sonnet 4 ($3/$15) costs more per token than GPT-4o ($2.50/$10), but prompt caching can reduce Claude input costs by up to 90%, making it more affordable for workloads with repeated context. Claude Haiku 3.5 ($0.80/$4) is pricier than GPT-4o-mini ($0.15/$0.60) per token but delivers stronger performance on complex tasks.

Question 7

How many tokens are in a typical API request?

Accepted Answer

One token is roughly 4 characters or 0.75 words of English text. A typical chatbot message uses 500-2,000 input tokens and 200-1,000 output tokens. A document summarisation request with a full document might use 4,000-30,000 input tokens and 500-1,500 output tokens. Code generation usually runs 1,000-4,000 input and 1,000-3,000 output tokens.

Question 8

Does Claude have a free tier?

Accepted Answer

Anthropic provides limited free API credits for evaluation when you create a new account. Beyond that, you pay per token with no minimum spend. There is no permanent free tier for production use. Enterprise customers can negotiate volume discounts and custom commitments with Anthropic's sales team.

Model	Input	Output	Cache Write	Cache Read	Batch In	Batch Out	Context	Best for
Claude Opus 4	$15.00	$75.00	$18.75	$1.50	$7.50	$37.50	200K	Complex reasoning, research, agentic code
Claude Sonnet 4Most Popular	$3.00	$15.00	$3.75	$0.30	$1.50	$7.50	200K	Best balance of cost, speed, and quality
Claude Haiku 3.5	$0.80	$4.00	$1.00	$0.08	$0.40	$2.00	200K	Classification, routing, high-volume tasks

Claude API Pricing in 2026

Complete Claude API Pricing Table

What does “per million tokens” mean?

How Claude API Pricing Works

The cost formula

Quick Cost Examples

Chatbot conversation

Summarise a 50-page document

10,000 support tickets via Batch API

Choose the Right Claude Model

Claude Sonnet 4

Claude Opus 4

Claude Haiku 3.5

Save Money with Advanced Features

Prompt Caching

Batch API

How Does Claude Compare?

Frequently Asked Questions

Explore More Pricing Guides