The Convention
All major providers quote prices as dollars per million tokens ($/M). When you see “$2.50 / $10.00” it means $2.50 per million input tokens and $10.00 per million output tokens. This convention exists because individual tokens cost fractions of a cent — per-million makes the numbers readable.
// Cost calculation formula
cost = (input_tokens × input_price_per_M
+ output_tokens × output_price_per_M)
/ 1,000,000
// Example: 2,000 input + 500 output on GPT-5.4
cost = (2000 × $2.50 + 500 × $15.00) / 1,000,000
= ($5,000 + $7,500) / 1,000,000
= $0.0125 per request
Scaling the Math
That $0.0125 per request seems trivial. But at 10,000 requests/day, it’s $125/day or $3,750/month. At 100,000 requests/day, it’s $37,500/month. The per-request cost is always small; the monthly bill is where reality hits. Always multiply by your expected daily volume to get the real picture.
Key insight: A common mistake is evaluating models by per-request cost alone. The right metric is monthly cost at production volume. A model that’s $0.001 cheaper per request saves $30,000/month at 1M requests/day.