tech calculator

AI Inference Cost Calculator

Estimate AI inference cost across tokens, images, or runtime seconds using per-unit rates.

Results

Token cost
$0
Image cost
$0
Runtime cost
$0
Total cost
$0

How to use this calculator

  1. Enter tokens and per-1k token rate (or leave zero if not applicable).
  2. Enter images and per-image rate if applicable.
  3. Enter runtime seconds and per-second rate if time-based billing applies.
  4. Review component costs and the total.

Inputs explained

Tokens
Total tokens processed (prompt + completion) for text models.
Cost per 1k tokens
Rate from your provider for the specific model.
Images
Count of images processed/generated.
Cost per image
Per-image rate from your provider.
Runtime (seconds)
Inference runtime if billed per second (common for hosted GPUs).
Cost per second
Per-second rate from your provider/instance type.

How it works

Token cost = (tokens ÷ 1000) × cost per 1k tokens.

Image cost = images × cost per image.

Runtime cost = seconds × cost per second.

Total = sum of all three components.

Formula

Token cost = (Tokens ÷ 1000) × Rate
Image cost = Images × Rate
Runtime cost = Seconds × Rate
Total = Sum of components

When to use it

  • Estimating batch job cost across text + image modalities.
  • Comparing providers by plugging in their rates for the same workload.
  • Budgeting for API calls or hosted GPU inference runs.

Tips & cautions

  • Set unused modalities to zero to focus on the ones you use.
  • If your provider uses input/output token pricing, sum them into the per-1k token rate or split tokens accordingly.
  • For GPU time, include idle padding or overhead if your platform bills wall-clock seconds.
  • No tiered pricing or volume discounts; enter an effective rate if tiers apply.
  • Does not include storage, bandwidth, or fine-tune training costs.
  • Assumes simple linear pricing; burst/idle nuances may differ by provider.

Worked examples

100k tokens at $0.002/1k

  • Token cost = 100,000 ÷ 1,000 × $0.002 = $0.20
  • Total = $0.20 if images/time are zero

50k tokens + 10 images + 500 sec

  • Token cost = 50,000 ÷ 1,000 × $0.002 = $0.10
  • Image cost = 10 × $0.02 = $0.20
  • Time cost = 500 × $0.0004 = $0.20
  • Total ≈ $0.50

Deep dive

Estimate AI inference cost by entering tokens, images, and runtime with their per-unit rates to see component and total cost.

Use it to budget batch jobs, compare providers, or sanity check API bills.

FAQs

How do I handle different token rates for input vs output?
Sum an effective rate (weighted by expected input/output tokens) or enter separate token counts with separate runs.
Can I model tiered pricing?
Enter an average effective rate for your expected volume; this tool doesn’t model tier steps.
Does this include storage/bandwidth?
No. Add those separately if your provider bills them.
What about concurrency or reserved instances?
This models per-unit usage only. Reserved capacity or concurrency commitments aren’t included.
Can I include fine-tuning training costs?
This tool is for inference. Training costs aren’t included.

Related calculators

Cost estimate only. Check your provider’s pricing (tiered rates, region, discounts) for accurate billing.