TokenTally · builder preview
Feed your typical prompt + completion sizes, choose a model, and TokenTally will show per-request, monthly, and annual costs instantly. The comparison table lets you see how switching models affects spend, and every estimate is backed by our internally maintained pricing dataset.
Ad slot · Hero
728×90
Placeholder shown until ads are approved
Estimated spend
Per request
$0.0005
Per month
$2.51
Per year
$30.13
Token totals
Prompt: 2,000,000 tokens (1,500,000 words)
Completion: 1,250,000 tokens (937,500 words)
Context window for Qwen3-Max (Global): 262,144 tokens
Last pricing sync: 3/13/2026
Optional: paste a sample prompt
Text stays in your browser—we only use it to approximate token counts.
Presets
Model comparison
| Model | $/request | $/month | Provider | Latency |
|---|---|---|---|---|
| Qwen3.5-Flash (Global) | $0.0001 | $0.4167 | Alibaba Cloud | economy |
| Llama 3.1 8B Instruct (Fireworks) | $0.0001 | $0.585 | Meta | economy |
| GPT-5 Nano | $0.0001 | $0.60 | OpenAI | economy |
| Gemini 2.5 Flash-Lite | $0.0001 | $0.70 | economy | |
| Grok 4.1 Fast | $0.0002 | $1.03 | xAI | standard |
| GPT-4o Mini | $0.0002 | $1.05 | OpenAI | economy |
| Qwen3.5-Plus (Global) | $0.0002 | $1.09 | Alibaba Cloud | standard |
| Grok 3 Mini | $0.0002 | $1.22 | xAI | economy |
Ad slot · Inline
300×250
Placeholder shown until ads are approved
Ad slot · Footer
728×90
Placeholder shown until ads are approved