TokenTally

Google Gemini

Gemini 2.5 Flash costs

Great for 1M-context assistant loops that still need adaptive thinking + Search grounding.

Last pricing check: Mar 13, 2026

$0.30 per 1M prompt tokens$2.50 per 1M completion tokens1,000,000 token context

Why teams choose this model

Realtime assistants
Agent loops with Search
Analytics copilots

Scenario planning

Realistic cost examples

Numbers use Gemini 2.5 Flash pricing

Realtime assistant

Customer-facing chatbots rely on Flash for balanced cost/perf.

Per request

$0.0018

Per month

$157.95

Tokens sent

130,500,000

850 prompt tokens600 completion tokens90,000 requests/mo

Analytics copilot

BI teams ask Flash to summarize dashboards and highlight anomalies.

Per request

$0.0026

Per month

$105.60

Tokens sent

88,000,000

1300 prompt tokens900 completion tokens40,000 requests/mo

Agent loop

Flash keeps multi-step workflows affordable while still hitting Search + Maps.

Per request

$0.0032

Per month

$80.00

Tokens sent

65,000,000

1500 prompt tokens1100 completion tokens25,000 requests/mo

Compare with

FAQs

Do I still get 1M context?

Yes—Google positions Flash as a hybrid reasoning model with 1M tokens, but surcharges kick in beyond 200K.

Cache pricing?

$0.03/M tokens (text) for cache hits; storage is $1/M tokens per hour.

Grounding costs?

Search: 1,500 RPD free (shared with Flash-Lite), then $35 per 1K prompts. Maps: 1,500 RPD free, then $25 per 1K.

Pricing sources