TokenTally

Google Gemini

Gemini 3 Flash costs

Balanced throughput and accuracy for live assistants, with audio surcharges when needed.

Last pricing check: Mar 13, 2026

$0.50 per 1M prompt tokens$3.00 per 1M completion tokens1,000,000 token context

Why teams choose this model

Enterprise chat
Realtime copilots
Moderate-cost agent loops

Scenario planning

Realistic cost examples

Numbers use Gemini 3 Flash Preview pricing

Enterprise chat

Internal chatbots rely on Flash for faster answers than Pro at lower cost.

Per request

$0.0023

Per month

$180.00

Tokens sent

120,000,000

900 prompt tokens600 completion tokens80,000 requests/mo

Research aide

Teams generate briefs and citations while grounding to Search.

Per request

$0.0033

Per month

$148.50

Tokens sent

94,500,000

1200 prompt tokens900 completion tokens45,000 requests/mo

Realtime assistant

Voice copilots leverage Flash’s audio pricing for rapid responses.

Per request

$0.0019

Per month

$191.00

Tokens sent

122,000,000

700 prompt tokens520 completion tokens100,000 requests/mo

Compare with

FAQs

How do audio prices work?

Audio inputs cost $1/M tokens vs. $0.50 for text/images/videos. Budget accordingly for voice-heavy flows.

What about caching?

$0.05/M tokens for cache hits; $1/M tokens per hour for storage when you pin caches.

Is Search grounding included?

Google gives 5,000 grounded prompts free per month; beyond that Search calls cost $14/1K queries.

Pricing sources