TokenTally

Google Gemini

Gemini 3.1 Flash-Lite costs

Great for high-volume automation, translation, and agent loops with 1M context beta support.

Last pricing check: Mar 13, 2026

$0.25 per 1M prompt tokens$1.50 per 1M completion tokens1,000,000 token context

Why teams choose this model

Notification digests
Tier-1 support bots
Batch localization

Scenario planning

Realistic cost examples

Numbers use Gemini 3.1 Flash-Lite Preview pricing

Support front line

Flash-Lite handles canned support replies while Search handles exceptional cases.

Per request

$0.0005

Per month

$95.85

Tokens sent

131,400,000

450 prompt tokens280 completion tokens180,000 requests/mo

Localization sweeps

Marketing teams translate release notes at pennies per answer.

Per request

$0.0007

Per month

$60.30

Tokens sent

79,200,000

520 prompt tokens360 completion tokens90,000 requests/mo

Notification digest

Ops digests collapse monitoring data for chat channels.

Per request

$0.0005

Per month

$67.90

Tokens sent

89,600,000

380 prompt tokens260 completion tokens140,000 requests/mo

Compare with

FAQs

Does Flash-Lite Preview also have long-context surcharges?

No—pricing stays flat because prompts rarely cross 200K tokens. Google doesn’t list uplifts for this preview tier.

Any cache benefits?

$0.025/M token reads keep shared prompts cheap. Storage is $1 per million tokens per hour if you pin caches.

Do preview caps differ?

Yes—preview tiers have lower rate limits, so use them for pilots before migrating to stable Flash/Flash-Lite.

Pricing sources