Google Gemini
Gemini 3.1 Flash-Lite costs
Great for high-volume automation, translation, and agent loops with 1M context beta support.
Last pricing check: Mar 13, 2026
Why teams choose this model
Scenario planning
Realistic cost examples
Numbers use Gemini 3.1 Flash-Lite Preview pricing
Support front line
Flash-Lite handles canned support replies while Search handles exceptional cases.
Per request
$0.0005
Per month
$95.85
Tokens sent
131,400,000
Localization sweeps
Marketing teams translate release notes at pennies per answer.
Per request
$0.0007
Per month
$60.30
Tokens sent
79,200,000
Notification digest
Ops digests collapse monitoring data for chat channels.
Per request
$0.0005
Per month
$67.90
Tokens sent
89,600,000
Compare with
FAQs
Does Flash-Lite Preview also have long-context surcharges?
No—pricing stays flat because prompts rarely cross 200K tokens. Google doesn’t list uplifts for this preview tier.
Any cache benefits?
$0.025/M token reads keep shared prompts cheap. Storage is $1 per million tokens per hour if you pin caches.
Do preview caps differ?
Yes—preview tiers have lower rate limits, so use them for pilots before migrating to stable Flash/Flash-Lite.
Pricing sources
- https://ai.google.dev/gemini-api/docs/pricing
Checked Mar 13, 2026