Google DeepMind
Gemini Flash pricing
Flash delivers a million-token context for pennies. Model newsletter bots, content moderators, and QA helpers here.
Last pricing check: Mar 13, 2026
Why teams choose this model
Scenario planning
Realistic cost examples
Numbers use Gemini 1.5 Flash pricing
Content guardrails
Flag tone, compliance, or toxicity before publishing user copy.
Per request
$0.0004
Per month
$24.36
Tokens sent
43,200,000
Newsletter bot
Weekly digests sourced from product + community updates.
Per request
$0.0012
Per month
$1.84
Tokens sent
2,850,000
QA helper
Docs-aware assistant answering product questions inside the app.
Per request
$0.0006
Per month
$5.51
Tokens sent
9,450,000
Compare with
FAQs
When should I choose Flash over Pro?
Pick Flash when latency + price matter more than maximum reasoning depth. It’s ideal for moderation, summarization, and FAQ-style bots.
Does Flash support tool use?
Yes, and you only pay for the tokens consumed invoking those tools.
Is there a tokens-per-minute quota?
Google enforces TPM caps per project. Model your peak usage in TokenTally to stay within those guardrails.
Pricing sources
- https://ai.google.dev/pricing
Checked Mar 13, 2026