Question 1

How do cache-hit prices factor in?

Accepted Answer

When Grok 4.20 recognizes a cached prompt prefix, the input rate drops to $0.20/M tokens. TokenTally stores that discount so you can compare best vs. default cases.

Question 2

Does the 2M context increase costs?

Accepted Answer

Only if you fill it. The scenarios keep prompts under 3K tokens, but you can paste any size prompt into the calculator to see real-time totals.

Question 3

Are tool invocations billed separately?

Accepted Answer

Yes—xAI charges $5 per 1K calls for Web/X search or code tools. Those fees stack on top of the token totals shown here.

Grok 4.20 Beta token costs

Realistic cost examples