Question 1

Is GPT-4o billed differently for streaming vs. JSON mode?

Accepted Answer

No. OpenAI bills GPT-4o strictly on prompt vs. completion tokens regardless of the output format or response mode.

Question 2

Can I mix GPT-4o with a cheaper model?

Accepted Answer

Yes—many teams route easy prompts to GPT-4o mini or gpt-4.1 mini and reserve GPT-4o for premium flows. TokenTally’s share button lets you compare both side by side.

Question 3

When should I look at GPT-4.1 instead?

Accepted Answer

Choose GPT-4.1 when you need 4.1’s reasoning upgrades or the 128k context from a single API. GPT-4o shines for multi-modal chat and speech.

GPT-4o pricing explained

Realistic cost examples