TokenTally

Alibaba Cloud Model Studio

Qwen3.5-Flash pricing

Use the million-token context at $0.029/M input and $0.287/M output to keep unit economics tight.

Last pricing check: Mar 13, 2026

$0.029 per 1M prompt tokens$0.287 per 1M completion tokens1,000,000 token context

Why teams choose this model

Tier-1 support bots
Notification summarizers
Bulk localization

Scenario planning

Realistic cost examples

Numbers use Qwen3.5-Flash (Global) pricing

Tier-1 handoff bot

Frontline chat bot clears routine issues before escalating to humans.

Per request

$0.0001

Per month

$8.55

Tokens sent

70,800,000

380 prompt tokens210 completion tokens120,000 requests/mo

Alert digestor

Ops teams condense alert storms into channel-ready summaries every few minutes.

Per request

$0.0001

Per month

$6.51

Tokens sent

51,000,000

420 prompt tokens260 completion tokens75,000 requests/mo

Localization sweeps

Marketing pushes product copy through Flash for rapid translations + tone checks.

Per request

$0.0001

Per month

$9.05

Tokens sent

72,000,000

500 prompt tokens300 completion tokens90,000 requests/mo

Compare with

FAQs

How cheap can requests get?

With prompts under 1K tokens you’re often well under a quarter-cent per call because the input rate is $0.029/M tokens.

Is Flash still multimodal?

Yes—Flash handles text/images at the same rate, so you can embed screenshots without separate billing.

What about regional pricing?

Switch the calculator to your preferred currency/region once we add it, but today we mirror the global (US) numbers straight from Alibaba’s doc.

Pricing sources