Back to Models
qwen·Apr 27, 2026
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flashcontext
1M
Max Output
65.5K
Input / 1M
$0.25
Output / 1M
$1.50
About
Qwen3.6 Flash is the speed-first variant in Alibaba's Qwen 3.6 series. It accepts text, image, and video input with a 1M token context window and includes an integrated thinking mode. The model supports tool use and structured output, and prompt caching is available with separate cache read and cache write pricing. Pricing is tiered, with higher rates applied to requests above the 256K token threshold.
Capabilities
- Context Length
- 1M
- Max Output
- 65.5K
- Reasoning
- Yes
- In
- text, image, video
- Out
- text
Pricing
Full pricing| Type | ≤256K | >256K |
|---|---|---|
| Input | $0.25 | $1.00 |
| Output | $1.50 | $4.00 |
| Cache Write | $0.3125 | $0.3125 |
OpenAI-compatible · Model ID qwen/qwen3.6-flash
curl https://api.elliotgate.com/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3.6-flash",
"messages": [{"role": "user", "content": "Hello!"}]
}'