Head to head
Gemini 3.1 Pro Preview vs Qwen 3.6 Plus
Gemini 3.1 Pro Preview (Google) and Qwen 3.6 Plus (Alibaba) compared on intelligence, speed, context, and price — and which to choose. Both run on just4o.chat from one chat.
| Metric | Gemini 3.1 Pro Preview | Qwen 3.6 Plus |
|---|---|---|
| Intelligence (AA index) | 57 ✓ | 50 |
| Output speed (tokens/sec) | 132.6 ✓ | 52.5 |
| Context window | 1.0M ✓ | 1M |
| Max output | 66K | 66K |
| Input price / 1M | $2 | $0.5 ✓ |
| Output price / 1M | $12 | $3 ✓ |
| Released | 2026-02-19 | 2026-03-31 |
Choose Gemini 3.1 Pro Preview if you want…
- Higher intelligence (Artificial Analysis index 57)
- Faster output (~132.6 tokens/sec)
- Larger context window (1.0M)
Choose Qwen 3.6 Plus if you want…
- Lower price ($1.13 / 1M blended)
Gemini 3.1 Pro Preview
At the top of the Artificial Analysis Intelligence Index — ahead of every other model evaluated — Gemini 3.1 Pro Preview earns its ranking not just on raw ability but on the economics of getting there. It ran a full benchmark suite at less than half the cost of comparable frontier models, which makes it the clearest answer to the question of whether top-tier intelligence requires top-tier spend. A 38-percentage-point drop in hallucination rate over its predecessor and a 94.3% GPQA Diamond score in graduate-level scientific reasoning make it a serious tool for complex research, deep software engineering, and agentic workflows that need to get things right. Its 1-million-token context window handles entire codebases or lengthy document sets without batching. The honest caveat: time-to-first-token averages nearly 25 seconds, well above the median for comparable models, and some developers report extended waits or reliability issues under high API load. If your work is iterative and deep rather than real-time and conversational, the trade-off is usually worth it.
Full Gemini 3.1 Pro Preview details →Qwen 3.6 Plus
At $0.50 per million input tokens, Qwen 3.6 Plus punches well above its price band — scoring 78.8 on SWE-bench Verified and 61.6 on Terminal-Bench 2.0, where it outpaces Claude 4.5 Opus on agentic coding tasks. The 1 million token context window lets you drop in entire codebases for security audits, multi-file refactors, or long-horizon agent sessions without chunking or worrying about cost. Always-on chain-of-thought reasoning is baked into the architecture rather than toggled per request, and native tool-calling makes it well-suited for multi-step workflows. Developers building high-volume API applications have reported generating hundreds of millions of tokens during its preview period — its first-day usage crossed one trillion tokens across platforms. That said, the long context is not a silver bullet: retrieval accuracy degrades in the middle of very long inputs, and real-world testing has surfaced instruction-following inconsistencies and occasional tool-calling failures that more mature providers handle more reliably. For cost-sensitive production deployments where coding and document analysis are the core workload, few models compete at this price.
Full Qwen 3.6 Plus details →FAQ
Which is better, Gemini 3.1 Pro Preview or Qwen 3.6 Plus?
Gemini 3.1 Pro Preview leads on 3 of the headline metrics (higher intelligence (artificial analysis index 57); faster output (~132.6 tokens/sec); larger context window (1.0m)), while Qwen 3.6 Plus wins on lower price ($1.13 / 1m blended). The right pick depends on whether you prioritise capability, speed, or cost.
Is Gemini 3.1 Pro Preview or Qwen 3.6 Plus cheaper?
Qwen 3.6 Plus is cheaper at $1.13 per 1M tokens (blended), versus $4.5.
Can I use both Gemini 3.1 Pro Preview and Qwen 3.6 Plus?
Yes. Both are available on just4o.chat from a single chat — you can switch between them per message with no separate subscriptions.