Head to head

Gemini 3.1 Pro Preview vs Gemini 3.5 Flash

Gemini 3.1 Pro Preview (Google) and Gemini 3.5 Flash (Google) compared on intelligence, speed, context, and price — and which to choose. Both run on just4o.chat from one chat.

Metric	Gemini 3.1 Pro Preview	Gemini 3.5 Flash
Intelligence (AA index)	57 ✓	55
Output speed (tokens/sec)	132.6	280 ✓
Context window	1.0M	1.0M
Max output	66K	66K
Input price / 1M	$2	$1.5 ✓
Output price / 1M	$12	$9 ✓
Released	2026-02-19	2026-05

Choose Gemini 3.1 Pro Preview if you want…

Higher intelligence (Artificial Analysis index 57)

Choose Gemini 3.5 Flash if you want…

Faster output (~280 tokens/sec)
Lower price ($3.38 / 1M blended)

Gemini 3.1 Pro Preview

At the top of the Artificial Analysis Intelligence Index — ahead of every other model evaluated — Gemini 3.1 Pro Preview earns its ranking not just on raw ability but on the economics of getting there. It ran a full benchmark suite at less than half the cost of comparable frontier models, which makes it the clearest answer to the question of whether top-tier intelligence requires top-tier spend. A 38-percentage-point drop in hallucination rate over its predecessor and a 94.3% GPQA Diamond score in graduate-level scientific reasoning make it a serious tool for complex research, deep software engineering, and agentic workflows that need to get things right. Its 1-million-token context window handles entire codebases or lengthy document sets without batching. The honest caveat: time-to-first-token averages nearly 25 seconds, well above the median for comparable models, and some developers report extended waits or reliability issues under high API load. If your work is iterative and deep rather than real-time and conversational, the trade-off is usually worth it.

Full Gemini 3.1 Pro Preview details →

Gemini 3.5 Flash

The first Flash-tier model to outperform a Pro on coding and agentic benchmarks, Gemini 3.5 Flash rewrites expectations for what a speed-optimized model can do. At over 280 tokens per second — roughly 4x faster than comparable frontier models — it sustains the throughput that production agent loops demand, while benchmark results on Terminal-Bench 2.1 (76.2%) and MCP Atlas (83.6%) put it ahead of Gemini 3.1 Pro on the tasks developers actually care about. Early users call it "an insane value" for delivering near-frontier intelligence at roughly a third of Pro's cost. The 31-point drop in hallucination rate over its predecessor makes it meaningfully more reliable in practice. The honest caveat: time to first token sits around 19 seconds, which stings in latency-sensitive interactions, and aggressive rate limiting has frustrated users hitting it hard. Deep reasoning, hard analytical problems, and ultra-long context retrieval still favor the Pro. But for teams running iterative coding agents, structured data pipelines, or high-throughput chatbots where cost and speed are the binding constraints, Flash 3.5 is the practical choice.

Full Gemini 3.5 Flash details →

FAQ

Which is better, Gemini 3.1 Pro Preview or Gemini 3.5 Flash?

Gemini 3.5 Flash leads on 2 of the headline metrics (faster output (~280 tokens/sec); lower price ($3.38 / 1m blended)), while Gemini 3.1 Pro Preview wins on higher intelligence (artificial analysis index 57). The right pick depends on your priorities.

Is Gemini 3.1 Pro Preview or Gemini 3.5 Flash cheaper?

Gemini 3.5 Flash is cheaper at $3.38 per 1M tokens (blended), versus $4.5.

Can I use both Gemini 3.1 Pro Preview and Gemini 3.5 Flash?

Yes. Both are available on just4o.chat from a single chat — you can switch between them per message with no separate subscriptions.

Compare interactively All models