Head to head

Claude Sonnet 4.6 vs Gemini 3.1 Pro Preview

Claude Sonnet 4.6 (Anthropic) and Gemini 3.1 Pro Preview (Google) compared on intelligence, speed, context, and price — and which to choose. Both run on just4o.chat from one chat.

MetricClaude Sonnet 4.6Gemini 3.1 Pro Preview
Intelligence (AA index)4457
Output speed (tokens/sec)44.1132.6
Context window1M1.0M
Max output64K66K
Input price / 1M$3$2
Output price / 1M$15$12
Released2026-022026-02-19

Choose Claude Sonnet 4.6 if you want…

  • A comparable all-rounder — they trade blows on the headline metrics.

Choose Gemini 3.1 Pro Preview if you want…

  • Higher intelligence (Artificial Analysis index 57)
  • Faster output (~132.6 tokens/sec)
  • Lower price ($4.5 / 1M blended)
  • Larger context window (1.0M)

Claude Sonnet 4.6

Sonnet 4.6 sits at the sweet spot where coding and agentic work get done without paying Opus prices. On SWE-bench Verified it scores 79.6% — within one point of Opus 4.6 (80.8%) — at roughly a third of the cost, which is why developers running automated pipelines tend to reach for it first. The self-correction training is the headline improvement: when a tool call fails, the model recognizes and recovers rather than cycling through the same error. Users also praise the 1M-token context window for swallowing entire codebases or large document sets in a single pass. The honest caveat is that this context window has edges — retrieval quality degrades on adversarial tests beyond about 700K tokens, so vector-based RAG is still the safer bet for critical long-context searches. Speed is also a known tension: at 44 tokens per second, it runs slower than the median for its tier, which can feel noticeable in real-time applications. Still, for teams that need high-quality code generation, browser automation, and multi-step agentic workflows without Opus-level spend, Sonnet 4.6 is the practical default.

Full Claude Sonnet 4.6 details →

Gemini 3.1 Pro Preview

At the top of the Artificial Analysis Intelligence Index — ahead of every other model evaluated — Gemini 3.1 Pro Preview earns its ranking not just on raw ability but on the economics of getting there. It ran a full benchmark suite at less than half the cost of comparable frontier models, which makes it the clearest answer to the question of whether top-tier intelligence requires top-tier spend. A 38-percentage-point drop in hallucination rate over its predecessor and a 94.3% GPQA Diamond score in graduate-level scientific reasoning make it a serious tool for complex research, deep software engineering, and agentic workflows that need to get things right. Its 1-million-token context window handles entire codebases or lengthy document sets without batching. The honest caveat: time-to-first-token averages nearly 25 seconds, well above the median for comparable models, and some developers report extended waits or reliability issues under high API load. If your work is iterative and deep rather than real-time and conversational, the trade-off is usually worth it.

Full Gemini 3.1 Pro Preview details →

FAQ

Which is better, Claude Sonnet 4.6 or Gemini 3.1 Pro Preview?

Gemini 3.1 Pro Preview leads on 4 of the headline metrics (higher intelligence (artificial analysis index 57); faster output (~132.6 tokens/sec); lower price ($4.5 / 1m blended); larger context window (1.0m)), while Claude Sonnet 4.6 wins on other factors. The right pick depends on your priorities.

Is Claude Sonnet 4.6 or Gemini 3.1 Pro Preview cheaper?

Gemini 3.1 Pro Preview is cheaper at $4.5 per 1M tokens (blended), versus $6.

Can I use both Claude Sonnet 4.6 and Gemini 3.1 Pro Preview?

Yes. Both are available on just4o.chat from a single chat — you can switch between them per message with no separate subscriptions.