Frontier Models

9 articles

analysisMar 17, 2026

Claude Opus 4: What Practitioners Need to Know

Anthropic's flagship model sets new benchmarks across reasoning, coding, and extended context — here's what changes in practice.

3 min read
briefingMar 17, 2026

Introducing the AI Daily Intelligence Briefing

We built a research pipeline that synthesises hundreds of AI sources daily. Now we're opening it up. Here's what to expect.

2 min read
newsMar 16, 2026

Mistral Small 4: A 119B MoE Model That Unifies Reasoning, Multimodal, and Agentic Workloads

Mistral Small 4 ships under Apache 2.0 with 256k context, 40% lower latency, and a unified architecture for instruct, reasoning, multimodal, and coding tasks.

4 min read
analysisMar 09, 2026

NVIDIA GTC 2026: Blackwell Ultra, NeMo Overhaul, and the Inference War

NVIDIA's annual GPU conference delivered Blackwell Ultra with 2.5x inference throughput, a rebuilt NeMo framework, and a clear signal that the company sees inference — not training — as the next bottleneck.

3 min read
newsMar 05, 2026

OpenAI Launches GPT-5.4 with Native Computer Use and 1M-Token Context

GPT-5.4 arrives with computer use that beats human baselines, a million-token context window, and integrated coding — here's what it means for production systems.

4 min read
analysisMar 05, 2026

Gemini 2.5 Pro: Google's Best Model Yet, and What It Reveals About the Race

Google's Gemini 2.5 Pro sets new benchmarks on coding, math, and long-context reasoning — but the real story is what it tells us about where the frontier model competition is heading.

4 min read
newsMar 02, 2026

DeepSeek V4: 1T-Parameter MoE on Domestic Chips, but Release Date Remains Uncertain

Sources say DeepSeek is preparing V4 — a 1T-parameter MoE model with 1M context and native multimodal, built on Huawei and Cambricon chips — but multiple predicted launch dates have passed.

4 min read
analysisFeb 28, 2026

DeepSeek R2: A 671B Open-Weight Model That Matches the Frontier

DeepSeek releases R2 with 671B parameters in a mixture-of-experts architecture under an open licence, posting benchmark scores within striking distance of the best closed models — and the implications ripple far beyond the leaderboard.

3 min read
analysisFeb 20, 2026

Mistral Large 3: Europe's Bid for AI Sovereignty Gets Serious

Mistral releases Large 3, a 123B parameter model that outperforms GPT-4o on several benchmarks — and signals that Europe's AI sovereignty ambitions now have technical substance behind them.

3 min read