Claude Opus 4.6 Sets New Benchmark Records Across Coding and Reasoning
Anthropic Blog·(10mo ago)
Anthropic's latest flagship model Claude Opus 4.6 achieves state-of-the-art performance on SWE-bench, GPQA, and MATH benchmarks, cementing its position as the most capable coding assistant available.