
No ratings yet
Be the first to review this model
A significant leap in small model performance, beating GPT-4o on many benchmarks while reducing latency by nearly half and cost by 83%. Features the same 1M token context window as the full GPT-4.1. The sweet spot for developers needing strong intelligence at production-friendly prices.
Released
April 14, 2025
Parameters
Unknown
Context
1M
Pricing
Paid
Last updated: March 15, 2026
Benchmark scores may vary based on evaluation methodology and conditions.