
marginlab.ai
January 29, 2026
1 min read
72/100
Summary
The Claude Code Opus 4.5 Performance Tracker provides daily benchmarks on a curated subset of SWE-Bench-Pro to monitor performance changes. It utilizes statistical testing to detect significant degradations in performance, benchmarking directly in the Claude Code CLI with the Opus 4.5 model.
Key Takeaways
Community Sentiment
Positives
Concerns