
entropicthoughts.com
March 12, 2026
3 min read
56/100
Summary
LLMs demonstrate a significant drop in performance when the success criterion shifts from "passes all tests" to "would get approved by the maintainer." The time to reach a 50% success rate decreases from 50 minutes to 8 minutes under the more stringent criterion.
Key Takeaways
Community Sentiment
Positives
Concerns