CursorBench 3.1 evaluates AI agents on ambiguous, multi-file tasks based on real Cursor sessions, with scores indicating performance. Fable 5 Max achieved the highest score of 72.9%, while GPT-5.5 Extra High scored 64.3%.
cursor.com
3 min
13h ago
CursorBench 3.1 evaluates AI agents on ambiguous, multi-file tasks based on real Cursor sessions, with scores indicating performance. Fable 5 Max achieved the highest score of 72.9%, while GPT-5.5 Extra High scored 64.3%.
cursor.com
3 min
13h ago
CursorBench 3.1 evaluates AI agents on ambiguous, multi-file tasks based on real Cursor sessions, with scores indicating performance. Fable 5 Max achieved the highest score of 72.9%, while GPT-5.5 Extra High scored 64.3%.
cursor.com
3 min
13h ago
No more articles to load