Aletheia, powered by Gemini 3 Deep Think, autonomously solved 6 out of 10 problems in the FirstProof challenge. Expert assessments confirmed the accuracy of Aletheia's solutions for the problems completed.
arxiv.org
1 min
2/25/2026
Top mathematicians have issued a challenge to AI systems to solve a set of unsolved mathematical problems within a week, as part of an initiative called “First Proof.” These problems are newly formulated and are not present in any large language model's training data.
scientificamerican.com
3 min
2/11/2026
Aletheia, powered by Gemini 3 Deep Think, autonomously solved 6 out of 10 problems in the FirstProof challenge. Expert assessments confirmed the accuracy of Aletheia's solutions for the problems completed.
arxiv.org
1 min
2/25/2026
Top mathematicians have issued a challenge to AI systems to solve a set of unsolved mathematical problems within a week, as part of an initiative called “First Proof.” These problems are newly formulated and are not present in any large language model's training data.
scientificamerican.com
3 min
2/11/2026
Aletheia, powered by Gemini 3 Deep Think, autonomously solved 6 out of 10 problems in the FirstProof challenge. Expert assessments confirmed the accuracy of Aletheia's solutions for the problems completed.
arxiv.org
1 min
2/25/2026
Top mathematicians have issued a challenge to AI systems to solve a set of unsolved mathematical problems within a week, as part of an initiative called “First Proof.” These problems are newly formulated and are not present in any large language model's training data.
scientificamerican.com
3 min
2/11/2026
No more articles to load