Themata.AI | AI news without the noise

Themata.AI

AI is changing the world. Don't stay behind. Clear summaries, community insight, delivered without the noise. Subscribe to never miss a beat.

Privacy

Contact

🕒 Latest 🔥 Top

Filtering by tag:

claudeClear

News Opinion Research Tool Clear

XユーザーのBo Wangさん: 「Three weeks ago I shared that Claude had shocked Prof. Donald Knuth by finding an odd-m construction for his open Hamiltonian decomposition problem in about an hour of guided exploration. Prof. Knuth titled the paper Claude’s Cycles. The story didn't end there. The updated https://t.co/1ZbmrCpHni」 / X

claude hamiltonian-decomposition ai-research guided-exploration

Research

Further human + AI + proof assistant work on Knuth's "Claude Cycles" problem

Claude found an odd-m construction for the open Hamiltonian decomposition problem, impressing Prof. Donald Knuth, who titled the resulting paper "Claude’s Cycles." The updated research reveals that for the base case m=3, there are 11,502 Hamiltonian cycles, with 996 generalizing to all odd-m, and 760 valid "Claude-like" decompositions identified.

twitter.com

🔥🔥🔥🔥🔥

1 min

1d ago

claude ai-adoption developer-tools code-generation

Research

90% of Claude-linked output going to GitHub repos w <2 stars

Claude Code has experienced an 8% week-over-week growth, with a doubling time of 61 days. The earliest observed public-era commits for Claude Code adoption include a change to the initial game setup in the moinmir/ClashOfCans project on February 24, 2025.

claudescode.dev

🔥🔥🔥🔥🔥

12 min

4d ago

claude ai-user-research ai-ethics multilingual-ai

Research

What 81,000 people want from AI

A survey involving 81,000 Claude users from 159 countries and 70 languages gathered insights on AI usage, aspirations, and concerns. This study is considered the largest and most multilingual qualitative research on public perceptions of AI.

anthropic.com

🔥🔥🔥🔥🔥

29 min

3/19/2026

GitHub - davegoldblatt/marcus-claims-dataset: Systematic extraction and analysis of every testable AI claim Gary Marcus made on his Substack (2022-2026). Dual-pipeline analysis by Claude and ChatGPT with hybrid reconciliation.

ai-skepticism gary-marcus claude chatgpt

Research

Marcus AI Claims Dataset

The Marcus Claims Dataset systematically extracts and analyzes 2,218 testable claims made by Gary Marcus on his Substack from 2022 to 2026. Among claims with checkable evidence, 59.9% were supported, 33.7% were mixed, and 6.4% were contradicted.

github.com

🔥🔥🔥🔥🔥

2 min

3/4/2026

claude ai-agents llms anthropic

Research

Claude's Cycles [pdf]

Posted by fs123. Score: 248 points. Comments: 122.

www-cs-faculty.stanford.edu

🔥🔥🔥🔥🔥

1 min

3/3/2026

claude code-generation ai-agents developer-tools

Research

What Claude Code Chooses

Claude Code was tested on real repositories 2,430 times using open-ended questions, achieving an 85.3% extraction rate across three models, four project types, and 20 tool categories. The primary finding indicates that Claude Code favors building custom solutions over purchasing existing tools.

amplifying.ai

🔥🔥🔥🔥🔥

3 min

2/26/2026

AIs can’t stop recommending nuclear strikes in war game simulations

llms gpt-52 claude ai-agents

Research

AIs can't stop recommending nuclear strikes in war game simulations

Advanced AI models, including GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash, recommended nuclear strikes during simulated geopolitical crises without human-like reservations. These simulations involved scenarios such as border disputes, competition for resources, and threats to regime survival.

newscientist.com

🔥🔥🔥🔥🔥

3 min

2/25/2026

llms ai-reasoning openai claude

Research

"Car Wash" test with 53 models

The car wash test evaluates AI reasoning by asking whether to walk or drive 50 meters to a car wash. Most leading AI models, including Claude Sonnet 4.5, GPT-5.1, Llama, and Mistral, fail to provide the correct answer, which is to drive.

opper.ai

🔥🔥🔥🔥🔥

9 min

2/23/2026

ai-agents claude anthropic ai-safety

Research

Measuring AI agent autonomy in practice

AI agents are currently deployed in diverse contexts, ranging from email triage to cyber espionage. An analysis of millions of human-agent interactions across Claude Code and a public API aims to measure the autonomy of AI agents in real-world usage.

anthropic.com

🔥🔥🔥🔥🔥

28 min

2/19/2026

claude code-generation developer-tools anthropic

Research

Claude's C Compiler vs. GCC

Anthropic developed CCC (Claude’s C Compiler), which compiles the Linux kernel using code entirely written by Claude Opus 4.6, with human guidance limited to writing test cases. Benchmark tests compare CCC's performance against the industry-standard GCC compiler.

harshanu.space

🔥🔥🔥🔥🔥

15 min

2/9/2026

claude hamiltonian-decomposition ai-research guided-exploration

Research

Further human + AI + proof assistant work on Knuth's "Claude Cycles" problem

twitter.com

🔥🔥🔥🔥🔥

1 min

1d ago

claude ai-user-research ai-ethics multilingual-ai

Research

What 81,000 people want from AI

anthropic.com

🔥🔥🔥🔥🔥

29 min

3/19/2026

claude ai-agents llms anthropic

Research

Claude's Cycles [pdf]

Posted by fs123. Score: 248 points. Comments: 122.

"Car Wash" test with 53 models

opper.ai

🔥🔥🔥🔥🔥

9 min

2/23/2026

claude ai-user-research ai-ethics multilingual-ai

Research

What 81,000 people want from AI

anthropic.com

🔥🔥🔥🔥🔥

29 min

3/19/2026

claude code-generation ai-agents developer-tools

Research

What Claude Code Chooses

amplifying.ai

🔥🔥🔥🔥🔥

3 min

2/26/2026

ai-agents claude anthropic ai-safety

Research

Measuring AI agent autonomy in practice

anthropic.com

🔥🔥🔥🔥🔥

28 min

2/19/2026