
quesma.com
February 22, 2026
14 min read
60/100
Summary
Backdoors were hidden in ~40MB binaries to test AI and Ghidra's capabilities in malware detection. The experiment involved collaboration with Michał “Redford” Kowalczyk, a reverse engineering expert, to establish a benchmark for identifying malicious code in binaries.
Key Takeaways
Community Sentiment
Positives
Concerns

We reproduced Anthropic's Mythos findings with public models
Apr 17, 2026

Evaluating and mitigating the growing risk of LLM-discovered 0-days
Feb 5, 2026

How We Broke Top AI Agent Benchmarks: And What Comes Next
Apr 11, 2026

Opus 4.6 uncovers 500 zero-day flaws in open-source code
Feb 5, 2026

Mythos Finds a Curl Vulnerability
May 11, 2026