Themata.AI
Themata.AI

Popular tags:

#developer-tools#ai-agents#llms#claude#ai-ethics#code-generation#ai-safety#openai#anthropic#discussion

AI is changing the world. Don't stay behind. Clear summaries, community insight, delivered without the noise. Subscribe to never miss a beat.

© 2026 Themata.AI • All Rights Reserved

Privacy

|

Cookies

|

Contact
🕒 Latest🔥 Top
WeekMonthYearAll Time

Filtering by tag:

autoresearchClear
Sakana Fugu — Multi-agent System as A Model
ai-agentsautoresearchmachine-learningexperimentation
Research

Sakana Fugu

Sakana Fugu is a multi-agent system that autonomously enhances a small GPT's training recipe using AutoResearch, which iteratively edits training code and conducts experiments. The AI agent completed 123 experiments over approximately 14 hours on a single H100 GPU, tracking improvements in validation bits-per-byte (BPB).

sakana.ai

🔥🔥🔥🔥🔥

4 min

6d ago

Can LLMs Beat Classical Hyperparameter Optimization Algorithms?

The autoresearch repository allows an LLM agent to optimize hyperparameters by directly editing training code. A study compares classical hyperparameter optimization algorithms with LLM-based methods for tuning a small language model's hyperparameters.

arxiv.org

🔥🔥🔥🔥🔥

3 min

6/9/2026

Autoresearch on an old research idea | Blog | Yogesh KumarTool

Autoresearch on an old research idea

Karpathy's Autoresearch utilizes a constrained optimization loop with a large language model (LLM) agent. The author applied Autoresearch to legacy code from eCLIP while managing household tasks.

ykumar.me

🔥🔥🔥🔥🔥

6 min

3/23/2026

Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster

Claude Code was given access to 16 GPUs on a Kubernetes cluster and submitted approximately 910 experiments over 8 hours. It determined that scaling model width was more significant than any single hyperparameter and achieved a 2.87% improvement in validation performance, reducing val_bpb from 1.003 to 0.974.

blog.skypilot.co

🔥🔥🔥🔥🔥

12 min

3/19/2026

Sakana Fugu

Sakana Fugu is a multi-agent system that autonomously enhances a small GPT's training recipe using AutoResearch, which iteratively edits training code and conducts experiments. The AI agent completed 123 experiments over approximately 14 hours on a single H100 GPU, tracking improvements in validation bits-per-byte (BPB).

sakana.ai

🔥🔥🔥🔥🔥

4 min

6d ago

Autoresearch on an old research idea

Karpathy's Autoresearch utilizes a constrained optimization loop with a large language model (LLM) agent. The author applied Autoresearch to legacy code from eCLIP while managing household tasks.

ykumar.me

🔥🔥🔥🔥🔥

6 min

3/23/2026

Can LLMs Beat Classical Hyperparameter Optimization Algorithms?

The autoresearch repository allows an LLM agent to optimize hyperparameters by directly editing training code. A study compares classical hyperparameter optimization algorithms with LLM-based methods for tuning a small language model's hyperparameters.

arxiv.org

🔥🔥🔥🔥🔥

3 min

6/9/2026

Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster

Claude Code was given access to 16 GPUs on a Kubernetes cluster and submitted approximately 910 experiments over 8 hours. It determined that scaling model width was more significant than any single hyperparameter and achieved a 2.87% improvement in validation performance, reducing val_bpb from 1.003 to 0.974.

blog.skypilot.co

🔥🔥🔥🔥🔥

12 min

3/19/2026

Sakana Fugu

Sakana Fugu is a multi-agent system that autonomously enhances a small GPT's training recipe using AutoResearch, which iteratively edits training code and conducts experiments. The AI agent completed 123 experiments over approximately 14 hours on a single H100 GPU, tracking improvements in validation bits-per-byte (BPB).

sakana.ai

🔥🔥🔥🔥🔥

4 min

6d ago

Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster

Claude Code was given access to 16 GPUs on a Kubernetes cluster and submitted approximately 910 experiments over 8 hours. It determined that scaling model width was more significant than any single hyperparameter and achieved a 2.87% improvement in validation performance, reducing val_bpb from 1.003 to 0.974.

blog.skypilot.co

🔥🔥🔥🔥🔥

12 min

3/19/2026

Can LLMs Beat Classical Hyperparameter Optimization Algorithms?

The autoresearch repository allows an LLM agent to optimize hyperparameters by directly editing training code. A study compares classical hyperparameter optimization algorithms with LLM-based methods for tuning a small language model's hyperparameters.

arxiv.org

🔥🔥🔥🔥🔥

3 min

6/9/2026

Autoresearch on an old research idea

Karpathy's Autoresearch utilizes a constrained optimization loop with a large language model (LLM) agent. The author applied Autoresearch to legacy code from eCLIP while managing household tasks.

ykumar.me

🔥🔥🔥🔥🔥

6 min

3/23/2026

No more articles to load