Autoresearch

ResearchfreeOpen SourceResearch, Software Development, Education

An open-source project that lets AI agents autonomously run LLM training experiments and keep the best model changes.

GitHub ↗

🛡️ AgentReady threat assessment

MAESTRO 7-layer threat model + OWASP AIVSS risk score for Autoresearch, derived from its capabilities.

AIVSS 9.5 · Critical

View MAESTRO 7-layer threat model →

Overview

Autoresearch is an open-source project by Andrej Karpathy that lets AI agents run autonomous machine learning research loops on a small but real LLM training setup. The repository is designed so an agent edits the main training file, launches a fixed 5-minute experiment, evaluates whether the result improved, and then keeps or discards the change before repeating the cycle. Its README describes the setup as a lightweight autonomous research organization driven by instructions in a program.md file rather than traditional manual code iteration. The project is built around a simplified single-GPU nanochat training workflow and is aimed at developers and researchers exploring automated model improvement, agent-driven experimentation, and compact research loops on their own hardware. :contentReference[oaicite:0]{index=0}

Key features

open-source
LLM training
autonomous research
machine learning
single GPU
nanochat
experiment loops
PyTorch
model optimization
research automation

Use cases

Running autonomous overnight experiments to improve small language model training setups.
Testing agent-driven code changes in a controlled single-file training workflow.
Exploring automated research loops for architecture, optimizer, and hyperparameter changes.
Studying how AI systems can manage iterative machine learning experimentation with minimal human intervention.