AutoAgent is an autonomous framework designed for agent engineering, functioning similarly to autoresearch but focused on building and iterating on agent harnesses. The system allows a user to assign a task to an AI agent, which then autonomously modifies system prompts, tools, agent configurations, and orchestration over time. By running benchmarks and checking scores, the meta-agent performs a hill-climbing optimization, keeping improvements and discarding failures. The core workflow involves programming via a Markdown file called program.md, which provides context and directives to the meta-agent, while the meta-agent directly edits the agent.py harness file. This approach minimizes manual engineering by allowing the agent to optimize its own performance through continuous, automated experimentation.
AutoAgent is a revolutionary open-source library designed to automate the tedious process of agent engineering and prompt tuning. By employing a meta-agent, the library allows for the autonomous optimization of an agent's harness, including system prompts, tool definitions, and orchestration strategies, all without human intervention. During a 24-hour run, AutoAgent achieved impressive results, including the top score on SpreadsheetBench and a leading GPT-5 score on TerminalBench. This technology effectively transitions the human's role from a manual engineer to a high-level director, enabling rapid, self-improving agent development across various domains and benchmarks.