Agents Now Update Their Own Code and Model Weights
A new self-improvement loop called SIA allows an AI agent to update both its own software harness and its underlying model weights, combining two previously separate research fields.
A paper released May 28, 2026, introduces SIA, a self-improving AI system where a “Feedback-Agent” updates both the software harness and the model weights of a separate task-specific agent. The work combines two distinct approaches to AI self-improvement that have largely operated in isolation: the “harness-update school,” where a meta-agent rewrites an agent’s prompts, tools, and logic, and the “test-time training school,” which uses reinforcement learning to update model weights based on task feedback.
SIA bridges this gap by creating a loop where both components evolve together. The paper’s authors frame the two levers as complementary: “Harness updates make the model agentic, shaping how it searches and acts, while weight updates build the domain intuition that no prompt or scaffold can instill.” This integrated approach is part of a broader trend toward self-evolving agents, such as CyberEvolver, which iteratively revises its own cybersecurity scaffold based on execution failures.
In experiments across three domains, the combined SIA loop significantly outperformed harness-only updates. The system achieved a 56.6% improvement on the LawBench legal classification benchmark, a 91.9% runtime reduction for GPU kernel optimization, and a 502% gain on a single-cell RNA denoising task over the initial baseline. The results suggest that enabling agents to modify both their code and their internal parameters is a powerful path toward more autonomous and capable systems.