Agents Learn From Experience Without Retraining
A new framework called ExpGraph allows agents to learn from past successes and failures using a graph-based memory, avoiding costly fine-tuning.
Large language model agents can now learn from experience without requiring parameter updates, using a new model-agnostic framework called ExpGraph. The system summarizes historical agent trajectories into reusable skills and failure lessons, organizing them as nodes in an evolving graph. This allows a frozen, off-the-shelf LLM to improve its performance by retrieving relevant experiences at runtime.
The framework uses a retrieval copilot, trained with reinforcement learning, to query the experience graph. This copilot learns to identify which past experiences are most useful for a new task. The graph itself is updated online based on the outcomes of downstream tasks, allowing the agent's knowledge base to grow and adapt over time.
Evaluated on tasks including question answering, code generation, and multi-step agent environments like ALFWorld, ExpGraph improved performance over strong baselines by 12.2% on static tasks and 21.4% in agentic environments. The approach points toward more modular and efficient agent development, where core reasoning models can remain fixed while their capabilities are extended through external, structured experience.