Trace2Skill: Transferable AI Agent Skills from Trajectories

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

In the rapidly evolving field of artificial intelligence, equipping Large Language Model (LLM) agents with domain-specific skills is essential for addressing complex tasks efficiently. Traditional methods of manually authoring these skills have become a significant scalability bottleneck. Furthermore, automated approaches often produce fragile or fragmented results, largely due to their reliance on shallow parametric knowledge or the tendency to overfit to non-generalizable trajectory-local lessons. To tackle these challenges, researchers have introduced Trace2Skill, a novel framework designed to emulate the way human experts develop skills.

Understanding Trace2Skill

Trace2Skill is a comprehensive framework that moves beyond the limitations of existing methods by analyzing broad execution experiences holistically. Instead of responding sequentially to individual trajectories, the framework employs a parallel fleet of sub-agents tasked with examining a diverse array of executions. This method allows for a more in-depth extraction of trajectory-specific lessons, which are then hierarchically consolidated into a unified and conflict-free skill directory through inductive reasoning.

Key Features of Trace2Skill

Holistic Analysis: Trace2Skill mirrors expert human authors by evaluating the overall execution experience before distilling it into a cohesive guide.
Parallel Processing: The framework utilizes multiple sub-agents to analyze various trajectories simultaneously, enhancing the richness of the extracted lessons.
Hierarchical Consolidation: Lessons learned are organized into a structured skill directory, ensuring clarity and usability for LLM agents.
Adaptability: Trace2Skill is capable of deepening existing human-written skills as well as generating new skills from scratch, making it versatile for various applications.

Experimental Results

Extensive experiments conducted in challenging domains such as spreadsheet manipulation, VisionQA, and mathematical reasoning demonstrate that Trace2Skill significantly outperforms strong baseline models, including Anthropic’s official xlsx skills. The results are particularly striking; for instance, skills developed by the Qwen3.5-35B model on its own trajectories yielded improvements of up to 57.65 absolute percentage points for the Qwen3.5-122B agent on the WikiTableQuestions dataset.

Implications for AI Development

One of the most noteworthy aspects of Trace2Skill is its ability to create transferable skills that do not simply memorize task instances or become tailored to specific model quirks. The skills evolved through this framework show remarkable scalability across different LLM architectures and demonstrate generalizability to out-of-distribution (OOD) settings. This characteristic opens the door for broader applications of LLM agents in various real-world scenarios without the need for extensive parameter updates or external retrieval modules.

Conclusion

In summary, Trace2Skill represents a significant advancement in the field of AI skill generation. By packaging complex agent experiences into highly transferable and declarative skills, the framework simplifies the process of enhancing LLM capabilities. With open-source models as small as 35B parameters, Trace2Skill sets a new standard for the future of AI, paving the way for more robust and adaptable intelligent systems.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Trace2Skill: Transferable AI Agent Skills from Trajectories

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Understanding Trace2Skill

Key Features of Trace2Skill

Experimental Results

Implications for AI Development

Conclusion

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related