Researchers at Huawei have introduced a novel approach to building artificial intelligence agents that continuously refine their problem-solving abilities by treating skills as dynamic, experience-rich resources rather than static tools.

The framework, called MUSE-Autoskill Agent, reimagines how large language model based agents acquire and deploy capabilities. Instead of viewing skills as one-time creations, the system manages skills through a structured lifecycle that includes creation, memory storage, intelligent selection, evaluation, and iterative refinement. This lifecycle approach transforms skills into assets that accumulate experience and improve over time.

How the System Works

According to research published on arXiv by Huawei Lin, Peng Li, Jie Song, Fuxin Jiang, and Tieying Zhang, the architecture introduces several novel capabilities. Agents can create new skills on demand when encountering unfamiliar problems, then store these skills in a shared repository for reuse across different tasks and even between different agents. The system organizes and retrieves skills efficiently, selecting the most relevant ones for each challenge.

A key innovation is the integration of skill-level memory that tracks how each skill performs across multiple tasks. This memory mechanism enables agents to learn which skills work best in which contexts, adapting and refining approaches based on accumulated evidence rather than relying on isolated experiences.

Validation and Impact

  • Skills are evaluated through automated unit tests and real-time feedback during execution
  • The framework enables cross-agent skill transfer, allowing one agent's learnings to benefit others
  • Experiments demonstrate improvements in task success rates and operational efficiency
  • Reusability metrics show enhanced skill deployment across diverse problem domains

The research team tested their framework using SkillsBench, a benchmark designed to measure agent performance across complex, multi-step tasks. Results indicated that skills managed through this lifecycle approach consistently outperformed approaches treating skills as static artifacts.

Why This Matters

Current AI agents often struggle with scalability and long-term improvement because they lack mechanisms to learn from experience systematically. By creating skills that remember their history and adapt based on feedback, MUSE-Autoskill addresses a fundamental limitation in how AI systems currently operate.

The framework's emphasis on testability is particularly significant. By implementing unit tests for individual skills and monitoring runtime performance, the system provides quality assurance mechanisms that could increase reliability in real-world deployments.

The ability to transfer skills between agents also opens possibilities for collaborative learning environments where improvements made by one system benefit others, potentially accelerating overall capability development across multiple AI systems.

This research contributes to a broader shift in AI development toward agents that operate more like evolving systems than static applications. As language models become more central to enterprise workflows, frameworks that enable continuous improvement and systematic knowledge reuse could prove essential for practical deployment at scale.