SkillOpt: Self-Evolving Agent Skills
Microsoft's new system lets AI agents rewrite their own tools when the world changes.
AI agents are brittle. They break the moment a database schema changes, an API updates, or an environment drifts. That is not a bug — it is a fundamental design flaw. All their tool-handling logic is hardcoded, and when the world moves, the agent falls over.
Microsoft Research published SkillOpt in May 2026 to fix this. The core idea is simple but radical: the agent does not just use tools, it improves them. After every failed execution, SkillOpt runs a "text-space optimizer" that analyzes what went wrong and rewrites the skill's code to handle the new environment. No retraining, no human in the loop.
The four-stage loop is the heart of the system. First, the agent executes a skill. If it fails, it collects feedback from the environment. It then runs the optimizer, which edits the skill's source code directly. Finally, it saves the improved version as a new external package. Next time, the improved skill loads instantly with zero runtime overhead.
This is a genuine shift in how we think about AI agents. Instead of agents that need constant babysitting when environments drift, SkillOpt moves toward infrastructure that can self-repair. The agentic web — where AI systems manage real-world infrastructure without breaking — is becoming more plausible, one self-rewritten skill at a time.