How Hermes-Agent Evolves Over Time

Hermes-Agent claims to be "the agent that grows with you". It's not marketing — it's three counters, a background fork, and a 100-line prompt.

#agent

#ai

Created on May 13, 2026, Last Updated on May 13, 2026, By a Developer

Openclaw proved that agents can do things, Hermes Agent proved that agents can remember and learn. — Someone from Internet

I heard about this when Hermes-Agent started gaining its momentum, but never looked into that. On its face, Hermes-Agent creates, patches skills as the conversation goes.

On the last day of my vacation, I decided to open its codebase and take a look at how this is achieved.

Where Are Skills Created?

This blog assumes some knowledge about the basic components of an AI agent. If not, consider reading my earlier blog and its corresponding project to grab some key aspects about it.

If you’ve ever used Hermes-Agent, you may have noticed skill got created while chatting with the agent.

My first guess is that there is a tool registered for skill creation, or the skill toolset has some magic system prompt. But turns out, skills are created as part of chat loop gated by a counter.

# run_agent.py
class AIAgent:
	def run_conversation(self, ...):
		# thousands lines of code ...
		_should_review_skills = False
		if (self._skill_nudge_interval > 0
		        and self._iters_since_skill >= self._skill_nudge_interval
		        and "skill_manage" in self.valid_tool_names):
		    _should_review_skills = True
		    self._iters_since_skill = 0
		    
		if final_response and not interrupted and (_should_review_memory or _should_review_skills):
			try:
				self._spawn_background_review(
					messages_snapshot=list(messages),
					review_memory=_should_review_memory,
					review_skills=_should_review_skills,
				)
			except Exception:
				pass

basically it is saying at the end of the chat loop, every N iteration, review the conversation and update skills. And as you might already noticed, memory updates follow the same pattern.

How Are Skills Created?

Short answer is forked agent similar to what Claude Code have, although I don’t know who implemented this first.

def _spawn_background_review(
	self,
	messages_snapshot: List[Dict],
	review_memory: bool = False,
	review_skills: bool = False,
) -> None:
	# ...
	if review_memory and review_skills:
		prompt = self._COMBINED_REVIEW_PROMPT
	elif review_memory:
		prompt = self._MEMORY_REVIEW_PROMPT
	else:
		prompt = self._SKILL_REVIEW_PROMPT
		
	# ...
	review_agent = AIAgent(
		model=self.model,
		max_iterations=16,
		quiet_mode=True,
		platform=self.platform,
		provider=self.provider,
		api_mode=_parent_runtime.get("api_mode") or None,
		base_url=_parent_runtime.get("base_url") or None,
		api_key=_parent_runtime.get("api_key") or None,
		credential_pool=getattr(self, "_credential_pool", None),
		parent_session_id=self.session_id,
		enabled_toolsets=["memory", "skills"],
	)
	
	review_agent._memory_write_origin = "background_review"
	review_agent._memory_write_context = "background_review"
	review_agent.run_conversation(
		user_message=prompt,
		conversation_history=messages_snapshot,
	)

Restricted toolset. The review agent can only use memory and skills tools.
Provenance tag. _memory_write_origin = "background_review" is a ContextVar that flows through every tool call. Which tells the skill_manage tool to mark the created skill as agent-owned.

And the prompt is huge, so I’ll only pull out the essential pieces:

Review the conversation above and update the skill library. Be ACTIVE — most sessions produce at least one skill update, even if small. A pass that does nothing is a missed learning opportunity, not a neutral outcome.
 
Signals to look for (any one of these warrants action):
  • User corrected your style, tone, format, legibility, or verbosity. 
  • User corrected your workflow, approach, or sequence of steps.
  • Non-trivial technique, fix, workaround, debugging path, or tool-usage pattern emerged that a future session would benefit from.
  • A skill that got loaded or consulted this session turned out to be wrong, missing a step, or outdated. Patch it NOW.

Preference order — prefer the earliest action that fits, but do pick one when a signal above fired:
  1. UPDATE A CURRENTLY-LOADED SKILL.
  2. UPDATE AN EXISTING UMBRELLA.
  3. ADD A SUPPORT FILE under an existing umbrella.
  4. CREATE A NEW CLASS-LEVEL UMBRELLA SKILL when no existing skill covers the class. The name MUST be at the class level. The name MUST NOT be a specific PR number, error string, feature codename, library-alone name, or 'fix-X / debug-Y / audit-Z-today' session artifact. If the proposed name only makes sense for today's task, it's wrong — fall back to (1), (2), or (3).

'Nothing to save.' is a real option but should NOT be the default.

Who Created This Skill?

Skills the user asked to create are fundamentally different from skills that grow from background review. The provenance system distinguishes that.

# tools/skill_provenance.py
_write_origin: contextvars.ContextVar[str] = contextvars.ContextVar(
    "skill_write_origin",
    default="foreground",
)
BACKGROUND_REVIEW = "background_review"

When skill_manage(action="create") runs inside the background review fork, it checks this context:

# tools/skill_manager_tool
from tools.skill_provenance import is_background_review
if is_background_review():
	mark_agent_created(name)

Two provenance classes, background_review ones are agent owned, otherwise user-owned.

What If It’s a Bad Skill?

The skill library will bloat in the blink of an eye, and skills created through background review are not necessarily ones the user really needs. The curator is a separate idle-triggered background task managing this, somewhat like a garbage collector for skills.

# agent/curator.py
DEFAULT_INTERVAL_HOURS = 24 * 7     # runs every 7 days
DEFAULT_MIN_IDLE_HOURS = 2          # only when agent is idle
DEFAULT_STALE_AFTER_DAYS = 30       # stale after 30 days
DEFAULT_ARCHIVE_AFTER_DAYS = 90     # archived after 90 days

The curator only touches agent-created skills. It runs an auxiliary LLM that reviews the skill collection. Transitions:

active → stale (30 days no activity) → archived (90 days)

The Trajectory System

Hermes-Agent also has a trajectory system. trajectory_compressor.py and the save_trajectories flag collect conversation traces, compress them within token budgets, and output JSONL files for RL fine-tuning via Tinker-Atropos.

I did not plan to talk about the trajectory system here though, but it’s worth mentioning that it is the other evolution path. Model improvement through training data. It’s orthogonal to the skill system. One happens at the model level, the other at the harness level.

Brief

Every few turns in the chat loop, the agent will fork itself to review past conversations and decide to add or patch any skills as a background job. The skill created at background is marked as “agent owned”, and a curator system reviews and manages them during off-peak hours, marking some of them active, stale, or archived.

Learn From Claude Code: Agent Spawning

by a Developer

Apr 2026

by a Developer

agent

Learning Claude Code's agent spawning by inspecting its leaked source code.

Learn From Claude Code: App State Machine

by a Developer

Apr 2026

by a Developer

agent

Learning Claude Code's App State by inspecting its leaked source code.

Learn From Claude Code: MCP Integration

by a Developer

Apr 2026

by a Developer

agent

Anthropic invented MCP a year ago. Now they're experimenting with skill-ifying it. Here's what the leaked source reveals about the pivot.

Learn From Claude Code: Memory

by a Developer

Apr 2026

by a Developer

agent

Learning Claude Code's memory system by inspecting its leaked source code.

ZANE.C

How Hermes-Agent Evolves Over Time

How Hermes-Agent Evolves Over Time

Hermes-Agent claims to be "the agent that grows with you". It's not marketing — it's three counters, a background fork, and a 100-line prompt.

Where Are Skills Created?

How Are Skills Created?

Who Created This Skill?

What If It’s a Bad Skill?

The Trajectory System

Brief

On this page

Related

Learn From Claude Code: Agent Spawning

Learn From Claude Code: App State Machine

Learn From Claude Code: MCP Integration

Learn From Claude Code: Memory