Personal Agent Ontology (PAO): Mapping the Shared Architecture of AI Agents

A formal OWL 2 ontology with 115 classes, 173 properties, and 75 named individuals reveals the converging domain model behind every personal AI agent — from Claude Code to OpenClaw to Letta

February 24, 2026

18 min read

Research

📺 Watch the Full Video Guide

Deep-dive into the Personal Agent Ontology — what it covers, why it matters, and how it maps the converging architecture of AI agents.

🎬 Watch Video Guide

🎧 Listen to this article

Every personal AI agent — OpenClaw, Claude Code, LangGraph, CrewAI, Letta — makes the same design choices: sessions, memory tiers, tool invocations, permission policies. Then each invents its own vocabulary. A "session" in one system is a "thread" in another. "Memory" means three different things depending on who you ask. The architecture is converging, but the language is not.

The Personal Agent Ontology (PAO) is a formal OWL 2 DL ontology that maps this converging domain. Created by Michel at Mepuka, it captures the shared structure that every agent implementation assumes — not as code, but as a domain model. It draws from academic research (cognitive architectures, BDI models, dialog act theory) and real implementations (Claude Code's session model, Letta's tiered memory, MCP's capability discovery).

115

Classes

173

Properties

Named Individuals

Modules

The ontology is available as browsable HTML documentation, and as OWL 2 DL in Turtle, JSON-LD, and OWL/XML formats at purl.org/pao/.

What Is a Personal Agent Ontology?

An ontology, in the formal knowledge engineering sense, is a structured vocabulary that defines the concepts in a domain and the relationships between them. It's not code — it's a map. PAO uses the OWL 2 DL (Web Ontology Language) standard, the same formal framework used for biomedical knowledge bases like the OBO Foundry, industrial standards, and the semantic web.

PAO answers a specific question: what concepts does every personal AI agent implementation assume? Not how to build an agent, but what the domain contains. When you look across OpenClaw, Claude Code, Letta, CrewAI, and LangGraph, you find the same entities — sessions, memory tiers, tool invocations, permission policies, context windows — described with different vocabulary.

💡 Why OWL 2 DL?

OWL 2 DL provides decidable reasoning — you can computationally verify that your ontology is internally consistent, that class hierarchies are sound, and that no contradictions exist. This is the same foundation that biomedical ontologies (SNOMED CT, Gene Ontology) use to maintain rigorous, machine-verifiable knowledge structures.

PAO aligns every top-level class to BFO 2020 (Basic Formal Ontology, ISO 21838-2), the same upper ontology used by the OBO Foundry's biomedical ontologies. This means an AIAgent is classified as a Generically Dependent Continuant — an information entity that depends on physical substrate (a server, a process) for its existence. A HumanUser is an Object. The distinction sounds academic until you reason about deployment migration: the agent persists across servers because it is information, not matter.

The 9 Modules

PAO organizes its 115 classes into 9 modules, each covering a distinct architectural concern that appears across every agent implementation:

🤖 Identity & Agents

Agent, AIAgent, HumanUser, SubAgent, AgentRole, Persona, Organization

💬 Conversation

Conversation, Session, Turn, Message, ContentBlock, ContextWindow, CompactionEvent

🧠 Memory

WorkingMemory, EpisodicMemory, SemanticMemory, ProceduralMemory, Encoding, Retrieval, Consolidation, Forgetting

🎯 Goals, Plans & Tasks

Belief, Desire, Intention, Goal, Plan, Task, Deliberation (BDI model)

🛡️ Governance & Safety

PermissionPolicy, SafetyConstraint, ConsentRecord, RetentionPolicy, AuditLog

🔌 External Integration

ExternalService, ToolDefinition, CapabilityDiscovery, MCP alignment

🔧 Error Recovery

ErrorRecoveryEvent, RetryAttempt, ReplanEvent, RollbackEvent, Checkpoint

🏷️ Model Identity

FoundationModel, ModelProvider, ModelDeployment, GenerationConfiguration

⏰ Scheduling

Schedule, RecurrencePattern, Trigger, ScheduledExecution, ConcurrencyPolicy

Identity & Agents

The identity module starts with the simplest question: what is a personal AI agent? Agent is the base class, splitting into AIAgent and HumanUser. The distinction matters because their properties differ — an AI agent has a Persona (system prompt, behavioral configuration), belongs to a ModelDeployment, and can spawn a SubAgent. A human user has none of these.

SubAgent is a subclass of AIAgent, not a separate type. It has every property an agent has, plus a delegation relationship to its parent — its own session, memory, and tool access. This directly models systems like Claude Code, where sub-agents spun up for background tasks need the same architectural scaffolding as the primary agent.

Three concepts that most implementations leave implicit get first-class treatment: AgentRole (the same agent can act as researcher, coder, or reviewer), Organization (agents belong to tenants), and Persona (the behavioral configuration that makes an agent act like "a helpful assistant" or "a senior engineer").

The Conversation Stack

Most agent systems model conversation as "a list of messages." PAO exposes five distinct layers, each with its own architectural concerns:

Conversation — spans multiple sessions, holds long-running thread identity and participants
Session — a bounded period of interaction with status transitions (active, suspended, completed), owns the ContextWindow
Turn — records who spoke and links to any ToolInvocations
Message — the actual content within a turn
ContentBlock — text, code, images, or tool calls within a message

The most interesting piece is context overflow. In PAO, ContextWindow is a first-class entity with token capacity, current usage, and a CompactionDisposition — the policy that determines what happens when the window fills. A CompactionEvent records that tokens were reclaimed, how many, and what was lost. Most architectures bury this as an implementation detail. PAO elevates it to an architectural concern because memory, tool tracing, and conversation continuity all degrade when context compacts.

The conversation module also includes a pragmatics layer drawn from DIT++ dialog act theory: DialogAct, CommunicativeFunction, CommonGround, and GroundingAct. These model why an utterance was made (requesting information, confirming understanding), not just what was said.

Channels add another dimension. The same agent might interact through CLI, Discord, web chat, email, or REST API. PAO models six channel types through a closed enumeration (owl:oneOf), which means a reasoner can verify that every channel has a recognized type. This matters for governance — a permission policy might allow tool execution through CLI but block it through email.

Memory Architecture

Agent memory looks deceptively simple from the outside: store things, retrieve things. PAO exposes an architecture closer to cognitive science than database engineering, drawing heavily from Letta (formerly MemGPT), the CoALA framework, and LangMem.

Letta's core insight, from the MemGPT paper, is that agent memory should work like an operating system's virtual memory: the LLM gets a fixed context window (analogous to RAM), and a tiered storage system pages data in and out. The LLM itself manages the paging.

PAO generalizes this into four memory tiers:

Tier	What It Stores	Example
Working Memory	Current context — volatile, bounded by context window	Active conversation turns
Episodic Memory	Records of specific interactions	"What did we do last Tuesday?"
Semantic Memory	General knowledge extracted from episodes	"User prefers dark mode" (Claim with confidence)
Procedural Memory	How to do things — learned patterns, workflows	Tool usage sequences, workflow templates

The operations between tiers matter more than the tiers themselves:

Encoding — moves information from working memory into storage
Retrieval — pulls information back into working memory
Consolidation — transforms episodic memories into semantic knowledge
Forgetting — removes items by policy or by request (privacy-aware deletion)
Rehearsal — strengthens frequently accessed items

Each operation is a first-class process with provenance tracking via PROV-O derivation chains. When a Claim lives in semantic memory, PAO tracks which episode, turn, and message produced the evidence. If a user requests data deletion, the system can follow provenance chains to find every derived memory item.

⚠️ The Hardest Unsolved Problem

Memory remains the most challenging problem in agent architecture. Every framework tackles it differently: Letta with self-directed virtual context, Mem0 with automatic fact extraction, Zep with temporal knowledge graphs. The write-back problem is especially acute — allowing an agent to write "facts" into long-term memory without rigorous validation creates self-reinforcing errors. PAO maps the full design space rather than solving this, so builders can see the choices they're making and the ones they're ignoring.

Goals, Plans & Tasks (BDI Model)

This module borrows from the Belief-Desire-Intention (BDI) model in philosophy of mind. An agent holds:

Beliefs — what it thinks is true about the world
Desires — what it wants to achieve
Intentions — what it commits to doing

Goals decompose into Plans, plans contain Tasks with dependencies and status. Deliberation is modeled explicitly: the act of choosing which goals to pursue and which plans to adopt. BDI gives agents a vocabulary for why they act, not just what they do.

The BDI model has deep roots in AI research, originating from Michael Bratman's philosophical work on practical reasoning and formalized by Rao and Georgeff in the 1990s. PAO adapts these concepts for the LLM agent era, where "beliefs" include the agent's context about user preferences and environmental state.

Governance & Safety

PAO treats governance as architecture, not an afterthought. A PermissionPolicy governs what tools an agent can use. A SafetyConstraint defines non-negotiable behavioral limits. ConsentRecord and RetentionPolicy handle privacy. AuditLog and AuditEntry record every authorization decision.

The ontology aligns to ODRL (the W3C Open Digital Rights Language) for policy representation. This matters because governance policies need to be machine-verifiable. A reasoner can check whether a given tool invocation is permitted under the active policies — not just at code review time, but at runtime.

External Services & Integration

This module covers how agents connect to the outside world. An ExternalService exposes capabilities (tool capabilities, resource capabilities, prompt capabilities) discovered through CapabilityDiscoveryEvent.

This module aligns closely with the Model Context Protocol (MCP), modeling the same connection lifecycle: discovery, authentication, invocation, disconnection. MCP has emerged as the standard for tool and resource discovery in agent systems, and PAO provides the formal domain model that MCP's protocol-level specification doesn't cover.

Error Recovery & Observability

Things go wrong. PAO classifies failures through ErrorRecoveryEvent (timeout, authentication, rate limiting) and tracks recovery strategies: RetryAttempt, ReplanEvent, RollbackEvent. Checkpoint and CheckpointDecision capture the agent's state before risky operations. OperationalMetric and MetricObservation provide observability hooks.

Model Identity

Tracks which FoundationModel, from which ModelProvider, through which ModelDeployment, produced each conversation turn. GenerationConfiguration captures temperature, top-p, and max tokens — the parameters that affect output quality. This is essential for reproducibility and auditing.

Scheduling & Automation

Handles recurring and triggered agent tasks. A Schedule binds a RecurrencePattern to an action. Triggers fire on cron expressions, intervals, or external events. ScheduledExecution tracks each run's outcome. ConcurrencyPolicy determines what happens when a new execution overlaps with one still running.

Academic Foundations

PAO doesn't exist in a vacuum. It draws from several established research traditions:

Foundation	Contribution to PAO	Reference
BFO 2020	Upper ontology alignment (ISO 21838-2)	basic-formal-ontology.org
BDI Model	Goal/intention reasoning (Bratman, Rao & Georgeff)	Philosophy of mind → agent planning
CoALA Framework	Cognitive architecture for language agents	arxiv.org/abs/2309.02427
MemGPT / Letta	Tiered memory with LLM-directed paging	arxiv.org/abs/2310.08560
DIT++ Dialog Acts	Pragmatics layer for conversation modeling	ISO 24617-2 standard
PROV-O	Provenance tracking for memory derivation chains	W3C PROV-O
ODRL	Machine-verifiable permission policies	W3C ODRL

The CoALA framework from Princeton (Sumers et al., 2023) is particularly influential. It describes language agents with modular memory components, a structured action space, and a generalized decision-making process. PAO takes CoALA's conceptual framework and renders it in formal OWL 2, making the concepts machine-verifiable rather than just descriptively useful.

Real Implementations Mapped by PAO

PAO's value becomes clear when you see how it maps across real agent systems:

PAO Concept	Claude Code	OpenClaw	Letta	CrewAI
Session	Session	Session	Agent State	Task Execution
SubAgent	Background task agent	Subagent	—	Agent in Crew
WorkingMemory	Context window	Context window	Core Memory	Task context
SemanticMemory	CLAUDE.md	SOUL.md / AGENTS.md	Archival Memory	Long-term memory
ToolInvocation	Tool use blocks	Tool calls	Function calls	Tool execution
CompactionEvent	Context compaction	Context management	Recursive summarization	—
PermissionPolicy	Tool permissions	Tool policies	Sandbox rules	Agent delegation rules

The table illustrates PAO's core thesis: the architecture converges even when the vocabulary doesn't. Every implementation has sessions, memory tiers, tool invocations, and permission policies — they just call them different things.

Why This Matters

PAO is descriptive, not prescriptive. It maps what the domain contains, not how to build an agent. That matters because the domain is stabilizing.

Here's what a formal ontology gives you that documentation and code can't:

Shared vocabulary — When a team discusses "memory," they can reference specific PAO classes (WorkingMemory vs. EpisodicMemory vs. SemanticMemory) instead of talking past each other
Completeness checking — When you design your agent's memory system, you can check whether you've accounted for consolidation and forgetting, not just storage and retrieval
Machine reasoning — OWL 2 DL supports decidable reasoning, meaning tools can verify your agent model for consistency automatically
Interoperability — A standard vocabulary enables different agent frameworks to exchange information about their capabilities and state
Governance auditing — Every tool invocation can be traced through permission policies and audit logs using formal relationships

"The ontology makes the convergence visible. It gives agent builders a shared map — not to constrain design, but to clarify it."

✅ Practical Example

Consider designing an agent's privacy compliance. Without PAO, you might implement data deletion for chat messages and think you're done. PAO's memory model shows you also need to trace Consolidation operations — claims in semantic memory that were derived from those messages. The provenance chain (via PROV-O) means every downstream knowledge artifact must also be identified and potentially deleted.

Getting Started

PAO is freely available and ready to explore:

Browse the documentation — mepuka.com/pao/ provides interactive HTML documentation of all 115 classes, 173 properties, and 75 named individuals
Read the companion blog post — mepuka.com/blog/personal-agent-ontology/ walks through each module with architectural context
Download the ontology — Available at purl.org/pao/ in Turtle, JSON-LD, and OWL/XML formats
Load it into Protégé — The standard ontology editor lets you explore class hierarchies, run reasoners, and extend the ontology for your use case
Map your own agent — Take your agent's architecture and identify which PAO classes correspond to your components. The gaps are where you may be missing architectural concerns

# Quick start: download and explore with Python
pip install rdflib
python3 -c "
from rdflib import Graph
g = Graph()
g.parse('https://purl.org/pao/', format='turtle')
print(f'Classes: {len(list(g.subjects(predicate=RDF.type, object=OWL.Class)))}')
print(f'Properties: {len(list(g.subjects(predicate=RDF.type, object=OWL.ObjectProperty)))}')
"

Pros & Cons

✅ Strengths

First comprehensive formal model of the personal agent domain
Grounded in established standards (BFO, PROV-O, ODRL)
Covers 9 architectural concerns most implementations leave implicit
Machine-verifiable via OWL 2 DL reasoning
Maps real implementations (Claude Code, Letta, OpenClaw, MCP)
Memory model captures operations, not just storage
Provenance tracking enables privacy compliance
Free, open, and available in multiple serialization formats

⚠️ Limitations

Descriptive, not prescriptive — doesn't tell you how to build
Ontology engineering has a steep learning curve (OWL, Protégé)
No runtime enforcement — requires tooling to operationalize
Agent domain is still evolving rapidly; ontology may need frequent updates
Limited adoption so far — value increases with community use
Some modules (ProceduralMemory) are less well-modeled in practice