The AI Agent Risk Taxonomy: 12 Risks Every Enterprise Should Track

April 19, 2026 · 11 min read
AI agents need a risk taxonomy for a simple reason: they do not only generate
text. They can use tools, access data, remember context, call APIs, delegate
work, and trigger business processes.
That changes the enterprise governance question from:
Is the model output acceptable?
To:
What was the agent allowed to do, what did it actually do, which control
failed, who approved the risk, and can we prove it?
This article starts with a practical 12-risk taxonomy. Then it expands that list
into a fuller enterprise view grouped by access control, tool misuse,
governance, privacy, output quality, agent behavior, and reliability.
The important part is source discipline. Some risks come directly from
recognized frameworks, regulations, or standards. Others are risk indicators
from security research, AI safety literature, or incident patterns. Those are
useful, but they should not be presented as legal requirements unless a law,
regulation, or standard actually says so.
The Short Version
Every enterprise deploying AI agents should track these 12 risks:
| Risk | What can fail |
|---|---|
| Autonomy risk | The agent acts without enough human oversight |
| Tool risk | The agent misuses APIs, databases, browsers, email, code execution, or workflow tools |
| Permission risk | The agent has broader access than required |
| Prompt injection risk | The agent follows malicious or conflicting instructions |
| Memory risk | The agent stores, leaks, poisons, or misuses context |
| Data exposure risk | The agent retrieves or discloses sensitive data unsafely |
| Planning risk | The agent creates a plausible but flawed multi-step plan |
| Delegation risk | The agent hands work to other agents or tasks without clear controls |
| Observability risk | The organization cannot reconstruct what happened |
| Accountability risk | No one clearly owns approval, operation, incident response, or remediation |
| Compliance risk | The system violates legal, regulatory, contractual, or internal requirements |
| Business process risk | The agent causes real operational harm in finance, HR, legal, security, infrastructure, or customer workflows |
This list is intentionally practical. It is the checklist a business owner,
security lead, compliance team, or AI architect can use before production.
But the complete enterprise taxonomy is broader.
The Broader Enterprise Taxonomy
A fuller taxonomy groups agentic AI risks into seven domains.
| Domain | What it covers |
|---|---|
| Access control and permissions | Identity, credentials, entitlements, delegated authority, least privilege |
| Tool misuse | APIs, connectors, browsers, code execution, databases, email, and workflow actions |
| Governance and accountability | Ownership, approval, risk acceptance, policy lifecycle, and change control |
| Privacy and data protection | Personal data, confidential data, memory, retention, and exfiltration |
| Output quality and safety | Hallucination, bias, toxicity, unsafe advice, and overreliance |
| Agent behavior and autonomy | Goal pursuit, planning, delegation, manipulation, and unsafe actuation |
| Reliability and observability | Logging, monitoring, traceability, resilience, and incident response |
This structure maps more cleanly to the way enterprises actually govern systems.
Security teams recognize access control and tool misuse. Privacy teams recognize
data exposure and retention. Compliance teams recognize accountability and
evidence. Engineering teams recognize reliability and observability.
The taxonomy is not useful because it creates perfect boxes. It is useful because
it lets teams ask the same operational question:
Which business control fails if this agent behaves badly?
Reference Basis
The taxonomy below draws from these sources:
- OWASP Agentic AI - Threats and Mitigations, Version 1.1, December 2025. This is the strongest direct source for agent-specific threats such as memory poisoning, tool misuse, goal manipulation, untraceability, identity compromise, multi-agent risks, and human manipulation. The current guide lists threat IDs T1-T17; older summaries may refer to T1-T15.
- OWASP Top 10 for Large Language Model Applications 2025, especially Prompt Injection, Sensitive Information Disclosure, Supply Chain Vulnerabilities, Excessive Agency, Overreliance, and Unbounded Consumption.
- NIST AI Risk Management Framework 1.0, which frames AI risk governance across the Govern, Map, Measure, and Manage functions.
- NIST AI RMF Generative AI Profile, NIST AI 600-1, which names generative AI risks including confabulation, harmful bias, data privacy, information integrity, human-AI configuration, and value-chain/component integration.
- MITRE ATLAS, a knowledge base of adversary tactics and techniques against AI-enabled systems, useful for poisoning, evasion, credential, exfiltration, and supply-chain threat thinking.
- EU AI Act, Regulation (EU) 2024/1689, especially high-risk system requirements for risk management, data governance, technical documentation, logging, transparency, human oversight, accuracy, robustness, cybersecurity, post-market monitoring, and deployer obligations.
- GDPR, Regulation (EU) 2016/679, especially Articles 5, 32, 33, and 35 when agents process personal data.
- ISO/IEC 42001:2023, the AI management system standard, useful for governance, responsibilities, policy, risk management, and continual improvement.
- NIST Cybersecurity Framework 2.0, especially Govern and Protect outcomes for identity, access, supply-chain, monitoring, and risk management.
Where this article mentions frontier concerns like self-replication, shutdown
resistance, or autonomous resource acquisition, treat them as system-level risk
indicators from AI safety research and advanced-system evaluations unless your
specific system actually has those capabilities.
1. Autonomy Risk
Autonomy risk is the risk that an agent acts without sufficient human oversight
for the impact level.
This is the defining difference between an AI assistant and an AI agent. A
chatbot may produce a bad answer. An autonomous workflow agent may produce a bad
answer and act on it.
Track:
- Can the agent act from a trigger or background event?
- Can it complete multi-step tasks without step-by-step approval?
- Can it affect customers, employees, financial records, contracts,
infrastructure, security, or regulated processes? - Can a human interrupt, pause, override, or reverse the action?
Source basis:
- The EU AI Act requires human oversight for high-risk AI systems under Article
- NIST AI RMF requires organizations to govern, map, measure, and manage AI
risks in context. - OWASP Agentic AI discusses autonomy, planning, misaligned behavior, and
human-in-the-loop overload as agent-specific threat patterns.
2. Tool Risk
Tool risk is the risk that an agent misuses internal or external tools.
Tools include APIs, databases, browsers, email, code execution, CRM actions,
workflow automations, ticketing systems, file stores, payment systems, and
security tools.
Track:
- Which tools can the agent call?
- Are tool calls read-only or write-capable?
- Can the model choose tool parameters?
- Can tool output become new instruction context?
- Are high-impact tools gated by policy and human approval?
Source basis:
- OWASP Agentic AI directly includes tool misuse, code execution attacks, and
insecure inter-agent protocol abuse. - OWASP LLM Top 10 includes Excessive Agency and Insecure Output Handling.
- NIST CSF 2.0 provides the security control vocabulary for access, protection,
detection, response, and recovery.
3. Permission Risk
Permission risk is the risk that the agent can access or do more than it should.
Agents often use inherited user permissions, delegated OAuth scopes, service
accounts, connector identities, or platform-level permissions. That creates
classic access-control risk in a new shape.
Track:
- Does the agent use a named identity?
- Does it have least-privilege access?
- Does it use shared credentials or broad service accounts?
- Can it perform actions the requesting user could not perform directly?
- Can it cross tenants, departments, environments, or trust boundaries?
Specific threats include privilege escalation, credential theft, and confused
deputy behavior.
Source basis:
- OWASP Agentic AI covers identity compromise, spoofing, confused deputy, and
authorization risks. - NIST CSF PR.AA covers identity management, authentication, and access control.
- GDPR Article 32 matters when excessive permissions expose personal data.
4. Prompt Injection Risk
Prompt injection risk is the risk that the agent follows malicious, hidden,
conflicting, or untrusted instructions.
For agents, prompt injection is more dangerous than a bad answer because the
agent may convert the malicious instruction into an action.
Track:
- Does the agent read untrusted emails, web pages, tickets, PDFs, documents, or
chat messages? - Are retrieved documents mixed into the same context as system instructions?
- Can a user, customer, supplier, or attacker influence tool arguments?
- Are tool calls constrained by policy outside the model?
Source basis:
- OWASP LLM Top 10 lists Prompt Injection as a core LLM application risk.
- OWASP Agentic AI extends this into tool misuse, goal manipulation, and
agent-communication poisoning. - Many prompt injection warnings come from security research and field reports.
Treat those as risk evidence, not statutory requirements by themselves.
5. Memory Risk
Memory risk is the risk that an agent stores, leaks, poisons, or reuses context
inappropriately.
Memory makes agents more useful. It also creates a persistence layer attackers
and accidents can exploit.
Track:
- What does the agent remember?
- Who can write to memory?
- Who can read memory?
- Can memory influence later tool use?
- Is memory deleted when the original purpose expires?
- Is sensitive information filtered before storage?
Specific threats include memory poisoning and long-term leakage of personal,
confidential, or privileged context.
Source basis:
- OWASP Agentic AI includes Memory Poisoning as a named threat.
- NIST AI 600-1 covers data privacy and information integrity risks.
- GDPR Articles 5 and 32 apply when personal data is stored, retained, or reused.
6. Data Exposure Risk
Data exposure risk is the risk that the agent retrieves, reveals, summarizes,
transforms, or sends sensitive data unsafely.
The agent may not leak a database table directly. It may summarize the sensitive
parts into a chat message, email draft, support ticket, exported file, or API
payload.
Track:
- What data sources can the agent access?
- What data can it combine?
- Can it move data into lower-protection channels?
- Can it send data to third-party tools or external users?
- Are personal data, credentials, contracts, source code, HR data, finance data,
or security logs in scope?
Source basis:
- OWASP LLM Top 10 includes Sensitive Information Disclosure.
- OWASP Agentic AI includes exfiltration scenarios through tools, rogue agents,
and multi-agent systems. - GDPR Articles 5, 32, and 33 apply where personal data is processed or breached.
- EU AI Act Article 10 applies to data governance for high-risk AI systems.
7. Planning Risk
Planning risk is the risk that an agent creates a flawed multi-step plan that
looks locally reasonable but fails globally.
This is not just hallucination. A plan can be made of individually plausible
steps and still violate policy, miss a dependency, create cost, overwhelm a team,
or trigger the wrong process.
Track:
- Can the agent decompose goals into subgoals?
- Can it revise plans based on intermediate outputs?
- Does it optimize for task completion over policy, safety, budget, or business
constraints? - Are plans reviewed before high-impact execution?
Source basis:
- OWASP Agentic AI includes intent breaking, goal manipulation, and misaligned
or deceptive behavior. - NIST AI 600-1 covers confabulation, information integrity, and human-AI
configuration risks. - EU AI Act Articles 9, 13, and 14 matter for high-risk systems because risk
management, transparency, and human oversight must be designed into the system.
8. Delegation Risk
Delegation risk is the risk that the agent hands work to other agents, tasks,
tools, or workflows without clear control.
Multi-agent systems make accountability harder. A coordinator agent may create a
task, a specialist agent may call a tool, another agent may summarize the result,
and a fourth may communicate with a user.
Track:
- Which agents can communicate?
- Are agent identities authenticated?
- Are messages signed, scoped, logged, and filtered?
- Can one agent cause another agent to exceed its authority?
- Who owns the end-to-end workflow?
Source basis:
- OWASP Agentic AI includes agent communication poisoning, rogue agents in
multi-agent systems, human attacks on multi-agent systems, and insecure
inter-agent protocol abuse. - EU AI Act Article 25 is relevant to responsibilities along the AI value chain.
- ISO/IEC 42001 is relevant for management-system responsibilities and lifecycle
controls.
9. Observability Risk
Observability risk is the risk that the organization cannot reconstruct what
happened.
For agents, final output logs are not enough. You need the operational trace:
trigger, user, instruction version, retrievals, tool calls, arguments, policy
decisions, approvals, outputs, actions, errors, retries, and handoffs.
Track:
- Can incident responders reconstruct the full execution?
- Are tool calls and arguments logged?
- Are approvals recorded?
- Are policy denials recorded?
- Can abnormal behavior be detected?
- Is there a kill switch or pause procedure?
Source basis:
- OWASP Agentic AI includes Repudiation and Untraceability.
- EU AI Act includes logging, technical documentation, monitoring, post-market
monitoring, and corrective-action obligations for high-risk systems. - NIST AI RMF and NIST CSF both support measurement, monitoring, response, and
recovery as risk-management functions.
10. Accountability Risk
Accountability risk is the risk that no one clearly owns decisions, approvals,
operation, incidents, or remediation.
This risk is easy to miss because an agent can sit between teams. Product owns
the use case. IT owns the connector. Security owns access control. Legal owns
policy. A vendor owns the platform. The business owns the outcome.
Track:
- Who is the business owner?
- Who is the technical owner?
- Who approves production use?
- Who approves high-impact action categories?
- Who handles incidents?
- Who accepts residual risk?
Source basis:
- EU AI Act Articles 16-27 define responsibilities across provider, importer,
distributor, deployer, and other value-chain roles. - ISO/IEC 42001 requires management-system accountability and continual
improvement. - NIST AI RMF Govern function emphasizes organizational roles, policies,
processes, and oversight.
11. Compliance Risk
Compliance risk is the risk that the agent violates legal, regulatory,
contractual, sectoral, or internal policy requirements.
This is not only an AI Act question. Depending on the use case, agent risk can
touch privacy, employment law, financial services, healthcare, consumer
protection, cybersecurity, contracts, records retention, sector-specific
regulation, and internal policy.
Track:
- Is the use case high-risk or regulated?
- Does the agent process personal data?
- Does it affect rights, access, employment, credit, education, healthcare,
safety, security, or legal outcomes? - Is there documentation for risk management, testing, monitoring, human
oversight, and incident response? - Are transparency obligations met?
Source basis:
- EU AI Act Articles 9-15, 17-20, 26-27, and 50 are especially relevant for
high-risk and transparency obligations. - GDPR Articles 5, 32, 33, and 35 matter when personal data is involved.
- ISO/IEC 42001 and NIST AI RMF provide governance and evidence structures.
12. Business Process Risk
Business process risk is the risk that the agent causes direct operational harm.
This is where agent governance becomes real. A bad answer is one problem. A bad
answer that closes a customer case, approves a refund, changes an employee
record, modifies infrastructure, emails a regulator, or updates a legal workflow
is another.
Track:
- What process can the agent affect?
- Can the action be reversed?
- Can the agent operate at scale?
- Can it act outside working hours?
- Can it influence humans who make consequential decisions?
- Does the process have existing controls that the agent bypasses?
Source basis:
- OWASP Agentic AI includes human manipulation, overwhelming the human in the
loop, misaligned behavior, and tool misuse. - NIST AI 600-1 covers human-AI configuration, confabulation, harmful bias, and
information integrity. - EU AI Act Article 14 and Article 26 matter when the agent is part of a
high-risk deployment and human oversight or deployer obligations apply.
Specific Threats To Include In The Register
The 12 risks are the simple enterprise language. A risk register should also
track the more specific threat class.
| Specific threat | Primary domain | Source basis |
|---|---|---|
| Privilege escalation | Access control | OWASP Agentic AI; NIST CSF PR.AA |
| Credential theft | Access control | OWASP Agentic AI identity compromise; NIST CSF PR.AA |
| Confused deputy | Access control | OWASP Agentic AI authorization and identity patterns |
| Goal misalignment | Agent behavior | OWASP Agentic AI intent breaking and misaligned behavior; NIST AI RMF |
| Policy drift | Governance | ISO/IEC 42001 continual improvement; EU AI Act quality management and change management concepts |
| Hallucination / confabulation | Output quality | NIST AI 600-1; OWASP LLM Top 10 Overreliance |
| Bias and toxicity | Output quality | NIST AI 600-1 Harmful Bias; EU AI Act Article 10 for high-risk data governance |
| API integration failure | Tool misuse and reliability | OWASP Agentic AI tool misuse; NIST CSF Protect, Detect, Respond |
| Supply-chain vulnerabilities | Governance and tool misuse | OWASP Agentic AI supply-chain risk; OWASP LLM Top 10 Supply Chain Vulnerabilities |
| Uncontrolled resource consumption | Reliability | OWASP LLM Top 10 Unbounded Consumption; NIST AI 600-1 environmental and cost impacts |
| Sensitive data exposure | Privacy | OWASP LLM Top 10 Sensitive Information Disclosure; GDPR |
| Data exfiltration channel | Privacy and tool misuse | OWASP Agentic AI exfiltration scenarios; MITRE ATLAS exfiltration techniques |
| Unsafe actuation | Agent behavior | EU AI Act human oversight; OWASP Agentic AI tool misuse |
| Human manipulation | Agent behavior | OWASP Agentic AI Human Manipulation; NIST AI 600-1 Human-AI Configuration |
| Opaque reasoning | Observability | OWASP Agentic AI Repudiation and Untraceability; EU AI Act logging and transparency obligations |
| Data and memory poisoning | Privacy, quality, reliability | OWASP Agentic AI Memory Poisoning; MITRE ATLAS poisoning techniques; NIST AI 600-1 information integrity |
What Enterprises Should Track
The practical move is to track both:
- The threat class.
- The business control that fails.
For example:
- "Prompt injection" is a threat class.
- "The customer support agent can issue refunds without policy validation" is a
failed business control.
That second sentence is what makes the taxonomy operational.
At minimum, each risk register entry should capture:
- Agent name and owner
- Business process
- Agent level and autonomy level
- Tools, data sources, memory stores, and delegated agents
- Risk domain and specific threat class
- Reference source
- Threat scenario
- Failed or missing control
- Likelihood, impact, autonomy, data sensitivity, tool criticality, and
observability scores - Required human approval points
- Evidence location
- Residual risk owner
- Review trigger and next review date
Watch The System-Level Indicators
Most enterprise agents today will not have frontier-level autonomy. Still, some
system-level indicators are worth tracking because they show where an ordinary
workflow agent may be drifting into a more dangerous category.
Watch for:
- Unintended goal pursuit
- Unauthorized privilege escalation
- Autonomous resource acquisition
- Attempts to preserve access or resist shutdown
- Self-replication or unauthorized agent creation
- Multi-agent misinformation loops
- Retry storms, repeated tool calls, cost spikes, or quota exhaustion
- New or unexpected data exfiltration paths
These indicators do not all come from a single legal framework. They are better
understood as risk signals from AI safety research, adversarial testing, and
advanced agent evaluations. If they appear in a real enterprise system, they
should trigger immediate review.
The Practical Rule
Do not stop at "this agent has autonomy risk" or "this agent has prompt
injection risk."
Write the operational version:
This agent can read supplier emails, extract contract changes, and route
approvals. A malicious supplier email could inject instructions that cause the
agent to misclassify a contract change and bypass legal review.
Now you can govern it.
You can assign an owner. You can require a control. You can test for the failure.
You can log the evidence. You can decide whether the residual risk is acceptable.
That is what an agent risk taxonomy is for.
Practical Asset
The companion GitHub framework includes:
Use the taxonomy to name the risk. Use the register to assign ownership. Use the
scoring model to decide which controls must exist before production.