AI Security Architecture

AI Agent Identity and Authorization: OAuth for Non-Human Workers

AI agents are becoming active participants in software systems. They read issues, open pull requests, query databases, call SaaS APIs, schedule tasks, and trigger automations. That means they need identity and authorization models as carefully designed as the ones we use for human users and backend services.

Published Jun 1, 2026 13 min read AI Security

The new identity problem

For years, backend systems mostly separated actors into two buckets: humans and services. Humans signed in through user accounts. Services used API keys, service accounts, OAuth clients, or workload identity. AI agents blur that boundary because they behave like services but often act on behalf of a human, inside a context that changes from task to task.

An agent that summarizes logs may only need read access. The same agent, five minutes later, might ask to create an incident ticket, rotate a credential, or trigger a deployment. If the system treats that agent as a permanent admin service account, every tool call becomes overprivileged. If the system treats it as a normal user session, it loses the ability to distinguish human intent from autonomous execution.

The central question is not "can this agent call the API?" The real question is "which actor delegated which permission to which agent, for which task, for how long, with what audit trail?"

Why this is becoming urgent

In 2026, this is no longer a theoretical architecture problem. NIST launched an AI Agent Standards Initiative focused on secure, interoperable agent ecosystems. NIST also published a summary of AI agent security responses where commenters broadly agreed that agent systems create novel security concerns and that traditional cybersecurity practices need adaptation.

The practical takeaway for backend engineers is simple: agent access cannot be an afterthought. Once an agent has tool access, your APIs need to know who the agent is, who it represents, what it is allowed to do, and how to revoke that authority when the task is done.

Authorization pipeline

Human intent User asks an agent to perform a bounded task.

Agent plan The agent proposes the tools and actions it needs.

Policy check A gateway checks risk, scope, owner, and environment.

Scoped token OAuth or workload identity issues short-lived authority.

Tool call The API executes only the approved operation.

Audit event Actor, task, token, input, output, and result are recorded.

OAuth is delegation, not magic

OAuth is useful because it models delegated authorization. A user or system grants a client limited access to a resource server. That maps well to agent workflows, but only if the implementation avoids turning OAuth into a broad, long-lived permission bucket.

A human-delegated agent workflow should usually produce a token that is narrow, time-bound, and tied to a task. A background agent workflow may use client credentials or workload identity, but still needs scopes, policy constraints, and traceability. The key is to avoid one generic "agent token" that can read and write everything.

Identity should have layers

A useful agent identity model records more than one actor. It should capture the human requester, the agent instance, the tool or MCP server being used, the target API, and the policy decision that allowed the action. This is similar to distributed tracing: one request may pass through many systems, but the trace keeps the story connected.

Human principal The person or team that requested and approved the task.

Agent principal The agent runtime, model family, configuration, and session identity.

Tool principal The API client, MCP server, worker, or integration used to act.

This layered identity matters during incident response. If a database record changes, the audit log should not merely say "service-account-agent-prod updated row 123." It should answer: which user initiated the task, which agent generated the action, which permission was granted, which tool executed it, and whether the result matched the approved plan.

The policy gateway pattern

The safest place to enforce agent authorization is not inside the prompt. Prompts can describe rules, but enforcement belongs in code. A policy gateway sits between the agent and the tools. It receives proposed tool calls, evaluates them against policy, asks for human approval when needed, issues scoped credentials, and writes audit events.

This is especially important for MCP-style tool ecosystems. MCP can expose useful capabilities to agents, but every exposed capability is also a permission boundary. A database MCP server should not grant write access because the agent phrased a confident request. The gateway should decide whether this task, actor, environment, and resource justify that action.

Scopes should be task-shaped

Traditional scopes often describe broad application permissions: read invoices, write tickets, manage users. Agents need more context-aware scopes. A task-shaped scope might allow creating one support ticket for one customer, reading logs for one service over the last hour, or generating a draft pull request without merge permission.

Control	Weak pattern	Stronger pattern
Token lifetime	Long-lived shared token for all agent actions.	Short-lived token tied to one task or session.
Scope design	Generic admin-style scopes such as `write:all`.	Resource, action, environment, and time-bounded scopes.
Human approval	Agent decides when an action is safe.	Policy requires approval for destructive or high-risk calls.
Auditability	Logs show only a service account name.	Logs connect human, agent, tool, token, request, and result.
Revocation	Credential remains valid after the task ends.	Token expires automatically and can be revoked immediately.

What I would build

For a practical portfolio-grade implementation, I would build an agent authorization gateway using Cloudflare Workers or FastAPI, backed by Postgres/Supabase for policy and audit storage. The agent would never call sensitive APIs directly. It would send a proposed action to the gateway. The gateway would validate the request, map it to a policy, issue a short-lived credential or internal signed action token, then proxy the approved tool call.

The first version would include: a tool registry, scoped action definitions, risk levels, human approval states, token expiration, structured audit logs, and a small dashboard showing recent agent actions. That dashboard matters. Agent security is not only about blocking attacks; it is about making autonomous activity visible enough that humans can trust it.

Failure modes to design for

Agent authorization systems fail in predictable ways. The most common failure is excess privilege: a broad token is easier to ship than a scoped one. The second is missing attribution: logs show that an API was called, but not why. The third is policy bypass: the agent finds a direct API path that avoids the gateway. The fourth is approval fatigue: humans approve everything because the system asks too often.

Good design reduces these failure modes with default-deny policies, narrow token issuance, tool access through one controlled path, risk-based approval, and dashboards that summarize activity instead of burying reviewers in raw logs.

Observability for agent actions

Every agent action should produce structured telemetry. At minimum: agent ID, human requester, task ID, tool name, resource ID, scopes requested, scopes granted, policy decision, approval status, latency, result status, and whether any sensitive fields were redacted. This data supports security, debugging, compliance, and product analytics.

Once those events exist, the system can answer useful questions: which tools are agents using most, which actions require the most manual approval, which scopes are too broad, which prompts produce repeated blocked actions, and which automations are safe enough to become more autonomous.

The design principle

AI agents should not inherit trust from their fluency. A well-written plan is not authorization. A confident tool call is not approval. Production systems need enforceable boundaries: identity, scopes, policy, approval, execution, audit, and revocation.

That is the difference between an agent that is merely connected to tools and an agent that can safely participate in real backend operations.