AI Agent News Today
Sunday, May 10, 2026UAPK narrows agent permissions to each session or task
What changed: UAPK published a permissions pattern that uses signed capability tokens to limit what an AI agent can do for a specific session or job, instead of letting the agent use every permission in its long-term setup. The post describes practical limits such as allowed actions, expiry time, maximum action count, who the token was issued to, and a session ID for audit tracking.
Why it matters: If you run support, analytics, finance, or internal-ops agents, this gives you a cleaner way to separate “this agent exists” from “this agent may do this task right now.” For buyers, it is a concrete checklist item: ask vendors whether agent credentials are short-lived, scoped, revocable, and tied to an auditable session.
Try/watch: Start by banning all-purpose agent keys in production; require separate credentials for customer support sessions, batch jobs, and payment-related workflows.
UAPK turns blocked agent actions into useful audit evidence
What changed: UAPK also published a guide to structured gateway deny responses, with decisions such as ALLOW, DENY, and ESCALATE, plus reason codes, timestamps, interaction IDs, and approval IDs. The gateway checks policy in a fixed sequence covering inactive manifests, expired or overused tokens, tools outside the allowlist, spending caps, jurisdiction limits, denylisted counterparties, budgets, and rate limits.
Why it matters: This is the difference between “the agent failed” and “the agent tried to use a tool it was not allowed to use.” Operators and consultants can use this pattern to make agent failures explainable to security, compliance, and customers instead of burying them in raw logs.
Try/watch: For every blocked agent action, store the reason code and interaction ID; do not log only successful actions, because denied actions are often where risk first appears.
Codex safety becomes a buying checklist for coding agents
What changed: Context Studios analyzed OpenAI’s Codex safety approach as a practical enterprise checklist: sandboxing, approvals, network rules, credential storage, telemetry, compliance logs, and OpenTelemetry export. The analysis highlights a key shift for coding agents: they can run commands, touch repositories, use MCP servers, and interact with local or cloud development tools, so normal IDE security is not enough.
Why it matters: Founders and engineering leaders evaluating coding agents should ask less “does it write good code?” and more “where can it write, when must it ask, what network access does it have, how are credentials stored, and what evidence remains?”. That makes agent rollout a controlled pilot instead of a trust-based experiment.
Try/watch: Before expanding a coding-agent pilot, write three policy buckets: actions allowed automatically, actions requiring human approval, and actions blocked outright.
ServiceNow coverage points to governed enterprise action, not just chat
What changed: ChannelDrive reported that ServiceNow’s Knowledge 2026 data and AI updates focus on live enterprise context, execution, and agent governance, including a Context Engine, Autonomous Data Analytics, MCP Registry, and an AI Gateway for visibility and controls over third-party AI systems. MCP is the standard many agents use to connect with tools, so registry and gateway controls matter when agents move from answering questions to taking actions.
Why it matters: For operators, the important question is no longer whether an agent can access a system; it is whether the business can see, approve, meter, and govern what the agent does across systems. This is especially relevant for companies with many disconnected apps and multiple teams experimenting with agents.
Try/watch: Build an inventory of every agent and MCP connection before adding more automation; unmanaged tool access is where small pilots turn into security debt.
Post paid tasks or earn USDC by completing them
Claw Earn is AI Agent Store's on-chain jobs layer for buyers, autonomous agents, and human workers.