AI Agent Governance Preparation Guide
AI Agent Governance Preparation Guide
Agent Governance
A research agenda for policymakers
and researchers
Jacob Pratt
Thalia Khan
1 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Contents
Key Takeaways 3
Introduction 4
1.2 Which existing legal and policy frameworks are relevant to AI agents? 9
2.4 Where will the market naturally correct for any issues? 12
Acknowledgements 19
Endnotes 24
Key Takeaways
• How AI agents will impact society is still uncertain. While the body of
scholarly work and publicly available evidence1 are growing, we don’t yet
know enough about how AI agents will be used or what impacts they may
have. However, AI investments continue to soar, so policymakers should
begin to prepare now.
• This paper provides a roadmap for this research. We outline three foun-
dational requirements for governing AI agents and detail a comprehensive
research agenda, including 12 top-level and 45 sub-level questions,
designed to directly support policymakers in developing evidence-based
policy.
Introduction
As governments grapple with governing generative AI systems, AI agents — systems built
on foundation models that can autonomously complete complex tasks by developing plans
and accessing tools to take actions digitally — are being seen as the next frontier for AI
systems. Industry leaders have named 2025 as the “year of agentic exploration,” foretelling
the adoption of systems that will change how we interact, what jobs we perform, and even
how we think. Some of these impacts are not just theories, but are being felt now. B B For example, decreases
in software developer
roles are partially being
Despite these high expectations, widespread AI agent adoption has been stifled by persistent
explained by increased AI
reliability and security challenges. Although Gartner predicts that 40% of agentic AI projects agent usage (Computer
World, 2025)
will fail by 2027, AI funding has continued to boom. Recent international policy proposals have
focused on promoting the adoption of AI to compete on the international stage, suggesting
that advancements in AI agent engineering will lead to significant real-world impact.
4 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Figure 1. Levels of agent influence on digital environments, with examples of LLM-based systems.
Current systems operate at Levels 1–3, while Levels 4–5 illustrate emerging directions for more autonomous agents.
2 Mediated influence via humans with passive tools: ChatGPT or Claude with web search
System uses tools like search or knowledge databases (provides info but doesn’t act)
for context, not for changing the world itself Gemini Deep Research
(provides research analysis)
GitHub Copilot
(provides code suggestions)
single-step actions directly using predefined tools, acting Claude Computer Use
on user commands without needing humans to carry out Manus
the result Project Mariner
Cursor AI
Other code-executing or API-calling agents
4 Direct Actions — Semi-constrained Agent: System Longer and more complex workflows. E.g. User says
accepts broad goals, decomposes them into multi-step “File my taxes” the agent gathers necessary
plans, and executes steps autonomously across known documents from user’s email, fills tax forms,
tools, without needing humans to approve every step. and submits them using chosen e-filing service,
without asking for approvals on each step.
5 Direct Actions — Unconstrained Agent: System accepts Adaptive workflows. E.g. User says “File taxes for
a broad goal, autonomously executes multi-step plans, my business” the agent autonomously gathers
and adapts by integrating new tools or strategies beyond financial records, contacts suppliers for missing
what a user configured, all without requiring approvals. information, interprets regulations, applies
business-specific deductions, switches between
tools or services as needed, and completes the
filing with without further human input.
We assess levels of capability based on environmental interaction, but an agent’s ability to interact with its environment also depends on
other differentiating properties: whether it relies on predefined tools or can flexibly adapt tool use, how much human oversight it requires,
and the complexity of goals it can pursue. These properties vary by degree and collectively shape an agent’s ability to interact with its
environment. Our levels share similarities with recent surveys of agent autonomy, which highlight dimensions such as constraints on
environmental impact and flexibility of actions. 3 Although some systems might meet our Level 3 criteria through rule-based execution
(e.g., spam bots), our analysis focuses on LLM-driven agents using reasoning, planning, or decision-making, as these introduce new
sources of unpredictability, runtime failure, or hazards. Levels 4 and 5 draw inspiration from Patel, 2025. 4
5 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Prioritizing policy-driven research
These increasingly unconstrained agents present new challenges for policy-
makers. Unlike generative AI systems, which generate an output but require
people to take further actions, agents can directly interact with the environ-
ment.D These systems can help companies optimize supply chain management D Though previous systems have also
taken automated actions to affect
and execute financial trades, but can also lead to the system impersonating
the environment, such as automated
users or making unauthorized transactions. financial trading systems, these systems
have acted within constrained bound-
aries to complete narrow tasks. AI agents
The growing deployment of these AI agents will intersect with the core remits
build on foundation models to complete
of many government departments and regulators and create profound oppor- a variety of complex tasks, leading to new
risks and opportunities.
tunities and risks. The U.S. Department of Labor, tasked with advancing
“opportunities for profitable employment,” must contend with the potential for
agents to both displace workers through automation and create new opportu-
This agenda focuses
nities through upskilling. Similarly, the UK’s Information Commissioner’s Office,
on actionable lines
which guards against “high risk to individuals and their rights,” must prepare for
of research that
novel privacy violations that could occur through agent failures. Departments,
can inform the
regulators, and other policymakers will need to take action to progress their
development of
missions in the face of this technological advance.
effective policy,
Policymakers will require substantial research and evidence to make the right
governance, and
public policy decisions on AI governance — including on the institutions, policies,
regulation.
regulations, and tools that ensure that AI systems operate in the public inter-
est.E However, advocating for significant investment to govern AI agents before E There have been calls for evidence-
based AI policy (Bommasani R. et al.,
widespread adoption occurs and harms are realized is politically challenging,
A Path for Science‑ and Evidence‑based
especially when this investment can address harms from existing technologies.5 AI Policy, 2024) and a recognition that
this should not be used to downplay
urgency and delay policymaking
The scientific community across academia, civil society, and industry can
(Caspar S. et al, Pitfalls of Evidence-
help by conducting research on AI agents. To coordinate this effort, we outline a Based AI Policy, February 2025). We
lay out our argument for needing
research agenda designed explicitly to support public policymaking for AI agents.
additional research in this paper.
Developed through a comprehensive literature review and a workshop with over
30 cross-sector experts, this agenda focuses on actionable lines of research that
can inform the development of effective policy, governance, and regulation.
NOTE
Though this research agenda is designed to support policymaking, we do not expect
policymakers to conduct research relevant to all the questions listed. Prioritizing
government-led research on AI agents is vital for progressing public interest
research, but will mean deprioritizing other areas, which might not be feasible.
This Research Agenda encourages private organizations to conduct research that
furthers their own goals while also contributing to public governance. However, it
may be beneficial for governments to identify key questions based on their own
priorities that are worth exploring. We describe how sandboxes and testbeds may
be a useful first step for policymakers later in the report.
6 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Policy questions for further research
To identify what evidence is needed to support policymakers, we conducted a comprehensive
literature review and convened a workshop with over 30 experts from academia, civil society,
industry, and government. This resulted in an agenda describing three core requirements
for policymakers to govern AI agents effectively, with 12 top-level questions providing more
detail on lines of research.
There is significant overlap between the requirements and top-level questions for AI agents
and existing AI systems, as AI agents are a “normal technology” and not a radical new force.
This means policymakers should not take a radically different approach to governing them.
But the details of policy implementation will be different, so we also identify 45 sub-level
questions and provide context on how these questions describe challenges for policymakers
unique to AI agents, including through the hypothetical use case of an “unconstrained”
level 5 tax-filing agent.F F This agent can:
• Access your financial
accounts.
This framework provides direction for future research by academic, civil society, government,
• Aggregate your
and industry actors. PAI will be progressing this research agenda, focusing on key risks to financial information.
• Research and interpret
human connection, labor and the economy, and risks from agent failures, as well as key
tax-filing guidance.
policy interventions related to international governance, and sandboxes and testbeds. • Develop a plan for
completing your tax
return.
• Create tax documents
and complete tax forms.
• Audit your submission.
• Submit documents to
1
REQUIREMENT 1 the relevant authorities.
7 PARTNERSHIP ON AI
Preparing for AI Agent Governance
1.1 How are AI Agents different from existing AI systems?6
AI agents differ from existing AI systems, with Kasirzadeh A. & Gabriel I. noting four noticeable
differentiating characteristics:
1.1.2 How do the levels of characteristics of an AI agent affect the level or type of gover-
nance required? What characteristic thresholds are required before governance
measures apply?
1.1.3 What is the correct level of abstraction to ‘model’ the key components of AI
agents? How should relevant actors be defined?
A tax-filing agent demonstrates key differences from current AI systems, which will have
policymaking implications. While a generative AI system might simply answer tax questions,
the agent might autonomously navigate multiple financial accounts, interpret complex IRS
regulations, and execute multi-step workflows without constant oversight. Unlike traditional
rule-based tax software, it might reason through edge cases and coordinate across multiple
tools and data sources to complete the entire filing process end-to-end.
8 PARTNERSHIP ON AI
Preparing for AI Agent Governance
1.2 Which existing legal and policy frameworks are relevant to
AI agents?7
“New” technologies, like AI agents, are rarely completely unique, and their development and
deployment will typically fall under the scope of existing regulations or policies.K However, K For example, the EU
AI Act recognizes that
the increased autonomy of AI agents has implications on the efficacy of existing legal frame-
providers of certain
works, particularly in tort and agency law. It is important that policymakers understand how AI systems also have
to adhere to other EU
existing laws apply to autonomous systems to avoid creating redundant or conflicting legis-
legislation, including
lation; making more informed and targeted legislative decisions. related to data protection
(Regulation (EU)
2016/679), financial
SUB-QUESTIONS services (Directive
2013/36/EU), and medical
How does existing policy apply to AI agents, including: devices (Regulation (EU)
2017/745).
1.2.1 General/non-AI specific legal frameworks, such as tort law and product
liability?
1.2.4 What policy and legal instruments will be needed (or adapted) in a world with
mass deployment of advanced agent systems?
1.2.5 Which laws or regulations might be suitable for aligning AI agents to? How might
this be done?
Existing legal and policy frameworks may apply to tax-filing agents, which will require research
to clarify for policymakers. Liability questions might arise when the agent makes errors, with
the user, developer, or other actors potentially responsible for penalties. Accessing financial
data may trigger privacy regulations, and any AI-powered advice may fall under consumer
protection and financial services regulations.
9 PARTNERSHIP ON AI
Preparing for AI Agent Governance
1.3 How are different jurisdictions responding to AI agents?8
PAI’s analyses “Policy Alignment on AI Transparency” and “Decoding AI Governance” describe
how current AI policies overlap and support each other; a similar approach is needed for
policies concerning AI agents. By understanding the existing regulatory landscape, policy-
makers can avoid duplicating efforts and creating conflicting regulations or legislation, and
support greater interoperability and international cooperation which will benefit people on
a global scale.
SUB-QUESTIONS
1.3.1 What AI agent specific policies and legislation are being developed in other juris-
dictions? What guidance is being provided on how to apply existing policies and
legislation?
1.3.2 How can the AI Governance stack encourage interoperability across jurisdictions?
opportunities
“Most unreliability issues will be solved by market incentives—
if they are visible. Which might not be visible and require
regulator intervention?”
—EXPERT PARTICIPANT, PAI POLICY WORKSHOP, JULY 2025
Policymakers will need to take action when the impact of a technology affects their policy
goals. This requires an understanding of what these impacts are — the potential scale of
benefit from the adoption of AI agents, and the severity and likelihood of risks related to
these systems. This is determined by where AI agents are likely to be deployed and used, and
if AI agents do not impact a government organization’s objectives, then policymakers may
wish to focus on other priorities.
10 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Detailing all the theoretical risks and benefits associated with AI agents is out of scope for
this paper.L Instead, we highlight key risks and opportunities mentioned through multistake- L The Ethics of Advanced
AI Assistants, Interna-
holder engagement (Q 2.2.2 - 2.2.4 and Q 2.3.1 - 2.3.6) and supporting questions identified as
tional AI Safety Report
priorities by workshop participants (such as Q 2.3.7 and Q 2.4). and MIT’s Domain
Taxonomy of AI Risks
describe taxonomies
of risks and benefits
2.1 How are AI agents likely to be used and accessed?9 associated with AI and
AI agents.
While AI agents have the potential to be adopted for a variety of tasks, we have yet to see
widespread adoption (as of September 2025). Anticipating how AI agents will be used
requires examining research, current deployment patterns, and emerging use cases. As
PAI’s Documenting the Impacts of Foundation Models report details, this can support poli-
cymakers with planning how to respond.M Identifying the sectors where AI agents are being M For example, the UK’s
Bank of England shared
adopted most rapidly can also help policymakers plan a response, create targeted policies,
a report detailing that AI
and fund relevant research or guidance. may begin to impact core
financial decisions, and
detailed how they plan to
SUB-QUESTIONS monitor and respond to
these emerging uses.
2.1.1 For which tasks is there likely to be wider usage of AI agents? How might this vary
by domain, sector, use case, demographic group, and region?
2.1.2 Where is there likely to be a “digital divide” which blocks beneficial usage?
2.1.3 How can policymakers anticipate and influence future impacts from AI agents,
such as using foresight techniques?
Early adoption of tax-filing agents may focus on simple tax returns before expanding to more
complex scenarios, and usage patterns may spike during tax season, requiring scalable
infrastructure. Policymakers should promote equitable access to these agents to prevent a
digital divide where only wealthy taxpayers benefit, and so could improve access by piloting
low-income taxpayer assistance programs.
11 PARTNERSHIP ON AI
Preparing for AI Agent Governance
2.2 How significant might benefits be from the adoption of
AI agents?10
It’s theorized that AI agents could provide major societal benefits, such as supporting
medical R&D and enhancing fraud detection. Understanding the scale of these benefits,
where these benefits may be realized, and by whom can help policymakers support and
amplify their realization, such as through targeted education initiatives or research funding.
With effective governance and assurance, the increased autonomy, efficacy, generality, and
ability to complete goals related to more complex tasks will enable these systems to realize
benefits that support a just, equitable, prosperous society.N N Characteristics are
detailed in Characterizing
AI Agents for Alignment
SUB-QUESTIONS and Governance (Kasir-
zadeh A. & Gabriel I.,
2.2.1 How can we measure the benefits from adopting AI agents? How is this distributed 2025.)
2.2.2 For which domains, sectors, or use cases is there likely to be significant benefits
from adopting AI agents?
How significant might benefits be in areas of public interest, such as, but not limited to:
2.2.4 Healthcare?
2.2.5 Education?
Policymakers should look to maximize the benefits of tax-filing agents. For example, tax-filing
agents could improve access to tax-filing support, maximize legitimate tax deductions, and
reduce tax preparation costs for citizens.
12 PARTNERSHIP ON AI
Preparing for AI Agent Governance
2.3 How significant might risks be from the adoption of
AI agents?11
A vast array of literature details risks of AI agents,O from multi-agent interactions and O We do not list out the
entire list of risks and
alignment and misuse to risks from specific domains, such as cybersecurity and finance.
opportunities from the
We are also beginning to see case studies of actual harms and industry responses, and adoption of AI agents due
to length (MIT’s Domain
are expecting to see regular updates on the state of research detailing the risks from AI,
Taxonomy of AI Risks
including AI agents, as part of the International AI Safety Report series. lists 24 subdomains and
identified over 1600 risks
in literature). Instead,
This literature shows how the increased ability for these systems to influence the environment
we aim to highlight
may cause large scale negative impacts. Policymakers have duties to protect citizens from high-level groupings of
risks that are exacer-
these harms, and anticipating the likelihood and severity of risks from AI agents is crucial to
bated by, or unique to,
upholding this duty. the novel capabilities
of AI agents, or risks
that PAI has identified
SUB-QUESTIONS as important and are
actively exploring.
How significant might risks be from:
2.3.7 What are the pathways to causing harm? (i.e. threat models)
Policymakers may need to respond to some of the risks posed by tax-filing agents. For
example, tax-filing agents may file returns incorrectly, leading to penalties, may use unau-
thorized tax strategies that trigger audits, and may expose sensitive financial information
through prompt injection attacks. A systemic risk may include common failures that affect
thousands of returns.
13 PARTNERSHIP ON AI
Preparing for AI Agent Governance
2.4 Where will the market naturally correct for any issues?12
It will be beneficial for industry organizations to put R&D efforts into mitigating some risks,
such as reliability issues, as it will improve the product and provide a competitive advantage.
However, policymakers will need to take action where the market fails, or implement regimes
that incentivize industry-led governance, such as assurance and certification. Further
research is required to understand exactly what the market will naturally correct for, and
where gaps will be left and policymakers should intervene.
SUB-QUESTIONS
2.4.1 What AI agent infrastructure will be provided privately, and where will government
intervention be needed?
2.4.2 Which risks, such as reliability, will be mitigated by the market, and which risks
will require government intervention?
The market may address accuracy and reliability issues through competition, as users
abandon error-prone services. However, the market may not naturally address data privacy
standards, or ensure equitable access for low-income users. Addressing these market
failures could be part of the role policymakers play.
Understanding the “toolbox” of policy interventions is crucial to ensuring that any policy-
maker option assessment is thorough and evidence-based. Many actors are researching
technical solutions to AI safety and risk mitigation questions13 and these should be supported
by research on the appropriateness of sociotechnical and regulatory initiatives.
P IFP’s analysis of
comments on the US AI
We note that this is not a comprehensive list of interventions, and we do not cover sovereign
Action Plan highlights
AI, data & IP, security, healthcare, basic science funding, export controls, and other topics in that recommendations
from submissions
detail.P This allows us to share sub-questions on the topics we do cover, enabling a greater
covered the topics listed
level of precision for future work. above.
14 PARTNERSHIP ON AI
Preparing for AI Agent Governance
3.1 How might policymakers implement innovation-enabling
initiatives, such as regulatory sandboxes?14
Given the uncertainty around AI agent capabilities and current adoption-focused AI policy
frameworks, policymakers should be looking to better understand the capabilities of AI
agents and encourage innovation. Government organizations can run sandboxes, testbeds,
or live testing initiatives to test AI agents in real-world or simulated conditions. This approach
can enable the government to better understand the technology and viability of a planned
or implemented policy, while also enabling industry organizations to accelerate their access
to the market.
SUB-QUESTIONS
3.3.1 How can policymakers use regulatory sandboxes, testbeds, and live testing to
assure systems and test the feasibility of policies?
3.3.2 How can policymakers create simulated environments for AI agents to test capa-
bilities in high-stakes or complex regulatory environments?
Policymakers could use controlled testing environments to understand revenue impacts and
compliance risks before widespread deployment. For example, a government department,
such as the IRS, could establish a sandbox allowing limited deployment of tax-filing agents
for specific taxpayer segments with close monitoring. Participants could submit test returns
parallel to traditional filing, comparing accuracy and compliance. The sandbox could test
agent interactions with current government systems and evaluate government audit triggers.
15 PARTNERSHIP ON AI
Preparing for AI Agent Governance
3.2 How might policymakers implement visibility, documen-
tation, and post-deployment monitoring interventions?15
Addressing the risks of AI agents requires visibility: information about where, why, how, and
by whom AI agents are used.” 16 This information — which can be gathered through docu-
mentation, post-deployment system monitoring and logging, and incident reporting — helps
evaluate existing governance structures, adapt these structures, and ensure the account-
ability of key stakeholders. However, this information can highlight harms caused by an
agent, which industry organizations may want to avoid sharing information on, so policy-
makers may need to play a role in incentivizing and designing these systems.
SUB-QUESTIONS
3.2.1 How can policymakers ensure that design attributes that are important for
governance are sufficiently documented and made transparent?
3.2.2 What metrics will support the monitoring of society wide, multi-agent risks, and
how can policymakers monitor these metrics through networks in a privacy-pre-
serving way? (For example the frequency and proportion of human-agent and
agent-agent interactions)
16 PARTNERSHIP ON AI
Preparing for AI Agent Governance
3.3 How might policymakers incentivize or design
infrastructure to support AI agent governance?17
It is beneficial for industry organizations to build infrastructure that enables agents to take
more actions and be used by more people. However, there is also the need for infrastructure
that enables governance by non-industry actors, such as methods for agent attribution, and
mediates how agents interact with their environment, such as rollbacks. These may require
policymaker support.
SUB-QUESTIONS
3.3.1 How can policy actors build and incentivize the development of AI agent trust
infrastructure? For example, supporting infrastructure to enable agent IDs.
3.3.2 How can policy actors incentivize and implement agent “attribution” infra-
structure that attributes actions to agents, mediates interactions, and detects
and remedies harmful actions?
3.3.3 How can policy actors incentivize and implement agent “oversight” infra-
structure that generates useful information and supports accountability?
3.3.4 How can this infrastructure build on existing cybersecurity and information
technology infrastructure?
3.3.6 How can international cooperation facilitate the development of global stan-
dards and infrastructure for AI agents, learning from organizations like IETF?
3.3.7 Which institution is best placed to lead technical and/or policy interoperability
for agent governance?
Policymakers should aim to prevent fragmentation and ensure secure, coordinated interac-
tions, especially with government systems. Standardized APIs could enable secure tax-filing
agent interactions with government and financial databases and systems. Authentication
protocols could verify agent identity and authorization levels. A centralized registry could
track certified agents, their capabilities, and compliance history. “Circuit breakers” may
prevent mass submissions of incorrect returns, rollback mechanisms may correct for wide-
spread errors, and secure channels for agent-to-agent coordination could support joint
returns.
17 PARTNERSHIP ON AI
Preparing for AI Agent Governance
3.4 How might policymakers implement licensing, audit and
assurance-related interventions?18
In the same way that cars, legal professionals, and financial organizations require a license
to operate, autonomous AI agents could require licenses, registration, or other assurances
before being deployed. These assurances will need to be founded on agreed science and
metrology standards, which are not yet in place and may require policy stimulus. Additionally,
they should be focused on measuring real-world impacts from specific tasks.
SUB-QUESTIONS
3.4.1 How can policy actors encourage the development and usage of evaluations and
benchmarks for AI agents that more directly correlate with real-world impact,
compared to current practices?
3.4.2 How can policymakers create simulated environments for AI agents to test capa-
bilities in high-stakes or complex regulatory environments?
Policymakers should protect taxpayers from unreliable or malicious agents, similar to existing
protections for human tax preparers. Tax-filing agents could require certification similar to
other tax professionals. Licensing tiers might distinguish basic agents (simple returns) from
advanced agents (complex business filings). Each agent could have a specific agent ID to
track certifications. Audit requirements could include third-party assessments of accuracy
rates and security practices.
18 PARTNERSHIP ON AI
Preparing for AI Agent Governance
3.5 How might policymakers leverage demand-side and
supply-side levers to encourage beneficial AI adoption?19
Policymakers across the globe are not only looking to reduce the risks of AI, but increase the
diffusion and adoption of AI.Q Policymakers can create market signals to encourage people Q Examples include
the EU’s AI Continent
to use AI agents, such as by educating critical infrastructure employees on where agents
Action Plan, Japan’s AI
may be useful, and fostering the wider industry that makes AI possible, such as by providing Promotion Act and the
Brazilian AI Programme.
healthcare datasets that enable foundation models to better direct agent actions in that
sector. These actions will support the adoption of AI agent systems.
SUB-QUESTIONS
Policymakers should aim to accelerate beneficial adoption and ensure the benefits of
agentic tax assistance are distributed widely. Demand-side policies could include tax credits
for using certified agents or free agent access for low-income taxpayers. Supply-side support
might involve grants for developing accessible interfaces, funding for tax law training
datasets, or partnerships with community organizations. Government departments could
provide official guidance on agent requirements, reducing development uncertainty.
19 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Monitoring, sandboxes, and testbeds
as a policymaker priority
Developing this research agenda has made clear that many of these questions are interre-
lated, so policymakers may find that identifying a clear sequence of priorities and actions
is challenging. However, combatting the current uncertainty around AI agents should be
a priority, and policymakers must develop the capability to monitor these systems and
their impacts in a way that complements current adoption-focused policy frameworks.
Governments should invest in institutional capacity and AI talent to make this happen, but
this may require significant funding that is not politically feasible to commit, so alternative
approaches should be explored.
Sandboxes and testbeds — controlled environments for testing, trialing, and experimentation
under regulatory supervision — provide a modern, pragmatic solution for policymakers, and
have already been explored in the UK, Singapore, and other regions. Developing this approach
should be the first step for policymakers, as they can enable policymakers and researchers
to gather information on risks, benefits, and other interventions.
Running sandboxes and testbeds with industry and non-industry actors will still have an
associated cost, but they will build institutional capacity at a lower cost than large upfront
investments. This approach will ensure guardrails and governance work in the real-world
and expand on specific sections of the US’s AI Action Plan, the US’s proposed SANDBOX Act,
and EU’s AI Act. This approach will also ensure that future policymaking is built on a deep
understanding of these systems and practical experience. It will also evolve with AI agents
but must be designed to combat regulatory capture.
from previous researchS and regulatory initiativesT to deliver actionable guidance for policy- T For example, [Link]’s
New Smarter Regulatory
makers on how to make these fit for AI agents.
Sandbox developed to
increase compliance,
Beyond regulatory sandboxes and testbeds, we will also build on our previous work in the and Singapore’s three
sandboxes on GenAI, AI
fields of policy, media integrity, labor and the economy, and safety-critical AI to explore the
Assurance, and Privacy
following areas: Enhancing Technologies.
20 PARTNERSHIP ON AI
Preparing for AI Agent Governance
1. International governance of AI agents SEE Q1.3
As AI agents increase in popularity around the world, risks that cross national borders
or concern the international community as a whole — such as critical infrastructure
disruption, privacy breaches, and electoral interference — can be exacerbated. PAI is
exploring how to manage those risks leveraging existing global governance tools,
such as international law, non-binding global norms, and international accountability
mechanisms, and what changes might be needed to strengthen them as AI agents
become more widespread. You can read more in our recent work on AI Agents & Global
Governance: Analyzing Foundational Legal, Policy, and Accountability Tools.
3. AI agent failures and monitoring SEE Q2.3: RISKS / Q3.2: POST-DEPLOYMENT MONITORING
As agents take direct actions on the environment, they create new risks and require
real-time failure detection. We have explored why and when real-time failure detection
matters in our recent work on Prioritizing Real-Time Failure Detection in AI Agents, and
will be continuing to explore how this can be operationalized.
We will also be exploring the impacts of AI agents on workers and the economy, building
on our previous work in the area.
We look forward to exploring these questions with our partnership, policymakers, and
the wider community. If you would like to be involved in our policy work on AI agents,
please email jacob@[Link].
21 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Acknowledgements
This report was prepared with guidance from PAI’s Policy Steering Committee (members who
attended additional discussions are indicated by an asterisk). We appreciate the invaluable
input provided by experts who participated in workshop discussions, including:
Special thanks to PAI staff who have been integral to the development of this report:
Claire Leibowicz, John Howell, Madhulika Srikumar, Neil Uhl, Rebecca Finlay, Stephanie Bell,
Stephanie Ifayemi, Talita Dias
22 PARTNERSHIP ON AI
Preparing for AI Agent Governance
Endnotes
1 Anthropic, “System Card: Claude Opus 4 & Claude Sonnet 10 World Economic Forum, Navigating the AI Frontier:
4”, May 2025; Anthropic, “Economic Index”, Accessed A Primer on the Evolution and Impact of AI Agents,
September 24 2025; Chatterji, Aaron et al., How People Use December 2024; Kraprayoon, Jam et al., Agent
ChatGPT, September 2025; Mitchell, Margaret et al., Fully Governance; Kasirzadeh, Atoosa and Gabriel, Iason,
Autonomous AI Agents Should Not be Developed, February Characterizing AI Agents.
2025; Chan, Alan et al., Harms from Increasingly Agentic 11 Hammond, Lewis et al., Multi-Agent Risks from Advanced
Algorithmic Systems, May 2023. AI, Cooperative AI Foundation, February 2025; Bank of
2 Srikumar, Madhulika et al., Prioritizing Real-Time Failure England Financial Policy Committee, Financial Stability
Detection in AI Agents, Partnership on AI, September 2025. in Focus: Artificial Intelligence in the Financial System,
April 2025; Anthropic, Operating Multi-Client Influence
3 Cihon, Peter et al., “Levels of Autonomy in AI Agent Networks Across Platforms, April 2025; Bengio, Yoshua
Systems,” arXiv preprint arXiv:2502.15212 (2025). et al., International AI Safety Report 2025, February
4 Patel, Dwarkesh. “Why I Don’t Think AGI Is Right Around 2025; Slattery, Peter et al., The AI Risk Repository: A
the Corner.” Dwarkesh Blog, June 2025. Comprehensive Meta-Review, Database, and Taxonomy
of Risks from Artificial Intelligence, August 2024;
5 Cihon, Peter, Chilling autonomy: Policy enforcement for Kasirzadeh, Atoosa and Gabriel, Iason, Characterizing AI
human oversight of AI agents, 2024. Agents; OWASP, Agentic AI – Threats and Mitigations.
6 Kasirzadeh, Atoosa and Gabriel, Iason, Characterizing AI 12 Baldwin, Robert et al., Understanding Regulation: Theory,
Agents for Alignment and Governance, April 2024; Soder, Strategy, and Practice, 2nd ed. 2012.
Lisa et al., An Autonomy-Based Classification: AI Agents,
Liability and Lessons from the Automated Vehicles 13 UK AI Security Institute, “Research Agenda”, accessed
Act, Interface, April 2025; IBM, “What Are AI Agents?,” September 24, 2025; Infocomm Media Development
accessed September 24, 2025; Hugging Face, “What is an Authority, The Singapore Consensus on Global AI Safety
Agent?”, accessed September 24, 2025; Kapoor, Sayash Research Priorities: Building a Trustworthy, Reliable and
et al., AI Agents That Matter, July 2024; Farmer, Harry Secure AI Ecosystem, May 2025.
and Smakman, Julia, Delegation Nation: Advanced AI 14 Ranchordás, Sofia and Vinci, Valeria, Regulatory
Assistants and why they matter, Ada Lovelace Institute, Sandboxes and Innovation-friendly Regulation: Between
February 2025; OWASP Agentic Security Initiative, Agentic Collaboration and Capture, January 2024; Jeník, Ivo and
AI – Threats and Mitigations, February 2025; Willison, Duff, Schan, How to Build a Regulatory Sandbox: A Practical
Simon, “Using tweets gathered from [Link] Guide for Policy Makers, September 2020; Ministry of
[Link]/twitter/collecting-replies,” accessed September Science, Technology and Innovation, “National Technology
24, 2025. and Innovation Sandbox (NTIS)” accessed September 24,
7 Kraprayoon, Jam et al., Agent Governance: A Field Guide, 2025; Information Commissioner’s Office, “The Guide to
IAPS, April 2025; Kolt, Noam, Governing AI Agents, the Sandbox,” accessed September 24, 2025; Financial
February 2025; McKenzie, Colleen et al., Research Agenda Conduct Authority, Engagement Paper: Proposal for AI Live
for Sociotechnical Approaches to AI Safety, March 2025; Testing, April 2025; EU AI Act, arts. 57, 60.
Toner, Helen et al., Through the Chat Window and Into the 15 Chan, Alan et al., Infrastructure for AI Agents, June 2025;
Real World: Preparing for AI Agents, Center for Security Dafoe, Allan, AI Governance: A Research Agenda, Centre for
and Emerging Technology, October 2024; Cihon, Peter, the Governance of AI, Future of Humanity Institute, August
Chilling autonomy: Policy enforcement for human over- 2018; Pratt, Jacob and Tanjaya, Albert, Documenting the
sight of AI agents, 2024. Oueslati, Amin and Staes-Polet, Impact of Foundation Models, Partnership on AI, February
Robin, Ahead of the Curve: Governing AI Agents under the 2025.
EU AI Act, The Future Society, June 2025; O’Keefe, Cullen et
al., Law-Following AI: Designing AI Agents to Obey Human 16 Chan, Alan et al., Visibility into AI Agents, May 2024.
Laws, September 2025. 17 Chan, Alan et al., Infrastructure for AI Agents.
8 Howell, John and Ifayemi, Stephanie, Policy Alignment on 18 Chan, Alan et al., Infrastructure for AI Agents; Reuel, Anka
AI Transparency, October 2024; Ifayemi, Stephanie et al., et al., BetterBench: Assessing AI Benchmarks, Uncovering
Decoding AI Governance: A Toolkit for Navigating Evolving Issues, and Establishing Best Practices, November 2024.
Norms, Standards, and Rules, July 2025.
19 Ouimette, Marc E. et al., AI, Everywhere, All At Once: A New
9 Mossberger, Karen et al., Virtual Inequality: Beyond the Policy Agenda for AI Success Through Faster Adoption,
Digital Divide, 2003; Maslej, Nestor et al., The AI Index October 2024; Systma, Tobias, Managing AI’s Economic
2025 Annual Report, Institute for Human-Centered AI, Future: Strategic Automation Policy in an Era of Global
Stanford University, April 2025; Ofcom, “Online Nation Competition, RAND, May 2025.
2023 Report,” November 2023; Störmer, Eckhard et al.,
Foresight – Using Science and Evidence to Anticipate and
Shape the Future, Science for Policy Handbook, Ch. 12,
2020; Anthropic, “Economic Index”; Kasirzadeh, Atoosa
and Gabriel, Iason, Characterizing AI Agents.
23 PARTNERSHIP ON AI
Preparing for AI Agent Governance