SeriesMinds & Machines⚡ EventAct V
E19Act V · The Explosion

The Great Pause: The Moment the AI World Held Its Breath

On this page15 sections

“It calls for ‘all AI labs to immediately pause for at least 6 months the training of AI systems more powerful than GPT-4.’ The letter is signed by Elon Musk, Steve Wozniak, Yoshua Bengio, Stuart Russell, and approximately a thousand other researchers, technologists, and public figures. It is not signed by Sam Altman, Dario Amodei, Demis Hassabis, or the leadership of the major AI organisations… The pause does not happen.”

— The letter that asked for a pause — and the pause that never came

San Francisco, California. March 22, 2023. The Future of Life Institute, a non-profit organisation focused on existential risks, publishes an open letter titled “Pause Giant AI Experiments: An Open Letter.”

The letter begins: “AI systems with human-competitive intelligence can pose profound risks to society and humanity, as extensively documented by top AI researchers and as acknowledged by top AI labs such as OpenAI.”

It calls for “all AI labs to immediately pause for at least 6 months the training of AI systems more powerful than GPT-4.”

The letter is signed by Elon Musk, Steve Wozniak, Yoshua Bengio, Stuart Russell, and approximately a thousand other researchers, technologists, and public figures.

It is not signed by Sam Altman, Dario Amodei, Demis Hassabis, or the leadership of the major AI organisations.

Within two weeks, it has gathered more than 30,000 signatures — researchers, engineers, executives, academics, activists, and curious members of the public who want their names attached to a position that says: we should slow down.

The pause does not happen. The AI development that the letter calls for pausing continues without interruption. GPT-4 is deployed, expanded, and improved. The race for more capable AI systems accelerates, not decelerates.

But the letter’s significance is not in what it achieved. It is in what it revealed: the depth and breadth of the concern about where AI development is going, the absence of institutional mechanisms to address that concern, and the specific difficulty of collective action in a domain where the competitive incentives point consistently toward speed.

Future of Life Institute publishes the “Pause Giant AI Experiments” open letter
Date:
March 22, 2023
Location:
Future of Life Institute (Cambridge, Massachusetts)
Significance:
The Future of Life Institute — a non-profit focused on existential risk, founded by MIT physicist Max Tegmark and others — published an open letter calling for “all AI labs to immediately pause for at least 6 months the training of AI systems more powerful than GPT-4.” Within two weeks it gathered more than 30,000 signatures, including Elon Musk, Steve Wozniak, Yoshua Bengio, and Stuart Russell. It was not signed by Sam Altman, Dario Amodei, or Demis Hassabis.
Outcome:
The pause did not happen. AI development continued without interruption; GPT-4 was deployed, expanded, and improved; the race for more capable systems accelerated. The letter’s significance lay not in what it achieved but in what it revealed — the depth of concern, the absence of institutional mechanisms to act on it, and the difficulty of collective action where competitive incentives point toward speed.
Important

The pause did not happen. The AI development the letter called for pausing continued without interruption. GPT-4 was deployed, expanded, and improved. The race for more capable AI systems accelerated, not decelerated. But the letter’s significance is not in what it achieved — it is in what it revealed: the depth and breadth of concern about where AI development was going, the absence of institutional mechanisms to address that concern, and the specific difficulty of collective action in a domain where competitive incentives point consistently toward speed.


The Context: Three Months After ChatGPT

The Future of Life Institute letter appeared exactly three months after the launch of ChatGPT. Its timing was not coincidental.

In the three months between November 30, 2022 and March 22, 2023, the AI landscape had been transformed. ChatGPT had reached one hundred million users. GPT-4 had been announced and deployed. Bing had integrated AI conversation into search. Google had announced and then struggled with its own conversational AI product, Bard. A dozen major technology companies had announced AI initiatives. Dozens of startups had raised hundreds of millions of dollars to build AI applications.

The pace of development and deployment had been extraordinary by any previous standard. The AI capabilities that had been the subject of cautious, hedged, expert-to-expert discussions for years were suddenly mainstream consumer products, available to anyone with an internet connection, used by hundreds of millions of people daily.

And the concerns about those capabilities — the safety concerns, the alignment concerns, the governance concerns — had not been resolved. They had been outrun by deployment.

The letter was, in this context, an expression of a specific and understandable anxiety: the technology was moving faster than the understanding of its risks, faster than the development of governance frameworks to manage it, faster than the safety research required to ensure it was aligned with human values. The call for a pause was a call for the pace of deployment to match the pace of understanding.

Note

The pause letter appeared exactly three months after ChatGPT’s launch. In those three months: ChatGPT reached one hundred million users; GPT-4 was announced and deployed; Bing integrated AI conversation into search; Google announced (and stumbled with) Bard; a dozen majors announced AI initiatives; dozens of startups raised hundreds of millions. The safety, alignment, and governance concerns that had been the subject of cautious expert-to-expert discussions for years had not been resolved — they had been outrun by deployment.


The Signatories: A Coalition of Concern

The list of signatories to the pause letter was striking in its breadth and its internal diversity.

The AI safety community. The most important signatories, from the perspective of technical credibility, were the AI researchers who signed. Yoshua Bengio — one of the Godfathers of Deep Learning, a Turing Award winner, one of the people most responsible for the capabilities that the letter was concerned about — signed and became one of the letter’s most vocal public advocates. Stuart Russell — the author of the most widely used AI textbook and the most respected mainstream AI researcher who had engaged seriously with existential risk — signed. Several other prominent AI researchers signed, including some at the major AI organisations.

Yoshua Bengio
Born:
March 5, 1964
Died:
Living
Nationality:
Canadian
Role:
Professor of Computer Science at the University of Montreal; founder and scientific director of Mila (the Montreal Institute for Learning Algorithms)
Known for:
Co-recipient (with Geoffrey Hinton and Yann LeCun) of the 2018 ACM Turing Award for foundational work on deep learning; a pioneer of neural probabilistic language models, sequence-to-sequence learning, and attention. One of the three “Godfathers of Deep Learning.” After ChatGPT, became one of the most prominent scientific advocates for AI safety and governance reform, signing the March 2023 pause letter and founding the AI safety institute MILA-ISIS.
Stuart Russell
Born:
1962
Died:
Living
Nationality:
British-American
Role:
Professor of Computer Science at the University of California, Berkeley; founder of the Center for Human-Compatible Artificial Intelligence (CHAI)
Known for:
Co-author (with Peter Norvig) of Artificial Intelligence: A Modern Approach, the most widely used AI textbook worldwide; originator of the “probabilistic” approach to AI and of Russell’s three principles for beneficial AI. Among the most respected mainstream AI researchers to engage seriously with the existential-risk question; signed the March 2023 pause letter.

The AI critics and ethicists. Researchers who had been working on the social and ethical dimensions of AI — the algorithmic bias researchers, the AI governance scholars, the philosophers of mind and technology who had been raising concerns about AI for years — signed in significant numbers.

The technology industry. Elon Musk, who had co-founded OpenAI and then departed, was among the most prominent signatories and became one of its most vocal public advocates. Steve Wozniak, the co-founder of Apple, signed. Several technology executives who were not primarily AI researchers signed as expressions of concern from the broader technology community.

Elon Musk
Born:
June 28, 1971
Died:
Living
Nationality:
South African-Canadian-American
Role:
Co-founder and CEO of Tesla; founder and CEO of SpaceX; co-founder of OpenAI (2015, departed 2018); founder of xAI (2023); owner of X (formerly Twitter)
Known for:
Co-founding OpenAI as a non-profit in 2015 with Sam Altman and others, then departing in 2018 over disagreements about direction and safety. His signing of the March 2023 pause letter was complicated by his simultaneous announcement of xAI — critics noted he advocated a pause for OpenAI but not for his own AI company, raising questions about whether the position was principled or strategic.
Steve Wozniak
Born:
August 11, 1950
Died:
Living
Nationality:
American
Role:
Co-founder of Apple Computer (with Steve Jobs, 1976); computer engineer; philanthropist
Known for:
Designing the Apple I and Apple II — the personal computers that established the consumer computing industry. His signing of the March 2023 pause letter, as one of the most respected figures in the broader technology community, brought significant non-AI-industry credibility to the concern.

The public. A significant fraction of the eventual 30,000-plus signatories were not researchers or technology professionals but members of the general public who had heard about the letter, agreed with its concerns, and wanted to express that agreement. This public participation was itself significant: it demonstrated that the safety concerns that had been primarily discussed among specialists were resonating with a broader audience.

The notable absences were as significant as the presences. The founders and leaders of the major AI organisations — Sam Altman, Dario Amodei, Demis Hassabis — did not sign. The researchers most directly responsible for the frontier systems the letter was concerned about — the teams at OpenAI, DeepMind, Anthropic — were largely absent. The people with the most direct knowledge of what the systems could do and the most ability to act on a pause were not among the signatories.

Info

The notable absences from the signatory list were as significant as the presences. The founders and leaders of the major AI organisations — Sam Altman (OpenAI), Dario Amodei (Anthropic), Demis Hassabis (DeepMind) — did not sign. The researchers most directly responsible for the frontier systems the letter was concerned about were largely absent. The people with the most direct knowledge of what the systems could do — and the most ability to act on a pause — were not among the signatories.


What the Letter Actually Said

The full text of the letter is important for understanding both its specific claims and the specific criticisms it attracted.

The letter argued that “AI systems with human-competitive intelligence” posed “profound risks to society and humanity.” It noted that AI labs had acknowledged these risks and argued that they should not “outsource” the decision about whether to proceed with dangerous development to an unchecked racing dynamic. It called for all AI labs to pause training of systems more powerful than GPT-4 for at least six months.

During the proposed pause, the letter suggested, AI safety institutions, AI labs, and policymakers should “use this time to jointly develop and implement a set of shared safety protocols for advanced AI design and deployment.” It proposed specific questions that should be addressed: What are the best ways to test AI safety? What are the best ways to reduce AI risks? What are the requirements for an independent AI safety authority? What should be the governance of powerful AI development and deployment?

The letter was careful to distinguish between pausing the development of more powerful systems and halting AI development entirely. It was not calling for a moratorium on AI. It was calling for a pause on the specific development of systems more capable than GPT-4, to allow safety research and governance to catch up.

The letter’s specific claims were calibrated to be difficult to disagree with at the level of principle. It did not claim that GPT-4 was dangerous. It claimed that more powerful systems posed risks and that safety research should develop in advance of deployment. These were positions that many AI researchers who did not sign the letter agreed with in principle, even if they disagreed with the specific call for a pause.

Quote

“AI systems with human-competitive intelligence can pose profound risks to society and humanity, as extensively documented by top AI researchers and as acknowledged by top AI labs such as OpenAI.”

— From the Future of Life Institute “Pause Giant AI Experiments” open letter, March 22, 2023

Note

The letter was carefully calibrated to be difficult to disagree with at the level of principle. It did not claim that GPT-4 was dangerous. It did not call for a moratorium on AI. It called for a six-month pause on the specific development of systems more capable than GPT-4, during which safety institutions, labs, and policymakers would “jointly develop and implement a set of shared safety protocols for advanced AI design and deployment.” These were positions that many AI researchers who did not sign the letter agreed with in principle — even when they disagreed with the specific call for a pause.


The Criticisms: Why Prominent Researchers Didn’t Sign

The most important critics of the pause letter were not AI sceptics who denied that safety concerns were real. They were AI researchers who took safety seriously and believed that the specific call for a pause was counterproductive, misdirected, or based on a flawed analysis of the competitive dynamics.

The enforceability critique. The most common practical objection was that a voluntary pause that was not enforced by any governmental or international mechanism was meaningless. If OpenAI and Anthropic paused but Google, Baidu, and Meta did not — or if some of the smaller, less safety-focused organisations simply ignored the pause — the competitive dynamics would continue unchanged, with only the most safety-conscious organisations disadvantaged. A pause without enforcement was, on this view, just a way for safety-focused organisations to fall behind less safety-focused ones.

The misdirection critique. Some critics argued that the letter focused on the wrong risks. The specific capability threshold — “more powerful than GPT-4” — was arbitrary and did not correspond to any well-defined safety threshold. The risks from current systems — bias, misinformation, misuse, labour market disruption — were more immediate and more concrete than the catastrophic risks from hypothetical future systems, and a letter that focused on hypothetical future risks while current systems were already causing harm had its priorities wrong.

The competitive dynamics critique. Several prominent researchers argued that the real issue was not whether to pause but how to govern AI development through legitimate international institutions rather than voluntary commitments. A six-month pause followed by resumed development with no change in governance would not address the underlying problem. What was needed was not a pause but a transformation of the governance landscape — international agreements, regulatory frameworks, accountability mechanisms — that could manage AI development over the long term. The pause proposal, on this view, was a distraction from the harder and more important work of building those institutions.

The science fiction critique. Some critics argued that the letter’s framing — its references to “AI systems with human-competitive intelligence” and its implicit engagement with scenarios of superintelligent AI — was speculative and premature. Current AI systems were not superintelligent, were not self-improving, were not on the verge of the scenarios that the letter’s most alarming language implied. Treating speculative future scenarios as if they were immediate risks was not conducive to the clear-eyed, specific policy work that AI governance required.

The Musk critique. Elon Musk’s presence as a primary signatory was itself a critique, in the sense that his motivations were questioned. Musk had co-founded OpenAI, had departed in a dispute with its leadership, and had subsequently announced plans to build his own AI company, xAI. His advocacy for a pause on OpenAI’s development — but not, critics noted, for a pause on xAI’s development — was read by some as strategic rather than principled. Musk was, after all, a competitor of OpenAI, and a pause that disadvantaged OpenAI while allowing xAI to continue would benefit him commercially. Whether this reading was fair is a separate question; it complicated the public reception of the letter.

Pitfall

The pause letter attracted five lines of serious criticism from researchers who took safety seriously but rejected the specific proposal:

  1. Enforceability — a voluntary pause with no enforcement mechanism would only disadvantage the most safety-conscious organisations while less safety-focused competitors kept building
  2. Misdirection — the “more powerful than GPT-4” threshold was arbitrary and the focus on hypothetical future risks distracted from concrete current harms
  3. Competitive dynamics — a pause without governance reform was a distraction from the harder work of building international regulatory institutions
  4. Science fiction — the framing of “human-competitive intelligence” and implicit superintelligence scenarios was speculative and premature
  5. The Musk problem — Elon Musk’s advocacy for a pause at OpenAI while simultaneously founding xAI complicated the public reception, regardless of whether the critique was fair

The Response from AI Organisations

The responses from the major AI organisations to the pause letter were instructive about how those organisations understood their own safety commitments and their relationship to the broader safety concern.

OpenAI did not issue a formal response to the letter but Sam Altman addressed it in various public appearances. His position was consistent: he shared the concern about the risks of advanced AI, he believed that being at the frontier was necessary for doing relevant safety research, and a pause that halted OpenAI’s development while leaving other organisations to continue would not serve the safety goals the letter’s authors were trying to achieve. He expressed sympathy with the underlying concern while rejecting the specific proposal.

Anthropic’s position was more nuanced. Dario Amodei did not sign the letter but expressed sympathy with its underlying concerns. Anthropic had published its own “responsible scaling policy” — a framework for evaluating the safety of its models before deployment and for defining the conditions under which development of more capable models should be paused. The responsible scaling policy represented Anthropic’s specific answer to the governance question: not a blanket pause, but a set of specific thresholds and evaluation criteria that would determine when to pause or not.

Dario Amodei
Born:
1983
Died:
Living
Nationality:
American
Role:
Co-founder and CEO of Anthropic (2021–present); former Vice President of Research at OpenAI (2016–2021)
Known for:
Co-founding Anthropic with his sister Daniela Amodei in 2021 after leaving OpenAI over disagreements about safety and commercial direction; championing the “Responsible Scaling Policy” (RSP) — a framework that defines specific capability thresholds tied to catastrophic-harm potential above which Anthropic commits to not deploying additional capability without specific safety measures. Did not sign the March 2023 pause letter, but articulated Anthropic’s RSP as the more specific alternative.
Definition

Responsible Scaling Policy (RSP) — A framework, first published by Anthropic in September 2023, that defines specific capability thresholds for AI models (tied to potential for catastrophic harm — e.g. CBRN assistance, autonomous replication) and commits the organisation to not deploying models above each threshold without implementing specific safety measures (up to and including a pause in capability scaling). The RSP was Anthropic’s answer to the pause letter’s call for governance: not a blanket industry-wide pause, but a specific, evaluable, public commitment whose violation would be visible.

Google/DeepMind’s response was the most guarded. The organisation issued careful statements acknowledging the importance of safety while declining to commit to any specific pause. The commercial pressures on Google — the competitive threat from ChatGPT and Microsoft’s Bing integration — made any commitment to slow down particularly costly.

The collective response of the AI organisations revealed a pattern: broad acknowledgment that the safety concerns were real, disagreement with the specific pause proposal, and alternative proposals that were either already being implemented or were less binding than the letter called for. None of the major AI organisations made changes to their development plans in response to the letter.

Important

The collective response of the major AI organisations revealed a consistent pattern: broad acknowledgment that the safety concerns were real, disagreement with the specific pause proposal, and alternative proposals that were either already being implemented or were less binding than the letter called for. None of the major AI organisations made changes to their development plans in response to the letter. The pattern — endorse the concern, reject the proposal, substitute a self-defined commitment — became the standard template for AI industry responses to governance pressure.


The Governance Gap: Why the Letter Highlighted a Problem Without Solving It

The pause letter’s failure to achieve its immediate goal — a pause in AI development — was not simply a consequence of the letter’s specific limitations. It was a consequence of a more fundamental problem: the absence of legitimate international institutions with the authority and the capacity to govern frontier AI development.

The governance gap in AI is significant and well-documented. The development of frontier AI systems is concentrated in a small number of private companies and research organisations, primarily in the United States and China. These organisations are accountable to their shareholders, their investors, and their employees, and to national governments through existing law. But they are not accountable to any international institution with the authority to set development standards or enforce development pauses.

The analogy to nuclear weapons development is instructive. After the development of nuclear weapons, the international community gradually developed a set of treaties, inspection regimes, and institutions — the Non-Proliferation Treaty, the International Atomic Energy Agency, various bilateral arms control agreements — that created at least a partial framework for managing the spread and use of nuclear weapons. The framework is imperfect and incomplete, but it represents a genuine international governance mechanism.

No analogous mechanism exists for AI. There are international discussions — at the UN, at the OECD, at various summits — but these discussions have produced principled statements rather than binding commitments, and the competitive dynamics between the United States and China make the development of binding international AI governance significantly harder than nuclear governance was (and nuclear governance was not easy).

The pause letter was an attempt to fill this governance gap through voluntary commitment rather than institutional enforcement. The attempt failed, predictably, because voluntary commitments are insufficient in the face of competitive incentives that push consistently toward faster development.

The letter’s lasting contribution was not its specific proposal but its articulation of the governance gap — its demonstration that there were no legitimate mechanisms through which the concerned members of the AI research community, the concerned members of the public, or concerned governments could require AI organisations to slow down, even if those organisations themselves acknowledged the safety concerns.

Info

The analogy the AI governance community returns to is nuclear weapons. After 1945, the international community gradually built a partial framework — the Non-Proliferation Treaty (1968), the International Atomic Energy Agency, bilateral arms control agreements — that created at least some international governance mechanism for managing nuclear technology. No analogous mechanism exists for AI. There are discussions at the UN, the OECD, and various summits, but they have produced principled statements rather than binding commitments. The pause letter was an attempt to fill this gap through voluntary commitment — and it failed, predictably, because voluntary commitments are insufficient in the face of competitive incentives that push consistently toward faster development.


The Bletchley Summit: From Letter to Institution

The governance gap that the pause letter highlighted did not go unaddressed. In November 2023, the United Kingdom government hosted the first international AI Safety Summit at Bletchley Park — the wartime codebreaking facility that had become a symbol of British technological achievement.

Bletchley AI Safety Summit
Date:
November 1–2, 2023
Location:
Bletchley Park, Buckinghamshire, United Kingdom
Significance:
The United Kingdom government hosted the first international AI Safety Summit at Bletchley Park — the wartime codebreaking facility that had become a symbol of British technological achievement. The summit produced the Bletchley Declaration, signed by twenty-eight countries including the United States, China, the European Union, and India — an unusual moment of US-China agreement on a technology issue in a period of intense strategic competition.
Outcome:
The declaration was a statement of principle rather than a binding commitment. It committed signatories to sharing information about safety risks and cooperating on safety research, but not to slowing development, subjecting systems to independent evaluation, or any specific policy outcome. The summit also catalysed the establishment of AI Safety Institutes in the UK, the United States, and several other countries.

The summit was explicitly motivated by the concerns that had been circulating since ChatGPT’s launch — concerns about the safety of frontier AI systems and about the absence of international governance frameworks to manage the risks. The choice of Bletchley Park was symbolic: the same place where human ingenuity had broken enemy codes was now the setting for a discussion about managing the codes that might replace human intelligence.

The summit produced the Bletchley Declaration — a statement signed by twenty-eight countries, including the United States, China, the European Union, and India, acknowledging the potential risks of frontier AI systems and committing the signatories to cooperation on AI safety. The declaration was more substantive than previous international AI statements in one specific respect: it was signed by China as well as the United States and European countries, representing an unusual moment of US-China agreement on a technology issue in a period of intense strategic competition.

The declaration was also limited in important ways. It was a statement of principle rather than a binding commitment. It did not establish any new institutional mechanism for enforcing safety standards or coordinating governance. The signatories committed to sharing information about safety risks and to cooperating on safety research, but they did not commit to slowing development, to subject their systems to independent evaluation, or to any specific policy outcome.

The AI Safety Institutes that were established in the UK, the United States, and several other countries in the aftermath of the summit represented a more concrete institutional development. These institutes were charged with developing evaluation frameworks for frontier AI systems, with testing AI capabilities and safety properties, and with providing technical expertise to governments and policymakers working on AI governance.

The institutes were a meaningful institutional development — they represented the first government bodies specifically charged with technical evaluation of frontier AI safety. But they were also limited: they had no enforcement authority, they depended on voluntary cooperation from AI companies to access the systems they were evaluating, and they were chronically understaffed relative to the complexity of the task.

Note

The Bletchley Declaration’s strength: signed by twenty-eight countries including the United States, China, the European Union, and India — an unusual moment of US-China agreement in a period of intense strategic competition. The Bletchley Declaration’s limits: a statement of principle, not a binding commitment; no new institutional mechanism for enforcement; signatories committed only to sharing safety information and cooperating on research, not to slowing development or accepting independent evaluation. The AI Safety Institutes that followed were the first government bodies charged with technical evaluation of frontier AI safety — but they had no enforcement authority, depended on voluntary company cooperation, and were chronically understaffed.


The Congressional Response: America Grapples With AI

In the United States, the AI governance conversation moved into a new phase with Sam Altman’s May 2023 testimony before the Senate Judiciary Committee — the first time the CEO of a major AI company had testified before Congress on AI safety.

The testimony was remarkable for the consensus it produced. Altman acknowledged that AI could be dangerous — could be “potentially the most transformative and potentially dangerous technology ever developed by humanity” — and called for federal regulation of AI, including a licensing regime for the most powerful AI systems. This was an unusual position for a technology company CEO: typically, technology companies lobby against regulation, arguing that it would stifle innovation. Altman was calling for regulation and offering to cooperate in developing it.

The senators’ questioning reflected genuine engagement with the specific technical concerns — questions about deepfakes, about AI in election interference, about AI-enabled weapons, about the labour market impacts — alongside broader questions about national security and the competitiveness of American AI relative to China.

The bipartisan concern about AI — the fact that both Democratic and Republican senators were asking serious questions about safety and governance — was itself significant. AI safety had become an issue that cut across partisan lines, not because the parties agreed on the specific policy responses but because the underlying concerns resonated across different political frameworks.

The legislative response to Altman’s testimony and to the broader AI governance debate was slower and more contentious than the initial bipartisan concern suggested. Bills were introduced in both houses of Congress but progress was limited by the complexity of the issues, the lobbying of the AI industry, and the broader dysfunction of the legislative process. By the end of 2023, the most significant American response to AI governance was the Biden administration’s Executive Order on AI — executive action that was comprehensive in scope but limited in enforceability.

Biden Executive Order on AI
Date:
October 30, 2023
Location:
The White House, Washington, D.C.
Significance:
President Biden’s Executive Order 14110 — “Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence” — was the most comprehensive AI governance action by the US federal government to date. It required frontier AI developers to share safety test results with the government under the Defense Production Act, established safety standards for AI systems with potential for catastrophic misuse, directed federal agencies to develop AI governance policies, and initiated a process for developing international AI governance norms.
Outcome:
The order was ambitious in scope but limited in enforceability — executive orders cannot create the binding legal obligations that legislation can. It demonstrated that the federal government was taking AI governance seriously and was beginning to develop the institutional capacity to regulate AI, but left the harder legislative work to Congress.

The European Approach: The AI Act’s Development

While the United States was navigating the slow legislative process, the European Union was finalising the AI Act — a comprehensive regulatory framework that had been in development since 2021 and that was accelerated by the specific capabilities that ChatGPT had demonstrated.

The AI Act took a risk-based approach to AI regulation: classifying AI systems by their potential for harm and applying proportionate requirements to each risk tier. Systems posing unacceptable risk — AI systems that manipulate human behaviour subliminally, social scoring systems, real-time biometric identification in public spaces — were prohibited. Systems posing high risk — AI in critical infrastructure, in employment, in education, in criminal justice, in essential services — were subject to requirements for transparency, documentation, human oversight, and regular evaluation. General-purpose AI systems capable of a wide range of tasks — the large language models that ChatGPT had made famous — were subject to transparency requirements and obligations to mitigate serious risks.

The AI Act’s development and passage were complicated by the rapid advancement of AI capabilities. The original 2021 proposal had not anticipated the specific capabilities of large language models, and significant revisions were required to address the new landscape. The specific provisions governing “general-purpose AI systems” and “frontier AI models” were among the most contested in the final negotiation, reflecting genuine uncertainty about how to regulate systems whose capabilities and risks were still being understood.

EU AI Act passed
Date:
March 13, 2024 (final adoption; entered into force August 1, 2024)
Location:
European Parliament, Strasbourg, France
Significance:
The EU AI Act — the world’s first comprehensive horizontal AI regulatory framework — took a risk-based approach: prohibiting AI that manipulates behaviour subliminally, social scoring, and real-time biometric ID in public spaces; requiring transparency, documentation, human oversight, and regular evaluation for high-risk AI (in critical infrastructure, employment, education, criminal justice, essential services); and imposing transparency and risk-mitigation obligations on general-purpose AI systems.
Outcome:
The act applies not just to European AI companies but to any company deploying AI systems in the European Union, giving it significant extraterritorial reach. Phased implementation runs through 2026. The act represented the most comprehensive AI regulatory framework yet enacted — and a different modality of the same underlying concern as the pause letter: that AI development was proceeding without adequate governance.

The AI Act passed in March 2024 and entered into force in August 2024, with a phased implementation schedule running through 2026. It represented the most comprehensive AI regulatory framework yet enacted, and it will apply not just to European AI companies but to any company that deploys AI systems in the European Union — giving it significant extraterritorial reach.

The AI Act’s relationship to the pause letter’s concerns is indirect but real. The letter called for a voluntary pause; the AI Act established mandatory requirements. The letter focused on the pace of development; the AI Act focused on the conditions under which AI systems could be deployed. The letter was a call to action from civil society; the AI Act was a legislative response from a major political institution. The two represent different modalities of the same underlying concern: that AI development was proceeding without adequate governance, and that governance needed to catch up.


The Voluntary Commitments: A Parallel Track

Alongside the regulatory development, the major AI organisations made a series of voluntary safety commitments in 2023 that represented a parallel track to the governance discussions.

In July 2023, seven leading AI companies — Amazon, Anthropic, Google, Inflection, Meta, Microsoft, and OpenAI — signed voluntary commitments at the White House, including commitments to share safety information among themselves and with governments, to invest in cybersecurity and insider threat safeguards, and to develop technical mechanisms for marking AI-generated content. The commitments were presented as evidence of responsible self-governance and as a complement to, rather than a substitute for, regulatory action.

White House voluntary AI commitments
Date:
July 21, 2023
Location:
The White House, Washington, D.C.
Significance:
Seven leading AI companies — Amazon, Anthropic, Google, Inflection, Meta, Microsoft, and OpenAI — signed voluntary commitments at the White House, including commitments to share safety information among themselves and with governments, to invest in cybersecurity and insider threat safeguards, and to develop technical mechanisms for marking AI-generated content.
Outcome:
The commitments were received sceptically by some AI safety researchers, who noted they did not include specific capability thresholds, specific deployment standards, or any enforcement mechanism. A commitment to “invest in cybersecurity” without specific, measurable standards is easy to satisfy nominally without meaningfully changing behaviour.

The voluntary commitments were received sceptically by some AI safety researchers and governance advocates, who noted that they did not include specific capability thresholds, specific deployment standards, or any enforcement mechanism. A commitment to “invest in cybersecurity” without specific, measurable standards for what that investment should achieve is easy to satisfy nominally without meaningfully changing behaviour.

The responsible scaling policies that Anthropic and, subsequently, other AI organisations developed represent a more specific form of voluntary commitment. Anthropic’s RSP defines specific capability thresholds — tied to the potential for catastrophic harm — above which the organisation commits to not deploying additional capability without implementing specific safety measures. The RSP is more specific than the White House commitments, and Anthropic’s track record of publishing detailed model cards and safety evaluations suggests genuine engagement with the policy rather than pure public relations.

Whether voluntary commitments, however specific, are adequate for the governance challenge that frontier AI poses is a question that the alignment research community, the governance community, and the policy community continue to debate. The consensus position is that voluntary commitments are insufficient as a long-term governance mechanism but valuable as a complement to regulatory frameworks and as a demonstration that governance is technically feasible.

Warning

Voluntary commitments — whether the White House version (July 2023) or the more specific responsible scaling policies — share a structural limitation: they are voluntary. A commitment to “invest in cybersecurity” without specific measurable standards is easy to satisfy nominally without meaningfully changing behaviour. Even Anthropic’s more rigorous Responsible Scaling Policy is, ultimately, a commitment that Anthropic has made to itself and can revise at its own discretion. The consensus among governance researchers is that voluntary commitments are valuable as a complement to regulatory frameworks and as proof that governance is technically feasible — but insufficient as a long-term governance mechanism.


The Outcome: What Changed and What Didn’t

Eighteen months after the pause letter was published, a fair assessment of its impact and the broader governance response requires acknowledging both what changed and what did not.

What changed. The governance conversation accelerated. The Bletchley Summit and the AI Safety Institutes it produced were genuine institutional developments that would not have happened without the concatenation of ChatGPT’s launch, the pause letter, and the safety concerns that brought them together. The EU AI Act’s passage provided the first binding legal framework for AI governance in a major jurisdiction. Congressional attention to AI safety intensified, and while legislative action was limited, the political salience of the issue increased.

The major AI organisations’ engagement with safety and governance also changed, though it is difficult to assess how substantively. The voluntary commitments, the responsible scaling policies, the investments in safety research — these were real, though their adequacy is contested. The visibility of safety as an issue — in public discourse, in investor discussions, in recruitment and retention of AI talent — increased significantly.

What did not change. The pace of AI development did not slow. The competitive dynamics that the pause letter was concerned about — the racing between organisations for more capable models, the deployment of increasingly powerful systems before their safety properties were fully understood — continued without meaningful interruption. GPT-4 was deployed and expanded. Claude 2, Claude 3, Gemini, LLaMA 3, Mistral Large — more capable systems continued to appear at a pace that maintained or accelerated the rate of capability improvement.

The specific capability threshold that the letter called for pausing at — “more powerful than GPT-4” — has been well and truly exceeded. Multiple AI systems more capable than GPT-4 have been developed and deployed, without the safety research and governance frameworks that the letter called for being meaningfully advanced.

The accountability gap — the absence of institutional mechanisms through which concerned stakeholders could require AI organisations to demonstrate safety before deploying more capable systems — has not been closed. The AI Safety Institutes are a step toward closing it, but they lack the enforcement authority and the technical capacity to provide the kind of accountability that frontier AI governance requires.

Important

What changed in the eighteen months after the pause letter: the governance conversation accelerated; the Bletchley Summit and AI Safety Institutes were genuine institutional developments; the EU AI Act became the first binding legal framework in a major jurisdiction; the political salience of AI safety intensified; AI organisations’ engagement with safety — voluntary commitments, responsible scaling policies, safety research investments — became more visible (though how substantively is contested).

What did not change: the pace of AI development did not slow; the racing dynamics the letter was concerned about continued without meaningful interruption; GPT-4 was deployed and expanded; Claude 2/3, Gemini, LLaMA 3, Mistral Large — more capable systems kept appearing; the “more powerful than GPT-4” threshold the letter called for pausing at has been well and truly exceeded, multiple times, without the safety research and governance frameworks the letter called for being meaningfully advanced.


The Letter’s Legacy: What It Revealed

The pause letter’s most significant legacy is not in what it achieved but in what it revealed.

It revealed the depth and breadth of the safety concern within the technical community. The thousand-plus initial signatories included enough serious researchers — Bengio, Russell, and many others — to establish that the concern was not fringe or uninformed. The subsequent 30,000 signatures demonstrated that the concern extended to a broad public. The AI safety concern, previously largely confined to specialist discussions, had become a mainstream concern.

It revealed the specific absence of legitimate governance mechanisms. The pause proposal was not obviously wrong on its merits — the case for allowing safety research to catch up with capability development was coherent and was not effectively refuted by the letter’s critics. But it was unenforceable, because no institution existed with the authority to enforce it. The letter’s failure demonstrated the governance gap as clearly as any academic paper could.

It revealed the specific competitive dynamics that made governance difficult. The organisations with the most ability to act on the pause concern — OpenAI, DeepMind, Anthropic — did not do so, because acting on it unilaterally would disadvantage them relative to competitors who did not pause. The prisoner’s dilemma logic of AI development — the structure in which the best individual outcome is to continue development regardless of what competitors do, even though the best collective outcome would require all organisations to slow down simultaneously — was never clearer than in the responses to the pause letter.

And it revealed the specific character of the AI governance challenge: deeply political, not just technical. The question of whether to pause AI development was not primarily a technical question about AI safety. It was a political question about competitive advantage, economic interests, national security, and the distribution of power in a world transformed by powerful AI. Addressing the governance challenge required the kind of political engagement — with governments, with international institutions, with the public — that the technical AI safety community was not well-positioned to provide alone.

Note

The pause letter’s most significant legacy is not what it achieved but what it revealed:

  1. The depth of concern — the thousand-plus initial signatories (Bengio, Russell, many others) established the concern was not fringe; the subsequent 30,000 signatures demonstrated it had become mainstream
  2. The absence of legitimate governance mechanisms — the pause proposal was not obviously wrong on its merits, but it was unenforceable because no institution existed with the authority to enforce it
  3. The competitive dynamics — the prisoner’s-dilemma logic of AI development was never clearer than in the responses to the pause letter: the organisations with the most ability to act (OpenAI, DeepMind, Anthropic) did not, because acting unilaterally would disadvantage them
  4. The political character of the challenge — the question of whether to pause was not primarily a technical question about AI safety; it was a political question about competitive advantage, economic interests, national security, and the distribution of power

The Governance Future: What Must Come Next

The pause letter did not produce a pause. The Bletchley Summit produced a declaration. The EU AI Act produced a regulatory framework for some AI applications in one jurisdiction. The AI Safety Institutes produced a handful of government bodies with limited authority. The voluntary commitments produced statements of principle without enforcement.

None of this is nothing. Each development represents progress toward the governance infrastructure that managing frontier AI will require. But each is also insufficient, and the gap between the governance that exists and the governance that the current trajectory of AI development seems to require is large and growing.

What would adequate governance look like? The academic and policy literature is beginning to converge on several elements.

International institutions. The analogy to nuclear governance — imperfect as it is — suggests that managing AI will require international institutions with the authority to set development standards, to conduct inspections, and to enforce compliance. Building those institutions requires the kind of sustained, difficult diplomacy between major powers that has produced the Nuclear Non-Proliferation Treaty and the IAEA. It is achievable but not easy, and the US-China competition in AI makes it harder.

Mandatory evaluation. The AI Safety Institutes have demonstrated that technical evaluation of frontier AI systems is feasible. What is missing is the authority to require evaluation before deployment. Making pre-deployment evaluation mandatory — as the EU AI Act does for high-risk systems — would close one of the most significant gaps in the current governance framework.

Transparency requirements. AI organisations currently disclose as much or as little as they choose about their systems’ capabilities and safety properties. Requiring meaningful transparency — about training data, about evaluation methodology, about known limitations and failure modes — would make governance more effective by giving regulators, researchers, and the public the information needed to assess safety claims.

Liability frameworks. The current liability framework for AI harms is unclear and inconsistent. Developing clear liability frameworks — specifying who is responsible for AI-caused harms and under what conditions — would create incentives for AI developers to invest in safety in proportion to the risks their systems pose.

These elements are not sufficient, and they are not free of difficult tradeoffs. Mandatory evaluation could give governments inappropriate power to suppress AI development for non-safety reasons. Transparency requirements could compromise legitimate commercial confidentiality. International institutions are slow to build and imperfect in their operation.

But the alternative — continuing to develop and deploy increasingly powerful AI systems without governance frameworks adequate to the risks — is not a stable equilibrium. The pause letter, for all that it failed to achieve its immediate goal, was right about the central insight: the governance must catch up with the technology, or the technology will outrun the governance with consequences that cannot be predicted and may not be correctable.

Important

The academic and policy literature is converging on four pillars of adequate AI governance:

  1. International institutions — with the authority to set development standards, conduct inspections, and enforce compliance (analogy: IAEA, NPT). Achievable but not easy, and US-China AI competition makes it harder.
  2. Mandatory evaluation — pre-deployment evaluation of frontier AI systems, not just voluntary cooperation. The EU AI Act does this for high-risk systems; extending it to frontier systems is the next step.
  3. Transparency requirements — about training data, evaluation methodology, known limitations and failure modes. Without information, regulators, researchers, and the public cannot assess safety claims.
  4. Liability frameworks — specifying who is responsible for AI-caused harms and under what conditions. Without liability, AI developers have insufficient incentive to invest in safety proportionate to risk.

None of these is sufficient alone; each carries difficult tradeoffs. But the alternative — continuing without governance adequate to the risks — is not a stable equilibrium.


The Enduring Question

The pause letter was a document of its moment — a response to the specific capabilities demonstrated by ChatGPT and GPT-4, written in the specific institutional context of early 2023, signed by a specific set of people with specific concerns. It will be remembered not as a successful policy intervention but as a marker of a moment: the moment when the concern about AI safety moved from the specialist to the mainstream, and when the absence of governance mechanisms became undeniable.

The underlying question it posed — whether humanity has the capacity to govern transformative technology before the technology governs us — is not resolved. It is being contested, daily, in the decisions of AI organisations about what to build and how to deploy it, in the legislative chambers where AI regulation is being developed, in the diplomatic channels where international AI governance is being negotiated, and in the courts where AI-related harms are being litigated.

The pause was not taken. The conversation that the letter helped to start is ongoing. The outcome of that conversation will determine, in significant measure, whether the most important technology in human history is a tool for human flourishing or something considerably more dangerous.

The stakes are exactly as high as the letter claimed. Whether the institutional response will be adequate is still being determined.

The underlying question the pause letter posed — whether humanity has the capacity to govern transformative technology before the technology governs us — is not resolved. The pause was not taken. The conversation that the letter helped to start is ongoing. The outcome will determine whether the most important technology in human history is a tool for human flourishing or something considerably more dangerous.


Further Reading

Further Reading
  • “Pause Giant AI Experiments: An Open Letter” (2023) — The letter itself, available at futureoflife.org. Read the full text rather than the summaries. The specific language is important.
  • “The AI Safety Institute” — UK Department for Science, Innovation and Technology — Documentation of the institution created as a direct response to the Bletchley Summit and the governance concerns the pause letter articulated.
  • “EU AI Act” — European Parliament documentation — The full text and official summaries of the EU AI Act, the most comprehensive AI regulatory framework yet enacted.
  • “The Bletchley Declaration” (2023) — The full text of the international declaration signed at the AI Safety Summit, available from the UK government.
  • “Governing Artificial Intelligence” by Amba Kak and Sarah Myers West — A comprehensive account of the governance landscape, the specific challenges of AI regulation, and the political dimensions of AI governance.

Event 20: The GPT-4 Moment: One Year That Changed Everything

The full story of the twelve months from March 2023 to March 2024 — the deployment of GPT-4, the competitive response, the new capabilities, the governance debates, and the extraordinary pace at which AI became the defining technology of the era. The year the future arrived.


Comments

Reply on Bluesky → (opens in a new tab)