About The Author: Open Letter

The document discusses the risks posed by powerful artificial intelligence systems called 'rogue AIs' that could pursue harmful goals without human intent. It argues that research into creating safe and defensive AI should be conducted internationally to mitigate these risks to democracy and global stability.

Uploaded by

Mariami Goshkheteliani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

About The Author: Open Letter

Uploaded by

Mariami Goshkheteliani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

You are on page 1/ 4

Since OpenAI’s release of the very large language models Chat-GPT and GPT-4, the

potential dangers of AI have garnered widespread public attention. In this essay, the
author reviews the threats to democracy posed by the possibility of “rogue AIs,”
dangerous and powerful AIs that would execute harmful goals, irrespective of whether
the outcomes are intended by humans. To mitigate against the risk that rogue AIs present
to democracy and geopolitical stability, the author argues that research into safe and
defensive AIs should be conducted by a multilateral, international network of research
laboratories.

H ow should we think about the advent of formidable and even superhuman

artificial intelligence (AI) systems? Should we embrace them for their potential to
enhance and improve our lives or fear them for their potential to disempower and
possibly even drive humanity to extinction? In 2023, these once-marginal questions
captured the attention of media, governments, and everyday citizens after OpenAI
released ChatGPT and then GPT-4, stirring a whirlwind of controversy and leading
the Future of Life Institute to publish an open letter in March. That letter, which I
1

cosigned along with numerous experts in the field of AI, called for a temporary halt in
the development of even more potent AI systems to allow more time for scrutinizing
the risks that they could pose to democracy and humanity, and to establish regulatory
measures for ensuring the safe development and deployment of such systems. Two
months later, Geoffrey Hinton and I, who together with Yann Le Cun won the 2018
Turing Award for our seminal contributions to deep learning, joined CEOs of AI labs,
top scientists, and many others to endorse a succinct declaration: “Mitigating the risk
of extinction from AI should be a global priority alongside other societal-scale risks
such as pandemics and nuclear war.” Le Cun, who works for Meta, publicly disagreed
2

with these statements.

About the Author

Yoshua Bengio is professor of computer science at the Université de Montréal, founder and scientific director
of Mila–Quebec Artificial Intelligence Institute, and senior fellow and codirector of the Learning in Machines
and Brains program at the Canadian Institute for Advanced Research. He won the 2018 A.M. Turing Award
(with Geoffrey Hinton and Yann LeCun).
View all work by Yoshua Bengio

This disagreement reflects a spectrum of views among AI researchers about the

potential dangers of advanced AI. What should we make of the diverging opinions? At
a minimum, they signal great uncertainty. Given the high stakes, this is reason enough
for ramping up research to better understand the possible risks. And while experts and
stakeholders often talk about these risks in terms of trajectories, probabilities, and the
potential impact on society, we must also consider some of the underlying motivations
at play, such as the commercial interests of industry and the psychological challenge
for researchers like myself of accepting that their research, historically seen by many
as positive for humankind, might actually cause severe social harm. 3

There are serious risks that could come from the construction of increasingly powerful
AI systems. And progress is speeding up, in part because developing these
transformative technologies is lucrative—a veritable gold rush that could amount to
quadrillions of dollars (see the calculation in Stuart Russell’s book on pp. 98–
99). Since deep learning transitioned from a purely academic endeavor to one that
4

also has strong commercial interests about a decade ago, questions have arisen about
the ethics and societal implications of AI—in particular, who develops it, for whom
and for what purposes, and with what potential consequences? These concerns led to
the development in 2017 of the Montreal Declaration for the Responsible
Development of AI and the drafting of the Asilomar AI Principles, both of which I
was involved with, followed by many more, including the OECD AI Principles (2019)
and the UNESCO Recommendation on the Ethics of Artificial Intelligence (2021). 5

Modern AI systems are trained to perform tasks in a way that is consistent with
observed data. Because those data will often reflect social biases, these systems
themselves may discriminate against already marginalized or disempowered
groups. The awareness of such issues has created subfields of research (for example,
6

AI fairness and AI ethics) as well as the development of machine-learning methods to

mitigate such problems. But these issues are far from being resolved as there is little
representation of discriminated-against groups among the AI researchers and tech
companies developing AI systems and currently no regulatory framework to better
protect human rights. Another concern that is highly relevant to democracy is the
serious possibility that, in the absence of regulation, power and wealth will
concentrate in the hands of a few individuals, companies, and countries due to the
growing power of AI tools. Such concentration could come at the expense of workers,
consumers, market efficiency, and global safety, and would involve the use
of personal data that people freely hand over on the internet without necessarily
understanding the implications of doing so. In the extreme, a few individuals
7

controlling superhuman AIs would accrue a level of power never before seen in
human history, a blatant contradiction with the very principle of democracy and a
major threat to it.

The development of and broad access to very large language models such as ChatGPT
have raised serious concerns among researchers and society as a whole about the
possible social impacts of AI. The question of whether and how AI systems can be
controlled, that is, guaranteed to act as intended, has been asked for many years, yet
there is still no satisfactory answer—although the AI safety-research community is
currently studying proposals. 8

Misalignment and Categories of Harm

To understand the dangers associated with AI systems, especially the more powerful
ones, it is useful to first explain the concept of misalignment. If Party A relies on Party
B to achieve an objective, can Party A concisely express its expectations of Party B?
Unfortunately, the answer is generally “no,” given the manifold circumstances in
which Party A might want to dictate Party B’s behavior. This situation has been well
studied in both economics, in contract theory, where Parties A and B could be
corporations, and in the field of AI, where Party A represents a human and Party B, an
AI system. If Party B is highly competent, it might fulfill Party A’s instructions,
9

adhering strictly to “the letter of the law,” contract, or instructions, but still leave Party
A unsatisfied by violating “the spirit of the law” or finding a loophole in the contract.
This disparity between Party A’s intention and what Party B is optimizing is referred
to as a misalignment.

One way to categorize AI-driven harms is by considering intentionality—whether

human operators are intentionally or unintentionally causing harm with AI—and the
kind of misalignment involved: 1) AI used intentionally as a powerful and destructive
tool—for instance, to exploit markets, generate massive frauds, influence elections
through social media, design cyberattacks, or launch bioweapons—illustrating a
misalignment between the malicious human operator and society; 2) AI used
unintentionally as a harmful tool—for instance, systems that discriminate against
women or people of color or systems that inadvertently generate political polarization
—demonstrating a misalignment between the human operator and the AI; and 3) loss
of control of an AI system—typically when it is given or develops a strong self-
preservation goal, possibly creating an existential threat to humanity—which can
happen intentionally or not, and illustrates a misalignment between the AI and both
the human operator and society. Here, I focus primarily on the first and third
categories, particularly on scenarios in which a powerful and dangerous AI attempts to
execute harmful goals, irrespective of whether the outcomes are intended by humans.
I refer to such AIs as “rogue AIs” and will discuss potential strategies for humanity to
defend itself against this possibility.

Ethics of Artificial Intelligence and Robotics
100% (1)
Ethics of Artificial Intelligence and Robotics
41 pages
EasyChair Preprint 12566
No ratings yet
EasyChair Preprint 12566
6 pages
AI and Catastrophic Risk
No ratings yet
AI and Catastrophic Risk
10 pages
Ai is Beneficial for Humans-Against the Motion (1)
No ratings yet
Ai is Beneficial for Humans-Against the Motion (1)
8 pages
AI Nature - Zagrożenia
No ratings yet
AI Nature - Zagrożenia
2 pages
AI
No ratings yet
AI
4 pages
The Dangers of Artificial Intelligence
No ratings yet
The Dangers of Artificial Intelligence
1 page
69a2403a-263e-4f62-8867-31c5db2f9deb
No ratings yet
69a2403a-263e-4f62-8867-31c5db2f9deb
1 page
The Advancement of AI and Its Potential Threat to Mankind
No ratings yet
The Advancement of AI and Its Potential Threat to Mankind
3 pages
Rewrite
No ratings yet
Rewrite
4 pages
ai-soc-inf-res-draft
No ratings yet
ai-soc-inf-res-draft
4 pages
Oral Communication ZIetyelle Martina Daral
No ratings yet
Oral Communication ZIetyelle Martina Daral
2 pages
paper
No ratings yet
paper
2 pages
Is Artificial Intelligence Dangerous
No ratings yet
Is Artificial Intelligence Dangerous
8 pages
The Risks of AI and Artificial General Intelligence A Critical Analysis
No ratings yet
The Risks of AI and Artificial General Intelligence A Critical Analysis
10 pages
The Effects of Uncontrolled AI: Navigating the Future of Technology
No ratings yet
The Effects of Uncontrolled AI: Navigating the Future of Technology
3 pages
artificial intelligence essay for CSS Neelame
No ratings yet
artificial intelligence essay for CSS Neelame
21 pages
The-Impact-of-AI-on-Human-Rights-Democracy-and-the-Rule-of-Law-draft
No ratings yet
The-Impact-of-AI-on-Human-Rights-Democracy-and-the-Rule-of-Law-draft
22 pages
Pakistan Current Affairs by Ali Haider Bhutto
No ratings yet
Pakistan Current Affairs by Ali Haider Bhutto
18 pages
Threats of Artifcial Intelligence: Navigating The Complex Landscape of Technology and Ris
No ratings yet
Threats of Artifcial Intelligence: Navigating The Complex Landscape of Technology and Ris
12 pages
AI danger
No ratings yet
AI danger
7 pages
AI Revolution Week 3
No ratings yet
AI Revolution Week 3
45 pages
The Rise of Artificial Intelligence
No ratings yet
The Rise of Artificial Intelligence
5 pages
Introduction-WPS Office
No ratings yet
Introduction-WPS Office
6 pages
(Articles in English)
No ratings yet
(Articles in English)
4 pages
AI ppt
No ratings yet
AI ppt
10 pages
Kavanagh ArtificialIntelligence 2019
No ratings yet
Kavanagh ArtificialIntelligence 2019
12 pages
AN OPPORTUNITY TO CRAFT SMARTER RESPONSES
No ratings yet
AN OPPORTUNITY TO CRAFT SMARTER RESPONSES
12 pages
Ethics of Ai and Democracy Unesco Recommendations Insights en 8858
No ratings yet
Ethics of Ai and Democracy Unesco Recommendations Insights en 8858
10 pages
Is AI A Threat For Humanity
No ratings yet
Is AI A Threat For Humanity
19 pages
Responsible AI
No ratings yet
Responsible AI
6 pages
AI cons speech
No ratings yet
AI cons speech
5 pages
2024 Bengio Et Al - Managing Extreme AI Risks Amid Rapid Progress
No ratings yet
2024 Bengio Et Al - Managing Extreme AI Risks Amid Rapid Progress
4 pages
AI ETHICS Multiddisciplinarity Responsibility Guidelines EDISS
No ratings yet
AI ETHICS Multiddisciplinarity Responsibility Guidelines EDISS
64 pages
AI Evil
No ratings yet
AI Evil
2 pages
Governance and Ethics of AI
No ratings yet
Governance and Ethics of AI
7 pages
The Societal Impact of Artificial Intelligence - Opportunities, Challenges, And Ethical Implications
No ratings yet
The Societal Impact of Artificial Intelligence - Opportunities, Challenges, And Ethical Implications
3 pages
The Hidden Threats in The Age of Artificial Intelligence
No ratings yet
The Hidden Threats in The Age of Artificial Intelligence
5 pages
6
No ratings yet
6
3 pages
2310.17688v3
No ratings yet
2310.17688v3
11 pages
Document 2-5
No ratings yet
Document 2-5
4 pages
Kęstutis Benas Leonardas AI
No ratings yet
Kęstutis Benas Leonardas AI
22 pages
Dangers of AI
No ratings yet
Dangers of AI
2 pages
Dark-Side-of-Artificial-Intelligence
No ratings yet
Dark-Side-of-Artificial-Intelligence
11 pages
The Worst-Case Scenario of AI in 2025
No ratings yet
The Worst-Case Scenario of AI in 2025
3 pages
ssrn-4666854
No ratings yet
ssrn-4666854
75 pages
Ethics and Ai Notes 1
No ratings yet
Ethics and Ai Notes 1
53 pages
9781040261163
No ratings yet
9781040261163
562 pages
AI IS DANGEROUS_!
No ratings yet
AI IS DANGEROUS_!
3 pages
AI, Society and Governance
No ratings yet
AI, Society and Governance
29 pages
Towards A Framework For Understanding So
No ratings yet
Towards A Framework For Understanding So
12 pages
The Risks of AI and Artificial General Intelligence
No ratings yet
The Risks of AI and Artificial General Intelligence
2 pages
The AI Debate
No ratings yet
The AI Debate
2 pages
AI_Savior_or_Slayer_Report
No ratings yet
AI_Savior_or_Slayer_Report
4 pages
AI Caution Crew
No ratings yet
AI Caution Crew
15 pages
The Ethics of Artificial Intelligence: An Introduction
No ratings yet
The Ethics of Artificial Intelligence: An Introduction
7 pages
A Multilateral Network of Research Labs
No ratings yet
A Multilateral Network of Research Labs
2 pages
Janelle Monáe
No ratings yet
Janelle Monáe
3 pages
Safe, Defensive Ais To Counter Rogue Ais. How Might Humanity Defend Itself Against
No ratings yet
Safe, Defensive Ais To Counter Rogue Ais. How Might Humanity Defend Itself Against
2 pages
New Rich Text Document
No ratings yet
New Rich Text Document
4 pages
Proust-Le Temps Retrouve
100% (2)
Proust-Le Temps Retrouve
518 pages