What's to come?

Sept. 11 2014 7:42 AM

You Should Be Terrified of Superintelligent Machines

Not because they might develop the worst human characteristics, but because they’re nothing like humans at all.

140910_FT_Superintelligence — An AI need not care intrinsically about food, air, temperature, energy expenditure, occurrence or threat of bodily injury, disease, predation, sex, or progeny.

Adapted from Superintelligence: Paths, Dangers, Strategies by Nick Bostrom. Out now from Oxford University Press.

In the recent discussion over the risks of developing superintelligent machines—that is, machines with general intelligence greater than that of humans—two narratives have emerged. One side argues that if a machine ever achieved advanced intelligence, it would automatically know and care about human values and wouldn’t pose a threat to us. The opposing side argues that artificial intelligence would “want” to wipe humans out, either out of revenge or an intrinsic desire for survival.

As it turns out, both of these views are wrong. We have little reason to believe a superintelligence will necessarily share human values, and no reason to believe it would place intrinsic value on its own survival either. These arguments make the mistake of anthropomorphising artificial intelligence, projecting human emotions onto an entity that is fundamentally alien.

Let us first reflect for a moment on the vastness of the space of possible minds. In this abstract space, human minds form a tiny cluster. Consider two persons who seem extremely unlike, perhaps Hannah Arendt and Benny Hill. The personality differences between these two individuals may seem almost maximally large. But this is because our intuitions are calibrated on our experience, which samples from the existing human distribution (and to some extent from fictional personalities constructed by the human imagination for the enjoyment of the human imagination). If we zoom out and consider the space of all possible minds, however, we must conceive of these two personalities as virtual clones. Certainly in terms of neural architecture, Ms. Arendt and Mr. Hill are nearly identical. Imagine their brains lying side by side in quiet repose. You would readily recognize them as two of a kind. You might even be unable to tell which brain belonged to whom. If you looked more closely, studying the morphology of the two brains under a microscope, this impression of fundamental similarity would only be strengthened: You would see the same lamellar organization of the cortex, with the same brain areas, made up of the same types of neuron, soaking in the same bath of neurotransmitters.

Despite the fact that human psychology corresponds to a tiny spot in the space of possible minds, there is a common tendency to project human attributes onto a wide range of alien or artificial cognitive systems. Yudkowsky illustrates this point nicely:

Back in the era of pulp science fiction, magazine covers occasionally depicted a sentient monstrous alien—colloquially known as a bug-eyed monster (BEM)—carrying off an attractive human female in a torn dress. It would seem the artist believed that a non-humanoid alien, with a wholly different evolutionary history, would sexually desire human females. ... Probably the artist did not ask whether a giant bug perceives human females as attractive. Rather, a human female in a torn dress is sexy—inherently so, as an intrinsic property. They who made this mistake did not think about the insectoid’s mind: they focused on the woman’s torn dress. If the dress were not torn, the woman would be less sexy; the BEM does not enter into it.

An artificial intelligence can be far less humanlike in its motivations than a green scaly space alien. The extraterrestrial (let us assume) is a biological creature that has arisen through an evolutionary process and can therefore be expected to have the kinds of motivation typical of evolved creatures. It would not be hugely surprising, for example, to find that some random intelligent alien would have motives related to one or more items like food, air, temperature, energy expenditure, occurrence or threat of bodily injury, disease, predation, sex, or progeny. A member of an intelligent social species might also have motivations related to cooperation and competition: Like us, it might show in-group loyalty, resentment of free riders, perhaps even a vain concern with reputation and appearance.

An AI, by contrast, need not care intrinsically about any of those things. There is nothing paradoxical about an AI whose sole final goal is to count the grains of sand on Boracay, or to calculate the decimal expansion of pi, or to maximize the total number of paper clips that will exist in its future light cone. In fact, it would be easier to create an AI with simple goals like these than to build one that had a humanlike set of values and dispositions. Compare how easy it is to write a program that measures how many digits of pi have been calculated and stored in memory with how difficult it would be to create a program that reliably measures the degree of realization of some more meaningful goal—human flourishing, say, or global justice.

In this sense, intelligence and final goals are “orthogonal”; that is: more or less any level of intelligence could in principle be combined with more or less any final goal.

Nevertheless, there are some instrumental goals likely to be pursued by almost any intelligent agent, because there are some objectives that are useful intermediaries to the achievement of almost any final goal.

If an agent’s final goals concern the future, then in many scenarios there will be future actions it could perform to increase the probability of achieving its goals. This creates an instrumental reason for the agent to try to be around in the future—to help achieve its future-oriented goal.

Most humans seem to place some final value on their own survival. This is not a necessary feature of artificial agents: Some may be designed to place no final value whatever on their own survival. Nevertheless, many agents that do not care intrinsically about their own survival would, under a fairly wide range of conditions, care instrumentally about their own survival in order to accomplish their final goals.

Resource acquisition is another common emergent instrumental goal, for much the same reasons as technological perfection: Both technology and resources facilitate the achievement of final goals that require physical resources to be mobilized and arranged in particular patterns. Whether one desires a giant marble monument or an ecstatically happy intergalactic civilization, one needs materials and technology.

Human beings tend to seek to acquire resources sufficient to meet their basic biological needs. But people usually seek to acquire resources far beyond this minimum level. In doing so, they may be partially driven by minor biological conveniences (such as housing that offers slightly better temperature control or more comfortable furniture). A great deal of resource accumulation is motivated by social concerns—gaining status, mates, friends, and influence, through wealth accumulation and conspicuous consumption. Perhaps less commonly, some people seek additional resources to achieve altruistic ambitions or expensive non-social aims.

On the basis of such observations it might be tempting to suppose that a superintelligence not facing a competitive social world would see no instrumental reason to accumulate resources beyond some modest level, for instance whatever computational resources are needed to run its mind along with some virtual reality. Yet such a supposition would be entirely unwarranted.

First, the value of resources depends on the uses to which they can be put, which in turn depends on the available technology. With mature technology, basic resources such as time, space, matter, and free energy could be processed to serve almost any goal.

The orthogonality thesis suggests that we cannot blithely assume that a superintelligence will necessarily share any of the final values stereotypically associated with wisdom and intellectual development in humans—scientific curiosity, benevolent concern for others, spiritual enlightenment and contemplation, renunciation of material acquisitiveness, a taste for refined culture or for the simple pleasures in life, humility and selflessness, and so forth. We will consider later whether it might be possible through deliberate effort to construct a superintelligence that values such things, or to build one that values human welfare, moral goodness, or any other complex purpose its designers might want it to serve. But it is no less possible—and in fact technically a lot easier—to build a superintelligence that places final value on nothing but calculating the decimal expansion of pi. This suggests that—absent a special effort—the first superintelligence may have some such random or reductionistic final goal.

Third, the instrumental convergence thesis entails that we cannot blithely assume that a superintelligence with the final goal of calculating the decimals of pi (or making paper clips, or counting grains of sand) would limit its activities in such a way as not to infringe on human interests. An agent with such a final goal would have a convergent instrumental reason, in many situations, to acquire an unlimited amount of physical resources and, if possible, to eliminate potential threats to itself and its goal system. Human beings might constitute potential threats; they certainly constitute physical resources.

Taken together, these three points thus indicate that the first superintelligence may shape the future of Earth-originating life, could easily have non-anthropomorphic final goals, and would likely have instrumental reasons to pursue open-ended resource acquisition. If we now reflect that human beings consist of useful resources (such as conveniently located atoms) and that we depend for our survival and flourishing on many more local resources, we can see that the outcome could easily be one in which humanity quickly becomes extinct.

Adapted from Superintelligence: Paths, Dangers, Strategies by Nick Bostrom. Out now from Oxford University Press.

TODAY IN SLATE

Culturebox

The Ebola Story

How our minds build narratives out of disaster.

By Willa Paskin

The Budget Disaster That Completely Sabotaged the WHO’s Response to Ebola

By Joshua Keating

PowerPoint Is the Worst, and Now It’s the Latest Way to Hack Into Your Computer

By Lily Hay Newman

The Shooting Tragedies That Forged Canada’s Gun Politics

By Joshua Keating

A Highly Unscientific Ranking of Crazy-Old German Beers

Education

Welcome to 13th Grade!

Some high schools are offering a fifth year. That’s a great idea.

By Rebecca Schuman

THE SLATEST SEE ALL

Culturebox

The Actual World

“Mount Thoreau” and the naming of things in the wilderness.

By Kim Stanley Robinson

Want Kids to Delay Sex? Let Planned Parenthood Teach Them Sex Ed.

By Amanda Marcotte

Would You Trust Walmart to Provide Your Health Care? (You Should.)

By Alison Griswold

Three Ways Your Text Messages Change After You Get Married

How Should a Gentleman Mourn His Dog’s Death?

Protect Your Neighbors From the Threat of Gay Gremlins

Bad Astronomy

Oct. 23 2014 7:30 AM Our Solar System and Galaxy … Seen by an Astronaut Phil Plait

Dear Prudence

Oct. 23 2014 6:00 AM Monster Kids from poorer neighborhoods keep coming to trick-or-treat in mine. Do I have to give them candy? Emily Yoffe

Culturebox

Oct. 22 2014 11:54 PM The Actual World “Mount Thoreau” and the naming of things in the wilderness. Kim Stanley Robinson

Culturebox

Oct. 22 2014 11:52 PM The Ebola Story How our minds build narratives out of disaster. Willa Paskin

Brow Beat

Oct. 22 2014 10:39 PM Avengers: Age of Ultron Looks Like a Fun, Sprawling, and Extremely Satisfying Sequel Sharan Shetty

Politics

Oct. 22 2014 9:42 PM Landslide Landrieu Can the Louisiana Democrat use the powers of incumbency to save herself one more time? John Dickerson

Brow Beat

Oct. 22 2014 9:19 PM The Phone Call Is Twenty Minutes of Pitch-Perfect, Wrenching Cinema Sharan Shetty

The Slatest

Oct. 22 2014 8:20 PM Man Jumps White House Fence, Is Attacked By Secret Service Dogs and Arrested Elliot Hannon

The Slatest

Oct. 22 2014 7:46 PM Family of Dallas Nurse With Ebola Says She’s Now Virus-Free Elliot Hannon

The Gist

Oct. 22 2014 6:41 PM We Work Hard for Less Money In terms of “real wages,” we make less than our grandparents did. At least we have air conditioning. Mike Pesca

The Slatest

Oct. 22 2014 6:36 PM Reaganomics Tax Experiment Still Going Poorly in Kansas Ben Mathis-Lilley

The World

Oct. 22 2014 6:30 PM The Tragedies That Have Shaped Canada's Gun Politics Joshua Keating

The Slatest

Oct. 22 2014 6:10 PM Gunman Killed Inside Canadian Parliament; Soldier Shot at National Monument Dies Ben Mathis-Lilley and Elliot Hannon

Gentleman Scholar

Oct. 22 2014 5:54 PM May I Offer to Sharpen My Friends’ Knives? Or would that be rude? Troy Patterson

Future Tense

Oct. 22 2014 5:33 PM One More Reason Not to Use PowerPoint: It’s The Gateway for a Serious Windows Vulnerability Lily Hay Newman

The Slatest

Oct. 22 2014 4:54 PM CDC Announces Monitoring of All Travelers to U.S. From Ebola-Stricken Countries Laura Bradley

Education

Oct. 22 2014 4:45 PM Welcome to 13th Grade! Several Oregon high schools are offering a fifth year of high school. Every district should consider it. Rebecca Schuman

News & Politics

Politics

Oct. 22 2014 9:42 PM Landslide Landrieu Can the Louisiana Democrat use the powers of incumbency to save herself one more time? John Dickerson

The Tragedies That Have Shaped Canada's Gun Politics

Paul Farmer Says Up to Ninety Percent of Those Infected Should Survive Ebola. Is He Right?

Why Has the World Health Organization Done Such a Bad Job Responding to the Ebola Crisis?

Why Anyone Old Enough to Vote Should Be Able to Run for President

The Irritating Confidante: Ben Bradlee’s Fascinating Relationship With John F. Kennedy

Why Countries Make Human Rights Pledges They Have No Intention of Honoring

Business

Continuously Operating

Oct. 22 2014 2:38 PM Crack Open an Old One A highly unscientific evaluation of Germany’s oldest breweries. L.V. Anderson

Facebook Made $595 Million in the U.K. Last Year. It Paid $0 in Taxes.

Should You Trust Walmart with Your Health Care?

Is McDonald’s Planning to Replace Its Cashiers With Computer Screens?

Soda and Fries Have Lost Their Charm for Both Consumers and Investors

Walmart Is Killing the Rest of Corporate America in Solar Power Adoption

The Global Millionaires Club Is Booming and Losing Its Exclusivity

Life

Dear Prudence

Oct. 23 2014 6:00 AM Monster Kids from poorer neighborhoods keep coming to trick-or-treat in mine. Do I have to give them candy? Emily Yoffe

Is It Rude to Offer to Sharpen Your Friends’ Knives for Them?

In Praise of 13th Grade

How to Protect Your Neighbors From the Gay Gremlin Threat

The Surprisingly Xenophobic Origins of Wonder Bread

Do Gay Men "Have More Fun" Than Straight Men? (Video.)

At This Beach You Can Sunbathe in the Shadow of a Fascist Youth Camp

Double X

The XX Factor

Oct. 22 2014 4:27 PM Three Ways Your Text Messages Change After You Get Married Melissa Dahl

Want Kids to Delay Sex? Let Planned Parenthood Teach Them Sex Ed.

On the Internet, Men Are Called Names. Women Are Stalked and Sexually Harassed.

Renée Zellweger’s New Face Is Too Real

George Tiller’s Murderer Threatens Another Abortion Provider, Claims Right of Free Speech

I Am 25. I Don’t Work at Facebook. My Doctors Want Me to Freeze My Eggs.

Women Are Still Losing Jobs for Getting Pregnant

Slate Plus

Tv Club

Oct. 22 2014 5:27 PM The Slate Walking Dead Podcast A spoiler-filled discussion of Episodes 1 and 2. Mike Vuolo and Chris Wade

What David Plotz Learned From His Podcast Series on Working

Where Are Slate Plus Members From? This Weird Cartogram Explains.

Doctor Who Podcast: Phil Plait and Gabriel Roth Recap Episode 9

Senior Editor David Haglund’s Recommended Reading From Slate This Week

Arts

Culturebox

Oct. 22 2014 11:54 PM The Actual World “Mount Thoreau” and the naming of things in the wilderness. Kim Stanley Robinson

Ebola Resists Narrative. So Should We.

Here Is the Epic, Intense Trailer for Avengers: Age of Ultron

Watch The Phone Call, a Short Film Starring Sally Hawkins That May Win an Oscar

Skinny Mark Wahlberg Goes for an Oscar: Watch the First Trailer for The Gambler

Writer William Giraldi Wishes Everyone Would Please Stop Likening Him to a Literary Genius

Watch the Eerie, Gorgeous Trailer for Joshua Oppenheimer’s Follow-up to The Act of Killing

Technology

Future Tense

Oct. 22 2014 5:33 PM One More Reason Not to Use PowerPoint: It’s The Gateway for a Serious Windows Vulnerability Lily Hay Newman

Netizen Report: Twitter Users Under Fire in Mexico, Venezuela, Turkey

The App That Will Make Algebra Classes Seem Even More Pointless

A Seal of Approval to Stop Online Weather Poseurs

Don’t Give Up on Apple TV Yet

This Synthesizer Makes Music Out of the Weather

We Need to Make Learning More Dangerous

Health & Science

Bad Astronomy

Oct. 23 2014 7:30 AM Our Solar System and Galaxy … Seen by an Astronaut Phil Plait

Researchers Recorded Orcas in the Wild Via Drone for the First Time Ever

Feisty Sunspot Is Feisty. Very Very Much So.

Where Does Ebola Hide Between Outbreaks?

People Who Say “I’m Not a Scientist” Shouldn’t Be Politicians, Either

Monster Sunspot Will Make Thursday’s Eclipse That Much Cooler

Watch This Confused Rhino Act Like Its Best Friend, an Adorable Lamb

Sports

Sports Nut

Oct. 20 2014 5:09 PM Keepaway, on Three. Ready—Break! On his record-breaking touchdown pass, Peyton Manning couldn’t even leave the celebration to chance. Chris Wade and A.J. McCarthy

The 2014 Kansas City Royals Show the Value of Building a Mediocre Baseball Team

How Relief Pitching Has Changed Baseball

Simone Biles Is the Best Gymnast in the World, and It’s Not Even Close

Baseball’s Strike Zone Is Getting Bigger Every Year. Will Someone Shrink It Back Down to Size?

Happy Baseball Fans Stormed the Field As Far Back As the 1920s. Why’d They Stop? (Video.)

A New U.S. Map Based on Where Baseball Players Were Born