80,000 Hours Podcast11 Apr 2024

#184 – Zvi Mowshowitz on sleeping on sleeper agents, and the biggest AI updates since ChatGPT

Many of you will have heard of Zvi Mowshowitz as a superhuman information-absorbing-and-processing machine — which he definitely is. As the author of the Substack Don’t Worry About the Vase, Zvi has spent as much time as literally anyone in the world over the last two years tracking in detail how the explosion of AI has been playing out — and he has strong opinions about almost every aspect of it.

Links to learn more, summary, and full transcript.

In today’s episode, host Rob Wiblin asks Zvi for his takes on:

US-China negotiations
Whether AI progress has stalled
The biggest wins and losses for alignment in 2023
EU and White House AI regulations
Which major AI lab has the best safety strategy
The pros and cons of the Pause AI movement
Recent breakthroughs in capabilities
In what situations it’s morally acceptable to work at AI labs

Whether you agree or disagree with his views, Zvi is super informed and brimming with concrete details.

Zvi and Rob also talk about:

The risk of AI labs fooling themselves into believing their alignment plans are working when they may not be.
The “sleeper agent” issue uncovered in a recent Anthropic paper, and how it shows us how hard alignment actually is.
Why Zvi disagrees with 80,000 Hours’ advice about gaining career capital to have a positive impact.
Zvi’s project to identify the most strikingly horrible and neglected policy failures in the US, and how Zvi founded a new think tank (Balsa Research) to identify innovative solutions to overthrow the horrible status quo in areas like domestic shipping, environmental reviews, and housing supply.
Why Zvi thinks that improving people’s prosperity and housing can make them care more about existential risks like AI.
An idea from the online rationality community that Zvi thinks is really underrated and more people should have heard of: simulacra levels.
And plenty more.

Chapters:

Zvi’s AI-related worldview (00:03:41)
Sleeper agents (00:05:55)
Safety plans of the three major labs (00:21:47)
Misalignment vs misuse vs structural issues (00:50:00)
Should concerned people work at AI labs? (00:55:45)
Pause AI campaign (01:30:16)
Has progress on useful AI products stalled? (01:38:03)
White House executive order and US politics (01:42:09)
Reasons for AI policy optimism (01:56:38)
Zvi’s day-to-day (02:09:47)
Big wins and losses on safety and alignment in 2023 (02:12:29)
Other unappreciated technical breakthroughs (02:17:54)
Concrete things we can do to mitigate risks (02:31:19)
Balsa Research and the Jones Act (02:34:40)
The National Environmental Policy Act (02:50:36)
Housing policy (02:59:59)
Underrated rationalist worldviews (03:16:22)

Producer and editor: Keiran Harris
Audio Engineering Lead: Ben Cordell
Technical editing: Simon Monsour, Milo McGuire, and Dominic Armstrong
Transcriptions and additional content editing: Katy Moore

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(342)

How middle powers avoid losing everything in a post-AI world | Anton Leicht

In a post-AGI world, can a country without access to frontier AI even be considered sovereign anymore?Anton Leicht says once frontier AI becomes a core economic input, the countries that own it will p...

14 Jul 1h 33min

#246 – Sneha Revanur on how a small team of activists helped pass America's landmark AI safety laws

Six years ago, aged just 15, Sneha Revanur founded the AI advocacy nonprofit Encode AI — back when AI felt like a niche issue. Now the world’s caught up with her, and she’s ready to share everything s...

8 Jul 52min

We can guess what intergalactic war would look like. And strangely, it matters.

Intergalactic war is probably billions of years away — yet physics can already tell us how it ends. And strangely that conclusion is relevant to decisions people have to make today.In this video, Rob ...

18 Jun 15min

How AI could create the world’s biggest problems (article by Zershaaneh Qureshi)

Imagine you’re living 15,000 years ago. Your people are hunter-gatherers and you sleep under the stars. If someone told you humans would one day build cities with millions of people, fly through the a...

11 Jun 1h 29min

#245 – Rohin Shah on what it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers')

Most people working on AI safety think without a massive effort AI systems will probably end up with goals catastrophically different from humanity’s. Today’s guest, Rohin Shah — head of AGI Safety an...

2 Jun 2h 48min

What makes for a dream job? | Benjamin Todd

What actually makes a job fulfilling? It's not what most career advice tells you. "Follow your passion" sounds inspiring, but it's misleading — and the research backs that up.Drawing on hundreds of st...

28 Mai 28min

#244 – Benjamin Todd on how we’re updating our career advice for the strangest time in history

The average career is 80,000 hours long. With AI advancing so rapidly, the hours you have left in your career matter more than ever.Some leading AI researchers think there’s a 10% chance that AI syste...

26 Mai 1h 6min

Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

A red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deployment’ without getting caught. It’s one part o...

20 Mai 20min