Future of Life Institute Podcast podcast

Dan Hendrycks on Catastrophic AI Risks

11/3/2023

Future of Life Institute Podcast

0:00

2:07:24

Dan Hendrycks joins the podcast again to discuss X.ai, how AI risk thinking has evolved, malicious use of AI, AI race dynamics between companies and between militaries, making AI organizations safer, and how representation engineering could help us understand AI traits like deception. You can learn more about Dan's work at https://www.safe.ai Timestamps: 00:00 X.ai - Elon Musk's new AI venture 02:41 How AI risk thinking has evolved 12:58 AI bioengeneering 19:16 AI agents 24:55 Preventing autocracy 34:11 AI race - corporations and militaries 48:04 Bulletproofing AI organizations 1:07:51 Open-source models 1:15:35 Dan's textbook on AI safety 1:22:58 Rogue AI 1:28:09 LLMs and value specification 1:33:14 AI goal drift 1:41:10 Power-seeking AI 1:52:07 AI deception 1:57:53 Representation engineering

More episodes from "Future of Life Institute Podcast"

More Episodes

Get the whole world of podcasts with the free GetPodcast app.

Subscribe to your favorite podcasts, listen to episodes offline and get thrilling recommendations.

Dan Hendrycks on Catastrophic AI Risks

Future of Life Institute Podcast

More episodes from "Future of Life Institute Podcast"

Dan Faggella on the Race to AGI

Liron Shapira on Superintelligence Goals

Annie Jacobsen on Nuclear War - a Second by Second Timeline

Katja Grace on the Largest Survey of AI Researchers

Holly Elmore on Pausing AI, Hardware Overhang, Safety Research, and Protesting

Sneha Revanur on the Social Effects of AI

Roman Yampolskiy on Shoggoth, Scaling Laws, and Evidence for AI being Uncontrollable

Special: Flo Crivello on AI as a New Form of Life

Carl Robichaud on Preventing Nuclear War

Frank Sauer on Autonomous Weapon Systems