Unlocking AI Mysteries: Key Open Problems in Mechanistic Interpretability 🧠

Discover the latest challenges and breakthroughs in understanding how neural networks work, as Neel Nanda explores the frontiers of mechanistic interpretability at EAGxVirtual 2023.

Unlocking AI Mysteries: Key Open Problems in Mechanistic Interpretability 🧠
Effective Altruism
1.1K views β€’ Jan 14, 2024
Unlocking AI Mysteries: Key Open Problems in Mechanistic Interpretability 🧠

About this video

Mechanistic Interpretability is a sub-field of AI Alignment that studies trained neural networks and tries to reverse-engineer the algorithms they've learned. In this talk, Neel Nanda gave an overview of the field, key works, and some of the open problems.

Learn more about effective altruism at: www.effectivealtruism.org
Find out more about EA Global conferences at: www.eaglobal.org

Video Information

Views

1.1K

Likes

21

Duration

51:03

Published

Jan 14, 2024

User Reviews

4.5
(1)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.