A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic Interpretability. I'm joined by my co-author Lawrence Chan. In this part, we give an overview of the paper and discuss the key takeaways
Part 2: • A Walkthrough of Progr...
Part 3: • A Walkthrough of Progr...
If you want to learn more about mechanistic interpretability, check out neelnanda.io/getting-started
Our paper: arxiv.org/abs/2301.05217
Original grokking paper: arxiv.org/abs/2201.02177
AdamW: pytorch.org/docs/stable/gener...
Walkthrough of toy models of superposition: • A Walkthrough of Toy M...
Danny Hernandez paper on scaling laws for repeated data: arxiv.org/abs/2205.10487
Jermyn & Schlegeris on S-Shaped Curves: www.alignmentforum.org/posts/...
Unifying Grokking and Double Descent: arxiv.org/abs/2303.06173
Omnigrok: arxiv.org/abs/2210.01117
0:00 - Intro
0:50 - What is grokking?
9:53 - Mechanistic interpretability
11:47 - Paper overview, modular addition algorithm
15:08 - Progress measures
21:41 - why this work is bullshit
29:30 - Predicting when it will grok?
33:45 - Why does grokking happen?
40:27 - Lottery ticket hypothesis
42:43 - Conclusion

Жүктеу

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: How? (Part 2/3)

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

БОКС | Олимпиада | 1/16 финал | 92 кг | Айбек Оралбай (Қазақстан) - Олайтан Олаоре (Нигерия)

БАБУШКИН КОМПОТ В СОЛО

35M Subscriber Moment Almost Here🎉❤️ Supported by Korean creators🇰🇷🤝🇯🇵

This is not my neighbor Terrible neighbor! #funny #zoonomaly #memes

CHATGPT DOESN'T REASON! (Top scientist bombshell)

Kolmogorov Arnold Networks (KAN) Paper Explained - An exciting new paradigm for Deep Learning?

New Discovery: LLMs have a Performance Phase

What is OpenAI’s super-secret Project Q*? | About That

Scaling interpretability

Stanford's FREE data science book and course are the best yet

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: Why? (Part 3/3)

WE MUST ADD STRUCTURE TO DEEP LEARNING BECAUSE...

Open Problems in Mechanistic Interpretability: A Whirlwind Tour

Why Does Diffusion Work Better than Auto-Regression?

Как бесплатно замутить iphone 15 pro max

Look, this is the 97th generation of the phone?

Запрещенный Гаджет для Авто с aliexpress 2

Как сделать так, чтобы видеть экран телефона в солнечную погоду?

Здесь упор в процессор

🖼️Этот девайс не купить в магазине! Самоделка с нейросетью

Kumanda İle Bilgisayarı Yönetmek #shorts

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Пікірлер: 7