Wednesday Feb 22, 2023

Episode 8.52: Attention is all you need: how tokens find matches that will come next.

The “Attention is All You Need” paper lies behind Andrew Karpathy’s excellent YouTube video “Let’s build GPT: from scratch, in code, spelled out”. We discuss some implications.

Comment (0)

No comments yet. Be the first to say something!

Copyright 2025 All Rights Reserved

Podcast Powered By Podbean

Version: 20241125