Super Prompt: Generative AI

LLM Training: Superman's Kryptonite-Proof Suit

Tony Wan Season 1 Episode 13

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 18:56

Why isn't Superman's suit Kryptonite-proof? This question reveals how large language models are trained. We break down transformers (the T in GPT), self-attention mechanisms, and the inference process—using Superman to explain why GPT-3 can generate coherent answers to questions it's never seen before. Solo episode on LLM architecture.

To stay in touch, sign up for our newsletter at https://www.superprompt.fm