Building Anthropic: Safety-First AI at the Frontier

Ben Mann

Stanford CS153: Technology Entrepreneurship — Infra @ Scale (Winter 2025) · Day 9 · Jordan Hall 420-040

In this insightful talk from CS153 Infra @ Scale 2025, Ben Mann, Co-Founder of Anthropic, provides a comprehensive look into the formidable engineering and safety challenges inherent in building state-of-the-art AI systems at an unprecedented scale. Mann, a key figure in the development of GPT-3 at OpenAI before co-founding Anthropic, shares his unique perspective on the journey from early AI research to the current frontier of large language models, emphasizing the critical role of safety as a foundational principle. The discussion delves into the technical breakthroughs that have enabled explosive growth in AI capabilities and the complex infrastructure required to support it.

AI review

Ben Mann is clearly a credible person with real experience, and there are interesting ideas buried in here — Constitutional AI, elicitation overhang, mechanistic interpretability. But the talk as described is a founder retrospective, not an engineering session. The article reads like a press release that learned to use bullet points. There's no implementation depth, no reproducible methodology, no code, and no framework an engineer could actually act on. What's here is mostly: 'we take safety seriously, scaling laws are real, RLAIF is better than RLHF.' That's a LinkedIn post, not a…

Watch on YouTube