Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI - Dwarkesh Podcast

Transcript

Interviewer (0:00)

Today I have the honor of chatting with Jeff Dean and Noam Shazir. Jeff is Google's chief scientist and through his 25 years at the company, he has worked on basically the most transformative systems in modern computing, from MapReduce, BigTable, TensorFlow, AlphaChef. Genuinely, the list doesn't end. Gemini now and Noam is the single person most responsible for the current AI revolution. He has been the inventor or the co inventor of all the main architectures and techniques that are used for modern LLMs, from the transformer itself to mixture of Experts to mesh TensorFlow, to many other things. And they are two of the three co leads of Gemini at Google DeepMind. Awesome. Thanks so much for coming on.

Jeff Dean (0:48)

Thanks for having us. Super excited to be here.

Interviewer (0:50)

Okay, first question. Both of you have been Google for 25 or close to 25 years. At some point early on in the company, you probably understood how everything worked. When did that stop being the case? Do you feel like there was a clear moment that happened?

Noam Shazeer (1:06)

I mean, I know I joined and at that point this was like end of 2000 and they had this thing, everybody gets a mentor. And so I knew nothing. I would just ask my mentor everything and my mentor knew everything. It turned out my mentor was Jeff. And it was not the case that everyone at Google knew everything. It was just the case that Jeff knew everything because he had basically written everything.

Jeff Dean (1:31)

You're very kind. I mean, I think as companies grow, you kind of go through these phases. When I joined, we were 25 people, 26 people, something like that. And so you eventually learned everyone's name and even though we were growing, you kept track of all the people who were joining. At some point then you kind of lose track of everyone's name of the company, but you still know everyone working on, you know, software engineering things. Then you sort of lose track of, you know all the names of people in the software engineering group, but you know, you at least know all the different projects that everyone's working on. And then at some point the company gets big enough that, you know, you get an email that Project Platypus is launching on Friday and you're like, what the heck is Project Platypus?

Noam Shazeer (2:13)

So I think usually it's a very good surprise. Like you're like, wow, Project Platypus? Like, I had no idea we were doing that. And it turns out, yeah, but I.

Jeff Dean (2:21)

Think it is good to keep track of like what's going on in the company, even at a very high level, even if you don't know Every last detail. And it's good to know lots of people throughout the company so that you can go ask someone for more details or figure out who to talk to. I think, like, with one level of indirection, you can usually find the right person in the company if you have a good network of people that you've built up over time.

Summary

Dwarkesh Podcast Episode Summary

Title: Jeff Dean & Noam Shazeer – 25 Years at Google: From PageRank to AGI
Host: Dwarkesh Patel
Release Date: February 12, 2025
Guests: Jeff Dean (Google’s Chief Scientist) & Noam Shazeer (Co-lead of Gemini at Google DeepMind)

1. Introduction

In this engaging episode of the Dwarkesh Podcast, host Dwarkesh Patel welcomes two of Google's most influential figures: Jeff Dean and Noam Shazeer. Both have dedicated over a quarter-century to Google, contributing to groundbreaking projects that have shaped modern computing and artificial intelligence. They currently co-lead the Gemini project at Google DeepMind, pushing the boundaries toward Artificial General Intelligence (AGI).

2. Journey at Google: Growth and Evolution

Key Discussion Points:

Understanding Company Growth: Jeff and Noam reflect on their 25-year tenure at Google, discussing how the company's expansion transformed their roles and interactions.

Notable Quotes:

Jeff Dean [01:06]: "When I joined, we were 25 people, 26 people... at some point then you kind of lose track of everyone's name of the company."
Noam Shazeer [02:13]: "Usually it's a very good surprise. Like you're like, wow, Project Platypus? Like, I had no idea we were doing that."

Insights:

Both guests highlight the shift from a small team where everyone knew each other to a vast organization with myriad projects, necessitating a robust internal network and high-level awareness of ongoing initiatives.

3. Recruitment Stories: Joining Google

Key Discussion Points:

Jeff's Recruitment: Jeff reached out to Google himself.
Noam's Recruitment: Initially hesitant, Noam applied after seeing Google's potential at a job fair.

Notable Quotes:

Jeff Dean [02:42]: "I kind of reached out to them, actually."
Noam Shazeer [04:50]: "I sent them a resume on a whim in 2000 because I figured it was my favorite search engine."

Insights:

Both Jeff and Noam emphasize proactive approaches to joining Google, driven by their passion for the company's mission to organize the world's information.

4. Evolution of Computational Hardware and Moore's Law

Key Discussion Points:

Impact of Moore's Law: Discussion on how the pace of hardware advancement has influenced system design and project feasibility.
Shift to Specialized Hardware: Transition from general-purpose CPUs to machine learning accelerators like TPUs.

Notable Quotes:

Jeff Dean [05:47]: "The fabrication processes improvements are now taking three years instead of every two years."
Noam Shazeer [07:35]: "Pretty much all of deep learning has taken off roughly because arithmetic is very, very cheap and moving data around is comparatively much more expensive."

Insights:

The guests explain that while traditional CPU advancements have slowed, the rise of specialized hardware like TPUs has enabled more efficient and powerful machine learning computations, facilitating the training of large-scale models.

5. Early AI and Neural Networks at Google

Key Discussion Points:

Jeff’s Undergraduate Thesis: Implementation of parallel backpropagation for neural networks.
2007 Translation Model: Developing a 2 trillion token n-gram model for machine translation.

Notable Quotes:

Jeff Dean [13:09]: "I implemented model parallelism and data parallelism on a 32 processor hypercube machine."
Noam Shazeer [19:07]: "The spelling correction system he built in 2001 was amazing."

Insights:

Jeff and Noam recount their early ventures into neural networks and language models, laying the groundwork for Google's later advancements in AI-driven applications like search and translation.

6. Breakthroughs in Language Models and Gemini Project

Key Discussion Points:

Development of Large Language Models: Transition from n-gram models to Transformers and beyond.
Gemini’s Role: Current efforts in leading AI research towards AGI.

Notable Quotes:

Jeff Dean [19:07]: "Once we built that for translation, the serving of large language models started to be used for other things."
Noam Shazeer [20:10]: "We're going to have to be very, very flexible and dynamic as we improve the capabilities of these models."

Insights:

The conversation delves into how foundational language models evolved into multifaceted tools used across Google’s ecosystem, setting the stage for the Gemini project’s ambitious goals.

7. Inference Time Compute and Scaling AI

Key Discussion Points:

Inference vs. Training: The importance of compute dedicated to running models versus training them.
Scalability Challenges: Techniques to handle increasing computational demands during inference.

Notable Quotes:

Noam Shazeer [54:17]: "We've been exploiting and improving pre-training a lot in the past and post-training and those things will continue to improve."
Jeff Dean [55:45]: "We've got a bunch of techniques now that seem like they can kind of do that."

Insights:

Jeff and Noam discuss the critical role of inference time compute in enhancing model performance, exploring algorithmic improvements and hardware optimizations to manage scaling effectively.

8. Model Architectures and Mixture of Experts

Key Discussion Points:

Mixture of Experts (MoE): Utilizing specialized sub-models within a larger architecture to enhance efficiency.
Pathways System: Google's approach to modular and scalable model design.

Notable Quotes:

Jeff Dean [10:26]: "We're getting better at quantizing or having much more reduced precision models."
Noam Shazeer [102:07]: "Actually in the past I found experts to be relatively easy to understand."

Insights:

The duo explains how MoE architectures allow for dynamic allocation of computational resources, enabling models to handle diverse tasks more efficiently by activating only relevant experts.

9. Continual Learning and Modular Models

Key Discussion Points:

Organic Model Growth: Strategies for models to adapt and specialize over time.
Continual Learning: Enabling models to learn from new data without retraining from scratch.

Notable Quotes:

Jeff Dean [95:00]: "We have this amazing system with mixture of experts, but I think we should probably have a more organic structure in these things."
Noam Shazeer [118:32]: "There’s got to be some way to get more from the same data."

Insights:

Jeff and Noam explore the future of model design, advocating for more flexible and modular architectures that can evolve organically, integrating specialized knowledge while maintaining overall coherence.

10. AI's Role in Organizing Information and Moving Toward AGI

Key Discussion Points:

Beyond Information Retrieval: AI's expanding role in synthesizing and creating new information.
AGI Aspirations: The path toward developing more generalized and capable AI systems.

Notable Quotes:

Jeff Dean [27:40]: "I think our multimodal capabilities are showing that it's more than just text."
Noam Shazeer [29:13]: "These systems can actually go and do something for you or write your code or figure out problems that you wouldn't have been able to figure out yourself."

Insights:

The conversation highlights how AI is transitioning from merely retrieving information to actively assisting in complex tasks, paving the way toward more autonomous and intelligent systems that embody AGI characteristics.

11. AI Safety, Responsible AI, and Mitigating Risks

Key Discussion Points:

Potential Risks: Concerns about misinformation, automated hacking, and misuse of AI.
Responsible AI Principles: Google's framework for ensuring AI benefits society while minimizing harm.

Notable Quotes:

Jeff Dean [72:55]: "We need to put as many safeguards and mitigations and understand the capabilities of the models in place as we can."
Noam Shazeer [76:18]: "As these systems do get more powerful, you've got to be more and more careful."

Insights:

Jeff and Noam emphasize the importance of proactively addressing AI's ethical and safety challenges, advocating for engineering robust safeguards and fostering responsible deployment practices to harness AI's benefits while mitigating its risks.

12. Future of AI Research and Enhancing Researchers' Productivity

Key Discussion Points:

AI-Assisted Research: How AI tools can exponentially increase researchers' productivity.
Autonomous Software Engineering: The potential for AI to autonomously generate and refine codebases.

Notable Quotes:

Jeff Dean [38:27]: "If you prompt it and say, 'I'd like you to implement a SQL processing database system,' it actually did a quite good job."
Noam Shazeer [37:07]: "We therefore have to be very, very flexible and dynamic as we improve the capabilities of these models."

Insights:

The guests discuss visions of a future where AI not only assists but actively enhances the research process, automating complex tasks and enabling researchers to focus on higher-level problem-solving and innovation.

13. Reflections and Personal Anecdotes

Key Discussion Points:

Most Exciting Periods: Jeff reminisces about the early days of search at Google and current advancements with the Gemini team.
Work Environment: Noam shares experiences of collaborative workspaces, like the "micro kitchen," fostering spontaneous idea exchanges.

Notable Quotes:

Jeff Dean [85:02]: "Seeing the growth and usage of our systems was really just personally satisfying."
Noam Shazeer [85:36]: "I love being in person, work with a bunch of great people and build something that's helping millions to billions of people."

Insights:

Both Jeff and Noam express deep satisfaction derived from witnessing and contributing to their projects' tangible impacts, highlighting the importance of collaborative environments and continuous innovation in their careers.

14. Final Thoughts and Future Directions

Key Discussion Points:

AI Development Paradigms: Exploring the balance between gradual improvement and rapid advancements through feedback loops.
Publishing Research: Balancing openness with strategic disclosure to maintain competitive advantages.
Continual Innovation: Emphasizing the need for ongoing algorithmic and hardware advancements to sustain AI progress.

Notable Quotes:

Jeff Dean [123:12]: "Having a lot of brilliant people working together enables us to collectively do something that none of you could do individually."
Noam Shazeer [134:37]: "We've got a model that's pretty much already trained and you want to do inference on. It is going to be a growing and important class of computation."

Insights:

The episode concludes with reflections on the future trajectory of AI, underscoring the necessity for sustained collaboration, strategic research dissemination, and adaptive infrastructure to navigate and shape the evolving landscape of artificial intelligence.

Conclusion

Jeff Dean and Noam Shazeer provide a comprehensive overview of their extensive careers at Google, shedding light on the evolution of computational systems, the advancement of artificial intelligence, and the strategic directions leading towards AGI. Their insights into hardware specialization, model scalability, AI safety, and the future of research offer valuable perspectives for enthusiasts and professionals alike. This episode not only chronicles significant milestones in AI development but also anticipates the transformative impact of future innovations spearheaded by visionary leaders at Google DeepMind.