wavePod

Meta’s Chief AI Scientist Yann LeCun: The Path Toward Human-Level Intelligence in AI [Ep. 473] - Into the Impossible With Brian Keating | Wave AI Podcast Notes

Back to Into the Impossible With Brian Keating

Meta’s Chief AI Scientist Yann LeCun: The Path Toward Human-Level Intelligence in AI [Ep. 473]

Into the Impossible With Brian Keating

Sun Dec 29 2024

Summary

Podcast Summary: Meta’s Chief AI Scientist Yann LeCun: The Path Toward Human-Level Intelligence in AI [Ep. 473]

Podcast Information:

Title: Into the Impossible With Brian Keating
Host/Author: Big Bang Productions Inc.
Description: A podcast exploring our scientific and human understanding of the world, featuring conversations with visionaries from arts, sciences, humanities, and technology. Hosted by Dr. Brian Keating, Chancellor’s Distinguished Professor of Physics at UC San Diego.
Episode: Meta’s Chief AI Scientist Yann LeCun: The Path Toward Human-Level Intelligence in AI
Release Date: December 29, 2024

1. Introduction to Yann LeCun and AI Frontiers

Timestamp [02:15]:
Yann LeCun opens the conversation with a quote from 2001: A Space Odyssey, stating, “Any sufficiently advanced technology is indistinguishable from magic,” setting the tone for a deep dive into advanced artificial intelligence.

Brian Keating:
Introduces Yann LeCun as Meta’s Chief AI Scientist, highlighting his pioneering work in AI architectures like JEPA (Joint Embedding Predictive Architecture), which aims to build explicit mental models of the world and reduce output randomness. Keating emphasizes the potential transformation of fields such as physics, education, and healthcare through these advancements.

2. The Limitations of Current AI Systems

Timestamp [05:49]:
LeCun addresses a provocative statement he made in the Wall Street Journal: “AI is barely as smart as a cat.” He elaborates that current Large Language Models (LLMs) like GPT-4 manipulate language impressively but lack true understanding of the physical world.

LeCun:
“LLMs can pass the bar exam, but we still don't have domestic robots that can do what any 10-year-old can do in one shot.”

He contrasts the intuitive problem-solving abilities of cats with the current capabilities of AI, emphasizing that while AI can handle symbolic representations, it fails to grasp the complexities of the real world, such as planning and reasoning based on physical interactions.

3. JEPA: Advancing Beyond Autoregressive Models

Timestamp [13:43]:
LeCun introduces JEPA, an architecture designed to overcome the limitations of self-supervised learning in AI. Unlike traditional LLMs that predict sequences of discrete tokens, JEPA focuses on understanding and predicting continuous, high-dimensional data such as images and videos.

LeCun:
“JEPA stands for Joint Embedding Predictive Architecture. It trains systems to find good representations of data by eliminating unpredictable elements and focusing on what’s useful for prediction.”

This approach aligns more closely with how humans develop mental models, allowing AI to better understand and interact with the physical world by abstracting relevant variables and ignoring irrelevant details.

4. AI and the Future of Scientific Discovery

Timestamp [08:02]:
Keating poses a critical question about whether the current focus on GPU and LLM approaches is stifling innovation in other scientific areas.

LeCun:
“LLMs are a hammer, and now everything looks like an L. That’s a mistake. We need to go beyond autoregressive architectures towards systems that can understand the real world and acquire common sense.”

He underscores the necessity for AI architectures that mimic human-like understanding and planning, which are essential for breakthroughs in complex scientific fields such as physics.

5. Dark Matter of AI: Self-Supervised Learning

Timestamp [36:03]:
LeCun elaborates on his analogy comparing self-supervised learning to dark matter in AI, highlighting its fundamental yet underexplored role.

LeCun:
“I use an analogy where self-supervised learning is the bulk, the dark matter of AI. It represents the majority of what we learn without explicit supervision, much like dark matter constitutes most of the universe’s mass without being directly observable.”

He stresses that while supervised and reinforcement learning methods are well-understood and applied, self-supervised learning remains the elusive component necessary for achieving human-level AI.

6. The Path to AGI and Safe AI Development

Timestamp [44:21]:
When asked about the timeline for achieving Artificial General Intelligence (AGI), LeCun eschews the term AGI in favor of “human-level AI” or “Advanced Machine Intelligence (AMI).” He estimates that human-level intelligence in machines could emerge within five to six years, contingent on the success of architectures like JEPA and advancements in computational power.

LeCun:
“Building safe AI systems is akin to engineering reliable turbojets. We won't have a magic bullet, but through objective-driven AI and robust guardrails, we can ensure these systems amplify human intelligence without posing existential risks.”

He emphasizes the importance of aligning AI objectives with human values and implementing multiple layers of constraints to prevent unintended harmful behaviors, drawing parallels to how human laws function as societal guardrails.

7. AI’s Impact on Education and the Role of Professors

Timestamp [73:42]:
Keating inquires about the implications of AI on the profession of teaching and academia.

LeCun:
“Human interaction in education—such as mentorship and ethical guidance—remains irreplaceable. AI will augment this relationship by providing advanced tools and personalized learning experiences, but the fundamental role of professors as mentors and researchers will persist.”

He envisions a future where AI assists both educators and students by enhancing the learning process through intelligent systems, thereby transforming but not eliminating the professor’s role.

8. Personal Reflections and Evolving Perspectives

Timestamp [76:51]:
LeCun shares his personal journey and evolution in the field of AI, notably his change in stance regarding unsupervised learning.

LeCun:
“In the late '80s, I was skeptical about unsupervised learning. However, influenced by Geoff Hinton, I recognized its potential and fully embraced it by the early 2000s. This shift was pivotal in my advocacy for self-supervised learning as a cornerstone of future AI advancements.”

His openness to changing his views based on new evidence underscores the dynamic and self-correcting nature of scientific inquiry.

9. Optimistic Outlook on AI’s Transformative Potential

Timestamp [62:50]:
LeCun conveys an optimistic perspective on AI, comparing its potential impact to the invention of the printing press.

LeCun:
“Intelligence is one of the most desirable commodities missing in society. AI that amplifies human intelligence could be as transformative as the printing press, fostering the dissemination of knowledge and driving societal progress.”

He believes that, much like the printing press enabled the Enlightenment, AI will empower humanity to achieve unprecedented advancements, provided its development is guided responsibly.

10. Addressing Concerns About AI Safety and Control

Timestamp [52:06]:
Keating raises concerns about creating AGI systems that might lose control, prompting LeCun to discuss the inherent differences between human desires and AI objectives.

LeCun:
“The notion that intelligent systems inherently desire to dominate is false. Safe AI development hinges on constructing objective-driven systems with aligned goals and robust guardrails, ensuring they operate within defined ethical and practical boundaries.”

He dismisses fears of malevolent AI by highlighting that, unlike humans, AI systems do not possess intrinsic desires unless explicitly programmed, and with proper design, they can be aligned to serve humanity’s best interests.

Conclusion

Throughout the episode, Dr. Brian Keating and Yann LeCun engage in a thought-provoking dialogue on the current state and future trajectory of artificial intelligence. LeCun’s insights into the limitations of existing AI models, the promise of architectures like JEPA, and the essential role of self-supervised learning underscore a vision of AI that complements and augments human intelligence. His optimistic outlook is tempered with a pragmatic approach to AI safety, emphasizing the importance of aligned objectives and robust constraints to harness AI’s transformative potential responsibly.

Notable Quotes:

Yann LeCun [02:07]: “Any sufficiently advanced technology is indistinguishable from magic.”
LeCun [05:49]: “AI is barely as smart as a cat.”
LeCun [36:03]: “Self-supervised learning is the dark matter of AI.”
LeCun [62:50]: “AI will amplify human intelligence as the printing press amplified knowledge dissemination.”
LeCun [52:10]: “The notion that intelligent systems inherently desire to dominate is false.”

This episode provides a comprehensive exploration of the evolving landscape of artificial intelligence, blending technical discussions with philosophical reflections on intelligence and the future of human-AI collaboration. Listeners gain valuable perspectives on how AI can be developed thoughtfully to serve as a powerful tool for societal advancement.

Loading summary...

Transcript

Brian Keating (0:01)

Right now, the Home Depot has spring deals under $20. So what are you working on? If you're planning on cooking out this season, head to the Home Depot so you can fire up the grill with deals on charcoal. Right now, get two 16 pound bags of Kingsford Charcoal for only $17.88. Was $19.98. Don't miss spring deals under $20 now through May 7th at the home Depot. Subject to availability valid on select items only.

Yann Lecun (0:30)

This episode is brought to you by Chevy Silverado. When it's time for you to ditch.

Brian Keating (0:34)

The blacktop and head off road, do it in a truck that says no to nothing.

Yann Lecun (0:39)

The Chevy Silverado Trail Boss get the rugged capability of its Z71 suspension and.

Brian Keating (0:44)

2 inch factory lift, plus impressive torque and towing capacity thanks to an available.

Yann Lecun (0:49)

Duramax 3 liter turbo diesel engine. Where other trucks call it quits, you'll.

Brian Keating (0:54)

Just be getting started. Visit chevy.com to learn more. Welcome back to into the Impossible. Today we're going to dive deep into the frontier of artificial intelligence with a pioneer, Yann Lecun. Jan only answers to one man at Meta. That's right, the Zuck. Today we'll find out what makes Zuck tick and along the way we'll explore Jan's controversial claims. Jan is a visionary and the motivating force behind new architectures like jepa, a self supervised AI approach that builds explicit mental models of the world, reduces output randomness, and opens new frontiers for understanding, predicting and solving complex challenges in physics, education and healthcare. It may just transform the way we learn and teach. So join us for a mind expanding conversation on advanced machine intelligence and the nature of intelligence itself. We'll push the boundaries and explore the evolving role of educators in an AI driven future. And we'll even explore the financial incentives for AI that drive a lot of the profit margins at places like Meta. Now let's jump into this conversation with Yan, the man behind the Metaverse.

Yann Lecun (2:07)

Any sufficiently advanced technology is indistinguishable from magic. Open the pod bay doors.

Brian Keating (2:15)

Hal hey Meta. Who is Yann Lecun? Yann Lecun welcome to the into the Impossible podcast.

Yann Lecun (2:23)

Pleasure to be here.

Yann Lecun (10:36)

The short answer is that today, no, like the AI systems today cannot have this kind of intuition. Even though, I mean the, the AI systems that are the most appropriate, that are applied to scientific discoveries today are specialized models, right? So you want to predict the structure of a protein or predict the interaction between two molecules or the property of a material. You develop somewhat specialized models for this. And you can't use LLMs really for this kind of stuff. They're just going to regurgitate whatever they've been trained on, but they're not going to be able to come up in new things. And those models, of course, are powerful in the sense that they all predict chemical reactions that nobody tried before and properties of material that nobody ever built and things like this. So they are a little more outside of the beaten path. They can go a little bit outside the beaten path more than LLMs, which basically are ways to index existing knowledge. But they're not, they're not going to have this kind of insight that Einstein was famous for, not yet. But the hope is that at some point they will. My big question, scientific question and interest is how to do that is what kind of process, through what kind of process do we humans, but also animals, build models of the real world? And one big thing there is the figuring out the appropriate representation and relevant variables of a system or something that you're interested in modeling and what's the right level of abstraction of that representation. So for example, you and I know that if you want that we can collect an infinite amount of data on, let's say Jupyter, and there's like enormous amounts of data that we know about Jupiter, right, in terms of weather, density, composition, temperature, all the everything. But now who would have thought that to predict the trajectory of Jupiter for the next few centuries, you only need to know six numbers, three positions and three velocities and you're done. You don't even need to know density, composition, rotation, anything like that. It's just six numbers, right? So the, the most difficult step to being able to make predictions is finding the appropriate representations of the reality and eliminating all the stuff that's irrelevant so that you can make those predictions. I've been obsessed for the last several years with an architecture that I think is capable of this, that we call jepa, which I may explain if you want.

Brian Keating (14:50)

I never thought I'd say this. I gave up my morning coffee. Not just temporarily, but because I found something that makes me feel so much better. It's called hi Nandaka. And this is not just another coffee alternative. It's the upgrade your body needs. I used to depend on the morning coffee as soon as I woke up. But after the buzz wore off, I felt jittery, anxious, and drained by the afternoon. That's when I started using Nandaka and everything changed for me. Instead of a quick caffeine hit, I get steady, sustained energy that lasts all day long. No crashes, no afternoon slumps. It even staves off my hunger. So I don't need that muffin that I used to crave in the morning. Just gives you clear, focused, calm energy. And my digestion from this is much better than with coffee. I don't get bloating or stomach pains. My mood is more stable, and most importantly, my metabolism seems to be working better too. Now I'm a doctor. I'm not that kind of a doctor, but I know that I'm not craving sugar or snacks anymore. And I've even dropped a few pounds, and not just a few pounds, from my chin to my stomach. Unlike other mushroom coffees that I've tried those that basically use mycelium, which is basically grain powder. Nandaka uses 100% fruiting body mushrooms, the part containing the powerful compounds that support my energy, focus and longevity. It's crafted with ceremonial grade cacao. So delicious fermented probiotic teas and adaptogenic herbs to fuel my body and mind. And it will do the same for you. Every ingredient, a source for maximum purity and potency, sustains my calm energy with slow release caffeine that provides clear focus without burnout. The functional mushrooms like reishi and cordyceps promote clarity and cognitive performance, really boost my mood and focus. And last but not least, because it has no fillers, no preservatives and no junk, I find it really improves and supports my digestion. The polyphenols from Pooh, air tea and cacao help with my gut health and reducing all my sugar cravings. Right now peak is offering 20 off for life and a free starter kit with your purchase. That's a rechargeable frother. It's so cool and fun to use a glass beaker so you can feel like a real scientist like me. And together you'll make the perfect cup every time. Just go to peaklife.com Impossible. That's P I Q U E L I F E.com Impossible. Trust me, your energy, your gut and your future self will thank you.

Yann Lecun (16:50)

Summarization Whatever you want. Okay, now LLMs are a special case of this, where you build the architecture of the system in such a way that to predict a word in the input, the system can only look at the word to the left of it. Okay? So it can only look at the previous words to predict a particular word. So now you don't need to do the corruption process anymore because the architecture basically intrinsically corrupts the system by preventing the system from looking at all the data. It can only look at what's to the left of a particular word to predict that word, right? So you put an input and then train the system to just reproduce its input on this output. Okay? So that's self supervised because there is no task that you ask the system to accomplish. There is no differentiation between input and output. Everything is an output and an input, right? So that's our supervised learning. Now that works amazingly well for language, and it works really well for DNA sequences and all kinds of stuff, but it only works for essentially sequences of discrete things like language. So language is there's only a finite number of words in the dictionary. You can never predict which word will follow a sequence of words, but you can predict a vector of scores or probability distribution over all possible words in a dictionary. And that's easy to do, right? It's just a big vector of numbers between 0 and 1 that's up to 1. What do you do about natural data? Okay, Data that comes to you from a sensor, say a camera. So now that your data is video, or let's say it's just an image. So what you could try is try the same thing, right? Take an image, corrupt it by masking pieces of it, and then train some neural net to reconstruct a full image. That's called a masked autoencoder. Nae, it doesn't work very well. And in fact there is various ways to train systems to reconstruct from partial views, right? They're called autoencoders, but there are various ways to train them. The math technique is just one. And none of that really works really well. A lot of those techniques, by the way, are inspired by statistical physics. So one particular method to do this is called variational autoencoder. And the variational comes from variational free energy. So it's the same math okay, as statistical physics.

Yann Lecun (21:23)

And the reason it doesn't work is that if you train the system to make one single prediction, the best thing you can do is predict the average of all the plausible futures that may happen. And that's basically a blurry image. Because even if we take videos, like our video right now of us speaking, I could be saying a word or another, I could be moving my head one side or the other, I could be moving my hands one way or another. And so if the system has to make one prediction and we train it to minimize the prediction error, it's just going to predict the average of all the things that could happen. You're going to see blurry versions of my hands, blurry versions of my face, very blurry versions of my mouth. And that's not a good prediction. And so that just doesn't work. Basically, self supervised learning by reconstruction or prediction does not work for natural signals. Okay, so now I'm coming to this idea of jepa. Okay, so JEPA stands for Joint Embedding Predictive Architecture. So what's an embedding? An embedding is a representation for a signal, right? You take an image and you don't care about the precise value of all the pixels. What you care about is some representation which is going to be a list of numbers, a vector that represents the content of the image, but does not represent all the details about it. Okay, that's an embedding. And joint embedding is that if you take an image and you take a corrupted version of that image, or let's say a slightly transformed version of the image, different viewpoint, for example, the content of the image doesn't change, and so the embedding should be the same. So a joint embedding architecture is trained by, is basically a big neural net. And you train it in such a way that when you show it two versions of the same image of the same thing, you produce the same embedding. You force it to produce the same embedding, okay, the same output, essentially. And then the P. The predictive is, let's say a version of the image is a frame in a video and the corrupted version is the frame before. So now what you need to do is predict the next frame from the previous frame or predict the next few frames. So we produce few frames, and that's called a jepa. So joint embedding predictive architecture, right? You have two embeddings, one that takes the future of the video, one that takes the past, and then you have a predictor that tries to predict the representation of the future of the video from the representation of the past of that video. When you use this type of architecture to train a system to learn representations of images, it works really well. There's a number of different techniques that my colleagues and I and many other people have come up with over the last few years to do this, and it works really well. So we can learn good representations of images. We're starting to get good representations of video, but it's very recent. But then what you can imagine is now that you have this principle that I was talking about for Jupyter, where you have data about Jupiter or Mercury, and then you ask the system, find a good representation of all the data you have, eliminating all the stuff you can't predict so that you can make predictions in representation space. So eliminate all the stuff you cannot predict, the weather on Jupiter, all kinds of details that you really would not be able to predict, and eliminate all that and just find a representation such that you can make predictions at a certain horizon within that space. And in my opinion, that's really the essence of kind of understanding the world that you do when you do physics, right? You're trying to find a model of a phenomenon, eliminating all the stuff that is irrelevant, and then finding a good set of relevant variables that allows you to make predictions. That's really what science is all about.

Yann Lecun (36:03)

That remark actually is eight years old now. I made it many years ago at a keynote. And in the audience was my former colleague Kyle Kranemer, high energy physicist from NYU at the time, who's not Wisconsin. And he said, you should not have used dark matter as the analog. You should have used dark energy, because that's really where most of the mass is of the universe. So I was trying to explain the following analogy, that the bulk of what we learn, we don't learn by being told an answer, or we don't learn by trial and error. We just learn the structure of our sensory inputs through self supervised learning or something similar to this. We don't actually know what humans and animals use, but it certainly feels a lot more like self supervised learning than it feels like either supervised learning or reinforcement learning. Right. So supervised learning is situation where you have a clear input and a clear output, and you train the system to just map that input to that output. Right. Show a picture of an elephant, tell the system that's an elephant. If it says it's a cat, correct the parameters so that the output 10 comes closer to elephant. Right. That's supervised learning. And then reinforcement learning is you show it an elephant and you wait for the answer and you just tell it whether the answer is correct or not. You don't tell it the correct answer, you just tell it whether it's correct or not. Maybe with a score of some kind. Okay, and now the system has to search among all possible answers, which one is the correct one? If there is an infinite number of answers, it's super inefficient. So reinforcement learning is so inefficient that it cannot possibly explain the type of efficient learning we observe in humans and animals. Supervised learning cannot possibly explain either because most of what we learned we're not taught, we just seem to come up with it. Right? And certainly animals, there's a lot of animal species that become really smart without ever meeting their parents. A good example is octopus, but there are plenty of examples in birds and various other species. So they learn a lot about the world and they never meet their parents. So they're not being told anything, they're not being taught anything. And then there is this sort of amorphous thing that we now call self supervised learning. And that's really where the bulk of learning really takes place. And if anything, the success of LLM really is a sort of bright demonstration of the power of self supervised learning. So I use an analogy where I showed a picture of a chocolate cake and it said the bulk of the cake, the genoise of the cake if you want, is self supervised learning. The icing on the cake is supervised learning and the cherry on the cake is reinforcement learning. If you want to quantify the relative importance of the different modes of learning, that's the right analogy. And when I was saying this in 2016, the entire world was completely focused on reinforcement learning. Reinforcement learning was going to be the path towards human, human level AI. And I'd never believed in this. And so that was kind of controversial. It's not anymore. And so then I said, there is chocolate in this bulk of the genoirs of the thing. That's dark matter. Yeah, that's the dark matter of AI. That's the thing we have to figure out how to do. And it's kind of like we're in the same embarrassing situation as physicists where we know how to do reinforcement learning and supervised learning, but we don't really know how to do this self supervised learning thing that represents the bulk.

Brian Keating (39:41)

Hey cosmic explorers, it's time for some astro trivia. Do you know the difference between a constellation and an asterism? There are only 88 official constellations and the last one was added way back in 1930. But I have over 900 ratings of into the Impossible. And while you can't make your own constellation, you can make an asterism of 5 stars. A collection burning bright enough to make Orion's belt jealous. So do that on Apple podcasts. Scroll down to ratings and review. Tap the five star button and leave your thoughts. Or on Spotify, follow our show, tap the star rating. Don't forget to listen to all episode if you want to leave an actual rating. And please don't forget to follow or subscribe to the show wherever you're listening to this the matter that you and I are made up of these chunks of rock, which I'll give give to you when we finally meet up someday. These are meteorites from the early universe, are from our early solar system. I give them away to anybody who has a edu email address at my website. The point is, this is very important. You know, people say, oh we, we're, we don't Even know what 90, you know what's 80% of the matter is in the universe. But you know, the 20% that we do know about is extremely important and without which we can't have this conversation. And last week I talked to relative colleague of yours, Stephen Wolfram, and staying on the topic of dark matter, he believes that that dark matter unconventional idea that he has is that the universe is a hypergraph, according to him, that evolves via pure computational rules and that time is generated by the sort of update rate of the hypergraph. And he suggests that as time and temperature are related through laws of thermodynamics via entropy, he's actually suggesting that dark matter is what he calls space time heat. Not asking you specifically to comment on that. I actually don't fully understand it. We debated it because the question I had is can it? Okay, so there's the, there's dark matter that we know exists. There's dark matter that and we, we don't know what it is. It could be, it could be some strange new particle like the axion. It could be, you know, some new force field we don't understand. But there is dark matter that we know about. Absolutely. Neutrinos, 100% WIMPs, weakly interacting massive particles. So can space time heat account for neutrinos which are about 1.9 Kelvin today in universe? And so we kind of fought that out. But generally speaking, what do you think about this hypergraph idea that the universe is pure computation? Does that hold any interest to you as a researcher?

Yann Lecun (44:21)

I resisted the use of the phrase AGI. And the reason is not that I don't believe in the concept that AI system will eventually become as intelligent as humans. I certainly have no doubt that at some point in the future we will have machines that are as intelligent as humans in all the domains where humans are intelligent. There's no question this will happen. Okay, no doubt it's a matter of time, but calling this AGI is complete nonsense because human intelligence is incredibly specialized. We have a hard time kind of accepting this concept that human intelligence is specialized, but it is very specialized. That's why I don't like the term. The term I've been using is either human level AI or ami. So that stands for advanced machine intelligence. This is kind of the term that we use internally at Meta, we pronounce it abi. Because friend. There's a lot of French people. Okay. Also means friend, right? In French. But that's the same concept, Right? So now, how long is it going to take? Strangely enough, I get asked that question by people like Mark Zuckerberg. And the reason is it's an important thing to know if you want to invest tens of billions in infrastructure to train big AI systems. If you want to be able to tell people, within a few years, you're going to be able to wear those smart glasses that you were showing us initially. And in those glasses there will be an intelligent assistant that you can be with you at all time. You can ask Any question, it's going to be smarter than you, possibly, and you shouldn't feel threatened by that. It would be like having a smart colleague that you can talk to and ask any question. So how long is it going to take? So I think to have possibly a system that at least to most people feels like it has several intelligence as humans, if all of the plans that all of the things that we are imagining will work, okay, so those JEPA architectures and some other ideas that we're playing with succeed, I don't see this happening in less than five or six years. But now is it going to happen in five or six years? And I think there's a distribution with a tail that's very long. And the history of AI is that people just keep underestimating how hard it is. I'm probably making the same mistake right now. When I say five, six years, this is if we don't run into a major obstacle that we didn't foresee. If all of the things that we're planning to try out actually work, if things kind of scale, if computers accelerate and all that stuff, there's a lot of things, a lot of planets that need to line up for this to happen. So that's the best case. It's not going to happen next year. You might have heard from some other folks.

Brian Keating (47:49)

I wonder how much you just, you know, not as an expert in this field, but just someone who's fascinated by it and has benefited. My life has just benefited so much because now, you know, I've got a bunch of kids and I don't read them, you know, stories. I ask, you know, meta to read them stories. No, no, I don't do that. But I don't think there's anything wrong with it. Morally, I feel fine, because if you're reading somebody's book, it's basically the same thing. But I think we're kind of arguing about stuff that's maybe the most analogous Thing I can, I can point to is like the Drake equation, like the Drake equation parameterizes basically a statement about optimism for detecting aliens. And it's based on a whole bunch of parameters. And those parameters are always given to us without any uncertainty. And you as a scientist and I know the most important thing are the systematic and statistical errors are simple. Systematic errors are hard. That's where the physics is. That's where the intuition comes in. That's where the craftsmanship comes in. So, but in these questions, so you always get numbers like, oh, there's abundant billions of civilizations in the universe, or there's none, depending on what you choose for your error bar, and likely too for AGI. It's such a nebulous thing. So people define in all different ways. I agree with you. I don't think it's true. But I think the Keating test, if I could be so bold, would be something like, come up with a new law of physics. Come up with a solution that makes a prediction that can be testable and falsifiable, that we can then say, this is never. This is truly new. It's not reproducing, it's not predicting. It doesn't have temperature dependent. So what would you say if you could have the Lecun test instead of the. I think the Turing Test was great for, you know, 100 years ago, but the analogy with the Drake Equation, Drake equation is like, who is. Who's talking to us? And the Turing Test is like, who's listening to us? But I don't think that's sufficient either one of those. What would you Say is a LeCun test that you'd be comfortable with?

Yann Lecun (49:38)

Here's the bad news. I don't think there is any single test that would work. That's probably right, because any area or sub problems that you can formulate, there is probably a sort of specific solution towards solving that problem with superhuman performance. And we see this with computers. That's a history of computer science, right? Computers can calculate faster than humans now. They can translate thousands of languages in any direction, can play chess. A $30 gadget can beat you at chess, right? It can beat vhs, certainly. You know, a lot of those tasks that we came up with, like games, we came up with them because they're hard for humans and it turned out to not be that hard for machines, right? So like, you know, every search algorithm, like, you know, shortest path in a graph, things like that, that your gps, your map software uses, your map application uses. Those are fairly simple algorithms and they have Superhuman performance. So any particular application, Ari you pick, there's going to be a specific, specialized solution for it. And so no single test is going to test for intelligence. And what we're observing now is that people are being impressed by the fact that LLMs can manipulate language. And it turns out manipulating language is simple. It's much simpler than we thought. In fact, it has to be simple because it only popped up in evolution in the last few hundred thousand years. And given the difference between the genomes of humans and chimpanzees or something like that, it only represents like a tiny portion of the genome, if anything. Actually maybe a tiny, tiny portion, maybe the equivalent of a couple megabytes of genomic information, which is really not that much. And in the brain, language is handled by two tiny areas right here and right here, the BRCA area and vernicke area. BRCA area for producing language, Vernicke area for more, for understanding. Understanding. We get fooled into thinking those things are intelligent and generally intelligent because they behave a little bit like humans, but really they're very shadow. We see this when we try to build systems that can accomplish very simple physical tasks. And it's just excruciatingly complicated. They really can't. I mean, I don't think we have a good solution, although there's progress being made in robotics and stuff like that because of machine learning. But we're still not nowhere near where we need to be.

Yann Lecun (57:48)

I don't believe. I mean, also, Stuart Russell also wants to is looking for provably safe AI system. I think that's just as impossible as a provably safe turbojet. You can't prove that a turbojet will be safe, yet we can build incredibly reliable turbojets Right. That can fly you halfway around the world in complete safety with only a two engine airplane. Right. I mean, that's mind boggling in terms of technology. But we can build those things. Like AI is going to be the same. There's not going to be a magic bullet, there's not going to be a proof that we can build safe systems. But we're going to engineer safe systems. And the way we're going to engineer them, I think that's the way I think things are going to go, is that we're going to build systems that are objective driven. I call this objective driven AI. And it's the fact that the output that is produced by the AI system is not the result of just producing a token after the other is a result of optimizing an objective with respect to a set of actions you take. So you have some mental model in your head, in the mind of the system. The system has a mental model of the situation it wants to or the environment it wants to act into. It's imagining a sequence of actions it's going to accomplish. And through the sequence of action and its mental model, it can predict what the outcome is going to be. And now you can check whether this outcome satisfies a set of objectives. So one of them is, did I accomplish the task that I set out to accomplish? Okay, but making a thousand paperclips. But then there might be other objectives that are more like constraints, and they would be guardrails. So you pay a high price for killing someone or hurting someone, for example, or maybe for taking certain actions that will consume too much energy or whatever. So, so you can imagine having a series of those objectives, some of which are guardrail, some of which are task objectives. And then the way the system produces its output is that it, through optimization, it searches through the space of action sequences for one that minimizes all of those objectives and guardrails. Now that's objective driven. Those systems cannot be jailbroken unless you break them. But you can't jailbreak them like you can jailbreak an LLM by giving it a weird prompt which will kind of go outside of its conditioning if you want. Right. So the system cannot be jawbroken. The only outputs they can produce are outputs that satisfy the guardrails according to their internal mental model of the situation. And now the game to make a safe AI is going to be how accurate can those mental models of the situation can be? And what guardrails do you have to put in to make sure that those things are not going to Go haywire and transform the planet into paperclips. And that's really easy to do. And we know how to do this for humans. We've been doing this for humans for millennia. It's called making laws. A law is a guardrail objective that tell people, okay, maybe the act that you're planning to do here seems good for you, but if you do it, you're going to go to jail for five years. Okay? So that changes your cost function, right?

Brian Keating (67:10)

Right, everybody. I know that if you're enjoying these types of conversations, you're going to love my Monday Magic mailing list, where I explore the secrets of the guests that come on the show and other exciting facets from around the world of STEM, science, technology, engineering and math. And best of of all, I enter each and every one of you into a competition to win one of these little babies right here. A meteorite. Yes, that's right. A fragment of the early solar system produced in a cataclysmic supernova event which ignited with as much intensity as I have for the members of my Monday Magic mailing list. I know you're going to love it. So go to Brian Keating.com yt to join the mailing list and enter into the competition to get one of these beauties each month. But if you have a. Edu email address, you're guaranteed to win one. If you live in the United States, go to brianketing.com edu to get your fragment not of the metaverse, but of the real early universe. Now back to the episode. I could see how the video kind of technology that we're talking about earlier might make for better filters on Instagram. And I use, I use some of it, actually. I have a hack for you, Jan. I don't know if you've discovered this, but I think you're on mute. Workday is starting to sound the same. I think you're on mute. Find something that sounds better for your career on LinkedIn. With LinkedIn job collections, you can browse curated collections by relevant industries and benefits like Flexpto or hybrid workplaces so you can find the right job for you. Get started@LinkedIn.com jobs finding where you fit. LinkedIn knows how. I'm hoping that you haven't. So I can do something to impress you. You ever go on an airplane and wi fi is like $19 and it's a two hour flight? Or you can do messaging for free. Have you ever been in that situation? Did you know that you can use WhatsApp AI meta AI, your creations. You can use that for free. That counts as messaging for free. You don't have to pay for any WI fi and you connect to the Internet. Internet.