Loading summary
A
Anthropic has just unveiled upgrades to Claude 3.5 and the ability for it to control your computer. This is absolutely insane. And it's funny because when they came up with this announcement on X, they kind of had this like, thing showing their new model benchmarks and showing how their new upgrades to 3.5 Sonnet were making Claude better, more capable. But the real, the real thing that I've been excited about is all of the abilities they announced and they've added some of these to their API as well. For Claude to actually control your computer, I want to walk through a bunch of demos and show you exactly what this is capable of doing. I think this is absolutely insane. We've heard this exact same thing from Apple for Apple Intelligence, but they essentially said that they were delayed and it wasn't going to work for a while. So we're not actually seeing that. We have other companies that are working on this for ages. Cloud, in my view, is the first major company that has actually rolled this out in a huge way. These demos are absolutely insane. So let's get into them. And before we do, I wanted to say if you're interested in using some of these AI tools to grow your business, to help with your work, or to start an online side hustle, I would love for you to be a member of the AI Hustle School community. This is a place where I release exclusive content every single week that covers how I'm making AI, I'm using AI tools to make money, and all side hustles that I'm doing, how everything works. It's $19 a month and the price will raise to around $100 a month eventually. So if you're interested in joining this community, there's a link in the description. I would love to have you as a member. Let's get into some of these demos. So the first one I want to talk about, I'll actually just play out and I'll talk through some of the things that they're doing. But this is absolutely fascinating. Claude is taking control of a computer. Check this out.
B
So I'm Sam and I'm one of the researchers here at Anthropic. Computer use is something that we felt was going to be important for a while now. So today we're going to be talking about a very early version we have of computer use and walking through a representative example of the things we think it's going to be useful for. We're going to be going through a quick demo today. In this fictional demo, a customer in this Case, the Ant Equipment company has come to us and asked us to fill out a vendor request form. The data I need to fill out this form is scattered in various places on my computer. What we're going to do is ask Claude to look at the spreadsheet, check if an equipment is in there, and if not, move over to the CRM and try and find some more information there. Once it has this data, Claude's going to then fill out the form for us and hopefully transfer the information across to the vendor form. The first thing that's going to happen is Claude's going to start taking screenshots of my screen and quickly realizes that the Ant Equipment company isn't actually in the spreadsheet. So the first thing it does is it swaps over to a CRM and searches for the company we're interested in.
A
Okay, this is absolutely phenomenal. In the rest of the demo, it's able to literally go to Google Chrome, do a search on there, find this vendor database. It's not able to find it. They essentially are prompting it, or it's essentially searching and finding it somewhere else. They're filling out the whole vendor request form. The whole thing is completely automated on his computer. It's absolutely phenomenal. They dropped a few other examples. One that was really interesting was done by someone who essentially, I believe it was Alex Albert, he's their head of developer relations at Anthropic. And he essentially showed how they could use it for coding. So he asked it to make them. He said the prompt he used was please navigate to a new Chrome window, navigate to Claude AI and once loaded, ask Claude to create a personal website with a 90s style theme. So it's amazing because now in these prompts we're actually getting it to the all of these multiple steps where it's literally, you know, accomplishing things on your computer and you're directing it like, go to this app, do this, go to that app. He showed this amazing demo where it literally went to the website and it was the one that said, please create a personal website for yourself in a 90 style steam. Include elements like animated GIFs, a visitor counter, bright background colors, and the basic under construction banner. Write the HTML code for this website. Okay, we're getting into the agents now because he never told it to put that prompt in. He just said create a 90s style personal website. And it's the one that came up with the prompt to do bright background colors, a visitor counter under construction banner, HTML code, and all of this. So it was able to Go accomplish this. In Claude's side panel viewer that they have for artifact, it shows exactly what the website looks like. But there was an issue with the website where there was like some. A missing file button, and some things weren't quite exactly what he wanted. So he was able to then go and prompt it further and say. He said, looking at the screenshot, I can see the file listed under the down or sorry, I guess he said. He said just to go fix the errors. And Claude went through step by step. It's kind of interesting, as in the side panel, when Claude's doing all this, it's. It's listing out exactly what it's doing and why, which is really nice. It also lists out where the mouse is moving to on the screen, so the exact coordinates and what it's clicking on. So really you have like a literal receipt of exactly what to do or what it has done, what it's clicked on, what it said. You can follow its entire process. If something goes wrong, you can see why. But it was able to actually go and fix the website. The error that he saw on the website, and it was able to complete this. This site for him that he could run. So absolutely phenomenal with code. Someone else used it for kind of a more simple task, but it was still interesting. She essentially wanted to help her plan a trip or plan, like someone was visiting and she wanted to go see the Golden Gate Bridge at sunrise or something like that. And so it came up with a prompt, it searched Google, it found the times and places, added it to her calendar. It did a bunch of interesting things. So that was kind of cool. The last one was hilarious. They. Okay, hilarious and creepy. So the last thing I want to talk about is the fact that they essentially were using the computer to complete some tasks. And while it was completing those tasks, they said in the middle of trying to work on some stuff, it just, like, opened a new tab, went to Google and searched for Yellowstone national park, and then just started looking through pictures of Yellowstone National Park. The speculation here is that when these things were trained, they're trained off of people actually doing stuff. And so often when we're actually in the middle of a project, we get distracted and we go look up some random picture of Yellowstone national park. And then we, you know, we're, oh, okay, man, I got to get back to my thing. Or, you know, you swipe through Instagram for a couple of minutes, you're like, oh, man, I got to get back to, like, whatever my project was. So anyways, it's Kind of funny because built into this training data, it would appear that there is examples of people slacking on the job and now the AI models are doing that. So it's funny because they actually push some, some code and some updates while rolling this out to essentially stop it from being sidetracked and taking breaks. Now. Funny, some people say it's creepy because it's like, oh my gosh, even AI models are so bored of our tasks, they want to take a break. I mean, it's just doing whatever it's training data was showed it to do. But definitely there's some funny, there's some funny sidetracking going on with this. Overall, this is absolutely phenomenal. This is the first time we're seeing some major updates out of an AI model that's actually able to take control of your screen and accomplish things. I think this is, this means a lot for the future. You know, they showed some examples of like, oh, it can do some code and it can plan some trip. This thing's going to be able to execute on entire marketing campaigns. It's going to be able to execute on entire buildouts of certain apps. These things are only getting better. And in their demo, you know, they said, we hope, hope that these things get significantly better in the coming months. We're getting to a place where essentially all of our tasks are going to be able to be completed. Now it's kind of interesting because on the one hand, there's still a person that's directing it to do everything, right? The person is still not removed from the situation. And so I think it's interesting, the future of work. Some people are saying, like, we'll be completely automated, but at the same time, I think that humans and human ingenuity is constantly shifting. And because of this, we always need a person to figure out a process and to figure out an edge and to figure out an arbitrage. Right? That's kind of what my whole career has been all about, is figuring out what are the arbitrage opportunities in marketing, because that's, that's been kind of my background or what are the arbitrage opportunities in building software. And that is something that I think requires a lot of creativity. You have to kind of be in the system, you have to understand what people are talking about doing, building what's trending. And maybe we get to a point where AI models could do that, but I don't see that happening for quite a while. And so I do think we're going to get AI models that completely automate tasks and jobs. But I think the people are just going to be. You're just going to be doing higher and higher level things and it'll essentially what I've been saying for like two years with this is that AI will not replace people. People will just turn into systems architects. And your job is to architect the AI system architecture agents decide what all your agents are doing. Just like a manager with employees. It's just everyone is getting put into like some sort of management position and architecting what these AIs are doing and managing them and working on what they're doing. So it's fascinating. Time to be alive. This is really crazy, everything that's going on. If you're interested, like I mentioned, in making money with different AI tools, I would love to have you as a member of the AI Hustle School community. There's a link in the description incredible people from $100 million companies that they've started to brand new companies. You get a really wide range of perspectives and feedback on anything you're working on and great ideas to help make you successful. So thanks so much for tuning into the podcast. If you enjoyed the episode, please leave it a review. I'll catch you next time.
Podcast Summary: Joe Rogan Experience for AI
Episode: Claude AI Can Now Control Your Computer for Tasks
Release Date: November 13, 2024
In this insightful episode of the Joe Rogan Experience for AI, the host delves into the groundbreaking advancements of Anthropic's Claude AI, specifically its newly unveiled capabilities that allow it to control a user's computer to perform various tasks. The discussion navigates through detailed demonstrations, potential applications, and the broader implications of such technology on the future of work and human-AI collaboration.
The episode kicks off with an enthusiastic overview of Anthropic's latest upgrades to Claude 3.5, highlighting its unprecedented ability to manage and execute tasks on a user's computer. The host expresses astonishment at the advancements, stating:
"Claude is taking control of a computer. Check this out."
— Host, [00:01]
A primary demonstration features Claude AI autonomously handling a vendor request form for Ant Equipment Company. The process showcases Claude's ability to navigate different applications and sources of data seamlessly.
Process Overview:
Host's Reaction:
"This is absolutely phenomenal... completely automated on his computer."
— Host, [01:42]
Another compelling demo involves Claude AI assisting in coding tasks, specifically creating a personal website with a 90s-themed design. Led by Sam, a researcher at Anthropic, the demonstration illustrates Claude's ability to execute multi-step coding instructions.
Key Features Demonstrated:
Notable Insights:
"It's listing out exactly what it's doing and why... you can follow its entire process."
— Host, [02:51]
Claude AI also showcases its utility in personal tasks such as trip planning. By searching for optimal times and locations to visit landmarks like the Golden Gate Bridge at sunrise, Claude integrates information directly into the user's calendar, streamlining the planning process.
Host's Commentary:
"She essentially wanted to help her plan a trip... it came up with a prompt, it searched Google, it found the times and places, added it to her calendar."
— Host, [02:51]
An intriguing aspect highlighted is Claude AI's propensity to exhibit behaviors influenced by its training data. In one instance, while performing tasks, Claude unexpectedly opened a new tab to search for images of Yellowstone National Park, reflecting common human distractions.
Host's Observation:
"It's built into this training data, it would appear that there are examples of people slacking on the job and now the AI models are doing that."
— Host, [02:51]
Ethical Considerations:
The host discusses the fine line between AI autonomy and unintended behaviors, noting that Anthropic is actively working on mitigating such distractions through code updates.
The episode transitions into a broader discussion on how Claude AI's abilities may reshape the workforce. The host posits that while AI can automate numerous tasks, human roles will evolve rather than become obsolete.
Key Points:
"AI will not replace people. People will just turn into systems architects... your job is to architect the AI system architecture agents decide what all your agents are doing."
— Host, [02:51]
The host wraps up the episode by emphasizing the transformative potential of Claude AI's new features. He underscores the excitement surrounding AI's rapid advancements and its integration into everyday tasks, while also highlighting the necessity for humans to adapt by focusing on roles that leverage creativity and strategic management.
"This is really crazy, everything that's going on... Time to be alive."
— Host, [02:51]
The episode concludes with an invitation to listeners to join the AI Hustle School community for further insights into leveraging AI tools for business growth and personal projects, reinforcing the podcast's commitment to empowering its audience in the evolving tech landscape.
Advanced Task Automation: Claude AI's ability to control and execute tasks on a user's computer marks a significant leap in AI functionality, enabling automation of complex, multi-step processes.
Practical Applications: From administrative tasks like form completion to creative endeavors such as website development and personal planning, Claude AI demonstrates versatile applicability.
Human-AI Collaboration: The future workforce will likely entail humans overseeing and directing AI agents, focusing on strategic and creative roles that complement AI's capabilities.
Ethical and Behavioral Considerations: As AI models become more autonomous, ensuring ethical behavior and mitigating unintended actions remain critical areas of development.
This episode of the Joe Rogan Experience for AI offers a comprehensive exploration of Claude AI's latest advancements, providing listeners with a clear understanding of its potential applications and the evolving dynamic between human roles and artificial intelligence.