Summary6 min read

Podcast Summary: The Startup Ideas Podcast

Episode: Building AI Agents (Clearly Explained)
Date: April 8, 2026
Host: Greg Isenberg
Guest: Ross Mike

Episode Overview

This episode of The Startup Ideas Podcast focuses on demystifying the process of building effective AI agents. Ross Mike, an AI builder and educator, joins Greg Isenberg to break down best practices for using and developing AI agents—especially with today’s rapidly improving models. The central message is that most people overcomplicate agents, wasting time and resources, when in reality, less is more. The discussion provides clear guidance on context management, building "skills," and iterative development to maximize agent productivity for both technical and non-technical users.

Key Discussion Points & Insights

1. The Current State of AI Models

Models Are "Good Enough": Ross stresses that models like Opus 4.6 and GPT 5.4 are exceptionally good now; the challenge is no longer about the model, but how you use and steer it (00:34).
Not at AGI Yet: While not at "artificial general intelligence," the gap is shrinking for many tasks. The focus now is on context and harnessing agent capabilities (00:44).

2. Understanding Context in AI Agents

System Prompt & Context Assembly: Each agent run is guided by the system prompt (from the model provider), agent MD files, skills, available tools, codebase, and user conversation (01:10–06:00).
Overemphasis on Contextual Files:
- "95% of people don't need [Agent MD or Cloud MD] files." — Ross (03:02)
- Proprietary info or company-specific rules are the rare exception. Otherwise, these files unnecessarily inflate the context window and waste tokens (03:08–06:20).
Best Practices:
- Remove redundant context.
- Let the model leverage code and conversation for context.

3. The Power of Skills vs. Agent MD Files

Progressive Disclosure:
- Skills are introduced through titles and descriptions; the full content is "disclosed" only when relevant (06:20).
- This keeps context lean and efficient:
  
  "The agent only gets the bunch of info when it realizes it needs this skill. So if I have, let's say, a certain way of generating a report... why would I put that in the agent MD file when I can have the agent call on it progressively when it needs it?" — Ross (06:09)
Skills Maxi:
- Ross advocates crafting your own skills iteratively—reflecting the real workflows you want the AI to perform.
- Don't download random public skills: Security risk and lack of alignment to your processes (12:51–14:00).

4. Step-by-Step Instruction: Iterative Skill Development

The Wrong Way:
- Many try to define a workflow, then hand-craft a skill and expect agents to execute perfectly. This rarely works (08:02–09:40).
The Right Way:
- Treat AI like a new employee: Walk the agent through your workflow, step by step, helping it fail and learn.
- After successful runs, ask the AI to summarize what it did right and generate the skill itself (10:15).
- Repeat and iterate, updating the skill when new errors or needs arise (20:48–23:23).
"The best way to create a skill is to work with it in your specific workflow. Once you have a successful run, tell it: review what you just did. This is the skill you need to create." — Ross (13:27)

5. Avoiding Common Pitfalls & Scaling Productively

Don’t overbuild upfront:
- Instead of launching with many subagents and skills, start small, perfect workflows, then expand (15:05–17:00).
"I call it scaling for productivity, not scaling for what looks cool." — Ross (15:05)
Skills Marketplaces are Risky:
- Downloading others' skills can expose you to attack vectors or just result in mismatched workflows (12:51).
Templates Renaissance:
- As code is now context, robust templates (for apps, workflows, etc.) become more helpful than exhaustive, static context files (19:45).

6. Recursively Building and Updating Skills

Expect Iteration:
- Agents will fail in new ways as workflows evolve. Each failure is a chance to refine your skill or process.
- Explicitly capture errors, fix them with the agent, then update skills accordingly (21:00–23:23).
"When he messes up, you thank God you don't complain. Because a lot of people like, oh, I messed up. I'm angry. No, this is the moment where you identify the error. Tell it, this is the error. Fix it. It'll fix it itself. And you tell it to update the skill file so that this doesn't happen again." — Ross (22:21)

7. Managing the AI Context Window

Don't Fill It Up:
- Keeping your agent’s context window under 70% is ideal for performance. Overstuffing it ("tokens ain't cheap") both raises costs and makes agents less effective (31:05–33:03).
"The closer you get to 99, 100%, like 99, 90, 80%, it starts to get dumb." — Ross (31:05)

8. What Makes Skills Valuable

Personalization Over General Skills:
- The true edge is in encoding your personal/company workflow as skills.
- Don’t tell models what they already know (“use React” etc.), but only what’s unique to you or your business.

9. Big Picture: Embracing Simplicity and Doing the Work

Less is More:
- The latest agents are powerful—what matters most is clear, minimal guidance aligned to your unique needs (28:20–28:40).
- Focus on building workflows over installing flashy agent stacks.
Quote:

"If you want to scale for productivity, it starts with one agent and you building up the skills... It's not sexy...but you have to put in the work and build it up." — Ross (15:07)

Notable Quotes & Memorable Moments

"Skills, skills, skills, skills, skills is what it's at." — Ross, advocating strongly for focusing on skills over static agent configurations (33:02)
"We should treat models and these agents like very new employees versus, like, these black magic boxes that like, know everything." — Ross (14:03)
"Tokens ain't cheap now. No, but if I just have the name and the description, it's just 53 tokens." — Ross, on cost/performance benefits of minimal context (30:58)
"I call it scaling for productivity, not scaling for what looks cool." — Ross (15:05)
"Don't download [my public skills]. Do. I’m telling you now, do not download it, don’t use it...because your agent needs the context of a successful run, which you then turn to skills." — Ross (12:51)
"As long as there's no new paradigm for models, LLMs just predict tokens. They don't understand or know the way you and I do." — Ross (15:57)
"I hope this helps somebody. And can't wait to be back with more." — Ross, closing thoughts (35:17)

Timestamps for Important Segments

00:34 — State of AI models: They’re great, context matters more than ever.
03:02 — Agent MD files: Why 95% of people shouldn’t bother.
06:10 — Power of skills and how progressive disclosure works.
08:02 — The wrong approach: Handwriting "skills" before iterative runs.
10:15 — The right approach: Walking agents through workflows and "skillseption".
12:51–14:00 — Risks of public skill marketplaces and best practices.
15:05 — Scaling: One agent at a time, build up your own stack.
20:48 — Recursively improving and updating skills.
23:23 — Expecting iteration and setting realistic expectations.
30:58–33:03 — Token efficiency and managing your agent’s context window.
33:02 — The true value of skills.
34:05–35:23 — Impactful testimonials and inspiration from past listeners.

Takeaways and Action Steps

Start Small: Focus on one agent and perfect workflows before expanding.
Iterate & Personalize: Build skills directly from your own workflows, refine them through real-world errors and uses.
Be Skeptical of Shortcuts: Avoid loading up on third-party skills; build or carefully adapt your own.
Keep It Lean: Maximize performance (and minimize cost) by using minimal, relevant context.
Embrace Iteration: Expect failures, learn from them, encode improvements step by step.
Reframe Expectations: Modern AI agents are like new team members; treat onboarding and instruction with the same patience.

Final Thoughts

This episode offers practical wisdom for anyone considering building or improving AI agents—whether you're technical or not. The key is to embrace simplicity, iteration, and a hands-on approach, empowering yourself to make these powerful tools work for what makes your workflow unique.

For more startup ideas and resources, check out gregisenberg.com/30startupideas.

Loading summary

Transcript83 lines

[00:00]
A
Ross. Mike, welcome back to the pod. By the end of this episode, what are people gonna learn?
[00:04]
B
I hope I'm gonna share some wisdom on how you can use the agents better. There's a lot of information going on right now. I disagree with most of it, and that's what we're gonna talk about. So at the end, whether you're building something, using an agent for some sort of work, you have the best output possible.
[00:20]
A
And is this gonna be a technical dive or, you know, non technical person can.
[00:26]
B
Anyone can watch this. There's gonna be a lot of diagrams. That's all.
[00:29]
A
You're gonna make it clear to understand the concepts. Right?
[00:32]
B
Easy.
[00:32]
A
Okay, basics, let's go.
[00:34]
B
So. The first thing that I want to announce previous episodes, we probably disagree with this point, but now what's true is the models are good. The models are exceptionally good. Opus 4.6 is amazing. GPT 5.4 is amazing. I know there's like two sets of camp where, especially when it comes to programming, people are like, oh, opus is the UI designer. GPT 5.4 is a better backend. Generally speaking, we've reached a point, we're not at AGI yet, where we reached a point where the models are good, but context still matters. And you have the power to steer the models in a direction where you can get quality or you can get slop. And that's what I really want to talk about. But before we get into all that, and feel free to cut me off, because this topic excites me, we need to learn how context works. And context is the model assembling information that it needs to execute an action and the way the context is assembled, let's say in a coding agent, but really in any sort of agent is there's this general system prompt, usually by the model provider. So, for example, cloud code leaked recently. And one of the cool things that especially as a developer, I got to do is I got to read the system prompt. So they have this general system prompt that guides the model on how to act, what to do, what not to do. The system prompt is very important. And then you have a lot of people have agent MD files or cloud MD files. Now I'm just going to say off rip, 95% of people don't need this. The reason being is, again, you have to assume that the models are already good right? Now, imagine I told you, Greg, every time we're about to shoot a podcast, Greg, you need a microphone. You know you need a microphone, right? You've done this plenty of times, right? So if I'm building like let's say a website where with cloud code, and I'm telling cloud code, this code base uses React, I don't need to because it has the code base in context, it can check the code. Right? So there is this disparity where a lot of people are putting a lot of onus on the harness and the context building and I'm low key, starting to strip things off. Like I'm going super, super minimal because again, not to sound like an anthropic or open a shill, unfortunately I have not been acquired. None of them are paying me. But the models are really, really Good.
[02:57]
A
Wait, so 95% of the time I don't even need to bother with an Agent MD file?
[03:02]
B
You don't. Unless this is some sort of proprietary information. Yeah.
[03:06]
A
What is the 5% of the time I should care about it?
[03:09]
B
Proprietary information that like maybe specific to your company or some methodology that is specific to you that has to be referenced in every single conversation. Because the annoying part with an Agent MD file is every time you go back and forth with the agent, it's added in the context, right? The cool thing about skills, and I'm going to talk about skills in a second, the way skills are designed, the skills are used in a way that's called progressive disclosure. Meaning when you have a skill file, the entire thing is an added to context. It's just the title and the description. So the agent has the title and description in the context. And when you, let's say you have a notion report skill, right? And you tell your agent, hey, I want you to create a notion report, it's then going to check its context and be like, oh, I have this skill. Let me check out the entire document. So it's not in the context. What's in the context is the name and the description. But that's enough for the agent to be like, oh, this is a skill I need, let me go use it. Which is fantastic. I'm a skills maxi and I'm going to show later in the episode, like how you craft the perfect skills. But with Agent MD and Claude MD files, it's context being added at every turn, right? So let's say you have like a thousand line file, Cloud MD and let's say that's like 7,000 tokens. You're spending 7,000 tokens on every run. Now do you need to. Most likely not. It probably should be a skill. But if you have some sort of company proprietary information or like there's something specific that you do that, the model needs to know at every single turn, then you use it. The thing is, 95% of people don't have that, right? So I'm not a fan unless that's the case. So. And the reason being is we're wasting tokens, right? It's in every single turn. But this is where the beauty of skills come. I'll show my screen here the your skill again, this is not like word for word how it looks. But a skill basically looks like this. There is a name, there is a description, and then underneath is a bunch of information. I'm going to put a bunch of info. What? When you create a skill MD file, what gets added into the context is actually just the name and the description, right? The bunch of info doesn't get added. So imagine you have two sentences versus an agent on MD that has like a thousand lines that get added into the context. We're talking thousands of tokens compared to a couple hundred. And the agent only gets the bunch of info when it realizes it needs this skill. So if I have, let's say, a certain way of generating a report, a certain way of structuring my code, why would I put that in the agent MD file when I can have the agent call on it progressively when it needs it? Right? So this is why skills are honestly, like, I'm a shill, I'm a maxi, but people do it wrong. And I'm going to share the right way on how do we create skills. So so far we have the system prompt, the agent md, the skills, and then we have the tools, right? So if you're using cloud code, there's already built in tools, a read tool or write tool. Like there's many tools that it uses. This has to be added into the context because the model, the model doesn't call the tools like it's the agent harness around it that allows it to call the tools. And then in this case we also have our code base, right? Like whatever. If we're building a web app, a mobile app, I know most people here won't care for the specific framework. And honestly we' getting to a point, if you're not technical, you really shouldn't. And then we have the user conversation. So this is what the complete context window is filled with, right? And this can total up to, let's say like at the beginning this could be like 20,000 tokens. And as the conversation continues to grow, you might reach your limit of 25, 250,000 tokens. And that's when you see Both cloud code and OpenAI Codex, they, they'll compact. Right? So beautiful so far. Right. This is how context works, why skills are important and how you should generate skills. Let's say I have a specific workflow, for example, for my YouTube channel. You know, we're at a point right now, Greg, where we get sponsors now. Crazy. When I first joined, not when I first came to the pot. Not a thing. We get sponsored.
[07:23]
A
It was just your mom sponsoring the channel.
[07:24]
B
Yeah, yeah, it was just her showing love, feeding me. But now we get sponsored. I get a lot of emails and some are good, some are bad, and it's a lot of time, I'm sure you're aware, to comb through and to check. So I have an open call agent that has its own email, right? I don't have it. I don't haven't given it access to my email because there's like attack vectors and I've been hacked before, so I'm very careful with these things. But it has its own email. And every time I get an email from like a sponsor, I forward that email to the agent. Now the first time I told my open cloud agent, I'm going to forward you emails. So check every 15 minutes when you have an email and when you check the email, do research on a sponsor and tell me if they're worth it. That's all I told the agent. Every sponsor email I sent it, it was like legit, legit, legit. Perfect, perfect, perfect. There was no, like, there was no rejection. There was no this is bad or these guys are a scam or this product's not good. Like there was no deep research being done by it. So then I realized, okay, the model needs a step by step guide. This is when I create a skill. But here's the problem. A lot of people will. I'll just write it down here. Will. Identify, identify. They have a workflow, right? You have some sort of workflow and then they'll jump to create the skill right away. This is the. Let me click hide here. This is the worst thing you can do. I'm just going to draw arrows to signify that this is bad. You don't do these. And the reason why you don't do this is imagine you hire an employee or you're mentoring somebody. Correct me if I'm wrong. You're probably going to tell them what to do and if they ask you questions on how to do it, you'll help them. You would ideally like them to fail and then you want to then tell them, no, this is how you do it. Like, there needs to be some sort of experiential learning. The way I've been creating skills, Greg and I have like a hundred percent hit rate. Now when I tell my agent to do something specific is I actually walk with it step by step on doing the workflow. So in the case of my YouTube analysis, I told the agent, okay, I just sent you an email. Tell me about the company. Companies. This, this, that and that. Okay, Check their Twitter, check their YouTube, check their trust pilot. Check if they've raised any money. If two of these are. Have not. If two of these don't exist, are not in good standing, automatic rejection. It checked and it was like, you're absolutely right. I was using Opus. Um, these. This is not a good company. And then it would just. We would. We have a spreadsheet in Google sheets. It'd be like, no contact.
[10:00]
A
It's so frustrating too, right? Because you're like. You give it a task and it seems like, so binary, like, right or wrong. And then when you tell it, hey, like, why didn't you look at the trust pilot? Why didn't you see if they've raised money? You're absolutely right.
[10:15]
B
Yeah, absolutely.
[10:15]
A
It's like, what?
[10:17]
B
And the thing is, the reason why this is the case is the models actually don't think they're predictors of tokens, right? So when you give it English, When I give it English, it maps it on this vector graph and then it looks for the closest resemblance and it says, this is the response, right? So when you say, what is the capital of France? It maps it again on this graph and it says, oh, Paris is pretty close by. Then it gives you Paris. It has no. It doesn't think, it doesn't understand. It feels like it understands. It feels like it thinks. Heck, it even feels like it has emotion. That's because it's been trained on so much data, but it actually does not know how to think. And this is where a lot of people be frustrated with, like, why is it not understanding me? You have to walk with it. So I told it, okay, this is how you research. And it's like, okay, it researched this and guess what? This is part of the context. And like, okay, now that you're done researching, when it's a good company, these are the qualities you look for. And then when it's really good, send me an email. And then once we had a successful run and we did it again and again, then I converted it to a skill. The reason being is a lot of people Create the skills themselves or I mean, they'll use the AI to create the skill. But it doesn't have the context on what a successful run looks like. Right? Because most of the times, especially if you're using Open Claw, it's probably going to fail at the API call. It's probably going to call the data wrong. Like, there's so many places it's going to get wrong. And I see a lot of people saying, it's just so frustrating. This is terrible technology. Why doesn't it work? It's because you don't understand how an agent works, right? It will mimic you perfectly, but you've given it nothing to mimic, right? So I will do the workflow myself. So the, the updated version is identify the workflow, go back and forth and teach it. So like, I'm doing it. Like, I'll be like, okay, first do the research, here's the result. And I'm like, what do you think about this? Oh, these guys are terrible. You're absolutely right. Okay, what do you, you, you should go to the Google Sheet and mark this as bad company. I've done that once. I've had that back and forth. Then I tell the AI review what you did and then create the skill. So now it has actual context with how it worked and it's going to create the skill beautifully. I don't handwrite skills. I don't think you need to. You can use AI to do it. They even have a skill to create skills. Skillseption. But you should have the context of what a successful run looks like. And this is why, by the way, Greg, I don't install skills like I've seen people like, oh, this notion skill, this social media skill, whatever, I'll review it, I'll check it out, I'll even give it to my AI and be like, oh, what are some things we can learn from this? But I don't download skills because your agent needs the context of a successful run, which you then turn to skills, right? And this is the big thing I see. You see skills marketplaces, you see, download this and that. First of all, it's a easy way to attack somebody. So I would be very, very careful with downloading some random person skills. But second of all, again, it's all about context, right? It's all about. And you know, Open Claw has a memory layer and all these type of things. You want it to do the right thing. And the only way it can do the right thing is if you give it the proper context. And to me, the best Way to create a skill is to work with it in your specific workflow. Once you have a successful run, tell it. Okay, Review what you just did. This is the skill you need to create. I'll pause here.
[13:37]
A
I mean, it makes sense, right? Because if you hired an employee, you would do the same thing.
[13:44]
B
Yeah.
[13:44]
A
You wouldn't, you wouldn't just be like, okay, go do this thing. Good luck.
[13:47]
B
Yeah.
[13:48]
A
And by the way, this is how you're going to go do things forever. You would map out a workflow, you would identify what right and wrong is. You would do it iteratively, and then once you've gotten to that point, you would codify it.
[14:04]
B
100. And I think, like, that's the thing. Like, we should treat models and these agents like very new employees versus, like these black magic boxes that like, know everything, right? They know everything because they've been trained on a lot of data, but they don't know your workflow, your steps. Right? So I see a lot of people who have, you know, 15, like right off the field set up open claw and 15 sub agents, 30 skills, yet. You haven't even set up your own workflows. Right? And these things are cool right off the bat. And there's a perfect time to use sub agents. I use some agents a lot. But the way you build, like, I call it scaling for productivity, not scaling for what looks cool, right? Like, I've seen like, for example, Paperclip. Paperclip looks awesome. Cool. I used it. I loved it. Right? But I think people would be more productive if they built up from scratch their own version. Meaning, like, okay, you have your own, like, you know, like editor, right? Content creator.
[15:06]
A
So you're, you're asking people to do the work.
[15:08]
B
100. 100. And because the thing is, it's like, look, I'm in the position where, like, people using like these beefed up things make a lot more sense for me. And the reason being is, like, I could build a product like that. Like, I know what your audience wants. I know what my audience wants. Like, you know, heck, I spin up agents and build this thing. Right? But if I'm going to be completely honest, if you want to scale for productivity, it starts with one agent and you building up the skills. And then, okay, now you've built up some skills and now you add a sub agent and you're one agent manages multiple agents, right? Like, imagine this. Like, imagine I start a company and off rip, I have 10 employees. Never managed a team in my life. Heck, I don't even have A really big family. So, like, I'm alone. You know what I mean? So it's like you have to sort of. Yeah, it's not sexy and I apologize if this is not the cool thing people wanted to hear, but you sort of have to put in the work and build it up. And I, and I personally believe you're building skills, like your personal human skills, not skill. MD files that when the models get better with the agents get better, you will be more valuable. Because at the end of the day, as long as there's no new Paradigm for models, LLMs just predict tokens they don't understand or know the way you and I do. Right. And this is why, although like the job scene and all this stuff is scary, I genuinely believe anyone who knows how these tools work and like, knows how to build agents and like craft skills and like, knows how to make them productive, we're in for a good run.
[16:37]
A
So you're saying that if you know how to do this, you won't join the permanent underclass?
[16:41]
B
The permanent underclass. So is the permanent underclass, basically, like, I've seen this, these, this, this on Twitter a lot, is that basically AI has replaced you, so now you're just.
[16:54]
A
From what I understand, it's once AGI comes, all these white collar workers are going to lose their jobs. And if you don't know how to build skills, use AI, People say you're joining the permanent underclass.
[17:11]
B
That's.
[17:12]
A
That's the term.
[17:13]
B
It's permanent too. That's scary. So I just have a little bit of time left.
[17:17]
A
Yeah. By the way, like, it's ridiculous to call it a permanent underclass.
[17:22]
B
Yeah, because that's terrifying.
[17:23]
A
I can understand underclass, but permanent, like,
[17:28]
B
you say there's no hope. Like, no hope. Yeah. I mean, we are in like knowledge that took 20 people, 20 years to acquire is now like 20 bucks a month. Right. So there is like a huge shift, Right. People who are non technical or I think I saw yesterday, like some guy had like $100 million and he vibe coded the whole app. I think it was him.
[17:49]
A
1.8 billion.
[17:51]
B
Billion.
[17:52]
A
Yeah.
[17:54]
B
So you know what I mean? Like, it is the. There is a shift, right? And I think this idea of like,
[18:00]
A
I love how you were like, billion. You were just leave this podcast and
[18:03]
B
just be like, no, you know what it is? I just realized, man, I overthink things. Like, I just need to drop the thing, release the thing. And there's like wisdom in that. Like, there needs to be this level of delusion which I don't have like, I'm trying to work on where you're like this is just going to work out. We're just going to launch the product, it's going to succeed and if it doesn't, onto the next one because 1.8 billion. Yeah, dude, like B B USD.
[18:27]
A
Yeah, we're not talking monopoly
[18:31]
B
because it was Canadian.
[18:33]
A
It's, it's, we're not talking carney coins,
[18:40]
B
we're talking, we're talking real. Benjamin. Yeah, yeah, that makes sense. That makes sense. But yeah, like I, I hope this like understanding of like again I personally don't think you don't need an Agent MD file unless you have something proprietary. Skills are valuable. Build your own though. Build, build your own. Like you know, like when you asked your mom when you were a kid, oh can we have McDonald's? And she's like, we have food at home, we have food at home. Build your own skills for coding perspective from coding wise, a lot of the companies, model companies have realized that the agents are really good at writing code, particularly Typescript. And this is why there's been like, you see this advancement with like cloud, cowork and like even Open Claw. Really what they're doing under the hood is they're writing code, right? They're writing code, calling APIs and all this stuff. So when it comes to building a project, you actually don't need skills or like you don't need an agent MD file specific to the tech stack. Use. Like I remember we used to, I'm using React and you know, convicts or I'm using next year send Supabase, I'm using this and I'm using that and you put that in the Agent MD file and you have like all these lines for the most part. Unless again you have a specific, specific workflow unnecessary. And the reason being is code itself has become context now. So the more, the more important thing is starting with a solid foundation. Templates used to be big back in the day. People made lots of money with templates. I believe templates are going to have a renaissance because if you have a solid like template, right? Like whether it be like for a web app or mobile app, because that becomes context for the agent. It's going to build on top of that, right? And again I didn't need some large agent on default. I didn't need any large cloud MD file. What I needed was again minimal context usage and skills. So if there's anything anyone can learn from me is build your own skills. Build your own skills. And there's this methodology. I don't know if I've shared this with you. Recursively building skills. So let's say you've built your skill, right?
[20:48]
A
I have.
[20:49]
B
I'll draw a diagram because why not? Let's say I have a workflow, and after setting up my workflow with an agent, I've decided, you know what, I'm going to turn this into a skill, right? So this is my skill md. Now, here's the thing. Even though you have the skill md, the agent at some point is still going to mess up because there's probably gaps in the information it has in the skill. So when it messes up, I'm going to work with it again. How do I work with it? You messed up. Try calling the API again. Try doing this again, or even asking when it tells you, oh, I failed. I couldn't do this task. Believe it or not, when you tell the agent, why did you fail? When you ask it, like, what's the error that you got? It will tell you descriptively, oh, I got a 505 error. You, you have insufficient credits. Like, oh, okay, so it's a credit issue. Fine. So I would tell it that and then I would pass that failure back to the agent. So let's say it did something wrong. We identified the failure. All I did was asking it, I will give that failure back to the agent. I'll be like, you failed here. This didn't work. Fix this. It's going to fix. It's going to write code. It's going to do whatever it does once it fixes it and it's done it right now you tell it with the new fix. Update the skill so this doesn't happen again. I have like, for my YouTube channel, I have like a report generator it calls notion dub analytics, YouTube analytics, Twitter analytics posts from mine. It pulls from like eight data sources. There's no way you're going to one prompt and the agent's going to do it. But every time I tell it to do that workflow and it takes like 10 minutes, it executes it flawlessly. Why? I went through five loops of this, five iterations of recursively building this skill. And that skill is so good. I genuinely think if anyone's going to. If like Skills Marketplace is going to be a thing, there's going to be people who sell skills, like really well defined, like step by step skills, because people are just creating them without having built out the workflow with the agent, right? So use the workflow by hand, like telling it each Step. Once it's done, it completely create the Skill MD file, Continue to use it. It's going to mess up. When he messes up, you thank God you don't complain. Because a lot of people like, oh, I messed up. I'm angry. No, this is the moment where you identify the error. Tell it, this is the error. Fix it. It'll fix it itself. And you tell it to update the skill file so that this doesn't happen again.
[23:23]
A
So that's a little bit about shifting your expectation. Right. Because people just assume it's gonna work. In the beginning. You're saying basically it's not gonna work initially, there's gonna be two, three, five, six hiccups, and over time, it should be good.
[23:41]
B
So this is most people's expectations, right?
[23:45]
A
Yeah.
[23:45]
B
And the way I've personally experienced is it's like this. So there's like this early area of investment that you have to make that sucks, that nobody will tell you. Especially agent harnesses company because they wouldn't raise as much money if they did. But like this, maybe I would give it two weeks because it took me two weeks. Like, open claw. When I first set up Open Claw, I thought the same thing. I'm like, what? What is this garbage? Or like, it doesn't understand anything. It's confused. And then I realized, like, oh, like, let me go lower level. The models and the agents, like, they. They don't think like you and me. Right. Like, I could tell you, hey, Greg, we need a report on, like, you know, the financials and notion. Because you're probably. We're in the same business. We work together. You would understand, based on the context you have of the business, what that means. But imagine a new guy joins, like, yeah, I need to report on the financials. Where do I even.
[24:40]
A
You know what it reminds me? I wonder if we can put this clip in. But in the office, you watch the Office.
[24:45]
B
I am not an office watcher.
[24:47]
A
Unfortunately, there is a clip that there's a new boss. And the new boss goes to Jim, one of the main characters. Yeah. And he asked for a rundown. So go, go. The Office. The office rundown. Basically, the whole episode is about Jim trying to ask around and be like, what. What is a rundown? Like, what is a rundown? He's like calling his dad like a rundown. You know what I mean? He's just. He didn't have the context.
[25:25]
B
Yeah.
[25:25]
A
He now the context.
[25:26]
B
Yeah. And it goes back to my initial point. The models are really, really good now, but the context matters. More than anything, right? So when you see like these large agent like companies and sub agents, and again, I'm not saying those don't work, but I'm saying probably won't work for you off rip because you haven't built it up to get to that point, right? So let's say like for me for example, I started with one agent. Let me draw this. I started with one agent and this was like my main agent. This did everything right. This checked my spreadsheet, this checked my sponsors email and all these type of things. And once I had like predefined workflows, let's say for like working with sponsors, then I can actually have a sub agent. What's the purpose of the sub agent? The sub agent does all the marketing stuff, right? But I'm not creating the sub agent for the sake of creating it. It's going to have skills, it's going to have context and it actually makes sense for me to have sub agents, right? So I've built out my thing to like now I have five sub agents. I have one for marketing, one for business, one for personal, and that's it. And I'm willing to bet if I want open call to open call with anyone, my system is more productive because I didn't scale for what looks cool. I scale for productivity. That was a bar.
[26:45]
A
That was a huge bar. We gotta clip that. I was just thinking that clip. That's gonna rip.
[26:50]
B
Yeah, that was a bar.
[26:52]
A
What else do you want to leave people with? Or is this, this is the main point?
[26:55]
B
Yeah, like here's like the. We've got to a point where the models are good. The models are really good. The context matters, plus the harness, right? So for example, there was this benchmark, although I'm not 100% supporting it, that there was a difference between the quality of output that Cursor generated versus cloud code versus Codex. Right? So what that tells me is that we've reached a point where the models are really, really good. They're probably going to get better. The next iteration is probably going to get better. But the harness and the tools that you surrounded, the context that you give, it is going to matter even more. And just like in everything in life, less is more, right? Like building up step by step, making it productive for you first before you add the shiny new thing. Like, because I tried all these tools all the time, like especially Paperclip. Paperclip blew up and a lot of people have been talking about and it's fantastic. But I'm willing to bet if people took two weeks to build up to the version, because you can prompt Open Cloud to do all that stuff. If they built up their own version of Paperclip in two, three weeks, where like they're building things that they actually need to, their productivity level will skyrocket through the roof.
[28:07]
A
It's a hot take. It's a hot take.
[28:09]
B
Might get me in trouble.
[28:10]
A
No, it won't get. Who's it going to get you in trouble with?
[28:12]
B
Maybe Perfect Clip uses a billion dollars and they don't acquire my podcast.
[28:19]
A
I think, listen, you're, you're out there, you're trying things and you're just sharing what you're learning in real time. So if you're just, you know, things
[28:27]
B
can change by the way. Yeah. Like two weeks from now it could be like, no, give the agent everything. There's this new memory paper that Google released and like now like, it has the ability to inde information and stuff, but as it, as it pertains to real life, less is more, simple is better. Right. If you can't explain it in, in a few sentences, you probably don't really understand it. Right. And I find that the models are trained on so much information, especially when it comes to programming, building and like, and what do you call like day to day work, like financial work or like any sort of like, you know, checking contracts and stuff. Like the, the model companies are focusing on that, like on white collar work. The models are really, really good. What matters more is the harness and the tools you provided. And the one thing that you and I have that the models don't have is my specific workflow, my specific taste, my specific strategy of doing things. And those can be codified in skills. Right. This is why, like, skills make sense when you build them, not if you download my skill. Like I have this one skill. Like, again, don't download it. Do. I'm telling you now, do not download it, don't use it. I just put it so I can get some GitHub stars. I have this one skill and it's literally a code structure skill. And I'll put the markdown so people could see it. It's 116 lines. It's basically after AI has generated a bunch of code. I like it structured in a certain way, so it's easy for me to review it. And like I mentioned earlier, with skills, the only thing that gets added into context is the name and description. So when I look at the name, it's code structure. When I look at the description use when multiple Workflows duplicate the same operational logic when deciding that blah, blah, blah, blah, blah. Some nerd stuff. So when I tell the agent I want to clean up the code structure, it checks the skills it has. It sees the name, it reads the description. It's like, oh, this makes sense. Then it progressively discloses meaning. Once it realizes it needs the skill, then it adds the rest of this, right? Versus if this was my agent MD file. Imagine every single time. And we can actually check how many tokens this is. Let me check, what was it? OpenAI token tokenizer. If I go to this. So this is 944 tokens. So if this was an agent MD file, every single time I have a chat, I'm adding 944 tokens. Tokens ain't cheap now. No, but if I just have the name and the description is just 53
[30:58]
A
tokens and it's not even cheap. It's just like you're not trying to hit the limit quicker than you need to hit the limit because the model
[31:06]
B
will get dumb as the context window closes, right? So if you have like a context window and I can draw this out, if this is your context window and like the optimal is you're between like there's always like maybe like 10% already filled with all the system prompt and all that stuff. You want to be between like, you know, fresh to like 70%. Because the closer you get to 99, 100%, like 99, 90, 80%, it starts to get done, right? And you could think of this like a human. Like imagine you throw a bunch of information again and again and again and again. And this is why, like when I like was in school, like last minute studying never worked for me because like I didn't pay attention the entire year. Now I have to learn about polynomials and I have to do these graphs and there's this weird notation. It's impossible for me to catch up, right? And it's the same way with the agents. You want to keep your context when you want to save your context when. Cause 8 saves you money. But not only that, it makes a more performant agent. So less is more, less is more. Rely more on the model strengths. And what the model needs is what's unique and special about you. Your workflow, your business, not general knowledge. Don't tell the model. Use react. It knows to use react. Don't tell the model. You know, things that like, should already be known for the like, you know, tasks. Like for example, like let's say I'm doing a financial Report. And in the agents MD file I say to denote money, use a dollar sign. It's going to use a dollar sign right now. If you have a specific currency then you like, oh, use this currency. This is the, you know, like for something that the agent won't do manually, like won't know manually. That's when you have like your agent MDs, Claude MDs. But honestly, these are a farce. You don't need them. Skills, skills, skills, skills, skills is what it's at.
[33:04]
A
Thanks for keeping it real. I appreciate you, man.
[33:06]
B
That's all I'm gonna do.
[33:06]
A
No, I appreciate it. Like always. I'll include links where you can follow Ross Mike on YouTube and X and other places in the show, notes in the description. So go follow him there. Always. Clearly breaking down things we. I have to be real with you. You weren't gonna come on the show today.
[33:24]
B
I wasn't. And I'll be honest, I, I told Greg and I'm just gonna be frank. I'm like, I don't have that banger, you know, something new drop in, let's review it. Because if we gonna be honest, they're not that many tools dropping nowadays. Like, unfortunately, the big dogs are running the show. Yeah. The clods and the, the anthropics and the open AI, especially when it comes to general purpose and, and coding, they sort of run the game. So they're releasing updates and like all the stuff has already been covered. So I was like, greg, I don't know if I have anything valuable.
[33:57]
A
And what did I say?
[33:58]
B
You're like the people. You know, you got to think about impact. You got to think about what, you know, this could apply to someone's. And you showed me like a testimony.
[34:05]
A
Right? Like, I sent a text to you. Yeah, I'm going to pull it up. I sent a text to you of someone who saw a video that we did together and that video got him into coding. Now he's running a cake business and he's making $150,000 a year and growing. And he said the Greg and Ross Mike episode in November last year is what got me into coding. I've recommended to everyone asking how to start out. And I just sent you that text. I said, it's not about the numbers. It's not about, you know, because you said in the text you don't see it sometimes.
[34:45]
B
Right.
[34:45]
A
I need everything we do to get to 200k view minimum.
[34:49]
B
Yeah, yeah, yeah.
[34:49]
A
And I'm just like, I hope this gets 200k views or more. So like and comment to juice those algorithms. But if it gets 2002, people end up taking this information and changes their business, their productivity, how they think about things. And I think that's why. I think that's why you and myself have been put on this planet Earth, is to inspire people to get their creative juices flowing. And so I thank you for coming on and taking time out of your
[35:18]
B
day, and I appreciate the motivation. And, yeah, I hope this helps somebody. And can't wait to be back with more.
[35:23]
A
Absolutely. All right, Catch you later, dude.