In this episode, I sit down with my friend Rohit Krishnan - writer of the Substack newsletter Strange Loop Canon - for a hands-on conversation about what it actually looks like to build with AI agents today. Between us we're burning through tens of billions of tokens a month - I hit nearly 100 million in a single day this week - and we share what we're each running on our own machines. We dig into the quirks and surprising power of tools like OpenClaw, Claude Code, and Cowork, debate why AI remains stubbornly bad at good writing, and zoom out to ask what a world of trillions of agents might actually look like — and what economic infrastructure it will need.
Loading summary
A
On Wednesday, Rohit, I fell just shy of 100 million tokens.
B
You know, it is intelligence on a tap. Computer used to be a job that people did. There used to be like room full of people who were computers. They would compute things. And now it's machine. The next step is analyst.
A
I think it's not unreasonable to think that we will have hundreds of millions and then billions and then tens of billions and hundreds of billions of agentic systems running around the Internet as automated infrastructure in a few years time in that world. What do you think an agent meaningfully is?
B
I called it Homo agenticus. Suddenly the world becomes one where agents are the ones that are interacting mostly with each other. Going to websites for us or like doing transactions for us using your credit card. All of these things are the things that the agents need to be doing. Which means like when you have a trillion of them, we need different coordination guardrails for them to be able to do tasks for each other.
A
Agents will need some mechanism to exchange value between themselves. Where it won't so much be about transaction costs and communication costs between employees, but it will be about security and verifiability costs.
B
I get LLMs who review all of my essays and I throw away almost all of the comments that it gives me. I can see that it pushes it towards a meme. It's like, oh, you were too colloquial. Here, take it out. It's like, no, like without that, you know, it'll read like the back of a breakfast cereal army.
A
Arnold actually moves house this weekend into a new Mac Mini. It can see my lights and I can turn them on as I walk to my studio from the house. And a few other things.
B
The hardest leaps for my wife was I don't need to think about what I need to ask it before I ask it. It's like, don't worry about it, just talk to it like you know it's your analyst and it'll just do things for you or your husband. But it listens better than most husbands that I know, including myself. It at least remembers things.
A
Today my guest is my friend Rohit Krishnan. Many of you may know him from Strange Loop Canon. It's where he writes his essays on substat. It's very hard to describe what Strange Loop Canon is about. Let's just say there's not a lot of canons in it, but it is quite strange. I absolutely love the essays. Business, Technology, Economics with rigor and clarity. Rohit, you're so distinctive. You're an engineer, you're an economist, you're a hedge fund operator, you're a builder, and today a coffee drinker. Now, we have both been experimenting extensively with AI and more recently with what people are calling AI agents. Thinking through practical questions like how do you get these damn things to do useful work through to theoretical ones like what would an agent economy look like? And of course we've been playing around with openclaw. We are going to try to cover all of that. So let's get started. Ravit, thank you for making your Friday morning available.
B
My absolute pleasure. Thank you for having me. I mean there's very few things that is more fun on a Friday morning than to do this.
A
Oh yeah, no, I, I've been looking forward to this. I've been building a lot using, using AI tools, you know, Claude code and replay and other things. And you have as well. So, so, so what's on your screen right now? What are you working on?
B
I have a codecs thing running right there. I have three screens that is. That is basically doing a music creation little thing. I just made it for me where I can play on my. A computer keyboard and actually play music. So that's number one. I have another thing running here which is my polymarket prediction and foresight forge pun. That is the daily prediction thing that I run for myself. I have two telegrams running there which is talking. One is talking to my open claw, one is talking to my own telegram, one for all my daily to dos and all that kind of stuff, normal stuff that you might do. And I have a last one running here which is a long standing project that a hobby horse I have of like, I call it Horus. I wrote a post about it a while back trying to see whether I can get AI to write better, which is as we all know, the hardest thing for an AI to do.
A
I love what you've got there. I have committed about 300,000 lines of code this year. It has been really, really nonstop. And I was reflecting that my coding world has changed remarkably with Opus 4. 5, now Opus 4.6, but also now with these Openclaw agents. Openclaw, correct me if I get this wrong. Rovit, it's an agent orchestration framework that you can run locally on your machine or in the cloud. It has a persistent memory and tool use and you can. It kind of works the way a science fiction software agent works and people have gone crazy for it. So you've been doing some open core work. Did I get the definition right by the way?
B
Yes. I mean basically openclaw you know, it went through a bunch of naming changes but basically what OpenClaw does is that like you, you have these LLMs that can do useful work, but useful work that you wanted to do on your computer is different to like if you can actually access it on your WhatsApp, on your Telegram, on your Slack, you know, signal, whatever, what email, what have you. OpenClub basically is a way that, that like it's a kind of built this to actually connect to all of these normal data sources that you would need it to talk to and you can make it run persistently in some environment, wherever you like. You know, I'm running it on my laptop, you're running it on a Mac Mini. People run it on the cloud, whatever you like and then you just talk to it just like you normally talk to an LLM. And it can take actions for you like scheduling meetings or like responding to messages, cleaning up your inbox, getting a daily to do list. I mean all of these things that I do on a daily basis now. I mean I have tried using to do lists for example, something that I've tried using for 20 years now. I have never ended up sticking with one because they all suck and like they make me do different things and I like the longest I use them for is like four days. But this one, it works because I need to change zero habits and it does the work that I should be doing which is like read through my emails and tell me what I'm supposed to be doing today. Compared to the history it's not a hard job but it does require a little bit of grunt work that somebody needs to do. Until like three weeks ago it was me.
A
Right. So tell me about how you, you work with your agent. Do you have a name for your agent?
B
I don't, I haven't. The main reason is I haven't settled on because like, I mean right now I call it Morpheus because it's, it's the name that I used to use back in the day but I don't know whether it's going to stick or not. But I do want, I have it just because like you know, it's on my telegram. So like I need, I need to call it something and it's on a bunch of my groups so I have to say at something before I can call it. It's called Morpheus.
A
My agent has had a name change as well. So my agent was called Mini Arnold because in the Second Terminator the Arnold Schwarzenegger character comes back to protect humans and Protect John Connor. And then Chantal, who's on my team, pointed out that we had agreed a year or two ago that we should, if we do have agents, we should call them R something per the Asimov rule. R. Daniel Olive Oil. So Mini Arnold has been renamed R. Mini Arnold, which is quite a mouthful, but I still call it Armini Arnold. So when you're working with this Agent Morpheus, what is the type of thing that you do? When do you interact with it? Just get us to some specificity.
B
Sure. So let me put it this way. Like a year ago, two years ago, actually, the most used app on my phone would have been probably like WhatsApp or Slack, right? Like email a little bit, but like WhatsApp or Slack. Sometime over the last year, the most used app kind of became chatgpt. Because I would use it for everything, as a. As an interface to ask questions, learns, etc. If I get 5 minutes standing in a grocery shop, I would be asking something to kind of response and so on and so forth. Today it's almost like I'm moving back to the Slack world, except that now ChatGPT is in there. So now I can ask it any question that I want about anything that I want, and it becomes the interface with which I kind of get against the world. So I do three kinds of things with it today. And I don't think this is the like height of what can be done. This is just what I feel happy doing today because I've given it all my. I have a personal laptop here. I said, like, this is your home, you know, you get to do whatever you want. You want to mess up the file system, I don't care. Go have fun, right? Everything is backed up. So now I run it here. For one is daily tasks to dos, that kind of stuff. The second thing that I do is like email triage and inbox and responses and stuff like that. It occasionally gets something wrong, but like it's fine, right? I mean, it's not the end of the world. We can deal with these kinds of things. So figuring out which emails I should respond to, how deleting all the junky ones on run sort of regularly, that kind of stuff. The third thing that I use it for is like interacting with this machine to see whether I have stuff running. Like I have a piece of code running, can you interact with it and tell me how that's going and like rerun it or you know, get the terminal access and kind of talk to those kinds of stuff. So like the third one is probably the least personal assistant, if that makes sense. Because you're. Even if you had a personal assistant, you're not going to tell them like, hey, go check my computer and see how that terminal code is running. Give me the response, then I'll tell you what to do. And you go, right. That doesn't happen. The first two are very personal assistant. It's like, it's like having a, you know, your, your own genius personal demon kind of go and do tasks on your behalf and just like clean up your admin life so that you can kind of solve that.
A
These are like, how fast is it responding to you and how often are you pinging it?
B
Oh man, I don't even know. I'm. I ping it like regularly, maybe once an hour at least, sometimes very regularly. If I have something immediate, ongoing in terms of sort of stuff. Like anytime I'm walking around and I'm like, oh, I need to do that, I throw it into this. And then it kind of takes care of everything else that needs to happen at a stateful way in the backend.
A
This is your open core Agent Morpheus. Have you added any new skills to it? Because there are loads of skills out there that you, you can add.
B
I haven't actually, because I haven't found the. Because I've, I've hand rolled a few things, but I haven't really needed to. I found that like when I add skills it is for some specific thing that I want to kind of rerun repeatedly. And what I find now is that it's all my ad hoc stuff. Right. Because if I know something I need to do repeatedly, then there's usually a workflow that I've built and kind of runs like the prediction one or like the daily, daily news forecast one. I already have an agent running for it, so I don't need to do that in my openclaw. Openclaw is like my personal butler. I mean I get to ask Morpheus anything that I needed to do today. If it was something that I would do every day, then that becomes an ongoing thing that can be run somewhere else anyway, I don't need Open Claw to do it.
A
So I tend to take the view when I'm using these tools that I want them to do the most useful things for me, not the, not the trivial thing. Right. So a couple of the skills I've given Armini Arnold are the Foundry library and the reflection library. So these are self learning skills. And so during the context discussion I can say learn this. And so it now has Learned that I like executive summaries and bullet points when it was sent back to me. I like the date format which is reverse date YY MNBD before every piece that if it's asked a task that is complicated, even if I don't specify a plan, it needs to plan and give me the plan. And then I've had some really, really surprising things. I had it designed effectively an email fortress, like a protection fortress against prompt injection attacks. And it has these systems, six or seven layers. It went off and found some academic papers and some red teaming data and built a test rig and we've got that in place. I gave it one of my apps which was a multi agent deliberation system. So you can put a question to a panel of agents with different personalities and they argue over 20 minutes. And it used it and it said this could be very useful for the analysis I do for you Azim. However, this is designed for a human so can I just build it as an API driven version for me and I, you know, guys, I've got. And it did in four minutes replace my work which had taken me a day, right. I gave it a credit card. It's too nervous to spend any, any money. It's, it's got a 50 prepaid credit card and you know, it's still like should I do that test later or not? And like you, so I don't do the email work but I do do detailed ad hoc things with it and I cannot understand why it is as good as it is. So one example I need, I got it to do a market report on Propic and I said here is access to Prism. So Prism is the exponential view research backplane and my open core agent Armini Arnold went to prism, downloaded over 500 analyses we have on Anthropic and wrote the best thing in 10,000 words that I have seen about Anthropic. Far better than what I get out of a GPT 5.2 Pro deep research query or the Manus Deep Research query. And I have no idea what's going on under the hood in openclaw but it was remarkable. It has kind of changed my way of working.
B
The one way that I think about this is that with Codex or cloud code, if you use it and you ask it to kind of go do a piece of task, it is remarkably good at being agentic and going and doing it right. And in one way that I like Open Claw is like you now have a front end and a bunch of connections that you've given it so like, ultimately, like I do this all the time of, you know, I have a, like various databases of different things that I have running. Some for fun, some for work. I would routinely just spin up a Codex instance because I use Codex quite a bit and say, like, hey, go write me a report about this or tell me something about this. Go figure something out. And it'll run for, you know, 20 minutes, 30 minutes, go.
A
Can I just check? So this is a little bit of a hack because for people who may not know, Codex is a code development interface that OpenAI runs. It's their version of CLAUDE code. And you're getting it to do market research, essentially.
B
I can get yeah, everything. You know, one things that Frontier Labs are really bad at, just like LLMs are naming things. So cloud code is not for code and Codex is not for code. Right? Not just for code. They're amazing at code. But what they are is effectively like LLMs in a loop that works. So you can give it a task and it'll figure out some version, it'll create a plan. The plan will have 25 steps. It'll go through, like in other files, it'll go research the Internet, whatever, keep doing things in a loop until it gets an output that you actually want. Obviously it works really well for code, but also it works remarkably well if it's not code. Right? I mean, some, like your report is not code, it's words. But the process to get to the report would require a bunch of iteration and these things are really good at doing it. So something that I do regularly is like in what in the folder? I'll right click the folder open in Terminal Codex and then I can ask it any question that I want that about stuff in the Codex. I did my taxes this way. I did like, I did my taxes last year this way as well. Because I am not. I don't care enough and I don't want to spend that much time like, just go, you go do it.
A
Can I ask you that the Implaud Cowork, which is the platform that Anthropic has that was spun out of Claude code co op, I found incredibly powerful as well. And I use that extensively and I was blown away by a report it did on some of my investments. But it has some tool use and it can call on tools. Can the code versions like Codex use tools as well for analytics?
B
It can do the same thing. Like it can, you can give it skills, you can give it tools, you can give it whatever. And it kind of does it. What I find is that like Claude Cowork is more of a. It's a better interface because OpenAI is also pretty terrible at like products as we know from their naming schema. Right? I mean, like you just look at it, but they do roughly the same thing because you can give them, you know, MCP access to kind of various things like Player.
A
I can just explain what MCP access is just so you may.
B
If you have a piece of code running in a loop on your computer and you want it to talk to something, a different application, it needs to go talk to that application somehow. It needs to either use an API, which is an application programming interface like it. It has to talk to that API, it has to talk to that database somehow. MCP is basically just a model context protocol. It's just a wrapper that you put around that such that like the models know this is the protocol by which you should be talking to something. So you want to use Chrome. Google has just shipped Web MCP such that like agents will now know how to use Chrome better. Rather than trying to act like human, you know, open it up, go to X, Y coordinate, click. Which is kind of a terrible way to do things. But I think cloud code codecs, these things are really, really good at like using the tools of our given device because that's how they're trained, right? I mean, they're trained on a computer. So there. Anything you can do with a terminal, they should be able to do. And as we all know from every horror story, you can do anything you
A
want from a terminal, including indeed the two of us. You know, we are standing on the edge. We might not be at the absolute bleeding edge, but I don't think we are in the laggards category of using these tools. So I think it's helpful for people who are listening to give a sense practically of what you can now do with. With a tool like Claude Cowork, which I think you can get for 25 bucks a month or something, or what I'm able with my open core agent, Armini Arnold. I'll give you an example. I read a story in the financial press yesterday that said there's high dispersion in the stock market. Now. What does this mean? It means that the NASDAQ index is staying about flat, but stocks are moving 11 or 12% over time because sums are going up and some are going down. And that is a warning signal that it's a 99th percentile dispersion. And it may mean that there is a severe correction about to show up Not a bubble bursting, but a 15, 20, 10, May, 60% like it. Perhaps I didn't really understand this idea. I'm talking with more sense now than I would have done yesterday. I just copied that and I threw that into open court and I said, go and make sense of this for me, right? Go and compare it to my portfolio and explain what's going on. Take your time, build a plan, spin up sub agents, because open core can spin up multiple sub agents that go off and do other things, which is a better way of doing things. And it came back in, in 20 minutes and it was a really, really good report. It had gone out and got historical data, they've got stock data and assembled this for me. You know, like you, I was leaving a car journey, fired that off, finish the journey. There's. There's a report in place which I've now acted on, right? I've now acted on the basis of that. So that's a kind of practical, very hard thing that might have taken me a day and I would have never got around to it. How about you? What's it, What's a great example for you?
B
There's this essay that I wrote, I think three years ago. It basically said, like, computer used to be a job that people did. Like when we went to the moon first, there used to be like room full of people who were computers. They would compute things. And now it's machine that, like, you just, you do everything on the machine. And I said like, the next step is analyst. That was my kind of provocation in that, in that particular essay. This was sometime in 2023, and I feel like today we kind of have that. It's, it's remarkable, the speed, like anything that I want to analyze about any subject, any topic, anywhere is effectively at my fingertips. The only thing that remains is for me to ask, like, that is the thing that remains, right? So, I mean, it's really hard to explain, you know, where would you use it? Because I use it like, this is my prism. This is the prism with which I interface with the every part of the world. Anything that I don't understand, I ask. Anytime I see something similar to that that I kind of want to find out, I go and either search or create a report or get it to kind of do some work for me. Like if I'm trying to figure out, hey, I want to do, you know, this particular project, which book should I purchase or buy? Can you kind of compare and contract? These are simple ones or more complex, if there is any analytic work That I kind of need to do either as work or like for myself. Financial analysis. It's not like the components are, you know, quantum physics. I mean these are solvable problems. And there is a 10, 12, 20 step process that I would have taken if I had enough time and energy to kind of go off and solve that problem. But I don't have the time, which means most of them would have not gotten done except now I can just do it, right? I mean I can spin up a report. It's amazing that like one way to think about it is that you have an analyst at your disposal and you can spin up a report about anything in the world in 20 minutes. Will this be the best report in the world? Maybe not. But like it's a extremely well researched 90th percentile report on any subject and topic that you want in 20 minutes. I built one for paleontology recently just because like my son and I were talking about it and I had this idea that like when climate change kept happening in like history, what is the relationship between the variance in climate changing and like the number of taxa, the species that actually existed? Like I'm not a paleontologist, right. I love the subject but like I can now solve that. I have, I have a project for you to actually solve that problem for me.
A
It is amazing and I just wanted to give a flavor to people who perhaps have not had a chance to get a handle on Claude Cowork or on Codex or an Open Court agent. How it changes your behavior. I'm going to give you a little bit of a shocking statistic here. It may not shock you, but my token usage has increased a little bit since I started using openclaw. So just again for people to understand, a token is about three quarters of a word. If you go back and forth, typically with ChatGPT, you might use four or five thousand tokens in a single instance. If you use a deep research query, it might be tens of thousands or hundreds of thousands of tokens. Personally, I was probably using a million tokens a day with various workflows that are running up until openclaw and on Wednesday Rohit, I fell just shy of 100 million tokens personal usage per day. And in fact this week I've averaged about 80 million tokens per day. And each one of those is generating value because I know how much I'm paying. I'm paying tens of bucks a day, so it's thousands a month and I'm getting that value back and I can see it and it is Unending. It's what spurred me to write this update to the boom or bubble question. Saying, listen, demand is really off the charts and it's going to continue to go off the charts. So 100 million tokens a day is my new baseline.
B
It is crazy that like I did this analysis I think last year. So I built a tool to kind of monitor all my agents usage on my computer. Again, one of those things you can do because you have a question, you can build a tool for it. So I did it and I built it in Rust. For those of you who are kind of listening. It's not a language that I know how to code in, but I was like, now I can, because I don't need to know because I know Python and I know C, but I don't know Rust. I can do it anyway. So I built this tool and I learned that in the last quarter, the Q4 of 2025, I spent about 17 billion tokens, give or take, in Q4. Then I realized in month one, in January of this year, I spent 50 billion. The spike is. And once you see the trend line here, and the thing is, I distinctly remember because it was only like two years ago when you would like do token counting, when you're sending a query, when you would be like, oh man, it's like 10,000 tokens, should I kind of send it? It was like two, three years ago, it wasn't that far back, but now it's like it said 50 billion last month. You know, it is intelligence on a tap that you can kind of turn on and you can do anything you want with it. I mean, why would I ever do a research report on like marine mammal, you know, fossil convergence? Like there is no logical reason for me to do that except that now I can because I have a piece of curiosity that I can actually solve for. And that applies both for work as well as for like life, right? I mean, you can solve anything you like.
A
A quick note. If you want to support us in bringing more of these conversations to the world, please consider subscribing to the show. I thought, Horace, your writing tool was called Walter.
B
No, no. So I have a, I have, I have two. Walter was my first attempt at trying to do something that actually works properly. Now I have a different one called Horace. And the purpose behind it is basically to say, like, can you measure literary cadence across different texts, learn from AI and be able to sort of test it? It's a hobby horse, right? Because we are both writers. And one of the things, one of the weird things about the last few years as being writers is that there is a text generation machine that someone created, right? Like it is the world's best text generation machine. Turns out what it's worst at is actually generating good text. But what it is amazing at is like generating code, generating images, even videos. It's getting really good. But if I ask it to write like a four paragraph like essay even it's. It's not very. It's not engaging, it's not fun to read and it just annoyed me. So that is why try to figure out, can you get the cadences of these like writings over a period of time and then try to capture from it some lessons, train a model against it and then be able to deploy it so that I can test various pieces of writing, saying, is this literary enough, is this well written enough, does it have the right cadence enough, etc. Basically just to get AI to say like what AI generated writing is, is slop.
A
So Nathan Lambert, who runs Interconnected, wrote this great essay about AI writing and he said AI writing is distinctly mid. And that has been my experience as well, right. It is in the middle of a statistical distribution. It doesn't just delve if this is. And then that's quite a lot. There are lots of these signature tells that uses a lot of Latinate words. And so, like you, I have been building a tool just as an experiment and mine is called Brocker. So Brocker is the language center of the brain. And the ambition of Brocker is to be able to get one of these things to write well. So I think we have a chance to maybe compare notes.
B
So, yes, we should absolutely do. That is a fantastic name, by the way. It is a brilliant, brilliant name.
A
I'll tell you where all of the names of our products come from. They come from scientific themes that are connected to the thing that we're trying to do. So there's another one that is called Prism. And Prism is the research backplane, because you kind of look through the Prism, the research backbone. There's another one called Scintilla, which was the early signals detection. The trouble is I've built so many, I actually can't remember what any of them, any of them.
B
See, when the names are too cool, you can't remember all of them. But you know what's really funny? I have a daily running thing that generates predictions that I read sort of as a different lens to watch news. Right. This is the only one where I let the AI name itself and change it over time. Yeah, guess what the name was Foresight Forge. It is the most poppiest of the slop names. I kind of love it because like, you know, I give you full freedom and control and you can revise it as many times as you want and you still land on slop. Like that tells you everything you need to know.
A
Yeah, absolutely. And I think if anyone's been using Replit, which is something I use from time to time, the names that Replit automatically gives the software are exactly that sort of structure, you know, like Finance Boundary. And they have to illiterate. They have to be two words. So. So let's get into Horace, which is your. Your new version. How well is it working?
B
It's working. Okay, here's a discovery that I've made before I get into how well it's working. What I found is that my hypothesis was if I take like essays from, you know, or Charlie Strauss's short stories, and if I get AI to write a short story in similar fashion, how well will it be able to distinguish? What is kind of interesting is that if you get the best AIs to actually write short stories, it mimics the cadences really well in the sense that it has learned some underlying structure. It's just that it's like a child putting a leg together. You know, they don't care about the right colors or they might not care about the right shapes. In some ways they just kind of build them and say like, here's a. Whatever, you know, here's a spaceship or here's a castle, here's a dinosaur. And you kind of have to look at that and go like, if somebody wanted to really build a dinosaur, they would have had to change a bunch of these things to actually make it look good. So Horus works okay. Today you can extract some per token metrics. Kind of works well. It's not ready for primetime yet, but it is something that I'm intrigued by because like, if it works, it's a good counterpoint to something like a pangram Labs, right, Where you can take a piece of writing and say like, oh, if I do these things, it actually will sound good. Not just like non slop, but will actually sound good. One maybe snippet of what was interesting is like I first tried it with pure token and sentence based metrics and
A
that was working super well.
B
Then I thought like, just like you said, answer to everything in life is more tokens. So why don't I take a bunch of short stories, essays, novels, poems that I love and just extract the main sentiment of that, give it to AI to rewrite it. And then I swap them. So I can say, here is the sentence, this is what good looks like, this is what bad looks like. And instantly it was like a big improvement.
A
If you think about how a transformer works, it's next token prediction. It's clearly able to divine some lower level structure, some substructure that we don't see, whether it's because of the high dimensionality of the space or some other structure it's finding, but it's still a statistical game. And one of the things about writing is that of course I have a style of writing and you can recognize my writing. If you compared an essay from Tolstoy to mine, well, his is better anyway, but you'd know which one is mine. And so the approach that I've taken has been to take a bunch of my writing. Unfortunately, I've got hundreds of thousands of words and view. The writing is a kind of fractal process. There are lots of layers. There's paragraph length, there's threads that run through, there's sentence length, there's word cadence, there is lexical etymology. Do I use Latinate or Germanic words? And I use some really traditional natural language processing tools, sentence parsers, fragmentors, tone concept, argumentation, style, to build a fingerprint of my writing. And then I use that writing to those parameters to pass as you must or you shouldn't, to the LLM. And effectively we've gone through. I have a little bit of a testing rig. So it writes, it writes an essay, I look at it, I go, I mark it up, it goes back through and it figures out how to adjust the weights and how good is it? It's a B plus at three paragraphs right now. It's been a fun, fun process. It is teaching me a lot about my own writing, frankly, which is quite interesting. I didn't know, for example, that I use 80% Germanic root words rather than Latinate words. Uh, most LLMs are about 60% Latinate, it turns out.
B
That is a fascinating. I mean, the LLMs have these weird, like peccadillas, right? And different ones have different sort of interests. Like, for example, I get LLMs to review all of my essays and I throw away almost all of the comments that it gives me, because all comments, I can see that it pushes it towards a mean. It's like, oh, you were too colloquial here, take it out. It's like, no, it's meant to, like, without that, you know, it'll read like the Back of a breakfast cereal. Who wants to read that? It has to be interesting for somebody to actually understand it. Or I think writing weirdly is like the hardest eval for all of them because for math, for science, for research, you can almost still do like LLM as a judge and get them to figure out if it's going well for writing. You kind of still have to do it because they have no taste yet.
A
I don't think they have taste. But I think the fact that you identified as being so hard is, is also true about writing. Well, in, in general. So all of us write and, and, and, but most, most people don't write well enough. And I think if you go off and you look at really great writers, whether it's looking at a George Orwell or you look at literary fiction and you, you think about what goes on in the construction of that, that world model, that mental model of the world. What gets disclosed, what doesn't get disclosed. When does the author use a sentence that runs over four pages? When they use a sentence that is one word long, these are deliberate choices. This is not page rank with a teleport every 10th link. And that model is built in the head of the writer. And I think it's extremely hard. I think that's why you'll often find places which have got exceptionally brilliant people, like an investment bank, the type you used to work in, or a top consultancy. And the written output is terrible, which is why these consultancies hire so many writers to try to polish the work afterwards. So I think it is a testament of, I mean, what is it, is it that when we write we are combining our assessment of empirical space that we are considering our experience and some measure of interiority that is not captured anywhere else? Is that what's going on?
B
I suspect so. I think one way that I've been thinking about it is that like LLMs are incredibly good at learning patterns. You throw any data at it, it can identify hidden patterns in there, if it has seen it in the training data in some sort. And whether that's pre training, post training, RL, whatever, it kind of doesn't matter. It's really good at that. But every test that I have done kind of shows that it is very happy to identify a set of those patterns and then once it identifies, it just kind of applies them. Right. I mean, there is no, there is no interiority necessarily in the way that you say it now. It's really hard to talk about it because anytime you understand a concept well enough that you can nail it down to, like, some specifics, obviously, that can then go into the training data again and like, that loop continues. So it's a little bit like catching water. But at the same time, I do feel that, like, when I'm reading, you know, if it's Orwell or like Charlie Strauss or whomever, if I'm reading an author that I love, there is a sense of who the author is in the book that I'm reading. It is not just a construction that usually stands completely separate to who the author is. There is a personality that kind of emerges. And LLMs are still kind of flat in the amount of personality that they actually are able to kind of push out and emit. They have such a strong tendency to come back to the median. The main patterns that they have learned that these kind of, I don't know, non linear, non easily interpretable patterns that might exist in writing is the hardest one for them to stump out. Because the. The point of, like, can you get this point across? They're very good at it. But can you get this point across like a Zim? They're not so good because they're like, now they have to kind of mimic Azim, and maybe they can mimic you in the surface. But, like, there's a. There's a lot going on. Right. The essay that you write on a Friday is different to the essay that you would write on a Thursday.
A
Absolutely. I mean, and we know this because we test it all the time and it's like, well, yeah, that's one framework that I used 10 years ago. It's suitable. Then you don't bring it out. Now. I want to move on to something else in a second. But I'll give you one thing that I found quite helpful. So this is a tool that I built called Mirror and Mirrors back to you. So it reflects a process that I go through when I'm writing myself, which is I ask myself questions or my team asks me questions. And what I do with Mirror is I will come up with, like, the thing I want to write about, the argument I want to make.
B
Right.
A
And I will then connect it to one of our data stores, so maybe it gets connected to Prism so it can look at all of our research. And then there is an agent that plays the role of a good editor and is entitled to ask me up to six questions, right? And it's quite tough. It asks me these questions and what it comes out with is a structure. So not an essay, a. An outline structure which I can then look at and use as a Critique of is that the structure that I, that I want. And I think what I said to you earlier that like writing is very fractal and can you get the LLM? I think it's very difficult in the same way as using an LLM for code. If you just try to get the LLM to write the complex bit of software, it will fail because it sticks everything together. If you break the task apart architecture, front end, back end, database security, it's going to have much higher degree of success. And I think the same is true with that human writing where I separate out the structural component from, you know, the, the other components. And that has been one little insight I've had in the last few months.
B
I think that's a very interesting one. I feel like, I mean I do use them to ask like, is this logical? And that is something that they get. They're better at answering than like, is it written well, if that makes sense. But the thing that you said about them being fractal is super important because that's something I think about quite a lot. Because like LLMs now are very, they have gotten good at like sentence level. If you ask them to generate a sentence, they're, they're, you know, it's still not always, but 50, 50, you'll get some good sentences. What you don't get is that sentence in a paragraph, they're not there yet. Or that. What you don't get is that paragraph in like a section or that section in an essay or you know, like as you go zoom out more and more they get worse and worse. Now the funny thing about code is that like, I mean, think about the amount of effort that has gone into getting them to write code, right? Like they've read every single piece of code that is existing out there. They self generate the data and learn again from that code. We are spending, you know, billions, tens of billions of dollars a year trying to get them to write code better. So it kind of is like, yes, they, it, it worked because of the amount of effort that we have put into it. There is no similar effort that we have put into writing because like we can't. Because you need something to judge it before it can go, right?
A
Yeah, that verifiability. But the code breakthrough has been pretty amazing. If we'd had this conversation in September, it would be completely different to December and it certainly would be different to today. You and I have got these, this, this, this at least one agent that can spawn multiple sub agents. I think it's not unreasonable to think that we will have hundreds of millions and then billions and then tens of billions and hundreds of billions of agentic systems running around the Internet and this as automated infrastructure in a few years time in that world. What do you think an agent meaningfully is? What is the mece definition of an agent?
B
Really good question. This is something I'm spending a lot of time working on and actually I'm writing a paper that'll come out hopefully at some point soon about agentic economy and why it might take to coordinate, et cetera. So one starting point is to say that like there are 8 billion humans on the planet and if 8 billion humans actually start using agents in any meaningful sense, there is no reason we shouldn't get like a trillion agents. This sounded fanciful even like a quarter ago or a year ago, but like I already have like 20 of them run and you know, I'm barely kind of touching the surface.
A
Can I challenge you there? You've got 20 agents running. You've got 20 agents, or have you got one agent and a bunch of cron jobs?
B
No, so many of them are like full fledged agents that are actually going off and doing things individually, I think. See, the second part is important because like what is an agent? This is the kind of definitional question, right? Fundamentally right now what an agent is is some version of a persistent LLM that has its context changing continuously, relatively autonomously. Because like ultimately, like your Open Claw instance, you're still asking a question. The question, along with a bunch of context is going into Claude Opus 4. 6 that is doing some analysis and sending you back some information. Like the fundamental unit is still Claude 4.6. Into it some question comes and then it gets says, oh, you know, Azim asked a hard question. I might spin up like six more to kind of go do different things, come back. But it's context management all the way through. All of us have these agents kind of representing us in the world. Suddenly the world becomes one where agents are the ones that are interacting mostly with each other. And if agents are the, you know, we are in the mix, don't get me wrong, but like agents are the ones that are doing the jobs for us or interacting with us, going to websites for us, or like doing transactions for us using your credit card. All of these things are the things that the agents need to be doing. Suddenly as more and more autonomy comes towards the agents, we have to figure out what are the kind of institutions we need. Institutions are the word I use for agents such that they are able to do the task that you gave them to do. Because agents just like you said, are weird, right? I called it Homo agenticus in the sense that like it is a new genus, it does things differently. They would much rather build versus buy. They're scared of spending cash at this moment. They don't really like trade. Like there are these weird things that show up in their behavior, which means like when you have a trillion of them, we need different coordination guardrails for them to be able to do tasks for each other. But if you're having this conversation again in like a year, two year, three years time frame like that 20 will become 200 because we would just have a bunch of stuff that like they would be doing in the background. Things that would cost $1,000 a day today would cost like a dollar a day like two years from now. And you can just, that's that scarcity is gone, right? And then you can suddenly go up and like for as much as my coffee costs, you can kind of do all sorts of incredibly complicated analyses. That is the speed with which this changes.
A
That's a fantastically articulate picture which I agree with so much of it. We may have one or a handful of agents that are the primary agent. I called it the AI Chief of Staff because I don't actually want to remember all of these tools that I have. I want the performance, I want someone else to log it. And I don't want to be poking around, well, which tool do I pull out at this moment? So there is a sense that perhaps that's what the open claw agent or this is exactly a great position for Apple to find itself in. Which would be, we know we can own that, that agent interface and that trust interface. And then there is an enormous number of agents out there. And I think you're absolutely right. You do need to have some transaction mechanism. Transaction mechanism is required because there are still resources and those resources may be latency. Right. Responding 1 millisecond is more expensive than a thousand milliseconds compute energy. There will be things that are non rival risk, but we've made rival because of law. So in certain types of information and data. So agents will need some mechanism to exchange value between themselves and we will replicate it would be my bet some of the features we'll replicate a cosian boundary of where it won't so much be about transaction costs and communication costs between employees, but it will be about security and verifiability costs that I will want internal facing agents that are less sandboxed than the ones that actually interface to the outside world. And then the outside world ones will be interfacing on much tighter contracts and be spending some kind of resource or money in order to get things done.
B
You need some way for the agents to kind of discuss, negotiate, like talk to each other such that they can actually interact. You need some version of identity, you need some version of verifiability that like they did something. You need a unit of exchange. In fact, one of the essays that I wrote like I think two, three weeks ago with Alex Ima's was on Will we need money in an agentic economy? Because we were trying to show that like theoretically agents are like humans in the sense that they can talk to each other, but if you want them to do stuff together, they need some shared medium of exchange so that they can, you know, it's like the Hayekian price signal thing. Like you need something so that they don't need to solve everything from first principles every time, which is not a sensible way to kind of run anything. We are going to start seeing that. We'll start seeing that this year. That's my prediction.
A
It might actually be the use of a certain class of crypto. Right. You know, as a way of exchanging back and forth because I'm not sure the scale. Right. The quantum of some of these transactions. I mean it's really important that there is a, there is some value ascribed, otherwise you can't ascribe value. And what will that be? You know, maybe PayPal and Stripe will deliver Rails for payments with sort of fractional sense, fractose sense we could call them. But I like you agree that there will. There's quite likely to need to be this means and mechanism of exchange. So it's kind of the world that we already know.
B
I mean plausibly that is one way that it could be. Like there is no reason that it should be so de novo new that like because we know how the economies work, we know how human economies work and the human economy that we are having this conversation in 2026 has evolved over like several hundred years, thousands of years where we have tested different things and we have stuck with certain things and they keep coming back. Like medium of exchange keeps coming back, identity keeps coming back. Like various checks and balances keeps coming back. The separation of fiscal versus monetary policy, if you want to kind of go all the way up, keeps coming back. But like every time something like these are in a weird way like economic invariants, just like there are physical invariants. So I would not be surprised at all I would expect to see something similar show up in the agentic economy as well, where we have similar invariants. Now, will it look like a greenback? Probably not. Will it have the picture of the queen on it? Maybe we have pictures of the king. Now I got to get the new pants. But like these are, these are things that will change over a period of time. But like ultimately a function that they solve will not necessarily shift because ultimately that is kind of what we'll still need to solve for like they do need to solve same problems that we need to solve.
A
A lot of questions from people about how should they get started with openclaw and how can they use it securely. So I don't want to pretend it's easy. It's easy to do the one click download and get something running and thereafter there is a world of pain, config files, env variables and so on. But one of the easier ways, especially if you're nervous about putting it on your home network, is you can go and get a VPS, which is a virtual server. A company like Hostinger, Hostinger and Hosting or DigitalOcean both offer a VPS. It's 7 to $15 a month where you could get started and then you can decide how many resources you actually want to give that agent. Do you want to give it your email logins? Do you want to give it your logins to your PayPal or not? And perhaps don't do that at that point. So that's kind of quite a quick way to get started. You sync it up with a Slack channel or a WhatsApp channel. There are, there are instructions. Although it's pretty easy if you download the Mac installer as well to do it that way around. Let's talk about securing it. What do you do about securing your agents?
B
I just, I am more free rolling in this case. I live YOLO because I gave it access and now it's like let's see how much you can screw it up. It's fine. But I don't recommend that to most folks. I would, I would highly, highly say. I mean my basic philosophy is as follows. You need like LLMs basically work on the basis of context that it it is given. If somebody poisons the context like LLMs will not know what is poison versus what is not very easily. As the models get smarter, they get better at distinguishing this. But it's still really really hard. However, on the other side, I want to talk to it on my telegram. I want it to be able to access my emails. I want it to read through all of my previous history and my documents and stuff. If I wanted to do that without detracting from functionality, it's really hard to put a security layer on top of it. So step one that I would say is like connected to something where you're okay with like it, not sort of it going off into the world and like screwing up something. Right. So the easiest way that like I have onboarded people is to say like, choose a folder on your computer, download Claude code or codecs, open it in terminal. I know it looks scary, black, whatever if you're. But it's not. It is the easiest thing in the world to use because if you open up a terminal and type, your interface is literally just typing. It just looks like it was built in the 1990s because it was. But like it's still a much easier way in some ways than to kind of install openclaw. Because installing openclaw it's again, it's not that hard. You gotta install something on your thing, but you got to connect it to Telegram. Then I would say like, just use it for a day so that you can just talk to it. Get used to that interaction method. And then over a period of time you can say like, oh, I wanted to connect to my Google Calendar as well. Okay, I need to download Gog and you check and somebody will tell you NPM installs here and like it'll work, start working, etc. And then you can kind of do it. Then you can say like, okay, maybe I should. I'm doing it long enough. I need a virtual server. I need to get Hetzner or DigitalOcean or whatever. Then I can like, I would say like go slow because people get scared. Because computer installation instructions, if you're non technical, look like gibberish, because they are gibberish. Right? Like none of these words mean anything, but you can just stop worrying about it. Think of them as like a magical incantation, copy, paste it in and go. That's at least my advice.
A
Yeah, I think that there's some sense also in getting to use Claude code or cowork in the first instance so you get a sense of what the agent experience is like before you jump into openclaw. So you know, get the premium version, get Claude code and cowork, try some experiments, build up your ability. The trouble with the BPS hosted service is that it doesn't see any of the stuff inside your home. And so our Mini Arnold is on a Mac. Our Mini Arnold actually moves house this weekend into Its new Mac Mini, which has got much more RAM because it needs to run lots of sub agents. And it kept hitting memory pressure and PS kills that were killing processes. But as a consequence, it can see my lights. It can see and I can turn them on as I walk to my studio from the house. And a few other things which I can't do if it's in a vps. I have a setup here which is probably a little bit more complex than the average home. You know, I have got a couple of firewalls and a fiber line and I can isolate things. And so that may not be your experience, but I would suggest for anybody who wants to do this, start with Claude code or cowork or Codex. Your recommendation to get a sense of what you can do agentically before you fall down the open floor route.
B
Yeah. Can I just say, like, one thing that I have found is that I set up my wife with this late last year at some point late last year, and I think the hardest leap for her to make, it took a week. But the hardest leap was for her to be like, oh, I don't need to think about what I need to ask it before I ask it. Because it's very common. Like, how should I phrase xyz? Should I do this? Or it's like, don't worry about it. Just talk to it like you know, it's your analyst and it'll just do things for you. It took up. Or your husband, your husband. But it listens better than most husbands that I know, including myself. It at least remembers things.
A
Romit, thank you so much for joining your Substack is Strange loop canon. I cannot recommend it highly enough and of course for everyone else. If you're not subscribed to Exponential View, please subscribe to Exponential View. We're on substack, we're on YouTube, and wherever you get your podcasts. Robin, thank you very much.
B
Thanks, Azeem.
A
This is awesome. Cheers. Thanks for listening all the way to the end. If you want to know when the next conversation is released, just hit subscribe wherever you're listening. That's all for now, and I'll catch you next time.
Podcast: Azeem Azhar's Exponential View
Host: Azeem Azhar
Guest: Rohit Krishnan (Writer: Strange Loop Canon)
Date: February 19, 2026
In this thought-provoking episode, Azeem Azhar and Rohit Krishnan delve into the rapidly approaching era of AI agents: autonomous, task-completing systems proliferating across digital life. Together, they examine the practical and philosophical shifts that come with a “trillion-agent economy”, covering personal workflows, future economic models, agent-to-agent transactions, and the stubborn limits of machine-generated writing. The conversation is rich with hands-on experiences, visionary projections, and candid peer-to-peer exchanges—a must-listen for those at the intersection of AI, business, and society.
Krishnan: “It’s amazing that... you have an analyst at your disposal... you can spin up a report about anything in the world in 20 minutes... an extremely well-researched 90th percentile report on any subject and topic that you want.”
Azhar: “I want them to do the most useful things for me, not the trivial thing.”
Agents as Analysts:
Tool Chains:
| Timestamp | Topic | |------------|------------------------------------------------------------| | 00:00 | Massive token usage; AI as ‘intelligence on tap’ | | 04:09 | Azhar & Krishnan’s agent-powered workflows | | 07:43 | How agents have replaced traditional productivity tools | | 13:37 | How Openclaw & Codex enable richer workflows | | 19:25 | Example of reports and personal research with agents | | 23:15 | Quantifying the explosion of token usage | | 26:11 | The struggle for literary quality in AI-generated text | | 34:04 | The challenge of interiority and real personality in writing| | 37:28 | Fractal, compositional approach to writing with AI | | 39:26 | Defining an agent in the upcoming economy | | 44:01 | Why agents need economic coordination and money | | 46:33 | How to get started with agents, security advice | | 49:59 | Using local/cloud and learning to trust AI agents | | 51:21 | Helping non-technical users adapt to the new paradigm |
For listeners eager to dive deeper:
[End of Summary]