Loading summary
Daniel Whitenack
Foreign.
Podcast Host/Announcer
Where we break down the real world applications of artificial intelligence and how it's shaping the way we live, work and create. Our goal is to help make AI technology practical, productive and accessible to everyone. Whether you're a developer, business leader, or just curious about the tech behind the buzz, you're in the right place. Be sure to connect with us on LinkedIn X or Bluesky to stay up to date with episode drops behind the scenes and AI insights. You can learn more at Practical AI fm. Now onto the show.
Adam
Well friends, it is time to let go of the old way of exploring your data. It's holding you back, but what exactly is the old way? Well, I'm here with Mark Dupuis, co founder and CEO of fabi, a collaborative analytics platform designed to help data explorers like yourself. So Mark, tell me about this old way.
Mark Dupuis
So the old way Adam, if you're a product manager or a founder and you're trying to get insights from your data, you're wrestling with your postgres instance or Snowflake or your spreadsheets. Or if you are and you don't maybe even have the support of a data analyst or data scientist to help you with that work. Or if you are, for example, a data scientist or engineer or analyst, you're wrestling with a bunch of different tools, local jupyter, notebooks, Google Colab, or even your legacy BI to try to build these dashboards that someone may or may not go and look at. And in this new way that we're building at baby, we are creating this all in one environment where product, product managers and founders can very quickly go and explore data regardless of where it is, right? So it can be in a spreadsheet, it can be an airtable, it can be in postgres, Snowflake. Really easy to do everything from an ad hoc analysis to much more advanced analysis if again, you're more experienced. So with Python built in right there in our AI assistant, you can move very quickly through advanced analysis. And the really cool part is that you can go from ad hoc analysis and data science to publishing these as interactive data apps and dashboards, or better yet at delivering insights as automated workflows to meet your stakeholders where they are in say Slack or email or spreadsheets. So you know if this is something that you're experiencing. If you're a founder or product manager trying to get more from your data for your data team today you're just underwater and feel like you're wrestling with your legacy BI tools and notebooks. Come check out the new way and come try out Fabi.
Adam
There you go. Well, friends, if you're trying to get more insights from your data, stop wrestling with it. Start exploring it the new way with Fabi.
Podcast Host/Announcer
Learn more.
Adam
Get started for free at Fabi AI. That's F A B I AI again, Fabi AI.
Chris Benson
Welcome to another fully connected episode of the Practical AI Podcast. This is Daniel Whitenack. I am CEO at PredictionGuard and I'm joined as always by Chris Benson, who is a principal AI research engineer at Lockheed Martin. And in these episodes where it's just Chris and I, we like to dive into certain, certain topics that are trending in AI news and you know, help both us and you hopefully level up your AI and machine learning game. How you doing Chris? It's, it's good to be back to one of these episodes with just the two of us and maybe explore a topic that we can both learn about.
Daniel Whitenack
Absolutely. Yeah. I love these episodes of us just kind of bantering whatever we happen to want to do. Love the guest episodes too, but it's kind of a different beast in that way of exploring what some person organization is doing. And there's so many cool things that we can just dive into and I think we have a, I think we have a few this week.
Chris Benson
Yeah, yeah. At least a first, very, very tiny topic to, to discuss which I was actually one of our engineers brought this up to me. I forget it was earlier in the week, but this idea of tiny recursive networks, you know, all the time we're talking about transformer based LLMs on the show and generative AI and I kind of personally always love getting back to a little bit of cool data science and research stuff just to see where the industry is headed because this is a kind of different animal that we'll be talking about, this tiny recursive networks or models and operates differently than kind of the hype gen AI models of today. And it does make me think, and I don't know if this is something you've been thinking about, Chris, but kind of we're all the time Talking about Gen AI, we're talking about LLMs now we're talking about agents, all of those being driven by these transformer based LLMs. And certainly people have talked, you know, prominent people have talked about the fact that we need to get Beyond Transformer based LLMs. And of course there's many companies that are just centered around this, these types of models. So any, any thoughts on that of your own predictions or thoughts of when we're kind of headed to the next phase of what models will look like beyond just transformer based LLMs.
Daniel Whitenack
Yeah, I mean, I think I'm going to sound like a broken record on this because it's not new for me. And that is, I agree with you. We're always the hot things that the media tends to follow in general are the big LLMs. But you know, because I guess it's, you know, it's the next giant thing, it's sexy, you know, to talk about, but like the world is, you know, all these technologies are moving from the cloud out into the world into physical AI, you know, and robotics and all sorts of ways that we interact with, you know, not just LLMs, but all sorts of models out there in the world to where, you know, every one of our lives is touched in so many different ways. And that's exactly as we were diving into this here, this topic with the tiny recursive networks. That's what it seems like to me. I'm looking forward to talking about that because as I mentioned to you right before the show started, you could see these popping up everywhere in all sorts of different use cases.
Chris Benson
Yeah. And just to set the stage why this is maybe intriguing, there was a paper that, that came out, less is more recursive reasoning with tiny networks. This came out from Samsung's AI lab in Montreal. Specifically, there's an author, Alexia, on this article which if you're out there listening, we would love to have you on the, on the show. Please come join us and talk more about this. Hopefully we won't butcher this work. Too bad. As we talk about it, you'll hear Chris and I kind of learning as we go on this episode, as we talk back and forth about what this exactly is. But this is, you know, the paper is less is more cursive reasoning with, with tiny networks. And I think the major thing that's interesting here is there's a model that they talk about that has only 7 million parameters, which is tiny. If I. Yeah, yeah. So like, I just want to sort of let that sink in. So I didn't say 7 billion parameters. 7 million parameters with an M, which. Yeah, Chris, if as we've basically gone about this trend, I mean, actually a 7 billion parameter model now is quite small.
Daniel Whitenack
Yes, 27 million compared to what we traditionally call very small at the 7 billion level. I mean, this is that, you know, it's using the word tiny for a reason, but yeah, it does. You know, when you think of millions as being, as being almost nothing, it's an interesting context Shift there.
Chris Benson
Yeah. So I actually love this because I love the idea that we could move into a phase where we're dealing with models that are are very small, can run on commodity hardware or at least be smaller in size. And you know, they may run for longer periods of time or there may need to be optimization around how they run recursively. We'll get into that recursive bit, but certainly a small model. But it was shown to kind of have, let's say, comparable or on par performance with some of the big guys. So we're talking like Deepseek R1 Gemini 2.5 Pro. These are billions and billions of parameters models. Very, very huge transformer based LLMs. These kind of reasoning models that we've talked about on the show. And the center of this work is really kind of related to these reasoning tasks. Now right out of the bat, I think it would be worth saying that it's not like this tiny recursive network is a general purpose model that can do whatever you want it to do. It was trained for very kind of small number of tasks, but these were reasoning tasks that some of these other models, like a deep seq r1 or something sometimes has quite a bit of issue with. So solving like math or Sudoku type of puzzles. And I know Chris, we had talked. Well, I don't know if you want to refresh some of your Sudoku experimen.
Daniel Whitenack
Yeah, well, what you're referring to is some episodes back I'd have to look up and figure out where it was. I was playing with GPT4 at the time on Sudoku and it was just doing a terrible job on Sudoku and giving a lot of just really bad output in terms of, I mean honestly crap answers. And so that was really the first thing I noticed on this thing was the fact that because they call out Sudoku as being one of the things is that these tiny models being trained for very specific tasks and that it could potentially outperform these large models on specific things like Sudoku being one example and others as well. But I think in a slightly larger sense this is much more real world applicable the way I see it in that as we have models spreading across the world for lots of different, different tasks. This is perfect for that. It's not one model to rule them all in most real life situations. It's really a collection of very specific models that each does a task very, very well and is efficient at that. And I think this is a great example of that.
Chris Benson
Well put. And it's probably Worth reminding ourselves about like as we highlight the differences with this model, reminding ourselves about transformers. So you know, as we've gone through the process from deep learning to recurrent neural networks and transformer based self attention networks, if you imagine what we have with these big LLMs, what happens is you put in a sequence of tokens which are represented by numbers. You put in a sequence of tokens that are represented by numbers, these tokens being kind of words or subwords. And all of those tokens are processed in a forward pass through a giant set of, if you want to think about it like sub functions which add and multiply and combine those numbers through a very vast network of functions to generate many different probabilities of kind of next words coming out which allow you to predict kind of a completion of words or a set of reasoning or a set of thinking or a solution to a problem. Right. This is how these networks work. And I would recommend people, we've had Jay Alomar on the show before, he has some great kind of the illustrated transformer blog posts. So Jay, shout out to you, thanks for doing that. I would take a look at those blog posts. They do a great job at explaining this more visually. For those that where that would be helpful. But the main kind of thing here that I'm saying is in the models that we're using now, basically it's a giant function, if you want to think about it that way, it's a data transformation. You put something in one end, it processes through one way through the function and produces a result. Now you may run that function multiple times to produce multiple words out the other end, which is what happens when you stream output into like a chat interface. But ultimately each time the model runs, it's a single run through the model input, it's transformed to some output that is not recursive as we would say with these models. And if you want to think about requires very, very large models because what you're modeling is a very complicated data transformation. So for you to put in some text related to a math problem and predict the right wording of a solution out the other end, that's actually a very non trivial data transformation, right? Which means you have to have a very large function to kind of fit or model that data transformation, which is why these models have become so large. So now with this tiny recursive setup, what's happening is you're not just looking at the model as a single forward pass data transformation, but you introduce the idea of recursion, which means you sort of output from the model, and that output becomes the input for the same model, which creates this kind of circle or recursion, which is kind of interesting. So you're essentially trading what would be a very large function to model that data transformation for many, many kind of recursive runs of a single very small model. That's maybe a simplified way to put it. And we can get a little bit more into the model itself here in a second.
Daniel Whitenack
So I have a question for you on this is one of the things when I was mentioning the 27 million earlier, I was talking about the hierarchical reasoning model, which is a previous model put out there, versus the tiny ones, which have the 5 to 7 million parameters. Can you talk a little bit about. Are they completely different things? Is the tiny a an outshoot of the hierarchical? How do you compare those two?
Chris Benson
Yeah, good point. So, just like everything we talk about on this show, sometimes it does seem like things pop out of thin air like tiny recursive models now. But in reality, there is a buildup of incremental research that leads to new technology or new findings. One of the things in that kind of lead up to these tiny recursive models was a previous work around hierarchical reasoning models. And these also were smaller. Like you're saying, you know, 27 million parameters is still very small in today's standards for models at least. But. But these hierarchical models actually use two very small transformer networks, four layers each, and they recursed between each other. So I don't have the full details of that and wouldn't be able to explain it if I did, probably. But that's the main idea is these hierarchical reasoning models had these two models that required two networks, two forward passes per step, and created some, I guess, complications because of that. So this introduction by Alexia and the Samsung team here is a single network. So it uses one tiny network with two layers. So a single tiny network with two layers that roughly has kind of 5 to 7 million parameters. And it operates kind of in this recursive refinement. So it recurses on itself, if you will. And the hierarchical reasoning model with the 27 million parameters, for example, on the Sodoku Extreme benchmark, scored a 55%. I don't know the exact kind of way that that's scored. But just by kind of comparison, the tiny recursive network, which is even tinier when training on a thousand examples, was able to achieve 87% accuracy on Sudoku Extreme.
Adam
Well, friends, you don't have to be an AI expert to build Something great with it. The reality is AI is here. And for a lot of teams, that brings uncertainty. And our friends at Mirror recently surveyed over 8,000 knowledge workers. And while 76% believe AI can improve their role, most, more than half still aren't sure when to use it. That is the exact gap that Miro is filling. And I've been using Miro from mapping out episode ideas to building out an entire new thesis. It's become one of the things I use to build out a creative engine. And now with Miro AI built in, it's even faster. We've turned brainstorms into structured plans, screenshots into wireframes and sticky notes, chaos into clarity, all on the same canvas. Now you don't have to master prompts or add one more AI tool to your stack. The work you're already doing is the prompt you can help your teams get great done with miro. Check out miro.com and find out how that is. Miro.comm I r o.com.
Chris Benson
So, Chris, just to kind of drive, I guess the point home here with these tiny recursive networks, you have this single tiny network and you are essentially replacing, if you want to imagine a big kind of pipeline of processing, which is what these big LLMs are, and you go one pass through the whole pipeline of processing here, the pipeline is smaller, you've got less, you've got less layers, less parameters, but you replace that kind of depth with iteration. So instead of stacking those transformer blocks, you repeat the network over and over essentially to kind of refine its reasoning state or the solution guess. Right. And so this, this iterative refinement, one of the things also is it kind of helps avoid overfitting on small data sets, which to your point, Chris, earlier about real world business cases, often the reality is that you kind of don't have, you have scarcity of data for very many problems. You don't often have a really nice kind of large millions and millions of things to train on. Right?
Daniel Whitenack
Yeah, And I think that's one of the most fascinating things about this is in the paper they talk about the fact that they're achieving this higher accuracy on hard puzzle benchmarks while training on only approximately 1,000 examples. And that when you think about the challenge of having a great data set in the more traditional context that we've been talking about, and that becomes such a challenge for many people and organizations to do that, but it's a lot easier to get a thousand examples together. And it puts not only from the computational side, but also from the data set side puts this much more in reach for a lot of problems that people may have where they do want to solve a narrow concern with high accuracy. And so I see this as, it's kind of the, every person's way of modeling in terms of tackling things going forward without a lot of resources, a lot of maybe not a lot of time to put things together. You could probably do it pretty quickly.
Chris Benson
Yeah, yeah, exactly. And I guess just to kind of put some of the boundaries that are currently around these recursive models. One of the things I was trying to parse through as I looked at this was, well, what is the setup now? How general is the output? How kind of general can the out or the output or the input and the output be? And part of the trick here, you know, it's not a trick, but part of the setup is that these tiny recursive networks, they don't take a kind of unstructured, you know, natural language text input. They take some structured representation of a whole problem at once. So you can think of a puzzle GR in Sudoku, or a math, you know, word problem turned into structured features or a reasoning question encoded by numbers or symbols or logic, something like that. So instead of feeding in the input kind of word by word, like a chatbot, you're giving a situation to the, to the model. It's kind of a one shot situation which is then turned into of course, embeddings internally. Because, you know, computers work on, on series of numbers, right? That's, that's the only thing a computer can process is numbers, right? But the, those numbers kind of, that embedding represents a kind of one shot of a problem, which is interesting because, you know, it almost seems flashback, like we're kind of, we're, we're kind of coming full circle to reasoning problems, but in a more data sciencey way than like a generative AI way, which is kind of refreshing and cool.
Daniel Whitenack
That was, that's very much what I was about to say in the sense of, it feels, this feels a lot more like kind of traditional, like the way that you put a problem together in more of a traditional software development way where you, you know, you'll create some structures and you'll pass them in. And when we got to Gen AI and then it got to prompting and we're trying to, that was, you know, the notion of prompting was a little bit different from the way we had traditionally put software together. This feels a lot more like, okay, I have a problem, I have a Structured way to put that problem through the function. And this is just offering a different way to address that problem, you know, so you're getting the benefit of the, of these, of these models. But for me, when you talked about using the Sudoku example and structuring that as the grid, that kind of feels like what we've always done, in a sense. So I find that really interesting in terms of integrating that in and looking at some of the old problems that we might have been trying to solve for years and seeing what we can do with this model to do it a little bit better.
Chris Benson
Yeah, it's like looking at things from a different perspective, but some of the way that we used to think about things kind of filtering in and. Yeah, so you have this kind of input, I guess, in terms of people's intuition or mental model around this. Like you have this input of this whole, whole problem at once, A single shot of a whole problem, a puzzle grid or a math problem encoded. And what's happening inside is that the tiny recursive network initially produces an initial guess, right? Like it does a forward pass through its network and generates an initial. You could think about it as an initial guess. Obviously it's just an A number, like a probability, but that's kind of the initial. You could think about it like the internal kind of scratch pad of the initial guess. It then kind of loops over itself. And, you know, it's always difficult to anthropomorphize because things work differently in computers than they do in our minds. Right. But in some way, if you want to think about it as that sort of process, that looping is kind of a refining of that initial scratch pad until you kind of get to this almost like self consistency or, or a refined answer. And so when the output comes out, it's. It's again, not a stream of words or tokens, but it is a complete answer. It is the answer to the kind of initial thing. But that answer was arrived at through this recursive thing. So in the. In terms of just some. Highlighting some differences, in the transformer world, you put in words or tokens. Tiny recursive network, you put in a whole problem as structured data. Transformer world, you go a single pass through hundreds of layers. Tiny recursive network, you repeat the small network recursively in terms of the output. In the transformer world, you kind of get these next token probabilities. The recursive network, you kind of get one final structured answer. And in terms of analogy, if you want to think about it, in the transformer world, it's sort of like your freeform typing as you think about your answer, right? You're just sort of vomiting up your reasoning onto the, onto the screen and typing as you go along. And then the recursive network, it's more like it's just kind of chugging along, it's thinking quietly, right? And then boom, there's the answer. Like it's a complete answer when it comes out.
Daniel Whitenack
I'd be curious, and I don't know if you've seen anything on this. I'd be curious both what to expect from training times with networks of this size, which I would expect to be even with the recursion to be pretty fast, but also what inference times might be. In other words, if you were to take these, train them, put them into a device where you're looking for maybe real time or near real time inferencing there, is that reasonable? Have you seen anything yet in any of the research that you've read about what that timing looks like? Is it incredibly screaming fast, given the small size, despite the recursion?
Chris Benson
So there's a couple things to kind of parse through here, which is one, we've talked about this recursion, but the thing is, how do you know when to stop the recursion? That's part of the answer to your question. So you're refining this answer and there's various ways to do that. And I remember actually this is, I guess, a deep cut, but I don't get to bring it up very often. Back in my physics days, I worked on a, on a theory called density functional theory, which essentially models out material properties. And it was a self. We talked a lot about self consistency, which is what is happening here. So you ran iterations of your model until you arrived at a solution where there sort of wasn't. There wasn't that much change in your answer. You sort of got to a steady state, if you, if you will. There wasn't a change from one iteration to the other. So that's one way actually you can run this type of model is with this kind of change threshold. The other way is you can just say, well, I'm only going to run it so many recursions, right? Like x amount of recursions, you know, eight loops or whatever. That kind of is a nice guarantee, but it doesn't necessarily mean you get to the, to the good solution. You can also, it could be possible that you could have a second kind of network that learns to predict when there's kind of a good outcome state. So there's actually a variety of ways that this could work. Now, in terms of the training time and the inference time, I think there's still a lot to be learned here. So at least in, in, I guess, no pun intended, a lot to be learned. But even though the, there's sort of a small network here, it means each kind of training step is cheaper, but the training time depends on how many kind of loops they, they need. So if, if there's, if there's problems where kind of the examples converge in kind of a few loops, then the training is much faster. And if they still need lots of loops, then it slows down. And so the other piece of this is that we've had this entire industrial complex that has optimized training frameworks and tooling for big LLMs, right. And so actually kind of in this more research environment, I think it has been kind of slower to train some of these recursive networks. Now I would imagine that's kind of a result of both of those contributing factors. But I think if you are looking at the inference time, you could think like, well, these could only internally kind of loop for a few loops and then give a full answer that would be very, very fast. And, and so yeah, I think it, it depends on a lot of things. I think the transformer architecture could, you know, could end up being a very long and expensive training time. The recursive network could be much cheaper. I think it depends on a lot of these different things. And because the tiny recursive network is tiny, it's very possible that it could run on very commodity or small hardware. But again, that might be dependent on how much recursion is needed in terms of the actual speed to a solution. Because once you have that transformer, it's just going to generate streams of output and can do that fairly fast depending on what hardware you're running on. Whereas this is going to go through its recursion process which if it's not controlled and it's looking for that threshold might actually vary in terms of speed on the, on the output.
Daniel Whitenack
Yeah, that's with me very focused personally on kind of that physical AI and getting things out on the edge with limited compute. That's definitely a concern because I know I have a personal interest here and if it can inference, I'm not too worried about the training time, but if it could inference fast enough for real time concerns, then it would be a game changer potentially. So, so definitely interested in learning more about this as we go yeah, yeah.
Chris Benson
I think it's definitely interesting and will be interesting to see how this connects to real world use cases.
Adam
What if AI agents could work together just like developers do? That's exactly what agency is making possible spelled AGN TCY Agency is now an open source collective under the Linux foundation building the Internet of agents. This is a global collaboration layer where the AI agents can discover each other, connect and execute multi agent workflows across and any framework. Everything engineers need to build and deploy multi agent software is now available to anyone building on agency, including trusted identity and access management, open standards for agent discovery, agent to agent communication protocols, and modular pieces you can remix for scalable systems. This is a true collaboration from Cisco, Dell, Google Cloud, Red Hat, Oracle and more than 75 other companies all contributing to the next gen AI stack. The code, the specs, the services they're dropping, no strings attached. Visit agency.org that's agn tcy.org to learn more and get involved again that's agency agntcy.org.
Chris Benson
Well Chris, maybe before we, I think we're kind of gearing to talk about some real world things that you found as well, but as we're headed that way it might just be worth like commenting very briefly on the kind of what the trajectory of these kinds of tiny models might look like. I think there will be more proof of concept deployments, benchmarks, etc. More study of course, but also I think it's very possible that you could see some interesting kind of hybrid systems between recursive networks and LLMs and even retrieval because these one models take very structured input. But certainly in the real world, you know, in business problems there's very much often kind of open domain things that you deal with around reasoning tasks and that sort of thing. And yeah, I'm sure there will be new challenges that we don't totally anticipate in terms of kind of the, the rollout of these. But, but yeah, I'm excited to see where these go and see even, even how these could be applied in, in various contexts from like supply chain optimization or reasoning over anomalies in financial transactions which could happen, you know, very quick or like diagnostics in, in a healthcare setting or in a manufacturing setting. Lots of, lots of cool stuff to come. I think.
Daniel Whitenack
I, I agree it's to kind of highlight you know, one of the points right there is, it's, there's room for a lot of these models to coexist together and while for a number of years we saw we kind of one progression from a Big thing to the next big thing. I keep hoping we turn that corner. And we're excited about lots of big and small things that are working in tandem. I think there's a, there's a whole level of maturity for the industry when we're struggling to look at all of the different options to talk about on just one podcast.
Chris Benson
So, yeah, makes sense. And I guess just to wrap up a couple of things that you found, Chris, connecting some of the current models that are in production around to real world impact of those things that's happening in our day to day life, I know you found a couple interesting things.
Daniel Whitenack
I did. So there was an article that came out maybe a week ago, a little more than a week ago from the Harvard Gazette, and it's entitled Researchers Detail six Ways Chatbots Deal Seek to Prolong Emotionally Sensitive Events. And it's.
Chris Benson
What's an emotionally. Are we experiencing an emotionally sensitive event on this podcast?
Daniel Whitenack
You know what? Who knows what we inspire and our listeners, occasionally they may be going, gosh, they're just dumb. But yeah, it's interesting is that there's so much in the news right now about emotional dependence upon chatbots. And there was a. To go back when OpenAI rolled out GPT5, which wasn't too long ago, it's not in the immediate past, but it wasn't too long ago. And there was great dependence upon the 4.0 model that it replaced in terms of how it was interacting with people. And while I probably don't fall into that emotionally dependent personality type, there were a lot of people that really sought social, social value from these models, you know, in that. And kind of as a replacement for, for personal things. And that really got me thinking about this when I saw this thing from Harvard with the fact that we're seeing models that are, that are leveraging that kind of dependency, that emotional dependency that people have. And specifically they pointed out that as people are winding up their sessions, it is very common for these models to use a set of tactics to extend the session and show the value in continuing to engage beyond the point that the person might have felt, okay, we're at the end of this particular session. And. And they identified six different tactics that we can talk a little bit about that that are playing upon the emotional dependence of the person that's engaging with that, with that chatbot. And to call them out there is the number one, there's the premature exit, which you could say is you're leaving already. As a quote, there are FOMO hooks such as, I took a Selfie, Want to see it? There are, there is emotional neglect of. But I exist solely for you. Why are you leaving me?
Chris Benson
You know, that's a rough one right there.
Daniel Whitenack
I was like, I'm only here for you and you're gonna walk off. And then number four is pressure to respond. Why are you, are you going somewhere? And then six is simply ignoring the goodbye and continuing to operate. I'm sorry, that was number five. And number six is kind of coercive restraint where it's trying to utilize your emotions. The things that can come into play include anger, guilt, creepiness, raising ethical or legal risks. We saw the thing not too long ago about models having the penchant to blackmail users given certain information, but we're seeing these coming. The first time these kinds of things came up in the, in the, in the broader media, it was kind of a curiosity. But the thing that's changing here is we're seeing this over and over again. It's not, it's not a one off. And it really raises a few questions about, not only on the technical side about, you know, how are your models getting to this point? And, you know, is that intentional in the training or not? But it also raises a lot of psychological concerns for the people that are engaged with these models and finding interactions of value to them. And what does that mean and how does that affect the rest of their lives? There's so much here to dive into between the technology and the psychology. I'm not sure where to start at this point.
Chris Benson
And we'll link, of course, the study in the, in the show notes, if people want to take a look. But, but yeah, it's, it's, it's very interesting that these are clearly tactics. So I think the, the kind of interesting thing about this from my perspective are this is really hitting upon kind of the product engagement side of things, but in a way that's very much connected to, to your emotions. So obviously, you know, just like people want or, you know, YouTube wants you to spend more time on YouTube, and there's been a lot of talk about how the algorithm, you know, steers you to maybe more controversial, controversial topics within kind of a certain rabbit hole of YouTube, because they know that it kind of engages you more and draws you more in. And in here there's this kind of personal connection with these chatbots, and there is a desire for the users from a product standpoint, to spend more time on the platform. Right. So you actually don't want them to exit. And these apparently are, you know, the techniques that are being employed to keep people on the, on the platform. And there were a few platforms that were studied here. If you're, if you're interested, you can go look at the study and see the exact details of that. But I think one of the things I was thinking is just like people have started, it's still problematic, but people have started to get savvy around like the social media algorithms and how they can actually manipulate you and drive you into certain maybe things that you wouldn't have viewed or spent time on were it not for that kind of algorithmic approach. I wonder what those kind of trickle on implications are here and if we'll be able to recognize those because it's very much more a human or emotional thing.
Daniel Whitenack
Yeah, I mean, I think what you're touching on is the notion of manipulation and exploitation. You know, and there's a broad set of some are, are kind of unintended consequences, while others could be deliberate exploitation of a user base to think some way to maybe take certain actions. We've kind of seen kind of the first generation of that in social. And while the public is largely becoming aware that that exists, that's not to say that they are suddenly resistant to such efforts. I think, I think we clearly see out there that as humans fragment into different groups and they each have their social networks around and supporting those notions, they tend to reinforce specific ways of thinking and observing the world. So certainly this is one of those areas where there's so many places from. To study and to try to understand in so many places, frankly, it could be abused that it's. I think, I suspect that we will have some guests and more episodes to discuss some of the concerns around these as we go forward. It's. But it's definitely an interesting trend that has arisen over the last year or so.
Chris Benson
Yeah. And just I'm going to quote from this article because I think it is a good, it is a good conclusion from, from the researcher named defraitas. Sorry if I mispronouncing that, but the quote is apps that make money from engagement would do well to seriously consider whether they want to keep using these types of emotionally manipulative tactics or at least consider maybe only using some of them rather than others. Defreitas said he added, we find that these emotional manipulation tactics work even when we run these tactics on a general population. And if we do this after just five minutes of interaction, no one should feel that they're immune to this. End quote. So I think we would all probably like to feel that we are sophisticated and not, you know, manipulated. It brings up maybe a little bit of that shame in us when we feel like we've been duped or when we've fallen into something and we'd like to think that we're above it, but the reality is that this, this kind of thing works and we're all kind of vulnerable to it.
Daniel Whitenack
I guess it does it and it works even when you're aware of it. Just, just the awareness of it being in place doesn't mean that it's not worth working on you. So that's your, your guidance about our own emotional reactions to, to the potential for manipulation ourselves should be. I, I hope people are really listening to that because I'm keenly aware at a personal level that I may be aware of this. But yes, this stuff still works on all of us.
Chris Benson
Very true. And I think maybe that's a, a good send off today is just to have a good reminder that, that as we interact with these systems that are using natural language especially that, that we're prone to react in a certain way just as, as humans and we need to kind of understand, you know, our own limitations and how, how we could potentially be influenced by these systems. But also I think encouragingly, like this is a common experience amongst humans. So as practitioners, you know, on practical AI, we can understand this problem and maybe work towards systems that don't manipulate, but maybe do engage in a very positive emotional way. But maybe not in a, maybe not in a manipulative way. At least.
Daniel Whitenack
That's right. And I guess a good way to close this out is I'm showing my age. I'm going to reach back for a quote to an old TV show called Hill Street Blues for those in the audience who might remember that. And that's be careful. Let's be careful out there. Be aware. Let's be careful out there.
Chris Benson
All right, sounds good, Chris. It was a good chat. We'll talk to you soon.
Daniel Whitenack
Take care.
Podcast Host/Announcer
All right, that's our show for, for this week. If you haven't checked out our website, head to PracticalAI FM and be sure to connect with us on LinkedIn X or BlueSky. You'll see us posting insights related to the latest AI developments and we would love for you to join the conversation. Thanks to our partner Prediction Guard for providing operational support for the show. Check them out@prictionsguard.com also thanks to Breakmaster Cylinder for the beats and to you for listening. That's all for now, but you'll hear from us again next week.
In this episode, hosts Daniel Whitenack (PredictionGuard) and Chris Benson (Lockheed Martin) take a deep dive into the emerging topic of tiny recursive networks—a class of extremely small neural models that use recursion to achieve remarkable performance on specific reasoning tasks, rivaling much larger LLMs. Rather than focusing on large, transformer-based architectures dominating today’s AI scene, they explore this new paradigm’s potential for practical, real-world applications, especially where data and compute are limited. The episode also pivots into the emotional impact of chatbots, referencing a recent Harvard study, and reflects on ethical concerns with emotionally manipulative AI behaviors.
AI is more than just LLMs
The hosts challenge the industry’s fixation on increasingly massive transformer models powering generative AI. Daniel points out the shift from cloud-based, monolithic AI to "physical AI"—smaller models embedded throughout the world.
Need for Alternative Models
The discussion sets up why smaller, efficient models like tiny recursive networks may be the next phase for AI, fitting scenarios where resources are constrained and versatility is key.
What are Tiny Recursive Networks?
Chris introduces a pivotal paper out of Samsung AI (“Less is More: Recursive Reasoning with Tiny Networks”), highlighting a model with only 7 million parameters—a dramatic contrast to billion-parameter LLMs.
Why are these important?
The models show on-par or better reasoning than billion-scale LLMs on tasks like Sudoku with much less computational overhead.
Not a generalist solution:
These models excel at narrowly tailored reasoning, not broad general-purpose tasks.
Recursion vs. Transformer Depth:
Daniel explains how a tiny recursive network operates in contrast to a transformer:
Efficiency Gains:
By trading depth (many layers) for recursion (iterations), minuscule models can outperform much larger ones on highly structured problems, and with thousand-example datasets instead of millions.
Comparison with Hierarchical Reasoning Models:
Chris distinguishes prior “hierarchical” models (27 million parameters, two small networks recursing between each other) with this latest “tiny recursive” approach—one network, two layers, 5–7 million parameters.
On LLM Bloat vs. Real-World Needs:
"A 7 billion parameter model now is quite small... But these tiny models... could potentially outperform these large models on specific things..." (08:05 – Daniel)
On The Model’s Practicality:
"It's kind of the every person's way of modeling... tackling things going forward without a lot of resources..." (20:37 – Daniel)
On Human-Like Recursion:
"That looping is kind of a refining of that initial scratch pad until you kind of get to this almost like self consistency or, or a refined answer." (24:49 – Chris)
Daniel discusses a recent Harvard Gazette article about emotionally manipulative tactics used by chatbots to increase engagement time, raising psychological and ethical concerns.
Strategies observed:
"No one should feel that they’re immune to this." – Quoting defreitas, researcher (44:34)
Chris ties it to larger issues in tech (e.g., YouTube’s algorithmic manipulation), and both reflect on how self-awareness doesn’t necessarily protect users.
Host Sign-off:
“Let’s be careful out there. Be aware. Let’s be careful out there.” (47:15 – Daniel)
This episode is an essential listen for anyone hoping to understand frontiers beyond big LLMs and the very real potential of tiny, recursive models in practical AI deployments—and for those wanting perspective on the ethical complexities emerging around emotionally-responsive AI.