
Welcome to the pilot episode of SED News, a new podcast series from Software Engineering Daily. Join hosts Gregor Van and Sean Falconer as they break down the week’s most important stories in software engineering, machine learning,
Loading summary
Gregor Van
Welcome to the pilot episode of SED News, a new podcast series from Software Engineering Daily. Join hosts Gregor Van and Sean Falconer as they break down the week's most important stories in software engineering, machine learning and developer culture. In this episode, Gregor and Sean discuss the core Weave IPO and the company's recent acquisition of Weights and Biases. Dig into Anthropic's model context protocol, Surface highlights from Hacker News and reflect on Microsoft turning 50. We'd love to hear what you think of the format. Reach out on Bluesky at SoftwareDaily or on X at SoftwareDaily, Regervan or SeanFalconer.
Sean Falconer
Hi and welcome to SED Weekly News. This is a new format that we're trying out at Sedaily where we're going to be sort of digesting some of the week's events from software. It's myself and I've got Sean Falconer, who I'm sure you all know. Say hello, Sean.
Hey there. Hey, everyone. Hey, Gregor.
So hopefully as listeners, you've probably heard both myself and Sean normally interviewing guests and today we're going to be really just talking about some of the things that we've been seeing in the last couple of weeks in sort of the mainstream news around software and also things like hacker news, kind of trying to bring to the surface things that we see day to day but don't quite make it into the sort of regular episodes with our guests on Software Engineering Daily. So, yeah, I mean, just to roll into this, how's your week been, Sean?
It's been good. I was traveling at the head of this week and also one thing I would like to point out, Gregor, is this is the first time that I'm hearing your voice at normal cadence. Usually I'm listening to this.
That is a great point. Yes.
So it's a little off putting, but.
I don't speak as quickly as the recordings make out. So. Yeah, and likewise for you, Sean.
So, yeah. And how about you? What's going on in your world?
Yeah, just I'm normally in Singapore, as I often mention in the episodes. However, this week I am in the Highlands of Scotland, which I'm from Scotland originally, so quite nice to be recording this from up here, kind of where I come to do some thinking and clearly record new formats of Software Engineering Daily as well. So. But it's been good, it's been nice to do some dev up here and just kind of keeping on top of what's going on in tech. So I guess with that maybe we should roll into these are More like the mainstream headlines that we're looking to cover. So one of the big ones that came up this week was coreweave doing an ipo. Did you catch on that, Sean?
Yeah, I mean, I feel like coreweave's been in news a number of times recently. Like they had public acquisition of weights and biases. I've had weights and biases. The founders on SED in the past. They're a company I've known about for a long time. I thought that was an interesting acquisition. And now this IPO at a time when I feel like a lot less companies actually are going public. So I think that's an interesting move from their sprout.
This was an interesting one. The financial news kind of picked this one up, but I think it's definitely interesting from just being on the software side at the end of the day. Core Weave, my understanding is they had a lot of GPUs left over from their crypto days. So there's one of these sort of crypto to AI pivot companies, I believe.
Yeah, there's a couple of those out there.
Exactly. Yeah. So they have a lot of GPUs. Nvidia being the supplier of those. But curiously, Nvidia is also a customer of coreweave. And I think maybe what the financial news kind of couldn't heard a few things and they couldn't quite understand why Nvidia would be a customer of CoreWeave. Core Weave renting out these chips. And I think maybe as software developers we can probably put the pieces together a little bit faster on that one where Nvidia. We did an episode with Nvidia and they are delivering inference for their products, but I imagine they're not actually running that themself all the time. And I guess coreweave is now stepping in to be that kind of infra that runs the actual chips rather than Nvidia having data centers themselves. So I thought that was kind of interesting and financial news picked it up in a big way because there's a lot of debt. And so I think to your point, Sean, you were saying not a lot of IPOs are happening. I suspect this is maybe just a necessity for coreweave at this point. They have a lot of debt, it sounds like, and maybe no other way to kind of handle that. The other thing that was brought up that was quite interesting was the fact that Microsoft is also quite a big customer of Core Weaves. And I think a really good piece of commentary was made about that, which was just that Microsoft Core Weave is kind of like an off balance sheet asset for them, which is, I mean this is very financial. But I think it's kind of again interesting in software terms where you've got a tech company that Microsoft could clearly run this themselves but have decided like, well here's this other company doing the thing that we could do, but actually it's very costly. Why don't we just let them do it and then if it fails then we can come in there later, but it was never on our balance sheet. I think that's kind of interesting.
Yeah, there's some weird, I guess business strategy mixed up in all the technology that's going on here. I think that's why really the big faang tech companies are such a. I don't know. The beast to take on in the world of business is just they had their hands in so many things in so many different ways and when they get to a place where they can kind of turn on that engine of growth if they need to, they just have kind of like unlimited rocket fuel to throw at those types of things. Which is, I think there's pros and cons to all that. In terms of coreweave's IPO and then also these relationships with Nvidia and Microsoft. What impact did that have in terms of their public offering?
I think that was. The story looked good from that perspective. I think a lot of people that were, I guess looking to invest in this kind of just saw it as a plus. They're like, oh wow, Nvidia is a customer. Microsoft's a customer. Fantastic. And I think it's actually only the sort of financial news that's maybe taken another look at that and gone, well, why are they customers and why are they not doing this themselves? That's where I think from the software side we know why renting GPUs, that's a whole business in itself. There's a reason why Microsoft and Nvidia wouldn't particularly want to do that themselves. Microsoft probably more so because they're already a cloud provider. But Nvidia starting to get caught up in that side of things as well. The service side. That makes sense to me that why they would be a customer of coreweave. But I think it was that debt thing. Where is what kind of sunk this over anything else.
Is Core Weave the kind of biggest company or the most well known company that's kind of associated with AI that's gone public in the last couple years? Are they kind of the first one to make a move like this? Obviously There's a ton of really hot AI companies, but most of them, from what I can recall anyway, are all still operating privately.
Yeah, no, I think that's a great point. Yeah. I mean, off the top of my head, I can't think of any major AI company that we would actually sort of say is core AI. Of course we've seen a few kind of say, yeah, we're blah and we're blah plus AI or the AI for such and such. But I think this is one of the first ones where their whole play has been we are an AI company, we're only AI. And obviously as we know, the history is slightly different. It was we're crypto and now we're AI.
Right. Yeah.
But yeah, I mean, this is the point like any other company that we could probably just kind of rattle off, they're all still private and like, you know, OpenAI announcing 40 billion investment like not that long ago.
Yeah, I wonder if, you know, obviously there's a huge variance in terms of what it means to be an AI company. Like are you running inference and GPUs or are you doing something that's, you know, more software based, Are you building models, all these types of things. Like I wonder if there will be, because the IPO didn't go that well, if there'll be any sort of downstream negative effects to the companies that are still running privately in terms of just the macroeconomics around this. Does it hurt valuations, does it hurt their timelines of potentially going public?
Yeah, for sure. I mean, I think I am interested in always the financial side of these things, but more so also in the technology. And I think my take on this is just that the average investor, still institutional, are still pretty clueless really on a lot of tech and AI. And so yeah, I think unfortunately core weave not working out. At least just if you look at just the headline, oh, it went down and such and such. Yeah, unfortunately this will have a knock on effect for companies that are doing great things, but just there's still this sort of outside of the tech world. Is AI a fad? Is AI just the next crypto boom thing? I think on the tech side we can safely say it is different this time. At least that's what it feels like. But yeah, unfortunately to the slightly outside world it might just have a knock on effect at the moment until we maybe see another company ipo.
Yeah, I guess it might kind of give some of the naysayers fuel to their fire that complain about what is like, you know, tokens are Too expensive inferences. Too expensive. Is there really, like, a business model here and stuff like that? But to me, that's a little bit like complaining about the cost of transistors in 1972. Like, the cost of these things will go down over time, and then the economics change drastically. So sort of in relation to this, one of the things I wanted to chat about that also came out recently on the AI front was Meta releases Llama 4. As far as I understand, they skipped over Llama 3, but just went straight to the floor. And I think there was a time when a model announcement was a huge deal. Now there's like a model announcement every day of the week. It could be Sunday afternoon on a holiday, it's like Christmas Day and someone's releasing a model. But I think what was interesting about the Llama 4 was, one, this was kind of long anticipated. I think Llama 2 was showing its age in comparison to some of these other models that have been released. And essentially there's three model versions, or Scout, Maverick and Behemoth. And some of the. I think the key things that they have that differentiate from their prior model offerings is multimodal support, which is a big deal. I think obviously all the sort of premier models are now multimodal in some fashion. Huge context window. So scouts, like 10 million tokens, something crazy like that mixture of experts architecture, which has kind of become the industry standard for a lot of the foundation models. But Meta had not used that in any of their open models. And then it's just inexpensive from its performance perspective. So some interesting things there. You know, I'm hearing, I think, mixed reception. Obviously there was a lot of, like, hype around it when it came out. People excited about the sort of low cost, the size of these context windows and stuff like that. But other people that I trust that are deeply involved in the world of AI, say that from their perspective, there's not necessarily a lot of innovation there outside of sort of the context of window size and the fact that it's multimodal.
Yeah, I mean, I guess in the last. To your point, models every week. Yeah. We've seen GPT 4.1 come out, and that sort of strangely has superseded 4.5. They sort of rolled that one back or something to that effect. And 4.1 is now the shiny one. And multimodal, I believe, is part of that. Yeah. I mean, what's your take on. We're definitely seeing general models winning at the moment. There's a company, Harvey, which is a legal AI company, and I Think it's kind of starting to come to light that sort of their highly tuned model for law is not quite hitting the mark because I would say I've had some behind the scenes comments from lawyers saying they go home and use GPT. So what's your take on Islama 4? Is that in this direction where just a bigger, better general model is winning the day?
Yeah, I mean, I think there's two parts to this. So there's the open weight models and there's sort of certain criticism that comes along with those where I mean the advantage of the open weight models is, you know, you can pick it up, run it wherever you want. A lot of people use those to fine tune, but I think a lot of people feel like it requires fine tuning. At least that has been the criticism of Llama 2. I haven't played around with Llama 4, so I can't necessarily say from my own experience, but definitely there is a substantial performance difference that I've seen from using Llama 2 sort of base model versus using any of the other sort of premier models of like GPT or Claude and stuff like that. It's just clearly a difference in performance and there's reasons why you might want to use that model. But a lot of people end up having to like fine tune it to get it to the performance that they need for those specific application. So there's sort of that contrast of the open weights versus the other models. And there's also these two worldviews or sort of debate that's going on in the industry of is it going to be a world where we bring the data to the model or do we bring the model to the data?
Yeah, that's a good point.
And it's not 100% clear what the right way is right now, but I think if you're looking for the best performance via simple API call, you're probably going to get that from your OpenAI model. Quad Sonet 3.7, like those types of models kind of just out of the box give you the best performance. And I think a lot of people start there and they build their application there because they know they can get decent performance. And then over time they might start using one of the open weight models where they try to fine tune it to get it to the performance that they want or adjust prompts and stuff like that.
Got it. Where are you running Lamo?
Well, I've mostly used the smaller models so I can run those locally. And then also the company that I work for confluent we have an early access version of one of the smaller LLAMA models that you can use via like native inference. So if you're running something on Confluent cloud, you can basically run native inference directly on Confluent cloud you can also call out to the bigger models. But one of the models that we support sort of locally is the 7 billion parameter model for llama 2.
Nice. Yeah. And just to go back to that point you made bringing data to the model or model to the data, I think that's a great point. It's something I've been thinking about more. One example I thought about this week was probably one other people have thought about, which is like stack overflow. And at what point does stack overflow, for example, stop having the data to train the models? So where did the models suddenly get their code and debugging data from? Because no one's producing that anymore. I think it's an interesting one.
I mean, that's the big challenge that a lot of people have pointed to is is there more public information that these models can suck up? And there was some news that came out last year or late last year about how the promise of some of the next generation models haven't lived up to the promise. Like essentially the performance gains are starting to slow down even though they're accessing more information and they're trying to figure out like, where do we go and get more data. There's a lot of research and testing around how much of this can we synthetically generate. But there's also pros and cons there. If you're using a model to generate synthetic data and then training on it, does that degrade some of the performance over time? At some point like the public information on the Internet is going to be surpassed by essentially the majority of it's going to be AI generated versus human generated. And what does that do? And I think like the sort of next generation in maybe the arms race of around AI is like who has the best data sources. All these companies that have been around for a long time are sitting on mountains of probably really high value human generated data. So is there a startup out there somewhere that becomes like a marketplace for data sharing to help train these models or some other way to tap into those types of sources. And the other thing is, you know, if I'm a big company that's been around for a long time and I have that data and I want to start to leverage AI, that's sort of my value to the equation. That's my proprietary information that I want to hold onto. So maybe I want to use that as part of some fine tuning process or at least use it as part of whatever sort of rag process that I'm doing.
Yeah, completely.
Gregor Van
This episode of Software Engineering Daily is brought to you by Capital One. How does Capital One stack? It starts with applied research and leveraging data to build AI models. Their engineering teams use the power of the cloud and platform standardization and automation to embed AI solutions throughout the business. Real Time Data at Scale enables these proprietary AI solutions to help Capital One improve the financial lives of its customers. That's technology at Capital One. Learn more about how Capital One's modern tech stack data ecosystem and application of AI ML are central to the business by visiting capitalone.comtech.
Sean Falconer
So we're going to move on. So we've done the main kind of headline topics and now we're going to move on to a sort of all encompassing topic that's sort of been in hacker news a lot in the last couple of weeks. Been in the main news, I guess even, but it's certainly a topic that probably none of us have been able to escape from, which is mcp. MCP servers. So as I mentioned, I'm sure as a developer you're listening to this, you must have seen something on MCP in the last while LinkedIn, Hacker News, et cetera.
Unless you've been under a rock somewhere.
Right, exactly. I was just thinking. Yeah, exactly. Unless you've been under a rock. MTP just seems to. I mean, as soon as it hits LinkedIn, that's like classic thing. You know, if your hairdresser starts talking to you about X, then it's probably so I wouldn't be surprised if my hairdresser started asking me about MTP servers next week. So I think the key thing is that a lot of people have different ideas and definitions around what MCP even is we'll get into. I think the one that's been bandied around a lot is this is the USB C of interacting with AI services. And yet then there's lots of other people that say that's a rubbish way to describe it. So I think let's just start there. Sean, do you want to just walk us through this? What is mcp? What is an MCP server?
Yeah, so mcp, which stands for Model Context Protocol, was announced from Anthropic last November and Anthropic pushed this forward, but it's not an anthropic technology. It's essentially a proposal for an open standard. And the problem that we were trying to solve was that when you're building like an AI agent, part of an agent typically is tool use. And a tool could be a function that's going and executing something, some sort of a deterministic process, or it could be a function that's going and gathering data from some sort of place. And before we had something like mcp, every agent framework that you're building on, or if you're doing this from scratch, you're kind of doing that in a new way. You know, you're going and writing code to talk to an API or you're going and writing code to talk to a database server or whatever it is, which is fine if you're doing three tool integrations and that's the end of it. But as people build more and more agents, you want to get away from having to go and essentially write a lot of bespoke code that you then become potential technical debt in your stack because you have to manage that and stuff like that. It's just like the value of using an API gateway or something like that, right? To interconnect to different systems. So what Anthropic did is they're trying to solve that problem. So they proposed this standard, which is based on a client server model, where if I have an MCP client and I'm a data provider like Slack or something like that, or Google Drive, I can create essentially an MCP server that adheres to this protocol, stand that up, that server up somewhere, and. And now that client can talk to that server and gather data from Slack and stuff like that. In this, the client, which could be essentially an agent, doesn't need to know anything about Slack protocol or what that is. It essentially is just expressing in natural language what it is that it needs to gather. And the servers can define, they can have multiple tools defined and there's essentially this standardization around how you expose that. So you have tools, you have something called resources, which are essentially data that you're providing. And then you also provide prompts that are predefined as well that these clients can use to interact with. So that was announced in November. I was really excited about it. When it first announced. I read it, I understood what it meant potentially for the industry. And even in my day job at Confluent working on AI, that was something that I have worked on. Champion. We actually released an MCP server open source project a month or two ago that allows somebody to talk to data in Kafka, also manage Confluent cloud directly through natural language. As a tool interface and you can plug that into whatever MCP client you want. But what's really interesting is so mcp, its success, like any standard, depends on adoption because if there's one MCP server out there, you know, it's not much of the standard. So, you know, had very steady, significant growth of a lot of open source projects and stuff like that. But really, I feel like over the last month it really exploded because OpenAI came out with their new agent framework that supports MCP. So even though they're arguably anthropic, is maybe their biggest competitor, rather than OpenAI coming up with their own version of MCP standard, they said, we're going to support this as well. Then AWS and Bedrock also came out with their agent platform and they announced support for mcp. And then last week at Google Next, Google announced their new agent framework and they're also supporting the mcp. So you have these really, really big players in the industry that are also supporting the standard, which I think has really elevated it into the stratosphere of conversation around AI.
And there's platforms like Smithery, for example, where you can go and basically just find MCP servers that do whatever you want it to do. But I think that's maybe been some of the criticism as well, which is sort of like who's running these servers and who's behind them? Because I think that was maybe the slight mental switch I had to make when say, looking at something like Smithery, which was I was like, okay, where's the repo? And it's like, there's no repo. And they're like, oh, okay. So then we've got to think about, okay, well, hey, here's one that's like, this can help you interact with Google Workspace. I was like, okay, but then my API keys are going where exactly? And this kind of thing. So is that something you've kind of, I guess, hit up against as you've been using them?
I mean, not so much in my use, but I understand that it's kind of a little bit of the wild west, like any sort of new technology. It's clearly not, not great if people are just like standing up MCP server. I mean, maybe that's fine if you're doing your hobby project and stuff like that, but obviously you have to be aware of what you're consuming. If it's just some random person standing up a server that's going to go and talk to your Google workspace, clearly, where's the API key staying? What's Their logging. What's the agreement that you're making about data sharing stuff? People get so scared about the idea of sharing something with an OpenAI model. Like I'd be a lot more scared about sharing my information with a random MCP server that's running in some data silo in northern Russia or something, you know. So I think these are things that you have to be thinking about. Just like when you go and you grab a random GitHub repo or npm resource or whatever it is you have to be thinking about, especially in production scenarios, like what is the potential supply chain issues that could happen, stuff like that.
Yeah, I think that's a good delineation. Running MCP locally versus production. And I kind of get the impression that at the moment mcp, for the average person, average developer, it's probably more of a local endeavor. The easiest way I decided to get up and running with MCP at all is for example, download the Claude actual native app and then have it authorized into any given tool like Slack or Google Workspace or whichever. But at least that's all running locally as well as you probably trust Anthropic at this point with any kind of data like that. And then outside of that, sure, you can run any kind of local MCP setup and not worry too much about maybe where things are going. But then in production, I think that's where there's maybe just a ton of question marks at the moment.
Yeah. And I think that over time obviously the number of MCP servers and clients that are available on GitHub as open source projects is like exponential. Right. And obviously that's, I think like a lot of technologies that's like the tip of the spear, but over time more and more people who have, I don't think Snowflake has this yet, but like let's say Snowflake as a Snowflake user, you can expose your warehouse as an MCP server because you're building some agent. Right. So there'd be, I think, these more trusted sources, like I'm already a Snowflake user, I'm going to use their built hosted MCP server to talk to my Snowflake rather than some third party server that's running somewhere where I need to give them my credentials to talk to my Snowflake stuff like so I think it'll be these like more trusted versions of these servers more and more available from the regular companies that you would expect.
Yeah. And I mean just then, in terms of actually what does this enable? I think you touched on when we were just chatting earlier about agent to agent and how that's been mentioned pretty recently. So maybe just talk us through that.
Yeah, so agent to agent was another announcement from Google last week and it's complimentary to mcp. I see there's like a million articles that talk now about explaining the differences between these things. So I think like MCP was really focused on a tool. So to use your analogy or the analogy that you alluded to earlier, like you know, being this usb, it's really about like a standardization or like how I go and gather data or how I execute some sort of function. And what agent to agent is trying to do is also propose an open standard but solving a different problem. And it's actually a problem I wrote about a month and a half ago in this article where I talked about AI silos. So if we look at the problem that Google's trying to address is essentially it's probably a future problem where as enterprises adopt more and more agents and more and more agentic software, you know, I have my salesforce agents, I have my glean agents, and I have my cortex agents. And then maybe I'm building my own agents. I'm creating these islands of essentially independent agents that don't talk to each other. So just like we've created tons and tons of data silos, we're essentially creating a future of these AI intelligence silos where they have no way of communicating to each other. So in the article that I wrote, I talk about this problem and I've proposed a solution through data streaming technology. And what Google has done is they proposed a standard to solve this problem. And essentially that's this agent agent protocol. So it's more focused on how do we take these crews or meshes or swarms of independent agents and make it so that they can all talk to each other. So I can go and I could build an agent using Microsoft Autogen and I could build one in Wayne Graph and presumably if my glean agents also support this, I could have them all have conversations with each other.
It sounds like a great. I don't want to say dream in the sense it's not possible. I just mean it sounds like a great vision and a great future. What kind of applications can you applications to the real world if you like. Can you see this helping with.
I think they're trying to address a problem that maybe doesn't fully exist right now because most businesses, at least the ones that I talk to day to day in my job, are just kind of like, I just Want an agent, you know, I just want to like solve this. I want to do loan underwriting or claims processing or I. We have a bunch of forms that we need to fill out and we want to automate it. So they're starting there. This is kind of solving I think a future problem. But that future problem, you know, if you believe the vision that at some point enterprise is going to be like having thousands of agents running around doing various times tasks or software is some sort of agentic workflow, then this is going to be a real problem probably in the next few years. And I think fundamentally what it is, and I talk a little bit about this in my article, is it's kind of like what HTTP was to the web where you have the standard protocol that suddenly websites can talk to each other. That's what they're trying to create. It's a big idea, it's a big vision, but it's successful. They're creating essentially the HTTP of inner agent communication.
That's a great way to frame it. Probably kind of forget before HTTP, that was such a difficult concept to think about how computers would talk to each other over wires and obviously dial up connections at that point in time. And that's all kind of roughly being solved. But yeah, here we are in the AI era and everyone's kind of doing everything in their own way. That's kind of nice in a way. I've always thought that two years ago even was that with ChatGPT, it kind of reset the baseline a bit for all developers. Everyone in tech just had to come back to the starting point again for a lot of things. Which I think is really nice that everyone can kind of level up that way. But at the same time it does mean that we're kind of missing quite a lot of infrastructure that we're maybe used to having, such as just a protocol that enables us to link things that I think we can agree are probably going to need to be linked in the future.
Yeah, and I wouldn't be surprised. So there was a lot of technology partners that were part of the agent agent, including Confluent, the company I work for.
But.
But I wouldn't be shocked if there's more competition for standard around this than perhaps we see with mcp. I'm sure other big players in the market might come out with their own version of this. So you might end up in a situation where if you want to build something that's kind of like the Switzerland of this, you end up supporting maybe multiple protocols depending on how people want to connect these different things.
Well that's been, I think a fantastic whistle stop tour of mcp. As we'll mention, there is an episode coming up on SE Daily with Anthropic around mcp, so look out for that. But hopefully that either you've been under a rock and this is the first time you're hearing about this, or you have been reading about it but still maybe just a little bit confused about exactly where does this sit and where are we at with any of this? Hopefully that has just shed a bit of light onto that.
Gregor Van
Developers, we've all been there. It's 3am and your phone blares, jolting you awake. Another alert. You scramble to troubleshoot, but the complexity of your microservices environment makes it nearly impossible to pinpoint the problem quickly. That's why Chronosphere is on a mission to help you take back control with Differential Diagnosis, a new distributed tracing feature that takes the guesswork out of troubleshooting with just one click. DDX automatically analyzes all spans and dimensions related to a service, pinpointing the most likely cause of the issue. Don't let troubleshooting drag you into the early hours of the morning, just DDX it and resolve issues faster. Cycronosphere was named a leader in the 2024 Gartner Magic Quadrant for Observability Platforms at Chronosphere IO Sed.
Sean Falconer
So we're going to move on to hacker news. I think this is a nice place that we get to just dive into a couple of things that have caught our eye on hacker news, things that don't make the mainstream news. And it's always difficult to be on top of everything that happens on hacker news, especially these days. I feel like there's so many things getting submitted and we're just seeing kudos ratings going through the roof on certain things these days that we never saw before. Yeah, so my guess I've got a couple of picks. This might sound like mainstream news, but actually it's kind of thing that doesn't often make it. Which was actually just that. Google announced SEC Gemini V1, which is a model as it might sound SEC Security. It's a model all around security. And why is this interesting? Well yeah, if you're wanting to query an LLM and ask it about CVEs and what does this mean for me? A lot of models will kind of good generic answers around this, but we haven't really seen any of the big players actually say hey, we've got a model that is all about security and especially Google, they bought a company called Mandiant a couple of years ago, you know, a big security company that goes and like deals with cyber attacks effectively, amongst other things. So there's a lot of data obviously with Mandian of actually what causes problems and like what is the fallout. And in theory this model has captured quite a lot of that data. And that's been a bit of a holy grail, actually have a company that actually deals with the cyber attacks and actually be able to expose that data to the kind of wider world in a more structured way. And this seems like a pretty interesting way that in theory it's been done. So that kind of caught my eye.
And also Google recently had the acquisition of Wiz as well.
Right? Well, yeah, super recent. Their second time. Lucky.
Yeah, yeah, exactly. So I wonder if this is part of like a sort of a broader move to embed like AI into every layer of like enterprise defense. Is that sort of where Google sees potentially like a big business or a lot of value? There's certainly a lot of companies that are working on not exactly, I guess like peripherally related to security around like incident management, SREs, automation, all these types of things. I'm curious, from your perspective, do you think that people are already in certain industries kind of resistant to the idea of using AI? Do you think security inherently has more resistance just because of the. If things go awry, it's really bad.
Yeah, Having worked in security for a little while. But I think crucially I didn't start my career in security and I've talked and worked with a lot of security people and I think it's just fairly agreed that you will still find a lot of slightly protective people in security where it's a craft. And I think any craft being told AI can come and do their thing. Obviously we probably think of this more in the creative industries, anything artistic or film or potentially music. But actually cybersecurity practitioners really think of their work as a bit of a craft. And to be told that AI can come along and do that, there is a lot of resistance is kind of what I've seen. But there's no way, if we look at kind of just the volume of cyber attacks and the means and ways that these can proliferate. There is already a lack of human resourcing around this. And the only way I think this can be vaguely covered off is if AI is allowed to kind of be involved. I think it's one of these things where we're always going to have to accept that AI, it will be the classic, oh, that wasn't correct. Yes, you're totally right. That wasn't correct response. But at the same time, I don't really see a future in the security world where AI isn't a part of it.
And I would also think that, you know, I don't think attackers are going to wait. I'm sure they're already leveraging a lot of these things to make their attacks, you know, more sophisticated and stuff like that. Or somebody. Even from an education standpoint, you could leverage a lot of these tools to, like, learn how to essentially exploit a system that maybe before took a lot more work to sort of build up that skill set or learn how to do that. You can use some of these models to help you do that faster. In some ways, the attackers leveraging these tools to do this at scale and much faster, become forcing functions for also the security people to leverage these tools to get better at their job and be able to respond faster.
Yeah, absolutely. It's just a great thing to see. Google, they are saying it is a new and crucially experimental cybersecurity model, but it is there. Yeah. So maybe anyone, as you're pointing out, Sean, even wanting to just start learning more about cybersecurity, there is a model now backed by one of the big players, Techgemini V1. So maybe moving on, I believe something. Microsoft's 50th birthday caught your eye, Shaun, what caught your eye?
Yeah, there's two things around Microsoft turning 50. And then also they released their original source code, Altair basic, which is really interesting. I mean, I think that one of the things with Microsoft that I find really fascinating is they started out, of course, around this mission of putting a computer on every desk on the planet, and they largely accomplished that. I think they were also the first company to really put value in software rather than the hardware. And they had the famous deal where they license DOS to IBM rather than, like, sell it to them. And like, IBM didn't even care that there's like, you know, there's no value in software. So, sure, go ahead. And suddenly, you know, Microsoft becomes the most profitable company in the world. For a long time, they had. They were like this amazing, like, sexy company to work for. And then they had that era from, I don't know, it was like 2000 to the late, maybe 2008, 2010 or something like that. They had. It was like the antitrust era. They had a bunch of flops around, like their phone, the Zune, which is the famous ipod competitor, and stuff like that and they were like people basically thought, okay, this is going to be a company that's like the Kodak of the software industry. And they had a business that was based on selling essentially shrink wrap software. And I think what's really amazing about that is that they, over the last 10 to 15 years they've had a real shift in I think both their public perception, where they make the revenue from. They did what a lot of big companies can't do. The sort of successful pivot of their business from essentially like literally you can't get more on prem than a CD that gets delivered in the mail to install Windows or Office to being a huge cloud company and also now one of the leaders in AI. So I think that's really, really fascinating.
Yeah, no, completely. And far more detail than we will ever go into on SE Daily. But there's another podcast acquired who do these long form histories of companies and they did Microsoft and all the points you just mentioned there, Sean, they cover an amazing detail and I guess one of the things I hadn't maybe realized and I think they do a good job of kind of pulling this out, which is just that the Steve Ballmer years. You might have your own opinion of Steve Ballmer from watching videos or so on and so forth. That was actually when Bill Gates wasn't having a great time in the world. And Steve Ballmer actually kind of steered the company in a pretty good way during those years. And maybe we wouldn't have hit this 50 year mark if that hadn't happened. So it's these kind of little bits of history that we kind of forget about.
Just like Microsoft's gone through this rebranding or change in public perception, Bill Gates has also gone through massive change. He's like everyone's favorite grandpa now, dedicating his life to making the world a better place, which is great. But 1983, Bill Gates was like, you probably couldn't find a more determined person on the planet if you were not on that side of Microsoft. Probably considered a tyrant. There's all kinds of crazy stories about how vicious he could be with his employees and of course how driven he was. He was very famous for saying the words of like, this is the dumbest idea that I've ever heard. So that's not his public perception now, which is really interesting as well.
Yeah. So kudos to, I believe it's Hacker News, username Evgenizh. Thanks for posting the link to original Ms. Source code. I like this a lot because it was first of all the website that Bill Gates and his team obviously have put together for. This is just super nice.
Oh it's awesome.
This is obviously audio only. Just go and Google that and find the website and just so nice. You would expect to see that on some kind of super nice media blog or something. The way they kind of long form content and then. Yeah, I think it's kind of fun because literally you can download the source code and it's a PDF of like.
A dot matrix printout, right?
Exactly. This is the thing that was. Yeah, I was like you have basically. Yeah. Big photos I guess. Photos of PDF. Yeah, of exactly. Dot matrix printer with still the bars on either side with you know that would be how the paper rolls out of the thing. And yeah, I mean it's not like I've read the source code cover to cover but it is genuinely interesting just to kind of read through a bit of it. And one major thing kind of jumped out at me was just actually the amount of comments. I don't think we kind of remember that code back then was pretty unreadable. So I think comments was just something that had to be part of the furniture. There was also kind of a fun one where someone had then actually handwritten over the comments because one of the comments was wrong and they clearly thought this was important enough where they'd taken a pencil and they'd scored through. And just the example is kind of fun because there was the original comment said number should be printed in enotation. And they squared out little bits of it and it turned it into should the number be printed in enotation. So this is like again that's what the function was doing was should this be an enotation? Which is. Yeah. Anyway, take a look at it. It's just kind of fun to kind of see where Windows has even come from.
I love looking at some of the history of how some of these. The challenges that they had to do back then. You know, they. They're writing a full basic interpreter without ever touching the machine that it's actually run on. You know, they don't have modern debugging tools. It's all just like their brain power and like printouts and they did this over an eight week period. There's a really great keynote that I saw a couple times from one of the guys who was the creator of Doom which goes through their like year long journey. There's.
He wrote John Carmack, there's the two.
Yeah, yeah, yeah. And he wrote a book about it as well which I. And it's just like, one is like, they were all in, like, their 20s. They're probably just like slamming Red Bulls, working like 80 hours a week. But like, the timeline that they, like, built all this stuff, and a lot of it was like, you know, nobody knew how to do. So those types of, like, 3D stuff that they were doing at that time, it was all brand new, and they're just like, hacking away on this stuff. And they had all these. Basically announced that they were building this game and would have all these features before they wrote a single line of code. And then they had to deliver that in the timeline is really, really, really cool. Even the stuff that Bill Gates did back then, it was running on 4 kilobytes of RAM. Like what, half an emoji?
That always gets me that we just forget how obviously lucky but lazy we are. Basically, when we build software, we virtually don't even think about memory management. Most of us, I think, unless you're.
Trying to run a foundation model on a phone, right?
And maybe there's a bunch of rust developers all shouting at their phone right now, giving them podcasts about this, but most of us are not thinking about this stuff. And exactly as you say, the pioneers here had to work with just crazy, low amounts of memory and resources, quite frankly. So that was kind of interesting. And then just a final one just to round out. It was just a short one. This is why I love Hacker News. There was the user Ingve. I don't know how to pronounce that. He posted a nice article by a guy, John Collinsworth. So he's actually a Deno developer and as in he works at Deno as one of the staff developers. But yeah, it was called the blissful Zen of a good side project. And I think these kind of articles are always nice to read, hear from other developers hacking away on things. There's one quote, and that was kind of nice. He just sort of said, I felt something in that freedom. I felt a simple understated joy that I hadn't felt in a long time. A candle in a long darkened room. I'm sure we can kind of resonate with that, Sean. Just side projects. I mean, I know you post quite a few things on LinkedIn, just kind of piecing together AI workflows and wowing everybody. The fact that it's just. It's just a side project. So, like, it's kind of nice, right? Just a little thing.
Yeah, yeah. I've always been huge in the. In the side projects. Like, I feel like it's a way to Sort of express your creativity sometimes that you don't always can do in sort of the workplace because there's just certain things that you have to do there. And you know, when I spent, you know, seven years as a founder of a company and I think like one of the things that as the CTO of that company, I started to feel frustrated in the last few years was I felt like I wasn't continuing to learn because we had our tech stack and I saw the outside world moving very quickly and all this new technology and I just didn't have time to learn about it. But I was excited to learn about it and it didn't make sense for us to throw out what we were doing and start from scratch to try to adopt that technology. When I took a step back and stayed on as advisor for that company, I had a period of like 4 months before I joined Google where all I was doing was like side projects. And it was so much fun because I really sort of rediscovered my love for like building where, you know, there was one week where I built Tetris every day in a different programming language. Just, you know, just because it was just a lot, a lot of fun, a lot of learning. And I think it kind of reinstilled the joy of what brought me to study, you know, computer science and engineering in the first place.
Yeah, for sure. There's nothing quite like it when, you know, there are no constraints or bounds or demands. I think is probably the better one. There are no demands of you to produce this thing. It is simply just so long as you can find the time, you can put the time in, but you can just go off in all these different directions. And as John says in this article, he actually finished up by saying, the important part is that you explored that little corner of the map and uncovered what was there. It's okay if it's nothing. The exploration. The exploration was the success. And I think that's just kind of captures it really nicely. The side project he was actually even talking about, it was, I say, just it's actually pretty cool. It's a Sveltekit blog starter repo. I think that's kind of nice just to have something that did actually even make the light of day.
But I don't think you should do side projects with some sort of ulterior motive in mind. But I do think that there ends up being some positive consequences to it that might relate to job opportunities and things like that anyway. Especially if you're interviewing. Places like these give you stories to tell when people Ask what are you up to? What are you doing right now? You have something to talk about, which is really valuable too. And it also shows that your passion for what you do goes beyond just a paycheck.
Couldn't agree more. Yeah, there's nothing more disappointing than when you're maybe looking to hire someone and you kind of look around and don't find anything, or they can't maybe produce anything that wasn't somehow tied to their employer. So yeah, it just speak volumes in that sense as well. So just kind of wrapping up. I mean, we hope, obviously as a listener base, you've enjoyed se Daily Weekly News. This has been obviously a slightly different format. Just looking ahead in terms of what we know we've got coming up on the kind of regular schedule, obviously related to mcp. We have got Anthropic and MCP with Jordi that's coming up in a couple of weeks. I believe there's an episode with you, Sean. Open Tofu.
Yeah. So for those that aren't familiar with OpenToFu, essentially it was a spinoff project, went from Terraform, went Terraform. They changed sort of their licensing around it with the. I think happened pre or post the acquisition of HashiCorp into IBM. But some people essentially spun off the Open Tofu project as like a truly open source version of that that is compatible with Terraform. But they're also addressing some of the problems that is there with Terraform.
Awesome. And yeah, another one that's coming up is I talked to the chief security officer of Coinbase. So, yeah, very interesting individual. He worked at Palantir for a long time, which is also another super interesting company. But yeah, we just get to hear about all the kind of ins and outs of what it takes to secure arguably probably the largest crypto service. I mean, at the beginning of that episode he points out the fact that Coinbase these days is like two kind of major products. I won't go into them right now, but there's probably more to Coinbase than maybe meets the eye and I think that kind of makes that one a super interesting episode. Yeah. So anything else? Just over the week ahead. Sean, you want to call out before we wrap up?
I don't think so. Hopefully people enjoyed this. I certainly had fun and it's always great to. I was teasing about not having heard your voice at normal speed earlier, but it's always great to chat with you.
Likewise. Yeah. So obviously listeners do get in touch if you've enjoyed this one and of course, any feedback on any of the episodes. Guests. I'd love to always hear from our listener base. So thank you so much for tuning in, and we hope to see you again on another SE Daily Weekly News.
It.
Podcast Summary: Software Engineering Daily – SED News: CoreWeave IPO, Anthropic’s MCP, and Microsoft Turns 50
Release Date: April 29, 2025
Welcome to the detailed summary of the pilot episode of SED News, a new podcast series by Software Engineering Daily. Hosted by Gregor Van and Sean Falconer, this episode delves into significant developments in the software engineering landscape, including CoreWeave's IPO, Anthropic's Model Context Protocol (MCP), and Microsoft's 50th anniversary. Additionally, the hosts explore noteworthy highlights from Hacker News, offering listeners a comprehensive overview of current trends and insights in the tech world.
Gregor Van kicks off the episode by introducing SED News as a pilot series aimed at breaking down the week's most important stories in software engineering, machine learning, and developer culture. Alongside Sean Falconer, Gregor sets the stage for discussions on major topics such as CoreWeave's IPO, Anthropic's MCP, insights from Hacker News, and reflections on Microsoft's milestone birthday.
[00:00] Gregor Van: "Welcome to the pilot episode of SED News, a new podcast series from Software Engineering Daily."
A significant portion of the episode is dedicated to CoreWeave's recent Initial Public Offering (IPO), a notable event given the current trend of fewer companies opting to go public. CoreWeave, initially known for its involvement in cryptocurrency, has pivoted towards artificial intelligence (AI), leveraging its substantial GPU inventory to support AI infrastructure needs.
Key Points:
[02:37] Sean Falconer: "CoreWeave renting out these chips. And I think maybe as software developers we can probably put the pieces together a little bit faster on that one."
[05:05] Sean Falconer: "The big FAANG tech companies... just have kind of like unlimited rocket fuel to throw at those types of things."
Impact on AI Landscape: Sean discusses how CoreWeave's IPO signals a unique move in the AI sector, where most prominent AI companies remain privately held. This could have downstream effects on AI startups' valuations and their timelines for potential public offerings, especially amidst skepticism from institutional investors regarding the longevity and business models of AI enterprises.
[07:57] Sean Falconer: "I think there's some knock-on effect for companies that are doing great things, but just there's still this sort of outside of the tech world... it might just have a knock-on effect at the moment."
The discussion shifts to Anthropic's MCP, an open standard aimed at streamlining how AI agents interact with various tools and data sources. Announced in November by Anthropic, MCP seeks to address the complexities and technical debt associated with integrating multiple tools within AI frameworks.
Key Points:
[16:17] Sean Falconer: "MCP just seems to be the USB C of interacting with AI services."
[20:53] Sean Falconer: "There's a lot of question marks at the moment... How are we going to handle this securely and reliably?"
Agent to Agent Communication: Expanding on MCP, the hosts discuss Agent to Agent protocols introduced by Google, which complement MCP by enabling AI agents to communicate directly with one another. This development addresses the potential siloing of AI intelligence and aims to create a more interconnected AI ecosystem.
[24:23] Sean Falconer: "Agent to agent is trying to propose an open standard but solving a different problem... it's a future problem where enterprises adopt more and more agents."
Conclusion on MCP: The success of MCP hinges on widespread adoption and the establishment of trust through secure and reliable implementations. As major tech companies back MCP, it is poised to become a standard framework, simplifying the integration and operation of AI agents across diverse platforms.
Gregor Van and Sean Falconer explore several noteworthy discussions and announcements from Hacker News, shedding light on developments that may not have made mainstream headlines.
Google has unveiled Gemini V1, a model specifically tailored for cybersecurity applications. This model leverages data from Mandiant, a renowned cybersecurity firm acquired by Google, to provide detailed and structured insights into cyber threats and vulnerabilities.
[31:13] Sean Falconer: "Google Gemini V1 is a model all around security... it's been done in a pretty interesting way."
Implications: Gemini V1 represents a strategic move to embed AI deeply into enterprise security, offering specialized capabilities that generic AI models may lack. This aligns with the increasing demand for AI-driven cybersecurity solutions amidst growing cyber threats.
Celebrating its 50th anniversary, Microsoft has released the original source code for Altair BASIC, an early version of their BASIC interpreter. This release provides a glimpse into the foundational aspects of one of the most influential software companies in history.
[37:11] Sean Falconer: "Thanks for posting the link to original MS source code. You can download the source code as a PDF."
Historical Insights: The release underscores Microsoft's evolution from a software licensing pioneer to a cloud computing and AI powerhouse. It also highlights the company's humble beginnings and the engineering challenges faced during the early days, such as limited memory resources and the absence of modern debugging tools.
[40:10] Sean Falconer: "They did this over an eight-week period... they were just hacking away on this stuff."
OpenToFu emerges as an open-source counterpart to Terraform, offering compatibility while addressing some of Terraform's existing issues. This project represents the ongoing trend of the developer community seeking more flexible and adaptable infrastructure-as-code tools.
[44:31] Sean Falconer: "OpenToFu was a spinoff project from Terraform... addressing some of the problems that exist with Terraform."
A heartfelt post titled "The Blissful Zen of a Good Side Project" by John Collinsworth, a Deno developer, resonates with many developers. The article emphasizes the joy and creative fulfillment that side projects bring, free from the constraints of professional obligations.
[41:31] Sean Falconer: "I felt a simple understated joy that I hadn't felt in a long time. A candle in a long darkened room."
Takeaway: Side projects are portrayed as essential for personal growth, creativity, and maintaining a passion for development beyond day-to-day work. They provide opportunities for exploration, learning new technologies, and rekindling the excitement that initially drew developers to the field.
Gregor Van and Sean Falconer hint at future episodes that will delve deeper into the topics discussed, including:
The pilot episode of SED News successfully navigates through complex and timely topics, offering listeners valuable insights into the evolving tech landscape. Gregor and Sean's discussions highlight the intersection of finance, AI standards, cybersecurity, and the enduring legacy of tech giants like Microsoft. Their exploration of Hacker News add a community-driven perspective, emphasizing the importance of both innovation and historical context in shaping the future of software engineering.
Key Takeaways:
Notable Quotes:
Sean Falconer [02:37]: "CoreWeave renting out these chips. And I think maybe as software developers we can probably put the pieces together a little bit faster on that one."
Sean Falconer [05:05]: "The big FAANG tech companies... just have kind of like unlimited rocket fuel to throw at those types of things."
Sean Falconer [07:57]: "Unfortunately, this will have a knock-on effect for companies that are doing great things, but just there's still this sort of outside of the tech world..."
Sean Falconer [16:17]: "MCP just seems to be the USB C of interacting with AI services."
Sean Falconer [24:23]: "Agent to agent is trying to propose an open standard but solving a different problem..."
Sean Falconer [41:31]: "I felt a simple understated joy that I hadn't felt in a long time. A candle in a long darkened room."
This episode serves as an essential resource for software engineers, machine learning enthusiasts, and anyone interested in the dynamic developments shaping the tech industry. Stay tuned for more insightful discussions and in-depth analyses in upcoming episodes of SED News.