
Loading summary
A
Welcome everyone to another special edition of the Sans Cyber Leaders Podcast. I'm James Lyon and I hope you're here listening because you've recently enjoyed our episode on Iran with the great Tim Conway, another topical example of cyber leaders. But I'm sitting here in rural England and a few miles away there's a train station that connects straight to London. And like that city's famous bus system, you wait ages for one and then two come along at once. And things are so frenzied in the cybersecurity world right now. Here is our second special edition of the Cyber Leaders Podcast.
B
That's right, James. I'm Kieran Martin. Welcome to the show. Thanks for joining. So what's brought us to this point? Well, this episode is about.
A
What is it about?
B
That's a good question. Is it about Mythos Preview, the new model from Anthropic they judge too dangerous on cybersecurity grounds to release? Or is it about AI tools as a whole? After all, OpenAI came out with their own announcement about their quite different approach to their new model security and warnings about the hacking capabilities of AI models are not entirely new. But this warning is a lot louder and has, as they say in media and political circles, it's cut through, I think.
A
Kieran. It is all ultimately about the rise of Skynet, but. But anyway, look, regular listeners will know, of course, always. I start normal episodes on this podcast with the announcement that I've been, well, hacking and breaking things since as long as I can remember. And according to some interpretations of the last month's developments, now AI is going to do all of that for me. I can sit on a beach and sip a cocktail. So, before I have to come up with new introduction, let's set out what's happened, why it matters and what it means for the cyber defence community. So let's start quickly with what's happened. On 8 April, Anthropic, the AI giant that runs the series of Claude LLM capabilities, announced that its new model, Claud Mythos, was going to be delayed. Why? Not because of some engineering challenges or some excuses? Well, basically because it was too good at hacking. Now, you have to grossly oversimplify these things to summarize them, I admit, but basically they said two things. One was it was really good at finding vulnerabilities. Zero days scary stuff that would be incredibly useful to attackers. Stuff that no one has ever found or at least never reported on a scale we haven't seen before. In the flagship example, they reported in their press Release, they said they'd found one vulnerability in core open source Internet infrastructure that was 27 years old. It's actually probably older than some of the analysts that are going to be tasked with deploying patches. And as a result, the time taken from discovery of vulnerability to exploit reputation, it's going to collapse even further. It's already gone from 28 months to 28 hours in the last eight years. And now we're heading down to minutes. What was the other thing, Kieran? There were two.
B
You've basically already said it. I'm very impressed.
A
Nope, I've lost it already, have I? What was the other bit?
B
The bit about it being really good at hacking.
A
That's it. I suppose, in some ways, and perhaps this is surprising, the bit received less attention. But to cyber defenders out there, it really matters. Essentially, you know, Mythos Preview was very good at the full attack attack chain, getting through to actual exploitation. You know, whether you're using zero days or all the existing weaknesses we already know about. But people don't always defend being able to leverage them and quickly with potentially less expertise at scale is a big deal.
B
So to cut a long story short, for those two reasons anthropic delayed to do what? Maybe they were trying to do what I normally do when I've got a problem, and that's to ignore it long enough in the hope that it will go away.
A
If only. Kieran. No, no. They announced something called Project Glasswing. That's a partnership based on privile access to a few dozen big American companies, tech giants, cybersecurity companies, banks, you name it, to try and work on mitigations. And they do this with privileged access to the model before it comes out.
B
So, James, is everybody else going to do this?
A
Well, sort of, but not quite. I mean, OpenAI have said that they will also delay their next model, but they want a much more open partnership based on a trusted accreditation model, which I'm sure we'll get into with our guests shortly. And of course, the new Chinese models will inevitably coming. I mean, history suggests they'll be very powerful and we've no idea how they'll approach that release necessarily.
B
So what does it all mean? Well, it dropped in an unsuspecting world at the height of one of the most serious wars in recent history. So it's a lot to take in. But as people across the world have started to take it in, there are some points of consensus and some ongoing points of debate that are emerging. First, this is a big deal. Now, James, when I was Writing this up, I hit the wrong key on my keyboard and I promise you I'm not making this up. You know me well enough. I don't have that imagination. But I typed this is a bug deal. I and you are beside each other in the keyboard. I thought about leaving that. So anyway, it's a big bug deal. A new T shirt. And maybe that's going to be the title of our podcast today.
A
Very punny. But hey, look, get to the point, my subordinate. I mean, friend.
B
Haha. Thank you. Yes, boss. It's a big bug deal. It's a serious acceleration of something that was already on our radar. The hacking power of new AI models. How much of a transformation is it really? That's hard to know. As you've said, James, for understandable reasons, Anthropic have kept access to the model on a privileged basis. So when the news first came out, we only had the analysis, as they did with their partners, to go on. There's only been one independent assessment of Mythos Preview based on having actual access to the model. And you and I, James, I think we can take a bit of pride here because it was done by the UK's very own AI Security Institute.
A
That's a remarkably good bit of progressive government there. Building a government agency capable of winning the trust of an AI giant based in another country to do an independent analysis of their tool and publishing it within a week. Well, frankly, I'm flabbergasted, Kieran.
B
Well, well done them. And I'll gently suggest they've set a standard for the rest of government and hopefully for other governments too. But the a A SI work was a markedly, perhaps calmer assessment than some of the more headline grabbing stuff reported. When Anthropic's release came out, it focused, I think, on the second part of your two points, the general hacking capability rather than the discovery of zero day vulnerabilities. And it concluded that Mythos Preview is really, really good at hacking. And it was the first AI model to fully complete its full range of attack tasks across the whole chain. But it said it had some shortcomings and if you look at their charts, it was once again a notable acceleration of something we've already known about for a while. I'm struck by what Heather Adkins, that brilliant SISU at Google, has said about the clock started ticking months ago. And they also caveated by saying that no one, including them, has really tested Mythos Preview against actual cyber defenses.
A
That's right, Kieran. So look what to make of it all. I mean, clearly this is a big bug deal, definitely a T shirt we're making. But there's different, you know, versions of this and, you know, some are saying it changes everything. Some are saying it's marketing hype and a good way of trying to cover up for recent issues over anthropic on the security side. And few have lost money in our industry by predicting the apocalypse. It's certainly been an attractive habit for marketers over my 20 odd years. And then some are even saying it's actually a wonderful opportunity to fix some of the serious bugs in the Internet and more broadly, the technology we rely on every day we've never been able to fix before. So we've got everything from dire straits through to wonderful optimism and apocalypse mixed in the middle. Some are saying it's a combination of all of these.
B
Well, what does it mean for cyber defenders for this community? And that's where we're here to help.
A
Indeed. Well, look, folks, we sometimes talk about cyber defense as a community and that's a real thing. It's who this podcast is aimed at. And key parts of the community have raced into action rapidly. Experts from all sorts of different backgrounds with different perspectives arguing through and trying to figure out what really matters here or the line I use very often what works and what doesn't. And they came up with a paper that was published jointly by us here at SANS and our wonderful friends at the Cloud Security alliance, who let us into the party to provide some expertise. And it gives a balanced, informative and actionable account as we could make of this remarkable development.
B
Yes, thank you, everyone involved in that. And that brings us to our guests finally. Now, it's a new format today, so we had to set out the scene and we have three guests and all of them are involved in this report, the AI Vulnerability Storm. We're hugely grateful to them. Now let's introduce them. One is our very own Rob Lee, Chief AI officer and chief of research at sams. Welcome back to the show, Rob.
C
Hey, thank you for having me here.
A
And we're all also extremely privileged to have Gaddy Evron, founder and CEO at the AI agent security company Gnostic, and CISO in residence for the Cloud Security alliance, who led the charge on this wonderful paper and, well, indeed, lead author of the paper. So welcome. G. Thanks for making some time for us today.
D
Thank you. And I appreciate the accolades. Although honestly, I just herded cats. It was everybody, truly, 250 people working together over a weekend. I am happy to take credit for the herding of the cats. But not for the whole paper.
A
I'm excited to hear how you might have accelerated cat herding with Mythos. But that's a question question for later in the podcast, isn't it, Kieran?
B
Indeed. And speaking as a cat, last but not least, it's great to welcome Ed Scudis of Sans. Now, when you mention Ed when discussing penetration testing or incident response, everybody knows who you're talking about. So for the first but not the last, I hope, time on the show, let's welcome Ed Scudus. Ed, hello.
E
Hello. Thank you. I'm honored to be here. Looking forward to this discussion.
A
Fantastic. Well, look, let's get into this. Kieran and I have done some scene setting, but it's time to get to the experts. So. So, first of the two things I highlighted in the introduction that the Mythos preview, it's very good at finding bugs that we don't currently know about. That seems to me to be pretty proven at this point, and a theme that frankly exists even beyond frontier models. But what are the implications of that? Gaddy, why don't you start us off on this?
D
I believe the first implication really is about the model itself. This is now the Mythos problem, A big bug mythos problem, because Mythos is what people know. That's what broke through, breached, if I'm to use that term lightly. The New York Times and the CNN barrier. So the problem itself has been around for a long time. You don't need metas to be able to find bugs or exploit bugs. It is indeed more capable. It didn't do a lot more than previous models, but you can do a ton with what we already had. So thus, this is the Mythos problem, because this is how it's now known. This is how we discovered it. As to the implications, I believe that we'll get into that throughout today. But the most important thing to realize, which you already touched on, is, number one, there is market here, obviously, and there are a lot of skeptics. And let me promise you, there is hype. But the truth is larger than any hype. We have all been in this industry for a long time. We survived how many apocalypses. However you pronounce the multiple of apocalypse, many. All the way back to Y2K and beyond. With that said, it is also real. And we must recognize that to start moving on this, because it shifts so many of our assumptions in cyber defense that we have to start working on this now, even if it takes us a while to get there.
A
That makes sense to me, and I agree. We've seen quite a few of these, certainly over the tenure of my career. But Ed, if I might come to you here, you've been around even longer than me, one of the titans on which this industry was built.
D
Ah, shucks.
A
Are you watching this play out thinking, oh, here we go again, another overhyped capability. Or, you know, like Gaddy, do you think underneath that hype and marketing there really is a significant trend here? How are you thinking about it?
E
I think there's a real there there. Like Gaddy said, there is some marketing happening here on the anthropics part and others. But we've been doing source assisted pen testing with AI for 15 months. Me and my team putting aside Mythos, using current models, we have found massive vulnerabilities, you know, major issues in not only open source code bases, but also, you know, closed source, where our customers give us the source code and we start combing through it. We've had things that we've had human pen testers test year after year after year for five, seven, ten years. And then we apply these techniques using current models, you know, nothing Mythos or beyond. And on the first day of the pen test, we'll discover five or ten critical vulnerabilities that we've never seen before. Because the AI is able to find subtle flaws. And this is not something like traditional SAST would find, you know, source assisted software analysis, which would be like cross site scripting or SQL injection. Sure, these tools can find that, but they're finding subtler bugs like strange authentication bypasses or authorization flaws or. One of the big things we're finding is in SaaS applications, cross tenant access. So we're able to go in as one user of the SaaS application and through manipulation of a bug found by the AI, we're able to start accessing other parts and other customers of the SaaS application. So there's a whole raft of bugs that current models are really good at finding. And my worry is that a lot of our listeners here will say, well, I don't have access to Mythos, so that's a future problem for me. But the fact is you can use current models, turning them against your own code bases, to find vulnerabilities today and eradicate them so that when future attackers get Mythos or beyond capabilities, those vulns are away from you. They're already gone. You need to kind of drain the swamp. Now we've been given this little reprieve and warning and we need to take advantage of that.
A
I do love that point and it's funny. We got to admit, we're talking out of both sides of our mouth a little bit, aren't we? We're kind of saying, oh, look, there's a big new piece of technology. We got our eyes on Mythos, and then we're kind of simultaneously going. And a lot of it was already capable with prior models, and there's a bigger trend and probably ought to get on with solving this, even though there's a lot of marketing hype. Rob, if I might come to you quickly on this. You spent a lot of time playing with A.I. you know, I'd say you're an A.I. pragmatist, but certainly on the optimistic side of it as well. You know, I have my bouts of cynicism, and I need a dose of Rob to lift me out of my frustrations as a grumpy curmudgeon with AI how do you think about this impacting the cybersecurity profession? I mean, is it the case, as I said in my intro, that I just need to be able to type good prompts now, and I can go and find as many zero days as I like. Gone are the days of needing skilled humans. Where do you sit compared to Ed and Gaddy on this? Similar perspective or different?
C
I'll put it this way. So one of the things that Gaddy and Heather Adkins. I hope I get this right, Gotti, which is the cataclysm. You know, where you guys wrote that paper was about six months ago. What was the official name of that paper again?
D
So I believe we called it something like preparing for or surviving through positive notes, because we call it the cataclysm. We made three choices, right, that we thought about deeply. Number one, we call this a cataclysm, an AI vulnerability cataclysm. Number two, we put a number next to it. Six months and stopping there. Making these two choices means fud, fear, uncertainty, and doubt. And in our careers, we really try to both learn and strive to avoid that, to speak with metrics, to speak with risk, to speak to executives, really. But the warning had to come out, and we really tried to get the point across of this is already here. And that was the risk we took back then, and that's why we used words like cataclysm. Today, we're no longer using this terminology, but the problem is no less urgent. It's just insecurity. You are often, Cassandra, you're blessed with foresight, but you're cursed for nobody to ever believe you and then blame you after it happens, right?
A
Stay with us.
B
We'll be right back.
A
Hi everyone, James Lyon here, the CEO of the SANS Institute. A quick thought for you. Cybercriminals have networks, dark web forums where they share what works, what doesn't, and where they're constantly sharpening their playbooks against us. So why shouldn't we do the same? That's exactly what the SANS Cyber Leaders Network is about. It's a place where CISOs and security leaders share what's actually working inside their organizations and what isn't, while getting access to world class experts sharing insights into the latest threats and trends. You'll find me in there surfing around, sharing what works. So come join us us@go.sans.org CLN that's Charlie Lima November. And if you're enjoying the show, one teeny tiny small favor hit subscribe. That's genuinely all we'll ever ask of you. And in return, we'll keep fighting to bring you the guests and conversations that you want to hear. Appreciate it all. Now let's get on with the show.
C
That's one of the things I loved about that, which was the way you're describing it, but also the foresight because I mean, in terms of where you guys were, pretty crystal balling that down the line. And even with what Ed said, I think his team, in using it for so long highlights a couple of things here. And I'll go back and when we're writing the paper, something that really struck me, which I loved, describing a new capability inside organizations that they should develop called vulnerability operations. So vulnabs for short. So when you end up taking a look at what Ed's doing and the potential, what are people, you know, calling the apocalypse or you know, people are overworked trying to patch all these things, it's hard to predict exactly what the workload change is going to be. A couple weeks ago, actually I think three weeks now, I watched a really good documentary called the AI Doc, which highly recommend everyone watch this, but there's a keyword in there which kind of describes me and how just this situation is described. The subtitle of that is How I Became Apocalyptomist.
A
Definitely a T shirt. Just putting it out now.
C
The combination of both you believe in the apocalypse, we became really optimistic simultaneously. So I've adopted that word in terms of my own viewpoint with this, which is right now everyone's feeling the worry, the stress, the, you know, there's still a lot of FUD that's surrounding this. But the positive angle that I look at Is number one. Anthropic's given us a gift which we're on this podcast and we're able to talk about something that, that Gaddy and others were referencing six months ago. Ed, you were talking to me about it. Now. Josh Wright, you know, did his keynote about it. There's this influx of people, knew this was coming and now it's here. And Anthropic's given this gift because we were struggling in getting the word out. You know, Gaddy, how many people asking you in the press about this? Now we're talking about this and now we have a chance to highlight this to other executives into the board. You know, should there be an additional investment?
B
Sure.
C
It's marketing hype, but now it's a focus and now you're able to make these asks. I'm also optimistic on what is this going to change for code development in the future when you have these capabilities at the front end before you deploy your capabilities, that you'll be a lot more aggressive, you know, in vulnops, working directly with SEC DevOps and the ability for toting to potentially find more flaws before they're hit to production. I think that is likely going to happen. We just need to ride out this initial storm to get to that point where I believe there's going to be a lot more secure code, not 100% but a lot more accurate secure code that's going to be released. You know, Jenny Easterly has said this again and again and again. It's not a cybersecurity problem. It is a bad code writing problem. And with that in mind, you know, and a hat to everyone is kind of in here and kind of like bringing my own thoughts together. That's why an apocalyptomist here.
B
Right. So hold those thoughts. Just as I was trying to work out what the plural of apocalypse was, you've now taken it a step further and given me a bigger challenge. But we are coming back to vulnerabilities. We are coming back to those points that you raised. We need to go into detail in them about what defenders should do, what organizations can do, and we will come back absolutely to the optimism. But as an optimist, it's my sad duty to take us back to the other part of the storm before we do that, so that we can get a full picture of what we're up against here. So I'll move us on to the second bit. I used to head up a public authority, as you know in the UK for cybersecurity. We always Told people zero days. Yeah. Don't get overexcited about them. It's all the existing weaknesses. And so I want to ask you, maybe me, Rob, go first, about Mythos as a very good hacker. The three. Three out of ten simulations in the UK's AI Security Institute, it managed the full attack chain pretty quickly. No AI model has done this before. Again, we've seen this coming for a while. What does this mean?
C
Well, first of all, I think most of us in here are looking at a capability that we've not directly put our hands on. So, you know, I go take a look at the capabilities that currently exist and really emphasize that Mythos is likely a change in logarithmic speed going from, say, we're traveling at 50 kilometers an hour. You know, you're now breaking the sound barrier. But I really want to highlight that 50 kilometers an hour or for a lot of things, it's really fast. And even in my own development and working with the previous models, we're able to get extremely accurate reports, even pointed at compromised systems. And so when you end up taking a look at holistically, we need to remind ourselves that cybersecurity, you know, we do have this capability, and it has launched the hacking capability forward. And Anthropic noted that in their report GTT1002 back in November, that not only are we looking at the vulnerability exposing at a really short wind.
B
Yeah.
C
We're also looking at autonomous hacking that is extremely sped up. You combine that peanut butter and chocolate to make your Snickers bar. That is what I'm really concerned about. It's not just finding zero days. It's the speed of attack in the chain that would be able to be accomplished.
B
All right, well, the listeners can't see, but I saw it because it's on video for me. Sorry, Audio podcast for everybody else. Ed, you were nodding vigorously when we started talking about this. Give me your perspective.
A
Almost aggressively, I would say.
B
With a smile. With a smile.
E
I think it was pretty aggressively. Yeah. So that's really interesting area of this. You know, I had mentioned earlier, let's not get obsessed with Mythos ability to find zero days, because current models are really good at it. That's good. That's true. We have found in applying current models to our active pen testing, that they're okay at it. They're not great. We have created some internal tooling, we call it Sidekick, where, you know, it rides alongside a pen tester, and the pen tester compares notes with it and so forth. And we found Some stuff we found, some medium vulnerabilities, some low risk stuff. We did have our little sidekick find a critical vulnerability a few weeks ago that the pen tester working on the thing didn't find himself, and that's good. Mythos could really be a game changer here, though. So I wouldn't discourage anybody from using current models to augment their current pen test team. But Mythos, I think, is the real game changer here. You know, another thing, if we're emphasizing some more of the pessimism before the optimism.
B
Later for the optimism. Yeah, I think that's the right order.
E
Yes, exactly. I'm a little concerned with, you know, the promise that Mythos and current models are going to find a lot of zero day vulnerabilities, because I think that's going to force those who have an arsenal of the zero day vulnerabilities, especially the ones that they know are easy to find, to use them now. Now, use them or lose them, because the swamp is going to be drained over the next 3, 612 months. And this is kind of completely independent of Mythos and next generation models, but it's being pushed by them, that is, take your current zero days and use them, the ones that you may have spent tens of thousands, hundreds of thousands, millions of dollars on. Use them now because they're probably going to evaporate when Mythos and other models turn their attention to finding them.
D
Brilliant.
B
So, gaddy, I think we're at the border checkpoint between optimism and pessimism. So you can stay where you want for a while. But I really love your perspective on this. If you like the normal zero day part of this and how you evaluate it.
D
When it comes down to it, we're security professionals, we're risk professionals, we're technologists. Depending on who we are, we might be none or neither. But as we started, we survived the multiple apocalypse. I don't know if that's the way it would be announced, but you'll update me, right? Love it. And it comes down to optimism because we did survive. Those people will come together, the sun will rise, but not make it any less serious.
B
Yeah.
D
And up to now, we covered the specific risk of vulnerability in code. And now we can turn that capability of attackers as they kind of hit their singularity moment. Maybe a micro singularity. The singularity is not evenly distributed, whatever you want to call it. And defense hasn't. We have seen a lot of AI technologies go into defense. None of them are yet mature. And we're still imagining at the level of we can use AI for something we've done before as opposed to something completely new and different. So attackers are there. They can find vulnerabilities, they can exploit them, and they can run autonomous operations. Malware is now coming in. Nothing is perfect. It will keep advancing, but we understand it now. So the first thing, just in summary is take the technology, build your vuln ops capabilities and start. Start by just pointing an agent at your code and saying find something just to get started. Then start using whether it's commercial tools like from Tropic Codec Security or Cloud, Cloud Security Aardvark, you can use open source, which we actually released to not compete with them. From Gnostic, which is called openant A&T, you can use anything but stuff start. But then there is the second point and there are three.
B
Always three.
D
That's good, always three. I was in the military. I don't know if there will be three, but I'll always keep it to three. So the second point is we don't really have these other defensive technologies yet. We are borrowing from the attackers. What we do have is our people. And our people can be accelerated. Today we can't allow or ford ourselves to move at human speed anymore. We have to run at machine speed. And if there is one thing that helps everybody across the board and attackers for sure, whether you're in GRC or audit or incident response or threat hunting or threat intelligence. And of course coding is coding agents. These are agents. But the vast majority of agents out there, the best agents in the world right now, are coding agents, cloud code, cursor, copilot, whichever one you like. If we don't ask, suggest, encourage, make it mandatory, and force people to start using agents now. Now to empower themselves to be two times to 100, 200, 300 times, depending on what's better, more capable, more efficient, able to understand faster as they did before. We're behind this, of course, needs to be done securely and I of course biased on that, but I truly believe that. But use agents. I am on the optimistic side because I believe that is truly the number one thing we need to do as we develop these new capabilities to a defense.
A
I love that, Gary. There's something in there I want to pick up on and I want to bring Rob into as well, because it's a little discussion we had over the last couple of weeks.
B
Weeks.
A
This moving to machine speed. Now, I've seen a bit of this movie before. I was there in the early days of malware, where we used to kind of reverse engineer and try to write identities or signatures as they might have been called back then by hand for each piece of malicious code. And then attackers started generating them using generators, you know, programming solutions. So we had to build kind of machine learning rigs and expert systems as the number of samples went from thousands a day to hundreds of thousands a day. And it changed some of the makeup of the skills that were needed of the analysts. But we fundamentally needed more analysts to deal with the scale of the problem. Gaddy Rob, up to you. Who goes first? I shall allow first mover advantage. I'm curious when you talk about machine speed, how you think this might change the shape of cybersecurity professional skills over the next few years in accomplishing Gaddy, what you were just describing, which I think is going to be the right approach to defense.
C
So I guess my point we have to go back and I'm a big fan of history and how we landed in our careers here. When the cybersecurity quote unquote industry began, sysadmins were the cybersecurity professionals. It was a side job. You know, many cases early on it wasn't even side job. They couldn't even describe it. They were just thrown at problems that were related to security. And then late 90s, early 2000s, you started seeing very slow. You have to go back in early the ask, you know, when did the job security analyst and even forensicators and so forth, instant responders, pen testers. Ed, I think you even helped coin the term pen tester, if I'm correct came to exist, and I see that currently going on here is that the skills where people are dabbling in trying to learn is not unlike what we saw at the beginning, is that you end up having this separate skill set that is being developed now. Whether those become pull up AI, cybersecurity analysts, machine learning, I think that'll metastasize over the next few years. There may be even a split of cybersecurity. It would be almost like how it split from cybersecurity in two different functions that and again have a little bit of a reach, which is almost a debate for another day, is will you have AI SecOps and Security Operations separate from cybersecurity capabilities. And it sounds like that's impossible, but given how different AI is, is especially once you get beyond, you know, like in the singularity world, working and utilizing AI becomes a lot different because, you know, you're reasoning with it, you have to train it, you have to do a lot more introspection. But we still need someone focusing in on the core cybersecurity stuff. So I don't know. I think it's skill based, James, but I also think there's room to make the analogy that it could go separate. It could be, you know, clear career fields. I think we're in the sysadmin days where people are of kind of figuring out, hey, I'm a cyber security person.
A
It's really interesting whether we'll end up with AI folks who are bilingual with cyber or you know, cyber with AI merged in or what spectrum of separation ed. You are no doubt going to have some thoughts here, having seen a few of these movies over the years and the evolution of our industry.
E
Exactly. And I've been thinking a lot along the lines of what Rob just said. You know, 30 years ago we saw people and you could think of it as skill sets if you'd like, but I've been thinking about it as workflow. So if you go back 30 years ago, people started putting down the workflows of intrusion analyst or incident handling or. Rob was hugely involved in digital forensics and other friends of ours did cyber defense. And maybe 25 years ago we formalized the workflow of what a penetration test is and what's a web app pen test versus a network pen test and so forth. And there's others, right? Mobile, cloud, ICS and so forth. But what I think the industry needs now is to define what the workflows are, are the AI centric workflows and they may be very different from the current workflows for given jobs. One example I'm very familiar with is this whole AI enabled source analysis and assisted penetration testing. It's a completely inverted workflow where you start with the AI looking at the source code as opposed to a traditional, say, web app pen test where you have a human looking at the target environment. Also, this whole discussion about vulnops, I think what vulnops fundamentally is, is a new work workflow around which there are skills. But I would love to see more people who are very focused on vulns define AI centric vulnop's workflows so that then we can figure out what the skills are around it and build up those skills in ourselves and in our workmates. If I had to bet on whether AI splits out so that there's sort of a cyberized AI versus traditional cybersecurity, my bet is it won't split. I understand you know, James and Rob's comment that it split because it did in the sysadmin days. However, I don't think there will be enough for traditional cyber cybersecurity people to do without AI, so that they're all going to move over and have AI centric workflows and provide the unique capability that the humans can provide in those workflows. So what I'm really interested in is AI enabled workflows. In fact, I don't want to sound too, you know, haughty about this or anything, but if you're just sprinkling AI on an existing workflow, maybe that'll help, but I don't find that that interesting. I want to see AI workflows built from the ground up. Dan Guido of Trail of Bits did an amazing presentation at Unprompted Con about six weeks ago or so and it was all about how do you build AI centric workflows around your business and not just AI, enable existing workflows? And I've been thinking a lot about that and how it applies to penetration testing, cyber defense, digital forensics. I know Rob's been thinking about it for digital forensics and incident response. Heather Barnhart has been thinking about it a lot for digital forensics and incident response. But I think creating those workflows and then the brand new concept of vulna
A
ops, creating a workflow for that, that's fascinating, Ed. And I, I do buy what you're selling, I have to say. I think, you know, I've been using this expression that over the next few years as AI is adopted by the good guys and the bad and we figure out the best ways to supercharge our various efforts and reinvent workflows. You know, when everyone has AI, the edge again is human critical thinking, humans with interesting ideas and training and problem domain expertise. And, and that would suggest that people that understand AI and their particular cyber security problem domain are likely going to be to be able to define those new workflows, processes and systems. Ed, if I might come back to you on a related question. I suppose so now we've reassured the cybersecurity community that, you know, it isn't Skynet. We might need some people.
B
It's all fine. No, sorry.
A
What happens now? I mean, this has been accelerating for a while. Gaddy covered that in his opening remarks that this has been a really effective way to draw attention to an underlying trend that's been swept, swelling. OpenAI have just come out with their own delay and a different model for how the community handles the challenge. We then are going to have the wave of Chinese models and open models and so on and then the whole cycle starts again. So for security leaders and practitioners listening, what should we be looking out for months down the line or 12 or 18 out from here, do you think?
E
Ed, this is where I get really excited. I think we're going to go through some rough times in three months, six months, nine months, maybe a year, as we adapt to new attacker capabilities augmented by AI and we implement things like vulnops and improved AI centric workflows. So that might happen over the space of the next year or so. But what I see, two, three, four, five years out, I'm highly optimistic about. I saw Phil Venables quoted, he put something on X about after he watched the videos and such from Unprompted. He said he's short term pessimistic and he's never been more long term optimistic. And I agree wholeheartedly with that idea. I think three or four years out we're actually going to drain the swamp of many major vulnerabilities. We got to get there first. And you know, I say this to a lot of my friends who, you know, look at this with pessimism over the short term or even what happens to jobs in the long term. Job loss in cybersecurity, what mean. And I say this, I've been doing cybersecurity work for 30 years now and I've worked with some of the most amazing people in the industry and we've given the best years of our lives to try to make the world safer and more secure. At best, we've barely treaded one. I mean, it is true that we're more secure than we were then in absolute terms. If you look at the security capabilities of Windows 11 versus the security capabilities of Windows 2000, oh my goodness. We've come a long way. However, the attackers have gotten so much better that at best we've gotten better at the same rate they have. Or maybe they've gotten a little better than we have. So we're just treading water. I see. With these new capabilities that are coming online and the significant lowering of vulnerabilities in our software base space, it's possible that three or four years from now we can actually have a fundamentally more secure environment so that attackers will figure out other ways to attack us on top of it, but the baseline will be more secure. And that's the first time I think that's fundamentally happened in my career. We have a chance to get a little bit ahead here.
B
So Ed, just to help amplify this for our listeners, we all know He's a household name in our industry. We all know who the grateful Venables is, but we have, we know some listeners who aren't full time cybersecurity professionals. We believe most of them are serving community orders and this is part of their judgment undated sentence that they have to listen to us. But for people not steeped in cybersecurity. Tell us about who Phil is and also we'll put his comments, we'll link that to them in the show notes and then I'd love to bring in Gaddy.
E
I've been following Phil for a couple of decades and he's sort of a CISO's CISO. Yeah. Very well steeped in financial services CISO. Kind of where I think he cut his teeth years ago. He is very much influenced my thinking over the years. He's certainly one worth following on X and his other postings and blog articles. But he's where a lot of very serious CISOs look to for guidance and influence of the industry.
A
Great.
B
Gary. We're in an optimistic mood now with Ed speaking for himself and quoting Phil. What's your take on the whole what's next?
D
So who the hell knows, man? Yeah, but now that, now that I said that I have opinions, right? I mean science fiction and I can go as far as say there are levels of science fiction to reality. Six months ago people would have disagreed on other things but I can now comfortably say what? Six months ago people would have looked at me very oddly instead of oddly. For example, are we going to be able to wait for vendors with their binary not code to patch their problems or find their problems for them before we do it on our own. But before I go that far, two points really that matter. I love that we went through the people element because I think that's our only moat. There is no moat. Code is nothing. Everybody can develop whatever they want. So Skynet. I choose to believe as a belief system Skynet won't happen. I can't do anything about it if it will. But I can be around people. I can take care of community. And looking at how they jobs become redundant, not might become redundant. It's no longer 2025 where we thought I will not be replaced by AI. I will replaced by a human using AI. I think some of us will be. My fiance is a vulnerability researcher and a pretty good one. She's looking at should I become a washing machine technician or something like. I'm not sure many vulnerability researchers will still be here. So looking at people, looking at how we Implement this in the future is critical, but also strategically. When we are now as CISOs, keen off what you mentioned about Phil, who is also an author of this paper, are we, for example, we have communicated to the board and the risk committee and the CIO and the chief legal officer and finance officer that we are at a certain risk level at red, yellow and green. A lot of the reds are now yellows. A lot of the yellows are now greens. A lot of the greens are now yellows. This has shifted drastically. It's outdated. We need to consider what our risk metrics are, both to communicate with the board, maybe even with shareholders, in my reports to nasdaq. So a lot needs to be considered about communication among stakeholders. And the second thing is government, governance wise, we have to move faster. We have to be able to bring on vendors, new technologies in a decent way, but also think about our organizations. Because while we want to be positive, we need to understand this is just the first wave. And as we build resiliency for this wave, let's make sure we build security programs that are resilient to the next one.
B
That's fantastic, Ganim. Thank you.
D
I get that a lot. Thank you.
B
It is fantastic because that nuance is what we need. This is really complicated. I loved your first answer. Who the hell knows? Because anybody who predicts with confidence where
D
this is going, I'll cut you off and double down the complication.
C
Fine.
D
But how are we supposed to give advice when we tell people, for example, patch faster, when our advice now is often wait on your patching, cool them down to avoid supply chain issues. Right. This is complex. And we came up with this paper. Actually, everybody here contributed the paper.
B
Yeah.
D
And we need to realize the industry came together, 250 people, CISOs, others wrote this together to be ready with information, to educate, to have an external document somebody can put on the table, say the industry says so, not just me, to be able to establish some guidelines for tomorrow morning so we can build our programs. But this is just the start. We need to go for this mode, the community to move forward and adjust.
B
So this is perfect. You should actually host the show. You're much better at it than I am. Because this pivots us perfectly to, I think in the US used to call it the $64 million question. But in AI terms, that'll buy you about half an hour on some model or something. So let's call it the $64 trillion question and I'm going to turn it over to Rob. So Ganny set out beautifully how the community came together. You're central to this. And it says a lot of stuff we've already talked about. We don't need to go into the diagnosis of what these things can do. There's some great stuff about what organizations need to do to prepare for this wave. Summarize it for us, Rob.
C
I think one of the things, you know, going back to both what Ed and Gadi referring to organizations need to take a look at, this is a people thing. You have a great team. You're going to be working through a significant challenge over the next six to 12 months. But I also lean back to what was just proven to us. Something I alluded to in, you know, talks I've given over the past couple months. Is that a very optimistic way of looking at this is that everyone is talking about asymmetry between the attackers and the defenders. And it exists, it will exist. However, we do have a structural advantage over the attackers, which there's a hell of a lot more of us than there are of them. And what we've shown over the past week is when you have a call to action, you know, this is where, you know, with what you just said, what's your organizational look to all, it's the community, is that we can come together and do great things and come up with great ideas. And the thinking, using our structural advantage, I think, is going to empower us and enable us much more than before. And that is, you know, from coming up with new ideas and developing them. My theory is that we can, quote, unquote, open claw solutions by having a lot of us go hands on hand board, rapidly produce things that will potentially get in front of some of these offensive capabilities as quick as they're able to develop them. These teams are developed in stovetop environments. You don't really want to share your capabilities. I've worked in one, you know, limited number of people that you would expose to the really good onslaught that you developed on the defender side. As soon as we get, you know, a little hint of what they're doing, there could be a call to action across the community. And that is something that daddy, you know, herding cats or whatever you want to call it, demonstrably shown over the past week and it should not go unlooked. To me, it's the path of the future. We as a community can do this. And it's going to take a lot of smart humans. AI is not going to coordinate to all the agents together to figure this out. Smart people, community, structural advantage. That's the path Forward well on that
A
wonderful trio of staccato sentences there. I know we're charging through time here and this feels like a topic we could probably spend or many days on, honestly with this group. But maybe Let me give the last question here to Ed. You said something interesting before that I think gels with what our other guests are sharing. There's going to be a lot of change, a lot of disruption, and that could lead to some real pain for the cybersecurity community in the short term. Actually quite worried about some of the misinterpretation from hype causing some of that. And then you have reasons to be optimistic in the long term. I also agree with that view, having seen similar versions of this movie before. If you were advising security leaders right now on the things they should do in the short term to try to make the long term better, what types of things would you suggest they do? Aside from obviously reading the paper we keep referencing?
E
Yeah, I think there's three areas to focus on. I mean there's more, but you know, you can't really focus on more than three things. One is this whole vulnerability discovery, patching, et cetera and optimizing that as they're calling it vulnops, and looking at your vulnerability discovery and patching systems today and figuring out how you can radically improve. And I think as a community we have to flesh out what this phone ops really means and do it quickly. Gaddy, no pressure. We got to get together and figure that out. The second thing is limiting the blast radius to segment your network where you can, to do detection quicker where you can, so that if something does get owned with a zero day, it doesn't hurt your entire environment. That's, you know, nothing fundamentally new. You should have been doing that for while, but if you haven't gotten around to it, now absolutely is the time. And then the third thing is on the incident response side of things, doing tabletop exercises to make sure you're ready for a single incident. But we're recommending now to folks, is what if you had two or three simultaneous incidents of different kinds? What would you do with that? Are you prepared to even contemplate that? Those are three things in the order that I think you should focus on them in. And I'm sure there's something we could debate about all of that. But. But those are the three things that really bubble up to the top for me in what I see in my enterprise customers. And all of them are covered in more detail in that cloud security alliance, sans unprompted paper we've been talking about.
A
Love it.
D
If I may.
A
Please do.
D
Here is my call to action. Question. Challenge to people and then to the community. And maybe sans can lead that forward if somebody can figure it out. The first one is for ourselves. Many people feel outmoded. Many cisos feel outmoded. We feel old. We feel never catch up. I think that's bs. We can. All we need is English. Kick yourself in the bum. Go download a coding agent right now. It's not about coding. You don't need to know code, cursor, cloud code, use it, ask it a question, do one task each day different with it. Learn it. It will empower you. That's my challenge to you. Do it now, please. And take somebody with you for the journey. Don't leave people behind. And the second thing is speaking about people. People are being left behind right now not just because disappearing jobs but rather because the disappearing junior and sons in an expert in training this industry. How do we allow people to come in? What do they need to do? What do they need to know? How do they find a job? We were already an aging industry before this. It was so hard to find a job before this. So that's kind of a challenge I would like to leave out there that I think matters a lot. Please go kick yourself in the bum and start now. Download an agent and try talk to it about something different every day. And two, let's think about the future of who is in the industry.
B
That sounded like a takeaway to me. But you've already had two goes at the final question. James. Neither of them were because I going to ask the final question. And there is a twist. We have three guests.
A
That's right. So three guests, three takeaways, right?
B
Nope, not going to do that. Three takeaways is too many. G said it's always three, but he was whittling it down to two and you put on weight if you have three takeaways, we all know that.
A
Well, I do like three takeaways, I must admit. But what are we going to do instead then, Kieran?
B
That's easy. You're going to do it. Punishment for all those times you've hid our poor guests. They're exhausted. At the end of the long recording, they've said everything you want to say and you just say to them, oh, go for it. And describe everything in 30 seconds. So James James Line CEO, SA Gans Institute sir, this has been described as the most important event in cybersecurity for years. Organizations have to act. Tell Them everything they need to know. You have 30 seconds. Go.
A
Oh, that is just so brutal. I understand why our guests hate and love this now and I feel like daddy just did such a wonderful job of it, but. Okay, all right. Okay, Here we go. 30 seconds. Let me comment in macro here for security leaders. There's going to be change, there's going to be overhyping. Don't worry about that. Like you're hearing from world class practitioners here that there is something, something real here. There's short term, substantial risk and long term reasons to be optimistic. So go read the paper. It's in the show Notes. Watch Ed and his team's wonderful webcast doing a live demonstration of the actual practical utility of these things. We're talking about the dynamics of cybersecurity. They're changing in front of us. We're going to have whole new areas like Volnox. Roles are going to compress, they're going to expand, skills are going to change and if we throw our hands in the air, declare BS and ignore it, we're going to be weaker for it. So make sure you make time for your teams to spend real time here and learn. Just like Gary said, most security teams are running at 100% capacity or more. They're not getting to do enough here. This isn't the end of cyber security. It's not solved in time to go home. It's not Armageddon. So shooting down the middle, we've got to be ready. And as leaders, you need to make space for your team to be ready to keep upskilling and following this for more developments. It's going to move fast, so bet on people and ignore AI cybersecurity impact at your.
B
Well, that was wonderful. The zero day clock may be collapsing. Your 30 second stopwatch is going in the opposite direction. But I'll let you off because that was genuinely excellent. But I think we have to thank Gaddy, we have to thank Rob, we have to thank Ed. Absolutely wonderful guests, thank you all.
D
And thanks to Sans for stepping in, helping with this document last minute, taking the risk to work on this over the weekend. All of you were writers and authors of this and supported us. Kieran, James, Ed, of course, Rob, who stepped in and did all the work. Thank you.
B
Absolute pleasure. Thank you.
A
Thank you for herding those cats. Although I did want to make a cataclysm joke at some point here. I'm sure we can use AI to generate a new graphic there, but you guys have been wonderful and I suspect we'll be back this is not going to be the last discussion on this topic, but I do believe we've been very useful to security leaders today, don't you think? Kieran?
B
Well, I hope so. So if you did find this useful, please leave us a rethink wherever you got this podcast people who understand modern communication technology tells us that that sort of thing helps, especially if it's a good rating. And if you have any suggestions more prosaically longer form feedback or follow ups on our show, you can email us on cyber leaders podcastans.org and with that,
A
thank you very much for listening.
B
Thank you for listening. Keep cybering in the AI world For
A
me, Kieran Martin, and me, James Line. It's goodbye and avoid the cataclysm.
Podcast: Cyber Leaders
Host: SANS Institute
Episode: Weathering the AI Vulnerability Storm with Gadi Evron, Rob Lee and Ed Skoudis
Date: April 24, 2026
This special edition episode addresses the recent alarming developments in artificial intelligence (AI) and cybersecurity, particularly surrounding the delayed release of Anthropic’s Claude Mythos model due to its extraordinary hacking capabilities. The panel—comprised of leading cyber experts—unpacks what this means for the security community, the practical and philosophical implications for defense, and actionable strategies for security leaders as AI-powered vulnerability discovery reaches new heights.
Anthropic's Decision
On April 8, Anthropic announced a delay in releasing their new large language model, Claude Mythos. The reason wasn’t technical or business—as is typical—but based on cybersecurity risk: the model was just "too good at hacking" (01:11).
Full Attack Chain Capability
The model doesn’t just find vulnerabilities; it can execute sophisticated, full-spectrum attack chains—including exploitation (02:58).
Project Glasswing
Anthropic created a privilege-access consortium (“Project Glasswing”), working with select large companies to develop mitigations (03:37).
Other Models
OpenAI delayed their own model for security review. Chinese AI models are expected to follow but with uncertain risk controls (03:56).
Independent Review
The UK’s AI Security Institute provided a notably balanced, independent analysis—confirming Mythos' exceptional hacking skills while highlighting the need for more testing against defenses (05:28).
Hype vs. Reality
Some voices call this overblown marketing, others a real paradigm shift. The consensus: it’s both an opportunity to fix legacy vulnerabilities and a call to action against accelerating threats (06:30).
Defender Mobilization
SANS and the Cloud Security Alliance collaborated rapidly on a joint paper, “AI Vulnerability Storm,” providing actionable guidance to practitioners (07:22, 08:00).
The "Mythos Problem"
The issue isn’t that AI can find vulnerabilities; it’s the scale, speed, and public awareness marked by Mythos (09:44).
Industry Hype & Real Stakes
Acknowledges hype but stresses the urgent need for new defense paradigms and a re-examination of assumptions.
Current AI Tools Already Powerful
AI-augmented pen tests have already outperformed years of manual efforts (11:16).
Call to Action: “Drain the Swamp”
Organizations should use existing AI to fix vulnerabilities before more powerful models (like Mythos) become widespread.
Optimism and “VulnOps”
Introduces the concept of “Vulnerability Operations” (VulnOps): proactively operationalizing continuous vulnerability discovery and remediation (16:20).
“Apocalyptomist” Mindset
Recognizes the risks but sees an inflection point to make future code dramatically more secure.
Workflow Inversion
AI-centric workflows are fundamentally changing traditional approaches (e.g., starting analysis with AI on source code before human review).
“VulnOps” as a Discipline
The rise of “VulnOps” could parallel past evolutions like “SecOps,” establishing a new standard in cyber operations (31:15).
Boards and Stakeholders Need Updated Risk Metrics
Old red/yellow/green risk levels may be outdated due to the rapid change in vulnerability discovery, and comms with boards and regulators must be rethought (35:18).
Community Action
Collaboration and speed matter: the rapid assembly of 250 experts for the AI Vulnerability Storm paper is proof (37:53, 38:49).
Upgrade Vulnerability Management
Limit Blast Radius
Prepare for Parallel Incidents
Quote (Ed):
“These are three things in the order I think you should focus on… All covered in more detail in the Cloud Security Alliance/SANS paper.” (41:37)
Personal Challenge (Gadi):
“Kick yourself in the bum. Go download a coding agent right now… Do one task each day different with it. Learn it. It will empower you.” (43:04)
Community Challenge:
“Let’s think about the future of who is in the industry. How do we let new people in?” (44:10)
“It’s a big bug deal. A new T-shirt—and maybe that’s going to be the title of our podcast today.”
— Kieran Martin (04:20)
“Apocalyptomist: You believe in the apocalypse, but you’re also optimistic.”
— Rob Lee (17:15)
“Kick yourself in the bum. Go download a coding agent right now… Do one task each day different with it.”
— Gadi Evron (43:04)
“This is really complicated. I loved your first answer, ‘Who the hell knows?’”
— Kieran Martin (37:28)
“If you’re just sprinkling AI on an existing workflow, maybe that’ll help. But I don’t find that interesting. I want to see AI workflows built from the ground up.”
— Ed Skoudis (28:46)
Key Takeaways:
Action for Security Leaders:
Memorable Final Words:
“As leaders, you need to make space for your team to be ready, to keep upskilling and following this for more developments. Bet on people and ignore AI cybersecurity impact at your peril.”
— James Line (44:58)
For further resources and to watch Ed’s demonstration webcast, see the show notes.