Loading summary
Patrick Gray
Foreign and welcome to this soapbox edition of the Risky Business podcast. My name is Patrick Gray. For those of you who are unfamiliar, these soapbox editions of the show are wholly sponsored. And that means everyone you hear in one of these podcasts paid to be here. And today we're speaking with Ed Wu, who is the founder of a company called Dropzone. Dropzone makes a really interesting AI platform that you can deploy into your SOC that basically acts as a tier one SOC analyst, Right? And it works really well. I also should disclose at this point that I'm an advisor to DropZone, which means I have an extra vested interest in them doing well. But yeah, I mean, I regularly meet with today's guest, Ed Wu, and talk to him about all manner of stuff. And I can promise you he's a really sharp guy who understands this problem, problem space very, very well and has been in it longer than most. In fact, before he was a founder of Drop Zone, he worked at Extra Hop Networks where he was part of the team, or I think led the team that took Extra Hop platform and took it from being a network oriented product into being a security oriented product. And if you want to see, like, how happy they were with his work when he was at Extra Hop, one of the founders. Well, I'm sorry, one of the investors in Drop Zone is actually one of the founders of Extra Hop. So, you know, that's a solid endorsement there. Ed, thank you for joining me. I thought today what we could really talk about is not just about Dropzone and what it does in the SoC. Obviously we'll touch on that. But I wanted to talk about the use of AI in cybersecurity more generally. What it's good for, what it's not good for. But let's start with the SoC, right? Because I think it's one area where not only is the use case clear, but people are already using it in the SoC. And not just drop zone, like AI when it comes to, like processing logs, looking at alerts, things like that, triaging. I mean, people are using LLMs in a lot of socks already. Do you think that's a fair statement?
Ed Wu
Yeah, yeah, absolutely. To best answer this, I think actually using cursor or AI coding tools, I think that's like a very good analogy. So a lot of us might remember a couple years ago where if you are using ChatGPT to help you write code, you get laughed at. Because the consensus back then is if you are using ChatGPT to write code, vibe code, you are just creating more bugs that will end up costing you more time than actually if you would have to have done this yourself. But now fast forward to today. I think it's pretty clear every single hand head of engineering, every single CTO is strongly advocating developers to use AI coding tools. Whether it's cursor, whether it's GitHub, Copilot, and I think a lot of this is ultimately with any new technologies there's always hesitation and skepticism. But over time as the early adopters start to see return, so words get spread out and then rest of the community start to pick up. All these success stories with AI in soc specifically, I think two years ago, probably around this time, I remember the RSAC two years ago where Microsoft just launched Security Copilot and all it was was a chatbot. You can ask it to enrich a particular IP address, you could ask it to summarize a particular log line, but that's pretty much it. But yeah, I would say in the last two years the technology has matured extensively where there are a number of organizations using AI agents within SOC in production and as they see more know actual real world impact, see words get spread across the community and I think nowadays the percentage of people who are skeptical of the technology has, has dramatically decreased compared to even a year ago.
Patrick Gray
I'm wondering though, like to what degree people feel comfortable using it right in a SOC context. Because as you point out, you know, stuff like Copilot, stuff like cursor, like that is just workaday now, right? Everybody kind of uses it, but they can dial up and dial down like where they want to use it because it's like it's one of those sort of tools, right, where you use it in a development environment. You can just say, well I want to use it here, but this bit I'll do manually. You know, SOC work is really sort of workflow based, right? So I'm guessing, you know, it's a little bit different in that you have to think ahead of like, well, where do I want to, to use an LLM to do this and where do I want it to step back and kick it to a human? Like is that part of the whole question of how this stuff is winding up in the SOC at the moment?
Ed Wu
Yeah, yeah it is. You're absolutely right. Like with coding copilots to some extent, every time a developer is working on a project, they are making a decision, a two way decision, whether they want Cursor to give it a try first or they should just wing it themselves. So they are kind of making this decision like whether I delegate this to cursor to take a first step or I just do this myself manually. But we saw specifically most of the time what we have seen is the human analysts are not looking at each alert and making a dynamic decision. Oh, for this alert I want to delegate it to Dropzone.
Patrick Gray
Well, but I mean that's the problem you're trying to solve, right? Which is there's too many alerts. So trying to, if you're actually in a position where you have to decide which alert you want to AI triage, that's kind of useless, right?
Ed Wu
Yeah, absolutely. And that's kind of why, at least from what we have seen, the chatbots, the security chatbots of the world has not been tremendously successful. Because again, the challenge is there are so much to do in security. If you have to micromanage a chatbot and tell it exactly what to do, like every 30 seconds, then that kind of more or less defeat the purpose. And this is where for AI agents one of the most common way is to treat it as a new tier one. So feed all your alerts to an AI agent, the agent will perform the investigations, it will dismiss or close the false positives and then only escalate the suspicious or the malicious alerts. So that's kind of the most common workflow like deployment model we have seen, which is leveraging AI agents as a new tier one. Or you can say the AI filter or the AI meat shield that shields the rest of the team from the vast majority of the noise. It is interesting that most security products historically really focus on true positives. When you look at the detection product, most of them are showing you how they were able to detect a five step sophisticated kind of apt attacks. But in reality for AI soc agents the biggest value proposition is not detecting true positives or sophisticated multi month multi hop intrusions. But instead the biggest value proposition is on reducing false positives. Because by the virtue of removing hay from the haystack it makes finding the needles much easier.
Patrick Gray
Now I bet already some people are listening to this and saying, well hold on buddy, because what happens if you start dismissing true positives and flagging them as false positives? And that's always going to be the concern when you're looking at plugging in an AI model and trusting it to sit at the top of your detection stack and you give it authority to dismiss stuff like how can you assuage fears that there is some genuine attack going on and the model just doesn't know about it, doesn't think it's a big deal and just gets rid of it. Because I'm guessing that's a huge barrier when you're trying to sell into a new place is convincing them that it's actually accurate enough that it's not going to give you a bad result there. How can you assuage those fears?
Ed Wu
Yeah, there's definitely a couple components. First and foremost, AI soc agent vendors are including us all prioritize minimizing false negatives. Meaning when we say a security Alert is benign, 99.9% of the time, it is actually benign. And this is where I will be transparent and frank at this moment, looking at the technology, there will always be a degree of hallucination. There is no way to completely remove all hallucinations from the large language models. And any vendor who claim they have figured out the magic way to remove all hallucinations, they should be acquired by OpenAI for like $20 billion. Because I'm sure OpenAI and Google would love to know the magic sauce to remove all hallucinations.
Patrick Gray
Yeah. So this is not a little problem that a security startup is going to fix. This is a fundamental large language model issue.
Ed Wu
Correct. But what security startups can do is build processes, systems and engineers in modules in a way where the level of hallucination is controllable and manageable. And this is where you ask like, hey, how can I trust an AI SOC agent to not make mistakes? And our perspective is an AI SOC agent will make mistakes, but it's not about whether it will make mistakes or not, but it's more about the probability of making mistakes. And this is where I was introduced to a concept recently that talks about the trade off between leverage and uncertainty. So some of us who have been like a manager or business owner or tech lead are very familiar with this concept, which is sometimes you are given a project and then you might have somebody else working for you, and then you are doing this mental calculus in your head, how long does it take me to do it? How long would it take my employee to do it? And how much can I trust my employee on doing the right thing or solving this problem in the same way that I want it to be solved? And anytime I think, whether it's delegating tasks to another human or delegating tasks to an AI agent, there's always this trade off. Anytime you want to increase leverage, you're kind of sacrificing uncertainty or increasing uncertainty and increasing potential errors. So from our perspective, our goal is to build a system, and we have already achieved it. Consistently, that is at or above the accuracy compared to a typical human tier one security analyst.
Patrick Gray
Yeah, I mean you can benchmark this, right? Because socks are well logged. Right. Like decisions are well recorded. So you can actually benchmark an LLM based product against people.
Ed Wu
Absolutely. And some of our especially MSSP or MDR service providers, when they were poc, our technology, we often get put into like a Bake Off. So the service provider will gather 100 security alerts, they will run 100 through our system, and then they will build a spreadsheet. One column is what dropzone has found, the other column is what they're, their team has found. And we will compare and contrast. And I will definitely tell you that oftentimes when you run through exercises like these, the first thing you notice is even different members of the team might mark the same alert in different ways because there's always a difference in opinion. But even beyond that, the accuracy of an AI like our AI Soc analyst is definitely on par, if not sometimes meaningfully better than the human team members.
Patrick Gray
Now you just touched on something interesting there, which is that you, you know, you have to manage an LLM or have expectations around an LLM similarly to how you would have expectations of a human staff member. What I'm seeing, you know, I'm seeing some interesting stuff in AI around multi, multi agent sort of deployments where, where you almost have an AI that can play that role of being a supervisor to the core LLM that's doing most of the work. I mean, is that something that you've played with as well at dropzone, which is having a supervisory model observing your sort of log processing and investigation model? And can you even have multiple models doing the investigations and then you can evaluate, like if there is some sort of disagreement between them, you might want to kick that up to a human. So I guess my question is, you know, what's the role of sort of multi agent in a tech stack like this?
Ed Wu
Yeah, so kind of similar to kind of how we operate as humans. I think sometimes we feel like there are multiple voices in our head. Right. As a father, I should prioritize X over Y. As an entrepreneur I should prioritize Z over X. Right, Stuff like that. So yes, absolutely. And what we have seen with large language models is giving them different Personas really helps them to specialize. And by doing that you are able to build more complex end to end workflows that you couldn't have with a single Persona. So we definitely leverage what we call multi Personas within our system that's specialized, each being specialized in a specific function and things like self reflection, which is you ask model to do one thing and then you ask it or another module to critique itself. Is a very common technique to increase the accuracy of the output of specific functions. One very common example is, for example, you want a large language model to generate an, an SPL query. So a splunk query, the model might generate something and that query might or might not work. And a very common technique to improve the accuracy of that query is use another module to nitpick the query generated by the first module to spot kind of mistakes. Hey, you misspelled user Instead of username, it should be user space name, for example, as a field and stuff like that. Very similar to, I think most of us when we were in schools, when we are taking exams, especially math exams, I think most of us when we complete all the questions, we will go back and revisit our answers like again critiquing ourselves. So this kind of self reflection is definitely a very common technique. And then using multimodal, like different prompts, different temperatures, different models to generate the same output and then compare and contrast. It's kind of a little bit like voting, like when you ask three people about a certain topic and you pick kind of the most agreed upon answer that is going to further boost the accuracy of the outputs.
Patrick Gray
Yeah, right. So this is absolutely a thing that's happening because I mean a friend of mine, he went to some Microsoft demo which he was blown away by where they got it to build like a Scrabble game or something. But it was the multi model part that was incredible to him where there's like a, you know, there's like a model that's a project manager that deals with the other models and you know, yells at them when they get stuff wrong. And he just said it was incredible watching all these little AIs going off and doing stuff. I mean, are you currently doing this? Are you with the multiple model approach?
Ed Wu
Yeah, absolutely. To give you another example, alert investigation. It's a little bit like being a detective. You can kind of technically go on forever. You can investigate to the nth degree. So one module we have is kind of like an accountant where it's keeping track of the progress made by the investigator components and trying to identify when the marginal utility of additional CPU cycles or additional time spent on this Alert.
Patrick Gray
Yeah. At the point that it's analyzing 1 gigabyte crash dumps, it might be time to tell it to chill out. Right?
Ed Wu
Yeah. Or looking at IPs associated with another username, associated with another IP that might correlate to alert. Again, a lot of these, after a certain point there is decreasing marginal utility.
Patrick Gray
Yeah, no, that makes a lot of sense. So when we start looking outside the SoC. Right. Which I know is not what you do. You know, obviously I'm working now with Decibel, which is one of the backers of your company. Right. And you know, everybody's all looking for ways to invest in AI companies that are doing interesting things. I think it's got some applicability pretty much everywhere. I think the clearest use case, you know, day one, is the SoC stuff. It's the type of stuff that you're doing. But. But obviously as someone who is running an AI startup, you've got your finger on the pulse, I'm guessing, of where people are making progress in other areas of cybersecurity. Where do you see the exciting stuff happening there?
Ed Wu
Yeah, from what I've seen is obviously there are different ways to prioritize different chunk of tasks. But from our perspective, what we have seen is most people are prioritizing the work that's the most manual as well as highest quantity. Because if we were to build a module or product that automates stuff, you might as well start with the most laborious and the highest quantity tasks within the security program. So we have seen definitely pen testing. That's one where throwing spaghetti at the wall is not the most fun thing, you know, or being a manual fuzzer is not the most fun thing that somebody could do. That's one see other one. We have seen a lot of success so far is in code reviews again. Code review. I don't think any of us wake up in the morning and gets excited about reviewing code. And this is where like. But at the end of the day, for any fast growing application or business, there are a lot of code commits that will love or benefit from security reviews.
Patrick Gray
Man. I got a friend who has just played around with some generic models and figured out how to prompt them in such a way that he thinks it's the end of the SAST industry. And he says there's no moat. So it's not something he's going to turn into a startup because it's all done with commodity models. And he's like, if you know what you're doing, you, you could throw code into them and just all the bugs fall out. It's coming. That's definitely coming.
Ed Wu
Yeah, like code analysis. I did my PhD in program analysis, so definitely spent a lot of Times in a previous life looking at code, looking at syntax trees, basic blocks and stuff like that. Yeah, large language models are very good at understanding code. I do think there are still challenges, especially where the code base is very large. So like if you have like a hundred line Python script, I would not be surprised if ChatGPT as is already does a tremendous job of spotting the issues. But when you have a more complex code base with complex interactions with internal libraries or proprietary libraries or APIs.
Patrick Gray
Yeah, a million dependencies and dependencies on dependencies and yeah, you're just going to run out of space, aren't you?
Ed Wu
Yeah. And also this requires the model to really understand the different, the context of your code. And this is where even in soc, what we have seen is initially most of the AI SOC startups like us focus first on building integrations, but we are getting to a place where most of the integrations are already built. And what we have seen is the difference between a mature product and immature product now moves on to ability to build context. Because a mature AI SOC analyst will be able to come into your environment through a combination of integrations and other means, really understand your organizational policies, preferences and practices versus a naive or immature AI SOC analyst or AI SOC product will come in and be like, okay, I marked this alert as malicious because I saw it as malicious. Even though the company might have a policy saying this kind of logging activity is actually expected.
Patrick Gray
I mean, it's probably worth pointing out too that one of the issues that you've had running this business is I think some people expect AI magic to fix their problems when they just have a terrible detection stack. Right. So you go in there and the source data is patchy, like really patchy, so your agent can't collect the context it needs to make decisions and whatnot. So just to be clear, like an AI SoC analyst is only going to work well when you've got a detection stack that's pulling in the right information to begin with. I mean, people are, you know, some people expect a little bit too much, Right. Which is that an AI agent is going to be able to infer things without actually collecting good context.
Ed Wu
Yeah, yeah, I would say we have run into a number of cases. For example, our technology is asked to investigate AWS alerts when there are no AWS logs at all, either in AWS itself or within their siem. So obviously in that case, it's technically impossible to investigate those alerts if there are no logs at all. So yes, like an AI SOC agent is not going to fix the visibility problem. If you don't have logs in certain parts of your business, then an AI agent is not going to be able to fix it for you. With regards to patchy detections, we have seen cases like for example, within our product when we see the same false positive happening over and over and over again, our technology will propose recommendations like tweaks on the detection rules to help tone down the noise. So I would say that's actually a.
Patrick Gray
Little bit easier to solve the opposite problem. Yeah, yeah, yeah.
Ed Wu
Than like trying to, you know, ask is the opposite problem, which is you are asked to cook a dish when you don't have any hard ingredients.
Patrick Gray
Yeah. Now look, another thing I wanted to ask you about and it's been, you know, quite the thing on social media over the last week, is this paper that was written, I think by an Apple Internet looking at large reasoning models and about how they're not actually, they don't really appear to be more accurate than large language models when asked to do reasoning tasks. And in fact when tasks get to a certain level of complexity, both LLMs and LRMs are not all that useful. Right. Which I don't quite understand why people are so surprised by this because when we see where the Wins are with LLMs, it's the stuff that you're talking about, like high volume kind of menial stuff that nobody wants to do that's sort of semi repetitive and requires diligence, you know, I mean a lot of the reason people miss sock alerts is because sitting in front of a sim console all day is boring and mind numbing. And this isn't a problem experienced by computers. Like, it just isn't. Yeah, but I wanted to ask you what you made of that paper. Like was there anything in there that was surprising to you? Anything you agree with or disagree with as someone who's using these sorts of models?
Ed Wu
Yeah, I think there are different ways to. That was definitely an interesting paper. Some say Apple is just jealous of kind of being a little bit left behind by everybody else. But yeah, I think from our perspective, part of the art of using large language models is task decomposition. And what I mean by that is similar to asking a single person to build a business. That will be very difficult. But most modern projects, whether it's a modern business or Manhattan Project, involves a large number of different type of specialists doing their special thing but working in unison to really achieve a very complex end to end or solve a complex problem end to end. So generally if you expect a single large language model invocation to be able to perform very complex tasks. I think that's kind of misaligned expectation. Most of the large language model or AI agent developers like us are decomposing complex tasks into small cognitive steps. Each of them, frankly should be trivially solvable by a middle schooler. So, for example, when our AI SoC agent is looking at an alert and trying to make sense of this alert and investigate it, on average, our system makes close to 100 distinct large language model invocations. Again, by breaking down alert investigation into small cognitive steps.
Patrick Gray
Yeah, I mean, it's interesting when you said pen testing earlier, like as something that's ripe for sort of disruption with LLMs, I know that there's a lot of pen testers who would wince at that and say, no, that's not possible. And look, I mean, I think to a degree they're right. Like real elite level sort of pen testing is going to require that pen tester brain, which is a rare type of brain. But there's so much of the pen testing workflow where the tricky part is understanding which steps to do next and why. But the steps themselves are actually quite simple. So I think, you know, I think we might wind up in a situation where a lot of the cool technology work is actually teaching the LLMs how to do certain things. I can see that as being something that if you're a pen tester, you might teach a model. Hey, there's this type of check that I figured out how to do. You teach the model how to do it and then when you actually want to get around to doing the check, it's just as simple as asking the model to do it. And then of course, with these multimodal approaches, you might be able to have models which will understand better which checks you want to apply in which contexts and whatnot. So, but I think you're right. It's about breaking those things down, isn't it? Into those simple steps and just thinking about those problems in terms of I have an army of middle schoolers who will do whatever I want at basically infinite speed. Like, how can I instruct these 14 year olds on how to do stuff? Is that, that's kind of the way I think about it. Is it the way you think about it as well?
Ed Wu
Yeah, yeah, absolutely. I think a lot of people use phrases like force, multiplication, or up leveling. One analogy we generally use is we want to uplevel human security engineers and human security analysts to be like the generals and special forces where they have an army of AI middle schoolers or AI foot soldiers that's kind of listening to their commands and doing whatever they instructed. And this is also where one thing we have seen as we work with different organizations of different sizes and maturity is actually making sure the AI agent is coachable. Like listening to instructions is quite important.
Patrick Gray
You and I have talked about that before because you actually had to do quite a lot of work there to get that coachability into the models that you're using.
Ed Wu
Yeah. And I also think it's a very key component of this trust building. I think I use analogy like everybody has experience working with smart jerks that are very stubborn and do not take any inputs or feedback or suggestions from team members. But I think all of us probably also have experience working with somebody who's junior but tremendously coachable. And after a couple of months, that junior person is actually outperforming somebody who is more senior because they are so coachable and they are absorbing everything, you know, you taught them. And we are kind of seeing something similar within the AI SoC agent space where there are, you know, every environment is different and sometimes a very coachable AI SOC agent can kind of actually become significantly more valuable to an organization than maybe a smarter, out of the box agent. But that's very stubborn.
Patrick Gray
Yeah, yeah. No, I mean, it's. I think we're actually at the fun part from my perspective when it comes to AI because we've got a better understanding of what it's useful for. And of course that's going to change. Right. But yeah, we're getting a better idea of how to use it, what it's good at, what it's not so good at yet. Edwoo, we're going to wrap it up there. Always a pleasure to chat to you, my friend. And pick your brain on this stuff. We learn a lot. So thanks a lot for your time and I'll be chatting to you again soon.
Ed Wu
Thank you for having me.
Risky Business Podcast Summary
Title: Soap Box: AI has entered the SOC, and it ain't going anywhere
Host: Patrick Gray
Guest: Ed Wu, Founder of Dropzone
Release Date: June 16, 2025
In this special soapbox edition of the Risky Business podcast, host Patrick Gray welcomes Ed Wu, the founder of Dropzone. Dropzone offers an innovative AI platform designed to function as a Tier One Security Operations Center (SOC) analyst. Patrick discloses his role as an advisor to Dropzone, highlighting his vested interest and longstanding collaboration with Ed Wu. Ed’s expertise stems from his tenure at Extra Hop Networks, where he played a pivotal role in transforming the platform from a network-oriented product to a security-focused solution. Patrick emphasizes Ed’s credibility by noting the endorsement from one of Extra Hop’s founders, who is now an investor in Dropzone.
Patrick Gray [00:00]: "Ed, thank you for joining me. I thought today what we could really talk about is not just about Dropzone and what it does in the SoC. Obviously, we'll touch on that. But I wanted to talk about the use of AI in cybersecurity more generally."
Patrick initiates the conversation by addressing the integration of AI within SOCs, noting that AI tools like Large Language Models (LLMs) are increasingly being adopted for tasks such as log processing, alert analysis, and triaging.
Patrick Gray [00:00 - 02:09]: "Do you think that's a fair statement?"
Ed Wu agrees, drawing parallels between the evolving acceptance of AI in software development and its growing role in cybersecurity operations.
Ed Wu [02:09]: "With any new technologies, there's always hesitation and skepticism. But over time as the early adopters start to see return, words get spread out and then the rest of the community start to pick up."
He references Microsoft’s Security Copilot as an early example and observes significant maturation in AI technology over the past two years, leading to widespread production use in SOCs.
Patrick raises a critical concern regarding the delegation of alert handling to AI agents: the risk of AI misclassifying true positives as false positives.
Patrick Gray [07:56]: "Now I bet already some people are listening to this and saying, well hold on buddy...how can you assuage those fears?"
Ed acknowledges the inherent limitations of LLMs, particularly the issue of hallucinations, where AI might inaccurately dismiss genuine threats.
Ed Wu [08:41]: "There will always be a degree of hallucination. There is no way to completely remove all hallucinations from the large language models."
Ed emphasizes Dropzone’s commitment to minimizing false negatives, ensuring that benign alerts are accurately identified while acknowledging the impossibility of completely eliminating errors.
Ed Wu [09:34]: "Our goal is to build a system...at or above the accuracy compared to a typical human tier one security analyst."
Patrick highlights the importance of benchmarking AI performance against human analysts, to which Ed confirms Dropzone’s participation in comparative evaluations, often demonstrating parity or superiority in accuracy.
Ed Wu [11:49]: "When you run through exercises like these, the accuracy of an AI like our AI SOC analyst is definitely on par, if not sometimes meaningfully better than the human team members."
Patrick explores the concept of multi-agent AI systems, where supervisory models oversee investigative models to enhance accuracy and reliability.
Patrick Gray [12:51]: "...where you almost have an AI that can play that role of being a supervisor to the core LLM that's doing most of the work."
Ed elaborates on Dropzone’s use of multiple specialized personas within their AI system, enabling complex workflows and self-reflection to improve task accuracy.
Ed Wu [13:49]: "Using multimodal, like different prompts, different temperatures, different models to generate the same output and then compare and contrast...like voting."
He likens this approach to dividing responsibilities among human specialists to manage complex investigations effectively.
Shifting focus beyond SOCs, Patrick inquires about other cybersecurity domains ripe for AI disruption. Ed identifies penetration testing and code reviews as prime areas where AI can automate laborious and high-quantity tasks.
Ed Wu [18:32]: "Pen testing...manual fuzzer...code reviews...a lot of code commits...benefit from security reviews."
Patrick adds insights on AI’s potential to revolutionize Static Application Security Testing (SAST), highlighting the diminishing moats as commodity models become adept at code analysis.
Patrick Gray [20:17]: "...a friend...endedustry'sclearsast ...turn into a startup because it's all done with commodity models."
Ed acknowledges the strengths of LLMs in understanding code but points out challenges with large, complex codebases requiring deep contextual understanding.
Ed Wu [20:58]: "Large language models are very good at understanding code...complex interactions with internal libraries or proprietary libraries or APIs."
Patrick underscores a common misconception: AI cannot compensate for inadequate detection stacks or poor data quality. Effective AI SOC agents require robust, comprehensive logging and context.
Patrick Gray [22:16]: "...an AI SoC analyst is only going to work well when you've got a detection stack that's pulling in the right information to begin with."
Ed concurs, providing examples where missing logs render AI investigation impossible and highlighting Dropzone’s capability to recommend detection rule adjustments to reduce noise.
Ed Wu [22:56]: "It's technically impossible to investigate those alerts if there are no logs at all...our technology will propose recommendations like tweaks on the detection rules."
Patrick references a recent paper by Apple critiquing large reasoning models, prompting Ed to discuss the importance of task decomposition in leveraging AI effectively.
Ed Wu [25:12]: "The art of using large language models is task decomposition...our AI SoC agent is looking at an alert and trying to make sense of this alert and investigate it, on average, our system makes close to 100 distinct large language model invocations."
He envisions AI as force multipliers for human security professionals, enhancing their capabilities rather than replacing them.
Ed Wu [28:25]: "We want to uplevel human security engineers and human security analysts to be like the generals and special forces where they have an army of AI middle schoolers or AI foot soldiers."
Patrick concurs, emphasizing the evolving understanding of AI’s practical applications and limitations in the cybersecurity landscape.
Patrick Gray [30:18]: "We're getting a better idea of how to use it, what it's good at, what it's not so good at yet."
Patrick and Ed conclude their insightful discussion by acknowledging the transformative potential of AI in SOCs and broader cybersecurity domains. They emphasize the importance of realistic expectations, robust data infrastructure, and the collaborative synergy between human expertise and AI capabilities.
Patrick Gray [30:44]: "Ed, we're going to wrap it up there. Always a pleasure to chat to you...we learn a lot."
Ed Wu [30:44]: "Thank you for having me."
This episode of Risky Business delves deep into the integration of AI within SOCs, exploring both the advancements and challenges posed by AI-driven security operations. Ed Wu provides a nuanced perspective on leveraging AI as a powerful tool to augment human analysts, while also candidly addressing the limitations inherent in current AI technologies.