Transcript
A (0:01)
Welcome to to the Point Cybersecurity Podcast. Each week, join Jonathan Neffer and Rachel Lyon to explore the latest in global cybersecurity news, trending topics and cyber industry initiatives impacting businesses, governments and our way of life. Now, let's get to the Point. Hello, everyone. Welcome to this week's episode of to the Point Podcast. I I'm Rachel Lyon, here with my co host, Jon Neffer. John, I've missed you. It's been a couple of weeks in your worldly travels.
B (0:34)
Exactly. I'm glad to be back home, though, back from visiting the kids out in Granada where everywhere you go out you get tapas with all of your drinks.
A (0:45)
Isn't that wonderful? You spend like €5, right, on a beer and then you get like this delicious buffet meal.
B (0:51)
Yeah, exactly.
C (0:53)
It was fantastic.
A (0:54)
I love that. I am all about that. Well, we have another awesome guest this week, you guys. I am so excited to welcome Matt Lee. He's the creator of cloudware Games, an online training platform designed to help cloud engineers and DevOps professionals develop their problem solving skills by fixing realistic issues in simulated cloud environments. And my favorite thing, please go to his LinkedIn profile where he talks about helping CTOs running on AWS sleep better at night. Which is something we all want, right? We all absolutely want. So welcome. Welcome to the podcast, Matt.
C (1:26)
Thanks for having me. Yeah, I'm glad you guys took an interest.
B (1:31)
Yeah, Matt, thanks for joining us. Let's kick it right off though, here. Outages and service delivery problems are not only costly, but they get a lot of attention. Customers get upset. What do you do and how do you help responding teams stay calm and make the right decisions when things go badly?
C (1:54)
So it's an excellent question and I'll start off, I guess where the inspiration for cloud war games came from is I was training up these more junior cloud professionals, DevOps type people. And when you get a client, my biggest client, they're a big E commerce company. If they go down, they lose about $100,000 an hour. So you can imagine how many guys in Ties or C Suite type people are breathing down your neck right there. And so I saw these junior guys, I try and give them a chance to handle some of the outage, but after a certain point, you have to step in and move. And they were having trouble just getting stage fright of sorts. You know, I don't know what switches to push or anything like that. So I thought, wouldn't it be great if there was a way we could simulate this disaster, you know, in our staging environment? Or testing environment, or we could rerun the same disaster we just hit. So over the years, I'm also a journaler. I write down tons of things. And so I actually have a stack of all these 3am problems. The big headaches. Hard coded read replicas and stuff like that. Or hard code, the right replica and it couldn't switch over. Oh, just the things that spent hours of your time and you hate it. But, you know, you might as well. If you get lemons, you might as well make some lemonade. So then I started designing these simulations, and the first iteration of it, I just used a big whiteboard and actually just drew out a network diagram and kind of Dungeons and Dragons, the whole thing. And then after a while I'm like, why am I. Why not use real infrastructure? I can design something that could run for a couple pennies and simulate the whole disaster. And so that kind of expanded. And then more people wanted to be the. The inframaster. We say, you know, this is a dungeon master. And so kind of kept expanding from there, and it became a fun little thing to do. And eventually I said, well, I was. I was talking to a couple of colleagues and they're like, why don't you make this into a business? And there we go. Cloud War Games was born.
