RBTALKS5: How Pfizer uses AI to detect insider risk - Risky Bulletin

Summary

Risky Bulletin Podcast Summary

Podcast Information

Title: Risky Bulletin
Host/Author: risky.biz
Episode: RBTALKS5: How Pfizer uses AI to detect insider risk
Release Date: December 20, 2024

Introduction

In the fifth episode of Risky Bulletin Talks (RBTALKS5), host Catalina Campano engages in an insightful conversation with Brian A. Coleman, the Senior Director for Insider Risk, Information Security, and Digital Forensics at Pfizer. This episode delves into Pfizer's innovative use of Artificial Intelligence (AI) to enhance their insider risk detection and management strategies.

AI Integration at Pfizer

Brian Coleman elaborates on Pfizer's strategic incorporation of AI into their cybersecurity frameworks. The primary motivation behind this integration is the necessity to "respond quicker" and "understand data related to the matters we were investigating" (01:03).

Document Categorization and Summarization:

Pfizer leverages language models to categorize various types of documents, such as HR files, pay stubs, and scientific documents. This categorization aids in "document summarization," enabling analysts to receive concise and relevant summaries rather than sifting through thousands of documents manually.

"If we can train the models and the language models as well as the AI to help us summarize that, I now come to a business owner with a more intelligent set of facts around what happened." (01:03)
Reducing False Positives:

By integrating AI with traditional Data Loss Prevention (DLP) systems, Pfizer aims to minimize false positives. This ensures that analysts can focus on high-priority issues without being overwhelmed by irrelevant alerts.

"We're leveraging the DLP with the language models that will then help us respond more quickly to true high priority matters." (02:59)

Implementation Process

Implementing AI at Pfizer is a meticulous process that extends beyond deploying off-the-shelf tools. Brian emphasizes the importance of customizing AI models to fit Pfizer's unique data environment.

Customization and Training:

Pfizer has been developing and implementing their AI-driven solutions for approximately four to five months within an eight-month conceptual framework.

"We have been about four or five months developing it and implementing and trying to, to get the right responses back from the language models and the systems." (07:20)
Not Plug-and-Play:

The integration is not a simple plug-and-play solution. Instead, it requires significant time and resources to tailor the AI models to Pfizer's specific needs.

"It's not just a plug and play solution... creating our own custom on top of the, you know, and kind of supplementing the basic ones." (05:47)

Benefits and Impact

The implementation of AI has yielded several tangible benefits for Pfizer's security and legal teams.

Enhanced Efficiency:

By summarizing large volumes of documents, AI reduces the time analysts spend on triage, allowing them to prioritize critical incidents effectively.

"Time to decision on the importance of the data... those numbers would significantly drop if you're asking someone to review, you know, 5 or 10 or 20 documents versus 1500 or 3000 documents." (12:24)
Support for Incident Handlers:

AI-generated summaries aid incident handlers by providing quick insights, thereby accelerating the response process.

"It now can summarize some of those documents that might have taken, you know, days and hours to go through in some capacity." (09:09)
Cross-Departmental Benefits:

The reduction in false positives not only streamlines the security team's workflow but also benefits the legal team by freeing up resources to focus on more substantial issues.

"It helps them make a more, you know, informed decision about, you know, how, how serious is this is this event that we're looking at." (10:21)

Challenges and Considerations

Despite the promising advancements, Pfizer faces several challenges in integrating AI into their cybersecurity operations.

Reliance on Accurate Data:

The effectiveness of AI models hinges on the quality and relevance of the data they are trained on. Brian cautions against over-reliance on AI outputs without proper validation.

"If people rely 100% on what is returned without... making sure that it's validated... it just helps our investigations, our matters, our alerts become that much better." (10:36)
Resource Intensive:

The AI integration process demands significant human resources and financial investment, especially for large-scale operations.

"It's definitely an investment of human hours for sure with no real... until you get to a point of where you start seeing analyst triage time, time to remediation, kind of all of those time metrics that we're going to measure start decreasing." (15:40)
Early Stage Metrics:

Pfizer is still in the early stages of evaluating the AI tool's effectiveness. Current metrics show a 10% potential reduction in analyst workload, with expectations of improvement as the models learn and adapt.

"We're seeing about a 10% of the time where the analyst rejected and the model said reject it." (12:24)

Future Directions

Looking ahead, Pfizer aims to evolve their AI capabilities from a reactive stance to a more proactive approach in insider risk management.

Proactive Data Protection:

The goal is to "protect the data before it leaves the company" by identifying and securing sensitive information across various repositories.

"We're going to take that build on kind of what we learned and become more proactive so that you're stopping it before the data actually even leaves the company." (05:54)
Asset Discovery and Intellectual Property Protection:

AI will play a crucial role in discovering assets and safeguarding intellectual property by identifying unauthorized document distributions.

"This has benefits for your legal team as well, not just your security team because they get clearer alerts, they don't waste their time on false positives." (08:56)
Enhanced Security Controls:

Future initiatives include tightening controls around Pfizer's most valuable intellectual property and ensuring proper data sensitivity labels across all platforms.

"We're now going to say, okay, this document is sitting out in a repository where it should maybe have a different setting of permissions." (17:18)

Conclusion

The dialogue between Catalina Campano and Brian A. Coleman provides a comprehensive look into how Pfizer is harnessing AI to revolutionize their insider risk detection and management. By meticulously tailoring AI models to their specific needs, Pfizer not only enhances their cybersecurity posture but also sets a precedent for other organizations aiming to leverage AI in safeguarding sensitive information. As AI continues to evolve, Pfizer's proactive and informed approach underscores the critical balance between technological innovation and human expertise in the realm of cybersecurity.

Notable Quotes

"I thought it was a breath of fresh air to hear from somebody using AI without any hidden selling points." — Catalina Campano (00:22)
"It's just another piece of data that an analyst should use. It is not the end all answer." — Brian A. Coleman (10:36)
"AI can solve everything, you know, sales pitch from everyone because I think it's, there's a danger in relying on it too much." — Brian A. Coleman (07:46)
"This is, this is more where, where I think, you know, teams like, like mine can leverage. It is summarization, right? Data summarization." — Brian A. Coleman (21:45)

Disclaimer: This summary is based on the transcript provided and aims to encapsulate the key discussions and insights shared during the podcast episode.

Transcript

A (0:00)

Foreign. This is Catalina Campano. Welcome to Risky Business Talks, a podcast series where we interview people from the infosec community. Today our guest is Brian A. Coleman, senior director for insider risk, Information security and Digital Forensics at Pfizer. Welcome, Brian.

B (0:21)

Hey, how are you?

A (0:22)

I should have said, welcome back, Brian, because you first appeared in one of our podcasts three months ago when you talked to us on behalf of one of our sponsors, enterprise browser maker Island. After that interview, Brian was mentioning to me how his team was slowly incorporating AI into their daily workflows. I wanted to have this talk because Pfizer is not an infosec vendor. It doesn't sell AI products. So Brian won't have a reason to overhype anything in this talk. And I thought it was a breath of fresh air to hear from somebody using AI without any hidden selling points anywhere in the conversation. So, Ryan, can you tell me more about what exactly Pfizer is doing with AI?

B (1:03)

Yeah. And so it came out of a need around being able to respond quicker. Right. And understand data related to the matters we were investigating. And so what we started partnering with some vendors on is how could we leverage the language models, plus a little bit of the AI to complement what analysts do on a daily basis and really, you know, help them kind of always say, respond more intelligently around the matter. And so what we've started building out is the capability to take various types of data, put them into language models that then help categorize those documents as a specific type of document, whether it's an HR type document, a pay stub, a stem type document, then there's a bunch of subcategories, and then we could leverage, and we're building out now the ability to leverage, you know, an internal AI platform that we have to basically do document summarization. So as an analyst, I don't need to be an expert on the scientific processes, but if we can train the models and the. The language models as well as the AI to help us summarize that, I now come to a business owner with a more intelligent set of facts around what happened and could potentially engage the right people versus, as we were talking earlier, sending someone 20,000 documents or 20,000 emails to review, we now are coming in there with a very detailed summary of what we believe the data to be with. And this takes a lot of partnering with the business to make sure that you get it right, though.

A (2:54)

So what are you using this for? Detecting insider risk phishing attempts.

B (2:59)

So right now we're leveraging it specifically on my team on insider threat, there's definitely appetite to kind of see what the other use cases are. And so currently the use case is around a lot of insider threat cases are people trying to do the right thing, maybe doing it the wrong way. And, and some of those documents potentially could have terms and document classifications that are inaccurate. So what we're using is the language models on top of, let's call it like traditional dlp, which is not. Traditional DLP is not reliable on just a straight keyword basis. So what we're doing is leveraging the DLP with the language models that will then help us respond more quickly to true high priority matters. Right. And an example could be someone could have a term in, let's say their resume that they worked on a very high priority project, but that same term could be in some kind of batch record or something along those lines. And the two are going to be treated very differently from an analyst perspective because one's a resume and one's a very, you know, very important, you know, document to the company. And so we're leveraging it to help help weed out the kind of the non, I don't want to say non important, but kind of the, the less. Yeah, the false positives. And so that we can now focus on the matters that truly involve the data we care about. And, and you know, early signs so far is that the, the language models, it takes a lot of time to get these calculated and kind of responding in the right way with, with the types of documents and in some cases some of the documents understood by some of the models. So we're working to understand that as well.