
Loading summary
Tyson
The Voices of Search Podcast is a proud member of the I Hear Everything Podcast network. Looking to launch or scale your podcast, I Hear Everything delivers podcast production, growth and monetization solutions that transform your words into profit. Ready to give your brand a voice? Then visit iheareverything.com My name is Tyson
and joining me today is Kaspar Siminsky, Senior Director at Search Brothers and former member of Google Search Team. Today, Casper and I are going to be talking about the latest changes in the industry and how you can take that into practice, especially in the enterprise arena. What are your thoughts on blocking LLMs from proprietary data?
Kaspar Siminsky
Well, it really depends, right? How proprietary is the data? I mean, if it's really proprietary and if it's something that we do not want to get scraped and crawled, ultimately then it shouldn't be accessible in the first place. This is a hypothetical situation.
Sponsor/Pre Visible Representative
Time for a one minute break to hear from our sponsor, Pre Visible. So you're looking for SEO help and you got a couple of options. You could start replying to spam from agencies that claim they can get you to rank number one on Google. You can pay an hourly rate for a consultant who will inevitably nickel and dime you with hourly charges. Or you can work with a cookie cutter agency to quickly launch a strategy less project with low success rate. None of those sound very good now do they? Well, well that's where Pre Visible's integrated consulting model comes in. Pre Visible draws From a collective 40 years of SEO and digital marketing experience to unlock your organic growth opportunities. They build custom solutions that combine strategy, technical expertise, content and reporting to effectively operationalize SEO for your business. Pre Visible's four stage approach ensures that your SEO programs thrive by starting off with a strategy first approach. Then they support you in your efforts to create quality content, help you identify technical issues, and most importantly, they'll work with your cross functional teams to integrate your SEO strategies to make sure that your SEO budget actually drives results, not just your agency's bottom line. So join brands like Yelp, eBay, Canva, Atlassian Square, all who rely on the SEO consultants at Pre Visible. For more information go to Previsible IO. That's Pre Visible. P R E V I S I B L E I O
Kaspar Siminsky
If it's public, if it's accessible, it's going to get crawled. Ultimately, it's kind of like a binary choice. If we don't want stuff for stuff to be crawled, it probably shouldn't be crawlable to begin with. If it's crawled by some bots, chances are it's going to leak.
Tyson
Ultimately, that's going to wrap up this episode of the Voice of Search podcast. Thanks again to Caspar Ziminski, senior Director at Search Brothers, for joining us. If you'd like to get in contact with Caspar, you can find a link to his LinkedIn profile in the show notes, or be sure to go over and check out his company's website@searchbrothers.com if you haven't subscribed yet and you'd like a daily stream of SEO and content marketing knowledge in your podcast feed, hit that subscribe button in your podcast app or on YouTube and we'll be back in your feed in the following day with that. That's all for today. Thanks for stopping by and we'll see
you on the next episode.
In this concise episode, Tyson and Kaspar Siminsky discuss the challenges and realities of keeping proprietary data away from Large Language Models (LLMs) and crawlers, especially for enterprise-level organizations. The conversation addresses the binary nature of web visibility and offers practical advice for those concerned about sensitive or proprietary content.
Defining Proprietary Data
Kaspar opens by questioning how proprietary the data is:
Visibility Equals Crawlability
The Inevitability of Leaks
Protecting Truly Proprietary Data
Limits of Technical Barriers
On the Hypothetical of Blocking LLMs:
On Public Content Risks:
The conversation is frank and practical—Kaspar avoids technical jargon and opts for real-world logic: Unless you're willing to keep your proprietary data offline, you must assume that it could eventually be accessed by LLMs and crawlers. For enterprises especially, this means rethinking which assets are truly suitable for online exposure.
Bottom line: If you don't want it scraped, don’t let it be visible online—no technical fix is foolproof.
For more information about Kaspar Siminsky, visit Search Brothers or check the show notes for his LinkedIn profile.