Transcript
A (0:00)
This is the Unknown Secrets of Internet Marketing. Your insider guide to the strategies top marketers use to crush the competition. Ready to unlock your business full potential? Let's get started. Howdy. Welcome back to another fun filled episode of the Unknown Secrets of Internet Marketing which I am trying to drop that piece of it. Okay, so best SEO podcast. The best SEO podcast. We have all those handles and Internet marketing secrets. We've decided to drop that. So I need to change the bumpers but you can find us anywhere@fest seopodcast. This is the Unknown Secrets of Internet Marketing. We've been running for 12 years straight and we talk about everything, digital marketing and SEO and well, AI because AI has taken over and I thought it would be good as we're continuing to have these AI discussions to hold up, pump the brakes and say, okay, plagiarism, AI content generation, how are we ranking? What's going on? I even remember a publicly traded client we have early on was like 92% human generated rejected. I was like, well he wrote it. You know he wrote IT and the LLMs are trained on human writing. So a lot of it's going to be human written. Like I knew this person written the article like he knew nothing about AI. I was like there's no way that he wrote this. And so that started an education process, AI governance process to understand how we need to frame these things, how we need to look at these things. And I wanted to bring on John Gilham from Originality AI. He's got a AI checker, plagiarism checker, fact checker. Because well, there's an issue with, we were talking to the pre interview, the, the integrity of the Internet as a whole as LLMs are referencing LLM sources. You said you just completed a study on, on how many AI overviews are referencing AI generated content.
B (2:01)
Yeah, thanks man. Thanks for having me. Yeah, we, yeah, we did, just did a study. I think it's a sort of, we find it infinitely fascinating to sort of understand how AI generated content is sort of proliferating across the, across the Internet. And so we look at sort of studies on where, where that's happening, where it isn't happening. We saw with this study, we looked at hundreds of thousands of a overviews and then the websites that they had cited and then ran those websites through our AI detector. Our detector, highly accurate but not perfect but across a large data set is, is very telling and can be relied on for, for understanding. And we saw sort of 15, 15 at times 20% of your money, your life searches citing AI generated piece of content. And so it sort of, yeah, definitely raises the question of sort of the snake eating its own tail as some sort of visualize it. But if AI is rooting itself in AI, there's a whole world of problems that can come from that.
A (3:04)
Yeah, the degradation of data as AI feeds itself. I would love to even go down the rabbit hole later on synthetic data and how that works. That's something that we're starting to get into and test out some different tools on. But like I remember Elon Musk, right, bought Twitter and then, you know, for freedom or whatever. Like I think it's great move, but I think he, I mean it's clear and I believe this to be true. He bought it to train Grok, right. And he also was tweeting about how many bots there were on Twitter and he had to get all of that synthetic data or that AI generated data out of there because it needs to be trained on real humans. That's also why Google did the deal with Reddit because, you know, real humans need, are providing the inputs. And I think that that's probably why ChatGPT or OpenAI made it free for so many people because they, they need the engagement with real humans to, to train, to train these models.
