cmonewshubb
Advertisement
  • Home
  • CMO News
  • Growth Marketing
  • Industry News
  • Market Research
  • Contact us
No Result
View All Result
  • Home
  • CMO News
  • Growth Marketing
  • Industry News
  • Market Research
  • Contact us
No Result
View All Result
cmonewshubb
No Result
View All Result
Home Industry News

Dozens of big brands have blocked GPTBot, OpenAI’s new web crawler

admin by admin
August 27, 2023
in Industry News


At least 69 of the 1,000 most popular websites in the world have blocked GPTBot, the new web crawler OpenAI introduced Aug. 7, according to a new analysis.

And the percentage of sites is increasing by about 5% per week, according to AI content and plagiarism service Originality.ai.

Why we care. To block or not to block ChatGPT? That has been the big question for many SEOs. Clearly, several popular websites have already blocked GPTBot, presumably because they don’t want OpenAI scraping their data to help train its models – at least not without compensation. Additionally, ChatGPT does not cite or link to its sources.

By the numbers. The 15 most popular sites blocking ChatGPT, according to the analysis, are:

  • amazon.com
  • quora.com
  • nytimes.com
  • shutterstock.com
  • wikihow.com
  • cnn.com
  • foursquare.com
  • healthline.com
  • scribd.com
  • businessinsider.com
  • reuters.com
  • medicalnewstoday.com
  • goodhousekeeping.co
  • amazon.co.uk
  • tumblr.com

But. Even though many sites are blocking GPTBot, they are not also blocking CCbot, Common Crawl’s web crawler. Part of the training data used by OpenAI, Google and others comes from Common Crawl.

There are a few noteworthy exceptions that block both bots, such as the New York Times, which clearly does not want its content used to train AI systems. Other popular websites blocking both GPTBot and CCbot include shutterstock.com, reuters.com and goodhousekeeping.com.

  • At least 62 of the top 1,000 websites have blocked CCBot.

Limitations. 241 robots.txt files out of the 1,000 websites were not identified/inspected as part of this analysis. (That’s why I wrote “at least” in the opening sentence.)

Originality.ai’s analysis. Websites That Have Blocked OpenAI’s GPTBot – 1000 Website Study

Dig deeper. Should you block ChatGPT’s web browser plugin from accessing your website?



Source link

Previous Post

3 steps for effective PPC reporting and analysis

Next Post

The 2 Simple & Straightforward Methods for Market Sizing Your Business

Next Post

The 2 Simple & Straightforward Methods for Market Sizing Your Business

The Future Of Retail Is Hyper-Personalized

Trending

The New Heavyweight Latino Artist & The Boom Of Regional Mexican Music

by admin
September 22, 2023

5 Must-Read Books for Building Brands and Wealth by Entrepreneurs of Color

by admin
September 22, 2023

Microsoft unveils new AI tools to ‘transform search & advertising’

by admin
September 22, 2023

AMA with Google’s Gary Illyes: 15 quick SEO takeaways

by admin
September 22, 2023
CMO-62

© CMO News Hubb All rights reserved.
Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Privacy Policy and Terms & Conditions.

Navigate Site

  • Home
  • CMO News
  • Growth Marketing
  • Industry News
  • Market Research
  • Contact us

Newsletter Sign Up.

No Result
View All Result
  • Home
  • CMO News
  • Growth Marketing
  • Industry News
  • Market Research
  • Contact us

© 2022 CMO News Hubb All rights reserved.