New post Need visibility? Apply for a FREE post for your Startup.  Apply Here

News

You can now stop Google from using the content on your website to train Bard and future AIs

1 Mins read
  • You can now stop Google from using the content on your website to train its AI machine
  • Now the Google-Extended flag in robots.txt can tell Google’s crawlers to include a site in search without using it to train new AI models like the ones powering Bard.
  • Some websites, including The New York Times, have opted to legally prohibit companies from utilizing their content for AI training by updating their terms of service.

Google has unveiled a new tool named Google-Extended, providing website publishers with the option to exclude their data from being utilized to train Google’s AI models while still ensuring accessibility through Google Search. This tool enables sites to continue being scraped and indexed by web crawlers like Googlebot while preventing their data from contributing to AI model training.

Google-Extended offers publishers control over whether their websites are used to enhance Bard and Vertex AI generative APIs. It empowers web publishers to manage access to their content while exempting their data from AI training purposes. Google had previously disclosed its intention to train its AI chatbot, Bard, using publicly accessible web data.

Danielle Romain, Google’s VP of Trust, explained in a blog post that the company recognizes the desire of web publishers for greater choice and control regarding how their content is employed in emerging generative AI applications. To utilize Google-Extended, publishers can simply disallow “User-Agent: Google-Extended” in their site’s robots.txt file, which instructs automated web crawlers on accessible content.

Google expressed its commitment to exploring additional machine-readable approaches to offer web publishers more choices and control as AI applications expand. The company emphasized that it would share further developments in this regard soon.

Several websites have already taken steps to block web crawlers used for data scraping and AI model training, including those used by OpenAI’s ChatGPT. Notable sites such as The New York Times, CNN, Reuters, and Medium have implemented measures to restrict access to their content for AI training purposes. Blocking Google, however, presents unique challenges since complete exclusion from Google’s crawlers would result in a loss of search engine indexing. Some websites, including The New York Times, have opted to legally prohibit companies from utilizing their content for AI training by updating their terms of service.

Medium recently announced its universal blocking of web crawlers until more nuanced solutions become available, echoing concerns expressed by numerous other websites grappling with the balance between indexing and data protection.

Google’s introduction of Google-Extended offers web publishers a more selective approach to participating in AI training data, aligning with evolving preferences in the digital publishing landscape.

Don’t miss any tech news ever!

We don’t spam! Read our privacy policy for more info.

395 posts

About author
There's this unexplainable joy I get whenever I write, knowing fully well that my copy will transform people's life and destiny. This rare feeling elates me and encourages me to write more value-packed pieces. I think a divine being has possessed me to write, that is why I write, Therefore, I will advise every of my piece should be regarded as a divine message.
Articles
Related posts
News

Elon Musk's AI startup acquires X

1 Mins read
Elon Musk announced Friday that his artificial intelligence startup xAI has acquired X, formerly Twitter, in an all-stock transaction. The deal values…
News

FG gets ₦1 Billion grant from Airtel to empower Nigerian tech talent

1 Mins read
The Nigerian government has received a ₦1 billion grant from Airtel Africa Foundation to support the 3 Million Technical Talent (3MTT) program,…
News

TikTok moves to intensify digital safety efforts in Sub-Saharan Africa

1 Mins read
TikTok has reaffirmed its commitment to online safety during the Second Annual Sub-Saharan Africa Safer Internet Summit in Cape Town, showcasing significant…
Newsletter Subscription

🤞 Don’t miss any update!

We don’t spam! Read more in our privacy policy

Join our Telegram channel here - t.me/TechpadiAfrica

Leave a Reply

×
News

Meta AI Used Public Posts from Facebook and Instagram for Training, Avoid Private Data