The war against AI bots is still really about privacy versus money
Is this the real life? Is this technology?
Artificial intelligence bots are running rampant on the internet as they scour for data to train language models. Much of that data includes content created by real-deal humans, and many are unhappy with their data being used that way. To combat this, companies are creating tools to prevent AI bots' access to data on both their websites and their products.
Why are people worried about AI bots?
Training generative AI requires significant amounts of data. To collect the information, several companies have AI bots scouring the web for content. Data comes in two forms: public and private. Public data is readily available on the internet for anyone to glean, while private data "includes things like text messages, emails and social media posts made from private accounts," said The New York Times. The problem is that public data is running out, which is leading to the creation of AI bots bent on scouring the internet for the private alternative.
"As companies look to train their AI models on data that is protected by privacy laws, they're carefully rewriting their terms and conditions to include words like 'artificial intelligence,' 'machine learning' and 'generative AI,'" said the Times. Essentially, companies including Google and Meta have begun using private user data such as social media posts to train their AI models. People are worried that using private data to train generative AI could render AI capable of replicating content created by humans, especially in areas like art, music and literature. "In three, four, five years' time, there might not be entire segments of this creative industry because we'll just be decimated," Sasha Yanshin, a YouTube personality and co-founder of a travel recommendation site, said to the Times
Subscribe to The Week
Escape your echo chamber. Get the facts behind the news, plus analysis from multiple perspectives.
Sign up for The Week's Free Newsletters
From our morning news briefing to a weekly Good News Newsletter, get the best of The Week delivered directly to your inbox.
From our morning news briefing to a weekly Good News Newsletter, get the best of The Week delivered directly to your inbox.
How are companies fighting back?
Generative AI's data thirst has presented a lucrative opportunity for companies that have a strong stock of private data. "Thanks to the scarcity of high-quality data and the immense pressure and demand to build even bigger and better models, we're in a rare moment where data owners actually have some leverage," said MIT Technology Review. For example, music labels have opted to sue the AI music companies Suno and Udio, claiming the two companies "made use of copyrighted music in their training data 'at an almost unimaginable scale,' allowing the AI models to generate songs that 'imitate the qualities of genuine human sound recordings.'"
In a bigger step, Cloudfare, a content delivery network and cloud security platform, created a tool designed to block AI bots from scraping text from websites. "We hear clearly that customers don't want AI bots visiting their websites and especially those that do so dishonestly," said Cloudfare in a blog post. While this is not a surefire solution because more advanced bots can mimic how a real person uses a website, such a block could nonetheless limit a significant amount of bot activity.
However, several content owners are "torn between their instinct to protect their intellectual property and their eagerness to take money from those AI makers," said Axios. Platforms like Reddit and Stack Overflow are attempting to balance the use of AI with the protection of data, but the bot "free-for-all over access to web data is just the opening salvo of what will be an increasingly hot war."
Sign up for Today's Best Articles in your inbox
A free daily email with the biggest news stories of the day – and the best features from TheWeek.com
Devika Rao has worked as a staff writer at The Week since 2022, covering science, the environment, climate and business. She previously worked as a policy associate for a nonprofit organization advocating for environmental action from a business perspective.
-
How to avoid Blue Monday's financial woes
The Explainer The most depressing day of the year can actually be a catalyst for good money decisions
By Rebekah Evans, The Week UK Published
-
Prop 6, inmate firefighters and the state of prison labor
The Explainer The long-standing controversial practice raises questions about exploitation
By Theara Coleman, The Week US Published
-
Crossword: January 20, 2025
The Week's daily crossword
By The Week Staff Published
-
TikTok's fate uncertain as weekend deadline looms
Speed Read The popular app is set to be banned in the U.S. starting Sunday
By Peter Weber, The Week US Published
-
TikTok alternatives surge in popularity as app ban looms
The Explainer TikTok might be prohibited from app stores in the United States
By Justin Klawans, The Week US Published
-
Will Biden's AI rules keep the genie in the bottle?
Talking Points A new blow in the race for 'geopolitical superiority'
By Joel Mathis, The Week US Published
-
Is 'AI slop' breaking the internet?
In The Spotlight 'Low-quality, inauthentic, or inaccurate' content is taking over social media and distorting search engine results
By The Week UK Published
-
Appeals court kills FCC net neutrality rule
Speed Read A U.S. appeals court blocked Biden's effort to restore net-neutrality rules
By Peter Weber, The Week US Published
-
David Sacks: the conservative investor who will be Trump's crypto and AI czar
In the Spotlight Trump appoints another wealthy ally to oversee two growing — and controversial — industries
By David Faris Published
-
Judge rejects Elon Musk's $56B pay package again
Speed Read Judge Kathaleen McCormick upheld her rejection of the Tesla CEO's unprecedented compensation deal
By Peter Weber, The Week US Published
-
DOJ seeks breakup of Google, Chrome
Speed Read The Justice Department aims to force Google to sell off Chrome and make other changes to rectify its illegal search monopoly
By Peter Weber, The Week US Published