The British Broadcasting Corporation (BBC) has reportedly issued a legal warning to U.S.-based artificial intelligence start-up Perplexity AI, accusing the company of unauthorized content scraping from its news website. The move marks the latest flashpoint in the growing tension between traditional media outlets and AI companies over content usage rights in the age of generative technology.
According to a report by Wired, the BBC has formally contacted Perplexity AI, expressing concerns over the company’s practices of using web-crawling tools to extract content from its news platform without obtaining the appropriate licensing or attribution. The public broadcaster views this as a potential violation of its copyright and editorial policies.
Perplexity AI, known for its chatbot and AI-powered search interface that provides real-time answers to user queries, has grown rapidly in popularity by combining generative AI with live internet data. However, it is now facing scrutiny over the sources of its information and whether it is complying with copyright laws while aggregating content from publishers around the globe.
The BBC alleges that Perplexity’s systems are bypassing its robots.txt file — a standard web protocol used by websites to control how search engines and bots index their content. Bypassing this directive could be interpreted as intentional circumvention of access restrictions, raising serious legal and ethical questions.
In its communication, the BBC is believed to have demanded immediate clarification from Perplexity and may pursue legal remedies if the alleged scraping continues. A BBC spokesperson stated.
“We are aware of AI platforms using BBC content without permission. We take the protection of our journalism seriously and are exploring all appropriate responses.”
As of this writing, Perplexity AI has not publicly responded to the BBC’s claims. However, in previous discussions with media outlets, Perplexity has defended its practices, stating that it is working to balance innovation with respect for intellectual property. The company claims that it does not aim to infringe on content ownership but instead provide AI-generated summaries or answers based on publicly accessible web data.
