Allegations Of Unauthorized Data Extraction
Reddit, the influential social media platform known for its myriad user communities, has initiated legal proceedings in New York federal court against artificial intelligence firm Perplexity. The complaint alleges that Perplexity unlawfully scraped user-generated content to train its AI model, marking a significant confrontation in the ongoing debate over data rights.
Multiple Defendants In The Crosshairs
The lawsuit names not only Perplexity, but also three ancillary entities: Oxylabs, a Lithuanian data scraper; AWMProxy, linked to a former Russian botnet; and Texas-based startup SerpApi. According to Reddit, these defendants covertly extracted copyrighted material by disguising their identities and locations to mimic ordinary browsing activity.
Follow THE FUTURE on LinkedIn, Facebook, Instagram, X and Telegram
Industry Response And Legal Denials
In response, Perplexity has refuted the allegations, asserting that its platform only summarizes and cites publicly accessible Reddit discussions rather than using them to train AI models. Perplexity has further decried the suit as an act of extortion designed to impede an open internet. Similarly, SerpApi has expressed strong disagreement with Reddit’s claims and indicated plans to vigorously defend itself in court.
Data Licensing And Market Implications
This lawsuit is one among several legal challenges targeting the use of copyrighted materials for AI training purposes. Reddit has proactively engaged in similar disputes, having recently filed a comparable lawsuit against AI startup Anthropic. Ben Lee, Reddit’s Chief Legal Officer, has emphasized that the intensifying competition for high-quality human content has generated an industrial-scale data laundering economy.
Strategic Licensing And Revenue Opportunities
Amid these legal disputes, Reddit continues to capitalize on its vast reservoir of user-generated data by negotiating licensing agreements with major industry players including OpenAI and Google. This strategy not only reinforces Reddit’s central role in AI development but also highlights its evolution into a significant revenue stream, with recent reports noting that licensing arrangements now account for nearly 10% of the company’s revenue.

