Reddit is suing Perplexity AI and others over alleged unauthorized content scraping

Reddit has filed a lawsuit against the artificial intelligence company Perplexity AI, accusing it of illegally collecting large amounts of Reddit’s content to train and power its AI systems. The complaint (filed on October 22 in a federal court in Manhattan) also names three other companies, which include Oxylabs, SerpApi, and AWMProxy, as co-defendants. Reddit claims these firms extracted millions of posts and comments from the platform and supplied them to Perplexity without permission. According to Reddit, such action violated its terms of service and ignored technical safeguards meant to block automated data harvesting.

In the filing, Reddit argues that Perplexity and its partners engaged in ‘industrial-scale scraping’ for profit, bypassing anti-bot protections and using indirect methods to extract information. The company alleges that the scrapers not only accessed Reddit directly but also exploited Google’s search results to gather content in a way that evaded detection. Reddit’s lawyers claim that the data collection was done deliberately and systematically, targeting valuable discussions across millions of subreddit communities to enhance Perplexity’s AI products (like its ‘answer engine’) without permission or compensation.

Importantly, Reddit stated that it discovered the scraping through its own investigation. To test this, the company created a hidden post that could only be found through Google search, not on Reddit itself. When that post later appeared in Perplexity’s AI results, Reddit took it as proof that Perplexity was using data obtained without permission. As per Reddit, it tried to resolve the issue before going to court. Even the company sent Perplexity a cease-and-desist letter in May 2024, instructing it to stop using Reddit’s content without a license. But after that, Perplexity’s tool reportedly cited Reddit data about forty times more often, which Reddit saw as proof that the company ignored the warning and continued using its content for profit.

In the lawsuit, Reddit is asking for financial damages and a permanent injunction to prevent Perplexity and the data-scraping firms from using its content without permission. The complaint does not specify an amount but states that the defendants caused significant harm. Reddit notes that it has already entered into paid licensing deals with other technology firms (including Google and OpenAI) for the responsible use of its data.

This is not the first time Reddit has filed a lawsuit over unauthorized use of its content. Earlier, in June 2025, the company filed a similar case against Anthropic, another AI startup, for allegedly using its posts and comments without permission to train its AI models. And cases like these are showing a growing pattern of disputes between online platforms and AI companies over the use of publicly available user-generated content. Actually, other major firms like OpenAI, Microsoft, and Meta are also facing similar lawsuits from authors, publishers, and media groups.

Considering this scenario, it is worth noting that recently Anthropic agreed to pay a historic $1.5 billion to settle a similar case, which is considered one of the largest settlements of its kind. The case involved authors who claimed their books were used without permission to train the company’s AI systems.

The Tech Portal is published by Blue Box Media Private Limited. Our investors have no influence over our reporting. Read our full Ownership and Funding Disclosure →

Ashutosh Singh

Ashutosh is a Senior Writer at The Tech Portal, largely reporting on new tech, and intersection of technology and business. Ashutosh’s career spans across nearly a decade of technology writing across multiple platforms and languages.

Reddit is suing Perplexity AI and others over alleged unauthorized content scraping

Up next

Tesla to build AI5 chips with Samsung and TSMC in US

Author

Ashutosh Singh

Tags

OpenAI upgrades ‘Codex’ with multi-agent workflows and desktop app control to challenge Anthropic’s Claude Code

Meta hikes Quest 3 price by $100 and Quest 3S by $50 amid RAM shortage

Anthropic launches ‘Claude Opus 4.7’ with stronger coding and vision capabilities

YouTube adds option to turn off Shorts with new ‘Zero-minute’ timer

IPO-bound Anthropic receives investor offers at $800Bn valuation: Report

OpenAI upgrades ‘Codex’ with multi-agent workflows and desktop app control to challenge Anthropic’s Claude Code

Meta hikes Quest 3 price by $100 and Quest 3S by $50 amid RAM shortage

Anthropic launches ‘Claude Opus 4.7’ with stronger coding and vision capabilities

YouTube adds option to turn off Shorts with new ‘Zero-minute’ timer