Perplexity accused of breaking a major online AI scraping rule – but it says it has done nothing wrong


  • Perplexity seen to be ignoring signals like robot.txt to scrape online sites
  • It even found protected and hidden test sites from Cloudflare
  • OpenAI adheres to responsible crawling, but Perplexity quiet for now

Cloudflare has accused AI giant Perplexity of scraping websites which explicitly disallowed crawling via robots.txt and other network-level rules by hiding its identity and conducting obfuscated crawling activity.

Researchers from the company said they observed Perplexity using multiple user agents, including one impersonating Google Chrome on macOS, as well as rotating IP addresses and ASNs to evade detection.

Leave a Comment