What happens if I block PerplexityBot in robots.txt?

Your content will not appear in Perplexity AI answers. Perplexity respects robots.txt directives, and blocking its crawler is the single most common reason brands with otherwise well-structured content fail to appear in Perplexity citations despite ranking well in Google and Bing.

Can I allow PerplexityBot on some pages but block it on others?

Yes. You can use Disallow directives with specific URL paths to restrict PerplexityBot from crawling certain sections of your site while allowing access to your main content pages. This is common for brands that want AI visibility on their public content but not on internal admin pages.

PerplexityBot Robots.txt Setup: Exact Configuration Guide

Setting up PerplexityBot in your robots.txt file means adding the correct user-agent directive and crawl rules that tell Perplexity's web crawler which pages it can access and index. PerplexityBot is the crawler that Perplexity AI uses to build its search index and retrieve sources when generating answers. If your robots.txt blocks PerplexityBot, your content cannot appear in Perplexity citations regardless of how well it is optimized. DataReportal's Digital 2026 report confirms Perplexity attracts over 12 million unique monthly visitors, a discovery surface that requires explicit crawler access configuration.

What Is the Exact PerplexityBot User-Agent String?

The user-agent string for Perplexity's web crawler is PerplexityBot. Some older documentation references a variation, but the canonical and current user-agent is a single word. Add this directive to your robots.txt file.

User-agent: PerplexityBot Allow: /

If your robots.txt uses a catch-all Disallow directive followed by specific Allows, the order matters. Place the PerplexityBot Allow directive after any catch-all Disallow to ensure it takes priority. The following configuration correctly allows PerplexityBot while maintaining a restrictive default policy.

User-agent: * Disallow: /admin/ Disallow: /api/ Allow: /

User-agent: PerplexityBot Allow: /

The Princeton Generative Engine Optimization study found that technical crawl access issues are the single largest preventable cause of zero AI visibility. Brands that allow all major AI crawlers in their robots.txt see a measurable increase in citation frequency within 30 days of correcting access issues.

How Do You Verify PerplexityBot Can Access Your Site?

Verification takes two steps. First, check your robots.txt file by visiting yourdomain.com/robots.txt in a browser. Verify the PerplexityBot section exists and contains the correct Allow directives.

Second, check your server access logs for PerplexityBot crawl events. Search your logs for the user-agent string PerplexityBot. If you see crawl events after adding the robots.txt directive, Perplexity is successfully accessing your site. If you see no PerplexityBot entries within 14 days of configuration, check whether your CDN, WAF, or hosting provider is blocking the crawler at a different layer than robots.txt.

Some hosting providers and security plugins block known crawler user-agents by default. If you run Cloudflare, check your firewall rules and WAF settings for any rules that block bot user-agents. Add an exception for PerplexityBot if needed.

What Crawl Rate Should You Expect From PerplexityBot?

PerplexityBot crawls less aggressively than Googlebot or Bingbot. Expect crawl events on your site 1-3 times per week for established domains, and less frequently for new or low-authority domains. Unlike Google's continuous crawling, Perplexity crawls pages on a more deliberate schedule driven by query demand.

A page that receives a PerplexityBot crawl is likely being considered for citation within the following 24-48 hours. If you publish a new page and see a PerplexityBot crawl within 48 hours, consider it a strong leading indicator that your content is in Perplexity's retrieval pipeline.

If you are not seeing PerplexityBot crawl events, submit your sitemap URL through Perplexity's sitemap submission endpoint if available, or ensure your sitemap is discoverable from your robots.txt file with a Sitemap directive pointing to your full sitemap URL.

PerplexityBot Robots.txt Setup: Exact Configuration Guide

What Is the Exact PerplexityBot User-Agent String?

How Do You Verify PerplexityBot Can Access Your Site?

What Crawl Rate Should You Expect From PerplexityBot?

Frequently asked questions

Keep reading

New guides, straight to your inbox.