Robots.txt Checker

Fetch and analyse the robots.txt file for any domain — see crawl rules, user-agent blocks, and sitemap references.

Domain

Did this tool work for you?

AdSense336 × 280

How to use this calculator

1
Enter the domain name (without https://).
2
Click "Check Now" to fetch https://domain.com/robots.txt.
3
Results show the number of user-agent rules, disallow/allow directives, and sitemap references.
4
The raw file content is also shown for detailed inspection.

AdSense · 728 × 90

Frequently asked questions

What is robots.txt?

robots.txt is a text file at the root of your domain (/robots.txt) that tells crawlers which pages or sections they're allowed or not allowed to crawl. It's a directive, not a security measure — compliant bots respect it; malicious scrapers may not. Google, Bing, and other search engine crawlers all respect robots.txt.

What does Disallow: / mean?

Disallow: / blocks the specified user-agent from crawling any page on your site. If applied to User-agent: *, this blocks all crawlers from the entire site. This is a common mistake that prevents search engines from indexing any of your content.

Should I block crawlers from my admin area?

Yes — but robots.txt alone is not sufficient security. Bots (including malicious ones) can ignore it. Use server-side authentication and IP allowlisting for admin areas. robots.txt's role is to save crawl budget, not provide security. Add Disallow: /admin/ to prevent it from appearing in search results.

Does robots.txt affect indexing?

Blocking a URL in robots.txt prevents crawling but not necessarily indexing. Google can still index a blocked URL if it sees links pointing to it — it just can't read the content. To prevent indexing, add a noindex meta tag on the page itself (requires the page to be crawlable) or use the X-Robots-Tag header.

About robots.txt checker

Robots.txt Checker — Analyse crawl rules for any domain

Robots.txt best practices

Every public website should have a robots.txt, even if it just allows all crawlers (User-agent: * / Allow: /). Always include a Sitemap: directive pointing to your sitemap. Block duplicate content, admin sections, search result pages, and any pages with thin or private content. Use specific user-agents to apply different rules to Googlebot vs. Bingbot.

Robots.txt syntax guide

User-agent: specifies which crawler the following rules apply to (* = all). Disallow: specifies paths the crawler should not visit. Allow: overrides Disallow for specific sub-paths. Sitemap: points to your XML sitemap URL. Crawl-delay: suggests a delay between requests (not supported by Google). Comments start with #.

Learn more from an authoritative source:

Google Search Central

Related tools

Keyword Density Calculator

Calculate keyword density and frequency for any text. Paste your content and keyword to see how often it appears and whether density is in the recommended SEO range.

Meta Description Checker

Check your meta description length, character count, pixel width, and SEO quality. Get instant feedback on whether it will display fully in Google search results.

Title Tag Checker

Check your SEO title tag for optimal length, pixel width, and keyword placement. See how it will appear in Google search results.

Reading Level Calculator

Calculate the reading level and readability score of any text using the Flesch-Kincaid formula. Find out what grade level your content targets.

See all SEO Tools tools

Results are estimates for informational purposes only and do not constitute professional financial, medical, legal, or technical advice. Read full disclaimer →