Utinzo

Robots.txt Checker

Fetch and analyse the robots.txt file for any domain — see crawl rules, user-agent blocks, and sitemap references.

Did this tool work for you?

AdSense336 × 280
AdSense336 × 280

How to use this calculator

  1. 1

    Enter the domain name (without https://).

  2. 2

    Click "Check Now" to fetch https://domain.com/robots.txt.

  3. 3

    Results show the number of user-agent rules, disallow/allow directives, and sitemap references.

  4. 4

    The raw file content is also shown for detailed inspection.

AdSense · 728 × 90

Frequently asked questions

What is robots.txt?

robots.txt is a text file at the root of your domain (/robots.txt) that tells crawlers which pages or sections they're allowed or not allowed to crawl. It's a directive, not a security measure — compliant bots respect it; malicious scrapers may not. Google, Bing, and other search engine crawlers all respect robots.txt.

What does Disallow: / mean?

Disallow: / blocks the specified user-agent from crawling any page on your site. If applied to User-agent: *, this blocks all crawlers from the entire site. This is a common mistake that prevents search engines from indexing any of your content.

Should I block crawlers from my admin area?

Yes — but robots.txt alone is not sufficient security. Bots (including malicious ones) can ignore it. Use server-side authentication and IP allowlisting for admin areas. robots.txt's role is to save crawl budget, not provide security. Add Disallow: /admin/ to prevent it from appearing in search results.

Does robots.txt affect indexing?

Blocking a URL in robots.txt prevents crawling but not necessarily indexing. Google can still index a blocked URL if it sees links pointing to it — it just can't read the content. To prevent indexing, add a noindex meta tag on the page itself (requires the page to be crawlable) or use the X-Robots-Tag header.

About robots.txt checker

Robots.txt Checker — Analyse crawl rules for any domain

Robots.txt best practices

Every public website should have a robots.txt, even if it just allows all crawlers (User-agent: * / Allow: /). Always include a Sitemap: directive pointing to your sitemap. Block duplicate content, admin sections, search result pages, and any pages with thin or private content. Use specific user-agents to apply different rules to Googlebot vs. Bingbot.

Robots.txt syntax guide

User-agent: specifies which crawler the following rules apply to (* = all). Disallow: specifies paths the crawler should not visit. Allow: overrides Disallow for specific sub-paths. Sitemap: points to your XML sitemap URL. Crawl-delay: suggests a delay between requests (not supported by Google). Comments start with #.

Robots.txt Checker – Utinzo

Learn more from an authoritative source:

Google Search Central
Related tools

Results are estimates for informational purposes only and do not constitute professional financial, medical, legal, or technical advice. Read full disclaimer →

Robots.txt Checker – Free Online SEO Tool | Utinzo