Robots.txt and crawlability audit

Screaming Frog crawl benchmark report showing 620 nodes, detail score 80/100, depth p90 and orphan count — The report summarizes crawl signals including node count, depth, orphan URLs and overall score.

What this page covers

A robots.txt and crawlability audit checks whether important URLs can be found, crawled, and understood without avoidable technical blockers.

It looks at practical barriers such as missing or blocked sitemap signals, disallowed paths, overly strict bot rules, JavaScript-heavy navigation, and deep site structures that hide key pages.

In brief

Review XML sitemaps and robots.txt together. Missing sitemap signals or blocked paths can make it harder for crawlers to find the URLs that matter.
Keep robots.txt and noindex rules separate. Robots.txt blocks crawling, while noindex usually requires Google to crawl the page before keeping it out of the index.
Check structure, JavaScript navigation, internal links, speed, mobile usability, and URL patterns together. Crawlability is not controlled by one file alone.

What to do

A useful audit starts by identifying the URLs the site treats as important, including URLs listed in XML sitemaps. Crawling that set shows which pages are reachable, which are blocked by robots.txt, and where sitemap coverage does not match crawl reality.

For larger sites, crawl budget choices need to be intentional. Pages blocked in robots.txt and pages marked noindex serve different purposes, so each rule should be checked against the page’s search role and the site’s URL structure.

The review should also look beyond robots.txt. JavaScript-heavy navigation, deep click paths, weak internal linking, slow pages, mobile issues, and unclear URL structures can all make important pages harder to discover and index.

What to keep in mind

This audit is most useful when a site has indexation gaps, blocked sections, unclear sitemap coverage, deep architecture, or recent technical changes that may have affected crawler access.

It is not a ranking promise. Technical cleanup can remove obstacles, but search performance also depends on content quality, query relevance, freshness, internal links, and other SEO factors.

A practical review can catch edge cases such as URLs blocked in robots.txt, pages marked noindex, pages returning 404 errors, and parameter rules that keep valuable pages from being indexable.

Next step

Try Radar for free to map your URL structure and identify crawlability blockers before moving into broader SEO planning.

Try Radar for free

Section

Search intents

Robots.txt and crawlability audit

What this page covers

In brief

What to do

What to keep in mind

Next step

Related pages in this section