robots.txt Generator
Generate a robots.txt file for your website instantly. Add user-agent rules, allow/disallow paths, crawl-delay, and sitemap URL. Free, real-time robots.txt generator.
Generate a robots.txt file for your website instantly. Add user-agent rules, allow/disallow paths, crawl-delay, and sitemap URL. Free, real-time robots.txt generator.
robots.txt is a plain text file at the root of a website that tells crawlers which paths they may visit and which they should avoid. The format dates to 1994 and is supported by virtually every search engine and well-behaved crawler. The file is advisory — malicious crawlers ignore it — but legitimate crawlers (Google, Bing, etc.) honor its directives reliably.
Common uses: blocking crawlers from admin areas (/wp-admin/, /admin/), excluding faceted search URL parameters that produce duplicate content, preventing indexing of staging or development paths, declaring sitemap location, and allowing specific user agents while blocking others. The file is read at the start of each crawl session.
This generator builds a syntactically correct robots.txt from form inputs. Common patterns (allow all, block all, block specific paths) are templates; custom rules can be added per user agent. The output goes at /robots.txt of your site root.
Hand-writing robots.txt is error-prone. Syntax mistakes (case sensitivity, exact path matching, ordering of rules) silently produce wrong behavior — paths you meant to block remain crawled, or paths you wanted indexed get excluded. A generator that produces correct syntax avoids these pitfalls.
robots.txt also has subtle interactions with other SEO tools. Disallowing a path in robots.txt does not prevent it from appearing in search results (Google may index the URL without crawling it); meta noindex requires the page to be crawled first. Knowing which tool to use for which intent matters; the generator can guide you.
Pick a template, customize, deploy.
Format: User-agent: <name> followed by Allow/Disallow directives. * matches all bots. Specific names (Googlebot, Bingbot) target specific crawlers. Multiple User-agent blocks can stack rules.
Disallow: <path> blocks paths starting with the given prefix. Disallow: / blocks the entire site. Disallow: /admin/ blocks anything under /admin/. Trailing slash matters; Disallow: /admin (no slash) also matches /administrator.
Crawl-delay (in seconds) requests slower crawling. Sitemap (absolute URL) declares your sitemap location. # starts a comment line.