Sitemap: http://www.ross.ws/sitemap.xml ## Limit all bots to 20 seconds between requests. Disallow all bots from accessing certain pages. Set honeypot for misbehaving bots. Cannot separate with blank lines. User-agent: * Crawl-delay: 20 Disallow: contact/index.php Disallow: ?f=site_map.html Disallow: /scripts/bot_honeypot.php ## Prevent users of Firefox extension Fasterfox from pre-fetching linked pages on a page. User-agent: Fasterfox Disallow: / ## Disallow Internet Archive (http://www.archive.org/). User-agent: ia_archiver Disallow: / ## Disallow bad bots. Separate with blank lines. User-agent: aipbot Disallow: / User-agent: BecomeBot Disallow: / User-agent: psbot Disallow: / ## Disallow foreign search engine spiders. ## Baidu (China). User-agent: Baiduspider User-agent: Baiduspider-video User-agent: Baiduspider-image Disallow: / ## Goo (Japanese). User-agent: moget User-agent: ichiro Disallow: / ## Link analysis service Majestic-SEO uses distributed search engine Majestic-12. User-agent: MJ12bot Disallow: / ## Naver (Korean). User-agent: NaverBot User-agent: Yeti Disallow: / ## SoGou (China). User-agent: sogou spider Disallow: / ## Yandex (Russian). User-agent: Yandex Disallow: / ## Youdao (a.k.a. Yodao; China). User-agent: YoudaoBot Disallow: /