# Default robots.txt file # Last updated: 04.23.07 # Generated by: Ron Pemberton # The below file is in place by default to block bad robots # from indexing your site while allowing good robots to browse # freely. Remove the below entries at your own risk. User-agent: digout4u User-agent: extractorpro User-agent: GetRight User-agent: go-ahead-got-it User-agent: grub User-agent: HTTPClient User-agent: LinkWalker User-agent: nearsite User-agent: netattache User-agent: NEWT ActiveX User-agent: sitesnagger User-agent: teleport User-agent: TovekTools Web Indexer User-agent: UbiCrawler User-agent: Web Downloader User-agent: WebTrends User-agent: webwhacker User-agent: webzip Disallow: / Sitemap: http://ronpemberton.com/sitemap.xml User-agent: * Disallow: User-agent: Mediapartners-Google* Disallow: User-agent: * Disallow: /wp- Disallow: /search Disallow: /feed Disallow: /comments/feed Disallow: /feed/$ Disallow: /*/feed/$ Disallow: /*/feed/rss/$ Disallow: /*/trackback/$ Disallow: /*/*/feed/$ Disallow: /*/*/feed/rss/$ Disallow: /*/*/trackback/$ Disallow: /*/*/*/feed/$ Disallow: /*/*/*/feed/rss/$ Disallow: /*/*/*/trackback/$ # BEGIN XML-SITEMAP-PLUGIN Sitemap: http://ronpemberton.com/sitemap.xml.gz # END XML-SITEMAP-PLUGIN User-agent: * Disallow: / User-agent: Googlebot Allow: /