DTIbot
  Overview
 
DTIbot is the official name given to the collective web crawlers we have and will place in service.

If you are visiting this page because DTIbot accessed your web site, please be assured we make every effort to limit our use of your bandwidth.

Our web crawler is and will be used in connection with current and future services.
 
  WebcrawlerDTIbot
 
Our current objectives:

Run live testing of DTI Site Search.
Build our experience in navigating our web crawlers across the Internet.
 
  Support
 
We understand your bandwidth and server time are valuable. It is important for us that our bot be welcomed on the internet as a well-behaved crawler. If you notice any unusual activity from DTIbot please report it to:
 
  Access
 
If you have a problem or concern about DTIbot we would prefer to have the chance to address it but if you need to block DTIbot we do respect the robots.txt exclusion list.

To block DTIbot from some parts of your site you can use the following example:

User-agent: DTIbot
Disallow: /logs/
Disallow: /cgi-bin/

In this example, /logs/ and /cgi-bin/ are directories that will be blocked to DTIbot and will not be crawled. Other parts of your site will still be crawled.

To block DTIbot from your entire web site you can use this:

User-agent: DTIbot
Disallow: /
 
  Site Index
 

If DTIbot has crawled your site and you would like to view what has been indexed, visit our search page and enter "site:yourdomain" as your search parameters.

We periodically purge our index in preparation for the next round of testing, so search results for your site may return empty.

 
  Sitemap.xml
 

DTIbot supports the sitemap protocol.

Note: The sitemap feature at this time is being implemented by DTIbot at random.

You can specify the location of your Sitemap using a robots.txt file.
To do this, add the following line:

Sitemap: <sitemap_location>

More information can be found at http://www.sitemaps.org
 Validate our sitemap  
  Robots.txt
 
More information on robots.txt can be found at http://www.robotstxt.org

Thank you,
DTIbot
 
 
Links | Who we are | Contact Us | DTIbot | Help
 
Copyright © 2010 Dorn Technologies Incorporated All rights reserved  Terms of Use | Privacy