Category Archive for 'web-robot'

WebFilter

Monday, June 11th, 2007

Owner of the robot : Telemate.net Software, Inc.
Country : USA
Robot type : probe (does not spider sites)
Description : WebFilter lets managers prevent employees from accessing URLs that contain content the company finds objectionable.
User Agent transmitted to the visited web server :

WebFilter Robot

 

 
IP address range : from 216.248.177.128 to 216.248.177.191 ()
URL for more information : [...]

Walhello

Monday, June 11th, 2007

Owner of the robot : Walhello
Country : Netherlands
Robot type : search engine
Description : Walhello combines a search engine and informations from other sources like the DMOZ drectory, as well as products sold by several on-line shops.
User Agent transmitted to the visited web server :

appie 1.1 (www.walhello.com)

 

 
IP address range : from 81.205.0.0 to 81.205.255.255 (planet.nl)
URL for [...]

Wadaino

Monday, June 11th, 2007

Owner of the robot : wadaino.jp
Country : Japan
Robot type : search engine
Description : Japanese portal site.
User Agent transmitted to the visited web server :

wadaino.jp-crawler 0.2 (http://wadaino.jp/)

 

 
IP address range : from 202.51.8.0 to 202.51.15.255 (wadaino.jp)
URL for more information : http://wadaino.jp/
Access control options understood by the robot :

User Agent to use in the robots.txt file : [...]

Voyager

Monday, June 11th, 2007

Owner of the robot : Kosmix Corporation
Country : USA
Robot type : search engine
Description : The Kosmix search engine focusses on health information.
Although the FAQ page of the company says that they do not use the cfetch/1.0 user agent any more since end 2005, it is still active.

User Agent transmitted to the visited web server :

cfetch/1.0

IP [...]

Voila

Monday, June 11th, 2007

Owner of the robot : France T?l?com S.A.
Country : France
Robot type : search engine
Description : France Telecom search engine.
User Agent transmitted to the visited web server :

Mozilla/4.0 (compatible; MSIE 5.0; Windows 95) VoilaBot BETA 1.2 (http://www.voila.com/)

 

 
IP address range : from 193.252.148.0 to 193.252.148.255 ()
URL for more information : http://www.voila.com/
Access control options understood by the [...]

Twiceler

Monday, June 11th, 2007

Owner of the robot : Cuill, Inc.
Country : USA
Robot type : search engine
Description : Twiceler is the experimental robot of Cuill, a young company that pioneers a new approach to search. It is led by former IBM and Google engineers.

User Agent transmitted to the visited web server :

Mozilla/5.0 (Twiceler-0.9 http://www.cuill.com/twiceler/robot.html)

IP address range :

from 64.0.0.0 to [...]

Turnitin

Monday, June 11th, 2007

Owner of the robot : iParadigms, LLC.
Country : USA
Robot type : search engine
Description : Turnitinbot spiders web sites for the sole purpose of helping educational institutions prevent plagiarism (in particular, searching for similarities beteween student papers and content found on the Internet).
User Agent transmitted to the visited web server :

TurnitinBot/2.1 (http://www.turnitin.com/robot/crawlerinfo.html)
TurnitinBot/2.0 (http://www.turnitin.com/robot/crawlerinfo.html)
TurnitinBot/2.0 http://www.turnitin.com/robot/crawlerinfo.html

 

 
IP address range [...]

TMCrawler

Monday, June 11th, 2007

Owner of the robot :
Country : Taiwan
Robot type : unknown
Description : No information available.
User Agent transmitted to the visited web server :

TMCrawler

 

 
IP address range : from 59.124.0.0 to 59.127.255.255 (hinet.net)
URL for more information :
Access control options understood by the robot :

User Agent to use in the robots.txt file :
 
Last visit of [...]

TLink

Monday, June 11th, 2007

Owner of the robot : Thibault Kummer
Country : France
Robot type : search engine software
Description : Tlink original task is the detection of dead links. Thanks to a system of plugins, TLink can be used to perform other maintenance tasks.
User Agent transmitted to the visited web server :

TLink (http://www.coldsource.net/projets/tlink/

 

 
IP address range : from to [...]

SygolBot

Monday, June 11th, 2007

Owner of the robot : Giorgio Galeotti
Country : Italy
Robot type : search engine
Description : Search engine (in English and Italian) developped by an Italian impassioned hobbyist.
User Agent transmitted to the visited web server :

SygolBot http://www.sygol.net

 

 
IP address range : from 81.208.26.0 to 81.208.26.63 (fastwebnet.it)
URL for more information : http://www.sygol.net/SygolBot.asp
Access control options understood by the robot [...]