Category Archive for 'web-robot'

WebFilter

Monday, June 11th, 2007

Owner of the robot : Telemate.net Software, Inc. Country : USA Robot type : probe (does not spider sites) Description : WebFilter lets managers prevent employees from accessing URLs that contain content the company finds objectionable. User Agent transmitted to the visited web server : WebFilter Robot     IP address range : from 216.248.177.128 [...]

Walhello

Monday, June 11th, 2007

Owner of the robot : Walhello Country : Netherlands Robot type : search engine Description : Walhello combines a search engine and informations from other sources like the DMOZ drectory, as well as products sold by several on-line shops. User Agent transmitted to the visited web server : appie 1.1 (www.walhello.com)     IP address [...]

Wadaino

Monday, June 11th, 2007

Owner of the robot : wadaino.jp Country : Japan Robot type : search engine Description : Japanese portal site. User Agent transmitted to the visited web server : wadaino.jp-crawler 0.2 (http://wadaino.jp/)     IP address range : from 202.51.8.0 to 202.51.15.255 (wadaino.jp) URL for more information : http://wadaino.jp/ Access control options understood by the robot [...]

Voyager

Monday, June 11th, 2007

Owner of the robot : Kosmix Corporation Country : USA Robot type : search engine Description : The Kosmix search engine focusses on health information. Although the FAQ page of the company says that they do not use the cfetch/1.0 user agent any more since end 2005, it is still active. User Agent transmitted to [...]

Voila

Monday, June 11th, 2007

Owner of the robot : France T?l?com S.A. Country : France Robot type : search engine Description : France Telecom search engine. User Agent transmitted to the visited web server : Mozilla/4.0 (compatible; MSIE 5.0; Windows 95) VoilaBot BETA 1.2 (http://www.voila.com/)     IP address range : from 193.252.148.0 to 193.252.148.255 () URL for more [...]

Twiceler

Monday, June 11th, 2007

Owner of the robot : Cuill, Inc. Country : USA Robot type : search engine Description : Twiceler is the experimental robot of Cuill, a young company that pioneers a new approach to search. It is led by former IBM and Google engineers. User Agent transmitted to the visited web server : Mozilla/5.0 (Twiceler-0.9 http://www.cuill.com/twiceler/robot.html) [...]

Turnitin

Monday, June 11th, 2007

Owner of the robot : iParadigms, LLC. Country : USA Robot type : search engine Description : Turnitinbot spiders web sites for the sole purpose of helping educational institutions prevent plagiarism (in particular, searching for similarities beteween student papers and content found on the Internet). User Agent transmitted to the visited web server : TurnitinBot/2.1 [...]

TMCrawler

Monday, June 11th, 2007

Owner of the robot : Country : Taiwan Robot type : unknown Description : No information available. User Agent transmitted to the visited web server : TMCrawler     IP address range : from 59.124.0.0 to 59.127.255.255 (hinet.net) URL for more information : Access control options understood by the robot : User Agent to use [...]

TLink

Monday, June 11th, 2007

Owner of the robot : Thibault Kummer Country : France Robot type : search engine software Description : Tlink original task is the detection of dead links. Thanks to a system of plugins, TLink can be used to perform other maintenance tasks. User Agent transmitted to the visited web server : TLink (http://www.coldsource.net/projets/tlink/     [...]

SygolBot

Monday, June 11th, 2007

Owner of the robot : Giorgio Galeotti Country : Italy Robot type : search engine Description : Search engine (in English and Italian) developped by an Italian impassioned hobbyist. User Agent transmitted to the visited web server : SygolBot http://www.sygol.net     IP address range : from 81.208.26.0 to 81.208.26.63 (fastwebnet.it) URL for more information [...]