Category Archive for 'web-robot'
Monday, June 11th, 2007
Owner of the robot : Telemate.net Software, Inc. Country : USA Robot type : probe (does not spider sites) Description : WebFilter lets managers prevent employees from accessing URLs that contain content the company finds objectionable. User Agent transmitted to the visited web server : WebFilter Robot IP address range : from 216.248.177.128 [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : Walhello Country : Netherlands Robot type : search engine Description : Walhello combines a search engine and informations from other sources like the DMOZ drectory, as well as products sold by several on-line shops. User Agent transmitted to the visited web server : appie 1.1 (www.walhello.com) IP address [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : wadaino.jp Country : Japan Robot type : search engine Description : Japanese portal site. User Agent transmitted to the visited web server : wadaino.jp-crawler 0.2 (http://wadaino.jp/) IP address range : from 202.51.8.0 to 202.51.15.255 (wadaino.jp) URL for more information : http://wadaino.jp/ Access control options understood by the robot [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : Kosmix Corporation Country : USA Robot type : search engine Description : The Kosmix search engine focusses on health information. Although the FAQ page of the company says that they do not use the cfetch/1.0 user agent any more since end 2005, it is still active. User Agent transmitted to [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : France T?l?com S.A. Country : France Robot type : search engine Description : France Telecom search engine. User Agent transmitted to the visited web server : Mozilla/4.0 (compatible; MSIE 5.0; Windows 95) VoilaBot BETA 1.2 (http://www.voila.com/) IP address range : from 193.252.148.0 to 193.252.148.255 () URL for more [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : Cuill, Inc. Country : USA Robot type : search engine Description : Twiceler is the experimental robot of Cuill, a young company that pioneers a new approach to search. It is led by former IBM and Google engineers. User Agent transmitted to the visited web server : Mozilla/5.0 (Twiceler-0.9 http://www.cuill.com/twiceler/robot.html) [...]
Posted in web-robot | 12 Comments »
Monday, June 11th, 2007
Owner of the robot : iParadigms, LLC. Country : USA Robot type : search engine Description : Turnitinbot spiders web sites for the sole purpose of helping educational institutions prevent plagiarism (in particular, searching for similarities beteween student papers and content found on the Internet). User Agent transmitted to the visited web server : TurnitinBot/2.1 [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : Country : Taiwan Robot type : unknown Description : No information available. User Agent transmitted to the visited web server : TMCrawler IP address range : from 59.124.0.0 to 59.127.255.255 (hinet.net) URL for more information : Access control options understood by the robot : User Agent to use [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : Thibault Kummer Country : France Robot type : search engine software Description : Tlink original task is the detection of dead links. Thanks to a system of plugins, TLink can be used to perform other maintenance tasks. User Agent transmitted to the visited web server : TLink (http://www.coldsource.net/projets/tlink/ [...]
Posted in web-robot | Comments Off
Monday, June 11th, 2007
Owner of the robot : Giorgio Galeotti Country : Italy Robot type : search engine Description : Search engine (in English and Italian) developped by an Italian impassioned hobbyist. User Agent transmitted to the visited web server : SygolBot http://www.sygol.net IP address range : from 81.208.26.0 to 81.208.26.63 (fastwebnet.it) URL for more information [...]
Posted in web-robot | Comments Off