Twiceler


URL for more information : http://www.cuill.com/twiceler/robot.html

12 Responses to “Twiceler”

  1. T├Ánu Samuel says:

    Twiceler is broken. Today it went closed loop and made many queries per second agains front page of my site. Had to block IP. Even tried to call this company and mailed. NOTHING.

  2. Alexander Higginsd says:

    This Bot is identified as a search engine, when and where can or will the results be available for visitors? Is there anymore information on the type of search engine. I assumme it will be available to the general public?

  3. Marcos says:

    They don’t obey robots. They don’t stop from hammering your site. They hit your site a zillion thousand times a day. They claim to be a search engine but where’s the page to search for something? BS. My site just went offline due to excess bandwidth due to this thing. I sent them an e-mail but I am honestly not expecting much.

  4. Jean-Luc says:

    If you don’t like this bot, block it with these lines at the top of your .htaccess file:

    Options +FollowSymlinks
    RewriteEngine on

    RewriteCond %{HTTP_USER_AGENT} twiceler
    RewriteRule ^.*$ - [F]

  5. blah says:

    Twiceler has to be the most retarded bot ever.
    Instead of just blocking them (like the comment above), do a 301 redirect to their own site. Hopefully the bot is broken enough so that it will visit their own site instead.

    RewriteEngine on
    RewriteCond %{HTTP_USER_AGENT} ^Mozilla/5(.*)Twiceler
    RewriteRule . http://www.cuill.com/your_bot_sucks [R=301]

  6. Darcy says:

    Their crawler is pure garbage. It’s crawling one of my sites with all kinds of garbage query strings and made up URL’s.

    I’d say it’s more likely that it’s surfing for security holes.

    They are obviously a bunch of flakes as they are just internet noise.

  7. JAG says:

    well this week i’ve been visited by that crap and i want to kick it out of our site…
    but, i’m really new, and don’t know where to find this .htaccess file you mentioned before.

  8. Jean-Luc says:

    .htaccess files can only be used with Apache web servers. In most cases, the .htaccess file is placed in the top directory of the web site. Search for .htaccess in Google and you will find a lot of information.

  9. Marco says:

    I use this on my sites, works pretty well:
    if (!(strpos(strtolower($_SERVER["HTTP_USER_AGENT"]), 'twiceler') === false)) {
    sleep(25);
    echo 'Twiceler is bad software. Use Google instead. Support proper search engines!';
    die();
    }

  10. Alex Higgins says:

    Twiceler’s such engine just launched at cuil.com and are being touted as the new Google. What a joke, what until all these buried posts about how the the twiceler bot has been abusing webmasters over the last year start to surface.

  11. Quietust says:

    Interestingly, Twiceler does not seem to know how to follow HTTP “301 Moved Permanently” redirects.

    My site currently gets very few hits from it (about 40 per week), but all of those hits are directed to a retired hostname which automatically supplies known bots with 301 redirects to equivalent URLs on its new hostname (other clients get a “300 Multiple Choices” so they read the page, followed by a 5 second Refresh if they don’t click the link). Googlebot and Yahoo! Slurp understand them properly (they immediately rerequest it from the new URL and update their index), but when Twiceler receives a 301 response, it simply gives up and goes to the next page in its list without ever following the redirect.

    I don’t know if it’s smart enough to follow temporary redirects (302 and 303), since I don’t use them anywhere on my site (except for login forms and such, which bots aren’t going to follow).

  12. Bytetips says:

    Is been two days Twiceler crawling my site constantly.. I decided to block this Bot. Now i know how to block this Bot.I hope i can stop it.