<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: SoGou</title>
	<atom:link href="http://www.internetofficer.com/web-robot/sogou/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.internetofficer.com/web-robot/sogou/</link>
	<description>Tools and Articles for Webmasters and SEO's</description>
	<lastBuildDate>Thu, 22 Jul 2010 17:36:36 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Abel Cheung</title>
		<link>http://www.internetofficer.com/web-robot/sogou/comment-page-1/#comment-1463</link>
		<dc:creator>Abel Cheung</dc:creator>
		<pubDate>Sat, 02 Aug 2008 09:49:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.internetofficer.com/web-robot/sogou/#comment-1463</guid>
		<description>Not only nasty, it even intentionally ignores my robots.txt (which has not been updated for 1.5 yr) and directly crawls all pages explicitly disallowed in robots.txt.

Thus I&#039;m not nice to them as well.

&lt;code&gt;Rewritecond %{HTTP_USER_AGENT} &quot;^Sogou&quot;
RewriteRule .* http:/&lt;em&gt;&lt;/em&gt;/ww&lt;em&gt;&lt;/em&gt;w.sogou.com/ [L,R=301]&lt;/code&gt;

This is implemented with firewall using packet rate limiting on ACK packets as well (yes, not SYN).</description>
		<content:encoded><![CDATA[<p>Not only nasty, it even intentionally ignores my robots.txt (which has not been updated for 1.5 yr) and directly crawls all pages explicitly disallowed in robots.txt.</p>
<p>Thus I&#8217;m not nice to them as well.</p>
<p><code>Rewritecond %{HTTP_USER_AGENT} "^Sogou"<br />
RewriteRule .* http:/<em></em>/ww<em></em>w.sogou.com/ [L,R=301]</code></p>
<p>This is implemented with firewall using packet rate limiting on ACK packets as well (yes, not SYN).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacco V.</title>
		<link>http://www.internetofficer.com/web-robot/sogou/comment-page-1/#comment-1115</link>
		<dc:creator>Jacco V.</dc:creator>
		<pubDate>Wed, 12 Sep 2007 15:04:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.internetofficer.com/web-robot/sogou/#comment-1115</guid>
		<description>this robot is quite nasty

It somehow sniffs internet trafic and tries to access it.
It even tries to pickup a copy of session-bound pages
&lt;br /&gt;&#160;&lt;br /&gt;
Webtravellog User:
&lt;code&gt;124.115.220.*** - - [12/Sep/2007:14:02:13 +0200] &quot;GET /loginSuccess.php?sessionId=a222d271f54ef1809cc567300fe9ba3f HTTP/1.1&quot; 200 17595 &quot;https://*****.webtravellog.com/login/&quot; &quot;Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)&quot;             SSLv3 RC4-MD5&lt;/code&gt;
&lt;br /&gt;&#160;&lt;br /&gt;
And, the robot:
&lt;code&gt;220.181.19.72 - - [12/Sep/2007:14:11:03 +0200] &quot;GET /loginSuccess.php?sessionId=a222d271f54ef1809cc567300fe9ba3f HTTP/1.1&quot; 200 8093 &quot;-&quot; &quot;Sogou Orion spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07)&quot;             SSLv3 DHE-RSA-AES256-SHA&lt;/code&gt;</description>
		<content:encoded><![CDATA[<p>this robot is quite nasty</p>
<p>It somehow sniffs internet trafic and tries to access it.<br />
It even tries to pickup a copy of session-bound pages<br />
<br />&nbsp;<br />
Webtravellog User:<br />
<code>124.115.220.*** - - [12/Sep/2007:14:02:13 +0200] "GET /loginSuccess.php?sessionId=a222d271f54ef1809cc567300fe9ba3f HTTP/1.1" 200 17595 "https://*****.webtravellog.com/login/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)"             SSLv3 RC4-MD5</code><br />
<br />&nbsp;<br />
And, the robot:<br />
<code>220.181.19.72 - - [12/Sep/2007:14:11:03 +0200] "GET /loginSuccess.php?sessionId=a222d271f54ef1809cc567300fe9ba3f HTTP/1.1" 200 8093 "-" "Sogou Orion spider/3.0(+<a href="http://www.sogou.com/docs/help/webmasters.htm#07" rel="nofollow"></a><a href='http://www.sogou.com/docs/help/webmasters.htm#07'>http://www.sogou.com/docs/help/webmasters.htm#07</a>)"             SSLv3 DHE-RSA-AES256-SHA</code></p>
]]></content:encoded>
	</item>
</channel>
</rss>

