GoFuckYourself.com - Adult Webmaster Forum

GoFuckYourself.com - Adult Webmaster Forum (https://gfy.com/index.php)
-   Fucking Around & Business Discussion (https://gfy.com/forumdisplay.php?f=26)
-   -   How Do You Stop Fucking Baido Indexing Sites?.. (https://gfy.com/showthread.php?t=1136551)

EddyTheDog 03-22-2014 10:13 AM

How Do You Stop Fucking Baido Indexing Sites?..
 
It ignores robots.txt - Anyone know what IPs it uses?....

Magnetron 03-22-2014 10:31 AM

What do you have against Scott Baio?

WDF 03-22-2014 10:35 AM

Don't you like all that CN traffic?

freecartoonporn 03-22-2014 10:35 AM

here it is .

Stop Baidu crawler

brassmonkey 03-22-2014 10:42 AM

fucking racist!! :disgust







































:1orglaugh:1orglaugh:1orglaugh

bean-aid 03-22-2014 12:35 PM

Just have host blacklist china traffic.

ErectMedia 03-22-2014 08:51 PM

robots.txt is like asking your neighbor to keep his dog off your lawn, htaccess is like installing an electric fence :2 cents:

rowan 03-22-2014 09:39 PM

Quote:

Originally Posted by beaner (Post 20024044)
Just have host blacklist china traffic.

Make sure they have clue before they do that, and don't end up blacklisting all of Asia Pacific.

My own solution to the problem is to firewall any IP that presents a Baidu user-agent.

fuzebox 03-22-2014 09:55 PM

I would never turn down free traffic. If you don't want china leeching your resources, redirect it somewhere useful :2 cents:

rowan 03-23-2014 04:23 AM

Quote:

Originally Posted by fuzebox (Post 20024299)
I would never turn down free traffic. If you don't want china leeching your resources, redirect it somewhere useful :2 cents:

Dunno about the OP but I'm assuming that he's in the same boat as I am - the issue is that the Baidu web spider trawls over the whole site, but never actually sends any (or very little) human traffic.

If your site has a decent number of pages and/or it is dynamically generated then Baidu really is just wasting resources.

Phoenix 03-23-2014 05:41 AM

i might be interested in taking all chinese traffic.

FlowerKid 03-23-2014 06:12 AM

If baidu spider teaffic is relevant for your server performance, maybe it's time to upgrade.

hdbuilder 03-23-2014 08:44 PM

Just put this in your htaccess file in the root of each domain:

# Block bad spiders
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Sosospider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} Baiduspider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} Sogou
RewriteRule ^.* - [F,L]

You can add as many as you want , make sure the lines ends with [NC,OR] and with nothing for the last one

Using it for years and been tested ...

EddyTheDog 03-24-2014 12:39 AM

Quote:

Originally Posted by hdbuilder (Post 20024983)
Just put this in your htaccess file in the root of each domain:

# Block bad spiders
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} Sosospider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} Baiduspider [NC,OR]
RewriteCond %{HTTP_USER_AGENT} Sogou
RewriteRule ^.* - [F,L]

You can add as many as you want , make sure the lines ends with [NC,OR] and with nothing for the last one

Using it for years and been tested ...

Thanks - That looks like it will do the trick...

The main reason is that it inflates my traffic to sponsors so much its hard to see what the real conversions are - If you are as into stats as me it is a real pain in the ass...

I am moving towards using GeoIp scripts and sorting traffic that way but it takes time.....


All times are GMT -7. The time now is 10:58 AM.

Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc