View Single Post
  #19 (permalink)  
Old 01-09-2008, 07:15 PM
Rodney Rodney is offline
Pligg Donor
 
Join Date: Feb 2006
Posts: 67
Thanks: 8
Thanked 4 Times in 3 Posts
Quote:
Originally Posted by chuckroast View Post
Good question Divisive Cotton

It starts by blocking 174 of the most widely used User_Agents these bad bots use. If the bot isn't listed, it then set's a trap by giving bot certain instructions to follow in the robots.txt file. Like a test. If the bot does not follow these instructions correctly the trap is sprung and it will activate Scarecrow.

If a bot springs the trap, Scarecrow will generate 2,500 random character email addresses to poison the harvester. Then it will block the IP address by writing it to the .htaccess, providing them with nothing but a 404 error if they attempt to reconnect.

You are sent an email each time a bot gets caught in the trap. The email provides all the available information collected about the bot. User_agent, IP, Time, etc.

If for some reason a good bot gets blocked by mistake, you can simply remove the blocked IP from the .htaccess

Hope that clears things up.
Is there any configuration in this module? I'd like to turn off the "poisoning" feature. It's enough for me to just identify them and block them.

I don't want my server sending out any more emails than it has to
Reply With Quote