Lifelesspeople.com

 Forum FAQsForum FAQs  Knowledge BaseKnowledge Base  RulesRules   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   HostingHosting   RegisterRegister 
 DonateDonate   WikiWiki   ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Bad web robots and what you can do to protect your website

 
Lifelesspeople.com Forum Index -> News & Announcements
Post new topic   Reply to topic View previous topic :: View next topic  
Author Message
LP-Trel
Zen


Joined: 02 Dec 2002
Posts: 5721
Location: Nirvana by Boredom

PostPosted: Sun Jan 08, 2006 6:22 am    Post subject: Bad web robots and what you can do to protect your website Reply with quote

As many of you are likely aware not all of your visitors are human. Some are robots such as GoogleBot, Yahoo! Slurp, or MSNbot and they can be quite hungry for bandwidth when browsing or "crawling" your websites.

Note: Not all spiders are good. Wink

Some of them intentionally attempt to spam your contact forms or blog comments while others attempt to download your entire website's contents. Others are just "dumb bots" and continue eating bandwidth for no good reason ignoring robots.txt files completely.

Many of these are already blocked out from reaching dynamic websites (php, perl, ruby etc) via our application firewall but, some such as the bandwidth hungry bots can eat enough bandwidth to take your website offline (bandwidth exceeded) in just a few short hours on static content.

To protect yourself the following resources may be of help:

http://www.javascriptkit.com/h.....ss13.shtml
http://www.google.com/search?q.....+.htaccess

Note: Adding the following to your list of blocked robots could help save bandwidth. We have found that this robot is very aggressive and can eat away gigabytes of bandwidth in just a few days.

Code:

RewriteCond %{HTTP_USER_AGENT} ^OmniExplorer\_Bot [OR]

Also placing a robots.txt in the top level (public_html directory) containing at least:

User-agent: *
Crawl-delay: 5

can help your website be spidered without killing the server or causing your website to be suspended. Wink
_________________
* Knowledge Base * Wiki * Forum FAQs *
Back to top
 
Display posts from previous:   
Post new topic   Reply to topic    Lifelesspeople.com Forum Index -> News & Announcements All times are GMT - 6 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Home | Hosting | News | Forum | Links | System Status | About | Archive | Donate ]
Powered by phpBB © 2001, 2002 phpBB Group
All trademarks and copyrights on this page are owned by their respective owners. Posts and comments are owned by the poster. Everything else © 2001 - 2007 Lifelesspeople.com