LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 01-12-2010, 05:51 AM   #1
belda
LQ Newbie
 
Registered: Dec 2006
Posts: 15

Rep: Reputation: 0
apache overloaded by robots


hi,
im running vps with debian lenny, with apache2 webserver, memcached, postgres and django-framework using mod-wsgi.
Some of the pages served by django take long to generate (aprox 15 s), but there is also the memcached, compensating this.

My problem is that, when a robot visits the site, it starts traversing the site visiting all the pages and also the nongenerated, thus slowing it down to point where it is not responding.


What Im looking for is a solution to identify that the request comes from a robot (user-agent, ips, etc) and limit the resources, so that f.e. only one thread serves the robot etc...

Is it possible? has anyone come across similar problem? Any other solution?
 
Old 01-12-2010, 06:45 AM   #2
Dave_Devnull
Member
 
Registered: May 2009
Posts: 142

Rep: Reputation: 24
How about robots.txt (and a sitemap) taking them to optimal 'static' versions? Or, failing that, using .htaccess to steer them elsewhere?
 
Old 01-12-2010, 07:20 AM   #3
belda
LQ Newbie
 
Registered: Dec 2006
Posts: 15

Original Poster
Rep: Reputation: 0
well, i need them to get the correct content and all sites, because I want to be indexed in search results. So I cannot block them or give them different results, because google would penalize it.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
C++ template functions and overloaded operators PatrickNew Programming 4 08-06-2008 06:09 AM
Ideas about event logging on overloaded servers anebi Linux - Server 5 04-03-2008 12:41 PM
Recovery from overloaded / JMCraig Linux - Newbie 2 04-01-2003 11:26 AM
CPU Overloaded jayakrishnan Linux - General 4 03-03-2003 12:33 AM
overloaded irq assignment manojrkrish Linux - Hardware 0 06-27-2002 11:12 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 08:02 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration