Linode Forum
Linode Community Forums
 FAQFAQ    SearchSearch    MembersMembers      Register Register 
 LoginLogin [ Anonymous ] 
Post new topic  Reply to topic
Author Message
PostPosted: Thu Jun 05, 2014 1:10 am 
Offline
Junior Member
User avatar

Joined: Wed Oct 20, 2010 7:10 pm
Posts: 36
Location: Sebastopol, CA
Hi,

Recently, I have had two spikes in traffic to my website. Average connections per second have been about 180. These spikes were at about 600 and 800 connections per second. There seemed to be some funny business going on after looking at the server logs of the content that was served; likely scraping content from my website.

I was wondering have any of you experienced this. How did you or would you deal with it? Do any of you have experience with investigating and taking action other than beefing up servers (maybe legal action)?

My server configuration has been as follows:
NodeBalancer with two 1GB web servers (serving HTTP on port 80 and HTTPS on port 443) running Apache. These servers connect to one 2GB database server running MySQL.

I am migrating to two 2GB SSD web servers running LiteSpeed and optimizing my web app to server more static files from a CDN to reduce overall connections.

So, again. Does anyone have experience and recommendations for handling this load? Anyone have experience with researching this kind of activity to determine who is inadvertently knocking down my site?

Let me know if you need more information.

Thanks, Josh


Top
   
PostPosted: Thu Jun 05, 2014 6:00 am 
Offline
Senior Member

Joined: Mon Jan 02, 2012 12:45 pm
Posts: 365
Not that it's much consolation, but I doubt they (whomever 'they' are) were trying to knock down your site. This game of 'block the scrapers' is always going to be pretty much reactionary unless you start blocking large segments of IPs in advance.

When we see this type of activity we block the entire range that owns the offending IP addresses unless it appears to be a consumer/residential IP address. It's not perfect, but it prevents return visits and gives us at least a little bit of a sense of doing something about it.


Top
   
PostPosted: Thu Jun 05, 2014 7:40 pm 
Offline
Senior Member

Joined: Thu Feb 20, 2014 5:06 pm
Posts: 58
I had the same problem (to a lesser extent), and used ModSecurity along with the OWASP Core Ruleset to deal with it. Modsecurity is a pain to setup, but it's free and very powerful. The OWASP ruleset has a ton of rules to deal with bots, and they're pretty good at keeping it up-to-date.


Top
   
PostPosted: Thu Jul 03, 2014 7:21 pm 
Offline

Joined: Thu Jul 03, 2014 7:19 pm
Posts: 1
You might look into setting up Cloudflare in front of your site. With it's CDN and caching and WAF rules you'd probably be all set without having to install anything locally...


Top
   
PostPosted: Wed Jul 30, 2014 3:57 pm 
Offline

Joined: Wed Jul 30, 2014 3:55 pm
Posts: 1
I HIGHLY suggest cloudflare and then varnish in front of the web server. We run 4 servers, 1 with varnish 2 apps and a database server. We can handle some really massive spikes with minimal impact.


Top
   
Display posts from previous:  Sort by  
Post new topic  Reply to topic


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
RSS

Powered by phpBB® Forum Software © phpBB Group