Linode Forum
Linode Community Forums
 FAQFAQ    SearchSearch    MembersMembers      Register Register 
 LoginLogin [ Anonymous ] 
Post new topic  Reply to topic
Author Message
PostPosted: Mon Jan 26, 2009 2:36 pm 
Offline
Newbie

Joined: Mon Jan 26, 2009 2:27 pm
Posts: 2
I've been seeing a lot of mysterious load spikes on my Ubuntu 8.04 linode in the past 24+ hours, with load average ranging from .25 up to 1.5+, almost all of that coming from IO wait (as far as I can tell). I can ameliorate the problem a lot by turning off swap, but that's clearly not a good long term solution. I've tried several kernels: 2.6.18, .27, and .28 with no changes. I've turned off all of my servers to no effect, and cleanly run both rkhunter and chkrootkit (freshly reinstalled). Finally, I've installed munin to track things going forward.

Is there something else I should be looking for on my server? I'm at a loss at this point for other things that I can do. Is anyone else having similar problems on Atlanta36?


Top
   
 Post subject:
PostPosted: Mon Jan 26, 2009 4:03 pm 
Offline
Senior Member

Joined: Fri Dec 07, 2007 1:37 am
Posts: 385
Location: NC, USA
You might take a look at the output from
Code:
$ vmstat 5
and see whether it is io or swap activity that is loading you down.

To be honest, if you are actively swapping you probably need to cut down your memory usage or upgrade to a bigger linode.


Top
   
 Post subject:
PostPosted: Mon Jan 26, 2009 4:09 pm 
Offline
Senior Member

Joined: Thu Apr 03, 2008 12:02 am
Posts: 103
AOL: derole
i'm on atlanta12 and i noticed a spike in CPU usage, loadavg and IOstat at around 2am-4am this morning, no idea why.
Linode dashbord graph shows it as flat but munin caught it.


Top
   
 Post subject:
PostPosted: Mon Jan 26, 2009 4:09 pm 
Offline
Newbie

Joined: Mon Jan 26, 2009 2:27 pm
Posts: 2
Thanks. I'll check out vmstat if things get bad again. I'm not actively swapping, though -- about 1/2 of my physical memory is disk cache, so there's plenty that's available for use. When swap is enabled, a few MB get used up (as I've seen on every linux box I've worked on), but it's never been really used.


Top
   
 Post subject:
PostPosted: Wed Jan 28, 2009 3:18 pm 
Offline
Junior Member

Joined: Sat May 05, 2007 8:53 pm
Posts: 49
I've been seeing very poor performance on my linode at newark62 as well. I've seen my loads spike when the vps isn't doing anything.


Top
   
 Post subject:
PostPosted: Wed Jan 28, 2009 3:40 pm 
Offline
Senior Member

Joined: Fri May 02, 2008 8:44 pm
Posts: 1121
Me too. I'm on fremont59 and my load average is a ridiculous 3.20. I'm not doing anything out of the ordinary, I'm not swapping at all. (I'm using only half of my RAM.) I don't see any unfamiliar processes running in the background, so I know I wasn't hacked. But the CPU is spending 20-40% of its time waiting for disk I/O.

Edit: The load average is now below 1.00. Whatever the problem was, it seems to be going away. But I don't like things like this, anyway!

Can somebody among the staff look into the issue, please? Now you have a list of hosts (newark62, atlanta12, atlanta36, fremont59) to look into, and a corresponding list of times at which the I/O problems seem to have occurred. What was going on on these hosts?


Top
   
 Post subject:
PostPosted: Wed Jan 28, 2009 3:51 pm 
Offline
Senior Member

Joined: Tue Apr 29, 2008 6:26 pm
Posts: 58
Website: http://blog.shadypixel.com/
Using the program iotop you can monitor which programs are waiting on io/using io.


Top
   
 Post subject:
PostPosted: Wed Jan 28, 2009 6:46 pm 
Offline
Senior Member

Joined: Fri May 02, 2008 8:44 pm
Posts: 1121
Problem persists on fremont59.

Operations that require even a moderate amount of CPU + IO are feeling very sluggish. All MySQL queries are slower than usual, even though my DB is sitting idle at this time. Public key based SSH logins take several seconds longer than usual. Apt-get update takes forever. Rsync takes ~100 times longer than usual to generate a file list.

My linode usually has a load average of 0.1 or lower. Right now my load average is 0.8, and a reboot doesn't do anything to solve the problem. I waited a few minutes after the reboot to check my load average, so that all those daemons starting up after reboot doesn't affect my short term load average. No noticeable drop. Oh, and it took several times longer than usual for my daemons to start up after the reboot.

I've been monitoring this using top, iotop, vmstat, and munin. I'm not using any more CPU or IO than usual -- CPU average 4%, IO average 200 according to the Dashboard. I'm not doing anything unusual with my linode, nor does anyone else seem to be doing something nasty with my account. Nonetheless, many of my processes are waiting for IO to complete. As if I were trying to do some work while a degraded RAID array was being rebuilt!

Which makes me suspect..........

Is somebody fsck'ing a disk on fremont59 (and other nodes mentioned above) ?


Top
   
 Post subject:
PostPosted: Wed Jan 28, 2009 7:25 pm 
Offline

Joined: Thu Nov 20, 2008 4:52 am
Posts: 1
I also noticed this behavior last night on newark74.


Top
   
 Post subject:
PostPosted: Wed Jan 28, 2009 10:37 pm 
Offline
Senior Member

Joined: Tue Apr 29, 2008 6:26 pm
Posts: 58
Website: http://blog.shadypixel.com/
@hybinet

Doesn't sound like the problem's on your linode. I'd open a ticket about it.


Top
   
 Post subject:
PostPosted: Thu Jan 29, 2009 12:50 am 
Offline
Senior Member

Joined: Fri May 02, 2008 8:44 pm
Posts: 1121
@btmorex

Already did, three hours ago. I'll post in this thread if I get an answer that might be helpful to others as well. Or better yet, a staff member should let us know what's going on.


Top
   
 Post subject:
PostPosted: Thu Jan 29, 2009 1:50 pm 
Offline
Senior Member

Joined: Fri May 02, 2008 8:44 pm
Posts: 1121
UPDATE

Might be kernel related. I was running the new 2.6.27.4 kernel when I came across the anomalies described above. I switched back to "Latest 2.6 series" twelve hours ago, and the problem has disappeared since then. (I've been monitoring my load averages at 10 minute intervals throughout the night.)

Caker also says that Linux kernels newer than 2.6.20 may have obscure IO issues, so I for one is willing to attribute my problems to my premature adoption of the bleeding edge kernel.

But what about others? @patrickpkt, oliver, astrashe3 : Which kernel version have you been running?


Top
   
 Post subject:
PostPosted: Thu Jan 29, 2009 2:03 pm 
Offline
Senior Member

Joined: Thu Apr 03, 2008 12:02 am
Posts: 103
AOL: derole
2.6.18.8-linode10 (SMP) i686


Top
   
 Post subject:
PostPosted: Thu Jan 29, 2009 5:33 pm 
Offline
Junior Member

Joined: Sat May 05, 2007 8:53 pm
Posts: 49
Whatever was going on isn't nearly as bad today -- it might not be happening at all.

I'm running 2.6.18.8-linode16. I didn't run iotop because it complained that my kernel was too old.

I reinstalled my linode's OS yesterday with a 32 bit ubuntu 8.10 -- I had the problem before and after the reinstall.

I don't know about everyone else, but mine is running pretty well now. I don't know what was going on, but I haven't changed anything to fix it. It got better on its own.


Top
   
 Post subject:
PostPosted: Sun Feb 01, 2009 11:06 am 
Offline
Junior Member

Joined: Fri Oct 22, 2004 6:40 am
Posts: 20
I'm on newark61 and over the last 30 days my IO has averaged 10K by looking at the graphs in my control panel.

3 weeks ago this was less than 1K and I'm not doing anything different on my server it just seems to use up swap space more than normal!

CPU usage is currently at 10% (5% average for the last 30 days).


Top
   
Display posts from previous:  Sort by  
Post new topic  Reply to topic


Who is online

Users browsing this forum: No registered users and 0 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
RSS

Powered by phpBB® Forum Software © phpBB Group