Linode Forum
Linode Community Forums
 FAQFAQ    SearchSearch    MembersMembers      Register Register 
 LoginLogin [ Anonymous ] 
Forum locked  This topic is locked, you cannot edit posts or make further replies.
Author Message
 Post subject: Host26 reboot
PostPosted: Tue Mar 01, 2005 4:00 am 
Offline
Linode Staff
User avatar

Joined: Tue Apr 15, 2003 6:24 pm
Posts: 3090
Website: http://www.linode.com/
Location: Galloway, NJ
OOM Killer is eating Linodes on host26 now.

Upgrading the host kernel -- Linodes will be shutdown properly, host will be rebooted, and Linodes will be restarted.

Updates in a few.

-Chris


Top
   
 Post subject:
PostPosted: Tue Mar 01, 2005 4:19 am 
Offline
Linode Staff
User avatar

Joined: Tue Apr 15, 2003 6:24 pm
Posts: 3090
Website: http://www.linode.com/
Location: Galloway, NJ
Host26's kernel has been upgraded. Linodes are restarting now.

-Chris


Top
   
 Post subject: OOM Killer details
PostPosted: Tue Mar 01, 2005 1:49 pm 
Offline
Junior Member

Joined: Tue Jun 29, 2004 2:27 pm
Posts: 34
Chris,
Is there an overview explaination somewhere of the OOM Killer issues? I checked the forums and there seem to be occasional complaints about the OOM Killer being overly agressive, but I was looking for more details.

What are the specific issues with regard to UML hosts? Is the host OS's OOM Killer causing the problems, or a specific Linode instance's OOM Killer? Are there steps a Linode user can take to reduce the OOM Killer's wrath?

Like others, I have noticed during some more intense work like compiling, I will occasionally get out of memory errors even though there is plenty of swap configured (Linode 64 + 256M swap) and available.

Thanks,

--John


Top
   
 Post subject:
PostPosted: Tue Mar 01, 2005 2:04 pm 
Offline
Linode Staff
User avatar

Joined: Tue Apr 15, 2003 6:24 pm
Posts: 3090
Website: http://www.linode.com/
Location: Galloway, NJ
This is an issue with Linux kernel 2.6.10 having bugs in the OOM killer. None of these hosts are actually out of memory, but once the OOM killer comes alive it will eventually kill everything on the box. At least that's been our experience. As for an explaination, look for the various OOM killer related threads on LKML.

A bunch of OOM killer patches went into 2.6.11, and that seems to have fixed things.

-Chris


Top
   
 Post subject: OOM Killer
PostPosted: Tue Mar 01, 2005 2:21 pm 
Offline
Junior Member

Joined: Tue Jun 29, 2004 2:27 pm
Posts: 34
So the big problem is with the host kernel killing an entire UML instances?

And for the people who have seen out of memory compile problems (when there is enough swap), it's most likely their own UML's OOM Killer that may be killing processes?

Which means that some of my memory problems described above may be due to my UML kernel (using 2.4 latest) not have the OOM Killer patches backported? (I don't know how bad the OOM Killer problems are in 2.4, but there seem to be some issues based on what I've read.)


--John


Top
   
 Post subject: Re: OOM Killer
PostPosted: Tue Mar 01, 2005 2:33 pm 
Offline
Linode Staff
User avatar

Joined: Tue Apr 15, 2003 6:24 pm
Posts: 3090
Website: http://www.linode.com/
Location: Galloway, NJ
jpw wrote:
So the big problem is with the host kernel killing an entire UML instances?

Yup. The OOM killer "decides" that the host needs more memory (when they're is *plenty* of free memory/swap), and starts killing UML processes. Once it starts, it never stops.

jpw wrote:
And for the people who have seen out of memory compile problems (when there is enough swap), it's most likely their own UML's OOM Killer that may be killing processes?

Well, this might be another problem. Are you using Gentoo? If so, what host are you on?

jpw wrote:
Which means that some of my memory problems described above may be due to my UML kernel (using 2.4 latest) not have the OOM Killer patches backported? (I don't know how bad the OOM Killer problems are in 2.4, but there seem to be some issues based on what I've read.)

Hmm, 2.4 had and then didn't have the OOM killer. If you're referring to "page_alloc" failures in your console, I believe those are genuine OOM scenarios (no RAM/swap). Or, perhaps the memory allocator in 2.4 is susceptible to the same problems.

-Chris


Top
   
 Post subject: Re: OOM Killer
PostPosted: Tue Mar 01, 2005 3:08 pm 
Offline
Junior Member

Joined: Tue Jun 29, 2004 2:27 pm
Posts: 34
caker wrote:
Well, this might be another problem. Are you using Gentoo? If so, what host are you on?

host26, running Fedora Core 1, kernel 2.4 latest

I see the out of memory errors occasionally when compiling various things (named or sendmail, for example). Rerunning 'make' again usually finishes without problems. Restarting the build completely from scratch (to try and reproduce the problem) doesn't usually result in the out of memory error. So it seems less reproducable than I would expect if it was simply just running out of RAM + swap, which is why I thought it might be OOM Killer related.

caker wrote:
Hmm, 2.4 had and then didn't have the OOM killer. If you're referring to "page_alloc" failures in your console, I believe those are genuine OOM scenarios (no RAM/swap). Or, perhaps the memory allocator in 2.4 is susceptible to the same problems.

I don't see anything in the logs or the system console, just in the shell as part of the make. There always seems to be a lot of unused swap when I see the error. Perhaps it's not OOM Killer related at all.

--John


Top
   
Display posts from previous:  Sort by  
Forum locked  This topic is locked, you cannot edit posts or make further replies.


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
RSS

Powered by phpBB® Forum Software © phpBB Group