Linode Forum
Linode Community Forums
 FAQFAQ    SearchSearch    MembersMembers      Register Register 
 LoginLogin [ Anonymous ] 
Post new topic  Reply to topic
Author Message
 Post subject: fremont outage
PostPosted: Sun Nov 21, 2010 1:49 pm 
Offline
Senior Member

Joined: Sat Feb 10, 2007 7:49 pm
Posts: 96
Website: http://www.arbitraryconstant.com/
A company having the people and spare hardware to recover from a truly spectacular outage like this isn't something I take for granted.

Cheers.


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 2:42 pm 
Offline
Senior Member
User avatar

Joined: Fri Oct 24, 2003 3:51 pm
Posts: 965
Location: Netherlands
+1. Judging from the time it took to come back up, my primary mail server must have been on one of the hosts that got damaged during the outage. Really satisfying to have the problem just fixed in a few hours using redundant spare gear, instead of having to wait for repairs or shipment of replacement gear, as has happened to me at other providers.

Well done, Linode team.

_________________
/ Peter


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 4:11 pm 
Offline
Newbie

Joined: Sun Nov 21, 2010 4:06 pm
Posts: 2
While its nice that the pseudo redundancy was there, I'm still disappointed that I had to learn of the outage by searching twitter for "linode". No email, phone call, or a minimum @linode tweet. Unless I'm blind, there isn't a link to the status page on the main linode.com site.


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 4:53 pm 
Offline
Senior Member
User avatar

Joined: Fri Dec 11, 2009 7:09 pm
Posts: 168
I use the rss feed from http://status.linode.com/
I have to say- every other host I have ever used would have had, at minimum, 48 hours of downtime from this. Having everything back up and running in 6 hours after a catastrophic failure is impressive.

_________________
--
Chris Bryant


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 4:55 pm 
Offline
Senior Member
User avatar

Joined: Sun Dec 27, 2009 11:12 pm
Posts: 1038
Location: Colorado, USA
JasonTokoph wrote:
No phone call

Where's Miss Hathaway when you need her.

I think it's pretty naive to think that a low cost VPS service is going to call you personally with every hiccup.

Look into monitoring apps (locally run or outsourced).


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 5:45 pm 
Offline
Senior Member

Joined: Wed May 13, 2009 1:18 am
Posts: 681
JasonTokoph wrote:
While its nice that the pseudo redundancy was there, I'm still disappointed that I had to learn of the outage by searching twitter for "linode". No email, phone call, or a minimum @linode tweet. Unless I'm blind, there isn't a link to the status page on the main linode.com site.

Yeah, with something like this where the affected Linodes should be pretty clearly identified, having an email go out to their owners seems like it would be a good feature, providing it was internally simple enough so it didn't distract staff from actually addressing the problem.

With that said, if you need to be aware of outages for your node, I'd strongly suggest doing some monitoring (either yourself or with any of the free and/or paid online services). There are a number of problems - especially network related - that can arise for which Linode can't necessarily know for sure which nodes are affected, not to mention more localized issues. I know for my Linodes that I need to stay up, I monitor all the services I care about myself, and it's always my own monitoring that triggers me to go look at status.linode.com to see if it's a more general problem.

I'd also suggest throwing in a boot time cron entry on your Linode to spit out a message to you any time it boots, just as a head's up.

-- David


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 8:09 pm 
Offline
Junior Member

Joined: Mon May 24, 2010 6:39 pm
Posts: 20
I hope they really look into why the UPS system failed. I was shocked to hear a power outage would take down a hosting company. I can understand now that I see the UPS failed. Still a shock that the UPS systems failed says to me that something failed along the lines of making sure they where tested to ensure they worked.

Glad to hear they had hot spares ready to go for servers and got things fixed up for clients.


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 10:03 pm 
Offline
Senior Member
User avatar

Joined: Sat Aug 30, 2008 1:55 pm
Posts: 1739
Location: Rochester, New York
Datacenters exist in the real world and are built by humans. They can, and will, fail in unanticipated ways. That's life. Don't be surprised when it happens, or think that it won't happen just because of a company's reputation, a facility's history, etc. It will.

From my perspective, the next few days/weeks are the interesting part... figuring out the causes, the contributing factors, and all that stuff. I live for the RFO. :-)

_________________
Code:
/* TODO: need to add signature to posts */


Top
   
 Post subject:
PostPosted: Sun Nov 21, 2010 11:48 pm 
Offline
Newbie

Joined: Sun Nov 21, 2010 4:06 pm
Posts: 2
db3l wrote:
With that said, if you need to be aware of outages for your node, I'd strongly suggest doing some monitoring (either yourself or with any of the free and/or paid online services).


Don't get me wrong. I have monitors in place for all of my systems, Linodes or otherwise. These monitors kicked off alerts that I needed. The issue is that I had no idea why the alerts were being generated.

Also, while I didn't expect a phone call, I did list it along with other forms of communication as I didn't receive anything.

Its even worse that I have yet to receive any form of communication from Linode regarding the outage. Fine, use 100% of your resources to fix the issue, but once its fixed, affected customers should receive some sort of notification.

I'll be happy when a post mortem is in my inbox along with an apology for lack of communication.


Top
   
 Post subject:
PostPosted: Mon Nov 22, 2010 12:36 am 
Offline
Senior Member
User avatar

Joined: Sun Dec 27, 2009 11:12 pm
Posts: 1038
Location: Colorado, USA
JasonTokoph wrote:
I'll be happy when a post mortem is in my inbox along with an apology for lack of communication.

You're kidding right?

They have a SLA, which they didn't break.

They have a Status page - that pretty much tells you what's going on.

They sell a UNMANAGED service.

Unless you have a few DOZEN HIGH END VPS's that were effected, I think you're vastly over reacting.

They were down, they fixed it, they reported it, it's a rare event, get over it already.


Top
   
 Post subject:
PostPosted: Mon Nov 22, 2010 1:10 am 
Offline
Senior Member

Joined: Wed May 13, 2009 1:18 am
Posts: 681
JasonTokoph wrote:
Don't get me wrong. I have monitors in place for all of my systems, Linodes or otherwise. These monitors kicked off alerts that I needed. The issue is that I had no idea why the alerts were being generated.

Oh, sorry - I got the wrong impression from your earlier phrasing, which made it sound like you didn't have anything yourself to let you notice the outage even on your own Linodes.

For what it's worth, when my monitors fire, and after the obvious stuff like issues on the monitor's own connection or the network, first thing I do is check status.linode.com, and then if there is nothing there, get on IRC and lurk a bit or ask about it. I'm not a tweeter, so I rarely look there. Typically one of the two will give me the low down fairly quickly (IRC in particular gets busier during the larger outages, and at a minimum you can probably find out if you aren't alone), but I'm willing to accept some latency also since I may have actually noticed things myself first or the issue may be small enough not to prompt a status update.

Also, the good news is that now you know about status.linode.com for the future. I agree some additional links to it couldn't hurt, though it's a fairly common naming convention among providers in the last few years, so not a bad name to try by default. Also, depending on the outage, the main Linode site may not itself respond - e.g., if the recent outage had been in Texas, so depending on using a link from the main site won't always work (status.linode.com is hosted elsewhere).

Quote:
I'll be happy when a post mortem is in my inbox along with an apology for lack of communication.

Wouldn't necessarily wait for the inbox, but I'm sure some summary information will be posted or made available when possible. For example http://www.linode.com/forums/viewtopic. ... 4778#24778 when there was an outage affecting a decent number of machines in the Newark DC last year.

-- David


Top
   
 Post subject:
PostPosted: Mon Nov 22, 2010 2:39 am 
Offline
Newbie

Joined: Mon May 24, 2010 12:29 pm
Posts: 3
Location: Here.
Does anyone even use email for service updates anymore?


Top
   
 Post subject:
PostPosted: Mon Nov 22, 2010 3:49 am 
Offline
Senior Member

Joined: Sat Jun 05, 2004 12:49 am
Posts: 333
Jay3ld wrote:
I hope they really look into why the UPS system failed. I was shocked to hear a power outage would take down a hosting company.


You must be new to the internet


Top
   
 Post subject:
PostPosted: Mon Nov 22, 2010 6:25 am 
Offline
Senior Member
User avatar

Joined: Tue Aug 17, 2004 11:37 pm
Posts: 262
Website: http://www.our-lan.com
WLM: nf@our-lan.com
Location: Brisbane, Australia
email is useless as a status message medium.. esepcially given my frontend client access server for mail was one of the nodes that went down.. Such is life though :)

_________________
ServerAdmin - www.our-lan.com
"Diplomacy is the art of saying nice doggy whilst looking for a really big stick"
"In my experiece, any attempt to make any system idiot proof will only challenge God to make a better idiot"


Top
   
 Post subject:
PostPosted: Mon Nov 22, 2010 7:12 am 
Offline
Senior Member
User avatar

Joined: Fri Oct 24, 2003 3:51 pm
Posts: 965
Location: Netherlands
bryantrv wrote:
I use the rss feed from http://status.linode.com/

+1. This is hosted at typepad.com, in a different datacenter to all Linodes.

_________________
/ Peter


Top
   
Display posts from previous:  Sort by  
Post new topic  Reply to topic


Who is online

Users browsing this forum: No registered users and 2 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
RSS

Powered by phpBB® Forum Software © phpBB Group