OverlordQ wrote:
A) Unplanned outage, how do they warn against those?
By creating maint@linode which lists _all_ maintenance, without exception. That way I can look at that mail archive and see todays date and 'upgrading libraries on xen hosts' and go... ahhhh!
Quote:
B) Do you want them to fix your box or post here?
Actually I want them to post here first, then fix the problem
Ive just had to tell a customer "your servers seem ok but _may_ be rebooted at some point" because I dont know if they are going to reboot all hosts. I had to tell them that because there is _NO_ official information that I can find on the status of the problem (or even much on the cause, fix eta, etc)
Take 5 mins to post _all_ the info you have. Say what is going to happen to rebooted hosts (are they now ok) and what about un-rebooted hosts (will they be rebooted later or are they fine).
Part of taking credit and fanboy love we have for this wonderful thing (and I think linode is great) is also taking the responsibility for the fsck-ups that happen along the way.
Im a professional sysadmin, so I understand things 'go wrong' and that is fine, people have to live with that. But what you *can do* is be open and honest and fully informative about the problem. It takes 5 mins to do and often stopping and thinking about the problem enough to lay it out clearly can actually help.
For example xen instances can be _saved_ to disk. Is there some reason that admin's cant do the following:
* save all xen instances on a host
* reboot the host
* restore the xen instance
If that was workable, then maybe you could have a 2 minute 'hang' for each host and _not need_ a reboot. *shrugs* Maybe linode should look at trying that (xm save _savefile_) and see if it could be used to reduce the impact next time.
Quote:
C) You get what you pay for.
D) Run it yourself if you think you can do better.
I pay for a service and part of that service involves updating the:
* forums
* outage announcements
* blogs
* twitter
None of which have any useful information.
I have taken 5 mins to email my clients and say "linode hosted servers are to be taken as 'unreliable' until further notice".

There. I did better.
