943,336 Members | Top Members by Rank

Ad:
Feb 2nd, 2009
0

A New Way To Look At Service Outage Post-Mortems

Expand Post »
From angry customers to irate CIOs, service outages are an IT nightmare to be avoided at all costs. It's impossible to prevent them entirely, so post-mortem assessment is critical to understanding how to minimize their impact in the future.

While not strictly a service "outage," Google's search engine mishap last weekend got me thinking about how the company will dissect the incident and what methods they will put in place so similar situations won't happen again.

That got me wondering about how other, smaller companies do post-mortems and I came across a fascinating paper [PDF] presented at an IT conference hosted recently by the Computer Measurement Group (CMG).

Charles Foy, a service level manager with Siemens Healthcare, wrote a paper called, "Say Goodbye to Post Mortems, Say Hello to Effective Problem Management" that takes an in-depth look at how his company investigates service outages and learns from them.

Foy says that although there were already methods in place, the need for a new system became evident after Y2K. At first, Foy's team planned to simply house standard post-mortem documents in a centralized folder. They soon realized, however that they could design and entirely new process and database that would eventually lower the amount of unscheduled downtime.

"The benefits of a database of post-mortems were numerous. When implemented, we would have a central repository with records of all outages, the customers affected, downtime incurred, hardware involved, root causes, and the preventive measures implemented," wrote Foy in his paper.

"Our new goal then was to define a process and database that reduces unscheduled outages, increases availability, and communicates the root cause and preventive measures implemented to internal and external audiences," he goes on to say.

To get all the details on how Foy's team developed its new process on problem management, you'll need to download the 11-page paper [PDF]. Its well worth taking the time to read because, although some of the methods may be overkill for small organizations, it's easy to glean plenty of tips that can be applied to companies of any size.
Similar Threads
Reputation Points: 10
Solved Threads: 0
Junior Poster
Lisa Hoover is offline Offline
104 posts
since Apr 2008

This thread is more than three months old

No one has posted to this discussion for at least three months. Please let old threads die and do not reply to them unless you feel you have something new and valuable to contribute that absolutely must be added to make the discussion complete. Otherwise, please start a new thread in this forum instead.
Message:
Previous Thread in IT Professionals' Lounge Forum Timeline: Smartass satnav and the 32 mile traffic jam
Next Thread in IT Professionals' Lounge Forum Timeline: The unhappy webmaster: a modern fairy tale





About Us | Contact Us | Advertise | Acceptable Use Policy
Forum Index | Build Custom RSS Feed


Follow us on Twitter


© 2011 DaniWeb® LLC