Government Technology

Battery Failure, Human Error Still Cause Most Data Center Outages



Battery failure lead cause of data center outages

September 17, 2013 By

Data center outages remain common and three major factors — uninterruptable power supply (UPS) battery failure, human error and exceeding UPS capacity — are the root causes, according to a new study released earlier this month.

Study of Data Center Outages, released by the Ponemon Institute on Sept. 10, and sponsored by Emerson Network Power, revealed that 91 percent of respondents experienced an unplanned data center outage within the last 24 months, a slight dip from the 2010 survey results, when 95 percent of respondents had reported an outage.

“That to me is probably a wake-up call to most data center professionals who should think seriously about what happens when they have unplanned outages,” said Peter Panfil, vice president of global power sales for Emerson Network Power.

The study reported findings on survey responses from IT professionals nationwide, including 8 percent from public sector. Fifty-five percent of the survey’s respondents claimed that UPS battery failure was the top root cause for data center outages, while 48 percent felt human error was the root cause. Forty-six percent of those surveyed cited exceeding UPS capacity as a major problem.

During the last few months, two major data center outages occurred within state governments. According to local media, Oregon government services encountered major setbacks after the state’s data center went down in July, resulting in the delay of unemployment payments and a temporary loss of state employee email access. On Sept. 13, a power outage in one of New Jersey’s three state data centers caused a temporary shutdown of the state's websites and computers.

Panfil said a typical data center operating on 1 megawatt of UPS will have about five strings of batteries, each string containing 40 batteries. Much like a string of Christmas lights wired in a series, if one of those batteries fails, the entire string will also fail. 

“In many cases, that failure cannot be detected if the battery is just sitting there at an idle state not delivering power,” Panfil said. “As a battery ages and as it starts to fail, its internal resistance goes up and that’s one of its failure mechanisms.”

How to Prevent an Outage

According to the survey responses, IT professionals of high-performing data centers recommend the following actions for preventing outages:

  • Consider data center availability their highest priority above all others, including cost minimization and improving energy efficiency;
  • Utilize all best practices in data center design and redundancy to maximize availability;
  • Dedicate ample resources to bring their data center up and running in case of an unplanned outage;
  • Have complete support from senior management on efforts to prevent and manage unplanned outages;
  • Regularly test generators and switchgear to ensure emergency power in case a utility outage does occur;
  • Regularly test or monitor UPS batteries; and
  • Implement data center infrastructure management (DCIM).


“No single technology or best practice can completely remove the risk of downtime,” said Larry Ponemon, founder and chairman of the Ponemon Institute. “However, what this report shows us is that by committing the necessary investment in infrastructure technology and resources and taking a number of actions, organizations can dramatically reduce the frequency and duration of unplanned data center outages that can potentially cost data centers thousands of dollars per minute.”
 

 


| More

You May Also Like

Comments

Add Your Comment

You are solely responsible for the content of your comments. We reserve the right to remove comments that are considered profane, vulgar, obscene, factually inaccurate, off-topic, or considered a personal attack.

In Our Library

White Papers | Exclusives Reports | Webinar Archives | Best Practices and Case Studies
Cybersecurity in an "All-IP World" Are You Prepared?
In a recent survey conducted by Public CIO, over 125 respondents shared how they protect their environments from cyber threats and the challenges they see in an all-IP world. Read how your cybersecurity strategies and attitudes compare with your peers.
Maintain Your IT Budget with Consistent Compliance Practices
Between the demands of meeting federal IT compliance mandates, increasing cybersecurity threats, and ever-shrinking budgets, it’s not uncommon for routine maintenance tasks to slip among state and local government IT departments. If it’s been months, or even only days, since you have maintained your systems, your agency may not be prepared for a compliance audit—and that could have severe financial consequences. Regardless of your mission, consistent systems keep your data secure, your age
Best Practice Guide for Cloud and As-A-Service Procurements
While technology service options for government continue to evolve, procurement processes and policies have remained firmly rooted in practices that are no longer effective. This guide, built upon the collaborative work of state and local government and industry executives, outlines and explains the changes needed for more flexible and agile procurement processes.
View All

Featured Papers