Amazon Cloud Service Outage Cripples Multiple Web Sites
Latency and connectivity issues affecting Amazon Web Services' Elastic Compute Cloud (EC2) and Rational Database Services is thought to be the cause of manyWeb site outages.
The trouble occurred at its northern Virginia datacenter early Thursday morning, and Amazon continued to report issues throughout the business day, according to the company's Service Health Dashboard.
Customers such as Foursquare, Reddit, Quora and Hootsuite were among those impacted by the disruptions, according to published reports. Three of the four sites remained down as of 6:00 p.m. Eastern Daylight time Thursday.
Only Reddit attributed its service disruption to the Amazon's outage. According to a notice on Reddit's site: "Reddit is in 'emergency read-only mode' right now because Amazon is experiencing a degradation. They are working on it but we are still waiting for them to get to our volumes. You won't be able to log in. We're sorry and will fix the site as soon as we can."
Amazon pointed to a single availability zone in the Virginia datacenter. According to a status message at 4:48 eastern time:
"All other Availability Zones are operating normally. Customers with snapshots of their affected volumes can re-launch their volumes and instances in another zone. We recommend customers do not target a specific Availability Zone when launching instances. We have updated our service to avoid placing any instances in the impaired zone for untargeted requests."
Service at Amazon's northern California datacenter and its facilities in Europe and the Asia Pacific appeared to be running fine, according to the dashboard. Experts say customers can avoid the impact of such problems by selecting multiple availability zones or even better, multiple cloud providers.
"If your business relies on a Web site to be up, why do you allow a failure in a single availability zone to shut down your business?," wrote Scott Sanchez, security and privacy officer at ScaleUp Cloud, in a blog post. "There are so many tools out there at this point to simplify deployment, scaling and resiliency across multiple availability zones or even across multiple cloud providers – frankly, you have no excuse."
Jeffrey Schwartz is editor of Redmond magazine and also covers cloud computing for Virtualization Review's Cloud Report. In addition, he writes the Channeling the Cloud column for Redmond Channel Partner. Follow him on Twitter @JeffreySchwartz.