The recent outage of Amazon Web Services (AWS) east region cloud has taken on
many dramatic monikers such as "cloudgate," "cloudburst," and has even
triggered a creative commiserative competition (http://lat.ms/g7aTDC). Most
of us though are not surprised that an outage occurred, but remain a bit
puzzled by the length of time it has taken for the engineers to right the
situation. We look forward to post-mortem reports from AWS that will
hopefully help us understand what actually happened. Was there an elusive
heisenbug that sprinkled some corrosive pixie dust on the block storage
devices? Or was it simply the case of someone making like an air traffic
controller and falling asleep at the switch? In any case, full transparency
should be the modus operandi here.
Two main themes though quickly emerge out of this episode.
First is there are a heck of a lot of enterprise... (more)