<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-5018869173202937575</id><updated>2012-02-16T09:30:58.259-08:00</updated><category term='crash'/><category term='tesco'/><category term='retail'/><category term='Oregon'/><category term='corrosion'/><category term='MIT'/><category term='florida'/><category term='pos terminals'/><category term='england'/><category term='blackberry'/><category term='paypal'/><category term='servers'/><category term='411'/><category term='bahrain'/><category term='uk'/><category term='unemployment'/><category term='outage'/><category term='power'/><category term='debit card'/><category term='batelco'/><category term='OED'/><category term='NOC'/><category term='fail'/><category term='bell'/><category term='cellular'/><category term='sexism'/><category term='gmail'/><category term='hardware'/><category term='google'/><category term='key west'/><category term='RIM'/><title type='text'>Pinkston's Law</title><subtitle type='html'>"Most outages begin as upgrades"</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>8</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-4403123804963434834</id><published>2009-12-25T16:18:00.000-08:00</published><updated>2009-12-26T18:18:31.542-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='OED'/><category scheme='http://www.blogger.com/atom/ns#' term='unemployment'/><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='Oregon'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><category scheme='http://www.blogger.com/atom/ns#' term='servers'/><title type='text'>Oregon Employment Division Servers and Phones Crash: 10/04/2009</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;p&gt;&lt;b&gt;Length of outage: Officially, 10 hours. In                                   reality, 24+ hours&lt;/b&gt;&lt;/p&gt;                                &lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: 165,000 Unemployment                                   recipients, plus OED staffers. &lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;                                                                  &lt;p&gt;I happened to be one of those affected by                                    this outage, because at the time, I was drawing                                    unemployment!&lt;/p&gt;                                                                  &lt;p&gt;The original article in the Oregonian the                                   following Monday spun the story to make it sound                                   as if it was the extra load of new people                                   applying for benefits that crashed the system.                                   Even in this later, edited version, you don't                                   find the truth until well down the page:&lt;/p&gt;                                                                   &lt;p&gt;Original news story &lt;a href="http://www.oregonlive.com/education/index.ssf/2009/10/employment_department_phones_a.html"&gt;&lt;b&gt;HERE&lt;/b&gt;&lt;/a&gt;&lt;b&gt;.&lt;/b&gt;&lt;/p&gt;                                                                  &lt;p&gt;Here's where the truth comes out:&lt;/p&gt;                                                                  &lt;blockquote&gt;Problems started Sunday when a computer server crashed while state workers were                                  doing maintenance on the state's computer network. The 60 percent of unemployed who                                  usually file online for their weekly checks turned to the telephone to file their claims on the                                  state's interactive voice response system. At the same time, the group looking for emergency                                  extensions also were swamping the phone lines.&lt;/blockquote&gt;                                                                   &lt;p&gt;So, they don't &lt;i&gt; explicitly&lt;/i&gt; say it was an upgrade, but the system was down when I tried to use it early on Sunday morning, indicating that they had been working on the system during the overnight shift. This smells suspiciously like an upgrade was being applied. &lt;b&gt; Pinkston's Law!&lt;/b&gt;&lt;/p&gt;                                                                   &lt;p&gt;Also, it is an interesting example of the cascading failure effect; when people could not file online, they moved to the phones to file on Monday (so much for the 10-hour outage -- the system was still down Monday morning). The phone system is not sized to handle all of the traffic that the online system handles, so it crashed, too.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-4403123804963434834?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/4403123804963434834/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/oregon-employment-division-servers-and.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/4403123804963434834'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/4403123804963434834'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/oregon-employment-division-servers-and.html' title='Oregon Employment Division Servers and Phones Crash: 10/04/2009'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-7966044985317047914</id><published>2009-12-25T16:17:00.000-08:00</published><updated>2009-12-26T18:29:48.935-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='gmail'/><category scheme='http://www.blogger.com/atom/ns#' term='google'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><title type='text'>Google's Gmail Outage Caused by Upgrade Error: 09/01/2009</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: Two hours&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: Unknown - certainly millions&lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;Gmail is a very popular free webmail services                                  that many people use daily. What is less                                  well-known is that Gmail is also used                                  extensively in business as a paid,                                  enterprise-grade services.&lt;br /&gt;So, while folks who use the free personal                                   email side of Gmail are annoyed when it goes                                   down, business users are -- understandably --                                   furious.&lt;br /&gt;News story:&lt;br /&gt;&lt;a href="http://www.eweekeurope.co.uk/news/google-s-gmail-outage-caused-by-upgrade-error-1738"&gt;http://www.eweekeurope.co.uk/news/google-s-gmail-outage-caused-by-upgrade-error-1738&lt;/a&gt;&lt;br /&gt;While Google does have the admirable mission                          statement, "Don't be evil," they are sometimes                          quite tight-lipped about specific causes of outages.                          This time they made it clear, in a statement by Ben                          Treynor, "VP Engineering and Site Reliability Czar":&lt;br /&gt;&lt;a href="http://gmailblog.blogspot.com/2009/09/more-on-todays-gmail-issue.html"&gt;http://gmailblog.blogspot.com/2009/09/more-on-todays-gmail-issue.html &lt;/a&gt;&lt;br /&gt;&lt;blockquote&gt;Here's what happened: This morning (Pacific Time) we took a small  fraction of Gmail's servers offline to perform routine upgrades. This  isn't in itself a problem — we do this all the time, and Gmail's web  interface runs in many locations and just sends traffic to other  locations when one is offline.&lt;br /&gt;&lt;br /&gt;However, as we now know, we had  slightly underestimated the load which some recent changes (ironically,  some designed to improve service availability) placed on the request  routers — servers which direct web queries to the appropriate Gmail  server for response. At about 12:30 pm Pacific a few of the request  routers became overloaded and in effect told the rest of the system  "stop sending us traffic, we're too slow!". This transferred the load  onto the remaining request routers, causing a few more of them to also  become overloaded, and within minutes nearly all of the request routers  were overloaded. As a result, people couldn't access Gmail via the web  interface because their requests couldn't be routed to a Gmail server.  IMAP/POP access and mail processing continued to work normally because  these requests don't use the same routers.&lt;br /&gt;&lt;br /&gt;The Gmail engineering  team was alerted to the failures within seconds (we take monitoring  very seriously). After establishing that the core problem was  insufficient available capacity, the team brought a LOT of additional  request routers online (flexible capacity is one of the advantages of  Google's architecture), distributed the traffic across the request  routers, and the Gmail web interface came back online.&lt;br /&gt;&lt;/blockquote&gt;I have to commend Google -- and Mr. Treynor                                  in particular -- for being forthright about the                                  outage, and providing a textbook case of                                  Pinkston's Law. This case also illustrates the                                  tendency for failures in one part of a network                                  to cascade to other parts, often in an                                  unexpected fashion.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-7966044985317047914?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/7966044985317047914/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/googles-gmail-outage-caused-by-upgrade.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/7966044985317047914'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/7966044985317047914'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/googles-gmail-outage-caused-by-upgrade.html' title='Google&apos;s Gmail Outage Caused by Upgrade Error: 09/01/2009'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-911835210163409029</id><published>2009-12-25T14:32:00.000-08:00</published><updated>2009-12-26T18:30:39.179-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='tesco'/><category scheme='http://www.blogger.com/atom/ns#' term='uk'/><category scheme='http://www.blogger.com/atom/ns#' term='retail'/><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='england'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><category scheme='http://www.blogger.com/atom/ns#' term='pos terminals'/><title type='text'>Tesco IT upgrade causes till outage: May 11, 2009</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: &lt;/b&gt;4-24 hours&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: &lt;/b&gt;100                                  retail stores forced to close&lt;/li&gt;&lt;/ul&gt;&lt;a href="http://www.tesco.com/"&gt;Tesco&lt;/a&gt; is                                  a major grocery and general merchandise retailer                                  in the UK. North American readers might compare                                  it to Wal-mart or Costco. Tesco launched a                                  big  "loyalty scheme" promotion                                  in UK  newspapers to its millions of                                  Clubcard holders, which required an upgrade of                                  their software, which caused their tills (cash                                  registers) to malfunction just as the stores                                  opened at 8:00 AM.&lt;br /&gt;Original news story is &lt;a href="http://www.computerworlduk.com/management/it-business/it-department/news/index.cfm?newsid=14708"&gt;&lt;b&gt;HERE&lt;/b&gt;&lt;/a&gt;.&lt;br /&gt;The official statement from Tesco was terse                                  but candid:&lt;br /&gt;&lt;blockquote&gt;"A number of stores were                                    affected by a routine IT upgrade this morning                                    at various locations in the country,” said a                                    Tesco spokesperson. &lt;br /&gt;&lt;/blockquote&gt;She might just as well have said,                                  "Blimey! We were struck down by Pinkston's                                  Law!"&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-911835210163409029?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/911835210163409029/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/tesco-it-upgrade-causes-till-outage-may.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/911835210163409029'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/911835210163409029'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/tesco-it-upgrade-causes-till-outage-may.html' title='Tesco IT upgrade causes till outage: May 11, 2009'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-9035814185003995251</id><published>2009-12-25T14:31:00.000-08:00</published><updated>2009-12-26T18:31:39.226-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='key west'/><category scheme='http://www.blogger.com/atom/ns#' term='power'/><category scheme='http://www.blogger.com/atom/ns#' term='florida'/><category scheme='http://www.blogger.com/atom/ns#' term='corrosion'/><category scheme='http://www.blogger.com/atom/ns#' term='hardware'/><title type='text'>Florida Keys Electric Cooperative  Power Outage: Oct. 11, 2004</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: &lt;/b&gt;Approx. 1 hour&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: &lt;/b&gt;Unknown                                   -- most residents of Florida Keys&lt;/li&gt;&lt;/ul&gt;Here is a relatively rare instance of a &lt;i&gt;hardware&lt;/i&gt;                                  upgrade causing an outage. The unique geography                                  and climate of the Florida Keys was clearly a                                  factor.&lt;br /&gt;Read the original article &lt;b&gt;&lt;a href="http:///"&gt;HERE&lt;/a&gt;&lt;/b&gt;.&lt;br /&gt;Here, I think it best to quote directly from                                  the article to give you a sense of what                                  happened:&lt;br /&gt;&lt;blockquote&gt;One strand of a corroded shield wire                                    unraveled during its removal from service                                    today, causing a power outage from Islamorada                                    to Key West. Florida Keys Electric Cooperative                                    was pulling the wire for replacement when one                                    of its seven twisted strands failed.&lt;br /&gt;&lt;br /&gt;The broken strand swung into the energized                                    transmission lines below it, causing a short                                    in the transmission line. The shorted line                                    caused a power outage beginning at 12:40 p.m.                                    The outage began south of Snake Creek Bridge                                    at mile marker 86.&lt;br /&gt;&lt;br /&gt;The strand of shield wire failed over water                                    while being pulled along Long Key Channel,                                    complicating correction of the problem.       &lt;br /&gt;&lt;/blockquote&gt;As a little background, the "shield                                  wire" is the un-insulated wire that runs                                  from pole to pole above the wires that carry the                                  actual current. It is intended to reduce service                                  interruptions and equipment damage by                                  intercepting lightning strikes. In a salt-air                                  environment such as one finds along coastlines,                                  these conductors tend to corrode fairly quickly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-9035814185003995251?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/9035814185003995251/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/florida-keys-electric-cooperative-power.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/9035814185003995251'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/9035814185003995251'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/florida-keys-electric-cooperative-power.html' title='Florida Keys Electric Cooperative  Power Outage: Oct. 11, 2004'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-2222957891227940350</id><published>2009-12-25T14:29:00.000-08:00</published><updated>2009-12-26T18:32:28.491-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sexism'/><category scheme='http://www.blogger.com/atom/ns#' term='paypal'/><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='debit card'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><title type='text'>PayPal Upgrade Causes Major Outage, Affects Debit-Card Users: Oct 8, 2004</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: &lt;/b&gt;At least 4 days&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: &lt;/b&gt;Unknown,                                   but clearly many hundreds of thousands.&lt;/li&gt;&lt;/ul&gt;PayPal has become such a major part of our                                  lives for online commerce that we often think of                                  it as something that is just "always there                                  to use" like ATMs. But, of course, it runs                                  on a complex network of servers and other                                  equipment, and with those come upgrades.&lt;br /&gt;Read the original article &lt;b&gt;&lt;a href="http://www.auctionbytes.com/cab/abn/y04/m10/i12/s01"&gt;HERE&lt;/a&gt;&lt;/b&gt;.&lt;br /&gt;Here's the official word from PayPal (I find                                  it interesting that companies in this situation                                  invariably send out a &lt;i&gt;female&lt;/i&gt; staffer to                                  read the official statement to the press.                                  Perhaps they reason that it puts a sweeter face                                  on their &lt;strike&gt;weasel&lt;/strike&gt;                                  carefully-chosen words?) :&lt;br /&gt;&lt;blockquote&gt;PayPal spokesperson Amanda Pires said in                                    addition to the new home page, PayPal                                    "added some features on the backend"                                    on Friday that were the cause of the problem.                                   Pires said, "Everyone is working fast                                    and furiously to get it all fixed." The                                    problems are intermittent, she said, but                                    declined to describe their nature or reveal                                    the features that were added on Friday.      &lt;br /&gt;&lt;/blockquote&gt;Paypal is owned by eBay, and they now have                                   little in the way of competition to keep them on                                   their toes. As something of an                                   "insider" in one of my jobs, I                                   witnessed some PayPal outages and service                                   degradations that were never publicly                                   acknowledged, so I will not cover them here.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-2222957891227940350?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/2222957891227940350/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/paypal-upgrade-causes-major-outage.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/2222957891227940350'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/2222957891227940350'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/paypal-upgrade-causes-major-outage.html' title='PayPal Upgrade Causes Major Outage, Affects Debit-Card Users: Oct 8, 2004'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-1103558923637863752</id><published>2009-12-25T14:26:00.000-08:00</published><updated>2009-12-26T18:33:25.032-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='MIT'/><category scheme='http://www.blogger.com/atom/ns#' term='bell'/><category scheme='http://www.blogger.com/atom/ns#' term='sexism'/><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><category scheme='http://www.blogger.com/atom/ns#' term='411'/><title type='text'>Newly Installed Software Causes Outages in MIT's 411 Directory Services: Feb, 1998</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: &lt;/b&gt;Several                                   outages, up to four weeks&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: &lt;/b&gt;Unknown.                                   All MIT campus phone services affected&lt;/li&gt;&lt;/ul&gt;This older article chronicles the problems                                   MIT was having with Bell Atlantic's 411                                   (directory assistance) services in 1997-1998.                                   Apparently there had been a number of failures                                   leading up to the major one in February, 1998.&lt;br /&gt;Read the original article &lt;b&gt;&lt;a href="http://tech.mit.edu/V118/N24/bsoftware.24n.html"&gt;HERE&lt;/a&gt;&lt;/b&gt;.&lt;br /&gt;Here's a statement from MIT's point of view:&lt;br /&gt;&lt;blockquote&gt;"This was caused by a software                                    change. Since the new software did not                                    interface with ours, we had to reroute                                    traffic," said Valerie L. Hartt,                                    Supervisor of Operator Services in Information                                    Systems. &lt;br /&gt;&lt;/blockquote&gt;It seems that Bell Atlantic would                                  periodically perform upgrades on their own                                  equipment which would render it incompatible                                  with the calls they were receiving from MIT's                                  system.&lt;br /&gt;&lt;blockquote&gt;"Part of the problem with this was that Bell Atlantic never informed MIT's 5ESS service team that it would be performing this [upgrade] service ... Therefore, we could not inform the community, nor be available during the upgrade to perform our own testing." &lt;br /&gt;&lt;/blockquote&gt;For me, the funniest part of this outage is                                  that fact that both Bell Atlantic and MIT were                                  using &lt;i&gt;identical&lt;/i&gt; telephone switches: The &lt;a href="http://en.wikipedia.org/wiki/5ESS_switch"&gt;AT&amp;amp;T                                  5ESS&lt;/a&gt;, which is still in widespread use.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-1103558923637863752?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/1103558923637863752/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/newly-installed-software-causes-outages.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/1103558923637863752'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/1103558923637863752'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/newly-installed-software-causes-outages.html' title='Newly Installed Software Causes Outages in MIT&apos;s 411 Directory Services: Feb, 1998'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-5541219053916667955</id><published>2009-12-25T14:23:00.000-08:00</published><updated>2009-12-26T18:33:57.034-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bahrain'/><category scheme='http://www.blogger.com/atom/ns#' term='cellular'/><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><category scheme='http://www.blogger.com/atom/ns#' term='batelco'/><title type='text'>Batelco (Bahrain) Cellular Network Outage: May 19-20, 2007</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: Unknown&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: Unknown, up                                   to 600,000 possible&lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;Link to the original news story, which quotes                                  the &lt;i&gt;Gulf Daily News&lt;/i&gt;:&lt;br /&gt;&lt;a href="http://www.cellular-news.com/story/23887.php"&gt;http://www.cellular-news.com/story/23887.php&lt;/a&gt;&lt;br /&gt;The outage caused a bit of out&lt;i&gt;rage&lt;/i&gt;:&lt;br /&gt;&lt;blockquote&gt;An influential business source told the newspaper that "a company with nearly BD100 million net profit should have a back-up service because what happened affected the communications of thousands of mobile owners. This is not acceptable nowadays," he said&lt;br /&gt;&lt;/blockquote&gt;The outage was blamed on "migration to a New                              Generation Network (NGN)." I wonder how one                              says &lt;i&gt;Pinkston's Law&lt;/i&gt; in the local language...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-5541219053916667955?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/5541219053916667955/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/batelco-bahrain-cellular-network-outage.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/5541219053916667955'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/5541219053916667955'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/batelco-bahrain-cellular-network-outage.html' title='Batelco (Bahrain) Cellular Network Outage: May 19-20, 2007'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5018869173202937575.post-7104011156486270738</id><published>2009-12-25T09:36:00.000-08:00</published><updated>2009-12-26T18:35:12.889-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='RIM'/><category scheme='http://www.blogger.com/atom/ns#' term='blackberry'/><category scheme='http://www.blogger.com/atom/ns#' term='outage'/><category scheme='http://www.blogger.com/atom/ns#' term='fail'/><category scheme='http://www.blogger.com/atom/ns#' term='crash'/><category scheme='http://www.blogger.com/atom/ns#' term='NOC'/><title type='text'>Blackberry E-mail Outage: 02/11/2008</title><content type='html'>&lt;ul&gt;&lt;li&gt;                                 &lt;br /&gt;&lt;b&gt;Length of outage: 3 hours&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Number of people affected: Unknown, North                                  American users of Blackberry's email &lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;Original Story: &lt;a href="http://www.cnbc.com/id/23134603"&gt;http://www.cnbc.com/id/23134603&lt;/a&gt;&lt;br /&gt;This outage should probably count as at least                                  two examples of Pinkston's Law, based on this                                  quote:&lt;br /&gt;&lt;blockquote&gt;It was the second major outage for the service in less than a year. In April, a minor software upgrade crashed the system for all users. A smaller disruption in September also was caused by a software glitch. &lt;br /&gt;&lt;/blockquote&gt;I find it interesting that at least one                                   analyst zeroed in on the existence of a Network                                   Operations Center (NOC) as a contributing factor                                   in the outage:&lt;br /&gt;&lt;blockquote&gt;Any time you got a system that's got a NOC, a Network Operations Center, you have the potential for a single point of failure.  What's a bit surprising to me is that with all the work they've been doing over time ... that they haven't been able to have enough redundancy in the NOC so that there isn't a single point of failure. &lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5018869173202937575-7104011156486270738?l=pinkstonslaw.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://pinkstonslaw.blogspot.com/feeds/7104011156486270738/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/blackberry-e-mail-outage-02112008.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/7104011156486270738'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5018869173202937575/posts/default/7104011156486270738'/><link rel='alternate' type='text/html' href='http://pinkstonslaw.blogspot.com/2009/12/blackberry-e-mail-outage-02112008.html' title='Blackberry E-mail Outage: 02/11/2008'/><author><name>Steve Pinkston</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://2.bp.blogspot.com/-YsS8EKgykFI/TkhbfyPEnHI/AAAAAAAAAF8/lM3mYCZ1Hvo/s220/SP2009.jpg'/></author><thr:total>0</thr:total></entry></feed>
