ENTERPRISE

Embarrassing Bugs (Part 3)

7/5/2012 3:19:42 PM

Mars Climate Orbiter - $327.6 million

Description: Mars Climate Orbiter – went 310 million miles to buy the farm

Mars Climate Orbiter – went 310 million miles to buy the farm

First launched in 1998, this small spacecraft was intended to circle Mars, collecting data about what passes for weather on that planet. It arrived there some 286 days into its mission and was never heard from again. Was it a victim of the mysterious powers of the red planet? No, its loss was entirely down to the Americans’ insistence on using imperial measurements, and not metric ones. As such, the software written to control the orbiters thrusters had been programmed with pound-seconds force (lbf-s) instead of the specified metric units of Newtonseconds (N-s), resulting in it being significantly closer to Mars than was healthy. Instead of entering a stable orbit, it ploughed straight into the upper atmosphere and a was promptly vaporised after its 310 million mile journey. The cost of developing the probe, an associated lander, and manning the mission was $327.6 million, with no appreciable scientific return on that investment.

BlackBerry Blackout – Incalculable Cost

Description: BlackBerry Blackout

Research In Motion (RIM) had carved itself a very enviable slice of the smartphone market, especially among business users who liked the immediacy of its ‘push’ communications system.

That was until its entire subscriber network went down for four days due to a software issue in 2011. What became very apparent during the outage was that RIM didn’t understand what the problem was, or how to fix it, undermining its credibility severely.

The outage started in the BlackBerry datacenter in Slough, but soon spread to the Middle East, Africa and eventually made its way to Latin America, the US and Canada. In the end, three-quarters of RIM’s 70 million users couldn’t communicate using the BlackBerry messaging service or receive e-mail.

After days of complete silence, RIM eventually offered the excuse of a ‘core switch failure’, which did little to calm the anger of its customers. It’s never really elaborated on that, and in many respects this was truly a communications failure that wasn’t hardware or software based, but one purely between the company and its paying customers. In business, credibility is everything, and this outage dented RIM in a way that can’t easily be fixed.

By way of recompense, RIM offered all those affected $100 of free applications on their phones, but the damage this did to a company that was already struggling dwarfs this expenditure.

Mercedes M-Class-187,000 Cars Recalled

Description: Mercedes M-Class

Cruise control isn’t a feature that many in this country use, but in the United States it’s a major selling point for anyone who travels long distances by car. As such, Mercedes-Benz put in its 2000-2004 M-Class SUV, but with a slight software twist.

It’s general norm with these systems that touching the brake of accelerator disengages the cruise control, but due to a bug on the M-Class this didn’t happen.

In the event of an accident, drivers discovered that while they attempted to stop the car, the cruise control system was attempting to maintain the same speed – a conflict that could only be resolved by excessive braking force.

Realising that it would be responsible for any injury or fatalities that this fault contributed to, Mercedes-Benz issued a recall that encompassed 137,000 vehicles in the USA and another 50,000 in Germany, the exact cost of which the company has never revealed.

Parole Software Glitch – 450 Dangerous Criminals Freed

Description: 450 Dangerous Criminals Freed because The errors of Parole Software Glitch

450 Dangerous Criminals Freed because The errors of Parole Software Glitch

As with any modern society, California doesn’t have infinite jail space to house inmates, so it’s keen to return to society those it feels are the least threat as quickly as possible. That was the logic behind a parole system it developed, which would help it identify who to release and when. Unfortunately, it had some pretty serious bugs and set about releasing all manner of nefarious people who would normally be having a prolonged stay at the big house.

Included in its recommendations, that the state then followed, were 450 criminals classed as violent and dangerous, and a further 1,000 who were incarcerated for drug-related and other less serious crimes

These events came off the back of a Supreme Court instruction to California’s prisons to reduce their inmate numbers by 33,000 over two years. The system wasn’t supposed to include offenders with violent records, gang associations or sex criminals, but it decided they deserved a break too.

AT&T – $60m In Lost Calls

Description: AT&T – $60m In Lost Calls

AT&T – $60m In Lost Calls

In early January 1990, 60,000 American’s tried to make long distances calls using AT&T, only to find it wasn’t something the provider could do. At the heart of the problem was 114 long-distance telecom switches, all of which refused to route calls.

The switches were designed to detect a fault, put up a ‘do not disturb’ sign, hand their calls to the next switch and reset. A code change to speed up the reset process altered the exact order of events, spending two rather than one message, which hit the resetting server just as it was rebooting, making it assume another fault, starting the cycle again. As a result, all 114 switches created a cascade failure, as each resetting switch overloaded the next.

The cost on the day was $60m in lost revenue, the annoyance of its customers and it took a further hit by offering 33% off long-distance calls on Valentine’s day that year in an attempt to make amends.

Final Thoughts

Software bugs are all around us, messing up cash dispensers, bricking phones and generally making life more complicated than it already is. But as long as humans are involved in coding systems, they’ll be flawed, just like ourselves.

Thankfully, most of the programming mistakes we make are an inconvenience, rather than commercially damaging and life threatening. In those circumstances, the bigger mistake is not identifying the greater danger of a single point of failure, rather than the error that’s ultimately exposed as the culprit.

Other  
  •  Simplicity: intuitive design, simple choices, and familiarity
  •  Retro - Cloud Computing
  •  Problem At Sony Corporation - Can Sony Stay Relevant In An Apple World?
  •  Tim Cook: “Of course, I'm going to change things.”
  •  Sign Language Technology (Part 3) - Seimens Hearing Aids
  •  Sign Language Technology (Part 2) - Why Was Sign Abandoned?
  •  Sign Language Technology (Part 1)
  •  How To Specify And Build A Media PC (Part 5)
  •  How To Specify And Build A Media PC (Part 4)
  •  How To Specify And Build A Media PC (Part 3) - Step-By-Step Guide
  •  How To Specify And Build A Media PC (Part 2) - Low-cost Streaming HTPC, High-End HTPC, Powerful Gaming HTPC Tower
  •  How To Specify And Build A Media PC (Part 1)
  •  HTPC Networking Consideration (Part 3) - Wireless Networking, Powerline To The Rescue
  •  HTPC Networking Consideration (Part 2) - Home Server & Wired Networking
  •  HTPC Networking Consideration (Part 1) - NAS
  •  Zyxel NSA310 : Ideal for streaming
  •  Synology DiskStation DS212j
  •  Belkin N600 DB Wireless Dual-Band N+ Router
  •  Writer, Reader, Speaker
  •  Introducing IBM BPM and ESB : IBM's SOA Foundation lifecycle & IBM's BPM enabled by SOA method
  •  
    Video
    Top 10
    Nikon 1 J2 With Stylish Design And Dependable Image And Video Quality
    Canon Powershot D20 - Super-Durable Waterproof Camera
    Fujifilm Finepix F800EXR – Another Excellent EXR
    Sony NEX-6 – The Best Compact Camera
    Teufel Cubycon 2 – An Excellent All-In-One For Films
    Dell S2740L - A Beautifully Crafted 27-inch IPS Monitor
    Philips 55PFL6007T With Fantastic Picture Quality
    Philips Gioco 278G4 – An Excellent 27-inch Screen
    Sony VPL-HW50ES – Sony’s Best Home Cinema Projector
    Windows Vista : Installing and Running Applications - Launching Applications
    Most View
    Bamboo Splash - Powerful Specs And Friendly Interface
    Powered By Windows (Part 2) - Toshiba Satellite U840 Series, Philips E248C3 MODA Lightframe Monitor & HP Envy Spectre 14
    MSI X79A-GD65 8D - Power without the Cost
    Canon EOS M With Wonderful Touchscreen Interface (Part 1)
    Windows Server 2003 : Building an Active Directory Structure (part 1) - The First Domain
    Personalize Your iPhone Case
    Speed ​​up browsing with a faster DNS
    Using and Configuring Public Folder Sharing
    Extending the Real-Time Communications Functionality of Exchange Server 2007 : Installing OCS 2007 (part 1)
    Google, privacy & you (Part 1)
    iPhone Application Development : Making Multivalue Choices with Pickers - Understanding Pickers
    Microsoft Surface With Windows RT - Truly A Unique Tablet
    Network Configuration & Troubleshooting (Part 1)
    Panasonic Lumix GH3 – The Fastest Touchscreen-Camera (Part 2)
    Programming Microsoft SQL Server 2005 : FOR XML Commands (part 3) - OPENXML Enhancements in SQL Server 2005
    Exchange Server 2010 : Track Exchange Performance (part 2) - Test the Performance Limitations in a Lab
    Extra Network Hardware Round-Up (Part 2) - NAS Drives, Media Center Extenders & Games Consoles
    Windows Server 2003 : Planning a Host Name Resolution Strategy - Understanding Name Resolution Requirements
    Google’s Data Liberation Front (Part 2)
    Datacolor SpyderLensCal (Part 1)