Mobile WorkHorse

About this Blog:

Al Sacco writes about (and drools over) anything and everything mobile or wireless as it applies to the global workforce--with a focus on BlackBerry smartphones

Al Sacco

Details on Yesterday’s EMEA BlackBerry Outage from Zenprise, BoxTone

to Mobile/Wireless |

On Tuesday, April 28, users of Research In Motion’s (RIM) BlackBerry service throughout Europe, the Middle East and Asia suffered, to varying degrees, a widespread data outage. Service stayed down for a couple hours for most affected parties, longer for others. The cause of the downtime has since been attributed to a RIM Server Router Protocol (SRP) outage, which occurred at roughly 1:35 PM (GMT) on Tuesday, April 28.

image BoxTone Dashboard Showing Two BES with SRP Failure
BoxTone Dashboard Showing Two BES with SRP Failure

SRP is RIM's proprietary network protocol employed to transfer data between the company's BlackBerry infrastructure and organizations’ BlackBerry Enterprise Servers.

BlackBerry outages aren’t exactly uncommon nowadays, nor was Tuesday’s service disruption particularly serious since the problem was resolved relatively quickly for most of those impacted.

Shortly after news of the outage, I spoke with both Zenprise and BoxTone, which offer competing BlackBerry infrastructure management and support products. I’ve covered both Zenprise and BoxTone frequently on CIO.com. (Read, “Eyes on Zenprise: How the Red Sox Keeps BlackBerrys in the Game,” and “myBoxTone Expert: The On-Device IT Help Desk” for more.)

Ahmed Datoo, VP of marketing for Zenprise, says the company’s U.K. customers were first alerted to the SRP disconnect at 13:32 (1:32 GMT). From Datoo:

“One of our customers in the UK got an alert from Zenprise...that the RIM SRP network went down. That network looked to be back up and running around 15:00. One of our U.S. customers (who) supports users in Europe received an alert from Zenprise (at) roughly the same time, but service for them was restored at 14:13."

“One of the automated diagnostics that our product runs before triggering an alert is to telnet to port 3101 to test RIM connectivity (that’s the port the BES server talks to the RIM network on). It looks like one of the advertised IP addresses of the RIM network went down, and the traffic was rerouted to the secondary IP address. The propagation of the DNS changes may have taken some time, which is why some customers saw service restore faster than others.”

image of Zenprise Dashboard Showing BES with SRP Failure
Zenprise Dashboard Showing BES with SRP Failure

Mitch Berk, director of product management with BoxTone, shared information that mostly coincides with Zenprise’s findings.

BoxTone customers in the U.K. first received alerts and specifics via e-mail updates at 1:35 PM (GMT), along with possible resolution instructions. The following is an example of what one of those BoxTone alerts looked like.

Alert : BES XXX : Critical -> Unavailable
Explanation = The SRP connection to the BES infrastructure has been lost due to network conditions such as packet loss, latency, or other symptoms of poor network conditions. The BES will automatically attempt to reconnect.
Possible Action = If this event is observed repeatedly or for a long duration, there may be network access issues from the BES to the RIM SRP Host (or the RIM NOC itself may be experiencing issues). The following items can be tested to check the connection with RIM:
1: Ping the SRP host via the windows command prompt (Ping Hostname)
2: Use the bbsrptest.exe utility (See RIM KB KB00804).
3: Telnet to the SRP host on port 3101 to verify connectivity to the SRP server.
4: Verify you aren't experiencing network outages or firewall configuration changes.

That specific BoxTone customer saw the SRP connection restored in roughly 2.5 hours, according to Berk.

Berk also

Continue Reading

Print

Browse CIO Blogs

See all CIO Blogs »

Cloud computing has emerged as one of the most significant game changers to hit the technology landscape in the past 20 years. With this massive expansion of the cloud, the perception of the IT organization is shifting from a utility player to a change agent. This eBook breaks down five ways progressive organizations are using cloud-based IT Management solutions to help drive innovation and become more strategic, including: adding visibility and analytics, speeding up time-to-value, lowering costs, improving prioritization, and providing a blueprint for future cloud deployments.
Read the white paper to see how IBM helped Citigroup deliver new services and enhancements to their 200 million customers faster.
There are 3 ways to modernize legacy applications: rewrite completely, acquire packaged solutions or migrate existing code. This paper explains why it's best to migrate and how IBM® Rational® software can help.
Accommodating specific lines of business can result in a hybrid ecosystem of applications and servers. The resulting complexity of this architecture makes for an environment that is costly to maintain and difficult to change when addressing new challenges.
This whitepaper will help you to define a mobile device passcode policy. Security managers must attempt to reconcile two opposing goals. They must: 1) create a passcode policy that is strong enough to protect the device if it is lost or stolen, while: 2) not annoying users with needless length or complexity.
This whitepaper, authored by The Radicati Group, looks at the key reasons organizations should consider moving to a cloud-based archiving solution. Email archiving solutions enable organizations to store, monitor, and collect electronic data exchanged by their users to comply with internal policies and regulations.
ATERNITY will showcase a 30-minute demo on how Fortune 500 companies are leveraging its award-winning FPI Platform to deliver a user-centric approach to Proactive IT Management.
For businesses to move forward and tap into the ever-expanding universe of Internet users and network-enabled devices, it's critical to learn how to make the transition to IPv6. Learn the critical steps your organization must take to make a seamless transition-and keep your business world connected.
Learn how IT teams can protect against spear phishing tactics. Harry Sverdlove, chief technology officer of Bit9 offers a frank discussion about spear phishing - the most common technique used in today's advanced attacks.
Learn how to build a solid business case for your migration to Red Hat Enterprise Linux so you can run leaner, innovate faster, be more flexible and own the New Now.
Social media isn't about you; it's about everything around you. As you consider how your customers want to communicate with you, social media is something that can't be ignored. But what should your strategy be? Is social media "just another channel?" What kind of a plan makes sense for your contact center and for your customers? Join our experts as they share their insight and research results.
Hardware tokens were a popular method of strong authentication in past years but the cumbersome provisioning and distribution tasks, high support requirements and replacement costs have limited their growth. The additional log-in steps that hardware tokens require and the resulting user frustrations have limited adoption and make them impractical for larger scale partner and customer applications.

Newsletter Sign-Up »

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all Newsletters | Privacy Policy