Anticipating Traffic Surges - Lessons Learned from ESPN Crash
September 13, 2016

Michelle McLean
ScaleArc

Share this

ESPN made news headlines this past weekend – the bittersweet kind. Unfortunately, the news highlighted that ESPN's fantasy football app was crashing, on the first Sunday of the NFL season. Where's the "sweet" part? The crash likely signals a huge amount of user popularity.

We see these types of stories often during so-called "surge" events, like when Black Friday takes down a retailer. Why? Often, it's the database that's been swamped in the process.

The application-to-database connection is fragile, because applications have to directly tie into the database and the coding of the app must match the database infrastructure. For example, if the database has multiple database servers that can all respond to an inbound request, the application needs to know which type of server to send its request to. While those changes can ensure a better response time, the work isn't trivial – a programmer must go through hundreds of thousands of lines of code to program how to handle reads vs. writes – and it can lead to errors.

Any recent changes by ESPN to increase database capacity or update the app could jeopardize that fragile connection. If ESPN recently modified the application to talk to different database servers, for example, the team might have accidentally introduced a "bad" query that the database can't handle or might have changed how the application talks to the database and broken that connection.

Organizations that are anticipating a surge in traffic have a number of best practices they should follow to ensure a smooth experience for their customers, including:

1. Freezing code early

Despite the understandable desire to make the app or site as current as possible, it's essential for engineering to force a code freeze many weeks before the "go live" date. Quality assurance (QA) and other testing require adequate time to ensure the updated site or app is working as needed.

2. Load testing

A big part of that testing work needs to come in the form of load testing. After a QA team has performed functional testing – that is, does each feature work – the next step is to see how the code performs when it's swamped with traffic. The key is to perform this load testing with traffic that's as close to production traffic as possible.

3. Increasing resiliency at the data tier

The lifeblood of any app or site is data; without it, you're down. To build in resiliency at this layer, organizations need to employ techniques such as database scale out to have multiple copies of the data available and database load balancing to ensure traffic is serviced by the fastest-responding server to the user.

4. Enabling redundancy in all network services

Beyond the data tier, organizations need to make sure the rest of the technology stack has all the redundancy built in as possible. Web server infrastructure and web load balancers are critical, as is network redundancy into both the web farms and the database server clusters. If you're hosting the app or service in the cloud, ensure a redundant version is available in an alternate cloud region.

Michelle McLean is VP of Marketing at ScaleArc.

Share this

The Latest

September 22, 2017

UK businesses have attained high levels of business-IT alignment in of all sizes, with IT pros demonstrating strong business understanding and business pros exhibiting strong IT knowledge to increase business performance, according to a new survey by ManageEngine ...

September 21, 2017

The increased complexity of new computing architectures coupled with new application development methodologies – especially in the face of time-to-market and security threat pressures – should make secure UX the first strategic decision for CEOs and CFOs on the path to digital transformation ...

September 19, 2017

IT professionals tend to go above and beyond the scope of their core responsibilities as the changing business landscape demands more of their attention, both inside and outside of the office, according to the Little-Known Facts survey conducted by SolarWinds in honor of IT Professionals Day ...

September 18, 2017

Digital video consumption is viral and, according to a new study released by IBM and International Broadcasting Convention (IBC), more than half of the 21,000 consumers surveyed are using mobiles every day to watch streaming videos, and that number is expected to grow 45 percent in the next three years ...

September 15, 2017

No technology that touches more than one IT stakeholder, no matter how good and how transformative, can deliver its potential without attention to leadership, process considerations and dialog. In this blog, I'd like to share effective strategies for AIA adoption ...

September 14, 2017

Enterprise IT environments are becoming more heterogeneous and complex, with fragmentation permeating cloud infrastructure, tooling and culture, according to a survey recently conducted by IOD Cloud Technologies Research in partnership with Cloudify ...

September 12, 2017

One area that enables enterprises to reduce complexity and streamline operations is their virtual desktop infrastructure (VDI). Virtualization is a linchpin of digital transformation and effectively optimizing an enterprise's VDI is essential to moving forward with digital technologies. Delivering the best possible VDI performance means taking a fresh look at what "desktop" means today. The endpoint, or desktop, now can be a physical thin client, a software-defined thin client, a traditional laptop, a phone or tablet. To reduce operational waste and achieve better performance across the desktop environment, consider these five actions ...

September 11, 2017

In incident management, we often overlook the simple things in favor of trying to do too much, too soon. Why not make sure we've done the fundamentals properly? ...

September 08, 2017
For our Advanced IT Analytics (AIA) Buyer's Guide, we interviewed more than 20 deployments to help us better assess vendor strengths and limitations. So given the abundance of riches to work with, I've decided to illustrate several of the more prominent AIA benefit categories with actual real-world comments ...
September 07, 2017

The Input/Output Operations per Second (I/O) capabilities of modern computer systems are truly a modern wonder. Yet no matter how powerful the processors, no matter how many cores, how perfectly formed the bus architecture, or how many flash modules are added, somehow it never seems to be enough ...