Site icon The Next Platform

When Data Needs More Firepower: The HPC, Analytics Convergence

Big data arguably originated in the global high-performance computing (HPC) community in the 1950s for government applications such as cryptography, weather forecasting, and space exploration. High-performance data analysis (HPDA)—big data using HPC technology—moved into the private sector in the 1980s, especially for data-intensive modeling and simulation to develop physical products such as cars and airplanes. In the late 1980s, the financial services industry (FSI) became the first commercial market to use HPC technology for advanced data analytics (as opposed to modeling and simulation). Investment banks began to use HPC systems for daunting analytics tasks such as optimizing portfolios of mortgage-backed securities, pricing exotic financial instruments, and managing firm-wide risk. More recently, high-frequency trading joined the list of HPC-enabled FSI applications.

The invention of the cluster by two NASA HPC experts in 1994 made HPC technology far more affordable and helped propel HPC market growth from about $2 billion in 1990 to more than $20 billion in 2013. More than 100,000 HPC systems are now sold each year at starting prices below $50,000, and many of them head into the private sector.

What’s New?

It’s widely known that industrial firms of all sizes have adopted HPC to speed the development of products ranging from cars and planes to golf clubs and potato chips. But lately, something new is happening.Leading commercial companies in a variety of market segments are turning to HPC-born parallel and distributed computing technologies — clusters, grids, and clouds — for challenging big data analytics workloads that enterprise IT technology alone cannot handle effectively. IDC estimates that the move to HPC has already saved PayPal more than $700 million and is saving tens of millions of dollars per year for some others.

The commercial trend isn’t totally surprising when you realize that some of the key technologies underpinning business analytics (BA) originated in the world of HPC. The evolution of these HPC-born technologies for business analytics has taken two major leaps and is in the midst of a third. The advances have followed this sequence:

Why Businesses Turn to HPC for Advanced Data Analytics

High-performance data analysis is the term IDC coined to describe the formative market for big data workloads that exploit HPC resources. The HPDA market represents the convergence of long-standing, data-intensive modeling and simulation (M&S) methods in the HPC industry/application segments that IDC has tracked for more than 25 years and newer high-performance analytics methods that are increasingly employed in these segments as well as by commercial organizations that are adopting HPC for the first time. HPDA may employ either long-standing numerical M&S methods, newer methods such as large-scale graph analytics, semantic technologies, and knowledge discovery algorithms, or some combination of long-standing and newer methods.

The factors driving businesses to adopt HPC for big data analytics (i.e., HPDA) fall into a few main categories:

Examples of HPDA in the Enterprise

The following examples illustrate the expanding range of HPC usage for advanced business analytics/business intelligence (BA/BI):

Economically Important HPDA Use Cases

It’s easy enough to cite interesting examples, but much more important for the future of HPDA is determining which examples represent repetitive use cases that are evolving into pursuable market segments. After closely tracking the formation of the HPDA market for more than five years, especially actual sales of HPC compute and storage systems, IDC has added the following new commercial HPDA applications to the established HPC segments we have reported on for more than 25 years. In 2014, for the first time, we produced detailed five-year forecast for the new HPDA segments:

HPDA is a fast-growing, formative, worldwide market that is still heavily in motion. One thing is nearly certain; however, HPDA is evolving from static searches to an emerging era of higher-value, dynamic pattern discovery. The challenge presented by these problems is to discover hidden patterns and relationships — things you didn’t know were there — and then to track patterns dynamically as they form and evolve or dissolve. Many of the commercial examples cited previously in this document — from fraud detection and BA/BI to marketing — already benefit from graph analytics and other pattern discovery methods.

Perhaps no field has stronger potential for benefiting from HPDA in general, and pattern discovery in particular, than bioscience. HPDA applications already in motion in this varied field range from advanced research — notably in genomics, proteomics, epidemiology, and systems biology — to commercial initiatives to develop new drugs and medical treatments, agricultural pesticides, and other bioproducts.

One of the world’s most socially and economically important HPDA thrusts will almost surely be the multiyear transition from today’s procedures-based medicine to personalized, outcomes-based healthcare. Identifying highly effective treatments in near real time (while the patient is still in the office) by comparing an individual’s genetic makeup, health history, and symptomology against tens of millions of archived patient records poses enormous HPDA challenges that may take another decade to master. The goal here is for the computer to process all this data and generate efficacy ratings for a range of treatment options. The options will eventually be highly personal: what constitutes a good outcome for a broken hand will vary, depending on whether the patient is an office worker or a concert violinist. When this capability matures, it will likely serve as a decision-support tool of unprecedented utility for the global healthcare community.

HPDA Market Prospects

The HPDA vendor scene is becoming increasingly heterogeneous and vibrant. The analytics side of the formative HPDA market is where traditional HPC users and first-time commercial adopters are converging most rapidly. Established vendors that have served each of these customer groups are exploiting this convergence by following their buyers into the new HPDA analytics territory.

IDC forecasts that revenue for HPDA-focused servers will grow robustly (13.3% CAGR), increasing from $743.8 million in 2012 to reach $2.7 billion in 2018. HPDA storage revenue will approach $1.6 billion in the latter year. The most serious technical challenge to liberating HPDA growth is data movement and management, although the HPDA market should be seen more fundamentally as a war among clever algorithms.

The growing market for HPDA is already enlarging HPC’s long record of contributions to science, commerce, and society. HPDA promises to play a major role in helping commercial firms to address the major opportunities and challenges of the 21st century.

Key Considerations

Competitive forces will increasingly drive leading firms to follow the examples of PayPal and other commercial pioneers that are using HPC technology to move beyond today’s static searches to exploit higher-value, dynamic pattern discovery. Knowledge-based businesses should explore HPC technology to see if it is a good fit for their evolving big data opportunities and challenges.

Exit mobile version