SlideShare a Scribd company logo
Case study
HP finds big value in HP
Vertica big data solution
Time to analyze clickstream data 				
reduced from days to minutes
Industry
Technology
Objective
Streamline processing of hp.com clickstream data
Approach
Implement big data analytics and storage solutions
IT matters
•	Solution easily accommodates billions of rows 	
of data generated by hp.com visitor clicks
•	Queries returned in minutes instead of days,
allowing users to perform more complex, 		
iterative data analysis
•	Industry-standard SQL ensures user familiarity,
maximizing acceptance and ROI
Business matters
•	Improved ability to identify and correct 		
issues with website hardware or software, 	
which reduces risks of degraded customer
experience and lost sales
•	Improved ability to deliver interactive, 	
personalized website experience, which	
improves sales conversions and drives 		
sales and revenue
“Our clickstream implementation of HP Vertica and 	
Apache Hadoop demonstrates the enormous value 		
of these technologies. We expect more and more HP
customers will follow suit and adopt the same 		
approach for their big data analytics.”
—John Lormand, director, HP.com Technology
Big data has value, but to realize that value, businesses need to
evolve from legacy batch processing technologies to solutions
that support real-time interactive analysis. HP Vertica Analytics
and the open source Apache Hadoop software offer a big data
solution that HP has leveraged internally to improve its
clickstream analytics capabilities.
2
It is true that “time is money.”
But so is data.
And today’s companies are increasingly aware
that the more data they collect, the more
value it has. “Big Data”—the enormous data
sets generated when companies capture
highly granular digital information—can help
companies drive innovation and productivity.
Big data can also help companies identify
new opportunities and markets, and deepen
their understanding of customer needs and
behaviors. And big data can give companies
a competitive edge and help them better
understand risk.
But to mine the value of big data, companies
need cutting-edge technology. They must be
able to analyze data sets that dwarf the size of
traditional databases. And that analysis must
be speedy, to ensure that companies can act
on it in a timely fashion.
That is why HP has integrated its own
technology, the HP Vertica Analytics
Platform, with the open-source Apache
Hadoop software, to create a robust and
comprehensive big data analytics solution.
Billions of clicks
As is true for many companies today, HP’s
public face is its corporate website hp.com.
The site is visited by millions of people each
month and functions as one of the company’s
most important marketing communications
vehicles. The site allows HP to offer thousands
of pages of searchable information about its
products and services directly to the public.
The site also serves as a virtual storefront,
enabling HP to engage customers and 	
transact business with them.
During the course of the millions of
interactions with site visitors, hp.com
generates “clickstream” data, including
information on what pages visitors load,
how much time they spend on each page,
what links they click, and how they exit the
site. Analyzing this clickstream data, in turn,
allows HP to deepen its understanding of
website visitors. The company can better
understand how visitors interact with the site.
As a result, the data can be used to improve
the site itself—making it more usable, for
example, or ensuring visitors can easily find
the information they need.
More broadly, analyzing clickstream data
yields insight into customer behavior, such as
buying behaviors. This enables HP to refine
its sales and marketing campaigns, or even
its products and services themselves. “The
biggest consumer of clickstream data is our
marketing analysts,” notes John Lormand,
director, HP.com Technology. “It is broadly
recognized for its value in helping us refine
how we communicate to the public and
position our solutions.”
At one time, HP stored its clickstream data
using traditional Oracle databases, and
performed modeling and analytics with 	
SAS Analytics software.
But the data sets are enormous, due to a
number of factors.
First is the volume of hp.com traffic and the
number of clicks each visitor makes per visit.
“We capture 11 to 12 billion clicks per month,”
Lormand says. To fully support trending and
comparative analysis, HP must store around
five years’ worth of clickstream data; analysts
typically want to work with about 15 months’
worth at a time to perform year over year
trend analysis. This allows the analysts to
account for seasonality and show correlation
to previous year’s traffic.
The site itself is also extremely complex—
more a collection of services than a single
application—and is not a static environment.
Many pages are generated dynamically,
based on information provided by the visitor
or visitor behavior. “HP.com is an integrated
environment, with some pieces generated by
HP and others served up by service providers,”
Lormand explains.
HP’s clickstream database, in fact, was HP’s
largest Oracle instance.
The sheer volume of the data collected created
issues, however. The database performance
was sluggish; queries could take days to
process. “Query results were taking at least
48 hours after each day’s transactions were
completed,” Lormand notes. “And more
complex analytics were impossible to do, in
practical terms—they simply took too long.
“We knew we needed to improve our
clickstream analytics capabilities.”
Case study | HP Vertica big data solution
3
Infrastructure Management Insight
Intelligent data centers of
the future
Intelligent, workload
optimized solutions
Intelligent Business
Decisions, Faster
Case study | HP Vertica big data solution
Big data analytics, 	
user-friendly model
So HP turned to its own HP Software
portfolio, leveraging the HP Vertica Analytics
Platform, an industry-leading big data
solution that supports real-time querying
and loading, advanced in-database analytics,
and sophisticated storage and execution
functionality to speed queries 50 to 1,000
times faster than traditional databases. HP
integrated the Vertica solution with Apache
Hadoop as its distributed file system. “Both
applications are massively parallel processing
systems designed for low-cost big data
processing,” notes Lormand. “And in terms of
functionality, they are highly complementary.
Hadoop enables efficient loading of structured
and unstructured data. Vertica enables
efficient, extreme analytics.”
As a result, instead of waiting days to perform
queries, analysts can now get query results 	
in hours or even minutes—even when 		
working with the extremely large data 		
sets that are stored within the hp.com
clickstream database.
Another key advantage of Vertica is that
it is based on ANSI SQL, a structure that is
familiar to HP analysts. “Vertica offers big
data analytics capabilities in a user-friendly
engagement model,” Lormand explains.
“This helped ensure user acceptance of the
technology. As soon as we rolled out the
solution, it was embraced by our analysts.”
Faster, more 		
flexible analytics
Today, HP’s big data solution—which
easily accommodates the billions of rows
of clickstream data generated by hp.com
visitors—is delivering enhanced analytics
capabilities to the company’s business users.
Because the combined HP Vertica and Hadoop
solution performs analysis much more quickly,
it allows HP to interact with its data more
flexibly and fluidly. “Our HP Vertica solution
allows more recursive, repetitive types
of analysis on our clickstream database,”
Lormand notes. “So now, when analysts notice
something of interest, they can easily perform
iterative queries. This lets them follow a
particular train of thought because they don’t
have to wait for days between queries. This
creates a ‘conversation with the data’ that
helps us uncover those hidden insights in the
massive amounts of clickstream data.”
Faster and more flexible analytics, in turn,
means HP’s understanding of clickstream
data is more sophisticated and nuanced. “Our
analysts can correlate data points in ways
they never could before because our Oracle
solution simply couldn’t process the requests,”
Lormand explains.
The business benefits of these enhanced
analytics capabilities will be significant. HP
will be better equipped to improve its website
functionality and architecture. It can more
easily correlate events across its server farms,
for example, which will allow it to identify
and isolate anomalies that will yield insights
into how website functionality is affecting
user interactions. “Our HP Vertica solution
gives us a true, end-to-end picture of our
environment,” says Lormand. “And because
it gives us faster results, we can respond to
issues more quickly.”
HP will be able to better tailor its website
interactivity to the needs of individual visitors,
delivering a more precise and granular
shopping experience. In the past, for example,
the site guided visitors to information on the
Fast, flexible 	
analytics unleashes
big data’s power
HP Big Data Solutions
Rate this documentShare with colleagues
Sign up for updates
hp.com/go/getupdated
basis of broad categories. If the visitor seemed
to fit the profile of a typical retail customer,
that visitor would be guided to one set of
solutions. Visitors fitting the profile of a 	
home office user would be led to a different
subset of products.
But some visitors don’t always fit neatly 	
into these categories. Now, thanks to the
insight gained via the HP Vertica solution, 	
HP can build website functionality that 	
ensures the site responds appropriately 	
to all kinds of visitors. And this, in turn, will
enhance 	visitor satisfaction and improve 	
sales conversion rates.
HP also uses HP Vertica to analyze channel
rebate data, a task that requires matching
serial numbers on rebate claims to 135 million
rows of shipment data. This has improved
HP’s ability to forecast rebates and to execute
quarter-end financial adjustments.
Given the importance of hp.com to the
company’s revenue and brand, it’s likely that
managing clickstream data will continue to
be an important use of its HP Vertica solution.
“We know that our website must deliver an
interactive and personalized experience to
our visitors,” Lormand concludes. “It’s a key
strategic goal, and HP Vertica gives us critical
capabilities that we need to achieve it.”
Customer at a glance
Application
Big data analytics
Software
HP IT Performance Suite—Information
Management
•	HP Vertica Analytics Platform
•	Apache Hadoop
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only
warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein
should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein.
4AA4-5388ENW, February 2013
Case study | HP Vertica big data solution

More Related Content

PPTX
Big Data : a 360° Overview
Juvénal CHOKOGOUE
 
PDF
HP Vertica
Satya Harish
 
PDF
Customer 360
Dave Birckhead
 
PDF
Infrastructure To Cloud Transformation
Michael Graber
 
PDF
The Connected Consumer – Real-time Customer 360
Capgemini
 
PDF
HP Vertica and IIS: Big Data = Big Literacy
IIS International Integrated Solutions
 
PDF
Customer Event Hub - the modern Customer 360° view
Guido Schmutz
 
PDF
Accelerate Actionable Insights with the Business Data Lake
Capgemini
 
Big Data : a 360° Overview
Juvénal CHOKOGOUE
 
HP Vertica
Satya Harish
 
Customer 360
Dave Birckhead
 
Infrastructure To Cloud Transformation
Michael Graber
 
The Connected Consumer – Real-time Customer 360
Capgemini
 
HP Vertica and IIS: Big Data = Big Literacy
IIS International Integrated Solutions
 
Customer Event Hub - the modern Customer 360° view
Guido Schmutz
 
Accelerate Actionable Insights with the Business Data Lake
Capgemini
 

What's hot (20)

PDF
Big Data Analytics for Contact Centers
Rajender K Salgam
 
PDF
Mir 1808 cus_datplat
EvoLife.bg
 
PDF
S ba0881 big-data-use-cases-pearson-edge2015-v7
Tony Pearson
 
PDF
All Customers are Not Alike: Gaining a 360 Degree View
G3 Communications
 
PPTX
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
 
PDF
Extended 360 degree view of customer
Trisha Dutta
 
PDF
Driving Customer Loyalty with Azure Machine Learning
CCG
 
PPTX
From Data to Data Driven - Applications that will change your business
NG DATA
 
PDF
ANTS - 360 view of your customer - bigdata innovation summit 2016
Dinh Le Dat (Kevin D.)
 
PPT
8.17.11 big data and hadoop with informatica slideshare
Julianna DeLua
 
PPTX
Business Visualization: Dashboard & Storyboarding
NMIMS Global Access School of Continuing Education (NGA-SCE)
 
PDF
Hortonworks hadoop big data_retail__white_paper
Shyam Babu
 
PPTX
M&A Trends in Telco Analytics
Open Analytics
 
PPTX
Big Data Hadoop Customer 360 Degree View
BP PODDAR INSTITUTE OF MANAGEMENT AND TECHNOLOGY
 
PDF
Lead to Cash: The Value of Big Data and Analytics for Telco
Sam Thomsett
 
PDF
Crowdstar case-study
Satya Harish
 
PPTX
Big Data Analytics with Microsoft
Caserta
 
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
Cognizant
 
PDF
Big data solutions explained for marketeers & business executives
Agile Delivery
 
PDF
Retail Big Data and Analytics
Cloudera, Inc.
 
Big Data Analytics for Contact Centers
Rajender K Salgam
 
Mir 1808 cus_datplat
EvoLife.bg
 
S ba0881 big-data-use-cases-pearson-edge2015-v7
Tony Pearson
 
All Customers are Not Alike: Gaining a 360 Degree View
G3 Communications
 
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
 
Extended 360 degree view of customer
Trisha Dutta
 
Driving Customer Loyalty with Azure Machine Learning
CCG
 
From Data to Data Driven - Applications that will change your business
NG DATA
 
ANTS - 360 view of your customer - bigdata innovation summit 2016
Dinh Le Dat (Kevin D.)
 
8.17.11 big data and hadoop with informatica slideshare
Julianna DeLua
 
Business Visualization: Dashboard & Storyboarding
NMIMS Global Access School of Continuing Education (NGA-SCE)
 
Hortonworks hadoop big data_retail__white_paper
Shyam Babu
 
M&A Trends in Telco Analytics
Open Analytics
 
Big Data Hadoop Customer 360 Degree View
BP PODDAR INSTITUTE OF MANAGEMENT AND TECHNOLOGY
 
Lead to Cash: The Value of Big Data and Analytics for Telco
Sam Thomsett
 
Crowdstar case-study
Satya Harish
 
Big Data Analytics with Microsoft
Caserta
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Cognizant
 
Big data solutions explained for marketeers & business executives
Agile Delivery
 
Retail Big Data and Analytics
Cloudera, Inc.
 
Ad

Viewers also liked (11)

PDF
MEM's Trends 2013
Satya Harish
 
PDF
G06.2014 magic quadrant for the wired and wireless lan access infrastructure
Satya Harish
 
PDF
G12.2012 magic quadrant for enterprise information archiving
Satya Harish
 
PDF
G06.2014 magic quadrant for secure web gateways
Satya Harish
 
PDF
Quantimetrica ultra low_power_voice_switch_feb2015
Satya Harish
 
PDF
Aims technology.wp2.2015
Satya Harish
 
PDF
Gartner HH 2015 - 2005 Hype Cycle
Satya Harish
 
PDF
BOOK - IBM zOS V1R10 communications server TCP / IP implementation volume 1 b...
Satya Harish
 
PDF
BOOK - IBM DB2 9 FOR zOS
Satya Harish
 
PDF
G02.2013 magic quadrant for enterprise network firewall
Satya Harish
 
PDF
BOOK - IBM Sterling B2B Integration and Managed File Transfer Solutions
Satya Harish
 
MEM's Trends 2013
Satya Harish
 
G06.2014 magic quadrant for the wired and wireless lan access infrastructure
Satya Harish
 
G12.2012 magic quadrant for enterprise information archiving
Satya Harish
 
G06.2014 magic quadrant for secure web gateways
Satya Harish
 
Quantimetrica ultra low_power_voice_switch_feb2015
Satya Harish
 
Aims technology.wp2.2015
Satya Harish
 
Gartner HH 2015 - 2005 Hype Cycle
Satya Harish
 
BOOK - IBM zOS V1R10 communications server TCP / IP implementation volume 1 b...
Satya Harish
 
BOOK - IBM DB2 9 FOR zOS
Satya Harish
 
G02.2013 magic quadrant for enterprise network firewall
Satya Harish
 
BOOK - IBM Sterling B2B Integration and Managed File Transfer Solutions
Satya Harish
 
Ad

Similar to Hp big data_casestudy (20)

PDF
Next-Gen Analytics: Conversing with Big Data
IIS International Integrated Solutions
 
PPTX
How Startups can leverage big data?
Rackspace
 
PDF
HP Software Performance Tour 2014 - See the Big Picture in Big Data
HP Enterprise Italia
 
PPTX
Big data an elephant business opportunities
Bigdata Meetup Kochi
 
PPTX
Evlotion of Big Data in Big data vs traditional Business
BackiyalakshmiVenkat
 
PDF
The Story of HPE Haven OnDemand
Alon Mei-raz
 
PDF
The_Story_of_HavenOndemand_External
Fernando Lucini
 
PDF
Hortonworks and HP Vertica Webinar
Hortonworks
 
PDF
TDWI checklist - Evolving to Modern DW
Jeannette Browning
 
PDF
Hadoop 2.0: YARN to Further Optimize Data Processing
Hortonworks
 
PDF
Come fare business con i big data in concreto
HP Enterprise Italia
 
PDF
Splunk-hortonworks-risk-management-oct-2014
Hortonworks
 
PPT
Hadoop Demo eConvergence
kvnnrao
 
PDF
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 
PPTX
Cognitive Procurement Masterclass with IBM - SID 51774
SAP Ariba
 
PDF
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Hortonworks
 
PPTX
Getting Started with BI Analytics on HANA
Dickinson + Associates
 
PDF
A Winning Strategy for the Digital Economy
Eric Kavanagh
 
PDF
Hortonworks.HadoopPatternsOfUse.201304
James Kenney
 
PPTX
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014
 
Next-Gen Analytics: Conversing with Big Data
IIS International Integrated Solutions
 
How Startups can leverage big data?
Rackspace
 
HP Software Performance Tour 2014 - See the Big Picture in Big Data
HP Enterprise Italia
 
Big data an elephant business opportunities
Bigdata Meetup Kochi
 
Evlotion of Big Data in Big data vs traditional Business
BackiyalakshmiVenkat
 
The Story of HPE Haven OnDemand
Alon Mei-raz
 
The_Story_of_HavenOndemand_External
Fernando Lucini
 
Hortonworks and HP Vertica Webinar
Hortonworks
 
TDWI checklist - Evolving to Modern DW
Jeannette Browning
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hortonworks
 
Come fare business con i big data in concreto
HP Enterprise Italia
 
Splunk-hortonworks-risk-management-oct-2014
Hortonworks
 
Hadoop Demo eConvergence
kvnnrao
 
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 
Cognitive Procurement Masterclass with IBM - SID 51774
SAP Ariba
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Hortonworks
 
Getting Started with BI Analytics on HANA
Dickinson + Associates
 
A Winning Strategy for the Digital Economy
Eric Kavanagh
 
Hortonworks.HadoopPatternsOfUse.201304
James Kenney
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014
 

More from Satya Harish (20)

PDF
Workday-hrtechnologyconferencedebihirshlagflextronics
Satya Harish
 
PDF
WorkDay-surviving and thriving in a world of change
Satya Harish
 
PDF
Book scrum tutorial
Satya Harish
 
PDF
O - Oracle application testing suite test starter kits for oracle e business ...
Satya Harish
 
PDF
Qualcomm
Satya Harish
 
DOCX
Book HH - SQL MATERIAL
Satya Harish
 
PDF
Book HH- vb2008me preview
Satya Harish
 
PDF
Book HH- vb6 preview
Satya Harish
 
PDF
G03.2014 Intelligent Business Process Management Suites
Satya Harish
 
PDF
G05.2013 Critical Capabilities for SIEM
Satya Harish
 
PDF
G07.2013 Application Security Testing
Satya Harish
 
PDF
G05.2015 Secure Web Gateways
Satya Harish
 
PDF
G11.2013 Application Development Life Cycle Management
Satya Harish
 
PDF
G10.2013 Application Delivery Controllers
Satya Harish
 
PDF
G06.2014 Security Information and Event Management
Satya Harish
 
PDF
G05.2013 Security Information and Event Management
Satya Harish
 
PDF
G05.2015 - Magic quadrant for cloud infrastructure as a service
Satya Harish
 
PDF
G05.2014 - Magic quadrant for cloud infrastructure as a service
Satya Harish
 
PDF
PERIODIC TABLE OF SEO SUCCESS FACTOR
Satya Harish
 
PDF
BOOK - IBM tivoli netcool service quality manager data mediation gateway deve...
Satya Harish
 
Workday-hrtechnologyconferencedebihirshlagflextronics
Satya Harish
 
WorkDay-surviving and thriving in a world of change
Satya Harish
 
Book scrum tutorial
Satya Harish
 
O - Oracle application testing suite test starter kits for oracle e business ...
Satya Harish
 
Qualcomm
Satya Harish
 
Book HH - SQL MATERIAL
Satya Harish
 
Book HH- vb2008me preview
Satya Harish
 
Book HH- vb6 preview
Satya Harish
 
G03.2014 Intelligent Business Process Management Suites
Satya Harish
 
G05.2013 Critical Capabilities for SIEM
Satya Harish
 
G07.2013 Application Security Testing
Satya Harish
 
G05.2015 Secure Web Gateways
Satya Harish
 
G11.2013 Application Development Life Cycle Management
Satya Harish
 
G10.2013 Application Delivery Controllers
Satya Harish
 
G06.2014 Security Information and Event Management
Satya Harish
 
G05.2013 Security Information and Event Management
Satya Harish
 
G05.2015 - Magic quadrant for cloud infrastructure as a service
Satya Harish
 
G05.2014 - Magic quadrant for cloud infrastructure as a service
Satya Harish
 
PERIODIC TABLE OF SEO SUCCESS FACTOR
Satya Harish
 
BOOK - IBM tivoli netcool service quality manager data mediation gateway deve...
Satya Harish
 

Recently uploaded (20)

PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
Doc9.....................................
SofiaCollazos
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
The Future of Artificial Intelligence (AI)
Mukul
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Doc9.....................................
SofiaCollazos
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 

Hp big data_casestudy

  • 1. Case study HP finds big value in HP Vertica big data solution Time to analyze clickstream data reduced from days to minutes Industry Technology Objective Streamline processing of hp.com clickstream data Approach Implement big data analytics and storage solutions IT matters • Solution easily accommodates billions of rows of data generated by hp.com visitor clicks • Queries returned in minutes instead of days, allowing users to perform more complex, iterative data analysis • Industry-standard SQL ensures user familiarity, maximizing acceptance and ROI Business matters • Improved ability to identify and correct issues with website hardware or software, which reduces risks of degraded customer experience and lost sales • Improved ability to deliver interactive, personalized website experience, which improves sales conversions and drives sales and revenue “Our clickstream implementation of HP Vertica and Apache Hadoop demonstrates the enormous value of these technologies. We expect more and more HP customers will follow suit and adopt the same approach for their big data analytics.” —John Lormand, director, HP.com Technology Big data has value, but to realize that value, businesses need to evolve from legacy batch processing technologies to solutions that support real-time interactive analysis. HP Vertica Analytics and the open source Apache Hadoop software offer a big data solution that HP has leveraged internally to improve its clickstream analytics capabilities.
  • 2. 2 It is true that “time is money.” But so is data. And today’s companies are increasingly aware that the more data they collect, the more value it has. “Big Data”—the enormous data sets generated when companies capture highly granular digital information—can help companies drive innovation and productivity. Big data can also help companies identify new opportunities and markets, and deepen their understanding of customer needs and behaviors. And big data can give companies a competitive edge and help them better understand risk. But to mine the value of big data, companies need cutting-edge technology. They must be able to analyze data sets that dwarf the size of traditional databases. And that analysis must be speedy, to ensure that companies can act on it in a timely fashion. That is why HP has integrated its own technology, the HP Vertica Analytics Platform, with the open-source Apache Hadoop software, to create a robust and comprehensive big data analytics solution. Billions of clicks As is true for many companies today, HP’s public face is its corporate website hp.com. The site is visited by millions of people each month and functions as one of the company’s most important marketing communications vehicles. The site allows HP to offer thousands of pages of searchable information about its products and services directly to the public. The site also serves as a virtual storefront, enabling HP to engage customers and transact business with them. During the course of the millions of interactions with site visitors, hp.com generates “clickstream” data, including information on what pages visitors load, how much time they spend on each page, what links they click, and how they exit the site. Analyzing this clickstream data, in turn, allows HP to deepen its understanding of website visitors. The company can better understand how visitors interact with the site. As a result, the data can be used to improve the site itself—making it more usable, for example, or ensuring visitors can easily find the information they need. More broadly, analyzing clickstream data yields insight into customer behavior, such as buying behaviors. This enables HP to refine its sales and marketing campaigns, or even its products and services themselves. “The biggest consumer of clickstream data is our marketing analysts,” notes John Lormand, director, HP.com Technology. “It is broadly recognized for its value in helping us refine how we communicate to the public and position our solutions.” At one time, HP stored its clickstream data using traditional Oracle databases, and performed modeling and analytics with SAS Analytics software. But the data sets are enormous, due to a number of factors. First is the volume of hp.com traffic and the number of clicks each visitor makes per visit. “We capture 11 to 12 billion clicks per month,” Lormand says. To fully support trending and comparative analysis, HP must store around five years’ worth of clickstream data; analysts typically want to work with about 15 months’ worth at a time to perform year over year trend analysis. This allows the analysts to account for seasonality and show correlation to previous year’s traffic. The site itself is also extremely complex— more a collection of services than a single application—and is not a static environment. Many pages are generated dynamically, based on information provided by the visitor or visitor behavior. “HP.com is an integrated environment, with some pieces generated by HP and others served up by service providers,” Lormand explains. HP’s clickstream database, in fact, was HP’s largest Oracle instance. The sheer volume of the data collected created issues, however. The database performance was sluggish; queries could take days to process. “Query results were taking at least 48 hours after each day’s transactions were completed,” Lormand notes. “And more complex analytics were impossible to do, in practical terms—they simply took too long. “We knew we needed to improve our clickstream analytics capabilities.” Case study | HP Vertica big data solution
  • 3. 3 Infrastructure Management Insight Intelligent data centers of the future Intelligent, workload optimized solutions Intelligent Business Decisions, Faster Case study | HP Vertica big data solution Big data analytics, user-friendly model So HP turned to its own HP Software portfolio, leveraging the HP Vertica Analytics Platform, an industry-leading big data solution that supports real-time querying and loading, advanced in-database analytics, and sophisticated storage and execution functionality to speed queries 50 to 1,000 times faster than traditional databases. HP integrated the Vertica solution with Apache Hadoop as its distributed file system. “Both applications are massively parallel processing systems designed for low-cost big data processing,” notes Lormand. “And in terms of functionality, they are highly complementary. Hadoop enables efficient loading of structured and unstructured data. Vertica enables efficient, extreme analytics.” As a result, instead of waiting days to perform queries, analysts can now get query results in hours or even minutes—even when working with the extremely large data sets that are stored within the hp.com clickstream database. Another key advantage of Vertica is that it is based on ANSI SQL, a structure that is familiar to HP analysts. “Vertica offers big data analytics capabilities in a user-friendly engagement model,” Lormand explains. “This helped ensure user acceptance of the technology. As soon as we rolled out the solution, it was embraced by our analysts.” Faster, more flexible analytics Today, HP’s big data solution—which easily accommodates the billions of rows of clickstream data generated by hp.com visitors—is delivering enhanced analytics capabilities to the company’s business users. Because the combined HP Vertica and Hadoop solution performs analysis much more quickly, it allows HP to interact with its data more flexibly and fluidly. “Our HP Vertica solution allows more recursive, repetitive types of analysis on our clickstream database,” Lormand notes. “So now, when analysts notice something of interest, they can easily perform iterative queries. This lets them follow a particular train of thought because they don’t have to wait for days between queries. This creates a ‘conversation with the data’ that helps us uncover those hidden insights in the massive amounts of clickstream data.” Faster and more flexible analytics, in turn, means HP’s understanding of clickstream data is more sophisticated and nuanced. “Our analysts can correlate data points in ways they never could before because our Oracle solution simply couldn’t process the requests,” Lormand explains. The business benefits of these enhanced analytics capabilities will be significant. HP will be better equipped to improve its website functionality and architecture. It can more easily correlate events across its server farms, for example, which will allow it to identify and isolate anomalies that will yield insights into how website functionality is affecting user interactions. “Our HP Vertica solution gives us a true, end-to-end picture of our environment,” says Lormand. “And because it gives us faster results, we can respond to issues more quickly.” HP will be able to better tailor its website interactivity to the needs of individual visitors, delivering a more precise and granular shopping experience. In the past, for example, the site guided visitors to information on the Fast, flexible analytics unleashes big data’s power HP Big Data Solutions
  • 4. Rate this documentShare with colleagues Sign up for updates hp.com/go/getupdated basis of broad categories. If the visitor seemed to fit the profile of a typical retail customer, that visitor would be guided to one set of solutions. Visitors fitting the profile of a home office user would be led to a different subset of products. But some visitors don’t always fit neatly into these categories. Now, thanks to the insight gained via the HP Vertica solution, HP can build website functionality that ensures the site responds appropriately to all kinds of visitors. And this, in turn, will enhance visitor satisfaction and improve sales conversion rates. HP also uses HP Vertica to analyze channel rebate data, a task that requires matching serial numbers on rebate claims to 135 million rows of shipment data. This has improved HP’s ability to forecast rebates and to execute quarter-end financial adjustments. Given the importance of hp.com to the company’s revenue and brand, it’s likely that managing clickstream data will continue to be an important use of its HP Vertica solution. “We know that our website must deliver an interactive and personalized experience to our visitors,” Lormand concludes. “It’s a key strategic goal, and HP Vertica gives us critical capabilities that we need to achieve it.” Customer at a glance Application Big data analytics Software HP IT Performance Suite—Information Management • HP Vertica Analytics Platform • Apache Hadoop © Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein. 4AA4-5388ENW, February 2013 Case study | HP Vertica big data solution