SlideShare a Scribd company logo
BIGDATA
The Technological Renaissance
Dr. Ritu Bhargava
Department Of Computer Science
Sophia Girls’ College,
Ajmer(Autonomous)
What is Big
Data?
According to Gartner,
information becomes big data
when the volume can no longer
be managed with normal
database tools.
DEFINATION
Big data is high-volume, high-
velocity, and high-variety
information assets that demand
cost-effective, innovative forms of
information processing for
enhanced insight and decision-
making.
Bigdata the technological renaissance
5 V’s Of Big Data
» Volume: Raw Data
» Velocity: Change over time
» Variety: Data types
» Veracity:Data Quality
» Value: Information for Decision
Making.
DATA IS EVERYWHERE
» The digital universe will grow from 3.2 zettabytes to 40 zettabytes in only six
years.
» Every day, we create 2.5 quintillion bytes of data — so much that 90% of the
data in the world today has been created in the last two years alone.
» This data comes from everywhere: sensors used to gather climate
information, posts to social media sites, digital pictures and videos, purchase
transaction records, and cell phone GPS signals .
Origins of Big Data Infrastructure
● The value generated by a social network is proportional to the number of
contacts between users of the social network, rather than the number of
users. According to Metcalfe’s Law[3], and its variants, the number of
contacts for N users is proportional to N*logN. Thus, the growth of contacts,
and therefore the interactions within a social network, which results in data
generation, is nonlinear with respect to number of users. As the world gets
more connected, one can expect the number of interactions to grow, resulting
in even more accelerated data growth.
RECENT STUDY
Google’s search index exploded from 26 Million pages in 1998, to more than 1
Trillion in less than a decade, this content was “multi-structured”, consisting of
natural language text, images, video, geo-spatial, and even renderings of
structured data.
Google had to develop , the Google File System (GFS), and MapReduce
programming framework.
These two publications became the blueprint for Apache Hadoop, an open
source framework that has become a de facto standard for big data platforms
deployed today.
Apache Hadoop
Yahoo adopted Apache Hadoop in January 2006, and made significant
contributions to make it a scalable and stable platform.
Today, Yahoo has the largest footprint of Apache Hadoop, running more than
45,000 servers managing more than 370 Petabytes of data with Hadoop.
Being an open source system, licensed under the liberal Apache Software
License, governed by the Apache Software Foundation.
The scalability and flexibility of Apache Hadoop prompted growing Internet
companies such as Facebook, Twitter, and LinkedIn to adopt it for their data
infrastructure.
Industrial Internet: The Next Frontier
The Big Data use-cases today are analysing customer behaviour, their buying
patterns, their likes and dislikes as expressed in social media,their clickstreams
and location information from mobile devices, machine-generated data could be
the next frontier for Big Data systems.
For example, in an automobile , thousands of signals being captured by 70+
sensors that generate more than 25 gigabytes of data every hour, and processed
by 70 on-board computers .
While most of this data is transient, and needs to be acted upon in real-time,
recognizing patterns within the data to improve safety and usability of the
automobile implies aggregating and analysing it offline.
Facts!
CEO
» Zuckerberg noted that 1 billion pieces of content are shared via Facebook’s
Open Graph daily .
» Facebook puts up over 10 million photographs every hour and around 3
billion ‘like’ buttons are pushed everyday .
» Google process more than 24 petabytes of data every day .
» 48 hours of video are uploaded to YouTube every minute, resulting in nearly
8 years of content every day .
» 70% of data is created by individuals – but enterprises are responsible for
storing and managing 80% of it .
» Every day, we create 2.5 quintillion bytes of data — so much that 90% of the
data in the world today has been created in the last two years alone.
Drivers and Opportunities
» Real-time prediction.
» Increase operational and supply chain efficiencies
» Deep insights into customer behaviour based on pattern and purchase
analysis
» Information aggregation
» Better and more scientific customer segmentation for targeted marketing and
product offering.
» Improve productivity and innovation
» McKinsey predicts an increase in job opportunities ranging from 140K to
190K
» Uncover hidden patterns and rapidly respond to changing scenarios.
» Multi-channel and multi-dimensional information aggregation
» Data convergence
CUSTOMER SEGMENTATION
Bigdata the technological renaissance
Bigdata the technological renaissance
Bigdata the technological renaissance
Bigdata the technological renaissance
Bigdata the technological renaissance
Market Opportunity
Big Data offer bigger opportunities. Here is a snapshot of some of the
predictions done by market research firms
IDC predicts the Big Market to grow to $16.9 Billion by 2019
Digital reasoning estimates that Big Data market would be worth $48.3 billion in
2019
Applications
» Better financial data management
» Investment banking using aggregated information from various sources likes
financial forecasting, asset pricing and portfolio management.
» More accurate pricing adjustments based on vast amount of real-time data
» Stock advises based on huge amount of stock data analysis, unstructured
data like social media content etc.
» Customer segmentation based on previous transactions and profile
information
» Analysis of purchase patterns and tailor made product offerings
» Unstructured data analysis from social media, multi-media to understand the
tastes, preferences, and customer patterns and do sentiment analysis
» Targeted marketing based on user segmentation
Thank you

More Related Content

What's hot (18)

PPT
The Pros and Cons of Big Data in an ePatient World
PYA, P.C.
 
KEY
Big Data Trends
David Feinleib
 
PDF
Big Data for Smart City
Koltiva
 
PDF
Fun Facts about Big Data
Crayon Data
 
PDF
7 Big Facts About Data-Driven Innovation
U.S. Chamber of Commerce
 
PDF
Big Data & Analytics (Conceptual and Practical Introduction)
Yaman Hajja, Ph.D.
 
PPTX
A Big Data Timeline
Big Cloud
 
PPTX
big data
sai praneeth
 
PDF
Towards a big data roadmap for europe
BIG Project
 
PDF
Big data overview external
Brett Colbert
 
PPTX
Data Mining With Big Data
Muhammad Rumman Islam Nur
 
PPTX
Big data characteristics, value chain and challenges
Musfiqur Rahman
 
PPTX
Big data introduction
vikas samant
 
PPTX
BIG DATA(PPT)
josnapv
 
PDF
P02 | Big Data | Anurag Gupta | BCA
ANURAGGUPTA570
 
PPT
Data mining with big data
Sandip Tipayle Patil
 
The Pros and Cons of Big Data in an ePatient World
PYA, P.C.
 
Big Data Trends
David Feinleib
 
Big Data for Smart City
Koltiva
 
Fun Facts about Big Data
Crayon Data
 
7 Big Facts About Data-Driven Innovation
U.S. Chamber of Commerce
 
Big Data & Analytics (Conceptual and Practical Introduction)
Yaman Hajja, Ph.D.
 
A Big Data Timeline
Big Cloud
 
big data
sai praneeth
 
Towards a big data roadmap for europe
BIG Project
 
Big data overview external
Brett Colbert
 
Data Mining With Big Data
Muhammad Rumman Islam Nur
 
Big data characteristics, value chain and challenges
Musfiqur Rahman
 
Big data introduction
vikas samant
 
BIG DATA(PPT)
josnapv
 
P02 | Big Data | Anurag Gupta | BCA
ANURAGGUPTA570
 
Data mining with big data
Sandip Tipayle Patil
 

Similar to Bigdata the technological renaissance (20)

PPTX
Bigdata " new level"
Vamshikrishna Goud
 
PPTX
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Hritika Raj
 
PPTX
Big data
Mahmudul Alam
 
PPTX
Data mining with big data implementation
Sandip Tipayle Patil
 
PPT
Big data and Internet
Sanoj Kumar
 
PDF
Big data analytics with Apache Hadoop
Suman Saurabh
 
PPTX
Understanding big data
Praneet Samaiya
 
PPTX
Big data Ppt
Prashant Navatre
 
PPT
130214 copy
Arpit Arora
 
PPTX
Big Data World
Hossein Zahed
 
PDF
Big Data Analytics Orientation. .pdf
080msdsa024yatru
 
PPTX
Big data Analytics
Guduru Lakshmi Kiranmai
 
PPT
Research issues in the big data and its Challenges
Kathirvel Ayyaswamy
 
DOCX
Introduction to big data – convergences.
saranya270513
 
PPTX
big data.pptx
ParasSundriyal2
 
PDF
Big Data Analytics Introduction chapter.pdf
NikulZinzuvadiya
 
DOCX
Big data lecture notes
Mohit Saini
 
PPTX
Foundations of Big Data: Concepts, Techniques, and Applications
hoisala6sludger
 
PDF
Big data and analytics
Bohitesh Misra, PMP
 
PPTX
Big data
mmuthuraj
 
Bigdata " new level"
Vamshikrishna Goud
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Hritika Raj
 
Big data
Mahmudul Alam
 
Data mining with big data implementation
Sandip Tipayle Patil
 
Big data and Internet
Sanoj Kumar
 
Big data analytics with Apache Hadoop
Suman Saurabh
 
Understanding big data
Praneet Samaiya
 
Big data Ppt
Prashant Navatre
 
130214 copy
Arpit Arora
 
Big Data World
Hossein Zahed
 
Big Data Analytics Orientation. .pdf
080msdsa024yatru
 
Big data Analytics
Guduru Lakshmi Kiranmai
 
Research issues in the big data and its Challenges
Kathirvel Ayyaswamy
 
Introduction to big data – convergences.
saranya270513
 
big data.pptx
ParasSundriyal2
 
Big Data Analytics Introduction chapter.pdf
NikulZinzuvadiya
 
Big data lecture notes
Mohit Saini
 
Foundations of Big Data: Concepts, Techniques, and Applications
hoisala6sludger
 
Big data and analytics
Bohitesh Misra, PMP
 
Big data
mmuthuraj
 
Ad

More from RituBhargava7 (9)

PPTX
Client server architecture
RituBhargava7
 
PPTX
File organization
RituBhargava7
 
PDF
Data Models
RituBhargava7
 
PDF
Data models
RituBhargava7
 
PPTX
Database abstraction
RituBhargava7
 
PPTX
Open Source Concepts
RituBhargava7
 
PDF
Role of a DBA
RituBhargava7
 
PPTX
File systems versus a dbms
RituBhargava7
 
PPTX
Database tachnologies
RituBhargava7
 
Client server architecture
RituBhargava7
 
File organization
RituBhargava7
 
Data Models
RituBhargava7
 
Data models
RituBhargava7
 
Database abstraction
RituBhargava7
 
Open Source Concepts
RituBhargava7
 
Role of a DBA
RituBhargava7
 
File systems versus a dbms
RituBhargava7
 
Database tachnologies
RituBhargava7
 
Ad

Recently uploaded (20)

PPTX
quantum computing transition from classical mechanics.pptx
gvlbcy
 
PDF
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
PDF
Air -Powered Car PPT by ER. SHRESTH SUDHIR KOKNE.pdf
SHRESTHKOKNE
 
PPTX
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
PDF
Zero carbon Building Design Guidelines V4
BassemOsman1
 
PPTX
Ground improvement techniques-DEWATERING
DivakarSai4
 
PDF
STUDY OF NOVEL CHANNEL MATERIALS USING III-V COMPOUNDS WITH VARIOUS GATE DIEL...
ijoejnl
 
PDF
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
PDF
Jual GPS Geodetik CHCNAV i93 IMU-RTK Lanjutan dengan Survei Visual
Budi Minds
 
DOCX
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
PPTX
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
PPTX
Introduction to Fluid and Thermal Engineering
Avesahemad Husainy
 
PPTX
cybersecurityandthe importance of the that
JayachanduHNJc
 
PPTX
Water resources Engineering GIS KRT.pptx
Krunal Thanki
 
PPTX
Information Retrieval and Extraction - Module 7
premSankar19
 
PDF
勉強会資料_An Image is Worth More Than 16x16 Patches
NABLAS株式会社
 
PDF
4 Tier Teamcenter Installation part1.pdf
VnyKumar1
 
PDF
20ME702-Mechatronics-UNIT-1,UNIT-2,UNIT-3,UNIT-4,UNIT-5, 2025-2026
Mohanumar S
 
PPTX
Precedence and Associativity in C prog. language
Mahendra Dheer
 
PDF
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
quantum computing transition from classical mechanics.pptx
gvlbcy
 
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
Air -Powered Car PPT by ER. SHRESTH SUDHIR KOKNE.pdf
SHRESTHKOKNE
 
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
Zero carbon Building Design Guidelines V4
BassemOsman1
 
Ground improvement techniques-DEWATERING
DivakarSai4
 
STUDY OF NOVEL CHANNEL MATERIALS USING III-V COMPOUNDS WITH VARIOUS GATE DIEL...
ijoejnl
 
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
Jual GPS Geodetik CHCNAV i93 IMU-RTK Lanjutan dengan Survei Visual
Budi Minds
 
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
Introduction to Fluid and Thermal Engineering
Avesahemad Husainy
 
cybersecurityandthe importance of the that
JayachanduHNJc
 
Water resources Engineering GIS KRT.pptx
Krunal Thanki
 
Information Retrieval and Extraction - Module 7
premSankar19
 
勉強会資料_An Image is Worth More Than 16x16 Patches
NABLAS株式会社
 
4 Tier Teamcenter Installation part1.pdf
VnyKumar1
 
20ME702-Mechatronics-UNIT-1,UNIT-2,UNIT-3,UNIT-4,UNIT-5, 2025-2026
Mohanumar S
 
Precedence and Associativity in C prog. language
Mahendra Dheer
 
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 

Bigdata the technological renaissance

  • 1. BIGDATA The Technological Renaissance Dr. Ritu Bhargava Department Of Computer Science Sophia Girls’ College, Ajmer(Autonomous)
  • 2. What is Big Data? According to Gartner, information becomes big data when the volume can no longer be managed with normal database tools.
  • 3. DEFINATION Big data is high-volume, high- velocity, and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision- making.
  • 5. 5 V’s Of Big Data » Volume: Raw Data » Velocity: Change over time » Variety: Data types » Veracity:Data Quality » Value: Information for Decision Making.
  • 6. DATA IS EVERYWHERE » The digital universe will grow from 3.2 zettabytes to 40 zettabytes in only six years. » Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. » This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals .
  • 7. Origins of Big Data Infrastructure ● The value generated by a social network is proportional to the number of contacts between users of the social network, rather than the number of users. According to Metcalfe’s Law[3], and its variants, the number of contacts for N users is proportional to N*logN. Thus, the growth of contacts, and therefore the interactions within a social network, which results in data generation, is nonlinear with respect to number of users. As the world gets more connected, one can expect the number of interactions to grow, resulting in even more accelerated data growth.
  • 8. RECENT STUDY Google’s search index exploded from 26 Million pages in 1998, to more than 1 Trillion in less than a decade, this content was “multi-structured”, consisting of natural language text, images, video, geo-spatial, and even renderings of structured data. Google had to develop , the Google File System (GFS), and MapReduce programming framework. These two publications became the blueprint for Apache Hadoop, an open source framework that has become a de facto standard for big data platforms deployed today.
  • 9. Apache Hadoop Yahoo adopted Apache Hadoop in January 2006, and made significant contributions to make it a scalable and stable platform. Today, Yahoo has the largest footprint of Apache Hadoop, running more than 45,000 servers managing more than 370 Petabytes of data with Hadoop. Being an open source system, licensed under the liberal Apache Software License, governed by the Apache Software Foundation. The scalability and flexibility of Apache Hadoop prompted growing Internet companies such as Facebook, Twitter, and LinkedIn to adopt it for their data infrastructure.
  • 10. Industrial Internet: The Next Frontier The Big Data use-cases today are analysing customer behaviour, their buying patterns, their likes and dislikes as expressed in social media,their clickstreams and location information from mobile devices, machine-generated data could be the next frontier for Big Data systems. For example, in an automobile , thousands of signals being captured by 70+ sensors that generate more than 25 gigabytes of data every hour, and processed by 70 on-board computers . While most of this data is transient, and needs to be acted upon in real-time, recognizing patterns within the data to improve safety and usability of the automobile implies aggregating and analysing it offline.
  • 11. Facts! CEO » Zuckerberg noted that 1 billion pieces of content are shared via Facebook’s Open Graph daily . » Facebook puts up over 10 million photographs every hour and around 3 billion ‘like’ buttons are pushed everyday . » Google process more than 24 petabytes of data every day . » 48 hours of video are uploaded to YouTube every minute, resulting in nearly 8 years of content every day . » 70% of data is created by individuals – but enterprises are responsible for storing and managing 80% of it . » Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone.
  • 12. Drivers and Opportunities » Real-time prediction. » Increase operational and supply chain efficiencies » Deep insights into customer behaviour based on pattern and purchase analysis » Information aggregation » Better and more scientific customer segmentation for targeted marketing and product offering. » Improve productivity and innovation » McKinsey predicts an increase in job opportunities ranging from 140K to 190K » Uncover hidden patterns and rapidly respond to changing scenarios. » Multi-channel and multi-dimensional information aggregation » Data convergence
  • 19. Market Opportunity Big Data offer bigger opportunities. Here is a snapshot of some of the predictions done by market research firms IDC predicts the Big Market to grow to $16.9 Billion by 2019 Digital reasoning estimates that Big Data market would be worth $48.3 billion in 2019
  • 20. Applications » Better financial data management » Investment banking using aggregated information from various sources likes financial forecasting, asset pricing and portfolio management. » More accurate pricing adjustments based on vast amount of real-time data » Stock advises based on huge amount of stock data analysis, unstructured data like social media content etc. » Customer segmentation based on previous transactions and profile information » Analysis of purchase patterns and tailor made product offerings » Unstructured data analysis from social media, multi-media to understand the tastes, preferences, and customer patterns and do sentiment analysis » Targeted marketing based on user segmentation