100% found this document useful (2 votes)
1K views

Big Data

Big data is large, diverse data that requires new techniques and technologies to manage and analyze. It is characterized by high volume, variety, and velocity. The volume of data is growing exponentially and was estimated to increase from 0.8 zettabytes in 2009 to 35 zettabytes by 2020. Big data comes from a variety of sources and formats, including text, images, videos, sensor data, and social media. It is also being generated faster than ever from sources like mobile devices, scientific instruments, and the Internet of Things. Analyzing big data in a timely manner allows for real-time insights and decisions that were not previously possible.

Uploaded by

k_kara
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
100% found this document useful (2 votes)
1K views

Big Data

Big data is large, diverse data that requires new techniques and technologies to manage and analyze. It is characterized by high volume, variety, and velocity. The volume of data is growing exponentially and was estimated to increase from 0.8 zettabytes in 2009 to 35 zettabytes by 2020. Big data comes from a variety of sources and formats, including text, images, videos, sensor data, and social media. It is also being generated faster than ever from sources like mobile devices, scientific instruments, and the Internet of Things. Analyzing big data in a timely manner allows for real-time insights and decisions that were not previously possible.

Uploaded by

k_kara
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 16

Big Data Definition

No single standard definition



Big Data is data whose scale, diversity, and complexity
require new architecture, techniques, algorithms, and
analytics to manage it and extract value and hidden
knowledge from it

Characteristics of Big Data: 1-Scale (Volume)
Data Volume
44x increase from 2009 2020
From 0.8 zettabytes to 35zb
Data volume is increasing exponentially

Characteristics of Big Data: Complexity (Varity)
Various formats, types, and structures
Text, numerical, images, audio, video,
sequences, time series, social media data,
multi-dim arrays, etc
Static data vs. streaming data
A single application can be
generating/collecting many types of data


To extract knowledge all these types
of data need to linked together
Characteristics of Big Data: Speed (Velocity)
Data is begin generated fast and need to be processed fast
Online Data Analytics
Late decisions missing opportunities
Examples
E-Promotions: Based on your current location, your purchase history, what you
like send promotions right now for store next to you

Healthcare monitoring: sensors monitoring your activities and body any
abnormal measurements require immediate reaction

Big Data: 3Vs
Some Make it 4Vs
Harnessing Big Data
OLTP: Online Transaction
Processing (DBMSs)
OLAP: Online Analytical
Processing (Data Warehousing)
RTAP: Real-Time Analytics
Processing (Big Data
Architecture & technology)

Whos Generating Big Data ?
Social media and
networks
(all of us are
generating data)
Scientific instruments
(collecting all sorts of data)
Mobile devices
(tracking all
objects all the
time)
Sensor technology
and networks
(measuring all kinds
of data)
The progress and innovation is no longer hindered by the ability to collect data

But, by the ability to manage, analyze, summarize, visualize, and discover knowledge from the collected
data in a timely manner and in a scalable fashion
The Model Has Changed
The Model of Generating/Consuming Data has Changed

Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
Whats driving Big Data ?
- Ad-hoc querying and reporting
- Data mining techniques
- Structured data, typical sources
- Small to mid-size datasets
- Optimizations and predictive analytics
- Complex statistical analysis
- All types of data, and many sources
- Very large datasets
- More of a real-time
Value of Big Data Analytics
Big data is more real-time in
nature than traditional DW
applications
Traditional DW architectures (e.g.
Exadata, Teradata) are not well-
suited for big data apps
Shared nothing, massively
parallel processing, scale out
architectures are well-suited for
big data apps

Challenges in Handling Big Data
The Bottleneck is in technology
New architecture, algorithms, techniques are needed
Also in technical skills
Experts in using the new technology and dealing with big data

Thank You
Visit www.seminarlinks.blogspot.com to download

You might also like