0% found this document useful (0 votes)
63 views21 pages

Big Data Intro

Big data refers to large data sets that originate from various sources and are difficult to process using traditional data processing approaches. It requires specialized technologies and techniques to capture, store, analyze, and visualize such large and complex data sets. Big data comes in structured, unstructured, and semi-structured forms and is characterized by its volume, velocity, and variety. Proper structuring of big data allows organizations to gain customized insights from user behaviors and preferences to make personalized recommendations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
63 views21 pages

Big Data Intro

Big data refers to large data sets that originate from various sources and are difficult to process using traditional data processing approaches. It requires specialized technologies and techniques to capture, store, analyze, and visualize such large and complex data sets. Big data comes in structured, unstructured, and semi-structured forms and is characterized by its volume, velocity, and variety. Proper structuring of big data allows organizations to gain customized insights from user behaviors and preferences to make personalized recommendations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 21

BIG DATA

ANDHRA LOYOLA
COLLEGE
BIG DATA

› Big Data is a field dedicated to the analysis,


processing, and storage of large collections of
data sets that frequently originate from disparate
sources.
BIG DATA

› It is required when traditional technologies and


techniques are insufficient.

› DataficationCapturing/Collecting Big Data


BIG DATA - Sources

› Social Data  FB, Twitter, Instagram, LinkedIn

› Machine Data  RFID, Sensors, GPS

› Transactional Data  Amazon, Flipkart, e-Bay


Structuring Big Data
› Arrangement of the available data in a manner that is easy to study,
analyze and derive conclusions from it.

› Todays Information Processing Systems can analyze and structure a large


amount of data specially for you on the basis of your interests and
search criteria.

› It helps in understanding user behaviours, requirements and preferences


to make personalized recommendations for every individual.
Structuring Big Data

 Ex: Recommended list of products based on earlier


purchases

Big Data can be useful for structuring the data and


presenting a specially customized recommendation set for
every user.
Types of Big Data

› 1. Structured Data

› 2. UnStructured Data

› 3. Semi-Structured Data

› *Meta Data
Types of Big Data
› 1. Structured Data:
› Conforms to a data model or schema.

› It is often stored in a tabular form.

› Makes it easier for any program to sort, read and process the data.

› It is most often stored in a relational database

› ERP and CRM systems


Types of Big Data
› 1. Structured Data:
› Represented using the following figure:

› Ex; Banking transactions, invoices, and customer records.


Types of Big Data
› 2. UnStructured Data:
› Does NOT Conform to any data model

› It has a faster growth rate

› Ex: Some common types of Un-Structured data:


Types of Big Data
› 2. UnStructured Data:

› This form of data is either textual or binary

› Texual may contain the contents of various tweets or


blog postings.

› Binary  may be the media files that contain image,


audio or video data
Types of Big Data
› 2. UnStructured Data:

› This form of data is either textual or binary

› Stored in BLOB

› NoSQL
Types of Big Data
› 3. Semi-Structured Data:
› It has a defined level of structure and consistency
› A semi-structured data is hierarchical or graph-based.
› For example stored in XML and JSON files

› EDI Files, Spreadsheets


Types of Big Data
› 4. Meta Data

› Provides information about a dataset’s characteristics and


structure

› Important for Semi, Unstructured data processing.


Elements of Big Data
› The Five Vs of Big Data

For a dataset to be considered Big Data:


1. Volume
2. Velocity
3. Variety
4. Veracity
5. Value
Elements of Big Data
› The Five Vs of Big Data
1. Volume :

Volume refers to the scale (amount) of data

generated each second

from social media, smart phones, cars, credit cards,

M2M sensors, photographs, video, etc.


Elements of Big Data
› The Five Vs of Big Data
2. Velocity :

In Big Data environments, Velocity refers to the

speed at which vast amounts of data are being

generated, collected and analyzed.


Elements of Big Data
› The Five Vs of Big Data
2. Velocity :

In Big Data environments, Velocity refers to the

speed at which vast amounts of data are being

generated, collected and analyzed.


Elements of Big Data
› The Five Vs of Big Data
2. Velocity :

Figure: Examples of High Velocity Big data sets


THANK YOU

You might also like