Ds Assignment

This document discusses big data, providing examples and characteristics. It defines big data as data that is huge in volume and growing exponentially over time. Examples given include trade data from the New York Stock Exchange and data from Facebook. The characteristics of big data are described as volume, variety, velocity, and variability. Advantages of processing big data are also summarized.

Uploaded by

csmath211101090

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Ds Assignment

Uploaded by

csmath211101090

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

KHAWAJA FAREED UEIT RAHIM YAR KHAN

Assignment # 01 OF Data Science

Topic :: Big Data
Submitted By:: Muhammad Saleem
Math211101090
Submitted To:: Dr Saima Noreen Khosa

Dated:: 1/3/2024

Big Data is a collection of data that is huge in volume, yet growing

exponentially with time. It is a data with so large size and complexity that
none of traditional data management tools can store it or process it
efficiently. Big data is also a data but with huge size.
Following are some of the Big Data examples,
The New York Stock Exchange is an example of Big Data that generates
about one terabyte of new trade data per day.
Social Media
The statistic shows that 500+terabytes of new data get ingested into the
databases of social media site Facebook, every day. This data is mainly
generated in terms of photo and video uploads, message exchanges,
putting comments etc.

Types Of Big Data

1. Structured
2. Unstructured
3. Semi-structured
Structured
Any data that can be stored, accessed and processed in the form of fixed
format is termed as a ‘structured’ data. Over the period of time, talent in
computer science has achieved greater success in developing techniques
for working with such kind of data (where the format is well known in
advance) and also deriving value out of it. However, nowadays, we are
foreseeing issues when a size of such data grows to a huge extent, typical
sizes are being in the rage of multiple zettabytes.
1021 bytes = 1 zettabyte or 1 billion terabytes
Examples Of Structured Data
An ‘Employee’ table in a database is an example of Structured Data
Unstructured
Any data with unknown form or the structure is classified as unstructured
data. In addition to the size being huge, un-structured data poses
multiple challenges in terms of its processing for deriving value out of it. A
typical example of unstructured data is a heterogeneous data source
containing a combination of simple text files, images, videos etc. Now
day organizations have wealth of data available with them but
unfortunately, they don’t know how to derive value out of it since this data
is in its raw form or unstructured format.
Examples Of Un-structured Data
The output returned by ‘Google Search’
Semi-structured
Semi-structured data can contain both the forms of data. We can see semi-
structured data as a structured in form but it is actually not defined with e.g.
a table definition in relational DBMS.
Examples Of Semi-structured Data
Personal data stored in an XML file
Data Growth over the years
Please note that web application data, which is unstructured, consists of log
files, transaction history files etc. OLTP systems are built to work with
structured data wherein data is stored in relations (tables).

Characteristics Of Big Data

Big data can be described by the following characteristics:
 Volume
 Variety
 Velocity
 Variability
(i) Volume –The name Big Data itself is related to a size which is
enormous. Size of data plays a very crucial role in determining value out of
data. Also, whether a particular data can actually be considered as a Big
Data or not, is dependent upon the volume of data.
(ii) Variety –Variety refers to heterogeneous sources and the nature of
data, both structured and unstructured. During earlier days,
spreadsheets and databases were the only sources of data considered by
most of the applications. Nowadays, data in the form of emails, photos,
videos, monitoring devices, PDFs, audio, etc. are also being considered in
the analysis applications. This variety of unstructured data poses certain
issues for storage, mining and analyzing data.
(iii) Velocity –The term ‘velocity’ refers to the speed of generation of data.
How fast the data is generated and processed to meet the demands,
determines real potential in the data.
Big Data Velocity deals with the speed at which data flows in from sources
like business processes, application logs, networks, and social media sites,
sensors,Mobile devices, etc.
(iv) Variability –This refers to the inconsistency which can be shown by
the data at times, thus hampering the process of being able to handle and
manage the data effectively.
Advantages Of Big Data Processing
Ability to process Big Data in DBMS brings in multiple benefits, such as-
 Businesses can utilize outside intelligence while taking decisions
Access to social data from search engines and sites like Facebook, Twitter are
enabling organizations to fine tune their business strategies.
 Improved customer service
Traditional customer feedback systems are getting replaced by new
systems designed with Big Data technologies. In these new systems, Big
Data and natural language processing technologies are being used to read
and evaluate consumer responses.
 Early identification of risk to the product/services, if any
 Better operational efficiency
Big Data technologies can be used for creating a staging area or landing
zone for new data before identifying what data should be moved to the
data warehouse. In addition, such integration of Big Data technologies and
data warehouse helps an organization to offload infrequently accessed
data.

Mckinsey's 7s ZARA
100% (3)
Mckinsey's 7s ZARA
44 pages
Sap PP Configuration Pack
92% (12)
Sap PP Configuration Pack
74 pages
Laudon Traver Ec16 PPT Ch02 Accessible 26022023 075808pm
100% (1)
Laudon Traver Ec16 PPT Ch02 Accessible 26022023 075808pm
42 pages
Activity No. 1 - Cost, Concepts and Classifications
No ratings yet
Activity No. 1 - Cost, Concepts and Classifications
4 pages
Starbucks LTV Case Study
No ratings yet
Starbucks LTV Case Study
8 pages
Big Data Processing
No ratings yet
Big Data Processing
19 pages
Evolution of Big Data
No ratings yet
Evolution of Big Data
50 pages
UNIT 3 Notes by ARUN JHAPATE
No ratings yet
UNIT 3 Notes by ARUN JHAPATE
9 pages
Big Data
No ratings yet
Big Data
7 pages
Unit 1 Introduction To BIG DATA ANALYSIS: Evolution of Technology
No ratings yet
Unit 1 Introduction To BIG DATA ANALYSIS: Evolution of Technology
9 pages
1.8 Big Data - Introduction & Characteristics
No ratings yet
1.8 Big Data - Introduction & Characteristics
9 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
60 pages
BIG DATA & Hadoop Tutorial
No ratings yet
BIG DATA & Hadoop Tutorial
23 pages
Unit Iii Big Data Analytics What Is Data?
No ratings yet
Unit Iii Big Data Analytics What Is Data?
36 pages
Big Data
No ratings yet
Big Data
4 pages
Module I Big Data
No ratings yet
Module I Big Data
7 pages
Unit-I (Big Data)
No ratings yet
Unit-I (Big Data)
30 pages
Introduction To Bigdata
No ratings yet
Introduction To Bigdata
31 pages
Unit 1 Bigdata
No ratings yet
Unit 1 Bigdata
30 pages
Big Data Analytics Unit 1
No ratings yet
Big Data Analytics Unit 1
26 pages
What Is BIG DATA - Introduction, Types, Characteristics, Example
No ratings yet
What Is BIG DATA - Introduction, Types, Characteristics, Example
11 pages
Big Data Hadoop
No ratings yet
Big Data Hadoop
35 pages
Chapter 4 Data Analytics
No ratings yet
Chapter 4 Data Analytics
19 pages
Assignment: Advance Marketing Research & Data Analytics
No ratings yet
Assignment: Advance Marketing Research & Data Analytics
4 pages
Big Data Unit 1 Notes
No ratings yet
Big Data Unit 1 Notes
37 pages
Big Data - Unit-1 - KCS-061
No ratings yet
Big Data - Unit-1 - KCS-061
63 pages
BIG DATA Research PDF
No ratings yet
BIG DATA Research PDF
9 pages
Big Data Unit-1 Kcs-061
No ratings yet
Big Data Unit-1 Kcs-061
64 pages
Unit - I Part I
No ratings yet
Unit - I Part I
48 pages
Big Data Unit 1 Notes
No ratings yet
Big Data Unit 1 Notes
36 pages
BDA Unit-1
No ratings yet
BDA Unit-1
56 pages
Big Data Intro
No ratings yet
Big Data Intro
12 pages
Big Data12
No ratings yet
Big Data12
11 pages
Big Data
No ratings yet
Big Data
13 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
26 pages
Topic 11
No ratings yet
Topic 11
11 pages
BDA NOTES With Questions Included
No ratings yet
BDA NOTES With Questions Included
108 pages
Unit 1
No ratings yet
Unit 1
26 pages
Big Data UNIT1
No ratings yet
Big Data UNIT1
23 pages
big Data
No ratings yet
big Data
21 pages
Unit-I Bdaur-Bcom
No ratings yet
Unit-I Bdaur-Bcom
5 pages
big data introduction unit 1
No ratings yet
big data introduction unit 1
19 pages
Introduction to Data Science_students
No ratings yet
Introduction to Data Science_students
237 pages
Sybca Bigdata
No ratings yet
Sybca Bigdata
10 pages
Printed Notes Dsba
No ratings yet
Printed Notes Dsba
13 pages
What Is Big Data
No ratings yet
What Is Big Data
3 pages
1.1 Module-1
No ratings yet
1.1 Module-1
31 pages
Seminar Report BIG DATA
No ratings yet
Seminar Report BIG DATA
28 pages
Cloud computing
No ratings yet
Cloud computing
86 pages
Big Data
No ratings yet
Big Data
3 pages
Big Data UNIT I
No ratings yet
Big Data UNIT I
91 pages
Big Data Chapter-I_new
No ratings yet
Big Data Chapter-I_new
49 pages
Types of Big Data
No ratings yet
Types of Big Data
32 pages
Big Type Data
No ratings yet
Big Type Data
4 pages
UNIT- 1_DA_Notes
No ratings yet
UNIT- 1_DA_Notes
51 pages
BigData_1
No ratings yet
BigData_1
14 pages
Big Data Pgdca
No ratings yet
Big Data Pgdca
23 pages
BDA Question Answer
No ratings yet
BDA Question Answer
29 pages
R19 BDA UNIT-1
No ratings yet
R19 BDA UNIT-1
22 pages
Pushpak Data Mining
No ratings yet
Pushpak Data Mining
11 pages
UNIT-1 BDA
No ratings yet
UNIT-1 BDA
20 pages
Introduction to Big Data
No ratings yet
Introduction to Big Data
297 pages
Converted 4011171
No ratings yet
Converted 4011171
144 pages
BDA Unit 1
No ratings yet
BDA Unit 1
22 pages
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
1.1. Entrepreneur: 1.1.1. Meaning and Definition of Entrepreneur
No ratings yet
1.1. Entrepreneur: 1.1.1. Meaning and Definition of Entrepreneur
10 pages
Integrated Marketing Communications Chapter 1
No ratings yet
Integrated Marketing Communications Chapter 1
33 pages
Management and Organisational Behaviour
No ratings yet
Management and Organisational Behaviour
62 pages
Understanding Six Sigma: An Overview: Prathamesh Suresh Pardeshi Hpgd/Ap16/Xxxx Specialization: Service Excellence
No ratings yet
Understanding Six Sigma: An Overview: Prathamesh Suresh Pardeshi Hpgd/Ap16/Xxxx Specialization: Service Excellence
86 pages
Project Recovery Plan Development: Lunar International College July, 2021
No ratings yet
Project Recovery Plan Development: Lunar International College July, 2021
28 pages
Live Case Study ON Change Management in Walmart: Submitted by D. Bhagavathi Purnima (121723601008) Year CM - Section: B
No ratings yet
Live Case Study ON Change Management in Walmart: Submitted by D. Bhagavathi Purnima (121723601008) Year CM - Section: B
20 pages
Top Courses by IIMs
No ratings yet
Top Courses by IIMs
1 page
Management
100% (1)
Management
126 pages
Information Technology for Managers 2nd Edition George Walter Reynolds 2024 scribd download
No ratings yet
Information Technology for Managers 2nd Edition George Walter Reynolds 2024 scribd download
51 pages
Netflix - Freedom and Responsibility
No ratings yet
Netflix - Freedom and Responsibility
5 pages
Orgman Staffing
No ratings yet
Orgman Staffing
55 pages
output-46523
No ratings yet
output-46523
4 pages
Pom Bba Gen Unit I Notes
No ratings yet
Pom Bba Gen Unit I Notes
31 pages
Creating Compelling Experiences
100% (1)
Creating Compelling Experiences
2 pages
History of DSDM: Dynamic Systems Development Method (DSDM) Is An
No ratings yet
History of DSDM: Dynamic Systems Development Method (DSDM) Is An
2 pages
Performance Management System: Y S N Murthy, Pvpsit of
No ratings yet
Performance Management System: Y S N Murthy, Pvpsit of
25 pages
Components of A Business Plan: Jill Kline Wyoming SBDC
No ratings yet
Components of A Business Plan: Jill Kline Wyoming SBDC
28 pages
New Business Model and Strategy For Internet Economy
100% (1)
New Business Model and Strategy For Internet Economy
20 pages
Chapter 1 The Demand For Audit and Other Assurance Services
No ratings yet
Chapter 1 The Demand For Audit and Other Assurance Services
2 pages
David - Sm15ge PPT ch01
No ratings yet
David - Sm15ge PPT ch01
32 pages
Iso 31000
No ratings yet
Iso 31000
14 pages
Case Study
No ratings yet
Case Study
22 pages
Merger Clearance
No ratings yet
Merger Clearance
25 pages
Audit Plan - LabManual1 Module1 Answer Template
No ratings yet
Audit Plan - LabManual1 Module1 Answer Template
1 page
10-229 AICPA CICA Privacy Maturity Model Finale Book
No ratings yet
10-229 AICPA CICA Privacy Maturity Model Finale Book
42 pages

Ds Assignment

Uploaded by

Ds Assignment

Uploaded by

KHAWAJA FAREED UEIT RAHIM YAR KHAN

Assignment # 01 OF Data Science

Big Data is a collection of data that is huge in volume, yet growing

Types Of Big Data

Characteristics Of Big Data

You might also like