0% found this document useful (0 votes)

76 views

Chapter 01: Types of Digital Data

The document discusses different types of digital data, including structured, semi-structured, and unstructured data. Structured data is organized in rows and columns like in databases and spreadsheets. Unstructured data lacks a predefined structure or schema and includes text, images, videos, and audio files. Semi-structured data has some structure but does not conform fully to predefined data models like structured data. The majority of organizational data is unstructured. Challenges with unstructured data include storage requirements, scalability, indexing, searching, and interpretation.

Uploaded by

Shivananda V Seeri

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views

Chapter 01: Types of Digital Data

Uploaded by

Shivananda V Seeri

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 80

CHAPTER 01: TYPES OF DIGITAL DATA

Data
• Any data that can be processed by digital
computer and stored in the sequences of 0's and
1's (Binary language) is knowns as digital data.
• Whenever you send an email, read a social media
post, or take pictures with your digital camera,
you are working with digital data.
• In general, data can be any character, text,
numbers, voice messages, SMS, WhatsApp
messages, pictures, sound, or video.
Data
• Byte is the basic unit of information
in computer storage and processing, and is
composed of eight bits; a kilobyte is 1,000 bytes;
one megabyte is 1,000 kilobytes . (GB, TB, PB, EB,
ZB, YB)
• Digitizing is the process of converting information
into digital form and is necessary for a computer to
be able to process and store the information.
Data
• It is an invaluable asset of any enterprise (big or small).
• Data is present internal to the enterprise and also exists
outside the firewalls of the enterprise.
• Data may be in homogeneous or heterogeneous.
• Need of the hour is to
– Understand, manage, process,
– and take the data for analysis
– to draw valuable insights.
Types of digital data
1. Structured Data : data stored in the form of
rows and columns (databases, Excel)
2. Un-structured Data: No pre-defined schema
(PPTs, images, Videos, pdfs)
3. Semi-structured Data: Hybrid schema (JSON,
HTML, XML, Email, and so on),
Distribution of digital data (in %)
(by Gartner)

10
10 Unstructured

Semi-structured
80 Structured
Structured Data
• Data which is in an organized form (In rows & columns).
• Computer programs can use this data easily.
• Relationships exists between entities of data.
• Example
– Data stored in databases
– ERP
– CRM
– DW
– Data Cube
Structured Data
• The data conforms to a pre-defined schema or structure
is known as structured data.
• The data can be processed, stored, and retrieved in a
fixed format. This data can be processed easily by
programs.
• Conforms to a relational data model.
• Structured data is organized in semantic chunks/entities
with similar entities grouped together to form
relations/tables.
structured Data
• Descriptions for all entities in a group
• Have the same defined format
• Have a predefined length
• Follow the same order.
Example
Sources of Structured Data

Databases

Structured Excel
Data

OLTP
systems
Ease with structured data
Indexing/ Transaction
Searching processing
(ACID)
Ease with Scalability
structured
data
Security

Insert/Update/
delete
Database (RDBMS)
• Oracle Corp. – Oracle
• IBM – DB2, IBM-Informix
• Microsoft – SQL
• EMC – Greenplum
• Teradata – Teradata
• Open source- MySQL, PostgresSQL
• Sqlite
• Sequel Pro
• Amazon Aurora
• SAP SQL Anywhere, SAP IQ (Sybase)
Semi-structured Data
• Data which does not conform to a data model but has
some structure.
• Computer programs can not use this data easily.
• Example
– emails
– XML
– HTML
– JSON, and so on.
Semi-structured data (SSD)
• It is referred to as self describing structure.
• It is a form of structured data that does not
conform with the formal structure of data models
associated with relational databases or other
forms of data tables.
• It uses metadata and tags to provide semantic
information.
Characteristics of semi-structured data
(SSD)
• Does not conform to a data model
• Cannot be stored in the form of rows and columns
as in a database.
• The tags and elements are used to describe data.
• Attributes in a group may not be the same.
• Similar entities are grouped.
• Size of the same attributes in a group may differ
• Type of same attributes in group may differ.
• Evolving Schema
• Schema and data are tightly coupled.
Example (Names & Emails)
• One way is:
Name: Raju Patil
Email : [email protected], [email protected]

• Another way is:

First Name: Raju
Last Name :Patil
Email : [email protected]
Sources of SSD

• Email
• XML
• TCP/IP
• Zipped files
• Mark-up languages
• Integration of data from heterogeneous sources.
Example: Email format

To: <Name>
From: <Name>
Subject: <Text>
CC: <Name>
Body: <Text, Graphics, Images, etc.><Name>
ABC Healthcare Blood Test Report
<> ----
Date
<> -----
Department
<> <>
Patient Name Attending Doctor
<> <>
Hemoglobin Patient Age
content
<>
RBC count
<>
WBC count
<>
Platelet count
Diagnosis <notes>
Conclusion <notes>
XML & JSON
Integration of data from heterogeneous
sources
User

Mediator : Uniform access to multiple data sources

Structured Legacy
RDBMS OODBMS
file system
Getting to know Unstructured data
• Over the past few days, Dr. Ben and Dr. Stanley
had been exchanging long emails about a
particular case of gastro-intestinal problem.
• Email contains procedure practiced by Dr. Stanley,
about combination of drugs that has successfully
cured gastro-intestinal disorders in patients.
• Dr. Mark has a patient in the “GoodLife”
emergency unit with quite similar case of gastro-
intestinal disorder.
Unstructured Data
• Unstructured data refers to the data that lacks any
specific form or structure.
• This makes it very difficult and time-consuming to
process and analyze unstructured data.
• Data which does not conform to any data model is USD.
• Computer programs can not use this data directly.
• About 80-90% data of an organization is in this format.
• An enormous amount of knowledge is hidden in this
data.
• Hence finding useful knowledge/insight from USD is very
crucial.
Unstructured Data
• Unstructured data is a generic label for describing data
that is not contained in a database or some other type
of data structure.
• Unstructured data can be textual or non-textual.
• Textual unstructured data is generated in media like
email messages, PowerPoint presentations, Word
documents, comments in social media, etc.
• Non-textual unstructured data is generated in media
like images, CCTV footage, audio files and video files.
• Anything in a non-database form is unstructured data.
Unstructured Data
• Two types:
1. Bitmap objects : image, video, or audio files
2. Textual objects : word, emails, ppts and so on.
Unstructured Data
• Example
– Memos, QR code (Quick Response), Blogs
– Chat rooms, Tweets, Comments, likes, tags
– PPTs, emoji's, emoticons (emotion icons)
– Images, log files, social media posts
– Videos, sensor data (raw), weather data
– Doc files, geospatial data, surveillance data
– Body of email , GPS data, sensor data, etc.
– WhatsApp messages, CCTV footage and so on.
Getting to know Unstructured data
Characteristics of Unstructured data

• This data cannot be stored in the form of rows

and columns as in a database and does not
conform to any data model.
• It is difficult to determine the meaning of the
data.
• It does not follow any rule or semantics, i.e. Not
in any particular format or sequence.
• Not easily usable by a program.
Sources of Unstructured data
• Web pages • Social media data
• Audio and Videos • White papers
• Images • Surveys
• Body of an email • SMS
• Word document • Free form text
• PPT and reports • Server Log files
• Chats and text messages • Product reviews
Web page is unstructured data

Multimedia Image

Web Page XML

Text
Database
Challenges
• Storage space: A lot of space is required to store USD.
• Scalability: As the data grows, scalability becomes an
issue and the cost of storing USD increases.
• Retrieve information: Difficult to retrieve required
information from USD
• Security: Ensuring security is difficult due to varied
sources of data. E.g. emails, web pages, etc.
• Indexing & searching: Very difficult and error-prone
as the structure of the USD is not clear.
Challenges
• Interpretation : USD is not easily interpreted by
conventional search algorithms.
• Classification : Different naming conventions
followed across the organization make it difficult to
classify data.
• Deriving meaning : Computer programs cannot
automatically derive meaning or structure from USD.
• File formats : Increasing number of file formats
makes it difficult to interpret data.
Portion of Unstructured data

USD
Dealing with USD
1. Data mining
2. Text mining /Text Analytics
3. NLP
4. Noisy text analytics
5. Manual tagging with meta data Possible
6. Part of speech tagging Solutions
7. UIMA
8. Web Scraping
Data Mining
• It is the computing process of discovering patterns
in large data sets involving methods at the
intersection of AI, machine learning &
DL, statistics, and database systems.
• Popular algorithms:
– Association rule mining (MBA)
– Regression Analysis (Y=mX+ c)
– Collaborative filtering
Collaborative filtering
• collaborative filtering uses similarities between users and
items simultaneously to provide recommendations.
• It is a method of making automatic predictions (filtering)
about the interests of a user by collecting preferences
or taste information from many users (collaborating).
• Collaborative filtering works on a fundamental
principle: you are likely to like what someone similar to
you likes.
Collaborative filtering
• Collaborative filtering (CF) is a technique commonly used
• Collaborative filtering (CF) is a technique used
by recommender systems to build personalized
recommendations on the Web.
• Companies that employ CF model include Amazon,
Facebook, Twitter, LinkedIn, Spotify, Google News,
Netflix, iTunes.
Collaborative filtering
Text analytics or text mining
• It is the process of converting
unstructured text data into meaningful data for
analysis, to measure customer opinions, product
reviews, feedback and sentimental analysis to
support fact based decision making.
• Uses many linguistic, statistical, and machine
learning techniques such as clustering, pattern
recognition, tagging, association analysis,
predictive analytics, etc.
Text analytics or text mining
• It helps organizations to find potentially valuable
business insights in corporate documents, customer
emails, call center logs, survey comments, social
network posts, medical records and other sources of
text-based data.
• Text mining capabilities are also being incorporated
into AI chatbots/virtual agents that companies deploy
to provide automated responses to customers as part
of their marketing, sales and customer service
operations.
Natural Language Processing (NLP)
• Natural language processing (NLP) is the ability of a
computer program to understand human language as
it is spoken. NLP is a component of artificial
intelligence (AI).
• It is a field of computer science, artificial
intelligence and computational linguistics concerned
with the interactions between computers and human
(natural) languages (HCI domain).
• NLP strives to build machines that understand and
respond to text or voice data.
Natural Language Processing (NLP)
Noisy text analytics
• It is the process of extracting structured or semi-
structured information from noisy unstructured text data
such as online chat, text messages, emails, message
boards, blogs, wikis, etc.
• The noisy unstructured data comprises one or more of
the followings:
– Spelling mistakes,
– Acronyms
– Non-standard words (HBD, K, GN, GM, VGM, etc.)
– Missing punctuations,
– Missing letters and so on.
Manual tagging with metadata
• It is the process of tagging manually with adequate
metadata to provide the semantics to understand
unstructured data.

Road Accident
Part of Speech Tagging
• It is also called as POS or POST or grammatical
tagging.
• It is the process of reading text and tagging each
word in the sentence as belonging to a particular
part of speech such as “noun”, “verb”, “adjective”,
“pronoun”, etc.

.
Unstructured Information
Management Architecture(UIMA)
• It is an open source platform from IBM, which
integrates different kinds of analysis engines to
provide a complete solution for knowledge
discovery from USD.
• It bridge the gap between structured and USD.
Uses of UIMA
• Used to convert unstructured data such as
repair logs and service notes
into relational tables.
• These tables can then be used
by automated tools to detect maintenance or
manufacturing problems.
Uses of UIMA
• Used in medical contexts to analyze clinical notes,
such as the Clinical Text Analysis and Knowledge
Extraction System ( Apache CTAKES).
• CTAKES is an open-source Natural Language
Processing (NLP) system that extracts clinical
information from electronic health/medical
record free-text (Users are free to type whatever
they want in any form).
UIMA block diagram
Analysis

Transformed into
Acquired from Subjected to
USD various semantic
sources analysis

Delivery

Structured
Query and Structured
information
presentation information
access

Users
Web Scraping
Big Data
• Big data is a term that describes large, hard-
to-manage volumes of data – both structured
and unstructured - none of traditional data
management tools can store it or process it
efficiently.
• experts now predict that 74 zettabytes of
data will be in existence by 2021.
Big Data
• Every day, we create 2.5 quintillion(1018)
bytes of data —90% of the data in the world
today has been created in the last two years
alone.
• This data comes from everywhere: sensors
used to gather climate information, posts to
social media sites, digital pictures and videos,
purchase transaction records, and cell phone
GPS signals, WhatsApp, IOT and so on.
Characteristics of Data
• Composition: Deals with structure of data, i.e.,
sources of data, the granularity(Ex. Postal
address), the types, nature of data (Static or real-
time).
• Condition: Deals with the state of data, that is,
“Can one use data as it is for analysis?” or “Does it
require cleansing for further enhancement and
enrichment?”.
Characteristics of Data
• Context: Deals with
– Where, this data has been generated?
– Why this data generated?
– How sensitive is this data?
– What are the events associated with this data?
– And so on.
Gartner
• Is a global research and advisory firm
providing insights, advice, and tools for
leaders in IT, Finance, HR, Customer Service
and Support
Big data definition- Gartner
• Big data is high-volume, high-velocity, and high-
variety information assets that demand cost
effective, innovative forms of information
processing for enhanced insight and decision
making.
• Cost effective and innovative forms of
information processing: Talks about embracing
new techniques and technologies to capture,
store, process, persevere, integrate and visualize
the big data(3vs).
Definition of Big data by Gartner
• Enhanced insight and decision making: Talks
about deriving deeper, richer, and meaningful
insights and then using these insights to make
faster and better decisions to gain business value
and thus a competitive edge.
Big data formula

Actionable Better
DATA Information
Intelligence Decisions

Enhanced
Business
Value
Challenges with Big Data
• Capture
• Storage (Solution: Cloud Computing)
• Curation ( Management of data + Data retention)
• Search
• Analysis
• Transfer
• Visualization
• Privacy violations
3 Vs
3 V’s of Big data
• The data that is big in Volume, Velocity and
Variety is known as big data.
Sources of big data
• Archives: Archives of scanned documents,
customer correspondence records, patient’s
health records, student’s admission records,
students’ assessment records and so on.
• Sensor data: Car sensors, smart electric meters,
office buildings, washing m/c, other electronic
appliances and so on.
• Machine log data: Event logs, application logs,
audit logs, server logs, etc.
Sources of big data
• Public web: Wikipedia, Weather, regulatory, census, etc.
• Data storage: File systems, SQL database, NoSQL
database (Mongo DB, Cassandra) and so on.
• Media: Audio, Video, image, etc.
• Docs: CSV, word docs, PDF, PPT, XLS, etc.
• Business Apps: ERP, CRM, HR, Google Docs, etc.
• Social media: Twitter blogs, Facebook, LinkedIn,
YouTube, Instagram, etc.
• IOT
Other characteristics of big data
• Veracity and Validity: Refers to the accuracy
(quality) and correctness of the data.
• Volatility: Deals with how long the data is valid?,
and how long should it be stored?. (OTP, Aadhar
No., PW)
• Variability: Data flows can be highly inconsistent
with periodic peaks. (In total 7V’s of big data)
Why Big data
More Data

More Accurate analysis

More confidence in decision making

Greater operational efficiency, cost reduction, time

reduction, new product development, optimized
offerings, etc.
Three reasons for leveraging big data

1. Competitive Advantage.
2. Decision making
3. To create new business value out of data.
Typical data warehouse Environment
Typical Hadoop Environment
• It is different from DW environment.
• Here data sources are web logs, images, audios,
videos, social media, doc files, pdfs, etc.
Hadoop Environment
Big data & DW coexistence
Big data & DW coexistence

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Unit 1
No ratings yet
Unit 1
59 pages
Paper - II Web Technology
No ratings yet
Paper - II Web Technology
248 pages
PSA 5 Final
No ratings yet
PSA 5 Final
36 pages
Unit I: Web and Internet Technologies (15A05605) III B.Tech II Sem (CSE)
No ratings yet
Unit I: Web and Internet Technologies (15A05605) III B.Tech II Sem (CSE)
50 pages
Unix Course Material - Tata Elxsi
No ratings yet
Unix Course Material - Tata Elxsi
112 pages
Nta Ugc Net KEY TO SUCCESS in Computer Science
No ratings yet
Nta Ugc Net KEY TO SUCCESS in Computer Science
61 pages
Unit 1
No ratings yet
Unit 1
80 pages
Java Session Notes
No ratings yet
Java Session Notes
48 pages
Django: Writing Your First Django App, Part 2
No ratings yet
Django: Writing Your First Django App, Part 2
12 pages
The Complete Reference Java - Lecture Notes, Study Material and Important Questions, Answers
No ratings yet
The Complete Reference Java - Lecture Notes, Study Material and Important Questions, Answers
208 pages
Java
No ratings yet
Java
149 pages
Introduction To Algorithms
No ratings yet
Introduction To Algorithms
8 pages
I B.Sc.,Python Lab (2023-2024)
No ratings yet
I B.Sc.,Python Lab (2023-2024)
50 pages
OS06
No ratings yet
OS06
19 pages
OA Lab Manual Jul-Dec2018-19
No ratings yet
OA Lab Manual Jul-Dec2018-19
41 pages
Dbms Complete Lab Manual
No ratings yet
Dbms Complete Lab Manual
172 pages
Unit-1 Question Bank
No ratings yet
Unit-1 Question Bank
4 pages
Prog Lab-II-Data Structures Using C-Lab Manual
No ratings yet
Prog Lab-II-Data Structures Using C-Lab Manual
140 pages
Cs 72 Mobile Computing 2 Mark Questions Unit I 1) What Are The Categories of Mobile Services?
No ratings yet
Cs 72 Mobile Computing 2 Mark Questions Unit I 1) What Are The Categories of Mobile Services?
20 pages
6.Text Processing and Pattern Searching
No ratings yet
6.Text Processing and Pattern Searching
33 pages
02 Fundamentals of The Analysis of Algorithm Efficiency
No ratings yet
02 Fundamentals of The Analysis of Algorithm Efficiency
49 pages
Unit - Iv Software Quality Planning & Control
No ratings yet
Unit - Iv Software Quality Planning & Control
11 pages
Python Unit-4
No ratings yet
Python Unit-4
22 pages
HR Syl 5 Pages 2023-2024 20.09.2023
No ratings yet
HR Syl 5 Pages 2023-2024 20.09.2023
5 pages
IOT TITLES NEW-2024-2025
No ratings yet
IOT TITLES NEW-2024-2025
38 pages
Environmental Studies Notes
No ratings yet
Environmental Studies Notes
23 pages
Ques1. Explain UNIX System Architecture.: Ans. at The Center of The UNIX Onion Is Program
No ratings yet
Ques1. Explain UNIX System Architecture.: Ans. at The Center of The UNIX Onion Is Program
9 pages
23PCA11 Unit 1 Cloud Computing
No ratings yet
23PCA11 Unit 1 Cloud Computing
49 pages
Unix Command Unit 1
No ratings yet
Unix Command Unit 1
40 pages
4.Array Techniques
No ratings yet
4.Array Techniques
26 pages
Bca-601: Multimedia Concepts & Application
0% (1)
Bca-601: Multimedia Concepts & Application
4 pages
Daa Uniti
No ratings yet
Daa Uniti
41 pages
22CS302 - UNIT 1 To 3 - Material
No ratings yet
22CS302 - UNIT 1 To 3 - Material
93 pages
01-Introduction To Soft Computing PDF
100% (2)
01-Introduction To Soft Computing PDF
61 pages
Ads Lab Record
No ratings yet
Ads Lab Record
145 pages
DSA - Question Bank - 2023
No ratings yet
DSA - Question Bank - 2023
4 pages
PHP Basics
0% (1)
PHP Basics
37 pages
2.Fundamental Algorithms
No ratings yet
2.Fundamental Algorithms
20 pages
Prog 1: Write C++ Programs To Implement The Stack ADT Using An Array
No ratings yet
Prog 1: Write C++ Programs To Implement The Stack ADT Using An Array
47 pages
DWDM Unitwise Qns
No ratings yet
DWDM Unitwise Qns
3 pages
Q.1. Define Problem. What Are Steps in Problem Solving? Definition of Problem
100% (1)
Q.1. Define Problem. What Are Steps in Problem Solving? Definition of Problem
30 pages
PST Book - Unit 1 - 5
No ratings yet
PST Book - Unit 1 - 5
192 pages
Software Engineering SYIT SEM-IV
No ratings yet
Software Engineering SYIT SEM-IV
7 pages
Unit 1 Introduction To Database: Characteristics
No ratings yet
Unit 1 Introduction To Database: Characteristics
79 pages
Unix MCQs PDF
No ratings yet
Unix MCQs PDF
33 pages
Chapter-9 Creating A Django Based Basic Web Application
No ratings yet
Chapter-9 Creating A Django Based Basic Web Application
10 pages
Office Automation Tools Short Study Materials
No ratings yet
Office Automation Tools Short Study Materials
5 pages
BCA 403 (File & Data Structure)
100% (1)
BCA 403 (File & Data Structure)
94 pages
PST Unit 4
No ratings yet
PST Unit 4
10 pages
Distributed Operating System
100% (1)
Distributed Operating System
24 pages
Data Warehouse
No ratings yet
Data Warehouse
2 pages
UNIX Question Bank - Final PDF
No ratings yet
UNIX Question Bank - Final PDF
6 pages
M.C.A Data Structures Question Paper: Madras University
No ratings yet
M.C.A Data Structures Question Paper: Madras University
2 pages
Data Structures With Java Unit - 1
No ratings yet
Data Structures With Java Unit - 1
22 pages
Tower of Hanoi: Problem
No ratings yet
Tower of Hanoi: Problem
4 pages
2 Ip 12 Notes RDBMS 2022 PDF
No ratings yet
2 Ip 12 Notes RDBMS 2022 PDF
16 pages
Part - A (Short Answer Questions) : Unit - I
No ratings yet
Part - A (Short Answer Questions) : Unit - I
7 pages
Advanced Unix Programming
From Everand
Advanced Unix Programming
Prof. N. B Venkateswarlu
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
01.DBMS Environment
No ratings yet
01.DBMS Environment
15 pages
Sap Business Bydesign Implementation Process
No ratings yet
Sap Business Bydesign Implementation Process
2 pages
HDM - 4 Modelling Framework
No ratings yet
HDM - 4 Modelling Framework
9 pages
Test PDF
No ratings yet
Test PDF
2 pages
NAME: Sayali Milind Deo. SEAT NO.:704163 LAB1 DBMS (External Practical Exam)
No ratings yet
NAME: Sayali Milind Deo. SEAT NO.:704163 LAB1 DBMS (External Practical Exam)
4 pages
Batch Apex in Salesforce
No ratings yet
Batch Apex in Salesforce
3 pages
The Basic Objectives of The Database Ans.: A Database Is A Collection of Interrelated Data Stored With Minimum Redundancy To Serve
No ratings yet
The Basic Objectives of The Database Ans.: A Database Is A Collection of Interrelated Data Stored With Minimum Redundancy To Serve
2 pages
Quick Fields 8
No ratings yet
Quick Fields 8
11 pages
Structured Analysis Part 1
No ratings yet
Structured Analysis Part 1
19 pages
Data Warehousing and OLAP Technology
No ratings yet
Data Warehousing and OLAP Technology
12 pages
Lecture 2 Hci in The Software Process
No ratings yet
Lecture 2 Hci in The Software Process
4 pages
Introduction To Oracle: Lecturer: J. Mutai
No ratings yet
Introduction To Oracle: Lecturer: J. Mutai
12 pages
Store Name Store Owner Name Age Contact No. Street Barangay Town/City N O. Store Code
No ratings yet
Store Name Store Owner Name Age Contact No. Street Barangay Town/City N O. Store Code
101 pages
Reporting and Budgeting With SAP BPC
No ratings yet
Reporting and Budgeting With SAP BPC
31 pages
Testing PPT 3 Test Process
No ratings yet
Testing PPT 3 Test Process
9 pages
Maintenance Management System (CMMS) Pada
No ratings yet
Maintenance Management System (CMMS) Pada
6 pages
Big Data Pgdca
No ratings yet
Big Data Pgdca
23 pages
DBE Lab Experiment 5: Name: Implement Queries On Constraints
No ratings yet
DBE Lab Experiment 5: Name: Implement Queries On Constraints
11 pages
SharePoint Administration
No ratings yet
SharePoint Administration
6 pages
Question Type: True/False
No ratings yet
Question Type: True/False
28 pages
Instagram Data Analysis Using Panoply and Mode - by Ka Hou Sio - Towards Data Science
No ratings yet
Instagram Data Analysis Using Panoply and Mode - by Ka Hou Sio - Towards Data Science
25 pages
DBMS Architecture: Database Models
No ratings yet
DBMS Architecture: Database Models
6 pages
Data Scientist - Docx .2
No ratings yet
Data Scientist - Docx .2
10 pages
Lecture 03
No ratings yet
Lecture 03
14 pages
Chapters 15-17: Transaction Management: Transactions, Concurrency Control and Recovery
No ratings yet
Chapters 15-17: Transaction Management: Transactions, Concurrency Control and Recovery
67 pages
Sap History
No ratings yet
Sap History
1 page
JPA Mini Book
No ratings yet
JPA Mini Book
3 pages
Experiment 9: Bank Database Objective
No ratings yet
Experiment 9: Bank Database Objective
4 pages
Fdocuments - in - Data Mining MCQ
50% (2)
Fdocuments - in - Data Mining MCQ
34 pages
Information and Knowledge Management in Libraries
No ratings yet
Information and Knowledge Management in Libraries
3 pages

Chapter 01: Types of Digital Data

Uploaded by

Chapter 01: Types of Digital Data

Uploaded by

CHAPTER 01: TYPES OF DIGITAL DATA

• Another way is:

Mediator : Uniform access to multiple data sources

• This data cannot be stored in the form of rows

Web Page XML

More Accurate analysis

More confidence in decision making

Greater operational efficiency, cost reduction, time

You might also like