0% found this document useful (0 votes)
9 views

Sources of Data

This document discusses different sources of data for data analysis. It explains that data can come from primary sources, which is raw data collected directly from sources like questionnaires, interviews, and surveys, or secondary sources, which is previously collected data that is reused. Primary data collection methods include interviews, surveys, and observation. Secondary data comes from internal sources within an organization or external sources like government publications. Other sources of data mentioned include sensor data, satellite data, and web traffic data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Sources of Data

This document discusses different sources of data for data analysis. It explains that data can come from primary sources, which is raw data collected directly from sources like questionnaires, interviews, and surveys, or secondary sources, which is previously collected data that is reused. Primary data collection methods include interviews, surveys, and observation. Secondary data comes from internal sources within an organization or external sources like government publications. Other sources of data mentioned include sensor data, satellite data, and web traffic data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

INSTITUTE –University School of

Business
DEPARTMENT -AIT
M.B.A
Business Analytics

Faculty Name : Shailja Gera

Lecture on :Different Sources of Data

DISCOVER . LEARN . EMPOWER


UNIT-2

1
Different Sources of Data for Data Analysis

Data collection is the process of acquiring, collecting, extracting, and


storing the voluminous amount of data which may be in the structured
or unstructured form like text, video, audio, XML files, records, or other
image files used in later stages of data analysis.
In the process of big data analysis, “Data collection” is the initial step
before starting to analyze the patterns or useful information in data. The
data which is to be analyzed must be collected from different valid
sources.
Data collection starts with asking some questions:
Q-what type of data is to be collected?
Q-what is the source of collection?
Most of the data collected are of two types?
(A)qualitative data-which is a group of non-numerical data such as
words, sentences mostly focus on behavior and actions of the group 
(B) quantitative data-which is in numerical forms and can be
calculated using different scientific tools and sampling data.
Data is Divided into two main types:
1.Primary data:
• The data which is Raw, original, and extracted directly from the
official sources is known as primary data.
• This type of data is collected directly by performing techniques such
as questionnaires, interviews, and surveys.
• The data collected must be according to the demand and requirements
of the target audience on which analysis is performed otherwise it
would be a burden in the data processing.
(a)Interview method:The data collected during this process is through
interviewing the target audience by a person called interviewer and the person who
answers the interview is known as the interviewee.
Some basic business or product related questions are asked and noted down in the
form of notes, audio, or video and this data is stored for processing. 
(b) Survey method:The survey method is the process of research where a list of
relevant questions are asked and answers are noted down in the form of text, audio,
or video. The survey method can be obtained in both online and offline mode like
through website forms and email. Then that survey answers are stored for
analyzing data. Examples are online surveys or surveys through social media polls.
(c) Observation method:The observation method is a method of data
collection in which the researcher keenly observes the behavior and
practices of the target audience using some data collecting tool and
stores the observed data in the form of text, audio, video, or any raw
formats. In this method, the data is collected directly by posting a few
questions on the participants. For example, observing a group of
customers and their behavior towards the products. The data obtained
will be sent for processing.
2. Secondary data:
Secondary data is the data which has already been collected and reused
again for some valid purpose. This type of data is previously recorded
from primary data and it has two types of sources named internal source
and external source.
(a)Internal source:
These types of data can easily be found within the organization such as
market record, a sales record, transactions, customer data, accounting
resources, etc. The cost and time consumption is less in obtaining
internal sources.
(b) External source:
The data which can’t be found at internal organizations and can be
gained through external third party resources is external source data.
The cost and time consumption is more because this contains a huge
amount of data. Examples of external sources are Government
publications, news publications, Registrar General of India, planning
commission, international labor bureau, syndicate services, and other
non-governmental publications.
Other sources:
• Sensors data: With the advancement of IoT devices, the sensors of these
devices collect data which can be used for sensor data analytics to track the
performance and usage of products.
• Satellites data: Satellites collect a lot of images and data in terabytes on
daily basis through surveillance cameras which can be used to collect useful
information.
• Web traffic: Due to fast and cheap internet facilities many formats of data
which is uploaded by users on different platforms can be predicted and
collected with their permission for data analysis. The search engines also
provide their data through keywords and queries searched mostly.

You might also like