What Are The Different Sources of Data?

The document discusses different types of data sources and scales of measurement. It describes two main sources of data - internal sources from within an organization and external sources from outside. It also discusses primary and secondary data. Four scales of measurement are defined - nominal, ordinal, interval, and ratio scales. Probability sampling methods like simple random sampling, systematic sampling, stratified sampling, and clustered sampling are also explained.

Uploaded by

Jesien Morales

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views5 pages

What Are The Different Sources of Data?

Uploaded by

Jesien Morales

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Sources of Data

Sources of Data can be classified into 2 types. Statistical sources refer to data that are gathered for
some official purposes and incorporate censuses and officially administered surveys. Non-statistical
sources refer to the collection of data for other administrative purposes or for the private sector.

What are the different sources of data?

Following are the two sources of data:
1. Internal Source

 When data are collected from reports and records of the organization itself, it is known as the
internal source.
 For example, a company publishes its ‘Annual Report’ on Profit and Loss, Total Sales,
Loans, Wages etc.
2. External Source

 When data are collected from outside the organization, it is known as the external source.
 For example, if a Tour and Travels Company obtains information on ‘KarnatakaTourism’ from
Karnataka Transport Corporation, it would be known as external sources of data.

Types of Data
A) Primary Data

 Primary data means ‘First-hand information’ collected by an investigator.

 It is collected for the first time.
 It is original and more reliable.
 For example Population census conducted by the government of India after every 10 years.
B) Secondary Data

 Secondary data refers to ‘Second-hand information’.

 These are not originally collected rather obtained from already published or unpublished
sources.
 For example the Address of a person taken from the Telephone Directory or Phone number
of a company taken from ‘Just Dial’.
https://byjus.com/commerce/what-are-the-sources-of-data/
In statistics, the term measurement is used more broadly and is more appropriately termed scales of
measurement. Scales of measurement refer to ways in which variables/numbers are defined and
categorized. Each scale of measurement has certain properties which in turn determines the
appropriateness for use of certain statistical analyses. The four scales of measurement are nominal,
ordinal, interval, and ratio.

 Nominal: Categorical data and numbers that are simply used as identifiers or names represent a
nominal scale of measurement. Numbers on the back of a baseball jersey (St. Louis Cardinals 1 =
Ozzie Smith) and your social security number are examples of nominal data. If I conduct a study
and I'm including gender as a variable, I will code Female as 1 and Male as 2 or visa versa when I
enter my data into the computer. Thus, I am using the numbers 1 and 2 to represent categories
of data.
 Ordinal: An ordinal scale of measurement represents an ordered series of relationships or rank
order. Individuals competing in a contest may be fortunate to achieve first, second, or third
place. First, second, and third place represent ordinal data. If Roscoe takes first and Wilbur takes
second, we do not know if the competition was close; we only know that Roscoe outperformed
Wilbur. Likert-type scales (such as "On a scale of 1 to 10 with one being no pain and ten being
high pain, how much pain are you in today?") also represent ordinal data. Fundamentally, these
scales do not represent a measurable quantity. An individual may respond 8 to this question and
be in less pain than someone else who responded 5. A person may not be in half as much pain if
they responded 4 than if they responded 8. All we know from this data is that an individual who
responds 6 is in less pain than if they responded 8 and in more pain than if they responded 4.
Therefore, Likert-type scales only represent a rank ordering.
 Interval: A scale which represents quantity and has equal units but for which zero represents
simply an additional point of measurement is an interval scale. The Fahrenheit scale is a clear
example of the interval scale of measurement. Thus, 60 degree Fahrenheit or -10 degrees
Fahrenheit are interval data. Measurement of Sea Level is another example of an interval scale.
With each of these scales there is direct, measurable quantity with equality of units. In addition,
zero does not represent the absolute lowest value. Rather, it is point on the scale with numbers
both above and below it (for example, -10 degrees Fahrenheit).
 Ratio: The ratio scale of measurement is similar to the interval scale in that it also represents
quantity and has equality of units. However, this scale also has an absolute zero (no numbers
exist below the zero). Very often, physical measures will represent ratio data (for example,
height and weight). If one is measuring the length of a piece of wood in centimeters, there is
quantity, equal units, and that measure can not go below zero centimeters. A negative length is
not possible.
 Probability Sampling Methods
 1. Simple random sampling
 In this case each individual is chosen entirely by chance and each member of the
population has an equal chance, or probability, of being selected. One way of
obtaining a random sample is to give each individual in a population a number, and
then use a table of random numbers to decide which individuals to include.1 For
example, if you have a sampling frame of 1000 individuals, labelled 0 to 999, use
groups of three digits from the random number table to pick your sample. So, if the
first three numbers from the random number table were 094, select the individual
labelled “94”, and so on.
 As with all probability sampling methods, simple random sampling allows the
sampling error to be calculated and reduces selection bias. A specific advantage is
that it is the most straightforward method of probability sampling. A disadvantage of
simple random sampling is that you may not select enough individuals with your
characteristic of interest, especially if that characteristic is uncommon. It may also
be difficult to define a complete sampling frame and inconvenient to contact them,
especially if different forms of contact are required (email, phone, post) and your
sample units are scattered over a wide geographical area.

 2. Systematic sampling
 Individuals are selected at regular intervals from the sampling frame. The intervals
are chosen to ensure an adequate sample size. If you need a sample size n from a
population of size x, you should select every x/nth individual for the sample. For
example, if you wanted a sample size of 100 from a population of 1000, select every
1000/100 = 10th member of the sampling frame.
 Systematic sampling is often more convenient than simple random sampling, and it
is easy to administer. However, it may also lead to bias, for example if there are
underlying patterns in the order of the individuals in the sampling frame, such that
the sampling technique coincides with the periodicity of the underlying pattern. As a
hypothetical example, if a group of students were being sampled to gain their
opinions on college facilities, but the Student Record Department’s central list of all
students was arranged such that the sex of students alternated between male and
female, choosing an even interval (e.g. every 20th student) would result in a sample
of all males or all females. Whilst in this example the bias is obvious and should be
easily corrected, this may not always be the case.

 3. Stratified sampling
 In this method, the population is first divided into subgroups (or strata) who all
share a similar characteristic. It is used when we might reasonably expect the
measurement of interest to vary between the different subgroups, and we want to
ensure representation from all the subgroups. For example, in a study of stroke
outcomes, we may stratify the population by sex, to ensure equal representation of
men and women. The study sample is then obtained by taking equal sample sizes
from each stratum. In stratified sampling, it may also be appropriate to choose non-
equal sample sizes from each stratum. For example, in a study of the health
outcomes of nursing staff in a county, if there are three hospitals each with different
numbers of nursing staff (hospital A has 500 nurses, hospital B has 1000 and
hospital C has 2000), then it would be appropriate to choose the sample numbers
from each hospital proportionally (e.g. 10 from hospital A, 20 from hospital B and 40
from hospital C). This ensures a more realistic and accurate estimation of the health
outcomes of nurses across the county, whereas simple random sampling would over-
represent nurses from hospitals A and B. The fact that the sample was stratified
should be taken into account at the analysis stage.
 Stratified sampling improves the accuracy and representativeness of the results by
reducing sampling bias. However, it requires knowledge of the appropriate
characteristics of the sampling frame (the details of which are not always available),
and it can be difficult to decide which characteristic(s) to stratify by.

 4. Clustered sampling
 In a clustered sample, subgroups of the population are used as the sampling unit,
rather than individuals. The population is divided into subgroups, known as clusters,
which are randomly selected to be included in the study. Clusters are usually already
defined, for example individual GP practices or towns could be identified as clusters.
In single-stage cluster sampling, all members of the chosen clusters are then
included in the study. In two-stage cluster sampling, a selection of individuals from
each cluster is then randomly selected for inclusion. Clustering should be taken into
account in the analysis. The General Household survey, which is undertaken annually
in England, is a good example of a (one-stage) cluster sample. All members of the
selected households (clusters) are included in the survey.1
 Cluster sampling can be more efficient that simple random sampling, especially
where a study takes place over a wide geographical region. For instance, it is easier
to contact lots of individuals in a few GP practices than a few individuals in many
different GP practices. Disadvantages include an increased risk of bias, if the chosen
clusters are not representative of the population, resulting in an increased sampling
error.

 Non-Probability Sampling Methods
 1. Convenience sampling
 Convenience sampling is perhaps the easiest method of sampling, because
participants are selected based on availability and willingness to take part. Useful
results can be obtained, but the results are prone to significant bias, because those
who volunteer to take part may be different from those who choose not to (volunteer
bias), and the sample may not be representative of other characteristics, such as
age or sex. Note: volunteer bias is a risk of all non-probability sampling methods.

 2. Quota sampling
 This method of sampling is often used by market researchers. Interviewers are given
a quota of subjects of a specified type to attempt to recruit. For example, an
interviewer might be told to go out and select 20 adult men, 20 adult women, 10
teenage girls and 10 teenage boys so that they could interview them about their
television viewing. Ideally the quotas chosen would proportionally represent the
characteristics of the underlying population.
 Whilst this has the advantage of being relatively straightforward and potentially
representative, the chosen sample may not be representative of other characteristics
that weren’t considered (a consequence of the non-random nature of sampling). 2

 3. Judgement (or Purposive) Sampling
 Also known as selective, or subjective, sampling, this technique relies on the
judgement of the researcher when choosing who to ask to participate. Researchers
may implicitly thus choose a “representative” sample to suit their needs, or
specifically approach individuals with certain characteristics. This approach is often
used by the media when canvassing the public for opinions and in qualitative
research.
 Judgement sampling has the advantage of being time-and cost-effective to perform
whilst resulting in a range of responses (particularly useful in qualitative research).
However, in addition to volunteer bias, it is also prone to errors of judgement by the
researcher and the findings, whilst being potentially broad, will not necessarily be
representative.

 4. Snowball sampling
 This method is commonly used in social sciences when investigating hard-to-reach
groups. Existing subjects are asked to nominate further subjects known to them, so
the sample increases in size like a rolling snowball. For example, when carrying out a
survey of risk behaviours amongst intravenous drug users, participants may be
asked to nominate other users to be interviewed.
 Snowball sampling can be effective when a sampling frame is difficult to identify.
However, by selecting friends and acquaintances of subjects already investigated,
there is a significant risk of selection bias (choosing a large number of people with
similar characteristics or views to the initial individual identified).

Digital Image Processing Assign9-1
No ratings yet
Digital Image Processing Assign9-1
5 pages
Piping Interview Questions
100% (8)
Piping Interview Questions
17 pages
Tws PDF
100% (1)
Tws PDF
60 pages
Nature of Stat
No ratings yet
Nature of Stat
11 pages
STATISTICS
No ratings yet
STATISTICS
10 pages
Ocram
No ratings yet
Ocram
4 pages
Basic Concept of Statistics
No ratings yet
Basic Concept of Statistics
2 pages
Probability and Statistics Lesson 1 2
No ratings yet
Probability and Statistics Lesson 1 2
47 pages
Methodology 1
No ratings yet
Methodology 1
29 pages
Research Methodology (Chapter 3
No ratings yet
Research Methodology (Chapter 3
34 pages
APPLIED STATISTICS FOR BUSINESS AND ECONOMICS Midterms Reviewer
No ratings yet
APPLIED STATISTICS FOR BUSINESS AND ECONOMICS Midterms Reviewer
23 pages
Statistics For Research: Data and Variables
No ratings yet
Statistics For Research: Data and Variables
7 pages
EM-104-Module
No ratings yet
EM-104-Module
12 pages
4683-Methods of Social Research-I (Assignment-II)
No ratings yet
4683-Methods of Social Research-I (Assignment-II)
22 pages
6 Sampling and Basic Descriptive Statistics
No ratings yet
6 Sampling and Basic Descriptive Statistics
38 pages
Data Collection UGC - NET Paper-1
No ratings yet
Data Collection UGC - NET Paper-1
17 pages
MMW-Module-4
No ratings yet
MMW-Module-4
51 pages
MMWChapter4 6
No ratings yet
MMWChapter4 6
66 pages
Use The Regression Line To Make Predictions and Evaluate How Reliable These Predictions Are
No ratings yet
Use The Regression Line To Make Predictions and Evaluate How Reliable These Predictions Are
51 pages
Statistics 2: DR Taher
No ratings yet
Statistics 2: DR Taher
42 pages
MM7 WEEK 1 Statistics Overview Levels of Measurements and Sampling
No ratings yet
MM7 WEEK 1 Statistics Overview Levels of Measurements and Sampling
41 pages
1 Statistics Introduction
No ratings yet
1 Statistics Introduction
36 pages
Variable and Types of Statistical Variables
100% (1)
Variable and Types of Statistical Variables
9 pages
DMBA103
No ratings yet
DMBA103
9 pages
Collection of Data: Objectives
No ratings yet
Collection of Data: Objectives
7 pages
Sample - Is The Subset of The Entire Population
No ratings yet
Sample - Is The Subset of The Entire Population
6 pages
NSTA 51516 Slides
No ratings yet
NSTA 51516 Slides
97 pages
module_7
No ratings yet
module_7
49 pages
STATISTICS Is A Group of Methods Used To Collect
No ratings yet
STATISTICS Is A Group of Methods Used To Collect
17 pages
Chapter 1
No ratings yet
Chapter 1
4 pages
Measurement and Scales and Selecting Sampling 21102022 095103am
No ratings yet
Measurement and Scales and Selecting Sampling 21102022 095103am
21 pages
Sampling Human Pop1
No ratings yet
Sampling Human Pop1
49 pages
Mathematics As A Tool: Data Management
No ratings yet
Mathematics As A Tool: Data Management
29 pages
Statistics Lecture Notes
No ratings yet
Statistics Lecture Notes
6 pages
BA 302: Chapter-1 Instructions: Introduction & Definitions by Dr. Kishor Guru-Gharana
No ratings yet
BA 302: Chapter-1 Instructions: Introduction & Definitions by Dr. Kishor Guru-Gharana
5 pages
01. Basic Concepts in Statistics (1)
No ratings yet
01. Basic Concepts in Statistics (1)
41 pages
Statistics - MMW
No ratings yet
Statistics - MMW
15 pages
Statistics PDF
No ratings yet
Statistics PDF
13 pages
Chapter 1 INTRODUCTION TO STATISTICS (New)
No ratings yet
Chapter 1 INTRODUCTION TO STATISTICS (New)
34 pages
Methods of Sampling From A Population
No ratings yet
Methods of Sampling From A Population
3 pages
1) Unit 1. Introduction PDF
No ratings yet
1) Unit 1. Introduction PDF
7 pages
Statistics for Business and Economics
No ratings yet
Statistics for Business and Economics
6 pages
Chapter 2 Stat
No ratings yet
Chapter 2 Stat
6 pages
Statistics For Beginners 2024
No ratings yet
Statistics For Beginners 2024
37 pages
Introduction To Statistics: February 21, 2006
No ratings yet
Introduction To Statistics: February 21, 2006
34 pages
Stats Week 1 - Notes
No ratings yet
Stats Week 1 - Notes
7 pages
Stat 2 PDF
No ratings yet
Stat 2 PDF
41 pages
GEC4 Mathematics in the Modern World CHAPTER 4
No ratings yet
GEC4 Mathematics in the Modern World CHAPTER 4
35 pages
NOTES STATISTICS Final
No ratings yet
NOTES STATISTICS Final
11 pages
Sampling-Method
No ratings yet
Sampling-Method
4 pages
Methods of Sampling From A Population
No ratings yet
Methods of Sampling From A Population
4 pages
Social Work Stat
No ratings yet
Social Work Stat
52 pages
Nature of Statistics: Sample Population Parameter, Statistic
No ratings yet
Nature of Statistics: Sample Population Parameter, Statistic
3 pages
Statistics
No ratings yet
Statistics
52 pages
1 Statistics Collection N Presentation of Data
No ratings yet
1 Statistics Collection N Presentation of Data
15 pages
Research: Strategies and Methods
No ratings yet
Research: Strategies and Methods
34 pages
MMW GE 4 Week 10 PPT 23 24
No ratings yet
MMW GE 4 Week 10 PPT 23 24
23 pages
EDA-Lecture 1
No ratings yet
EDA-Lecture 1
52 pages
STAT-II Week End
100% (2)
STAT-II Week End
57 pages
Topic 2 - Data Collection and Sampling Techniques
No ratings yet
Topic 2 - Data Collection and Sampling Techniques
15 pages
STATISTICS ANALYSIS - Reviewer Week 1
No ratings yet
STATISTICS ANALYSIS - Reviewer Week 1
5 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Karakter Goku
No ratings yet
Karakter Goku
8 pages
AMCOL MCST TR HeviSand Sieve Comparison Chart
No ratings yet
AMCOL MCST TR HeviSand Sieve Comparison Chart
1 page
CEMAT PresentationV71474389
0% (1)
CEMAT PresentationV71474389
112 pages
Drop Shape Analyzer - Dsa25: Specifications
No ratings yet
Drop Shape Analyzer - Dsa25: Specifications
4 pages
Kenny and Osborne Music Performance B
No ratings yet
Kenny and Osborne Music Performance B
10 pages
Deborah Ball PDF
No ratings yet
Deborah Ball PDF
19 pages
1.8 Complex Numbers: Question Paper
No ratings yet
1.8 Complex Numbers: Question Paper
10 pages
Engineering Physics - : BPHY101L
No ratings yet
Engineering Physics - : BPHY101L
7 pages
Chapter 2 - Traffic Survey and Data Collection Methods
No ratings yet
Chapter 2 - Traffic Survey and Data Collection Methods
26 pages
4q Mathematics 10 PT
No ratings yet
4q Mathematics 10 PT
5 pages
Answer All Questions in This Section
No ratings yet
Answer All Questions in This Section
13 pages
ISO9001+ISO14001 Internal Audit Programme Sample
No ratings yet
ISO9001+ISO14001 Internal Audit Programme Sample
1 page
Aim of Project
No ratings yet
Aim of Project
5 pages
ZSI: The Zolera Soap Infrastructure Developer's Guide
No ratings yet
ZSI: The Zolera Soap Infrastructure Developer's Guide
71 pages
Vishwakarma Institute of Technology, Pune: Academic Time Table 2010-2011 First Term
No ratings yet
Vishwakarma Institute of Technology, Pune: Academic Time Table 2010-2011 First Term
2 pages
Combined Maths - E - (Paper I & II Answer Guides)
No ratings yet
Combined Maths - E - (Paper I & II Answer Guides)
41 pages
[FREE PDF sample] (eBook PDF) Chemical Structure and Reactivity: An Integrated Approach 2nd Edition ebooks
100% (2)
[FREE PDF sample] (eBook PDF) Chemical Structure and Reactivity: An Integrated Approach 2nd Edition ebooks
41 pages
27-650 ENG Manual PCD1P1001-J30 PQA 01 PDF
No ratings yet
27-650 ENG Manual PCD1P1001-J30 PQA 01 PDF
92 pages
Enzyme Electrode
100% (1)
Enzyme Electrode
10 pages
Conversion Table For Potato Specific Gravity
No ratings yet
Conversion Table For Potato Specific Gravity
7 pages
Affinity
No ratings yet
Affinity
7 pages
Sub Inspector in Delhi Police and Central Armed Police Forces Examination 2023 Paper I
No ratings yet
Sub Inspector in Delhi Police and Central Armed Police Forces Examination 2023 Paper I
335 pages
4885 - Qa - 1
100% (1)
4885 - Qa - 1
27 pages
The Mostly Complete Chart of Neural Networks
100% (1)
The Mostly Complete Chart of Neural Networks
19 pages
PLZ Series: Vishay Semiconductors
No ratings yet
PLZ Series: Vishay Semiconductors
10 pages
Detection and Location of Faults in Underground Cable Using Matlab/Simulink/Ann and Orcad
No ratings yet
Detection and Location of Faults in Underground Cable Using Matlab/Simulink/Ann and Orcad
5 pages
AISC-BOLT SPACING FOR COMPOSITE SECTION-(Batten bolts & Lacing bolts)
No ratings yet
AISC-BOLT SPACING FOR COMPOSITE SECTION-(Batten bolts & Lacing bolts)
2 pages

What Are The Different Sources of Data?

Uploaded by

What Are The Different Sources of Data?

Uploaded by

Sources of Data

What are the different sources of data?

 Primary data means ‘First-hand information’ collected by an investigator.

 Secondary data refers to ‘Second-hand information’.

You might also like