Quantitative Data Analysis Approaches
Quantitative Data Analysis Approaches
net/publication/299534063
CITATIONS READS
0 41,442
4 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Babak Taheri on 31 March 2016.
Research Methods
for Business
& Management
Second edition
This is a sample only. Full information and purchase details for this title
are available at http://www.goodfellowpublishers.com and for other
titles in the series at the Global Management Series page.
THE GLOBAL MANAGEMENT SERIES
Research Methods
for Business
& Management
Second edition
ISBN: 978-1-910158-51-7
All trademarks used herein are the property of their repective owners, The
use of trademarks or brand names in this text does not imply any affiliation
with or endorsement of this book by such owners.
This is a sample only. Full information and purchase details for this
title are available at http://www.goodfellowpublishers.com and
for other titles in the series at the Global Management Series page.
Contents
10 Quantitative Data
Analysis Approaches
In order to understand data and present findings in an accurate way, researchers and
managers need to develop an awareness of statistical analysis techniques. The previ-
ous chapter concentrated on quantitative data collection, this chapter delves into the
statistical tools used to analyse the data once collected. It focuses on two sets of the
most widely used statistical tools – exploring relationships and comparing groups – as
shown in the ‘Deductive’ section in the Data Analysis area of the Methods Map (see
Chapter 4). Finally, we briefly explain the nature of Big Data.
Data preparation
Real-life data generally cannot be used directly for data analysis – they are
unorganised and filled with different types of problems and errors. We
discuss three pre-processing steps that prepare data for further analysis:
data entry, data cleaning and data formatting.
Data entry
A conventional way to organise data is to use tables, with records as rows
and attributes as columns. A record is an identifiable piece of information
which contains a set of values of attributes to the record. For example, one
may organise the information collected from questionnaires in the follow-
ing way: each record corresponds to all the answers from a respondent, with
each attribute associated with the answer to one question.
Quantitative Data Analysis Approaches 175
Table 10.1: Information required for each variable in the questionnaire in variable view in
SPSS
Figure 10.1: Example of (top) variable view and (bottom) data view in SPSS software
Quantitative Data Analysis Approaches 177
Data cleaning
Even if there are no errors introduced during entry phase, real-life data
need to be cleaned because they are often incomplete, noisy and inconsistent
(Han, Kamber, & Pei, 2011). Incompleteness arises when for some records
the values for some attributes are missing. There are mainly two ways to
deal with this issue. First, delete the whole record that misses data; this
could be viable when the number of records with missing data is relatively
small compared to the whole dataset. Second, fill the missing values; one
can use the expected value on the corresponding attribute or regression on
other attributes to predict the missing value. Noises refer to random factors
that can only be quantified in a probabilistic way. Noises confound obser-
vations and cause outliers that are far away from normal observations. A
primary task of data cleaning is to identify and ‘smooth’ out these outliers.
Inconsistencies often arise when one combines information from different
sources. For example, combining datasets with both American and British
date information may cause confusion (i.e. the 3rd of April 1990 could be
displayed as both 4/3/90 and 3/4/90).
Preliminary analysis
Describing data
To present a sample in an illustrative way one can either use descriptive
statistics (numbers) or graphs, or both; it is a matter of personal preference –
some prefer descriptive statistics because they are quantifiable while others
prefer graphs because they are more intuitive. Therefore, when deciding
which form to present data, it is important to know who your target audi-
ence is.
10
If the sample is of a nonmetric type (for example an ordinal scale as
described in Chapter 9), frequency and ratio are two commonly used descrip-
tive statistics. Frequency counts the number of occurrences of a specific
category, and ratio calculates the corresponding percentage of frequency
in the entire sample. Nonmetric data can be visualised through pie charts
or bar charts. We give an example on the cut quality of diamonds based
on a dataset with 53940 records (Source: http://vincentarelbundock.github.
io/Rdatasets/datasets.html). The cut quality of diamonds is a nonmetric
measurement and has five categories: fair, good, very good, premium and