0% found this document useful (0 votes)

26 views21 pages

FET 401 Week 8 Lecture Note

1) The document discusses data analysis and descriptive statistics, including defining data analysis, variables, and types of statistics. 2) Descriptive statistics are used to summarize and describe data through measures of central tendency, dispersion, frequency distributions and charts. 3) SPSS software can be used to conduct descriptive statistics on data, including calculating means, standard deviations, ranges and creating frequency tables.

Uploaded by

Johnpraise

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views21 pages

FET 401 Week 8 Lecture Note

Uploaded by

Johnpraise

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

MODULE 4: DATA ANALYSIS AND RESULTS PRESENTATION

WEEK 8: DATA PREPARATION AND DESCRIPTIVE STATISTICS

Data Analysis
Data analysis means a critical examination of the assembled and grouped data for studying the
characteristics of the object under study and for determining the pattern of relationships among
the variables relating to it. Data analysis is the categorizing, ordering, manipulating, and
summarizing of data, adding that its purpose is to reduce large quantities of raw data to
manageable and interpretable form so that characteristics of situations, events, and people can
be richly described and the relations among variables studied and interpreted. Analysis and
interpretation of data are crucial aspects of research. You will notice that the activities of data
analysis in research cannot be separated from statistics; scoring, categorizing, ordering,
manipulating, summarizing, interpreting data etc are all activities involving the use of statistics
in data analysis.

What is Statistics?
Statistics are quantitative methods of describing, analyzing, and drawing inferences
(conclusions) from data. By using it, one can infer about population characteristics on the basis
of the sample observations. We have seen that no meaningful conclusion can be drawn by
merely looking to the experimental data; hence, appropriate statistical techniques are used for
drawing meaningful inferences. Thus, statistics provides us the know-how for collection,
compilation, and analysis of data scientifically.
Practitioners need to understand statistics:
 To know how to properly present and describe information,
 To know how to draw conclusions about large populations based only on information
obtained from samples,
 To know how to solve key industry-related problems and make sensible, valid, and
reliable decisions on the basis of the statistical analysis conducted

Types of Statistics: There are two general types or categories of statistics that are often referred
to when making a statistical decision or working on a statistical problem:

1|Page
1. Descriptive Statistics: statistical procedures used for summarizing, organizing,
graphing and describing data. Descriptive statistics utilize numerical and graphical
methods to look for patterns in a data set, to summarize the information revealed in a
data set, and to present the information in a convenient form that individuals can use to
make decisions. The main goal of descriptive statistics is to describe a data set. Thus,
the class of descriptive statistics includes both numerical measures (e.g. the mean or the
median) and graphical displays of data (e.g. pie charts or bar graphs).

2. Inferential Statistics – statistical procedures that allow one to draw inferences to the
population on the basis of sample data. Represented as tests of significance (test
relationships and differences). Inferential statistics utilizes sample data to make
estimates, decisions, predictions, or other generalizations about a larger set of data.
Some examples of inferential statistics might be a z statistics or a t-statistics

Variables: A variable is any characteristic that is measured or captured in a dataset – any

factor/issue under investigation: e.g., tourist arrivals per year, expenditure, tourist satisfaction,
nationality, gender, number of nights in destination, perceived quality, likelihood to return to the
destination, etc.

The variables can be measured in one of three general ways: categorical, discrete and
continuous as discussed in our previous class.

Variables can also be classified as independent or dependent variables. Independent variables

are also called explanatory variables and are examined to see how they explain, predict, or
influence other variables – called dependent variables. Dependent variables are the variables
that are believed to be influenced by independent variables.

Data in statistics is sometimes classified according to how many variables are in a particular
study. For example, “height” might be one variable and “weight” might be another variable.
Depending on the number of variables being looked at, the data might be univariate, or it might
be bivariate.

2|Page
 Univariate analysis is the analysis of one (“uni”) variable.
 Bivariate analysis is the analysis of exactly two variables.
 Multivariate analysis is the analysis of more than two variables.

SPSS Overview

1. Data View
 Used to display data
 Columns represent variables
 Rows represent individual units or groups of units that share common values of
variables
2. Variable View
 Used to display information on variables in dataset
 TYPE: Allows for various styles of displaying
 LABEL: Allows for longer description of variable name
 VALUES: Allows for longer description of variable levels
 MEASURE: Allows choice of measurement scale
3. Output View
 Displays Results of analyses/graph

3|Page
Entering Variables in SPSS

4|Page
Importing data from Excel
 Select File Open Data
 Choose Excel as file type
 Select the file you want to import
 Then click Open

5|Page
6|Page
7|Page
Descriptives Statistics
Descriptive statistics describe and summarize data. Descriptive statistics can be used to describe
a single variable (univariate analysis) or more than one variable (bivariate/multivariate
analysis).

Descriptive analysis explores each variable in a data set. It looks at the range of values, as well
as the central tendency of the value. It describes the pattern of response to the variables. Some
ways you can describe patterns found in univariate data include measure of central tendency
(mean, mode, and median) and dispersion (measure of variability): range, variance, maximum,
minimum, quartiles (including the interquartile range), and standard deviation.

Example: A researcher wants to describe the pattern and summarize the main features of 400 L
students of the Faculty of Engineering, Lead City University. He collected and analyzed the
following data.
Data: LCU FENG DATA.xls (available in the Sample Data folder).

In SPSS, the Descriptives procedure computes a select set of basic descriptive statistics for one
or more continuous numeric variables. In all, the statistics it can produce are:
• N valid responses (Number of valid responses/samples)
• Mean
• Sum
• Standard deviation
• Variance
• Minimum
• Maximum
• Range
• Standard error of the mean (or S.E. mean)
• Skewness
• Kurtosis

8|Page
Steps to conduct Descriptive Statistics in SPSS
Running the Procedure
1. Click Analyze > Descriptive Statistics > Descriptives/Frequency.
2. Add the variables to the Variables box.
3. Click OK when finished

To run the Descriptives procedure, select Analyze > Descriptive Statistics > Descriptives.

The Descriptives window lists all of the variables in your dataset in the left column. To select
variables for analysis, click on the variable name to highlight it, then click on the arrow button
to move the variable to the column on the right. Alternatively, you can double-click on the name
of a variable to move it to the column on the right.

9|Page
10 | P a g e
Outputs

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
Age_of_Students_Years 102 13.00 19.00 32.00 22.0098 2.86493
Height_of_Students_feets_Inches 102 2.20 4.90 7.10 5.7569 .56857
Valid N (listwise) 102

Interpretation
Here we see a side-by-side comparison of the descriptive statistics for the two numeric variables.
This allows us to quickly make the following observations about the data:
• The maximum age and height observed among 400 L FENG students are 32 years and 7
feet 1 inch, respectively.
• The minimum age and height observed among 400 L FENG students are 19 years and 4
feet 9 inches, respectively.
• The averages of the age and height were 22 years and approx. 5 feet 8 inches.

11 | P a g e
Frequencies
Output
Statistics
Age_of_Students_Years Height_of_Students_feets_Inches
N Valid 102 102
Missing 0 0
Mean 22.0098 5.7569
Median 21.0000 5.8000
Mode 20.00 5.20a
Std. Deviation 2.86493 .56857
Range 13.00 2.20
Minimum 19.00 4.90
Maximum 32.00 7.10
a. Multiple modes exist. The smallest value is shown

Frequency Table
Age_of_Students_Years
Frequency Percent Valid Percent Cumulative Percent
Valid 19.00 18 17.6 17.6 17.6
20.00 20 19.6 19.6 37.3
21.00 19 18.6 18.6 55.9
22.00 12 11.8 11.8 67.6
23.00 5 4.9 4.9 72.5
24.00 5 4.9 4.9 77.5
25.00 11 10.8 10.8 88.2
26.00 9 8.8 8.8 97.1
32.00 3 2.9 2.9 100.0
Total 102 100.0 100.0

12 | P a g e
Height_of_Students_feets_Inches
Frequency Percent Valid Percent Cumulative Percent
Valid 4.90 4 3.9 3.9 3.9
5.00 9 8.8 8.8 12.7
5.10 5 4.9 4.9 17.6
5.20 10 9.8 9.8 27.5
5.30 10 9.8 9.8 37.3
5.40 1 1.0 1.0 38.2
5.60 3 2.9 2.9 41.2
5.70 7 6.9 6.9 48.0
5.80 6 5.9 5.9 53.9
5.90 8 7.8 7.8 61.8
6.00 2 2.0 2.0 63.7
6.10 6 5.9 5.9 69.6
6.20 8 7.8 7.8 77.5
6.30 8 7.8 7.8 85.3
6.40 4 3.9 3.9 89.2
6.50 2 2.0 2.0 91.2
6.60 3 2.9 2.9 94.1
6.70 4 3.9 3.9 98.0
7.10 2 2.0 2.0 100.0
Total 102 100.0 100.0

Bar Chart
A bar diagram is a graph in which rectangular bars are created with lengths equal to
their values that they represent. These bars can be created vertically or horizontally.
The bar diagram is used for comparing the magnitudes of some discrete groups
having measured either in discrete or continuous manner.

Example of a simple bar chart

A quality engineer for an automotive supply company wants to decrease the number of car door
panels that are rejected because of paint flaws. As part of the initial investigation, the engineer
creates a bar chart to compare the counts of paint flaws.

13 | P a g e
Data: PaintFlaws.xls (available in the Sample Data folder).

Procedure
1. Click Graphs -> Legacy Dialogs -> Bar
2. Click Define
3. Select the variable for which you wish to create a bar chart, and move it into the
“Category Axis” box.
4. Select “Titles” to add a title (Optional)
5. Click Continue after you have added a title
6. Click OK
7. Your bar chart will appear in the SPSS viewer window

Output

Interpretation
This bar chart shows that Peel is the most common paint flaw and that Smudge and Other are
the least common paint flaws.

14 | P a g e
Example of a Clustered bar chart
A researcher wants to describe the pattern and summarize the main features of 400 L students of
the Faculty of Engineering, Lead City University. He collected and analyzed the following data.
Data: LCU FENG DATA.xls (available in the Sample Data folder).

Procedure
1. Click Graphs > Legacy Dialogs > Bar
2. Select “Clustered” and “Summaries for groups of cases”
3. Click Define
4. Select the variable you wish to display on the horizontal axis, and move it into the
“Category Axis” box
5. Select the second variable, and move it to the “Define Clusters by” box
6. Select your desired option under “Bars Represent”
7. Select “Titles” to add a title (Optional)
8. Click OK

Output

15 | P a g e
Interpret the results
Civil and electrical engineering departments have the highest number of second-class upper
degrees while mechanical engineering has the highest number of second-class lower degrees.
No student currently has a pass degree in the mechanical engineering department

Pie Chart
A Pie Chart is a type of graph that displays data in a circular graph. The pieces of the graph are
proportional to the fraction of the whole in each category. In other words, each slice of the pie is
relative to the size of that category in the group as a whole. The entire “pie” represents 100% of
a whole, while the pie “slices” represent portions of the whole.

Example of Pie Chart

16 | P a g e
Procedure
1. Click Graphs -> Legacy Dialogs -> Pie
2. Select “Summaries for groups of cases”
3. Click Define
4. Click “Reset” (recommended)
5. Move the variable for which you are creating a pie chart into the “Define slices by” box
6. Select your desired option under “Slices Represent”
7. Select “Titles” to add a title (recommended)
8. Click “OK”

Output

Interpret the results

This pie chart shows that Peel is the most common paint flaw and that Smudge and Other are
the least common paint flaws.

17 | P a g e
The Scattergram/Scatterplot
The scattergram is a visual expression of correlation coefficient. It provides the pattern of the
relationship between two variables. The scattergram can be obtained by plotting the paired data
along the X–Y-axis. The graphic so obtained can show the relationship between the variables. A
scattergram/scatterplot is obtained by plotting the independent variable and dependent variable
along X and Y axes, respectively.

Example of a simple scatterplot

A medical researcher studies obesity in adolescent girls. Because body fat percentage is difficult
and expensive to measure directly, the researcher wants to determine whether the body mass
index (BMI)—a measurement that is easy to take—is a good predictor of body fat percentage.
The researcher collects BMI, body fat percentage, and other personal variables of 92 adolescent
girls.

As part of the initial investigation, the researcher creates a scatterplot of the body fat percentage
vs. BMI to evaluate the relationship between the two variables.

Procedure
1. Open the sample data, BodyFatPercentage.
2. Choose Graph > Legacy Dialogs > Scatter/Dot > Simple.
3. Under Y variables, enter %Fat.
4. Under X variables, enter BMI.
5. Click OK.

Output

18 | P a g e
Output with Regression

Interpret the results

19 | P a g e
The scatterplot of the BMI and body fat data shows a strong positive and linear relationship
between the two variables. Body mass index (BMI) may be a good predictor of body fat
percentage.

Example of a scatterplot (with regression and groups)

A quality engineer for a camera manufacturer wants to shorten the flash recovery time. Flash
recovery time is the least amount of time that is required between flashes. The engineer wants to
determine whether a relationship exists between the voltage that remains in the camera battery
immediately after a flash and the flash recovery time. The engineer also wants to determine
whether there are differences in flash recovery time between old and new formulations of the
battery. The engineer collects random samples of batteries made with the old and new
formulations. The engineer measures the volts remaining immediately after a flash and the flash
recovery time for each.

As part of the initial investigation, the engineer creates a scatterplot of volts remaining after a
flash versus flash recovery time, grouped by battery formulation, to assess the relationship
between the two variables for the two formulations.
Data: FlashRecoveryTime.xls (available in the Scatter Plot Sample Data folder).

Procedure
1. Open the sample data, FlashRecoveryTime.
2. Choose Graph > Legacy Dialogs > Scatter/Dot > choose Simple Scatter > Define.
3. Under Y variables, enter Flash Recovery.
4. Under X variables, enter Volts After.
5. In Set Markers by, choose categorical variables for grouping, enter Formulation.
6. Click OK.

Outputs

20 | P a g e
Interpret the results
The scatterplot shows a negative linear relationship between the volts after and the flash recovery
time. As the amount of volts after the flash increases, the recovery time decreases. The new
formulation appears to require a shorter flash recovery time than the old formulation.

21 | P a g e

Challenges of Irregular Students Research
86% (22)
Challenges of Irregular Students Research
19 pages
Statistical Analysis of Data With Report Writing
100% (2)
Statistical Analysis of Data With Report Writing
16 pages
Financial Analysis of Nabil Bank Limited-A Proposal Report
100% (1)
Financial Analysis of Nabil Bank Limited-A Proposal Report
5 pages
Quantitative Data Analysis Thru Descriptive Statistics
No ratings yet
Quantitative Data Analysis Thru Descriptive Statistics
6 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Lecture 3-Basic Statistics
No ratings yet
Lecture 3-Basic Statistics
49 pages
Lecture Notes: (Introduction To Medical Laboratory Science Research)
No ratings yet
Lecture Notes: (Introduction To Medical Laboratory Science Research)
13 pages
3 - Descriptive Stat
No ratings yet
3 - Descriptive Stat
70 pages
Descriptive-Analytics
No ratings yet
Descriptive-Analytics
6 pages
Wa Nko Nalipay PR
No ratings yet
Wa Nko Nalipay PR
12 pages
Statistics Theory Notes
No ratings yet
Statistics Theory Notes
21 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
5 pages
PR2-MODULAR-M
No ratings yet
PR2-MODULAR-M
5 pages
Data Analysis and Statistical Treatment
No ratings yet
Data Analysis and Statistical Treatment
99 pages
Marketing Ii: Facultad de Economía y Negocios Universidad de Chile
No ratings yet
Marketing Ii: Facultad de Economía y Negocios Universidad de Chile
18 pages
Probability and Statistics Notes
No ratings yet
Probability and Statistics Notes
38 pages
ESM 507 statistical analysis B
No ratings yet
ESM 507 statistical analysis B
3 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
9 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
2 pages
Chapter 12
No ratings yet
Chapter 12
46 pages
Experiment-1 2
No ratings yet
Experiment-1 2
6 pages
biostatics course
No ratings yet
biostatics course
29 pages
Statistical Analysis_ Descriptive Stat (2)
No ratings yet
Statistical Analysis_ Descriptive Stat (2)
6 pages
Lesson 5 (Descriptive Statistics Part 1)_Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1)_Oct 2024
72 pages
WWW Social Research Methods Net KB Statdesc PHP
100% (1)
WWW Social Research Methods Net KB Statdesc PHP
87 pages
CSE 323 (1) Statistics in Education
No ratings yet
CSE 323 (1) Statistics in Education
31 pages
Unit 2 DS pdf
No ratings yet
Unit 2 DS pdf
97 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
19 pages
Marketing Research: Ninth Edition
No ratings yet
Marketing Research: Ninth Edition
44 pages
Prof. James Analysis
No ratings yet
Prof. James Analysis
142 pages
DSOST2
No ratings yet
DSOST2
44 pages
Lecture 4_Data Science Statistics
No ratings yet
Lecture 4_Data Science Statistics
21 pages
LU 3 Descriptive Statistics in SPSS
No ratings yet
LU 3 Descriptive Statistics in SPSS
60 pages
Research Report
No ratings yet
Research Report
47 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
3 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
11 pages
Practical Research 2: Descriptive
No ratings yet
Practical Research 2: Descriptive
19 pages
Descriptive Statistics: Measures of Variability and Central Tendency
No ratings yet
Descriptive Statistics: Measures of Variability and Central Tendency
12 pages
Data-Analysis-Tawan-Pee-Cream
No ratings yet
Data-Analysis-Tawan-Pee-Cream
53 pages
Step 6 Data Analysis
No ratings yet
Step 6 Data Analysis
23 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
SPSS and Statistics
No ratings yet
SPSS and Statistics
18 pages
Lecture 1 - Introduction To Statistics
No ratings yet
Lecture 1 - Introduction To Statistics
3 pages
Lecture Sheet For SPSS
100% (1)
Lecture Sheet For SPSS
29 pages
1.-Statistics
No ratings yet
1.-Statistics
125 pages
Statistics Analysis With Software Application
No ratings yet
Statistics Analysis With Software Application
22 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
5. Descriptive Statistics
No ratings yet
5. Descriptive Statistics
15 pages
Introduction To Basic Statistics
100% (2)
Introduction To Basic Statistics
31 pages
Basic Concepts and Terminologies
No ratings yet
Basic Concepts and Terminologies
37 pages
BRM - Data Analysis, Interpretation and Reporting Part II
No ratings yet
BRM - Data Analysis, Interpretation and Reporting Part II
102 pages
SPROB Polished
No ratings yet
SPROB Polished
8 pages
Class Note II-1-1
No ratings yet
Class Note II-1-1
30 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
6 pages
Session 1 On Descriptive Statistics
No ratings yet
Session 1 On Descriptive Statistics
24 pages
Statistics
No ratings yet
Statistics
152 pages
Statistics
No ratings yet
Statistics
68 pages
CRI 191 Pre Compre Rationalization
No ratings yet
CRI 191 Pre Compre Rationalization
136 pages
Data Analysis of Students Marks With Descriptive Statistics: Article
No ratings yet
Data Analysis of Students Marks With Descriptive Statistics: Article
5 pages
Notes Data Analytics
No ratings yet
Notes Data Analytics
19 pages
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Rohan Resume - Program Manager
No ratings yet
Rohan Resume - Program Manager
2 pages
Manish Final Project PDF
No ratings yet
Manish Final Project PDF
52 pages
Bivariate Regression - Part I: Indep Var / Dep Var Continuous Discrete
No ratings yet
Bivariate Regression - Part I: Indep Var / Dep Var Continuous Discrete
4 pages
Friedman Two Way Anova
No ratings yet
Friedman Two Way Anova
13 pages
Multivariate Analysis of Covariance (MANOVA) in SPSS
No ratings yet
Multivariate Analysis of Covariance (MANOVA) in SPSS
10 pages
Consumer Perception and Market Potential of Gumamela
No ratings yet
Consumer Perception and Market Potential of Gumamela
19 pages
Gis Analysis Function
No ratings yet
Gis Analysis Function
44 pages
Bi OP
No ratings yet
Bi OP
25 pages
Type 1 Ss - All 4 Types
No ratings yet
Type 1 Ss - All 4 Types
4 pages
CH10B
No ratings yet
CH10B
20 pages
EDF Climate Corps Fellowship Requirements
No ratings yet
EDF Climate Corps Fellowship Requirements
3 pages
상관관계와 산점도에 관한 예비수학교사의 SMK 분석 (문지은, 2018)
No ratings yet
상관관계와 산점도에 관한 예비수학교사의 SMK 분석 (문지은, 2018)
62 pages
STREET FOOD IN HANOI
No ratings yet
STREET FOOD IN HANOI
11 pages
Leung 2007
No ratings yet
Leung 2007
11 pages
Particle SizeAnalyses
No ratings yet
Particle SizeAnalyses
41 pages
Pengenalan Data Mining
No ratings yet
Pengenalan Data Mining
25 pages
School of Statistics
No ratings yet
School of Statistics
9 pages
Bu I 4 ANNOVA
No ratings yet
Bu I 4 ANNOVA
4 pages
BUS 331 01 Syllabus Wynter Spring 2023
No ratings yet
BUS 331 01 Syllabus Wynter Spring 2023
6 pages
W1..well Aligned Objectives and Data
No ratings yet
W1..well Aligned Objectives and Data
5 pages
Economterics Final 2024.
No ratings yet
Economterics Final 2024.
32 pages
Thesis Corrected Version 29 03 2019
100% (2)
Thesis Corrected Version 29 03 2019
402 pages
Investment Habits of Working Women
60% (10)
Investment Habits of Working Women
43 pages
FINAL Lesson 6.2 Pearson Product Moment Correlation Coefficient Quarter 4 Week 8 For Grouphings-1
No ratings yet
FINAL Lesson 6.2 Pearson Product Moment Correlation Coefficient Quarter 4 Week 8 For Grouphings-1
10 pages
1-Introduction To Business Forecasting
No ratings yet
1-Introduction To Business Forecasting
19 pages
Resume Template by Job Updates With Radha
No ratings yet
Resume Template by Job Updates With Radha
2 pages
DLL Practical Research 2 Remo D. Angeles 02.6-12.2019
100% (4)
DLL Practical Research 2 Remo D. Angeles 02.6-12.2019
3 pages
1995 - Dechow, Sloan, Sweeney - Jurnal - Detecting Earnings Management
No ratings yet
1995 - Dechow, Sloan, Sweeney - Jurnal - Detecting Earnings Management
34 pages

FET 401 Week 8 Lecture Note

Uploaded by

FET 401 Week 8 Lecture Note

Uploaded by

MODULE 4: DATA ANALYSIS AND RESULTS PRESENTATION

WEEK 8: DATA PREPARATION AND DESCRIPTIVE STATISTICS

Variables: A variable is any characteristic that is measured or captured in a dataset – any

Variables can also be classified as independent or dependent variables. Independent variables

Example of a simple bar chart

Example of Pie Chart

Interpret the results

Example of a simple scatterplot

Interpret the results

Example of a scatterplot (with regression and groups)

You might also like