STATS 201

The document provides an overview of various statistical concepts, including data types (qualitative and quantitative), frequency distribution, graphical representation, measures of central tendency, binomial distribution, hypothesis testing, correlation, and regression. Each section defines key terms, explains types and methods, and discusses advantages and limitations. The content serves as a foundational guide for understanding statistical analysis and its applications.

Uploaded by

pushpendragodara91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views5 pages

STATS 201

Uploaded by

pushpendragodara91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

1.

Data and Its Types

● Definition: Data are facts, figures, or information collected, analyzed, and

interpreted.
● Types:
○ Qualitative (Categorical): Descriptive, non-numerical.
■ Nominal: Categories with no inherent order (e.g., blood groups A, B, O).
■ Ordinal: Categories with a meaningful order but unequal intervals (e.g.,
pain scale: mild, moderate, severe).
○ Quantitative (Numerical): Measurable, numerical.
■ Discrete: Countable, usually whole numbers (e.g., number of patients).
■ Continuous: Can take any value within a range (e.g., height, weight,
temperature).

2. Frequency Distribution
● Definition: A table or graph that shows the number of times each value or group
of values occurs in a dataset.
● Types:
○ Ungrouped: Lists each distinct value and its frequency.
○ Grouped: Organizes data into class intervals and shows the frequency within
each interval. Includes class limits, class boundaries, class width, and class
midpoint.
○ Relative Frequency: Proportion of observations in each category/interval
(frequency / total number of observations).
○ Cumulative Frequency: Total number of observations up to and including a
specific category/interval.

3. Graphical Representation
● Purpose: To visually summarize and present data, making patterns and trends
easier to understand.
● Types:
○ Categorical Data:
■ Bar Chart: Compares frequencies of different categories (bars don't
touch).
■ Pie Chart: Shows proportions of different categories as slices of a circle.
○ Numerical Data:
■ Histogram: Represents the frequency distribution of continuous data (bars
touch).
■ Frequency Polygon: Line graph connecting the midpoints of the tops of
histogram bars.
■ Ogive (Cumulative Frequency Curve): Line graph showing cumulative
frequencies.
■ Scatter Plot: Shows the relationship between two quantitative variables.
■ Box Plot (Box and Whisker Plot): Displays the distribution of data based on
quartiles, median, and outliers.

4. Measures of Central Tendencies: Mean, Median, and Mode

● Definition: Single values that attempt to describe the "center" or "typical" value
of a dataset.
○ Mean (Average): Sum of all values divided by the number of values (xˉ=n∑xi).
■ Advantages: Uses all data points, easy to calculate.
■ Disadvantages: Sensitive to outliers.
○ Median (Middle Value): The middle value when data is arranged in order. For
even number of data points, it's the average of the two middle values.
■ Advantages: Not affected by outliers.
■ Disadvantages: Doesn't use all data points.
○ Mode (Most Frequent Value): The value that appears most often in the
dataset. A dataset can have no mode, one mode (unimodal), or multiple
modes (bimodal, trimodal, etc.).
■ Advantages: Easy to identify, useful for categorical data.
■ Disadvantages: Not always unique, may not represent the center well.

5. Binomial Distribution
● Definition: A discrete probability distribution that describes the probability of
obtaining a certain number of successes in a fixed number of independent
Bernoulli trials (experiments with only two possible outcomes: success or failure).1
● Conditions (BINS):
○ Binary outcome (success or failure).
○ Independent trials.
○ Number of trials is fixed (n).
○ Same probability of success (p) for each trial.
● Probability Mass Function: P(X=k)=(kn)pk(1−p)n−k, where:
○ n = number of trials
○ k = number of successes
○ p = probability of success in a single trial
○ (kn)=k!(n−k)!n!(binomial coefficient)
● Mean: μ=np
● Variance: σ2=np(1−p)

6. Hypothesis Testing
● Definition: A formal procedure used to determine whether there is enough
statistical evidence to reject a null hypothesis in favor of an alternative
hypothesis.2
● Hypotheses:
○ Null Hypothesis (H0): A statement of no effect or no difference (the status
quo).
○ Alternative Hypothesis (H1or Ha): A statement that contradicts the null
hypothesis (what the researcher wants to find evidence for). Can be
one-tailed (directional) or two-tailed (non-directional).
● Types of Errors:
○ Type I Error (False Positive, α): Rejecting a true null hypothesis. The
probability of making a Type I error is the significance level (α).
○ Type II Error (False Negative, β): Failing to reject a false null hypothesis. The
power of a test is 1−β, the probability of correctly rejecting a false null
hypothesis.
● Significance Level (α): The probability of rejecting the null hypothesis when it is
true (commonly 0.05).
● P-value: The probability of obtaining test results at least as extreme as the
observed results, assuming the null hypothesis is true.3 If the p-value is less than4
α, we reject H0.
● Parametric Tests: Statistical tests that assume the data follows a specific
distribution (usually normal) and make assumptions about population parameters.
Examples: t-tests, ANOVA, Pearson correlation.
● Non-Parametric Tests: Statistical tests that do not rely on specific distributional
assumptions. Used when data is not normally distributed or is ordinal/nominal.
Examples: Chi-square tests, Mann-Whitney U test, Kruskal-Wallis test, Spearman
correlation.

7. Correlation
● Definition: A statistical measure that describes the extent to which two or more
variables fluctuate together. It indicates the strength and direction of a linear
relationship.
● Types:
○ Positive Correlation: Both variables increase or decrease together (e.g.,
height and weight).
○ Negative Correlation: As one variable increases, the other decreases (e.g.,
study time and exam anxiety).
○ No Correlation: No linear relationship between the variables.
● Measures:
○ Pearson Correlation Coefficient (r): Measures the strength and direction of
a linear relationship between two continuous variables.5 Ranges from -1 to +1.
■ r=+1: Perfect positive correlation
■ r=−1: Perfect negative correlation
■ r=0: No linear correlation
○ Spearman's Rank Correlation Coefficient (ρ): Measures the strength and
direction of a monotonic relationship (not necessarily linear) between two
ordinal or continuous variables after ranking them.
● Advantages: Helps identify relationships between variables, useful for prediction
(in conjunction with regression).
● Importance: Provides insights into how variables are associated, guides further
research.
● Limitations: Correlation does not imply causation! Can be affected by outliers.
Only measures linear (Pearson) or monotonic (Spearman) relationships.

8. Regression
● Definition: A statistical method used to model the relationship between a
dependent variable (outcome) and one or more independent variables
(predictors).6 It aims to predict the value of the dependent variable based on the
values of the independent variables.7
● Types:
○ Simple Linear Regression: One independent variable predicts a dependent
variable. The model is a straight line: Y=a+bX, where:
■ Y = dependent variable
■ X = independent variable
■ a = y-intercept (value of Y when X=0)
■ b = slope (change in Y for a one-unit change in X)
○ Multiple Linear Regression: Two or more independent variables predict a
dependent variable.
● Purpose: Prediction, explanation of relationships between variables.
● Assumptions of Linear Regression: Linearity, independence of errors,
homoscedasticity (constant variance of errors), normality of errors.
● R-squared (R2): Coefficient of determination, represents the proportion of the
variance in the dependent variable that is predictable from the independent8
variable(s). Ranges from 0 to 1.

Lecture Notes in MAED Stat Part 1
100% (1)
Lecture Notes in MAED Stat Part 1
15 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
CG8_DATA-ANALYSIS
No ratings yet
CG8_DATA-ANALYSIS
63 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Bio Statistic
No ratings yet
Bio Statistic
3 pages
Statistics
No ratings yet
Statistics
8 pages
DATA-VISUALIZATION-NOTES-OU
No ratings yet
DATA-VISUALIZATION-NOTES-OU
125 pages
DV Unit 1&2 Notes
No ratings yet
DV Unit 1&2 Notes
50 pages
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
No ratings yet
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
31 pages
Reviewer for Psych Stats
No ratings yet
Reviewer for Psych Stats
36 pages
EDU 411 Topic 5 Data Analysis
No ratings yet
EDU 411 Topic 5 Data Analysis
9 pages
Statistics През
No ratings yet
Statistics През
46 pages
APznzaZmf FjNZzQU2KZGNWcTIMyEPNieeXpEIC4txhLpx IW9aIcijwEdcvmrObIy4gDpcU78AYLsB6msaeqj47x3Fc6z9vdKhe5EnyMTtReSpFg 23R3DG W66DWWysqOW PfB BJrKuEN CsrKXdSrdM OKOdbGKa2ND0ltkJXrievcwimUpSlHEYiQCPleUm8zmyjmaz7 PPZRnRfUuizv
No ratings yet
APznzaZmf FjNZzQU2KZGNWcTIMyEPNieeXpEIC4txhLpx IW9aIcijwEdcvmrObIy4gDpcU78AYLsB6msaeqj47x3Fc6z9vdKhe5EnyMTtReSpFg 23R3DG W66DWWysqOW PfB BJrKuEN CsrKXdSrdM OKOdbGKa2ND0ltkJXrievcwimUpSlHEYiQCPleUm8zmyjmaz7 PPZRnRfUuizv
24 pages
Data Analysis
100% (1)
Data Analysis
34 pages
Chapter 1
No ratings yet
Chapter 1
25 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Statistics - Exam Reviewer (Final)
No ratings yet
Statistics - Exam Reviewer (Final)
10 pages
Ch-5
No ratings yet
Ch-5
26 pages
Medical Statistics New
No ratings yet
Medical Statistics New
46 pages
Psychological Stats Reviewer
No ratings yet
Psychological Stats Reviewer
11 pages
BIOSTAT
No ratings yet
BIOSTAT
7 pages
Reseach 04
No ratings yet
Reseach 04
13 pages
3-4-RESEARCH-8-2
No ratings yet
3-4-RESEARCH-8-2
54 pages
Main Title: Planning Data Analysis Using Statistical Data
100% (1)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
UNIT I, II, & III
No ratings yet
UNIT I, II, & III
13 pages
Descriptive and Inferential Statistical Analysis
No ratings yet
Descriptive and Inferential Statistical Analysis
25 pages
A. Variables:: Types of Distributions
No ratings yet
A. Variables:: Types of Distributions
10 pages
BRM ANSWER KEY Q BANK BY ALAM.
No ratings yet
BRM ANSWER KEY Q BANK BY ALAM.
90 pages
Business Stats
No ratings yet
Business Stats
5 pages
Lecture 1-2-118
No ratings yet
Lecture 1-2-118
117 pages
Exp 3
No ratings yet
Exp 3
35 pages
biostatistics notes part 1
No ratings yet
biostatistics notes part 1
9 pages
0112231515385486
No ratings yet
0112231515385486
9 pages
2Statistical Analysis of Data 2
No ratings yet
2Statistical Analysis of Data 2
43 pages
Inquiries Chapter 4
No ratings yet
Inquiries Chapter 4
6 pages
Gea1000 Cheatsheet Finals
No ratings yet
Gea1000 Cheatsheet Finals
3 pages
Statistics and Data Analytics Cheat Sheets
100% (1)
Statistics and Data Analytics Cheat Sheets
2 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
18 pages
bes-summary
No ratings yet
bes-summary
11 pages
Week 5A - Statistics Handout
No ratings yet
Week 5A - Statistics Handout
9 pages
3 Matm111
No ratings yet
3 Matm111
3 pages
Statistical Techniques - Bda
No ratings yet
Statistical Techniques - Bda
33 pages
STAT1103 RensNotes
No ratings yet
STAT1103 RensNotes
8 pages
Cofactor Statistics
100% (1)
Cofactor Statistics
27 pages
Statistics For Data Analysis
No ratings yet
Statistics For Data Analysis
13 pages
Cheat Sheets - Stats Analytics
No ratings yet
Cheat Sheets - Stats Analytics
2 pages
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
No ratings yet
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
56 pages
Biostatistics Notes: Descriptive Statistics
No ratings yet
Biostatistics Notes: Descriptive Statistics
16 pages
Biostatistics Notes
No ratings yet
Biostatistics Notes
8 pages
Finals in Edu 533
No ratings yet
Finals in Edu 533
11 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
42 pages
BCSL44 Short Notes
No ratings yet
BCSL44 Short Notes
2 pages
3. Variables & Chart
No ratings yet
3. Variables & Chart
60 pages
2statsnotes 1
No ratings yet
2statsnotes 1
24 pages
Bio Statistics
No ratings yet
Bio Statistics
55 pages
Statistics Notes
No ratings yet
Statistics Notes
3 pages
250 Lec 5 Fall 13
No ratings yet
250 Lec 5 Fall 13
42 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Artificial Intelligence and Robotics MCQ SPPU Questions and Answers
No ratings yet
Artificial Intelligence and Robotics MCQ SPPU Questions and Answers
1 page
2014 TEPS Catalog1 PDF
No ratings yet
2014 TEPS Catalog1 PDF
14 pages
Đề số 9 - NTT
No ratings yet
Đề số 9 - NTT
6 pages
RMS Titanic - Wikipedia, The Free Encyclopedia
No ratings yet
RMS Titanic - Wikipedia, The Free Encyclopedia
32 pages
Spectrophotometric Determination of Available Sulphate Content in Soil Samples of Jaipur District
No ratings yet
Spectrophotometric Determination of Available Sulphate Content in Soil Samples of Jaipur District
3 pages
On Anthropometric Measurements
No ratings yet
On Anthropometric Measurements
19 pages
Thesis: 1-Five-Star Hotel 2 - Central Library Topics
No ratings yet
Thesis: 1-Five-Star Hotel 2 - Central Library Topics
7 pages
Animal Riddles: Can You Find Out What I Am?
No ratings yet
Animal Riddles: Can You Find Out What I Am?
3 pages
Subjective: "Nahihirapan Ako Huminga" As Verbalized by The Patient. Objective
No ratings yet
Subjective: "Nahihirapan Ako Huminga" As Verbalized by The Patient. Objective
4 pages
Module 07 Part 4
100% (1)
Module 07 Part 4
17 pages
Sigma Coating: Anodizing Test Report
100% (1)
Sigma Coating: Anodizing Test Report
1 page
A Journey Through Time - Us Forces in Malir Ww2
No ratings yet
A Journey Through Time - Us Forces in Malir Ww2
7 pages
Kyambogo University Faculty of Arts and Social Science Department of Foundation of Education
No ratings yet
Kyambogo University Faculty of Arts and Social Science Department of Foundation of Education
5 pages
Bolero Neo Accessories Price List Feb 2025
No ratings yet
Bolero Neo Accessories Price List Feb 2025
6 pages
9E_Revision_Paper_for_Semestral_Assessment_2__Part_2___1_ (3)
No ratings yet
9E_Revision_Paper_for_Semestral_Assessment_2__Part_2___1_ (3)
5 pages
MCQ Test Agriculture and Industries
No ratings yet
MCQ Test Agriculture and Industries
5 pages
Softening Final
100% (1)
Softening Final
23 pages
Welcome To The FMEA Worksheet: This Spreadsheet Can Be Used To
No ratings yet
Welcome To The FMEA Worksheet: This Spreadsheet Can Be Used To
22 pages
Typewriter
100% (1)
Typewriter
25 pages
Manual Water Level Measurements
No ratings yet
Manual Water Level Measurements
8 pages
Portfolio Brinda Herons Formula
No ratings yet
Portfolio Brinda Herons Formula
3 pages
9.3 Alloys and their properties MCQ QP.pdf
No ratings yet
9.3 Alloys and their properties MCQ QP.pdf
5 pages
CAIE-A2 Level-Physics - ATP
No ratings yet
CAIE-A2 Level-Physics - ATP
5 pages
Molecular Spectra-1
No ratings yet
Molecular Spectra-1
17 pages
26 End Time Signs
No ratings yet
26 End Time Signs
2 pages
FT0107MN Fagor
No ratings yet
FT0107MN Fagor
4 pages
Aula 09 - Revis-O - Produ - o Agropecu-Ria
No ratings yet
Aula 09 - Revis-O - Produ - o Agropecu-Ria
17 pages
Orbital Motors: Seal Kits
100% (1)
Orbital Motors: Seal Kits
16 pages
Reporte de Salidas de Stock V3
No ratings yet
Reporte de Salidas de Stock V3
32 pages
Nanofluids Technology For Thermal Sciences And Engineering Research Development And Applications Mukesh Kumar Awasthi download
No ratings yet
Nanofluids Technology For Thermal Sciences And Engineering Research Development And Applications Mukesh Kumar Awasthi download
76 pages

STATS 201

Uploaded by

STATS 201

Uploaded by

1.

Data and Its Types

●​ Definition: Data are facts, figures, or information collected, analyzed, and

4. Measures of Central Tendencies: Mean, Median, and Mode

You might also like

● Definition: Data are facts, figures, or information collected, analyzed, and