P Value

A P value represents the probability of obtaining results at least as extreme as the observed data, assuming that the null hypothesis is true. A low P value (<0.05) indicates strong evidence against the null hypothesis, while a high P value suggests the results could have occurred by chance. Confidence intervals provide a range of values that are likely to contain the true population parameter with a specified degree of confidence (typically 95%). P values and confidence intervals are complementary statistical tools for drawing inferences from data.

Uploaded by

Kamal Anchalia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

340 views13 pages

P Value

Uploaded by

Kamal Anchalia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

P values

Definition of a P value Consider an experiment where you've measured values in two samples, and the means are different. How sure are you that the population means are different as well? There are two possibilities:
y y

The populations have different means. The populations have the same mean, and the difference you observed is a coincidence of random sampling.

The P value is a probability, with a value ranging from zero to one. It is the answer to this question: If the populations really have the same mean overall, what is the probability that random sampling would lead to a difference between sample means as large (or larger) than you observed? How are P values calculated? There are many methods, and you'll need to read a statistics text to learn about them. The choice of statistical tests depends on how you express the results of an experiment (measurement, survival time, proportion, etc.), on whether the treatment groups

are paired, and on whether you are willing to assume that measured values follow a Gaussian bell-shaped distribution. Common misinterpretation of a P value Many people misunderstand what question a P value answers. If the P value is 0.03, that means that there is a 3% chance of observing a difference as large as you observed even if the two population means are identical. It is tempting to conclude, therefore, that there is a 97% chance that the difference you observed reflects a real difference between populations and a 3% chance that the difference is due to chance. Wrong. What you can say is that random sampling from identical populations would lead to a difference smaller than you observed in 97% of experiments and larger than you observed in 3% of experiments. You have to choose. Would you rather believe in a 3% coincidence? Or that the population means are really different? "Extremely significant" results

Intuitively, you probably think that P=0.0001 is more statistically significant than P=0.04. Using strict definitions, this is not correct. Once you have set a threshold P value for statistical significance, every result is either statistically significant or is not statistically significant. Some statisticians feel very strongly about this. Many scientists are not so rigid, and refer to results as being "very significant" or "extremely significant" when the P value is tiny. Often, results are flagged with a single asterisk when the P value is less than 0.05, with two asterisks when the P value is less than 0.01, and three asterisks when the P value is less than 0.001. This is not a firm convention, so you need to check the figure legends when you see asterisks to find the definitions the author used. One- vs. two-tail P values When comparing two groups, you must distinguish between one- and two-tail P values. Start with the null hypothesis that the two populations really are the same and that the observed discrepancy between sample means is due to chance.

The two-tail P value answers this question: Assuming the null hypothesis, what is the chance that randomly selected samples would have means as far apart as observed in this experiment with either group having the larger mean? To interpret a one-tail P value, you must predict which group will have the larger mean before collecting any data. The one-tail P value answers this question: Assuming the null hypothesis, what is the chance that randomly selected samples would have means as far apart as observed in this experiment with the specified group having the larger mean?

A one-tail P value is appropriate only when previous data, physical limitations or common sense tell you that a difference, if any, can only go in one direction. The issue is not whether you expect a difference to exist - that is what you are trying to find out with the experiment. The issue is whether you should interpret increases and decreases the same. You should only choose a one-tail P value when you believe the following:

Before collecting any data, you can predict which group will have the larger mean (if the means are in fact different). If the other group ends up with the larger mean, then you should be willing to attribute that difference to chance, no matter how large the difference.

It is usually best to use a two-tail P value for these reasons:

The relationship between P values and confidence intervals is more clear with two-tail P values. Some tests compare three or more groups, which makes the concept of tails inappropriate (more precisely, the P values have many tails). A two-tail P value is more consistent with the P values reported by these tests. Choosing a one-tail P value can pose a dilemma. What would you do if you chose a one-tail P value, but observed a large difference in the opposite direction to the experimental hypothesis? To be rigorous, you should conclude that the difference is due to chance, and that the difference is not statistically significant. But most people would

be tempted to switch to a two-tail P value or to reverse the direction of the experimental hypothesis. You avoid this situation by always using two-tail P values.
Statistical hypothesis testing

The P value is a fraction. In many situations, the best thing to do is report that number to summarize the results of a comparison. If you do this, you can totally avoid the term "statistically significant", which is often misinterpreted. In other situations, you'll want to make a decision based on a single comparison. In these situations, follow the steps of statistical hypothesis testing. 1.Set a threshold P value before you do the experiment. Ideally, you should set this value based on the relative consequences of missing a true difference or falsely finding a difference. In fact, the threshold value (called alpha) is traditionally almost always set to 0.05. 2.Define the null hypothesis. If you are comparing two means, the null hypothesis is that the two populations have the same mean.

3.Do the appropriate statistical test to compute the P value. 4.Compare the P value to the preset threshold value. If the P value is less than the threshold, state that you "reject the null hypothesis" and that the difference is "statistically significant". If the P value is greater than the threshold, state that you "do not reject the null hypothesis" and that the difference is "not statistically significant". Note that statisticians use the term hypothesis testing very differently than scientists. Statistical significance The term significant is seductive, and it is easy to misinterpret it. A result is said to bestatistically significant when the result would be surprising if the populations were really identical. A result is said to be statistically significant when the P value is less than a preset threshold value. It is easy to read far too much into the word significant because the statistical use of the word has a meaning entirely distinct from its usual meaning. Just because a difference is statistically significant does not mean that it is important or

interesting. And a result that is not statistically significant (in the first experiment) may turn out to be very important. If a result is statistically significant, there are two possible explanations:
y

The populations are identical, so there really is no difference. You happened to randomly obtain larger values in one group and smaller values in the other, and the difference was large enough to generate a P value less than the threshold you set. Finding a statistically significant result when the populations are identical is called making a Type I error. The populations really are different, so your conclusion is correct.

There are also two explanations for a result that is not statistically significant:
y

The populations are identical, so there really is no difference. Any difference you observed in the experiment was a coincidence. Your conclusion of no significant difference is correct. The populations really are different, but you missed the difference due to some

combination of small sample size, high variability and bad luck. The difference in your experiment was not large enough to be statistically significant. Finding results that are not statistically significant when the populations are different is called making a Type II error.

Confidence intervals
Statistical calculations produce two kinds of results that help you make inferences about the populations from the samples. You've already learned about P values. The second kind of result is a confidence interval. 95% confidence interval of a mean Although the calculation is exact, the mean you calculate from a sample is only an estimate of the population mean. How good is the estimate? It depends on how large your sample is and how much the values differ from one another. Statistical calculations combine sample size and variability to generate a confidence interval for the population mean. You can calculate intervals for any desired degree of confidence, but 95%

confidence intervals are used most commonly. If you assume that your sample is randomly selected from some population, you can be 95% sure that the confidence interval includes the population mean. More precisely, if you generate many 95% CI from many data sets, you expect the CI to include the true population mean in 95% of the cases and not to include the true mean value in the other 5%. Since you don't know the population mean, you'll never know for sure whether or not your confidence interval contains the true mean. Other situations When comparing groups, calculate the 95% confidence interval for the difference between the population means. Again interpretation is straightforward. If you accept the assumptions, there is a 95% chance that the interval you calculate includes the true difference between population means. Methods exist to compute a 95% confidence interval for any calculated statistic, for example the relative risk or the best-fit value in nonlinear regression. The interpretation is the same in all cases. If you accept the assumptions of the test,

you can be 95% sure that the interval contains the true population value. Or more precisely, if you repeat the experiment many times, you expect the 95% confidence interval will contain the true population value in 95% of the experiments. Why 95%? There is nothing special about 95%. It is just convention that confidence intervals are usually calculated for 95% confidence. In theory, confidence intervals can be computed for any degree of confidence. If you want more confidence, the intervals will be wider. If you are willing to accept less confidence, the intervals will be narrower.

What is "Statistical Significance" (p-value)?

The statistical significance of a result is the probability that the observed relationship (e.g., between variables) or a difference (e.g., between means) in a sample occurred by pure chance ("luck of the draw"), and that in the population from

which the sample was drawn, no such relationship or differences exist. Using less technical terms, we could say that the statistical significance of a result tells us something about the degree to which the result is "true" (in the sense of being "representative of the population"). More technically, the value of the p-value represents a decreasing index of the reliability of a result (see Brownlee, 1960). The higher the pvalue, the less we can believe that the observed relation between variables in the sample is a reliable indicator of the relation between the respective variables in the population. Specifically, the p-value represents the probability of error that is involved in accepting our observed result as valid, that is, as "representative of the population." For example, a p-value of .05 (i.e.,1/20) indicates that there is a 5% probability that the relation between the variables found in our sample is a "fluke." In other words, assuming that in the population there was no relation between those variables whatsoever, and we were repeating experiments such as ours one after another, we could expect that approximately in every 20 replications of the experiment there would be one in which the relation between the variables in question would be equal or stronger than in ours. (Note that this is not the same as saying that,

given that there IS a relationship between the variables, we can expect to replicate the results 5% of the time or 95% of the time; when there is a relationship between the variables in the population, the probability of replicating the study and finding that relationship is related to the statistical power of the design. See also, Power Analysis). In many areas of research, the p-value of .05 is customarily treated as a "border-line acceptable" error level.

Approximating The Shapiro-Wilk W-Test For Non-Normality
No ratings yet
Approximating The Shapiro-Wilk W-Test For Non-Normality
3 pages
Hind Oil Industries
No ratings yet
Hind Oil Industries
6 pages
Data Analysis With Small Samples and Non-Normal Data - Nonparametrics and Other Strategies
100% (1)
Data Analysis With Small Samples and Non-Normal Data - Nonparametrics and Other Strategies
241 pages
Statistics and Data
No ratings yet
Statistics and Data
67 pages
Theory Session: Introduction To Biostatistics
No ratings yet
Theory Session: Introduction To Biostatistics
22 pages
Assignment Introduction To Biostatistics
No ratings yet
Assignment Introduction To Biostatistics
6 pages
Tests of Significance and Measures of Association
No ratings yet
Tests of Significance and Measures of Association
21 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
58 pages
Data Types
No ratings yet
Data Types
8 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
18 pages
STATA Codes - Basic
No ratings yet
STATA Codes - Basic
8 pages
Chi Square Statistics
No ratings yet
Chi Square Statistics
7 pages
Doane4e Preface PDF
100% (1)
Doane4e Preface PDF
23 pages
4.3. Parametric & Nonparametric Tests
No ratings yet
4.3. Parametric & Nonparametric Tests
26 pages
Hypothesis Testing Unit-4
No ratings yet
Hypothesis Testing Unit-4
26 pages
17 A Introduction To Descriptive Statistics and Exploratory Data Analysis
No ratings yet
17 A Introduction To Descriptive Statistics and Exploratory Data Analysis
47 pages
Chapter 1 Introduction The Teaching of Theory (3 Hours) Objective
100% (1)
Chapter 1 Introduction The Teaching of Theory (3 Hours) Objective
32 pages
Biostatistics MCQS PDF
No ratings yet
Biostatistics MCQS PDF
30 pages
Master of Statistics
100% (1)
Master of Statistics
24 pages
Tutorial 5.0 Discrete Random Variable 2023
No ratings yet
Tutorial 5.0 Discrete Random Variable 2023
7 pages
Quartiles, Deciles, Percentiles
100% (1)
Quartiles, Deciles, Percentiles
5 pages
Statistics and Probability Module 4 Moodle
No ratings yet
Statistics and Probability Module 4 Moodle
6 pages
Assignment Booklet PGDAST Jan-Dec 2018
No ratings yet
Assignment Booklet PGDAST Jan-Dec 2018
35 pages
Chapter 1. Biostatistics
No ratings yet
Chapter 1. Biostatistics
34 pages
BasicStatistics I
No ratings yet
BasicStatistics I
90 pages
Application of SPSS
92% (13)
Application of SPSS
15 pages
Basic Statistics
No ratings yet
Basic Statistics
66 pages
SPSS2 Workshop Handout 20200917
No ratings yet
SPSS2 Workshop Handout 20200917
17 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
Chapter 2: Frequency Distribution and Measures of Central Tendency 2.1 A FREQUENCY DISTRIBUTION Is A Tabular Arrangement of Data Whereby The Data Is Grouped
No ratings yet
Chapter 2: Frequency Distribution and Measures of Central Tendency 2.1 A FREQUENCY DISTRIBUTION Is A Tabular Arrangement of Data Whereby The Data Is Grouped
9 pages
Measure of Dispersion Statistics
No ratings yet
Measure of Dispersion Statistics
24 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
10 pages
P Value Definition
100% (1)
P Value Definition
1 page
Anova Notes
No ratings yet
Anova Notes
7 pages
What Is Statistics?: "Statistics Is A Way To Get Information From Data"
No ratings yet
What Is Statistics?: "Statistics Is A Way To Get Information From Data"
220 pages
Basic Business Statistics: 11 Edition
No ratings yet
Basic Business Statistics: 11 Edition
24 pages
Statatistical Inferences
No ratings yet
Statatistical Inferences
22 pages
Harmonic Mean
No ratings yet
Harmonic Mean
14 pages
CH 14
No ratings yet
CH 14
13 pages
Sample Final Exam A
100% (1)
Sample Final Exam A
12 pages
Industrial Statistics & Quality Control
0% (1)
Industrial Statistics & Quality Control
4 pages
EPIData Presentation
No ratings yet
EPIData Presentation
36 pages
The Exponential Family of Distributions: P (X) H (X) e
No ratings yet
The Exponential Family of Distributions: P (X) H (X) e
13 pages
Variance and Standard Deviation
No ratings yet
Variance and Standard Deviation
14 pages
CHAPTER3 Continuous Probability Distribution
No ratings yet
CHAPTER3 Continuous Probability Distribution
56 pages
CH 10 Skewness Kurtosis
No ratings yet
CH 10 Skewness Kurtosis
17 pages
Microsoft Excel Essential Training: Raju Miyan Lecturer Khwopa College of Engineering
No ratings yet
Microsoft Excel Essential Training: Raju Miyan Lecturer Khwopa College of Engineering
25 pages
Chapter 1 Introduction To Biostat
No ratings yet
Chapter 1 Introduction To Biostat
62 pages
Statistics For Begineers
No ratings yet
Statistics For Begineers
28 pages
Statistics Exam Questions
No ratings yet
Statistics Exam Questions
5 pages
One-Way ANOVA
No ratings yet
One-Way ANOVA
41 pages
Critical Region
No ratings yet
Critical Region
7 pages
Evaluation of Evidence
No ratings yet
Evaluation of Evidence
51 pages
Basic Stat-1, Descriptive Statistics and Probability
100% (1)
Basic Stat-1, Descriptive Statistics and Probability
13 pages
Sampling Distributions Coursera
No ratings yet
Sampling Distributions Coursera
8 pages
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
No ratings yet
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
25 pages
The Normal Distribution - Lecture Slides
No ratings yet
The Normal Distribution - Lecture Slides
36 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Role of It in Retail-11bsp0185
No ratings yet
Role of It in Retail-11bsp0185
4 pages
Review On Dibs in Search of Self
100% (1)
Review On Dibs in Search of Self
3 pages
A Brief History of Plant Tissue Culture
100% (1)
A Brief History of Plant Tissue Culture
3 pages
Opposing Views
No ratings yet
Opposing Views
10 pages
Enid Blyton - Five Go Adventuring Again
67% (3)
Enid Blyton - Five Go Adventuring Again
92 pages
Topics of Sip Byfinance Students of 2011-13 Batch
No ratings yet
Topics of Sip Byfinance Students of 2011-13 Batch
1 page
Impact of Fii On Stock Market
No ratings yet
Impact of Fii On Stock Market
66 pages
Numerical Questions - IfM
No ratings yet
Numerical Questions - IfM
3 pages
Numerical Questions - IfM
No ratings yet
Numerical Questions - IfM
3 pages
SWOT Kingfisher Airlines
No ratings yet
SWOT Kingfisher Airlines
3 pages
Tulsi Tanti
No ratings yet
Tulsi Tanti
3 pages
5.1. Testing Hypothesis
No ratings yet
5.1. Testing Hypothesis
34 pages
Block Size For Local Estimation
No ratings yet
Block Size For Local Estimation
3 pages
Group Assignment Business Statistics courses
No ratings yet
Group Assignment Business Statistics courses
5 pages
Research & Methodology (Unit1,2,3,4)
No ratings yet
Research & Methodology (Unit1,2,3,4)
10 pages
Assignment 2
No ratings yet
Assignment 2
22 pages
T-Test (Independent Samples)
No ratings yet
T-Test (Independent Samples)
2 pages
Notes - Module 4
No ratings yet
Notes - Module 4
17 pages
Ss Notes
No ratings yet
Ss Notes
34 pages
Usia Anak Kejadian Diare: Crosstab
No ratings yet
Usia Anak Kejadian Diare: Crosstab
16 pages
SG4011 Module Guide 2020-21
No ratings yet
SG4011 Module Guide 2020-21
14 pages
Introduction To Econometrics (3 Updated Edition, Global Edition)
No ratings yet
Introduction To Econometrics (3 Updated Edition, Global Edition)
9 pages
Machinistas meet randomistas: useful ML tools for empirical researchers Esther Duflo
No ratings yet
Machinistas meet randomistas: useful ML tools for empirical researchers Esther Duflo
71 pages
2022 MA311 Statistics-Tutorial
No ratings yet
2022 MA311 Statistics-Tutorial
5 pages
TATA Steels Sales Forecast
No ratings yet
TATA Steels Sales Forecast
30 pages
Lect - 10 - Difference-in-Differences Estimation PDF
No ratings yet
Lect - 10 - Difference-in-Differences Estimation PDF
19 pages
(M6) Posttask
No ratings yet
(M6) Posttask
7 pages
Assignment No 01 Sim
No ratings yet
Assignment No 01 Sim
3 pages
Multiple Regression: Department of Government, University of Essex GV207 - Political Analysis, Week 10
No ratings yet
Multiple Regression: Department of Government, University of Essex GV207 - Political Analysis, Week 10
6 pages
Stat Chapter-5 PPT
No ratings yet
Stat Chapter-5 PPT
61 pages
- Advanced Statistical Methods
No ratings yet
- Advanced Statistical Methods
2 pages
Seasonal It y
No ratings yet
Seasonal It y
7 pages
T-Test: % Turnout in Presidential Elections by States
No ratings yet
T-Test: % Turnout in Presidential Elections by States
4 pages
vertopal.com_Lab_Linear_Regression
No ratings yet
vertopal.com_Lab_Linear_Regression
21 pages
Chapter 9 Hypothesis Testing Exercises 2
No ratings yet
Chapter 9 Hypothesis Testing Exercises 2
3 pages
Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Auto Regressive Models
No ratings yet
Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Auto Regressive Models
31 pages
Point and Interval Estimation
No ratings yet
Point and Interval Estimation
10 pages
The Study of Different Types of Kernel Density Estimators: Minge Sha, Yonggang Xie
No ratings yet
The Study of Different Types of Kernel Density Estimators: Minge Sha, Yonggang Xie
5 pages

P Value

Uploaded by

P Value

Uploaded by

P values

It is usually best to use a two-tail P value for these reasons:

What is "Statistical Significance" (p-value)?

You might also like