0% found this document useful (0 votes)
83 views

MATH408: Probability & Statistics Summer 1999 Week 1

This document outlines the objectives and goals of the MATH408 Probability & Statistics course taught in the summer of 1999. The course aims to develop an understanding of probability, statistics, and their application to process improvement. It will cover topics like data analysis, probability distributions, confidence intervals, hypothesis testing, and statistical software. Students will apply concepts through projects and learn to use MINITAB for statistical analysis. Examples are provided to illustrate statistical experimentation and applications in various fields.

Uploaded by

Erin Walker
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
83 views

MATH408: Probability & Statistics Summer 1999 Week 1

This document outlines the objectives and goals of the MATH408 Probability & Statistics course taught in the summer of 1999. The course aims to develop an understanding of probability, statistics, and their application to process improvement. It will cover topics like data analysis, probability distributions, confidence intervals, hypothesis testing, and statistical software. Students will apply concepts through projects and learn to use MINITAB for statistical analysis. Examples are provided to illustrate statistical experimentation and applications in various fields.

Uploaded by

Erin Walker
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 54

MATH408: Probability & Statistics

Summer 1999
WEEK 1
Dr. Srinivas R. Chakravarthy
Professor of Mathematics and Statistics
Kettering University
(GMI Engineering & Management Institute)
Flint, MI 48504-4898
Phone: 810.762.7906
Email: [email protected]
Homepage: www.kettering.edu/~schakrav
OBJECTIVES & GOALS

• Develop an understanding and need for the


use of probability and statistics in process
improvement.
• Develop an understanding between
variation and the quality of a product.
• Develop a thorough understanding of basic
concepts in probability and statistics.
OBJECTIVES & GOALS (cont'd)

• Get a proper insight into data collection,


analyzing the data and interpreting the data.
• Get exposed to basic probability
distributions such as binomial, Poisson and
normal (or Gauss), student’s t, chi-square,
and Fisher’s F.
• Know how to construct confidence intervals
and interpret these.
OBJECTIVES & GOALS (cont'd)

• Know the meaning of testing hypotheses.


• Exposed to basic techniques in ANOVA.
• Develop an understanding of regression
analysis.
• Get exposed to basic Design of
experiments.
OBJECTIVES & GOALS (cont'd)
• Develop an understanding of statistical process
control and process capability.
• Be able to use statistical package such as MINITAB
and be familiar with the commands needed to use
the statistical tools seen in the course.
• The statistical package will be fully integrated into
the course and regular laboratory classes will give
hands-on experience with the software and the
statistical tools.
OBJECTIVES & GOALS (cont'd)

• Practical data sets will be used throughout


the course and a detailed term project will
be required as part of the course.
• A number of illustrative examples using
practical data and former students’
projects will be presented.
OBJECTIVES & GOALS (cont'd)

During the course, the students will be able to


• apply the concepts in practice.
• complete class projects and a detailed term
project.
• use MINITAB.
TEXTBOOK
Engineering Statistics
D. C. Montgomery, G. C. Runger & N. F. Hubele.
• SOFTWARE: MINITAB for Windows,
Release 11/12.
• Detailed outline of topics to be covered can
be seen in your handout. You are highly
encouraged to go through these before the
class.
FIRST WEEK
• What is Applied Statistics?
• Applications from various fields.
• What is statistics?
• What is probability?
• Relationship between probability and
Statistics.
What is Applied Statistics?
• Collection of (statistical) techniques used in
practice.
• Range from very simple ones such as
graphical display, summary statistics, and
time-series plots, to sophisticated ones such
as design of experiments, regression
analysis, principal component analysis, and
statistical process control.
Applied Statistics (cont'd)
• Successful application of statistical methods
depends on the close interplay between
theory and practice.
• There should be interplay (communication
and understanding) between engineers and
statisticians.
Applied Statistics (cont'd)
• Engineers should have adequate statistics
background to (a) know what questions to
ask; (b) mix engineering concepts with
statistics to optimize productivity; (c) get
help and understand the implementation.
Applied Statistics (cont'd)
• The object of statistical methods is to make
the scientific process as efficient as
possible. Thus, the process will involve
several iterations, each of which will consist
of an “hypothesis”, data collection, and
“inference”. The iterations stop when
satisfactory results are obtained.
WHY WE NEED STATISTICS?

• Quality is something we all look for in any


product or service we get.
– What is Quality?
– It is not static and changes with time.
– Continuous quality improvement program is a
MUST to stay competitive in these days.
NEED STATISTICS (cont'd)
– Final quality and cost of a product are pretty
much dependent on the (engineering) designs
and the manufacture of the products.
– Variability is present in machines, materials,
methods, people, environment, and
measurements.
– Manufacturing a product or providing a service
involves at least one of the above 6 items (may
be some other items in addition to these)
NEED STATISTICS (cont'd)
– Need to understand the variability.
– Statistically designed experiments are used to
find the optimum settings that improve the
quality.
– In every activity, we see people use (or abuse?)
statistics to express satisfaction (or
dissatisfaction) towards a product.
– There is no such a thing as good statistics or bad
statistics.
NEED STATISTICS (cont'd)

– It is the people who report the statistics


manipulate the numbers to their
advantage.
– Statistics properly used will be more
productive.
EXPLORE, ESTIMATE and
CONFIRM
• Statistical experiments are carried out to
• EXPLORE: gather data to study more
about the process or the product.
• ESTIMATE: use the data to estimate
various effects.
• CONFIRM: gather additional data to
verify the hypotheses.
EXAMPLE 1 (EEC)
• Bonding Example: An engineer working for a
chemical company has the following diary of
activities with regard to a “new bonding
method” that is under consideration by the
company.
• Hypothesis 1: A new bonding method to bond
two films is expected to yield a higher bonding
strength compared to the current method.
EXAMPLE 1 (cont'd)
• KEY FACTORS: Bonding glue,
Temperature, Density and thickness of the
films, and Pressure setting.
• Experiment 1: Two films were bonded
together by choosing bonding glue type A,
temperature level to be 300oC, the thickness
of the two films to be 4 mils, and a pressure
setting to be 200 psi.
EXAMPLE 1 (cont'd)
• Data 1: The bonding strength measured
was lower than the current method.
• Question 1: Why is data 1 not supportive of
the hypothesis 1?
• Induction 1: The temperature setting may
be low causing the glue to perform at below
optimum level.
EXAMPLE 1 (cont'd)
• Experiment 2: Three sets of two films were
bonded together by choosing bonding glue type A,
the thickness of the two films to be 4 mils, and a
pressure setting to be 200 psi. The temperature
settings for these three sets were taken to be
400oC, 450oC and 500oC, respectively.
• Data 2: The bonding strengths for the three
specimens were as follows:
EXAMPLE 1 (cont'd)
– At 400oC the strength was still lower than the
current one;
– At 450oC the strength was higher than the
current one;
– At 500oC the strength was lower than the
current one;
EXAMPLE 1 (cont'd)
• Induction 2: The temperature setting at 450oC
seems to give a better bonding strength when all
other variables are set at the above mentioned
levels.
• The above investigation in various steps illustrates
the basic ideas in a statistical experiment
conducted in a scientific way.
EXAMPLE 1 (cont'd)
• The remaining series of steps, with possible
modifications including varying the settings of the
variables simultaneously, form the basis of an
experimental design. This will be seen in great
detail later.
EXAMPLE 1 (cont'd)
BASIC IDEAS
• Constraint: the films should not peel off under
“normal” usage.
• Key variables: bonding glue, temperature, density
and thickness of the films, and pressure setting.
• Goal: the effectiveness of such bonding method.
• Procedure: All possible configurations in actual
production setup should be considered in the
study.
EXAMPLE 1 (cont'd)
• EXPLORE: Bond specimens of films at several
settings and measure the bonding strength.
• ESTIMATION: Suppose our study shows that
the bonding strength is affected by glue,
temperature and setting, then we would like to
estimate the strength.
• CONFIRMATION: Once we find the optimal
settings, we run additional experiments to verify
that the settings are in fact “best”.
EXAMPLE 1 (cont'd)
• Recommendation: If the study is done scientifically,
then we may have one of the following:
• (a) Continue with the production.
• (b) Not to use the method.
• (c) Suggest appropriate modification in the process.
• However, if it is not scientifically done, the
conclusion may be totally false.
APPLICATIONS
• Statistical methods have applications in many
areas: industrial, medical, behavioral, sociological
and economic.
• General principles and strategies to be adopted in
these areas will all be the same. However, certain
problems can call for some special techniques.
• Some detailed engineering applications are given
in the handout. You may want to add more to
these as we go along.
BRAINSTORMING SESSION
• This is a starting point for any analysis, more
so in a statistical study.
• Gather information about the problem by
assembling a group of people involved.
• Simple statement of the problem; get all ideas;
group these into several classes.
• Draw a cause-and-effect diagram. The
following is an example.
Cause-and-effect Diagram
Cause-and-effect Diagram for Gas Mileage

Measurements Materials Men


Driver Training
Fuel tank Gasoline type

Driver
Speedometer type oil type
Number of passengers

Gage for pressure fuel additive

GAS MILEAGE
Engine Warm-up time Tire type
Weather conditions
Tune-ups Transmission type
Tire pressure
Tire rotation
Terrain Air conditioner
Ride breaks
Front/Rear drive
Maintenance Weight
Highway or city
Fast start carburator

Environment Methods Machines


BASIC CONCEPTS IN
STATISTICS
• What is a variable?
• What is data?
• How to collect data?
• What do we do with the data?
STATISTICS(cont'd)
• Why investigate relationship about
variables?
• How to use Statistics?
• What is Exploratory Data Analysis?
• What is descriptive statistics?
• What is inferential statistics?
MINITAB
• We will go to the laboratory (Applied
Mathematics Lab) to give a brief
introduction to MINITAB.
• Make sure that you bring your class
handout on MINITAB to the lab.
OBSERVATIONAL STUDIES
• The objectives here are to establish the
current process (or the performance of the
process or equipment), to identify areas, if
any, for improvement, to identify sources of
variation, and to set the direction for further
experimentation, if needed. This study is
also referred to as passive data collection.
EXPERIMENTAL STUDIES
• In this the study is conducted through a
designed experiment. Here data is collected
on the process under study by deliberately
varying the controllable variables and then
inferences are made on the process.
Usually, a sequence of experimental study
is conducted before a product is made.
WHAT IS DATA?
• Data is collection of information pertaining to a
specific problem under study.
• For example, in a study of MPG of a new model
car, the data would be the miles per gallon of the
cars that were tested.
• Suppose we are interested in the braking distance
(at 35mph) of that particular model car, then the
data would comprise the braking distances of the
tested cars.
DATA (cont'd)
• Study the income level of people in a city (to see
whether it is profitable to start a new business), data
would be the income of all people living in the city.
• A new drug is being planned and the interest would
be to see the reception for it. The company
performs a pilot study through contacting a number
of physicians and gathers information (data) to see
the impact of the drug.
• The data can be quantitative or qualitative.
DATA (cont'd)
• Variables, such as the MPG of a new model
car, number of defective in a lot sampled, the
weight of a cereal box, etc, is quantitative.
• Quantitative variable can be discrete or
continuous.
• Variables, which cannot be quantified such
as the color of the eyes, location, etc., are
classified as qualitative variables.
DATA (cont'd)
• A qualitative variable which can be ordered
(according to some scale) is referred to as
ordinal.
• An unordered qualitative variable (such as the
color of the hair) is referred to as nominal.
• In dealing with data one has to be aware of
major types of problems such as data errors,
outliers and missing observations.
DATA (cont'd)
• A data error is an observation that is
incorrectly recorded.
• Recording error, typing error, transcription
(copying) error, repetition error and
deliberate (falsification) error.
• An outlier is an observation that falls away
from the rest of the data.
DATA (cont'd)
• Missing observations arise for a number of reasons.
• In response to a questionnaire people may forget to
answer some questions.
• In agricultural experiments the crops may suddenly
die in some plots leading to no yield, which cannot
be taken as 0 yield.
• Some analysis becomes more involved due to
missing observations.
DATA (cont'd)
• There are two kinds of data: raw and
grouped.
• Raw data: not compiled in any way.
• Grouped data: classified into several groups
or classes according to some criteria.
UNI- AND MULTI-VARIATE
DATA
• Study of only on one variable, such as the
MPG of a new model car as a function of
the size of the car then we are dealing with
univariate data.
• Study deals with more than one variable at
a time, then we are dealing with
multivariate data.
UNI- AND MULTI-VARIATE
DATA
• Study of MPG as a function of the engine
size, HP, passenger capacity, fuel capacity,
etc, then the study deals with multivariate
data.
MULTIVARIATE ANALYSIS
• Deals with study involving simultaneous
measurements on many variables.
• Multivariate statistical techniques differ
from univariate in the sense that the
attention is drawn away from the analysis of
mean and variance of a single variable.
MULTIVARIATE ANALYSIS
(cont'd)
• Instead, the attention is focused on:
• There are several multivariate techniques
available for investigating the above three
areas.
• These include:
– (a) multiple regression;
– (b) discriminant analysis;
MULTIVARIATE ANALYSIS
(cont'd)
– (c) multivariate ANOVA;
– (d) correlation analysis;
– (e) logit analysis;
– (e) principal component analysis;
– (f) factor analysis;
– (g) cluster analysis;
– (h) metric multidimensional scaling.
HOW TO USE STATISTICS
(efficiently)?
• What is the main objective of the study?
• Then, we ask:
– (a) What information is available on this
problem?
– (b) Do we have data on this problem? If so how
the data was selected?
– (c) Has any study been done on this problem
before?
INVESTIGATION STAGES
• Proper statistical study of a problem
involves:
– 1. Understanding of the problem and the goals
of the study.
– 2. Determine the type of data to be used for the
study.
– 3. Assess the structure and the quality of the
data.
INVESTIGATION STAGES
(cont'd)
– 4. Perform an initial examination of the data.
– 5. Carry out a number of formal statistical
procedures.
– 6. Compare with any previous findings.
– 7. Summarize the findings through report
writings and presentations.
POPULATION
• Population: is a collection of all units defined by
some characteristic, which is the subject under
study.
• In the study of the MPG of a new model car, the
population consists of the MPG's of all cars of that
model.
• To study the income level of a particular city the
population consists of the incomes of all working
people in that city.
POPULATION (cont’d)
• Parameter: is a fixed but unknown
quantity.
• Examples: mean, standard deviation, range,
median, mode, proportion.
POPULATION

SAMPLE

You might also like