Chapter 1 Statistic
Chapter 1 Statistic
What is Statistics? The word statistics derives from classical Latin roots, status which means state. Statistics has become the universal language of the sciences. As potential users of statistics, we need to master both the science and the art of using statistical methodology correctly. These method include:
Carefully defining the situation Gathering data Accurately summarizing the data Deriving and communicating meaningful conclusions
Specific definition: Statistics is a collection of procedures and principles for gathering data and analyzing information to help people make decisions when faced with uncertainty.
CHAPTER 1 INTRODUCTION
Nowadays statistics is used in almost all fields of human effort such as: education health business agriculture..etc. Example applications of Statistics Sport => A statistician may keeps records of the number of hits a baseball player gets in a season. Financial => Financial advisor uses several statistic information to make reliable predictions in investment. Public Health => An administrator would be concerned with the number of residents who contract a new strain of flu virus during a certain year. Others => Economics The national gross profit for year 2007 was increase by 2.2%
CHAPTER 1 INTRODUCTION
Statistics has Two Aspects 1) Theoretical / Mathematical Statistics 2) Applied Statistics
Theoretical / Mathematical Statistics => Deals with the development, derivation and proof of statistical theorems, formulas, rules and laws. Applied Statistics => Involves the applications of those theorems, formulas, rules and laws to solve real world problems.
CHAPTER 1 INTRODUCTION
Applied Statistics can be divided into two main areas, depending on how data are used.
(2) Inferential statistics Refers to the technique of interpreting the values resulting from the descriptive techniques and making decisions and drawing conclusions about the population
CHAPTER 1 INTRODUCTION
ASPECTS OF STATISTICS
Theoretical/Mathematical Statistics Deals with the development, derivation and proof of statistical theorems, formulas, rules and laws.
Applied Statistics Involves the applications of those theorems, formulas, rules and laws to solve real world problems. Inferential Statistics Consist of methods that use results obtained from sample to make decisions or conclusions about a population
Descriptive Statistics
CHAPTER 1 INTRODUCTION
Example 1
Determine which of the following statements is descriptive in nature and which is inferential. Of all U.S kindergarten teachers, 32% say that knowing the alphabet is an essential skill.
inferential
CHAPTER 1 INTRODUCTION
Why do we have to study statistics? To read and understand various statistical studies in related field. To communicate and explain the results of study in related field using our own words. To become better consumers and citizens.
CHAPTER 1 INTRODUCTION
Basic Terms of Statistics
Population versus Sample Population a collection of all individuals about which information is desired. -individuals are usually people but could also be schools, cities, pet dogs, agriculture fields, etc. there are two kinds of population: - When the membership of a population can be (or could be) physically listed. finite population:- e.g. the books in library. - When the membership is unlimited. infinite population:- e.g. the population of all people who might use aspirin. Sample a subset of the population.
CHAPTER 1 INTRODUCTION
Parameter versus Statistic
Parameter a numerical value summarizing all the data of an entire population. - often a Greek letter is used to symbolize the name of parameter. e.g. the average age at time of admission for all students who have ever attended our college. Statistic a numerical value summarizing the sample data. - English alphabet is used to symbolize the name of statistic e.g. the average height, found by using the set of 25 heights.
CHAPTER 1 INTRODUCTION
Variable a characteristics of interest about each individual element of a population or sample. e.g. a students age at entrance into college, the color of students hair, etc.
Data value the value of variable associated with one element of a population or sample. This value may be a number, a word, or a symbol. e.g. Farah entered college at age 23, her hair is brown, etc.
Data
the set of values collected from the variable from each of the elements that belong to sample. e.g. the set of 25 heights collected from 25 students.
CHAPTER 1 INTRODUCTION
Example 2
A statistics student is interested in finding out something
about the average ringgit value of cars owned by the faculty members of our university. Each of the seven terms just describe can be identified in this situation. i) population: the collection of all cars owned by all faculty members at our university. ii) sample: any subset of that population. For example, the cars owned by members the statistics department. iii) variable: the ringgit value of each individual car.
CHAPTER 1 INTRODUCTION
iv) data value: one data value is the ringgit value of a particular car. Alis car, for example, is value at RM 45 000. v) data: the set of values that correspond to the sample obtained (45,000; 55,000; 34,0000;). vi) parameter: which we are seeking information is the average value of all cars in the population. vii) statistic: will be found is the average value of the cars in the sample.
Census: a survey includes every element in the population. Sample survey: a survey includes every element in
CHAPTER 1 INTRODUCTION
Types of Variables
CHAPTER 1 INTRODUCTION
Quantitative (numerical) Variables
A variable that quantifies an element of a population. - e.g. the total cost of textbooks purchased by each student for this semesters classes. Arithmetic operations such as addition and averaging are meaningful for data that result from a quantitative variable. Can be subdivided into two classification: discrete variables and continuous variables.
CHAPTER 1 INTRODUCTION
Discrete Variables
A quantitative variable that can assume a countable number of values. Can assume any values corresponding to isolated points along a line interval. That is, there is a gap between any two values.
Example 3
CHAPTER 1 INTRODUCTION
Continuous Variables
A quantitative variable that can assume an uncountable number of values. Can assume any value along a line interval, including every possible value between any two values.
Example 4
Weight of books and supplies you are carrying as you attend class today.
CHAPTER 1 INTRODUCTION
Qualitative (attribute, categorical) variables A variable that describes or categorizes an element of a population.
Example 5 A sample of four hair-salon customers was surveyed for their hair color, hometown and level of satisfaction.
CHAPTER 1 INTRODUCTION
Exercise 1 1. Of the adult U.S. population, 36% has an allergy. A sample of 1200 randomly selected adults resulted in 33.2% reporting an allergy.
What is sample?
1200
CHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION
2. The faculty members at Universiti Utara Malaysia were surveyed on the question How satisfied were you with this semester schedule? Their responses were to be categorized as very satisfied, somewhat satisfied, neither satisfied nor dissatisfied, somewhat dissatisfied, or very dissatisfied.
Name the variable interest. satisfaction Identify the type of variable. qualitative
CHAPTER 1 INTRODUCTION
3. Identify each of the following as an example of (1) attribute (qualitative) or (2) numerical (quantitative) variables.
The breaking strength of a given type of string The hair color of children auditioning for the musical Annie. The number of stop signs in town of less than 500 people. Whether or not a faucet is defective. The number of questions answered correctly on a standardized test. The length of time required to answer a telephone call at a certain real estate office.
CHAPTER 1 INTRODUCTION
DATA The set of values collected from the variable from each of the elements that belong to sample.
e.g. the set of 25 heights collected from
25 students.
From a survey or an experiment.
CHAPTER 1 INTRODUCTION
Two types of data:
Secondary data: data obtained from published material by governmental, industrial or individual sources
CHAPTER 1 INTRODUCTION
PRIMARY DATA
Primary Data Collection Techniques Data is collected by researcher Data is obtained from respondent (i) Face to face interview -Two ways communication. - Researcher(s) asks question directly to respondent(s). Advantages: Precise answer. Appropriate for research that requires huge data collection. Increase the number of answered questions. Disadvantages: Expensive. Interviewer might influence respondents responses. Respondent refuse to answer sensitive or personal question.
CHAPTER 1 INTRODUCTION
(ii) Telephone interview Advantages:
Quick. Less costly. Wider respondent coverage.
Disadvantages:
Information obtained might not represent the whole population. Limited interview duration. Not appropriate for long and contemplate question. Demonstration cannot be performing. Telephone is not answered.
CHAPTER 1 INTRODUCTION
(iii) Postal questionnaire
A set of questions to obtain related information of conducted study. Questionnaires are posted to every respondent. Wider respondent coverage. Respondent have enough time to answer questions. Interviewer influences can be avoided. Lower cost. One way interaction. Low response rate. Not suitable for numerous and hard questions. Time consuming. Questionnaire is answered by unqualified respondent.
Advantages:
Disadvantages:
CHAPTER 1 INTRODUCTION
(iv) Observation
Observing and measuring specific characteristics without attempting to modify the subjects being studied. Records human behaviors, objects and situations without contact with respondent.
CHAPTER 1 INTRODUCTION
SECONDARY DATA Published records from governmental, industrial or individual sources. Historical data. Various resources. Experiment is not required. Advantages: Lower cost. Save time and energy. Disadvantages: Obsolete information. Data accuracy is not confirmed.
CHAPTER 1 INTRODUCTION
Data also can be classified by how they
are categorized, counted or measured. This type of classification uses measurement scales with 4 common types of scales: nominal, ordinal, interval and ratio.
CHAPTER 1 INTRODUCTION
Nominal Level of Measurement
A qualitative variable that characterizes (or
describes/names) an element of a population. Arithmetic operations not meaningful for data. Order cannot be assigned to the categories. Example: - Survey responses:- yes, no, undecided, - Gender:- male, female
CHAPTER 1 INTRODUCTION
Ordinal Level of Measurement
A qualitative variable that incorporates and
ordered position, or ranking. Differences between data values either cannot be determined or are meaningless. Example: - Level of satisfaction:- very satisfied, satisfied, somewhat satisfied, etc. - Course grades:- A, B, C, D, or F
CHAPTER 1 INTRODUCTION
Interval Level of Measurement
Involve a quantitative variable.
are meaningful. Differences make sense, but ratios do not (e.g., 30-20=20-10, but 20/10 is not twice as hot!). No natural zero
CHAPTER 1 INTRODUCTION
Example: Temperature scales are interval data with 25oC warmer than 20oC and a 5oC difference has some physical meaning. Note that 0oC is arbitrary, so that it does not make sense to say that 20oC is twice as hot as 10oC. The year 0 is arbitrary and it is not sensible to say that the year 2000 is twice as old as the year 1000.
CHAPTER 1 INTRODUCTION
Ratio Level of Measurement
A scale in which both intervals between values and ratios
of values are meaningful. A real zero point. Example: - Temperature measured in degrees Kelvin is a ratio scale because we know a meaningful zero point (absolute zero). - Physical measurements of height, weight, length are typically ratio variables. It is now meaningful to say that 10 m is twice as long as 5 m. This is because there is a natural zero.
CHAPTER 1 INTRODUCTION
Levels of Measurement Nominal - categories only Ordinal - categories with some order Interval - differences but no natural starting point Ratio differences and natural starting point
CHAPTER 1 INTRODUCTION
Exercise 2 Classify each as nominal-level, ordinal-level, interval-level or ratiolevel. a. Ratings of newscasts in Malaysia. (poor, fair, good, excellent) b. Temperature of automatic popcorn poppers. c. Marital status of respondents to a survey on saving accounts. d. Age of students enrolled in a marital arts course. e. Salaries of cashiers of C-Mart stores. ordinal interval nominal
ratio
ratio