0% found this document useful (0 votes)
11 views

Final Test to 2024

Uploaded by

Khánh Long
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Final Test to 2024

Uploaded by

Khánh Long
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 56

HCMC UNIVERSITY OF FINAL EXAM, SEMESTER 1, 2024-2025

TECHNOLOGY AND EDUCATION Subject: Mathematical Statistics for Engineers


Faculty of International Education Course code: MATH132901E
Number of pages: 02 pages.
-------------------------
Duration: 90 minutes.
Permitted to use two handwritten A4 sheets.
Question 1 ( 1 /10 pts)
At a certain supermarket, 40% of the customers buy food, 35% buy goods for the home,
and 25% buy clothes. Of those customers buying food, only 30% use cash for payment. Of those
customers buying goods for the home, 20% use cash for payment, whereas of those buying
clothes, 50% use cash for payment. If a customer use cash for payment, what is the probability
that he/she bought food?

Question 2 ( 1.5 /10 pts)


A student never comes to class on time. Let X (min) be the time that the student arrives
late to class. Suppose the probability density function of X is given by:
𝑘 𝑥 (𝑥 + 3), 𝑖𝑓 𝑥 ∈ [2 ; 30]
𝑓(𝑥) =
0, 𝑒𝑙𝑠𝑒
a. Find the value of k.
b. What is the probability that the student arrives late by 10 to 20 minutes?

Question 3 ( 1/10 pts)


In a certain population, the IQ (as measured by a standard test) is known to be
approximately normally distributed with a mean μ = 105 and standard deviation σ = 12. Random
individuals are selected until 4 individuals with an IQ greater than 124.8 are found. What is the
probability that the total number of individuals selected is 20?
Question 4 ( 1/10 pts)
A class of 40 students, including a monitor and a vice-monitor, is attending a seminar in
room Y. The room has 9 rows, with 5 chairs in each row. The students will sit randomly in the
room. Calculate the probability that the monitor and vice-monitor will sit in the same row.

Question 5 (2.5 /10 pts)


The IELTS overall band score ( X ) of a final-year student at university A is normally
distributed. The following data was observed for a random sample of final-year students at
University A:
X (band score) 5.0 5.5 6.0 6.5 7.0 7.5
Number of students 10 30 50 60 10 5
a/ Calculate a 95% confidence interval for the true average overall band score of a final-year student
at university A.
b/ Calculate a lower confidence bound at the 99% confidence level for the true proportion of such
final-year students who get overall band score greater than 6.5 .

Question 6 (2.5 /10 pts)


A sample of 50 scores on a statistics test was observed, yielding a sample mean score 𝑥̅ =
7.12357 and a sample standard deviation 𝑠 = 0.87195. Suppose that the distribution of score is
normal. The statistics instructor believes that the population mean score differs from 7. Is the data
different from the instructor’s claim at a 5% significance level? Test this hypothesis.

Document ID: BM1/QT-PĐBCL-RĐTV-E Page 1/2


Question 7 (1 /10 pts)
Two types of medication, A and B, for hives are being tested to determine if there is a
difference in the true proportions of adult patient reactions. A random sample of 300 adults given
medication A, there are 30 adults still having hives 30 minutes after taking the medication. A
random sample of 400 adults given medication B, there are 50 adults still having hives 30 minutes
after taking the medication. Test the hypothesis that there is a difference in the true proportions
of adult patient reactions between the two medications A and B at the 1% level of significance.

Question 8 (1 /10 pts)


Let X (in people) be the number of people watching the livestream of shop A on weekend.
Let Y (in million VND) be the revenue of shop A by livestream on that day. Observe a sample
(X,Y):
X 200 250 300 350 400 450 500 550
Y 20 23 28 36 45 43 49 54
Determine the correlation coefficient of X and Y. Find the linear regression equation of X and
Y, and calculate a point estimate of the revenue of shop A when X = 610.

Note that: Some values of 𝑧 and t critical values.

α 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
𝑧 2.576 2.326 2.170 2.054 1.960 1.881 1.812 1.751 1.695 1.645
𝑡 , 2.977 2.624 2.438 2.326 2.145 2.052 1.975 1.909 1.853 1.761

Notice: Invigilators should not explain the questions on the exam papers.

Learning outcome mapping Assessed in


[LO 2.1.1]: Compute mean, median, mode, standard deviation, Question 1,2,3,4
variance, and know their function.
[LO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[LO 2.1.1; 2.1.2; 2.4.3; 2.4.4]: Use binomial, negative binomial,
normal, Poisson distributions, exponential distribution and their
relationships.
[LO 2.1.1, 2.1.2]: Calculate the confidence intervals for proportion, Question 5,6,7,8
mean, variance based on a sample collected.
[LO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[LO 2.4.4]: Use linear regression model.

December 11th, 2024

Approved by program chair

Document ID: BM1/QT-PĐBCL-RĐTV-E Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2022-2023
AND EDUCATION Subject: Mathematical Statistics for Engineers
FACULTY OF APPLIED SCIENCES Course code: MATH132901E
Number of pages: 02 pages.
DEPARTMENT OF MATHEMATICS
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. In 2023, at university A, 50% of students live in dormitory campus, 30% of students rent
apartments or houses off-campus; 20% live in their own houses. For students living in
dormitory campus, the percentage of students passing Maths exam is 70%. For renting
apartments or houses off-campus, the percentage is 80%, whereas for living in their houses
the percentage is 60%. If select randomly a student passed Maths exam, what is the
probability that he lives in dormitory campus?
2. Let X (minute) denote the time to fully charge the 6 cell Li-ion battery of a Dell laptop.
The pdf of X is
𝑘 𝑥 (𝑥 − 20) , 0 ≤ 𝑥 ≤ 240 ;
𝑓(𝑥) =
0 , 𝑥 < 0 𝑜𝑟 𝑥 > 240.
a/ Find the value of 𝑘 .
b/ Select randomly laptops until finding 4 laptops having full charge time less than 3 hours.
What is the probability that we must select 20 laptops?
3. The Toeic test scores of students at HCM University of Technology and Education is
normally distributed with 𝜇 = 570 and 𝜎 = 80.
a/ Calculate the probability that there are at least three test scores above 650 in 50 Toeic
tests selected.
b/ What value 𝑐 is such that the interval (570 − c; 570 + c) includes 98% of all Toeic test
scores?
Question II (5.5 points)
1. A store conducted a survey of the lifespan of laptops priced from 700-1000 USD and got
the following data:
Lifespan (year) 2.5-3 3-3.5 3.5-4 4-4.5 4.5-5
Laptops 20 15 28 30 10
Suppose that the distribution of lifespan is normal.
a/ Calculate a 95% upper confidence bound for the true average lifespan of this laptop.
b/ Is there compelling evidence for concluding that true average lifespan is 3.5 years? Carry
out a test of hypotheses using a significance level of 0.02.
c/ Calculate a 97% confidence interval for the true proportion of laptops having lifespan
greater than 4 years.
2. Shopee, Lazada are the leading e-commerce platforms using online payment and cash
payment. This year, Lazada offers more discount codes for online payment than Shopee.
In a sample of 1200 randomly selected consumers shopping at Lazada, there are 950 of
these people choosing online payment. Observe a sample of 1000 randomly selected
consumers shopping at Shopee, there are 750 of these people choosing online payment.
Does this data strongly suggest that the true proportion of such consumers using online
payment at Lazada is greater than at Shopee? Test the hypotheses at significance level
0.05.
3. The following data decribes the expenditure X (million VND/month) on food per month
by employees at a certain company, and their corresponding monthly incomes Y(million
VND/month).

Number: BM1/QT-PĐBCL-RĐTV
X 6 8 7 8 8 9 11 10
Y 12 13 14 15 16 17 18 19
Do two variables X and Y have a linear regression relationship? Using the equation for
the linear regression, estimate the monthly income of an employee at this company who
spends 7 million VND/month on food.

Notice: Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for proportion, Question II
mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

May 21th, 2023

Approved by program chair

Phạm Văn Hiển

Number: BM1/QT-PĐBCL-RĐTV
HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2021-2022
AND EDUCATION Subject: Mathematical Statistics for Engineers
FACULTY OF APPLIED SCIENCES Course code: MATH132901E
Number of pages: 02 pages.
DEPARTMENT OF MATHEMATICS
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A box contains four 40-W bulbs, five 60-W bulbs, and three 75-W bulbs. If bulbs are
selected one by one in random order, what is the probability that at least two bulbs must
be selected to obtain one that is rated 60 W?
2. A friend who lives in Dalat city makes frequent consulting trips to Hochiminh City; 50%
of the time she travels on airline #1, 30% of the time on airline #2, and the remaining
20% of the time on airline #3. For airline #1, flights are late into Hochiminh city 30% of
the time. For airline #2, the percentage is 25% , whereas for airline #3 the percentage is
40%.
If we learn that on a particular trip she arrived late, what are the probability of having
flown on airline #2?
3. Let X (gram) denote the weight of a product type A. The pdf of X is
𝑘(𝑥 + 3𝑥 − 7) ; 2 ≤ 𝑥 ≤ 7
𝑓(𝑥) =
0 ; 𝑒𝑙𝑠𝑒
a/ Find the value of k
b/ Select randomly products until finding a product greater than 5 gram. What is the
probability that we must select 11 products.
4. Suppose the diameter at height (inch) of trees of a certain type is normally distributed
with 𝜇 = 6.8 and 𝜎 = 1.8 . If eleven trees are independently selected, what is the
probability that at least two has a diameter exceeding 8.8 inch?

Question II (5.5 points)


1. A sample of 50 sprinkler systems used for fire protection in office buildings, when tested,
yields a sample average activation temperature of 120.08°F ; a sample standard
deviation of 2.5°F. Suppose that the distribution of activation temperature is normal.
a/ Calculate a 97% confidence interval for the true average system-activation
temperature of sprinkler systems.
b/ A manufacturer of sprinkler systems claims that the true average system-activation
temperature is 118° F. Does the data differs from the manufacturer’s claim at
significance level 5% ? Test the opinion.
2. This year, VNPAY-QR offers many discount codes than E-wallets. In a sample of 1200
randomly selected consumers who make payments by VNPAY-QR Code, there are 500
of these people get promotional codes. Observe 1000 randomly selected consumers who
make payments by E-wallets (MoMo, ZaloPay), there are 400 of these people get
promotional codes.
a/ Does this data strongly suggest that the true proportion of such consumers who get
promotional codes after making payments by VNPAY-QR Code is greater than
payments by E-wallets? Test the hypotheses at significance level 0.05.
b/ Calculate an lower confidence bound at the 99% confidence level for the true
proportion of such consumers who get promotional codes after making payments by
VNPAY-QR code.
3. A college bookstore must order books two months before each semester starts. They
believe that the number of books that will ultimately be sold for any particular course is
related to the number of students registered for the course when the books are ordered.

Number: BM1/QT-PĐBCL-RĐTV
They would like to develop a linear regression equation to help plan how many books to
order. From past records, the bookstore obtains the number of students registered, X, and
the number of books actually sold for a course, Y. These data are below.
X 36 28 35 40 31 29 37 33
Y 31 22 30 35 29 30 33 32
Do two variables X and Y have a linear regression relationship ? Find the linear
regression equation of X and Y?

Notice: Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

May 30th, 2022

Approved by program chair

Number: BM1/QT-PĐBCL-RĐTV
HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2021-2022
AND EDUCATION Subject: Mathematical Statistics for Engineers
FACULTY OF INTERNATIONAL Course code: MATH132901E
Number of pages: 02 pages.
EDUCATION
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A production facility employs 22 workers on the day shift, 14 workers on the swing
shift, and 10 workers on the graveyard shift. Suppose that three workers are randomly
selected. What is the probability that one worker of each shift is selected?
2. The Reviews editor for a certain scientific journal decides whether the review for any
particular book should be short (1–2 pages), medium (3–4 pages). Data on recent
reviews indicates that 60% of them are short, 40% are medium. Reviews are submitted
in either Word or LaTeX. For short reviews, 70% are in Word, whereas 30% of medium
reviews are in Word. Suppose a recent review is randomly selected.
What is the probability that the selected review was submitted in Word format?
3. Let X (gram) denote the weight of a product type A. The pdf of X is
𝑘(−𝑥 + 8𝑥 − 14) ; 3 ≤ 𝑥 ≤ 6
𝑓(𝑥) =
0 ; 𝑒𝑙𝑠𝑒
a/ Find the value of k
b/ Select randomly 12 products. What is the probability that we have 7 products less
than 4 gram.
4. Suppose the diameter at height (inch) of trees of a certain type is normally distributed
with 𝜇 = 8.2 and 𝜎 = 2.1 . What is the probability that the diameter of a randomly
selected tree will be less than 10.7 inch? If five trees are independently selected, what is
the probability that at least one has a diameter less than 10.7 inch ?
Question II (5.5 points)
1. A sample of 60 sprinkler systems used for fire protection in office buildings, when tested,
yields a sample average activation temperature of 132°F ; a sample standard deviation of
2.1°F. Suppose that the distribution of activation times is normal.
a/ Calculate a 99% confidence interval for the true average system-activation
temperature of sprinkler systems.
b/ A manufacturer of sprinkler systems claims that the true average system-activation
temperature is 130°F. Does the data is something other than the manufacturer’s claim at
significance level 5% ? Test the opinion.
2. Among 600 helmets of company A, there are 550 helmets meeting quality standards.
a/ Calculate a 97% CI for the true proportion of helmets meeting quality standards
among helmets.
b/ The manager of company A claims that the population proportion of helmets meeting
quality standards is 87%. Does the data is something other than the manager’s claim at
significance level `1% ? Test the opinion.
3. Let X be the number of weeks a student studied and Y be the grades achieved by the
students of a Probability and Statistics course. Observe a sample (X,Y):
X 14 10 13 11 12 12 15 14
Y 8 6 8 7 7 6 8 8
Do two variables X and Y have a linear regression relationship ? Find the linear
regression equation of X and Y?

Notice: Invigilators should not explain the questions on the exam papers.

Number: BM1/QT-PĐBCL-RĐTV
Expected Learning Outcomes Questions
[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

May 30th, 2022

Approved by program chair

Number: BM1/QT-PĐBCL-RĐTV
TRƯỜNG ĐẠI HỌC SƯ PHẠM KỸ THUẬT ĐỀ THI CUỐI KỲ HỌC KỲ I NĂM HỌC 2020-2021
THÀNH PHỐ HỒ CHÍ MINH Môn: Xác suất thống kê ứng dụng
KHOA KHOA HỌC ỨNG DỤNG Mã môn học: MATH132901
Đề thi có 2 trang. Thời gian: 90 phút.
BỘ MÔN TOÁN
Được sử dụng tài liệu.
-----------*--------------

Câu I: ( 4,5 điểm)


1. Có 8 bạn cùng ngồi xung quanh một cái bàn tròn, mỗi bạn cầm một đồng xu như nhau. Tất
cả 8 bạn cùng tung đồng xu của mình, bạn có đồng xu ngửa thì đứng, bạn có đồng xu sấp thì
ngồi. Tính xác suất để không có hai bạn liền kề cùng đứng.
2. Ở một trạm xăng, 45% khách đổ xăng A95, 35% khách đổ xăng A92, và 20% khách đổ
xăng E5. Trong số những khách đổ xăng A95 chỉ có 30% khách đổ đầy bình; với xăng A92
thì chỉ có 60% khách đổ đầy bình và với xăng E5 thì có 50% khách đổ đầy bình. Biết người
khách đến trạm xăng đã đổ đầy bình, tính xác suất người này đổ xăng A92.
3. Thời gian một người tham quan Thảo Cầm Viên là biến ngẫu nhiên có phân phối chuẩn với
thời gian tham quan trung bình là 95 phút, độ lệch chuẩn là 18 phút. Trong 80 người đến
tham quan hãy tính xác suất có 20 người có thời gian tham quan nhỏ hơn 77 phút.

Number: BM1/QT-PĐBCL-RĐTV
4. Gọi X là tuổi thọ (đơn vị: năm) của con người. Một công trình nghiên cứu cho biết
kx 2 (100  x )2 ; 0  x  100
hàm mật độ của X là f ( x)  
0 ; x  0 hay x  100

Tính xác suất cho một người có tuổi thọ lớn hơn tuổi thọ trung bình.
Câu II: (5,5 điểm)
1. Đo nồng độ cholesterol (đơn vị mg/dL ) cho một nhóm người ở vùng A trong năm 2020, ta
được dữ liệu sau
Cholesterol 190-200 200-210 210-220 220-230 230-240 240-250
Số người 10 5 12 18 20 15
Cho rằng nồng độ cholesterol tuân theo phân phối chuẩn.
a/ Tìm khoảng ước lượng đối xứng cho nồng độ cholesterol trung bình của mỗi người dân
vùng A với độ tin cậy 95%
b/ Nồng độ cholesterol trên 240 mg/dL được coi là cao. Hãy ước lượng đối xứng cho tỉ lệ
người có cholesterol cao ở vùng A với độ tin cậy 2%.
c/ Tài liệu cho biết lượng cholesterol trung bình của dân vùng A năm 2019 là 215mg/dL. Có
ý kiến cho rằng nồng độ cholesterol trung bình người dân của vùng A năm 2020 đã tăng lên.
Dựa vào dữ liệu trên hãy kiểm định ý kiến này với mức ý nghĩa 1%.
2. Điều tra 620 sản phẩm của nhà máy I có 20 sản phẩm phải bảo hành. Điều tra ngẫu nhiên 710
sản phẩm của nhà máy II có 40 sản phẩm phải bảo hành. Hãy kiểm định xem tỉ lệ sản phẩm bảo
hành của hai nhà máy có khác nhau không với mức ý nghĩa 1%
3. Gọi X số giờ một sinh viên tự học môn Xác suất thống kê (ngoài thời gian học chính thức
trên lớp do giảng viên giảng dạy). Y là điểm số trung bình sinh viên đạt được cho môn học này.
Quan sát số giờ tự học và điểm trung bình của một số sinh viên ta được bảng sau:
X (giờ) 45 40 30 35 20 25 37 28 42
Y 9 10 8 9 7 7 8 6 8
Dựa vào số liệu này có thể dự đoán được điểm số trung bình của một sinh viên qua số giờ tự
học bằng hàm hồi quy tuyến tính thực nghiệm được hay không? Nếu được, hãy viết hàm hồi
quy tuyến tính thực nghiệm này.

Ghi chú: Cán bộ coi thi không được giải thích đề thi.

Chuẩn đầu ra của học phần (về kiến thức) Nội dung kiểm tra
[CĐR 2.1] Sử dụng được giải tích tổ hợp để tính xác suất Câu I
theo quan điểm đồng khả năng
[CĐR 2.2] Sử dụng được các công thức tính xác suất, đặc Câu I
biệt là xác suất có điều kiện
[CĐR 2.5] Sử dụng được phân phối siêu bội, nhị thức, Câu I
Poisson, chuẩn và mối liên hệ giữa các phân phối này.
[CĐR 2.3] Lập được bảng phân phối xác suất của biến Câu I
ngẫu nhiên rời rạc. Sử dụng được hàm phân phối xác suất
và hàm mật độ xác suất của biến ngẫu nhiên liên tục.
[CĐR 2.4] Tính được kỳ vọng, phương sai, median, mod
của biến ngẫu nhiên và cách sử dụng các số đặc trưng này.
[CĐR 2.6] Tính được giá trị trung bình mẫu, phương sai Câu II
mẫu bằng máy tính bỏ túi
[CĐR 2.8] Sử dụng được các tiêu chuẩn kiểm định giả
thiết để giải quyết các bài toán liên quan và áp dụng được

Number: BM1/QT-PĐBCL-RĐTV
trong thực tế.
[CĐR 2.7] Tìm được (giá trị) của khoảng tin cậy cho tỉ lệ, Câu II
trung bình, và phương sai ứng với số liệu thu được.
[CĐR 2.9] Sử dụng được hàm hồi quy tuyến tính thực Câu II
nghiệm.

Ngày tháng năm

Trưởng bộ môn

Nguyễn Văn Toản

Number: BM1/QT-PĐBCL-RĐTV
HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2017-2018
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A car is selected at random from a car park. The probability of the car being blue is
0.25 and the probability of it being an estate is 0.15. The probability of the car being a
blue estate is 0.08. What is the probability of the car being neither blue nor an estate?
2. A box of 10 biscuits chocolate biscuits and 1 lemon biscuit. Jenny takes out a biscuit
at random and eats it. She then takes out another biscuit at random.
a. Draw a tree diagram to show this information.
b. Find the probability that the second biscuit she takes is chocolate.
3. Let X denote the amount of space occupied by an article placed in a 1-packing
container and have range [0,1]. The pdf of X is f(x) = kx8 (1-x). Find k, EX, VX, and
the 70th percentile of X.
4. A multiple-choice test has three possible answers to each question, only one of
which is correct. A student guesses the answer to each of the twelve questions at
random. The random variable X is the number of correct answers.
a. State the distribution of X and explain why this model is suitable.
b. Find the probability that the student gets fewer than three questions correct.

Question II (5.5 point)


1. A survey of 2253 adults of a country revealed that 1262 of the respondents had at
some point used wireless means for online access. Calculate a 97% confidence interval
for the proportion of all adults in the country who at the time of the survey had used
wireless means for online access.
2. For a sample of 10 water specimens selected for treatment by coagulation, the
sample mean arsenic concentration was 24.3 mg/L, and the sample standard deviation
was 4.1. Using t-based methods to analyze the data to calculate and interpret a 95% CI
for true average arsenic concentration in all such water specimens. Assume that the
distribution of arsenic concentration was normal.
3. A hot-tub manufacturer advertises that with its heating equipment, a temperature of
100°F can be achieved in at most 15 min. A random sample of 42 tubs is selected, and
the time necessary to achieve a 100°F temperature is determined for each tub. The
sample average time and sample standard deviation are 16.5 min and 2.2 min,
respectively. Does this data cast doubt on the company’s claim? (using α = 0.05).
4. Urban storm water can be contaminated by many sources, including discarded
batteries. When ruptured, these batteries release metals of environmental significance.
A test for two different brands of size D batteries gave the following summary data on
zinc mass (g):
Assuming that both zinc mass
Brand Duracell Energizer
distributions are at least
Sample size 41 51
approximately normal, carry
Sample Mean 17.9 19.4 out a test at significance level
Sample Standard Deviation 2.5 3.4 .02 using the P-value approach
to decide whether true average zinc mass is different for the two types of batteries.

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


5. Toughness and fibrousness of asparagus are major determinants of quality. This was
the focus of a study reported in “Post-Harvest Glyphosphate Application Reduces
Toughening, Fiber Content, and Lignification of Stored Asparagus Spears” (J. of the
Amer. Soc. of Hort. Science, 1988: 569–572). The article reported the accompanying
data (read from a graph) on x shear force (kg) and y percent fiber dry weight
x 48 57 60 72 81 85 109 121 137 148 149 184 185 187
y 2.1 2.28 2.34 2.53 2.28 2.62 2.5 2.66 2.8 3.01 2.98 3.34 3.49 3.26
Determine the correlation coefficient for the above set of results and the equation of
the least squares regression line for predicting the percent fiber dry weight. Use your
regression line to estimate the value of the percent fiber dry weight when shear force is
132 kg?
Notice: Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

May 30th, 2017

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 2, 2018-2019
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A class has 20 male students and 30 female students. Choose randomly 5 students from this
class. Find the probability that there are both genders (male and female) in 5 students.
2. A test gives a positive result with probability 97% when the patient is indeed affected by
H1N1, while it gives a negative result with 98% probability when the patient is not affected
by H1N1. If a patient is selected at random from a population in which 0,5% of individuals
are affected by H1N1 and he is found positive, what is the probability that he is indeed
affected by H1N1?
3. The weight of a duck is a continuous random variable X (kg) . Suppose that X has the

kx( x 2  8) , if x  [1, 4]
following pdf: f ( x)  


0
 , if x  [1, 4]
a. Find the value of k
b. Find the probability that in 10 ducks, there are 4 ducks which are greater than 3 kg in
weight.
4. The lifetime of a light bulb is a continuous random variable X (years) normally distributed
with N (6; 1.25) . Find the probability that in 100 light bulbs there are at most 25 bulbs which
have lifetime greater than 7 years?

Question II (5.5 points)


1. Observe the salary X (million/month) of some people in Company A. We have the
following data.
X 4.0 - 4.5 4.5 – 5.0 5.0 – 5.5 5.5 – 6.0 6.0 – 6.5 6.5-7.0
The number 23 33 55 73 45 28
of people
a/ Calculate a 99% CI for true average salary of a person in Company A ? ( X has normal
distribution)
b/ It was also stated that the salary of a person in Company A was 6 million/month. Does the
data strongly suggest that the true average salary of a person in Company A is something
other than the stated value? Test the opinion using the significance level 0.02
2. In one experiment, 110 questionnaires with no incentive resulted in 75 respones, whereas
98 questionnaires that included a chance to win a lottery yielded 66 responses.
a/ Does this data suggest that the proportion of respones with no incentive resulted is less than
the proportion of response including a chance to win a lottery ? Test the hypotheses at
significance level 0.05
b/ What the confidence level is required if the precision of one-sided confidence for response
with no incentive resulted is 0.10347 ?
3. Let X (million Euro) be the money spending on advertising of a company. Let Y (million
euro) be the revenue of this company. Observe a sample (X,Y):
X 23 26 30 41 43 48 52 57
Y 651 762 856 1063 1190 1298 1421 1440
Do two variables X and Y have a linear regression relationship ? Find the linear regression
equation of X and Y? When X=70 , find Y=?

Notice: Invigilators should not explain the questions on the exam papers.

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


Expected Learning Outcomes Questions
[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

June 10th, 2019

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 3, 2018-2019
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. A class has 30 students. In this class, there are 8 students who can understand Japanese, 7
students who can understand Chinese and 3 students who can understand both Japanese and
Chinese. Choose randomly 4 students from this class to play a game. The rules of the game are
written a half in Japanese and a half in Chinese. Find the probability that this group can
understand the rules.
2. A certain company sends 30% of its overnight mail parcels via express mail service E1; 50%
of the overnight parcels via express mail service E2 and the remaining 20% via E3. Of those
sent via E1, only 3% arrive late, whereas 1% of the parcels handled by E2 arrive late and 2% of
the parcels handled by E3 arrive late.
If a record of an overnight mailing is randomly selected from the company’s file, what is the
probability that the parcel arriving late was E2 ?
3. Let X denote the temperature at which a certain chemical reaction takes place. Suppose that X

 2

 k(5  x 2 ) , if x  [1, 2]
has pdf : f ( x)   7


0
 , if x  [1, 2]
a. Find the value of k
b. Suppose this reaction is independently carried out once in each of fourteen different labs
and that the pdf of reaction time in each lab is as given. Find the probability that there are at least
2 labs which the temperature exceeds 1.
4. An article suggests a normal distribution with mean 130.2 oz and standard deviation 1.2 oz
for the actual contents of jars of a certain type. Stop selecting randomly jars until get 4 jars more
than 132 oz. Find the probability that the total number of jars is selected is 16 .
Question II (5.5 points)
1. Electric energy consumption ( X ) of a household in city A has the normal distribution.
Observe some households in city A randomly:
X(kwh/month) 60-110 110-170 170-220 220-270 270-320 320-370
The number of 20 32 62 71 80 115
households
a/ Calculate a 97% CI for the true average electric energy consumption of a household in city A.
b/ It was also stated that the electric energy consumption of a household in city A was 250
kwh/month. Does the data strongly suggest that the true average electric energy consumption of
a household is greater than the stated value? Test the opinion using the significance level 0.02
2. Researchers sent some resumes in response to job ads that appeared in the Boston Globe and
Chicago Tribune. The resumes were identical except that 2400 of them had “white sounding”
first names, such as Brett and Emily, whereas the other 2600 had “black sounding” names such
as Tamika and Rasheed. The resumes of the first type elicited 220 responses and the resumes of
the second type only 160 responses .
a/ Does this data strongly suggest that a resume with a “black” name is less likely to result in a
response than is a resume with a “white” name? Test the hypotheses at significance level 0.05
b/ Calculate a 99% lower bound for the population proportion of responses of the first type.
3. Let X (people) be the number of people attending a trip to Europe. Let Y (million VND) be
the revenue of the travel agency. Observe a sample (X,Y):
X 12 15 17 19 20 25 27 29
Y 720 895 1053 1100 1210 1480 1590 1780

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


Do two variables X and Y have a linear regression relationship ? Find the linear regression
equation of X and Y? Calculate a point estimate of the revenue of the travel agency when
X=30.

Notice: Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

August 2st, 2019

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY FINAL EXAM, SEMESTER 1, 2020-2021
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH131401E
Number of pages: 02 pages.
FOUNDATION SCIENCE GROUP
Duration: 90 minutes.
------------------------- Materials are allowed during the exam.
Question I (4.5 points)
1. The line  1 is parallel to  2 . Let A1 , B1 , C1 , D1 , E1 , F1 be on the line  1 and
A2 , B2 , C 2 , D2 , E 2 , F2 , G 2 be on the line  2 . Select randomly 3 points on these lines.
Find the probability that we get a triangle.
2. At a certain gas station, 40% of the customers use A95, 35% use A92, and 25% use E5.
Of those customers using A95, only 35% fill their tanks. Of those customers using A92,
65% fill their tanks, whereas of those using E5, 55% fill their tanks.
If the next customer fills the tank, what is the probability that A92 is requested?
3. Let X (gram) denote the weight of a product type A. The pdf of X is
k ( x 2  10 x  24) ; 4  x  6
f ( x)  
 0 ; else
a/ Find the value of k
b/ Select randomly the products until finding a product less than 5 gram. What is the
probability that we must select 12 products?
4.It is known that the cholesterol level in males 30 years old follows a normal distribution
with mean 220 mg/dl and standard deviation 30 mg/dl. Find the probability that at most 2
males in 20 males has cholesterol level greater than 240 mg/dl.
Question II(5.5 points)
1.An article reported that for a sample of 70 kitchens with gas cooking appliances
monitored during a one-week period, the sample mean CO2 level (ppm) was 620.12 , and
the sample standard deviation was 162.
a/ Calculate a 95% confidence interval for true average CO2 level in the population of
all homes from which the sample was selected.
b/ It was also stated that the CO2 level in the population of all homes was 600 (ppm).
Does the data strongly suggest that the true average CO2 level in the population of all
homes is something other than the stated value? Test the opinion using the significance
level 0.02
2. Last year, in a sample of 1200 randomly selected consumers who had opportunities to
send in a rebate claim form after purchasing a product A, there are 300 of these people said
they never did. This year, if a consumer sends in a rebate claim form, he will get 5 reward
points. Observe 1000 randomly selected consumers who had opportunities to send in a
rebate claim form after purchasing a product A this year, 200 of these people said they never
did.
a/ Does this data strongly suggest that the true proportion of such consumers who never
apply for a rebate claim after purchasing the product A last year is greater than this
year? Test the hypotheses at significance level 0.05
b/ Calculate an lower confidence bound at the 97% confidence level for the true
proportion of such consumers who never apply for a rebate claim after purchasing the
product A this year.
4. Let X (dollars) be the expenditure per month and Y (dollars) be the income per month
of an employee at a certain company. Observe a sample (X,Y):

X 2400 2600 2300 4900 3100 2500 5100 3200


Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2
Y 41000 45000 39000 59000 44000 43000 60000 48000
Do two variables X and Y have a linear regression relationship ? Find the linear
regression equation of X and Y?

Notice:Invigilators should not explain the questions on the exam papers.

Expected Learning Outcomes Questions


[ELO 2.1.1]: Compute mean, median, mode, standard deviation, Question I
variance, and know their function.
[ELO 2.4.3; 2.4.4]: Become familiar with various graphical
representations of data and learn to recognize misleading graphs.
[ELO 2.4.3; 2.4.4]: Use binomial, normal, Poisson distributions,
Hyper geometric distribution and their relationships.
[ELO 2.1.1, 2.1.2]: Draw probability distribution table for discrete
random variables.
[ELO 2.1.1, 2.1.2]: Calculate the confidence intervals for Question II
proportion, mean, variance based on a sample collected.
[ELO 2.1.3, 2.1.4]: Use test procedures to solve and develop
proficiency in its applications
[ELO 2.4.4]: Use linear regression model.

January8th, 2021

Head of foundation science group

Số hiệu: BM1/QT-PĐBCL-RĐTV Page 1/2


HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 2, 17-18
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Id Content pts
ea
1 B: “Car is blue” ; E: “Car is an estate”
P  B or E   P  B  E   P  B   P  E   P  B  E  0,5
0,5
 0.25  0.15  0.08  0.32.
2 0.5

P  second biscuit is chocolate   P  C1C2   P  L1C2 


10 9 1 10 0.5
    1
11 10 11 11

I
3 1

 kx
8
(1  x ) dx  1  k  90
(4,5
0
pts) 1
9 0.5
EX   90 x 9 (1  x )dx  ;
11
0
0.5
1
15
 VX  EX 2   EX   0.0124.
2
EX 2   90 x10 (1  x ) dx 
0
22
y
14 0.5
Let the 70th percentile of the thickness be y. 90 x8 (1  x )dx  0.7  y   0.56
0
25
4. a. X~B(12,1/3) because these following reason:
- 12 answers: either ‘correct’ or ‘incorrect’. 0,5
- Questions are answered independently of each other
- Probability of a correct answer is 1/3.
0,5
b. P(X<3)=P(X=0)+P(X=1)+P(X=2)=0.181 (to 3 s.f)
1 Let p be the proportion of all adults in the country who at the time of the survey had
used wireless means for online access.

II
p  1262
  2.17
p 1  p

 0.0227
 0,5
(5,5
2253 n
pts)
A 97% confidence interval for the proportion of all adults in the country who at the 0,5
time of the survey had used wireless means for online access is  0.5374;0.5828 
2 Let µ be the true average arsenic concentration in all such water specimens.
n  10, s  4.1, x  24.3
0,5
s
  2.262  2.9328
n
Hence, the 95% confidence interval for the true average arsenic concentration in all 0.5

 
such water specimens is x   ; x     21.3672;27.2328  .

3 Let µ be the true average time necessary to achieve a 100°F temperature of the
heating equipment.
H 0 :   15 ; H a :   15 .
0,5
n  42, s  2.2, x  16.5

z
 x  15 42  4.4187 0.5
s
 P _ val  0  0.05    or z  ztb  1.96  0.5
We reject H0. There is the evidence to reject the claim.
Hence, the data cast doubt on the company’s claim.
4 Let µ1 , µ2 be the true average zinc mass of the two brands of size D batteries: 0,25
Duracell and Energizer, respectively.
H 0 : 1  2 ; H a : 1  2 .
0,5
n1  41, s1  2.5, x1 17.9 ; n2  51, s2  3.4, x2  19.4
0.25
z
x  x 
1 2
 3.9567  P_value  0    0.02
2 2
s s
1
 2

n1 n2
We reject H0 at  = .02. There is evidence of a difference in true average zinc mass
for two types of batteries.
5 r = 0.9623895711: Strong relationship 0,25
Slope: B=0.008349892; y-Intercept: A= 1.759866075
The “best-fit” linear model is: y=1.759866075+0.008349892x; 0,25
Interpretation of the slope: Slope: B=0.008349892>0: Direct relationship 0,25
For every additional shear force, the fiber dry weight acquires an estimated
1.759866075 percent per kilogram.
If x = 132, the point estimate for the percent fiber dry weight is y=2.862051838. 0,25
HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 2, 18-19
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Content pts
1 Let A be an event that there are both genders (male and female) in 5 students.
There are   C505 ways to choose 5 students randomly.
0,5
There are A  C205  C305 ways to choose 5 male or 5 female students.
5A
C20  C305
The probability : P( A)  1   1  0.92542
 C505 0,5
2 Let A1 be an event of selecting an individual affected by H1N1.
Let A2 be an event of selecting an individual not affected by H1N1.
Let C be an event of positive result.
P ( A1 )  0.005 ; P( A2 )  0.995
P (C | A1 )  0.97 ; P (C | A2 )  0.02
By the law of total probability we find the probability of positive result:
0.5
P (C)  P ( A1 ) P (C | A1 )  P( A2 ) P (C | A2 )
 (0.005)(0.97)  (0.995)(0.02)  0.02475
Using Bayes formula, we find the probability that he is indeed affected by H1N1:
I P( A1.C) P( A1 ) P(C | A1 ) (0.005)(0.97) 0.5
P ( A1 | C)     0.1959
P(C) P(C) 0.02475
(4,5
3  4
4
pts) a/ 

f ( x)  1   k . x .( x 2  8)  1  k 
1
495 0.5
4
4
b/ P ( X  3)   x( x 2  8) dx  0.5798
3
495 0.5
Let Y be the number of ducks greater than 3 kg in 10 ducks. Then Y has the
binomial distribution with n=10; p= 0.5798
P (Y  4)  C104 (1  0.5798)6 (0.5798) 4  0.13063 0.5
4. X  N (6; 1.25) ;   6 ;   1.25
The probability that a light bulb has lifetime greater than 7 years
 X  76  X 
P ( X  7)  P    with Z   N (0,1)
  1.25  
 P( Z  0.89)  1   (0.89)  0.1867 0,5
Let Y be the number of light bulbs greater than 7 years in 100 light bulbs. Then Y
has the binomial distribution with n=100; p=0.1867
   n. p  18.67
Approximate to normal distribution  2 0,5
  n. p.(1  p)  15.1843
 X   25.5  18.67  X 
P ( X  25)  P ( X  25.5)  P    with Z   N (0,1)
  15.1843  
 P ( Z  1.75)   (1.75)  0.9599
1a n  257; x  5.5768 ; s  0.7127 0,5
Let µ be the true average salary of a person in Company A.
s 0,5
  2.58  0.1147
n
Hence, a 99% CI for true average salary of a person in Company A is
 
  x   ; x     5.4621;5.6915  .
0,5

1b Let µ be the true true average salary of a person in Company A


H0 :   6 ; H a :   6 .

z
 x  6 n  9.519
s 0.5

  0.02   ( z / 2 )  1   0.99  z / 2  2.33
2
Because z  z /2 : We reject H0 ; accept H a .
There is the evidence to reject the claim.
Hence, true average salary of a person in Company A is not 6 million/ month . 0.5
II 2a Let p1 be the proportion of respones with no incentive resulted.
(5,5 p2 the proportion of response including a chance to win a lottery .
pts)
H 0 : p1  p2 ; H a : p1  p2
75 66 75  66 141
n1  110 ; n2  98 ; f1  ; f2  ; f  
110 98 110  98 208

z
 f1  f 2   0.1286
1 1  0.5
f (1  f )   
 n1 n2 
  0.05 ;  ( z )  1    0.95  z  1.65
Because z   z : We accept H0 .
0.5
The claim is wrong. The proportion of respones with no incentive resulted is equal to the
proportion of response including a chance to win a lottery .
2b Let p be the proportion of respones with no incentive resulted.
p  75
110 0.5

  z

p 1  p
  0.10347  z  2.33

n
 ( z )     (2.33)    0.99   0.5

3 r = 0.99238: two variables X and Y have a linear regression relationship 0.5


Slope: B=24.0421; y-Intercept: A= 123.44
The linear regression equation is: y=123.44+24.0421x;
If x = 70, then y=1806.382 0.5
HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 3, 18-19
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Content Pts
1 Let A be an event that this group can understand the rules.
There are   C304 ways to choose 4 students randomly. 0,25

There are A  C234  C224  C184 0,5


A 0,25
The probability : P( A)  1   0.47837

2 Let A1 be an event of the overnight mail parcels via express mail service E1.
Let A2 be an event of the overnight parcels are sent via express mail service E2.
Let A3 be an event of the overnight parcels are sent via express mail service E3.
Let C be an event of the parcel arriving late.
P ( A1 )  0.3 ; P ( A2 )  0.5 ; P ( A3 )  0.2
P (C | A1 )  0.03 ; P (C | A2 )  0.01 ; P (C | A3 )  0.02
0.5
By the law of total probability we find the probability of the parcel arriving late:
P (C)  P ( A1 ) P (C | A1 )  P( A2 ) P (C | A2 )  P ( A2 ) P(C | A2 )  0.018
Using Bayes formula, we find the probability that the parcel arriving late was E2:
P( A2 .C) P ( A2 ) P(C | A2 ) (0.5)(0.01) 5 0.5
I P ( A2 | C)    
P(C) P(C) 0.018 18
(4,5 3  2
2 7
a/   7 (5  x
2
pts) f ( x)  1  ) 1 k  0.5
24

2
1
0.25
2 7 2
b/ P( X  1)   . (5  x 2 ) 
1
7 24 9
Let Y be the number of labs which the temperature exceeds 1. 0.5
Then Y has the binomial distribution with n=14; p= 2/9.
P (Y  2)  1  P (Y  1)  P(Y  0)  1  C114 (2 / 9)(7 / 9)13  (7 / 9)14  0.851766
0.25
4. Let X be actual contents of jars of a certain type.
X  N (130.2; 1.2 2 ) ;   130.2 ;   1.2
The probability thata jar more than 132 oz.
 X   132  130.2  X 
P ( X  132)  P    with Z   N (0,1)
  1.2   0,5
 P( Z  1.5)  1   (1.5)  1  0.9332  0.0668
Let Y be the jars total number of jars is selected. Then Y has the negative binomial
distribution with p=0.0668 ; r=4
0,5
P(Y  16)  C153 (0.0668)3 (1  0.0668)12 (0.0668)  0.0039519
II 1a n  380; x  260.3684 ; s  77.5626 0,5
(5,5
pts) Let µ be the true average electric energy consumption of a household in city A.
  97%  z /2  2.17 0,5
s
  2.17  8.6341
n 0,5
Hence, a 97% CI for the true average electric energy consumption of a household in city A
 
is   x   ; x     251.7343; 269.0025  .
1b Let µ be the true average electric energy consumption of a household in city A.
H 0 :   250 ; H a :   250 .

z
 x  250 n  2.6058
s 0.5
  0.02   ( z )  1    0.98  z  2.06
Because z  z : We reject H0 ; accept H a .
The true average electric energy consumption of a household is greater than the stated
value . The claim is right.
0.5
2a Let p1 be the proportion of respones of resumes with “white” name.
p2 the proportion of response of resumes with “black” name.
H 0 : p1  p2 ; H a : p1  p2
220 160 220  160 19
n1  2400 ; n2  2600 ; f1  ; f2  ; f  
2400 2600 2400  2600 250

z
 f1  f 2   4.0164
1 1 0.5
f (1  f )   
 n1 n2 
  0.05 ;  ( z )  1    0.95  z  1.65
Because z  z : We reject H0 ; accept H a .
0.5
The claim is right. The resume with a “black” name is not less likely to result in a
response than is a resume with a “white” name.
2b Let p be the proportion of responses of the first type. (“white” name)
p  220 ; z  z  2.33
 
2400 0.5

  z
 
p 1  p
 0.01372
n
The 99% lower bound for the population proportion of responses of the first type: 0.5
p  p    p  0.105386
3 r = 0.996885: two variables X and Y have a linear regression relationship 0.5
Slope: B=60.0873; y-Intercept: A= -3.28968
The linear regression equation is: y=-3.28968+60.0873;
If x = 30, then y=1799.32932 0.5
HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 1, 2020-2021
AND EDUCATION Subject: Mathematical Statistics for Engineers
HIGH QUALITY TRAINING FACULTY Course code: MATH130401E
-------------------------
Q Content Pts
1 Let A be an event that we get a triangle.
There are   C133 ways to choose 3 points randomly. 0,25
There are A  C62 .C71  C61 .C72 =231 0,5
A 21
The probability : P( A)    0.8076923077 0,25
 26
2 Let A1 , A2 , A3 be the event of the customers use A95; A92; E5, respectively
Let C be the event thatthe customers fill their tanks
P( A1 )  0.4 ; P( A2 )  0.35 ; P( A3 )  0.25
P(C | A1 )  0.35 ; P(C | A2 )  0.65 ; P(C | A3 )  0.55
By the law of total probability we find the probability that the customers fill their
tanks:
P(C)  P ( A1 ) P(C | A1 )  P( A2 ) P(C | A2 )  P( A2 ) P(C | A2 )  0.505 0.5
Using Bayes formula, we find the probability that these customers use A92:
P( A2 .C) P( A2 ) P(C | A2 ) (0.35)(0.65)
P( A2 | C)     0.4504950495
P(C) P(C) 0.505 0.5
I
3  6
3
(4,5
a/ 

f ( x)  1   k ( x 2  10 x  24)dx  1  k 
4
4
0.5
pts) 5
3
b/ P( X  5)   ( x2  10 x  24) dx  0.5
4
4
Let Y be the number of the products that we must select until finding a product less 0.5
than 5 gram.

Then Y has the negative binomial distribution with p= 0.5


0.5
1
P (Y  12)  (1  0.5)11 0.5 
4096
4. Let X be the cholesterol level in males 30 years old.
X  N (220; 302 ) ;   220 ;   30
The probability that males have cholesterol level greater than 240 mg/dl
 X   240  220  X 
P ( X  240)  P    with Z   N (0,1)
  30   0,5
 1   (0.67)  1  0.7486  0.2514
Let Y be the number of males having cholesterol level greater than 240 mg/dl. Then Y
has the binomial distribution with p=0.2514 ; n=12
0,5
2
P(Y  2)   C202 (0.2514)i (1  0.2514) 20i  0.0089
i 0
1a n  70; x  620.12 ; s  162
Let µ be the true average CO2 level in the population of all homes
  95%  z /2  1.96
0,5
s
  1.96  37.9508988
n
Hence, a 95% CI for the true average CO2 level in the population of all homesis 0,5

 
  x   ; x     582.1691012;658.0708988  .
1b Let µ be the true average CO2 level in the population of all homes
H 0 :   600 ; H a :   600 .
0.25
z
 x  600  n  1.039111095
s 0.5

  0.02   ( z /2 )  1   0.99  z /2  2.33
2
Because z  z / 2 : We accept H0.
the true average CO2 level in the population of all homes is equal to the stated value. The claim
is wrong. 0.5
II 2a Let p1 , p2 be the true proportion of such consumers who never apply for a rebate claim
(5,5 after purchasing the product A last year and this year, respectively.
pts)
H 0 : p1  p2 ; H a : p1  p2
300 200 300  200 5
n1  1200 ; n2  1000 ; f1  ; f2  ; f  
1200 1000 1200  1000 22 0.25

z
 f1  f2   2.786522184
1 1 0.5
f (1  f )   
 n1 n2 
  0.05 ;  ( z )  1    0.95  z  1.65
Because z  z : We reject H0 ; accept H a .
0.5
The claim is right. The true proportion of such consumers who never apply for a rebate
claim after purchasing the product A last year is greater than this year.
2b Let p be the true proportion of such consumers who never apply for a rebate claim after
purchasing the product A this year
p  200 ; ( z )  0.97  z  1.89 0.5
 
1000

0.5
  z

p 1  p
  0.02390681911
n
The 97% lower bound for the population proportion of consumers who never apply for a
rebate claim after purchasing the product A this year:

p  p    p  0.1760931809
3 r = 0.98282017: two variables X and Y have a linear regression relationship 0.5
Slope: B=6.972803645; y-Intercept: A= 24626.22811
The linear regression equation is: y=24626.22811+6.972803645x;
0.5
SOLUTION TO FINAL EXAMINATION
Course name: MATHEMATICAL STATISTICS FOR ENGINEERS
SEMESTER 2 – ACADEMIC YEAR 2019-2020
---------------------------------------------------------------------------------


Q1: Let Ai = hard drives from plant i , i = 1,2, 3 , 

B = a randomly selected hard drive is defective  .25 points

3
157
( )  P (A ) P (B | A ) = .54  .04 + .35  .08 + .11  .12 = 2500
a. P B =
i =1
i i
= .0628 .25 points

P ( A ) P ( B | A ) .35  .08 70
b. P (A | B ) =
2 2
= =  .4459 .5 points
P (B )
2
.0628 157

Q2:

X = number of defective tires you find before you finding 4 good tires
a.
→ X ~ NegBin 4,.95 ( )
2

(
P X 2 = ) C x =0
4 −1
x +4 −1
 .954  .05x  .99777 .5 point

r (1 − p ) 4 r 1−p( )= 80
b. E X ( ) = = ,
p 19
( )
V X =
p 2
361
.5 points

Q3: Let X = number of integrated circuits ICs is faulty → X ~ H 25, 5, 4 ( ) ( ) .5 points

The probability this shipmet of 25 ICs will be accepted:

1 C 5xC 204−x 2109


(
P X 1 = )  x =0 C 4
=
2530
 .8336 .5 points
25

X − 118
Q4: Let X = free speeds canbest → X ~ N 118,13.12 , Z = ( ) 13.1
( )
→ Z ~ N 0,1 .25 points

 180 
(
a. P X  100 = 1 − −)  = .915286 .25 points
 131 
b. Y = number of vehicles is not exceeding the posted speed limit

( )
→ Y ~ Bin 5, p , with p = P(X  100) = .084714 .5 points

( ) C ( )
5 −x
P Y 1 = x
5
px 1 − p  .3576 .5 points
x =1
Q5:

a. n = 16, x = 301.1875, sx = 91.3668202 .25 points


= .95 → = .05 → t /2,n −1
= 2.131 .25 points
A 95% confidence interval for the population mean is:
s
x t= 252.5118265, 349.8631735
/2,n −1 ( ) .5 points
n
b. Let = theaverage amount of water is filled in 1liter bottles .25 points

Null hypothesis H 0 : =1 Alternative hypothesis Ha : 1 .25 points


= .01 → z = 2.3265

20
(
z = .996 − 1 ) .008  −2.2361 Reject H a .25 points
That the average fill is 1 liter. .25 points
Q6:

a. A
= average worker productivity after wearing noise reduction device

B
= average worker productivity before wearing noise reduction device .25 points

Let d
= A
− B

Null hypothesis H 0 : d
=0 Alternative hypothesis Ha : d
 0 .25 points
we have: nd = 15, d = 1.066666667, sd = 3.731462116
= .05 → t ,n −1
= 1.761

( )s
nd
t = d −0  1.1071  t ,n −1
→ Reject H a .5 points
d
That wearing the noise reduction head gear increases worker productivity is
wrong. .25 points
b. pm = proportions of men said " yes ", pw = proportions of women said " yes " .25 points

291 217 508


We have: fm = , fw = ,f =
349 336 685
Null hypothesis H 0 : pm = pw Alternative hypothesis Ha : pm  pw .25 points
= .02 → z /2
 2.3265
fm − fw
z =
 1
 5.6184  −z ( /2
,z /2 )→ Reject H 0 .5 points
1 
(
f 1− f  + )
 349 336 
That different proportions of men and women in this student population would
be willing to marry beneath their class. .25 points

Q7: The correlation coefficient: r = 0.8764526758 .25 points


The equation of least-squares regression line for predicting metabolic rate
from body mass: .25 points

y = 201.1615996 + 24.02606662x .25 points

When x = 45 → y = 1282.334598 .25 points


HCMC UNIVERSITY OF TECHNOLOGY SOLUTION For FINAL EXAM, SEMESTER 1, 2024-2025
AND EDUCATION Subject: Mathematical Statistics for Engineers
FACULTY OF APPLIED SCIENCES Course code: MATH132901E
-------------------------
Q Content Pts
1 Let A1 , A2 , A3 be the event of the customers buy food, goods for the home, clothes,
respectively.
Let C be the customers use cash for payment. 0.25
𝑃(𝐴 ) = 0.4 ; 𝑃(𝐴 ) = 0.35 ; 𝑃(𝐴 ) = 0.25
𝑃(𝐶|𝐴 ) = 0.3 ; 𝑃(𝐶|𝐴 ) = 0.2 ; 𝑃(𝐶|𝐴 ) = 0.5
By the law of total probability we find the probability that the customers use cash for
payment
𝑃(𝐶) = 𝑃(𝐴 )𝑃(𝐶|𝐴 ) + 𝑃(𝐴 )𝑃(𝐶|𝐴 ) + 𝑃(𝐴 )𝑃(𝐶|𝐴 ) = 0.315 0.5
Using Bayes formula, we find the probability that the customers buy food:
( . ) ( ) 𝐶𝐴
𝑃(𝐴 |𝐶) = ( ) = ( )
= = 0.380952381

2
a/ 

f ( x )  1 ⇔ ∫ 𝑘𝑥(x + 3)𝑑𝑥 = 1 0.5
⇔𝑘= 3/31024 0.25
b/ 𝑃(10 < 𝑋 < 20) = 𝑥(𝑥 + 3) 𝑑𝑥
0.5
4175
= = 0.2691464673 0.25
15512

3 Let X be IQ in a particular population.


𝑋 ∼ 𝑁(105; 12 ) ; 𝜇 = 105 ; 𝜎 = 12
The probability that the individual has IQ greater than 125
.
𝑃(𝑋 > 124.8) = 𝑃 > 𝑤𝑖𝑡ℎ 𝑍 = ∼ 𝑁(0,1)
= 𝑃(𝑍 > 1.65) = 1 − 𝜙(1.65) = 1 − 0.9505 = 0.0495 0,5
Let Y be the number of individuals selected until get 4 individuals having an IQ greater than
124.8
Then Y has the negative binomial distribution with p=0.0495 ; r=4
𝑃(𝑌 = 20) = 𝐶 (𝑝) (1 − 𝑝) = 2.582132 ∗ 10 0,5

4. Let A be the event that the monitor and vice-monitor sitting in the same row.
40 students sit randomly in the hall: 𝑛 = 𝐴 0,25
the monitor and vice-monitor sitting in the same row : 𝑚 = 45 ∗ 4 ∗ 𝐴 0,5
The probability: 𝑃(𝐴) = =
0,25

5a 𝑛 = 165; 𝑥̅ = 6.1363636364 ; 𝑠 = 0.5559095940 0,5


Let µ be the true average overall band score of a final-year student at university A.
𝛼 = 95% ⇒ 𝑧 ⁄ = 1.96
𝜀 = 1.96 = 0.084823906 0,5

Hence, a 95% CI for the true average overall band score is 0,5
𝜇 ∈ (𝑥̅ − 𝜀; 𝑥̅ + 𝜀) = (6.0515397301; 6.2211875426).
5b Let p be the true proportion of such final-year students who get overall band score greater
than 6.5
15
𝑝̂ = ; 𝑧 = 2.33 0.5
165
( )
𝜀=𝑧 = 0.052146032
The lower confidence bound for the true proportion of such final-year students who get 0.5
overall band score greater than 6.5 is 𝑝 > 𝑝̂ − 𝜀 ⇒ 𝑝 > 0.038763059

6 Let µ be the population mean score


H 0 : 𝜇 = 7; 0.25
H a : 𝜇 ≠ 7.
( ̅ )
𝑧= √𝑛 = 1.002089397 ; n = 50 0.25
𝛼
𝛼 = 0.05 ⇒ 𝜙(𝑧 ⁄ ) = 1 − = 0.95 ⇒ 𝑧 ⁄ = 1.96 0.25
2
Because |𝑧| < 𝑧 ⁄ : Accept Ho : 𝜇 = 7
The population mean score is 7 . The opinion is wrong. 0.25

7 Let p1 , p2 be the true proportion of adults having hives 30 minutes after taking the
medication A , B, respectively.
H0 : 𝑝 = 𝑝 ;
0.25
Ha : 𝑝 ≠ 𝑝 .
30 50 30 + 50 4
𝑛 =300 ; 𝑛 = 400 ; 𝑓 = ; 𝑓 = ; 𝑓̅ = =
300 400 300 + 400 35
𝑧=
( )
= −1.028818615 0.25
̅( ̅)

𝛼 = 0.01 ⇒ 𝑧 / = 2.58 0.25


Because
|𝑧| < 𝑧 / : We accept H0.
There is no difference in the true proportions of adult patient reactions between two types of 0.25
medication A and B.

8 r = 0.9818840228: two variables X and Y have a linear regression relationship 0.5


A= -0.428571428 ; B=0.1004761905
The linear regression equation is: y= -0.428571428 + 0.1004761905 x;
When X=610 then Y= 60.86190478 0.5

You might also like