Exp 5-6-7-8

The document contains code snippets for performing regression, Z-test, T-test, and ANOVA using Python and R. Each section includes the necessary imports, data generation, hypothesis testing, and outputs for statistical analysis. The results indicate the estimated coefficients for regression, rejection of the null hypothesis in the Z-test, T-test statistics, and ANOVA setup with conclusions based on p-values.

Uploaded by

mithungrraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views8 pages

Exp 5-6-7-8

Uploaded by

mithungrraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

REGRESSION

Program:
import numpy as np
import matplotlib.pyplot as
plt def estimate_coef(x, y):
# number of
observations/points n =
np.size(x)
# mean of x and y vector
m_x = np.mean(x)
m_y = np.mean(y)
# calculating cross-deviation and deviation about x
SS_xy = np.sum(y*x) - n*m_y*m_x
SS_xx = np.sum(x*x) - n*m_x*m_x
# calculating regression
coefficients b_1 = SS_xy / SS_xx
b_0 = m_y - b_1*m_x
return (b_0, b_1)
def plot_regression_line(x, y, b):
# plotting the actual points as scatter plot
plt.scatter(x, y, color = "m",
marker = "o", s =
30) # predicted response
vector y_pred = b[0] + b[1]*x
# plotting the regression line
plt.plot(x, y_pred, color =
"g") # putting labels
plt.xlabel('x')
plt.ylabel('y')
# function to show plot
plt.show()
def main():
# observations / data
x = np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
y = np.array([1, 3, 2, 5, 7, 8, 8, 9, 10, 12])
# estimating
coefficients b =
estimate_coef(x, y)
print("Estimated coefficients:\nb_0 = {} \
\nb_1 = {}".format(b[0],
b[1])) # plotting regression line
plot_regression_line(x, y, b)
if name == " main ":
main()

Output:

Estimated coefficients:
b_0 = -0.0586206896552
b_1 = 1.45747126437
Z-TEST

Program:
# imports
import math
import numpy as np
from numpy.random import randn
from statsmodels.stats.weightstats import ztest
# Generate a random array of 50 numbers having mean 110
and sd 15
# similar to the IQ scores data we assume
above mean_iq = 110
sd_iq = 15/math.sqrt(50)
alpha = 0.05
null_mean =100
data =
sd_iq*randn(50)+mean_iq #
print mean and sd
print('mean=%.2f stdv=%.2f' % (np.mean(data), np.std(data)))
# now we perform the test. In this function, we passed
data, in the value parameter
# we passed mean value in the null hypothesis, in
alternative hypothesis we check whether the
# mean is larger
ztest_Score,p_value=ztest(data,value=null_mean,alternative='
la rger')
# the function outputs a p_value and z-score corresponding
to that value, we compare the
# p-value with alpha, if it is greater than alpha then
we do not null hypothesis
# else we reject it.
if(p_value < alpha):
print("Reject Null
Hypothesis")
else:
print("Fail to Reject NUll Hypothesis")

Output:
Reject Null Hypothesis
Experiment No: 7

T-TEST

Program:
# Importing the required libraries and
packages import numpy as np
from scipy import stats
# Defining two random
distributions # Sample Size
N = 10
# Gaussian distributed data with mean = 2 and var
= 1 x = np.random.randn(N) + 2
# Gaussian distributed data with mean = 0 and var
= 1 y = np.random.randn(N)
# Calculating the Standard Deviation
# Calculating the variance to get the standard
deviation var_x = x.var(ddof = 1)
var_y = y.var(ddof =
1) # Standard
Deviation
SD = np.sqrt((var_x + var_y) / 2)
print("Standard Deviation =",
SD) # Calculating the T-
Statistics
tval = (x.mean() - y.mean()) / (SD * np.sqrt(2 /
N)) # Comparing with the critical T-Value
# Degrees of freedom
dof = 2 * N - 2
# p-value after comparison with the T-
Statistics pval = 1 - stats.t.cdf( tval, df
= dof) print("t = " + str(tval))
print("p = " + str(2 * pval))
## Cross Checking using the internal function from SciPy
Packa ge
tval2, pval2 = stats.ttest_ind(x, y)
print("t = " + str(tval2))
print("p = " + str(pval2))

Output:
Standard Deviation =
0.7642398582227466 t =
4.87688162540348
p = 0.0001212767169695983
t = 4.876881625403479
p = 0.00012127671696957205


Experiment No: 8

ANOVA

Program:
# Installing the
package
install.packages("dplyr
") # Loading the package
library(dplyr)
# Variance in mean within group and between group
boxplot(mtcars$disp~factor(mtcars$gear),
xlab = "gear", ylab = "disp")
# Step 1: Setup Null Hypothesis and Alternate
Hypothesis # H0 = mu = mu01 = mu02 (There is no
difference
# between average displacement for different
gear) # H1 = Not all means are equal
# Step 2: Calculate test statistics using aov function
mtcars_aov <- aov(mtcars$disp~factor(mtcars$gear))
summary(mtcars_aov)
# Step 3: Calculate F-Critical Value
# For 0.05 Significant value, critical value = alpha =
0.05 # Step 4: Compare test statistics with F-Critical
value
# and conclude test p <alpha, Reject Null Hypothesis

Output:



Kohavi, Diane Tang, Xu - Trustworthy Online Controlled Experiments - A Practical Guide To AB Testing (2020)
No ratings yet
Kohavi, Diane Tang, Xu - Trustworthy Online Controlled Experiments - A Practical Guide To AB Testing (2020)
290 pages
Ad3411 Data Science and Analytics Laboratory
100% (7)
Ad3411 Data Science and Analytics Laboratory
24 pages
DSIMGTS Syllabus - RCRR
No ratings yet
DSIMGTS Syllabus - RCRR
6 pages
Stat 151 - Final Review
No ratings yet
Stat 151 - Final Review
15 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
AD3411-DATA SCIENCE AND ANALYTICS LABORATORY
No ratings yet
AD3411-DATA SCIENCE AND ANALYTICS LABORATORY
27 pages
AD3411 DATA SCIENCE AND ANALYTICS LAB (2)_removed
No ratings yet
AD3411 DATA SCIENCE AND ANALYTICS LAB (2)_removed
24 pages
Data Science Manual
No ratings yet
Data Science Manual
16 pages
dsa
No ratings yet
dsa
26 pages
STATSCHEATSHeet
No ratings yet
STATSCHEATSHeet
5 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
Stats_Lab(7-9)
No ratings yet
Stats_Lab(7-9)
8 pages
AD3411 - 6 To11
No ratings yet
AD3411 - 6 To11
15 pages
DA Lab ANSWERS
No ratings yet
DA Lab ANSWERS
10 pages
4-12
No ratings yet
4-12
17 pages
DVA Lab Manual
No ratings yet
DVA Lab Manual
20 pages
Statistics, Statistical Modelling and Data analytics_practicalfile_sj
No ratings yet
Statistics, Statistical Modelling and Data analytics_practicalfile_sj
23 pages
Lab 11,12 - Copy
No ratings yet
Lab 11,12 - Copy
7 pages
Staff Manual 06
No ratings yet
Staff Manual 06
3 pages
Cl-Vii Ass2 4301063
No ratings yet
Cl-Vii Ass2 4301063
5 pages
SMEC ML LAB MANUAL R22
No ratings yet
SMEC ML LAB MANUAL R22
21 pages
Maths Record Output .
No ratings yet
Maths Record Output .
24 pages
R Console
No ratings yet
R Console
6 pages
Pad Assignment No - 01
No ratings yet
Pad Assignment No - 01
6 pages
Statistical Analysis With Scipy?
No ratings yet
Statistical Analysis With Scipy?
9 pages
Cost Practical
No ratings yet
Cost Practical
13 pages
Lab Manual (DAV)
No ratings yet
Lab Manual (DAV)
33 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
27 pages
Python Code - Summary Statistics
No ratings yet
Python Code - Summary Statistics
6 pages
7406HW02-1
No ratings yet
7406HW02-1
3 pages
BAN5
No ratings yet
BAN5
2 pages
Cb161 Lab Manual
No ratings yet
Cb161 Lab Manual
25 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
Pratical 11 python DP
No ratings yet
Pratical 11 python DP
5 pages
BN2102 7-10
No ratings yet
BN2102 7-10
24 pages
1DA (1)
No ratings yet
1DA (1)
18 pages
Fha-pyhton Program Unit 1-4.Docx
No ratings yet
Fha-pyhton Program Unit 1-4.Docx
13 pages
Section 2
No ratings yet
Section 2
22 pages
Analisis Jalur
No ratings yet
Analisis Jalur
30 pages
data science
No ratings yet
data science
15 pages
Midterm2021R1 Sol PDF
No ratings yet
Midterm2021R1 Sol PDF
13 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
Data science and analtics Laboratory
No ratings yet
Data science and analtics Laboratory
21 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
FDA_BATCH2PROGRAM
No ratings yet
FDA_BATCH2PROGRAM
18 pages
Task 1
No ratings yet
Task 1
9 pages
Lab 04 Hypothesis Testing
No ratings yet
Lab 04 Hypothesis Testing
9 pages
AD3411 (2)
No ratings yet
AD3411 (2)
28 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
Lab Checkup Notes 2 - Google Docs
No ratings yet
Lab Checkup Notes 2 - Google Docs
7 pages
PR Practical File
No ratings yet
PR Practical File
38 pages
Data Analytics Lab
No ratings yet
Data Analytics Lab
46 pages
OceanofPDF.com Think Stats 3rd Edition Early Release - Allen Downey
No ratings yet
OceanofPDF.com Think Stats 3rd Edition Early Release - Allen Downey
97 pages
Linear Regression 1
No ratings yet
Linear Regression 1
2 pages
Hands on With Probability and Statistical
No ratings yet
Hands on With Probability and Statistical
9 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
s
No ratings yet
s
20 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
31 pages
Algorithm M
No ratings yet
Algorithm M
8 pages
maths lab
No ratings yet
maths lab
17 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
CP 16 - Effect of one abiotic factor on species distribution and morphology
No ratings yet
CP 16 - Effect of one abiotic factor on species distribution and morphology
6 pages
SPU-JSTMR__Volume_1_2
100% (1)
SPU-JSTMR__Volume_1_2
93 pages
Amiblu Stream Magazine November19
No ratings yet
Amiblu Stream Magazine November19
18 pages
Term Paper Contingent Choice Method
100% (3)
Term Paper Contingent Choice Method
10 pages
Model Weight (LBS) Price ($)
No ratings yet
Model Weight (LBS) Price ($)
14 pages
Twelve Steps To Writing An Effective Results Section
No ratings yet
Twelve Steps To Writing An Effective Results Section
1 page
Business Analytics For Management Decision
No ratings yet
Business Analytics For Management Decision
7 pages
Statistics for Engineers and Scientists 5th Edition William Navidi - The latest updated ebook version is ready for download
No ratings yet
Statistics for Engineers and Scientists 5th Edition William Navidi - The latest updated ebook version is ready for download
49 pages
Journal of Hospitality and Tourism Management: Anoop Patiar, Emily Ma, Sandra Kensbock, Russell Cox
No ratings yet
Journal of Hospitality and Tourism Management: Anoop Patiar, Emily Ma, Sandra Kensbock, Russell Cox
8 pages
Unit-4 Probability
No ratings yet
Unit-4 Probability
21 pages
K Nearest Neighbor Based Model For Intrusion Detection System
No ratings yet
K Nearest Neighbor Based Model For Intrusion Detection System
5 pages
Kata Pengantar Vano
No ratings yet
Kata Pengantar Vano
86 pages
Probability - Lecture 1
No ratings yet
Probability - Lecture 1
22 pages
Lecture 7 - Confidence Intervals and Hypothesis Testing: Statistics 102
No ratings yet
Lecture 7 - Confidence Intervals and Hypothesis Testing: Statistics 102
26 pages
Methods of Biomechanical Performance Ana
No ratings yet
Methods of Biomechanical Performance Ana
6 pages
Behavioral Geography
No ratings yet
Behavioral Geography
7 pages
Basic Statistics: Engr. Donald G. Garcia, ECE # 0023046
No ratings yet
Basic Statistics: Engr. Donald G. Garcia, ECE # 0023046
41 pages
Lesson 1 Definition of Quantitative Research - pdf2
No ratings yet
Lesson 1 Definition of Quantitative Research - pdf2
8 pages
Pol 224 Group D Work
No ratings yet
Pol 224 Group D Work
4 pages
Econometrics: ECO 2400: Anisha Sharma
No ratings yet
Econometrics: ECO 2400: Anisha Sharma
134 pages
Cyber Data Analytics - Assignment 1: Mateusz Garbacz
No ratings yet
Cyber Data Analytics - Assignment 1: Mateusz Garbacz
8 pages
Real Statistics Using Excel - Time Series Examples Workbook Charles Zaiontz, 27 July 2018
No ratings yet
Real Statistics Using Excel - Time Series Examples Workbook Charles Zaiontz, 27 July 2018
380 pages
A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling
No ratings yet
A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling
10 pages
2022-NURS4014-Research Project Course Assessments
No ratings yet
2022-NURS4014-Research Project Course Assessments
6 pages
Towards A Biologically Annotated Brain Connectome: Neuroscience
No ratings yet
Towards A Biologically Annotated Brain Connectome: Neuroscience
14 pages
Pengaruh Motivasi Dan Disiplin Kerja Terhadap Kinerja Karyawan PDAM Kota Tomohon
No ratings yet
Pengaruh Motivasi Dan Disiplin Kerja Terhadap Kinerja Karyawan PDAM Kota Tomohon
7 pages
W1A1
No ratings yet
W1A1
4 pages

Exp 5-6-7-8

Uploaded by

Exp 5-6-7-8

Uploaded by

REGRESSION

You might also like