0% found this document useful (0 votes)

19 views67 pages

BBABB602 Study Material and Syllabus

The document outlines the course objectives and outcomes for Advanced Data Analytics, focusing on the role of data analytics in business decision-making and the principles of information analysis. It covers various statistical methods such as linear regression, logistic regression, factor analysis, and cluster analysis, along with their applications and assumptions. Additionally, it emphasizes the importance of ethical, social, and security considerations in data analytics systems.

Uploaded by

vsetthupati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views67 pages

BBABB602 Study Material and Syllabus

Uploaded by

vsetthupati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 67

Advanced Data Analytics

STUDY MATERIAL
COURSE OBJECTIVES:

1. To describe the role of data analytics and decision support systems in business and record the current
issues with those of the firm to solve business problems.
2. To introduce the fundamental principles of computer-based information analysis and design and
develop an understanding of the principles and techniques used.
3. To enable students to understand the various knowledge representation methods and different expert
system structures as strategic weapons to counter the threats to business and make business more
competitive.
4. To enable the students to use of data analysis to assess the impact of Technology on electronic
commerce and electronic business and understand the specific threats and vulnerabilities of
computer systems.
.

COURSE OUTCOMES:

CO1: The students will be able to relate the basic concepts and technologies used in the field of data
analytics.
CO2: The students will be able to compare the processes of developing and implementing data analytics
algorithms.
CO3: The students will be able to examine the role of the ethical, social, and security issues of data
analytics systems.
CO4: The students will be able to investigate and translate the role of data analytics in organizations,
and the strategic management processes, with the implications for the management.

Course Content:

2|Page
Module Number Description of Topic Page No.
1 Simple Linear Regression: 9-35
Introduction – Overview –
Importance -Least Square Method–
Normal Equations - Calculation of
Regression Coefficients –
Properties of Regression Line –
Uses of Regression;
• Multiple Linear Regression:
Overview – Importance -
Least Square Method –-
Normal Equations –
Calculation of Regression
Coefficients - Properties of–
Testing Relevance of an
Additional Explanatory
Variable
2 Basic concept of Logistic 35-50
Regression – Assessing the
Model –
• log-likelihood statistic –
deviance statistic – R and R2
– Wald Statistic – odds ratio –
Sources of Bias and Common
Problems - Interpreting
Binary Logistic Regression
3 • Basic concept of Factor 50-71
Analysis, Factor Analysis
Model, Statistics Associated
with Factor Analysis, Factor
Analysis Process – Formulate
the Problem – Construct the
Correlation Matrix-
Determine the method of
Factor Analysis –Determine
the number of Factors –
Factor Extraction eigenvalues
and scree plot- Factor
Rotation – Interpret Factors –
Calculate Factor Scores -
Determine Model Fit.
4 • Basic concept of Cluster 71-82
Analysis, Statistics
Associated with Cluster
Analysis, Cluster Analysis
Process - Formulate the
Problem – Select a distance
measure – Select a clustering
procedure – Decide on the
number of Clusters – Interpret
and Profile Cluster – Asses
the reliability and validity .

3|Page
Module Topic Sub-topics Mapping with Industry and Lecture Correspond
number International Standard Hours ing
Assignment

Linear Simple Linear International Academia: 12 Simple

Regressio Regression: https://ocw.mit.edu/courses/18- Linear
n Introduction – s096-topics-in-mathematics-with- Regression:
1 Analysis: Overview – applications-in-finance-fall- Introductio
Importance -Least 2013/resources/lecture-6- n–
Square Method– regression-analysis/ Overview –
Normal Equations Industry Mapping: Importance
- Calculation of Creating a Predictive model -Least
Regression Square
Coefficients – Method–
Properties of Normal
Regression Line – Equations -
Uses of Calculation
Regression; of
• Multiple Regression
Linear Coefficients
Regression: – Properties
Overview – of
Regression
Importance -
Line – Uses
Least Square of
Method –- Regression;
Normal Multiple
Equations – Linear
Calculation Regressi
of Regression on:
Coefficients - Overvie
Properties w–
of– Testing Importan
Relevance of ce - Least
Square
an Additional
Method –
Explanatory
-Normal
Variable
Equatio
ns

4|Page
Binary Basic concept of International Academia: 12 Basic
2 Logistic Logistic https://ocw.mit.edu/courses/15-071- concept
Regression – the-analytics-edge-spring- of
Regression 2017/pages/logistic-regression/
Assessing the Logistic
Model – Industrial Mapping : Predictive Regressi
• log- model creation on –
likelihood Assessin
statistic – g the
deviance Model –
statistic – R log-
and R2 – likeliho
Wald od
Statistic – statistic
odds ratio – –
Sources of devianc
Bias and e
Common statistic
Problems - – R and
Interpreting R2
Binary
Logistic
Regression
3 Factor • Basic International Academia: 12 Basic
Analysis concept of https://ocw.mit.edu/courses/18- concept
Factor s096-topics-in-mathematics-with- of
Analysis, applications-in-finance-fall- Factor
Factor 2013/resources/lecture-15-factor- Analysi
modeling/
Analysis s,
Industrial Mapping : Predictive
Model, Factor
model creation
Statistics Analysi
Associated s
with Factor Model,
Analysis, Statisti
Factor cs
Analysis Associ
Process – ated
Formulate with
the Problem Factor
– Construct Analysi
the s,
Correlation Factor
Matrix- Analysi
Determine s
the method of Process
Factor –
Analysis – Formul
Determine ate the
the number Proble
of Factors – m–
Factor Constr
Extraction uct the
eigenvalues Correla
5|Page
and scree tion
plot- Factor Matrix-
Rotation – Determ
Interpret ine the
Factors – method
Calculate of
Factor Scores Factor
- Determine Analysi
Model Fit. s–
Cluster • Basic International Academia: 12 Basic
4 Analysis concept of https://ocw.mit.edu/courses/6- concept
Cluster 0002-introduction-to- of
Analysis, computational-thinking-and-data- Cluster
Statistics science-fall- Analysi
2016/resources/lecture-12-
Associated s,
clustering/
with Cluster Statistic
Analysis, Industrial Mapping : Predictive s
Cluster model creation Associat
Analysis ed with
Process - Cluster
Formulate Analysi
the Problem s,
– Select a Cluster
distance Analysi
measure – s
Select a Process
clustering
procedure –
Decide on the
number of
Clusters –
Interpret and
Profile
Cluster –
Asses the
reliability
and validity .

Learning Resources:

Text Book:

References:

6|Page
CO-PO Mapping:

CO PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8

BBABA602C01 2 3 1 2

BBABA602CO2 2 3 2 2

BBABA602CO3 1 1 1 2

BBABA602CO4 1 1 3 3

1=Low(Slight) 2=Moderate(Medium) 3=Substantial (High)

7|Page
MODULE -1

8|Page
Multiple Linear Regression

1. Derive normal equations of multiple linear regressions in matrix method. (BL: 6,

Create)

(BL6, Create)

3. Find an expression of R square. (BL: 5, Evaluate)

4. Prove that OLS estimator is unbiased. (BL: 5, Evaluate)

5. Prove that OLS estimator is a minimum variance estimator. (BL: 5, Evaluate)

6. Explain the assumptions of multiple linear regression models using matrix notation.

7. How can you test the overall significance of regression model? (BL: 5, Evaluate)

27 | P a g e
MODULE – 2

ADVANCED DATA ANALYTICS

In multiple regression, in which there are several predictors, a similar equation is derived in which
each predictor has its own coefficient. As such, Y is predicted from a combination of each predictor
variable multiplied by its respective regression coefficient.

Logistic regression is commonly used for prediction and classification problems. Some of these
use cases include:

• Fraud detection: Logistic regression models can help teams identify data anomalies,
which are predictive of fraud. Certain behaviors or characteristics may have a higher
association with fraudulent activities, which is particularly helpful to banking and other
financial institutions in protecting their clients. SaaS-based companies have also started to
adopt these practices to eliminate fake user accounts from their datasets when conducting
data analysis around business performance.

• Disease prediction: In medicine, this analytics approach can be used to predict the
likelihood of disease or illness for a given population. Healthcare organizations can set up
preventative care for individuals that show higher propensity for specific illnesses.

• Churn prediction: Specific behaviors may be indicative of churn in different functions of

an organization. For example, human resources and management teams may want to know
if there are high performers within the company who are at risk of leaving the organization;
this type of insight can prompt conversations to understand problem areas within the
company, such as culture or compensation. Alternatively, the sales organization may want
to learn which of their clients are at risk of taking their business elsewhere. This can prompt
teams to set up a retention strategy to avoid lost revenue.

ASSUMPTIONS OF LOGISTIC REGRESSION

Logistic regression does not make many of the key assumptions of linear regression and general
linear models that are based on ordinary least squares algorithms – particularly regarding linearity,
normality, homoscedasticity, and measurement level.

Firstly, it does not need a linear relationship between the dependent and independent variables.
Logistic regression can handle all sorts of relationships, because it applies a non-linear log
transformation to the predicted odds ratio. Secondly, the independent variables do not need to be
multivariate normal – although multivariate normality yields a more stable solution. Also the error
terms (the residuals) do not need to be multivariate normally distributed. Thirdly,
homoscedasticity is not needed. Logistic regression does not need variances to be heteroscedastic
for each level of the independent variables. Lastly, it can handle ordinal and nominal data as
independent variables. The independent variables do not need to be metric (interval or ratio
scaled). However some other assumptions still apply.

35 | P a g e
Binary logistic regression requires the dependent variable to be binary and ordinal logistic
regression requires the dependent variable to be ordinal. Reducing an ordinal or even metric
variable to dichotomous level loses a lot of information, which makes this test inferior compared
to ordinal logistic regression in these cases.

Secondly, since logistic regression assumes that P(Y=1) is the probability of the event occurring,
it is necessary that the dependent variable is coded accordingly. That is, for a binary regression,
the factor level 1 of the dependent variable should represent the desired outcome.

Thirdly, the model should be fitted correctly. Neither over fitting nor under fitting should occur.
That is only the meaningful variables should be included, but also all meaningful variables should
be included. A good approach to ensure this is to use a stepwise method to estimate the logistic
regression.

Fourthly, the error terms need to be independent. Logistic regression requires each observation to
be independent. That is that the data-points should not be from any dependent samples design, e.g.,
before-after measurements, or matched pairings. Also the model should have little or no
multicollinearity. That is that the independent variables should be independent from each other.
However, there is the option to include interaction effects of categorical variables in the analysis
and the model. If multicollinearity is present centering the variables might resolve the issue, i.e.
deducting the mean of each variable. If this does not lower the multicollinearity, a factor analysis
with orthogonally rotated factors should be done before the logistic regression is estimated.

Fifthly, logistic regression assumes linearity of independent variables and log odds. Whilst it does
not require the dependent and independent variables to be related linearly, it requires that the
independent variables are linearly related to the log odds. Otherwise the test underestimates the
strength of the relationship and rejects the relationship too easily, that is being not significant (not
rejecting the null hypothesis) where it should be significant. A solution to this problem is the
categorization of the independent variables. That is transforming metric variables to ordinal level
and then including them in the model. Another approach would be to use discriminant analysis, if
the assumptions of homoscedasticity, multivariate normality, and absence of multicollinearity are
met.

Lastly, it requires quite large sample sizes. Because maximum likelihood estimates are less
powerful than ordinary least squares (e.g., simple linear regression, multiple linear regression);
whilst OLS needs 5 cases per independent variable in the analysis, ML needs at least 10 cases per
independent variable, some statisticians recommend at least 30 cases for each parameter to be
estimated.

36 | P a g e
TYPES OF LOGISTIC REGRESSION

There are three types of logistic regression models, which are defined based on categorical
response.

• Binary logistic regression: In this approach, the response or dependent variable is

dichotomous in nature—i.e. it has only two possible outcomes (e.g. 0 or 1). Some popular
examples of its use include predicting if an e-mail is spam or not spam or if a tumor is
malignant or not malignant. Within logistic regression, this is the most commonly used
approach, and more generally, it is one of the most common classifiers for binary
classification.
• Multinomial logistic regression: In this type of logistic regression model, the dependent
variable has three or more possible outcomes; however, these values have no specified
order. For example, movie studios want to predict what genre of film a moviegoer is likely
to see to market films more effectively. A multinomial logistic regression model can help
the studio to determine the strength of influence a person's age, gender, and dating status
may have on the type of film that they prefer. The studio can then orient an advertising
campaign of a specific movie toward a group of people likely to go see it.
• Ordinal logistic regression: This type of logistic regression model is leveraged when the
response variable has three or more possible outcome, but in this case, these values do have
a defined order. Examples of ordinal responses include grading scales from A to F or rating
scales from 1 to 5.

Exercises:

1. Explain the probability value of logistic regression. (BL: 4, Analyze)

2. What do you mean by Wald Statistic? (BL: 4, Analyze)

3. Analyze the concept of odds ratio. (BL: 4, Analyze)

4. Discuss the uses of logistic regression. (BL: 5, Evaluate)

37 | P a g e
MODULE – 3

ADVANCED DATA ANALYTICS

1. Why is it useful to rotate the factors? Which is the most common method of rotation? (BL: 5,
Evaluate)

2. What guidelines are available for interpreting the factors? (BL: 4, Analyze)

3. What is the major difference between principal components analysis and common factor
analysis? (BL: 4, Analyze)

4. What hypothesis is examined by Bartlett’s test of sphericity? For what purpose is this test
used? (BL: 5, Evaluate)

5. For what purpose is the Kaiser–Meyer–Olkin measure of sampling adequacy used? (BL: 5,
Evaluate)

52 | P a g e
MODULE – 4

ADVANCED DATA ANALYTICS

1. Why is the average linkage method usually preferred to single linkage and complete linkage? (BL:
5, Evaluate)

2. What guidelines are available for deciding the number of clusters? (BL4:, Analyze)

3. Upon what basis may a researcher decide which variables should be selected to formulate a
clustering problem? (BL: 5, Evaluate)

4. What are some of the uses of cluster analysis in marketing? (BL4:, Analyze)

5. Compare different clustering procedures. (BL4:, Analyze)

M348 Applied Statistical Modelling - Linear Models
No ratings yet
M348 Applied Statistical Modelling - Linear Models
504 pages
APPLIED REGRESSION ANALYSIS AND GENERALIZED LINEAR MODELS Fox 2008
0% (1)
APPLIED REGRESSION ANALYSIS AND GENERALIZED LINEAR MODELS Fox 2008
103 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Regression Analysis Willey Publication
20% (5)
Regression Analysis Willey Publication
15 pages
Linear Algebra Spring Project 2024099270 Chominhyeok
No ratings yet
Linear Algebra Spring Project 2024099270 Chominhyeok
4 pages
13704416
No ratings yet
13704416
81 pages
83566
No ratings yet
83566
51 pages
Data Analytivs-Unit-2
No ratings yet
Data Analytivs-Unit-2
24 pages
DA-3rd unit
No ratings yet
DA-3rd unit
16 pages
Lecture 6-Revisions Chapter 1-5
No ratings yet
Lecture 6-Revisions Chapter 1-5
62 pages
LinearRegressionUsing R
No ratings yet
LinearRegressionUsing R
91 pages
Da Unit-3
No ratings yet
Da Unit-3
27 pages
Unveiling The Power of Regression Analysis - A Comprehensive Exploration
No ratings yet
Unveiling The Power of Regression Analysis - A Comprehensive Exploration
5 pages
Regression Analysis PDF
No ratings yet
Regression Analysis PDF
3 pages
1 Advanced Data Analysis-Course Outline
No ratings yet
1 Advanced Data Analysis-Course Outline
7 pages
Lec 1 Course Overview
No ratings yet
Lec 1 Course Overview
23 pages
Ba Numerical Ques Ans
No ratings yet
Ba Numerical Ques Ans
41 pages
Chapter 2
No ratings yet
Chapter 2
136 pages
BA unit 2 notes (1)
No ratings yet
BA unit 2 notes (1)
5 pages
Da 2
No ratings yet
Da 2
31 pages
Regression Course outline
No ratings yet
Regression Course outline
5 pages
DA_UNIT_3_R22
No ratings yet
DA_UNIT_3_R22
15 pages
PREDECTIVE ANALYTICS
No ratings yet
PREDECTIVE ANALYTICS
11 pages
1.descriptive Statistics and Probability Distributions:: Datascience Course Content
No ratings yet
1.descriptive Statistics and Probability Distributions:: Datascience Course Content
10 pages
Week01 Lecture BB
No ratings yet
Week01 Lecture BB
70 pages
2csbs3104 Computational Statistics
No ratings yet
2csbs3104 Computational Statistics
2 pages
22CB340
No ratings yet
22CB340
4 pages
DA UNIT-III
No ratings yet
DA UNIT-III
14 pages
Research Tools and Techniques: Comsats Institute of Information Technology, Wah Cantt Department of Management Sciences
No ratings yet
Research Tools and Techniques: Comsats Institute of Information Technology, Wah Cantt Department of Management Sciences
4 pages
Fox 2016 PDF
100% (1)
Fox 2016 PDF
817 pages
XSTK Project PDF
No ratings yet
XSTK Project PDF
26 pages
Unit v -Update
No ratings yet
Unit v -Update
53 pages
Unit-III
No ratings yet
Unit-III
13 pages
Data Science & Machine Learning by Using R Programming
No ratings yet
Data Science & Machine Learning by Using R Programming
6 pages
Bda Unit 5
No ratings yet
Bda Unit 5
14 pages
Applied Regression Analysis (Juran) SP2016
No ratings yet
Applied Regression Analysis (Juran) SP2016
3 pages
P-1.3.1 Linear Regression Analysis
No ratings yet
P-1.3.1 Linear Regression Analysis
9 pages
SS ZG536 - January 2019
No ratings yet
SS ZG536 - January 2019
8 pages
Model Development
No ratings yet
Model Development
80 pages
Da Unit III Data Analytics Unit 1
No ratings yet
Da Unit III Data Analytics Unit 1
39 pages
Practice Question
No ratings yet
Practice Question
21 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
DA-MODULE-3
No ratings yet
DA-MODULE-3
54 pages
Data Analytics Unit 3 Notes
100% (3)
Data Analytics Unit 3 Notes
28 pages
Arnav MLlab02
No ratings yet
Arnav MLlab02
6 pages
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
No ratings yet
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
36 pages
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
No ratings yet
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
158 pages
Business Analytics 2nd Edition Evans Test Bank - Quickly Download For The Best Reading Experience
100% (1)
Business Analytics 2nd Edition Evans Test Bank - Quickly Download For The Best Reading Experience
45 pages
Regression Analysis and Forecasting Models
No ratings yet
Regression Analysis and Forecasting Models
28 pages
4. Pa_ppt Unit 4 (1)
No ratings yet
4. Pa_ppt Unit 4 (1)
96 pages
2 4 Module Lectures
No ratings yet
2 4 Module Lectures
10 pages
Techniques of Statistical Analysis 1 Group 2 2014-15
No ratings yet
Techniques of Statistical Analysis 1 Group 2 2014-15
3 pages
5 - Part II - Regression Analysis w-notes(1)
No ratings yet
5 - Part II - Regression Analysis w-notes(1)
10 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
DA Notes 3
No ratings yet
DA Notes 3
12 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Control Charts: Six Sigma Thinking, #7
From Everand
Control Charts: Six Sigma Thinking, #7
Sumeet Savant
4/5 (1)
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
Techniques of Counting: Definitive Reference for Developers and Engineers
From Everand
Techniques of Counting: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Uncertainty Analysis - B PDF
100% (1)
Uncertainty Analysis - B PDF
91 pages
Who Boys Rda + Interpret
No ratings yet
Who Boys Rda + Interpret
34 pages
4glm3 Ha Online
No ratings yet
4glm3 Ha Online
51 pages
Instant download Essential Statistics for Public Managers and Policy Analysts Wang pdf all chapter
80% (5)
Instant download Essential Statistics for Public Managers and Policy Analysts Wang pdf all chapter
55 pages
Statistics
No ratings yet
Statistics
32 pages
Boletín Científico de La Escuela Superior Atotonilco de Tula
No ratings yet
Boletín Científico de La Escuela Superior Atotonilco de Tula
5 pages
Box and Whisker Notes
No ratings yet
Box and Whisker Notes
5 pages
Problem 3.6
No ratings yet
Problem 3.6
4 pages
Model Selection and Model Averaging
No ratings yet
Model Selection and Model Averaging
16 pages
Chapter three
No ratings yet
Chapter three
35 pages
Course Outline
No ratings yet
Course Outline
1 page
Application Nordmal Distribution
No ratings yet
Application Nordmal Distribution
4 pages
How To Choose The Right Statistical Tool in Prism
No ratings yet
How To Choose The Right Statistical Tool in Prism
3 pages
Forecasting Beta: How Well Does The Five-Year Rule of Thumb' Do?
No ratings yet
Forecasting Beta: How Well Does The Five-Year Rule of Thumb' Do?
37 pages
STAT 251 Course Text
No ratings yet
STAT 251 Course Text
179 pages
Pms - Contoh Tubes Pms
No ratings yet
Pms - Contoh Tubes Pms
74 pages
Curve Estimation Explained
50% (2)
Curve Estimation Explained
4 pages
The Hadamard Product and Some of Its Applications in Statistics PDF
No ratings yet
The Hadamard Product and Some of Its Applications in Statistics PDF
10 pages
DATA 1. Pre-Board Scores of The Selected BS Education Students (Per Section) Section 1 Section 2 Section 3 Section 4 Section 5
100% (1)
DATA 1. Pre-Board Scores of The Selected BS Education Students (Per Section) Section 1 Section 2 Section 3 Section 4 Section 5
10 pages
Lecture-14 (Test for Population Variances) (2)
No ratings yet
Lecture-14 (Test for Population Variances) (2)
6 pages
Unit II HONOR- Continuous Distibution
No ratings yet
Unit II HONOR- Continuous Distibution
14 pages
Exercices Supplementaires
No ratings yet
Exercices Supplementaires
4 pages
Lazy Learning (Or Learning From Your Neighbors)
No ratings yet
Lazy Learning (Or Learning From Your Neighbors)
3 pages
EViews Workshop
No ratings yet
EViews Workshop
26 pages
Athlete Anxiety Questionnaire The Development and
No ratings yet
Athlete Anxiety Questionnaire The Development and
9 pages
Analisis Statistika: Materi 4 Sebaran Penarikan Contoh (Sampling Distribution)
No ratings yet
Analisis Statistika: Materi 4 Sebaran Penarikan Contoh (Sampling Distribution)
19 pages
STA301-Statistics and Probability: Solved MCQS From Final Term Papers
0% (1)
STA301-Statistics and Probability: Solved MCQS From Final Term Papers
65 pages
bpharma-8-sem-biostatistics-and-research-methodology-79764-dec-2022
No ratings yet
bpharma-8-sem-biostatistics-and-research-methodology-79764-dec-2022
4 pages
UDPG1673 - Tutorial 3 - 201601
No ratings yet
UDPG1673 - Tutorial 3 - 201601
3 pages
Macro 1 - Bootstrap
No ratings yet
Macro 1 - Bootstrap
10 pages

BBABB602 Study Material and Syllabus

Uploaded by

BBABB602 Study Material and Syllabus

Uploaded by

Advanced Data Analytics

Linear Simple Linear International Academia: 12 Simple

CO PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8

1=Low(Slight) 2=Moderate(Medium) 3=Substantial (High)

1. Derive normal equations of multiple linear regressions in matrix method. (BL: 6,

3. Find an expression of R square. (BL: 5, Evaluate)

4. Prove that OLS estimator is unbiased. (BL: 5, Evaluate)

5. Prove that OLS estimator is a minimum variance estimator. (BL: 5, Evaluate)

ADVANCED DATA ANALYTICS

• Churn prediction: Specific behaviors may be indicative of churn in different functions of

ASSUMPTIONS OF LOGISTIC REGRESSION

• Binary logistic regression: In this approach, the response or dependent variable is

1. Explain the probability value of logistic regression. (BL: 4, Analyze)

2. What do you mean by Wald Statistic? (BL: 4, Analyze)

3. Analyze the concept of odds ratio. (BL: 4, Analyze)

4. Discuss the uses of logistic regression. (BL: 5, Evaluate)

ADVANCED DATA ANALYTICS

ADVANCED DATA ANALYTICS

5. Compare different clustering procedures. (BL4:, Analyze)

You might also like