BA ZG523 Introduction To Data Science
BA ZG523 Introduction To Data Science
Course Objectives
No
CO1 Gain basic understanding of the role of Data Science in various contexts
CO2 Understand the role of various concepts like Statistics, Machine learning etc in Data
Science
CO4 Understanding the key terms and tools used by Data Scientist
CO5 Understand the process of collecting data from unstructured sources and store it using
appropriate structure such as relational databases, graphs, matrices, etc
Text Books
T1 An Introduction to Data Science by Jeffrey Stanton(free ebook)
T2 Practical Data Science with R by Nina Zumel and John Mount, Indian Edition by
Dreamtech Press, 2014.
T3 The Art of Data Science by Roger D Peng and Elizabeth Matsui
T4 Analytics in a Big Data World, Bart Baesens,Wiley
Reference Book
R1 Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, Vipin Kumar,
Pearson
Content Structure
1. Introduction to Data Science
1.1 Definition
1.2 Motivating Examples
1.3 Roles and responsibilities of a Data Scientist
1.4 Ethical guidelines for Data Scientist
1.5 Data Science concerns
3. Data
3.1 Data driven decision making
3.2 Data acquisition from various sources
3.3 Data Preparation
3.4 Data formats
3.5 Data quality
3.6 High dimensional data
3.7 Data Models
3.7.1. Models as expectation
3.7.2. Comparing models to reality
3.7.3. Reactions to Data
3.7.4. Refining our expectations
4. Data Representation
4.1 Graphs & Networks
4.2 Matrices
4.3 Vectors
4.4 Libraries of Graph, Matrices and vectors
5. Data Analytics
5.1 Definition
5.2 Types of Analytics
5.3 Analytics Methodology
5.4 Analytics terminology
5.5 Applications
6. Sampling Techniques
6.1 Need for Sampling
6.2 Applications
6.3 Sampling Techniques
6.3.1 Weighted random sampling
6.3.2 Priority sampling
6.3.3 Non uniformity sampling
No Learning Outcomes-
LO3 understand key terms and apply the tools used by a Data Scientist
1. This course will feature experiential learning components in the form of assignments like data modeling
using Microsoft Excel. This will also form part of the Evaluation Components for the course.
Pre CH 1.1 Review Course Handout/ List your Chapter 1 of T1, Chapter 1 of
expectations from this course T4
Post 1.1 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
CH T4, Class notes
Pre CH 1.2 Review of Previous weeks topics & the Chapter 1 of T1, Chapter 1 of
present week’s content T4
Post 1.2 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
CH T4, Class notes
Pre CH 1.3 & 1.4 Review of Previous weeks topics & the Chapter 1 of T1, Chapter 1 of
present week’s content T4
During 1.3 ,1.4 Roles & responsibilities of a Data Scientist Chapter 1 of T1, Chapter 1 of
CH Ethical Guidelines for Data Scientist T4, Class notes
Post 1.3 ,1.4 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
CH T4, Class notes
Pre CH 1.5 Review of Previous weeks topics & the Chapter 1 of T1, Chapter 1 of
present week’s content T4
Post CH 1.5 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
T4, Class notes
Pre CH 2.1 Review of Previous weeks topics & the Chapter 1-T2, class notes
present week’s content
During CH 2.1 Roles in a Data Science project Chapter 1-T2, class notes
Post CH 2.1 Review reference chapters from textbook Chapter 1-T2, class notes
During CH 2.2 & 2.3 Stages of a Data Science project Chapter 1-T2, class notes
Setting expectations
Post CH 2.2 & 2.3 Review reference chapters from textbook Chapter 1-T2, class notes
Pre CH 3.1 & 3.2 Review of Previous weeks topics & the Chapter2-T4,Chapter 2- R1,
present week’s content class notes
During CH 3.1 & 3.2 Data driven decision making Chapter2-T4,Chapter 2- R1,
Data acquisition class notes
Post CH 3.1 & 3.2 Review reference chapters from textbook Chapter2-T4,Chapter 2- R1,
class notes
Pre CH 3.3 & 3.4 Review of Previous weeks topics & the Chapter2-T4,Chapter 2- R1,
present week’s content class notes
Post CH 3.3 & 3.4 Review reference chapters from textbook Chapter2-T4,Chapter 2- R1,
class notes
Pre CH 3.5 & 3.6 Review of Previous weeks topics & the Chapter2-T4,Chapter 2 & 3-
present week’s content R1, class notes
Post CH 3.5 & 3.6 Review reference chapters from textbook Chapter2-T4,Chapter 2 & 3-
R1, class notes
Contact Hour 10 (Data)
Pre CH 3.7 Review of Previous weeks topics & the Chapter 5-T3
present week’s content
Pre CH 4.1 Review of Previous weeks topics & the Mathematics preliminaries-
present week’s content class notes
Pre CH 4.2 Review of Previous weeks topics & the Mathematics preliminaries-
present week’s content class notes
Pre CH 4.3 & 4.4 Review of Previous weeks topics & the Mathematics preliminaries-
present week’s content class notes
Pre CH 5.1 & 5.2 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes
Post CH 5.1 & 5.2 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes
Pre CH 5.2 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes
During CH 5.2 Types of Analytics – Discussion with Chapter 3 & 4- T4, class
Examples notes
Post CH 5.2 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes
During CH 5.3 & 5.4 Analytics Methodology Chapter 3 & 4- T4, class
notes
Post CH 5.3 & 5.4 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes
Pre CH 5.5 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes
Post CH 5.5 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes
Pre CH 6.1 & 6.2 Review of Previous weeks topics & the Chapter 2 -T4, Chapter 2 –
present week’s content R1, class notes
During CH 6.1 & 6.2 Sampling & Applications Chapter 2 -T4, Chapter 2 –
R1, class notes
Post CH 6.1 & 6.2 Review reference chapters from textbook Chapter 2 -T4, Chapter 2 –
R1, class notes
Pre CH 6.3 Review of Previous weeks topics & the Chapter 2 -T4, Chapter 2 –
present week’s content R1, class notes
Post CH 6.3 Review reference chapters from textbook Chapter 2 -T4, Chapter 2 –
R1, class notes
Pre CH 7.6 Review of Previous weeks topics & the Appendix B–R1, class notes
present week’s content
Post CH 7.6 Review reference chapters from textbook Appendix B–R1, class notes
Pre CH 7.1 & 7.2 Review of Previous weeks topics & the Chapter 3- T4, Chapter 4 &
present week’s content Appendix D –R1, class notes
Post CH 7.1 & 7.2 Review reference chapters from textbook Chapter 3- T4, Chapter 4 &
Appendix D –R1, class notes
Pre CH 7.4 Review of Previous weeks topics & the Chapter 3 & 4- T4, Chapter 4
present week’s content & Chapter 8 –R1, class notes
During CH 7.4 Decision trees, rules- rule based classifiers Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes
Post CH 7.4 Review reference chapters from textbook Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes
During CH 7.4 Decision trees, rules- rule based classifiers Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes
Post CH 7.4 Review reference chapters from textbook Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes
Pre CH 7.3 Review of Previous weeks topics & the Chapter 3 & 4- T4, Chapter 4
present week’s content & Chapter 8 –R1, class notes
Post CH 7.3 Review reference chapters from textbook Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes
Pre CH 7.8 &7.9 Review of Previous weeks topics & the Chapter 3- T4, Chapter 5–R1,
present week’s content class notes
During CH 7.8 &7.9 Deep Learning, Neural networks Chapter 3- T4, Chapter 5–R1,
class notes
Post CH 7.8 &7.9 Review reference chapters from textbook Chapter 3- T4, Chapter 5–R1,
class notes
During CH 7.5 & 7.7 Introduction to PCA & Text mining Class notes
Post CH 7.5 & 7.7 Review reference chapters from notes Class notes
Pre CH 7.10 Review of Previous weeks topics & the Chapter 3- T4, Chapter 5–R1,
present week’s content class notes
Post CH 7.9 & 7.10 Review reference chapters from textbook Chapter 3- T4, Chapter 5–R1,
class notes
Pre CH 8.1 – 8.3 Review of Previous weeks topics & the T2, Class notes
present week’s content
During CH 8.1 – 8.3 Introduction to SPSS, SAS, R – Console & T2, Class notes
Studio
Post CH 8.1 – 8.3 Review reference chapters from notes T2,Class notes
Pre CH 8.4 – 8.7 Review of Previous weeks topics & the T2, Class notes
present week’s content
Post CH 8.4 – 8.7 Review reference chapters from notes T2, Class notes
Post CH
Evaluation Scheme:
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session
No Name Type Duration Weight Day, Date, Session, Time
EC-1 Quiz-I/ Assignment-I Online - 5% September 10 – 20, 2018
Quiz-II Online - 5% October 20 – 30, 2018
Experiential Online - 15% November 10 – 20, 2018
Learning
EC-2 Mid-Semester Test Closed 2 hours 30% 29/09/2018 (FN) 10 AM – 12 Noon
Book
EC-3 Comprehensive Open 3 hours 45% 24/11/2018 (FN) 9 AM – 12 Noon
Exam Book
Notes:
Syllabus for Mid-Semester Test (Closed Book): Topics in Session Nos. 1 to 16 (contact hours)
Syllabus for Comprehensive Exam (Open Book): All topics (Session Nos. 1 to 32) (contact hours)
It shall be the responsibility of the individual student to be regular in maintaining the self study schedule as
given in the course handout, attend the online lectures, and take all the prescribed evaluation components such
as Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the evaluation scheme
provided in the handout.
Instructor-in-charge