100% found this document useful (1 vote)
277 views

BA ZG523 Introduction To Data Science

The document outlines the content design for an introductory course on data science. [1] It includes the course objectives, textbooks, content structure, learning outcomes and experiential learning components. [2] The content is divided into 8 modules covering topics such as the data science process, data, data representation, analytics, algorithms and tools. [3] A learning plan is also provided, which details the pre, during and post class activities for the initial contact hours covering introductions to data science and the data science process.

Uploaded by

Clitt Orise
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
277 views

BA ZG523 Introduction To Data Science

The document outlines the content design for an introductory course on data science. [1] It includes the course objectives, textbooks, content structure, learning outcomes and experiential learning components. [2] The content is divided into 8 modules covering topics such as the data science process, data, data representation, analytics, algorithms and tools. [3] A learning plan is also provided, which details the pre, during and post class activities for the initial contact hours covering introductions to data science and the data science process.

Uploaded by

Clitt Orise
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

WORK INTEGRATED LEARNING PROGRAMMES


Part A: Content Design

Course Title Introduction to Data Science


Course No(s) BA ZG523
Credit Units 4
Credit Model
Content Authors

Course Objectives
No

CO1 Gain basic understanding of the role of Data Science in various contexts

CO2 Understand the role of various concepts like Statistics, Machine learning etc in Data
Science

CO3 Understand the roles and stages in a Data Science Project

CO4 Understanding the key terms and tools used by Data Scientist

CO5 Understand the process of collecting data from unstructured sources and store it using
appropriate structure such as relational databases, graphs, matrices, etc

Text Books
T1 An Introduction to Data Science by Jeffrey Stanton(free ebook)
T2 Practical Data Science with R by Nina Zumel and John Mount, Indian Edition by
Dreamtech Press, 2014.
T3 The Art of Data Science by Roger D Peng and Elizabeth Matsui
T4 Analytics in a Big Data World, Bart Baesens,Wiley

Reference Book
R1 Introduction to Data Mining, Pang-Ning Tan, Michael Steinbach, Vipin Kumar,
Pearson

Content Structure
1. Introduction to Data Science

1.1 Definition
1.2 Motivating Examples
1.3 Roles and responsibilities of a Data Scientist
1.4 Ethical guidelines for Data Scientist
1.5 Data Science concerns

2. Data Science Process

2.1 Roles in a Data Science Project


2.2 Stages of a Data Science Project
2.3 Setting expectations

3. Data
3.1 Data driven decision making
3.2 Data acquisition from various sources
3.3 Data Preparation
3.4 Data formats
3.5 Data quality
3.6 High dimensional data
3.7 Data Models
3.7.1. Models as expectation
3.7.2. Comparing models to reality
3.7.3. Reactions to Data
3.7.4. Refining our expectations

4. Data Representation
4.1 Graphs & Networks
4.2 Matrices
4.3 Vectors
4.4 Libraries of Graph, Matrices and vectors

5. Data Analytics
5.1 Definition
5.2 Types of Analytics
5.3 Analytics Methodology
5.4 Analytics terminology
5.5 Applications

6. Sampling Techniques
6.1 Need for Sampling
6.2 Applications
6.3 Sampling Techniques
6.3.1 Weighted random sampling
6.3.2 Priority sampling
6.3.3 Non uniformity sampling

7. Algorithms for mass data problems


7.1 Regression
7.2 Classification - Supervised
7.3 Clustering -
7.4 Decision trees / rules
7.5 PCA
7.6 Time series
7.7 Text mining
7.8 Deep learning
7.9 Neural networks
7.10 Random forest

8. Tools of Data Science


8.1 SPSS
8.2 SAS
8.3 R
8.4 Python
8.5 Tableau
8.6 Excel
8.7 Qlickview
Learning Outcomes:

No Learning Outcomes-

Students should be able to

LO1 understand and apply the principles of Data Science

LO2 describe the structure of a Data Science project

LO3 understand key terms and apply the tools used by a Data Scientist

LO4 understand and apply the algorithms used in Data Science

Experiential Learning Components:

1. This course will feature experiential learning components in the form of assignments like data modeling
using Microsoft Excel. This will also form part of the Evaluation Components for the course.

Module Topic Experiential Learning Component


Algorithms for mass
7 Data Modeling
data problems

Part B: Learning Plan

Academic Term First Semester 2018-2019


Course Title Introduction to Data Science
Course No BA ZG523
Lead Instructor

Contact Hour 1 (Introduction to Data Science)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 1.1 Review Course Handout/ List your Chapter 1 of T1, Chapter 1 of
expectations from this course T4

During 1.1 Introduction to Data Science Chapter 1 of T1, Chapter 1 of


CH T4, Class notes

Post 1.1 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
CH T4, Class notes

Contact Hour 2(Introduction to Data Science)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 1.2 Review of Previous weeks topics & the Chapter 1 of T1, Chapter 1 of
present week’s content T4

During 1.2 Motivating examples - Discussion Chapter 1 of T1, Chapter 1 of


CH T4, Class notes

Post 1.2 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
CH T4, Class notes

Contact Hour 3 (Introduction to Data Science)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 1.3 & 1.4 Review of Previous weeks topics & the Chapter 1 of T1, Chapter 1 of
present week’s content T4

During 1.3 ,1.4 Roles & responsibilities of a Data Scientist Chapter 1 of T1, Chapter 1 of
CH Ethical Guidelines for Data Scientist T4, Class notes

Post 1.3 ,1.4 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
CH T4, Class notes

Contact Hour 4 (Introduction to Data Science)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 1.5 Review of Previous weeks topics & the Chapter 1 of T1, Chapter 1 of
present week’s content T4

During CH 1.5 Data Science concerns Chapter 1 of T1, Chapter 1 of


T4, Class notes

Post CH 1.5 Review reference chapters from textbook Chapter 1 of T1, Chapter 1 of
T4, Class notes

Contact Hour 5 (Data Science Process)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 2.1 Review of Previous weeks topics & the Chapter 1-T2, class notes
present week’s content

During CH 2.1 Roles in a Data Science project Chapter 1-T2, class notes

Post CH 2.1 Review reference chapters from textbook Chapter 1-T2, class notes

Contact Hour 6 (Data Science Process)

Type Content Ref. Topic Title Study/HW Resource


Reference
Pre CH 2.2 & 2.3 Review of Previous weeks topics & the Chapter 1-T2
present week’s content

During CH 2.2 & 2.3 Stages of a Data Science project Chapter 1-T2, class notes
Setting expectations

Post CH 2.2 & 2.3 Review reference chapters from textbook Chapter 1-T2, class notes

Contact Hour 7 (Data)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 3.1 & 3.2 Review of Previous weeks topics & the Chapter2-T4,Chapter 2- R1,
present week’s content class notes

During CH 3.1 & 3.2 Data driven decision making Chapter2-T4,Chapter 2- R1,
Data acquisition class notes

Post CH 3.1 & 3.2 Review reference chapters from textbook Chapter2-T4,Chapter 2- R1,
class notes

Contact Hour 8 (Data)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 3.3 & 3.4 Review of Previous weeks topics & the Chapter2-T4,Chapter 2- R1,
present week’s content class notes

During CH 3.3 & 3.4 Data preparation Chapter2-T4,Chapter 2- R1,


Data formats class notes

Post CH 3.3 & 3.4 Review reference chapters from textbook Chapter2-T4,Chapter 2- R1,
class notes

Contact Hour 9 (Data)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 3.5 & 3.6 Review of Previous weeks topics & the Chapter2-T4,Chapter 2 & 3-
present week’s content R1, class notes

During CH 3.5 & 3.6 Data quality Chapter2-T4,Chapter 2 & 3 -


High dimensional data R1, class notes

Post CH 3.5 & 3.6 Review reference chapters from textbook Chapter2-T4,Chapter 2 & 3-
R1, class notes
Contact Hour 10 (Data)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 3.7 Review of Previous weeks topics & the Chapter 5-T3
present week’s content

During CH 3.7 Data models Chapter 5-T3

Post CH 3.7 Review reference chapters from textbook Chapter 5-T3

Contact Hour 11 (Data Representation)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 4.1 Review of Previous weeks topics & the Mathematics preliminaries-
present week’s content class notes

During CH 4.1 Data Representation - Graphs & Networks Mathematics preliminaries-


class notes

Post CH 4.1 Problems Mathematics preliminaries-


class notes

Contact Hour 12 (Data Representation)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 4.2 Review of Previous weeks topics & the Mathematics preliminaries-
present week’s content class notes

During CH 4.2 Data Representation -Matrices Mathematics preliminaries-


class notes

Post CH 4.2 Problems Mathematics preliminaries-


class notes

Contact Hour 13 (Data Representation)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 4.3 & 4.4 Review of Previous weeks topics & the Mathematics preliminaries-
present week’s content class notes

During CH 4.3 & 4.4 Data Representation -Vectors, Mathematics preliminaries-


Libraries of Graph, Matrices & vectors class notes
Post CH 4.3 & 4.4 Problems Mathematics preliminaries-
class notes

Contact Hour 14 (Data Analytics)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 5.1 & 5.2 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes

During CH 5.1 & 5.2 Definition Chapter 3 & 4- T4, class


Types of Analytics notes

Post CH 5.1 & 5.2 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes

Contact Hour 15 (Data Analytics)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 5.2 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes

During CH 5.2 Types of Analytics – Discussion with Chapter 3 & 4- T4, class
Examples notes

Post CH 5.2 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes

Contact Hour 16 (Revision)_MID SEM

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 1.1 to 5.2 Review of all the topics covered

During CH 1.1 to 5.2 Discussion & review of all the topics


before mid semester

Post CH 1.1 to 5.2

Contact Hour 17 (Data Analytics)

Type Content Ref. Topic Title Study/HW Resource


Reference
Pre CH 5.3 & 5.4 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes

During CH 5.3 & 5.4 Analytics Methodology Chapter 3 & 4- T4, class
notes

Post CH 5.3 & 5.4 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes

Contact Hour 18 (Data Analytics)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 5.5 Review of Previous weeks topics & the Chapter 3 & 4- T4, class
present week’s content notes

During CH 5.5 Analytics - Applications Chapter 3 & 4- T4, class


notes

Post CH 5.5 Review reference chapters from textbook Chapter 3 & 4- T4, class
notes

Contact Hour 19 (Sampling Techniques)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 6.1 & 6.2 Review of Previous weeks topics & the Chapter 2 -T4, Chapter 2 –
present week’s content R1, class notes

During CH 6.1 & 6.2 Sampling & Applications Chapter 2 -T4, Chapter 2 –
R1, class notes

Post CH 6.1 & 6.2 Review reference chapters from textbook Chapter 2 -T4, Chapter 2 –
R1, class notes

Contact Hour 20 (Sampling Techniques)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 6.3 Review of Previous weeks topics & the Chapter 2 -T4, Chapter 2 –
present week’s content R1, class notes

During CH 6.3 Sampling Techniques Chapter 2 -T4, Chapter 2 –


R1, class notes

Post CH 6.3 Review reference chapters from textbook Chapter 2 -T4, Chapter 2 –
R1, class notes

Contact Hour 21 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 7.6 Review of Previous weeks topics & the Appendix B–R1, class notes
present week’s content

During CH 7.6 Appendix B–R1, class notes


Time Series

Post CH 7.6 Review reference chapters from textbook Appendix B–R1, class notes

Contact Hour 22 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 7.1 & 7.2 Review of Previous weeks topics & the Chapter 3- T4, Chapter 4 &
present week’s content Appendix D –R1, class notes

During CH 7.1 & 7.2 Regression Chapter 3- T4, Chapter 4 &


Classification - supervised Appendix D –R1, class notes

Post CH 7.1 & 7.2 Review reference chapters from textbook Chapter 3- T4, Chapter 4 &
Appendix D –R1, class notes

Contact Hour 23 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 7.4 Review of Previous weeks topics & the Chapter 3 & 4- T4, Chapter 4
present week’s content & Chapter 8 –R1, class notes

During CH 7.4 Decision trees, rules- rule based classifiers Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes

Post CH 7.4 Review reference chapters from textbook Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes

Contact Hour 24 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference
Pre CH 7.4 Review of Previous weeks topics & the Chapter 3 & 4- T4, Chapter 4
present week’s content & Chapter 8 –R1, class notes

During CH 7.4 Decision trees, rules- rule based classifiers Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes

Post CH 7.4 Review reference chapters from textbook Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes

Contact Hour 25 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 7.3 Review of Previous weeks topics & the Chapter 3 & 4- T4, Chapter 4
present week’s content & Chapter 8 –R1, class notes

During CH 7.3 Clustering - unsupervised Chapter 3 & 4- T4, Chapter 4


& Chapter 8 –R1, class notes

Post CH 7.3 Review reference chapters from textbook Chapter 3 & 4- T4, Chapter 4
& Chapter 8 –R1, class notes

Contact Hour 26 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 7.8 &7.9 Review of Previous weeks topics & the Chapter 3- T4, Chapter 5–R1,
present week’s content class notes

During CH 7.8 &7.9 Deep Learning, Neural networks Chapter 3- T4, Chapter 5–R1,
class notes

Post CH 7.8 &7.9 Review reference chapters from textbook Chapter 3- T4, Chapter 5–R1,
class notes

Contact Hour 27 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference
Pre CH 7.5 & 7.7 Review of Previous weeks topics & the Class notes
present week’s content

During CH 7.5 & 7.7 Introduction to PCA & Text mining Class notes

Post CH 7.5 & 7.7 Review reference chapters from notes Class notes

Contact Hour 28 (Algorithms)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 7.10 Review of Previous weeks topics & the Chapter 3- T4, Chapter 5–R1,
present week’s content class notes

During CH 7.10 Introduction to Random forest Chapter 3- T4, Chapter 5–R1,


class notes

Post CH 7.9 & 7.10 Review reference chapters from textbook Chapter 3- T4, Chapter 5–R1,
class notes

Contact Hour 29 (Tools of Data Science)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 8.1 – 8.3 Review of Previous weeks topics & the T2, Class notes
present week’s content

During CH 8.1 – 8.3 Introduction to SPSS, SAS, R – Console & T2, Class notes
Studio

Post CH 8.1 – 8.3 Review reference chapters from notes T2,Class notes

Contact Hour 30 (Tools of Data Science)

Type Content Ref. Topic Title Study/HW Resource


Reference

Pre CH 8.4 – 8.7 Review of Previous weeks topics & the T2, Class notes
present week’s content

During CH 8.4 – 8.7 Introduction to R, Python, Tableau, Excel , T2,Class notes


Qlickview

Post CH 8.4 – 8.7 Review reference chapters from notes T2, Class notes

Contact Hour 31 & 32 (Revision)


Type Content Ref. Topic Title Study/HW Resource
Reference

Pre CH Review of all topics

During CH Discussion & Revision

Post CH

Evaluation Scheme:
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session
No Name Type Duration Weight Day, Date, Session, Time
EC-1 Quiz-I/ Assignment-I Online - 5% September 10 – 20, 2018
Quiz-II Online - 5% October 20 – 30, 2018
Experiential Online - 15% November 10 – 20, 2018
Learning
EC-2 Mid-Semester Test Closed 2 hours 30% 29/09/2018 (FN) 10 AM – 12 Noon
Book
EC-3 Comprehensive Open 3 hours 45% 24/11/2018 (FN) 9 AM – 12 Noon
Exam Book

Notes:
Syllabus for Mid-Semester Test (Closed Book): Topics in Session Nos. 1 to 16 (contact hours)
Syllabus for Comprehensive Exam (Open Book): All topics (Session Nos. 1 to 32) (contact hours)

Important links and information:


Elearn portal: https://elearn.bits-pilani.ac.in
Students are expected to visit the Elearn portal on a regular basis and stay up to date with the latest
announcements and deadlines.
Contact sessions: Students should attend the online lectures as per the schedule provided on the Elearn portal.
Evaluation Guidelines:
1. EC-1 consists of either two Assignments or three Quizzes. Students will attempt them through the
course pages on the Elearn portal. Announcements will be made on the portal, in a timely manner.
2. For Closed Book tests: No books or reference material of any kind will be permitted.
3. For Open Book exams: Use of books and any printed / written reference material (filed or bound) is
permitted. However, loose sheets of paper will not be allowed. Use of calculators is permitted in all
exams. Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
4. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the student should
follow the procedure to apply for the Make-Up Test/Exam which will be made available on the Elearn
portal. The Make-Up Test/Exam will be conducted only at selected exam centres on the dates to be
announced later.

It shall be the responsibility of the individual student to be regular in maintaining the self study schedule as
given in the course handout, attend the online lectures, and take all the prescribed evaluation components such
as Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the evaluation scheme
provided in the handout.

Instructor-in-charge

You might also like