Dssyll
Dssyll
2023
@#01112023
CSE 28.06.2023
Data Mining Concepts: Overview of Data Mining Technology, Association Rules, Classification, Clustering,
Approaches to Other Data Mining Problems, Applications of Data Mining, Commercial Data Mining Tools. Overview
of Data Warehousing and OLAP: Introduction, Definitions, and Terminology, Characteristics of Data Warehouses,
Data Modelling for Data Warehouses, building a Data Warehouse, Typical Functionality of a Data Warehouse, Data
Warehouse versus Views, Difficulties of Implementing Data Warehouses.
Teaching-
Learning Chalk and Talk/ PPT / Case Study:
Process https://www.researchgate.net/publication/47393965_Data_warehousing_and_data_mining_A_case_
study
2. Database Management Systems, Raghu Ramakrishnan and Johannes Gehrke, McGraw-Hill, 3rd Edition, 2013.
3. Database System Concepts, Abraham Silberschatz, Henry F. Korth, S. Sudarshan, McGraw Hill, 6th Edition,
2010.
@#01112023
CSE 28.06.2023
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
4 Conduct investigations of complex problems: Use research-based knowledge and research PO4
methods including design of experiments, analysis and interpretation of data, and
synthesis of the information to provide valid conclusions.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent responsibilities
relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or leader PO9
in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as a
member and leader in a team, to manage projects and in multidisciplinary environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X X
CO2 X X
CO3 X X
@#01112023
CSE 28.06.2023
PREDICTIVE ANALYTICS
Course Code 22SDS14 CIE Marks 50
Teaching Hours/Week (L:P:SDA) 2:0:2 SEE Marks 50
Total Hours of Pedagogy 40 Total Marks 100
Credits 03 Exam Hours 03
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Module-3
.Additive Models,Trees,and Boosting: Generalized additive models, Regression and classification trees , Boosting
methods-exponential loss and AdaBoost, Numerical Optimization via gradient boosting ,Examples ( Spam data,
California housing , New Zealand fish, Demographic data)
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Module-4
Neural Networks(NN) , Support Vector Machines(SVM),and K-nearest Neighbour: Fitting neural networks, Back
propagation, Issues in training NN, SVM for classification, Reproducing Kernels, SVM for regression, K-nearest –
Neighbour classifiers( Image Scene Classification)
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Module-5
Unsupervised Learning and Random forests: Association rules, Cluster analysis, Principal Components, Random
forests and analysis.
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
@#01112023
CSE 28.06.2023
5. L.Wasserman-All of statistics.
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X X
CO2 X X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
@#01112023
CSE 28.06.2023
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Module-1
Fundamentals of Machine Learning: The introduction basic machine learning concepts, tasks, and workflow using
an example classification problem based on the K-nearest neighbors method, and implemented using the scikit-
learn library.
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-2
Supervised Machine Learning: supervised learning methods for both classification and regression, learning about
the connection between model complexity and generalization performance, the importance of proper feature
scaling, and how to control model complexity by applying techniques like regularization to avoid overfitting.
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-3
k-nearest neighbors, this week covers linear regression (least-squares, ridge, lasso, and polynomial regression),
logistic regression, support vector machines, the use of cross-validation for model evaluation, and decision trees.
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Module-4
Evaluation: Evaluation and model selection methods to understand and optimize the performance of your machine
learning models.
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Module-5
Supervised Machine Learning - Part 2
Advanced supervised learning methods that include ensembles of trees (random forests, gradient boosted trees),
@#01112023
CSE 28.06.2023
and neural networks (with an optional summary on deep learning). You will also learn about the critical problem of
data leakage in machine learning and how to detect and avoid it.
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
2. Two assignments each of 20 Marks or one Skill Development Activity of 40 marks
to attain the COs and POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
1. Kevin Murphy. Machine Learning: A Probabilistic Perspective. The MIT Press. 2012.
2. Shai Shalev-Shwartz and Shai Ben-David. Understanding Machine Learning: From Theory to Algorithms.
Cambridge University Press. 2014.
3. Trevor Hastie, Robert Tibshirani and Jerome Friedman. The Elements of Statistical Learning: Data Mining,
Inference, and Prediction, 2nd Edition. Springer. 2009..
5. Ian Goodfellow, YoshuaBengio and Aaron Courville. Deep Learning. The MIT Press. 2016.
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
@#01112023
CSE 28.06.2023
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Module-2
Reviewing the literature: Place of the literature review in research, Bringing clarity and focus to your research
problem, Improving research methodology, Broadening knowledge base in research area, Enabling contextual
findings, How to review the literature, searching the existing literature, reviewing the selected literature, Developing
a theoretical framework, Developing a conceptual framework, Writing about the literature reviewed.
Research Design: Meaning of Research Design, Need for Research Design, Features of a Good Design, Important
Concepts Relating to Research Design, Different Research Designs, Basic Principles of Experimental Designs, Important
Experimental Designs.
Module-3
@#01112023
CSE 28.06.2023
Design of Sampling: Introduction, Sample Design, Sampling and Non-sampling Errors, Sample Survey
versus Census Survey, Types of Sampling Designs. Measurement and Scaling: Qualitative and
Quantitative Data, Classifications of Measurement Scales, Goodness of Measurement Scales, Sources of
Error in Measurement Tools, Scaling, Scale Classification Bases, Scaling Technics, Multidimensional
Scaling, Deciding the Scale. Data Collection: Experimental and Surveys, Collection of Primary Data,
Collection of Secondary Data, Selection of Appropriate Method for Data Collection, Case Study Method.
Module-4
Testing of Hypotheses: Hypothesis, Basic Concepts Concerning Testing of Hypotheses, Testing of Hypothesis, Test
Statistics and Critical Region, Critical Value and Decision Rule, Procedure for Hypothesis Testing, Hypothesis
Testing for Mean, Proportion, Variance, for Difference of Two Mean, for Difference of Two Proportions, for
Difference of Two Variances, P-Value approach, Power of Test, Limitations of the Tests of Hypothesis. Chi-square
Test: Test of Difference of more than Two Proportions, Test of Independence of Attributes, Test of Goodness of Fit,
Cautions in Using Chi Square Tests
Module-5
@#01112023
CSE 28.06.2023
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the
outcome defined for the course.
Text Books:
1. Research Methodology: Methods and Techniques, C.R. Kothari, Gaurav Garg,New Age International,4th Edition,
2018.. Douglas E Comer, “Internetworking with TCP/IP, Principles, Protocols and Architecture,” PHI, 6th
Edition
2. Research Methodology a step-by-step guide for beginners. (For the topic Reviewing the literature under
module 2), RanjitKumar,SAGE Publications,3rd Edition, 2011.
Reference Books:
1. Research Methods: the concise knowledge base, Trochim, Atomic Dog Publishing, 2005.
2. Conducting Research Literature Reviews: From the Internet to Paper, Fink A, Sage Publications, 2009.
@#01112023
CSE 28.06.2023
CO2 Choose research designs, sampling designs, measurement and scaling techniques L2
and also different methods of data collections.
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 x x x
CO2 x x x
CO3 x x x
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
@#01112023
CSE 28.06.2023
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
@#01112023
CSE 28.06.2023
7 Split the Iris dataset into two the datasets - IrisTest_TrainData.csv, IrisTest_TestData.csv. Read them as
two separate data frames named Train_Data and Test_Data respectively.
Answer the following questions:
a) How many missing values are there in Train_Data?
b) What is the proportion of Setosa types in the Test_Data?
c) What is the accuracy score of the K-Nearest Neighbor model (model_1) with 2/3 neighbors using
Train_Data and Test_Data?
d) Identify the list of indices of misclassified samples from the ‘model_1’.
e) Build a logistic regression model (model_2) keeping the modelling steps constant. Find the accuracy of
the model_2
8
Implementation of the problem solving strategies: either using Forward Chaining or Backward Chaining
@#01112023
CSE 28.06.2023
https://www.dataquest.io/data-science-resources/
@#01112023
CSE 28.06.2023
DEEP LEARNING
Course Code 22SDS21 CIE Marks 50
Teaching Hours/Week (L:P:SDA) 2:0:2 SEE Marks 50
Total Hours of Pedagogy 40 Total Marks 100
Credits 03 Exam Hours 03
Teaching-
Learning Chalk and Talk /PPT
Process
Module-3
Optimization for Training Deep Models: How Learning Differs from Pure Optimization, Challenges in Neural
Network Optimization, Basic Algorithms. Parameter Initialization Strategies, Algorithms with Adaptive
Learning Rates. Convolutional Networks: The Convolution Operation, Motivation, Pooling, Convolution and
Pooling as an Infinitely Strong Prior, Variants of the Basic Convolution Function,
Structured Outputs, Data Types, Efficient Convolution Algorithms, Random or Unsupervised Features
Teaching-
Learning Chalk and Talk /PPT
Process
Module-4
Sequence Modelling: Recurrent and Recursive Nets: Unfolding Computational Graphs, Recurrent Neural
Networks, Bidirectional RNNs, Encoder-Decoder Sequence-to-Sequence Architectures, Deep Recurrent
Networks, Recursive Neural Networks. Long short-term memory.
Teaching-
Learning Chalk and Talk /PPT
Process
Module-5
Practical Methodology: Performance Metrics, Default Baseline Models, Determining Whether to GatherMore
Data, Selecting Hyperparameters, Debugging Strategies, Example: Multi-Digit Number
Recognition. Applications: Vision, NLP, Speech.
Teaching-
Learning Chalk and Talk /PPT
@#01112023
CSE 28.06.2023
Process
CO2 Implement deep learning algorithms and solve real-world problems. L3, L4
@#01112023
CSE 28.06.2023
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
4 Conduct investigations of complex problems: Use research-based knowledge and research PO4
methods including design of experiments, analysis and interpretation of data, and
synthesis of the information to provide valid conclusions.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent responsibilities
relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or leader PO9
in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as a
member and leader in a team, to manage projects and in multidisciplinary environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X X
@#01112023
CSE 28.06.2023
DATA VISUALIZATION
Course Code 22SDS22 CIE Marks 50
Teaching Hours/Week (L:P:SDA) 3:2:0 SEE Marks 50
Total Hours of Pedagogy 40 hours Theory + 10 hours Lab Total Marks 100
Credits 04 Exam Hours 03
Course objectives:
Develop skills to both design and review visualizations.
Recognize the elements that go into visualising design.
Recognize how the type of visualisation is impacted by the type of data.
MODULE-1
What Is Visualization?, History of Visualization, Relationship between Visualization and Other Fields, The
Visualization Process, Types of Data, Structure within and between Records, Data Preprocessing, Perception in
Visualization, Metrics, The Visualization Process in Detail, Semiology of Graphical Symbols, The Eight Visual
Variables, Taxonomies.
Teaching-
Learning Chalk and Talk/ PPT / Web resources: https://www.shiksha.com/it-software/data-visualization-
Process chp
MODULE-2
Visualization Techniques for Spatial Data, Visualization Techniques for Geospatial Data, Visualization Techniques
for Multivariate Data.
MODULE-3
Visualization Techniques for Time-Oriented Data, Visualization Techniques for Trees, Graphs, and Networks,
Text and Document Visualization.
@#01112023
CSE 28.06.2023
Process
MODULE-5
Comparing and Evaluating Visualization Techniques, Visualization Systems, Research Directions in Visualization
Sl.NO Experiments
1
Two students plotted histograms for the exact same data in R with the same bin width and boundary
values; however their plots have completely different shape. What could be the cause of this?
2 Load the room temperature dataset into R, Python or MATLAB, or whichever software tool you prefer to
plot with.
1. Plot the 4 trajectories, FrontLeft, FrontRight, BackLeft and BackRight on the same plot.
2. Comment on any features you observe in your plot.
3. Be specific and describe how spark lines of these same data would improve the message the data
is showing.
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together
@#01112023
CSE 28.06.2023
.
SEE for IPCC
Theory SEE will be conducted by University as per the scheduled timetable, with common question papers for the
course (duration 03 hours)
1. The question paper will be set for 100 marks and marks scored will be scaled down proportionately to 50
marks.
2. The question paper will have ten questions. Each question is set for 20 marks.
3. There will be 2 questions from each module. Each of the two questions under a module (with a maximum of 3
sub-questions), should have a mix of topics under that module.
4. The students have to answer 5 full questions, selecting one full question from each module.
The theory portion of the IPCC shall be for both CIE and SEE, whereas the practical portion will have a CIE
component only. Questions mentioned in the SEE paper shall include questions from the practical component).
The minimum marks to be secured in CIE to appear for SEE shall be the 15 (50% of maximum marks-30) in
the theory component and 10 (50% of maximum marks -20) in the practical component. The laboratory
component of the IPCC shall be for CIE only. However, in SEE, the questions from the laboratory
component shall be included. The maximum of 04/05 questions to be set from the practical component of
IPCC, the total marks of all questions should not be more than the 20 marks.
SEE will be conducted for 100 marks and students shall secure 40% of the maximum marks to qualify in
the SEE. Marks secured will be scaled down to 50. (Student has to secure an aggregate of 50% of maximum
marks of the course(CIE+SEE)
Suggested Learning Resources:
Books
1. Interactive Data Visualization: Foundations, Techniques, and Applications, Matthew O. Ward, Georges
Grinstein, Daniel Keim, CRC Press 2015
2. The Visual Display of Quantitative Information Edward Tufte Graphics Press 2001
@#01112023
CSE 28.06.2023
1. https://www.classcentral.com/course/datavisualization-2737
2. https://www.shiksha.com/it-software/data-visualization-chp
3. https://www.youtube.com/watch?v=7kPqESo1vRw
Queries:
(Describe how water on earth cycles in different forms and in different locations, including underground
and in the atmosphere.)
@#01112023
CSE 28.06.2023
Module-1
Meet Hadoop: Data!, Data Storage and Analysis, Querying All Your Data, Beyond Batch, Comparison with Other
Systems: Relational Database Management Systems, Grid Computing, Volunteer Computing Hadoop
Fundamentals MapReduce: A Weather Dataset: Data Format, Analyzing the Data with Unix Tools, Analyzing the
Data with Hadoop: Map and Reduce, Java MapReduce, Scaling Out: Data Flow, Combiner Functions, Running a
Distributed MapReduce Job, Hadoop Streaming The Hadoop Distributed File system The Design of HDFS, HDFS
Concepts: Blocks, Namenodes and Datanodes, HDFS Federation, HDFS High-Availability, The Command-Line
Interface, Basic Filesystem Operations, Hadoop Filesystems Interfaces, The Java Interface, Reading Data from a
Hadoop URL, Reading Data Using the FileSystem API, Writing Data, Directories, Querying the Filesystem, Deleting
Data, Data Flow: Anatomy of a File Read, Anatomy of a File Write.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-2
YARN Anatomy of a YARN Application Run: Resource Requests, Application Lifespan, Building YARN
Applications, YARN Compared to MapReduce, Scheduling in YARN: The FIFO Scheduler, The Capacity Scheduler,
The Fair Scheduler, Delay Scheduling, Dominant Resource Fairness. Hadoop I/O Data Integrity, Data Integrity in
HDFS, Local FileSystem, Checksum File System, Compression, Codecs, Compression and Input Splits, Using
Compression in MapReduce, Serialization, The Writable Interface, Writable Classes, Implementing a Custom
Writable, Serialization Frameworks, File-Based Data Structures: SequenceFile.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-3
Developing a MapReduce Application The Configuration API, Combining Resources, Variable Expansion, Setting
Up the Development Environment, Managing Configuration, Generic Options Parser, Tool, and Tool Runner,
Writing a Unit Test with MRUnit: Mapper, Reducer, Running Locally on Test Data, Running a Job in a Local Job
Runner, Testing the Driver, Running on a Cluster, Packaging a Job, Launching a Job, The MapReduce Web UI,
Retrieving the Results, Debugging a Job, Hadoop Logs, Tuning a Job, Profiling Tasks, MapReduce Workflows:
Decomposing a Problem into MapReduce Jobs, JobControl, Apache Oozie How MapReduce Works Anatomy of a
MapReduce Job Run, Job Submission, Job Initialization, Task Assignment, Task Execution, Progress and Status
Updates, Job Completion, Failures: Task Failure, Application Master Failure, Node Manager Failure, Resource
Manager Failure, Shuffle and Sort: The Map Side, The Reduce Side, Configuration Tuning, Task Execution: The
Task Execution Environment, Speculative Execution, Output Committers.
Teaching-
@#01112023
CSE 28.06.2023
@#01112023
CSE 28.06.2023
3. Apache Flume: Distributed Log Collection for Hadoop, D'Souza and Steve Hoffman, O'Reilley 2014
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X
@#01112023
CSE 28.06.2023
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
@#01112023
CSE 28.06.2023
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Module-1
Overview And Language Modeling: Overview: Origins and challenges of NLP-Language and Grammar Processing
Indian Languages- NLP Applications-Information Retrieval. Language Modeling: Various Grammar- based
Language Models-Statistical Language Model.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-2
Word Level And Syntactic Analysis: Word Level Analysis: Regular Expressions-Finite-State
AutomataMorphological Parsing-Spelling Error Detection and correction-Words and Word classes- Part-of
Speech Tagging. Syntactic Analysis: Context-free Grammar-Constituency- Parsing- Probabilistic Parsing.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-3
Extracting Relations from Text: From Word Sequences to Dependency Paths: Introduction, Subsequence Kernels
for Relation Extraction, A Dependency-Path Kernel for Relation Extraction and Experimental Evaluation. Mining
Diagnostic Text Reports by Learning to Annotate Knowledge Roles: Introduction, Domain Knowledge and
Knowledge Roles, Frame Semantics and Semantic Role Labeling, Learning to Annotate Cases with Knowledge
Roles and Evaluations. A Case Study in Natural Language Based Web Search: InFact System Overview, The
GlobalSecurity.org Experience.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-4
Evaluating Self-Explanations in iSTART: Word Matching, Latent Semantic Analysis, and Topic Models: Introduction,
iSTART: Feedback Systems, iSTART: Evaluation of Feedback Systems, Textual Signatures: Identifying Text-Types
Using Latent Semantic Analysis to Measure the Cohesion of Text Structures: Introduction, Cohesion, Coh-Metrix,
Approaches to Analysing Texts, Latent Semantic Analysis, Predictions, Results of Experiments. Automatic
Document Separation: A Combination of Probabilistic Classification and Finite-State Sequence Modeling:
Introduction, Related Work, Data Preparation, Document Separation as a Sequence Mapping Problem, Results.
Evolving Explanatory Novel Patterns for Semantically based Text Mining: Related Work, A Semantically Guided
Model for Effective Text mining.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-5
Information Retrieval and Lexical Resources: Information Retrieval: Design features of Information Retrieval
Systems-Classical, non classical, and Alternative Models of Information Retrieval – valuation Lexical Resources:
World Net-Frame Net- Stemmers-POS Tagger- Research Corpora.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
2. Two assignments each of 20 MarksoroneSkill Development Activity of 40 marks to attain the COs and
POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
@#01112023
CSE 28.06.2023
https://hackr.io/blog/best-nlp-courses
https://www.youtube.com/watch?v=igKTO7lQxNo
https://www.youtube.com/watch?v=6I-Alfkr5K4
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X X
CO2 X
CO3 X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
@#01112023
CSE 28.06.2023
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-3
Genetic algorithms: Introduction, Basic operations, Traditional algorithms, Simple GA General Genetic Algorithm,
Operators in Genetic Algorithm, Stopping Condition for Genetic Algorithm Flow, Constraints in Genetic
Algorithm, Problem Solving Using Genetic Algorithm, The schema theorem, Genetic programming, applications.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-4
Swarm Intelligence System: Introduction, background of SI, Ant colony system Working of ant colony optimization,
ant colony for TSP.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
Process
Module-5
Unit commitment problem, particle Swarm Intelligence system Artificial bee colony system, Cuckoo search system.
Teaching-
Learning Chalk and Talk/ PPT/ Web resources
@#01112023
CSE 28.06.2023
Process
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
@#01112023
CSE 28.06.2023
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
PATTERN RECOGNITION
Course Code 22SDS234 CIE Marks 50
Teaching Hours/Week (L:P:SDA) 2:0:2 SEE Marks 50
Total Hours of Pedagogy 40 Total Marks 100
Credits 03 Exam Hours 03
Module-1
Introduction: Definition of PR, Applications, Datasets for PR, Different paradigms for PR, Introduction to
probability, events, random variables, Joint distributions and densities, moments. Estimation minimum risk
estimators, problems
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-2
Representation: Data structures for PR, Representation of clusters, proximity measures, size of patterns,
Abstraction of Data set, Feature extraction, Feature selection, Evaluation of Classifiers and Clustering
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-3
Nearest Neighbour based classifiers & Bayes classifier: Nearest Neighbour Algorithm, Variants of NN Algorithm,
Use of NN for Transaction Databases, Efficient Algorithms, Data Reduction, Prototype Selection, Bayes theorem,
Minimum Error Rate Classifier, Estimation of Probabilities, Comparison with NNC, Naive Bayes classifier,
Bayesian belief network.
Teaching-
Learning Chalk and Talk/ PPT / Web resources : https://www.youtube.com/watch?v=ygwgm7oxKhs
Process
Module-4
Hidden Markov models: Markov Models for Classification, Hidden Markov Models and Classification Using HMMS.
Decision Trees: Introduction, Decision Trees for Pattern Recognition, Construction of Decision Trees, Splitting at
the Nodes, Over fitting & Pruning, Example of Decision Tree Induction.
Teaching-
Learning Chalk and Talk/ PPT / Web Resources: https://www.ukessays.com/essays/engineering/hmm-
Process pattern-recognition-
9997.php#:~:text=A%20Hidden%20Markov%20Model%20HMM,of%20a%20set%20of%20obser
vations.
Module-5
Clustering: Hierarchical (Agglomerative, single/complete/average linkage, wards, Partitional (Forgy’s, k-means,
Isodata), Clustering Large Data Sets, examples, An application: Handwritten Digit recognition.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
2. Two assignments each of 20 Marks or oneSkill Development Activity of 40 marks to attain the COs and
POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
2. Pattern Recognition and Image Analysis, Earl Gose, Richard Johnsonbaugh, Steve Jost, PHI 1996
3. Pattern Classification, Duda R. O., P.E. Hart, and D. G. Stork, Wiley 2000.
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
@#01112023
CSE 28.06.2023
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
@#01112023
CSE 28.06.2023
2. Healthcare Analytics: From Data to Knowledge to Healthcare Improvement, Hui Yang and Eva K. Lee,
Wiley 2016
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X X
CO2 X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
@#01112023
CSE 28.06.2023
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT / Web resources: https://www.youtube.com/watch?v=wXB-V_Keiu8
Process
Module-3
Key Management and Distribution: Symmetric key distribution using Symmetric encryption, A key distribution
scenario, Hierarchical key control, session key lifetime, a transparent key control scheme, Decentralized key
control, controlling key usage, Symmetric key distribution using asymmetric encryption, simple secret key
distribution, secret key distribution with confidentiality and authentication, A hybrid scheme, distribution of
public keys, public announcement of public keys, publicly available directory, public key authority, public keys
certificates, X-509 certificates. Certificates, X-509 version 3, Public Key infrastructure.
@#01112023
CSE 28.06.2023
An Introduction to privacy preserving data mining: Privacy-Preserving Data Mining Algorithms, The
Randomization Method, Group Based Anonymization.
2. Privacy Preserving Data Mining: Models and Algorithms, Charu C. Aggarwal, Philip S Yu, Kluwer
Academic 2008
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
@#01112023
CSE 28.06.2023
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT / Web resources: https://www.scribbr.com/statistics/multiple-linear-
Process regression/
Module-3
Time Series Regression: Modeling Trend by Using Polynomial Functions. Detecting Autocorrelation. Types of
Seasonal Variation. Modeling Seasonal Variation by Using Dummy Variables and Trigonometric Functions.
Growth Curves. Handling First-Order Autocorrelation. Decomposition Methods: Multiplicative Decomposition.
Additive Decomposition. The X-12-ARIMA Seasonal Adjustment Method. Exercises. Exponential Smoothing:
Simple Exponential Smoothing. Tracking Signals. Holt’s Trend Corrected Exponential Smoothing. Holt-Winters
Methods. Damped Trends and Other Exponential.
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Module-4
Non-seasonal Box-Jenkins Modeling and Their Tentative Identification: Stationary and Non-stationary Time Series.
The Sample Autocorrelation and Partial Autocorrelation Functions: The SAC and SPAC. An Introduction to
Nonseasonal Modeling and Forecasting. Tentative Identification of Non-seasonal Box-Jenkins Models. Estimation,
Diagnostic Checking, and Forecasting for Non-seasonal Box-Jenkins Models: Estimation. Diagnostic Checking.
Forecasting. A Case Study. Box-Jenkins Implementation of Exponential Smoothing.
Teaching- Chalk and Talk/ PPT / Web resources : https://link.springer.com/chapter/10.1007/978-0-85729-
Learning 974-1_8
@#01112023
CSE 28.06.2023
Process
Module-5
Box-Jenkins Seasonal Modeling: Transforming a Seasonal Time Series into a Stationary Time Series. Examples of
Seasonal Modeling and Forecasting. Box-Jenkins Error Term Models in Time Series Regression. Advanced
BoxJenkins Modeling: The General Seasonal Model and Guidelines for Tentative Identification. Intervention
Models. A Procedure for Building a Transfer Function Model Causality in time series: Granger causality. Hypothesis
testing on rational expectations. Hypothesis testing on market efficiency.
Teaching-
Learning Chalk and Talk/ PPT / Web resources
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
2. Two assignments each of 20 MarksoroneSkill Development Activity of 40 marks to attain the COs and
POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
2. The Econometric Modelling of Financial Time Series, Terence C. Mills, Raphael N. Markellos,
Cambridge University Press, 2008
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
@#01112023
CSE 28.06.2023
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-3
Meta-Networks and Relational Algebra: Modes of Data, Source, Target, Direction, Multimode Networks, Bridging
a Meta-Network, Strength of Ties. Sources of Data: Network Sampling, Measuring Links, Data Quality, Additional
Ethnographic Data Collection Methods, Anonymity Issues.
Teaching-
Learning Chalk and Talk/ PPT / Web resource : example like
Process https://www.researchgate.net/publication/221615377_A_Relational_Algebra_for_DataMetadata_I
ntegration_in_a_Federated_Database_System
Module-4
Information Networks and the World Wide Web: The Structure of the web, Link Analysis and Web Search,
Sponsored Search Markets
Teaching-
Learning Chalk and Talk/ PPT / Web resources: https://web.stanford.edu/class/cs344g/www-1992.pdf
Process
@#01112023
CSE 28.06.2023
Module-5
Network Dynamics: Structural Models, The Small-World Phenomenon, Epidemics.
Teaching-
Learning Chalk and Talk/ PPT
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
a. Two assignments each of 20 Marks or one Skill Development Activity of 40 marks to attain the COs
and POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
2. Networks, Crowds, and Markets: Reasoning About a Highly Connected World, David Easley and John
Kleinberg, Cambridge University Press 2010
3. Social and Economic Networks, Matthew O. Jackson, Princeton University Press 2008
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X X
CO2 X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
@#01112023
CSE 28.06.2023
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT / https://www.youtube.com/watch?v=eDyJsFHYFFI
Process
Module-3
Detection of Objects of Interest, Tracking of Objects of Interest in a Sequence of Images, Tracking Objects of
Interest Through a Camera Network.
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-4
Biometric Techniques Applied to Video Surveillance, Vehicle Recognition in Video Surveillance, Activity
Recognition.
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-5
Unsupervised Methods for Activity Analysis and Detection of Abnormal Events, Analysis of Crowded Scenes in
Video, Detection of Visual Context, Example of an Operational Evaluation Platform: PPSL
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
2. Two assignments each of 20 Marks or one Skill Development Activity of 40 marks to attain the COs and
POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X
CO2 X X
CO3 X X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
@#01112023
CSE 28.06.2023
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
CSE 28.06.2023
CLOUD COMPUTING
Course Code 22SDS245 CIE Marks 50
Teaching Hours/Week (L:P:SDA) 2:0:2 SEE Marks 50
Total Hours of Pedagogy 40 Total Marks 100
Credits 03 Exam Hours 03
Module-1
Introduction, Cloud Infrastructure: Cloud computing, Cloud computing delivery models and services, Ethical
issues, Cloud vulnerabilities, Cloud computing at Amazon, Cloud computing the Google perspective, Microsoft
Windows Azure and online services, Open-source software platforms for private clouds, Cloud storage diversity
and vendor lock-in, Energy use and ecological impact, Service level agreements, User experience and software
licensing. Exercises and problems.
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-2
Cloud Computing: Application Paradigms.: Challenges of cloud computing, Architectural styles of cloud
computing, Workflows: Coordination of multiple activities, Coordination based on a state machine model: The
Zookeeper, The Map Reduce programming model, A case study: The Gre The Web application, Cloud for science
and engineering, High-performance computing on a cloud, Cloud computing for Biology research, Social
computing, digital content and cloud computing.
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-3
Cloud Resource Virtualization: Virtualization, Layering and virtualization, Virtual machine monitors, Virtual
Machines, Performance and Security Isolation, Full virtualization and paravirtualization, Hardware support for
virtualization, Case Study: Xen a VMM based paravirtualization, Optimization of network virtualization, vBlades,
Performance comparison of virtual machines, The dark side of virtualization, Exercises and problems
Teaching-
Learning Chalk and Talk/ PPT / Web reference link: https://www.youtube.com/watch?v=Sb5SO3WRSws
Process
Module-4
Cloud Resource Management and Scheduling: Policies and mechanisms for resource management, Application of
control theory to task scheduling on a cloud, Stability of a two-level resource allocation architecture, Feedback
control based on dynamic thresholds, Coordination of specialized autonomic performance managers, A utility-
based model for cloud-based Web services, Resourcing bundling: Combinatorial auctions for cloud resources,
Scheduling algorithms for computing clouds, Fair queuing, Start-time fair queuing, Borrowed virtual time, Cloud
scheduling subject to deadlines, Scheduling MapReduce applications subject to deadlines, Resource management
and dynamic scaling, Exercises and problems.
@#01112023
CSE 28.06.2023
Teaching-
Learning Chalk and Talk/ PPT
Process
Module-5
Cloud Security, Cloud Application Development: Cloud security risks, Security: The top concern for cloud users,
Privacy and privacy impact assessment, Trust, Operating system security, Virtual machine Security, Security of
virtualization, Security risks posed by shared images, Security risks posed by a management OS, A trusted virtual
machine monitor, Amazon web services: EC2 instances, Connecting clients to cloud instances through firewalls,
Security rules for application and transport layer protocols in EC2, How to launch an EC2 Linux instance and
connect to it, How to use S3 in java, Cloud-based simulation of a distributed trust algorithm, A trust management
service, A cloud service for adaptive data streaming, Cloud based optimal FPGA synthesis .Exercises and problems
Teaching-
Learning Chalk and Talk/ PPT
Process
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing marks in SEE is 40% of the
maximum marks of SEE. A student shall be deemed to have satisfied the academic requirements and earned the
credits allotted to each subject/ course if the student secures not less than 50% (50 marks out of 100) in the sum
total of the CIE (Continuous Internal Evaluation) and SEE (Semester End Examination) taken together.
Continuous Internal Evaluation:
1. Three Unit Tests each of 20 Marks
a. Two assignments each of 20 MarksoroneSkill Development Activity of 40 marks to attain the COs
and POs
The sum of three tests, two assignments/skill Development Activities, will be scaled down to 50 marks
CIE methods /question paper is designed to attain the different levels of Bloom’s taxonomy as per the outcome
defined for the course.
@#01112023
CSE 28.06.2023
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 X X
CO2 X X
CO3 X
2 Problem analysis: Identify, formulate, review research literature, and analyze complex PO2
engineering and business problems reaching substantiated conclusions using first
principles of mathematics, natural sciences, and engineering sciences.
3 Design/development of solutions: Design solutions for complex engineering problems and PO3
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
@#01112023
CSE 28.06.2023
5 Modern tool usage: Create, select, and apply appropriate techniques, resources, and PO5
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations
6 The engineer and society: Apply reasoning informed by the contextual knowledge to PO6
assess societal, health, safety, legal and cultural issues and the consequent
responsibilities relevant to the professional engineering and business practices.
7 Environment and sustainability: Understand the impact of the professional engineering PO7
solutions in business societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8 Ethics: Apply ethical principles and commit to professional ethics and responsibilities and PO8
norms of the engineering and business practices.
9 Individual and team work: Function effectively as an individual, and as a member or PO9
leader in diverse teams, and in multidisciplinary settings.
11 Project management and finance: Demonstrate knowledge and understanding of the PO11
engineering, business and management principles and apply these to one’s own work, as
a member and leader in a team, to manage projects and in multidisciplinary
environments.
12 Life-long learning: Recognize the need for, and have the preparation and ability to engage PO12
in independent and life-long learning in the broadest context of technological change.
@#01112023
Deep Learning Laboratory
Course Code 22SDSL26 CIE Marks 50
Teaching Hours/Week (L:T:P: S) 01:02:00 SEE Marks 50
Credits 02 Exam Hours 03
Course objectives:
Handle large amounts of datasets using various deep learning methods in NLP, Neural Networks etc.
Demonstrate several libraries and datasets publicly available will be used to illustrate the application of
these algorithms.
Sl.NO Experiments
1 Medical Image classification of any 2 diseases using any two repository using python as tool
2
Two experiments on text extraction from a large file using Mat Lab or Python.
3
Solving any two optimization problems using deep learning techniques like TSP,NapSac etc
4
Implementation of Perceptron Networks using Matlab
5
Implementation of Adeline network for system identification using Matlab
6 For the given dataset mtcars.csv (www.kaggle.com/ruiromanini/mtcars), plot a histogram to check the
frequency distribution of the variable „mpg‟ (Miles per gallon)
7 Train an SVM classifier on the iris dataset using sklearn. Try different kernels and the associated
hyperparameters. Train model with the following set of hyperparameters RBF-kernel, gamma=0.5, onevs-
rest classifier, no-feature-normalization. Also try C=0.01,1,10C=0.01,1,10. For the above set of
hyperparameters, find the best classification accuracy along with total number of support vectors on the
test data
8 Train a regularized logistic regression classifier on the iris dataset
(https://archive.ics.uci.edu/ml/machine- earning-databases/iris/ or the inbuilt iris dataset) using
sklearn.Train the model with the following hyperparameter C = 1e4 and report the best classification
accuracy.
Demonstration Experiments ( For CIE ) if any
9 Consider the dataset spiral.txt (https://bit.ly/2Lm75Ly). The first two columns in the dataset corresponds
to the co-ordinates of each data point. The third column corresponds to the actual cluster label. Compute
the rand index for the following methods: K – means Clustering Single – link Hierarchical Clustering
Complete link hierarchical clustering. Also visualize the dataset and which algorithm will be able to
recover the true clusters
10 Implementation of the problem solving strategies: either using Forward Chaining or Backward Chaining.
12 Project related to application of machine learning in Time Series Analysis & Forecasting.
@#01112023
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is 50%. The
minimum passing mark for the CIE is 50% of the maximum marks. A student shall be deemed to have satisfied the
academic requirements and earned the credits allotted to each course. The student has to secure not less than 40%
of maximum marks in the semester-end examination(SEE). In total of CIE and SEE student has to secure 50%
maximum marks of the course.
Continuous Internal Evaluation (CIE):
CIE marks for the practical course is 50 Marks.
The split-up of CIE marks for record/ journal and test are in the ratio 60:40.
Each experiment to be evaluated for conduction with observation sheet and record write-up. Rubrics for the
evaluation of the journal/write-up for hardware/software experiments designed by the faculty who is handling the
laboratory session and is made known to students at the beginning of the practical session.
Record should contain all the specified experiments in the syllabus and each experiment write-up will be evaluated
for 10 marks.
Total marks scored by the students are scaled downed to 30 marks (60% of maximum marks).
Weightage to be given for neatness and submission of record/write-up on time.
Department shall conduct 02 tests for 100 marks, the first test shall be conducted after the 8th week of the semester
and the second test shall be conducted after the 14th week of the semester.
In each test, test write-up, conduction of experiment, acceptable result, and procedural knowledge will carry a
weightage of 60% and the rest 40% for viva-voce.
The suitable rubrics can be designed to evaluate each student’s performance and learning ability.
The average of 02 tests is scaled down to 20 marks (40% of the maximum marks).
The Sum of scaled-down marks scored in the report write-up/journal and average marks of two tests is the total
CIE marks scored by the student.
@#01112023
CSE 28.06.2023
Semester-III
Scheme of Teaching and Examinations and
Syllabus
M. Tech in Data Science (SDS)
Copy (Effective from the Academic year 2022-23)
right
Registrar
,
Visvesvaraya Technological
University JnanaSangam, Machhe,
Belagavi-590018eMail:
[email protected]
Contact: 0831-2498112
@#01112023
CSE 28.06.2023
@#01112023
CSE 28.06.2023
01.02.2023
13072022
/V1
VISVESVARAYA TECHNOLOGICAL
UNIVERSITY, BELAGAVI
Scheme of Teaching and Examinations – 2022
M. Tech in Data Science (SDS)
Choice Based Credit System (CBCS) and Outcome Based Education(OBE)
III SEMESTER
Teaching Hours Examination
/Week
Duration in hours
Practical/Semin
Total Marks
Development
Course
Credits
SEE Marks
Course
CIE Marks
Theory
Sl. No
(Hours are
Cour Title
Activities
se
Skill
Cod
ar
e
L P SDA
1 PCC 22SDS31 Business Intelligence and 03 00 02 03 50
50 100 4
Analytics
2 PEC 22SDS32 Professional Elective 3 03 00 00 03 50 50 100 3
X
3 OEC 22SDS33 Open Elective Courses-1 03 00 00 03 50 50 100 3
X
PROJ 22SDS34 Project Work phase -1 00 06 00 -- 100 -- 100 3
4
5 S 22SDS35 Societal Project 00 06 00 -- 100 -- 100 3
P
@#01112023
CSE 28.06.2023
@#01112023
CSE 28.06.2023
semesters.)
@#01112023
CSE 28.06.2023
01.02.2023
13072022
/V1
Professional Elective 3 Open
Elective 1
Course Code Course Course Code Course
under title under title
22SDS32X 22SDS33X
22SDS321 High Performance Computing 22SDS331 IoT and Applications
22SDS322 Cyber Security and Cyber Law 22SDS332 Fundamentals of Artificial
Intelligence
22SDS323 Cloud Computing for Data Analysis 22SDS333 Big Data Analytics
22SDS324 Blockchain Technology 22SDS334 Web Technologies
22SDS325 Spatial Data Analysis 22SDS335 Natural Language Processing
@#01112023
CSE 28.06.2023
Note:
1. Project Work Phase-1:The project work shall be carried out individually. However, in case a disciplinary or
interdisciplinary projectrequires more participants, then a group consisting of not more than three shall be
permitted.
Students in consultation with the guide/co-guide (if any) in disciplinary project or guides/co-guides (if any) of all
departments in caseof multidisciplinary projects, shall pursue a literature survey and complete the preliminary
requirements of the selected Project work. Each student shall prepare a relevant introductory project document,
and present a seminar.
CIE marks shall be awarded by a committee comprising of HoD as Chairman, all Guide/s and co-guide/s (if any)
and a senior faculty of the concerned departments. The CIE marks awarded for project work phase -1, shall be
based on the evaluation of Project Report,Project Presentation skill, and performance in the Question and Answer
session in the ratio of 50:25:25.
2. Societal Project: Students in consultation with the internal guide as well as with external guide (much
preferable) shall involve in applying technology toworkout/proposing viable solutions for societal problems.
CIE marks shall be awarded by a committee comprising of HoD as Chairman, Guide/co-guide if any, and a senior
faculty of the department. The CIE marks awarded, shall be based on the evaluation of Project Report, Project
Presentation skill, and performance in the Question and Answer session in the ratio of 50:25:25.
Those, who have not pursued /completed the Societal Project, shall be declared as fail in the course and have to
complete the same during subsequent semester/s after satisfying the Societal Project requirements. There is no
SEE (University examination) for this course.
3. Internship: Those, who have not pursued /completed the internship, shall be declared as fail in the internship
course and have to
complete the same during subsequent University examinations after satisfying the internship requirements.
Internship SEE
@#01112023
CSE 28.06.2023
01.02.2023
13072022
/V1
(University examination) shall be as per the University norms.
CIE marks shall be awarded by a committee comprising of HoD as Chairman, Guide/co-guide if any, and a senior faculty of the
department. The CIE marks awarded for project work phase -1, shall be based on the evaluation of Project Report, Project
Presentation skill, and performance in the Question and Answer session in the ratio of 50:25:25.
@#01112023