Stream Processing and Analytics - Regular-HO
Stream Processing and Analytics - Regular-HO
Course Description
Data is moving at a very rapid pace because of which necessity of scalable systems capable of
processing and analyzing this fast, streaming data has arisen. The course introduces the students with
the architecture of streaming data processing systems. The course also enables students to understand
the complete end-to-end solution for cost-effective analysis and visualization of streaming data with
the help of various open source solutions available in this space. The course also helps students to
learn the implementation and application of algorithms and data structures required for the streaming
applications. Advanced streaming applications like Streaming SQL, Streaming Machine Learning will
be discussed at proper length.
Course Objectives
No
CO4 To present survey of tools and techniques required for streaming data analytics
Text Book(s)
R1 Big Data – Principles and best practices of scalable real-time data systems,
Nathan Marz, James Warren, 2017, Manning Publications
R2 Designing Data Intensive Applications, Martin Kleppmann, O’Reilly
Page | 1
Learning Outcomes:
No Learning Outcomes
LO1 Understand the components of streaming data systems with their capabilities and
characteristics
LO2 Learn the relevant architecture and best practices for processing and analysis of
streaming data
LO3 Gain knowledge about the development of system for data aggregation, delivery
and storage using Open source tools
LO4 Get familiarity with the advance streaming applications like Streaming SQL,
Streaming machine learning
Academic Term
Course Title STREAM PROCESSING AND ANALYTICS
Course No
Lead Instructor
Glossary of Terms
Modular Structure
Page | 2
No. Title of the Module
M1 Scalable Streaming Data Systems
M2 Streaming Data Systems Architecture
M3 Streaming Data Frameworks
M4 Streaming Analytics
M5 Advanced Streaming Applications
Page | 3
✔ What is stream processing?
✔ complex-event-processing
Page | 4
✔ Cassandra
● Implement the streaming data pipeline using the Kafka Kafka Streaming
Streaming library Guide
Page | 5
12 ● Time Series Analysis
Page | 6
Session 16 / Contact Hour 31 - 32
Evaluation Scheme:
Notes:
Syllabus for Mid-Semester Test (Closed Book): Topics in Session Nos. 1 to 8 (contact hours 1 to 16)
Syllabus for Comprehensive Exam (Open Book): All topics
It shall be the responsibility of the individual student to be regular in maintaining the self study
schedule as given in the course handout, attend the online lectures, and take all the prescribed
evaluation components such as Assignment/Quiz, Mid-Semester Test and Comprehensive
Exam according to the evaluation scheme provided in the handout.
Page | 7