0% found this document useful (0 votes)
3 views

Data Analytics

The document outlines the course outcomes and detailed syllabus for a Data Analytics course, focusing on key concepts such as data analytics pipelines, classification and regression techniques, and mining techniques for streaming data. It includes various units covering topics like data analysis, mining data streams, clustering, and visualization frameworks, with an emphasis on practical applications and tools like R programming. Additionally, it lists recommended textbooks and references for further reading.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Data Analytics

The document outlines the course outcomes and detailed syllabus for a Data Analytics course, focusing on key concepts such as data analytics pipelines, classification and regression techniques, and mining techniques for streaming data. It includes various units covering topics like data analysis, mining data streams, clustering, and visualization frameworks, with an emphasis on practical applications and tools like R programming. Additionally, it lists recommended textbooks and references for further reading.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

BADS601 DATA ANALYTICS

Course Outcome (CO) Bloom’s Knowledge Level (KL)

At the end of course, the student will be able to

Discuss various concepts of data analytics pipeline


CO1 K 1 , K2
CO2 Apply classification and regression techniques K3
Explain and apply mining techniques on streaming data
CO3 K 2 , K3
Compare different clustering and frequent pattern mining algorithms
CO4 K4
CO5 Describe the concept of R programming and implement analytics on Big data using R. K 2 , K3
DETAILED SYLLABUS 3-0-0
Unit Topic Proposed
Lecture
I Introduction to Data Analytics: Sources and nature of data, classification of data (structured,
semi-structured, unstructured), characteristics of data, introduction to Big Data platform, need of
data analytics, evolution of analytic scalability, analytic process and tools, analysis vs reporting,
modern data analytic tools, applications of data analytics. Data Analytics Lifecycle: Need, key 08
roles for successful analytic projects, various phases of data analytics lifecycle – discovery, data
preparation, model planning, model building, communicating results, operationalization.

II Data Analysis: Regression modeling, multivariate analysis, Bayesian modeling, inference and
Bayesian networks, support vector and kernel methods, analysis of time series: linear systems
analysis & nonlinear dynamics, rule induction, neural networks: learning and generalisation, 08
competitive learning, principal component analysis and neural networks, fuzzy logic: extracting
fuzzy models from data, fuzzy decision trees, stochastic search methods.
III Mining Data Streams: Introduction to streams concepts, stream data model and architecture,
stream computing, sampling data in a stream, filtering streams, counting distinct elements in a
stream, estimating moments, counting oneness in a window, decaying window, Real-time 08
Analytics Platform (RTAP) applications, Case studies – real time sentiment analysis, stock
market predictions.
IV Frequent Item sets and Clustering: Mining frequent item sets, market-based modelling,
Apriori algorithm, handling large data sets in main memory, limited pass algorithm, counting
frequent item sets in a stream, clustering techniques: hierarchical, K-means, clustering high 08
dimensional data, CLIQUE and ProCLUS, frequent pattern-based clustering methods, clustering
in noneuclidean space, clustering for streams and parallelism.
V Frame Works and Visualization: MapReduce, Hadoop, Pig, Hive, HBase, MapR, Sharding,
NoSQL Databases, S3, Hadoop Distributed File Systems, Visualization: visual data analysis
techniques, interaction techniques, systems and applications.
08
Introduction to R - R graphical user interfaces, data import and export, attribute and data types,
descriptive statistics, exploratory data analysis, visualization before analysis, analytics for
unstructured data.
Text books and References:
1. Michael Berthold, David J. Hand, Intelligent Data Analysis, Springer
2. Anand Rajaraman and Jeffrey David Ullman, Mining of Massive Datasets, Cambridge University
Press.
3. Bill Franks, Taming the Big Data Tidal wave: Finding Opportunities in Huge Data Streams with
Advanced Analytics, John Wiley & Sons.
4. Michael Minelli, Michelle Chambers, and Ambiga Dhiraj, "Big Data, Big Analytics: Emerging
Business Intelligence and Analytic Trends for Today's Businesses", Wiley
5. David Dietrich, Barry Heller, Beibei Yang, “Data Science and Big Data Analytics”, EMC Education
Series, John Wiley
6. Frank J Ohlhorst, “Big Data Analytics: Turning Big Data into Big Money”, Wiley and SAS Business
Series
7. Colleen Mccue, “Data Mining and Predictive Analysis: Intelligence Gathering and Crime Analysis”,
Elsevier
8. Anil Maheshwari, “Data Analytics”, McGraw Hill Education
9. Paul Zikopoulos, Chris Eaton, Paul Zikopoulos, “Understanding Big Data: Analytics for Enterprise
Class Hadoop and Streaming Data”, McGraw Hill
10. Trevor Hastie, Robert Tibshirani, Jerome Friedman, "The Elements of Statistical Learning", Springer
11. Mark Gardner, “Beginning R: The Statistical Programming Language”, Wrox Publication
12. Pete Warden, Big Data Glossary, O’Reilly
13. Glenn J. Myatt, Making Sense of Data, John Wiley & Sons
14. Pete Warden, Big Data Glossary, O’Reilly.
15. Peter Bühlmann, Petros Drineas, Michael Kane, Mark van der Laan, "Handbook of Big Data", CRC
Press 16. Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition,
Elsevier

You might also like