Data Analytics
Data Analytics
II Data Analysis: Regression modeling, multivariate analysis, Bayesian modeling, inference and
Bayesian networks, support vector and kernel methods, analysis of time series: linear systems
analysis & nonlinear dynamics, rule induction, neural networks: learning and generalisation, 08
competitive learning, principal component analysis and neural networks, fuzzy logic: extracting
fuzzy models from data, fuzzy decision trees, stochastic search methods.
III Mining Data Streams: Introduction to streams concepts, stream data model and architecture,
stream computing, sampling data in a stream, filtering streams, counting distinct elements in a
stream, estimating moments, counting oneness in a window, decaying window, Real-time 08
Analytics Platform (RTAP) applications, Case studies – real time sentiment analysis, stock
market predictions.
IV Frequent Item sets and Clustering: Mining frequent item sets, market-based modelling,
Apriori algorithm, handling large data sets in main memory, limited pass algorithm, counting
frequent item sets in a stream, clustering techniques: hierarchical, K-means, clustering high 08
dimensional data, CLIQUE and ProCLUS, frequent pattern-based clustering methods, clustering
in noneuclidean space, clustering for streams and parallelism.
V Frame Works and Visualization: MapReduce, Hadoop, Pig, Hive, HBase, MapR, Sharding,
NoSQL Databases, S3, Hadoop Distributed File Systems, Visualization: visual data analysis
techniques, interaction techniques, systems and applications.
08
Introduction to R - R graphical user interfaces, data import and export, attribute and data types,
descriptive statistics, exploratory data analysis, visualization before analysis, analytics for
unstructured data.
Text books and References:
1. Michael Berthold, David J. Hand, Intelligent Data Analysis, Springer
2. Anand Rajaraman and Jeffrey David Ullman, Mining of Massive Datasets, Cambridge University
Press.
3. Bill Franks, Taming the Big Data Tidal wave: Finding Opportunities in Huge Data Streams with
Advanced Analytics, John Wiley & Sons.
4. Michael Minelli, Michelle Chambers, and Ambiga Dhiraj, "Big Data, Big Analytics: Emerging
Business Intelligence and Analytic Trends for Today's Businesses", Wiley
5. David Dietrich, Barry Heller, Beibei Yang, “Data Science and Big Data Analytics”, EMC Education
Series, John Wiley
6. Frank J Ohlhorst, “Big Data Analytics: Turning Big Data into Big Money”, Wiley and SAS Business
Series
7. Colleen Mccue, “Data Mining and Predictive Analysis: Intelligence Gathering and Crime Analysis”,
Elsevier
8. Anil Maheshwari, “Data Analytics”, McGraw Hill Education
9. Paul Zikopoulos, Chris Eaton, Paul Zikopoulos, “Understanding Big Data: Analytics for Enterprise
Class Hadoop and Streaming Data”, McGraw Hill
10. Trevor Hastie, Robert Tibshirani, Jerome Friedman, "The Elements of Statistical Learning", Springer
11. Mark Gardner, “Beginning R: The Statistical Programming Language”, Wrox Publication
12. Pete Warden, Big Data Glossary, O’Reilly
13. Glenn J. Myatt, Making Sense of Data, John Wiley & Sons
14. Pete Warden, Big Data Glossary, O’Reilly.
15. Peter Bühlmann, Petros Drineas, Michael Kane, Mark van der Laan, "Handbook of Big Data", CRC
Press 16. Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition,
Elsevier