Cloudera Introduction To Data Science: Building Recommender Systems
Cloudera Introduction To Data Science: Building Recommender Systems
Hands-On Hadoop
Through instructor-led discussion and interactive, hands-on exercises, participants will
navigate the Hadoop ecosystem, learning topics such as:
The role of data scientists, vertical use cases, and business applications of data
science
Where and how to acquire data, methods for evaluating source data, and data
transformation and preparation
Types of statistics and analytical methods and their relationship
Machine learning fundamentals and breakthroughs, the importance of algorithms, and
data as a platform
How to implement and manage recommenders using Apache Mahout and how to set
up and evaluate data experiments
Steps for deploying new analytics projects to production and tips for working at scale
TRAINING SHEET
Data Transformation
Anonymization
Joining Datasets
Implementing Recommenders
with Apache Mahout
Overview
Similarity Metrics for Binary Preferences
Similarity Metrics for Numeric Preferences
Scoring
Finance
Retail
Descriptive Statistics
Advertising
Inferential Statistics
Use Cases
Overview
Project Lifecycle
Steps in the Project Lifecycle
Lab Scenario Explanation
Deploying to Production
Recommender Overview
Acquisition Techniques
Conclusion
Fundamental Concepts
Appendix A :
Hadoop Overview
Data Acquisition
Data Quantity
Data Quality
cloudera.com
1-888-789-1488 or 1-650-362-0488
Cloudera, Inc., 1001 Page Mill Road, Palo Alto, CA 94304, USA
Appendix B:
Mathematical Formulas
Appendix C :
Language and Tool Reference
2015 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA
and other countries. All other trademarks are the property of their respective companies. Information is subject to change without notice.
cloudera-training-sheet-introduction-to-data-science-103