The document discusses several machine learning topics including successes in games, autonomous vehicles, text analysis and more. It covers major machine learning paradigms like induction, clustering, analogy and reinforcement learning. It also describes different types of learning problems including supervised, unsupervised and reinforcement learning.
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
6 views
Machine Learning
The document discusses several machine learning topics including successes in games, autonomous vehicles, text analysis and more. It covers major machine learning paradigms like induction, clustering, analogy and reinforcement learning. It also describes different types of learning problems including supervised, unsupervised and reinforcement learning.
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13
Machine Learning Successes
• Games: chess, go, poker
• Autonomous vehicles • Text sentiment analysis • Individual face recognition • Email spam detection • Understanding digital images • Recommender systems (e.g., • Credit card fraud detection Netflix, Amazon) • Showing annoying ads • Machine translation • Speech understanding • SIRI, Alexa, Google Assistant, … The Big Idea and Terminology Given some data, learn a model of how the world works that lets you predict new data
• Training Set: Data from which you learn initially
• Model: What you learn; a “model” of how inputs are associated with outputs • Test set: New data you test your model against • Corpus: A body of text data (pl.: corpora) • Representation: The computational expression of data Major Machine learning paradigms (1) • Rote: 1-1 mapping from inputs to stored representation, learning by memorization, association-based storage & retrieval • Induction: Use specific examples to reach general conclusions • Clustering: Unsupervised discovery of natural groups in data Major Machine learning paradigms (2) • Analogy: Find correspondence between different representations • Discovery: Unsupervised, specific goal not given • Genetic algorithms: Evolutionary search techniques, based on survival of the fittest • Reinforcement: Feedback (positive or negative reward) given at the end of a sequence of steps • Deep learning: artificial neural networks with representation learning for ML tasks Types of learning problems • Supervised: learn from training examples • Regression: • Classification: Decision Trees, SVM • Unsupervised: learn w/o training examples • Clustering • Dimensionality reduction • Word embeddings • Reinforcement learning: improve performance using feedback from actions taken • Lots more we won’t cover • Hidden Markov models, Learning to rank, Semi-supervised learning, Active learning, … Supervised learning • Given training examples of inputs & corresponding outputs, produce “correct” outputs for new inputs • Two important scenarios: • Classification: outputs typically labels (goodRisk, badRisk); learn decision boundary to separate classes • Regression: aka curve fitting or function approxima-tion; Learn a continuous input-output mapping from examples, e.g., for a zip code, predict house sale price given its square footage Unsupervised Learning Given only unlabeled data as input, learn some sort of structure, e.g.: • Clustering: group Facebook friends based on similarity of post texts and friends • Topic modeling: Induce N topics and words most common in documents about each • Embeddings: Find sets of words whose meanings are related (e.g., doctor, hospital) • Large Language Models: Predict text that might follow a given test sequence (e.g., BERT, GPT-3) Machine Learning • ML’s significance in AI has gone up and down over the last 75 years • Today it’s very important for AI and data science • Driving ML are three trends: • Cheaper and more powerful computing systems • Open-source ML tools & models (e.g., Weka, scikit-learn, TensorFlow, Huggingface, SpaCy, BERT …) • Availability of large amounts of data • Understanding ML concepts and tools allow many to use them with success Decision Trees (DTs) • A supervised learning method used for classification and regression • Given a set of training tuples, learn model to predict one value from the others • Learned value typically a class (e.g., goodRisk) • Resulting model is simple to understand, interpret, visualize, and apply • D:\6th semester\CSE-3635 Artificial Intelligence 3.00+1.00\sheet\ 2021\14_machine_learning 14_2_dt