We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 70
MACHINE LEARNING
Machine learning is a branch of artificial
intelligence(AI) that enables computers to “self learn” from training data and improve overtime without explicitly programmed. Machine learning algorithms are able to detect patterns in data and learn from them, in order to make their own predictions. In short, ML algorithms and models learn through experience. Supervised learning: Supervised learning is a category of machine learning that uses labeled datasets to train algorithms to predict outcomes and recognize patterns. Unlike unsupervised learning, supervised learning algorithms are given labeled training to learn the relationship between the input and the outputs. Unsupervised learning: Unsupervised learning in artificial intelligence is a type of machine learning that learns from data without human supervision. Unlike supervised learning, unsupervised machine learning models are given unlabeled data and allowed to discover patterns and insights without any explicit guidance or instruction. Reinforcement learning: Reinforcement Learning (RL) is the science of decision making. It is about learning the optimal behavior in an environment to obtain maximum reward. It's about taking the best possible action or path to gain maximum rewards and minimum punishment through observations in a specific situation. It acts as a signal to positive and negative behaviors. • Applications of Machine Learning Machine learning jobs and skills Jobs: Machine learning Engineer Data Engineer Machine learning scientist Required skills Python (or) R Probability and Statistics Maths Companies using Machine Learning
Google, Salesforce, Amazon, IBM, Pinterest,
Netflix, face book Features of ML: ML uses data to detect various patterns in a given dataset. It can learn from past data and improve automatically. It’s a data driven technology. Machine learning is much similar to data mining as it also deals with the huge amount of the data. Well posed learning Problem Well Posed Learning Problem – A computer program is said to learn from experience E in context to some task T and some performance measure P, if its performance on T, as was measured by P, upgrades with experience E. • Any problem can be segregated as well-posed learning problem if it has three traits – – Task – Performance Measure – Experience • Certain examples that efficiently defines the well-posed learning problem are – 1. To better filter emails as spam or not T -> Identifying whether or not an email is spam. P -> The percentage of emails correctly categorized as spam or nonspam. E -> Observing how you categorize emails as spam or nonspam. 2. A checkers learning problem T -> Playing checkers game P -> Percent of games won against opponent E -> Playing practice games against itself 3. Handwriting Recognition Problem T -> Recognizing and classifying handwritten words from images. P -> Percentage of correctly identified words. E-> Set of handwritten words with their classifications in a database. 4. A Robot Driving Problem T -> To drive on public four-lane highways using sight scanners. P -> The average distance progressed before an error. E -> The order of images and steering instructions noted down while observing a human driver. 5. Face Recognition Problem – Task – predicting different types of faces – Performance Measure – able to predict maximum types of faces – Experience – training machine with maximum amount of datasets of different face images 6. Automatic Translation of documents – Task – translating one type of language used in a document to other language – Performance Measure – able to convert one language to other efficiently – Experience – training machine with a large dataset of different types of languages Designing a learning system • According to Arthur Samuel – “Machine Learning enables a Machine to Automatically learn from Data, Improve performance from an Experience and predict things without explicitly programmed.” • In Simple Words, – When we fed the Training Data to Machine Learning Algorithm, this algorithm will produce a mathematical model and with the help of the mathematical model, the machine will make a prediction and take a decision without being explicitly programmed. – Also, during training data, the more machine will work with it the more it will get experience and the more efficient result is produced. Why a designing a learning system is important? For a perfect & efficient system, it is important to design the learning system. If not expected output will not come & the system without function properly. Designing a learning system: There are 5 basic steps to be followed while designing a learning system. 1) Choosing the Training Experience 2) Choosing the Target Function 3) Choose a Representation for the Target Function 4) Choosing a Function Approximation Algorithm 5) The Final Design Step1:Choosing a training experience – The first important task is to choose the training data or training experience which will be fed to the machine learning algorithm. – It will impact on the success or failure of the model(machine) – So training data(or) experience should be chosen wisely. – Below are the attributes which will impact on success and failure of data. • Type of feedback • Degree • Distribution of examples. 1) Type of feedback: the training experience will be able to provide direct (or) indirect feedback regarding choices. For example: while playing chess, the training data will provide feedback to itself like “Instead of this move, if this is chosen, the chances of success increases ” 2) Degree: It is the degree capability to which the learner will control the sequence of training.
ex: driving a car
with trainer help with trainers help completely
100% dependence partially 50% on your own 100% similarly, when training data is fed to the machine then at that time accuracy is very less -but when it gains experience while doing task again and again will get feedback and control the task accordingly. 3) Distribution of examples to increase performance: How well it represent the distribution of examples over which the performance of final system is measured. More possible combinations More situations, more examples Learning over distributed range of examples. Ex: Learning driving A machine learning algorithm will get experience which going through a no.of different cases and different examples. Thus, ML algorithm will get more and more experience by passing through more and more examples and hence its performance will increases. Step:2 Choosing the target function: It means according to what ever knowledge we fed to the algorithm, the machine will choose next action/move function which will describe what type of legal moves should be taken. Ex: Checkers game While moving -diagonally, set of all possible moves is called legal moves. - Travel only in forward direction - Only one move per chance - Only in diagonal direction - Jump over opponent Target function => v(b) Board State => b Legal moves set => B b is final board state that is won, then v(b) = 100 b is final board state that is lost, then v(b) = -100 b is final board state that is draw, then v(b) = 0 If b is not final state then v(b)= v(b’) b’ ->best final state Step:3 Choosing a representation for target function: For any board state, we calculate function ‘c’ as linear combination of following board features i.e. c(b). Features: x1- No. of black pieces on board x2 - No .of red pieces on board x3- No .of black kings on board x4- No. of red kings on board x5- No. of black pieces threatened by red (blacks which can be beaten by red) x6- No. of red pieces threatened by black. • V^(b)=w0+w1x1+ w2x2 + w3x3 + w4x4 + w5x5 + w6x6 (representation of c(b)) (w0-w6)= numerical coefficients (or) weights of each feature. determine the importance / weightage of features w0 – additive constant.
Step:4 Choosing a Function Approximation Algorithm:
In this step, we choose a learning algorithm that can approximate the target function chosen. This step further consists of two sub-steps, a. Estimating the training value, b. Adjusting the weights. Step:5 Final Design: The final design of checkers learning system is described naturally by four different program modules which indicate the central components in most of the learning systems. The four modules are depicted as follows. Perspectives and Issues in ML PERSPECTIVES: • Machine learning is a subfield of artificial intelligence and machine learning algorithms are used in other related fields like natural language processing and computer vision. • In general, there are three types of learning and these are supervised learning, unsupervised learning, and reinforcement learning. • Their names tell the main idea behind them actually. • In supervised learning, your system learns under the supervision of the data outputs so supervised algorithms are preferred if your dataset contains output information. • Let me give you an example in there. • Let’s assume you have a medical statistic company and you have a dataset which contains patients’ features like blood pressure, sugar rate in their blood, heart rate per minute, etc. ISSUES: LACK OF QUALITY DATA One of the main issues in Machine Learning is the absence of good data. While, algorithms tend to make developers exhaust most of their time on artificial intelligence. FAULT IN CREDIT CARD FRAUD DETECTION Although this AI-driven software helps to successfully detect credit card fraud, there are issues in Machine Learning that make the process redundant. GETTING BAD RECOMMENDATIONS Proposal engines are quite regular today. While some might be dependable, others may not appear to provide the necessary results. TALENT DEFICIT Albeit numerous individuals are pulled into the ML business, however, there are still not many experts who can take complete control of this innovation. MAKING THE WRONG ASSUMPTIONS ML models can’t manage datasets containing missing data points. Thus, highlights that contain a huge part of missing data should be erased. supervised vs. unsupervised learning What is Supervised Learning? • Supervised Learning is the machine learning approach defined by its use of labeled datasets to train algorithms to classify data and predict outcomes. • The labeled dataset has output tagged corresponding to input data for the machine to understand what to search for in the unseen data. Supervised Machine Learning Methods • There are two main areas where supervised machine learning comes in handy: classification problems and regression problems. Classification • Classification refers to taking an input value and mapping it to a discrete value. In classification problems, our output typically consists of classes or categories. This could be things like trying to predict what objects are present in an image (a cat/ a dog) or whether it is going to rain today or not. Regression • Regression is related to continuous data (value functions). In Regression, the predicted output values are real numbers. It deals with problems such as predicting the price of a house or the trend in the stock price at a given time, etc. Some of the most common algorithms in Supervised Learning include Support Vector Machines (SVM), Logistic Regression, Naive Bayes, Neural Networks, K-nearest neighbor (KNN), and Random Forest. Supervised Machine Learning Applications Now, let's have a look at some of the popular applications of Supervised Learning: Predictive analytics (house prices, stock exchange prices, etc.) Text recognition Spam detection Customer sentiment analysis Object detection (e.g. face detection) What is Unsupervised Learning? • Unsupervised Learning is a type of machine learning in which the algorithms are provided with data that does not contain any labels or explicit instructions on what to do with it. The goal is for the learning algorithm to find structure in the input data on its own. • To put it simply—Unsupervised Learning is a kind of self-learning where the algorithm can find previously hidden patterns in the unlabeled datasets and give the required output without any interference. • Identifying these hidden patterns helps in clustering, association, and detection of anomalies and errors in data. Unsupervised Machine Learning Methods • Unsupervised Learning models can perform more complex tasks than Supervised Learning models, but they are also more unpredictable. Here are the main tasks that utilize this approach. Clustering • Clustering is the type of Unsupervised Learning where we find hidden patterns in the data based on their similarities or differences. These patterns can relate to the shape, size, or color and are used to group data items or create clusters. • There are several types of clustering algorithms, such as exclusive, overlapping, hierarchical, and probabilistic. Association • Association is the kind of Unsupervised Learning where we can find the relationship of one data item to another data item. We can then use those dependencies and map them in a way that benefits us—e.g., understanding consumers' habits regarding our products can help us develop better cross- selling strategies. • The association rule is used to find the probability of co-occurrence of items in a collection. These techniques are often utilized in customer behavior analysis in e-commerce websites and OTT platforms. Dimensionality reduction • As the name suggests, the algorithm works to reduce the dimensions of the data. It is used for feature extraction. • Extracting the important features from the dataset is an essential aspect of machine learning algorithms. This helps reduce the number of random variables in the dataset by filtering irrelevant features. • Finally, here's a nice visual recap of everything we've covered so far (plus the Reinforcement Learning).