0% found this document useful (0 votes)

497 views23 pages

ML unit-3

Machine learning jntuh R22

Uploaded by

yenagandula.narendra2904

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

497 views23 pages

ML unit-3

Machine learning jntuh R22

Uploaded by

yenagandula.narendra2904

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

R22MachineLearningLectureNotes

UNIT-III
LearningwithTrees:DecisionTrees,ConstructingDecisionTrees,Classificationand Regression
Trees.
EnsembleLearning:Boosting,Bagging,Differentwaystocombineclassifiers,BasicStatistics,
Gaussian Mixture Models, Nearest Neighbour Methods.
UnsupervisedLearning:KMeans Algorithm.
DecisionTrees:
 DecisionTreeisaSupervisedlearningtechniquethatcanbeusedforbothclassification
andRegressionproblems,butmostlyitispreferredforsolvingClassificationproblems.
 Itisatree-structuredclassifier,whereinternalnodesrepresentthefeaturesofadataset,
branches represent the decision rules and each leaf node represents the outcome.
 InaDecisiontree,therearetwonodes,whicharetheDecisionNodeandLeafNode.
 Decision nodes are used to make any decision and have multiple branches,
whereasLeaf nodes are the output of those decisions and do not contain any further
branches.
 Thedecisionsor thetestsareperformedon thebasis offeaturesof the givendataset.
 Itisagraphicalrepresentationforgettingallthepossiblesolutionstoa problem/decision
based on given conditions.
 It is called a decision tree because, similar to a tree, it starts with the root node, which
expands on further branches and constructs a tree-like structure.

1
R22MachineLearningLectureNotes

Example:

 Oneofthereasons that decision trees arepopularis that wecan turnthem into aset of
logical disjunctions (if ... then rules) that then go into program code very simply.
Ex:if thereis a partythen go to it
ifthereisnotapartyandyouhave anurgentdeadlinethen study
ConstructingDecisionTrees:
TypesofDecisionTree Algorithms:
 ID3: This algorithm measures how mixed up the data is at a node using something
called entropy. It then chooses the feature that helps to clarify the data the most.
 C4.5: This is an improved version of ID3 that can handlemissingdataand continuous
attributes.
 CART: This algorithm uses a different measure called Gini impurity to decide how to
split the data. It can be used for both classification (sorting data into categories) and
regression (predicting continuous values) tasks.

ID3 Algorithm:

EntropyinInformation Theory:
 Entropymeasures theamount of impurityin a setof features.
 TheentropyH of aset ofprobabilities piis:

 where the logarithm is base 2 because we are imagining that we encode everything
using binary digits (bits), and we define 0 log 0 = 0.

2
R22MachineLearningLectureNotes

 If all of the examples are positive, then we don’t get any extra information from
knowing the value of the feature for any particular example, since whatever the value
of the feature, the example will be positive. Thus, the entropy of that feature is 0.
 However, if the feature separates the examples into 50% positive and 50% negative,
then the amount of entropy is at a maximum, and knowing about that feature is very
useful to us.
 Forourdecisiontree,thebestfeaturetopickastheonetoclassifyonnowistheone that gives
you the most information, i.e., the one with the highest entropy.
InformationGain:
 Itisdefinedastheentropyofthewholesetminustheentropywhenaparticularfeature is
chosen.

 TheID3algorithmcomputesthisinformationgainforeachfeatureandchoosesthe one
that produces the highest value.

3
R22MachineLearningLectureNotes

C4.5 Algorithm:
 Itisanimprovedversionof ID3.
 Pruningisanother methodthat canhelp usavoid overfitting.
 IthelpsinimprovingtheperformanceoftheDecisiontreebycuttingthenodesorsub- nodes
which are not significant.
 Additionally,itremovesthe branches whichhaveverylow importance.
 There aremainly2 waysforpruning:
 Pre-pruning–wecanstopgrowingthetreeearlier,whichmeanswecanprune/remove/cut a
node if it has low importance while growing the tree.
 Post-pruning–onceourtreeisbuilttoitsdepth,wecanstartpruningthenodesbased on their
significance.
 C4.5uses adifferent methodcalled rulepost-pruning.
 ThisconsistsoftakingthetreegeneratedbyID3,convertingittoasetofif-thenrules, and then
pruning each rule by removing preconditions if the accuracy of the rule increases
without it.
 The rules are then sorted according to their accuracyon the training set and applied in
order.
 The advantages of dealing with rules are that they are easier to read and their order in
the tree does not matter, just their accuracy in the classification.
 ForContinuousVariables,thesimplestsolutionistodiscretisethecontinuousvariable.
 Computation complexity of Decision Tree is O(dnlogn) where n is number of data
points, d is number of dimensions.

4
R22MachineLearningLectureNotes

ClassificationExample:constructthedecision treetodecidewhat todointhe evening

Westartwithwhichfeaturehastoselectedasarootnode? Compute

Entropy of S:

findwhichfeaturehasthemaximalinformation gain:

5
R22MachineLearningLectureNotes

 Therefore,therootnodewillbethepartyfeature,whichhastwofeaturevalues(‘yes’ and
‘no’), so it will have two branches coming out of it.



Whenwelookatthe‘yes’branch,weseethatinallfivecaseswheretherewasaparty we went to
it, so we just put a leaf node there, saying ‘party’.
 Forthe‘no’branch,outofthefivecasestherearethreedifferentoutcomes,sonowwe need to
choose another feature.
 Thefivecases weare lookingat are:

 We’veusedthepartyfeature,sowejustneedtocalculatetheinformationgainofthe other
two over these five examples:

6
R22MachineLearningLectureNotes

 Here, Deadline feature has maximum information gain. Hence, we selected Deadline
feature for splitting data.

 Finally, wewill getthefollowingdecisiontree.

7
R22MachineLearningLectureNotes

ClassificationandRegression Trees(CART):
 Itisanotherwell-knowntree-basedalgorithm,CART,whosenameindicatesthatitcan be
used for both classification and regression.

GiniImpurity:

 Itisthe probabilityofmisclassifyinga randomlychosenelement inaset.

 The ‘impurity’ in the name suggests that the aim of the decision tree is to have each
leaf node represent a set of data points that are in the same class, so that there are no
mismatches. This is known as purity.
 Ifaleafis purethenallof thetrainingdata withinit havejust oneclass.
 Consideradataset Dthatcontainssamplesfrom kclasses.
 The probability of samples belonging to class i at a given node can be denoted as p i.
Then the Gini Impurity of is defined as: 

 Thenodewithuniform classdistribution hasthehighest impurity.

 Theminimum impurityis obtained when all records belongto thesameclass.

 Anattributewith thesmallest GiniImpurityis selectedfor splittingthe node.

8
R22MachineLearningLectureNotes

RegressioninTrees:

 A Regression tree is an algorithm where the target variable is continuous and the tree
is used to predict its value.

 Regression Tree works by splitting the training data recursively into smaller subsets
based on specific criteria.
 The objective is to split the data in a way that minimizes the residual reduction (Sum
of Squared Error) in each subset.
 Residual Reduction- Residual reduction is a measure of how much the average
squared difference between the predicted values and the actual values for the target
variable is reduced by splitting the subset. The lower the residual reduction, the better
the model fits the data.
 Splitting Criteria- CART evaluates every possible split at each node and selects the
one that results in the greatest reduction of residual error in the resulting subsets. This
processisrepeateduntilastoppingcriterionismet,suchasreachingthemaximumtree depth
or having too few instances in a leaf node.

9
R22MachineLearningLectureNotes

EnsembleLearning:
 EnsemblelearningreferstotheapproachofcombiningmultipleMLmodelstoproduce a
more accurate and robust prediction compared to any individual model.
 Theconventionalensemblemethodsincludebagging,boosting,andstacking-based
methods

Boosting:
 Boosting is an ensemble technique that combines multiple weak learners to create
astrong learner.

10
R22MachineLearningLectureNotes

 The ensemble of weak models are trained in series such that each model that comes
next, tries to correct errors of the previous model until the entire training dataset is
predicted correctly.
 Oneofthemostwell-known boostingalgorithmsis AdaBoost(AdaptiveBoosting).
AdaBoost:
 AdaBoost short for Adaptive Boosting is an ensemble learning used in machine
learning for classification and regression problems.
 ThemainideabehindAdaBoostistoiterativelytraintheweakclassifieronthetraining
datasetwitheachsuccessiveclassifiergivingmoreweightagetothedatapointsthatare
misclassified.
 ThefinalAdaBoostmodelisdecidedbycombiningalltheweakclassifierthathasbeen
usedfortrainingwiththeweightagegiventothemodelsaccordingtotheiraccuracies.
 The model which has the highest accuracy is given the highest weightage while the
model which has the lowest accuracy is given a lower weightage.
Stepsin AdaBoost:

1. Weight Initialization

Atthestart,everyinstanceisassignedanidenticalweight.Theseweightsdeterminethe importance of
every example.

2. ModelTraining

Aweaklearnerisskilledatthedataset,withtheaimofminimizingclassification errors.

3. WeightedErrorCalculation

The weighted mistakes are then calculated by means of summing up the weights of the
misclassified times. This step emphasizes the importance of the samples which are tough to
classify.

4. ModelWeightCalculation

TheweightofthesusceptiblelearneriscalculatedprimarilybasedontheirPerformancein classifying
the training data. Models that perform properly are assigned higher weights, indicating that
they're more reliable.

5. UpdateInstanceWeights

Theexampleweightsareupdatedtooffermoreweighttothemisclassifiedsamplesfromthe previous
step.

6. Repeat

Steps2through5arerepeatedforapredefinedvarietyofiterationsortilladistinctiveoverall
performance threshold is met.

11
R22MachineLearningLectureNotes

7. FinalModelCreation

Theverylaststurdymodel(alsoreferredtoastheensemble)iscreatedbymeansof combining the

weighted outputs of all weak learners.

8. Classification

Tomakepredictionson newrecords, AdaBoost usesthe verylast ensemblemodel.

Bagging:
 Bagging is a supervised learning technique that can be used for both regression and
classification tasks.

 Hereis anoverview ofthesteps includingBaggingclassifier algorithm:

 BootstrapSampling:Dividestheoriginaltrainingdatainto‘N’subsetsandrandomly
selects a subset with replacement in some rows from other subsets. This step ensures
that the base models are trained on diverse subsets of the data and there is no class
imbalance.
 BaseModelTraining:Foreachbootstrappedsample,trainabasemodelindependently on
that subset of data. These weak models are trained in parallel to increase
computational efficiency and reduce time consumption.

12
R22MachineLearningLectureNotes

 PredictionAggregation:Tomakeapredictionontestingdatacombinethepredictions of all
base models. For classification tasks, it can include majority voting or weighted
majority while for regression, it involves averaging the predictions.
 Out-of-Bag (OOB) Evaluation: Some samples are excluded from the training subset
ofparticularbasemodelsduringthebootstrappingmethod.These“out-of-bag”samples
canbeusedtoestimatethemodel’sperformancewithouttheneedforcross-validation.
 FinalPrediction:Afteraggregatingthepredictionsfromallthebasemodels,Bagging
produces a final prediction for each instance.
Random Forest:
 The idea is largelythat if one tree is good, then manytrees (a forest) should be better,
provided that there is enough variety between them.
 Itworks bycreatinganumberof Decision Trees duringthe training phase.
 Eachtreeisconstructedusingarandomsubsetofthedatasettomeasurearandom subset of
features in each partition.
 This randomness introduces variability among individual trees, reducing the risk
ofoverfitting and improving overall prediction performance.
 Inprediction,thealgorithmaggregatestheresultsofalltrees,eitherbyvoting(for
classification tasks) or by averaging (for regression tasks)

13
R22MachineLearningLectureNotes

Stacking:
 Stackingcombines manyensemblemethods in order to build ameta-learner.
 Stackinghas twolevels oflearning: 1)baselearningand 2) meta-learning.
 Inthe firstone, thebaselearners aretrained with trainingdata set.
 Oncetrained, the baselearners create anew data set forameta-learner.
 Themeta-learneristhentrainedwiththat newtrainingdata set.
 Finally,thetrained meta-learneris usedto classifynew instances.

Differentwaystocombine classifiers:
 If the number of classifiers is odd and the classifiers are each independent of each
other,thenmajorityvotingwillreturnthecorrectlabelifmorethanhalfofthe classifiers
agree.
 Forregression problems, ratherthantakingthemajorityvote,it is common to takethe
mean of the outputs.
 However, the mean is heavily affected by outliers, with the result that the median is a
more common average to use.
 Itistheuseofthemedianthatproducesthebaggingalgorithm,whichismeanttoimply ‘robust
bagging’.
Basic Statistics:
Mean:
 The"mean"is theaveragevalue of adataset.
 It is calculated by adding up all the values in the dataset and dividing by the number
of observations.
 The mean is a useful measure of central tendency because it is sensitive to
outliers,meaning that extreme values can significantly affect the value of the mean.
Median:

 The"median"is themiddle value in adataset.

 Itiscalculatedbyarrangingthevaluesinthedatasetinorderandfindingthevalue that lies
in the middle.
 Ifthereareanevennumberofvaluesinthedataset,themedianistheaverageofthe two
middle values.

14
R22MachineLearningLectureNotes

 Themedianisausefulmeasureofcentraltendencybecauseitisnotaffectedby
outliers,meaningthatextremevaluesdonotsignificantlyaffectthevalueofthe
median.

Mode:
 The"mode"is themost common value in adataset.
 Itis calculated byfindingthe valuethat occurs most frequentlyin the dataset.
 Iftherearemultiplevaluesthatoccurwiththesamefrequency,thedatasetissaidtobe bimodal,
trimodal, or multimodal.
 Themodeisausefulmeasureofcentraltendencybecauseitcanidentifythemost common
value in a dataset.
 However,itisnotagoodmeasureofcentraltendencyfordatasetswithawiderangeof values
or datasets with no repeating values.
Variance:
 Varianceisameasureof howmuchthedataforavariablevariesfromit'smean.

Covariance:
 Covarianceisa measureof relationshipbetweentwo variables thatisscaledependent,
i.e.howmuchwillavariablechangewhenanothervariablechanges.

StandardDeviation:
 Thesquareroot ofthevarianceisknown as thestandard deviation
Mahalanobis Distance:
 MahalanobisDistanceisastatisticaltoolusedtomeasurethedistancebetweenapoint and a
distribution.
 Itisapowerfultechniquethatconsidersthecorrelationsbetweenvariablesinadataset,
making it a valuable tool in various applications such as outlier detection, clustering,
and classification.
D²=(x-μ)ᵀΣ⁻¹(x-μ)

15
R22MachineLearningLectureNotes

WhereD²isthesquaredMahalanobisDistance,xisthepointinquestion,μisthemean vector
of the distribution, Σ is the covariance matrix of the distribution, and ᵀ denotes the
transpose of a matrix.
TheGaussian/NormalDistribution:
 Normal distribution, also known as the Gaussian distribution, is a continuous
probability distribution that is symmetric about the mean, depicting that data near the
mean are more frequent in occurrence than data far from the mean.

ThebiasandVarianceTrade-off:
 Biasisthedifferencebetweentheaveragepredictionofourmodelandthecorrectvalue which
we are trying to predict.
 Model with high bias pays very little attention to the training data and oversimplifies
the model. It always leads to high error on training and test data.
 Variance is the variability of model prediction for a given data point or a value which
tells us spread of our data.

16
R22MachineLearningLectureNotes

 Modelwithhighvariancepaysalotofattentiontotrainingdataanddoesnotgeneralize on the
data which it hasn’t seen before.
 Asaresult, such models performverywell on trainingdatabut has high errorrateson test
data.
 Ifourmodelistoosimpleandhasveryfewparametersthenitmayhave highbiasand low
variance.
 Ontheotherhand ifour model has large number ofparameters thenit’s goingto have
high variance and low bias.
 Soweneedtofindtheright/goodbalancewithoutoverfittingandunderfittingthedata.

GaussianMixture Models:
 GMMisblendingmultipleGaussian distributionstoformasingle model.
 AGaussianmixturemodel(GMM)isamachinelearningmethodusedtodeterminethe
probability each data point belongs to a given cluster. The model is a soft clustering
method used in unsupervised learning.

17
R22MachineLearningLectureNotes

 Insoftclustering,insteadofforcefullyassigningadatapointtoasinglecluster,GMM
assignsprobabilitiesthatindicatethelikelihoodofthatdatapointbelongingtoeachof the
Gaussian components.

Notation:

 K:Number ofGaussiancomponents
 N:Numberofdata points
 D:Dimensionalityofthedata

GMM Parameters:

 Means(μ):CenterlocationsofGaussiancomponents.
 CovarianceMatrices (Σ):Definetheshapeandspreadof eachcomponent.
 Weights(π):Probabilityofselectingeachcomponent.

Model Training

 TrainingaGMM involvessettingthe parametersusingavailabledata.

 TheExpectation-Maximization(EM)techniqueisoftenemployed,alternating
between the Expectation (E) and Maximization (M) steps until convergence.

Expectation-Maximization:

 DuringtheEstep,themodelcalculatestheprobabilityofeachdatapoint belongingto each

Gaussian component.
 TheMstepthenadjuststhe model’sparametersbasedonthese probabilities.

ClusteringandDensityEstimation:

 Post-training,GMMsclusterdatapointsbasedonthehighestposteriorprobability.
 Theyarealsousedfordensityestimation,assessingtheprobabilitydensityatany point
in the feature space.

NearestNeighbour Methods:
K-NearestNeighborsAlgorithm:
 The K-Nearest Neighbors (KNN) algorithm is a supervised machine learning method
employed to tackle classification and regression problems.

18
R22MachineLearningLectureNotes

Step1:Selectingtheoptimal valueof K

 Krepresentsthenumberofnearestneighborsthatneedstobeconsideredwhilemaking
prediction.
Step2:Calculatingdistance
 To measure the similarity between target and training data points, Euclidean distance
isused.Distanceiscalculatedbetweeneachofthedatapointsinthedatasetandtarget point.
Step3:FindingNearestNeighbors

 The k data points with the smallest distances to the target point are the nearest
neighbors.
Step4: VotingforClassificationorTakingAveragefor Regression
 Intheclassificationproblem,theclasslabelsofK-nearestneighborsaredeterminedby
performingmajorityvoting.Theclasswiththemostoccurrencesamongtheneighbors
becomes the predicted class for the target data point.
 In the regression problem, the class label is calculated by taking average of the target
values of K nearest neighbors. The calculated average value becomes the predicted
output for the target data point.
KDimensionalTree(KD Tree):
 KDTree is a space partitioning data structure for organizing points in K-Dimensional
space.
 ItisanimprovementoverKNN.
 Itisusefulforrepresentingdataefficiently.
 In KDTree the data points are organized and partitioned on the basis of some specific
conditions.
 Thepurposeofthetree wasto storespatialdata with thegoalofaccomplishing:

1. Nearestneighbor search.
2. Rangequeries.
3. Fastlook-up.

Example:
AsimpleexampletoshowcasetheinsertionintoaK-DimensionalTree,wewilluseak=2. The points
we will be adding are: (7,8), (12,3), (14,1), (4,12), (9,1), (2,7), and (10,19).

19
R22MachineLearningLectureNotes

UnsupervisedLearning:
 Unsupervisedlearningisa typeof machinelearningthat learnsfrom unlabeled data.
 Thismeans that thedatadoes not haveanypre-existinglabels or categories.
 The goal of unsupervised learning is to discover patterns and relationships in the data
without any explicit guidance.
TypesofUnsupervisedLearning:
Unsupervisedlearningisclassifiedintotwocategoriesof algorithms:

 Clustering:Aclusteringproblemiswhereyouwanttodiscovertheinherentgroupings
inthedata,suchasgroupingcustomersbypurchasingbehavior.Clusteringisatypeof
unsupervised learning that is used to group similar data points together.
 Association:Anassociationrulelearningproblemiswhereyouwanttodiscoverrules
thatdescribelargeportionsofyourdata,such aspeoplethatbuyXalsotendtobuyY.

ApplicationsofUnsupervisedlearning:

 Anomalydetection:Unsupervisedlearningcanidentifyunusualpatternsordeviations
from normal behavior in data, enabling the detection of fraud, intrusion, or system
failures.
 Scientific discovery: Unsupervised learning can uncover hidden relationships and
patterns in scientific data, leading to new hypotheses and insights in various scientific
fields.
 Recommendation systems: Unsupervised learning can identify patterns and
similaritiesinuserbehaviorandpreferencestorecommendproducts,movies,ormusic that
align with their interests.
 Customersegmentation:Unsupervisedlearningcanidentifygroupsofcustomerswith
similarcharacteristics,allowingbusinessestotargetmarketingcampaignsandimprove
customer service more effectively.
 Image analysis: Unsupervised learning can group images based on their content,
facilitating tasks such as image classification, object detection, and image retrieval.

KMeans Algorithm:
 K-Means Clustering is an Unsupervised Learning algorithm, which groups the
unlabeled dataset into different clusters. Here K defines the number of pre-defined
clusters

20
R22MachineLearningLectureNotes

Thek-meansclusteringalgorithm mainlyperforms two tasks:

 DeterminesthebestvalueforK centerpointsor centroids byaniterativeprocess.
 Assigns each data point to its closestk-center. Those data points which arenear to the
particular k-center, create a cluster.

21
R22MachineLearningLectureNotes

ApplicationsofK-MeansClustering:
 CustomerSegmentation
 Document Clustering
 ImageSegmentation
 RecommendationEngines
 ImageCompression

Advantagesof K-MeansClustering:

 Simple and Easy to implement: The K-means algorithm is easy to understand

andimplement.
 FastandEfficient:K-meansiscomputationallyefficientandcanhandlelargedatasets with
high dimensionality.
 Scalability:K-meanscanhandlelargedatasetswithmanydatapointsandcanbeeasily scaled
to handle even larger datasets.
 Flexibility: K-means can be easily adapted to different applications and can be used
with varying metrics of distance and initialization methods.

DisadvantagesofK-MeansClustering:

 Sensitivitytoinitialcentroids:K-meansissensitivetotheinitialselectionofcentroids and
can converge to a suboptimal solution.

22
R22MachineLearningLectureNotes

 Requires specifying the number of clusters: The number of clusters k needs to

bespecifiedbeforerunningthealgorithm,whichcanbechallenginginsomeapplications.
 Sensitive to outliers: K-means is sensitive to outliers, which can have a
significantimpact on the resulting clusters.

*****

KRR UNIT-3
No ratings yet
KRR UNIT-3
19 pages
ML Unit 4
No ratings yet
ML Unit 4
50 pages
STM Unit 3 Notes
No ratings yet
STM Unit 3 Notes
36 pages
ML unit-5
No ratings yet
ML unit-5
14 pages
ML unit-4
No ratings yet
ML unit-4
17 pages
ML UNIT-3
No ratings yet
ML UNIT-3
23 pages
ML unit-2
100% (1)
ML unit-2
28 pages
ML unit-1
100% (1)
ML unit-1
15 pages
ML Unit 1
No ratings yet
ML Unit 1
42 pages
5. Efficient convolution algorithms
No ratings yet
5. Efficient convolution algorithms
13 pages
Unit-2 Introduction To Hadoop
No ratings yet
Unit-2 Introduction To Hadoop
19 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
Unit 2 - Notes
No ratings yet
Unit 2 - Notes
9 pages
Flat - Unit - 4 Notes
No ratings yet
Flat - Unit - 4 Notes
20 pages
AI_UNIT - 3-1
No ratings yet
AI_UNIT - 3-1
31 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
136 pages
NN DL
No ratings yet
NN DL
1 page
NLP SEM QUESTIONS AND ANSWERS
No ratings yet
NLP SEM QUESTIONS AND ANSWERS
72 pages
FIoT Unit 05
No ratings yet
FIoT Unit 05
73 pages
Krr Unit i Notes
100% (1)
Krr Unit i Notes
32 pages
Design A Learning System in Machine Learning
No ratings yet
Design A Learning System in Machine Learning
41 pages
Mean Stack T Unit1
No ratings yet
Mean Stack T Unit1
75 pages
NN UNIT-1 Complete Notes with 153 pages (1)
No ratings yet
NN UNIT-1 Complete Notes with 153 pages (1)
153 pages
Unit 2 (Second Order Methods)
No ratings yet
Unit 2 (Second Order Methods)
9 pages
unit V
No ratings yet
unit V
67 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Studocu DAA Unit 5 Notes
No ratings yet
Studocu DAA Unit 5 Notes
23 pages
FIOT Unit-1 Notes
No ratings yet
FIOT Unit-1 Notes
27 pages
CP4252 Machine Learning lab manual
No ratings yet
CP4252 Machine Learning lab manual
37 pages
CCS369 - TSS-Unit 3
No ratings yet
CCS369 - TSS-Unit 3
55 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Unit 3 FIOT
No ratings yet
Unit 3 FIOT
21 pages
Unit-5 Alt
No ratings yet
Unit-5 Alt
15 pages
5.hyperparameters and Validation Sets (C)
No ratings yet
5.hyperparameters and Validation Sets (C)
3 pages
UNIT V Application Layer
100% (1)
UNIT V Application Layer
18 pages
MC4102 OOSE Question bank
No ratings yet
MC4102 OOSE Question bank
4 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
48 pages
BITS Pilani
No ratings yet
BITS Pilani
31 pages
Unit 4 Knowledge Representation
No ratings yet
Unit 4 Knowledge Representation
13 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
ANOVA PPT
No ratings yet
ANOVA PPT
12 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
CN Unit-3
No ratings yet
CN Unit-3
32 pages
Data Analysis of Google Play Apps
100% (2)
Data Analysis of Google Play Apps
32 pages
Devops Unit II
No ratings yet
Devops Unit II
41 pages
Fundamentals of Data Science: Nehru Institute of Engineering and Technology
100% (1)
Fundamentals of Data Science: Nehru Institute of Engineering and Technology
17 pages
WT Unit 3
No ratings yet
WT Unit 3
57 pages
Unit 4 NLP Notes
No ratings yet
Unit 4 NLP Notes
35 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
180 pages
Ad3411 - Student
No ratings yet
Ad3411 - Student
27 pages
Required Assignment Week 7
No ratings yet
Required Assignment Week 7
12 pages
Autoregressive Conditional Heteroskedasticity ARCH Family of Estimators
No ratings yet
Autoregressive Conditional Heteroskedasticity ARCH Family of Estimators
33 pages
Numpy - Tutorial - Ipynb - Colaboratory
No ratings yet
Numpy - Tutorial - Ipynb - Colaboratory
9 pages
Da Unit-2
No ratings yet
Da Unit-2
23 pages
Data Mining Syllabus
No ratings yet
Data Mining Syllabus
1 page
Intermediate Stats 2024 (2)
No ratings yet
Intermediate Stats 2024 (2)
2 pages
9.uncertainty and Validation
No ratings yet
9.uncertainty and Validation
30 pages
Predictive Modeling PDF
100% (3)
Predictive Modeling PDF
49 pages
OACC Data Analysis SAC 2019 Final
No ratings yet
OACC Data Analysis SAC 2019 Final
9 pages
Natural Language Processing
100% (1)
Natural Language Processing
21 pages
DBCA
No ratings yet
DBCA
11 pages
2024 11 S1 - 1 Linear Regression (A)
No ratings yet
2024 11 S1 - 1 Linear Regression (A)
2 pages
Module 3 - Paper 1 - Extracting Relations From Text From Word Sequences To Dependency Paths
No ratings yet
Module 3 - Paper 1 - Extracting Relations From Text From Word Sequences To Dependency Paths
11 pages
MUF0142 Sample Exam Questions 4
No ratings yet
MUF0142 Sample Exam Questions 4
16 pages
Chapter 4
No ratings yet
Chapter 4
25 pages
36-202 Methods For Statistics and Data Science
No ratings yet
36-202 Methods For Statistics and Data Science
3 pages
Unit I
No ratings yet
Unit I
30 pages
Tugas2 Regresi Linear Berganda - Ipynb - Colab
No ratings yet
Tugas2 Regresi Linear Berganda - Ipynb - Colab
3 pages
Lecture 2 - Single Eq Reg Model
No ratings yet
Lecture 2 - Single Eq Reg Model
30 pages
DD2437 Lecture04 PH 18ht
No ratings yet
DD2437 Lecture04 PH 18ht
33 pages
IF4071 - Deep Learning Laboratory
No ratings yet
IF4071 - Deep Learning Laboratory
1 page
ps4lab
No ratings yet
ps4lab
4 pages
Question Bank 1to11
No ratings yet
Question Bank 1to11
19 pages
Comp 3 Measure of Relationship and Effect
No ratings yet
Comp 3 Measure of Relationship and Effect
6 pages
Unit 3 AI Srs 13-14
No ratings yet
Unit 3 AI Srs 13-14
45 pages
Kelompok 4 (Paper) - Pearson Product Moment
No ratings yet
Kelompok 4 (Paper) - Pearson Product Moment
12 pages
Data Analytics Unit-3 Notes
No ratings yet
Data Analytics Unit-3 Notes
21 pages
Correlation and Regression
No ratings yet
Correlation and Regression
2 pages
WINE Prediction Quality
100% (1)
WINE Prediction Quality
6 pages
Identifying The Most Important Independent Variables in Regression Models - Statistics by Jim
No ratings yet
Identifying The Most Important Independent Variables in Regression Models - Statistics by Jim
8 pages
CS6456-Object Oriented Programming
No ratings yet
CS6456-Object Oriented Programming
15 pages
CP5191 Machine Learning Techniques L T P C3 0 0 3
No ratings yet
CP5191 Machine Learning Techniques L T P C3 0 0 3
7 pages
Question Bank: T.E. (Computer Engineering) Data Science and Big Data Analytics (2019 Pattern)
No ratings yet
Question Bank: T.E. (Computer Engineering) Data Science and Big Data Analytics (2019 Pattern)
4 pages
Pengaruh On The Job Training Dan Off The
No ratings yet
Pengaruh On The Job Training Dan Off The
12 pages
Treenet
No ratings yet
Treenet
49 pages
Principal Component Analysis - Intro - Towards Data Science
No ratings yet
Principal Component Analysis - Intro - Towards Data Science
4 pages
7448-Article Text-26945-1-10-20110414 PDF
No ratings yet
7448-Article Text-26945-1-10-20110414 PDF
3 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
5 pages
Correlation Exercises
No ratings yet
Correlation Exercises
4 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

ML unit-3

Uploaded by

ML unit-3

Uploaded by

R22MachineLearningLectureNotes

ClassificationExample:constructthedecision treetodecidewhat todointhe evening

 Finally, wewill getthefollowingdecisiontree.

 Itisthe probabilityofmisclassifyinga randomlychosenelement inaset.

 Thenodewithuniform classdistribution hasthehighest impurity.

 Anattributewith thesmallest GiniImpurityis selectedfor splittingthe node.

Theverylaststurdymodel(alsoreferredtoastheensemble)iscreatedbymeansof combining the

Tomakepredictionson newrecords, AdaBoost usesthe verylast ensemblemodel.

 Hereis anoverview ofthesteps includingBaggingclassifier algorithm:

 The"median"is themiddle value in adataset.

 TrainingaGMM involvessettingthe parametersusingavailabledata.

 DuringtheEstep,themodelcalculatestheprobabilityofeachdatapoint belongingto each

Thek-meansclusteringalgorithm mainlyperforms two tasks:

 Simple and Easy to implement: The K-means algorithm is easy to understand

 Requires specifying the number of clusters: The number of clusters k needs to

You might also like