Ass bigd
Ass bigd
Diagram:
(Data Collection) → (Preprocessing) → (Feature Selection) → (Training) → (Evaluation) →
(Deployment).
6. Advanced ML Concepts
Algorithm Characteristic
K-means Partition-based clustering.
Hierarchical Builds nested clusters.
DBSCAN Density-based clustering.
a) Parameters:
1. Content-based filtering.
2. Collaborative filtering.
3. Hybrid methods.
9. MapReduce Concepts
a) MapReduce Overview:
Would you like a detailed explanation or code for any specific section?
Machine Learning (ML) is a branch of artificial intelligence that enables systems to learn
patterns from data and improve their performance on specific tasks without being explicitly
programmed.
Example Use Case Predicting house prices. Modeling biological growth rates.
1. Mean Absolute Error (MAE): Average absolute difference between actual and predicted values.
2. Mean Squared Error (MSE): Average squared difference between actual and predicted values.
3. R-squared (R²): Proportion of variance explained by the model.
4. Root Mean Squared Error (RMSE): Square root of MSE, measures prediction error magnitude.
Feature Selection Can shrink coefficients to zero. Retains all features. Selects and retains.
b) Cluster Classification Algorithms:
a) Euclidean Distance:
b) K-means Clustering:
9. MapReduce Concepts
a) What is MapReduce?
A programming model for distributed data processing that divides tasks into:
a) Class Hierarchies:
Input Format Class: Defines how input data is read (e.g., TextInputFormat).
Output Format Class: Defines how output data is written (e.g., TextOutputFormat).
Key Components:
Let me know if you'd like detailed examples or explanations for specific parts.