0% found this document useful (0 votes)

7 views

B Tech-AIML-question bank-2 Answer Key

The document discusses various concepts in Artificial Intelligence and Machine Learning, including constraint propagation in CSPs, data preprocessing techniques for handling missing data, and the differences between supervised and unsupervised learning. It also covers the bias-variance trade-off, clustering methods, exploratory data analysis (EDA) techniques, and data cleaning processes using Python. Additionally, it compares Decision Trees and Support Vector Machines in classification problems and explains hierarchical clustering.

Uploaded by

binitn845

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

B Tech-AIML-question bank-2 Answer Key

Uploaded by

binitn845

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Artificial Intelligence and Machine Learning

Question Bank-2
How does constraint propagation contribute to solving Constraint Satisfaction Problems (CSPs)?
1
Concept: Reduces the search space by enforcing constraints across variables to simplify the problem.
Example: Sudoku—if a number is assigned to a cell, it cannot appear in the same row, column, or box.
Explain the importance of handling missing data in data preprocessing. Discuss two common imputation
techniques.
2
 Importance: Ensures model accuracy.
 Techniques: Mean imputation, KNN imputation.
What is the difference between supervised and unsupervised learning? Give an example of a problem
that would be best addressed by each type of learning.
3
 High dimensions reduce model efficiency.
 Solutions: PCA, feature selection.
Explain the bias-variance trade off in machine learning. How does it relate to model complexity?

Bias: Error from oversimplifying the model.

4
Variance: Error from model sensitivity to training data fluctuations.
Relation to Model Complexity: Complex models have low bias but high variance, and simple models have high
bias but low variance.
How does k-means clustering differ from hierarchical clustering? Discuss a scenario where k-means
would be more appropriate.
5
K-Means: Partitioned, fast, requires a predefined number of clusters.
Hierarchical: Builds a tree of clusters, does not require a predefined number. Scenario for K-Means: Suitable
for large datasets with well-defined cluster numbers, such as customer segmentation.
Given a dataset, perform a basic EDA. Identify potential data quality issues and suggest appropriate
cleaning techniques.
Movie rating Dataset
MovieID,Title,Genre,Rating,ReleaseYear
1,Action Movie 1,Action,4.5,2020
2,Comedy Nights,Comedy,3.8,2021
3,Drama Story,Drama,4.2,2019
4,Sci-Fi World,Sci-Fi,4.0,2022
5,Action 2,Action,4.7,2023
6,Comedy Club,Comedy,3.5,2020
7,Drama Time,Drama,4.5,2021
8,Sci-Fi 2049,Sci-Fi,4.8,2017 <-- Inconsistent Release Year
9,Action 3,Action,3.9,2022
10,Comedy Central,Comedy, ,2023 <-- Missing Rating
11,Drama Life,Drama,4.3,2020
6 12,Sci-Fi X,Sci-Fi,4.1,2021

Affected
Issue Description Cleaning Suggestions
Records
Missing Impute the missing value using the genre
Missing movie rating MovieID 10
Values average or median (Comedy = 3.6).
Inconsistent Unusual release year for MovieID 8 Check movie title for authenticity; adjust
Data Sci-Fi movie (2017) the year if metadata is available.
Duplicate Data None detected N/A N/A
None evident based on
Outliers N/A N/A
ratings (range 0 to 5)
Data Type
None detected N/A N/A
Issues

Cleaning Techniques
1. Handle Missing Values:
o MovieID 10 (Missing Rating):
 Use the genre average (Comedy = 3.6) or assign a default like 3.5 as the likely
median for a comedy.
df.loc[df['MovieID'] == 10, 'Rating'] = 3.6
2. Fix Inconsistent Release Year:
o MovieID 8 (Release Year 2017):
 Cross-check with the metadata or movie database.
 Likely correction: Set it to a year matching the genre or mark it as 2022 for
consistency.
3. Ensure Data Types are Correct:
o Confirm that Rating is a floating-point column and ReleaseYear is an integer.
4. Data Validation:
o Ensure that Ratings are within valid bounds (0 to 5).
o Check for outliers in ReleaseYear.

Explain the importance of feature scaling in machine learning. Compare and contrast normalization and
standardization. Provide an example where normalization would be preferred over standardization.

Feature scaling is crucial in machine learning for the following reasons:

1. Improves Model Performance: Many machine learning models rely on the distance between
data points (e.g., k-NN, SVMs). If features are on different scales, the model may give undue
importance to features with larger scales.
2. Speeds Up Convergence: Gradient descent converges faster when features are scaled because
it avoids the problem of zig-zagging during optimization.
3. Ensures Equal Weighting: Without scaling, features with larger numerical ranges may
dominate the training process.
4. Required for Distance-Based Models: Algorithms such as k-means and hierarchical clustering
are highly sensitive to the scale of features.
Normalization vs. Standardization
7 Normalization (Min-Max Standardization (Z-score
Aspect
Scaling) Scaling)
Formula x′=x−xmin/xmax−xmin x′=x−μ/σ
Centers data to mean 0 and
Range Scales data to [0, 1]
standard deviation 1
Effect on Less sensitive due to scaling with
Sensitive to outliers
Outliers standard deviation
When data follows a Gaussian
When data is bounded or
(normal) distribution or in models
Use Case needs to fit within specific
requiring assumptions of normally
ranges
distributed data
Linear Regression, Logistic
Examples Image pixel scaling
Regression
Write Python code using Pandas to perform the following tasks: (a) Calculate the total spending for each
customer. (b) Identify the top 10 most frequent customers. (c) Group the transactions by product category
and calculate the average price for each category.

import pandas as pd

# Sample dataset
data = {
8
'CustomerID': [101, 102, 101, 103, 102, 104, 101, 105, 104, 102],
'ProductCategory': ['Electronics', 'Clothing', 'Electronics', 'Furniture', 'Clothing',
'Electronics', 'Furniture', 'Electronics', 'Clothing', 'Furniture'],
'Price': [100.0, 50.0, 120.0, 250.0, 60.0, 110.0, 200.0, 90.0, 55.0, 300.0]
}

# Creating a DataFrame
df = pd.DataFrame(data)
print("Original DataFrame:")
print(df)

# (a) Calculate the total spending for each customer

total_spending = df.groupby('CustomerID')['Price'].sum().reset_index()
total_spending.columns = ['CustomerID', 'TotalSpending']
print("\nTotal Spending for Each Customer:")
print(total_spending)

# (b) Identify the top 10 most frequent customers

customer_frequency = df['CustomerID'].value_counts().head(10).reset_index()
customer_frequency.columns = ['CustomerID', 'Frequency']
print("\nTop 10 Most Frequent Customers:")
print(customer_frequency)

# (c) Group the transactions by product category and calculate the average price for each category
average_price_per_category = df.groupby('ProductCategory')['Price'].mean().reset_index()
average_price_per_category.columns = ['ProductCategory', 'AveragePrice']
print("\nAverage Price per Product Category:")
print(average_price_per_category)
You are given a dataset containing customer information, including age, income, and email addresses.
Some age values are missing, some income values are negative (representing errors), and some email
addresses are invalid. Describe a step-by-step process for cleaning this data using Python libraries like
Pandas and NumPy. Include specific examples of code snippets you would use for imputation (for
missing ages), handling negative incomes, and validating email addresses.

Step-by-Step Data Cleaning Process

1. Load the Dataset

Start by reading the dataset into a Pandas DataFrame.

import pandas as pd

# Load the dataset

df = pd.read_csv("customer_data.csv")

# Check the first few rows

print(df.head())
9

2. Handling Missing Age Values

Strategy: Impute missing ages using the median of the age column.
import numpy as np

# Check for missing values in the 'age' column

print("Missing Age Values:", df['age'].isna().sum())

# Impute missing ages with the median

df['age'].fillna(df['age'].median(), inplace=True)

3. Handling Negative Income Values

Strategy: Set negative values to NaN and impute with the median income.

Ensure all values are positive afterward.

# Replace negative income values with NaN

df.loc[df['income'] < 0, 'income'] = np.nan
# Impute missing income with the median
df['income'].fillna(df['income'].median(), inplace=True)

4. Validating Email Addresses

Strategy: Use regex to validate email formats and drop invalid entries.
import re

# Define a function to validate email addresses

def is_valid_email(email):
pattern = r'^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}$'
return bool(re.match(pattern, str(email)))

# Create a mask for valid emails

df['valid_email'] = df['email'].apply(is_valid_email)

# Remove rows with invalid emails

df = df[df['valid_email']].drop(columns=['valid_email'])

5. Detect and Handle Outliers (Optional)

Use the IQR method for further cleanup.

# Compute IQR for income

Q1 = df['income'].quantile(0.25)
Q3 = df['income'].quantile(0.75)
IQR = Q3 - Q1

# Remove outliers outside 1.5 * IQR

df = df[~((df['income'] < (Q1 - 1.5 * IQR)) | (df['income'] > (Q3 + 1.5 * IQR)))]

6. Save the Cleaned Data

# Save the cleaned DataFrame
df.to_csv("cleaned_customer_data.csv", index=False)

 Data loading and initial inspection.

 Handling missing values using appropriate imputation techniques.
 Identifying and handling outliers using suitable methods.

 Data loading and initial inspection.

 Handling missing values using appropriate imputation techniques.
10  Identifying and handling outliers using suitable methods.

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

import seaborn as sns

# Load the dataset

df = pd.DataFrame(sample_data)

# Convert Date column to datetime

df['Date'] = pd.to_datetime(df['Date'])

print("Initial Dataset:")

print(df.head())

Handling Missing Values

print("\nMissing Values Before Handling:")

print(df.isna().sum())

# Impute missing CustomerID with forward fill

df['CustomerID'].fillna(method='ffill', inplace=True)

# Impute missing Quantity with the median

df['Quantity'].fillna(df['Quantity'].median(), inplace=True)

print("\nMissing Values After Handling:")

print(df.isna().sum())

# Identifying and Handling Outliers

plt.figure(figsize=(10, 5))

sns.boxplot(data=df[['Quantity', 'Revenue']])

plt.title("Boxplot for Quantity and Revenue")

plt.show()

# Handle outliers using the IQR method for Revenue

Q1 = df['Revenue'].quantile(0.25)

Q3 = df['Revenue'].quantile(0.75)

IQR = Q3 - Q1

lower_bound = Q1 - 1.5 * IQR

upper_bound = Q3 + 1.5 * IQR

# Filter out outliers

df = df[(df['Revenue'] >= lower_bound) & (df['Revenue'] <= upper_bound)]

print("\nDataset after outlier handling:")

print(df)

You are given a dataset of sales transactions (you would provide a sample dataset). Develop a Python
script using Pandas, NumPy, Matplotlib, and Seaborn to perform a comprehensive exploratory data
analysis. Your script should include:
 Data transformation (if necessary).
 Visualizations to explore relationships between variables and identify trends.
 A brief report summarizing your findings and insights from the data.

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

import seaborn as sns

Data transformation (if necessary).

Visualizations to explore relationships between variables and identify trends.

# ------------------------------

# Data Transformation

# ------------------------------

11
# Example: Create a new column for Revenue per Quantity

print("\nData Transformation: Revenue per Quantity")

df['Revenue_per_Quantity'] = df['Revenue'] / df['Quantity']

print(df[['TransactionID', 'Revenue_per_Quantity']])

# ------------------------------

# Visualization

# ------------------------------

# Line plot for revenue trend over time

plt.figure(figsize=(12, 6))

sns.lineplot(x='Date', y='Revenue', data=df, marker='o')

plt.title("Revenue Trend Over Time")

plt.show()
# Bar plot for product-wise revenue

plt.figure(figsize=(10, 5))

sns.barplot(x='Product', y='Revenue', data=df, estimator=np.sum)

plt.title("Total Revenue by Product")

plt.show()

# Heatmap for correlation between numerical features

plt.figure(figsize=(8, 6))

sns.heatmap(df.corr(), annot=True, cmap='coolwarm', fmt='.2f')

plt.title("Correlation Heatmap")

plt.show()

# ------------------------------

# Report Findings and Insights

# ------------------------------

print("\nInsights from the Analysis:")

print("1. No more missing values in the dataset after appropriate imputation.")

print("2. Outliers in Revenue were detected and removed based on IQR.")

print("3. Quantity and Revenue show a positive correlation.")

print("4. Product A appears to generate higher overall revenue compared to other products.")

print("5. Revenue trend shows a steady increase over time.")

Compare and contrast the performance of Decision Trees and Support Vector Machines on a given
classification problem (provide a small dataset). Discuss the advantages and disadvantages of each
algorithm.

Performance Comparison
Support Vector Machine
Aspect Decision Tree
(SVM)
12 Training Speed Faster for small datasets Slower for large datasets
Easy to interpret and Complex, harder to
Model Complexity
visualize visualize
Decision Non-linear decision Linear or non-linear (using
Boundaries boundaries possible kernels)
Overfitting Risk High if not pruned Less prone to overfitting
More robust due to margin
Sensitivity to Noise Sensitive to noise
optimization
Efficiency on Large
Good with large datasets Slower for large datasets
Datasets
Advantages and Disadvantages
Decision Trees
Advantages:
 Easy to understand and interpret.
 Fast for small to medium datasets.
 Supports non-linear decision boundaries.
Disadvantages:
 Prone to overfitting if not pruned.
 Sensitive to small data variations.
 Can be unstable with changes in data.
Support Vector Machines (SVM)
Advantages:
 Effective for high-dimensional spaces.
 Robust to outliers due to margin optimization.
 Can model complex decision boundaries using kernels.
Disadvantages:
 Slower training for large datasets.
 Requires careful kernel selection and parameter tuning.
Less interpretable compared to decision
Explain the concept of hierarchical clustering. How does it differ from k-means clustering?

Hierarchical clustering is a method of clustering that builds a hierarchy of clusters by either starting
from individual data points and merging them (agglomerative approach) or starting with a single cluster
and dividing it into smaller clusters (divisive approach).
Types of Hierarchical Clustering
1. Agglomerative Clustering (Bottom-Up):
o Starts with each data point as an individual cluster.
o Iteratively merges the closest clusters until a single cluster is formed.
2. Divisive Clustering (Top-Down):
o Starts with all points in a single cluster.
o Splits clusters recursively until each data point is its own cluster.
Key Steps in Agglomerative Clustering
1. Compute a distance matrix between all points (e.g., Euclidean distance).
2. Merge the two closest clusters.
3. Update the distance matrix to reflect the new cluster.
13 4. Repeat steps 2 and 3 until a single cluster remains.
Differences Between Hierarchical Clustering and K-Means
Hierarchical
Aspect K-Means Clustering
Clustering
Cluster
Builds a hierarchy Partitions data into k clusters
Formation
Input No need for
Requires k beforehand
Parameters predefined k
Non-deterministic (depends on
Algorithm Type Deterministic
initialization)
Data Structure Dendrogram Centroid-based clustering
Slower for large
Performance Faster for large datasets
datasets
Poor for large
Scalability Good for large datasets
datasets
Explain the concept of boosting. How does it differ from bagging? Describe the AdaBoost algorithm
and its key steps.
14
Concept of Boosting
Boosting is an ensemble learning technique that combines weak learners (usually simple models like
decision trees) sequentially to create a strong predictive model. Each weak learner focuses on improving
the mistakes made by the previous ones.
Key Idea of Boosting
 Assign higher weights to data points that were misclassified by previous models.
 Train subsequent models to correct these errors.
 Aggregate the predictions of all weak learners for the final decision.
Boosting vs. Bagging
Aspect Boosting Bagging
Model
Sequential Parallel
Combination
Correcting errors of Reducing variance by
Focus
previous learners averaging predictions
Higher due to sequential
Complexity Lower due to parallel training
training
Overfitting Risk Higher if not regularized Lower due to averaging
Example AdaBoost, Gradient Random Forest, Bootstrap
Algorithms Boosting Aggregation
AdaBoost (Adaptive Boosting) Algorithm
AdaBoost is one of the earliest and most popular boosting algorithms. It focuses on improving the
classification performance by adjusting weights based on misclassification.
Key Steps of the AdaBoost Algorithm
1. Initialize Weights:
Assign equal weights to all data points:
wi=N1
where N is the total number of training examples.
2. Train Weak Learner:
Fit a weak learner (like a decision stump) to the weighted dataset.
3. Compute Error:
Calculate the weighted error of the weak learner:
4. Compute Alpha (Model Weight):
Calculate the contribution of the weak learner:
5. Update Weights:
Update the weights of the data points:
Normalize the weights to sum up to 1.
6. Repeat:
Repeat steps 2 to 5 for a specified number of iterations or until error converges.
Final Model:
Combine the weak learners' predictions using their weights α

Data Cleaning and Preprocessing
No ratings yet
Data Cleaning and Preprocessing
4 pages
Deep Learning Ram
No ratings yet
Deep Learning Ram
21 pages
Overview of Data Cleaning
No ratings yet
Overview of Data Cleaning
17 pages
Statistical Transform Data Cleaning
No ratings yet
Statistical Transform Data Cleaning
30 pages
Document (2)
No ratings yet
Document (2)
29 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
S-9
No ratings yet
S-9
18 pages
Data Cleaning
No ratings yet
Data Cleaning
42 pages
III-Unit
No ratings yet
III-Unit
4 pages
IIT FDS Assignment 1 Likhita
No ratings yet
IIT FDS Assignment 1 Likhita
7 pages
Ads Phase 5
No ratings yet
Ads Phase 5
23 pages
Aiml Data Preprocessing
No ratings yet
Aiml Data Preprocessing
99 pages
Project Report-Micro Credit Loan
No ratings yet
Project Report-Micro Credit Loan
8 pages
Ads Exp2 C35
No ratings yet
Ads Exp2 C35
9 pages
3 DSEngineering
No ratings yet
3 DSEngineering
64 pages
DATA SCIENCE SAMPLE
No ratings yet
DATA SCIENCE SAMPLE
5 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Prac 7
No ratings yet
Prac 7
5 pages
Practicals
No ratings yet
Practicals
42 pages
CSC407_Chapter 2-3
No ratings yet
CSC407_Chapter 2-3
46 pages
Chap.3 Data Preprocessing
No ratings yet
Chap.3 Data Preprocessing
6 pages
Reading 5 - Data Preparation
No ratings yet
Reading 5 - Data Preparation
23 pages
text 3
No ratings yet
text 3
3 pages
DSBDA 4
No ratings yet
DSBDA 4
16 pages
Data Cleaning
No ratings yet
Data Cleaning
13 pages
VIPDMTheoryChapter3
No ratings yet
VIPDMTheoryChapter3
87 pages
Lesson 3. Data Preparation and Structuring 1 Data Cleaning
No ratings yet
Lesson 3. Data Preparation and Structuring 1 Data Cleaning
36 pages
ds
No ratings yet
ds
114 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Data preprocessing (1)
No ratings yet
Data preprocessing (1)
77 pages
PDS_Exp_7_to_9
No ratings yet
PDS_Exp_7_to_9
10 pages
BI Unit 4
No ratings yet
BI Unit 4
21 pages
3 Data Preprocessing
No ratings yet
3 Data Preprocessing
33 pages
L 4 and 5-Data Cleaning DS-Sa
No ratings yet
L 4 and 5-Data Cleaning DS-Sa
44 pages
data science
No ratings yet
data science
10 pages
OCS353_Review Questions
No ratings yet
OCS353_Review Questions
3 pages
dw lab file
No ratings yet
dw lab file
18 pages
Mock Interview Topics and Questions
No ratings yet
Mock Interview Topics and Questions
4 pages
DataCleaninginML
No ratings yet
DataCleaninginML
15 pages
Predicting Credit Card Approvals
100% (1)
Predicting Credit Card Approvals
14 pages
DEC_Unit II Data Pre-processing
No ratings yet
DEC_Unit II Data Pre-processing
96 pages
01 - Feature Engg
No ratings yet
01 - Feature Engg
43 pages
Data Preprocessing Steps for Machine Learning in Python (Part 1) _ by Learn with Nas _ Wom
No ratings yet
Data Preprocessing Steps for Machine Learning in Python (Part 1) _ by Learn with Nas _ Wom
39 pages
Week 6 - Data Cleaning
No ratings yet
Week 6 - Data Cleaning
8 pages
Exercise - 3 Submission - Group - 12
No ratings yet
Exercise - 3 Submission - Group - 12
14 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Machine Learning
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Machine Learning
35 pages
1.2.1. Retrieving Data - 1.2.2. Cleaning Data
No ratings yet
1.2.1. Retrieving Data - 1.2.2. Cleaning Data
35 pages
Module 2_data preprocessing
No ratings yet
Module 2_data preprocessing
16 pages
Data Prep and Cleaning For Machine Learning
No ratings yet
Data Prep and Cleaning For Machine Learning
22 pages
Data Preprocessing Part 1
No ratings yet
Data Preprocessing Part 1
14 pages
Machine Learning Chapter 2
No ratings yet
Machine Learning Chapter 2
37 pages
dm unit 3
No ratings yet
dm unit 3
15 pages
JAVA Advanced 3
No ratings yet
JAVA Advanced 3
19 pages
SIM - Chapters - DA T2
No ratings yet
SIM - Chapters - DA T2
5 pages
lec 4
No ratings yet
lec 4
9 pages
Cleaning Data in Python
No ratings yet
Cleaning Data in Python
8 pages
03_Data_Preprocessing
No ratings yet
03_Data_Preprocessing
15 pages
DAP writeups_merged
No ratings yet
DAP writeups_merged
33 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
AI-900: Microsoft Azure AI Fundamentals Preparation
From Everand
AI-900: Microsoft Azure AI Fundamentals Preparation
Georgio Daccache
No ratings yet
ML Q
No ratings yet
ML Q
40 pages
Cluster Analysis Thesis Matlab Code PDF
100% (3)
Cluster Analysis Thesis Matlab Code PDF
7 pages
Music Recommendation From Song Sets.: January 2004
No ratings yet
Music Recommendation From Song Sets.: January 2004
10 pages
Program Book
No ratings yet
Program Book
51 pages
Clustering in Tableau 107
No ratings yet
Clustering in Tableau 107
8 pages
ANL252 SU5 Jul2022
No ratings yet
ANL252 SU5 Jul2022
58 pages
Paper Dinesh Clustering Techniques
No ratings yet
Paper Dinesh Clustering Techniques
5 pages
AdityaGaur BDA Exp8
No ratings yet
AdityaGaur BDA Exp8
4 pages
7 K-Means Clustering
0% (1)
7 K-Means Clustering
4 pages
Hyperspectral Imaging (Volume 32) (Data Handling in Science and Technology, Volume 32) 1st Edition Jose Manuel Amigo all chapter instant download
100% (3)
Hyperspectral Imaging (Volume 32) (Data Handling in Science and Technology, Volume 32) 1st Edition Jose Manuel Amigo all chapter instant download
50 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
Data Mining-1
No ratings yet
Data Mining-1
15 pages
An Efficient K-Means Clustering Algorithm
No ratings yet
An Efficient K-Means Clustering Algorithm
7 pages
Airline Reservation
No ratings yet
Airline Reservation
2 pages
Predictive Analytics and Data Mining: Segmentation Using Clustering
No ratings yet
Predictive Analytics and Data Mining: Segmentation Using Clustering
25 pages
Rmbi1020 Lec07 Clustering
No ratings yet
Rmbi1020 Lec07 Clustering
41 pages
s10726-024-09903-y
No ratings yet
s10726-024-09903-y
28 pages
4 - Data Analytics Using DM and ML Algorithms - 1
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
71 pages
Survey of Clustering Data Mining Techniques: Pavel Berkhin
100% (1)
Survey of Clustering Data Mining Techniques: Pavel Berkhin
56 pages
Industrial Training Report
No ratings yet
Industrial Training Report
31 pages
Pandas: Reference Sheet
No ratings yet
Pandas: Reference Sheet
9 pages
Zeroshot Fewshot (Concepts)
No ratings yet
Zeroshot Fewshot (Concepts)
5 pages
Analysis Clustering of Electricity Usage Profile U
No ratings yet
Analysis Clustering of Electricity Usage Profile U
10 pages
Instant ebooks textbook (Ebook) Smart Device Recognition: Ubiquitous Electric Internet of Things by Hui Liu, Chengming Yu, Haiping Wu ISBN 9789813349247, 9789813349254, 9813349247, 9813349255 download all chapters
100% (10)
Instant ebooks textbook (Ebook) Smart Device Recognition: Ubiquitous Electric Internet of Things by Hui Liu, Chengming Yu, Haiping Wu ISBN 9789813349247, 9789813349254, 9813349247, 9813349255 download all chapters
37 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
20 pages
Le 4
No ratings yet
Le 4
12 pages
Image Classification Using Image
No ratings yet
Image Classification Using Image
50 pages
data mining lab
No ratings yet
data mining lab
9 pages
Data Warehousing and Data Mining Lab
No ratings yet
Data Warehousing and Data Mining Lab
63 pages