0% found this document useful (0 votes)

5 views

ProjectOutliner

This document provides a comprehensive guide for developing a neural network model for clustering using datasets from Hugging Face. It covers objectives, requirements, neural network design, training strategies, clustering algorithms, and evaluation metrics. The document also includes sample code for implementing a basic Multi-Layer Perceptron (MLP) using PyTorch on the MNIST dataset.

Uploaded by

JAFOR MOHAMMAD

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

ProjectOutliner

Uploaded by

JAFOR MOHAMMAD

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Clustering with Neural Networks using Hugging Face

Datasets
Assignment Prepared By
Moin Mostakim
April 25, 2025

1. Introduction
This document outlines the process and requirements to develop a neural network model
for clustering using open datasets available on Hugging Face. Clustering is an unsupervised
task where the model identifies intrinsic groupings in the data without labeled outputs.

2. Objective
To design and implement a neural network capable of performing clustering on a selected
Hugging Face dataset, with evaluation on metrics such as Silhouette Score, Davies-Bouldin
Index, or cluster separation.

3. Requirements

3.1 Software and Tools

• Python (3.8+)
• PyTorch or TensorFlow
• Hugging Face datasets library
• Scikit-learn
• Matplotlib / Seaborn for visualization
• CUDA-enabled GPU (optional, for training acceleration)

3.2 Libraries
pip install torch torchvision datasets transformers scikit - learn
matplotlib

1
Build A Neural Network Model Clustering via NN

3.3 Dataset
Choose a dataset from Hugging Face: https://huggingface.co/datasets
Examples:

• ag news — for clustering news articles

• glue/sst2 — for sentence semantic clustering

• mnist — for image-based clustering

4. Neural Network Design

Clustering with neural networks involves encoding data into a compact latent space where
similar instances are closer together. This section describes the model architecture depending
on the input modality.

4.1 Input Preprocessing

• Text Data: Use pre-trained transformers (e.g., BERT, RoBERTa) or sentence em-
beddings (e.g., Sentence-BERT) to convert text into dense vector representations.

• Image Data: Use convolutional encoders or pre-trained feature extractors (e.g.,

ResNet, VGG) to get high-dimensional embeddings.

• Tabular Data: Normalize features and use dense feed-forward layers.

4.2 Architecture Overview

We design an encoder network to project the input into a lower-dimensional latent space
suitable for clustering. The core architectures include:

4.2.1 Autoencoder-based Clustering

• Encoder: Several dense or convolutional layers reducing dimensionality.

• Latent Space: Bottleneck layer representing data embedding.

• Decoder: Mirror of the encoder for reconstruction (used only during training).

• Loss: Combination of reconstruction loss and clustering-oriented loss.

L = Lreconstruction + λ · Lclustering

Page 2
Build A Neural Network Model Clustering via NN

4.2.2 Deep Embedding Clustering (DEC)

• Uses a deep autoencoder to learn representations.

• Learns a probability distribution over clusters using a Student’s t-distribution.

• Loss is KL divergence between soft cluster assignments and auxiliary target distribu-
tion:

XX pij
L = KL(P || Q) = pij log
i j
qij

Where:
α+1
(1 + ∥zi − µj ∥2 /α)− 2
qij = P − α+1
2
k (1 + ∥zi − µk ∥ /α)
2

2
P
qij / i qij
pij = P 2 P
k qik / i qik

4.2.3 Siamese Network for Contrastive Clustering

• Learns whether two samples are similar or dissimilar.

• Each branch processes a different input and compares their embeddings.

• Loss: Contrastive loss:

L = y · ∥z1 − z2 ∥2 + (1 − y) · max(0, m − ∥z1 − z2 ∥)2

4.2.4 Triplet Network for Semantic Similarity

• Trained on triplets: anchor, positive, and negative examples.

• Embedding of anchor should be closer to positive than to negative.

• Loss: Triplet loss:

L = max 0, ∥f (xa ) − f (xp )∥2 − ∥f (xa ) − f (xn )∥2 + α

4.3 Embedding Size and Clustering

• The output of the encoder is a fixed-length embedding vector (e.g., 64 or 128 dimen-
sions).

• Embeddings are passed to clustering algorithms such as K-Means or DBSCAN.

Page 3
Build A Neural Network Model Clustering via NN

4.4 Training Strategy

• Phase 1: Pre-train encoder (and decoder if using autoencoder) with reconstruction
or contrastive loss.

• Phase 2: Fine-tune embeddings with clustering loss or perform clustering on frozen

embeddings.

1 Visualization

Hugging Face Dataset

Data PreprocessingTokenization / Normalization

Feature Extraction / Embedding

Encoder Network(e.g. Autoencoder, BERT, CNN)

Latent Embedding Space Visualizationt-SNE / PCA

Clustering Algorithm(K-Means / DBSCAN / etc)

Evaluation MetricsSilhouette / DB Index / CH Index

Page 4
Build A Neural Network Model Clustering via NN

5. Clustering Algorithm
Apply a clustering algorithm to latent embeddings:

• K-Means

• DBSCAN

• Hierarchical Clustering

6. Evaluation Metrics
• Silhouette Score

• Davies-Bouldin Index

• Calinski-Harabasz Index

• Visual inspection via t-SNE or PCA

7. Expected Output
• Clustered representations

• Visualization of cluster separations

• Quantitative metric scores

2 Sample Code Introduction

This document outlines the basic structure of a neural network built using PyTorch. The
example provided is for a simple Multi-Layer Perceptron (MLP) applied to the MNIST
dataset.

3 Import Required Libraries

We begin by importing necessary libraries from PyTorch and Torchvision.
import torch
import torch . nn as nn
import torch . optim as optim
from torch . utils . data import DataLoader
import torchvision . transforms as transforms
import torchvision . datasets as datasets

Page 5
Build A Neural Network Model Clustering via NN

4 Define the Neural Network

We define a basic MLP with one hidden layer using nn.Sequential for clarity and modu-
larity.
class MyMLP ( nn . Module ) :
def __init__ ( self , input_size , hidden_size , output_size ) :
super ( MyMLP , self ) . __init__ ()
self . net = nn . Sequential (
nn . Linear ( input_size , hidden_size ) ,
nn . ReLU () ,
nn . Linear ( hidden_size , output_size )
)

def forward ( self , x ) :

return self . net ( x )

5 Set Hyperparameters
Set the size of layers, learning rate, batch size, and number of training epochs.
input_size = 784 # 28 x28 images flattened
hidden_size = 128
output_size = 10 # Number of classes in MNIST
learning_rate = 0.001
batch_size = 64
epochs = 5

6 Load Dataset and Create Dataloaders

Here we use the MNIST dataset and apply transformations such as normalization.
transform = transforms . Compose ([
transforms . ToTensor () ,
transforms . Normalize ((0.5 ,) , (0.5 ,) )
])

train_dataset = datasets . MNIST ( root = ’ ./ data ’ , train = True , transform =

transform , download = True )
test_dataset = datasets . MNIST ( root = ’ ./ data ’ , train = False , transform =
transform )

train_loader = DataLoader ( train_dataset , batch_size = batch_size , shuffle =

True )
test_loader = DataLoader ( test_dataset , batch_size = batch_size , shuffle =
False )

7 Initialize Model, Loss Function, and Optimizer

Prepare the model for training and evaluation on GPU if available.

Page 6
Build A Neural Network Model Clustering via NN

device = torch . device ( " cuda " if torch . cuda . is_available () else " cpu " )
model = MyMLP ( input_size , hidden_size , output_size ) . to ( device )
criterion = nn . CrossEntropyLoss ()
optimizer = optim . Adam ( model . parameters () , lr = learning_rate )

8 Training Loop
Perform training for the specified number of epochs.
for epoch in range ( epochs ) :
model . train ()
for batch_idx , ( data , targets ) in enumerate ( train_loader ) :
data = data . view ( data . size (0) , -1) . to ( device )
targets = targets . to ( device )

scores = model ( data )

loss = criterion ( scores , targets )

optimizer . zero_grad ()
loss . backward ()
optimizer . step ()

print ( f " Epoch [{ epoch +1}/{ epochs }] , Loss : { loss . item () :.4 f } " )

9 Evaluation
Evaluate the trained model on the test dataset.
model . eval ()
correct = 0
total = 0
with torch . no_grad () :
for data , targets in test_loader :
data = data . view ( data . size (0) , -1) . to ( device )
targets = targets . to ( device )

outputs = model ( data )

_ , predicted = torch . max ( outputs . data , 1)
total += targets . size (0)
correct += ( predicted == targets ) . sum () . item ()

print ( f " Test Accuracy : {100 * correct / total :.2 f }% " )

References
• Hugging Face Datasets: https://huggingface.co/docs/datasets/index
• PyTorch: https://pytorch.org

Page 7

Man (Shacman) f2000 f3000 - Fuse Box and Relay
100% (2)
Man (Shacman) f2000 f3000 - Fuse Box and Relay
11 pages
Shutdown Work Checklist Category 3 - 3.1 To 3. 4
0% (1)
Shutdown Work Checklist Category 3 - 3.1 To 3. 4
18 pages
Syllabus Health Assessment
No ratings yet
Syllabus Health Assessment
4 pages
Astro AI
No ratings yet
Astro AI
20 pages
Astro AI
No ratings yet
Astro AI
20 pages
Building Deep Learning Models Using the PyTorch Library
No ratings yet
Building Deep Learning Models Using the PyTorch Library
4 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
Introduction to ANN with steps 10 25
No ratings yet
Introduction to ANN with steps 10 25
30 pages
Self-Supervision, Bert, and Beyond: Building Transformer-Based Natural Language Processing Applications (Part 2)
No ratings yet
Self-Supervision, Bert, and Beyond: Building Transformer-Based Natural Language Processing Applications (Part 2)
117 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
Introduction to Genetic Algorithm Neural Networks
No ratings yet
Introduction to Genetic Algorithm Neural Networks
44 pages
TensorFlow Regression
No ratings yet
TensorFlow Regression
445 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
Lecture 26-30 Unit 2
No ratings yet
Lecture 26-30 Unit 2
20 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
Unit 5 Autoencoders.docx
No ratings yet
Unit 5 Autoencoders.docx
6 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
hw5
No ratings yet
hw5
10 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Week 02 Ch2.1 Introduction To Neural Networks
No ratings yet
Week 02 Ch2.1 Introduction To Neural Networks
44 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
NN Models & Architecture of NN: CSE-4619 Machine Learning
No ratings yet
NN Models & Architecture of NN: CSE-4619 Machine Learning
30 pages
A Introduction To Artificial Neural Network Library in C PDF
No ratings yet
A Introduction To Artificial Neural Network Library in C PDF
4 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Tensorflow Playground:: Exercise 2
No ratings yet
Tensorflow Playground:: Exercise 2
2 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
No ratings yet
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
27 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
LLM for Maths People
No ratings yet
LLM for Maths People
53 pages
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
No ratings yet
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
13 pages
Assignment-2_CNN-1[1]
No ratings yet
Assignment-2_CNN-1[1]
3 pages
Convolutional Autoencoder in Pytorch On MNIST Dataset - by Eugenia Anello - DataSeries - Medium
No ratings yet
Convolutional Autoencoder in Pytorch On MNIST Dataset - by Eugenia Anello - DataSeries - Medium
18 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
LAB SHEET 1 Basics
No ratings yet
LAB SHEET 1 Basics
5 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
50 pages
Deep Neural Network Application
No ratings yet
Deep Neural Network Application
17 pages
EPS-DL-Handout4- Steps to Build ANN From Scratch
No ratings yet
EPS-DL-Handout4- Steps to Build ANN From Scratch
14 pages
Eng Ppt Tech
No ratings yet
Eng Ppt Tech
18 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
Introduction To Artificial Neural Network
100% (1)
Introduction To Artificial Neural Network
42 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
Pedagogical Practicum I Syllabus 2023-1
No ratings yet
Pedagogical Practicum I Syllabus 2023-1
8 pages
Impact of Learning Styles On The Academic Performance of Junior High School Students of Golden Sunbeams Christian School, Antipolo City
No ratings yet
Impact of Learning Styles On The Academic Performance of Junior High School Students of Golden Sunbeams Christian School, Antipolo City
63 pages
Evaluate The Lateral Displacement That Occurs in Each Storey of High Rise Buildings by Using Staad Pro Software
No ratings yet
Evaluate The Lateral Displacement That Occurs in Each Storey of High Rise Buildings by Using Staad Pro Software
5 pages
PDF
No ratings yet
PDF
3 pages
Bearing Expo 2024 Destiny Trips... Ex - Mumbai
No ratings yet
Bearing Expo 2024 Destiny Trips... Ex - Mumbai
7 pages
AI and Culture - Culturally Dependent Responses to AI Systems
No ratings yet
AI and Culture - Culturally Dependent Responses to AI Systems
6 pages
English SBA
0% (1)
English SBA
14 pages
SRP-350352plusII PSP Installation Eng Rev 1 00
No ratings yet
SRP-350352plusII PSP Installation Eng Rev 1 00
15 pages
Club Venezia 1
No ratings yet
Club Venezia 1
5 pages
Govind Sharma Food Fest Project
No ratings yet
Govind Sharma Food Fest Project
19 pages
In Your Heart: By: Thomas S Carver
No ratings yet
In Your Heart: By: Thomas S Carver
4 pages
Management across Cultures Developing Global Competencies 3rd Edition Richard M. Steers - The ebook in PDF format is ready for download
100% (1)
Management across Cultures Developing Global Competencies 3rd Edition Richard M. Steers - The ebook in PDF format is ready for download
64 pages
rizal-life-works-and-writings-chapter-VI
No ratings yet
rizal-life-works-and-writings-chapter-VI
22 pages
Certificate of Appearance
No ratings yet
Certificate of Appearance
6 pages
Maglov - Tpack Lesson Redesign
No ratings yet
Maglov - Tpack Lesson Redesign
2 pages
Full download Documentary Media History Theory Practice 2nd Edition Broderick Fox pdf docx
100% (1)
Full download Documentary Media History Theory Practice 2nd Edition Broderick Fox pdf docx
65 pages
Guide To Personal Finance
No ratings yet
Guide To Personal Finance
36 pages
4ºisland Review Units 1-2
No ratings yet
4ºisland Review Units 1-2
5 pages
C#, ASP - Net and MySQL Project On Online Employee Management System
80% (10)
C#, ASP - Net and MySQL Project On Online Employee Management System
139 pages
Fatigue in Composites Science and Technology of The Fatigue Response of Fibre Reinforced Plastics 1st Edition Bryan Harris
100% (10)
Fatigue in Composites Science and Technology of The Fatigue Response of Fibre Reinforced Plastics 1st Edition Bryan Harris
70 pages
Sekonya NM's Language Machinery(errata p9)
No ratings yet
Sekonya NM's Language Machinery(errata p9)
46 pages
Facelift by Acupressure Beauty and Vitality at Your Fingertips by Ina C. Niemann
100% (2)
Facelift by Acupressure Beauty and Vitality at Your Fingertips by Ina C. Niemann
59 pages
RMIT, Note Taking For Lectures
No ratings yet
RMIT, Note Taking For Lectures
2 pages
Beatrice Nyaga's CV-Revised 2021
No ratings yet
Beatrice Nyaga's CV-Revised 2021
4 pages
Chem Project RAYON THREAD
No ratings yet
Chem Project RAYON THREAD
15 pages
Aquarelle Pattern v2 PDF
100% (3)
Aquarelle Pattern v2 PDF
17 pages
IB Questionbank Mathematical Studies 3rd Edition 1
No ratings yet
IB Questionbank Mathematical Studies 3rd Edition 1
7 pages