0% found this document useful (0 votes)

45 views

Bert Fine Tuning (AutoRecovered)

This document provides instructions for fine-tuning BERT for classification using local machines with TensorFlow. It describes preprocessing data by tokenizing sentences and one-hot encoding labels. It also discusses creating a BERT layer using BertModelLayer and freezing original BERT parameters. The document notes making a model by joining sequential layers after the BERT layer and using dropouts to reduce overfitting.

Uploaded by

Dinesh Yedakula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views

Bert Fine Tuning (AutoRecovered)

Uploaded by

Dinesh Yedakula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

1.

Fine-Tuning BERT for Classification using local machine

with internal GPU Reference

STEP 1: INSTALLING REQUIRED PACKAGES

1.Transformers : Transformers (formerly known as pytorch-transformers and pytorch-pretrained-bert)
provides general-purpose architectures (BERT, GPT-2, RoBERTa, XLM, DistilBert, XLNet…) for Natural
Language Understanding (NLU) and Natural Language Generation (NLG) with over 32+ pretrained models
in 100+ languages and deep interoperability between TensorFlow 2.0 and PyTorch.
pip install transformers

2.pytorch :

pip install torch===1.6.0 torchvision===0.7.0 -f https://download.pytorch.org/whl/torch_stable.html

STEP 2: PREPROCESSING DATA using transformers

Import BERT Model and BERT Tokenizer from transformers. Tokenizer is used to generate tokens from the
textdata which could be useful for further process
from transformers import AutoModel, BertTokenizerFast
bert = AutoModel.from_pretrained('bert-base-uncased')
tokenizer = BertTokenizerFast.from_pretrained('bert-base-uncased')

STEP 3: PREPROCESSING DATA using transformers

tokenize and encode sequences of train and test data using tokenizer.batch_encode_plus()
converting interger sequences to tensors using torch.tensor()

STEP 4: Freeze BERT Parameters

STEP 5: Defining Model
Create own neural network architecture and pass the pre-trained BERT to our define architecture and
push the model to GPU.Architecure can be defined by sequential layer with dropouts,activations. Lastly
appending optimizer from hugging face transformers

If data is imbalanced dataset we can use class weight from sklearn.utils.class_weight

Compute the class weights convert class weights to tensor using torch.tensor function ad setting right
epochs

STEP 6: FineTuning BERT

One of finetuning if we are using pytorch we need to write own function to perform the finetuning.

2.Fine-Tuning BERT for Classification using GCP Platform

with GCP TPU’s. Reference

STEP 1: Create a Cloud Storage bucket to hold dataset and

model output.

This section provides information on setting up Cloud Storage bucket and a Compute Engine
VM.Open a Cloud Shell window.

Open Cloud Shell

1. Create a variable for your project's ID.
export PROJECT_ID=project-id

2. Configure gcloud command-line tool to use the project where you want to create Cloud TPU.

gcloud config set project ${PROJECT_ID}

3. Create a Cloud Storage bucket using the following command:

gsutil mb -p ${PROJECT_ID} -c standard -l us-central1 -b on gs://bucket-name

4. Launch a Compute Engine VM and Cloud TPU using the ctpu up command.

$ ctpu up --tpu-size=v3-8 \
--machine-type=n1-standard-8 \
--zone=us-central1-b \
--tf-version=1.15.3 \
--name=bert-tutorial

5. The configuration you specified appears. Enter y to approve or n to cancel.

6. When the ctpu up command has finished executing, verify that your shell prompt has changed
from username@project to username@vm-name. This change shows that you are now logged into
your Compute Engine VM.
gcloud compute ssh bert-tutorial --zone=us-central1-b

As you continue these instructions, run each command that begins with (vm)$ in your VM session
window.

STEP 2: Clone the BERT repository and other required files.

Clone the BERT repository

From your Compute Engine virtual machine (VM), clone the BERT repository.

(vm)$ git clone https://github.com/google-research/bert

Download download_glue_data.py

This tutorial uses the General Language Understanding Evaluation (GLUE) benchmark to evaluate
and analyze the performance of the model. To use this benchmark, download
the download_glue_data.py script using the following git clone command:

(vm)$ git clone https://gist.github.com/60c2bdb54d156a41194446737ce03e2e.git

download_glue_data

Download the GLUE data

Next, run the download_glue_data.py on your Compute Engine VM.

(vm)$ python3 download_glue_data/download_glue_data.py --data_dir $HOME/glue_data -

-tasks all
STEP 3: Run the training job
Train the model

From your Compute Engine VM, run the following command.

python3 ./bert/run_classifier.py \
--task_name=${TASK_NAME} \
--do_train=true \
--do_eval=true \
--data_dir=${GLUE_DIR}/${TASK_NAME} \
--vocab_file=${BERT_BASE_DIR}/vocab.txt \
--bert_config_file=${BERT_BASE_DIR}/bert_config.json \
--init_checkpoint=${BERT_BASE_DIR}/bert_model.ckpt \
--max_seq_length=128 \
--train_batch_size=32 \
--learning_rate=2e-5 \
--num_train_epochs=3.0 \
--output_dir=${STORAGE_BUCKET}/${TASK_NAME}-output/ \
--use_tpu=True \
--tpu_name=${TPU_NAME}

STEP 4: Verify the output results.

Verify your results

The training should take less than 5 minutes. When the training completes, you should see results
similar to the following:

* Eval results *

eval_accuracy = 0.845588
eval_loss = 0.64990824
global_step = 343
loss = 0.34979442
3.Fine-Tuning BERT for Classification using local machine
with TensorFlow Reference

STEP 1:Preprocessing
The BERT layer requires in input an array of sequences with a defined max length for each sequence.Create

an instance of the BERT FullTokenizer, that requires in input the corpora used for training the BERT

model.Using the tokenizer, the data preparation is as follows:

Split data into training set, training labels, testing set and testing labels and Shuffling each sentence

set then Tokenized each sentence set using the tokenizer described above. Appending the tokens

which completely depends upon the dataset type. Performed one hot encoder to each label of the

label set

STEP 2 :BERT Layer

BertModelLayer wrapper is used to create the Keras Layer.

bert_config.json which contains all the parameters required for creating the
layer.To freeze all the original layer wrapper into the BertModelLayer class it
would be better keep all bert parameters False

Code for creating bert layer is below

Import bert

import os

def createBertLayer():
global bert_layer

bertDir = os.path.join(modelBertDir, "multi_cased_L-12_H-768_A-12")

#bert containing directory

bert_params = bert.params_from_pretrained_ckpt(bertDir)
#imputing pretrained parmas to bert model

bert_layer = bert.BertModelLayer.from_params(bert_params, name="bert")

#creating bert layer

bert_layer.apply_adapter_freeze()

Making Model
After creating bert layer we can join sequential layers right after it,and can
use dropouts to reduce overfitting

- Dinesh Yedakula
- 4th Year
- Mechanical Engineering
- IIt Dharwad

Project Automating Port Operations
No ratings yet
Project Automating Port Operations
5 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
X3 Development - Class-Representation Workshop
100% (3)
X3 Development - Class-Representation Workshop
145 pages
Bert T
No ratings yet
Bert T
2 pages
DL Pipeline and Tutorial
No ratings yet
DL Pipeline and Tutorial
36 pages
Pre-Training BERT From Scratch With Cloud TPU
No ratings yet
Pre-Training BERT From Scratch With Cloud TPU
11 pages
Building Deep Learning Models Using the PyTorch Library
No ratings yet
Building Deep Learning Models Using the PyTorch Library
4 pages
Exp 10 Sentiment Analysis BERT
No ratings yet
Exp 10 Sentiment Analysis BERT
5 pages
NNDL Lab Record
No ratings yet
NNDL Lab Record
26 pages
NLP Exercise 10
No ratings yet
NLP Exercise 10
6 pages
A Simple Guide On Using BERT For Binary Text Classification
No ratings yet
A Simple Guide On Using BERT For Binary Text Classification
18 pages
Lab Summary Google ML Path
No ratings yet
Lab Summary Google ML Path
11 pages
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
No ratings yet
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
14 pages
Building Deep Neural Network
No ratings yet
Building Deep Neural Network
17 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
UNIT-II
No ratings yet
UNIT-II
83 pages
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
07 - Assessmen (10) - JupyterLab
No ratings yet
07 - Assessmen (10) - JupyterLab
13 pages
Keras-tensorflow-IT Haarlem 2023
No ratings yet
Keras-tensorflow-IT Haarlem 2023
35 pages
Text Classification With Switch Transformer - 1716327819025
No ratings yet
Text Classification With Switch Transformer - 1716327819025
5 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
INT422
No ratings yet
INT422
5 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Fine-tuned vs RAG Short Notes ?
No ratings yet
Fine-tuned vs RAG Short Notes ?
25 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Finetuning
No ratings yet
Finetuning
3 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
No ratings yet
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
17 pages
Implementing A Convolutional Neural Network CNN 1718899610
No ratings yet
Implementing A Convolutional Neural Network CNN 1718899610
10 pages
Natural Language Processing With Pytorch Readthedocs Io en Latest PDF
No ratings yet
Natural Language Processing With Pytorch Readthedocs Io en Latest PDF
35 pages
Auto Keras
No ratings yet
Auto Keras
6 pages
Deep Learning Record
No ratings yet
Deep Learning Record
70 pages
DAAI22 Exercises2
No ratings yet
DAAI22 Exercises2
25 pages
DL7 2
No ratings yet
DL7 2
11 pages
hybridmodel with cnn modifications
No ratings yet
hybridmodel with cnn modifications
5 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
Practical Guide To Keras
No ratings yet
Practical Guide To Keras
28 pages
3-Sentiment Analysis BERT
No ratings yet
3-Sentiment Analysis BERT
5 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
RLDL128
No ratings yet
RLDL128
73 pages
Lesson1 Notes Fastai
No ratings yet
Lesson1 Notes Fastai
18 pages
Code
No ratings yet
Code
10 pages
Step by Step Guide How To Rapidly Build Neural Networks
No ratings yet
Step by Step Guide How To Rapidly Build Neural Networks
6 pages
LAB 2 Transfer Learning
No ratings yet
LAB 2 Transfer Learning
10 pages
FA I_Unit5
No ratings yet
FA I_Unit5
11 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
557 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Team Name - Codesmashers Team Members - Manmeet Singh Tuteja, Raghav Gupta
No ratings yet
Team Name - Codesmashers Team Members - Manmeet Singh Tuteja, Raghav Gupta
4 pages
Transfer Learning For Image Classification in Pytorch
No ratings yet
Transfer Learning For Image Classification in Pytorch
13 pages
UNIT_I CHP_5
No ratings yet
UNIT_I CHP_5
26 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
LPIC-1 Primer
From Everand
LPIC-1 Primer
John Greene
4.5/5 (3)
Operationalizing The Model
No ratings yet
Operationalizing The Model
46 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
No ratings yet
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
18 pages
7 CNNWithCustomImage
No ratings yet
7 CNNWithCustomImage
11 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
CCS355 –Neural Network CSE
No ratings yet
CCS355 –Neural Network CSE
38 pages
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
Code & Diff
No ratings yet
Code & Diff
9 pages
Reports, Smartforms, Adobe Form.
No ratings yet
Reports, Smartforms, Adobe Form.
30 pages
2nd Year COMPUTER Chapter 1,2.3.12
No ratings yet
2nd Year COMPUTER Chapter 1,2.3.12
1 page
Developing PDF
No ratings yet
Developing PDF
96 pages
Downloads Powerbi Presentation
100% (3)
Downloads Powerbi Presentation
179 pages
Lesson 1 Evolution of Programming Languages
No ratings yet
Lesson 1 Evolution of Programming Languages
3 pages
State Machines
No ratings yet
State Machines
47 pages
Rms
No ratings yet
Rms
34 pages
Major Software Failures, Why They Failed and Lessons Learned BY
No ratings yet
Major Software Failures, Why They Failed and Lessons Learned BY
11 pages
CG Project
No ratings yet
CG Project
12 pages
Python Final Submission
No ratings yet
Python Final Submission
3 pages
SAP - Test Acceleration & Optimization
100% (1)
SAP - Test Acceleration & Optimization
11 pages
LAB ACTIVITY 2B: Hypertext Markup Language (HTML) : Duration: 2 Hours Learning Outcomes
No ratings yet
LAB ACTIVITY 2B: Hypertext Markup Language (HTML) : Duration: 2 Hours Learning Outcomes
7 pages
Java Updated Resume Final
No ratings yet
Java Updated Resume Final
9 pages
Freshers Jobs 26 Feb 2024
No ratings yet
Freshers Jobs 26 Feb 2024
7 pages
Lastexception 63823506223
No ratings yet
Lastexception 63823506223
1 page
User Defined Exception: Compiled By: Aneeta Siddiqui
No ratings yet
User Defined Exception: Compiled By: Aneeta Siddiqui
17 pages
Aging Report
No ratings yet
Aging Report
37 pages
Phone No.: +91-7995158954 (After 5pm) Office No.: 080-25087680/25087359 (8:30am - 5pm) Email Id
No ratings yet
Phone No.: +91-7995158954 (After 5pm) Office No.: 080-25087680/25087359 (8:30am - 5pm) Email Id
2 pages
MSP430 Optimizing C - C++ Compiler
No ratings yet
MSP430 Optimizing C - C++ Compiler
184 pages
DONE - CS604-Quiz 2 Solution
No ratings yet
DONE - CS604-Quiz 2 Solution
8 pages
CIS Apple OSX 10.11 Benchmark v1.0.0
No ratings yet
CIS Apple OSX 10.11 Benchmark v1.0.0
123 pages
Muhammad Islahuddin - Tampilan Mobile App
No ratings yet
Muhammad Islahuddin - Tampilan Mobile App
3 pages
Jacky Bai - Pandas Hands-On - Data Analysis Crash Course (2020)
No ratings yet
Jacky Bai - Pandas Hands-On - Data Analysis Crash Course (2020)
139 pages
Q1. What Is JDBC? Explain Different Types JDBC Drivers With Suitable Diagram
No ratings yet
Q1. What Is JDBC? Explain Different Types JDBC Drivers With Suitable Diagram
64 pages
Checkpost Management System
100% (1)
Checkpost Management System
68 pages
Data RoadMap
No ratings yet
Data RoadMap
5 pages
Different Operating Systems - GeeksforGeeks
No ratings yet
Different Operating Systems - GeeksforGeeks
22 pages
CS439 CC F24 Assignment2
No ratings yet
CS439 CC F24 Assignment2
3 pages

Bert Fine Tuning (AutoRecovered)

Uploaded by

Bert Fine Tuning (AutoRecovered)

Uploaded by

1.

Fine-Tuning BERT for Classification using local machine

STEP 1: INSTALLING REQUIRED PACKAGES

pip install torch===1.6.0 torchvision===0.7.0 -f https://download.pytorch.org/whl/torch_stable.html

STEP 2: PREPROCESSING DATA using transformers

STEP 3: PREPROCESSING DATA using transformers

STEP 4: Freeze BERT Parameters

If data is imbalanced dataset we can use class weight from sklearn.utils.class_weight

STEP 6: FineTuning BERT

2.Fine-Tuning BERT for Classification using GCP Platform

STEP 1: Create a Cloud Storage bucket to hold dataset and

Open Cloud Shell

gcloud config set project ${PROJECT_ID}

3. Create a Cloud Storage bucket using the following command:

5. The configuration you specified appears. Enter y to approve or n to cancel.

STEP 2: Clone the BERT repository and other required files.

Clone the BERT repository

(vm)$ git clone https://github.com/google-research/bert

(vm)$ git clone https://gist.github.com/60c2bdb54d156a41194446737ce03e2e.git

Download the GLUE data

Next, run the download_glue_data.py on your Compute Engine VM.

(vm)$ python3 download_glue_data/download_glue_data.py --data_dir $HOME/glue_data -

From your Compute Engine VM, run the following command.

STEP 4: Verify the output results.

Verify your results

***** Eval results *****

model.Using the tokenizer, the data preparation is as follows:

STEP 2 :BERT Layer

Code for creating bert layer is below

bertDir = os.path.join(modelBertDir, "multi_cased_L-12_H-768_A-12")

bert_layer = bert.BertModelLayer.from_params(bert_params, name="bert")

You might also like

* Eval results *