paper 4-- Text_Classification_Based_on_Machine_Learning
paper 4-- Text_Classification_Based_on_Machine_Learning
ABSTRACT
The paper is focusing on improving text classification methods using machine learning (ML) and
natural language processing (NLP) technologies. It introduces the Trusted Platform Module
(TPM) algorithm that combines ML and NLP techniques for better classification performance.
The TPM algorithm achieves over 95% accuracy in experiments distinguishing between spam
and legitimate emails across different datasets.
INTRODUCTION
Text is a crucial way to share and store information, especially in today's digital world with
paperless offices, online libraries, and e-commerce. It serves as a universal medium for
communication and data management.
Highlights the challenges posed by the rapid increase in digital text data due to the global
growth of the Internet.
Points out the need for effective text classification technologies to process massive amounts of
data and meet user-specific requirements, with applications in sentiment analysis, opinion
mining, and domain-specific recognition.
TECHNIQUES USED
MODELS USED
RESULTS
1. Combines NLP and ML technologies to enhance text feature extraction and classification
accuracy.
2. TPM uses deep learning techniques (LSTM and CNN) for semantic understanding,
augmented by Bi-GRU for dataset-specific feature extraction
3. Graph-based semi-supervised learning algorithms support label propagation to minimize
manual data labeling efforts.
CONCLUSION