SlideShare a Scribd company logo
Text classification
With Apache Mahout and Lucene
Isabel Drost-Fromm

Software Engineer at Nokia Maps*
Member of the Apache Software Foundation
Co-Founder of Berlin Buzzwords and
Berlin Apache Hadoop GetTogether
Co-founder of Apache Mahout

*We are hiring, talk to me or mail careers@here.com
Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
TM
https://cwiki.apache.org/confluence/display/MAHOUT/Powered+By+Mahout

… provide your own success story online.
TM
Classification?
Text Classification Powered by Apache Mahout and Lucene
January 8, 2008 by Pink Sherbet Photography
http://www.flickr.com/photos/pinksherbet/2177961471/
By freezelight, http://www.flickr.com/photos/63056612@N00/155554663/
http://www.flickr.com/photos/29143375@N05/3344809375/in/photostream/

http://www.flickr.com/photos/redux/409356158/
http://www.flickr.com/photos/29143375@N05/3344809375/in/photostream/

http://www.flickr.com/photos/redux/409356158/
Image by jasondevilla
http://www.flickr.com/photos/jasondv/91960897/
How a linear classifier sees data
Image by ZapTheDingbat (Light meter)
http://www.flickr.com/photos/zapthedingbat/3028168415
Instance*
(sometimes also called example, item, or in databases a row)
Feature*
(sometimes also called attribute, signal, predictor, co-variate, or column in databases)
Label*
(sometimes also called class, target variable)
Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
Image taken in Lisbon/ Portugal.
Image by jasondevilla
http://www.flickr.com/photos/jasondv/91960897/
Text Classification Powered by Apache Mahout and Lucene
●

Remove noise.
Text Classification Powered by Apache Mahout and Lucene
●

Remove noise.

●

Convert text to vectors.
Text consists of terms and phrases.
Encoding issues?
Chinese? Japanese?
“New York” vs. new York?
“go” vs. “going” vs. “went” vs. “gone”?
“go” vs. “Go”?
Terms? Tokens? Wait!
Text Classification Powered by Apache Mahout and Lucene
Now we have terms – how to turn them
into vectors?
If we looked at two phrases only:
Sunny weather

High performance computing
Aaron

Zuse
Binary bag of words
●

Imagine a n-dimensional space.

●

Each dimension = one possible word in texts.

●

Entry in vector is one, if word occurs in text.

●

Problem:
–

bi , j =

{

1 ∀ x i ∈d j
0 else

}

How to know all possible terms in unknown text?
Term Frequency
●

Imagine a n-dimensional space.

●

Each dimension = one possible word in texts.

●

Entry in vector equal to the words frequency.
bi , j =ni , j

●

Problem:
–

Common words dominate vectors.
TF with stop wording
●

Imagine a n-dimensional space.

●

Each dimension = one possible word in texts.

●

Filter stopwords.

●

Entry in vector equal to the words frequency.

●

Problem:
–

bi , j =ni , j

Common and uncommon words with same weight.
TF- IDF
●

Imagine a n-dimensional space.

●

Each dimension = one possible word in texts.

●

Filter stopwords.

●

Entry in vector equal to the weighted frequency.

●

Problem:
–

bi , j =ni , j ×log 

∣D∣

∣{ d : t i ∈d }∣

Long texts get larger values.
Hashed feature vectors
●

Imagine a n-dimensional space.

●

Each word in texts = hashed to one dimension.

●

Entry in vector set to one, if word hashed to it.
Text Classification Powered by Apache Mahout and Lucene
<
How a linear classifier sees data
Text Classification Powered by Apache Mahout and Lucene
HTML

Tokenstream+x

Apache Tika

FeatureVector
Encoder

Fulltext

Lucene
Analyzer

Vector

Online
Learner

Model
Image by ZapTheDingbat (Light meter)
http://www.flickr.com/photos/zapthedingbat/3028168415
Goals

●

Did I use the best model parameters?

●

How well will my model perform in the wild?
Tune model
Parameters,
Experiment with
Tokenization,
Experiment with
Vector Encoding

Compute expected
performance
Text Classification Powered by Apache Mahout and Lucene
Performance
●

Use same data for training and testing.

●

Problem:
–

Highly optimistic.

–

Model generalization unknown.
Performance
●

Use same data for training and testing.

DON'T
●

Problem:
–

Highly optimistic.

–

Model generalization unknown.
Performance
●

Use just a fraction for training.

●

Set some data aside for testing.

●

Problems:
–

Pessimistic predictor: Not all data used for training.

–

Result may depend on which data was set aside.
Performance
●

Partition your data into n fractions.

●

Each fraction set aside for testing in turn.

●

Problem:
–

Still a pessimistic predictor.
Performance
●

Use just a fraction for training.

●

Set some data aside for tuning and testing.

●

Problems:
–

Highly optimistic.

–

Parameters manually tuned to testing data.
Performance
●

Use just a fraction for training.

●

Set some data aside for tuning and testing.
DON'T

●

Problems:
–

Highly optimistic.

–

Parameters manually tuned to testing data.
Performance
●

Use just a fraction for training.

●

Set some data aside for tuning.

●

Set another set of data aside for testing.

●

Problems:
–

Pretty pessimistic as not all data is used.

–

May depend on which data was set aside.
Performance Measures
Correct prediction: negative

Model
prediction:
negative

Model
prediction:
positive

Correct prediction: positive
Accuracy
ACC=

●

true positivetrue negative
true positive false positive false negativetrue negative

Problems:
–

What if class distribution is skewed?
Precision/ Recall
true positive
Precision=
true positive false positive
true positive
Recall=
true positive false negative
●

Problem:
–

Depends on decision threshold.
ROC Curves
ROC Curves

Orange rate
ROC Curves
True orange rate

False orange rate
ROC Curves
True orange rate

False orange rate
ROC Curves
True orange rate

False orange rate
ROC Curves
True orange rate

False orange rate
ROC Curves
True orange rate

False orange rate
AUC – area under ROC
True orange rate

False orange rate
Foto taken by fras1977
http://www.flickr.com/photos/fras/4992313333/
Image by Medienmagazin pro
http://www.flickr.com/photos/medienmagazinpro/6266643422
Text Classification Powered by Apache Mahout and Lucene
http://www.flickr.com/photos/generated/943078008/
Apache Hadoop-ready
Recommendations/
Collaborative filtering

kNN and matrix factorization
based Collaborative filtering
Classification/
Naïve Bayes, random forest
Frequent item sets/
(P)FPGrowth

Classification/
Logistic Regression/ SGD

Clustering/ Mean shift, k-Means,
Canopy, Dirichlet Process,
Co-Location search

Sequence learning/
HMM

Math libs/ Mahout collections

LDA
Libraries to have a look at:
Vowpal Wabbit Mallet
LibSvm
LibLinear
Libfm
Incanter
GraphLab
Skikits learn

Where to get more information:
“Mahout in Action” - Manning
“Taming Text” - Manning
“Machine Learning” - Andrew Ng
https://cwiki.apache.org/confluence/dis
play/MAHOUT/Books+Tutorials+and+T
alks
https://cwiki.apache.org/confluence/dis
play/MAHOUT/Reference+Reading
Image by pareeerica
http://www.flickr.com/photos/pareeerica/3711741298/

Frameworks worth mentioning:
Apache Mahout
Matlab/ Otave
Shogun
RapidI

Apache Giraph
R
Weka
MyMedialight

Get your hands dirty:
http://kaggle.com
https://cwiki.apache.org/confluence/dis
play/MAHOUT/Collections

Where to meet these people:
RecSys
NIPS
KDD
PKDD
ApacheCon
O'Reilly Strata

ICML
ECML
WSDM
JMLR
Berlin Buzzwords
Get started today with the right tools.

January 8, 2008 by dreizehn28
http://www.flickr.com/photos/1328/2176949559
Discuss ideas and problems online.

November 16, 2005 [phil h]
http://www.flickr.com/photos/hi-phi/64055296
Images taken at Berlin Buzzwords 2011/12/13 by
Philipp Kaden. See you there end of May 2014.

Discuss ideas and problems in person.
Text Classification Powered by Apache Mahout and Lucene
Become a committer yourself
BerlinBuzzwords.de – End of May 2014 in Berlin/ Germany.

http://

Online – user/dev@mahout.apache.org, java-user@lucene.apache.org,
dev@lucene.apache.org

Interest in solving hard problems.
Being part of lively community.
Engineering best practices.

Bug reports, patches, features.
Documentation, code, examples.
Image by: Patrick McEvoy
Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
http://www.flickr.com/photos/29143375@N05/3344809375/in/photostream/

http://www.flickr.com/photos/redux/409356158/
Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
http://www.flickr.com/photos/29143375@N05/3344809375/in/photostream/

http://www.flickr.com/photos/redux/409356158/
By freezelight, http://www.flickr.com/photos/63056612@N00/155554663/

More Related Content

PDF
General Tips for participating Kaggle Competitions
Mark Peng
 
PPT
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
Kavita Ganesan
 
PDF
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
lucenerevolution
 
PPTX
Modern classification techniques
mark_landry
 
PDF
Data Wrangling For Kaggle Data Science Competitions
Krishna Sankar
 
PDF
Overview of TensorFlow For Natural Language Processing
ananth
 
PDF
Introduction To Applied Machine Learning
ananth
 
PDF
A Folksonomy of styles, aka: other stylists also said and Subjective Influenc...
Natalia Díaz Rodríguez
 
General Tips for participating Kaggle Competitions
Mark Peng
 
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
Kavita Ganesan
 
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
lucenerevolution
 
Modern classification techniques
mark_landry
 
Data Wrangling For Kaggle Data Science Competitions
Krishna Sankar
 
Overview of TensorFlow For Natural Language Processing
ananth
 
Introduction To Applied Machine Learning
ananth
 
A Folksonomy of styles, aka: other stylists also said and Subjective Influenc...
Natalia Díaz Rodríguez
 

What's hot (20)

PDF
MaxEnt (Loglinear) Models - Overview
ananth
 
PDF
Text categorization with Lucene and Solr
Tommaso Teofili
 
PDF
Artificial Intelligence Course: Linear models
ananth
 
PDF
Hands-on Deep Learning in Python
Imry Kissos
 
PDF
L06 stemmer and edit distance
ananth
 
PPTX
Deep learning with TensorFlow
Barbara Fusinska
 
PDF
AutoML lectures (ACDL 2019)
Joaquin Vanschoren
 
PDF
Generating Natural-Language Text with Neural Networks
Jonathan Mugan
 
PDF
Text classification in scikit-learn
Jimmy Lai
 
PDF
Generating Sequences with Deep LSTMs & RNNS in julia
Andre Pemmelaar
 
PDF
Machine Learning Lecture 2 Basics
ananth
 
PDF
Deep Learning For Practitioners, lecture 2: Selecting the right applications...
ananth
 
PDF
A Multiscale Visualization of Attention in the Transformer Model
taeseon ryu
 
PDF
Introduction of Feature Hashing
Wush Wu
 
PDF
Aaa ped-23-Artificial Neural Network: Keras and Tensorfow
AminaRepo
 
PDF
Brief introduction to Machine Learning
CodeForFrankfurt
 
PDF
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
MLconf
 
PDF
Document Clustering using LDA | Haridas Narayanaswamy [Pramati]
Pramati Technologies
 
PDF
PFN Spring Internship Final Report: Autonomous Drive by Deep RL
Naoto Yoshida
 
PPTX
Intro to Data Structure & Algorithms
Akhil Kaushik
 
MaxEnt (Loglinear) Models - Overview
ananth
 
Text categorization with Lucene and Solr
Tommaso Teofili
 
Artificial Intelligence Course: Linear models
ananth
 
Hands-on Deep Learning in Python
Imry Kissos
 
L06 stemmer and edit distance
ananth
 
Deep learning with TensorFlow
Barbara Fusinska
 
AutoML lectures (ACDL 2019)
Joaquin Vanschoren
 
Generating Natural-Language Text with Neural Networks
Jonathan Mugan
 
Text classification in scikit-learn
Jimmy Lai
 
Generating Sequences with Deep LSTMs & RNNS in julia
Andre Pemmelaar
 
Machine Learning Lecture 2 Basics
ananth
 
Deep Learning For Practitioners, lecture 2: Selecting the right applications...
ananth
 
A Multiscale Visualization of Attention in the Transformer Model
taeseon ryu
 
Introduction of Feature Hashing
Wush Wu
 
Aaa ped-23-Artificial Neural Network: Keras and Tensorfow
AminaRepo
 
Brief introduction to Machine Learning
CodeForFrankfurt
 
Melanie Warrick, Deep Learning Engineer, Skymind.io at MLconf SF - 11/13/15
MLconf
 
Document Clustering using LDA | Haridas Narayanaswamy [Pramati]
Pramati Technologies
 
PFN Spring Internship Final Report: Autonomous Drive by Deep RL
Naoto Yoshida
 
Intro to Data Structure & Algorithms
Akhil Kaushik
 
Ad

Viewers also liked (20)

PDF
Natural Language Processing using Java
Sangameswar Venkatraman
 
PDF
Classificação de documentos
Dextra
 
PDF
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
lucenerevolution
 
PDF
Enhance discovery Solr and Mahout
lucenerevolution
 
PPT
Hands on Mahout!
OSCON Byrum
 
PPT
Presentación productos VIP Amado Salvador
Ricardo Hoyos
 
PDF
Plan de marketig
Manografica Chile
 
PPTX
Escuela de conducción para conductores profesionales
viniciojoel
 
PDF
Mon youth bulletin vol 28
ဗညာ ၐုိပ္
 
PPT
E-learning 2.0: Nuevas oportunidades para aprender en red
David Delgado ✔
 
PDF
Travel digital iq 2011
Gabriela Otto
 
PDF
Cocktail PGI Open source fait par et pour le Secteur public
LINAGORA
 
DOCX
Practica #4 ph de la leche
Richrad Alexander Valarezo Avila
 
PDF
Sappres Netweaver Identity Management
gueste2a899
 
PPTX
Macsfs apologetica i el rapto
defiendetufe
 
PPTX
Beef framework 2016
Tensor
 
PPTX
Diapositivas rosadas regimennn
carolina0505
 
PPT
Presentaciones de apuntes de integración cad cam
epnmecanica
 
PPT
TRabajo de la voz y sonido
Joan-Llorenç Alba
 
PDF
Tabelaprecosee201201
Miguel Silva
 
Natural Language Processing using Java
Sangameswar Venkatraman
 
Classificação de documentos
Dextra
 
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
lucenerevolution
 
Enhance discovery Solr and Mahout
lucenerevolution
 
Hands on Mahout!
OSCON Byrum
 
Presentación productos VIP Amado Salvador
Ricardo Hoyos
 
Plan de marketig
Manografica Chile
 
Escuela de conducción para conductores profesionales
viniciojoel
 
Mon youth bulletin vol 28
ဗညာ ၐုိပ္
 
E-learning 2.0: Nuevas oportunidades para aprender en red
David Delgado ✔
 
Travel digital iq 2011
Gabriela Otto
 
Cocktail PGI Open source fait par et pour le Secteur public
LINAGORA
 
Practica #4 ph de la leche
Richrad Alexander Valarezo Avila
 
Sappres Netweaver Identity Management
gueste2a899
 
Macsfs apologetica i el rapto
defiendetufe
 
Beef framework 2016
Tensor
 
Diapositivas rosadas regimennn
carolina0505
 
Presentaciones de apuntes de integración cad cam
epnmecanica
 
TRabajo de la voz y sonido
Joan-Llorenç Alba
 
Tabelaprecosee201201
Miguel Silva
 
Ad

Similar to Text Classification Powered by Apache Mahout and Lucene (20)

KEY
Machine Learning & Apache Mahout
Domingo Suarez Torres
 
PDF
Introduction to Mahout and Machine Learning
Varad Meru
 
PDF
Introduction to Machine Learning
nhm taveer hossain khan
 
PDF
SDEC2011 Mahout - the what, the how and the why
Korea Sdec
 
PPT
ai4.ppt
akshatsharma823122
 
PDF
Introduction to conventional machine learning techniques
Xavier Rafael Palou
 
PPT
Unit iii
Kgr Sushmitha
 
PPT
ai4.ppt
ssuser448ad3
 
PPTX
Introduction to Machine Learning
Rahul Jain
 
PPT
Orchestrating the Intelligent Web with Apache Mahout
aneeshabakharia
 
PDF
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Sebastian Raschka
 
PPTX
05 -- Feature Engineering (Text).pptxiuy
Sravani477269
 
PPTX
Intro to Apache Mahout
Grant Ingersoll
 
PDF
OSCON: Apache Mahout - Mammoth Scale Machine Learning
Robin Anil
 
PPT
ai4.ppt
atul404633
 
PPTX
iNews: Final Presentation
Huy Vu
 
PPTX
Debugging Skynet: A Machine Learning Approach to Log Analysis - Ianir Ideses,...
DevOpsDays Tel Aviv
 
PDF
Introduction to Machine Learning with SciKit-Learn
Benjamin Bengfort
 
PPT
Mahout part2
Yasmine Gaber
 
PPT
[ppt]
butest
 
Machine Learning & Apache Mahout
Domingo Suarez Torres
 
Introduction to Mahout and Machine Learning
Varad Meru
 
Introduction to Machine Learning
nhm taveer hossain khan
 
SDEC2011 Mahout - the what, the how and the why
Korea Sdec
 
Introduction to conventional machine learning techniques
Xavier Rafael Palou
 
Unit iii
Kgr Sushmitha
 
ai4.ppt
ssuser448ad3
 
Introduction to Machine Learning
Rahul Jain
 
Orchestrating the Intelligent Web with Apache Mahout
aneeshabakharia
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Sebastian Raschka
 
05 -- Feature Engineering (Text).pptxiuy
Sravani477269
 
Intro to Apache Mahout
Grant Ingersoll
 
OSCON: Apache Mahout - Mammoth Scale Machine Learning
Robin Anil
 
ai4.ppt
atul404633
 
iNews: Final Presentation
Huy Vu
 
Debugging Skynet: A Machine Learning Approach to Log Analysis - Ianir Ideses,...
DevOpsDays Tel Aviv
 
Introduction to Machine Learning with SciKit-Learn
Benjamin Bengfort
 
Mahout part2
Yasmine Gaber
 
[ppt]
butest
 

More from lucenerevolution (20)

PDF
State of the Art Logging. Kibana4Solr is Here!
lucenerevolution
 
PDF
Search at Twitter
lucenerevolution
 
PDF
Building Client-side Search Applications with Solr
lucenerevolution
 
PDF
Integrate Solr with real-time stream processing applications
lucenerevolution
 
PDF
Scaling Solr with SolrCloud
lucenerevolution
 
PDF
Administering and Monitoring SolrCloud Clusters
lucenerevolution
 
PDF
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
lucenerevolution
 
PDF
Using Solr to Search and Analyze Logs
lucenerevolution
 
PDF
Enhancing relevancy through personalization & semantic search
lucenerevolution
 
PDF
Real-time Inverted Search in the Cloud Using Lucene and Storm
lucenerevolution
 
PDF
Solr's Admin UI - Where does the data come from?
lucenerevolution
 
PDF
Schemaless Solr and the Solr Schema REST API
lucenerevolution
 
PDF
High Performance JSON Search and Relational Faceted Browsing with Lucene
lucenerevolution
 
PDF
Faceted Search with Lucene
lucenerevolution
 
PDF
Recent Additions to Lucene Arsenal
lucenerevolution
 
PDF
Turning search upside down
lucenerevolution
 
PDF
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
lucenerevolution
 
PDF
Shrinking the haystack wes caldwell - final
lucenerevolution
 
PDF
The First Class Integration of Solr with Hadoop
lucenerevolution
 
PDF
A Novel methodology for handling Document Level Security in Search Based Appl...
lucenerevolution
 
State of the Art Logging. Kibana4Solr is Here!
lucenerevolution
 
Search at Twitter
lucenerevolution
 
Building Client-side Search Applications with Solr
lucenerevolution
 
Integrate Solr with real-time stream processing applications
lucenerevolution
 
Scaling Solr with SolrCloud
lucenerevolution
 
Administering and Monitoring SolrCloud Clusters
lucenerevolution
 
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
lucenerevolution
 
Using Solr to Search and Analyze Logs
lucenerevolution
 
Enhancing relevancy through personalization & semantic search
lucenerevolution
 
Real-time Inverted Search in the Cloud Using Lucene and Storm
lucenerevolution
 
Solr's Admin UI - Where does the data come from?
lucenerevolution
 
Schemaless Solr and the Solr Schema REST API
lucenerevolution
 
High Performance JSON Search and Relational Faceted Browsing with Lucene
lucenerevolution
 
Faceted Search with Lucene
lucenerevolution
 
Recent Additions to Lucene Arsenal
lucenerevolution
 
Turning search upside down
lucenerevolution
 
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
lucenerevolution
 
Shrinking the haystack wes caldwell - final
lucenerevolution
 
The First Class Integration of Solr with Hadoop
lucenerevolution
 
A Novel methodology for handling Document Level Security in Search Based Appl...
lucenerevolution
 

Recently uploaded (20)

PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Software Development Methodologies in 2025
KodekX
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
The Future of Artificial Intelligence (AI)
Mukul
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Software Development Methodologies in 2025
KodekX
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 

Text Classification Powered by Apache Mahout and Lucene