Text Mining: Seminar Submitted by
Text Mining: Seminar Submitted by
• Email • Customer
• Insurance claims complaint letters
• News articles • Contracts
• Web pages • Transcripts of
• Patent portfolios phone calls with
customers
• Technical
documents
Reasons for Text Mining
90
80
70
60
Collections of
50 Text
40 Structured Data
30
20
10
0
Percentage
How Text Mining Differs from Data
Mining
Data Mining Text Mining
• Identify data sets • Identify documents
• Select features • Extract features
• Prepare data • Select features by
• Analyze algorithm
distribution • Prepare data
• Analyze
distribution
Mining