DM 5th unit ppt
DM 5th unit ppt
|{Relevant}^ {Retrieved}|
Recall =
|{ Relevant}|
3. F- Score:
Recall * Precision
F-Score =
( recall + Precision)\2
Text Mining:
Fig: Relationship Between the set of relevant documents and set of
retrieved document Relevant + Retrieved
All Documents
Text Retrieval Methods:
1. Document Selection Method:
Boolean Retrieval model ( and/ OR /Not)
2. Document Ranking Method: The goal is to approximate the
degree of relevance of a document with a score computed based on
information such as the frequency of words in the document and
the whole collection.
Tokenization:
Stop list: Regularly used terms a, the, for, with
Word stem : long, longer, longest------
Multi Media Mining
• Multi media data mining is used for Extracting intresting
information for Multi media data set.
• Multi media mining is a sub field of data mining which is used to find
interesting information of implicit knowledge from multitime data
bases.
• Audio data
• Video data
• Image data
• Graphical data
• Speech data
• Text Data
Categories of Multi Media data Mining:
Multi Media Data Mining
Video
Text Dynamic Media
Static Media Mining
Ming
Audio
Image
Mining
Mining
• The Multi Media Data Mining is classified into Two categories are
Static and Dynamic media.
• Static media contains text ( digital library, creating sms & mms) and
images ( photos & media images)
• Dynamic media contains Audio ( Music & MP3 sounds) & (video like
movies).
Applications of Multimedia Mining:
• Digital Library
• Traffic video sequences
• Media Analysis
• Customer Perception
• Media Making and Broad Casting
• Mobiles
• Digital cameras
• Internet------etc
Multimedia Data Mining Processing:
• Data Collection is the initial stage of the learning s/m pre-processing is
to extract significant features from raw data, it includes data cleaning,
transformation , normalization, features extaction etc----
• Learning can be direct, if informative types can be recognized at pre-
processing data\ stage.
• Complete process depends extremely on the nature of raw data and
difficulty field
• The product of pre- processing is the training set.
Multimedia Data Mining Processing:
Data Preprocessing
-Data Cleaning
-Feature Selection
Training set
Machine Learning
Model
Architecture of Multimedia Data Mining:
• The Architecture has several components:
1. Input
2. Multimedia content
3. Spation temporal segmentation
4. Feature Extraction
5. Finding the similar pattern.
Architecture of Multimedia Data Mining:
Vid
text Im eo
Input Multimedia Contents Aud
age io
Spatiotemporal segmentation