Pattern Recognition
Pattern Recognition
Pattern recognition can be defined as the classification of data based on knowledge already gained or on
statistical information extracted from patterns and/or their representation.
One of the important aspects of the pattern recognition is its application potential.
Examples: Speech recognition, speaker identification, multimedia document recognition (MDR), automatic
medical diagnosis.
In a typical pattern recognition application, the raw data is processed and converted into a form that is
amenable for a machine to use.
Feature
may be represented as continuous, discrete or discrete binary variables.
A feature is a function of one or more measurements, computed so that it quantifies some significant
characteristics of the object.
Example: consider our face then eyes, ears, nose etc are features of the face.
A set of features that are taken together, forms the features vector.
Example: In the above example of face, if all the features (eyes, ears, nose etc) taken together then the
sequence is feature vector ([eyes, ears, nose]). Feature vector is the sequence of a features represented as a
d-dimensional column vector. In case of speech, MFCC (Melfrequency Cepstral Coefficent) is the spectral
features of the speech. Sequence of first 13 features forms a feature vector.
Pattern recognition possesses the following features:
Pattern recognition system should recognise familiar pattern quickly and accurate
Learning is a phenomenon through which a system gets trained and becomes adaptable to give result in an
accurate manner.
Learning is the most important phase as how well the system performs on the data provided to the system
depends on which algorithms used on the data.
Entire dataset is divided into two categories, one which is used in training the model i.e. Training set
the other that is used in testing the model after training, i.e. Testing set.
Training Set
Training set is used to build a model.
It consists of the set of images which are used to train the system.
Training rules and algorithms used give relevant information on how to associate input data with output
decision.
The system is trained by applying these algorithms on the dataset, all the relevant information is extracted
from the data and results are obtained.
• Generally, 80% of the data of the dataset is taken for training data
Testing Set
Testing data is used to test the system.
It is the set of data which is used to verify whether the system is producing the correct output after being
trained or not.
Example: a system which identifies which category a particular flower belongs to, is able to identify seven
category of flowers correctly out of ten and rest others wrong, then the accuracy is 70 %
Advantages
Pattern recognition solves classification problems
It is useful for cloth pattern recognition for visually impaired blind people.