SHES2201 Lecture 3 - Data Mining in Bioinformatics
SHES2201 Lecture 3 - Data Mining in Bioinformatics
• Common Techniques
– Classification and prediction
– Clustering
– Data summarization
– Dependency modeling
– Change and deviation detection
Data Mining Techniques
• Dependency modeling.
– The aim is to derive some causal structure within the data.
– One example is functional dependency between predicates.
• Data summarization.
– The aim is
• to discover patterns that describe subsets of the data
(attribute focusing), and
• to extract rules from the data telling us how a subset
of data influences the presence of another subset
– Association Rules Mining (ARM) relate to an
undirected/unsupervised data mining technique.
– Usually produces clear and understandable results
Another example
• bacteriarhodopsin (bR) from the bacteria,
Halobacterium halobium, are now being used by
scientists to produce bioelectronic switches a thousand
times smaller and faster than current semiconductor
technologies.
• Hong, Birge and others (in Vitaliano, 1996) are
researching electronic photo-active bR systems to
develop massively parallel and massively distributed
biocomputers.
Biological Perspective