0% found this document useful (0 votes)
46 views

3types of Data Mining

Spatial data mining refers to extracting knowledge and patterns from spatial databases. It involves integrating data mining techniques with spatial databases and can be used for understanding spatial relationships in areas like geography, remote sensing, and medical imaging. A key challenge is developing efficient techniques for mining huge amounts of complex spatial data. Statistical techniques are commonly used for spatial data analysis and modeling spatial autocorrelation, and spatial data mining aims to further develop and scale up these methods.

Uploaded by

JOEL
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views

3types of Data Mining

Spatial data mining refers to extracting knowledge and patterns from spatial databases. It involves integrating data mining techniques with spatial databases and can be used for understanding spatial relationships in areas like geography, remote sensing, and medical imaging. A key challenge is developing efficient techniques for mining huge amounts of complex spatial data. Statistical techniques are commonly used for spatial data analysis and modeling spatial autocorrelation, and spatial data mining aims to further develop and scale up these methods.

Uploaded by

JOEL
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 4

Spatial Data Mining

A spatial database stores a large amount of space-related data, such as maps,


preprocessed remote sensing or medical imaging data, and VLSI chip layout data.
Spatial
databases have many features distinguishing them from relational databases.
They
carry topological and/or distance information, usually organized by
sophisticated,
multidimensional spatial indexing structures that are accessed by spatial data
access
methods and often require spatial reasoning, geometric computation, and
spatial
knowledge representation techniques.
Spatial data mining refers to the extraction of knowledge, spatial relationships,
or
other interesting patterns not explicitly stored in spatial databases. Such mining
demands
an integration of data miningwith spatial database technologies. It can be used
for understanding spatial data, discovering spatial relationships and relationships
between spatial
and nonspatial data, constructing spatial knowledge bases, reorganizing spatial
databases,
and optimizing spatial queries. It is expected to have wide applications in
geographic
information systems, geomarketing, remote sensing, image database
exploration, medical imaging, navigation, traffic control, environmental studies, and many other
areas
where spatial data are used. A crucial challenge to spatial data mining is the
exploration
of efficient spatial data mining techniques due to the huge amount of spatial
data and the
complexity of spatial data types and spatial access methods.
What about using statistical techniques for spatial data mining? Statistical
spatial data
analysis has been a popular approach to analyzing spatial data and exploring
geographic
information. The termgeostatistics is often associated with continuous
geographic space,
whereas the term spatial statistics is often associated with discrete space. In a
statistical

model that handles nonspatial data, one usually assumes statistical


independence among
different portions of data. However, different from traditional data sets, there is
no such
independence among spatially distributed data because in reality, spatial
objects are often
interrelated, or more exactly spatially co-located, in the sense that the closer
the two objects
are located, the more likely they share similar properties. For example, nature
resource,
climate, temperature, and economic situations are likely to be similar in
geographically
closely located regions. People even consider this as the first lawof geography:
Everything
is related to everything else, but nearby things are more related than distant
things. Such
a property of close interdependency across nearby space leads to the notion of
spatial
autocorrelation. Based on this notion, spatial statistical modeling methods have
been
developed with good success. Spatial data mining will further develop spatial
statistical
analysis methods and extend themfor huge amounts of spatial data, with more
emphasis
on efficiency, scalability, cooperation with database and data warehouse
systems,
improved user interaction, and the discovery of new types of knowledge.

Multimedia Data Mining


What is a multimedia database? A multimedia database system stores and
manages a
large collection of multimedia data, such as audio, video, image, graphics,
speech, text,
document, and hypertext data, which contain text, text markups, and linkages.
Multimedia database systems are increasingly common owing to the popular use of
audiovideo equipment, digital cameras, CD-ROMs, and the Internet. Typical
multimedia
database systems include NASAs EOS (Earth Observation System), various
kinds of
image and audio-video databases, and Internet databases.
In this section, our study of multimedia data mining focuses on image data
mining.
Mining text data and mining the World Wide Web are studied in the two
subsequent
sections. Here we introduce multimedia data mining methods, including
similarity

search in multimedia data, multidimensional analysis, classification and


prediction
analysis, and mining associations in multimedia data.

Text Mining
Most previous studies of data mining have focused on structured data, such as
relational,
transactional, and data warehouse data. However, in reality, a substantial
portion of
the available information is stored in text databases (or document databases),
which
consist of large collections of documents from various sources, such as news
articles,
research papers, books, digital libraries, e-mail messages, andWeb pages. Text
databases
are rapidly growing due to the increasing amount of information available in
electronic
form, such as electronic publications, various kinds of electronic documents, email, and
theWorldWideWeb (which can also be viewed as a huge, interconnected,
dynamic text
database). Nowadays most of the information in government, industry, business,
and
other institutions are stored electronically, in the form of text databases.
Data stored in most text databases are semistructured data in that they are
neither
completely unstructured nor completely structured. For example, a document
may
contain a few structured fields, such as title, authors, publication date, category,
and
so on, but also contain some largely unstructured text components, such as
abstract
and contents. There have been a great deal of studies on the modeling and
implementation of semistructured data in recent database research. Moreover,
information
retrieval techniques, such as text indexing methods, have been developed to
handle
unstructured documents.
Traditional information retrieval techniques become inadequate for the
increasingly
vast amounts of text data. Typically, only a small fraction of the many available
documents will be relevant to a given individual user.Without knowing what could be
in the

documents, it is difficult to formulate effective queries for analyzing and


extracting useful
information from the data. Users need tools to compare different documents,
rank the
importance and relevance of the documents, or find patterns and trends across
multiple
documents. Thus, text mining has become an increasingly popular and essential
theme
in data mining.

You might also like