Advanced Database Models , Systems and Applications
Advanced Database Models , Systems and Applications
Spatial Database
• A spatial database is a database that is enhanced to store and access spatial
data or data that defines a geometric space. These data are often associated
with geographic locations and features, or constructed features like cities. Data
on spatial databases are stored as coordinates, points, lines, polygons and
topology. Some spatial databases handle more complex data like three
dimensional objects, topological coverage and linear networks.
• Common database systems use indexes for a faster and more efficient search
and access of data. This index, however, is not fit for spatial queries. Instead,
spatial databases use something like a unique index called a spatial index to
speed up database performance. Spatial indexing is very much required because
a system should be able to retrieve data from a large collection of objects
without really searching the whole bunch. It should also support relationships
between connecting objects from different classes in a better manner than just
filtering.
• Aside from the indexes, spatial databases also offer spatial data types in their
data model and query language. These databases require special kinds of data
types to provide a fundamental abstraction and model the structure of the
geometric objects with their corresponding relationships and operations in the
spatial environment. Without these kind of data types, the system would not be
able to support the kind of modeling a spatial database offers.
• Non-spatial data (also called attribute or characteristic data) is that information
which is independent of all geometric considerations.
– For example, a person’s height, mass, and age are non-spatial data because
they are independent of the person’s location.
– It’s interesting to note that, while mass is non-spatial data, weight is spatial
data in the sense that something’s weight is very much dependent on its
location!
Example Of Spatial Data
A road map is a visualization of geographic information. A road map is a 2-
dimensional object which contains points, lines, and polygons that can
represent cities, roads, and political boundaries such as states or provinces.
In general, spatial data can be of two types
• Vector data: This data is represented as discrete points, lines and polygons
• Rastor data: This data is represented as a matrix of square cells.
• The spatial data in the form of points, lines, polygons etc. is used by many
different databases as shown above.
Multimedia Database
• Multimedia database is the collection of interrelated multimedia data that
includes text, graphics (sketches, drawings), images, animations, video, audio etc
and have vast amounts of multisource multimedia data.
• The framework that manages different types of multimedia data which can be
stored, delivered and utilized in different ways is known as multimedia database
management system.
• There are three classes of the multimedia database which includes static media,
dynamic media and dimensional media.
Content of Multimedia Database management system :
• Media data: It is the actual data which represents an object.
• Media format data: The information such as resolution, sampling rate, encoding
system, etc. about the format of the media data under consideration after is
undergoes acquisition, processing, and encoding is the media format data.
• Media keyword data: Media keyword data are the keyword description related
to the generation of data. This data is also known as content descriptive data.
Examples of content descriptive data are place, time, date of recording.
• Media feature data: Media feature data contains data which is content
dependent such as kind of texture, distribution of, and the different shapes
present in the data.
This still leads to a number of challenges to multimedia databases. These are:
• Modeling: Work in this area can improve the database versus information
retrieval techniques.
• Design: The physical, conceptual, and logical design of multimedia databases is
not addressed entirely leading to performance and tuning issues.
• Storage: The storage of databases on a standard disc can lead to problems like
representation, mapping to disc hierarchies, compression, etc.
• Performance: Audio-video synchronization and audio playback applications are
where physical limitations dominate. Parallel processing can reduce these
problems, but these techniques have not been completely developed yet.
Multimedia databases also consume a lot of processing power and bandwidth.
• Queries and Retrieval: Multimedia such as images, audio, video lead to
retrieval and queries issues such as efficient query formation, query execution,
etc.
Multimedia Database Applications:
• Documents and record management: Industries which keep a lot of
documentation and records. Ex: Insurance claim industry.
• Knowledge dissemination: Multimedia database is an extremely efficient tool
for knowledge dissemination and providing several resources. Ex: electronic
books
• Education and training: Multimedia sources can be used to create resources
useful in education and training. These are popular sources of learning in recent
days. Ex: Digital libraries.
• Real-time monitoring and control: Multimedia presentation when coupled
with active database technology can be an effective means for controlling and
monitoring complex tasks. Ex: Manufacture control
• Marketing
• Advertisement
• Retailing
• Entertainment
• Travel
DEDUCTIVE DATABASE
• A deductive database is a database system that can make deductions (i.e.
conclude additional facts) based on rules and facts stored in the (deductive)
database. Datalog is the language typically used to specify facts, rules and
queries in deductive databases.
• Deductive databases have grown out of the desire to combine logic
programming with relational databases to construct systems that support
powerful formalism and are still fast and able to deal with very large datasets.
Deductive databases are more expressive than relational databases but less
expressive than logic programming systems.
• In recent years, deductive databases such as Datalog have found new
application in data integration, information extraction, networking, program
analysis, security, and cloud computing.
• Deductive databases reuse many concepts from logic programming; rules and
facts specified in the deductive database language Datalog look very similar to
those in Prolog.
• A Deductive Database is a type of database that can make conclusions or we can
say deductions using a sets of well defined rules and fact that are stored in the
database.
• In today’s world as we deal with a large amount of data, this deductive database
provides a lot of advantages.
• It helps to combine the RDBMS with logic programming. To design a deductive
database a purely declarative programming language called Datalog is used.
• A deductive database uses two main types of specifications: facts and rules.
• Facts are specified in a manner similar to the way relations are specified, except
that it is not necessary to include the attribute names.
• Recall that a tuple in a relation describes some real-world fact whose meaning is
partly determined by the attribute names.
• In a deductive database, the meaning of an attribute value in a tuple is
determined solely by its position within the tuple.
• Rules are somewhat similar to relational views. They specify virtual relations
that are not actually stored but that can be formed from the facts by applying
inference mechanisms based on the rule specifications.
• The main difference between rules and views is that rules may involve recursion
and hence may yield virtual relations that cannot be defined in terms of basic
relational views.
Introduction To Information Retrieval and Web Search
• Information retrieval is the process of retrieving documents from a collection
in response to a query (or a search request) by a user.
• Information Retrieval is the activity of obtaining material that can usually be
documented on an unstructured nature i.e. usually text which satisfies an
information need from within large collections which is stored on computers.
For example, Information Retrieval can be when a user enters a query into the
system.
• The IR system assists the users in finding the information they require but it
does not explicitly return the answers to the question. It notifies regarding the
existence and location of documents that might consist of the required
information. Information retrieval also extends support to users in browsing or
filtering document collection or processing a set of retrieved documents.
• The system searches over billions of documents stored on millions of computers.
A spam filter, manual or automatic means are provided by Email program for
classifying the mails so that it can be placed directly into particular folders.
• An IR system has the ability to represent, store, organize, and access information
items. A set of keywords are required to search. Keywords are what people are
searching for in search engines. These keywords summarize the description of
the information.
Components of Information Retrieval/ IR Model
Acquisition: In this step, the selection of documents and other objects from various
web resources that consist of text-based documents takes place. The required data
is collected by web crawlers and stored in the database.
• Representation: It consists of indexing that contains free-text terms, controlled
vocabulary, manual & automatic techniques as well. example: Abstracting
contains summarizing and Bibliographic description that contains author, title,
sources, data, and metadata.
• File Organization: There are two types of file organization methods.
i.e. Sequential: It contains documents by document data. Inverted: It contains
term by term, list of records under each term. Combination of both.
• Query: An IR process starts when a user enters a query into the system. Queries
are formal statements of information needs, for example, search strings in web
search engines. In information retrieval, a query does not uniquely identify a
single object in the collection. Instead, several objects may match the query,
perhaps with different degrees of relevancy.
Difference Between Information Retrieval and Data Retrieval
Information Retrieval Data Retrieval
The software program that deals with the Data retrieval deals with obtaining data from a
organization, storage, retrieval, and evaluation database management system such as ODBMS.
of information from document repositories It is A process of identifying and retrieving the
data from the database, based on the query
particularly textual information. provided by user or application.
Retrieves information about a subject. Determines the keywords in the user query
and retrieves the data.
Small errors are likely to go unnoticed. A single error object means total failure.
Does not provide a solution to the user of the Provides solutions to the user of the database
database system. system.
The results obtained are approximate matches. The results obtained are exact matches.