2.5 Pre- and Post-Coordinate Indexing
2.5 Pre- and Post-Coordinate Indexing
5
PRE- AND POST-COORDINATE INDEXING
y Provide means for the user to make selection from among all items
in any particular category, according to any chosen set of criteria
such as: most thorough, most recent, most elementary, etc.
1. Document analysis
2. Significant characteristics
3. Representing significant characteristics
4. Information system
5. User(s)
D1 D2 D3 D4 D5 D6 D7
T1 X X X
T2 X X
T3 X X X X
T4 X X
T5 X X
T6 X X
Abstract: GridVine is a semantic overlay infrastructure based on a peer-to-peer (P2P) access structure. Built following the
principle of data independence, it separates a logical layer - in which data, schemas, and schema mappings are managed
- from a physical layer consisting of a structured P2P network supporting decentralized indexing, key load-balancing,
and efficient routing. The system is decentralized, yet fosters semantic interoperability through pair-wise schema
mappings and query reformulation. GridVine's heterogeneous but semantically related information sources can be
queried transparently using iterative query reformulation. The authors discuss a reference implementation of the
system and several mechanisms for resolving queries collaboratively.
Inspec controlled grid computing - information management - open systems - peer-to-peer computing -
terms: query formulation - semantic Web
Uncontrolled GridVine - peer information management - semantic overlay infrastructure - peer-to-
terms: peer access structure - data independence - logical layer - physical layer - decentralized
indexing - key load-balancing - decentralized system - semantic interoperability - pair-
wise schema mappings - iterative query reformulation
y Specificity
◦ identification of the precise concepts (terms) to
signal content of document
◦ representation of precisely those and only those
concepts (terms) to signal content of document
Alhambra
Granada, Spain
Here we have a
document with the title: Ecclesiastical Architecture
Architecture of
Cathedrals in Spain
Architecture of Cathedrals
Pre-coordinate X
Post-coordinate, X
Assignment
Post-coordinate, X X
Derived
Free Text X X
Indexer Searcher
Pre-coordinate X
Post-coordinate, X X
Assignment
Post-coordinate, X X
Derived
Free Text X
y Precision:
◦ Relevant documents retrieved/ Total
documents retrieved = % precision
y High Specificity =
◦ High precision
◦ Low recall
y High Exhaustivity =
◦ Low precision
◦ High recall
IMT 530 | 2008 | Tennis
2.5.6 Evaluation – Challenges
y Challenges for Indexing
y Consistency in indexing