SlideShare a Scribd company logo
What is Document
Indexing?
1
in·dex /in-deks/ n.
plural in·dex·es, in·di·ces /in-duh-seez/
a list (as of bibliographical information or citations to a body of literature) arranged usually in alphabetical order of some specified datum (as
author, subject, or keyword): as a : a list of items (as topics or names)
2
in·dex /in-deks/ v.
to provide an index for (something, such as a book)
Copyright ©2014
the process of tagging or associating
information with a file so it can be used for
search and retrieval purposes later
Indexing:
Indexing creates the “searchable” information
that users will later use to find documents.
Invoice Number?
Customer/Employee Number?
Customer/Employee Name?
Date?
Site ID?
Patient Name?Doctor?
Work Order Number?
Waybill Number?
Prescription Number?
The index information is
stored or integrated into
a database or
document/records
management system
which provides a
framework for users to
locate the documents.
My Database
There are two types of Indexes.
Full-text indexing is
just what the name
implies; all the text
of the document is
indexed.
When specific words
or descriptions are
indexed to create
the searchable
index fields, the
information is
referred to as
“metadata.”
So Why is
Indexing
Important?
“Documents are the currency of business.
They are at the heart of critical workflows
and drive just about every area of
business.” -- IDC, “The Role of Documents: How They Drive
Business, Today and Tomorrow”, January 2013
Great care should be taken to
design an efficient indexing scheme.
If the process is not
designed correctly
at the outset, trying
to rectify it later
can be both
difficult and costly.
And in some
environments such
as legal, the cost of
not locating a key
document can be
monumental.
Avoid Disaster
So how can indexing information
be extracted with little to no
user intervention?
So how can indexing information
be extracted with little to no
user intervention?
• Barcodes
• Content Data Mining
• Optical Character Recognition (OCR)
• Zonal OCR
• Drag and Drop OCR
Intelligent data capture software can extract barcode
data for indexing.
Intelligent data capture software can extract barcode
data for indexing.
Barcodes can also be used for many other
purposes such as file naming, splitting,
bookmarking and routing.
Files that contain text can be mined using various data
mining techniques.
OCR tools and technology such as
Regular Expressions aid in text mining.
Regular expression (regex) scripts are
powerful tools to help identify keywords or
actual strings of text for indexing from
many source types.
OCR tools and technology such as
Regular Expressions aid in text mining.
Regular expression (regex) scripts are
powerful tools to help identify keywords or
actual strings of text for indexing from
many source types.
The scripting process can look for words
with specific characters, lengths,
character types, or preceding keywords.
OCR tools and technology such as
Regular Expressions aid in text mining.
If an inventory item should contain three alpha characters
followed by five numbers, advanced indexing solutions can
use regex to recognized this pattern and reject all
documents with items not meeting this rule.
The document can be tagged for manual inspection before
further processing is done.
Advanced indexing solutions offer Field Validation
based on Regular Expressions.
PEN21096
CAP36581
INV98453
PA568793
Used to process EOB's or other records where the same
document needs to be in multiple patient records or places.
Advanced data capture solutions such as ImageRamp allow
the operator to easily scan the EOB once, index the different
patients' information via an onscreen keyboard, drag-and-
drop OCR, or barcode reading methods, and route to the
appropriate patients' records with little to no intervention.
Advanced indexing solutions can accommodate
special needs such as Scan Once, Index Many
ImageRamp:
Multiple Indexing,
Naming and
Routing of the
Same Document
Patient A
Patient B
Patient C
Policy
EOB
Index Sources can be:
• Print streams
• Scanned documents
• Existing files such as word processing
and spreadsheets
PDF print streams can be used to
produce the source data for invoice
runs or other AP/AR functions that
can then be mined for index data
and document splits.
With OCR technology, make your scanned or
image-based file fully text-searchable or extract
data from a zone for indexing.
With most data
capture solutions, users
often select the output
file format as a
“searchable PDF” to
make a full-text index.
This uses OCR
technology to create a
PDF file with two layers,
an image layer and a
text layer that can be
used for full-text
searching.
With zonal OCR, document areas are identified for
OCR capture. Drag-and-drop OCR lets an operator
highlight document text which is automatically OCR'd
and dropped into index fields.
Now that I’ve captured my index data, what can
I do?
Now that I’ve captured my index data, what can
I do?
1. Use a simple search and retrieval system
Now that I’ve captured my index data, what can
I do?
1. Use a simple search and retrieval system
• Let’s you search on the index fields or
free form search on full-text, searchable
PDF files.
• Can be a stepping stone to a full-
fledged document management
system later without loss of investment.
Now that I’ve captured my index data, what can
I do?
2. Send it to an existing document
management or EMR/EHR system.
Now that I’ve captured my index data, what can
I do?
2. Send it to an existing document
management or EMR/EHR system.
Henry Schein, Dentrix,
Dentrix Enterprise
Dentrix Ascend, Easy
Dental
Viive, DentalVision, axiUm
Filenet
ANYONE via CSV, XML
Laserfich
e
Documentum
MyMedicalRecords
Eaglesoft
Allscripts
Epic
Dentrix
Sharepoint
CSV, XML
standard
formats
Learn more about ImageRamp,
intelligent data capture software and…
Click for information on:
• Understanding your scanning requirements
• Using Regular Expressions for Automated Data Capture and Indexing
• Make your Paperless Dreams Come True, using Fujitsu ScanSnap
scanners for document capture
• What can barcodes do for me? (in document Management/EMR Data
capture)
• 8 Must Haves for any Document Capture System
• What is document Indexing
document capture and processing:
Contact us for more information on:
• How to capture index data from print streams
• Using Regex to capture index information,
• More tutorial information on document management
• Scanning documents for document management,
• How to intelligently capture index data from your scans
• Requirements for document management scanning
• How to select a document capture or document scanning
solution
• Using touchscreen scanners such as the Fujitsu ScanSnap as an
intelligent capture solution
• Batch document scanning solutions
• Document Management cost savings
• EMR data capture
• Batch Indexing solutions
• Batch document indexing
• Index documents
• Create a document index
• Document management index
• Index from print stream
• ECM index
• Index ECM
By DocuFi,
makers of ImageRamp,
Document Management
Capture Solution
30 years’ experience in the Document Imaging market.
Find out more at ImageRamp and
www.docufi.com
Copyright ©2014
Image Credits
• Dave Gray dgray_xplane, http://bit.ly/17xKYXp
• Marcin Wichary, Alphabetical, http://bit.ly/1aILOku
• Jim Morgan, database http://bit.ly/1ai0Nm3
• Liza liza31337, Book crease, http://bit.ly/1lWj8tL
• UCL Faculty of Mathematical and Physical Sciences, Index,
http://bit.ly/19q6GiI
• Stuart Caie kyz, Indexed, http://bit.ly/Kfwbau
• Spiffie, “Fujitsu ScanSnap S300M” http://bit.ly/1ksdhhv
• Doctorwonder, “Stack O'Money!” http://bit.ly/1fgxpko
• Boston Public Library, The card index department,
http://bit.ly/1kygZq2
• Robyn Jay, robynejay Train wreck at Montparnasse 1895,
http://bit.ly/19q8CYq
• Theilr, spray, http://bit.ly/1hjGKp3
• Phil Whitehouse,Phillie Casablanca, Blue Zone, http://bit.ly/1hjGVAT
• Seiichi Kusunoki Visual Maintenance, Bunch of Papers,
http://bit.ly/1eJ8EZu
• Patrick Hoesly, “Thank you” http://bit.ly/17xKErE
All images are owned or licensed by DocuFi with acknowledgement given to:
Ad

More Related Content

What's hot (20)

CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
Anandh Arumugakan
 
Information Retrieval Evaluation
Information Retrieval EvaluationInformation Retrieval Evaluation
Information Retrieval Evaluation
José Ramón Ríos Viqueira
 
Introduction to indexing (presentation1)
Introduction to indexing (presentation1)Introduction to indexing (presentation1)
Introduction to indexing (presentation1)
Mary May Porto
 
Probabilistic information retrieval models & systems
Probabilistic information retrieval models & systemsProbabilistic information retrieval models & systems
Probabilistic information retrieval models & systems
Selman Bozkır
 
Slsh
SlshSlsh
Slsh
MahendraAdhikari7
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
ssbd6985
 
Binding standards ms
Binding standards msBinding standards ms
Binding standards ms
madhuvardhan
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
dhatchayaninandu
 
Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
Mounia Lalmas-Roelleke
 
Thesaurus ppt.pptx
Thesaurus ppt.pptxThesaurus ppt.pptx
Thesaurus ppt.pptx
ApurvaShyam1
 
key word indexing and their types with example
key word indexing and their types with example key word indexing and their types with example
key word indexing and their types with example
Sourav Sarkar
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
anujessy
 
Indexing language concept types and characteristics
Indexing language concept types and characteristicsIndexing language concept types and characteristics
Indexing language concept types and characteristics
Dr. Utpal Das
 
Thesaurus 2101
Thesaurus 2101Thesaurus 2101
Thesaurus 2101
roseline2101
 
Precis
PrecisPrecis
Precis
silambu111
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
silambu111
 
POPSI
POPSIPOPSI
POPSI
silambu111
 
Library Automation A - Z Guide: A Hands on Module
Library Automation A - Z Guide: A Hands on ModuleLibrary Automation A - Z Guide: A Hands on Module
Library Automation A - Z Guide: A Hands on Module
Ashok Kumar Satapathy
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
Dustin Smith
 
Reprographics
ReprographicsReprographics
Reprographics
knoxbusiness
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
Anandh Arumugakan
 
Introduction to indexing (presentation1)
Introduction to indexing (presentation1)Introduction to indexing (presentation1)
Introduction to indexing (presentation1)
Mary May Porto
 
Probabilistic information retrieval models & systems
Probabilistic information retrieval models & systemsProbabilistic information retrieval models & systems
Probabilistic information retrieval models & systems
Selman Bozkır
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
ssbd6985
 
Binding standards ms
Binding standards msBinding standards ms
Binding standards ms
madhuvardhan
 
Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
Mounia Lalmas-Roelleke
 
Thesaurus ppt.pptx
Thesaurus ppt.pptxThesaurus ppt.pptx
Thesaurus ppt.pptx
ApurvaShyam1
 
key word indexing and their types with example
key word indexing and their types with example key word indexing and their types with example
key word indexing and their types with example
Sourav Sarkar
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
anujessy
 
Indexing language concept types and characteristics
Indexing language concept types and characteristicsIndexing language concept types and characteristics
Indexing language concept types and characteristics
Dr. Utpal Das
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
silambu111
 
Library Automation A - Z Guide: A Hands on Module
Library Automation A - Z Guide: A Hands on ModuleLibrary Automation A - Z Guide: A Hands on Module
Library Automation A - Z Guide: A Hands on Module
Ashok Kumar Satapathy
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
Dustin Smith
 

Viewers also liked (15)

Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
DocuFi, offering HAI and Infection Prevention Analytics
 
Scanning & document management
Scanning & document managementScanning & document management
Scanning & document management
Gautam Ganguly
 
Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?
Digismartek
 
RU
RURU
RU
Ricoh University
 
What is Data Capture
What is Data CaptureWhat is Data Capture
What is Data Capture
Chris Riley ☁
 
Apa itu soft copy
Apa itu soft copyApa itu soft copy
Apa itu soft copy
johnthj
 
Image Scanning Services
Image Scanning ServicesImage Scanning Services
Image Scanning Services
Global Associates
 
Document scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working bestDocument scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working best
Vander Loto
 
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
DocuFi, offering HAI and Infection Prevention Analytics
 
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
DocuFi, offering HAI and Infection Prevention Analytics
 
What can barcodes do for me? A look at barcodes in Document Management/EMR da...
What can barcodes do for me? A look at barcodes in Document Management/EMR da...What can barcodes do for me? A look at barcodes in Document Management/EMR da...
What can barcodes do for me? A look at barcodes in Document Management/EMR da...
DocuFi, offering HAI and Infection Prevention Analytics
 
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File FormatsPDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
DocuFi, offering HAI and Infection Prevention Analytics
 
Scanning Document Types | Record Nations
Scanning Document Types | Record NationsScanning Document Types | Record Nations
Scanning Document Types | Record Nations
Record Nations
 
What is Intelligent Document and Data Capture? A look at the technologies to ...
What is Intelligent Document and Data Capture? A look at the technologies to ...What is Intelligent Document and Data Capture? A look at the technologies to ...
What is Intelligent Document and Data Capture? A look at the technologies to ...
DocuFi, offering HAI and Infection Prevention Analytics
 
An Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your RequirementsAn Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your Requirements
DocuFi, offering HAI and Infection Prevention Analytics
 
Scanning & document management
Scanning & document managementScanning & document management
Scanning & document management
Gautam Ganguly
 
Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?
Digismartek
 
Apa itu soft copy
Apa itu soft copyApa itu soft copy
Apa itu soft copy
johnthj
 
Document scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working bestDocument scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working best
Vander Loto
 
Scanning Document Types | Record Nations
Scanning Document Types | Record NationsScanning Document Types | Record Nations
Scanning Document Types | Record Nations
Record Nations
 
Ad

Similar to What is Document Indexing? A tutorial for intelligent data capture. (20)

What is Batch Document Processing? A tutorial for document capture.
What is Batch Document Processing?  A tutorial for document capture.What is Batch Document Processing?  A tutorial for document capture.
What is Batch Document Processing? A tutorial for document capture.
DocuFi, offering HAI and Infection Prevention Analytics
 
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
DocuFi, offering HAI and Infection Prevention Analytics
 
Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...
Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...
Automated Data Capture and Extraction with ChronoScan for Automated Metadata ...
DocuFi, offering HAI and Infection Prevention Analytics
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
Bharat Kalia
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?
ARC Document Solutions
 
DU_SERIES_Session1.pdf
DU_SERIES_Session1.pdfDU_SERIES_Session1.pdf
DU_SERIES_Session1.pdf
RohitRadhakrishnan8
 
Applying ocr to extract information : Text mining
Applying ocr to extract information  : Text miningApplying ocr to extract information  : Text mining
Applying ocr to extract information : Text mining
Saurabh Singh
 
Document Parsing
Document ParsingDocument Parsing
Document Parsing
OliviaSmith160
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization word
Dhana K
 
File000162
File000162File000162
File000162
Desmond Devendran
 
CRC Final Report
CRC Final ReportCRC Final Report
CRC Final Report
Sangram Keshari Senapati
 
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
Alex Zeltov
 
8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial
DocuFi, offering HAI and Infection Prevention Analytics
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
IRJET Journal
 
How Image-to-Text Converters Work: A Comprehensive Guide
How Image-to-Text Converters Work: A Comprehensive GuideHow Image-to-Text Converters Work: A Comprehensive Guide
How Image-to-Text Converters Work: A Comprehensive Guide
imageocrcontact
 
search engines designed to support research on using statistical language models
search engines designed to support research on using statistical language modelssearch engines designed to support research on using statistical language models
search engines designed to support research on using statistical language models
CorporationMh
 
Understanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) EnvironmentUnderstanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) Environment
Adetula Bunmi
 
Docrecord
DocrecordDocrecord
Docrecord
guestd04e9
 
Docrecord
DocrecordDocrecord
Docrecord
Charles Clarke
 
OCR Document Reader Transforming Paper into Digital with Just One Click.docx
OCR Document Reader Transforming Paper into Digital with Just One Click.docxOCR Document Reader Transforming Paper into Digital with Just One Click.docx
OCR Document Reader Transforming Paper into Digital with Just One Click.docx
azapiai services
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
Bharat Kalia
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?
ARC Document Solutions
 
Applying ocr to extract information : Text mining
Applying ocr to extract information  : Text miningApplying ocr to extract information  : Text mining
Applying ocr to extract information : Text mining
Saurabh Singh
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization word
Dhana K
 
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
Alex Zeltov
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
IRJET Journal
 
How Image-to-Text Converters Work: A Comprehensive Guide
How Image-to-Text Converters Work: A Comprehensive GuideHow Image-to-Text Converters Work: A Comprehensive Guide
How Image-to-Text Converters Work: A Comprehensive Guide
imageocrcontact
 
search engines designed to support research on using statistical language models
search engines designed to support research on using statistical language modelssearch engines designed to support research on using statistical language models
search engines designed to support research on using statistical language models
CorporationMh
 
Understanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) EnvironmentUnderstanding EDP (Electronic Data Processing) Environment
Understanding EDP (Electronic Data Processing) Environment
Adetula Bunmi
 
OCR Document Reader Transforming Paper into Digital with Just One Click.docx
OCR Document Reader Transforming Paper into Digital with Just One Click.docxOCR Document Reader Transforming Paper into Digital with Just One Click.docx
OCR Document Reader Transforming Paper into Digital with Just One Click.docx
azapiai services
 
Ad

More from DocuFi, offering HAI and Infection Prevention Analytics (10)

HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...
HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...
HAIvia Mobile for Infection Prevention Data Capture and Forms Management (for...
DocuFi, offering HAI and Infection Prevention Analytics
 
Automated Document Indexing with ImageRamp
Automated Document Indexing with ImageRampAutomated Document Indexing with ImageRamp
Automated Document Indexing with ImageRamp
DocuFi, offering HAI and Infection Prevention Analytics
 
Custom Capture Tool Development
Custom Capture Tool DevelopmentCustom Capture Tool Development
Custom Capture Tool Development
DocuFi, offering HAI and Infection Prevention Analytics
 
Tips to Solve Common Problems Reading Barcodes
Tips to Solve Common Problems Reading BarcodesTips to Solve Common Problems Reading Barcodes
Tips to Solve Common Problems Reading Barcodes
DocuFi, offering HAI and Infection Prevention Analytics
 
Intelligent Data Capture Just Got Better, What's New in ImageRamp 6
Intelligent Data Capture Just Got Better, What's New in ImageRamp 6Intelligent Data Capture Just Got Better, What's New in ImageRamp 6
Intelligent Data Capture Just Got Better, What's New in ImageRamp 6
DocuFi, offering HAI and Infection Prevention Analytics
 
Batch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp BatchBatch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp Batch
DocuFi, offering HAI and Infection Prevention Analytics
 
Transformation in the Electric Utility Industry, Redevelopment of Decommissio...
Transformation in the Electric Utility Industry, Redevelopment of Decommissio...Transformation in the Electric Utility Industry, Redevelopment of Decommissio...
Transformation in the Electric Utility Industry, Redevelopment of Decommissio...
DocuFi, offering HAI and Infection Prevention Analytics
 
Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.  Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.
DocuFi, offering HAI and Infection Prevention Analytics
 
Improve OCR Accuracy, Clean Up and Enhance Scanned Images
Improve OCR Accuracy, Clean Up and Enhance Scanned ImagesImprove OCR Accuracy, Clean Up and Enhance Scanned Images
Improve OCR Accuracy, Clean Up and Enhance Scanned Images
DocuFi, offering HAI and Infection Prevention Analytics
 
Folder Watching For Automated Document Capture, Batch Scanning
Folder Watching For Automated Document Capture, Batch ScanningFolder Watching For Automated Document Capture, Batch Scanning
Folder Watching For Automated Document Capture, Batch Scanning
DocuFi, offering HAI and Infection Prevention Analytics
 

Recently uploaded (20)

FL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full VersionFL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full Version
tahirabibi60507
 
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Andre Hora
 
Expand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchangeExpand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchange
Fexle Services Pvt. Ltd.
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
Adobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest VersionAdobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest Version
kashifyounis067
 
Kubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptxKubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptx
CloudScouts
 
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
Egor Kaleynik
 
The Significance of Hardware in Information Systems.pdf
The Significance of Hardware in Information Systems.pdfThe Significance of Hardware in Information Systems.pdf
The Significance of Hardware in Information Systems.pdf
drewplanas10
 
Exploring Code Comprehension in Scientific Programming: Preliminary Insight...
Exploring Code Comprehension  in Scientific Programming:  Preliminary Insight...Exploring Code Comprehension  in Scientific Programming:  Preliminary Insight...
Exploring Code Comprehension in Scientific Programming: Preliminary Insight...
University of Hawai‘i at Mānoa
 
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
Andre Hora
 
Landscape of Requirements Engineering for/by AI through Literature Review
Landscape of Requirements Engineering for/by AI through Literature ReviewLandscape of Requirements Engineering for/by AI through Literature Review
Landscape of Requirements Engineering for/by AI through Literature Review
Hironori Washizaki
 
Avast Premium Security Crack FREE Latest Version 2025
Avast Premium Security Crack FREE Latest Version 2025Avast Premium Security Crack FREE Latest Version 2025
Avast Premium Security Crack FREE Latest Version 2025
mu394968
 
Maxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINKMaxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINK
younisnoman75
 
Automation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath CertificateAutomation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath Certificate
VICTOR MAESTRE RAMIREZ
 
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New VersionPixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
saimabibi60507
 
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
F-Secure Freedome VPN 2025 Crack Plus Activation  New VersionF-Secure Freedome VPN 2025 Crack Plus Activation  New Version
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
saimabibi60507
 
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Lionel Briand
 
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
steaveroggers
 
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& ConsiderationsDesigning AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Dinusha Kumarasiri
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
FL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full VersionFL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full Version
tahirabibi60507
 
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Exceptional Behaviors: How Frequently Are They Tested? (AST 2025)
Andre Hora
 
Expand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchangeExpand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchange
Fexle Services Pvt. Ltd.
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
Adobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest VersionAdobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest Version
kashifyounis067
 
Kubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptxKubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptx
CloudScouts
 
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
Egor Kaleynik
 
The Significance of Hardware in Information Systems.pdf
The Significance of Hardware in Information Systems.pdfThe Significance of Hardware in Information Systems.pdf
The Significance of Hardware in Information Systems.pdf
drewplanas10
 
Exploring Code Comprehension in Scientific Programming: Preliminary Insight...
Exploring Code Comprehension  in Scientific Programming:  Preliminary Insight...Exploring Code Comprehension  in Scientific Programming:  Preliminary Insight...
Exploring Code Comprehension in Scientific Programming: Preliminary Insight...
University of Hawai‘i at Mānoa
 
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
Andre Hora
 
Landscape of Requirements Engineering for/by AI through Literature Review
Landscape of Requirements Engineering for/by AI through Literature ReviewLandscape of Requirements Engineering for/by AI through Literature Review
Landscape of Requirements Engineering for/by AI through Literature Review
Hironori Washizaki
 
Avast Premium Security Crack FREE Latest Version 2025
Avast Premium Security Crack FREE Latest Version 2025Avast Premium Security Crack FREE Latest Version 2025
Avast Premium Security Crack FREE Latest Version 2025
mu394968
 
Maxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINKMaxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINK
younisnoman75
 
Automation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath CertificateAutomation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath Certificate
VICTOR MAESTRE RAMIREZ
 
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New VersionPixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
saimabibi60507
 
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
F-Secure Freedome VPN 2025 Crack Plus Activation  New VersionF-Secure Freedome VPN 2025 Crack Plus Activation  New Version
F-Secure Freedome VPN 2025 Crack Plus Activation New Version
saimabibi60507
 
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Lionel Briand
 
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
How to Batch Export Lotus Notes NSF Emails to Outlook PST Easily?
steaveroggers
 
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& ConsiderationsDesigning AI-Powered APIs on Azure: Best Practices& Considerations
Designing AI-Powered APIs on Azure: Best Practices& Considerations
Dinusha Kumarasiri
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 

What is Document Indexing? A tutorial for intelligent data capture.

  • 1. What is Document Indexing? 1 in·dex /in-deks/ n. plural in·dex·es, in·di·ces /in-duh-seez/ a list (as of bibliographical information or citations to a body of literature) arranged usually in alphabetical order of some specified datum (as author, subject, or keyword): as a : a list of items (as topics or names) 2 in·dex /in-deks/ v. to provide an index for (something, such as a book) Copyright ©2014
  • 2. the process of tagging or associating information with a file so it can be used for search and retrieval purposes later Indexing:
  • 3. Indexing creates the “searchable” information that users will later use to find documents.
  • 4. Invoice Number? Customer/Employee Number? Customer/Employee Name? Date? Site ID? Patient Name?Doctor? Work Order Number? Waybill Number? Prescription Number?
  • 5. The index information is stored or integrated into a database or document/records management system which provides a framework for users to locate the documents. My Database
  • 6. There are two types of Indexes.
  • 7. Full-text indexing is just what the name implies; all the text of the document is indexed.
  • 8. When specific words or descriptions are indexed to create the searchable index fields, the information is referred to as “metadata.”
  • 10. “Documents are the currency of business. They are at the heart of critical workflows and drive just about every area of business.” -- IDC, “The Role of Documents: How They Drive Business, Today and Tomorrow”, January 2013
  • 11. Great care should be taken to design an efficient indexing scheme.
  • 12. If the process is not designed correctly at the outset, trying to rectify it later can be both difficult and costly. And in some environments such as legal, the cost of not locating a key document can be monumental. Avoid Disaster
  • 13. So how can indexing information be extracted with little to no user intervention?
  • 14. So how can indexing information be extracted with little to no user intervention? • Barcodes • Content Data Mining • Optical Character Recognition (OCR) • Zonal OCR • Drag and Drop OCR
  • 15. Intelligent data capture software can extract barcode data for indexing.
  • 16. Intelligent data capture software can extract barcode data for indexing. Barcodes can also be used for many other purposes such as file naming, splitting, bookmarking and routing.
  • 17. Files that contain text can be mined using various data mining techniques.
  • 18. OCR tools and technology such as Regular Expressions aid in text mining.
  • 19. Regular expression (regex) scripts are powerful tools to help identify keywords or actual strings of text for indexing from many source types. OCR tools and technology such as Regular Expressions aid in text mining.
  • 20. Regular expression (regex) scripts are powerful tools to help identify keywords or actual strings of text for indexing from many source types. The scripting process can look for words with specific characters, lengths, character types, or preceding keywords. OCR tools and technology such as Regular Expressions aid in text mining.
  • 21. If an inventory item should contain three alpha characters followed by five numbers, advanced indexing solutions can use regex to recognized this pattern and reject all documents with items not meeting this rule. The document can be tagged for manual inspection before further processing is done. Advanced indexing solutions offer Field Validation based on Regular Expressions. PEN21096 CAP36581 INV98453 PA568793
  • 22. Used to process EOB's or other records where the same document needs to be in multiple patient records or places. Advanced data capture solutions such as ImageRamp allow the operator to easily scan the EOB once, index the different patients' information via an onscreen keyboard, drag-and- drop OCR, or barcode reading methods, and route to the appropriate patients' records with little to no intervention. Advanced indexing solutions can accommodate special needs such as Scan Once, Index Many ImageRamp: Multiple Indexing, Naming and Routing of the Same Document Patient A Patient B Patient C Policy EOB
  • 23. Index Sources can be: • Print streams • Scanned documents • Existing files such as word processing and spreadsheets
  • 24. PDF print streams can be used to produce the source data for invoice runs or other AP/AR functions that can then be mined for index data and document splits.
  • 25. With OCR technology, make your scanned or image-based file fully text-searchable or extract data from a zone for indexing.
  • 26. With most data capture solutions, users often select the output file format as a “searchable PDF” to make a full-text index. This uses OCR technology to create a PDF file with two layers, an image layer and a text layer that can be used for full-text searching.
  • 27. With zonal OCR, document areas are identified for OCR capture. Drag-and-drop OCR lets an operator highlight document text which is automatically OCR'd and dropped into index fields.
  • 28. Now that I’ve captured my index data, what can I do?
  • 29. Now that I’ve captured my index data, what can I do? 1. Use a simple search and retrieval system
  • 30. Now that I’ve captured my index data, what can I do? 1. Use a simple search and retrieval system • Let’s you search on the index fields or free form search on full-text, searchable PDF files. • Can be a stepping stone to a full- fledged document management system later without loss of investment.
  • 31. Now that I’ve captured my index data, what can I do? 2. Send it to an existing document management or EMR/EHR system.
  • 32. Now that I’ve captured my index data, what can I do? 2. Send it to an existing document management or EMR/EHR system. Henry Schein, Dentrix, Dentrix Enterprise Dentrix Ascend, Easy Dental Viive, DentalVision, axiUm Filenet ANYONE via CSV, XML Laserfich e Documentum MyMedicalRecords Eaglesoft Allscripts Epic Dentrix Sharepoint CSV, XML standard formats
  • 33. Learn more about ImageRamp, intelligent data capture software and…
  • 34. Click for information on: • Understanding your scanning requirements • Using Regular Expressions for Automated Data Capture and Indexing • Make your Paperless Dreams Come True, using Fujitsu ScanSnap scanners for document capture • What can barcodes do for me? (in document Management/EMR Data capture) • 8 Must Haves for any Document Capture System • What is document Indexing document capture and processing:
  • 35. Contact us for more information on: • How to capture index data from print streams • Using Regex to capture index information, • More tutorial information on document management • Scanning documents for document management, • How to intelligently capture index data from your scans • Requirements for document management scanning • How to select a document capture or document scanning solution • Using touchscreen scanners such as the Fujitsu ScanSnap as an intelligent capture solution • Batch document scanning solutions • Document Management cost savings • EMR data capture • Batch Indexing solutions • Batch document indexing • Index documents • Create a document index • Document management index • Index from print stream • ECM index • Index ECM By DocuFi, makers of ImageRamp, Document Management Capture Solution 30 years’ experience in the Document Imaging market. Find out more at ImageRamp and www.docufi.com Copyright ©2014
  • 36. Image Credits • Dave Gray dgray_xplane, http://bit.ly/17xKYXp • Marcin Wichary, Alphabetical, http://bit.ly/1aILOku • Jim Morgan, database http://bit.ly/1ai0Nm3 • Liza liza31337, Book crease, http://bit.ly/1lWj8tL • UCL Faculty of Mathematical and Physical Sciences, Index, http://bit.ly/19q6GiI • Stuart Caie kyz, Indexed, http://bit.ly/Kfwbau • Spiffie, “Fujitsu ScanSnap S300M” http://bit.ly/1ksdhhv • Doctorwonder, “Stack O'Money!” http://bit.ly/1fgxpko • Boston Public Library, The card index department, http://bit.ly/1kygZq2 • Robyn Jay, robynejay Train wreck at Montparnasse 1895, http://bit.ly/19q8CYq • Theilr, spray, http://bit.ly/1hjGKp3 • Phil Whitehouse,Phillie Casablanca, Blue Zone, http://bit.ly/1hjGVAT • Seiichi Kusunoki Visual Maintenance, Bunch of Papers, http://bit.ly/1eJ8EZu • Patrick Hoesly, “Thank you” http://bit.ly/17xKErE All images are owned or licensed by DocuFi with acknowledgement given to: