SlideShare a Scribd company logo
With ChronoScan
Capture and Extraction:
Where ECM Begins
Capture means many things when speaking
of Document/Records Management or
Enterprise Content Management.
AIIM
Association for Information and
Image Management
“Capture boils down to entering
content into the system.”
Extraction is an important element
of Capture…
By extraction we mean pulling the
important information from the
content to use for classification or
taxonomy purposes, creation of
the appropriate metadata or tags,
and more.
Extraction is an important element
of Capture…
So Why is Capture and
Extraction so Important?
All Information Governance and Content
Management Depends on Correct Metadata
• Find key information on demand
• Apply the correct data security/privacy rules
• Determine the correct data retention
• Protect your entity regarding eDiscovery/legal
compliance issues
• Turn your content or knowledge into a
competitive advantage
You have to correctly identify the document or content to:
a comprehensive suite of software for
document scanning, data extraction
and integration into your ECM, CMIS
compliant, or line of business
database.
ChronoScan is:
The capture of
the “thing”:
• Scans
• Faxes
• Emails
• PrintStreams
Exterior Interior
Let’s categorize capture by what we’ll
call the Exterior and the Interior
The capture of the content of the
“thing”:
Actual data and information extracted
from the “thing” such as invoice
number, line items, customer number,
vendor number, patient
name…whatever your information
concerns.
This presentation
looks at the “interior”
capture accomplished
by ChronoScan’s
“extraction” features.
ChronoScan’s Extraction Features We’ll Examine
OCR technology is the
foundation for many of
ChronoScan’s auto extraction
capabilities.
Using sophisticated OCR
technologies such as Zonal OCR and
Grid OCR, ChronoScan can extract
data to classify the document and
create indexes (metadata or tags)
from structured and unstructured
documents.
Extract only data from the area of your document where your
important information is found for fast, automatic data
extraction.
Zonal OCR Capture
Use Dynamic Text Anchors to link to moving text using constant
or variable patterns, thus accommodating unstructured
documents.
Zonal OCR Capture
Here, ChronoScan finds the word “subtotal” and captures the data to the
right. Extracted data can be further manipulated and used for validation.
Optimize for your documents with multiple parameters like
image processing, OCR engine, type of data to find, regular
expression validation and more.
Zonal OCR Capture
Grid OCR is used for Line
Item Extraction and
Advanced Report
Breakdown or Dismount.
With Line Item
Extraction, extract and
manipulate line data found
on such forms as invoices
or delivery tickets.
Advanced Report Breakdown or Dismount
Convert complex reports to a structured data
format.
Convert complex PDF or scanned OCR
reports into a structured data format. With
this unique feature, ChronoScan is able to
break down complex reports automatically,
splitting every different record as an
independent processing unit. The software is
able to adapt extraction to different rules
and page limits to break down and structure
visually complex documents into a
compressible data file (CSV/XLS).
Advanced Report Breakdown or Dismount
Break Down
Extract
Converts complex reports to structured data.
ChronoScan breaks down complex reports
automatically, splitting every different record as an
independent processing unit.
Easily adapt extraction to different rules and page
limits to break down and structure visually complex
documents into a compressible data file (CSV/XLS).
(using sophisticated Grid OCR)
Nuance OCR
Plug-In Option
The world's most accurate
and robust OCR available.
• Dramatically increases zonal OCR
confidence
• Improves OCR triggers precision
• Better & faster background OCR
increases precision on regular
expression rules
• Better image orientation detection
Extract 1D/2D barcodes from your documents
and assign any part of them to fields for indexing,
database export, TXT report, file naming, etc.
Barcodes are tried and
true information tags.
Read Barcodes from Images
Assign custom actions based on the barcoded values such as set
field values, split documents, etc.
Process
Captured Data
1 2
Barcodes can be used on separator or slip sheets to designate
where documents should end and begin when a stack of
documents are scanned. And the barcode information on the
separator sheets can be extracted for indexing, naming and
routing purposes too.
ChronoScan imports
PDF files with native
text so you can easily
index the fields you
want and export your
data to TXT, CSV, Excel,
Word, HTML, and
OLE/ODBC databases
to easily feed your
indexing or database
application.
Automate PDF Processing Tasks
Automatically extract fields and tables from PDF files.
ChronoScan learns the Document
Type using comprehensive layout
recognition features to “remember”
user actions. Every different
document type can be assigned to a
different template or job to customize
OCR areas, settings and actions.
Result: Scan/import documents
together, without previous
preparation to automate repetitive
tasks and improve data input.
Automatic Document
Learning:
Training ChronoScan to identify
documents with Intelligent
Document Recognition to
automatically capture information
Type 1 Documents
Type 2 Documents
Once data is identified, it can
be used for many purposes
besides indexing or metadata
creation.
Validation
File Naming
File Splitting Routing
Classification
ECM Integration
Bookmarking
Metadata
Once data is identified, it can
be used for many purposes
besides indexing or metadata
creation.
Relying on manual scrutiny to bring this “wild content” under control simply
will not work. The failure of humans to consistently tag and classify new
documents as they are filed has created the mess in the first place.
© AIIM 2014, www.aiim.org
Remember, Everything Depends on Correct
Metadata
Relying on manual scrutiny to bring this “wild content” under control simply
will not work. The failure of humans to consistently tag and classify new
documents as they are filed has created the mess in the first place.
Remember, Everything Depends on Correct
Metadata
The Key:
Automatic Metadata Creation
With ChronoScan
© AIIM 2014, www.aiim.org
For more on:
• Automated document classification
• Automated metadata creation
• Batch Document processing
• Batch PDF mining
• Batch text mining
• Batch TIF mining
• Text mining
• Extracting metadata,
• Data extraction from unstructured data
• Intelligent data capture
• Data extraction
• Using regex to extract data
• Document scanning
• Extracting data
• Extract meta data,
• Scanner software,
• Barcode recognition,
• OCR software,
• Capture tutorial
• Pdf scanning,
• Scanning software
• Indexing
• Document indexing
• Automated capture
• Meta data
• Docufi
• Imageramp
• ChronoScan
• Data capture
• What is ChronoScan
• US Chronoscan reseller
• ChronoScan in the US
www.docufi.com info@docufi.com
Copyright ©2014
Get Started With Us
Our solutions include, ImageRamp Batch for folder processing, and
ChronoScan Capture for advanced data mining and barcode requirements.
Built on over 30 years’ experience in the Document Imaging and Capture market
DocuFi is a premier ChronoScan Solutions Partner offering
extensive professional services to configure the system to
your specific requirements. DocuFi has been providing
custom solutions into health care, financial services, retail,
educational and other markets since 2010.
Learn More:
Ad

More Related Content

What's hot (20)

Painless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with AlfrescoPainless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with Alfresco
BlueFishTX
 
Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.  Automatic file naming and routing for scanned documents and existing files.
Automatic file naming and routing for scanned documents and existing files.
DocuFi, offering HAI and Infection Prevention Analytics
 
Folder Watching For Automated Document Capture, Batch Scanning
Folder Watching For Automated Document Capture, Batch ScanningFolder Watching For Automated Document Capture, Batch Scanning
Folder Watching For Automated Document Capture, Batch Scanning
DocuFi, offering HAI and Infection Prevention Analytics
 
8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial
DocuFi, offering HAI and Infection Prevention Analytics
 
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
DocuFi, offering HAI and Infection Prevention Analytics
 
Batch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp BatchBatch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp Batch
DocuFi, offering HAI and Infection Prevention Analytics
 
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
DocuFi, offering HAI and Infection Prevention Analytics
 
An Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your RequirementsAn Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your Requirements
DocuFi, offering HAI and Infection Prevention Analytics
 
What is Document Indexing? A tutorial for intelligent data capture.
What is Document Indexing? A tutorial for intelligent data capture.What is Document Indexing? A tutorial for intelligent data capture.
What is Document Indexing? A tutorial for intelligent data capture.
DocuFi, offering HAI and Infection Prevention Analytics
 
Improve OCR Accuracy, Clean Up and Enhance Scanned Images
Improve OCR Accuracy, Clean Up and Enhance Scanned ImagesImprove OCR Accuracy, Clean Up and Enhance Scanned Images
Improve OCR Accuracy, Clean Up and Enhance Scanned Images
DocuFi, offering HAI and Infection Prevention Analytics
 
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File FormatsPDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
DocuFi, offering HAI and Infection Prevention Analytics
 
DocuSolve Scanning Solutions
DocuSolve Scanning SolutionsDocuSolve Scanning Solutions
DocuSolve Scanning Solutions
Gordon Bishop
 
Oce e-Copy Barcode Recognition Services
Oce e-Copy Barcode Recognition ServicesOce e-Copy Barcode Recognition Services
Oce e-Copy Barcode Recognition Services
Andrew Bain
 
Steering Away from Bolted-On Analytics
Steering Away from Bolted-On AnalyticsSteering Away from Bolted-On Analytics
Steering Away from Bolted-On Analytics
Connexica
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
Theju Paul
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
FellowBuddy.com
 
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo
 
Take control over GDPR compliance with ContentMap software!
Take control over GDPR compliance with ContentMap software!Take control over GDPR compliance with ContentMap software!
Take control over GDPR compliance with ContentMap software!
Pär Eliasson
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
Aashish Rathod
 
ITGS - Data And Databases
ITGS - Data And DatabasesITGS - Data And Databases
ITGS - Data And Databases
Konrad Konlechner
 
Painless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with AlfrescoPainless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with Alfresco
BlueFishTX
 
DocuSolve Scanning Solutions
DocuSolve Scanning SolutionsDocuSolve Scanning Solutions
DocuSolve Scanning Solutions
Gordon Bishop
 
Oce e-Copy Barcode Recognition Services
Oce e-Copy Barcode Recognition ServicesOce e-Copy Barcode Recognition Services
Oce e-Copy Barcode Recognition Services
Andrew Bain
 
Steering Away from Bolted-On Analytics
Steering Away from Bolted-On AnalyticsSteering Away from Bolted-On Analytics
Steering Away from Bolted-On Analytics
Connexica
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
Theju Paul
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
FellowBuddy.com
 
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo
 
Take control over GDPR compliance with ContentMap software!
Take control over GDPR compliance with ContentMap software!Take control over GDPR compliance with ContentMap software!
Take control over GDPR compliance with ContentMap software!
Pär Eliasson
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
Aashish Rathod
 

Similar to Automated Data Capture and Extraction with ChronoScan for Automated Metadata and Classification (20)

Document Parsing
Document ParsingDocument Parsing
Document Parsing
OliviaSmith160
 
SoftTrac Synergetics
SoftTrac SynergeticsSoftTrac Synergetics
SoftTrac Synergetics
J.R. Herrington
 
AI-Driven News & Article Data Scraping: A Deep Dive into Content Extraction
AI-Driven News & Article Data Scraping: A Deep Dive into Content ExtractionAI-Driven News & Article Data Scraping: A Deep Dive into Content Extraction
AI-Driven News & Article Data Scraping: A Deep Dive into Content Extraction
Web Screen Scraping
 
Modern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdfModern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdf
DhanashreeBadhe
 
Automate The Process Of Textual Data Extraction From Images.pdf
Automate The Process Of Textual Data Extraction From Images.pdfAutomate The Process Of Textual Data Extraction From Images.pdf
Automate The Process Of Textual Data Extraction From Images.pdf
Data Scraping and Data Extraction
 
Drivve overview
Drivve overviewDrivve overview
Drivve overview
Lembit
 
No Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair MonarchNo Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair Monarch
Altair
 
Best Features of Document Management System Software | Digismartek
Best Features of Document Management System Software | Digismartek Best Features of Document Management System Software | Digismartek
Best Features of Document Management System Software | Digismartek
Digismartek
 
Existco Scan and File Utility
Existco Scan and File UtilityExistco Scan and File Utility
Existco Scan and File Utility
Existco Pty Ltd
 
Applying ocr to extract information : Text mining
Applying ocr to extract information  : Text miningApplying ocr to extract information  : Text mining
Applying ocr to extract information : Text mining
Saurabh Singh
 
Ecm model
Ecm modelEcm model
Ecm model
KaleemSarwar2
 
Data Capture Solution for Logistics
Data Capture Solution for LogisticsData Capture Solution for Logistics
Data Capture Solution for Logistics
Dokumentive
 
DU_SERIES_Session1.pdf
DU_SERIES_Session1.pdfDU_SERIES_Session1.pdf
DU_SERIES_Session1.pdf
RohitRadhakrishnan8
 
UiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptxUiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptx
RohitRadhakrishnan8
 
Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...
Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...
Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...
Alistair Pugin
 
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
ISIS Papyrus Software
 
UiPath Document Understanding_Day 3.pptx
UiPath Document Understanding_Day 3.pptxUiPath Document Understanding_Day 3.pptx
UiPath Document Understanding_Day 3.pptx
UiPathCommunity
 
Data Warehouse for data analytics presentation
Data Warehouse for data analytics presentationData Warehouse for data analytics presentation
Data Warehouse for data analytics presentation
21132067
 
InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17
Everteam
 
iData Sciences Product Overview
iData Sciences Product OverviewiData Sciences Product Overview
iData Sciences Product Overview
jvsrinivas1
 
AI-Driven News & Article Data Scraping: A Deep Dive into Content Extraction
AI-Driven News & Article Data Scraping: A Deep Dive into Content ExtractionAI-Driven News & Article Data Scraping: A Deep Dive into Content Extraction
AI-Driven News & Article Data Scraping: A Deep Dive into Content Extraction
Web Screen Scraping
 
Modern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdfModern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdf
DhanashreeBadhe
 
Automate The Process Of Textual Data Extraction From Images.pdf
Automate The Process Of Textual Data Extraction From Images.pdfAutomate The Process Of Textual Data Extraction From Images.pdf
Automate The Process Of Textual Data Extraction From Images.pdf
Data Scraping and Data Extraction
 
Drivve overview
Drivve overviewDrivve overview
Drivve overview
Lembit
 
No Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair MonarchNo Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair Monarch
Altair
 
Best Features of Document Management System Software | Digismartek
Best Features of Document Management System Software | Digismartek Best Features of Document Management System Software | Digismartek
Best Features of Document Management System Software | Digismartek
Digismartek
 
Existco Scan and File Utility
Existco Scan and File UtilityExistco Scan and File Utility
Existco Scan and File Utility
Existco Pty Ltd
 
Applying ocr to extract information : Text mining
Applying ocr to extract information  : Text miningApplying ocr to extract information  : Text mining
Applying ocr to extract information : Text mining
Saurabh Singh
 
Data Capture Solution for Logistics
Data Capture Solution for LogisticsData Capture Solution for Logistics
Data Capture Solution for Logistics
Dokumentive
 
UiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptxUiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptx
RohitRadhakrishnan8
 
Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...
Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...
Effective Document Capture in SharePoint - SharePoint Saturday Cape Town - 22...
Alistair Pugin
 
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
Inbound Mail Processing - Technology Innovation Brochure by ISIS Papyrus Soft...
ISIS Papyrus Software
 
UiPath Document Understanding_Day 3.pptx
UiPath Document Understanding_Day 3.pptxUiPath Document Understanding_Day 3.pptx
UiPath Document Understanding_Day 3.pptx
UiPathCommunity
 
Data Warehouse for data analytics presentation
Data Warehouse for data analytics presentationData Warehouse for data analytics presentation
Data Warehouse for data analytics presentation
21132067
 
InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17
Everteam
 
iData Sciences Product Overview
iData Sciences Product OverviewiData Sciences Product Overview
iData Sciences Product Overview
jvsrinivas1
 
Ad

Recently uploaded (20)

Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Ad

Automated Data Capture and Extraction with ChronoScan for Automated Metadata and Classification

  • 1. With ChronoScan Capture and Extraction: Where ECM Begins
  • 2. Capture means many things when speaking of Document/Records Management or Enterprise Content Management.
  • 3. AIIM Association for Information and Image Management “Capture boils down to entering content into the system.”
  • 4. Extraction is an important element of Capture…
  • 5. By extraction we mean pulling the important information from the content to use for classification or taxonomy purposes, creation of the appropriate metadata or tags, and more. Extraction is an important element of Capture…
  • 6. So Why is Capture and Extraction so Important?
  • 7. All Information Governance and Content Management Depends on Correct Metadata • Find key information on demand • Apply the correct data security/privacy rules • Determine the correct data retention • Protect your entity regarding eDiscovery/legal compliance issues • Turn your content or knowledge into a competitive advantage You have to correctly identify the document or content to:
  • 8. a comprehensive suite of software for document scanning, data extraction and integration into your ECM, CMIS compliant, or line of business database. ChronoScan is:
  • 9. The capture of the “thing”: • Scans • Faxes • Emails • PrintStreams Exterior Interior Let’s categorize capture by what we’ll call the Exterior and the Interior The capture of the content of the “thing”: Actual data and information extracted from the “thing” such as invoice number, line items, customer number, vendor number, patient name…whatever your information concerns.
  • 10. This presentation looks at the “interior” capture accomplished by ChronoScan’s “extraction” features.
  • 12. OCR technology is the foundation for many of ChronoScan’s auto extraction capabilities.
  • 13. Using sophisticated OCR technologies such as Zonal OCR and Grid OCR, ChronoScan can extract data to classify the document and create indexes (metadata or tags) from structured and unstructured documents.
  • 14. Extract only data from the area of your document where your important information is found for fast, automatic data extraction. Zonal OCR Capture
  • 15. Use Dynamic Text Anchors to link to moving text using constant or variable patterns, thus accommodating unstructured documents. Zonal OCR Capture Here, ChronoScan finds the word “subtotal” and captures the data to the right. Extracted data can be further manipulated and used for validation.
  • 16. Optimize for your documents with multiple parameters like image processing, OCR engine, type of data to find, regular expression validation and more. Zonal OCR Capture
  • 17. Grid OCR is used for Line Item Extraction and Advanced Report Breakdown or Dismount.
  • 18. With Line Item Extraction, extract and manipulate line data found on such forms as invoices or delivery tickets.
  • 19. Advanced Report Breakdown or Dismount Convert complex reports to a structured data format. Convert complex PDF or scanned OCR reports into a structured data format. With this unique feature, ChronoScan is able to break down complex reports automatically, splitting every different record as an independent processing unit. The software is able to adapt extraction to different rules and page limits to break down and structure visually complex documents into a compressible data file (CSV/XLS). Advanced Report Breakdown or Dismount Break Down Extract Converts complex reports to structured data.
  • 20. ChronoScan breaks down complex reports automatically, splitting every different record as an independent processing unit.
  • 21. Easily adapt extraction to different rules and page limits to break down and structure visually complex documents into a compressible data file (CSV/XLS). (using sophisticated Grid OCR)
  • 22. Nuance OCR Plug-In Option The world's most accurate and robust OCR available. • Dramatically increases zonal OCR confidence • Improves OCR triggers precision • Better & faster background OCR increases precision on regular expression rules • Better image orientation detection
  • 23. Extract 1D/2D barcodes from your documents and assign any part of them to fields for indexing, database export, TXT report, file naming, etc. Barcodes are tried and true information tags.
  • 24. Read Barcodes from Images Assign custom actions based on the barcoded values such as set field values, split documents, etc. Process Captured Data 1 2
  • 25. Barcodes can be used on separator or slip sheets to designate where documents should end and begin when a stack of documents are scanned. And the barcode information on the separator sheets can be extracted for indexing, naming and routing purposes too.
  • 26. ChronoScan imports PDF files with native text so you can easily index the fields you want and export your data to TXT, CSV, Excel, Word, HTML, and OLE/ODBC databases to easily feed your indexing or database application. Automate PDF Processing Tasks Automatically extract fields and tables from PDF files.
  • 27. ChronoScan learns the Document Type using comprehensive layout recognition features to “remember” user actions. Every different document type can be assigned to a different template or job to customize OCR areas, settings and actions. Result: Scan/import documents together, without previous preparation to automate repetitive tasks and improve data input. Automatic Document Learning: Training ChronoScan to identify documents with Intelligent Document Recognition to automatically capture information Type 1 Documents Type 2 Documents
  • 28. Once data is identified, it can be used for many purposes besides indexing or metadata creation.
  • 29. Validation File Naming File Splitting Routing Classification ECM Integration Bookmarking Metadata Once data is identified, it can be used for many purposes besides indexing or metadata creation.
  • 30. Relying on manual scrutiny to bring this “wild content” under control simply will not work. The failure of humans to consistently tag and classify new documents as they are filed has created the mess in the first place. © AIIM 2014, www.aiim.org Remember, Everything Depends on Correct Metadata
  • 31. Relying on manual scrutiny to bring this “wild content” under control simply will not work. The failure of humans to consistently tag and classify new documents as they are filed has created the mess in the first place. Remember, Everything Depends on Correct Metadata The Key: Automatic Metadata Creation With ChronoScan © AIIM 2014, www.aiim.org
  • 32. For more on: • Automated document classification • Automated metadata creation • Batch Document processing • Batch PDF mining • Batch text mining • Batch TIF mining • Text mining • Extracting metadata, • Data extraction from unstructured data • Intelligent data capture • Data extraction • Using regex to extract data • Document scanning • Extracting data • Extract meta data, • Scanner software, • Barcode recognition, • OCR software, • Capture tutorial • Pdf scanning, • Scanning software • Indexing • Document indexing • Automated capture • Meta data • Docufi • Imageramp • ChronoScan • Data capture • What is ChronoScan • US Chronoscan reseller • ChronoScan in the US www.docufi.com [email protected] Copyright ©2014 Get Started With Us Our solutions include, ImageRamp Batch for folder processing, and ChronoScan Capture for advanced data mining and barcode requirements. Built on over 30 years’ experience in the Document Imaging and Capture market DocuFi is a premier ChronoScan Solutions Partner offering extensive professional services to configure the system to your specific requirements. DocuFi has been providing custom solutions into health care, financial services, retail, educational and other markets since 2010.