0% found this document useful (0 votes)
95 views6 pages

Utility - Itextsharp: Business Definition Object

This document provides an overview of the Itextsharp business object in Blue Prism. Itextsharp allows extracting information from PDF files, including all text, specific pages of text, the number of pages, and titles. The business object uses Itextsharp DLL files to perform actions like getting data as a collection, extracting text and titles, getting text from a specific page, and getting the total number of pages. It takes a PDF file path as input and returns the extracted information as output.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
95 views6 pages

Utility - Itextsharp: Business Definition Object

This document provides an overview of the Itextsharp business object in Blue Prism. Itextsharp allows extracting information from PDF files, including all text, specific pages of text, the number of pages, and titles. The business object uses Itextsharp DLL files to perform actions like getting data as a collection, extracting text and titles, getting text from a specific page, and getting the total number of pages. It takes a PDF file path as input and returns the extracted information as output.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Utility -

ItextSharp
Business Definition Object

ITEXTSHARP | Skill Guide


Contents
About this document....................................................................................................................................... 3
About Itextsharp ............................................................................................................................................. 3
How Itextsharp Works .................................................................................................................................... 3
Business Object Actions.................................................................................................................................. 3
Get Data as Collection .............................................................................................................................. 3
Preconditions ........................................................................................................................................ 3
Endpoint ................................................................................................................................................ 3
Parameters ............................................................................................................................................ 3
Use Case ................................................................................................................................................ 3
Get text and Title from PDF file ............................................................................................................... 4
Preconditions ........................................................................................................................................ 4
Endpoint ................................................................................................................................................ 4
Parameters ............................................................................................................................................ 4
Get text per page...................................................................................................................................... 5
Preconditions ........................................................................................................................................ 5
Endpoint ................................................................................................................................................ 5
Parameters ............................................................................................................................................ 5
Get number of pages ................................................................................................................................ 6
Preconditions ........................................................................................................................................ 6
Endpoint ................................................................................................................................................ 6
Parameters ............................................................................................................................................ 6

ITEXTSHARP | Skill Guide


About this document

About Itextsharp
This object has the usefulness of extracting all the information of a PDF file, either to read all its
content, to obtain number of pages, titles etc.

How Itextsharp Works


This object needs of DLL files itextsharp.dll.

The object needs those DLL files referenced into its Initialise page, and the files must be into the Blue
Prism installation folder.

Business Object Actions


Get Data as Collection
This action extracts the text from a PDF file and saves it in a collection.

Preconditions
The action uses a code stage with variables parameterized as the path of the pdf file as input
and output value in a variable collection, if the stage has been executed correctly will fill to True the
variable success, and in case of error will fill us the variable Message.

If the code stage fails it is collected in a recover by sending an Exception

Endpoint
This action ends in a collection with the data obtained from a PDF.

Parameters

Parameter Direction Data Type Description


PDF Input Text Path to the PDF form file
Data Output Collection result of the value in a collection

Use Case

ITEXTSHARP | Skill Guide


Get text and Title from PDF file
This action takes text from a pdf file but also adds the title of the page.

Preconditions
the action performs through a path variable of the file an extraction of the text of the pdf and
also in another variable is the title of the page, all this is converted to text type.

Endpoint
the action ends with two variables in the termination stage with the text extracted in one
variable and the title in another.

Parameters

Parameter Direction Data Type Description


PDF Input Text Path to the PDF form file
Pdf Text Output Text All text obtained from PDF File
Pdf Title Output Text PDF's Title. Title is setted as the text between
the beginning and the text "in te vullen door de
contactpersoon van de gemeente"

ITEXTSHARP | Skill Guide


Get text per page
This action returns the text extracted from a PDF file according to the page number we pass it by
parameter.

Preconditions
The action only needs two parameterized variables that is the path of the pdf file and a number
that will be the page that will extract the text.

In case of failure the action is picked up by a recover by sending an Exception message.

Endpoint
The action ends at the end stage with the text obtained in a text type variable.

Parameters

Parameter Direction Data Type Description


PDF Input Text Path of the PDF File
Page Input Number Number of page to get
PageText Output Text Text got from the specific page

ITEXTSHARP | Skill Guide


Get number of pages
This action determines the total number of pages in a pdf file.

Preconditions
The action determines the total number of pages in a pdf file loaded by a path through a
variable.

Endpoint
The action ends at the end stage with the total number of pages in a file.

Parameters

Parameter Direction Data Type Description


PDF Input Text Path to the PDF file
NumberOfPages Output Number Total number of pages which compose the pdf file

ITEXTSHARP | Skill Guide

You might also like