0% found this document useful (0 votes)

402 views8 pages

Setting Up A Simple OCR Server: by Real Python 37 Comments

This document provides instructions for setting up a simple optical character recognition (OCR) server using Python. It discusses downloading dependencies like Tesseract and Leptonica, building them from source, and setting up a Flask backend to serve OCR results. It also describes adding a Python wrapper for Tesseract using pytesseract and building a basic CLI tool and API endpoint to interface with the OCR engine. The goal is to build an API that accepts an image URL, runs OCR on the image, and returns the text results.

Uploaded by

erivandoramos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

402 views8 pages

Setting Up A Simple OCR Server: by Real Python 37 Comments

Uploaded by

erivandoramos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Setting up a Simple OCR Server

by Real Python  37 Comments  api data-science flask intermediate web-dev

Table of Contents
Why Use Python for OCR?
Beginning Steps
Downloading Dependencies
What’s Happening?
Building Leptonica and Tesseract
Leptonica
Tesseract
Environment Variable
Tesseract Packages
Web-server time!
Let’s Make an OCR Engine
Optional: Building a CLI Tool for Your New OCR Engine
Back to the Server
Let’s Test!
Example
Front-end
Conclusion and Next Steps
The following is a collaboration piece between Bobby Grayson, a software developer at Ahalogy, and Real Python.

Why Use Python for OCR?

OCR (Optical Character Recognition) has become a common Python tool. With the advent of libraries such as Tesseract
and Ocrad, more and more developers are building libraries and bots that use OCR in novel, interesting ways. A trivial
example is a basic OCR tool used to extract text from screenshots so you don’t have to re-type the text later on.

Beginning Steps
We’ll start by developing the Flask back-end layer to serve the results of the OCR engine. From there you can just hit the
endpoint and serve the results to the end user in the manner that suits you. All of this is covered in detail by the tutorial.
We’ll also add a bit of back-end code to generate an HTML form as well as the front-end code to consume the API. This
will not be covered by the tutorial, but you will have access to the code.

Let’s get to it.

First, we have to install some dependencies. As always, configuring your environment is 90% of the fun.

This post has been tested on Ubuntu version 14.04 but it should work for 12.x and 13.x versions as well. If you’re
running OSX, you can use VirtualBox, Docker (check out the Dockerfile along with an install guide are included) or
a droplet on DigitalOcean (recommended!) to create the appropriate environment.

Downloading Dependencies
We need Tesseract and all of its dependencies, which includes Leptonica, as well as some other packages that power
these two for sanity checks to start.

NOTE: You can also use the _run.sh shell script to quickly install the dependencies along with Leptonica and
Tesseract. If you go this route, skip down to the Web-server time! section. But please consider manually building
these libraries if you have not done so before (for learning purposes).

Shell

$ sudo apt-get update

$ sudo apt-get install autoconf automake libtool
$ sudo apt-get install libpng12-dev
$ sudo apt-get install libjpeg62-dev
$ sudo apt-get install g++
$ sudo apt-get install libtiff4-dev
$ sudo apt-get install libopencv-dev libtesseract-dev
$ sudo apt-get install git
$ sudo apt-get install cmake
$ sudo apt-get install build-essential
$ sudo apt-get install libleptonica-dev
$ sudo apt-get install liblog4cplus-dev
$ sudo apt-get install libcurl3-dev
$ sudo apt-get install python2.7-dev
$ sudo apt-get install tk8.5 tcl8.5 tk8.5-dev tcl8.5-dev
$ sudo apt-get build-dep python-imaging --fix-missing

What’s Happening?
Put simply, sudo apt-get update is short for “make sure we have the latest package listings”. We then grab a number
of libraries that allow us to toy with images - i.e., libtiff, libpng, etc. Beyond that, we grab Python 2.7, our
programming language of choice, along with the python-imaging library for interaction with all these pieces.

Speaking of images, we need ImageMagick as well if we want to toy with (edit) the images before we throw them in
programmatically.

Shell

$ sudo apt-get install imagemagick

Building Leptonica and Tesseract

Again, if you ran the shell script, these are already installed, so proceed to the Web-server time! section

Leptonica
Now, time for Leptonica, finally!

Shell

$ wget http://www.leptonica.org/source/leptonica-1.70.tar.gz
$ tar -zxvf leptonica-1.70.tar.gz
$ cd leptonica-1.70/
$ ./autobuild
$ ./configure
$ make
$ sudo make install
$ sudo ldconfig

If this is your first time playing with tar, here’s what’s happening:

Grab the binary for Leptonica (via wget)

Unzip the tarball
cd into the new unpacked directory

Run autobuild and configure bash scripts to set up the application

Use make to build it
Install it with make after the build
Create the necessary links with ldconfig

Boom! Now we have Leptonica. On to Tesseract!

Tesseract
And now to download and build Tesseract…

Shell
$ cd ..
$ wget https://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.02.tar.gz
$ tar -zxvf tesseract-ocr-3.02.02.tar.gz
$ cd tesseract-ocr/
$ ./autogen.sh
$ ./configure
$ make
$ sudo make install
$ sudo ldconfig

The process here mirrors the Leptonica one almost perfectly. So to keep this DRY, see the Leptonica explanation for
more information.

Environment Variable
We need to set up an environment variable to source our Tesseract data:

Shell

$ export TESSDATA_PREFIX=/usr/local/share/

Tesseract Packages
Finally, let’s get the Tesseract english language packages that are relevant:

Shell

$ cd ..
$ wget https://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz
$ tar -xf tesseract-ocr-3.02.eng.tar.gz
$ sudo cp -r tesseract-ocr/tessdata $TESSDATA_PREFIX

BOOM! We now have Tesseract. We can use the CLI to test. Feel free to read the docs if you want to play. However, we
need a Python wrapper to truly achieve our end goal. So the next step is to set up a Flask server along with a basic API
that accepts POST requests:

1. Accept an image URL

2. Run the character recognition on the image

Web-server time!
Now, on to the fun stuff. First, we need to build a way to interface with Tesseract via Python. We COULD use popen but
that just feels wrong/unPythonic. Instead, we can use a very minimal, but functional Python package wrapping Tesseract
- pytesseract.

Want to get started quickly? Run the _app.sh shell script. Or you can set up the application manually by grabbing the
boilerplate code/structure here and then running the following commands:

Shell
$ wget https://github.com/rhgraysonii/ocr_tutorial/archive/v0.tar.gz
$ tar -xf v0.tar.gz
$ mv ocr_tutorial-0/* ../home/
$ cd ../home
$ sudo apt-get install python-virtualenv
$ virtualenv env
$ source env/bin/activate
$ pip install -r requirements.txt

NOTE: The Flask Boilerplate (maintained by Real Python) is a wonderful library for getting a simple, Pythonic
server running. We customized this for our base application. Check out the Flask Boilerplate repository for more
info.

Let’s Make an OCR Engine

Now, we need to make a class using pytesseract to intake and read images. Create a new file called ocr.py in the
“flask_server” directory and add the following code:

Python

import pytesseract
import requests
from PIL import Image
from PIL import ImageFilter
from StringIO import StringIO

def process_image(url):
image = _get_image(url)
image.filter(ImageFilter.SHARPEN)
return pytesseract.image_to_string(image)

def _get_image(url):
return Image.open(StringIO(requests.get(url).content))

Wonderful!

So, in our main method, process_image(), we sharpen the image to crisp up the text.

Sweet! A working module to toy with.

Optional: Building a CLI Tool for Your New OCR Engine

Making a CLI is a great proof of concept, and a fun breather after doing so much configuration. So lets take a stab at
making one. Create a new file within “flask_server” called cli.py and then add the following code:

Python
import sys
import requests
import pytesseract
from PIL import Image
from StringIO import StringIO

def get_image(url):
return Image.open(StringIO(requests.get(url).content))

if __name__ == '__main__':
"""Tool to test the raw output of pytesseract with a given input URL"""
sys.stdout.write("""
===OOOO=====CCCCC===RRRRRR=====\n
==OO==OO===CC=======RR===RR====\n
==OO==OO===CC=======RR===RR====\n
==OO==OO===CC=======RRRRRR=====\n
==OO==OO===CC=======RR==RR=====\n
==OO==OO===CC=======RR== RR====\n
===OOOO=====CCCCC===RR====RR===\n\n
""")
sys.stdout.write("A simple OCR utility\n")
url = raw_input("What is the url of the image you would like to analyze?\n")
image = get_image(url)
sys.stdout.write("The raw output from tesseract with no processing is:\n\n")
sys.stdout.write("-----------------BEGIN-----------------\n")
sys.stdout.write(pytesseract.image_to_string(image) + "\n")
sys.stdout.write("------------------END------------------\n")

This is really quite simple. Line by line we look at the text output from our engine, and output it to STDOUT. Test it out
(python flask_server/cli.py) with a few image urls, or play with your own ascii art for a good time.

Back to the Server

Now that we have an engine, we need to get ourselves some output! Add the following route handler and view function
to app.py:

Python

@app.route('/v{}/ocr'.format(_VERSION), methods=["POST"])
def ocr():
try:
url = request.json['image_url']
if 'jpg' in url:
output = process_image(url)
return jsonify({"output": output})
else:
return jsonify({"error": "only .jpg files, please"})
except:
return jsonify(
{"error": "Did you mean to send: {'image_url': 'some_jpeg_url'}"}
)

Make sure to update the imports:

Python
import os
import logging
from logging import Formatter, FileHandler
from flask import Flask, request, jsonify

from ocr import process_image

Also, add the API version number:

Python

_VERSION = 1 # API version

Now, as you can see, we just add in the JSON response of the Engine’s process_image() method, passing it in a file
object using Image from PIL to install. And, yes - For the time being, this currently only works with .jpg images.

NOTE: You will not have PIL itself installed; this runs off of Pillow and allows us to do the same thing. This is
because the PIL library was at one time forked, and turned into Pillow. The community has strong opinions on this
matter. Consult Google for insight - and drama.

Let’s Test!
Run your app:

Shell

$ cd ../home/flask_server/
$ python app.py

Then in another terminal tab run:

Shell

$ curl -X POST http://localhost:5000/v1/ocr -d '{"image_url": "some_url"}' -H "Content-Type: application/json"

Example
Shell

$ curl -X POST http://localhost:5000/v1/ocr -d '{"image_url": "https://realpython.com/images/blog_images/ocr/ocr.j

{
"output": "ABCDE\nFGH I J\nKLMNO\nPQRST"
}

Front-end
With the back-end API done along with the OCR Engine, we can now add a basic front-end to consume the API and add
the results to the DOM via AJAX and jQuery. Again, this is not covered by this tutorial, but you can grab the code from the
repository.
Test this out with some sample images:

1. OCR Sample #0
2. OCR Sample #1
3. OCR Sample #2
4. OCR Sample #3
5. OCR Sample #4
6. OCR Sample #5

Conclusion and Next Steps

Hope you enjoyed this tutorial. Grab the final code here from the repository. Oh—and please star the repo if you find this
code/tutorial useful. Cheers!

Happy hacking!

Python Tricks

Get a short & sweet Python Trick delivered to your inbox every couple of days. No spam ever. Unsubscribe any
time. Curated by the Real Python team.

7839+ Awesome Deep Web Onion Links List (Uncensored Content) PDF
76% (37)
7839+ Awesome Deep Web Onion Links List (Uncensored Content) PDF
391 pages
How To Get Credit Cards With Funds
85% (52)
How To Get Credit Cards With Funds
2 pages
1000 Mehods
78% (9)
1000 Mehods
70 pages
1 K
80% (15)
1 K
17 pages
Fisch SCRIPT 3
No ratings yet
Fisch SCRIPT 3
1 page
My Favorate Hacking Sites
90% (29)
My Favorate Hacking Sites
3 pages
Fridahandbook
No ratings yet
Fridahandbook
197 pages
MY Useful Website List
100% (5)
MY Useful Website List
3 pages
Practical XMPP
From Everand
Practical XMPP
Lloyd Watkin
No ratings yet
Termux PDF
No ratings yet
Termux PDF
2 pages
Creating A Modern Web App Using Symfony Api Platform
No ratings yet
Creating A Modern Web App Using Symfony Api Platform
59 pages
How To Scrape Websites With Python and BeautifulSoup PDF
100% (2)
How To Scrape Websites With Python and BeautifulSoup PDF
10 pages
Modes of Discourse - Smith Carlota (2003)
No ratings yet
Modes of Discourse - Smith Carlota (2003)
336 pages
Milnor - Singular Points of Complex Hypersurfaces
100% (1)
Milnor - Singular Points of Complex Hypersurfaces
114 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Python List
No ratings yet
Python List
11 pages
Harish Oraon
100% (1)
Harish Oraon
5 pages
Instant ebooks textbook Ubuntu Linux Bible 10th Edition David Clinton Christopher Negus download all chapters
100% (1)
Instant ebooks textbook Ubuntu Linux Bible 10th Edition David Clinton Christopher Negus download all chapters
65 pages
Instant Razor View Engine How-to
From Everand
Instant Razor View Engine How-to
Abhimanyu Kumar Vatsa
No ratings yet
How To Install Jenkins On Ubuntu 18.04: Prerequisites
No ratings yet
How To Install Jenkins On Ubuntu 18.04: Prerequisites
5 pages
Full Stack Resume
No ratings yet
Full Stack Resume
2 pages
DS Toolbox DataScienceGenius
No ratings yet
DS Toolbox DataScienceGenius
1 page
From Zero to Market with Flutter
From Everand
From Zero to Market with Flutter
Viachaslau Lyskouski
No ratings yet
Akshay Rahangdale
No ratings yet
Akshay Rahangdale
1 page
Tkinter Python To Upload File Code
No ratings yet
Tkinter Python To Upload File Code
7 pages
Python and REST APIs - Interacting With Web Services - Real Python
100% (1)
Python and REST APIs - Interacting With Web Services - Real Python
35 pages
skillset for tech
No ratings yet
skillset for tech
29 pages
6 Node - Js Projects
No ratings yet
6 Node - Js Projects
6 pages
Yousef Udacity Deep Learning Part 3 CNN
No ratings yet
Yousef Udacity Deep Learning Part 3 CNN
253 pages
Kevin Zhang Resume With Certificate
No ratings yet
Kevin Zhang Resume With Certificate
8 pages
Fullstack Engineer Resume.docx
No ratings yet
Fullstack Engineer Resume.docx
5 pages
Symfony5 The Fast Track PDF
100% (1)
Symfony5 The Fast Track PDF
342 pages
Flask Socketio
No ratings yet
Flask Socketio
49 pages
Python Specialization2
No ratings yet
Python Specialization2
3 pages
Education Work Experience: Garima - Cs18@nitp - Ac.in - +91-8218152661 - Garimasingh - Me
No ratings yet
Education Work Experience: Garima - Cs18@nitp - Ac.in - +91-8218152661 - Garimasingh - Me
2 pages
Kuldeep Kumar Rawani: Medley Medical Solutions PVT LTD
No ratings yet
Kuldeep Kumar Rawani: Medley Medical Solutions PVT LTD
2 pages
Django Girls
100% (1)
Django Girls
121 pages
Flask Python
No ratings yet
Flask Python
17 pages
Webhook With Whatsapp
100% (2)
Webhook With Whatsapp
11 pages
WooCommerce Mobile App - Ionic 3
No ratings yet
WooCommerce Mobile App - Ionic 3
22 pages
Mastering Ninject for Dependency Injection
From Everand
Mastering Ninject for Dependency Injection
Daniel Baharestani
No ratings yet
The Swift Codebook: A Beginner's Guide from Basics to Best Practices
From Everand
The Swift Codebook: A Beginner's Guide from Basics to Best Practices
Grace Huang
No ratings yet
Fullstack resume
No ratings yet
Fullstack resume
2 pages
Vinayak Python Developer
No ratings yet
Vinayak Python Developer
3 pages
Getting Started With TensorFlow - Js - TensorFlow - Medium
No ratings yet
Getting Started With TensorFlow - Js - TensorFlow - Medium
6 pages
Full download Pro MERN Stack: Full Stack Web App Development with Mongo, Express, React, and Node 1st Edition Vasan Subramanian (Auth.) pdf docx
100% (1)
Full download Pro MERN Stack: Full Stack Web App Development with Mongo, Express, React, and Node 1st Edition Vasan Subramanian (Auth.) pdf docx
53 pages
Python Specialization4
No ratings yet
Python Specialization4
3 pages
Practical Bot Development: Designing and Building Bots with Node.js and Microsoft Bot Framework Szymon Rozga pdf download
100% (3)
Practical Bot Development: Designing and Building Bots with Node.js and Microsoft Bot Framework Szymon Rozga pdf download
61 pages
Pybarcode Documentation: Release 0.13.1
No ratings yet
Pybarcode Documentation: Release 0.13.1
22 pages
ColdFusion Interview Questions, Answers, and Explanations: ColdFusion Certification Review
From Everand
ColdFusion Interview Questions, Answers, and Explanations: ColdFusion Certification Review
equitypress
No ratings yet
Code Games صفحة الهبوط
No ratings yet
Code Games صفحة الهبوط
46 pages
Install Voucherbox
No ratings yet
Install Voucherbox
3 pages
Linux Pocket Guide 2nd Edition by Daniel Barrett ISBN 9781449332983 1449332986 - The ebook is available for online reading or easy download
100% (5)
Linux Pocket Guide 2nd Edition by Daniel Barrett ISBN 9781449332983 1449332986 - The ebook is available for online reading or easy download
87 pages
Deepseek Docs
No ratings yet
Deepseek Docs
94 pages
Unity Cheat Sheet
100% (1)
Unity Cheat Sheet
1 page
Best Practise BackboneJS
No ratings yet
Best Practise BackboneJS
5 pages
Get Build Better Chatbots: A Complete Guide To Getting Started With Chatbots 1st Edition Rashid Khan PDF Ebook With Full Chapters Now
100% (9)
Get Build Better Chatbots: A Complete Guide To Getting Started With Chatbots 1st Edition Rashid Khan PDF Ebook With Full Chapters Now
52 pages
How To Build A Shortened URL Service With WordPress Custom Post Type - Wptuts+
No ratings yet
How To Build A Shortened URL Service With WordPress Custom Post Type - Wptuts+
13 pages
Backend Developer - Java
No ratings yet
Backend Developer - Java
2 pages
Micro-Framework: Presented By-Khirod Kumar Behera
No ratings yet
Micro-Framework: Presented By-Khirod Kumar Behera
10 pages
@digitalearn_official 100 black hat tools name (1)
No ratings yet
@digitalearn_official 100 black hat tools name (1)
5 pages
Disney+ Configs - Anom
No ratings yet
Disney+ Configs - Anom
4 pages
Build Your Own Mobile App Using Ionic and Drupal 8
No ratings yet
Build Your Own Mobile App Using Ionic and Drupal 8
9 pages
Alfresco 3 Cookbook
From Everand
Alfresco 3 Cookbook
Snig Bhaumik
No ratings yet
Google Cloud Platform Complete Self-Assessment Guide
From Everand
Google Cloud Platform Complete Self-Assessment Guide
Gerardus Blokdyk
1/5 (1)
Yousef Udacity Deep Learning Part1 Introdution + Part 2 NN
No ratings yet
Yousef Udacity Deep Learning Part1 Introdution + Part 2 NN
437 pages
AI - DT
No ratings yet
AI - DT
52 pages
Firesheep!
No ratings yet
Firesheep!
11 pages
Learning Flask Framework - Sample Chapter
100% (2)
Learning Flask Framework - Sample Chapter
27 pages
Doors
100% (1)
Doors
9 pages
Message 29
No ratings yet
Message 29
6 pages
Lecture 4 Vedic Civilization
No ratings yet
Lecture 4 Vedic Civilization
27 pages
Construction Process and Techniques of Traditional Houses in Tarakli Sakarya An Introductory Model For Web Based GIS Applications
No ratings yet
Construction Process and Techniques of Traditional Houses in Tarakli Sakarya An Introductory Model For Web Based GIS Applications
193 pages
I.T.B.P Hackathon
No ratings yet
I.T.B.P Hackathon
20 pages
10 Website That Will Save Hours
No ratings yet
10 Website That Will Save Hours
12 pages
Super Books Pakistan
No ratings yet
Super Books Pakistan
20 pages
deepseek-and-market-implications_250205_210456
No ratings yet
deepseek-and-market-implications_250205_210456
2 pages
Game Slot Online - Agen Judi Bola Daduemas88
No ratings yet
Game Slot Online - Agen Judi Bola Daduemas88
6 pages
The Startup Players Handbook: A Roadmap to Building SaaS and Software Companies 1st Edition Charles Edge - Quickly download the ebook to explore the full content
100% (1)
The Startup Players Handbook: A Roadmap to Building SaaS and Software Companies 1st Edition Charles Edge - Quickly download the ebook to explore the full content
56 pages
91 Club New Hack Mod Apk
100% (2)
91 Club New Hack Mod Apk
8 pages
11 Chapter 3
No ratings yet
11 Chapter 3
25 pages
Endpoint Security Summary
No ratings yet
Endpoint Security Summary
32 pages
Indian Cards
No ratings yet
Indian Cards
1,356 pages
The Complete Cyber Security Course 1st edition by Nathan House ISBN 1912787097 978-1912787090 - The ebook is ready for download to explore the complete content
100% (4)
The Complete Cyber Security Course 1st edition by Nathan House ISBN 1912787097 978-1912787090 - The ebook is ready for download to explore the complete content
78 pages
Premiere Pro Plugins
No ratings yet
Premiere Pro Plugins
11 pages
Top Career Websites.
No ratings yet
Top Career Websites.
6 pages
An A-Z of Useful Python Tricks - freeCodeCamp - Org - Medium PDF
No ratings yet
An A-Z of Useful Python Tricks - freeCodeCamp - Org - Medium PDF
14 pages
Cheat Sheets For AI, Neural Networks, Machine Learning, Deep Learning & Big Data PDF
No ratings yet
Cheat Sheets For AI, Neural Networks, Machine Learning, Deep Learning & Big Data PDF
21 pages
Face Detection With Python
0% (1)
Face Detection With Python
20 pages
A Guide To Face Detection in Python - Towards Data Science
No ratings yet
A Guide To Face Detection in Python - Towards Data Science
26 pages
Laksmi Manual & Automation Tester
No ratings yet
Laksmi Manual & Automation Tester
4 pages
Preguntas Examen SD Public
No ratings yet
Preguntas Examen SD Public
13 pages
Introductory Functional Analysis With Applications
No ratings yet
Introductory Functional Analysis With Applications
14 pages
English Q3 Week 5
100% (2)
English Q3 Week 5
6 pages
Efros Daniela, Group 191. Stylistic Analysis - The Apple Tree" by John Galsworthy
No ratings yet
Efros Daniela, Group 191. Stylistic Analysis - The Apple Tree" by John Galsworthy
1 page
Books and Authors
No ratings yet
Books and Authors
5 pages
ĐỀ HSG TIẾNG ANH 8
No ratings yet
ĐỀ HSG TIẾNG ANH 8
12 pages
IPC New
No ratings yet
IPC New
37 pages
Arupa_Kalita_Patangia
No ratings yet
Arupa_Kalita_Patangia
4 pages
WEEK 3 Quarter 1
No ratings yet
WEEK 3 Quarter 1
3 pages
SCE - EN - 014-101 Hardware Configuration IOT2000EDU - R1806
No ratings yet
SCE - EN - 014-101 Hardware Configuration IOT2000EDU - R1806
71 pages
Price List Exam
No ratings yet
Price List Exam
1 page
TXSTAAR r7 InstrTestPracTG
100% (1)
TXSTAAR r7 InstrTestPracTG
15 pages
Csc205number Systems
No ratings yet
Csc205number Systems
16 pages
Pembahasan Soal SNMPTN 2011 Matematika Dasar Kode 198
No ratings yet
Pembahasan Soal SNMPTN 2011 Matematika Dasar Kode 198
9 pages
Probability Theory: Sargur N. Srihari Srihari@cedar - Buffalo.edu
No ratings yet
Probability Theory: Sargur N. Srihari Srihari@cedar - Buffalo.edu
49 pages
KW (H) L Chart
No ratings yet
KW (H) L Chart
38 pages
I017 CG Lab6
No ratings yet
I017 CG Lab6
9 pages
Interview Me
No ratings yet
Interview Me
3 pages
Action Research - Lisa Brise
No ratings yet
Action Research - Lisa Brise
20 pages
Review Unit 3 - Present Perfect - LINK IT 3
No ratings yet
Review Unit 3 - Present Perfect - LINK IT 3
1 page
Importance of English in Higher Education Essays
100% (1)
Importance of English in Higher Education Essays
11 pages
3Ms First Term English Language Exam Monday December 03 2018
100% (3)
3Ms First Term English Language Exam Monday December 03 2018
2 pages
Advanced Financial Accounting 9th By Baker Richard instant download
100% (1)
Advanced Financial Accounting 9th By Baker Richard instant download
35 pages
AAA - 2010 - Diagnosis Treatment and Management of Children and Adults With Central Auditory Processing Disroder-Annotated
No ratings yet
AAA - 2010 - Diagnosis Treatment and Management of Children and Adults With Central Auditory Processing Disroder-Annotated
51 pages
Pronoun Antecedent Agreement
No ratings yet
Pronoun Antecedent Agreement
4 pages
2
No ratings yet
2
1 page
University of Oxford Style Guide
No ratings yet
University of Oxford Style Guide
28 pages

Setting Up A Simple OCR Server: by Real Python 37 Comments

Uploaded by

Setting Up A Simple OCR Server: by Real Python 37 Comments

Uploaded by

Setting up a Simple OCR Server

by Real Python  37 Comments  api data-science flask intermediate web-dev

Why Use Python for OCR?

Let’s get to it.

$ sudo apt-get update

$ sudo apt-get install imagemagick

Building Leptonica and Tesseract

Grab the binary for Leptonica (via wget)

Run autobuild and configure bash scripts to set up the application

Boom! Now we have Leptonica. On to Tesseract!

1. Accept an image URL

Let’s Make an OCR Engine

Sweet! A working module to toy with.

Optional: Building a CLI Tool for Your New OCR Engine

Back to the Server

Make sure to update the imports:

from ocr import process_image

Also, add the API version number:

_VERSION = 1 # API version

Then in another terminal tab run:

$ curl -X POST http://localhost:5000/v1/ocr -d '{"image_url": "some_url"}' -H "Content-Type: application/json"

$ curl -X POST http://localhost:5000/v1/ocr -d '{"image_url": "https://realpython.com/images/blog_images/ocr/ocr.j

Conclusion and Next Steps

You might also like