0% found this document useful (0 votes)

6 views6 pages

vertopal.com_2-Working-with-PDFs

The document provides an overview of working with PDF files in Python, specifically using the PyPDF2 library. It covers installation, reading text from PDFs, and limitations regarding image extraction and writing to PDFs. The document also includes examples of how to read and append pages from PDF files using PyPDF2.

Uploaded by

MuHaMMad SHouKaT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

vertopal.com_2-Working-with-PDFs

Uploaded by

MuHaMMad SHouKaT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Working with PDF Files

Welcome back Agent. Often you will have to deal with PDF files. There are many libraries in
Python for working with PDFs, each with their pros and cons, the most common one being
PyPDF2. You can install it with (note the case-sensitivity, you need to make sure your
capitilization matches):

pip install PyPDF2

Keep in mind that not every PDF file can be read with this library. PDFs that are too blurry, have a
special encoding, encrypted, or maybe just created with a particular program that doesn't work
well with PyPDF2 won't be able to be read. If you find yourself in this situation, try using the
libraries linked above, but keep in mind, these may also not work. The reason for this is because
of the many different parameters for a PDF and how non-standard the settings can be, text
could be shown as an image instead of a utf-8 encoding. There are many parameters to consider
in this aspect.

As far as PyPDF2 is concerned, it can only read the text from a PDF document, it won't be able to
grab images or other media files from a PDF. ___

Working with PyPDF2

Let's being showing the basics of the PyPDF2 library.

!pip install PyPDF2

Collecting PyPDF2
Downloading pypdf2-3.0.1-py3-none-any.whl (232 kB)
------------------------------------ 232.6/232.6 kB 490.8 kB/s
eta 0:00:00
Requirement already satisfied: typing_extensions>=3.10.0.0 in c:\
users\jmpor\anaconda3\lib\site-packages (from PyPDF2) (4.3.0)
Installing collected packages: PyPDF2
Successfully installed PyPDF2-3.0.1

# note the capitalization

import PyPDF2

Reading PDFs
Similar to the csv library, we open a pdf, then create a reader object for it. Notice how we use the
binary method of reading , 'rb', instead of just 'r'.

# Notice we read it as a binary with 'rb'

f = open('Working_Business_Proposal.pdf','rb')
pdf_reader = PyPDF2.PdfReader(f)

len(pdf_reader.pages)

page_number = 0
page_one = pdf_reader.pages[0]

We can then extract the text:

page_one_text = page_one.extract_text()

page_one_text

'Business Proposal The Revolution is Coming Leverage agile frameworks

f.close()

Adding to PDFs
We can not write to PDFs using Python because of the differences between the single string
type of Python, and the variety of fonts, placements, and other parameters that a PDF could
have.

What we can do is copy pages and append pages to the end.

f = open('Working_Business_Proposal.pdf','rb')
pdf_reader = PyPDF2.PdfReader(f)

page_number = 0
page_one = pdf_reader.pages[0]

pdf_writer = PyPDF2.PdfWriter()

pdf_writer.add_page(page_one);

pdf_output = open("Some_New_Doc.pdf","wb")

pdf_writer.write(pdf_output)

(False, <_io.BufferedWriter name='Some_New_Doc.pdf'>)

f.close()

Now we have copied a page and added it to another new document!

Simple Example
Let's try to grab all the text from this PDF file:

f = open('Working_Business_Proposal.pdf','rb')

# List of every page's text.

# The index will correspond to the page number.
pdf_text = []

pdf_reader = PyPDF2.PdfReader(f)

for p in range(len(pdf_reader.pages)):

page = pdf_reader.pages[0]

pdf_text.append(page.extract_text())

pdf_text

['Business Proposal The Revolution is Coming Leverage agile frameworks

to provide a robust synopsis for high level overviews. Iterative
approaches to corporate strategy foster collaborative thinking to
further the overall value proposition. Organically grow the holistic
world view of disruptive innovation via workplace diversity and
empowerment. Bring to the table win-win survival strategies to ensure
proactive domination. At the end of the day, going forward, a new
normal that has evolved from generation X is on the runway heading
towards a streamlined cloud solution. User generated content in real-
time will have multiple touchpoints for offshoring. Capitalize on low
hanging fruit to identify a ballpark value added activity to beta
test. Override the digital divide with additional clickthroughs from
DevOps. Nanotechnology immersion along the information highway will
close the loop on focusing solely on the bottom line. Podcasting
operational change management inside of workflows to establish a
framework. Taking seamless key performance indicators offline to
maximise the long tail. Keeping your eye on the ball while performing
a deep dive on the start-up mentality to derive convergence on cross-
platform integration. Collaboratively administrate empowered markets
via plug-and-play networks. Dynamically procrastinate B2C users after
installed base benefits. Dramatically visualize customer directed
convergence without revolutionary ROI. Efficiently unleash cross-media
information without cross-media value. Quickly maximize timely
deliverables for real-time schemas. Dramatically maintain clicks-and-
mortar solutions without functional solutions. BUSINESS PROPOSAL!1',
'Business Proposal The Revolution is Coming Leverage agile frameworks
to provide a robust synopsis for high level overviews. Iterative
approaches to corporate strategy foster collaborative thinking to
further the overall value proposition. Organically grow the holistic
world view of disruptive innovation via workplace diversity and
empowerment. Bring to the table win-win survival strategies to ensure
proactive domination. At the end of the day, going forward, a new
normal that has evolved from generation X is on the runway heading
towards a streamlined cloud solution. User generated content in real-
time will have multiple touchpoints for offshoring. Capitalize on low
hanging fruit to identify a ballpark value added activity to beta
test. Override the digital divide with additional clickthroughs from
DevOps. Nanotechnology immersion along the information highway will
close the loop on focusing solely on the bottom line. Podcasting
operational change management inside of workflows to establish a
framework. Taking seamless key performance indicators offline to
maximise the long tail. Keeping your eye on the ball while performing
a deep dive on the start-up mentality to derive convergence on cross-
platform integration. Collaboratively administrate empowered markets
via plug-and-play networks. Dynamically procrastinate B2C users after
installed base benefits. Dramatically visualize customer directed
convergence without revolutionary ROI. Efficiently unleash cross-media
information without cross-media value. Quickly maximize timely
deliverables for real-time schemas. Dramatically maintain clicks-and-
mortar solutions without functional solutions. BUSINESS PROPOSAL!1',
'Business Proposal The Revolution is Coming Leverage agile frameworks
to provide a robust synopsis for high level overviews. Iterative
approaches to corporate strategy foster collaborative thinking to
further the overall value proposition. Organically grow the holistic
world view of disruptive innovation via workplace diversity and
empowerment. Bring to the table win-win survival strategies to ensure
proactive domination. At the end of the day, going forward, a new
normal that has evolved from generation X is on the runway heading
towards a streamlined cloud solution. User generated content in real-
time will have multiple touchpoints for offshoring. Capitalize on low
hanging fruit to identify a ballpark value added activity to beta
test. Override the digital divide with additional clickthroughs from
DevOps. Nanotechnology immersion along the information highway will
close the loop on focusing solely on the bottom line. Podcasting
operational change management inside of workflows to establish a
framework. Taking seamless key performance indicators offline to
maximise the long tail. Keeping your eye on the ball while performing
a deep dive on the start-up mentality to derive convergence on cross-
platform integration. Collaboratively administrate empowered markets
via plug-and-play networks. Dynamically procrastinate B2C users after
installed base benefits. Dramatically visualize customer directed
convergence without revolutionary ROI. Efficiently unleash cross-media
information without cross-media value. Quickly maximize timely
deliverables for real-time schemas. Dramatically maintain clicks-and-
mortar solutions without functional solutions. BUSINESS PROPOSAL!1',
'Business Proposal The Revolution is Coming Leverage agile frameworks
to provide a robust synopsis for high level overviews. Iterative
approaches to corporate strategy foster collaborative thinking to
further the overall value proposition. Organically grow the holistic
world view of disruptive innovation via workplace diversity and
empowerment. Bring to the table win-win survival strategies to ensure
proactive domination. At the end of the day, going forward, a new
normal that has evolved from generation X is on the runway heading
towards a streamlined cloud solution. User generated content in real-
time will have multiple touchpoints for offshoring. Capitalize on low
hanging fruit to identify a ballpark value added activity to beta
test. Override the digital divide with additional clickthroughs from
DevOps. Nanotechnology immersion along the information highway will
close the loop on focusing solely on the bottom line. Podcasting
operational change management inside of workflows to establish a
framework. Taking seamless key performance indicators offline to
maximise the long tail. Keeping your eye on the ball while performing
a deep dive on the start-up mentality to derive convergence on cross-
platform integration. Collaboratively administrate empowered markets
via plug-and-play networks. Dynamically procrastinate B2C users after
installed base benefits. Dramatically visualize customer directed
convergence without revolutionary ROI. Efficiently unleash cross-media
information without cross-media value. Quickly maximize timely
deliverables for real-time schemas. Dramatically maintain clicks-and-
mortar solutions without functional solutions. BUSINESS PROPOSAL!1',
'Business Proposal The Revolution is Coming Leverage agile frameworks
to provide a robust synopsis for high level overviews. Iterative
approaches to corporate strategy foster collaborative thinking to
further the overall value proposition. Organically grow the holistic
world view of disruptive innovation via workplace diversity and
empowerment. Bring to the table win-win survival strategies to ensure
proactive domination. At the end of the day, going forward, a new
normal that has evolved from generation X is on the runway heading
towards a streamlined cloud solution. User generated content in real-
time will have multiple touchpoints for offshoring. Capitalize on low
hanging fruit to identify a ballpark value added activity to beta
test. Override the digital divide with additional clickthroughs from
DevOps. Nanotechnology immersion along the information highway will
close the loop on focusing solely on the bottom line. Podcasting
operational change management inside of workflows to establish a
framework. Taking seamless key performance indicators offline to
maximise the long tail. Keeping your eye on the ball while performing
a deep dive on the start-up mentality to derive convergence on cross-
platform integration. Collaboratively administrate empowered markets
via plug-and-play networks. Dynamically procrastinate B2C users after
installed base benefits. Dramatically visualize customer directed
convergence without revolutionary ROI. Efficiently unleash cross-media
information without cross-media value. Quickly maximize timely
deliverables for real-time schemas. Dramatically maintain clicks-and-
mortar solutions without functional solutions. BUSINESS PROPOSAL!1']

print(pdf_text[3])

Business Proposal The Revolution is Coming Leverage agile frameworks

Excellent work! That is all for PyPDF2 for now, remember that this won't work with every PDF
file and is limited in its scope to only text of PDFs.

Learning DevOps: The complete guide to accelerate collaboration with Jenkins, Kubernetes, Terraform and Azure DevOps
From Everand
Learning DevOps: The complete guide to accelerate collaboration with Jenkins, Kubernetes, Terraform and Azure DevOps
Mikael Krief
4.5/5 (2)
Test-Driven iOS Development with Swift: Create fully-featured and highly functional iOS apps by writing tests first
From Everand
Test-Driven iOS Development with Swift: Create fully-featured and highly functional iOS apps by writing tests first
Dr. Dominik Hauser
5/5 (2)
Mastering PostgreSQL 12 - Third Edition: Advanced techniques to build and administer scalable and reliable PostgreSQL database applications, 3rd Edition
From Everand
Mastering PostgreSQL 12 - Third Edition: Advanced techniques to build and administer scalable and reliable PostgreSQL database applications, 3rd Edition
Hans-Jurgen Schonig
No ratings yet
Schedule for Sale: WorkFace Planning for Construction Projects
From Everand
Schedule for Sale: WorkFace Planning for Construction Projects
Geoff Ryan P.M.P.
4/5 (3)
Microsoft Project 2019: Up To Speed
From Everand
Microsoft Project 2019: Up To Speed
R.M. Hyttinen
5/5 (1)
Getting Started with hapi.js
From Everand
Getting Started with hapi.js
John Brett
4.5/5 (1)
Google Apps Script for Beginners
From Everand
Google Apps Script for Beginners
Serge Gabet
No ratings yet
Evaluating The Influence of Electronic Marketing On Hotel Performance: The Case of Bahir Dar Star Related Hotels
No ratings yet
Evaluating The Influence of Electronic Marketing On Hotel Performance: The Case of Bahir Dar Star Related Hotels
69 pages
Object-Oriented Programming in ColdFusion
From Everand
Object-Oriented Programming in ColdFusion
Matt Gifford
4.5/5 (3)
IBM Cognos Business Intelligence
From Everand
IBM Cognos Business Intelligence
Dustin Adkison
No ratings yet
MongoDB High Availability
From Everand
MongoDB High Availability
Afshin Mehrabani
4.5/5 (2)
DevOps for the Modern Enterprise: Winning Practices to Transform Legacy IT Organizations
From Everand
DevOps for the Modern Enterprise: Winning Practices to Transform Legacy IT Organizations
Mirco Hering
No ratings yet
Protocol Buffers Handbook: Getting deeper into Protobuf internals and its usage
From Everand
Protocol Buffers Handbook: Getting deeper into Protobuf internals and its usage
Clément Jean
No ratings yet
Hands-on DevOps with Linux: Build and Deploy DevOps Pipelines Using Linux Commands, Terraform, Docker, Vagrant, and Kubernetes (English Edition)
From Everand
Hands-on DevOps with Linux: Build and Deploy DevOps Pipelines Using Linux Commands, Terraform, Docker, Vagrant, and Kubernetes (English Edition)
Alisson Machado de Menezes
No ratings yet
Mastering Python: Learn Python Step-by-Step with Practical Projects
From Everand
Mastering Python: Learn Python Step-by-Step with Practical Projects
Amelia Hartman
No ratings yet
Learning PHP Data Objects
From Everand
Learning PHP Data Objects
Dennis Popel
5/5 (1)
.NET Design Patterns
From Everand
.NET Design Patterns
Praseed Pai
3/5 (2)
Instant Zend Framework 2.0
From Everand
Instant Zend Framework 2.0
A N M Mahabubul Hasan
No ratings yet
Raspberry Pi LED Blueprints: Design, build, and test LED-based projects using the Raspberry Pi
From Everand
Raspberry Pi LED Blueprints: Design, build, and test LED-based projects using the Raspberry Pi
Agus Kurniawan
No ratings yet
Programming APIs with C# and .NET: Develop high-performance APIs that ensure seamless application communication and enhanced security
From Everand
Programming APIs with C# and .NET: Develop high-performance APIs that ensure seamless application communication and enhanced security
Jesse Liberty
No ratings yet
Mastering MongoDB 4.x - Second Edition: Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x, 2nd Edition
From Everand
Mastering MongoDB 4.x - Second Edition: Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x, 2nd Edition
Alex Giamas
No ratings yet
D Web Development
From Everand
D Web Development
Kai Nacke
No ratings yet
Practical OneOps
From Everand
Practical OneOps
Nilesh Nimkar
No ratings yet
Yii2 By Example: Develop complete web applications from scratch through practical examples and tips for beginners and more advanced users
From Everand
Yii2 By Example: Develop complete web applications from scratch through practical examples and tips for beginners and more advanced users
Fabrizio Caldarelli
No ratings yet
Drupal 8 Configuration Management
From Everand
Drupal 8 Configuration Management
Stefan Borchert
No ratings yet
PHP 7 Programming Blueprints
From Everand
PHP 7 Programming Blueprints
Jose Palala
No ratings yet
FastAPI Cookbook: Develop high-performance APIs and web applications with Python
From Everand
FastAPI Cookbook: Develop high-performance APIs and web applications with Python
Giunio De Luca
No ratings yet
C++ for Beginners: Understand Core C++ Concepts with Practical Examples
From Everand
C++ for Beginners: Understand Core C++ Concepts with Practical Examples
Eleanor Nash
No ratings yet
Python Programming for Newbies
From Everand
Python Programming for Newbies
Abound Academy
No ratings yet
Raspberry Pi: 40 Outstanding Raspberry Pi Tips and Tricks for Absolute Beginners
From Everand
Raspberry Pi: 40 Outstanding Raspberry Pi Tips and Tricks for Absolute Beginners
Dylan Day
No ratings yet
Persistence in PHP with Doctrine ORM
From Everand
Persistence in PHP with Doctrine ORM
Kévin Dunglas
No ratings yet
PhoneGap for Enterprise
From Everand
PhoneGap for Enterprise
Kerri Shotts
No ratings yet
Python Apps on Visual Studio Code: Develop apps and utilize the true potential of Visual Studio Code (English Edition)
From Everand
Python Apps on Visual Studio Code: Develop apps and utilize the true potential of Visual Studio Code (English Edition)
Swapnil Saurav
No ratings yet
Building a RESTful Web Service with Spring: A hands-on guide to building an enterprise-grade, scalable RESTful web service using the Spring Framework
From Everand
Building a RESTful Web Service with Spring: A hands-on guide to building an enterprise-grade, scalable RESTful web service using the Spring Framework
Ludovic Dewailly
5/5 (1)
Getting Started with RethinkDB: Absorb the knowledge required to utilize, manage, and deploy
From Everand
Getting Started with RethinkDB: Absorb the knowledge required to utilize, manage, and deploy
Gianluca Tiepolo
No ratings yet
Ext JS Data-driven Application Design
From Everand
Ext JS Data-driven Application Design
Kazuhiro Kotsutsumi
No ratings yet
Financial Data Science with Python: An Integrated Approach to Analysis, Modeling, and Machine Learning
From Everand
Financial Data Science with Python: An Integrated Approach to Analysis, Modeling, and Machine Learning
Haojun Chen
No ratings yet
Learning Azure DocumentDB
From Everand
Learning Azure DocumentDB
Becker Riccardo
No ratings yet
CodeIgniter 1.7
From Everand
CodeIgniter 1.7
David Upton
No ratings yet
Learning Hunk: A quick, practical guide to rapidly visualizing and analyzing your Hadoop data using Hunk
From Everand
Learning Hunk: A quick, practical guide to rapidly visualizing and analyzing your Hadoop data using Hunk
Dmitry Anoshin
No ratings yet
High Performance Enterprise Apps using C# 10 and .NET 6: Hands-on Production-ready Clean Codes, Pattern Matching, Benchmarking, Responsive UI and Performance Tuning Tools (English Edition)
From Everand
High Performance Enterprise Apps using C# 10 and .NET 6: Hands-on Production-ready Clean Codes, Pattern Matching, Benchmarking, Responsive UI and Performance Tuning Tools (English Edition)
Ockert J. du Preez
No ratings yet
Pentaho Analytics for MongoDB Cookbook: Over 50 recipes to learn how to use Pentaho Analytics and MongoDB to create powerful analysis and reporting solutions
From Everand
Pentaho Analytics for MongoDB Cookbook: Over 50 recipes to learn how to use Pentaho Analytics and MongoDB to create powerful analysis and reporting solutions
Joel Andre Latino
No ratings yet
Building Web Applications with Python and Neo4j
From Everand
Building Web Applications with Python and Neo4j
Gupta Sumit
No ratings yet
Building a Web Application with PHP and MariaDB: A Reference Guide
From Everand
Building a Web Application with PHP and MariaDB: A Reference Guide
Sai Srinivas Sriparasa
No ratings yet
Digital Transformation: Building Intelligent Enterprises
From Everand
Digital Transformation: Building Intelligent Enterprises
Anup Maheshwari
No ratings yet
Learn Python: Get Started Now with Our Beginner’s Guide to Coding, Programming, and Understanding Artificial Intelligence in the Fastest-Growing Machine Learning Language
From Everand
Learn Python: Get Started Now with Our Beginner’s Guide to Coding, Programming, and Understanding Artificial Intelligence in the Fastest-Growing Machine Learning Language
Anthony Adams
5/5 (3)
Functional Programming in C#: Classic Programming Techniques for Modern Projects
From Everand
Functional Programming in C#: Classic Programming Techniques for Modern Projects
Oliver Sturm
No ratings yet
Mastering iOS 18 Development: Take your iOS development experience to the next level with iOS, Xcode, Swift, and SwiftUI
From Everand
Mastering iOS 18 Development: Take your iOS development experience to the next level with iOS, Xcode, Swift, and SwiftUI
Avi Tsadok
No ratings yet
Minimal APIs in ASP.NET 9: Design, implement, and optimize robust APIs in C# with .NET 9
From Everand
Minimal APIs in ASP.NET 9: Design, implement, and optimize robust APIs in C# with .NET 9
Nick Proud
No ratings yet
DynamoDB Applied Design Patterns
From Everand
DynamoDB Applied Design Patterns
Uchit Vyas
3/5 (1)
KnockoutJS Blueprints
From Everand
KnockoutJS Blueprints
Carlo Russo
No ratings yet
PowerShell Troubleshooting Guide
From Everand
PowerShell Troubleshooting Guide
Michael Shepard
No ratings yet
Edge Cloud Operations: A Systems Approach
From Everand
Edge Cloud Operations: A Systems Approach
Larry L Peterson
No ratings yet
Learning PHP 7 High Performance
From Everand
Learning PHP 7 High Performance
Altaf Hussain
No ratings yet
Mastering RethinkDB
From Everand
Mastering RethinkDB
Shahid Shaikh
No ratings yet
Getting Started with PowerShell
From Everand
Getting Started with PowerShell
Michael Shepard
No ratings yet
Implementing Cloud Design Patterns for AWS
From Everand
Implementing Cloud Design Patterns for AWS
Marcus Young
No ratings yet
Clean Code Practices
From Everand
Clean Code Practices
Zoe Codewell
No ratings yet
Cloud Native Apps on Google Cloud Platform: Use Serverless, Microservices and Containers to Rapidly Build and Deploy Apps on Google Cloud
From Everand
Cloud Native Apps on Google Cloud Platform: Use Serverless, Microservices and Containers to Rapidly Build and Deploy Apps on Google Cloud
alasdair gilchrist
No ratings yet
Python Unleashed: Mastering the Art of Efficient Coding
From Everand
Python Unleashed: Mastering the Art of Efficient Coding
James Livingston
No ratings yet
Extracting text from PDF files with Python_ A comprehensive guide - Modo leitor
No ratings yet
Extracting text from PDF files with Python_ A comprehensive guide - Modo leitor
17 pages
7 Ways To Get 100 New Followers
No ratings yet
7 Ways To Get 100 New Followers
10 pages
Nism CH 8
No ratings yet
Nism CH 8
12 pages
FSA Assignment III Group 4
100% (1)
FSA Assignment III Group 4
4 pages
Goldman Sachs - Correction Detection PDF
No ratings yet
Goldman Sachs - Correction Detection PDF
17 pages
Tolerances and Resultant Fits For Housing - SKF
No ratings yet
Tolerances and Resultant Fits For Housing - SKF
10 pages
Burj Khalifa Project
100% (2)
Burj Khalifa Project
22 pages
Contracts Sem 1 Answers
No ratings yet
Contracts Sem 1 Answers
33 pages
Acc 6050 Module 5
No ratings yet
Acc 6050 Module 5
12 pages
Document (6)-1-1
No ratings yet
Document (6)-1-1
53 pages
Train Ticket Uma
No ratings yet
Train Ticket Uma
3 pages
GR 7 Summer H.W
No ratings yet
GR 7 Summer H.W
33 pages
Solid Waste Management in The World's Cities, UN-HABITAT: January 2010
No ratings yet
Solid Waste Management in The World's Cities, UN-HABITAT: January 2010
17 pages
Case Study: Starbucks: Lucero, Colin Jude M. STEM 123
No ratings yet
Case Study: Starbucks: Lucero, Colin Jude M. STEM 123
5 pages
SR. HSE Officer SIECROP Offer latter
No ratings yet
SR. HSE Officer SIECROP Offer latter
13 pages
Target-Cost
No ratings yet
Target-Cost
3 pages
Ion StrategyBrief LandingPagesForAgencies
No ratings yet
Ion StrategyBrief LandingPagesForAgencies
13 pages
MINE1x MiningValueChain-5
No ratings yet
MINE1x MiningValueChain-5
2 pages
For MGMT
No ratings yet
For MGMT
18 pages
Action Plan Sample
No ratings yet
Action Plan Sample
2 pages
Managing Operations: About IKEA
No ratings yet
Managing Operations: About IKEA
11 pages
DFDS Supplier Code of Conduct
No ratings yet
DFDS Supplier Code of Conduct
6 pages
Gift 2 GIFT DEED
No ratings yet
Gift 2 GIFT DEED
3 pages
The 4D Model For Designing Instructional Materials
100% (1)
The 4D Model For Designing Instructional Materials
15 pages
Accounting Assignment 4
No ratings yet
Accounting Assignment 4
3 pages
03 Task Performance 1
No ratings yet
03 Task Performance 1
3 pages
2024 Elections Audit
No ratings yet
2024 Elections Audit
94 pages
Transport Management
No ratings yet
Transport Management
36 pages
Meta Report 1
No ratings yet
Meta Report 1
44 pages
DemandAllocationRptUG R3
No ratings yet
DemandAllocationRptUG R3
111 pages

vertopal.com_2-Working-with-PDFs

Uploaded by

vertopal.com_2-Working-with-PDFs

Uploaded by

Working with PDF Files

pip install PyPDF2

Working with PyPDF2

!pip install PyPDF2

# note the capitalization

# Notice we read it as a binary with 'rb'

We can then extract the text:

'Business Proposal The Revolution is Coming Leverage agile frameworks

What we can do is copy pages and append pages to the end.

(False, <_io.BufferedWriter name='Some_New_Doc.pdf'>)

Now we have copied a page and added it to another new document!

# List of every page's text.

['Business Proposal The Revolution is Coming Leverage agile frameworks

Business Proposal The Revolution is Coming Leverage agile frameworks

You might also like