Python Forum
Convert a PDF files to HTML files
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Convert a PDF files to HTML files
#1
Hi everyone,

I am a new user and I started to learn Python to do something in particulary.

I would like to convert PDF Files automatically to HTML files but I don't know how to do this.

I tried to use somes library but to be honest I have lot of difficult to understand and to be capable to reproduce the example.

So if someone can help me to convert thoose PDF Files in HTML document, it would be nice of you.

Thank you for you're helping.
Reply
#2
PDF is a great way to view data, but one of the most difficult for extracting data.
There are some packages that make the conversion easier (sometimes, depending on PDF format)
One such for PDF to html is pdftotree: https://pypi.org/project/pdftotree/

There are others (you will have to reasearch to find suitable match) https://pypi.org/search/?q=%22PDF+to+HTML%22&o=
Reply
#3
(Oct-20-2020, 10:08 AM)Larz60+ Wrote: PDF is a great way to view data, but one of the most difficult for extracting data.
There are some packages that make the conversion easier (sometimes, depending on PDF format)
One such for PDF to html is pdftotree: https://pypi.org/project/pdftotree/

There are others (you will have to reasearch to find suitable match) https://pypi.org/search/?q=%22PDF+to+HTML%22&o=

Thank you for your answer.

I am going to check this package.
I hope there will be a very detailed example because I am a beginner in python programming
Reply
#4
I'm sorry but I can't understand how it works.

I am trying to use this package but I don't know what I need to install. I am using Pycharm with Python 3.8
Reply
#5
Quote:I am trying to use this package but I don't know what I need to install
pip install pdftotree

documentation and examples: https://github.com/HazyResearch/pdftotree
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  How to export files from spyder to excel? kranklin 3 1,966 Aug-04-2025, 08:51 PM
Last Post: kranklin
  How to optimize the speed of processing large JSON files in Python without using too sophia2005 3 964 Aug-02-2025, 03:25 PM
Last Post: snippsat
  Correct/proper way to create save files snakes 0 1,000 Mar-11-2025, 06:58 PM
Last Post: snakes
  Merge htm files with shutil library (TypeError: 'module' object is not callable) Melcu54 7 4,943 Mar-09-2025, 04:25 PM
Last Post: Pedroski55
  how to download large files faster? kucingkembar 3 2,354 Feb-20-2025, 06:57 PM
Last Post: snippsat
  Inserting Python Buttons into KV Files edand19941 3 1,338 Feb-19-2025, 07:44 PM
Last Post: buran
Question [SOLVED] Right way to open files with different encodings? Winfried 3 10,291 Jan-18-2025, 02:19 PM
Last Post: Winfried
  Convert Json to table formathttps://python-forum.io/thread-38313.html python_student 3 18,705 Dec-05-2024, 04:32 PM
Last Post: Larz60+
  Applications config files / Best practices aecordoba 2 5,032 Oct-23-2024, 12:56 PM
Last Post: aecordoba
  Compare 2 files for duplicates and save the differences cubangt 2 1,798 Sep-12-2024, 03:55 PM
Last Post: cubangt

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020