Page layout in Python

342 views

The first step in grabbing information from a pdf file is to translate it into text format with pdftotext -layout command.

Is it available any specific python tool or library to describe the layout of a page with ascii characters and to help in identifying and extracting the useful pieces of information? For example a function allowing to select N characters at line I starting from column Y.

If a such tool is not available, what is in your mind the best structure to describe in python a two dimensions page layout?

posted Jul 25, 2014 by Gurminder

Looking for an answer? Promote on:

Similar Questions

+2 votes

How to visit the page hoisted by python-django installed On virtual box.

I have python-django installed on ubuntu installed on top of virtual Box. Also I have apache2 installed in the same way. When i visit the page hoisted by Apache2 from my Windows Host machine it worked.
But When i am doing the same for Django pages it's not opening up in Web browser from Host machine.
In the virtual box I am using bridged adapter and Eth0.

Please Help me to sort out this problem

0 votes

Is there any way to export data from Django Admin page ?

In my Django application, I can handle my data in the admin page. Now, I like to export the admin page data into some formats like Excel, CSV or alteast JSON dump.

Is it possible? Anyone can clarify, please?

+1 vote

Telnet to remote system and format output via web page

I would like to create a web app using flask or cgi library (python) along with telnetlib to telnet to specific servers and execute commands and retrieve the output. The output will then be formatted and outputted to a webpage .

Is this the best way of getting info from a remote system to be output to a web page? Is flask over kill for project like this ?

0 votes

What is difference between session vs Http Client in Python ?

...

Page layout in Python

Your comment on this post:

Your answer

Preview