gpdftext

gPDFText is a text editor for GTK+ that opens PDF documents for ebook readers, converts the text contents into plain ASCII text, restores the original paragraphs and removes unwanted line breaks to allow easier zooming on the reader.

Many downloaded PDF files for ebook readers still use the A4 paper type (or letter which is similar in size) and when the PDF is displayed on the ebook reader, the zoom required to display the entire page makes the text too small. Simply exporting the PDF into text causes problems with line wrapping and the various ways that ebook PDFs indicate page headers and footers make it hard to automate the conversion.

gPDFText loads the PDF, extracts the text, reformats the paragraphs into single long lines and then puts the text into a standard GTK+ editor where you can make other adjustments. Books can then be saved as a PDF at the size configured in the gPDFText preferences, A5, B5 or original A4.

On the ebook reader, the plain text file then has no unwanted line breaks and can be zoomed to whatever text size you prefer.

Downloads

Future additions

Other ebook formats might be supportable, depending on whether there is free software support for reading the format in the first place.

gPDFText uses subversion at SourceForge. http://gpdftext.svn.sourceforge.net/viewvc/gpdftext/

Bugs should be filed using the Trac bugtracker at SourceForge.

There is also a Wiki and mailing list.

gPDFText is now in Debian.