Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > KOReader

Notices

Reply
 
Thread Tools Search this Thread
Old 04-13-2020, 01:06 PM   #1
sr1921
Junior Member
sr1921 began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Apr 2020
Device: Kindle, remarkable
Highlight problem with newlines in PDF files

When I highlight text in a PDF file, it seems that the break lines are ignored and so the word ending each line and the one starting the next line are recorded as a single word with no blank space between them.

This error can be noticed already in the metadata.pdf.lua file within the .sdr directory.
The error is also shown when exporting the notes with the Evernote plugin (no matter if you use the option to export as HTML or as a text file, in both cases the error can be seen).

At first I thought that the problem could be the way the newline was encoded in the output file, but it seems that this is not the case (I cannot see any special character between the two words).
The problem is not either related to the PDF formatting, as copying and pasting from the PDF to a text file keeps the break lines as expected.

To reproduce the problem: this seems to happen with any PDF file. As an example, you can try with the one available at http://ceur-ws.org/Vol-1756/paper06.pdf. For example try to highlight the first paragraph of the Introduction section and then export the notes.

The problem seems to arise in any condition (no zoom, no reflow, etc.).

This seems to be a bug but, any ideas for possible workarounds?
sr1921 is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
How to highlight words/sentences in PDF files? KarlVoit Onyx Boox 9 07-21-2019 11:48 AM
Problem with Caliber Reading My PDF Files... Nalapombu Calibre 2 04-16-2012 08:22 AM
Problem reading pdf files wdadli1 Android Devices 1 12-20-2011 10:12 PM
Underlining in pdf files - problem nhimclc iRex 4 05-03-2009 04:45 AM


All times are GMT -4. The time now is 09:42 AM.


MobileRead.com is a privately owned, operated and funded community.