View Single Post
Old 08-10-2018, 07:45 AM   #5
Pulpmeister
Wizard
Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.Pulpmeister ought to be getting tired of karma fortunes by now.
 
Posts: 2,831
Karma: 29145056
Join Date: Nov 2011
Location: Perth Western Australia
Device: kindle
The sources for the stories in both volumes are Australian newspapers online at the Australian National Library, through the portal known as Trove, which has online OCR text files. Each story is from a different newspaper or issue. It's odd, looking at the HTML, that only the second 50 stories have the RUS span. I made the original version as two separate 50 stories volumes, then decided to do a single one. It was obviously in the second, but...

My route is OCR TXT file/RTF in Word/Word HTML conversion/Kompozer to create links, anchors etc/Calibre to create books. Not ideal, but it's the only one I can handle easily.

I can't see how RUS can creep in. I checked my language and proofing stuff in Office and it's all English (Australian). I don't think the RUS code does any harm; I've not noticed anything when reading; but it is puzzling. I'll dig through some other HTMLs and see if it's in them too.

I blame Putin.
Pulpmeister is offline   Reply With Quote