View Full Version : HTXT format??


kevincar
02-27-2010, 02:28 AM
Hi all-

I'm new to the world of ebook readers.
I just got a Hanvon 518 and can't find anything anywhere about the "HTXT" format -- Can someone throw me a clue?

I'm imagining it's a fast-paging format, because epub and PDF speed is ghastly.
It's not HTML or "hypertext" - it's some sort of packed text format that I can't seem to crack.

Any help is greatly appreciated. Especially in regards to tools for conversion of HTML/TXT to this format.

KC

GambitDeclined
03-07-2010, 12:43 AM
I'm in the same boat. I'm beginning to believe that htxt is exclusively Chinese.

anabelee
03-09-2010, 03:52 AM
I think is not chinese. My sister has a Nintendo DS and can read books on it, and uses this format htxt and others with dslibris (sourceforge).
I'm not sure if this can help.

kevincar
03-09-2010, 03:26 PM
I'm in the same boat. I'm beginning to believe that htxt is exclusively Chinese.

Thanks for the link, anabelee -- but I don't have a Nintendo .


I think it's "Hanvon Text" format -- and Hanvon has not responding to my emails.

I found this link from hiboo:
http://www.hibnet.org/ti/hiblib/hibLib-enginedoc/node26.html

But this is prolly just for text screens on TI calculators.

and I think it's *brief* and incomplete description of the format - at least it seems to fit what the file looks like, but I've found zilch on the Net for converters or any utilities.

KC

DaleDe
03-11-2010, 12:01 PM
I do not believe the Nintendo program is really htxt anyway. The site says ePUB which is xthml a very different thing. Interesting you found something, however I agree it is probably not the exact format that Hanvon uses. They also have an mtxt format which is also undocumented. I suspect you are right in that these are Chinese formats. I do notice that the Hanvon list htxt as one of the few formats that support notes which likely means it is not an image since attaching a note to an image by selecting text would be more difficult.

anabelee
03-12-2010, 06:08 AM
Sorry for mistaking the formats, kevincar and dale.

mtcheung
03-27-2010, 05:22 AM
I am Chinese, and I do read Chinese in both styles.
How can I convert pdf or docx files to htxt.

I do not get useful information from the representative in Hong Kong.

Germantie

DaleDe
03-27-2010, 11:35 AM
I am Chinese, and I do read Chinese in both styles.
How can I convert pdf or docx files to htxt.

I do not get useful information from the representative in Hong Kong.

Germantie

Can you tell us anything about the HTXT format? Is is Chinese specific? If you bring it up in a text editor do you see TXT or binary? If you see text do you see some formatting info mixed with the TXT? In order to convert to that format we need to know what it looks like.

Dale

mothlight
04-09-2010, 11:38 PM
Can you tell us anything about the HTXT format? Is is Chinese specific? If you bring it up in a text editor do you see TXT or binary? If you see text do you see some formatting info mixed with the TXT? In order to convert to that format we need to know what it looks like.

Dale

I'm wondering the same thing. It is a binary format. The Linux file command doesn't know anything about it and no programs associate themselves with it.

$ file winesburg.htxt
winesburg.htxt: data
$cat winesburg.htxt
008108816827866<9d>Z<87>^\|^Q<93>^Yr<޼
.....
$cat topics.htxt
4008108816827866<9e>|ϫ,Mi׵[:
......
and so on.

I assume it is some proprietary format for Hanvon (i.e. Hanvon-txt). Or is maybe encoded with a wide character set. Still trying to work through it.

Annoyingly, my new reader came with hundreds of books on it, but the file names are only vaguely enlightening. I'd like to figure out a quick way to rename them by finding the author and title in the files.

DaleDe
04-10-2010, 01:21 AM
I'm wondering the same thing. It is a binary format. The Linux file command doesn't know anything about it and no programs associate themselves with it.

$ file winesburg.htxt
winesburg.htxt: data
$cat winesburg.htxt
008108816827866<9d>Z<87>^\|^Q<93>^Yr<޼
.....
$cat topics.htxt
4008108816827866<9e>|ϫ,Mi׵[:
......
and so on.

I assume it is some proprietary format for Hanvon (i.e. Hanvon-txt). Or is maybe encoded with a wide character set. Still trying to work through it.

Annoyingly, my new reader came with hundreds of books on it, but the file names are only vaguely enlightening. I'd like to figure out a quick way to rename them by finding the author and title in the files.

Keep working on it. If you want to send me a copy of one or two books I will try and find out myself. PM me, I will be out for a few days so I may not be able to quickly get back to you.

Dale

mothlight
04-10-2010, 04:57 AM
Keep working on it. If you want to send me a copy of one or two books I will try and find out myself. PM me, I will be out for a few days so I may not be able to quickly get back to you.

Dale

I posted some files here
http://tinyurl.com/yeblage

I tried playing around with converting from different character set encodings, but I didn't come up with much.

Kerry.

DaleDe
04-10-2010, 11:03 AM
Thanks, I got them. Will look at them in a few days. Hopefully somehow we can get to the bottom of this. Have you tried contact Hanvon directly? When you display the files on your computer what language are they in?

Dale

mothlight
04-10-2010, 10:37 PM
Thanks, I got them. Will look at them in a few days. Hopefully somehow we can get to the bottom of this. Have you tried contact Hanvon directly? When you display the files on your computer what language are they in?

Dale

Ahh, browsing through the manual, it appears that they are encrypted files, somehow tied to the device serial number. The translation is a little awkward, from page 18, http://bit.ly/bFJ4n8

The directory structure is
$ ls
Audio hwreader recorder
English Book hwrecentfile.sys Simplified Chinese Book
HWDeviceID.log Image Traditional Chinese Book

Half the books are in Chinese, but so far I haven't gotten any of those to actually display anything on the reader. The rest are in English (presumably taken from Gutenberg). But there are more than enough things to read on Gutenberg though that are easy to obtain, so it isn't a big deal.

Kerry.

DaleDe
04-10-2010, 10:59 PM
Were the ones you set for download in Engish?

Dale

mothlight
04-11-2010, 04:59 AM
Were the ones you set for download in Engish?

Dale

Yes, in English.

Like the winesburg.htxt should be pretty much exactly
http://www.gutenberg.org/files/416/416.txt
Although it has a lot of the introduction stripped out. It starts with

Winesburg, Ohio by
SHERWOOD ANDERSON

The writer, an old man with a white mustache, had some
difficulty in getting into bed. The windows of the
house in which he lived were high and he wanted to look
at the trees when he awoke in the morning. A carpenter
came to fix
....etc

DaleDe
04-11-2010, 10:41 AM
good, it helps to decode when you know what you are looking for :)

FCHYEH
05-01-2010, 08:45 PM
Hi all-

I'm new to the world of ebook readers.
I just got a Hanvon 518 and can't find anything anywhere about the "HTXT" format -- Can someone throw me a clue?

I'm imagining it's a fast-paging format, because epub and PDF speed is ghastly.
It's not HTML or "hypertext" - it's some sort of packed text format that I can't seem to crack.

Any help is greatly appreciated. Especially in regards to tools for conversion of HTML/TXT to this format.

KC


Hanvon HTXT file is basically TXT file with added security to protect the copyright.

FCHYEH
05-01-2010, 08:46 PM
Hanvon HTXT file is basically TXT file with added security to protect the copyright.

mmq
05-22-2010, 10:46 PM
If it helps, I checked some of the files off my N516 (hong kong version) against the .htxt files that were posted above, and they are byte for byte identical, so it seems like these aren't really tied to the device serial number

DaleDe
05-23-2010, 06:54 PM
If it helps, I checked some of the files off my N516 (hong kong version) against the .htxt files that were posted above, and they are byte for byte identical, so it seems like these aren't really tied to the device serial number

Thanks, each time we find out something it adds to the knowledge about this illusive format. Too bad Hanvon is so cryptic about this format. I don't think it helps their cause.

Dale

Nathanael
07-25-2010, 11:37 AM
it appears that they are encrypted files, somehow tied to the device serial number..... The rest are in English

Were the English texts also in HTXT?

I live in Shanghai, and the Hanvons are everywhere. I bought a 510 last year, that came with over 800 HTXT files. All the books were in Chinese, so I just assumed it was the Chinese equivalent of plaintext (perhaps "Hanyu text").

But now I'm thinking perhaps the H stands for "Hanvon", and it's their own proprietary format. I just today bought a Sibrary G5 (another Chinese company), which also comes with its own proprietary format. Of all the readers I looked at, only the Hanvons seemed to show support for HTXT, lending credence to the theory that it's a Hanvon proprietary format.

I've tried reading the HTXTs as BigFive, GB and Unicode, and letting Firefox attempt to auto-detect, all with no luck. Nor do they seem to be in any sort of compressed format I can discover (ZIP, RAR, 7z, SIT...).

I've noticed every file starts with the same plaintext string; in my case it's "4008108816827866". However, it doesn't seem to be related to my device's key in the HWDeviceID.log mine's "device=OTIyMDEwOTA4MDAwNDU2Nw%3D%3D�Ru&key=DCF77B4AD9A44CBC586537747ED27001".

I know zilch about encryption so I don't know where to go from here.

--Nathanael