Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 01-03-2013, 01:39 AM   #1
anandudapudi
Member
anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.
 
Posts: 10
Karma: 480352
Join Date: Jan 2013
Device: ADE, iPad
Question Regional language (kannada) epub convesion.

Hi all,

I just want to know how we can get regional language text exactly copied in html or xhtml page.
i found out two cases here.
1. I copied web page content to html page by keeping encoding type UTF-8 without BOM. It looked like this.

ಪಾತ್ರೆಯಲ್ಲಿ ಎಣ್ಣೆ ಕಾದನಂತರ ಮೊದಲು ತೊಗರಿಬೇಳೆ ಹಾಕುವುದು. ಬೇಳೆ ಕೆಂಪಗಾಗುತ್ತಿದಂತೆ, ಮೆಣಸು,ಶುಂಠಿ,ಬೆಳ್ಳುಳ್ಳಿ,ಈರುಳ್ಳಿ ಒಂದರನಂತರ ಒಂದನ್ನು ಹಾಕಿ ಎಣ್ಣೆಯಲ್ಲಿ 3-4 ನಿಮಿಷ ಹುರಿಯಬೇಕು. ಈ ಮಿಶ್ರಣವನ್ನು ಮಿಕ್ಸರಿನಲ್ಲಿ ಹಾಕಿ,ಕಾಯಿತುರಿ,ಹುಣಸೆಹಣ್ಣು,ಉಪ್ಪಿನೊಂದಿಗೆ ಹದಕ್ಕೆ ಬೇಕಾಗುವಷ್ಟು ನೀರು (1 ಬಟ್ಟಲು) ಹಾಕಿ ನುಣ್ಣಗೆ ರುಬ್ಬಬೇಕು.ಕೊತ್ತಂಬರಿ/ಪುದೀನಾ ಸೊಪ್ಪು ಕೂಡ ರುಬ್ಬುವ ಮುಂಚೆ ಹಾಕಬಹುದು.

2. I copied one more from kannada pdf, with keeping sme encoding type as above, but it looked like this.

~Kdg }}| ~tX~Kd[{ {Xgg }pm} v
~Kdg E. d uC} {p d Iyb yⰪ. {d}Q
U{ⷰ ~{ d. a{곪{ g ~Kd}Q A ~Kd{
I~곩gd"}p갩 e{d.

So i want to know why second one didn't display in kannada language, is there problem with static pdf content And dynamic kannada web content.
I want to see pdf content also sme in html page.

Please do favour to me, thanks in advance, any reply related to this highly appreciated.
anandudapudi is offline   Reply With Quote
Old 01-03-2013, 04:17 AM   #2
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 5,994
Karma: 4346921
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by anandudapudi View Post
2. I copied one more from kannada pdf, with keeping sme encoding type as above, but it looked like this.

~Kdg }}| ~tX~Kd[{ {Xgg }pm} v
~Kdg E. d uC} {p d Iyb yⰪ. {d}Q
U{ⷰ ~{ d. a{곪{ g ~Kd}Q A ~Kd{
I~곩gd"}p갩 e{d.

So i want to know why second one didn't display in kannada language, is there problem with static pdf content And dynamic kannada web content.
I want to see pdf content also sme in html page.
It's probably because fonts in PDF can have very exotic encodings. Actually, PDF is a bit like those blackmail notes with cut-out letters, it simply contains information on which squiggle goes where, and it doesn't care much about encoding, as long as each squiggle looks like a letter.
Jellby is offline   Reply With Quote
Old 01-03-2013, 05:17 AM   #3
anandudapudi
Member
anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.
 
Posts: 10
Karma: 480352
Join Date: Jan 2013
Device: ADE, iPad
Thumbs up Thanks mate...!!! But...

Quote:
Originally Posted by Jellby View Post
It's probably because fonts in PDF can have very exotic encodings. Actually, PDF is a bit like those blackmail notes with cut-out letters, it simply contains information on which squiggle goes where, and it doesn't care much about encoding, as long as each squiggle looks like a letter.
I agree with your points. but i want to know how i can copy text from pdf to HTML page exactly without this encoding hassle. i request suggest me any valid solution for this. Thanks in advance.
anandudapudi is offline   Reply With Quote
Old 01-03-2013, 11:45 AM   #4
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 9,503
Karma: 4597184
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2
Quote:
Originally Posted by anandudapudi View Post


I agree with your points. but i want to know how i can copy text from pdf to HTML page exactly without this encoding hassle. i request suggest me any valid solution for this. Thanks in advance.
To do what you asked above you will need to do an image and paste the image into the html. You cannot expect all the different formats to behave exactly the same so there is always some encoding hassle.

Dale
DaleDe is offline   Reply With Quote
Old 01-03-2013, 12:33 PM   #5
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 5,994
Karma: 4346921
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by anandudapudi View Post
but i want to know how i can copy text from pdf to HTML page exactly without this encoding hassle.
There's no solution guaranteed to work, other than OCR, because a PDF is more concerned about its looks than about the underlying meaning. In some cases, if the PDF font uses some known/standard encoding, you can maybe copy and paste with the right settings, or do a conversion afterwards. You may be lucky, and maybe the PDF encoding is ISCII (in that case it should be possible to find a converter), but it could be some ad-hoc encoding used only in that particular document.

Last edited by Jellby; 01-03-2013 at 12:35 PM.
Jellby is offline   Reply With Quote
Old 01-04-2013, 12:40 AM   #6
anandudapudi
Member
anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.
 
Posts: 10
Karma: 480352
Join Date: Jan 2013
Device: ADE, iPad
Thumbs up thanks mate..!!

Quote:
Originally Posted by DaleDe View Post
To do what you asked above you will need to do an image and paste the image into the html. You cannot expect all the different formats to behave exactly the same so there is always some encoding hassle.

Dale
@Dale: your response is well accepted and understood, i need to juggle with encoding hassle, no qualms.
anandudapudi is offline   Reply With Quote
Old 01-04-2013, 12:44 AM   #7
anandudapudi
Member
anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.anandudapudi ought to be getting tired of karma fortunes by now.
 
Posts: 10
Karma: 480352
Join Date: Jan 2013
Device: ADE, iPad
Thumbs up thanks buddy..!!!

Quote:
Originally Posted by Jellby View Post
There's no solution guaranteed to work, other than OCR, because a PDF is more concerned about its looks than about the underlying meaning. In some cases, if the PDF font uses some known/standard encoding, you can maybe copy and paste with the right settings, or do a conversion afterwards. You may be lucky, and maybe the PDF encoding is ISCII (in that case it should be possible to find a converter), but it could be some ad-hoc encoding used only in that particular document.
@Jellyby: your response to is appreciated, i understood that need to cope up with pdf the way it generates text or code. no qualms.
anandudapudi is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
How to create regional language (Kannada) epubs ? vinayaksanga ePub 4 01-02-2013 02:34 AM
Touch Regional lock-in? xenoglaux Kobo Reader 6 01-05-2012 02:46 AM
Bulk Conversion Ignoring convesion pref. order wenywen Conversion 1 10-30-2011 04:44 PM
Regional restrictions now in the UK tech_au News 166 10-26-2009 02:52 PM
pdf convesion failed mmaimon Calibre 6 03-20-2009 08:50 AM


All times are GMT -4. The time now is 12:56 AM.


MobileRead.com is a privately owned, operated and funded community.