![]() |
#1 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Mar 2022
Device: iPad
|
ANSI Format Copy/Paste for Indian Languages
Hello All,
I am writing an ePub book for Vedic chants. For generating the Vedic text, I am using the software Baraha (www.baraha.com). Once the material for a given chapter is done, I can export the text from Baraha Application into text file, RTF with Unicode encoding, ANSI encoding into RTF, HTML and image files. If I copy the Unicode for Telugu, Kannada or Tamil text, the text misses the Vedic ligature marks. The ANSI encoding is what works for the ligature marks as Unicode does not support Vedic markings in Telugu, Kannada or Tamil. It works only for Devanagari or Sanskrit text. But if I copy the ANSI encoded text from Baraha editor or from MS word/Word Pad, the text is all garbled up and doesn't make any sense. I know that some people have asked questions about Sanskrit text but this one is a different issue. Can we something about Sigil or PageEdit read the ANSI formatted text properly? In Sigil I have added the font file from Baraha software but still doesn't work. Please see the images from Baraha Application, Sigil and PageEdit. I have reached to Baraha developer regarding copying the ANSI formatted document but haven't heard anything yet. Please let me know what I can do in order for this to work. Baraha Application: MS word: PageEdit Window: Sigil Text Window: You will see that the Sigil text window shows weird text that is all jumbled up or garbled up. This is the issue I am trying to solve in order of the ePub to happen for me. Best, Sai Last edited by DiapDealer; 03-04-2022 at 12:54 PM. Reason: Change oversided images to thumbnails |
![]() |
![]() |
![]() |
#2 |
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 8,759
Karma: 5706256
Join Date: Nov 2009
Device: many
|
In line images should be small according to Mobileread Posting guidelines. You have already attached them so please just reference them inline in your post.
Other than that, perhaps someone here that knows more about this can help. Alternatively since the developer of Calibre is from India, perhaps trying with the Calibre editor might bring faster results. |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 8,759
Karma: 5706256
Join Date: Nov 2009
Device: many
|
Also, Can your Bahara editor export as html. If so you may have better luck trying that then trying copy and paste.
Furthermore, a search of the web found this: The Vedic extension characters defined in IS 13194:1991 Annex G—Extended Character Set for Vedic are now fully covered by the Unicode Standard. So you should be able to save your file as a utf-8 encoded html file and import it into Sigil and not lose anything. Last edited by KevinH; 03-03-2022 at 09:41 PM. |
![]() |
![]() |
![]() |
#4 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 441
Karma: 77256
Join Date: Sep 2011
Device: none
|
Lack of unicode ligatures for languages other than Sanskrit or character sets other than Devanagari, could it be merely support from Baraha? I’m not too familar with that particularity. Perhaps you could contact the author and ask to clarify, unless you’re clear, that Unicode export and support is for only certain scripts. That I can tell, Vedic marks used above do exist in Unicode.
|
![]() |
![]() |
![]() |
#5 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,727
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
|
@skrama87 According to the Baraha website, the app supports both ANSI and Unicode. Apparently, you've installed the ANSI fonts and are using the app in ANSI mode.
You'll need to install the Unicode version of these fonts and use Baraha in Unicode mode, because Sigil only supports Unicode texts. |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Mar 2022
Device: iPad
|
Thank you for the responses. This is what Baraha developer wrote to me about ANSI copy/paste.
- - - - Hi In Baraha ANSI encoding, Baraha uses many values in the (0-255) range which may not be supported by all applications. Most applications support ASCII (0-127) only. Hence when using modern programs, one has to use Unicode encoding. Unfortunately, Unicode fonts don't support Vedic characters yet! Regards Baraha Support |
![]() |
![]() |
![]() |
#7 | |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Mar 2022
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#8 |
Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 8,759
Karma: 5706256
Join Date: Nov 2009
Device: many
|
AFAIK, unicode fonts support the Vedic accents via a separate block of diacritic characters that can be composed over (on top) of characters to create what you want.
|
![]() |
![]() |
![]() |
#9 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 441
Karma: 77256
Join Date: Sep 2011
Device: none
|
Maybe author is mistaken. Possible some are missing though I'm not very familiar. Perhaps check Indology list. Rigveda as example:
http://www.detlef108.de/Rigveda.htm |
![]() |
![]() |
![]() |
#10 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 441
Karma: 77256
Join Date: Sep 2011
Device: none
|
If the issue is that few fonts themselves include Vedic accents, that is true. I've had issues but they display fine for me in Calibre reader and iOS Books.
Perhaps few if any free fonts are available. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Copy PAste | Fizzyfi | Calibre | 2 | 04-10-2015 02:55 PM |
Copy and Paste | JDavid | Sigil | 4 | 08-23-2012 04:02 PM |
Copy Paste | giosa | Sony Reader Dev Corner | 0 | 03-24-2012 06:17 PM |