Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 05-27-2013, 09:20 AM   #1
donMaranzano
Enthusiast
donMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheese
 
Posts: 37
Karma: 1000
Join Date: May 2013
Device: Kindle 5
Creating dictionary

I have an english bulgarian dictionary database and want to create dicitionary for Kindle 4. How can be done? My database is in such way:

[ei]
I. 1. буквата A
from A to Z от начало до край от край до край, изцяло, напълно
2. A уч. оценка отличен
straight A ам. пълно отличие
3. A1 мор. първокласен (за плавателен съд), разг. първостепенен, превъзходен
4. A муз. ла
II. 1. indefinite article пред гласни и полугласни звуком (напр. a book, a union, a ewe, a yard, a one-volume book) (обикн. не се превежда)
2. един, една, едно (често не се превежда)
not A нито един/една/едно
not A word нито/ни (една) дума
3. за разпределяне на, за, в (често не се превежда)
he gets ... A day/week, etc. той получава по ... на ден/седмица и пр. /дневно/седмично и пр.
one pound A dozen (по) една лира дузината
four levs A kilo по четири лева килото
fifty lew A person по петдесет лева на човек
4. със собствени имена като, който прилича на
he thinks he is A Napoleon той се мисли, че е (като) Наполеон
5. (с all of) еднакъв, един и същ
they are all of A size те всички са еднакви/една и съща големина
6. известен, някакъв
in A measure в известна/някаква стелен
in A sense в известен смисъл
7. някакъв, някой си
A Mr B. has come някой си г-н Б. е дошъл
A man is asking to see you някакъв човек иска да те види
they were killed to A man те бяха избити до един
AARDVARK
['a:dva:k]
n зоол. вид юж. афр. мравояд
AARDWOLF
['a:dwulf]
n зоол. юж. афр. животно, подобно на хиена

What is the procedure? Thank you very much.
donMaranzano is offline   Reply With Quote
Old 05-27-2013, 09:50 AM   #2
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 13,605
Karma: 5126946
Join Date: Aug 2009
Location: The (original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
don
Dictionaries (and Bibles) are huge documents and pose many problems. Shear size being one. Indexing is another.

Tossing in a second language (and the character set)

You have a difficult project

Looking quickly at your data shown:
(outside of Calibre)
you need to create a script to wrap (in your chosen visual format) HTML around each field (line/paragraph or word)

All that needs to be wrapped up in the standard HTML boilerplate (<html><head></head><body>...</body></html>
Keep in mind that individual file size will be an issue and you need to break the dictionary into multiple sections.
theducks is offline   Reply With Quote
Old 05-31-2013, 03:02 AM   #3
donMaranzano
Enthusiast
donMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheese
 
Posts: 37
Karma: 1000
Join Date: May 2013
Device: Kindle 5
Quote:
Originally Posted by theducks View Post
don
Dictionaries (and Bibles) are huge documents and pose many problems. Shear size being one. Indexing is another.

Tossing in a second language (and the character set)

You have a difficult project

Looking quickly at your data shown:
(outside of Calibre)
you need to create a script to wrap (in your chosen visual format) HTML around each field (line/paragraph or word)

All that needs to be wrapped up in the standard HTML boilerplate (<html><head></head><body>...</body></html>
Keep in mind that individual file size will be an issue and you need to break the dictionary into multiple sections.
Is it possible to make me just an example? Is there anyway to make to batch process for these words?
donMaranzano is offline   Reply With Quote
Old 05-31-2013, 03:23 AM   #4
Doitsu
Wizard
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 1,685
Karma: 4392001
Join Date: Dec 2010
Device: Kindle 3
AFAIK, you cannot create Kindle lookup dictionaries with Calibre. You'll need to use Mobipocket Creator or KindleGen instead, because Kindle dictionaries require a special non-standard .html format and special non-standard .opf entries.

Here's an example of a very simple English-German dictionary with inflections.

Code:
<html>
<body>

<idx:entry>
	<b><idx:orth>book
	<idx:infl>
		<idx:iform value="books"/>
	</idx:infl>
	</idx:orth> </b> 
	<i>Subst.</i> <br/>
	Buch (n)
</idx:entry>
<br/><br/>
<hr/>

<idx:entry>
	<b><idx:orth>go
	<idx:infl>
		<idx:iform value="goes"/>
		<idx:iform value="going"/>
		<idx:iform value="went"/>
		<idx:iform value="gone"/>
	</idx:infl>
	</idx:orth> </b> 
	<i>Verb</i> <br/>
	gehen
</idx:entry>
<br/><br/>

</body>
</html>
For more information, see the mobipocket.com web site. If your source data is tab-delimited, you can use the instructions on this web site to automatically generate the source files for your dictionary.
Doitsu is offline   Reply With Quote
Old 05-31-2013, 08:54 AM   #5
donMaranzano
Enthusiast
donMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheese
 
Posts: 37
Karma: 1000
Join Date: May 2013
Device: Kindle 5
Quote:
Originally Posted by Doitsu View Post
AFAIK, you cannot create Kindle lookup dictionaries with Calibre. You'll need to use Mobipocket Creator or KindleGen instead, because Kindle dictionaries require a special non-standard .html format and special non-standard .opf entries.

Here's an example of a very simple English-German dictionary with inflections.

Code:
<html>
<body>

<idx:entry>
	<b><idx:orth>book
	<idx:infl>
		<idx:iform value="books"/>
	</idx:infl>
	</idx:orth> </b> 
	<i>Subst.</i> <br/>
	Buch (n)
</idx:entry>
<br/><br/>
<hr/>

<idx:entry>
	<b><idx:orth>go
	<idx:infl>
		<idx:iform value="goes"/>
		<idx:iform value="going"/>
		<idx:iform value="went"/>
		<idx:iform value="gone"/>
	</idx:infl>
	</idx:orth> </b> 
	<i>Verb</i> <br/>
	gehen
</idx:entry>
<br/><br/>

</body>
</html>
For more information, see the mobipocket.com web site. If your source data is tab-delimited, you can use the instructions on this web site to automatically generate the source files for your dictionary.
Hi,
I have an error in the command window to execute command “”tab2opf.exe dict.exe”"

” Traceback : (most recent call last):
File “tab2opf.py”, line 240, in (module)
ValueError: need more than 1 value to unpack”

Please tell me what am I doing wrong…

In the site of mobipocket I did not find any suggestions...
donMaranzano is offline   Reply With Quote
Old 05-31-2013, 10:10 AM   #6
Doitsu
Wizard
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 1,685
Karma: 4392001
Join Date: Dec 2010
Device: Kindle 3
Quote:
Originally Posted by donMaranzano View Post
Hi,
I have an error in the command window to execute command “”tab2opf.exe dict.exe”"

” Traceback : (most recent call last):
File “tab2opf.py”, line 240, in (module)
ValueError: need more than 1 value to unpack”

Please tell me what am I doing wrong…
Most likely your input file format is incorrect. I.e. it's not a tab delimited file.
Note that each line must contain one headword followed by a tabulator and the translation (the translation can be multiple words, but mustn't contain line breaks.).
The best way to prepare the input file is to use a spreadsheet software. E.g. MS Excel or LibreOffice Calc. Just make sure that the first column contains the headword and the second word the definition(s) for the headword in the left column. When you're done select both columns and copy the text into a text editor (e.g. Notead) and save it as a utf8-8 Unicode file.
Depending on where you got your source file from, you might also be able to use pyGlossary to convert your file to a tab-delimted file if it's formatted in a way that pyglossary supports. Of course, you'll also need Python.

If you're source file is unstructured, you probably won't be able to convert it to format that tab2opf can handle unless you know your way around an editor with regular expression support.

You could also try to find an English-Bulgarian BGL file and convert it following the instructions on this page.
Doitsu is offline   Reply With Quote
Old 05-31-2013, 04:09 PM   #7
donMaranzano
Enthusiast
donMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheesedonMaranzano can extract oil from cheese
 
Posts: 37
Karma: 1000
Join Date: May 2013
Device: Kindle 5
Thank you, I already made up one from this site. I undestand how it's made but it is a very hard job to do it.
donMaranzano is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Creating dictionary for kindle 4 Koko Amazon Kindle 5 03-01-2013 06:26 AM
Creating Dictionary in Epub Krystl Workshop 8 04-13-2012 09:58 AM
creating dictionary from pdf dictionary fiaz Workshop 0 05-15-2011 12:40 PM
Dictionary question: changing word delimiters for french dictionary oecherprinte Amazon Kindle 1 05-09-2011 04:45 AM
creating a foreign language dictionary dirtylc Amazon Kindle 1 03-30-2009 08:40 AM


All times are GMT -4. The time now is 01:10 PM.


MobileRead.com is a privately owned, operated and funded community.