MobileRead Forums

MobileRead Forums (https://www.mobileread.com/forums/index.php)
-   Workshop (https://www.mobileread.com/forums/forumdisplay.php?f=178)
-   -   How to create your own mobipocket dictionary for any language: (https://www.mobileread.com/forums/showthread.php?t=20480)

Teom@n 01-10-2017 01:19 PM

Quote:

Originally Posted by Doitsu (Post 3456100)
That depends on your technical skills. If your MS Excel spreadsheet contains only two columns you could save it as a tab-delimited text file and process it with tab2opf.py.
If you're familiar with regular expressions, you could also convert the tab-delimited text file directly using a couple of regular expressions.

I have a computer skills so I will try to do my best.

Thanks.

Teom@n 01-10-2017 03:10 PM

Quote:

Originally Posted by Doitsu (Post 3456100)
That depends on your technical skills. If your MS Excel spreadsheet contains only two columns you could save it as a tab-delimited text file and process it with tab2opf.py.
If you're familiar with regular expressions, you could also convert the tab-delimited text file directly using a couple of regular expressions.

Do you know any script(python, etc.) for converting huge html files to other formats?

Doitsu 01-10-2017 03:14 PM

Quote:

Originally Posted by Teom@n (Post 3456176)
Do you know any script(python, etc.) for converting huge html files to other formats?

I usually use the Python bs4 library for HTML parsing.

Teom@n 01-10-2017 04:50 PM

Quote:

Originally Posted by Doitsu (Post 3456181)
I usually use the Python bs4 library for HTML parsing.

Ok, I will try it. Thanks.

Teom@n 01-22-2017 03:37 PM

Quote:

Originally Posted by Doitsu (Post 3456181)
I usually use the Python bs4 library for HTML parsing.

hi,

could you share the script/commands that you were using for converting html to text? I have a html file(124mb) which I took it from a mobi file. I installed the python and beautifulsoap.

could you guide me?

thanks

Doitsu 01-22-2017 06:26 PM

Quote:

Originally Posted by Teom@n (Post 3462662)
could you share the script/commands that you were using for converting html to text? I have a html file(124mb) which I took it from a mobi file. I installed the python and beautifulsoap.

Unfortunately, I can't help you with that, because scripts and commands will vary depending on the exact input and output formats. However, BS4 is well documented. For example:

Code:

soup.get_text()
will strip all tags from an HTML file.

If you're not a Python programmer, you could also use a text editor with regular expressions support, e.g. Notepad++, to remove unwanted tags or convert them to a different format.

Teom@n 01-23-2017 05:51 AM

Quote:

Originally Posted by Doitsu (Post 3462719)
Unfortunately, I can't help you with that, because scripts and commands will vary depending on the exact input and output formats. However, BS4 is well documented. For example:

Code:

soup.get_text()
will strip all tags from an HTML file.

If you're not a Python programmer, you could also use a text editor with regular expressions support, e.g. Notepad++, to remove unwanted tags or convert them to a different format.

Thanks mate.

sayonaroo 03-03-2019 10:06 PM

Could someone make one for me? I cannot figure out how to use python

I wanted to make a kindle dictionary out of the korean-Japanese naver dictionary

http://download.huzheng.org/babylon/japanese/

It says

You can convert any babylon .bgl file to StarDict format by the PyGlossary tool, just visit: http://pyglossary.sourceforge.net

Phionz Paranto 10-17-2019 11:05 PM

Please the update.
 
Quote:

Originally Posted by thomass (Post 1863532)
why don't you try one to see. ;)

BTW: attached an Indonesian – English dict with ~22300 words (stardict-quick)

I have downloaded and tried this dictionary to see if it works well. I found that this dictionary is very simple and basic for Indonesian-English. Do you have the most updated one? It's 2019 now. Thank you very much.
:help:
:iloveyou:

DNSB 10-18-2019 12:12 AM

Quote:

Originally Posted by Phionz Paranto (Post 3904899)
I have downloaded and tried this dictionary to see if it works well. I found that this dictionary is very simple and basic for Indonesian-English. Do you have the most updated one? It's 2019 now. Thank you very much.

The last time thomass visited Mobileread was a year ago. And yes, it's been 8 years since that dictionary was posted. The advice on how to create your own dictionary is still valid so you may want to apply it. Or if your time is worth more, you might purchase an Indonesian/English dictionary from Amazon. Check Kindle Default Dictionaries Indonesian for example.

markvdvelde 03-24-2020 04:48 PM

Croatian to English
 
Quote:

Originally Posted by thomass (Post 1564552)
English-Croatian dict:

This is a nice dictionary, but do you have it the other way around, from Croatian to English? When ever I search for Croatian-English dictionaries, I end up trying lots unknown and complicated (at least for me) software that has to run with commands etc. This is the first EN-HR dictionary that works out of the box. I would very happy with HR-EN too.


All times are GMT -4. The time now is 11:09 PM.

Powered by: vBulletin
Copyright ©2000 - 3.8.5, Jelsoft Enterprises Ltd.
MobileRead.com is a privately owned, operated and funded community.