|  08-31-2009, 09:27 AM | #1 | 
| Guru            Posts: 974 Karma: 4999999 Join Date: Mar 2009 Location: Rosario, Argentina Device: SONY PRS-T2, Kindle Paperwhite 11th gen | 
				
				Programming language for development
			 
			
			I have some experience in computer programming in several languages (ANSI C/C++, Borland Delphi/Kylix, a little Java and Python in console applications, to name some). I would like to start developing platform independent tools for ebook processing and formatting, but have not decided what language to use. Learning a new language/development environment is no problem, in fact I have to study a lot before I can start to do something worth sharing. So I thought maybe some of the great developers here can give me some advise. I have access to Windows and Linux boxes and have used VMWare Server and Virtual PC. Thanks in advance. Pablo   | 
|   |   | 
|  08-31-2009, 09:29 AM | #2 | 
| eBook Enthusiast            Posts: 85,559 Karma: 93980341 Join Date: Nov 2006 Location: UK Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6 | 
			
			Python seems to be the most popular choice.
		 | 
|   |   | 
|  08-31-2009, 09:40 AM | #3 | 
| Comparer of the Ephemeris            Posts: 1,496 Karma: 424697 Join Date: Mar 2009 Device: iPad | 
			
			I'll second Harry's suggestion of Python.  Python is free, quick, portable, interactive and multi-platform.  Calibre is written in Python, making it possible to write your own plugins or modifications. G | 
|   |   | 
|  08-31-2009, 10:02 AM | #4 | 
| Addict            Posts: 202 Karma: 4379 Join Date: May 2009 Location: Italy Device: Hanlin V3 (with lBook firmware & OpenInkPot) | 
			
			Since you already know C++, you can surely use it: C++ is an "immortal" language that always does its work, and its flexibility is unparalleled. You could also use Java: it is surely platform-independent and has an incredible amount of developement tools (both free and non-free) available, tons of APIs, easily available libraries, and so on. It also has a very wide and stable user base, which is always useful. You could also try C#, if you develop using Mono and GTK+ (instead of winforms) you end up with a platform-independent program which can be run on Windows, Linux and Mac (that's what I use when I need cross-platform programs). The downside is, the opensource community looks with suspect at Mono and C#, because the fear is that it would take a moment for Microsoft to change its claims on C# and the .NET framework and ask to pay licenses. | 
|   |   | 
|  08-31-2009, 10:21 AM | #5 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Python all the way. Highest productivity language I've ever found and I know over twenty    | 
|   |   | 
|  08-31-2009, 12:28 PM | #6 | 
| Lector minore            Posts: 660 Karma: 1738720 Join Date: Jan 2008 Device: Aura One, Paperwhite Signature | |
|   |   | 
|  08-31-2009, 12:33 PM | #7 | 
| eBook Enthusiast            Posts: 85,559 Karma: 93980341 Join Date: Nov 2006 Location: UK Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6 | 
			
			Undoubtedly - it would probably be 10x faster if it were written in C++. However, I don't know about you, but for me Calibre is "fast enough". If it takes 2 minutes to convert a book from Mobi to ePub for me I'm not really that bothered.
		 | 
|   |   | 
|  08-31-2009, 01:39 PM | #8 | |
| Wizard            Posts: 1,790 Karma: 507333 Join Date: May 2009 Device: none | Quote: 
 The most obvious one which in the past turned python programs of mine that should have processed under two minutes to take literally hours is the "do not build strings directly" or "do not build strings one character at a time" part. Code: output = ''
for tmpChar in longtext:
     outChar = tmpChar
     # some conditional processing of outChar here
     output += outCharCode: outputList = []
output = ''
for tmpChar in longtext:
     outChar = tmpChar
     # some conditional processing of outChar here
     outputList.append(outChar)
output = ''.join(outputList)Admittedly, I doubt this is news to Kovid... but I imagine there are more than one such pitfalls in Python where the obvious way is suboptimal for processing-heavy code. Might something as such act as a bottle-neck in some of your conversion scripts, Kovid? - Ahi | |
|   |   | 
|  08-31-2009, 03:02 PM | #9 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Ah everyone's favorite debate. If you want to get into it seriously, I suggest being more precise. Whey you say calibre is "slow" waht do you mean? 1) Some people complain that the GUI is slow. Other people say taht the GUI is fast enough. 2) Some people claim it is slow with large databases, other people say it is fast enough. 3) Some people say it is slow with conversion, other people... 4) Some people say it is slow with news download, ... 5) Startup times are slow Now as the person that knows the most about this (and yes, ahi I am well aware of those and lots of other pitfalls in string processing), let me shed some light on easch of these problems 1) Almost all the actual GUI rendering happens in compiled (C++) code 2) Calibre actually stores all the metadata in memory, so this is not a database/python/string processing problem. Though I do admit that the performance here can be optimized. In fact, that's on my agenda as aprt of database independence refactoring for 0.7 3) The biggest bottleneck in conversion is parsing of CSS. calibre uses a CSS library that uses regular expression to parse CSS (the regular expression are again in compiled code). This library's performance can be improved, but I am not going to sit down and re-write it. 4) This is slow because the browser simulation library that calibre uses is not multi-threaded, so all actual downloads happen in a single thread. Until I get around to fixing that library,... 5) This is actually one place where using python hurts. But as far as I am concerned, it's worth it. | 
|   |   | 
|  08-31-2009, 03:16 PM | #10 | |
| Wizard            Posts: 1,790 Karma: 507333 Join Date: May 2009 Device: none | Quote: 
 I could be wrong... perhaps I'll write down some specific filesize/processing type/length of time data later this week... it seems to me I can quite heavily process several (5+, 10+) MB RTF files with Python in the same amount of time it takes to convert an under 2 MB file from one reflow format to another... which is basically an HTML-ish to HTML-ish type conversion. No? If you are aware of the various pitfalls, doubtless there is good reason why things take as long as they do... but it always felt oddly (though not disruptively) slow to me. - Ahi | |
|   |   | 
|  08-31-2009, 03:18 PM | #11 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			A good point about RTF, I forgot to mention that. RTF conversion is slow in calibre, but that's largely because of a rather badly designed RTF processing library calibre uses. I've always been meaning to swap it out for another, but never get around it. Any volunteers    | 
|   |   | 
|  08-31-2009, 05:38 PM | #12 | 
| Guru            Posts: 974 Karma: 4999999 Join Date: Mar 2009 Location: Rosario, Argentina Device: SONY PRS-T2, Kindle Paperwhite 11th gen | |
|   |   | 
|  08-31-2009, 06:42 PM | #13 | 
| Wizard            Posts: 3,442 Karma: 300001 Join Date: Sep 2006 Location: Belgium Device: PRS-500/505/700, Kindle, Cybook Gen3, Words Gear | 
			
			Kovid, have you done any profiling of Calibre? From my experience bottlenecks often happen in places where you least expect them...
		 | 
|   |   | 
|  08-31-2009, 10:12 PM | #14 | 
| creator of calibre            Posts: 45,594 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | |
|   |   | 
|  09-01-2009, 02:53 AM | #15 | |
| eBook Enthusiast            Posts: 85,559 Karma: 93980341 Join Date: Nov 2006 Location: UK Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6 | Quote: 
 Conversion speed from Mobi to ePub seems to be remarkably inconsistent. All my books are created via the same route: BD -> HTML -> Mobi Creator -> Calibre, and yet sometimes I have two books which are pretty much the same size, and not obviously of different "complexities", and yet one will convert in 15 seconds and the other will take 15 minutes. Not a problem - just an observation  . | |
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Programming language code snippets in ebooks? | Connochaetes | Writers' Corner | 7 | 10-18-2010 02:43 PM | 
| Computer programming books | JoshLessard | Amazon Kindle | 6 | 08-08-2010 06:08 PM | 
| PRS-500 500 Programming | MarzKrishna | Sony Reader Dev Corner | 1 | 12-17-2009 08:43 PM | 
| Free Programming Resources | hacker | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 07-16-2005 11:24 AM |