|  01-11-2014, 08:44 PM | #31 | 
| Software Developer            Posts: 190 Karma: 89000 Join Date: Jan 2014 Location: Germany Device: PocketBook Touch Lux 3 | 
			
			Well, I would at least still like to know the following: You don't have to answer if you don't like, but with the experience and the working environment you're in, I would indeed consider some hints about it valuable. Last edited by skreutzer; 01-11-2014 at 08:47 PM. | 
|   |   | 
|  01-12-2014, 02:35 AM | #32 | |
| Wizard            Posts: 4,520 Karma: 121692313 Join Date: Oct 2009 Location: Heemskerk, NL Device: PRS-T1, Kobo Touch, Kobo Aura | Quote: 
 Just look at writer software out there. There are various programs out there (both commercial and open source). They are aimed to supporting and helping writers and their creative process. Only a very small portion of writers are using it, altough most of them know it is out there (I think) and available. If writers are not even considering using software geared towards them, how would convince them to use this? In real life I work in the integration field of software applications, I know quite well of the issues of different formats and mapping, especially with XML. I have professionally architected a framework to simplify the process to a great extent. I know the benefits of what you suggest. I just don't see it coming for this area without great, great issues. Without Word support, you can forget about it. Since you are ethically prohibited to use it, there is no way you can support it. | |
|   |   | 
|  01-12-2014, 04:51 AM | #33 | 
| Grand Sorcerer            Posts: 5,763 Karma: 24088559 Join Date: Dec 2010 Device: Kindle PW2 | 
			
			@skreutzer: Two other Open Source projects, recently announced ePub support: AbiWord 3.0 Calligra Author 2.6 But, as is all too common in the Linux world, both projects have not released any Windows binaries thus excluding about 90% of all potential users. It'd be great, if you could create Windows builds for these projects. You could also either join the development team of these projects to further your ideas or create a customized version that encourages using styles instead of direct formatting. However, before you proceed any further, it couldn't hurt to ask authors: - what tool they actually use for writing books (Word, Jutoh, AWP, Scrivener etc), - why they don't use Sigil, - what authoring features they'd like to see implemented etc. (You can create a poll in the Writer's Corner forum.) Last edited by Doitsu; 01-12-2014 at 07:54 AM. | 
|   |   | 
|  01-12-2014, 09:17 AM | #34 | 
| Member  Posts: 12 Karma: 10 Join Date: Dec 2011 Device: Sony PRS-T1 | 
			
			Well as I understand that release note about Abiword 3.0 it shall be available for Windows. In case of Calligra I agree, because their releases are quite rarely for Windows. | 
|   |   | 
|  01-12-2014, 10:48 AM | #35 | |||||
| Software Developer            Posts: 190 Karma: 89000 Join Date: Jan 2014 Location: Germany Device: PocketBook Touch Lux 3 | Quote: 
 Quote: 
 By the benefits? Quote: 
 Quote: 
 In AbiWord 2.8.2, it seems there's no direct way to edit a style template. However, since direct formatting can be disabled and XHTML output is present, it could be used as front end for automated processing workflows. Looks interesting to me, I will look into it. Thanks for the hint! Quote: 
 What I'm looking for is some kind of “special purpose”, which would be used by a formatting guy if the writer doesn't want to do the semantic markup himself, or by the writer after he has completed the entire text. It should be possible to apply style templates on the flow (and direct formatting needs to be prevented), but I'm convinced that writing and formatting are two separate steps. | |||||
|   |   | 
|  01-12-2014, 12:37 PM | #36 | 
| Grand Sorcerer            Posts: 28,869 Karma: 207000000 Join Date: Jan 2010 Device: Nexus 7, Kindle Fire HD | 
			
			Mixing ideology with functionality is almost always a recipe for disaster--whether in open or proprietary projects. I, myself, am not really interested in coding projects that can't (relatively) easily be compiled and/or run on the three major platforms. That's why I've always appreciated Sigil so much.
		 Last edited by DiapDealer; 01-12-2014 at 12:43 PM. | 
|   |   | 
|  01-12-2014, 01:28 PM | #37 | |||
| Grand Sorcerer            Posts: 5,763 Karma: 24088559 Join Date: Dec 2010 Device: Kindle PW2 | Quote: 
 Quote: 
 Quote: 
 | |||
|   |   | 
|  01-12-2014, 02:59 PM | #38 | 
| Grand Sorcerer            Posts: 28,869 Karma: 207000000 Join Date: Jan 2010 Device: Nexus 7, Kindle Fire HD | 
			
			Just to be clear... I'm talking about standalone projects. Obviously addons/plugins and the like are exempt. I'm thinking of Toxaris here (who I wouldn't want to offend with my previous post). Clearly his Word addon can't be cross-platform; and that doesn't change its value (to me and others). My point is: write code that will compile on as many platforms as possible. Test on as many platforms as possible and provide as many binaries as possible (along with the source) for those who have no interest in compiling. Let the user worry about whether or not it all fits their personal ideological preferences. That's my definition of "open." Not excluding anyone, for any reason. Last edited by DiapDealer; 01-12-2014 at 04:20 PM. | 
|   |   | 
|  01-12-2014, 03:01 PM | #39 | |
| Ex-Helpdesk Junkie            Posts: 19,421 Karma: 85400180 Join Date: Nov 2012 Location: The Beaten Path, USA, Roundworld, This Side of Infinity Device: Kindle Touch fw5.3.7 (Wifi only) | Quote: 
   | |
|   |   | 
|  01-12-2014, 03:38 PM | #40 | ||
| Bookmaker & Cat Slave            Posts: 11,503 Karma: 158448243 Join Date: Apr 2010 Location: Phoenix, AZ Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2 | Quote: 
 And when you say, "For the topic of my initial question, I would really like to hear how you clean the input from authors in terms of structure, and if/how the added structural information is used for a later automated processing," I'm not really sure what you're asking me. Are you asking me, generally, what do we do, or proprietarily, what do we do? Generally, we do what everyone else does, I presume, that's competent. The first step is an "either or." 
 It's not rocket science, it's just repetitive tedium. The problem is, as I see it, no two authors make the SAME mistake in the SAME way each time. Sometimes, errata is "MSONormal center bold" for a header; sometimes it's "MSONormal 18pt Bold" for that same header. Sometimes, we get files that are simply inexplicable, as to how styling that's in there doesn't show up in Styles, and doesn't show up in exported HTML (very common with any Pages-->Word "conversions" output). It would all be swell and good if authors had 5 styles to pick from, used 'em, and that was that. You could automate the process and Bob's yer uncle. But that's not what they are accustomed to, and that's not what they want. As Tox points out better than I could in his post, there are SCADS of tools out there already, that would work better (from a conversion standpoint), that authors already don't want to use. {shrug}. I just think that you are expecting right-brainers to somehow magically see the advantages of working in a left-brained environment, and my experience, for what it's worth, is that that ain't ever gonna happen. Not only are they utterly disinterested in what is going on behind the scenes, they don't WANT to know, don't CARE to know, and somehow, think it makes them less creative if they understand the "how." This is my experience. Feel free to ignore it. However, of all the phone calls I take, I cannot tell you how many tell me either, that 'I'm not good with computers," or, even worse, "I'm really very tecchie but I need help with this," the latter of which means that when the time comes for that person to download a file from a browser interface, the s**t will hit the fan. Nor do they know where their downloads folder is, or how to drag-and-drop. That's what that last sentence means. I'm not disparaging them, but your idea just seems to utterly ignore the reality of a writer's inner world. That's how I interpret what you've said thus far. Sort of, "well, I'm making this tool for myself (which is fine; that doesn't faze me), and for writers, and if they don't want to learn it, the hell with them." And if that's the entire gist--that you'll make it for yourself, and if anyone else wants to use it, they can--then great. But if you're asking everyone here for input and assistance and feedback, for a tool to be used widely, that would, purportedly, make OUR jobs easier, then you need to ALSO be open to the fact that maybe some of us might have a little more experience in the real-world environment in which you expect this to function. Just an idea. Quote: 
 Hitch | ||
|   |   | 
|  01-12-2014, 04:04 PM | #41 | |
| Wizard            Posts: 4,520 Karma: 121692313 Join Date: Oct 2009 Location: Heemskerk, NL Device: PRS-T1, Kobo Touch, Kobo Aura | Quote: 
 | |
|   |   | 
|  01-12-2014, 10:28 PM | #42 | |
| temp. out of service            Posts: 2,818 Karma: 24285242 Join Date: May 2010 Location: Duisburg (DE) Device: PB 623 | Quote: 
  That's exactly the point where you didn't listen to Hitch. I've met enough people being just of the cut Hitch speaks of: learn resistant homo habilis or lernresistentes Gewohnheitstier if you prefer. The only authors potentially willing to learn about good tools might be scientific ones because they heavily need a stable toolset for their work. Their work turns to shitpaper w/o a good reference apparatus so a lot of them happily switched from text editors aimed at authors (like NotaBene) to *TeX. As for making people switch OSs because of software... Why? Have you considered that they might use more than one program? Where do you help if your tool isn't all systems available and so a potential switcher cannot try and use it while waiting for an alternative for a different piece of software s/he uses too? If you truly want to promote free software you should do so. If you want to pull people to Linux... Don't. Linux isn't for people wanting a free windows. Linux is for people wanting Linux - the other ones won't switch their thinking about how an OS should operate. P.S.: If Linus has no problem with offering his stuff for multiple OSes nonfree ones included: http://subsurface.hohndel.org/de Why should you? More Catholic than the Pope eh? | |
|   |   | 
|  01-13-2014, 10:56 AM | #43 | |
| Guru            Posts: 698 Karma: 150000 Join Date: Feb 2010 Device: none | Quote: 
   | |
|   |   | 
|  01-13-2014, 11:41 AM | #44 | |
| Guru            Posts: 698 Karma: 150000 Join Date: Feb 2010 Device: none | Quote: 
 I work for a small-press "traditional" publisher, publishing in both print and e-book (usually both for a given title, but sometimes only one or the other). Typically our authors supply manuscripts as Word documents, and they are unaware of the proper use of styles. They do, however, often have a definite idea of how they want their manuscript typeset, and as you would expect in a WYSIWYG enviornment, they achieve the desired results by hook or crook. For example, one passage is poetry, indented with tabs; another is a blockquote; and a third is supposed to be a letter, so it's in a script font. Paragraph indentation is achieved via tabs and/or spaces. Vertical spacing is done with empty paragraphs. There may or may not be page breaks at the end of chapters. Once a manuscript is accepted, a manuscript passes through one or more editors, and back and forth to the author and publisher, until it is finally ready for prime time. Everyone uses Word and its native format. Wouldn't it be wonderful if they all used a collaborative, web-based process? Not. Going. To. Happen. Once the manuscript has seen edits, it comes to me. My goal is to "clean it up" by replacing the direct formatting and hook-or-crook styling with a uniform standardized set of paragraph and character House styles, while preserving the intent of the author's formatting in so far as possible. If the book is destined for print, I pull it into InDesign, and do the cleanup there. If there won't be a print version, or if we need a quick ebook as an ARC, I use LibreOffice, save as an .odt file and export that to epub via Writer2xhtml. The actual cleanup issues were well described by Hitch, though we don't get some of the more egregious cases -- manuscripts like that would never be accepted in the first place. Some things can be handled automatically: removing whitespace at the beginning of a paragraph, replacing direct formatting (e.g. italics, bold) with the appropriate character style, imposing house styles in place of whatever styles the author did or didn't use (usually the "defalult" style). Treatment of chapter headers, styling the first paragraph, and some other idiosyncratic details, however, require case-by-case intervention. Sometimes you just have to look at it to decide what the author was trying to do. After epub export from either InDesign or LibreOffice, it goes to Sigil for chapter splitting, adding the cover, adding lots of metadata to the .opf file, and generation of TOC's (typically both an .ncx and an inline TOC.xhtml). A modified version is also prepared for use with kindlegen to make the Amazon Kindle version. Going forward, we use the InDesign file (if there is one) as the master file to collect all of the inevitable corrections that only magically appear after the book is in print, and when necessary generate updated ebooks from it. HTH Albert | |
|   |   | 
|  01-13-2014, 07:06 PM | #45 | |
| Bookmaker & Cat Slave            Posts: 11,503 Karma: 158448243 Join Date: Apr 2010 Location: Phoenix, AZ Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2 | 
			
			I would say that, in all likelihood, 95% of what Albert and I (I = "we" at my company) do is identical.  The process is the process. There's just no silver bullet.  God knows, I wish there were.  But as long as we have individual authors using "hook or crook" to get manuscripts to do what they think they want it to look like, we'll have different ad hoc styling (inline styling) problems to solve.  I don't have the luxury that my buddy Albert has--no editors to help look for that type of garbage--so that's why we get the "more egregious" cases. Here is an ACTUAL sample of what we deal with; I have a woman that's been emailing me for the last 6-12 months. She had a book in print, in Europe, as she is. She had a "friend" do the scanning of the print book, in France. The resulting scan had all the usual horrors--page numbers midst-page, section breaks, varying margins, columns set (yes), you-name-it. She couldn't even begin to work on it. The friend didn't use Abbyy; used a desktop scanner and some type of older scanning software. I took pity on her (yes: I'm an idiot) and cleaned it up a bit, doing nothing more than tagging the italics and bold, removing the 5 gajillion styles, conforming the fonts, margins and other settings, removing all the section breaks. However, it had a ZILLION broken paragraphs, which of course she never saw. When I fixed the margins, she could of course now see them (doesn't know from a pilcrow). I'd sent it back to her for proofing, and for fixing the broken paras. Here is, I s**t thee not, text from an actual email from LAST NIGHT, after I painstakingly took screenshots, sent her links to a tutorial, etc., about cleaning up the broken paragraphs: Quote: 
 Hitch | |
|   |   | 
|  | 
| Tags | 
| sigil, wysiwym, xml | 
| Thread Tools | Search this Thread | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Marvin as a cloud front-end | taguntumi | Marvin | 9 | 11-22-2013 08:21 PM | 
| [Old Thread] Web Front end | DezmondFinney | Development | 24 | 12-18-2012 08:53 AM | 
| soPDF GUI Front-End | Nathan Campos | 37 | 11-04-2011 07:45 PM | |
| Web front end | DezmondFinney | Development | 7 | 08-10-2011 09:51 AM | 
| Hacking the front-end | DezmondFinney | Development | 18 | 08-05-2011 03:22 AM |