05-02-2011, 11:02 AM | #16 |
Resident Curmudgeon
Posts: 73,962
Karma: 128903250
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
It's not the process. It;s that the OP is telling you to go look for publisher pirated PDF to convert. V5.0 is not any version of Acrobat or any other software. v5.0 is the version number given to publisher originals as served by the pirates. Why is this a hard concept to grasp?
|
05-02-2011, 08:19 PM | #17 |
Addict
Posts: 351
Karma: 70000
Join Date: Jul 2010
Location: Australia
Device: ADE, iPad
|
Do you mean "Publisher" as in Book publisher or Microsoft Publisher...
I work for Book publishers and I've never heard the term you are referring to, this is why As you say the "Concept is hard to Grasp". |
Advert | |
|
05-02-2011, 08:31 PM | #18 |
Resident Curmudgeon
Posts: 73,962
Karma: 128903250
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
I mean the publishers putting out eBooks. You may not have heard of this because you've not gone to the dark side and looked around. I have and the OP is saying to start with a stolen publisher produced PDF.
|
05-02-2011, 11:43 PM | #19 |
Addict
Posts: 351
Karma: 70000
Join Date: Jul 2010
Location: Australia
Device: ADE, iPad
|
Humm...ok I'll take your word for it.
Maybe if the OP were to omit that reference. Because the rest of the stuff he wrote is helpful. |
05-03-2011, 09:36 AM | #20 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
I have to agree with JSWolf. The OP is absolutly talking about pirated PDF's. Otherwise the guide is nice, but not great. A lot of search and replaces could end up in undesired results...
|
Advert | |
|
05-03-2011, 10:16 AM | #21 |
Resident Curmudgeon
Posts: 73,962
Karma: 128903250
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Yes, the OP should remove the piracy references. And I think the fact that the mods didn't pick up on this means we need new mods who know about eBook piracy who would pick up on this. Heck, I even reported it and still nothing.
|
05-04-2011, 02:32 PM | #22 |
Connoisseur
Posts: 61
Karma: 12096
Join Date: Sep 2010
Location: Tasmania
Device: Sony PRS 650
|
Style
Styles from Word to Sigil
Method: Mark styles with pseudo-tags and save as text Unicode (UTF-8). Then open in Sigil and convert pseudo-tags to proper CSS tags. Realize that there are Character styles applied to a few words in a paragraph and Paragraph styles applied to whole paragraphs. When you transfer a PDF file to Word Character formatting is preserved (fonts, sizes, style, weight), but Paragraph formatting (indents, centering, spacing) is lost; that's if there ever was any in the PDF. I suspect not! ------------- IN WORD ------------- Note: in this section Wildcards can be left unchecked. Using tags for Character styles Italic style Put i_ before italic word(s) and _i after. Find/Replace: Code:
Find: blank More > Format > Font > italic Repl: i_^&_i Code:
Find: blank More > Format > Font > bold Repl: b_^&_b If you have tried tagging 'bold' and it has tagged the chapter headings you would need to modify the expressions given below ***. You could create your own tags for other character styles. Using pseudo-tags for chapter headings with Paragraph styles Let's suppose that you've managed to convert all the PDF chapter headings into Word's 'Heading 2' paragraph style. How? See *** at end of this post. These heading styles will need to be tagged before you convert the lot to a text file. Here's how: Code:
Find: blank More > Format > Style > Heading 2 Repl: h2_^& You can create your own tags for other Headings (h1_, h3_) if you've used these headings and their numbers warrant a global Find/Replace. If any of your lines have indents, the indent could be replaced with multiple spaces in the text file. To avoid this: Format menu > Paragraph Set all the indents to 'none' or zero. Now it's time to save your file and you'll be saving it as a text file with special encoding. File menu > Save As Give the file a name. Use the drop down menu of 'Save as type:' and select 'Plain Text (*.txt)'. A dialogue, 'File Conversion' pops up. Click 'Other encoding:' Select 'Unicode (UTF-8)' and click 'OK'. ----------- IN SIGIL ----------- Open the file in Sigil Add a stylesheet. (Use the one below until you've developed your own.) Copy into notepad and save as 'stylesheet.css'. Code:
@namespace h "http://www.w3.org/1999/xhtml"; @page { margin-top: 12pt; margin-bottom: 1pt } body { margin-left: 1%; margin-right: 1% } p { margin: 0; text-indent: 1em; font-family: "Times New Roman",serif; } h1, h2 { margin: 0; text-align: center; } .italic { font-style: italic; } .bold { font-weight: bold } .image { margin-top: 1em; margin-bottom: 1em; margin-left: 1.2em; text-align: center; max-height: 100% } blockquote { font-family: Arial, sans-serif; font-style: italic; } Select 'Add existing items...' and locate the file 'stylesheet.css' you've just created. Double click 'Section0001.xhtml' to open it. Go into Code View. Between the <head> </head> tags put this link to your stylesheet. Code:
<link href="../Styles/stylesheet.css" rel="stylesheet" type="text/css" /> All these Find/Replaces are done in Code View. CTRL H for Find/Replace Check use Regex. For Character styles use a Find/Replace like this: Code:
Find: i_([^_]*)_i Repl: <span class="italic">\1</span> Find: b_([^_]*)_b Repl: <span class="bold">\1</span> Always Find/Replace the inner tags first. For Paragraph styles use this type of Find/Replace to replace any 'h(digit)_' headings with <h1><h2> etc: Code:
Find: <p>h(\d)_([^</]*)</p> Repl: <h\1>\2</h\1> (You don't have to do this in Word. I've written a post in the Sigil section named 'Regex' which explains how to do this in Sigil.) Code:
https://www.mobileread.com/forums/showthread.php?t=130763 There are two ways to do this. The first method relies on the transferred PDF chapter headings having a distinguishing character style. The second uses Find/Replace based on text content. Using STYLE to alter Chapter headings to <h2> Format menu > Styles and Formatting Click in one of the chapter headings in the document. Find the style in the Formatting panel by looking for a blue outline box. Move the cursor to the right-hand side end of the box. A drop down menu arrow appears. Click the arrow and choose 'Select All x Instance(s)'. The nearer the x is to the number of chapters the better, but don' forget 'Prologue' and 'Epilogue'. With these multiple selections all you need to do is click 'Heading 2' once. Done. If there are headings like 'Part One', 'Part Two' you can use the same method replacing with 'Heading 1'. Using Find/Replace to alter Chapter headings to <h2> For these searches you must check 'Use wildcards'. We will find chapter headings and bracket them with the pseudo-tags h2_ ... _h2 1) The line starts with the word 'Chapter', 'CHAPTER' or 'chapter': Code:
Find: (^13)([Cc][Hh][Aa][Pp][Tt][!^13]@)(^13) Repl: \1h2_\2_h2\3 Code:
Find: (^13)([0-9]{1,2})(^13) Repl: \1h2_\2_h2\3 3) Chapter is headed by NUMBERS in WORDs (possibly hyphenated) only (example 'Forty-five'): Code:
Find: (^13)([A-z-]@)(^13) Repl: \1h2_\2_h2\3 In Sigil: At this stage you have a single HTML file. If you have created <h2> tags for Chapters you can use the following to split on chapter headings <h2>: In Code View. Code:
Find: (<h2) Repl: <hr class="sigilChapterBreak" />\1 Now press F6 |
Tags |
conversion, edit, epub, pdf, word |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Looking for an Epub workshop/class in SF Bay Area | SamL | Workshop | 3 | 04-27-2011 11:21 AM |
Digital Public Library Workshop | Giggleton | News | 0 | 03-01-2011 11:25 AM |
Steampunk Workshop - ya gotta see this | kennyc | Lounge | 0 | 12-29-2010 05:25 PM |
Other Fiction Audoux, Marguerite: Marie Claire’s Workshop, v.1, 3 September 2008. | Patricia | Kindle Books (offline) | 0 | 09-03-2008 12:12 AM |