View Single Post
Old 09-03-2011, 08:06 PM   #3
therealjoeblow
Zealot
therealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfoldedtherealjoeblow reads XML... blindfolded
 
Posts: 106
Karma: 52102
Join Date: Jun 2010
Device: Samsung Android Tablet w/Moon+ Pro Reader
Quote:
Originally Posted by jackie_w View Post
As long as your book uses double-quotes, rather than single, you might find some of them with a simple Notepad++ search:

Using regular expression search mode
Code:
“[^”]+</p>
Thanks - that works with proper slanted quotes, but doesn't really help with regular plain " style quotes, which exist in a lot of the files I'm having issues with, because when I replace “[^”] in the regex with "[^"] it ends up catching all of the class="calibre#" in the <p> tags.

The only way I can think of for it to work on that is if some intelligent search routine keeps track of how many occurrences of " it finds while ignoring the class attributes.

Maybe there's a way with regex, but I'm really not that great at deciphering its syntax for something this complex (I use it all the time to find broken paragraphs that start with lower case letters using <p class="calibre#">[a-z] )

The REAL Joe
therealjoeblow is offline   Reply With Quote