|  04-05-2017, 03:18 PM | #1 | 
| Wizard            Posts: 2,306 Karma: 13057279 Join Date: Jul 2012 Device: Kobo Forma, Nook | 
				
				Potential Enhancement: Links Report
			 
			
			Current Functionality Open the attached sample EPUB with Sigil's Tools > Reports > Links. Double-click on any link: If you click on any link in Chapter 1, Sigil will jump to the very top of Chapter1.xhtml. If you click on any link in Chapter 2, Sigil will jump to the very top of Chapter2.xhtml. (Not very helpful.) Expected Functionality Jumping to the link's exact location in the HTML file (see the way Calibre's Editor Tools > Reports > Links handles it). If you click on "Example Link #3", Calibre jumps directly to the <a> containing "Example Link #3". Why This Would Be Helpful I was working on a book with 100+ hyperlinks (a collection of online articles). While Sigil's Links Report is very helpful for spotting general errors (typos, broken links, errors), I had to manually find/search each individual link if there was a problem: Code: <p>As the fantastic W<a href="http://www.wikipedia.org/">ikipedi</a>a article said.</p>   | 
|   |   | 
|  04-05-2017, 04:05 PM | #2 | 
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | 
			
			I should be able to modify that report to keep track of the line number of the file where the link is located, and then at least move you to the right line in that file without too much dificulty.  The only issue is where in that particular line (the column number) it will be. For normal length lines in CodeView it should be easy to see the link itself. Unfortunately, the current signal design only passes the line number no column or offset info. Let me see if I can get at least the right line of the file to come up. Would that help? Last edited by KevinH; 04-05-2017 at 04:12 PM. | 
|   |   | 
|  04-05-2017, 06:10 PM | #3 | 
| null operator (he/him)            Posts: 22,004 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			I occasionally edit texts with a lot of links to endnotes, references etc.  I never did 'get' what Sigil's Links Report was supposed to do for me, but when Kovid produced his Links report I 'got' it immediately.   I'd have to do a side-by-side comparison to say anything further as its been a while since I've used either. But I do seem to remember I had the impression that Sigil's Links Report was a work-in-progress. Pssst - I also like calibre's word list, handy for spotting inconsistent spelling, especially of proper nouns, abbreviations, and acronyms. Had a paper from Yale the other day, at least three FIBs that should have been FBIs  Deliberate?  Don't think so, there were a couple of other valid 'wrong' words that were obvious typos or ignorance (e.g. effect that should have been affect), otherwise it was as dry as moon-dust. BR | 
|   |   | 
|  04-05-2017, 06:35 PM | #4 | 
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | 
			
			So is that a "it would help"?
		 | 
|   |   | 
|  04-05-2017, 07:55 PM | #5 | 
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | 
			
			BTW:  I just pushed this change to master.  It seems to work quite well with your test epub. Thanks for your request and test epub to illustrate exactly what you needed. If moving to the line is not enough, then a more major reworking of Report signals will be needed sometime in the future. Hope this helps. | 
|   |   | 
|  04-05-2017, 10:17 PM | #6 | 
| null operator (he/him)            Posts: 22,004 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			What would help (assuming it's possible) is a facility to filter by 'link purpose' - i.e. endnotes, references, index entry, toc entry, image reference, etc.  My primary concerns are endnotes and references, if the book has an extensive index they tend to get swamped by the index entries.  The endnote, reference and index entries in some of my texts span multiple xhtml files.  I have never seen anything in 'Target Text', 'Target's Target File', 'Target's Target ID' or 'Match', including in books where all the links seem to work as I would expect them to work. That observation is where my 'work-in-progress' comment originates. Would also help if the window had max and min available, and if the window remembered the column sizes last set. NB: I didn't mean to infer the calibre Links report isn't without it's problems too, only that it seems to be closer to what I want. BR Last edited by BetterRed; 04-05-2017 at 11:31 PM. | 
|   |   | 
|  04-05-2017, 11:45 PM | #7 | 
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | 
			
			A links purpose (endnote, index, etc) can't actually be determined without additional semantic epub:type info and that would only be in epub3.  I can try and determine why the target info is empty.  It should not be.  The provided testcase has all external links so it did not test that. Sigil also doesn't need to remember column sizes, as they are auto resized to hold the required info in each column. Sigil should also remember any window sizes set. I will look to see why the target info is not set for internal link targets in the report. With target file set and sortable you should be able to group/identify index entries from the filename. Possibly endnotes too if they are stored in a recognizable filename or names. | 
|   |   | 
|  04-06-2017, 12:01 AM | #8 | 
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | 
			
			I checked the code and 'Target Text', 'Target's Target File', 'Target's Target ID' or 'Match' are only filled in when the target of a link is itself an achor tag with a link someplace else (or back). So that appears to work as designed. I will test this tomorrow and verify that. | 
|   |   | 
|  04-06-2017, 02:42 AM | #9 | 
| null operator (he/him)            Posts: 22,004 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			@KevinH  - re filtering by link purpose, I had a feeling that might not be easy,  except perhaps via semantic tags, but then I run into the problem that endnotes, references and indexes often span multiple files, I would prefer not to merge them.  How about include/excluding by file - then I could exclude the files containing index links.   The problem with the auto sizing the columns is that they seem to auto size to the widest value, when they contain web addresses they can be a mile wide e.g. Code: http://www.bundesfinanzministerium.de/Content/DE/Standardartikel/hemen/Internationales_Finanzmarkt/Finanzmarktpolitik/2011-02-15-gutachten-bankenbeteiligung-anlage.pdf?__blob=publicationFile&v=3 The Links Report is not a major issue for me, I'm happy to take whatever is on offer, and I'm totally agnostic regarding which tools I use, whatever gets the job done at the time... and if it means using two tools, so be it. BR | 
|   |   | 
|  04-06-2017, 03:12 AM | #10 | |||
| Wizard            Posts: 2,306 Karma: 13057279 Join Date: Jul 2012 Device: Kobo Forma, Nook | Quote: 
 The real thing that was making me pull my hair out was Sigil just jumping to the top of the HTML files.  Quote: 
 See attached EPUB for all these examples: Example #1: Take your FBI example, maybe they link throughout the book as: Spoiler: 
 Spaces very easily creep their way into <a> when using the mouse + WYSWIG editors. These things stand out like a sore thumb when sorted. :P And a typo (fib.gov) can also easily sneak in the href (this is very hard to spot just looking through the Code View). Example #2: In the case of this specific book I was working on, there was an article that referenced (Author Year) with a link to the specific book/paper: Spoiler: 
 After sorting, you could see it show up in the list as: Spoiler: 
 Example #3: It is also very helpful when you do things like changing footnotes from <sup>##</sup> -> [##]. If I sort and see a giant list of: Spoiler: 
 I can go take a closer look. That single digit number should not be there!  Example #4: The "Target Exists?" column is also fantastic for catching missing footnotes: Spoiler: 
 you can sort "Target Exists?" and see "no" in the column, then you can investigate. Example #5: You can also sort all the links by "Target File" and take a very close look at external websites. For example, if there are lots of Youtube links, you may want to check to see if those still exist, what the titles are, etc. etc. (This particular article pointed to a Youtube Embed link instead of just pointing directly to the normal Youtube video.) This sort of stuff is EASILY spotted in the Report. Side Note: Calibre's Links Report also lets you click on the links right there and open them up in your browser. Pretty helpful. Side Note #2: I don't know when it was added, but Calibre's Tools > External Links > Check External Links is absolutely fantastic for catching HTTP 403/404 Errors. When you import/work from a collection of web articles it helps you can then go hunting and replacing with newer/working ones (for example, there may be broken Facebook/Twitter links, or a site has updated and hasn't redirected links properly). Quote: 
 Those columns are used if you do something like link to a separate Endnotes file: Spoiler: 
 Last edited by Tex2002ans; 04-06-2017 at 03:20 AM. | |||
|   |   | 
|  | 
| Thread Tools | Search this Thread | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Touch Normal links becomes footnotes links in epub made with Calibre | il_mix | Kobo Reader | 15 | 08-10-2014 01:19 PM | 
| Convert external links to internal links | sup | Recipes | 2 | 11-28-2013 09:39 AM | 
| Links to URLs work, internal links don't? | NewDay | ePub | 36 | 10-27-2010 04:09 AM | 
| Free Report (Kindle) - Economic Report of the President | koland | Deals and Resources (No Self-Promotion or Affiliate Links) | 5 | 02-13-2010 12:07 PM |