|
|
#1 |
|
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
Multiple XML into 1 PDF?
Hello, I've been trying host of tools to achieve this but have always failed, although something tells me ebook-convert may have the key - although I'm not quite sure how to formulate the command
![]() Here are the XML files I want to convert: https://github.com/PerseusDL/lexica As far as I understand ebook-convert doesn't actually support XML, but for me rendering it as HTML is more than enough (in fact opening it in Firefox displays everything properly, all I need to do is 'stitch' the files together into one PDF file. Any suggestions for this particular case? (Here, after "gh repo clone https://github.com/PerseusDL/lexica", I navigated to the folders containing the XML files and tried appending a '.html' extension before passing all of the files at the same time as an argument with the command 'ebook-convert "*.html" .pdf', but ebook-convert wasn't able to parse them like Firefox did...
|
|
|
|
|
|
#2 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,615
Karma: 28549044
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
|
|
|
| Advert | |
|
|
|
|
#3 |
|
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
Ok, perfect, I've got this far for now:
Code:
bash-5.2$ ls
README.md grc.lsj.perseus-eng22.html
grc.lsj.perseus-eng1.html grc.lsj.perseus-eng23.html
grc.lsj.perseus-eng10.html grc.lsj.perseus-eng24.html
grc.lsj.perseus-eng11.html grc.lsj.perseus-eng25.html
grc.lsj.perseus-eng12.html grc.lsj.perseus-eng26.html
grc.lsj.perseus-eng13.html grc.lsj.perseus-eng27.html
grc.lsj.perseus-eng14.html grc.lsj.perseus-eng3.html
grc.lsj.perseus-eng15.html grc.lsj.perseus-eng4.html
grc.lsj.perseus-eng16.html grc.lsj.perseus-eng5.html
grc.lsj.perseus-eng17.html grc.lsj.perseus-eng6.html
grc.lsj.perseus-eng18.html grc.lsj.perseus-eng7.html
grc.lsj.perseus-eng19.html grc.lsj.perseus-eng8.html
grc.lsj.perseus-eng2.html grc.lsj.perseus-eng9.html
grc.lsj.perseus-eng20.html test.html
grc.lsj.perseus-eng21.html
bash-5.2$ cat test.html
<html>
<body>
<h1>Table of Contents</h1>
<p style="text-indent:0pt">
<a href="grc.lsj.perseus-eng1.html">First File</a><br/>
<a href="grc.lsj.perseus-eng2.html">First File</a><br/>
<a href="grc.lsj.perseus-eng3.html">First File</a><br/>
<a href="grc.lsj.perseus-eng4.html">First File</a><br/>
<a href="grc.lsj.perseus-eng5.html">First File</a><br/>
<a href="grc.lsj.perseus-eng6.html">First File</a><br/>
<a href="grc.lsj.perseus-eng7.html">First File</a><br/>
<a href="grc.lsj.perseus-eng8.html">First File</a><br/>
<a href="grc.lsj.perseus-eng9.html">First File</a><br/>
<a href="grc.lsj.perseus-eng10.html">First File</a><br/>
<a href="grc.lsj.perseus-eng11.html">First File</a><br/>
<a href="grc.lsj.perseus-eng12.html">First File</a><br/>
<a href="grc.lsj.perseus-eng13.html">First File</a><br/>
<a href="grc.lsj.perseus-eng14.html">First File</a><br/>
<a href="grc.lsj.perseus-eng15.html">First File</a><br/>
<a href="grc.lsj.perseus-eng16.html">First File</a><br/>
<a href="grc.lsj.perseus-eng17.html">First File</a><br/>
<a href="grc.lsj.perseus-eng18.html">First File</a><br/>
<a href="grc.lsj.perseus-eng19.html">First File</a><br/>
<a href="grc.lsj.perseus-eng20.html">First File</a><br/>
<a href="grc.lsj.perseus-eng21.html">First File</a><br/>
<a href="grc.lsj.perseus-eng22.html">First File</a><br/>
<a href="grc.lsj.perseus-eng23.html">First File</a><br/>
<a href="grc.lsj.perseus-eng24.html">First File</a><br/>
<a href="grc.lsj.perseus-eng25.html">First File</a><br/>
<a href="grc.lsj.perseus-eng26.html">First File</a><br/>
<a href="grc.lsj.perseus-eng27.html">First File</a><br/>
</p>
</body>
</html>
|
|
|
|
|
|
#4 |
|
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
This is the error message my browser would give when trying to open one of the files:
Your file couldn’t be accessed It may have been moved, edited, or deleted. ERR_FILE_NOT_FOUND |
|
|
|
|
|
#5 |
|
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
Here's one last issue I was having with the files (once these two problems are sorted I'll be able to successfully render in PDF
If I take this file for instance, which is supposed to show Greek characters, it's instead just displaying plain UTF-8. I wonder if I can somehow tell Calibre to display it like it says to do inside (what was originally) the XML file... Here is an example file to illustrate what I mean: https://raw.githubusercontent.com/Pe...rseus-eng1.xml |
|
|
|
| Advert | |
|
|
|
|
#6 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48,260
Karma: 174315444
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
For what it may be worth, UTF-8 includes the Hellenic character set.
|
|
|
|
![]() |
| Tags |
| batch, html, pdf, render, xml |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| How to change wrong xml:lang in multiple files? | LucyOne | Conversion | 17 | 07-29-2021 07:14 AM |
| xml:lang empty (pdf to epub) | fxp33 | Conversion | 3 | 05-08-2015 12:40 AM |
| PDF to XML Conversion of Business Documents | citizen994 | Other formats | 3 | 11-18-2013 12:46 AM |
| PDF Conversion to XML | citizen994 | Other formats | 1 | 02-03-2012 01:46 AM |
| How to convert PDF to XML? | Ambar | Other formats | 3 | 01-12-2012 01:48 PM |