![]() |
#1 |
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
Multiple XML into 1 PDF?
Hello, I've been trying host of tools to achieve this but have always failed, although something tells me ebook-convert may have the key - although I'm not quite sure how to formulate the command
![]() Here are the XML files I want to convert: https://github.com/PerseusDL/lexica As far as I understand ebook-convert doesn't actually support XML, but for me rendering it as HTML is more than enough (in fact opening it in Firefox displays everything properly, all I need to do is 'stitch' the files together into one PDF file. Any suggestions for this particular case? (Here, after "gh repo clone https://github.com/PerseusDL/lexica", I navigated to the folders containing the XML files and tried appending a '.html' extension before passing all of the files at the same time as an argument with the command 'ebook-convert "*.html" .pdf', but ebook-convert wasn't able to parse them like Firefox did... ![]() |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,318
Karma: 27111242
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
Ok, perfect, I've got this far for now:
Code:
bash-5.2$ ls README.md grc.lsj.perseus-eng22.html grc.lsj.perseus-eng1.html grc.lsj.perseus-eng23.html grc.lsj.perseus-eng10.html grc.lsj.perseus-eng24.html grc.lsj.perseus-eng11.html grc.lsj.perseus-eng25.html grc.lsj.perseus-eng12.html grc.lsj.perseus-eng26.html grc.lsj.perseus-eng13.html grc.lsj.perseus-eng27.html grc.lsj.perseus-eng14.html grc.lsj.perseus-eng3.html grc.lsj.perseus-eng15.html grc.lsj.perseus-eng4.html grc.lsj.perseus-eng16.html grc.lsj.perseus-eng5.html grc.lsj.perseus-eng17.html grc.lsj.perseus-eng6.html grc.lsj.perseus-eng18.html grc.lsj.perseus-eng7.html grc.lsj.perseus-eng19.html grc.lsj.perseus-eng8.html grc.lsj.perseus-eng2.html grc.lsj.perseus-eng9.html grc.lsj.perseus-eng20.html test.html grc.lsj.perseus-eng21.html bash-5.2$ cat test.html <html> <body> <h1>Table of Contents</h1> <p style="text-indent:0pt"> <a href="grc.lsj.perseus-eng1.html">First File</a><br/> <a href="grc.lsj.perseus-eng2.html">First File</a><br/> <a href="grc.lsj.perseus-eng3.html">First File</a><br/> <a href="grc.lsj.perseus-eng4.html">First File</a><br/> <a href="grc.lsj.perseus-eng5.html">First File</a><br/> <a href="grc.lsj.perseus-eng6.html">First File</a><br/> <a href="grc.lsj.perseus-eng7.html">First File</a><br/> <a href="grc.lsj.perseus-eng8.html">First File</a><br/> <a href="grc.lsj.perseus-eng9.html">First File</a><br/> <a href="grc.lsj.perseus-eng10.html">First File</a><br/> <a href="grc.lsj.perseus-eng11.html">First File</a><br/> <a href="grc.lsj.perseus-eng12.html">First File</a><br/> <a href="grc.lsj.perseus-eng13.html">First File</a><br/> <a href="grc.lsj.perseus-eng14.html">First File</a><br/> <a href="grc.lsj.perseus-eng15.html">First File</a><br/> <a href="grc.lsj.perseus-eng16.html">First File</a><br/> <a href="grc.lsj.perseus-eng17.html">First File</a><br/> <a href="grc.lsj.perseus-eng18.html">First File</a><br/> <a href="grc.lsj.perseus-eng19.html">First File</a><br/> <a href="grc.lsj.perseus-eng20.html">First File</a><br/> <a href="grc.lsj.perseus-eng21.html">First File</a><br/> <a href="grc.lsj.perseus-eng22.html">First File</a><br/> <a href="grc.lsj.perseus-eng23.html">First File</a><br/> <a href="grc.lsj.perseus-eng24.html">First File</a><br/> <a href="grc.lsj.perseus-eng25.html">First File</a><br/> <a href="grc.lsj.perseus-eng26.html">First File</a><br/> <a href="grc.lsj.perseus-eng27.html">First File</a><br/> </p> </body> </html> |
![]() |
![]() |
![]() |
#4 |
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
This is the error message my browser would give when trying to open one of the files:
Your file couldn’t be accessed It may have been moved, edited, or deleted. ERR_FILE_NOT_FOUND |
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() Posts: 7
Karma: 10
Join Date: Jun 2025
Device: None
|
Here's one last issue I was having with the files (once these two problems are sorted I'll be able to successfully render in PDF
![]() If I take this file for instance, which is supposed to show Greek characters, it's instead just displaying plain UTF-8. I wonder if I can somehow tell Calibre to display it like it says to do inside (what was originally) the XML file... Here is an example file to illustrate what I mean: https://raw.githubusercontent.com/Pe...rseus-eng1.xml |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,946
Karma: 168961900
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
For what it may be worth, UTF-8 includes the Hellenic character set.
|
![]() |
![]() |
![]() |
Tags |
batch, html, pdf, render, xml |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
How to change wrong xml:lang in multiple files? | LucyOne | Conversion | 17 | 07-29-2021 06:14 AM |
xml:lang empty (pdf to epub) | fxp33 | Conversion | 3 | 05-07-2015 11:40 PM |
PDF to XML Conversion of Business Documents | citizen994 | Other formats | 3 | 11-17-2013 11:46 PM |
PDF Conversion to XML | citizen994 | Other formats | 1 | 02-03-2012 12:46 AM |
How to convert PDF to XML? | Ambar | Other formats | 3 | 01-12-2012 12:48 PM |