You could also use a little python, with something that will run
the scripts on the page, then print it to .pdf.
This Example uses a web browsers engine, so it should be ok to post
it here, since it uses a web browser to print, the same as if you
loaded the page into a web browser and printed it.
Python3, PyQt5, QtWebEngine
Code:
#! /usr/bin/env python
import sys
from PyQt5.QtCore import QUrl, pyqtSignal
from PyQt5.QtWidgets import QApplication
from PyQt5.QtWebEngineWidgets import QWebEngineView, QWebEngineProfile
agent = ('Mozilla/5.0 (Windows NT 10.0; WOW64; rv:62.0)'
' Gecko/20100101 Firefox/62.0')
class PdfPrint():
def __init__(self, url, out_file):
super(PdfPrint, self).__init__()
self.agent = QWebEngineProfile()
self.agent.defaultProfile().setHttpUserAgent(agent)
def print_pdf():
self.printer.show()
self.printer.page().printToPdf(out_file)
self.printer = QWebEngineView()
self.printer.load(QUrl(url))
self.printer.loadFinished.connect(print_pdf)
if __name__ == '__main__':
app = QApplication([])
url = ('https://hobbydocbox.com/docview/78/78187251'
'/#file=/storage/78/78187251/78187251.pdf')
out_file = "MyFile.pdf"
PdfPrint(url, out_file)
sys.exit(app.exec_())
I was able to get it with that.(little screen cap)
Code:
curl https://ptpb.pw/H67f -o MyFile.jpg
The pages for that .pdf are delivered in blobs if that helps you.
You'll need to scroll down slowly to get them all in cache before
you try and print it.
Otherwise you are going to have to deal with blobs. I can list
them, but what are you going to do with them? Way easier
to print them after a browser has rendered them.
https://hobbydocbox.com/6ef349df-c39...b-7ea050939620
https://hobbydocbox.com/037f89b8-f55...2-ed93025169ab
https://hobbydocbox.com/739016b6-65d...f-a3a2bd1120f7
etc.
And as last resort, you could take a screenshot of every page
with scrot or imagemagic
Focused window
scrot -u
import -screen out.png
I can print a .pdf of that url/pdf here multiple ways. With webengine and Palemoon. It's a huge thing made of image files. Good luck.