Quote:
Quote:
I have tried one thing which works half-ways there. Using the add-on FlashGot, together with wget. It downloaded only the photos I wanted. The problem is that it took three times the time compared to IDM and seemed to use a lot of resources and also it was strenuous on the system hard drive, because of writing temp files or whatever. And most of all, there was no way to repeat the process, checking for and only downloading new photos and skip the ones already downloaded. At least not without cross-checking manually. I just tried FlashGot with the browser built-in downloader and it got the files faster than anything! There still is the issue(?) of not checking automatically for new files, but since it was that quick, I could as well just trust the speed of the downloader enough to manually repeat the process at whatever pace. I did not imagine FlashGot being that fast and good, after having used it now and then, long ago, since I first got it, which was some years ago... (Perhaps this also means that I can finally ditch IDM (Internet Download Manager), which is a terrible program, full of crappy code and it's not even free.) We haven't figured out how to get wget to do this on its own, but maybe that isn't necessary. I will mark this as solved then, I guess. |
For anyone following this thread, that page could be parsed with a little python and something that follows scripts to make a list.
Then download them. Simple example with a few url's: Code:
agent="Mozilla/5.0 (Windows NT 10.0; WOW64; rv:56.0) Gecko/20100101 Firefox/56.0" Edit: syntax |
Code:
list=( |
Sorry @pan64, went too fast. No quotes should be around $list
Code:
for i in $list; do echo "curl -A "$agent" "$i" -o "${i##*/}""; done Code:
list=( |
Quote:
or did you mean to first download the page with your browser (with javascript enabled), then parse it with python? |
Quote:
|
Quote:
Quote:
|
@ondoho
Quote:
Then dump the page requests, or specific requests to terminal or file, parse it etc. For example this python3 code snippet Code:
import sys, os, subprocess, re, time, calendar Quote:
Code:
wget -i file.txt There have been quite a few posts on LQ lately on this topic, and there seems to be a lack of interest with new members to learn how to script anything. If I can't click a button in Ubuntu then well...I'm just lost. So I posted as much as possible without actually posting a script that will hack a website. Scripting is Linux, it's one of the most powerful aspects of *nix. I can see that some of them don't like it, or are unable. Well then find a gui program like the OP did. And of course, the OP's aren't the only ones reading these threads. New members read these threads and don't participate, but pick up things, just like I did. @pan64 Quote:
Thanks. |
Quote:
|
I am still here, reading. :)
Nice to see I caught some interest. I'm totally oblivious to making scripts, however and wouldn't know where to start even. Let alone what type of script and what "language", etc. So I'm guessing this isn't beginner stuff. I'm quite intrigued by this actually, and I'd love to see a solution to it, so I marked the thread as unsolved. Go ahead and brainstorm! :D |
Quote:
i should've known that python has a library for everything. still, I'm impressed. |
All times are GMT -5. The time now is 11:23 AM. |