LinuxQuestions.org
Go Job Hunting at the LQ Job Marketplace
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices

Reply
 
Search this Thread
Old 12-17-2010, 04:37 PM   #1
stf92
Senior Member
 
Registered: Apr 2007
Location: Buenos Aires.
Distribution: Slackware
Posts: 3,125

Rep: Reputation: 46
On the web browser and File>Save As>Web Page, HTML only.


Kernel 2.6.21.5, Slackware 12.0
Firefox 2.0
Xfce 4.4

Hi:
In the web browser, I do File>Save As and choose 'Web page, HTML only'. If the html is foo.html, then this has the effect of ommiting those files that 'Webv page, complete' puts into foo_files/, the .png's etcetera. When you, later, run the file manager and click on foo.html, you have the illusion foo.html contains the same information as if you had selected 'Web page, complete'. This is because the missing little files (.png etcetera) are automatically transfered throuth the network. You do not really have them on disk.

The main thing here, is I'm saving space on disk and have what I need in a single file. When I want to read it, I disconnect from the network, thus saving time. I'm only interested in the text (let's say). The browser immediately shows me the text.


THE PURPOSE OF THIS POST: lately, when I do what I have just said, the browser keeps trying to connect. Why? Maybe this only happens with some internet pages. I'm sure it instantly produced the page.

THE INCONVENIENCE: after downloading, I want to check if the file is legible. But this way it is impossible. Any hint will be welcome.
 
Old 12-17-2010, 05:45 PM   #2
Tinkster
Moderator
 
Registered: Apr 2002
Location: in a fallen world
Distribution: slackware by choice, others too :} ... android.
Posts: 22,978
Blog Entries: 11

Rep: Reputation: 879Reputation: 879Reputation: 879Reputation: 879Reputation: 879Reputation: 879Reputation: 879
Chances are that (sometimes) the "little files" are still your browsers
cache, and it doesn't need to go and fetch them from the net. At other
times (depending on the size of cache you set) they may have been removed
from the browsers cache, and it will try to satisfy the links in your
saved HTML file by getting the files from the net. Which won't work too
well when you're disconnected ;}


Cheers,
Tink
 
Old 12-17-2010, 06:45 PM   #3
stf92
Senior Member
 
Registered: Apr 2007
Location: Buenos Aires.
Distribution: Slackware
Posts: 3,125

Original Poster
Rep: Reputation: 46
Thanks for your reply. I understood well, I think. Now let me give you some reasons for this bizarre procdure (disconnection).

There is the file as I'll see it five years from now, when the links most probably have vanished. And there is the file as I am seeing it now, after loading it from disk, when the links will be satisfied. How do I know I've got a descent looking file? Only way seems to be disconnection from internet and waiting till the browser gives up.

To be honest, it is some little time ago that I discovered that work behind my back the o.s. or internet does. I hope I didn't bore you too much. Regards.
 
Old 12-17-2010, 08:47 PM   #4
Tinkster
Moderator
 
Registered: Apr 2002
Location: in a fallen world
Distribution: slackware by choice, others too :} ... android.
Posts: 22,978
Blog Entries: 11

Rep: Reputation: 879Reputation: 879Reputation: 879Reputation: 879Reputation: 879Reputation: 879Reputation: 879
You *could* just use sed to remove hrefs to external sites
from the files you saved...



Cheers,
Tink
 
Old 12-18-2010, 04:21 AM   #5
stf92
Senior Member
 
Registered: Apr 2007
Location: Buenos Aires.
Distribution: Slackware
Posts: 3,125

Original Poster
Rep: Reputation: 46
Oh, I see. Well, thanks.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Test HTML file locally with my browser before posting it on a WEB server mcanallyd Linux - Newbie 2 06-06-2009 10:01 PM
executing linux commands from web page and outputing it back to the web page ashes_sheldon Programming 8 05-09-2009 06:19 AM
I don't want the web page to save the text on field. RMLinux Linux - Newbie 2 09-30-2008 10:42 PM
Can ping web page from telnet, but can't open it in web browser a.ilic Linux - Networking 1 04-01-2008 10:29 AM
save web page results in a file and a folder. Whats the folder for? geeyathink General 2 01-30-2008 06:57 AM


All times are GMT -5. The time now is 11:58 PM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration