LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 09-08-2007, 09:45 AM   #1
Libertes
Member
 
Registered: Jan 2007
Posts: 86

Rep: Reputation: 16
Question WGET question


My problem with wget is here

T is element of Rational Numbers
(T is assumed to increase 1 hour or 1 day does not matter it is cron job)
Every K, L, M and N are element of Natural Numbers
and each K,L,M and N increas as T increases
(or they remains the same as T increases -ie processingFile[L] only changes with new
software version or according to code changes -)

T->Time
K->Site Category or directories
L->Name of different php files that does process or work(ie index.php or showTopic.php)
M->Different queryString to identify different forms of same content
N->contentIdentifier number

a dynamic site in the form of

www.dynamicSite.com/directory[K]/processingFile[L].php?query[M]=contentIdentifier[N]

should be transformed to a static offline format

www.offlineSite.com/whateverIndex[K,L,M,N].html or

www.offlineSite.com/directory[K,L,M,N]/index.html

with every new T change i want to run wget to add ONLY NEW CONTENT
content to offlineSite

How can i make it with wget?

(Sorry to write such a way i am a simple mathematician
who is not good at using daily life words -not smug -)

Last edited by Libertes; 09-08-2007 at 11:39 AM. Reason: Cleared the title
 
Old 09-08-2007, 10:53 AM   #2
zaichik
Member
 
Registered: May 2004
Location: Iowa USA
Distribution: CentOS
Posts: 419

Rep: Reputation: 30
I don't understand how K, L, M, and N would get their values. Perhaps K and L have a base value, and as new directories are added to the online site, the range of possible values for K will increase. And for each value of K, the same can be said of L. So perhaps the first time, the run would like this:
Code:
www.dynamicSite.com/directory1/processingFile1.php?query1=contentIdentifier1
And then a test for dynamicSite.com/directory1/processingFile2.php shows that it doesn't exist. Then we see if dynamicSite.com/directory2 exists. It doesn't, so we stop. The next run is:
Code:
www.dynamicSite.com/directory1/processingFile1.php?query2=contentIdentifier2
(or not?), followed by checks for directory1/processingFile2.php and directory2. When these are found to exist, they are added to the regular runs.

Is this how these variables get their values?
 
Old 09-08-2007, 11:14 AM   #3
Libertes
Member
 
Registered: Jan 2007
Posts: 86

Original Poster
Rep: Reputation: 16
yes u are right. base values can be 0 to mention the 1st content of the site.
www.dynamicSite.com/directory1/processingFile1.php?query1=contentIdentifier1
shows one content
and while nmbers change a new content comes to browser.
as time changes these content does not change
ie
http://www.linuxquestions.org/questions/newreply.php?do=newreply&noquote=1&p=2885585

will show this thread after many years.

all of K L M N changes by time. This means with a time, new content is
added to dynamicSite. To make things clear i did not add multiplication factors to K L M N according to time.(multiplication factors have random increments
(ie; at one new time unit people can add from zero new content to infinite new content to dynamicSite but this is not important for me, i only need a code that can understand the "newly added content" and add/transform those new content to offlineSite.com)

I found famous httrack software. It converts to .html and with a gui user can convert only newly added content to offlineSite.

I want exactly what httrack does with wget.

Of course i ll add a cron job that runs at time intervals i want to obtain
every newly added content.

hope to be clear.
i tried the advanced codes also at gnu.org's wget pages and codes at linuxquestions.org. But they fail to understand new added content.

Last edited by Libertes; 09-08-2007 at 11:17 AM.
 
Old 09-09-2007, 10:08 AM   #4
Libertes
Member
 
Registered: Jan 2007
Posts: 86

Original Poster
Rep: Reputation: 16
I found some wget commands
this one seems fine but not enough
wget -m -k -K -E www.dynamicSite.com
i am not sure why it spoils character sets at my computer?
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
wget question JockVSJock Linux - Newbie 2 06-02-2005 12:47 PM
wget question Jestrik Linux - Software 4 05-01-2005 07:41 PM
Question about wget sdouble Linux - Software 2 06-16-2004 11:36 AM
wget question ziggie216 Linux - Software 1 10-12-2003 01:10 PM
wget question ziggie216 Linux - Software 0 10-12-2003 12:02 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 06:37 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration