LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Slackware
User Name
Password
Slackware This Forum is for the discussion of Slackware Linux.

Notices


Reply
  Search this Thread
Old 04-08-2005, 04:25 AM   #1
acidjuice
Member
 
Registered: Jan 2005
Location: a tiny spot on the iceberg
Distribution: Slackware 10.1 (dropline 2.10, kernel 2.6.11.6)
Posts: 320

Rep: Reputation: 30
software interest level poll and feedback


what would you think of this: have you ever experienced a 650 MB corrupted download (with MD5 sums not matching). what if someone told you that you could download something like 5 MB and fix your download.

would you like such an utility? if you mantain a repository, would you consider adding 5 MB for every 650 MB file out there?

feedbacks welcomed.

aj.
 
Old 04-08-2005, 04:46 AM   #2
gbonvehi
Senior Member
 
Registered: Jun 2004
Location: Argentina (SR, LP)
Distribution: Slackware
Posts: 3,145

Rep: Reputation: 53
That's where P2P is good for, every "piece" of the "thing" you download is checked against corruption

Just to add another bad experience: Downloaded Red-Hat mmm 6? I don't remember, it was on year 1999 using dial-up... as you may guess my download was corrupted! And here where I live, you've to pay for local calls... it was like a month 6 hours per day.. not good :/

As you may guess, it would be nice to have this kind of utility

Unfortunatly there's no magic file since it's imposible to know which part is corrupted unless you check it enterily against the original. Maybe you could download "partial" checksums which could be compared against parts of the file, and if one fails, download that part (like p2p networks.. )

Last edited by gbonvehi; 04-08-2005 at 04:57 AM.
 
Old 04-08-2005, 04:51 AM   #3
acidjuice
Member
 
Registered: Jan 2005
Location: a tiny spot on the iceberg
Distribution: Slackware 10.1 (dropline 2.10, kernel 2.6.11.6)
Posts: 320

Original Poster
Rep: Reputation: 30
Quote:
Originally posted by gbonvehi
Unfortunatly there's no magic file since it's imposible to know which part is corrupted unless you check it enterily against the original.
actually there is. it's a very interesting algorithm which is being used currently today especially on sound and data transmission.

i would know how to do this: i'm just asking if you think if this could be interesting
 
Old 04-08-2005, 07:04 AM   #4
Nobber
Member
 
Registered: Jun 2002
Location: Nova Scotia
Distribution: Debian (home), Kubuntu 7.04 (work)
Posts: 265

Rep: Reputation: 30
If what you say is true, then all you'd need to reconstruct a "broken" 650MB download is a 5MB file. What if I just pretended to download the 650MB, and then used your magic 5MB file to turn a completely empty file into the required 650MB one? All currently existing file compression algorithms would be redundant!

But this isn't true, and cannot be true.

So there's a catch.

What's the catch?
 
Old 04-08-2005, 07:06 AM   #5
priller
Member
 
Registered: Jan 2004
Distribution: Debian etch
Posts: 202

Rep: Reputation: 30
You can build a correct file if you have more that 1 corupt download but I cant see how a 5 mb file will know which part is corrupted.
 
Old 04-08-2005, 07:44 AM   #6
acidjuice
Member
 
Registered: Jan 2005
Location: a tiny spot on the iceberg
Distribution: Slackware 10.1 (dropline 2.10, kernel 2.6.11.6)
Posts: 320

Original Poster
Rep: Reputation: 30
i invented nothing new, i'm talking about the Reed-Solomon algorithm.

basically with this algorithm (used to create the famous PAR files) you can rebuid parts of the missing data within specific limits.

you cannot of course rebuilt 650 MB from 5MB. but, if you miss, for instance, 2 MB of the 650 MB file (and in bad MD5 sums is often much less than that, just a bunch of bytes), it is enough to have a 2MB PAR file to fill that empty.

and i confirm, this is true whatever 2 MB you are missing in the 650 MB file.

well, still noone answers my question would this be of interest? this IS possible. i just need to know if it does interest anyone.

Last edited by acidjuice; 04-08-2005 at 07:58 AM.
 
Old 04-08-2005, 08:07 AM   #7
amwink
Member
 
Registered: Mar 2005
Location: Connecticut/USA
Distribution: Ubuntu 9.04
Posts: 68

Rep: Reputation: 15
So, I'll be the first to answer. As an end-user (currently my computer is not a repository for such big files), I'm interested.

However the last time I had a corrupted file was when downloading an old version of Mandrake via dial-up, some years ago. As the broadband gets more and more popular around the world, I think files up to 600-700MB are no longer a problem. But if you were able (if not yet) to implement this algorithm for files as big as DVD images, then I think your software will be a success.
 
Old 04-08-2005, 08:21 AM   #8
acidjuice
Member
 
Registered: Jan 2005
Location: a tiny spot on the iceberg
Distribution: Slackware 10.1 (dropline 2.10, kernel 2.6.11.6)
Posts: 320

Original Poster
Rep: Reputation: 30
thanks amwink,

the idea is to have ano size limit, or set it up quite high.

it is no easy task, i'm currently developing it. if someone is interested in contributing just email me, however this is not for the immediate tomorrow.
 
Old 04-08-2005, 09:16 AM   #9
Nad0xFF
Member
 
Registered: Apr 2005
Location: Russia, Saint-Petersburg
Distribution: Slackware 10
Posts: 109

Rep: Reputation: 15
I'm now writing such prog as a part of one large program. I think, there must be pattern recognition, addtionally to partial checksums. (may be there'll appera one byte and shift all file - so all checksums'll be inequal. But pattern recognition will recognize this)
 
Old 04-08-2005, 09:16 AM   #10
Nobber
Member
 
Registered: Jun 2002
Location: Nova Scotia
Distribution: Debian (home), Kubuntu 7.04 (work)
Posts: 265

Rep: Reputation: 30
OK, I think I understand now.

So how much error-correcting code would need to be added to a 650MB download in order to make it correctible by RS?
 
Old 04-08-2005, 09:24 AM   #11
Nad0xFF
Member
 
Registered: Apr 2005
Location: Russia, Saint-Petersburg
Distribution: Slackware 10
Posts: 109

Rep: Reputation: 15
There is large theory about that codes. AFAIK, RS code is used for protect bytes. (I don't know good)
 
Old 04-08-2005, 09:28 AM   #12
acidjuice
Member
 
Registered: Jan 2005
Location: a tiny spot on the iceberg
Distribution: Slackware 10.1 (dropline 2.10, kernel 2.6.11.6)
Posts: 320

Original Poster
Rep: Reputation: 30
it's up to human judgement.

i believe that if you perform a full download (which is then found corrupted) you might be missing only some bytes. in this case, a 2 MB file, for instance, is largely enough to cover all errors that the corrupted file has.

Nad0xFF: exactly what are you coding? i want to avoid reinventing the well and duplicating efforts.
 
Old 04-08-2005, 09:44 AM   #13
Nad0xFF
Member
 
Registered: Apr 2005
Location: Russia, Saint-Petersburg
Distribution: Slackware 10
Posts: 109

Rep: Reputation: 15
That is part of large solution for my work - find different files very fast and make them "equal"

Acidjuice: You should learn a little about error correcting codes in general, there is very good articles which explain basics very good
 
Old 04-08-2005, 09:53 AM   #14
acidjuice
Member
 
Registered: Jan 2005
Location: a tiny spot on the iceberg
Distribution: Slackware 10.1 (dropline 2.10, kernel 2.6.11.6)
Posts: 320

Original Poster
Rep: Reputation: 30
Quote:
Originally posted by Nad0xFF
You should learn a little about error correcting codes in general, there is very good articles which explain basics very good
any links welcomed.
 
Old 04-08-2005, 09:57 AM   #15
Nad0xFF
Member
 
Registered: Apr 2005
Location: Russia, Saint-Petersburg
Distribution: Slackware 10
Posts: 109

Rep: Reputation: 15
Sorry, I saw it only on Russian, but now found IMHO good articles:
http://mathworld.wolfram.com/Error-CorrectingCode.html
http://guest.engelschall.com/~sb/hamming/
http://www.mdstud.chalmers.se/~md7sh...in/node21.html


----
http://www.eccpage.com/
----



http://en.wikipedia.org/wiki/Error-correcting_code
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Out of interest... gwejones Linux - General 1 09-13-2005 04:45 AM
8139 card doesn't grok 100basetx at software level? nothingmuch Linux - Networking 1 01-22-2004 06:25 AM
How to map Red Hat RPM patch level => software version? sancho5 Linux - Software 2 09-20-2003 03:55 PM
Software-Suggestion: Feedback wanted! seneca Linux - General 10 09-02-2003 05:23 PM
POLL: The worst Linux GNU / GPL Software moeminhtun General 57 02-28-2003 09:15 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Slackware

All times are GMT -5. The time now is 01:48 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration