LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 10-20-2012, 11:10 PM   #1
yaximik
Member
 
Registered: Nov 2010
Posts: 90

Rep: Reputation: 0
decompression error - help!


Hello,

I got a foo.fastq.gz file as output from a sequencer (Windows 7 machine). It is 1.9 GB. After transfer to RHEL55 box I tried gunzip, which about a halfway through exited with error with all available options deleting the temporary file about 5.4 GB:

Code:
[yaximik@G5NNJN1 MiSeq]$ gunzip -t SC2T252P15_S1_L001_R1_001.fastq.gz
gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--crc error

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--length error
[yaximik@G5NNJN1 MiSeq]$ gunzip -q SC2T252P15_S1_L001_R1_001.fastq.gz

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--crc error

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--length error
[yaximik@G5NNJN1 MiSeq]$ gunzip -dq SC2T252P15_S1_L001_R1_001.fastq.gz

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--crc error

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--length error
Using GUI (Archive Manager) I saw that both errors are caused by one particular line entry, which is not hsown in the stdout. Finally, I was able to salvage everything until this line entry using

Code:
[yaximik@G5NNJN1 MiSeq]$ gunzip -c SC2T252P15_S1_L001_R1_001.fastq.gz > ./SC2T252P15_S1_L001_R1_001.fastq

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--crc error

gunzip: SC2T252P15_S1_L001_R1_001.fastq.gz: invalid compressed data--length error
which still exited with error, but I got the 5.2 GB foo.fastq text file

Here is the file content output:

Code:
[yaximik@G5NNJN1 MiSeq]$ gunzip -cl SC2T252P15_S1_L001_R1_001.fastq.gz >./content.txt
[yaximik@G5NNJN1 MiSeq]$ cat content.txt
         compressed        uncompressed  ratio uncompressed_name
         2086259746          1331149161 -56.7% SC2T252P15_S1_L001_R1_001.fastq
[yaximik@G5NNJN1 MiSeq]$
Is anything wrong with the content?

But the main question I am asking for help with - is there an utility that will allow to edit the original foo.fastq.gz file and remove the offending line and re-extract healthy data, which are too valuable?
 
Old 10-20-2012, 11:58 PM   #2
malekmustaq
Senior Member
 
Registered: Dec 2008
Location: root
Distribution: Slackware & BSD
Posts: 1,669

Rep: Reputation: 498Reputation: 498Reputation: 498Reputation: 498Reputation: 498
Try using force:

Code:
gunzip -f SC2T252P15_S1_L001_R1_001.fastq.gz
If you want to extract to and fro files from an archive use "cpio".

Hope that helps.

Goodluck.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Knoppix decompression issue deathzbloomz Linux - Laptop and Netbook 3 08-03-2008 02:19 AM
tar -jxvf returns "decompression failed" error ProoN Linux - Software 4 07-03-2007 07:10 AM
zipslack decompression error mm0204 Slackware 8 02-15-2007 02:42 AM
Decompression Problem andrewt Linux - Software 3 08-04-2003 03:09 PM
ppp: vj decompression error haknot Linux - Networking 0 01-14-2002 05:39 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 12:45 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration