LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 07-26-2007, 04:07 AM   #1
hazmatt20
Member
 
Registered: Jan 2006
Distribution: FC5, Ubuntu
Posts: 126

Rep: Reputation: 15
System Locks During RAID5 Resync


For the purpose of full disclosure, I have 9 drives in 2 RAID5 arrays (3 in /dev/md0 and 6 in /dev/md1) that are combined over LVM. md1 works fine, but I'm having trouble with md0. I got an error trying to write to a folder on the array that it was mounted read-only, and when I checked the array, only 1 of the 3 drives was up. When I rebooted, 2 of the 3 came up. I re-added the first drive and kept an eye on the progress. After maybe 10%, the system locked up. Each time I would reboot, the array would come up degraded with 2 drives. After adding the third, it would lock up, usually with no error, although one time started getting buffer I/O errors on one drive, but it didn't happen again. After freaking out a bit, I realized that if I didn't add the third drive, everything seems fine, albeit with degraded status.

So, I'm backing up some stuff now, but what should I do about this? Before suggesting anything drastic, note that the entire array is 2.7 TB with 1.6 of that used.
 
Old 07-26-2007, 11:50 PM   #2
macemoneta
Senior Member
 
Registered: Jan 2005
Location: Manalapan, NJ
Distribution: Fedora x86 and x86_64, Debian PPC and ARM, Android
Posts: 4,593
Blog Entries: 2

Rep: Reputation: 344Reputation: 344Reputation: 344Reputation: 344
Have you tried formatting the drive (with bad block remap) before re-adding it? If this is a production system, just replace the drive.
 
Old 07-28-2007, 12:47 AM   #3
hazmatt20
Member
 
Registered: Jan 2006
Distribution: FC5, Ubuntu
Posts: 126

Original Poster
Rep: Reputation: 15
Well, I don't want to format it just yet, but I did run badblocks on the drive (with non-destructive read-write test), and no bad blocks were found. The next step will be to scan the drive with the utility from seagate. I think I'll check all three while I'm at it.
 
Old 07-28-2007, 01:05 AM   #4
macemoneta
Senior Member
 
Registered: Jan 2005
Location: Manalapan, NJ
Distribution: Fedora x86 and x86_64, Debian PPC and ARM, Android
Posts: 4,593
Blog Entries: 2

Rep: Reputation: 344Reputation: 344Reputation: 344Reputation: 344
Quote:
Originally Posted by hazmatt20
Well, I don't want to format it just yet, but I did run badblocks on the drive (with non-destructive read-write test), and no bad blocks were found. The next step will be to scan the drive with the utility from seagate. I think I'll check all three while I'm at it.
You realize the drive get wiped (all blocks re-written) every time you re-add it to the array?
 
Old 07-28-2007, 02:52 AM   #5
hazmatt20
Member
 
Registered: Jan 2006
Distribution: FC5, Ubuntu
Posts: 126

Original Poster
Rep: Reputation: 15
Huh. I guess I must have been thinking assemble. I haven't slept much the last few days. Well, since the drive has been wiped anyway, how do you format it like you said? I couldn't find the flag, but like I said, I'm kinda tired.
 
Old 07-28-2007, 11:31 AM   #6
macemoneta
Senior Member
 
Registered: Jan 2005
Location: Manalapan, NJ
Distribution: Fedora x86 and x86_64, Debian PPC and ARM, Android
Posts: 4,593
Blog Entries: 2

Rep: Reputation: 344Reputation: 344Reputation: 344Reputation: 344
It depends on the filesystem, but generally the double "-c" options causes the read/write test and re-allocation. For example, to create an Ext3 filesystem:

Code:
mke2fs -c -c -j -m 0 /dev/xyz1
 
Old 07-28-2007, 12:38 PM   #7
Matir
LQ Guru
 
Registered: Nov 2004
Location: San Jose, CA
Distribution: Debian, Arch
Posts: 8,507

Rep: Reputation: 128Reputation: 128
How do you have 2.7TB in Raid 5 on 3 drives? Just curious. Seems that would take 1.35TB individual drives.

If you have 2 good drives, you should be able to wipe the other (as mentioned above) and rebuild from those, assuming they're still good. I'd run smartmontools against them to check their hardware SMART status.
 
Old 07-28-2007, 12:44 PM   #8
hazmatt20
Member
 
Registered: Jan 2006
Distribution: FC5, Ubuntu
Posts: 126

Original Poster
Rep: Reputation: 15
Quote:
Originally Posted by Matir
How do you have 2.7TB in Raid 5 on 3 drives? Just curious. Seems that would take 1.35TB individual drives.

If you have 2 good drives, you should be able to wipe the other (as mentioned above) and rebuild from those, assuming they're still good. I'd run smartmontools against them to check their hardware SMART status.
You missed the first part about 2 arrays over LVM. The other array has 6 400 GB drives. I also think that the last time I tried smartmontools, they were not smart capable.

The only thing I don't understand why that format would solve anything since the file system is going to be destroyed anyway when the array is rebuilt. I don't think you can even run mkfs on a drive setup as Linux raid autodetect.
 
Old 07-29-2007, 12:21 PM   #9
Matir
LQ Guru
 
Registered: Nov 2004
Location: San Jose, CA
Distribution: Debian, Arch
Posts: 8,507

Rep: Reputation: 128Reputation: 128
If you have drives that are >100GB, they should ALL be smart capable. All ATA-6 drives are required to be smart capable by the spec...
 
Old 08-01-2007, 04:26 PM   #10
hazmatt20
Member
 
Registered: Jan 2006
Distribution: FC5, Ubuntu
Posts: 126

Original Poster
Rep: Reputation: 15
Well, when I tried to run the long smart test on the drives, one of the (working) drives dropped out of the array and started giving me buffer io errors, but I loaded from a seagate tools cd (which all it had was smart), and ran a long test on all three drives with no problem. I'm running fsck right now, but it'll take a damn long time. I'm thinking about trying to reduce the size of the lvm, but I'm not sure how to remove /dev/md0 from it, let alone how to make sure the data is all on /dev/md1 first so it isn't lost.
 
Old 08-01-2007, 07:53 PM   #11
hazmatt20
Member
 
Registered: Jan 2006
Distribution: FC5, Ubuntu
Posts: 126

Original Poster
Rep: Reputation: 15
Ok, so after running fsck, I was able to add the third drive to the array. Everything was back to normal, but I wanted to make sure it would be fine after rebooting. Unfortunately, after I rebooted, it came up degraded with the third drive listed as removed. I'm adding it to the array again, but what do I need to do to get it to start correctly?
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Copying big files in RAID5 will stall system Micro420 Linux - Hardware 1 03-07-2007 08:34 PM
Mount Raid5 on Re-Installed System twac SUSE / openSUSE 1 12-12-2006 11:26 AM
RAID5 and RAID1 causing high system load on Suse 10.1 with no activity desibeli Linux - General 11 06-10-2006 01:50 PM
Xine locks up system mykrob Linux - Software 6 11-08-2004 07:24 AM
system locks joseph_1970 Linux - Hardware 4 12-10-2003 10:04 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 11:52 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration