Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
- hdc4 was marked as fault. That happen before but I always just resycnd that one. The hdd was ok, there only seem to be a loss on communication from time to time
- removed hdc4 from the array
- added hdc4 to the array
- some dma_intr: error=0x40 occured during the sync
- hdc4 was marked as faulty
- hdg1 automatically set to spared disk <--- this was no spare disk
- 2/4 active disk in RAID 5 -> not working
I have no idea about how to recover from this error. The data actually should be somehow consistent over the 4 / at least the 3 devices except hdc4.
Any help is highly appreciated!!! (I currently have no physical access to the computer, unfortunately)
Distribution: Mac OS X Leopard 10.6.2, Windows 2003 Server/Vista/7/XP/2000/NT/98, Ubuntux64, CentOS4.8/5.4
Posts: 2,986
Rep:
Unfortunately if more than 1 drive breaks in a RAID5, you have to revert to your backups and replace the faulty drives. You might be able to get some files back, but I wouldn't be surprised if they were corrupt or incomplete. Check your hard drive and make sure there is nothing wrong with them. Are they S.M.A.R.T. enabled? You might be able to run some SMART tests remotely.
Just curious but what distro are you using? I had a hell of a bad time with Ubuntu and RAID5. Drives kept failing when I knew for a fact that my drives were good. I later dumped Ubuntu and went to CentOS and my RAID5 has been running flawlessly. I don't trust Ubuntu anymore. I also had another issue with Ubuntu, but I'll save that for later.
Distribution: Mac OS X Leopard 10.6.2, Windows 2003 Server/Vista/7/XP/2000/NT/98, Ubuntux64, CentOS4.8/5.4
Posts: 2,986
Rep:
Wait until you get back, or until someone with more knowledge of fixing RAID can help you. Try not to mess with the system and mdadm too much because you don't want to permanently ruin your RAID5. There could be hope!
Hi Micro, in principal I totally agree with you. However, it'll take some month until I get physical access to the computer and I was hoping to find the expert on RAID here.
I'm aware of the risk, but might take the dare.
What I do know:
- I do have the RAID configuration file, so I know exactly how it's build.
- I know which device is out of sync / faulty
What I'm trying to figure out:
- I don't know why the one active disk was marked as spare disk. Anyhow, marking the disk as spare probably only effects the superblock not the data itself, right?
- If I now write the new superblock with my former configuration, the spare disk should be set back to active.
- Just writing the superblock does not trigger a sync of the disks, right?
- I should be able to mark the faulty disk as faulty before starting the array, so the data out of sync won't mess with the good data.
=> everything should be ok, except that the RAID is running degraded until I can exchange the other disk.
Can anyone agree / disagree on that? Any input appreciated.
Distribution: Mac OS X Leopard 10.6.2, Windows 2003 Server/Vista/7/XP/2000/NT/98, Ubuntux64, CentOS4.8/5.4
Posts: 2,986
Rep:
Quote:
Originally Posted by Ossah
=> everything should be ok, except that the RAID is running degraded until I can exchange the other disk.
Can anyone agree / disagree on that? Any input appreciated.
Don't quote me, but I think having only 2 disks active out of a 4 disk RAID5 means that your RAID is totally broken and unusable. I wouldn't even attempt to write anything for fear of causing more damage. If you could only convert that spare to a live disk again ...
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.