Linux - SoftwareThis forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I have a 12-disk RAID-6 array setup on commodity hardware. It's been running fine for a few weeks until yesterday when one of the disks failed. I suspected a faulty cable, so I replaced it. While I was doing that, I noticed that I had put the cables in the "wrong" order when installing, so I swapped them around since I wanted to know which disk was connected to which interface. I thought the RAID would use the disk's UUIDs only and not really care which port they were on. When I brought the array back up, it found 10 disks and one spare (the faulty one) with one disk out of the array. I tried adding the lone disk and let it run for a while. Next I looked at the reconstruction, it was counting up time remaining. Re-tried, same thing. Around here, my old Windows roots took over and made me reboot. I guess I thought the kernel was confused and wanted to re-read the disks or something. When it came back up it found 8 disks, two spares, no missing. I went to bed.
Today, I swapped the two disks whose cables were swapped and tried again, now it finds 10 of the disks, all spares:
I figured I'd try pushing a little harder to see what happened:
Code:
root@baloo:~# mdadm --assemble --force /dev/md0
mdadm: forcing event count in /dev/sdd1(7) from 68545 upto 68574
mdadm: Cannot open /dev/sdj1: Device or resource busy
dmsetup table was clean, so I thought maybe sdj1 needed an even harder nudge and zeroed it's superblock:
Code:
root@baloo:~# mdadm --misc --zero-superblock /dev/sdj1
root@baloo:~# mdadm --assemble --force /dev/md0
mdadm: clearing FAULTY flag for device 7 in /dev/md0 for /dev/sdd1
mdadm: SET_ARRAY_INFO failed for /dev/md0: Device or resource busy
Oh dear. (sdd would be the previously faulty disk, that may or may not be a cable error.) Right about now, I finally realize that I am trying very hard to dig myself out of a hole. So, let's see were we're at right now:
Code:
root@baloo:~# mdadm --assemble --force /dev/md0 --update=summaries --verbose
mdadm: looking for devices for /dev/md0
mdadm: no RAID superblock on /dev/sdj1
mdadm: /dev/sdj1 has wrong raid level.
mdadm: /dev/dm-3 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/dm-2 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/dm-1 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/dm-0 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdm is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdl is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdk is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: no RAID superblock on /dev/sdj1
mdadm: /dev/sdj1 has wrong raid level.
mdadm: /dev/sdj is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdi is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdh is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdg is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdf is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sde1 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sde is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdd is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdc is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdb is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sda5 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sda2 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sda1 is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sda is not one of /dev/sdd1,/dev/sdh1,/dev/sdi1,/dev/sdj1,/dev/sdl1,/dev/sdb1,/dev/sdc1,/dev/sdf1,/dev/sdg1,/dev/sdg1,/dev/sdm1,/dev/sdk1,/dev/sdn1
mdadm: /dev/sdb1 is identified as a member of /dev/md0, slot 1.
mdadm: /dev/sdc1 is identified as a member of /dev/md0, slot 2.
mdadm: /dev/sdf1 is identified as a member of /dev/md0, slot 4.
mdadm: /dev/sdg1 is identified as a member of /dev/md0, slot 5.
mdadm: /dev/sdg1 is identified as a member of /dev/md0, slot 5.
mdadm: /dev/sdm1 is identified as a member of /dev/md0, slot 13.
mdadm: /dev/sdk1 is identified as a member of /dev/md0, slot 9.
mdadm: /dev/sdd1 is identified as a member of /dev/md0, slot 7.
mdadm: /dev/sdh1 is identified as a member of /dev/md0, slot 8.
mdadm: /dev/sdi1 is identified as a member of /dev/md0, slot 6.
mdadm: /dev/sdl1 is identified as a member of /dev/md0, slot 12.
Segmentation fault
What I would like to do is force an assembly of all disks, without risking a re-sync since I'm pretty sure at least 11 of the 12 disks have good data. Any ideas?
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.