RAID 1 / partition failed!
Hello friends,
Currently running Centos 5.9 with software raid 1. I have two, 1TB drives with 4 raid partitions. In last week / partition of raid arry failed and other raid partation's status in mdstat is up. # cat /proc/mdstat contains : Personalities : [raid1] md0 : active raid1 sdb1[1] sda1[0] 305088 blocks [2/2] [UU] md3 : active raid1 sda3[0] sdb3[1] 209712384 blocks [2/2] [UU] md2 : active raid1 sdb5[1] sda5[0] 10482304 blocks [2/2] [UU] md1 : active raid1 sdb2[1] sda2[2](F) 734001728 blocks [2/1] [_U] mdadm returns : [root@server1 ~]# mdadm --detail /dev/md1 /dev/md1: Version : 0.90 Creation Time : Mon Jan 1 07:00:31 2001 Raid Level : raid1 Array Size : 734001728 (700.00 GiB 751.62 GB) Used Dev Size : 734001728 (700.00 GiB 751.62 GB) Raid Devices : 2 Total Devices : 2 Preferred Minor : 1 Persistence : Superblock is persistent Update Time : Sat Aug 31 16:42:02 2013 State : active, degraded Active Devices : 1 Working Devices : 1 Failed Devices : 1 Spare Devices : 0 UUID : 5d09a10b:aa7d25c1:af274a2b:6a0673f5 Events : 0.1825873 Number Major Minor RaidDevice State 0 0 0 0 removed 1 8 18 1 active sync /dev/sdb2 2 8 2 - faulty spare /dev/sda2 ######### Following are System logs about md1 Aug 25 04:22:01 server1 kernel: md: delaying resync of md1 until md0 has finished resync (they share one or more physical units) Aug 25 04:22:05 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 25 04:26:00 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 25 05:34:06 server1 kernel: md: syncing RAID array md1 Aug 25 05:34:06 server1 kernel: md: md1: sync done. Aug 18 04:22:01 server1 kernel: md: delaying resync of md1 until md0 has finished resync (they share one or more physical units) Aug 18 04:22:04 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 18 04:25:33 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 18 05:26:29 server1 kernel: md: syncing RAID array md1 Aug 18 06:16:41 server1 kernel: INFO: task md1_resync:32315 blocked for more than 120 seconds. Aug 18 06:16:41 server1 kernel: md1_resync D ffff81000103f1a0 0 32315 111 2579 (L-TLB) Aug 18 06:16:42 server1 kernel: md: md1: sync done. Aug 11 04:22:01 server1 kernel: md: delaying resync of md1 until md0 has finished resync (they share one or more physical units) Aug 11 04:22:05 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 11 04:26:11 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 11 05:35:35 server1 kernel: md: syncing RAID array md1 Aug 11 09:23:16 server1 kernel: md: md1: sync done. Aug 4 04:22:02 server1 kernel: md: delaying resync of md1 until md0 has finished resync (they share one or more physical units) Aug 4 04:22:06 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 4 04:26:04 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Aug 4 05:39:09 server1 kernel: md: syncing RAID array md1 Aug 4 09:40:37 server1 kernel: md: md1: sync done. Jul 28 04:22:01 server1 kernel: md: delaying resync of md1 until md0 has finished resync (they share one or more physical units) Jul 28 04:22:04 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Jul 28 04:26:10 server1 kernel: md: delaying resync of md1 until md3 has finished resync (they share one or more physical units) Jul 28 05:25:59 server1 kernel: md: syncing RAID array md1 Jul 28 08:57:35 server1 kernel: md: md1: sync done. Above log is not showing that sda drive is failed. Smartctl shows no issues with either drive, so I'm assuming the drive is good, but just not sync'd. How do I fix this? Thanks in advance. |
Seems to be sda2 which is faulty, from the output above...
md1 : active raid1 sdb2[1] sda2[2](F) if you are completely sure that the disk is OK, you could try to remove the device from the array and add it again. |
Use smartctl to check hardware. If hardware looks ok, re-add faulty partition, else .. you guess.
|
All times are GMT -5. The time now is 10:21 AM. |