LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Hardware (http://www.linuxquestions.org/questions/linux-hardware-18/)
-   -   "[sdX] Sense Key : Recovered Error..." messages every 30mn (http://www.linuxquestions.org/questions/linux-hardware-18/%5Bsdx%5D-sense-key-recovered-error-messages-every-30mn-811657/)

jf.argentino 06-02-2010 04:18 AM

"[sdX] Sense Key : Recovered Error..." messages every 30mn
 
Hello,

I'm worried about these kernel messages that occur every half an hour, following just the part for my sdc hd, but each burst contains messages relative to every hd I have in my computer:
Code:

May 31 19:09:23 localhost kernel: sd 8:0:2:0: [sdc] Sense Key : Recovered Error [current] [descriptor]
May 31 19:09:23 localhost kernel: Descriptor sense data with sense descriptors (in hex):
May 31 19:09:23 localhost kernel:        72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
May 31 19:09:23 localhost kernel:        00 00 00 00 00 00
May 31 19:09:23 localhost kernel: sd 8:0:2:0: [sdc] Add. Sense: ATA pass through information available

smartctl outputs look OK, I'm doing a "e2fsck -c -c -f -k -y" on every hd right now, i'm waiting the result in something like 12 hours...

After a couple of hour of googling, the only interesting pointer I've found is there, thanks to grepping the message in the kernel src dir... but I think it will take me a long time to understand this, so if somebody out there can tell me if I have to worry about these messages or not...

My configuration is:
Dell precision T7400 with SCSI storage controller LSI Logic / Symbios Logic SAS1068E PCI-Express FUSION-MPT SAS (rev 08)
sda: WD caviar blue sata WD5000AAKS
sdb: Hitachi Deskstar P7K500
sdc: Seagate Barracuda 7200.12
sdb and sdc are in software RAID mirroring
FEDORA 12 x86_64 up-to-date (kernel 2.6.32.12-115.fc12.x86_64)

Thank you

jf.argentino 06-03-2010 05:23 AM

If "e2fsck -c -c -f -k -y" does not explicitly report any bad block, I hope that this means that there's no badblock? Because "e2fsck" just said something like "FILE SYSTEM MODIFIED" as a conclusion (I didn't record the result).
But now smartctl on sdc give me:
Code:

ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate    0x000f  119  099  006    Pre-fail  Always      -      215141192
  3 Spin_Up_Time            0x0003  097  097  000    Pre-fail  Always      -      0
  4 Start_Stop_Count        0x0032  100  100  020    Old_age  Always      -      112
  5 Reallocated_Sector_Ct  0x0033  100  100  036    Pre-fail  Always      -      0
  7 Seek_Error_Rate        0x000f  068  060  030    Pre-fail  Always      -      6641092
  9 Power_On_Hours          0x0032  099  099  000    Old_age  Always      -      1125
 10 Spin_Retry_Count        0x0013  100  100  097    Pre-fail  Always      -      0
 12 Power_Cycle_Count      0x0032  100  100  020    Old_age  Always      -      56
183 Runtime_Bad_Block      0x0032  100  100  000    Old_age  Always      -      0
184 End-to-End_Error        0x0032  100  100  099    Old_age  Always      -      0
187 Reported_Uncorrect      0x0032  100  100  000    Old_age  Always      -      0
188 Command_Timeout        0x0032  100  100  000    Old_age  Always      -      0
189 High_Fly_Writes        0x003a  100  100  000    Old_age  Always      -      0
190 Airflow_Temperature_Cel 0x0022  062  058  045    Old_age  Always      -      38 (Lifetime Min/Max 36/42)
194 Temperature_Celsius    0x0022  038  042  000    Old_age  Always      -      38 (0 17 0 0)
195 Hardware_ECC_Recovered  0x001a  044  022  000    Old_age  Always      -      215141192
197 Current_Pending_Sector  0x0012  100  100  000    Old_age  Always      -      0
198 Offline_Uncorrectable  0x0010  100  100  000    Old_age  Offline      -      0
199 UDMA_CRC_Error_Count    0x003e  200  200  000    Old_age  Always      -      0
240 Head_Flying_Hours      0x0000  100  253  000    Old_age  Offline      -      31134217929983
241 Total_LBAs_Written      0x0000  100  253  000    Old_age  Offline      -      2310341095
242 Total_LBAs_Read        0x0000  100  253  000    Old_age  Offline      -      2386421352

...
Code:

Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error      00%      1105        -
# 4  Short offline      Completed without error      00%        0        -

smartctl outputs on others hd look fine.

More over, I can see kernel these output:
Code:

ata_id[2892]: HDIO_GET_IDENTITY failed for '/dev/sdc'
ata_id[2891]: HDIO_GET_IDENTITY failed for '/dev/sda'
ata_id[2894]: HDIO_GET_IDENTITY failed for '/dev/sdb'

So, do I have to change sdc ?


All times are GMT -5. The time now is 12:08 AM.