Hello,
Setup:
- HP Probook 4530s, Intel i5 2430M, 8GB ram, latest BIOS update from HP
- OCZ Petrol 128GB, firmware 3.15 (latest)
- Slackware64 13.37 / Kernel 3.2.13 SMP updated from slackware-current
- sata controller:
Code:
bash-4.2# lspci -vvnn | grep -i sata
00:1f.2 SATA controller [0106]: Intel Corporation 6 Series Chipset Family 6 port SATA AHCI Controller [8086:1c03] (rev 04) (prog-if 01 [AHCI 1.0])
Capabilities: [a8] SATA HBA v1.0 BAR4 Offset=00000004
- cheap 2nd HDD drive caddy bought from ebay, it replaces the optical disk offering you a 2nd hdd bay
- fstab entry (I know it's not optimized or anything, but I need it to work frst):
Code:
/dev/sdb1 /mnt/tmp ext4 defaults 0 1
Problem:
The SSD appears to fail randomly but after a short period of time.
dmesg says:
Code:
bash-4.2# dmesg | grep sdb
[ 5.802539] sd 1:0:0:0: [sdb] 250069680 512-byte logical blocks: (128 GB/119 GiB)
[ 5.811790] sd 1:0:0:0: [sdb] Write Protect is off
[ 5.820821] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
[ 5.820835] sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 5.830309] sdb: sdb1 sdb2
[ 5.839558] sd 1:0:0:0: [sdb] Attached SCSI disk
[ 21.649850] EXT4-fs (sdb1): mounted filesystem with ordered data mode. Opts: (null)
[ 98.908073] EXT4-fs (sdb1): re-mounted. Opts: commit=0
[ 514.269809] EXT4-fs (sdb1): mounted filesystem with ordered data mode. Opts: (null)
[ 788.011675] sd 1:0:0:0: [sdb] Unhandled error code
[ 788.011679] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
[ 788.011687] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x28: 28 00 02 54 29 80 00 00 20 00
[ 788.011705] end_request: I/O error, dev sdb, sector 39070080
[ 788.011715] Buffer I/O error on device sdb2, logical block 0
[ 788.011735] Buffer I/O error on device sdb2, logical block 1
[ 788.011740] Buffer I/O error on device sdb2, logical block 2
[ 788.011745] Buffer I/O error on device sdb2, logical block 3
[ 1707.381442] sd 1:0:0:0: [sdb] Unhandled error code
[ 1707.381452] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
[ 1707.381460] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 01 04 06 5f 00 00 30 00
[ 1707.381479] end_request: I/O error, dev sdb, sector 17040991
[ 1707.381566] Aborting journal on device sdb1-8.
[ 1707.381626] sd 1:0:0:0: [sdb] Unhandled error code
[ 1707.381631] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
[ 1707.381639] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 01 04 00 3f 00 00 08 00
[ 1707.381656] end_request: I/O error, dev sdb, sector 17039423
[ 1707.381664] Buffer I/O error on device sdb1, logical block 2129920
[ 1707.381669] lost page write due to I/O error on sdb1
[ 1707.381686] JBD2: I/O error detected when updating journal superblock for sdb1-8.
[ 1713.502972] sd 1:0:0:0: [sdb] Unhandled error code
[ 1713.502981] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
[ 1713.502989] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 00 00 00 3f 00 00 08 00
[ 1713.503007] end_request: I/O error, dev sdb, sector 63
[ 1713.503015] Buffer I/O error on device sdb1, logical block 0
[ 1713.503019] lost page write due to I/O error on sdb1
[ 1713.503036] EXT4-fs error (device sdb1): ext4_journal_start_sb:327: Detected aborted journal
[ 1713.503044] EXT4-fs (sdb1): Remounting filesystem read-only
[ 1713.503065] EXT4-fs (sdb1): previous I/O error to superblock detected
[ 1713.503093] sd 1:0:0:0: [sdb] Unhandled error code
[ 1713.503099] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
[ 1713.503108] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 00 00 00 3f 00 00 08 00
[ 1713.503135] end_request: I/O error, dev sdb, sector 63
[ 1713.503142] Buffer I/O error on device sdb1, logical block 0
[ 1713.503148] lost page write due to I/O error on sdb1
/var/log/messages says:
Code:
Jun 6 08:50:12 darkstar kernel: [ 514.269809] EXT4-fs (sdb1): mounted filesystem with ordered data mode. Opts: (null)
Jun 6 08:53:46 darkstar kernel: [ 727.923611] ata2: hard resetting link
Jun 6 08:53:56 darkstar kernel: [ 737.954145] ata2: hard resetting link
Jun 6 08:54:06 darkstar kernel: [ 747.985221] ata2: hard resetting link
Jun 6 08:54:41 darkstar kernel: [ 782.971167] ata2: hard resetting link
Jun 6 08:54:46 darkstar kernel: [ 788.011634] ata2: EH complete
Jun 6 08:54:46 darkstar kernel: [ 788.011675] sd 1:0:0:0: [sdb] Unhandled error code
Jun 6 08:54:46 darkstar kernel: [ 788.011679] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
Jun 6 08:54:46 darkstar kernel: [ 788.011687] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x28: 28 00 02 54 29 80 00 00 20 00
Jun 6 09:10:07 darkstar kernel: [ 1707.381442] sd 1:0:0:0: [sdb] Unhandled error code
Jun 6 09:10:07 darkstar kernel: [ 1707.381452] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
Jun 6 09:10:07 darkstar kernel: [ 1707.381460] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 01 04 06 5f 00 00 30 00
Jun 6 09:10:07 darkstar kernel: [ 1707.381626] sd 1:0:0:0: [sdb] Unhandled error code
Jun 6 09:10:07 darkstar kernel: [ 1707.381631] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
Jun 6 09:10:07 darkstar kernel: [ 1707.381639] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 01 04 00 3f 00 00 08 00
Jun 6 09:10:13 darkstar kernel: [ 1713.502972] sd 1:0:0:0: [sdb] Unhandled error code
Jun 6 09:10:13 darkstar kernel: [ 1713.502981] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
Jun 6 09:10:13 darkstar kernel: [ 1713.502989] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 00 00 00 3f 00 00 08 00
Jun 6 09:10:13 darkstar kernel: [ 1713.503093] sd 1:0:0:0: [sdb] Unhandled error code
Jun 6 09:10:13 darkstar kernel: [ 1713.503099] sd 1:0:0:0: [sdb] Result: hostbyte=0x04 driverbyte=0x00
Jun 6 09:10:13 darkstar kernel: [ 1713.503108] sd 1:0:0:0: [sdb] CDB: cdb[0]=0x2a: 2a 00 00 00 00 3f 00 00 08 00
1st attempt
Partitioned with cfdisk, no problems.
Formatted ext4, no problems.
Copied the contents of the root partition to it (aprox 15GB), no problems.
Chrooted into it and started "mc", fail.
2nd attempt
Reboot for 1st attempt, unplug & plug the caddy again.
Chrooted, no problems.
Opened "mc", edited lilo.con, no problems.
Ran "lilo", fail.
3rd attempt
Reboot for 2nd attempt, unplug & plug the caddy again.
Created and deleted files and folders on the mounted ssd, failed after several minutes.
The above are from today's fiddling with it, what else I did:
- plugged the SSD into my windows machine, all fine
- bought a 2nd caddy from a different vendor, same errors
- tried to boot directly from it, same errors
- upgraded laptop BIOS, same errors
- updated SSD firmware, same errors
- plugged a spare laptop hdd into the caddy , all fine
Now I'm really frustrated as I spent counteless hours and roughly 150 euros on drive and caddys trying to improve my work laptop and now I'm facing the daunting prospect of plugging it into the windows machine
.