Hi specialists crew,
first of all I really hope, that I'm in the right section of the forum. Otherwise could you correct me, and then thanks and sorry for that
I have a strange problem to which I have not found any comments or solutions anywhere (google). So I really hope you can help me with this.
I have a netbook (Acer AO531h) on which I am running Debian Squeeze:
uname -r
Then I have a Raidsonic USB external case for two SATA Disks IB-2221StU-B. There are two Western Digital 640GB (I can get the types if of concern).
fdisk -l
Code:
...
Disk /dev/sdd: 640.1 GB, 640135028736 bytes
255 heads, 63 sectors/track, 77825 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xdedc1dd2
Device Boot Start End Blocks Id System
/dev/sdd1 1 77825 625129281 fd Linux raid autodetect
Disk /dev/sde: 640.1 GB, 640135028736 bytes
255 heads, 63 sectors/track, 77825 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x28c2a7e0
Device Boot Start End Blocks Id System
/dev/sde1 1 77825 625129281 fd Linux raid autodetect
They are already formatted as raid1 and filled with data (on Truecrypt drive). Then I start it for example with:
mdadm --assemble --scan
Here is my problem:
As soon as I am using the drive (more or less excessively) for a long time (for example the resync which it will do once) it will suddenly change those two disks from /dev/sdb and /dev/sdc to /dev/sdd and /dev/sde, somehow similar to what happens when disconnecting and reconnecting the USB cable. That happens always.
dmesg
Code:
...
[203215.968347] usb 1-3: USB disconnect, device number 3
[203220.472076] usb 1-3: new high-speed USB device number 4 using ehci_hcd
[203220.606104] usb 1-3: New USB device found, idVendor=152d, idProduct=2336
[203220.606122] usb 1-3: New USB device strings: Mfr=1, Product=2, SerialNumber=5
[203220.606134] usb 1-3: Product: JM20336 SATA, USB Combo
[203220.606145] usb 1-3: Manufacturer: JMicron
[203220.606155] usb 1-3: SerialNumber: 231303819888
[203220.612514] scsi5 : usb-storage 1-3:1.0
[203221.613192] scsi 5:0:0:0: Direct-Access SAMSUNG HM641JI PQ: 0 ANSI: 2 CCS
[203221.614048] scsi 5:0:0:1: Direct-Access SAMSUNG HM641JI PQ: 0 ANSI: 2 CCS
[203221.618728] sd 5:0:0:0: [sdd] 1250263728 512-byte logical blocks: (640 GB/596 GiB)
[203221.620507] sd 5:0:0:0: [sdd] Write Protect is off
[203221.620525] sd 5:0:0:0: [sdd] Mode Sense: 00 38 00 00
[203221.621397] sd 5:0:0:0: [sdd] Asking for cache data failed
[203221.626812] sd 5:0:0:0: [sdd] Assuming drive cache: write through
[203221.635895] sd 5:0:0:0: [sdd] Asking for cache data failed
[203221.641351] sd 5:0:0:0: [sdd] Assuming drive cache: write through
[203221.648751] sd 5:0:0:1: [sde] 1250263728 512-byte logical blocks: (640 GB/596 GiB)
[203221.649499] sd 5:0:0:1: [sde] Write Protect is off
[203221.649508] sd 5:0:0:1: [sde] Mode Sense: 00 38 00 00
[203221.650236] sd 5:0:0:1: [sde] Asking for cache data failed
[203221.652893] sd 5:0:0:1: [sde] Assuming drive cache: write through
[203221.665246] sd 5:0:0:1: [sde] Asking for cache data failed
[203221.667743] sd 5:0:0:1: [sde] Assuming drive cache: write through
[203221.694442] sdd: sdd1
[203221.718702] sde: sde1
[203221.723249] sd 5:0:0:0: [sdd] Asking for cache data failed
[203221.725808] sd 5:0:0:0: [sdd] Assuming drive cache: write through
[203221.728348] sd 5:0:0:0: [sdd] Attached SCSI disk
[203221.730135] sd 5:0:0:1: [sde] Asking for cache data failed
[203221.732711] sd 5:0:0:1: [sde] Assuming drive cache: write through
[203221.735246] sd 5:0:0:1: [sde] Attached SCSI disk
[234740.313666] md: super_written gets error=-19, uptodate=0
[234740.313686] md/raid1:md127: Disk failure on sdc1, disabling device.
[234740.313692] md/raid1:md127: Operation continuing on 1 devices.
[234740.324080] md: super_written gets error=-19, uptodate=0
[234740.324119] md: super_written gets error=-19, uptodate=0
[234740.324652] md127: detected capacity change from 640131195904 to 0
[234740.324664] md: md127 stopped.
[234740.324676] md: unbind<sdc1>
[234740.324687] md: export_rdev(sdc1)
[234740.324733] md: unbind<sdb1>
[234740.324740] md: export_rdev(sdb1)
[242244.667642] md: md127 stopped.
[242244.674492] md: bind<sdd1>
[242244.675111] md: bind<sde1>
[242244.685188] bio: create slab <bio-1> at 1
[242244.685313] md/raid1:md127: not clean -- starting background reconstruction
[242244.685320] md/raid1:md127: active with 2 out of 2 mirrors
[242244.685380] md127: detected capacity change from 0 to 640131195904
[242244.688881] md127: unknown partition table
[242456.519693] device-mapper: uevent: version 1.0.3
[242456.520681] device-mapper: ioctl: 4.22.0-ioctl (2011-10-19) initialised: dm-devel@redhat.com
[242457.215405] kjournald starting. Commit interval 5 seconds
[242457.223354] md: resync of RAID array md127
[242457.223361] md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
[242457.223366] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for resync.
[242457.223374] md: using 128k window, over a total of 625128121k.
[242457.557077] EXT3-fs (dm-0): using internal journal
[242457.557088] EXT3-fs (dm-0): mounted filesystem with ordered data mode
[255610.708712] usb 1-3: USB disconnect, device number 4
[255610.715218] scsi 5:0:0:0: [sdd] killing request
[255610.715343] scsi 5:0:0:0: [sdd] Unhandled error code
[255610.715356] scsi 5:0:0:0: [sdd] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
[255610.715372] scsi 5:0:0:0: [sdd] CDB: Write(10): 2a 00 16 6c f4 3f 00 00 80 00
[255610.715407] end_request: I/O error, dev sdd, sector 376239167
[255610.720577] md/raid1:md127: Disk failure on sdd1, disabling device.
[255610.720584] md/raid1:md127: Operation continuing on 1 devices.
[255610.731953] md: md127: resync done.
[255610.738297] scsi 5:0:0:1: [sde] killing request
[255610.740087] scsi 5:0:0:1: [sde] Unhandled error code
[255610.740102] scsi 5:0:0:1: [sde] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
[255610.740120] scsi 5:0:0:1: [sde] CDB: Read(10): 28 00 16 6c f9 3f 00 00 80 00
[255610.740155] end_request: I/O error, dev sde, sector 376240447
[255610.745379] scsi 5:0:0:1: [sde] Unhandled error code
[255610.745391] scsi 5:0:0:1: [sde] Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
[255610.745406] scsi 5:0:0:1: [sde] CDB: Read(10): 28 20 16 6c f8 bf 00 00 80 00
[255610.745438] end_request: I/O error, dev sde, sector 376240319
[255610.763076] md: super_written gets error=-19, uptodate=0
[255610.763199] md/raid1:md127: sde: unrecoverable I/O read error for block 376238464
[255610.766346] md/raid1:md127: sde: unrecoverable I/O read error for block 376238592
..
[255610.816756] md/raid1:md127: sde: unrecoverable I/O read error for block 376238336
[255610.818343] md/raid1:md127: sde: unrecoverable I/O read error for block 376238208
[255610.820158] md: checkpointing resync of md127.
[255610.820476] md: super_written gets error=-19, uptodate=0
[255610.820566] md: super_written gets error=-19, uptodate=0
[255610.820578] RAID1 conf printout:
[255610.820582] --- wd:1 rd:2
[255610.820587] disk 0, wo:0, o:1, dev:sde1
[255610.820592] disk 1, wo:1, o:0, dev:sdd1
[255610.820596] RAID1 conf printout:
[255610.820600] --- wd:1 rd:2
[255610.820605] disk 0, wo:0, o:1, dev:sde1
[255610.820669] md: resync of RAID array md127
[255610.820674] md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
[255610.820679] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for resync.
[255610.820686] md: using 128k window, over a total of 625128121k.
[255610.820690] md: resuming resync of md127 from checkpoint.
[255610.820925] md: md127: resync done.
[255610.821144] md: super_written gets error=-19, uptodate=0
[255610.821170] md: super_written gets error=-19, uptodate=0
[255615.316079] usb 1-3: new high-speed USB device number 5 using ehci_hcd
...
[255615.450094] usb 1-3: SerialNumber: 231303819888
[255615.456232] scsi6 : usb-storage 1-3:1.0
[255616.457262] scsi 6:0:0:0: Direct-Access SAMSUNG HM641JI PQ: 0 ANSI: 2 CCS
...
[255616.565886] sd 6:0:0:1: [sdc] Attached SCSI disk
[276288.397664] Buffer I/O error on device md127, logical block 625128120
[276288.399147] Buffer I/O error on device md127, logical block 625128120
[276288.403454] Buffer I/O error on device dm-0, logical block 156237791
[276288.405114] Buffer I/O error on device dm-0, logical block 156237791
[276319.412362] md: md127 still in use.
[276326.708533] Buffer I/O error on device dm-0, logical block 78053890
[276326.709954] lost page write due to I/O error on dm-0
[276326.710169] JBD: I/O error detected when updating journal superblock for dm-0.
[276326.711920] Buffer I/O error on device dm-0, logical block 78053890
[276326.713548] lost page write due to I/O error on dm-0
[276326.714231] JBD: I/O error detected when updating journal superblock for dm-0.
[276326.716691] Buffer I/O error on device dm-0, logical block 0
[276326.718446] lost page write due to I/O error on dm-0
[276326.718667] EXT3-fs (dm-0): I/O error while writing superblock
[276326.920095] md: super_written gets error=-19, uptodate=0
[276329.754428] md127: detected capacity change from 640131195904 to 0
[276329.754447] md: md127 stopped.
[276329.754469] md: unbind<sde1>
[276329.754489] md: export_rdev(sde1)
[276329.754595] md: unbind<sdd1>
[276329.754612] md: export_rdev(sdd1)
[299631.926450] usb 1-3: USB disconnect, device number 5
...
[299636.554089] usb 1-3: SerialNumber: 231303819888
[299636.560563] scsi7 : usb-storage 1-3:1.0
[299637.561256] scsi 7:0:0:0: Direct-Access SAMSUNG HM641JI PQ: 0 ANSI: 2 CCS
...
[299637.675215] sd 7:0:0:0: [sdb] Attached SCSI disk
[409091.300147] usb 1-3: USB disconnect, device number 6
...
[409095.942149] usb 1-3: SerialNumber: 231303819888
[409095.948409] scsi8 : usb-storage 1-3:1.0
[409096.949188] scsi 8:0:0:0: Direct-Access SAMSUNG HM641JI PQ: 0 ANSI: 2 CCS
...
[409097.053901] sd 8:0:0:1: [sdc] Attached SCSI disk
[438316.471846] md: md127 stopped.
[438316.727902] md: bind<sdb1>
[438316.728531] md: bind<sdc1>
[438316.734336] bio: create slab <bio-1> at 1
[438316.734460] md/raid1:md127: not clean -- starting background reconstruction
[438316.734479] md/raid1:md127: active with 2 out of 2 mirrors
[438316.734536] md127: detected capacity change from 0 to 640131195904
[438316.742900] md127: unknown partition table
[438336.976356] md: resync of RAID array md127
[438336.976363] md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
[438336.976368] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for resync.
[438336.976376] md: using 128k window, over a total of 625128121k.
[438337.464682] kjournald starting. Commit interval 5 seconds
...
[458218.280712] usb 1-3: USB disconnect, device number 7
[458218.287406] scsi 8:0:0:0: [sdb] killing request
[458218.287448] scsi 8:0:0:0: [sdb] Unhandled error code
[458218.287460] scsi 8:0:0:0: [sdb] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
[458218.287476] scsi 8:0:0:0: [sdb] CDB: Write(10): 2a 00 1a 31 3f bf 00 00 80 00
[458218.287511] end_request: I/O error, dev sdb, sector 439435199
[458218.293066] md/raid1:md127: Disk failure on sdb1, disabling device.
[458218.293073] md/raid1:md127: Operation continuing on 1 devices.
[458218.304118] md: md127: resync done.
[458218.307840] scsi 8:0:0:1: [sdc] killing request
[458218.308057] scsi 8:0:0:1: [sdc] Unhandled error code
[458218.308064] scsi 8:0:0:1: [sdc] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
[458218.308073] scsi 8:0:0:1: [sdc] CDB: Read(10): 28 00 1a 31 40 bf 00 00 80 00
[458218.308090] end_request: I/O error, dev sdc, sector 439435455
[458218.312045] scsi 8:0:0:1: [sdc] Unhandled error code
[458218.312053] scsi 8:0:0:1: [sdc] Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
[458218.312062] scsi 8:0:0:1: [sdc] CDB: Read(10): 28 20 1a 31 40 3f 00 00 80 00
[458218.312080] end_request: I/O error, dev sdc, sector 439435327
[458218.332073] md: super_written gets error=-19, uptodate=0
[458218.332099] md/raid1:md127: sdc: unrecoverable I/O read error for block 439433472
[458218.336357] md/raid1:md127: sdc: unrecoverable I/O read error for block 439433600
...
lspci
Code:
00:00.0 Host bridge: Intel Corporation Mobile 945GME Express Memory Controller Hub (rev 03)
00:02.0 VGA compatible controller: Intel Corporation Mobile 945GME Express Integrated Graphics Controller (rev 03)
00:02.1 Display controller: Intel Corporation Mobile 945GM/GMS/GME, 943/940GML Express Integrated Graphics Controller (rev 03)
00:1b.0 Audio device: Intel Corporation N10/ICH 7 Family High Definition Audio Controller (rev 02)
00:1c.0 PCI bridge: Intel Corporation N10/ICH 7 Family PCI Express Port 1 (rev 02)
00:1c.2 PCI bridge: Intel Corporation N10/ICH 7 Family PCI Express Port 3 (rev 02)
00:1c.3 PCI bridge: Intel Corporation N10/ICH 7 Family PCI Express Port 4 (rev 02)
00:1d.0 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #1 (rev 02)
00:1d.1 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #2 (rev 02)
00:1d.2 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #3 (rev 02)
00:1d.3 USB Controller: Intel Corporation N10/ICH 7 Family USB UHCI Controller #4 (rev 02)
00:1d.7 USB Controller: Intel Corporation N10/ICH 7 Family USB2 EHCI Controller (rev 02)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev e2)
00:1f.0 ISA bridge: Intel Corporation 82801GBM (ICH7-M) LPC Interface Bridge (rev 02)
00:1f.2 SATA controller: Intel Corporation 82801GBM/GHM (ICH7 Family) SATA AHCI Controller (rev 02)
00:1f.3 SMBus: Intel Corporation N10/ICH 7 Family SMBus Controller (rev 02)
01:00.0 Ethernet controller: Atheros Communications AR8121/AR8113/AR8114 Gigabit or Fast Ethernet (rev b0)
02:00.0 Network controller: Atheros Communications Inc. AR9285 Wireless Network Adapter (PCI-Express) (rev 01)
lsusb -v
Code:
Bus 005 .. 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
Device Descriptor:
bLength 18
bDescriptorType 1
bcdUSB 1.10
bDeviceClass 9 Hub
bDeviceSubClass 0 Unused
bDeviceProtocol 0 Full speed (or root) hub
bMaxPacketSize0 64
idVendor 0x1d6b Linux Foundation
idProduct 0x0001 1.1 root hub
bcdDevice 3.02
iManufacturer 3 Linux 3.2.0-0.bpo.3-486 uhci_hcd
iProduct 2 UHCI Host Controller
iSerial 1 0000:00:1d.0
bNumConfigurations 1
Configuration Descriptor:
bLength 9
bDescriptorType 2
wTotalLength 25
bNumInterfaces 1
bConfigurationValue 1
iConfiguration 0
bmAttributes 0xe0
Self Powered
Remote Wakeup
MaxPower 0mA
Interface Descriptor:
bLength 9
bDescriptorType 4
bInterfaceNumber 0
bAlternateSetting 0
bNumEndpoints 1
bInterfaceClass 9 Hub
bInterfaceSubClass 0 Unused
bInterfaceProtocol 0 Full speed (or root) hub
iInterface 0
Endpoint Descriptor:
bLength 7
bDescriptorType 5
bEndpointAddress 0x81 EP 1 IN
bmAttributes 3
Transfer Type Interrupt
Synch Type None
Usage Type Data
wMaxPacketSize 0x0002 1x 2 bytes
bInterval 255
Hub Descriptor:
bLength 9
bDescriptorType 41
nNbrPorts 2
wHubCharacteristic 0x000a
No power switching (usb 1.0)
Per-port overcurrent protection
bPwrOn2PwrGood 1 * 2 milli seconds
bHubContrCurrent 0 milli Ampere
DeviceRemovable 0x00
PortPwrCtrlMask 0xff
Hub Port Status:
Port 1: 0000.0100 power
Port 2: 0000.0100 power
Device Status: 0x0001
Self Powered
Bus 001 Device 003: ID 152d:2336 JMicron Technology Corp. / JMicron USA Technology Corp. Hard Disk Drive
Device Descriptor:
bLength 18
bDescriptorType 1
bcdUSB 2.00
bDeviceClass 0 (Defined at Interface level)
bDeviceSubClass 0
bDeviceProtocol 0
bMaxPacketSize0 64
idVendor 0x152d JMicron Technology Corp. / JMicron USA Technology Corp.
idProduct 0x2336 Hard Disk Drive
bcdDevice 1.00
iManufacturer 1 JMicron
iProduct 2 JM20336 SATA, USB Combo
iSerial 5 231303819888
bNumConfigurations 1
Configuration Descriptor:
bLength 9
bDescriptorType 2
wTotalLength 32
bNumInterfaces 1
bConfigurationValue 1
iConfiguration 4 USB Mass Storage
bmAttributes 0xc0
Self Powered
MaxPower 2mA
Interface Descriptor:
bLength 9
bDescriptorType 4
bInterfaceNumber 0
bAlternateSetting 0
bNumEndpoints 2
bInterfaceClass 8 Mass Storage
bInterfaceSubClass 6 SCSI
bInterfaceProtocol 80 Bulk (Zip)
iInterface 6 Bulk-In, Bulk-Out Interface
Endpoint Descriptor:
bLength 7
bDescriptorType 5
bEndpointAddress 0x81 EP 1 IN
bmAttributes 2
Transfer Type Bulk
Synch Type None
Usage Type Data
wMaxPacketSize 0x0200 1x 512 bytes
bInterval 0
Endpoint Descriptor:
bLength 7
bDescriptorType 5
bEndpointAddress 0x02 EP 2 OUT
bmAttributes 2
Transfer Type Bulk
Synch Type None
Usage Type Data
wMaxPacketSize 0x0200 1x 512 bytes
bInterval 0
Device Qualifier (for other device speed):
bLength 10
bDescriptorType 6
bcdUSB 2.00
bDeviceClass 0 (Defined at Interface level)
bDeviceSubClass 0
bDeviceProtocol 0
bMaxPacketSize0 64
bNumConfigurations 1
can't get debug descriptor: Connection timed out
Device Status: 0x0001
Self Powered
...
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Device Descriptor:
bLength 18
bDescriptorType 1
bcdUSB 2.00
bDeviceClass 9 Hub
bDeviceSubClass 0 Unused
bDeviceProtocol 0 Full speed (or root) hub
bMaxPacketSize0 64
idVendor 0x1d6b Linux Foundation
idProduct 0x0002 2.0 root hub
bcdDevice 3.02
iManufacturer 3 Linux 3.2.0-0.bpo.3-486 ehci_hcd
iProduct 2 EHCI Host Controller
iSerial 1 0000:00:1d.7
bNumConfigurations 1
Configuration Descriptor:
bLength 9
bDescriptorType 2
wTotalLength 25
bNumInterfaces 1
bConfigurationValue 1
iConfiguration 0
bmAttributes 0xe0
Self Powered
Remote Wakeup
MaxPower 0mA
Interface Descriptor:
bLength 9
bDescriptorType 4
bInterfaceNumber 0
bAlternateSetting 0
bNumEndpoints 1
bInterfaceClass 9 Hub
bInterfaceSubClass 0 Unused
bInterfaceProtocol 0 Full speed (or root) hub
iInterface 0
Endpoint Descriptor:
bLength 7
bDescriptorType 5
bEndpointAddress 0x81 EP 1 IN
bmAttributes 3
Transfer Type Interrupt
Synch Type None
Usage Type Data
wMaxPacketSize 0x0004 1x 4 bytes
bInterval 12
Hub Descriptor:
bLength 11
bDescriptorType 41
nNbrPorts 8
wHubCharacteristic 0x000a
No power switching (usb 1.0)
Per-port overcurrent protection
bPwrOn2PwrGood 10 * 2 milli seconds
bHubContrCurrent 0 milli Ampere
DeviceRemovable 0x00 0x00
PortPwrCtrlMask 0xff 0xff
Hub Port Status:
Port 1: 0000.0503 highspeed power enable connect
Port 2: 0000.0100 power
Port 3: 0000.0100 power
Port 4: 0000.0100 power
Port 5: 0000.0503 highspeed power enable connect
Port 6: 0000.0100 power
Port 7: 0000.0100 power
Port 8: 0000.0100 power
Device Status: 0x0001
Self Powered
I don't know exactly how to read this things, thats why you get such a dump, sorry very much for that..
I am only running mdadm, apache, php, samba and truecrypt, not really anything else.
What I found out until now:
- it's not the harddisk sleep or idle mode since it happens while using it.
- it's probably not the external case, since it workes without issues on other computers (Debian Squeeze, any kernel).
- it's probably the netbook, since before I had another Debian Squeeze in VMware Player 3 on Windows XP host and I had exactly the same problem.
- it has nothing to do with Truecrypt since it also happens when not mounted, but testing for example with badblocks (array or single drives).
- it has probably nothing to do with CPU load, since it was always quite idle.
- there is also no problem if I do the same tests on a normal external single-harddrive or flash drive hooked via USB.
Suspects:
- Netbook chipset, chipset of external case
Is there anything I can do about this?
Thank you very much in advance!