LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 05-23-2006, 02:14 AM   #1
Clemente
Member
 
Registered: Aug 2003
Distribution: Debian, Ubuntu
Posts: 188

Rep: Reputation: 30
SATA drives suddenly malfunctional


Hi,

I got some trouble with two sata drives connected to a winfast mainboard (unknown model). The system is running debian sarge with kernel 2.6.8-11-amd64-k8.

As mentioned, I connected two 200GB sata drives (sda, sdb), organized as software raid1. All went fine fpr several months and reboots. Suddenly, the system didn't recognize the drives correctly.
While booting, it hangs several minutes. After coming up, sdb simply isn't available. Any access to /dev/sda (fdisk or mount) results in a unkillable process (output at bottom of posting).

Dmesg shows some lines, that look suspect to me:
Code:
ACPI: PCI interrupt 0000:00:05.0[A] -> GSI 17 (level, low) -> IRQ 17
ata1: SATA max UDMA/133 cmd 0xE900 ctl 0xEA02 bmdma 0xED00 irq 17
ata2: SATA max UDMA/133 cmd 0xEB00 ctl 0xEC02 bmdma 0xED08 irq 17
ata1: dev 0 cfg 49:2f00 82:7c6b 83:7f09 84:4673 85:7c69 86:3e01 87:4663 88:007f
ata1: dev 0 ATA, max UDMA/133, 398297088 sectors: lba48
ata1: dev 0 configured for UDMA/133
scsi0 : sata_sis
ata2: no device found (phy stat 00000000)
scsi1 : sata_sis
  Vendor: ATA       Model: Maxtor 6L200M0    Rev: BANC
  Type:   Direct-Access                      ANSI SCSI revision: 05
SCSI device sda: 398297088 512-byte hdwr sectors (203928 MB)
SCSI device sda: drive cache: write back
 /dev/scsi/host0/bus0/target0/lun0:<3>ata1: command 0x25 timeout, stat 0x50 host _stat 0x24
 p1
Attached scsi disk sda at scsi0, channel 0, id 0, lun 0
Does the acpi line or this irp 17 thing mean something in relation to the drive problem?

Thanks a lot,
Clemente

--

Little more dmesg output:
Code:
ACPI: PCI interrupt 0000:00:03.3[D] -> GSI 23 (level, low) -> IRQ 23
ehci_hcd 0000:00:03.3: Silicon Integrated Systems [SiS] USB 2.0 Controller
ehci_hcd 0000:00:03.3: irq 23, pci mem ffffff0000274000
ehci_hcd 0000:00:03.3: new USB bus registered, assigned bus number 4
PCI: cache line size of 64 is not supported by device 0000:00:03.3
ehci_hcd 0000:00:03.3: USB 2.0 enabled, EHCI 1.00, driver 2004-May-10
hub 4-0:1.0: USB hub found
hub 4-0:1.0: 8 ports detected
ACPI: PCI interrupt 0000:00:05.0[A] -> GSI 17 (level, low) -> IRQ 17
ata1: SATA max UDMA/133 cmd 0xE900 ctl 0xEA02 bmdma 0xED00 irq 17
ata2: SATA max UDMA/133 cmd 0xEB00 ctl 0xEC02 bmdma 0xED08 irq 17
ata1: dev 0 cfg 49:2f00 82:7c6b 83:7f09 84:4673 85:7c69 86:3e01 87:4663 88:007f
ata1: dev 0 ATA, max UDMA/133, 398297088 sectors: lba48
ata1: dev 0 configured for UDMA/133
scsi0 : sata_sis
ata2: no device found (phy stat 00000000)
scsi1 : sata_sis
  Vendor: ATA       Model: Maxtor 6L200M0    Rev: BANC
  Type:   Direct-Access                      ANSI SCSI revision: 05
SCSI device sda: 398297088 512-byte hdwr sectors (203928 MB)
SCSI device sda: drive cache: write back
 /dev/scsi/host0/bus0/target0/lun0:<3>ata1: command 0x25 timeout, stat 0x50 host _stat 0x24
 p1
Attached scsi disk sda at scsi0, channel 0, id 0, lun 0
eth0: Media Link On 100mbps full-duplex
NET: Registered protocol family 10
Disabled Privacy Extensions on device ffffffff80338ce0(lo)

Unkillable Process:
Code:
root@server1 : ~ : 09:04
>ps aux
root      2579  0.0  0.2  8312 2604 ?        S    May16   0:00 /usr/sbin/smbd -D
root     24034  0.0  0.0  1888  632 ?        D    May22   0:00 fdisk -l
root     24067  0.0  0.0  1756  724 ?        Ss   May22   0:00 /usr/sbin/cron

Last edited by Clemente; 05-23-2006 at 02:15 AM.
 
Old 05-24-2006, 12:01 AM   #2
WhatsHisName
Senior Member
 
Registered: Oct 2003
Location: /earth/usa/nj (UTC-5)
Distribution: RHEL, AltimaLinux, Rocky
Posts: 1,151

Rep: Reputation: 46
It sounds more like a hardware problem than something to do with the OS.

You should download the drive manufacturer’s diagnostic utility and test both drives with it.

If a drive fails the testing, then move it to another system and test it again. If it fails again, then the solution is fairly obvious. Replace it.

If it passes in the second system, then look for things like a failing power supply or a malfunctioning controller in the original system.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
2 hard drives, XP on my main sata drives, 10.2 on my IDE LILO doesnt show on boot up Dachy Slackware 14 01-03-2008 07:01 AM
SATA Drives Sfisher961 Linux From Scratch 1 10-05-2005 07:50 PM
SATA drives kaplan71 Red Hat 1 06-27-2005 04:41 PM
Suddenly no installation of any OS is detecting my hard drives. Happend after reboot. brynjarh Linux - Hardware 2 09-22-2004 04:39 PM
SATA Drives smace Linux - Newbie 2 04-13-2004 09:50 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 08:21 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration