Ubuntu 10.04 - JMicron JMB363 SATA controller - kernel freeze
Linux - HardwareThis forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Ubuntu 10.04 - JMicron JMB363 SATA controller - kernel freeze
Hi everybody,
I'm struggling with a quite annoying problem.
Let me explain my configuration quickly first.
OS: Ubuntu 10.04
kernel: 2.6.32-30-generic
CPU: Intel(R) Pentium(R) D CPU 3.20GHz
MB: Asus P5VD2-X
GPU: GeForce 7300 (PCI-E 16x)
2 SATA ports 1.5Gbps work perfectly. Never had a single issue.
1 SATA 3.0Gbps controlled by a JMicron JMB363 SATA controller that freezes the computer under heavy load (copying/rsyncing GBs of files).
If I leave the computer idle or do basic tasks instead, it works fine.
Sometimes just before crashing/freezing leaves some errors in the messages and kern.log logs.
Messages like:
ata8.00: exception Emask 0x33 SAct 0xf SErr 0x0 action 0xe frozen
ata8.00: irq_stat 0xffffffff, unknown FIS 00000000 00000000 00000000 00000000, host bus
ata8.00: failed command: READ FPDMA QUEUED
ata8.00: cmd 60/60:00:49:21:3d/00:00:01:00:00/40 tag 0 ncq 49152 in
res 40/00:04:49:21:3d/00:00:01:00:00/40 Emask 0x32 (host bus error)
ata8.00: status: { DRDY }
ata8.00: failed command: READ FPDMA QUEUED
ata8.00: cmd 60/80:08:81:21:3d/00:00:01:00:00/40 tag 1 ncq 65536 in
res 40/00:04:49:21:3d/00:00:01:00:00/40 Emask 0x32 (host bus error)
ata8.00: status: { DRDY }
ata8.00: failed command: READ FPDMA QUEUED
ata8.00: cmd 60/80:10:81:20:3d/00:00:01:00:00/40 tag 2 ncq 65536 in
res 40/00:04:49:21:3d/00:00:01:00:00/40 Emask 0x32 (host bus error)
ata8.00: status: { DRDY }
ata8.00: failed command: READ FPDMA QUEUED
ata8.00: cmd 60/38:18:11:21:3d/00:00:01:00:00/40 tag 3 ncq 28672 in
res 40/00:04:49:21:3d/00:00:01:00:00/40 Emask 0x32 (host bus error)
ata8.00: status: { DRDY }
ata8: hard resetting link
ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
ata8.00: configured for UDMA/133
ata8: EH complete
I've been trying to sort this problem for quite a while now. At the beginning I didn't know the problem was the Jmicron SATA port/controller. I switched disks/cables a few times. The problem was always on the same port. That's why I'm sure is this Jmicron the issue.
I tried changing the settings in the BIOS putting the Jmicron controller in ahci mode (rather than IDE mode) but nothing changed.
I have to say something more about my configuration.
The system is running on Linux Software Raid (mdadm).
Raid1 for the boot partition and Raid5 for the rest.
At the moment the RAID array is in degradeted mode since connecting the 3rd hard drive (the one on the Jmicron SATA port) would limit the things I can do, freezing the system.
Last, very important I guess, piece of information worth mentioning.
When I've installed this Ubuntu (10.04) I've done it on a single disk (NO RAID) on one of my "normal" (1.5Gbps) SATA port (SO NO JMICRON EITHER).
When I've decided to put a RAID system in place I've copied the files manually and it worked fine. I have done it before.
Now the problem here might be that, having installed the system with no Jmicron controller in use the Ubuntu installation didn't load the kernel modules needed.
Now, after having moved the OS to work on other hard drives (one ow which on a Jmicron controller) those modules are still not loaded.
Or maybe some others are loaded and are causing conflicts..
I'm not sure all of this makes sense. That's why I'm here hoping someone could help me to shed some light on it. :-)
I have a Gigabyte mainboard with exactly the same controller, running with ubuntu 10.04 but never had difficulties with it.
But I got the same error messages regarding a harddisk on a different controller, and it seems to me that that harddisk is defective.
Please post the output after a restart from dmesg | grep ata8
First of all, here is the command "dmesg | grep ata8" after having restarted:
[ 0.940333] ata8: PATA max UDMA/133 cmd 0x170 ctl 0x376 bmdma 0xe408 irq 15
I also run this other for a general overview of the disks in case is needed:
"dmesg | grep ata[0-9]"
[ 0.890884] ata1: PATA max UDMA/100 cmd 0xcc00 ctl 0xc800 bmdma 0xbc00 irq 29
[ 0.890888] ata2: PATA max UDMA/100 cmd 0xc400 ctl 0xc000 bmdma 0xbc08 irq 29
[ 0.914381] ata3: SATA max UDMA/133 cmd 0xfc00 ctl 0xf800 bmdma 0xec00 irq 21
[ 0.914385] ata4: SATA max UDMA/133 cmd 0xf400 ctl 0xf000 bmdma 0xec08 irq 21
[ 0.936855] ata5: SATA max UDMA/133 abar m8192@0xdfefe000 port 0xdfefe100 irq 28
[ 0.936861] ata6: SATA max UDMA/133 abar m8192@0xdfefe000 port 0xdfefe180 irq 28
[ 0.940329] ata7: PATA max UDMA/133 cmd 0x1f0 ctl 0x3f6 bmdma 0xe400 irq 14
[ 0.940333] ata8: PATA max UDMA/133 cmd 0x170 ctl 0x376 bmdma 0xe408 irq 15
[ 1.104460] ata7.01: ATAPI: HL-DT-STDVD-RAM GH22NP20, 1.01, max UDMA/66
[ 1.120370] ata7.01: configured for UDMA/66
[ 1.128019] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 1.256029] ata5: SATA link down (SStatus 0 SControl 300)
[ 1.292372] ata3.00: ATA-8: SAMSUNG HD204UI, 1AQ10001, max UDMA/133
[ 1.292377] ata3.00: 3907029168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[ 1.300390] ata3.00: configured for UDMA/133
[ 1.420031] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 1.426182] ata6.00: ATA-8: SAMSUNG HD204UI, 1AQ10001, max UDMA/133
[ 1.426189] ata6.00: 3907029168 sectors, multi 0: LBA48 NCQ (depth 31/32), AA
[ 1.432430] ata6.00: configured for UDMA/133
[ 1.504026] ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 1.668364] ata4.00: ATA-8: SAMSUNG HD204UI, 1AQ10001, max UDMA/133
[ 1.668370] ata4.00: 3907029168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[ 1.676394] ata4.00: configured for UDMA/133
I haven't applied the path yet but I will soon-ish.
Anyway I'm not sure how related it is, so in the meantime, I'm doing some other tests to better understand when this issue happens.
I tried writing/reading data to/from the disk (using the Jmicron controller) and I've been surprised to notice that the computer seems to experience the problem when is READING data ONLY.
I copied 1TB+ to the disk (write action) and everything went fine, I read the data (trying to copy it to another hard disk) and once it crashed after about 10/20GBs without logging any error, whilst the second time it started logging errors in the kern.log log so I have interrupted it immediately.
I copied 1TB+ to the disk (write action) and everything went fine, I read the data (trying to copy it to another hard disk) and once it crashed after about 10/20GBs without logging any error, whilst the second time it started logging errors in the kern.log log so I have interrupted it immediately.
Did your system freeze again? I ask this because I'm not certain about the cause of your system freezing; it's something usually caused by a memory failure. You won't find that in the logging.
Also I did not see any alarming lines in your latest dmesg.
yes, if I READ (NOT IF I WRITE) data from that disk (connected trought the JMICRON controller) the system either freezes or logs the errors shown above and freezes.
To me it sounds like the same problem, also because when I use the system without touching that SATA port it's all super stable.
I tried to do some "manual" stress tests and even when the load average got to 6 with CPU idle time to 0% the system was still fine and responding.
Anyway, if it was a memory failure as you say, how do I debug it?
I've had a gigabyte board with similar problems and have had a really hard time with it. I eventually just disabled the JMicron controller and used the other one, which luckily had 4 ports, plenty for what I need.
There may be a way around this, so can you post the output of:
Thanks Sjonnie48, I'll give memtest a try. I'm sure one more test does not hurt.. :-)
Hi H_TeXMeX_H,
unfortunately my motherboard has only 2 SATA (and two IDEs) plus another from the JMIcron controller.
I know this controller supports SATA multiplier so, in the future, (if I can make it work properly) I might get one of those and extend my computer life of another few years... :-)
"lspci -k":
00:00.0 Host bridge: VIA Technologies, Inc. P4M890 Host Bridge
Kernel driver in use: agpgart-via
Kernel modules: via-agp
00:00.1 Host bridge: VIA Technologies, Inc. P4M890 Host Bridge
00:00.2 Host bridge: VIA Technologies, Inc. P4M890 Host Bridge
00:00.3 Host bridge: VIA Technologies, Inc. P4M890 Host Bridge
00:00.4 Host bridge: VIA Technologies, Inc. P4M890 Host Bridge
00:00.5 PIC: VIA Technologies, Inc. P4M890 I/O APIC Interrupt Controller
00:00.6 Host bridge: VIA Technologies, Inc. P4M890 Security Device
00:00.7 Host bridge: VIA Technologies, Inc. P4M890 Host Bridge
00:01.0 PCI bridge: VIA Technologies, Inc. VT8237/VX700 PCI Bridge
Kernel modules: shpchp
00:02.0 PCI bridge: VIA Technologies, Inc. P4M890 PCI to PCI Bridge Controller
Kernel driver in use: pcieport
Kernel modules: shpchp
00:03.0 PCI bridge: VIA Technologies, Inc. P4M890 PCI to PCI Bridge Controller
Kernel driver in use: pcieport
Kernel modules: shpchp
00:0f.0 IDE interface: VIA Technologies, Inc. Device 5337 (rev 80)
Kernel driver in use: sata_via
Kernel modules: sata_via
00:0f.1 IDE interface: VIA Technologies, Inc. VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus Master IDE (rev 07)
Kernel driver in use: pata_via
Kernel modules: pata_via
00:10.0 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev a0)
Kernel driver in use: uhci_hcd
00:10.1 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev a0)
Kernel driver in use: uhci_hcd
00:10.2 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev a0)
Kernel driver in use: uhci_hcd
00:10.3 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev a0)
Kernel driver in use: uhci_hcd
00:10.4 USB Controller: VIA Technologies, Inc. USB 2.0 (rev 86)
Kernel driver in use: ehci_hcd
00:11.0 ISA bridge: VIA Technologies, Inc. VT8237A PCI to ISA Bridge
Kernel modules: i2c-viapro
00:11.7 Host bridge: VIA Technologies, Inc. VT8251 Ultra VLINK Controller
00:13.0 Host bridge: VIA Technologies, Inc. VT8237A Host Bridge
00:13.1 PCI bridge: VIA Technologies, Inc. VT8237A PCI to PCI Bridge
02:00.0 VGA compatible controller: nVidia Corporation G72 [GeForce 7300 LE] (rev a1)
Kernel driver in use: nvidia
Kernel modules: nvidia-current, nvidiafb, nouveau
03:00.0 SATA controller: JMicron Technology Corp. JMB362/JMB363 Serial ATA Controller (rev 02)
Kernel driver in use: ahci
Kernel modules: ahci
03:00.1 IDE interface: JMicron Technology Corp. JMB362/JMB363 Serial ATA Controller (rev 02)
Kernel driver in use: pata_jmicron
Kernel modules: pata_jmicron
04:07.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8110SC/8169SC Gigabit Ethernet (rev 10)
Kernel driver in use: r8169
Kernel modules: r8169
80:01.0 Audio device: VIA Technologies, Inc. VT1708/A [Azalia HDAC] (VIA High Definition Audio Controller) (rev 10)
Kernel driver in use: HDA Intel
Kernel modules: snd-hda-intel
One thing I just remembered, look on the HDD plugged into the JMicron SATA, and use the jumpers to limit the speed to 1.5 Gbps. This may solve the problem.
If it doesn't, try putting the VIA controller in AHCI mode.
Do you have any IDE drives plugged in ? It seems like you do.
Unfortunately I haven't found how to set the SAMSUNG HD204UI to 1.5Gbps with jumpers.
I read somewhere that it should be done via firmware... :-/ I'll have to read a bit more about it...
I'd probably try the firmware patch first...
There is no option in the BIOS to set the VIA controller to AHCI mode.
The only options related are:
- SATA controller: Enabled (options: Disabled, Enabled)
- SATA controller mode: IDE (options: IDE, RAID)
and another for the JMicron controller:
- JMicron RAID controller: IDE (options: Disabled, IDE, RAID, AHCI)
I tried setting the JMicron controller to AHCI but nothing changed. Same issue...
No, I don't have any IDE disks plugged in.
There are three 2TB SAMSUNG connected to SATA1, SATA2 and SATA_RAID1 (JMicron) and one 1TB external connected via USB.
Maybe you have seen the USB one...
That's strange because there's 'sata_via' for the SATA and 'pata_via' for the IDE. I don't see why pata_via is being used here, maybe it should be blacklisted.
This evening I set the disk on the JMicron controller to SATA 150. (Samsung has a nice tool to do that).
Unfortunately it didn't help. Same kind of errors:
Yeah, I think it's a problem with interrupts. Notice that ata7 and ata8 and on IRQ 28, so even tho there's no device at ata7, interrupts are being generated that give errors referring to it.
Try putting it in SATA mode. If that doesn't work, all I can say is to disable the JMicron controller and find another solution. That's the only solution that worked for me. It's just a bad controller.
Maybe you could try a new kernel version, maybe they have some workarounds for this issue.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.