LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Slackware (https://www.linuxquestions.org/questions/slackware-14/)
-   -   --current, randomly timed kernel oops on bootup of two test boxen (https://www.linuxquestions.org/questions/slackware-14/current-randomly-timed-kernel-oops-on-bootup-of-two-test-boxen-852843/)

Old_Fogie 12-28-2010 11:36 AM

--current, randomly timed kernel oops on bootup of two test boxen
 
Hello all,

I have two boxen running --current. Over the last two weeks (give or take) they both will randomly just lockup during the boot process.

I'd say this all began around the time I had this issue, which is still randomly occuring:

http://www.linuxquestions.org/questi...-drive-851900/


I don't know how to duplicate it so it happens every boot. But an un-scientific guess is, that I have these lockups 1 out of every five boot ups.

The problem does happen when the pc's are:

1. powered off, and booting up,
2. reboot issued.

But it's not *every* time.

I'm using an initrd for jfs filesystem on both machines.

I have no logs, as the boot process comes to life, I see the message where it's trying to bring up floppy drive, it sits there for about 2 to 3 seconds, and then a bunch of errors are spewed to the screen.

The oops lines are all references sata in one way or another.

Now one pc is indeed using a sata as the "/"

The other pc uses a pata drive for "/" but there is sata ports on that motherboard, but they are not in use at all for that pc.

Since the pc's are locking up prior to the drives being mounted, I don't think anything is able to get logged yet. I don't see anything in my logs, so I don't know what I may be able to post here.

One pc uses:
IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA Controller (rev 03)

And the other uses:
an asus a7n8x-e deluxe nforce 2 motherboard that doesnt even list the sata controllers as there is nothing plugged into them.

Any thoughts greatly appreciated.

EdGr 12-28-2010 12:17 PM

This morning, my normally unflappable Dell Inspiron E1505 greeted me with a kernel panic very early in the boot. No logs. I powered it off and rebooted a dozen times and saw the kernel panic exactly one more time.

I'm running -current with a custom kernel (that differs from the generic kernel in only 9 options).

Once booted, the machine runs fine.

I'm not sure if this is the same problem that you are seeing. Because the problem is infrequent and random, it will be hard to debug.

ETA: Here is a screen shot http://www.edgrochowski.com/photos-l...c-12-28-10.png

Ed

Old_Fogie 12-28-2010 02:16 PM

Hi Edgr I cannot access the picture, the server is throwing error '500'.

EdGr 12-28-2010 02:22 PM

Quote:

Originally Posted by Old_Fogie (Post 4205763)
Hi Edgr I cannot access the picture, the server is throwing error '500'.

Fixed. Please try now.
Ed

aaazen 12-28-2010 03:52 PM

--current, randomly timed kernel oops on bootup of two test boxen
 
I am also getting this error randomly on an Intel D510MO system.

But I set up a serial port/null modem to another machine running minicom to capture the whole thing.

Apparently there is a 30,000 byte limit on each post so this is broken into two...

Quote:


[ 0.000000] Linux version 2.6.35.7-smp (root@midas) (gcc version 4.5.1 (GCC) ) #2 SMP Mon Oct 11 14:52:09 CDT 2010
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: 0000000000000000 - 000000000008f000 (usable)
[ 0.000000] BIOS-e820: 000000000008f000 - 0000000000090000 (reserved)
[ 0.000000] BIOS-e820: 0000000000090000 - 000000000009ec00 (usable)
[ 0.000000] BIOS-e820: 000000000009ec00 - 00000000000a0000 (reserved)
[ 0.000000] BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
[ 0.000000] BIOS-e820: 0000000000100000 - 00000000cee98000 (usable)
[ 0.000000] BIOS-e820: 00000000cee98000 - 00000000ceebf000 (reserved)
[ 0.000000] BIOS-e820: 00000000ceebf000 - 00000000cef48000 (usable)
[ 0.000000] BIOS-e820: 00000000cef48000 - 00000000cefbf000 (ACPI NVS)
[ 0.000000] BIOS-e820: 00000000cefbf000 - 00000000ceff1000 (usable)
[ 0.000000] BIOS-e820: 00000000ceff1000 - 00000000cefff000 (ACPI data)
[ 0.000000] BIOS-e820: 00000000cefff000 - 00000000cf000000 (usable)
[ 0.000000] BIOS-e820: 00000000cf000000 - 00000000d0000000 (reserved)
[ 0.000000] BIOS-e820: 00000000f8000000 - 00000000fc000000 (reserved)
[ 0.000000] BIOS-e820: 00000000fff00000 - 0000000100000000 (reserved)
[ 0.000000] BIOS-e820: 0000000100000000 - 0000000130000000 (usable)
[ 0.000000] Notice: NX (Execute Disable) protection cannot be enabled: non-PAE kernel!
[ 0.000000] DMI 2.5 present.
[ 0.000000] last_pfn = 0xcf000 max_arch_pfn = 0x100000
[ 0.000000] x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106
[ 0.000000] found SMP MP-table at [c00fbe50] fbe50
[ 0.000000] init_memory_mapping: 0000000000000000-00000000377fe000
[ 0.000000] ACPI: RSDP 000f2110 00024 (v02 INTEL )
[ 0.000000] ACPI: XSDT ceffe120 0004C (v01 INTEL MOPNV10N 000001F5 01000013)
[ 0.000000] ACPI: FACP ceffd000 000F4 (v03 INTEL MOPNV10N 000001F5 MSFT 0100000D)
[ 0.000000] ACPI: DSDT ceff9000 039A7 (v01 INTEL MOPNV10N 000001F5 MSFT 0100000D)
[ 0.000000] ACPI: FACS cef87000 00040
[ 0.000000] ACPI: APIC ceff8000 00084 (v02 INTEL MOPNV10N 000001F5 MSFT 0100000D)
[ 0.000000] ACPI: MCFG ceff7000 0003C (v01 INTEL MOPNV10N 000001F5 MSFT 0100000D)
[ 0.000000] ACPI: HPET ceff6000 00038 (v01 INTEL MOPNV10N 000001F5 MSFT 0100000D)
[ 0.000000] ACPI: SSDT ceff2000 0377C (v01 INTEL SSDT PM 000001F5 MSFT 0100000D)
[ 0.000000] 2424MB HIGHMEM available.
[ 0.000000] 887MB LOWMEM available.
[ 0.000000] mapped low ram: 0 - 377fe000
[ 0.000000] low ram: 0 - 377fe000
[ 0.000000] node 0 low ram: 00000000 - 377fe000
[ 0.000000] node 0 bootmap 00008000 - 0000ef00
[ 0.000000] (11/32 early reservations) ==> bootmem [0000000000 - 00377fe000]
[ 0.000000] #0 [0000001000 - 0000002000] EX TRAMPOLINE ==> [0000001000 - 0000002000]
[ 0.000000] #1 [0001000000 - 0001e28b84] TEXT DATA BSS ==> [0001000000 - 0001e28b84]
[ 0.000000] #2 [0001e29000 - 0001e31162] BRK ==> [0001e29000 - 0001e31162]
[ 0.000000] #3 [000009ec00 - 00000fbe50] BIOS reserved ==> [000009ec00 - 00000fbe50]
[ 0.000000] #4 [00000fbe50 - 00000fbe60] MP-table mpf ==> [00000fbe50 - 00000fbe60]
[ 0.000000] #5 [00000fbea0 - 0000100000] BIOS reserved ==> [00000fbea0 - 0000100000]
[ 0.000000] #6 [00000fbe60 - 00000fbea0] MP-table mpc ==> [00000fbe60 - 00000fbea0]
[ 0.000000] #7 [0000002000 - 0000003000] TRAMPOLINE ==> [0000002000 - 0000003000]
[ 0.000000] #8 [0000003000 - 0000007000] ACPI WAKEUP ==> [0000003000 - 0000007000]
[ 0.000000] #9 [0000007000 - 0000008000] PGTABLE ==> [0000007000 - 0000008000]
[ 0.000000] #10 [0000008000 - 000000f000] BOOTMAP ==> [0000008000 - 000000f000]
[ 0.000000] Zone PFN ranges:
[ 0.000000] DMA 0x00000001 -> 0x00001000
[ 0.000000] Normal 0x00001000 -> 0x000377fe
[ 0.000000] HighMem 0x000377fe -> 0x000cf000
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[6] active PFN ranges
[ 0.000000] 0: 0x00000001 -> 0x0000008f
[ 0.000000] 0: 0x00000090 -> 0x0000009e
[ 0.000000] 0: 0x00000100 -> 0x000cee98
[ 0.000000] 0: 0x000ceebf -> 0x000cef48
[ 0.000000] 0: 0x000cefbf -> 0x000ceff1
[ 0.000000] 0: 0x000cefff -> 0x000cf000
[ 0.000000] Using APIC driver default
[ 0.000000] ACPI: PM-Timer IO Port: 0x408
[ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] disabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x02] enabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x03] disabled)
[ 0.000000] ACPI: LAPIC_NMI (acpi_id[0x01] high level lint[0x1])
[ 0.000000] ACPI: LAPIC_NMI (acpi_id[0x02] high level lint[0x1])
[ 0.000000] ACPI: LAPIC_NMI (acpi_id[0x03] high level lint[0x1])
[ 0.000000] ACPI: LAPIC_NMI (acpi_id[0x04] high level lint[0x1])
[ 0.000000] ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
[ 0.000000] IOAPIC[0]: apic_id 8, version 32, address 0xfec00000, GSI 0-23
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[ 0.000000] Using ACPI (MADT) for SMP configuration information
[ 0.000000] ACPI: HPET id: 0x8086a201 base: 0xfed00000
[ 0.000000] SMP: Allowing 4 CPUs, 2 hotplug CPUs
[ 0.000000] PM: Registered nosave memory: 000000000008f000 - 0000000000090000
[ 0.000000] PM: Registered nosave memory: 000000000009e000 - 000000000009f000
[ 0.000000] PM: Registered nosave memory: 000000000009f000 - 00000000000a0000
[ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 00000000000e0000
[ 0.000000] PM: Registered nosave memory: 00000000000e0000 - 0000000000100000
[ 0.000000] Allocating PCI resources starting at d0000000 (gap: d0000000:28000000)
[ 0.000000] setup_percpu: NR_CPUS:32 nr_cpumask_bits:32 nr_cpu_ids:4 nr_node_ids:1
[ 0.000000] PERCPU: Embedded 14 pages/cpu @c3c00000 s34752 r0 d22592 u1048576
[ 0.000000] pcpu-alloc: s34752 r0 d22592 u1048576 alloc=1*4194304
[ 0.000000] pcpu-alloc: [0] 0 1 2 3
[ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 840975
[ 0.000000] Kernel command line: BOOT_IMAGE=linux-35-7 ro root=807 console=tty0 console=ttyS0,38400
[ 0.000000] PID hash table entries: 4096 (order: 2, 16384 bytes)
[ 0.000000] Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
[ 0.000000] Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
[ 0.000000] Enabling fast FPU save and restore... done.
[ 0.000000] Enabling unmasked SIMD FPU exception support... done.
[ 0.000000] Initializing CPU#0
[ 0.000000] Initializing HighMem for node 0 (000377fe:000cf000)
[ 0.000000] Memory: 3348252k/3391488k available (9749k kernel code, 42148k reserved, 3152k data, 596k init, 2481496k highmem)
[ 0.000000] virtual kernel memory layout:
[ 0.000000] fixmap : 0xffd37000 - 0xfffff000 (2848 kB)
[ 0.000000] pkmap : 0xff800000 - 0xffc00000 (4096 kB)
[ 0.000000] vmalloc : 0xf7ffe000 - 0xff7fe000 ( 120 MB)
[ 0.000000] lowmem : 0xc0000000 - 0xf77fe000 ( 887 MB)
[ 0.000000] .init : 0xc1c9a000 - 0xc1d2f000 ( 596 kB)
[ 0.000000] .data : 0xc19854a9 - 0xc1c9961c (3152 kB)
[ 0.000000] .text : 0xc1000000 - 0xc19854a9 (9749 kB)
[ 0.000000] Checking if this processor honours the WP bit even in supervisor mode...Ok.
[ 0.000000] SLUB: Genslabs=13, HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1
[ 0.000000] Hierarchical RCU implementation.
[ 0.000000] RCU-based detection of stalled CPUs is disabled.
[ 0.000000] Verbose stalled-CPUs detection is disabled.
[ 0.000000] NR_IRQS:1280
[ 0.000000] Extended CMOS year: 2000
[ 0.000000] Console: colour dummy device 80x25
[ 0.000000] console [tty0] enabled
[ 0.000000] console [ttyS0] enabled
[ 0.000000] Fast TSC calibration using PIT
[ 0.000000] Detected 1662.935 MHz processor.
[ 0.002008] Calibrating delay loop (skipped), value calculated using timer frequency.. 3325.87 BogoMIPS (lpj=1662935)
[ 0.004006] pid_max: default: 32768 minimum: 301
[ 0.005047] Security Framework initialized
[ 0.006021] Mount-cache hash table entries: 512
[ 0.008214] CPU: Physical Processor ID: 0
[ 0.009007] CPU: Processor Core ID: 0
[ 0.010008] mce: CPU supports 5 MCE banks
[ 0.011012] CPU0: Thermal monitoring enabled (TM1)
[ 0.012007] using mwait in idle threads.
[ 0.013014] Performance Events: PEBS fmt0+, Atom events, Intel PMU driver.
[ 0.017009] ... version: 3
[ 0.018005] ... bit width: 40
[ 0.019005] ... generic registers: 2
[ 0.020005] ... value mask: 000000ffffffffff
[ 0.021005] ... max period: 000000007fffffff
[ 0.022005] ... fixed-purpose events: 3
[ 0.023005] ... event mask: 0000000700000003
[ 0.025083] ACPI: Core revision 20100428
[ 0.042016] ftrace: converting mcount calls to 0f 1f 44 00 00
[ 0.043009] ftrace: allocating 31719 entries in 63 pages
[ 0.045080] Enabling APIC mode: Flat. Using 1 I/O APICs
[ 0.046449] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1
[ 0.057822] CPU0: Intel(R) Atom(TM) CPU D510 @ 1.66GHz stepping 0a
[ 0.060999] Booting Node 0, Processors #1
[ 0.002999] Initializing CPU#1
[ 0.134018] Brought up 2 CPUs
[ 0.135006] Total of 2 processors activated (6651.04 BogoMIPS).
[ 0.136348] devtmpfs: initialized
[ 0.138227] xor: automatically using best checksumming function: pIII_sse
[ 0.163865] pIII_sse : 5092.000 MB/sec
[ 0.176004] xor: using function: pIII_sse (5092.000 MB/sec)
[ 0.193011] atomic64 test passed for i586+ platform with CX8 and with SSE
[ 0.213082] NET: Registered protocol family 16
[ 0.227079] ACPI FADT declares the system doesn't support PCIe ASPM, so disable it
[ 0.250006] ACPI: bus type pci registered
[ 0.262096] PCI: MMCONFIG for domain 0000 [bus 00-3f] at [mem 0xf8000000-0xfbffffff] (base 0xf8000000)
[ 0.290008] PCI: MMCONFIG at [mem 0xf8000000-0xfbffffff] reserved in E820
[ 0.310003] PCI: Using MMCONFIG for extended config space
[ 0.326004] PCI: Using configuration type 1 for base access
[ 0.343057] mtrr: your CPUs had inconsistent variable MTRR settings
[ 0.362004] mtrr: probably your BIOS does not setup all CPUs.
[ 0.379003] mtrr: corrected configuration.
[ 0.407095] bio: create slab <bio-0> at 0
[ 0.432140] ACPI: BIOS _OSI(Linux) query ignored
[ 0.448118] ACPI: Interpreter enabled
[ 0.459007] ACPI: (supports S0 S1 S3 S4 S5)
[ 0.471833] ACPI: Using IOAPIC for interrupt routing
[ 0.499310] ACPI: No dock devices found.
[ 0.512010] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
[ 0.540380] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-3f])
[ 0.560123] pci_root PNP0A03:00: host bridge window [io 0x0000-0x0cf7]
[ 0.580007] pci_root PNP0A03:00: host bridge window [io 0x0d00-0xffff]
[ 0.600009] pci_root PNP0A03:00: host bridge window [mem 0x000a0000-0x000bffff]
[ 0.622006] pci_root PNP0A03:00: host bridge window [mem 0xd0000000-0xf7ffffff]
[ 0.647329] pci 0000:00:1c.0: PCI bridge to [bus 01-01]
[ 0.663099] pci 0000:00:1c.1: PCI bridge to [bus 02-02]
[ 0.678098] pci 0000:00:1c.2: PCI bridge to [bus 03-03]
[ 0.694095] pci 0000:00:1c.3: PCI bridge to [bus 04-04]
[ 0.710122] pci 0000:00:1e.0: PCI bridge to [bus 05-05] (subtractive decode)
[ 0.742184] ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 7 9 10 *11 12)
[ 0.762774] ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 5 7 9 10 11 12) *0, disabled.
[ 0.786200] ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 7 *9 10 11 12)
[ 0.806770] ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 7 9 10 *11 12)
[ 0.827139] ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 7 9 10 11 12) *0, disabled.
[ 0.850764] ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 7 9 10 11 12) *0, disabled.
[ 0.874568] ACPI: PCI Interrupt Link [LNKG] (IRQs 3 4 5 7 *9 10 11 12)
[ 0.894769] ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 7 9 *10 11 12)
[ 0.915120] HEST: Table is not found!
[ 0.926102] vgaarb: device added: PCI:0000:00:02.0,decodes=io+mem,owns=io+mem,locks=none
[ 0.950019] vgaarb: loaded
[ 0.959059] SCSI subsystem initialized
[ 0.970077] usbcore: registered new interface driver usbfs
[ 0.987081] usbcore: registered new interface driver hub
[ 1.003058] usbcore: registered new device driver usb
[ 1.035177] raid6: int32x1 175 MB/s
[ 1.063073] raid6: int32x2 238 MB/s
[ 1.090887] raid6: int32x4 269 MB/s
[ 1.119032] raid6: int32x8 304 MB/s
[ 1.146939] raid6: mmxx1 375 MB/s
[ 1.174881] raid6: mmxx2 734 MB/s
[ 1.202902] raid6: sse1x1 304 MB/s
[ 1.230890] raid6: sse1x2 570 MB/s
[ 1.258866] raid6: sse2x1 597 MB/s
[ 1.286906] raid6: sse2x2 1132 MB/s
[ 1.298003] raid6: using algorithm sse2x2 (1132 MB/s)
[ 1.313021] PCI: Using ACPI for IRQ routing
[ 1.326145] HPET: 3 timers in total, 0 timers will be used for per-cpu timer
[ 1.347011] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0
[ 1.362005] hpet0: 3 comparators, 64-bit 14.318180 MHz counter
[ 1.382033] Switching to clocksource tsc
[ 1.399271] pnp: PnP ACPI init
[ 1.408471] ACPI: bus type pnp registered
[ 1.425399] pnp: PnP ACPI: found 14 devices
[ 1.437942] ACPI: ACPI bus type pnp unregistered
[ 1.451811] system 00:01: [mem 0xf8000000-0xfbffffff] has been reserved
[ 1.471629] system 00:01: [mem 0xfed14000-0xfed17fff] has been reserved
[ 1.491435] system 00:01: [mem 0xfed18000-0xfed18fff] has been reserved
[ 1.511245] system 00:01: [mem 0xfed19000-0xfed19fff] has been reserved
[ 1.534230] system 00:01: [mem 0xfed1c000-0xfed1ffff] has been reserved
[ 1.554063] system 00:01: [mem 0x000c0000-0x000dffff] could not be reserved
[ 1.578056] system 00:01: [mem 0x000e0000-0x000fffff] could not be reserved
[ 1.598903] system 00:01: [mem 0xfff00000-0xffffffff] has been reserved
[ 1.621897] system 00:06: [io 0x0500-0x053f] has been reserved
[ 1.639642] system 00:06: [io 0x0400-0x047f] has been reserved
[ 1.657367] system 00:06: [io 0x0680-0x06ff] has been reserved
[ 1.678272] system 00:06: [io 0x0295-0x0296] has been reserved
[ 1.738541] pci 0000:01:00.0: no compatible bridge window for [mem 0xfffe0000-0xffffffff pref]
[ 1.764411] pci 0000:00:1c.0: BAR 8: assigned [mem 0xe0300000-0xe06fffff]
[ 1.784753] pci 0000:00:1c.1: BAR 8: assigned [mem 0xe0700000-0xe08fffff]
[ 1.805107] pci 0000:00:1c.1: BAR 9: assigned [mem 0xe0900000-0xe0afffff 64bit pref]
[ 1.828296] pci 0000:00:1c.2: BAR 8: assigned [mem 0xe0b00000-0xe0cfffff]
[ 1.848649] pci 0000:00:1c.2: BAR 9: assigned [mem 0xe0d00000-0xe0efffff 64bit pref]
[ 1.871837] pci 0000:00:1c.3: BAR 8: assigned [mem 0xe0f00000-0xe10fffff]
[ 1.892167] pci 0000:00:1c.3: BAR 9: assigned [mem 0xe1100000-0xe12fffff 64bit pref]
[ 1.915382] pci 0000:00:1c.1: BAR 7: assigned [io 0x3000-0x3fff]
[ 1.933659] pci 0000:00:1c.2: BAR 7: assigned [io 0x4000-0x4fff]
[ 1.951932] pci 0000:00:1c.3: BAR 7: assigned [io 0x5000-0x5fff]
[ 1.970181] pci 0000:01:00.0: BAR 6: assigned [mem 0xe0020000-0xe003ffff pref]
[ 1.991810] pci 0000:00:1c.0: PCI bridge to [bus 01-01]
[ 2.007464] pci 0000:00:1c.0: bridge window [io 0x1000-0x1fff]
[ 2.025736] pci 0000:00:1c.0: bridge window [mem 0xe0300000-0xe06fffff]
[ 2.046091] pci 0000:00:1c.0: bridge window [mem 0xe0000000-0xe00fffff 64bit pref]
[ 2.069309] pci 0000:00:1c.1: PCI bridge to [bus 02-02]
[ 2.084978] pci 0000:00:1c.1: bridge window [io 0x3000-0x3fff]
[ 2.103231] pci 0000:00:1c.1: bridge window [mem 0xe0700000-0xe08fffff]
[ 2.123584] pci 0000:00:1c.1: bridge window [mem 0xe0900000-0xe0afffff 64bit pref]
[ 2.146802] pci 0000:00:1c.2: PCI bridge to [bus 03-03]
[ 2.162470] pci 0000:00:1c.2: bridge window [io 0x4000-0x4fff]
[ 2.180724] pci 0000:00:1c.2: bridge window [mem 0xe0b00000-0xe0cfffff]
[ 2.201079] pci 0000:00:1c.2: bridge window [mem 0xe0d00000-0xe0efffff 64bit pref]
[ 2.224297] pci 0000:00:1c.3: PCI bridge to [bus 04-04]
[ 2.239965] pci 0000:00:1c.3: bridge window [io 0x5000-0x5fff]
[ 2.258217] pci 0000:00:1c.3: bridge window [mem 0xe0f00000-0xe10fffff]
[ 2.278546] pci 0000:00:1c.3: bridge window [mem 0xe1100000-0xe12fffff 64bit pref]
[ 2.301739] pci 0000:00:1e.0: PCI bridge to [bus 05-05]
[ 2.317406] pci 0000:00:1e.0: bridge window [io disabled]
[ 2.334360] pci 0000:00:1e.0: bridge window [mem disabled]
[ 2.351307] pci 0000:00:1e.0: bridge window [mem pref disabled]
[ 2.369582] pci 0000:00:1c.0: PCI INT A -> GSI 17 (level, low) -> IRQ 17
[ 2.389671] pci 0000:00:1c.1: enabling device (0000 -> 0003)
[ 2.406631] pci 0000:00:1c.1: PCI INT B -> GSI 16 (level, low) -> IRQ 16
[ 2.426740] pci 0000:00:1c.2: enabling device (0000 -> 0003)
[ 2.443702] pci 0000:00:1c.2: PCI INT C -> GSI 18 (level, low) -> IRQ 18
[ 2.463814] pci 0000:00:1c.3: enabling device (0000 -> 0003)
[ 2.480774] pci 0000:00:1c.3: PCI INT D -> GSI 19 (level, low) -> IRQ 19
[ 2.501063] NET: Registered protocol family 2
[ 2.514200] IP route cache hash table entries: 32768 (order: 5, 131072 bytes)
[ 2.535779] TCP established hash table entries: 131072 (order: 8, 1048576 bytes)
[ 2.562145] TCP bind hash table entries: 65536 (order: 7, 524288 bytes)
[ 2.582407] TCP: Hash tables configured (established 131072 bind 65536)
[ 2.605302] TCP reno registered
[ 2.614729] UDP hash table entries: 512 (order: 2, 16384 bytes)
[ 2.632494] UDP-Lite hash table entries: 512 (order: 2, 16384 bytes)
[ 2.654747] NET: Registered protocol family 1
[ 2.667984] RPC: Registered udp transport module.
[ 2.682082] RPC: Registered tcp transport module.
[ 2.699290] RPC: Registered tcp NFSv4.1 backchannel transport module.
[ 2.735491] audit: initializing netlink socket (disabled)
[ 2.751751] type=2000 audit(1293532264.750:1): initialized
[ 2.768513] highmem bounce pool size: 64 pages
[ 2.788170] VFS: Disk quotas dquot_6.5.2
[ 2.800167] Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
[ 2.820399] DLM (built Oct 11 2010 14:46:35) installed
[ 2.838323] Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
[ 2.858039] NTFS driver 2.1.29 [Flags: R/W].
[ 2.871208] ROMFS MTD (C) 2007 Red Hat, Inc.
[ 2.884544] JFS: nTxBlock = 8192, nTxLock = 65536
[ 2.902216] SGI XFS with ACLs, security attributes, large block/inode numbers, no debug enabled
[ 2.929857] SGI XFS Quota Management subsystem
[ 2.943214] OCFS2 1.5.0
[ 2.950998] ocfs2: Registered cluster interface o2cb
[ 2.966067] ocfs2: Registered cluster interface user
[ 2.980975] OCFS2 DLMFS 1.5.0
[ 2.990095] OCFS2 User DLM kernel interface loaded
[ 3.004467] OCFS2 Node Manager 1.5.0
[ 3.015397] OCFS2 DLM 1.5.0
[ 3.024535] Btrfs loaded
[ 3.033052] Slow work thread pool: Starting up
[ 3.046457] Slow work thread pool: Ready
[ 3.058243] GFS2 (built Oct 11 2010 14:47:37) installed
[ 3.073908] msgmni has been set to 1692
[ 3.085740] alg: No test for cipher_null (cipher_null-generic)
[ 3.103280] alg: No test for ecb(cipher_null) (ecb-cipher_null)
[ 3.121074] alg: No test for digest_null (digest_null-generic)
[ 3.138600] alg: No test for compress_null (compress_null-generic)
[ 3.159263] alg: No test for fcrypt (fcrypt-generic)
[ 3.177855] alg: No test for stdrng (krng)
[ 3.190196] async_tx: api initialized (async)
[ 3.203422] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 254)
[ 3.225590] io scheduler noop registered
[ 3.237358] io scheduler deadline registered
[ 3.250200] io scheduler cfq registered (default)
[ 3.266403] pci_hotplug: PCI Hot Plug PCI Core version: 0.5
[ 3.284983] vesafb: framebuffer at 0xd0000000, mapped to 0xf8580000, using 3072k, total 8128k
[ 3.310594] vesafb: mode is 1024x768x16, linelength=2048, pages=4
[ 3.328863] vesafb: scrolling: redraw
[ 3.339832] vesafb: Truecolor: size=0:5:6:5, shift=0:11:5:0
[ 3.376638] Console: switching to colour frame buffer device 128x48
[ 3.413894] fb0: VESA VGA frame buffer device

continued on next posting...

aaazen 12-28-2010 03:55 PM

--current, randomly timed kernel oops on bootup of two test boxen
 
Here is the second part of my last posting:

Quote:

[ 3.427339] ERST: Table is not found!
[ 3.438491] isapnp: Scanning for PnP cards...
[ 3.764390] isapnp: No Plug & Play device found
[ 3.785395] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled
ÿ[ 4.048284] serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
[ 4.311283] serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
[ 4.330427] 00:0a: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
[ 4.347693] 00:0b: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
[ 7.376028] floppy0: no floppy controllers found
[ 7.394236] brd: module loaded
[ 7.405601] loop: module loaded
[ 7.415138] Compaq SMART2 Driver (v 2.6.0)
[ 7.427722] HP CISS Driver (v 3.6.20)
[ 7.440183] Loading iSCSI transport class v2.0-870.
[ 7.455998] fnic: Cisco FCoE HBA Driver, ver 1.4.0.145
[ 7.475227] Adaptec aacraid driver 1.1-5[26400]-ms
[ 7.490100] aic94xx: Adaptec aic94xx SAS/SATA driver version 1.0.3 loaded
[ 7.517375] scsi: <fdomain> Detection failed (no card)
[ 7.539366] sym53c416.c: Version 1.0.0-ac
[ 7.557970] qlogicfas: no cards were found, please specify I/O address and IRQ using iobase= and irq= options
[ 7.587658] QLogic Fibre Channel HBA Driver: 8.03.02-k2
[ 7.617755] iscsi: registered transport (qla4xxx)
[ 7.638899] QLogic iSCSI HBA Driver
[ 7.656248] Emulex LightPulse Fibre Channel SCSI driver 8.3.12
[ 7.680724] Copyright(c) 2004-2009 Emulex. All rights reserved.
[ 7.705893] Brocade BFA FC/FCOE SCSI driver - version: 2.1.2.1
[ 7.752022] Failed initialization of WD-7000 SCSI card!
[ 7.862177] DC390: clustering now enabled by default. If you get problems load
[ 7.891139] with "disable_clustering=1" and report to maintainers
[ 7.918801] megaraid cmm: 2.20.2.7 (Release Date: Sun Jul 16 00:01:03 EST 2006)
[ 7.948276] megaraid: 2.20.5.1 (Release Date: Thu Nov 16 15:32:35 EST 2006)
[ 7.976625] megasas: 00.00.04.17.1-rc1 Thu. Oct. 29, 11:41:51 PST 2009
[ 8.003718] mpt2sas version 05.100.00.02 loaded
[ 8.025053] GDT-HA: Storage RAID Controller Driver. Version: 3.05
[ 8.051187] 3ware Storage Controller device driver for Linux v1.26.02.003.
[ 8.079538] 3ware 9000 Storage Controller device driver for Linux v2.26.02.014.
[ 8.109220] LSI 3ware SAS/SATA-RAID Controller device driver for Linux v3.26.02.000.
[ 8.140240] ipr: IBM Power RAID SCSI Device Driver version: 2.5.0 (February 11, 2010)
[ 8.171549] RocketRAID 3xxx/4xxx Controller driver v1.6 (090910)
[ 8.197372] stex: Promise SuperTrak EX Driver version: 4.6.0000.4
[ 8.223769] st: Version 20081215, fixed bufsize 32768, s/g segs 256
[ 8.251468] ata_piix 0000:00:1f.2: PCI INT B -> GSI 19 (level, low) -> IRQ 19
[ 8.280723] ata_piix 0000:00:1f.2: MAP [ P0 P2 P1 P3 ]
[ 8.455150] scsi2 : ata_piix
[ 8.471869] scsi3 : ata_piix
[ 8.488621] ata1: SATA max UDMA/133 cmd 0x20b8 ctl 0x20cc bmdma 0x20a0 irq 19
[ 8.517829] ata2: SATA max UDMA/133 cmd 0x20b0 ctl 0x20c8 bmdma 0x20a8 irq 19
[ 8.552357] scsi4 : pata_legacy
[ 8.569711] ata3: PATA max PIO4 cmd 0x1f0 ctl 0x3f6 irq 14
[ 8.701355] ata1.00: ATAPI: ATAPI iHAS324 Y, BL1W, max UDMA/100
[ 8.713266] ata2.00: ATA-8: WDC WD10EADS-00M2B0, 01.00A01, max UDMA/133
[ 8.713274] ata2.00: 1953525168 sectors, multi 16: LBA48 NCQ (depth 0/32)
[ 8.784236] ata2.00: configured for UDMA/133
[ 8.790371] ata1.00: configured for UDMA/100
[ 8.827252] scsi 2:0:0:0: CD-ROM ATAPI iHAS324 Y BL1W PQ: 0 ANSI: 5
[ 8.864740] sr0: scsi3-mmc drive: 48x/48x writer dvd-ram cd/rw xa/form2 cdda tray
[ 8.895420] Uniform CD-ROM driver Revision: 3.20
[ 8.918218] scsi 3:0:0:0: Direct-Access ATA WDC WD10EADS-00M 01.0 PQ: 0 ANSI: 5
[ 8.951431] sd 3:0:0:0: [sda] 1953525168 512-byte logical blocks: (1.00 TB/931 GiB)
[ 8.983049] sd 3:0:0:0: [sda] Write Protect is off
[ 9.005789] sd 3:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 9.041695] sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 sda8 sda9 sda10 sda11 >
[ 9.155989] sda2: <openbsd: sda12 sda13bad subpartition - ignored
[ 9.182994] bad subpartition - ignored
[ 9.202465] bad subpartition - ignored
[ 9.221824] bad subpartition - ignored
[ 9.241051] bad subpartition - ignored
[ 9.260091] bad subpartition - ignored
[ 9.278981] bad subpartition - ignored
[ 9.297695] bad subpartition - ignored
[ 9.316223] >
[ 9.343720] sd 3:0:0:0: [sda] Attached SCSI disk
[ 9.365240] scsi5 : pata_legacy
[ 9.381860] ata4: PATA max PIO4 cmd 0x170 ctl 0x376 irq 15
[ 9.556390] I2O subsystem v1.325
[ 9.572988] i2o: max drivers = 8
[ 9.589800] I2O Configuration OSM v1.323
[ 9.608377] I2O Bus Adapter OSM v1.317
[ 9.626287] I2O Block Device OSM v1.325
[ 9.644575] I2O SCSI Peripheral OSM v1.316
[ 9.663261] I2O ProcFS OSM v1.316
[ 9.679478] Fusion MPT base driver 3.04.15
[ 9.697885] Copyright (c) 1999-2008 LSI Corporation
[ 9.718661] Fusion MPT SPI Host driver 3.04.15
[ 9.738145] Fusion MPT FC Host driver 3.04.15
[ 9.757304] Fusion MPT SAS Host driver 3.04.15
[ 9.776681] Fusion MPT misc device (ioctl) driver 3.04.15
[ 9.799116] mptctl: Registered with Fusion MPT base driver
[ 9.821627] mptctl: /dev/mptctl @ (major,minor=10,220)
[ 9.843193] Fusion MPT LAN driver 3.04.15
[ 9.861474] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
[ 9.887289] ehci_hcd 0000:00:1d.7: PCI INT A -> GSI 23 (level, low) -> IRQ 23
[ 9.914931] ehci_hcd 0000:00:1d.7: EHCI Host Controller
[ 9.936929] ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 1
[ 9.965335] ehci_hcd 0000:00:1d.7: using broken periodic workaround
[ 9.990286] ehci_hcd 0000:00:1d.7: debug port 1
[ 10.013948] ehci_hcd 0000:00:1d.7: irq 23, io mem 0xe0284400
[ 10.046021] ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00
[ 10.069213] usb usb1: New USB device found, idVendor=1d6b, idProduct=0002
[ 10.095563] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 10.123136] usb usb1: Product: EHCI Host Controller
[ 10.143571] usb usb1: Manufacturer: Linux 2.6.35.7-smp ehci_hcd
[ 10.167134] usb usb1: SerialNumber: 0000:00:1d.7
[ 10.186994] hub 1-0:1.0: USB hub found
[ 10.203750] hub 1-0:1.0: 8 ports detected
[ 10.221529] 116x: driver isp116x-hcd, 03 Nov 2005
[ 10.241069] driver isp1362-hcd, 2005-04-04
[ 10.258853] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver
[ 10.283187] uhci_hcd: USB Universal Host Controller Interface driver
[ 10.308170] uhci_hcd 0000:00:1d.0: PCI INT A -> GSI 23 (level, low) -> IRQ 23
[ 10.335670] uhci_hcd 0000:00:1d.0: UHCI Host Controller
[ 10.357579] uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 2
[ 10.386075] uhci_hcd 0000:00:1d.0: irq 23, io base 0x00002080
[ 10.409686] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
[ 10.436451] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 10.464542] usb usb2: Product: UHCI Host Controller
[ 10.485584] usb usb2: Manufacturer: Linux 2.6.35.7-smp uhci_hcd
[ 10.509760] usb usb2: SerialNumber: 0000:00:1d.0
[ 10.530385] hub 2-0:1.0: USB hub found
[ 10.548108] hub 2-0:1.0: 2 ports detected
[ 10.566772] uhci_hcd 0000:00:1d.1: PCI INT B -> GSI 19 (level, low) -> IRQ 19
[ 10.594821] uhci_hcd 0000:00:1d.1: UHCI Host Controller
[ 10.617268] uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 3
[ 10.646230] uhci_hcd 0000:00:1d.1: irq 19, io base 0x00002060
[ 10.670287] usb usb3: New USB device found, idVendor=1d6b, idProduct=0001
[ 10.697455] usb usb3: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 10.725946] usb usb3: Product: UHCI Host Controller
[ 10.747428] usb usb3: Manufacturer: Linux 2.6.35.7-smp uhci_hcd
[ 10.772140] usb usb3: SerialNumber: 0000:00:1d.1
[ 10.793317] hub 3-0:1.0: USB hub found
[ 10.811454] hub 3-0:1.0: 2 ports detected
[ 10.830534] uhci_hcd 0000:00:1d.2: PCI INT C -> GSI 18 (level, low) -> IRQ 18
[ 10.858920] uhci_hcd 0000:00:1d.2: UHCI Host Controller
[ 10.881569] uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 4
[ 10.910657] uhci_hcd 0000:00:1d.2: irq 18, io base 0x00002040
[ 10.934797] usb usb4: New USB device found, idVendor=1d6b, idProduct=0001
[ 10.962057] usb usb4: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 10.990622] usb usb4: Product: UHCI Host Controller
[ 11.012168] usb usb4: Manufacturer: Linux 2.6.35.7-smp uhci_hcd
[ 11.036921] usb usb4: SerialNumber: 0000:00:1d.2
[ 11.058158] hub 4-0:1.0: USB hub found
[ 11.076312] hub 4-0:1.0: 2 ports detected
[ 11.095382] uhci_hcd 0000:00:1d.3: PCI INT D -> GSI 16 (level, low) -> IRQ 16
[ 11.123730] uhci_hcd 0000:00:1d.3: UHCI Host Controller
[ 11.146415] uhci_hcd 0000:00:1d.3: new USB bus registered, assigned bus number 5
[ 11.175592] uhci_hcd 0000:00:1d.3: irq 16, io base 0x00002020
[ 11.199829] usb usb5: New USB device found, idVendor=1d6b, idProduct=0001
[ 11.227279] usb usb5: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 11.256041] usb usb5: Product: UHCI Host Controller
[ 11.277774] usb usb5: Manufacturer: Linux 2.6.35.7-smp uhci_hcd
[ 11.302622] usb usb5: SerialNumber: 0000:00:1d.3
[ 11.323856] hub 5-0:1.0: USB hub found
[ 11.342041] hub 5-0:1.0: 2 ports detected
[ 11.361397] sl811: driver sl811-hcd, 19 May 2005
[ 11.382232] r8a66597_hcd: driver r8a66597_hcd, 2009-05-26
[ 11.405783] Initializing USB Mass Storage driver...
[ 11.427441] usbcore: registered new interface driver usb-storage
[ 11.452403] USB Mass Storage support registered.
[ 11.473254] usbcore: registered new interface driver ums-alauda
[ 11.498040] usbcore: registered new interface driver ums-cypress
[ 11.523011] usbcore: registered new interface driver ums-datafab
[ 11.547865] usbcore: registered new interface driver ums-freecom
[ 11.572696] usbcore: registered new interface driver ums-isd200
[ 11.597128] usbcore: registered new interface driver ums-jumpshot
[ 11.621981] usbcore: registered new interface driver ums-karma
[ 11.645927] usbcore: registered new interface driver ums-onetouch
[ 11.670580] usbcore: registered new interface driver ums-sddr09
[ 11.694536] usbcore: registered new interface driver ums-sddr55
[ 11.718417] usbcore: registered new interface driver ums-usbat
[ 11.741966] PNP: PS/2 Controller [PNP0303:PS2K,PNP0f03:PS2M] at 0x60,0x64 irq 1,12
[ 11.772896] serio: i8042 KBD port at 0x60,0x64 irq 1
[ 11.793782] serio: i8042 AUX port at 0x60,0x64 irq 12
[ 11.815210] mice: PS/2 mouse device common for all mice
[ 11.836992] md: linear personality registered for level -1
[ 11.859397] md: raid0 personality registered for level 0
[ 11.881271] md: raid1 personality registered for level 1
[ 11.903035] md: raid10 personality registered for level 10
[ 11.925307] md: raid6 personality registered for level 6
[ 11.925479] input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input0
[ 11.978815] md: raid5 personality registered for level 5
[ 12.000686] md: raid4 personality registered for level 4
[ 12.022428] md: multipath personality registered for level -4
[ 12.045980] device-mapper: uevent: version 1.0.3
[ 12.066125] device-mapper: ioctl: 4.17.0-ioctl (2010-03-05) initialised: dm-devel@redhat.com
[ 12.097947] cpuidle: using governor ladder
[ 12.116315] cpuidle: using governor menu
[ 12.134192] TCP cubic registered
[ 12.149759] Initializing XFRM netlink socket
[ 12.168466] NET: Registered protocol family 17
[ 12.187643] Using IPI No-Shortcut mode
[ 12.205111] registered taskstats version 1
[ 12.223778] md: Waiting for all devices to be available before autodetect
[ 12.249991] md: If you don't use raid, use raid=noautodetect
[ 12.273252] md: Autodetecting RAID arrays.
[ 12.291380] md: Scanned 0 and added 0 devices.
[ 12.310575] md: autorun ...
[ 12.324762] md: ... autorun DONE.
[ 12.356306] UDF-fs: No partition found (1)
[ 12.400971] VFS: Mounted root (jfs filesystem) readonly on device 8:7.
[ 12.443315] devtmpfs: mounted
[ 12.457991] Freeing unused kernel memory: 596k freed
[ 12.478988] Write protecting the kernel text: 9752k
[ 12.499548] Write protecting the kernel read-only data: 2508k
INIT: version 2.86 booting
proc on /proc type proc (rw)
sysfs on /sys type sysfs (rw)
[ 13.355523] udev[1140]: starting version 165
Triggering udev events: /sbin/udevadm trigger --action=add
[ 13.851622] sr 2:0:0:0: Attached scsi generic sg0 type 5
[ 13.873986] sd 3:0:0:0: Attached scsi generic sg1 type 0
[ 13.960782] Linux agpgart interface v0.103
[ 13.996180] BUG: unable to handle kernel paging request at fffb2000
[ 13.997003] IP: [<c14c27d8>] ioread32_rep+0x38/0x50
[ 13.997003] *pde = 01d3a067 *pte = 00000000
[ 13.997003] Oops: 0002 [#1] SMP
[ 13.997003] last sysfs file: /sys/bus/serio/drivers/psmouse/uevent
[ 13.997003] Modules linked in: snd_pcm psmouse agpgart snd_timer output evdev sg snd soundcore snd_page_alloc serio_raw
[ 13.997003]
[ 13.997003] Pid: 1182, comm: modprobe Not tainted 2.6.35.7-smp #2 D510MO/
[ 13.997003] EIP: 0060:[<c14c27d8>] EFLAGS: 00010006 CPU: 0
[ 13.997003] EIP is at ioread32_rep+0x38/0x50
[ 13.997003] EAX: 000020b8 EBX: 00000200 ECX: 00000031 EDX: 000020b8
[ 13.997003] ESI: fffb1ec4 EDI: fffb2000 EBP: f6b73b30 ESP: f6b73b2c
[ 13.997003] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
[ 13.997003] Process modprobe (pid: 1182, ti=f6b72000 task=f715e9a0 task.ti=f6b72000)
[ 13.997003] Stack:
[ 13.997003] 000120b8 f6b73b54 c179c371 f6910000 f6911444 f69100b8 00000000 f69100b8
[ 13.997003] <0> 00000000 c1c70380 f6b73b78 c179c171 00000000 fffb1000 00000046 00000ec4
[ 13.997003] <0> f69100b8 f69100b8 00000058 f6b73b88 c179c209 f6910000 f69100b8 f6b73be4
[ 13.997003] Call Trace:
[ 13.997003] [<c179c371>] ? ata_sff_data_xfer32+0x61/0x110
[ 13.997003] [<c179c171>] ? ata_pio_sector+0x101/0x130
[ 13.997003] [<c179c209>] ? ata_pio_sectors+0x69/0xa0
[ 13.997003] [<c179ceaa>] ? ata_sff_hsm_move+0x19a/0x760
[ 13.997003] [<c103183b>] ? task_tick_fair+0x3b/0x100
[ 13.997003] [<c179d63e>] ? __ata_sff_port_intr+0x9e/0x100
[ 13.997003] [<c179d6ce>] ? ata_bmdma_port_intr+0x2e/0x100
[ 13.997003] [<c179bc30>] ? ata_bmdma_error_handler+0x0/0x120
[ 13.997003] [<c179de9d>] ? ata_bmdma_interrupt+0xcd/0x1d0
[ 13.997003] [<c1084744>] ? handle_IRQ_event+0x44/0x150
[ 13.997003] [<c1044090>] ? __do_softirq+0xe0/0x1a0
[ 13.997003] [<c10868e3>] ? handle_fasteoi_irq+0x63/0xd0
[ 13.997003] [<c100559d>] ? handle_irq+0x1d/0x30
[ 13.997003] [<c198493c>] ? do_IRQ+0x4c/0xc0
[ 13.997003] [<c104445c>] ? irq_exit+0x3c/0x80
[ 13.997003] [<c1984a0b>] ? smp_apic_timer_interrupt+0x5b/0x8a
[ 13.997003] [<c1003570>] ? common_interrupt+0x30/0x38
[ 13.997003] [<c198007b>] ? migration_call+0x19b/0x28f
[ 13.997003] [<c106da97>] ? find_symbol_in_section+0x17/0xe0
[ 13.997003] [<c10b007b>] ? sync_page+0x2b/0x50
[ 13.997003] [<c106e24c>] ? each_symbol_in_section.clone.34+0x4c/0x80
[ 13.997003] [<c106da80>] ? find_symbol_in_section+0x0/0xe0
[ 13.997003] [<c106ea7f>] ? each_symbol+0x3f/0x1d0
[ 13.997003] [<c106da80>] ? find_symbol_in_section+0x0/0xe0
[ 13.997003] [<c105e162>] ? sched_clock_local+0xb2/0x1a0
[ 13.997003] [<c105e162>] ? sched_clock_local+0xb2/0x1a0
[ 13.997003] [<c106ec3d>] ? find_symbol+0x2d/0x60
[ 13.997003] [<c1030101>] ? __free_domain_allocs.clone.117+0x21/0x40
[ 13.997003] [<c1982939>] ? mutex_lock+0x19/0x40
[ 13.997003] [<c106f328>] ? resolve_symbol.clone.28+0x48/0xa0
[ 13.997003] [<c1070374>] ? load_module+0xcd4/0x1f00
[ 13.997003] [<c146f784>] ? security_file_permission+0x14/0x20
[ 13.997003] [<c10715e8>] ? sys_init_module+0x48/0x1e0
[ 13.997003] [<c10e0432>] ? sys_write+0x42/0x70
[ 13.997003] [<c198404c>] ? syscall_call+0x7/0xb
[ 13.997003] Code: 01 78 0f 8b 38 83 e9 01 89 3a 83 c2 04 83 f9 ff 75 f1 5f 5d c3 8d b6 00 00 00 00 3d 00 00 01 00 76 11 25 ff ff 00 00 89 d7 89 c2 <f3> 6d 5f 5d c3 8d 76 00 ba b6 34 b2 c1 e8 36 fe ff ff 5f 5d c3
[ 13.997003] EIP: [<c14c27d8>] ioread32_rep+0x38/0x50 SS:ESP 0068:f6b73b2c
[ 13.997003] CR2: 00000000fffb2000
[ 13.997003] ---[ end trace 73354c9f0f8fa49e ]---
[ 13.997003] Kernel panic - not syncing: Fatal exception in interrupt
[ 13.997003] Pid: 1182, comm: modprobe Tainted: G D 2.6.35.7-smp #2
[ 13.997003] Call Trace:
[ 13.997003] [<c198174b>] ? printk+0x1d/0x1f
[ 13.997003] [<c19816c4>] panic+0x5a/0xc4
[ 13.997003] [<c10068bf>] oops_end+0x8f/0x90
[ 13.997003] [<c1024fcc>] no_context+0xbc/0x150
[ 13.997003] [<c14bdfb4>] ? __const_udelay+0x34/0x40
[ 13.997003] [<c10250e5>] __bad_area_nosemaphore+0x85/0x130
[ 13.997003] [<c105e162>] ? sched_clock_local+0xb2/0x1a0
[ 13.997003] [<c1025420>] ? do_page_fault+0x0/0x370
[ 13.997003] [<c10251a7>] bad_area_nosemaphore+0x17/0x20
[ 13.997003] [<c10256f2>] do_page_fault+0x2d2/0x370
[ 13.997003] [<c10322bd>] ? enqueue_task_fair+0x14d/0x180
[ 13.997003] [<c1030360>] ? enqueue_task+0x50/0xa0
[ 13.997003] [<c1039016>] ? try_to_wake_up+0x96/0x2c0
[ 13.997003] [<c1025420>] ? do_page_fault+0x0/0x370
[ 13.997003] [<c198471b>] error_code+0x73/0x78
[ 13.997003] [<c14c27d8>] ? ioread32_rep+0x38/0x50
[ 13.997003] [<c179c371>] ata_sff_data_xfer32+0x61/0x110
[ 13.997003] [<c179c171>] ata_pio_sector+0x101/0x130
[ 13.997003] [<c179c209>] ata_pio_sectors+0x69/0xa0
[ 13.997003] [<c179ceaa>] ata_sff_hsm_move+0x19a/0x760
[ 13.997003] [<c103183b>] ? task_tick_fair+0x3b/0x100
[ 13.997003] [<c179d63e>] __ata_sff_port_intr+0x9e/0x100
[ 13.997003] [<c179d6ce>] ata_bmdma_port_intr+0x2e/0x100
[ 13.997003] [<c179bc30>] ? ata_bmdma_error_handler+0x0/0x120
[ 13.997003] [<c179de9d>] ata_bmdma_interrupt+0xcd/0x1d0
[ 13.997003] [<c1084744>] handle_IRQ_event+0x44/0x150
[ 13.997003] [<c1044090>] ? __do_softirq+0xe0/0x1a0
[ 13.997003] [<c10868e3>] handle_fasteoi_irq+0x63/0xd0
[ 13.997003] [<c100559d>] handle_irq+0x1d/0x30
[ 13.997003] [<c198493c>] do_IRQ+0x4c/0xc0
[ 13.997003] [<c104445c>] ? irq_exit+0x3c/0x80
[ 13.997003] [<c1984a0b>] ? smp_apic_timer_interrupt+0x5b/0x8a
[ 13.997003] [<c1003570>] common_interrupt+0x30/0x38
[ 13.997003] [<c198007b>] ? migration_call+0x19b/0x28f
[ 13.997003] [<c106da97>] ? find_symbol_in_section+0x17/0xe0
[ 13.997003] [<c10b007b>] ? sync_page+0x2b/0x50
[ 13.997003] [<c106e24c>] each_symbol_in_section.clone.34+0x4c/0x80
[ 13.997003] [<c106da80>] ? find_symbol_in_section+0x0/0xe0
[ 13.997003] [<c106ea7f>] each_symbol+0x3f/0x1d0
[ 13.997003] [<c106da80>] ? find_symbol_in_section+0x0/0xe0
[ 13.997003] [<c105e162>] ? sched_clock_local+0xb2/0x1a0
[ 13.997003] [<c105e162>] ? sched_clock_local+0xb2/0x1a0
[ 13.997003] [<c106ec3d>] find_symbol+0x2d/0x60
[ 13.997003] [<c1030101>] ? __free_domain_allocs.clone.117+0x21/0x40
[ 13.997003] [<c1982939>] ? mutex_lock+0x19/0x40
[ 13.997003] [<c106f328>] resolve_symbol.clone.28+0x48/0xa0
[ 13.997003] [<c1070374>] load_module+0xcd4/0x1f00
[ 13.997003] [<c146f784>] ? security_file_permission+0x14/0x20
[ 13.997003] [<c10715e8>] sys_init_module+0x48/0x1e0
[ 13.997003] [<c10e0432>] ? sys_write+0x42/0x70
[ 13.997003] [<c198404c>] syscall_call+0x7/0xb


smoooth103 12-28-2010 05:19 PM

I've also had two or three random hard freezes while running X11/KDE since either the Dec 19th or December 24th 32bit current update. I couldn't ctrl-alt-backspace to kill X or anything. I am wondering if it may be the dbus, udev,acpid, or maybe even phonon. It could be hardware but I doubt it...

aaazen 12-28-2010 09:12 PM

EdGr my crash looks much like your screen shot picture.

I am running Slackware current upgraded from 13.1 and the random crashes started when I upgraded the kernel.

The old kernel 2.6.33.4 was rock solid and never crashed like this.

The main similarity is the where the kernel appears to stop in the

ioread32_rep routine which is inside of lib/iomap.c

The key line to look for in a crash is:

EIP: [<c14c27d8>] ioread32_rep+0x38/0x50 SS:ESP 0068:f6b73b2c

EdGr 12-28-2010 09:22 PM

I think we're looking at the same bug, comet.berkeley.

I have re-booted my machines enough times today to have observed quite a few kernel panics. The panics occur only on my two 32-bit machines (my Slackware64 machine does not appear to have the bug).

There are several failure signatures. I think you are right about ioread32_rep being a common denominator.

Like smoooth103, I have just started seeing this bug after the Dec 19 or Dec 24 updates. My guess is the problem is either in udev-165 or acpid-2.0.7.
Ed

hitest 12-28-2010 09:54 PM

Quote:

Originally Posted by EdGr (Post 4205651)

I did a clean install of 32 bit Slackware-current using the December 24th build and I am also getting random kernel panics on the unit. Your screen shot looks like my unit on random boots or reboots. Odd.

Old_Fogie 12-28-2010 10:10 PM

Hello all thanks for the responses

@comet: that's impressive how you were able to get minicom to do that.

@Edgr: I cannot confirm that "ioread.." is my issue at this time, as I use "vga=normal" in my lilo. I'll change my lilo to use vga=791 and see if I get that ioread to display or not.

Edgr, I guestimate this issue arrived for me around the time udev was issued.

Worse comes to worse, I'll try and take a photo of my screen when it's crashed and upload it.

EdGr 12-29-2010 09:41 AM

I think I may have found what tickles the kernel panic.

I disabled the following change in udev-165:

http://git.kernel.org/?p=linux/hotpl...5cd45f44132d73

My edits were:

Code:

diff -rpE x/udev/udev-165/extras/ata_id/ata_id.c z/udev//udev-165/extras/ata_id/ata_id.c
*** x/udev/udev-165/extras/ata_id/ata_id.c      2010-11-09 16:30:53.000000000 -0800
--- z/udev//udev-165/extras/ata_id/ata_id.c    2010-12-29 05:49:32.000000000 -0800
*************** static int disk_identify (struct udev *u
*** 397,408 ****
--- 397,412 ----
 
        /* SPC-4, section 6.4.2: Standard INQUIRY data */
        peripheral_device_type = inquiry_buf[0] & 0x1f;
+
+        /* this code appears to be buggy
        if (peripheral_device_type == 0x05)
          {
            is_packet_device = 1;
            ret = disk_identify_packet_device_command(fd, out_identify, 512);
            goto check_nul_bytes;
          }
+        end buggy code */
+
        if (peripheral_device_type != 0x00) {
                ret = -1;
                errno = EIO;

I installed the new ata_id in /lib/udev

With this change, I haven't seen a crash in two dozen reboots.
Ed

EdGr 12-29-2010 11:40 AM

I can explain two more observations.

I just now saw the bug when ejecting the DVD at the end of a clean Slackware installation. I previously didn't notice this because I run with HAL disabled in normal use. Old_Fogie - you were right in that problems with ejecting DVDs and booting are the same.

Also, this bug occurs with IDE drives, which explains why my 64-bit machine with SATA drives works.
Ed

Old_Fogie 12-29-2010 08:51 PM

Hi Edgr, Yes this does happen with both ide (pata) and sata drives here for me. I just had another crash a moment ago, and tho I changed to vga=791 in lilo, I still cannot see if the "ioread.." is my issue or not. I'm at a complete loss here :(

EdGr 12-29-2010 09:31 PM

Hi Old_Fogie. I think ata_id is run if the machine has any IDE drives, and that can trigger the kernel panic.

If you are comfortable with recompiling udev-165, the patch that I listed above successfully prevents the kernel panic. It doesn't fix the underlying bug - it just removes the condition that tickles it. Only one program needs to be recompiled and re-installed: ata_id, a small helper program that is part of udev. The rest of udev doesn't need to be changed. I haven't noticed any problems with the patched ata_id.

Another user reported success rolling back to udev-164:

http://groups.google.com/group/alt.o...0948f4838772a#

He appears to have kept the Slackware udev-164-i486-3 package around.

If this is too much work, it may be best to wait for Pat to issue an update. I emailed Pat and David Z. at Redhat this morning. I haven't received a reply - people may be on vacation.

(I'm actually on vacation too.)
Ed


All times are GMT -5. The time now is 12:47 PM.