LinuxQuestions.org
Visit Jeremy's Blog.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 07-01-2007, 04:06 PM   #1
Jubalint
Member
 
Registered: Mar 2004
Distribution: Debian
Posts: 35

Rep: Reputation: 15
Server Crashing


I'm running a Debian Etch server that is crashing once every few days. It's located in a data center and I'm having trouble diagnosing what the problem is.

I'm inexperienced in using Linux and am not sure how to go about trying to figure out the reason for the crash.

What happens is the system just is either hanging or crashing and not rebooting. I have to talk to the data center and get them to reboot the server manually.

I've looked in the /var/log/messages file but I'm not seeing anything strange and not sure where else to look.

I run a LAMP server managed by ISPConfig, ProFTPd, a SOCKS5 server (antinat) and a CS 1.6 game server. But even when the socks5 and 1.6 servers are off it will crash.

I'd appreciate any suggestions on how to diagnose this crash or what you think might be the problem. Thanks very much for the help .

Here is my syslog file if that helps:
Code:
Jul  1 06:47:02 onesimplehost syslogd 1.4.1#18: restart.
Jul  1 06:58:00 onesimplehost -- MARK --
Jul  1 07:18:01 onesimplehost -- MARK --
Jul  1 07:38:01 onesimplehost -- MARK --
Jul  1 07:58:01 onesimplehost -- MARK --
Jul  1 08:18:01 onesimplehost -- MARK --
Jul  1 08:38:01 onesimplehost -- MARK --
Jul  1 08:58:01 onesimplehost -- MARK --
Jul  1 09:18:02 onesimplehost -- MARK --
Jul  1 09:38:02 onesimplehost -- MARK --
Jul  1 09:58:02 onesimplehost -- MARK --
Jul  1 10:18:02 onesimplehost -- MARK --
Jul  1 10:38:03 onesimplehost -- MARK --
Jul  1 10:58:03 onesimplehost -- MARK --
Jul  1 11:18:03 onesimplehost -- MARK --
Jul  1 11:38:03 onesimplehost -- MARK --
Jul  1 11:58:03 onesimplehost -- MARK --
Jul  1 12:18:03 onesimplehost -- MARK --
Jul  1 12:38:04 onesimplehost -- MARK --
Jul  1 12:58:04 onesimplehost -- MARK --
Jul  1 13:18:04 onesimplehost -- MARK --
Jul  1 13:38:04 onesimplehost -- MARK --
Jul  1 16:46:22 onesimplehost syslogd 1.4.1#18: restart.
Jul  1 16:46:22 onesimplehost kernel: klogd 1.4.1#18, log source = /proc/kmsg started.
Jul  1 16:46:22 onesimplehost kernel: Linux version 2.6.18-4-486 (Debian 2.6.18.dfsg.1-12etch1) (dannf@debian.org) (gcc version 4.1.2 20061115 (prerelease) (Debian 4.1.1-21)) #1 Wed Apr 18 09:13:09 UTC 2007
Jul  1 16:46:22 onesimplehost kernel: BIOS-provided physical RAM map:
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 0000000000000000 - 000000000009f800 (usable)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 000000000009f800 - 00000000000a0000 (reserved)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 0000000000100000 - 000000003bff0000 (usable)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 000000003bff0000 - 000000003bff3000 (ACPI NVS)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 000000003bff3000 - 000000003c000000 (ACPI data)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
Jul  1 16:46:22 onesimplehost kernel:  BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
Jul  1 16:46:22 onesimplehost kernel: Warning only 896MB will be used.
Jul  1 16:46:22 onesimplehost kernel: Use a HIGHMEM enabled kernel.
Jul  1 16:46:22 onesimplehost kernel: 896MB LOWMEM available.
Jul  1 16:46:22 onesimplehost kernel: found SMP MP-table at 000f53a0
Jul  1 16:46:22 onesimplehost kernel: DMI 2.3 present.
Jul  1 16:46:22 onesimplehost kernel: ACPI: PM-Timer IO Port: 0x4008
Jul  1 16:46:22 onesimplehost kernel: ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
Jul  1 16:46:22 onesimplehost kernel: Processor #0 15:12 APIC version 16
Jul  1 16:46:22 onesimplehost kernel: ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1])
Jul  1 16:46:22 onesimplehost kernel: ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0])
Jul  1 16:46:22 onesimplehost kernel: IOAPIC[0]: apic_id 2, version 3, address 0xfec00000, GSI 0-23
Jul  1 16:46:22 onesimplehost kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
Jul  1 16:46:22 onesimplehost kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
Jul  1 16:46:22 onesimplehost kernel: Enabling APIC mode:  Flat.  Using 1 I/O APICs
Jul  1 16:46:22 onesimplehost kernel: Using ACPI (MADT) for SMP configuration information
Jul  1 16:46:22 onesimplehost kernel: Allocating PCI resources starting at 40000000 (gap: 3c000000:c2c00000)
Jul  1 16:46:22 onesimplehost kernel: Detected 1999.832 MHz processor.
Jul  1 16:46:22 onesimplehost kernel: Built 1 zonelists.  Total pages: 229376
Jul  1 16:46:22 onesimplehost kernel: Kernel command line: root=/dev/hdb3 ro 
Jul  1 16:46:22 onesimplehost kernel: Enabling fast FPU save and restore... done.
Jul  1 16:46:22 onesimplehost kernel: Enabling unmasked SIMD FPU exception support... done.
Jul  1 16:46:22 onesimplehost kernel: Initializing CPU#0
Jul  1 16:46:22 onesimplehost kernel: PID hash table entries: 4096 (order: 12, 16384 bytes)
Jul  1 16:46:22 onesimplehost kernel: Console: colour VGA+ 80x25
Jul  1 16:46:22 onesimplehost kernel: Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
Jul  1 16:46:22 onesimplehost kernel: Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
Jul  1 16:46:22 onesimplehost kernel: Memory: 902096k/917504k available (1502k kernel code, 14824k reserved, 601k data, 256k init, 0k highmem)
Jul  1 16:46:22 onesimplehost kernel: Checking if this processor honours the WP bit even in supervisor mode... Ok.
Jul  1 16:46:22 onesimplehost kernel: Calibrating delay using timer specific routine.. 4003.90 BogoMIPS (lpj=8007809)
Jul  1 16:46:22 onesimplehost kernel: Security Framework v1.0.0 initialized
Jul  1 16:46:22 onesimplehost kernel: SELinux:  Disabled at boot.
Jul  1 16:46:22 onesimplehost kernel: Capability LSM initialized
Jul  1 16:46:22 onesimplehost kernel: Mount-cache hash table entries: 512
Jul  1 16:46:22 onesimplehost kernel: CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
Jul  1 16:46:22 onesimplehost kernel: CPU: L2 Cache: 512K (64 bytes/line)
Jul  1 16:46:22 onesimplehost kernel: Compat vDSO mapped to ffffe000.
Jul  1 16:46:22 onesimplehost kernel: CPU: AMD Athlon(tm) 64 Processor 3000+ stepping 00
Jul  1 16:46:22 onesimplehost kernel: Checking 'hlt' instruction... OK.
Jul  1 16:46:22 onesimplehost kernel: ACPI: Core revision 20060707
Jul  1 16:46:22 onesimplehost kernel: ENABLING IO-APIC IRQs
Jul  1 16:46:22 onesimplehost kernel: ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=0 pin2=0
Jul  1 16:46:22 onesimplehost kernel: checking if image is initramfs... it is
Jul  1 16:46:22 onesimplehost kernel: Freeing initrd memory: 4243k freed
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 16
Jul  1 16:46:22 onesimplehost kernel: EISA bus registered
Jul  1 16:46:22 onesimplehost kernel: ACPI: bus type pci registered
Jul  1 16:46:22 onesimplehost kernel: PCI: PCI BIOS revision 2.10 entry at 0xfb8d0, last bus=1
Jul  1 16:46:22 onesimplehost kernel: PCI: Using configuration type 1
Jul  1 16:46:22 onesimplehost kernel: Setting up standard PCI resources
Jul  1 16:46:22 onesimplehost kernel: ACPI: Interpreter enabled
Jul  1 16:46:22 onesimplehost kernel: ACPI: Using IOAPIC for interrupt routing
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Root Bridge [PCI0] (0000:00)
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 6 7 *10 11 12)
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 6 7 10 *11 12)
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 6 7 10 11 12) *5
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 6 7 10 11 12) *0, disabled.
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 6 7 10 11 12) *0, disabled.
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 6 7 10 11 12) *0, disabled.
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNK0] (IRQs 3 4 6 7 10 11 12) *0, disabled.
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [LNK1] (IRQs 3 4 6 7 10 11 12) *0, disabled.
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKA] (IRQs *20)
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKB] (IRQs *21)
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKC] (IRQs *22)
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKD] (IRQs *23)
Jul  1 16:46:22 onesimplehost kernel: Linux Plug and Play Support v0.97 (c) Adam Belay
Jul  1 16:46:22 onesimplehost kernel: pnp: PnP ACPI init
Jul  1 16:46:22 onesimplehost kernel: pnp: PnP ACPI: found 11 devices
Jul  1 16:46:22 onesimplehost kernel: PnPBIOS: Disabled by ACPI PNP
Jul  1 16:46:22 onesimplehost kernel: PCI: Using ACPI for IRQ routing
Jul  1 16:46:22 onesimplehost kernel: PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
Jul  1 16:46:22 onesimplehost kernel: pnp: 00:02: ioport range 0x4000-0x407f could not be reserved
Jul  1 16:46:22 onesimplehost kernel: pnp: 00:02: ioport range 0x5000-0x500f has been reserved
Jul  1 16:46:22 onesimplehost kernel: PCI: Bridge: 0000:00:01.0
Jul  1 16:46:22 onesimplehost kernel:   IO window: disabled.
Jul  1 16:46:22 onesimplehost kernel:   MEM window: ec000000-edffffff
Jul  1 16:46:22 onesimplehost kernel:   PREFETCH window: e8000000-ebffffff
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 2
Jul  1 16:46:22 onesimplehost kernel: IP route cache hash table entries: 32768 (order: 5, 131072 bytes)
Jul  1 16:46:22 onesimplehost kernel: TCP established hash table entries: 131072 (order: 7, 524288 bytes)
Jul  1 16:46:22 onesimplehost kernel: TCP bind hash table entries: 65536 (order: 6, 262144 bytes)
Jul  1 16:46:22 onesimplehost kernel: TCP: Hash tables configured (established 131072 bind 65536)
Jul  1 16:46:22 onesimplehost kernel: TCP reno registered
Jul  1 16:46:22 onesimplehost kernel: audit: initializing netlink socket (disabled)
Jul  1 16:46:22 onesimplehost kernel: audit(1183322734.804:1): initialized
Jul  1 16:46:22 onesimplehost kernel: VFS: Disk quotas dquot_6.5.1
Jul  1 16:46:22 onesimplehost kernel: Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
Jul  1 16:46:22 onesimplehost kernel: Initializing Cryptographic API
Jul  1 16:46:22 onesimplehost kernel: io scheduler noop registered
Jul  1 16:46:22 onesimplehost kernel: io scheduler anticipatory registered
Jul  1 16:46:22 onesimplehost kernel: io scheduler deadline registered
Jul  1 16:46:22 onesimplehost kernel: io scheduler cfq registered (default)
Jul  1 16:46:22 onesimplehost kernel: isapnp: Scanning for PnP cards...
Jul  1 16:46:22 onesimplehost kernel: isapnp: No Plug & Play device found
Jul  1 16:46:22 onesimplehost kernel: Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
Jul  1 16:46:22 onesimplehost kernel: serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
Jul  1 16:46:22 onesimplehost kernel: 00:09: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
Jul  1 16:46:22 onesimplehost kernel: RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize
Jul  1 16:46:22 onesimplehost kernel: PNP: No PS/2 controller found. Probing ports directly.
Jul  1 16:46:22 onesimplehost kernel: serio: i8042 AUX port at 0x60,0x64 irq 12
Jul  1 16:46:22 onesimplehost kernel: serio: i8042 KBD port at 0x60,0x64 irq 1
Jul  1 16:46:22 onesimplehost kernel: mice: PS/2 mouse device common for all mice
Jul  1 16:46:22 onesimplehost kernel: EISA: Probing bus 0 at eisa.0
Jul  1 16:46:22 onesimplehost kernel: Cannot allocate resource for EISA slot 4
Jul  1 16:46:22 onesimplehost kernel: Cannot allocate resource for EISA slot 5
Jul  1 16:46:22 onesimplehost kernel: EISA: Detected 0 cards.
Jul  1 16:46:22 onesimplehost kernel: TCP bic registered
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 1
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 17
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 8
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 20
Jul  1 16:46:22 onesimplehost kernel: Using IPI Shortcut mode
Jul  1 16:46:22 onesimplehost kernel: ACPI: (supports S0 S3 S4 S5)
Jul  1 16:46:22 onesimplehost kernel: Freeing unused kernel memory: 256k freed
Jul  1 16:46:22 onesimplehost kernel: Time: tsc clocksource has been installed.
Jul  1 16:46:22 onesimplehost kernel: ACPI: Thermal Zone [THRM] (40 C)
Jul  1 16:46:22 onesimplehost kernel: Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
Jul  1 16:46:22 onesimplehost kernel: ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
Jul  1 16:46:22 onesimplehost kernel: SCSI subsystem initialized
Jul  1 16:46:22 onesimplehost kernel: VP_IDE: IDE controller at PCI slot 0000:00:0f.1
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKA] enabled at IRQ 20
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:0f.1[A] -> Link [ALKA] -> GSI 20 (level, low) -> IRQ 169
Jul  1 16:46:22 onesimplehost kernel: PCI: VIA IRQ fixup for 0000:00:0f.1, from 255 to 9
Jul  1 16:46:22 onesimplehost kernel: VP_IDE: chipset revision 6
Jul  1 16:46:22 onesimplehost kernel: VP_IDE: not 100%% native mode: will probe irqs later
Jul  1 16:46:22 onesimplehost kernel: VP_IDE: VIA vt8237 (rev 00) IDE UDMA133 controller on pci0000:00:0f.1
Jul  1 16:46:22 onesimplehost kernel:     ide0: BM-DMA at 0xc800-0xc807, BIOS settings: hda:pio, hdb:pio
Jul  1 16:46:22 onesimplehost kernel:     ide1: BM-DMA at 0xc808-0xc80f, BIOS settings: hdc:pio, hdd:pio
Jul  1 16:46:22 onesimplehost kernel: usbcore: registered new driver usbfs
Jul  1 16:46:22 onesimplehost kernel: usbcore: registered new driver hub
Jul  1 16:46:22 onesimplehost kernel: USB Universal Host Controller Interface driver v3.0
Jul  1 16:46:22 onesimplehost kernel: via-rhine.c:v1.10-LK1.4.1 July-24-2006 Written by Donald Becker
Jul  1 16:46:22 onesimplehost kernel: hdb: HDS728080PLAT20, ATA DISK drive
Jul  1 16:46:22 onesimplehost kernel: ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKB] enabled at IRQ 21
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:10.0[A] -> Link [ALKB] -> GSI 21 (level, low) -> IRQ 177
Jul  1 16:46:22 onesimplehost kernel: PCI: VIA IRQ fixup for 0000:00:10.0, from 10 to 1
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.0: UHCI Host Controller
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.0: new USB bus registered, assigned bus number 1
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.0: irq 177, io base 0x0000cc00
Jul  1 16:46:22 onesimplehost kernel: usb usb1: configuration #1 chosen from 1 choice
Jul  1 16:46:22 onesimplehost kernel: hub 1-0:1.0: USB hub found
Jul  1 16:46:22 onesimplehost kernel: hub 1-0:1.0: 2 ports detected
Jul  1 16:46:22 onesimplehost kernel: hdb: max request size: 512KiB
Jul  1 16:46:22 onesimplehost kernel: hdb: 160836480 sectors (82348 MB) w/1719KiB Cache, CHS=16383/255/63, UDMA(133)
Jul  1 16:46:22 onesimplehost kernel: hdb: cache flushes supported
Jul  1 16:46:22 onesimplehost kernel:  hdb: hdb1 hdb2 hdb3
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:10.1[A] -> Link [ALKB] -> GSI 21 (level, low) -> IRQ 177
Jul  1 16:46:22 onesimplehost kernel: PCI: VIA IRQ fixup for 0000:00:10.1, from 10 to 1
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.1: UHCI Host Controller
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.1: new USB bus registered, assigned bus number 2
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.1: irq 177, io base 0x0000d000
Jul  1 16:46:22 onesimplehost kernel: usb usb2: configuration #1 chosen from 1 choice
Jul  1 16:46:22 onesimplehost kernel: hub 2-0:1.0: USB hub found
Jul  1 16:46:22 onesimplehost kernel: hub 2-0:1.0: 2 ports detected
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:10.2[B] -> Link [ALKB] -> GSI 21 (level, low) -> IRQ 177
Jul  1 16:46:22 onesimplehost kernel: PCI: VIA IRQ fixup for 0000:00:10.2, from 11 to 1
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.2: UHCI Host Controller
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.2: new USB bus registered, assigned bus number 3
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.2: irq 177, io base 0x0000d400
Jul  1 16:46:22 onesimplehost kernel: usb usb3: configuration #1 chosen from 1 choice
Jul  1 16:46:22 onesimplehost kernel: hub 3-0:1.0: USB hub found
Jul  1 16:46:22 onesimplehost kernel: hub 3-0:1.0: 2 ports detected
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:10.3[B] -> Link [ALKB] -> GSI 21 (level, low) -> IRQ 177
Jul  1 16:46:22 onesimplehost kernel: PCI: VIA IRQ fixup for 0000:00:10.3, from 11 to 1
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.3: UHCI Host Controller
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.3: new USB bus registered, assigned bus number 4
Jul  1 16:46:22 onesimplehost kernel: uhci_hcd 0000:00:10.3: irq 177, io base 0x0000d800
Jul  1 16:46:22 onesimplehost kernel: usb usb4: configuration #1 chosen from 1 choice
Jul  1 16:46:22 onesimplehost kernel: hub 4-0:1.0: USB hub found
Jul  1 16:46:22 onesimplehost kernel: hub 4-0:1.0: 2 ports detected
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:0f.0[B] -> Link [ALKA] -> GSI 20 (level, low) -> IRQ 169
Jul  1 16:46:22 onesimplehost kernel: sata_via 0000:00:0f.0: routed to hard irq line 11
Jul  1 16:46:22 onesimplehost kernel: ata1: SATA max UDMA/133 cmd 0xB000 ctl 0xB402 bmdma 0xC000 irq 169
Jul  1 16:46:22 onesimplehost kernel: ata2: SATA max UDMA/133 cmd 0xB800 ctl 0xBC02 bmdma 0xC008 irq 169
Jul  1 16:46:22 onesimplehost kernel: scsi0 : sata_via
Jul  1 16:46:22 onesimplehost kernel: ata1: SATA link down 1.5 Gbps (SStatus 0 SControl 300)
Jul  1 16:46:22 onesimplehost kernel: ATA: abnormal status 0x7F on port 0xB007
Jul  1 16:46:22 onesimplehost kernel: scsi1 : sata_via
Jul  1 16:46:22 onesimplehost kernel: ata2: SATA link down 1.5 Gbps (SStatus 0 SControl 300)
Jul  1 16:46:22 onesimplehost kernel: ATA: abnormal status 0x7F on port 0xB807
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:10.4[C] -> Link [ALKB] -> GSI 21 (level, low) -> IRQ 177
Jul  1 16:46:22 onesimplehost kernel: PCI: VIA IRQ fixup for 0000:00:10.4, from 5 to 1
Jul  1 16:46:22 onesimplehost kernel: ehci_hcd 0000:00:10.4: EHCI Host Controller
Jul  1 16:46:22 onesimplehost kernel: ehci_hcd 0000:00:10.4: new USB bus registered, assigned bus number 5
Jul  1 16:46:22 onesimplehost kernel: ehci_hcd 0000:00:10.4: irq 177, io mem 0xee000000
Jul  1 16:46:22 onesimplehost kernel: ehci_hcd 0000:00:10.4: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
Jul  1 16:46:22 onesimplehost kernel: usb usb5: configuration #1 chosen from 1 choice
Jul  1 16:46:22 onesimplehost kernel: hub 5-0:1.0: USB hub found
Jul  1 16:46:22 onesimplehost kernel: hub 5-0:1.0: 8 ports detected
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKD] enabled at IRQ 23
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:12.0[A] -> Link [ALKD] -> GSI 23 (level, low) -> IRQ 185
Jul  1 16:46:22 onesimplehost kernel: eth0: VIA Rhine II at 0x1e400, 00:11:5b:e6:88:cb, IRQ 185.
Jul  1 16:46:22 onesimplehost kernel: eth0: MII PHY found at address 1, status 0x7869 advertising 05e1 Link 4061.
Jul  1 16:46:22 onesimplehost kernel: Attempting manual resume
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: INFO: recovery required on readonly filesystem.
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: write access will be enabled during recovery.
Jul  1 16:46:22 onesimplehost kernel: kjournald starting.  Commit interval 5 seconds
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: hdb3: orphan cleanup on readonly fs
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: hdb3: 6 orphan inodes deleted
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: recovery complete.
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: mounted filesystem with ordered data mode.
Jul  1 16:46:22 onesimplehost kernel: Linux agpgart interface v0.101 (c) Dave Jones
Jul  1 16:46:22 onesimplehost kernel: agpgart: Detected AGP bridge 0
Jul  1 16:46:22 onesimplehost kernel: agpgart: AGP aperture is 128M @ 0xe0000000
Jul  1 16:46:22 onesimplehost kernel: input: PC Speaker as /class/input/input0
Jul  1 16:46:22 onesimplehost kernel: FDC 0 is a post-1991 82077
Jul  1 16:46:22 onesimplehost kernel: Real Time Clock Driver v1.12ac
Jul  1 16:46:22 onesimplehost kernel: Via 686a/8233/8235 audio driver 1.9.1-ac4-2.5
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt Link [ALKC] enabled at IRQ 22
Jul  1 16:46:22 onesimplehost kernel: ACPI: PCI Interrupt 0000:00:11.5[C] -> Link [ALKC] -> GSI 22 (level, low) -> IRQ 193
Jul  1 16:46:22 onesimplehost kernel: via82cxxx: Six channel audio available
Jul  1 16:46:22 onesimplehost kernel: ac97_codec: AC97  codec, id: ALG96 (Unknown)
Jul  1 16:46:22 onesimplehost kernel: via82cxxx: Codec rate locked at 48Khz
Jul  1 16:46:22 onesimplehost kernel: via82cxxx: board #1 at 0xDC00, IRQ 193
Jul  1 16:46:22 onesimplehost kernel: parport: PnPBIOS parport detected.
Jul  1 16:46:22 onesimplehost kernel: parport0: PC-style at 0x378 (0x778), irq 7, dma 3 [PCSPP,TRISTATE,COMPAT,ECP,DMA]
Jul  1 16:46:22 onesimplehost kernel: pci_hotplug: PCI Hot Plug PCI Core version: 0.5
Jul  1 16:46:22 onesimplehost kernel: shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
Jul  1 16:46:22 onesimplehost kernel: Adding 1951888k swap on /dev/hdb2.  Priority:-1 extents:1 across:1951888k
Jul  1 16:46:22 onesimplehost kernel: EXT3 FS on hdb3, internal journal
Jul  1 16:46:22 onesimplehost kernel: loop: loaded (max 8 devices)
Jul  1 16:46:22 onesimplehost kernel: device-mapper: ioctl: 4.7.0-ioctl (2006-06-24) initialised: dm-devel@redhat.com
Jul  1 16:46:22 onesimplehost kernel: kjournald starting.  Commit interval 5 seconds
Jul  1 16:46:22 onesimplehost kernel: EXT3 FS on hdb1, internal journal
Jul  1 16:46:22 onesimplehost kernel: EXT3-fs: mounted filesystem with ordered data mode.
Jul  1 16:46:22 onesimplehost kernel: eth0: link up, 10Mbps, full-duplex, lpa 0x4061
Jul  1 16:46:22 onesimplehost kernel: NET: Registered protocol family 10
Jul  1 16:46:22 onesimplehost kernel: lo: Disabled Privacy Extensions
Jul  1 16:46:22 onesimplehost kernel: IPv6 over IPv4 tunneling driver
Jul  1 16:46:26 onesimplehost kernel: ACPI: Power Button (FF) [PWRF]
Jul  1 16:46:26 onesimplehost kernel: ACPI: Power Button (CM) [PWRB]
 
Old 07-01-2007, 04:21 PM   #2
ilikejam
Senior Member
 
Registered: Aug 2003
Location: Glasgow
Distribution: Fedora / Solaris
Posts: 3,109

Rep: Reputation: 96
Hi.

Nothing of particular interest in the syslog.

I'm thinking either a heat issue (or maybe not if it's in a real datacenter with decent HVAC), a marginal power supply, or the RAM is dodgy. I'm leaning towards RAM, as I've had a machines just crash hard with no errors from RAM issues before.

Can you run 'sensors'? That should hopefully show the CPU and board temperatures, and maybe also the voltage values for the various rails.

Dave
 
Old 07-02-2007, 01:00 PM   #3
Jubalint
Member
 
Registered: Mar 2004
Distribution: Debian
Posts: 35

Original Poster
Rep: Reputation: 15
This is what sensors gives me:

it87-isa-0290
Adapter: ISA adapter
VCore 1: +1.44 V (min = +4.08 V, max = +4.08 V) ALARM
VCore 2: +2.51 V (min = +4.08 V, max = +4.08 V) ALARM
+3.3V: +3.25 V (min = +4.08 V, max = +4.08 V) ALARM
+5V: +5.00 V (min = +6.85 V, max = +6.85 V) ALARM
+12V: +11.84 V (min = +16.32 V, max = +16.32 V) ALARM
-12V: -14.60 V (min = +3.93 V, max = +3.93 V) ALARM
-5V: -8.58 V (min = +4.03 V, max = +4.03 V) ALARM
Stdby: +4.70 V (min = +6.85 V, max = +6.85 V) ALARM
VBat: +4.08 V
fan1: 3443 RPM (min = 0 RPM, div = 8)
fan2: 0 RPM (min = 1318 RPM, div = 8)
fan3: 0 RPM (min = 0 RPM, div = 8)
M/B Temp: +40°C (low = +127°C, high = +78°C) sensor = thermistor
CPU Temp: +91°C (low = +127°C, high = +78°C) sensor = diode ALARM
Temp3: +49°C (low = +127°C, high = +78°C) sensor = thermistor
 
Old 07-03-2007, 02:45 AM   #4
ilikejam
Senior Member
 
Registered: Aug 2003
Location: Glasgow
Distribution: Fedora / Solaris
Posts: 3,109

Rep: Reputation: 96
If that CPU temperature reading is correct, then you've got a cooling issue, and a pretty serious one at that. Bear in mind, though, that the output from 'sensors' is sometimes off by a multiple of 2, so it might not be. On the other hand, the motherboard temp looks sane...

Dave
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Server keeps crashing yepp Linux - Enterprise 7 11-08-2005 07:39 AM
Linux Server keeps Crashing chrisellis Linux - General 2 06-25-2004 09:59 PM
X-Server keeps crashing no matter what! hari_seldon99 Linux - Software 5 01-31-2004 05:27 PM
MY server keeps crashing and I don't know why... Electrode Linux - General 6 07-06-2003 10:53 AM
x server crashing often... rooman Slackware 6 12-11-2002 02:39 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 08:22 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration