LinuxQuestions.org
Help answer threads with 0 replies.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 05-29-2007, 03:01 AM   #1
spayce
LQ Newbie
 
Registered: May 2007
Posts: 2

Rep: Reputation: 0
Urgent: RHEL4 64bit keeps crashing


I am a total linux Noob but my server keeps crashing, looks like hard drive error, but I dont have any SMART errors, and the array seems ok.

Looking for some help, this keeps on happening to my RHEL4 64bit Hugemem machine running vmware and it takes down the server.

I am running a Supermicro box with Dual Quad Core processors and 16GB of mem,

There are two arrays One Mirror set for the system and a RAID 10 for the VM volume.

The machine halted, no console, no ssh, only responds to ping, I see this in all the virtual terminals

ext3-fs error (device dm-0) in start_transaction:Journal has aborted

and after several hours of several days it happens again.


here is a section of /var/log/messages

it is interesting to note All the had and Ide errors and how at 12:32:12 last night the log just stops and then it picks up again at 22:39 when I rebooted it. Also after the reboot, the exact same errors start occurring again.


Looking for next step suggestions.

May 28 00:32:10 vmserver01 kernel: hda: packet command error: status=0x51 { DriveReady SeekComplete Error }

May 28 00:32:10 vmserver01 kernel: hda: packet command error: error=0x54

May 28 00:32:10 vmserver01 kernel: ide: failed opcode was 100

May 28 00:32:10 vmserver01 kernel: ATAPI device hda:

May 28 00:32:10 vmserver01 kernel: Error: Illegal request -- (Sense key=0x05)

May 28 00:32:10 vmserver01 kernel: Cannot read medium - incompatible format -- (asc=0x30, ascq=0x02)

May 28 00:32:10 vmserver01 kernel: The failed "Read Subchannel" packet command was:

May 28 00:32:10 vmserver01 kernel: "42 02 40 01 00 00 00 00 10 00 00 00 00 00 00 00 "

May 28 00:32:11 vmserver01 kernel: hda: packet command error: status=0x51 { DriveReady SeekComplete Error }

May 28 00:32:11 vmserver01 kernel: hda: packet command error: error=0x54

May 28 00:32:11 vmserver01 kernel: ide: failed opcode was 100

May 28 00:32:11 vmserver01 kernel: ATAPI device hda:

May 28 00:32:11 vmserver01 kernel: Error: Illegal request -- (Sense key=0x05)

May 28 00:32:11 vmserver01 kernel: Cannot read medium - incompatible format -- (asc=0x30, ascq=0x02)

May 28 00:32:11 vmserver01 kernel: The failed "Read Subchannel" packet command was:

May 28 00:32:11 vmserver01 kernel: "42 02 40 01 00 00 00 00 10 00 00 00 00 00 00 00 "

May 28 00:32:12 vmserver01 kernel: hda: packet command error: status=0x51 { DriveReady SeekComplete Error }

May 28 00:32:12 vmserver01 kernel: hda: packet command error: error=0x54

May 28 00:32:12 vmserver01 kernel: ide: failed opcode was 100

May 28 00:32:12 vmserver01 kernel: ATAPI device hda:

May 28 00:32:12 vmserver01 kernel: Error: Illegal request -- (Sense key=0x05)

May 28 00:32:12 vmserver01 kernel: Cannot read medium - incompatible format -- (asc=0x30, ascq=0x02)

May 28 00:32:12 vmserver01 kernel: The failed "Read Subchannel" packet command was:

May 28 00:32:12 vmserver01 kernel: "42 02 40 01 00 00 00 00 10 00 00 00 00 00 00 00 "

May 28 22:39:38 vmserver01 syslogd 1.4.1: restart.

May 28 22:39:38 vmserver01 syslog: syslogd startup succeeded

May 28 22:39:38 vmserver01 kernel: klogd 1.4.1, log source = /proc/kmsg started.

May 28 22:39:38 vmserver01 syslog: klogd startup succeeded

May 28 22:39:38 vmserver01 kernel: Linux version 2.6.9-5.ELhugemem (bhcompile@decompose.build.redhat.com) (gcc version 3.4.3 20041212 (Red Hat 3.4.3-9.EL4)) #1 SMP Wed Jan 5 19:38:36 EST 2005
 
Old 05-29-2007, 11:15 PM   #2
twantrd
Senior Member
 
Registered: Nov 2002
Location: CA
Distribution: redhat 7.3
Posts: 1,440

Rep: Reputation: 52
Well, from looking at the logs, yes, it appears to be a hard-drive error however, you stated that hda is part of a raid container. I'm assuming hda is raid 1. Therefore, if there was a hard-drive error, no big deal as it's part of a mirror anyways. Perhaps the controller is faulty/dying. Does the server come with any diagnostic tools (or on a CD like how dell has it) that you can run and perform a system component health check?

It would probably also help if you had another identical system and install RHEL on it and see if it mimics the same behavior. However, I understand if you don't.

-twantrd
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Can't install amarok 64bit du eto missing provider of libpq.so.4()(64bit) Ossah Linux - Software 1 04-21-2007 09:23 PM
urgent..upgrade from RHEL3 to RHEL4 kumarnine Linux - Enterprise 1 10-30-2006 08:12 AM
iSCSI initiator grief- 64bit RHEL4 RedHatCat Linux - Software 3 02-20-2006 10:03 AM
64bit Eval Issues...switched to 64bit OSS and WOW RedShirt SUSE / openSUSE 6 01-23-2006 09:07 PM
Nautilus Crashing after full install of FC2 64bit Hkrboy27 Linux - Software 0 10-06-2004 01:25 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 07:11 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration