LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 04-25-2004, 11:13 PM   #1
KendersPlace
Member
 
Registered: Feb 2003
Location: Phoenix, AZ - USA
Distribution: RedHat 8, Micro$haft
Posts: 33

Rep: Reputation: 15
Angry Total loss of power on HD seek ???


A customer's machine is running RH 8.0. It is basically a database server running mysql. It is a very stripped down installation - database, security, and web.

The drive went out, sent it for warranty replacement - a new drive (checked mfr date, it is indeed new, not the repaired old drive) arrives back.

Hard drive: Western Digital 250GB, 7200rpm, 8mb buffer "SE" edition.


SINCE REPLACING THE DRIVE....

Loaded up fresh install of RH on the new drive from the RH install CD's. About the 3rd time the box booted up - it would hit the partition the mysql tables are on during the boot process (listing out on the screen) and would suddenly drop all power when it hit this one partition - as if someone had pulled the power cable out of the wall.

Pressing the power button or reset button again had no effect - the box was totally dead. I flipped the switch on the POWER SUPPLY on the back of the box off and on, unplugged and re-plugged the power cable, then hit the front power button, and it started right back up with a fresh boot.

Out of the next 4 or 5 boots - each time it would hit this partition, the same thing happened. Finally, on the 6th or so boot (I was trying to see what was on the screen just before it died each time) it started up just fine. Worked perfectly.

Now, about every week or so, seemingly randomly - the box will just drop power again - same symptom, but happens after it's been running for a while. Have to cycle the power supply again, and it starts back up - a week later - another loss of power. IT ALWAYS SEEMS TO HAPPEN RIGHT AFTER KICKING OFF A LARGE SQL QUERY.

I replaced the power supply (was a 300w, replaced with brand new 400w). The exact same thing is still happening.


The question:

I've never seen anything like this, and have built 3 IDENTICAL machines for 3 different customers running IDENTICAL datbases, all have been fine for over a year, now this problem with this one box - SINCE replacing the hard drive.

Because this seems to happen during access only to one SQL partition - is it possible that the hard drive is somehow shorting out and tripping the power supply? This doesn't seem like a motherboard problem - seems to be directly related to this one partition on the hard drive.

Is there anything else that could cause this? Should I send this drive also back to Western Digital for replacement??


Box Specs:
Athlon XP 2000+
128 MB RAM
Single hard drive (noted above)
FIC motherboard w/ onboard video and NIC. (onboard sound and other periphrials disabled via Bios).

That's the only thing in this box. A processor, ram, motherboard, and a hard drive. No floppy, no CD, no anything - strictly a network accessible database server.


Thanks much in advance, sorry for long post - wanted to include all the details!

-K
 
Old 04-26-2004, 08:03 AM   #2
kilgoretrout
Senior Member
 
Registered: Oct 2003
Posts: 2,865

Rep: Reputation: 343Reputation: 343Reputation: 343Reputation: 343
Download the diagnostic utility from the WD website and thoroughly check the drive. You'll need to do that to rma the drive anyway. If the drive checks out OK, I'd suspect an overheating problem. Check to make sure your fans are working properly and the heatsink is properly mounted. Also try swapping out the drive cable.
 
Old 04-26-2004, 03:02 PM   #3
J.W.
LQ Veteran
 
Registered: Mar 2003
Location: Boise, ID
Distribution: Mint
Posts: 6,642

Rep: Reputation: 86
The only variable seems to be with that one HD, so Yes, I'd agree with kigoretrout that you need to return it. Based on your description, it definitely sounds defective.

The only other possibility that I could think of would be if you were OC'ing the Athlon. Since the behavior only seems to manifest itself when executing CPU-intensive and RAM-intensive queries, then OC'ing could introduce some instability leading to a hung state. Along these lines, if you are using multiple sticks of RAM, are they all the same speed, and is the RAM speed matched to the CPU? -- J.W.
 
Old 04-27-2004, 01:12 AM   #4
KendersPlace
Member
 
Registered: Feb 2003
Location: Phoenix, AZ - USA
Distribution: RedHat 8, Micro$haft
Posts: 33

Original Poster
Rep: Reputation: 15
Thanks for the replies.

1. The CPU is not overclocked.
2. The RAM is a single stick, PC2100.
3. RAM and CPU speeds are matched as far as I know. I always clear the CMOS with the jumper when building a new box, and I didn't change anything in the BIOS as far as RAM speed, so it should be default.

(The box is still at customer location - picking it up in a couple days).

The diagnostic utility from W.D. only runs under windows - the drive can be swapped to a W box, but was hoping they put out something basic and low-level I could just run under L. Guess not.

I hadn't considered the ribbon cable. That is possible as I had to fight with the drive mounting bracket and may well have damaged the cable. I'll try swapping that out w/ another EIDE cable first and if it happens again I guess it goes back.


Thanks again.
-K
 
Old 04-27-2004, 02:03 AM   #5
J.W.
LQ Veteran
 
Registered: Mar 2003
Location: Boise, ID
Distribution: Mint
Posts: 6,642

Rep: Reputation: 86
There may be some underlying issue with the RAM. It may be useful to run the diagnostics from here: http://memtest86.com/ -- J.W.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
total font loss mayhem in kde3.4.2 27B-6 Slackware 2 09-15-2005 07:50 PM
Squid don't start after power loss sveinxp Linux - Networking 4 08-31-2004 03:55 PM
RedHat Problem after a power loss jocast Linux - Software 21 06-12-2004 09:38 AM
Power loss - disk health check - how to force? kalahari875 Mandriva 2 05-27-2004 07:12 AM
Power loss, now no email etc Bake-SaleNet Linux - Networking 2 02-01-2004 06:26 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 04:42 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration