LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 04-22-2010, 12:02 AM   #1
wigglytoes
LQ Newbie
 
Registered: Feb 2009
Posts: 17

Rep: Reputation: 2
The tortoise and the hare - 2 systems - Wildly different performance


I'm stumped.

I have two Linux boxes. They run the same application, and have done so admirably for a few years. Lately one has slowed way down. (It's about 20 times slower running my application)

The tortoise has 8 CPUs, Two Quad Core Xeon processors on a Supermicro MB. 8 GB of RAM and a RAID 5 disk array powered by a 3Ware 9550SX RAID card.

The hare has 8 CPU's. Four Dual core AMD processors on a Supermicro MB. 8 GB of RAM and a RAID 5 disk array powered by a 3Ware 9550SX RAID card.

Yesterday, I tried to run a job on the tortoise, and it ran for 20 hours. I copied all the data to the hare, and it ran in 1 hour.

The hare is running Centos 2.6.18-164.11.1.el5
The Tortoise is running CentOS 2.6.18-164.15.1.el5

There is nothing odd in /var/log/messages
I have NOT tried to run memtest. (I always have trouble
with that anyway, as I have multiple CPUs and more than 4GB RAM)
There are no messages in the error logs for the RAID cards.
Running a 'cp' command on a big file on the tortoise and watching iostat shows that I'm transferring at about 60 MB/sec. (This stays the same whether I copy between the RAID filesystem and a system disk, or any combination of those two. And it stays the same if I use my application to do the copying.)
It doesn't look like a disk problem.

Running mpstat shows that my application caused the CPUs to spend most of their time in IOWait.

vmstat does not show massive swapping (or any)

I can't explain this, or even how to figure out what's going on.
Is there any way it could be the network? Both machines are on the same network. They do not use the network for anything, it's just there. I've just noticed that over my many years of head banging over computers, is that when one computer on a network slows down, they all seem to. Even when the network isn't used. (I know that's impossible, but then if man were meant to fly God would have given him wings, or a stronger velocity fart expulsion.)
 
Old 04-22-2010, 02:26 AM   #2
catkin
LQ 5k Club
 
Registered: Dec 2008
Location: Tamil Nadu, India
Distribution: Debian
Posts: 8,578
Blog Entries: 31

Rep: Reputation: 1208Reputation: 1208Reputation: 1208Reputation: 1208Reputation: 1208Reputation: 1208Reputation: 1208Reputation: 1208Reputation: 1208
Quote:
Originally Posted by wigglytoes View Post
They do not use the network for anything, it's just there.
2+ hours and no reply ... time for the ignoramuses to try and help out ...

Have you used a network traffic monitor such as tcpdump to confirm that? IDK, but could IOWait time be waiting for network response? Alternatively, could IOWait time be waiting for file locks to clear?
 
Old 04-26-2010, 04:48 AM   #3
peter1234
Member
 
Registered: Apr 2009
Posts: 42

Rep: Reputation: 2
Code:
2+ hours and no reply ... time for the ignoramuses to try and help out ...
4 days and 1 reply ... time for 2nd ignoramuses to join in ...

Maybe the 'tortoise' decided to take it slow and steady

Ok jokes aside,

Software side I cann't suggest much other than full reinstall (I don't think that is what you are looking for and it might not help, leave reinstall as lastresort).

chk for rootkits and things like that.

chk
Code:
Running a 'cp' command on a big file on the tortoise and watching iostat shows that I'm transferring at about 60 MB/sec.
Run the same 'cp' command on a big file on 'hare' and see transfer rate.


Hardware side chk:
-dust build up, faulty fans, overheating parts
-PSU(s)
-cache RAM in 3Ware 9550SX cards ok (if there any)
-remove 'tortoise' ram modules one at(or in pairs) a time and if that help
-remove 'tortoise' ram modules one at time and test it on different pc
-see if HDDs in 'tortoise' hotter than normal
-google and see if HDDs models in 'tortoise' have any known problems.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Tortoise GIT Issue? your_shadow03 Linux - Newbie 0 02-24-2009 01:21 AM
LXer: Tortoise and Hare: How Linux Can Leap Ahead of Mac OS X and Windows LXer Syndicated Linux News 0 06-11-2008 07:20 PM
LXer: Promise Technology Introduces High-Performance SCSI-to-SATA RAID 6 Storage Systems LXer Syndicated Linux News 0 09-22-2006 07:03 PM
LXer: Performance tuning UNIX systems LXer Syndicated Linux News 0 04-11-2006 04:03 AM
.xsession-errors growing wildly classicalcraig Linux - Newbie 7 10-26-2004 04:44 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 08:27 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration