LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 05-19-2011, 03:38 PM   #1
thllgo
Member
 
Registered: Sep 2003
Location: Laurel MD
Posts: 296

Rep: Reputation: 32
mpstat output interpret and process info


Hello,

I've got a system that is maxing out CPU. Running "mpstat -P ALL 3 5" shows the following.

Quote:
08:30:21 PM CPU %user %nice %sys %iowait %irq %soft %steal %idle intr/s
08:30:24 PM all 100.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 1001.00
08:30:24 PM 0 100.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 1001.00
08:30:24 PM 1 99.67 0.00 0.00 0.00 0.00 0.00 0.00 0.33 0.00
08:30:24 PM 2 100.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
08:30:24 PM 3 100.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
All 5 groups show the same. My first question is why would CPU0 be the only one with intr/s and the others do not?
Info. The OS is RHEL 5.4 running as a VM on ESXi 4.1. Memory doesn't appear to be an issue, the system has 8 GB and its only using about 1.5 GB.

Second question, I'm positive the process that is the problem is the tomcat process. Does anyone know a good way to see whats happening with a specific process?
 
Old 06-15-2011, 05:04 PM   #2
SteveK1979
Member
 
Registered: Feb 2004
Location: UK
Distribution: RHEL, Ubuntu, Solaris 11, NetBSD, OpenBSD
Posts: 225

Rep: Reputation: 43
Hi,

Quote:
Second question, I'm positive the process that is the problem is the tomcat process. Does anyone know a good way to see whats happening with a specific process?
strace is your friend! It should show you what the tomcat process is up to at a fairly low level.

As to the interrupts only being handled by one cpu, I would have to guess this is a vaguery of the fact it's a vm. I checked a VM that has quad cpus allocated and I see exactly the same behavoir.

Cheers,
Steve
 
1 members found this post helpful.
Old 06-16-2011, 09:29 AM   #3
thllgo
Member
 
Registered: Sep 2003
Location: Laurel MD
Posts: 296

Original Poster
Rep: Reputation: 32
Thank you,
strace is my new little friend.

I'm going to have to check with VMware. I'm wondering if the behavior with the lack of distribution between CPUs has a fix.
 
Old 06-16-2011, 03:09 PM   #4
SteveK1979
Member
 
Registered: Feb 2004
Location: UK
Distribution: RHEL, Ubuntu, Solaris 11, NetBSD, OpenBSD
Posts: 225

Rep: Reputation: 43
Hi,

What makes you think this is actually a problem that needs fixing? There are quite a few things that can influence this too. It can be influenced by HyperThreading as only one of the 'threads' can be interrupted since there is only one cpu in reality.

I'm quite interested in your answer - we also have a large number of linux VMs on VMware.

Cheers,
Steve
 
Old 06-16-2011, 04:10 PM   #5
thllgo
Member
 
Registered: Sep 2003
Location: Laurel MD
Posts: 296

Original Poster
Rep: Reputation: 32
The only difference between this and our other system is this system is running a newer version of our software that is wrapped around tomcat. Its not tomcats fault its something we are doing. We have a base VM that we copy and test our SW on. At the end of the test we fix our SW, destroy the VM and install the new version of the SW on a new VM. This way we have a clean system for each new version of the SW. This time however the system idle never drops bellow about 85%, frequently reaching 100%. Without feeding the system any data it is consistently using way too much CPU. Past tests show an idle system only around 5%.
 
Old 07-07-2011, 03:37 PM   #6
SteveK1979
Member
 
Registered: Feb 2004
Location: UK
Distribution: RHEL, Ubuntu, Solaris 11, NetBSD, OpenBSD
Posts: 225

Rep: Reputation: 43
Hi,

Did you ever get a response from VMware for this? Out of curiosity is irqbalence running on these systems?

Cheers,
Steve
 
Old 07-12-2011, 09:51 AM   #7
thllgo
Member
 
Registered: Sep 2003
Location: Laurel MD
Posts: 296

Original Poster
Rep: Reputation: 32
We did find the issue. One of the developers while fixing one problem created another that put one section of the app. into a CPU sucking loop. , but we found it and fixed it. I was sure the issue was our app.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
How to interpret Papi output kashyapa Linux - Software 0 07-30-2010 03:59 PM
Interpret info contained in /var/log/mcelog? robotsari Linux - Software 1 07-28-2010 02:41 PM
Description of mpstat output Sapfeer Linux - Newbie 4 02-22-2010 11:24 PM
interpret tcpdump output powah Linux - Networking 3 09-22-2008 07:22 PM
Wine debugger output: how do I interpret this? trashbird1240 Linux - Software 2 02-02-2007 01:03 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 01:50 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration