Quote:
Originally Posted by MensaWater
Most likely because whatever made your CPU go to 100% prevent sar from being able to run.
When you say you rebooted do you mean you did a warm reboot or did you have to power off the system because a warm boot couldn't be done?
It happens on occasion that so many resources are used the system will be totally unresponsive requiring a hard boot (power cycle). In such occasions the same thing that prevents you from doing other tasks will prevent system jobs from running.
Another possibility is that the filesystem was full and rebooting cleaned up some temporary file that had filled it up.
Nothing in the output you put here shows any huge decrease of CPU idle by 2 PM. Presumably at next 2:10 PM sample whatever happened that sent your CPU to 100% had already occurred preventing that sample. Look at the rest of your sadd to see if you can determine other issues (e.g. memory or file table) growth that was increasing rapidly. The CPU going to a 100% may have been a symptom of the other issue.
|
Yes, I had to power off the system because the VM was not responsible. I also checked network, memory, fd and so on, everything was normal before the incident, it happened suddenly.
my server did not run on a physical machine but a VM managed by VMWare, maybe it is VM issue.
By the way, would you tell me how to check how much memory that Linux kernel is using? is it equal to a sum up of all output of /proc/slabinfo?