Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Since some days, my Ubuntu Linux Home Server is experiencing extreme slowdown.
Here is what i know:
- Reboot doesn't help
- No unusual dmesg / log output
- Very high (8-22) load average
- High (75%+) CPU "wa" usage
- Very high response time and bad "feel"
- Ive touched nothing to slow it down
I suspect some kind of hardware semi-failure. Can you help me troubleshoot it?
There isn't anything wrong with the drive neccessarily, you may just have too many processes that are waiting on disk access and performing a lot of writes, or you could have several processes all waiting on the same file. Try using ps and lsof to see what files the waiting processes are attempting to access.
Actually your problem is entirely related to disk i/o. Linux counts processes waiting on disk access in its CPU load figures. So when you have high i/o wait you will also have high load numbers even though your CPU might very well be idle! I suspect there is something wrong with your RAID array. Was there a disk failure and is the array being rebuilt or something? If the answer is no then you have to identify which process is causing high i/o.
One way to check would be to run top and then sort by the time column in descending order. That should tell you which process has been running for a long time.
The only heavy I/O process is qemu, but it has always been there and it's not consuming so much disk bandwidth.
Even terminating it doesn't change things that much.
Disk I/O is very slow also after forcing an hard reboot: it tooks a lot of time also to replay the filesystem journal, when no processes are running at all.
Here is a top sorted by Time. Note that at the time of this "top" the server is running really FAST -_-'
I have an idea. Boot the server off a linux installation/live cd and run the badblocks program on each of the drives. On modern drives it takes about 1.5-2 hours for the read only test. If the program runs really slow you know something is wrong with your drives or the disk controller. If it runs normally but shows that you have bad blocks on your drives then your drives need to be replaced.
BTW you could run badblocks without rebooting your server i.e. from within your currently installed os. But I want you to use a CD in order to rule out the current filesystem as a factor in the slow down.
Since some days, my Ubuntu Linux Home Server is experiencing extreme slowdown.
Here is what i know:
- Reboot doesn't help
- No unusual dmesg / log output
- Very high (8-22) load average
- High (75%+) CPU "wa" usage
- Very high response time and bad "feel"
- Ive touched nothing to slow it down
I suspect some kind of hardware semi-failure. Can you help me troubleshoot it?
Would you mind telling me the RAM and swap memory of your system
I have an idea. Boot the server off a linux installation/live cd and run the badblocks program on each of the drives. On modern drives it takes about 1.5-2 hours for the read only test. If the program runs really slow you know something is wrong with your drives or the disk controller. If it runs normally but shows that you have bad blocks on your drives then your drives need to be replaced.
Nice idea. I'll do it tonight. Ty
Quote:
Would you mind telling me the RAM and swap memory of your system
Since some days, my Ubuntu Linux Home Server is experiencing extreme slowdown.
Maybe we could try looking at other stuff in parallel?
- I noticed you saying "since some days". So what happened since the machine last ran OK? Any system updates or reconfiguration? New users? Anything else we should know?
- Can you install Atop, reboot the machine to a sane state and have Atop store system- and process activity for at least 24 hours? (I like Atop because it's easy to replay the binary log given a reasonable interval is used.)
- You stated logs don't show any anomalies but you didn't say what you've looked with. If it was a case of cursory visual inspection I suggest using Logwatch instead. It's helpful for finding leads you might have overlooked in log files it knows about.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.