How do you deal with server fault where it hangs but doesn't get stopped?
Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
How do you deal with server fault where it hangs but doesn't get stopped?
We've some servers in linux and those servers get hang(stuck) but not stopped. So, how can I deal with those servers. It's not clear what's the cause of this stuckness. Any guidance will be appreciated.
Care to provide more information? Like every bit of where you're at, what you've checked, what logs say, what's just happening before, what should happen but isn't etc? This is a server, so you're some class of sysadmin, right?
I don't fancy outlining all the possible reasons why things might get stuck.
Man, server hang-ups are always a headache. I've faced similar issues in the past. If it's not giving you any error messages, it's kinda like flying blind. Have you tried looking at the logs just before the hang occurs? Could help pinpoint if it's a software issue or something else.
Another route is to use monitoring tools to see if it's a resource bottleneck. If all else fails, sometimes a hard reboot might be the only way to get it back online, but I know that's not ideal. Would love to hear how others are tackling this issue.
Man, server hang-ups are always a headache. I've faced similar issues in the past. If it's not giving you any error messages, it's kinda like flying blind. Have you tried looking at the logs just before the hang occurs? Could help pinpoint if it's a software issue or something else.
Another route is to use monitoring tools to see if it's a resource bottleneck. If all else fails, sometimes a hard reboot might be the only way to get it back online, but I know that's not ideal. Would love to hear how others are tackling this issue.
You're a spammer; your spam link in your only other post has been reported. And if you actually read the post here, you'd see they were already asked about logs and to provide more details.
Distribution: Mint 20.1 on workstation, Debian 11 on servers
Posts: 1,338
Rep:
I would find a way to confirm it's not just the network hanging. Reason I say that is I have a server that will randomly lose the entire network stack, but the server itself is still running. My "fix" is a script that runs every 15 minutes that tries to ping a bunch of local servers and then google. If every single one of those fails it will reboot the server. It's an automation server so I can tell it's still running because the automation stuff still happens, it's just that I lose network connectivity to it. It's a really weird issue since it happens maybe once or twice a year only so it's very hard to troubleshoot.
If the whole server is completely hung and it's doing this often enough I would maybe hookup a monitor to it and run something like "dmesg -w" and wait to see if it catches anything in the last moments before it hangs. That's assuming it's not crashing to the point that it loses display.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.