SlackwareThis Forum is for the discussion of Slackware Linux.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Sometime on Wednesday afternoon, Nov. 23rd, my Linux Server ceased function properly, but did not shutdown or reboot. I was no longer able to ssh into it, nor was it resolving DNS (it is the LAN DNS server). The mail port (25) was still open and it was receiving email, but not delivering any. Port 80 was still open. When I finally got to it physically on the following Saturday the console was continuously looping with the 1st line showing "BUG: unable to handle kernel paging request at ffff88030e28fe40". I made a screen-shot of the console - attached.
Some further lines down the output there are the lines:
The "3.10.17" number is the Linux kernel version. "Gigabyte Technology" is the motherboard manufacturer and the "297X-UD5H" is the motherboard product name. "06/17/2014" is the AMI BIOS date, "F8" is the BIOS version number. Not sure any of that is helpful. What might "sendmail Tainted" mean?
Well, this was a custom built machine with components, including mother board, recommended. There was no indication on the MB box that it was specifically for gaming.
What, for example, would you suggest as a Server Grade Motherboard?
You may be right on the non-ECC memory, but how do you know that? Because of the motherboard model?
The issue that Darth Vader is (rudely) implying is that server hardware tends to be more robust, and that usually includes using ECC (error-correcting code) memory. Long story short, it is possible for a neutron from a cosmic ray to flip a memory bit from 0 to 1 or 1 to 0. This is not common, because neutrinos rarely interact with particles on Earth. There are about a trillion neutrinos passing through your hand every second, but collisions only occur once every few years (see this fun xkcd what-if for more detailed information on neutrinos). When one of these neutrinos "collide" with a bit in your RAM, it can have no RAMifications (heh) on the computer or it can bring it to its knees. It all depends on what was contained in that bit of memory. ECC memory is designed to notice this and correct it.
Unfortunately, as far as I know, there's no way to know whether this issue was caused by some random neutrino or some other hardware/software glitch. Many times, when this occurs due to a neutrino, a quick reboot and you're back to normal and won't experience it again. There's really not much you can do to protect yourself from those types of situations without getting server-grade hardware that uses ECC memory. If this issue starts happening frequently, it could be a sign your hardware is having issues and would be unrelated to neutrinos.
Personally, as long as a machine isn't considered critical, you probably don't need to spend the extra money for server-grade components. I have been happily using my normal desktop rig as a 24/7 server for many years (both within my LAN and out to the internet), but if my machine goes down, only me, my wife, and a few friends would be affected. I also have an htpc running kodi that runs 24/7 that can act as a media server to my mobile phone. I haven't had issues with each, and, to me, neither computer is worth the cost of server-grade hardware based on what they provide. If you're running a major business, you could lose a lot of money in a quick period of time if your desktop turned server goes out, so it's typically beneficial to pay the extra money for server-grade hardware when that computer will be providing a critical function (which would be up to you and your company to decide if it is worth it).
TL;DNR: Don't worry about it unless it happens frequently. Then start testing your hardware because there might be a bad component.
My best guess would be that some piece of your hardware had a hiccup. Like bassmadrigal said you probably don't need to worry about it unless it starts happening more frequently. You could run diagnostic tests, like a memtest, stressing the CPU, checking the SMART status of your HDD, test your power supply, etc. But you will need some downtime to do all that.
Personally, I think that it is the MoBo. Gigabyte IMO is junk. I have a friend that cannot give his Gigabyte laptop away. I have had nothing but issues with their MoBo's. But that is purely a personal opinion and speculation.
My favorite right now is a old ASUS M2NPV-VM MoBo that I have running my personal web facing server. This thing is a workhorse. No ECC-RAM or anthing "server-grade" in it. And it's pretty minimal, MoBo, HDD, PSU, DVD, NIC.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.