SlackwareThis Forum is for the discussion of Slackware Linux.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I am seeing random boot failures on an old Dell Inspiron E1505 laptop with 32-bit Slackware-current. The laptop occasionally crashes after printing "Decompressing Linux" and before printing "Parsing ELF". The crash will repeat until I power-cycle the laptop. I am running a custom 4.9.52 kernel which is essentially the same as Slackware's generic kernel except that it doesn't require an initrd.
I tried adding the "nokaslr" kernel parameter. This didn't cure the crash.
This problem seems specific to 32-bit Linux and the E1505 laptop. 64-bit Slackware-current runs fine on all my 64-bit machines.
Ed
Presuming you've no initrd, does it boot fine with the slackware-huge kernel? If not, we're looking like a memory thing. Get memtest86 and run it. If so, we're probably looking at a kernel thing. I'm running on kernel version 4.9.45-dec5, because 4.93.45-dec1-4 were bummers.
I didn't notice any problems with the Slackware huge kernel, but I booted it only during installation. The boot failure on the custom kernel occurs maybe once every five boots. It occurs so early that there aren't any clues left for debugging.
Ed
I note that success or failure of booting is determined at power-on. This is looking like an initialization problem. The x86 boot code in /usr/src/linux/arch/x86/boot/compressed has changed recently. I bet there hasn't been much testing on 32-bit x86 hardware, which by now is over a decade old.
Ed
What is the GPU in your E1505? I am asking because apparently that model may use an nVidia GeForce Go 7300. I have had some random boot failures after switching server layouts in X with an nVidia GeForce 7300 LE using the nouveau driver. I did not see this when I was using the proprietary nVidia driver. A second reboot seems to be the fix for me.
The GPU is an ATI Radeon X1300. I use the open-source radeon driver. Coincidentally, the Radeon X1300 also has an initialization problem - every few boots, the display shows pixel artifacts after I start X Windows. This GPU has always had that problem. I don't believe it is related to the boot failure, which occurs in the kernel's bootloader.
Ed
I tried two more custom kernels, neither of which eliminated the crash.
* The first changed CONFIG_KERNEL_LZMA to CONFIG_KERNEL_GZIP.
* The second changed CONFIG_CC_STACKPROTECTOR_REGULAR to CONFIG_CC_STACKPROTECTOR_NONE.
The 32-bit kernel being built without CONFIG_RELOCATABLE avoids a lot of potential problems.
Sometimes the failure will occur slightly after "Decompressing Linux". The last message printed is "Booting the kernel". After a failure, powering the laptop off and then on (within two seconds) will result in a successful boot.
Ed
I asked about the slackware-huge kernel back in post #2 and I'd still like to know the results from that.
If I were you, I'd open 2 consoles and run 'make menuconfig' on your own config, while viewing the slackware-huge config in a pager. In the section between and including "General Setup" and "Processor Type and Features" I would make them as alike as possible. Obviously, you don't need support for things you don't have, but make the rest alike and try it. Your problem is most likely in there. If it boots, try your 'improvements' a few at a time until it pukes on you. The real weight in the huge kernel is in compiled-in drivers for everything conceivable, not in the section I outlined.
The custom kernels have worked on all versions of 64-bit Slackware, and on all but the most recent versions of 32-bit Slackware. I believe I am encountering a bug, possibly specific to the Dell Inspiron E1505.
Ed
The kernel does a lot of memory testing at the stage you're dying at, so you are probably right.
Your diff is confusing. I'm guessing you're compiling in drivers that the generic kernel has as modules. The best way to show a diff is to use diff -u and show the command line. That way, you get file1 marked with '-' and file2 with '+'. You will need EXT4_FS compiled in to get going if you have ext4 disks. But those changes shouldn't otherwise affect matters.
The modules you need are the modules required to boot; filesystem, motherboard chipset, video, other disk drivers (e.g. sata, raid if applicable). It can then load the usb modules itself.
Yes, the important configuration change is to build-in the ext4 filesystem.
Depopulating either of the two DIMMs did not eliminate the boot crash. However, I didn't see the kernel panic with only one DIMM installed.
At this point, I no longer trust the E1505 hardware. This machine is at the last stage of my PC waterfall, being used only as a music and video player. The next stage would be the recycler.
Ed
1. The extra dimm is 'The straw that broke the camel's back' in your box that trips something from "Just working" to "Just not working" or "Borderline."
2. That something could conceivably be the power supply (dying or overloaded).
If 1 is the case, you might well get out of it by slowing the bus speed down, and try this if you want to hang onto this piece of kit until the bitter and inevitable end. I gather you don't, but you can clear
a. The dimms - probably.
b. The kernel.
and we can stop posting here, and you can mark this solved.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.