Directory has become inaccessible
I was downloading mail from all my mailboxes when the email application crashed just after downloading mail from one of the mailboxes.
Actually, the email application froze so I had to kill it. I ran it again and checked the mailboxes again, and the email application froze again. And again. And again. Upon investigation, I found that the directory where the messages of that mailbox are stored is inaccessible: Code:
# /home/luc/Mail> ls -ls gmail1 Note the #, i.e. it is inaccessible even as root. SpaceFM (file manager) can open gmail1, but not gmail2. It says it's opening, but it takes forever and I give up. But right-clicking those directories and selecting Properties seems to work: gmail1: 540MB, 11664 files, 14 folders gmail2: 463MB, 6249 files, 7 folders I also tried PCManFM and had the same results. What is happening to that directory and how can I fix it? TIA |
Anything interesting in "dmesg" after "ls" gets killed?
|
Quote:
Code:
[35335.564211] BUG: unable to handle kernel paging request at 000000000003f8ca |
Uh, wow. I'd certainly say that counts as interesting...
I'd definitely run a memtest after seeing this, and not try to do any more debugging until I've actually ruled out bad RAM. |
What kind of test would you run and, assuming memory is bad, why does it affect that directory only?
|
https://www.memtest86.com/
This was the standard test for faulty RAM last time I checked, which admittedly was a while ago. |
assuming memory is bad, why does it affect that directory only?
|
Quote:
If it turns out that your memory is good, then "why does that it affect that directory only" becomes extremely interesting. |
You also need to scan the hard drive for bad sectors/bad blocks. That's definitely another potential cause, and it's more directly relevant to "why is this directory the only one affected?"
I'll just take the liberty of posting one link: https://www.tecmint.com/check-linux-...rs-bad-blocks/ This is much more likely to be a hardware issue than to be a kernel bug (which, really, is the alternative explanation). |
Thanks. I am very familiar with badblocks. I don't like it because I once bought a new hard disk and decided to check it with badblocks which accused about thirty-odd bad blocks. I had the disk returned/replaced and the new one had many bad blocks too! The vendor refused to replace it again and I sucked it up, but ended up using that hard disk for more than ten years without a single problem. I still have it and it works. I just hardly ever use it anymore because I bought much larger disks and outgrew it.
Now, I just rebooted and the directory is working normally. The mail application isn't freezing anymore either, all the messages are there. Nothing turns up in dmesg either. The problem seems to be gone. I should just point out that the machine froze when I issued the reboot command and I had to hard reset it. I ran memtest and no error was detect. Then I finally logged in again and everything seems normal. I have no idea what happened. |
With a bad disk block you should get an I/O error message. But you got a paging error, that has to do with virtual memory, for example bad RAM.
Do you have a swapfile? Was it manipulated while in use by the kernel? Code:
free -m |
Quote:
|
Quote:
|
Then I suspect a bug in the kernel, most likely the driver for your disk.
Look for updates! Last but not least, there are possible hardware faults like a distortion on a power line, leading to random corruptions... Once I met a series of hard disks with faulty embedded SRAM cache. Very nasty, all types of hangings, malfunctions, corruptions occurred. Finally we detected a bit flip in a corrupted data file. Contacted the vendor: the bit flip error was already suspected and examined. We got new disks :) |
There are a lot of hardware issues that could have caused this. Overheating is another possibility.
|
All times are GMT -5. The time now is 12:04 PM. |