LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - General (https://www.linuxquestions.org/questions/linux-general-1/)
-   -   My Desktop PC occasionally freezes yet the panel clock seconds keep ticking. (https://www.linuxquestions.org/questions/linux-general-1/my-desktop-pc-occasionally-freezes-yet-the-panel-clock-seconds-keep-ticking-4175675482/)

linustalman 05-18-2020 01:48 PM

My Desktop PC occasionally freezes yet the panel clock seconds keep ticking.
 
Hi.

My 2013 machine makes a sound like it's about to shut off. It did power off once about 3 weeks ago for no apparent reason.

It freezes for up to 30 seconds and then comes back. Some days this never happens. Other days like today, it can occur multiple times.

In GNOME Disks Utility, the HDD looks ok. I tested the RAM with memtest86+ and it's was ok. Could it be a PSU issue? Is there a way in Debian to find out? Is '/var/log/syslog' the best place to look?

I'm using the Nouveau drivers and kernel 5.4.0-0.bpo.4-amd64. I'm not sure if this is a hardware or software issue.

Thanks.

dc.901 05-18-2020 01:52 PM

I would also look at sensors (lm_sensors package): https://packages.debian.org/sid/lm-sensors
And, some BIOS have area to show hardware events (even shows when system powers off/on).

ondoho 05-18-2020 02:02 PM

Quote:

Originally Posted by linustalman (Post 6124680)
My 2013 machine makes a sound like it's about to shut off.

My machines make no sound at all when they're about to shut off.
Do you mean the sound it makes when it actually shuts off?
Then that's probably what's happening.

sevendogsbsd 05-18-2020 02:19 PM

I like the lm_sensors idea - maybe it is a heat issue? How long since you've cleaned out dust bunnies? Or is this a laptop and you can't...

jefro 05-18-2020 03:59 PM

If the clock keeps time then the system is really working. The Desktop if having an issue. I'd start with top htop or other system metric to see more.

Beep might mean some keyboard buffer full?

linustalman 05-20-2020 04:16 AM

When it freezes for a short while, the sound that the PC makes sounds like it comes from the DVD/CD drive.

linustalman 05-20-2020 04:19 AM

Quote:

Originally Posted by dc.901 (Post 6124685)
I would also look at sensors (lm_sensors package): https://packages.debian.org/sid/lm-sensors
And, some BIOS have area to show hardware events (even shows when system powers off/on).

Hi dc.901.

Here's the output for:

Code:

sensors
Code:

acpitz-acpi-0
Adapter: ACPI interface
temp1:        +27.8°C  (crit = +106.0°C)
temp2:        +29.8°C  (crit = +106.0°C)

coretemp-isa-0000
Adapter: ISA adapter
Package id 0:  +43.0°C  (high = +85.0°C, crit = +105.0°C)
Core 0:        +43.0°C  (high = +85.0°C, crit = +105.0°C)
Core 1:        +40.0°C  (high = +85.0°C, crit = +105.0°C)
Core 2:        +37.0°C  (high = +85.0°C, crit = +105.0°C)
Core 3:        +41.0°C  (high = +85.0°C, crit = +105.0°C)

nouveau-pci-0100
Adapter: PCI adapter
GPU core:    +0.97 V  (min =  +0.84 V, max =  +1.16 V)
temp1:        +30.0°C  (high = +95.0°C, hyst =  +3.0°C)
                      (crit = +105.0°C, hyst =  +5.0°C)
                      (emerg = +135.0°C, hyst =  +5.0°C)


linustalman 05-20-2020 04:21 AM

Quote:

Originally Posted by ondoho (Post 6124689)
My machines make no sound at all when they're about to shut off.
Do you mean the sound it makes when it actually shuts off?
Then that's probably what's happening.

Hi ondoho. No, it's not the same sound.

linustalman 05-20-2020 04:22 AM

Quote:

Originally Posted by sevendogsbsd (Post 6124697)
I like the lm_sensors idea - maybe it is a heat issue? How long since you've cleaned out dust bunnies? Or is this a laptop and you can't...

Hi sevendogsbsd. It's a Desktop PC. I cleaned it out in March/April.

linustalman 05-20-2020 04:22 AM

Quote:

Originally Posted by jefro (Post 6124725)
If the clock keeps time then the system is really working. The Desktop if having an issue. I'd start with top htop or other system metric to see more.

Beep might mean some keyboard buffer full?

Hi jefro. There's no beep.

sevendogsbsd 05-20-2020 06:23 AM

Temps look OK to me so probably not heat related.

dc.901 05-20-2020 06:34 AM

Take a look at dmidecode output; it can be a lot, so this guide will help:
https://www.cyberciti.biz/tips/query...nd-prompt.html

For powersupply; compare the output below with actual specs of PSU, do they match:
dmidecode -t 39

And see if there is IPMI device:
dmidecode -t 38

There is system event log, but it may not be easy to read:
dmidecode -t 15

So, if there is IPMI device, it will be much easier to read hardware layer system event log, but I will be surprised if it is there on a desktop.

linustalman 05-20-2020 09:07 AM

Quote:

Originally Posted by dc.901 (Post 6125285)
Take a look at dmidecode output; it can be a lot, so this guide will help:
https://www.cyberciti.biz/tips/query...nd-prompt.html

For powersupply; compare the output below with actual specs of PSU, do they match:
dmidecode -t 39

And see if there is IPMI device:
dmidecode -t 38

There is system event log, but it may not be easy to read:
dmidecode -t 15

So, if there is IPMI device, it will be much easier to read hardware layer system event log, but I will be surprised if it is there on a desktop.

Code:

sudo  dmidecode -t 39
Code:

# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 2.7 present.

Handle 0x0056, DMI type 39, 22 bytes
System Power Supply
        Power Unit Group: 1
        Location: To Be Filled By O.E.M.
        Name: To Be Filled By O.E.M.
        Manufacturer: To Be Filled By O.E.M.
        Serial Number: To Be Filled By O.E.M.
        Asset Tag: To Be Filled By O.E.M.
        Model Part Number: To Be Filled By O.E.M.
        Revision: To Be Filled By O.E.M.
        Max Power Capacity: Unknown
        Status: Present, OK
        Type: Switching
        Input Voltage Range Switching: Auto-switch
        Plugged: Yes
        Hot Replaceable: No
        Input Voltage Probe Handle: 0x0052
        Cooling Device Handle: 0x0054
        Input Current Probe Handle: 0x0055

Code:

sudo dmidecode -t 38
Code:

# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 2.7 present.


Code:

sudo dmidecode -t 15
Code:

# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 2.7 present.


linustalman 05-21-2020 01:43 AM

I disconnected the DVD/CD drive cables to see if that would stop the freezes but they still happen. I think the sound could be from the PSU or HDD - sounds a bit mechanical.

linustalman 05-21-2020 06:16 AM

I've noticed lines with kernel in syslog around the time of the freezes. I wonder if I should revert to the original non-backport kernel? Could the backport kernel (5.4.0-0.bpo.4-amd64) be the issue?

allend 05-21-2020 08:12 AM

In my experience, this can be a symptom of a failing HDD.

linustalman 05-21-2020 08:21 AM

1 Attachment(s)
Quote:

Originally Posted by allend (Post 6125671)
In my experience, this can be a symptom of a failing HDD.

Hi allend. In GNOME Disks, it seems ok. See attached image.

linustalman 05-22-2020 06:27 AM

I'm currently running this:

Code:

sudo badblocks -v /dev/sda5 > badsectors.txt
Checking blocks 0 to 1953263615
Checking for bad blocks (read-only test):

Anyone know roughly how long should it take on a 2TB SATA 7200 HDD?

linustalman 05-30-2020 12:05 PM

I'd say it's the HDD alright but cannot confirm.

linustalman 05-31-2020 03:35 AM

Some days my PC freezes a few times a day - once even during bootup recently. Yesterday, there were no freezes. As of typing this, 1h45m uptime - no freezes either today.

linustalman 05-31-2020 08:47 AM

Quote:

Originally Posted by allend (Post 6125671)
In my experience, this can be a symptom of a failing HDD.

I reckon you're right there, allend.

linustalman 06-01-2020 01:39 PM

The freezes usually happen when the system is under a heavy load.

linustalman 06-22-2020 01:16 AM

No freeze for a couple of days now.

jmccue 07-03-2020 07:45 AM

If the seconds are still going and you have another PC/Laptop. Try and ssh to the 'hung' machine and see if it is active. I think it is still active and once in you can poke around.

My guess, the video card is having an issues.

dugan 07-03-2020 12:42 PM

Next time this happens, ssh into it when it's frozen. Check the usual logs (/var/log/messages, /var/log/Xorg.0.log or ~/.local/share/xorg/Xorg.0.log, dmesg -t, any journalctl stuff that might be relevant, etc).

In my experiences, these freezes usually mean that the desktop is stuck on a READ operation that's taking an unexpectedly long time. So it can be a failing hard drive. I've also seen it in cases where the home directories are network mounts (I assume that's not your setup?), and I've personally seen Flash lock up X like this (when I forced the "experimental" 3D acceleration on).

linustalman 08-16-2020 06:51 AM

It's been ages since the PC froze. 😕

linustalman 10-24-2020 02:47 AM

2 Attachment(s)
Yestereve, I saw 16 bad sectors in GNOME Disks.

Edit: Today, it says '8' [see attached image]. 🤔

Another edit: It now mentions no bad sectors [see 2nd attached image]. 😕

linustalman 10-31-2020 04:04 AM

Today, I heard my PC beep. Then I noticed that it was a read-only file system. I did a reboot and was met with a console. I ran 'fsck path_to_my_luks_partition' and exited to reboot. All is ok again. However, I've the feeling this is a striking warning.

linustalman 11-01-2020 03:15 AM

GNOME Disks now says 8 bad sectors.

beachboy2 11-01-2020 03:54 AM

linustalman,

Install gsmartcontrol:

Code:

sudo apt install gsmartcontrol
Right click on drive > Perform test > Extended self-test (30 minutes).

Read the output and if items #5 or # 197 are not zero, then you have a problem with that drive.

linustalman 11-01-2020 08:43 AM

2 Attachment(s)
Quote:

Originally Posted by beachboy2 (Post 6180653)
linustalman,

Install gsmartcontrol:

Code:

sudo apt install gsmartcontrol
Right click on drive > Perform test > Extended self-test (30 minutes).

Read the output and if items #5 or # 197 are not zero, then you have a problem with that drive.

Hi beachboy2.

#197 had 8 - just as GNOME Disks said. Please find attached 2 screenshots re GSmartControl.

Thanks.

beachboy2 11-01-2020 09:11 AM

linustalman,

As I expected, your drive is about to fail.

Get your data off it pronto and then replace it.

linustalman 11-01-2020 09:13 AM

Quote:

Originally Posted by beachboy2 (Post 6180725)
linustalman,

As I expected, your drive is about to fail.

Get your data off it pronto and then replace it.

Thank you, beachboy2. I will do so. 👍🏻

rnturn 11-05-2020 03:30 PM

Quote:

Originally Posted by linustalman (Post 6126011)
I'm currently running this:

Code:

sudo badblocks -v /dev/sda5 > badsectors.txt
Checking blocks 0 to 1953263615
Checking for bad blocks (read-only test):

Anyone know roughly how long should it take on a 2TB SATA 7200 HDD?

Not exactly but it can take quite a while. If you think you need to do this, kick it off and find something else to do for a few hours.

Checking a drive of that size takes long enough that I wound up creating a cron job that runs in the wee hours on Sundays to handle fscking the disks in an external USB drive cabinet containing three 2TB drives (it hits one drive per week). This has kept me from being blind-sided by a forced fsck should I be forced to reboot in the middle of the day.

Cheers...

ondoho 11-06-2020 12:51 AM

^ yes but isn't badblocks much more time-consuming than a (standard, default, quick) fsck?
Anyhow, my first reaction when seeing 2TB on spinning rust: start the process, come back tomorrrow...

also, I think the process can be further slowed down by bottlenecks in RAM and CPU speed

linustalman 11-08-2020 02:39 AM

4 Attachment(s)
Hello again, ondoho.

---

One day no bad sectors are detected in GNOME Disks, then 16, then none, then 112, now none again. That's very odd.

ondoho 11-08-2020 06:36 AM

^ Oh, I think it has been very clear for a while that your storage is dying.
Posting screenshots & asking questions won't change that.

linustalman 11-08-2020 07:46 AM

Quote:

Originally Posted by ondoho (Post 6183262)
^ Oh, I think it has been very clear for a while that your storage is dying.
Posting screenshots & asking questions won't change that.

Can you answer my query from post #36?

linustalman 11-14-2020 12:50 PM

1 Attachment(s)
I finally moved to a new HDD - another 2TB Seagate. No freezes so far, so it was likely the old HDD was the only cause.

linustalman 12-12-2020 01:48 AM

Bad sectors must have been the issue with the old install.


All times are GMT -5. The time now is 03:35 AM.