LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 12-31-2021, 09:17 PM   #1
PsychoHermit
Member
 
Registered: Aug 2021
Location: Carson City Nevada
Distribution: Debian Testing
Posts: 136

Rep: Reputation: Disabled
Hard drive failing


Hi Folks,

it looks like my hard drive is on it's last legs. I didn't want to spend the money but it looks like I get to upgrade to an SSD.
Code:
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   100   099   062    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0025   100   100   040    Pre-fail  Offline      -       0
  3 Spin_Up_Time            0x0023   168   100   033    Pre-fail  Always       -       1
  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       3201
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002f   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0025   100   100   040    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0032   080   080   000    Old_age   Always       -       8950
 10 Spin_Retry_Count        0x0033   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       3200
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0033   100   100   097    Pre-fail  Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       592709156864
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       81606148102
190 Airflow_Temperature_Cel 0x0022   070   049   045    Old_age   Always       -       30 (Min/Max 20/31)
191 G-Sense_Error_Rate      0x0032   082   082   000    Old_age   Always       -       4689
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       851981
193 Load_Cycle_Count        0x0032   059   059   000    Old_age   Always       -       419500
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0036   100   100   000    Old_age   Always       -       0
223 Load_Retry_Count        0x002a   100   100   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged
--glenn
 
Old 12-31-2021, 10:22 PM   #2
rknichols
Senior Member
 
Registered: Aug 2009
Distribution: Rocky Linux
Posts: 4,776

Rep: Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212
I don't see anything wrong in those attributes. The Load_Cycle_Count and Power-Off_Retract_Count are awfully high, but so far they don't seem to have hurt anything. Having the drive power cycling every 90 seconds or so (851981 times in just 8950 hours) seems like a configuration problem.
 
Old 12-31-2021, 10:39 PM   #3
computersavvy
Senior Member
 
Registered: Aug 2016
Posts: 3,342

Rep: Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484Reputation: 1484
Smartctl is usually able to report a lot more if you were to use
Code:
sudo smartctl -a /dev/sdX
Unless you are certain it is failing (i.e. getting a lot of corruption that requires using fsck or similar) I would dig deeper before you just automatically replace it.

With that said, the output of the smartctl command above is a better judge of the status than the little bit you posted here.

Also remember that a backup is always recommended just in case of catastrophic failure.

Last edited by computersavvy; 12-31-2021 at 10:40 PM.
 
Old 12-31-2021, 11:01 PM   #4
syg00
LQ Veteran
 
Registered: Aug 2003
Location: Australia
Distribution: Lots ...
Posts: 21,120

Rep: Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120
Well, I certainly wouldn't be happy with those numbers for 187, 188.
 
Old 12-31-2021, 11:14 PM   #5
mrmazda
LQ Guru
 
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, others
Posts: 5,799
Blog Entries: 1

Rep: Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066Reputation: 2066
Quote:
Originally Posted by PsychoHermit View Post
Hi Folks,

it looks like my hard drive is on it's last legs. I didn't want to spend the money but it looks like I get to upgrade to an SSD.
Code:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       0
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       0
From these I see no reason to expect soon failure, especially with such low power on hours. Don't be afraid to update to SSD anyway. Unless your PC is running the disk on an old SATA-1 controller, upgrading to SSD should be worth it for the speed increase. This HDD looks like a great candidate for backing up your SSD.
 
Old 01-01-2022, 03:49 AM   #6
lvm_
Member
 
Registered: Jul 2020
Posts: 912

Rep: Reputation: 314Reputation: 314Reputation: 314Reputation: 314
Quote:
Originally Posted by syg00 View Post
Well, I certainly wouldn't be happy with those numbers for 187, 188.
Nothing wrong with those - you should check normalised, not raw values. It is probably a seagate drive - they always have insane raw values. And if it is, it should be replaced even if it looks perfectly healthy as this one does.
 
1 members found this post helpful.
Old 01-01-2022, 08:40 AM   #7
rknichols
Senior Member
 
Registered: Aug 2009
Distribution: Rocky Linux
Posts: 4,776

Rep: Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212Reputation: 2212
Quote:
Originally Posted by syg00 View Post
Well, I certainly wouldn't be happy with those numbers for 187, 188.
Convert those values to hex and look at the low-order bits. Seagate drives (I'm guessing this is a Seagate hybrid drive) typically have some raw values for which the low-order bits are the actual exception count and the higher-order bits are the number of operations.
 
1 members found this post helpful.
Old 01-01-2022, 10:59 AM   #8
smallpond
Senior Member
 
Registered: Feb 2011
Location: Massachusetts, USA
Distribution: Fedora
Posts: 4,138

Rep: Reputation: 1263Reputation: 1263Reputation: 1263Reputation: 1263Reputation: 1263Reputation: 1263Reputation: 1263Reputation: 1263Reputation: 1263
Unless you're really into parsing low level details, you can just ask smartctl for its evaluation with -H

Code:
sudo smartctl  -H /dev/nvme0n1
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.4-101.fc34.x86_64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
 
Old 01-01-2022, 11:26 AM   #9
rclark
Member
 
Registered: Jul 2008
Location: Montana USA
Distribution: KUbuntu, Fedora (KDE), PI OS
Posts: 476

Rep: Reputation: 178Reputation: 178
Looks ok. But hey ... any excuse to spend $100 on an SSD sounds good to me . My spinning rust storage devices are now only for backups. All our laptops, desktops, and a data server our running on SSDs.
 
Old 01-01-2022, 04:55 PM   #10
syg00
LQ Veteran
 
Registered: Aug 2003
Location: Australia
Distribution: Lots ...
Posts: 21,120

Rep: Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120Reputation: 4120
Never too late to learn - thanks for the education.
 
Old 01-01-2022, 08:23 PM   #11
PsychoHermit
Member
 
Registered: Aug 2021
Location: Carson City Nevada
Distribution: Debian Testing
Posts: 136

Original Poster
Rep: Reputation: Disabled
I guess I will hold off replacing the drive and see what develops. It may keep working for quite some time.

Thanks,
--glenn
 
Old 01-03-2022, 09:33 AM   #12
sundialsvcs
LQ Guru
 
Registered: Feb 2004
Location: SE Tennessee, USA
Distribution: Gentoo, LFS
Posts: 10,642
Blog Entries: 4

Rep: Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933Reputation: 3933
If you think that a drive might be headed for failure, get rid of the damned thing.

SSD hard drives, both internal and external, are insanely-big and no longer expensive. I have several external drives attached to all of my computers, for continuous backups and other purposes.

If you are using LVM = Logical Volume Management, as you should be, you can actually migrate all of the data off the failing drive and onto the new one automagically ... and without downtime.

Last edited by sundialsvcs; 01-03-2022 at 09:35 AM.
 
1 members found this post helpful.
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Hard driving failing then working then failing again : SMART says it is OK BW-userx Linux - Hardware 4 10-20-2016 07:14 PM
I want to copy my hard drive to a larger hard drive and boot from the new drive. lpmorgan1 Ubuntu 8 09-07-2009 11:26 PM
my hard drive or my optical drive is failing fatblueduck Linux - Hardware 7 01-28-2006 08:30 PM
hard drive failing, how can I create an image of my drive? oily_rags SUSE / openSUSE 6 07-07-2005 02:19 PM
X failing to load; AddScreen/InitScreen failing for driver 0 weblion Linux - Software 1 08-01-2002 06:14 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 06:09 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration