LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 11-23-2017, 05:53 AM   #1
KnutBluetooth
LQ Newbie
 
Registered: Jul 2011
Distribution: Arch Linux
Posts: 24

Rep: Reputation: Disabled
Weird problems with SSD


I recently built myself a new router. I'm using a 60GB Drevo X1 SSD for the OS (Arch). /var, /home and everything else that needs to be written to regularly is on a btrfs RAID1 array with regular HDDs. I've got this weird problem that after a week or two, the SSD stops working. I can see I/O errors (print_req_error: I/O error, dev sda) in the journal after I reboot the router. It seems the system can't access the SSD anymore for some reason. If I push the reset button, the BIOS won't find the SSD anymore. But when I totally it power off and restart it, then the SSD is found again and everything works as expected for a week or two. There doesn't seem to be anything wrong with the smartctl -x output. The HDDs in the btrfs RAID1 array don't have this problem. Not sure what I should try before ordering a new SATA cable.
 
Old 11-23-2017, 08:51 AM   #2
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,475

Rep: Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354
Have you tried unmounting it, & remounting it? If it's running off a module in some initrd maybe unload that before reloading.
 
Old 11-23-2017, 10:16 AM   #3
KnutBluetooth
LQ Newbie
 
Registered: Jul 2011
Distribution: Arch Linux
Posts: 24

Original Poster
Rep: Reputation: Disabled
I can't do that. The OS is on this drive. At that point, when it locks up, anything that isn't in memory or on the RAID array can't be read anymore. I can't even log into it from anywhere since that would mean running /bin/login.
 
Old 11-23-2017, 11:18 AM   #4
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,475

Rep: Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354Reputation: 2354
Then Your three fingered salute is probably the way to go until you ready replacement hardware, which I presume you will do.

On a side note, I'm probably old-fashioned but I like to have essential modules for booting (hard disk, motherboard chipset, and filesystem) compiled into the kernel, so an initrd is not needed. It's just one more set of hoops you don't have to jump through.
 
Old 11-23-2017, 11:43 AM   #5
KnutBluetooth
LQ Newbie
 
Registered: Jul 2011
Distribution: Arch Linux
Posts: 24

Original Poster
Rep: Reputation: Disabled
I'm definitely not ready yet to get any replacement hardware as all the hardware is brand new. I'm probably going to get some more SATA cables and try plugging the SSD into another SATA port to see how that goes if nothing else works. I guess you're thinking that a kernel module (libata and ahci here) dies after a while and isn't reloaded for some reason? But I don't see how that wouldn't also kill access to the btrfs RAID1 array. I know it's still working cause systemd-journald is still happily writing to /var on it and I can still access nginx and NFS shares that serve stuff from it. So I highly doubt that's the problem here. At first I was thinking that the drive might have a problem with fstrim or discard. But fstrim is set up to run every week and I didn't get the problem for over a week. So I don't think it's that. I tried also removing the discard flag in fstab for the UEFI vfat /boot partition. But that didn't help either. What I'd like to know is how to enable more debugging info to be sent to systemd-journald so I'll get more info on what's happening the next time it locks up.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
very weird situations with SSD drives tripialos Linux - Hardware 3 08-03-2013 08:53 AM
Weird SSD Issue SiriusStarr Linux - Hardware 9 06-30-2011 06:40 PM
Problems with a new SSD frode Linux - Hardware 8 04-10-2011 01:40 PM
weird, weird problems with logitech precision USB gamepad ikataii Linux - Hardware 4 10-14-2005 04:31 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 06:24 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration