Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place. |
Notices |
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Are you new to LinuxQuestions.org? Visit the following links:
Site Howto |
Site FAQ |
Sitemap |
Register Now
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
|
|
|
10-03-2014, 01:20 AM
|
#1
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Rep:
|
CentOS7 fails to boot: "Failed to mount /sysroot"
This was a fully functioning server up until this afternoon. Got a strange error about something tieing up the processor while in a ssh session. Could not get any commands though. Had my GF shut it down (power off) as there was nothing I could do remotely.
When I turned it back on it looked like it was booting normally but then I received an error message "Failed to mount /sysroot" and I got dropped into an emergency terminal. There is a log file it created. Tried to cat it but way too big. just see the trailing end where it starts failing on /sysroot. Can't retrieve the log to show here (yet) cause I left my rescue USB at work (not my brightest moment). I will have the logs to post tomorrow evening
I am going to work on it more in depth tomorrow, however if you guys could give me some ideas of what could be going on and where to look I would be thankful. I have never seen an error like that before.
|
|
|
10-03-2014, 07:13 PM
|
#2
|
Senior Member
Registered: Mar 2012
Distribution: Red Hat
Posts: 1,604
|
This could potentially be a problem with the RAM.
This sounds like it is using intird to boot the system which attempts to mount /sysroot to your RAM and then look for the fstab as well as other steps. I am not an expert, nor do I want to be, on systemd but it sounds like that is where the hangup is happening.
If you cant get any other diagnostic info might be worth it to use just a single stick of known-good RAM and attempt to boot.
And my personal rant:
RHEL 6 FOR LIFE! BOYCOTT SYSTEMD!
|
|
|
10-03-2014, 07:34 PM
|
#3
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Original Poster
Rep:
|
Quote:
Originally Posted by Kustom42
This could potentially be a problem with the RAM.
This sounds like it is using intird to boot the system which attempts to mount /sysroot to your RAM and then look for the fstab as well as other steps. I am not an expert, nor do I want to be, on systemd but it sounds like that is where the hangup is happening.
If you cant get any other diagnostic info might be worth it to use just a single stick of known-good RAM and attempt to boot.
And my personal rant:
RHEL 6 FOR LIFE! BOYCOTT SYSTEMD!
|
thanks for the advice. I attempted to boot with a stick from another pc: same issue
I am getting the logs now. Might take me a bit. Haven't figured out how I am going to do this yet. Tempted to take the whole drive out and boot it in a VM to rule out any hardware issues. Anyway, rescue live being loaded onto a usb stick now...
EDIT: I have ruled Hardware Fault in RAM. SystemRescueCD (or USB in this case) is running completely from the RAM with no issues
EDIT: Having an issue mounting the BTRFS partition from inside SRCD. The exact error I am getting is: "couldn't mount because of unsupported optional features (40)"
Last edited by thealmightyos; 10-03-2014 at 08:33 PM.
|
|
|
10-03-2014, 07:53 PM
|
#4
|
LQ Guru
Registered: Dec 2011
Distribution: Slackware, Debian 12, Devuan & MX Linux
Posts: 9,528
|
If your suspicious that the RAM is bad run Memtest overnight and see if it passes.
http://www.memtest.org/
|
|
|
10-03-2014, 09:12 PM
|
#5
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Original Poster
Rep:
|
Ram seems ok, but I do plan on running memtest tonight. Thank you for reminding me I don't have to download the whole UBCD for memtest lol.
I was finally able to grab that log file. I am posting it here as I am reading it. Have not gone over it all yet.
EDIT: It almost seems like it is having issues mounting that BTRFS partition. Any utilities that can check that type of file-system for errors? (asking this pre-google-foo)
EDIT: Post-google-foo: https://btrfs.wiki.kernel.org/index....ge/btrfs-check
EDIT: Excerpt from the log:
Code:
[ 2.544512] Apollo kernel: ------------[ cut here ]------------
[ 2.544553] Apollo kernel: WARNING: at fs/btrfs/inode.c:847 cow_file_range+0x438/0x450 [btrfs]()
[ 2.544555] Apollo kernel: Modules linked in: btrfs xor zlib_deflate raid6_pq libcrc32c sd_mod sr_mod crc_t10dif cdrom crct10dif_common ata_generic pata_acpi radeon ata_piix i2c_algo_bit drm_kms_helper ttm libata drm r8169 mii i2c_core
[ 2.544582] Apollo kernel: CPU: 1 PID: 271 Comm: mount Not tainted 3.10.0-123.8.1.el7.x86_64 #1
[ 2.544583] Apollo kernel: Hardware name: MSI MS-7236/MS-7236, BIOS V12.3 07/17/2008
[ 2.544585] Apollo kernel: 0000000000000000 00000000a01e708f ffff880036b71190 ffffffff815e237b
[ 2.544588] Apollo kernel: ffff880036b711c8 ffffffff8105dee1 ffff880036b71510 ffff880036b714fc
[ 2.544590] Apollo kernel: 0000000000000000 000000000003ffff ffff88003cbc01e0 ffff880036b711d8
[ 2.544593] Apollo kernel: Call Trace:
[ 2.544600] Apollo kernel: [<ffffffff815e237b>] dump_stack+0x19/0x1b
[ 2.544604] Apollo kernel: [<ffffffff8105dee1>] warn_slowpath_common+0x61/0x80
[ 2.544607] Apollo kernel: [<ffffffff8105e00a>] warn_slowpath_null+0x1a/0x20
[ 2.544618] Apollo kernel: [<ffffffffa02cf3d8>] cow_file_range+0x438/0x450 [btrfs]
[ 2.544630] Apollo kernel: [<ffffffffa02e2f59>] ? release_extent_buffer+0xa9/0xd0 [btrfs]
[ 2.544643] Apollo kernel: [<ffffffffa02e8e2f>] ? free_extent_buffer+0x4f/0xa0 [btrfs]
[ 2.544654] Apollo kernel: [<ffffffffa02cf74c>] run_delalloc_nocow+0x35c/0xa30 [btrfs]
[ 2.544665] Apollo kernel: [<ffffffffa02d0160>] run_delalloc_range+0x340/0x3a0 [btrfs]
[ 2.544677] Apollo kernel: [<ffffffffa02e5749>] ? find_lock_delalloc_range.constprop.43+0x1c9/0x1f0 [btrfs]
[ 2.544688] Apollo kernel: [<ffffffffa02defb8>] ? btrfs_get_token_64+0x68/0x100 [btrfs]
[ 2.544700] Apollo kernel: [<ffffffffa02e69b4>] __extent_writepage+0x324/0x780 [btrfs]
[ 2.544704] Apollo kernel: [<ffffffff81141e81>] ? find_get_pages_tag+0xe1/0x1a0
[ 2.544716] Apollo kernel: [<ffffffffa02e70a2>] extent_write_cache_pages.isra.30.constprop.48+0x292/0x410 [btrfs]
[ 2.544728] Apollo kernel: [<ffffffffa02e84fc>] extent_writepages+0x5c/0x90 [btrfs]
[ 2.544739] Apollo kernel: [<ffffffffa02ccff0>] ? btrfs_submit_direct+0x6c0/0x6c0 [btrfs]
[ 2.544750] Apollo kernel: [<ffffffffa02ca588>] btrfs_writepages+0x28/0x30 [btrfs]
[ 2.544753] Apollo kernel: [<ffffffff8114d65e>] do_writepages+0x1e/0x40
[ 2.544756] Apollo kernel: [<ffffffff81142ba5>] __filemap_fdatawrite_range+0x65/0x80
[ 2.544758] Apollo kernel: [<ffffffff81142c83>] filemap_fdatawrite_range+0x13/0x20
[ 2.544769] Apollo kernel: [<ffffffffa02e1cd9>] btrfs_wait_ordered_range+0x49/0x140 [btrfs]
[ 2.544781] Apollo kernel: [<ffffffffa0308b52>] __btrfs_write_out_cache+0x6a2/0x8c0 [btrfs]
[ 2.544794] Apollo kernel: [<ffffffffa03090bd>] btrfs_write_out_cache+0x8d/0xe0 [btrfs]
[ 2.544803] Apollo kernel: [<ffffffffa02b7eeb>] btrfs_write_dirty_block_groups+0x55b/0x650 [btrfs]
[ 2.544814] Apollo kernel: [<ffffffffa03351a7>] commit_cowonly_roots+0x15a/0x230 [btrfs]
[ 2.544824] Apollo kernel: [<ffffffffa02c824e>] btrfs_commit_transaction+0x44e/0x9d0 [btrfs]
[ 2.544836] Apollo kernel: [<ffffffffa02e8e2f>] ? free_extent_buffer+0x4f/0xa0 [btrfs]
[ 2.544848] Apollo kernel: [<ffffffffa030692d>] btrfs_recover_log_trees+0x3ed/0x4c0 [btrfs]
[ 2.544860] Apollo kernel: [<ffffffffa0304260>] ? replay_one_extent+0x6d0/0x6d0 [btrfs]
[ 2.544870] Apollo kernel: [<ffffffffa02c58dd>] open_ctree+0x17ed/0x1fb0 [btrfs]
[ 2.544878] Apollo kernel: [<ffffffffa029b9de>] btrfs_mount+0x63e/0x800 [btrfs]
[ 2.544881] Apollo kernel: [<ffffffff81149ec3>] ? free_pages+0x13/0x20
[ 2.544885] Apollo kernel: [<ffffffff8125860d>] ? selinux_sb_copy_data+0x14d/0x220
[ 2.544888] Apollo kernel: [<ffffffff811b3a59>] mount_fs+0x39/0x1b0
[ 2.544891] Apollo kernel: [<ffffffff811ce88f>] vfs_kern_mount+0x5f/0xf0
[ 2.544899] Apollo kernel: [<ffffffffa029b529>] btrfs_mount+0x189/0x800 [btrfs]
[ 2.544901] Apollo kernel: [<ffffffff81149ec3>] ? free_pages+0x13/0x20
[ 2.544904] Apollo kernel: [<ffffffff8125860d>] ? selinux_sb_copy_data+0x14d/0x220
[ 2.544906] Apollo kernel: [<ffffffff811b3a59>] mount_fs+0x39/0x1b0
[ 2.544908] Apollo kernel: [<ffffffff811ce88f>] vfs_kern_mount+0x5f/0xf0
[ 2.544911] Apollo kernel: [<ffffffff811d0c9e>] do_mount+0x24e/0xa30
[ 2.544913] Apollo kernel: [<ffffffff8114626e>] ? __get_free_pages+0xe/0x50
[ 2.544915] Apollo kernel: [<ffffffff811d1516>] SyS_mount+0x96/0xf0
[ 2.544919] Apollo kernel: [<ffffffff815f2a59>] system_call_fastpath+0x16/0x1b
[ 2.544920] Apollo kernel: ---[ end trace d3cbc53aeb04b87c ]---
Last edited by thealmightyos; 10-03-2014 at 11:21 PM.
|
|
|
10-04-2014, 02:42 PM
|
#7
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Original Poster
Rep:
|
RAM Confirmed OK: 22 passes: 0 Errors
Working the BTRFS repair angle as I am unable to mount that partition by any means. Will also use the script. Will that script run fine from a rescue USB?
There was some activity between the database and a program my co-worker is currently writing when this all started. I am allowing him limited access to the database while he learns Java. He has been working on it for well over a month without incident so I did not think it was a possible cause. Could some communication between his program and MariaDB cause this?
|
|
|
10-04-2014, 08:22 PM
|
#8
|
LQ Guru
Registered: Dec 2011
Distribution: Slackware, Debian 12, Devuan & MX Linux
Posts: 9,528
|
Quote:
Originally Posted by thealmightyos
RAM Confirmed OK: 22 passes: 0 Errors
Working the BTRFS repair angle as I am unable to mount that partition by any means. Will also use the script. Will that script run fine from a rescue USB?
There was some activity between the database and a program my co-worker is currently writing when this all started. I am allowing him limited access to the database while he learns Java. He has been working on it for well over a month without incident so I did not think it was a possible cause. Could some communication between his program and MariaDB cause this?
|
I'm not sure if that script will run from a rescue USB.
When I ran that script it placed the .txt file in my Downloads directory.
I have never come across a program interfering with a data base but anything is possible. What program was your co worker using?
How old is the HDD?
I hope that btrfs check with the repair option works for you.
Was the partition changed or modified in any way? Resized?
Last edited by Ztcoracat; 10-04-2014 at 10:14 PM.
|
|
|
10-04-2014, 08:41 PM
|
#9
|
LQ Guru
Registered: Dec 2011
Distribution: Slackware, Debian 12, Devuan & MX Linux
Posts: 9,528
|
I looked up the 2 warning in the output and I found this report.
Code:
2.544604] Apollo kernel: [<ffffffff8105dee1>] warn_slowpath_common+0x61/0x80
[ 2.544607] Apollo kernel: [<ffffffff8105e00a>] warn_slowpath_null+0x1a/0x20
http://bugs.centos.org/view.php?id=5347
Sorry, I have never seen those warnings before.
|
|
|
10-04-2014, 11:16 PM
|
#10
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Original Poster
Rep:
|
Quote:
Originally Posted by Ztcoracat
I'm not sure if that script will run from a rescue USB.
When I ran that script it placed the .txt file in my Downloads directory.
I have never come across a program interfering with a data base but anything is possible. What program was your co worker using?
How old is the HDD?
I hope that btrfs check with the repair option works for you.
Was the partition changed or modified in any way? Resized?
|
Hard drive is old, but did pass SMART tests. How old, I can not say without pulling it out and looking for a mfg date. The rig was not mine originally.
The program is of his own design. He is learning Java and wanted to learn how to access and write to a database. Very basic.
I really hope it works too. Have not had a chance to work on it today. I can only hope I get some time tomorrow.
Partition was not modified. I leave that sort of thing alone
|
|
|
10-05-2014, 07:22 PM
|
#11
|
LQ Guru
Registered: Dec 2011
Distribution: Slackware, Debian 12, Devuan & MX Linux
Posts: 9,528
|
Quote:
Originally Posted by thealmightyos
Hard drive is old, but did pass SMART tests. How old, I can not say without pulling it out and looking for a mfg date. The rig was not mine originally.
The program is of his own design. He is learning Java and wanted to learn how to access and write to a database. Very basic.
I really hope it works too. Have not had a chance to work on it today. I can only hope I get some time tomorrow.
Partition was not modified. I leave that sort of thing alone
|
Since the partition wasn't changed that rules out one of my suspicions of why your not able to mount the file system. But that's a good thing.
-:-When you have the time to go through the BTRFS repair let me know how it goes.-:-
|
|
|
10-05-2014, 09:26 PM
|
#12
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Original Poster
Rep:
|
Quote:
Originally Posted by Ztcoracat
Since the partition wasn't changed that rules out one of my suspicions of why your not able to mount the file system. But that's a good thing.
-:-When you have the time to go through the BTRFS repair let me know how it goes.-:-
|
It is the filesystem. When I attempted "btrfs check --repair /dev/sda3" I got many "Checksum Verify Failed ######## wanted ####### got #######" and then "csum failed".
Leaving it alone to see if it does anything else.
I might just have to reinstall. And if that is the case I need to plan ahead better and figure out some redundancies. Never seen a FS just crap out like that and then can not be repaired. In fact, this is the first time I have heard of btrfs.
EDIT: yep, so it did eventually finish but it didn't "repair" anything. Just had a long list of blocks that did not match what it expected. Will try with other options like scrub from here: https://btrfs.wiki.kernel.org/index.php/Manpage/btrfs
Last edited by thealmightyos; 10-05-2014 at 10:01 PM.
|
|
|
10-06-2014, 12:12 AM
|
#13
|
Member
Registered: Mar 2009
Distribution: CentOS 6.5 / 7
Posts: 119
Original Poster
Rep:
|
Starting a new post because I am going in a new direction.
Data does not appear to be recoverable. All that was lost was some hard work I put into the configuration so it was not a major deal. In fact, this just might be a blessing disguise. I originally installed and configured this server just to see what the new CentOS7 could do kept using it after as I really did like the OS (sorry Kustom42, looks like that puts me in the systemd camp). What I should have done was install it with specific options and configure it with specific goals in mind. Specifically, I should have told my friend to use a developmental environment for his java experimenting. I am not pointing fingers but I am fairly certain he was messing with that tool of his when this whole thing started. Also, I wanted to tighten down security so I can only access the server via ssh and ftp with a cert and key.
What I am getting at is that this gives me the excuse to start over and plan my approach rather then going ad hoc and reacting to situations. This might be a single home server but I know I can do better.
If I can not figure out how to recover the filesystem in by this time tomorrow I will close this thread as "Solved: FS Corruption" and move on to reinstallation
Last edited by thealmightyos; 10-06-2014 at 12:28 AM.
|
|
|
10-06-2014, 12:23 AM
|
#14
|
LQ Guru
Registered: Dec 2011
Distribution: Slackware, Debian 12, Devuan & MX Linux
Posts: 9,528
|
Quote:
Originally Posted by thealmightyos
It is the filesystem. When I attempted "btrfs check --repair /dev/sda3" I got many "Checksum Verify Failed ######## wanted ####### got #######" and then "csum failed".
Leaving it alone to see if it does anything else.
I might just have to reinstall. And if that is the case I need to plan ahead better and figure out some redundancies. Never seen a FS just crap out like that and then can not be repaired. In fact, this is the first time I have heard of btrfs.
EDIT: yep, so it did eventually finish but it didn't "repair" anything. Just had a long list of blocks that did not match what it expected. Will try with other options like scrub from here: https://btrfs.wiki.kernel.org/index.php/Manpage/btrfs
|
If I had to guess some files where open when you weren't able to do anything remotely when you had GF shut the system down.
I lost 30 days worth of files once because I had to partake in a not so graceful shutdown.
The system locked up on me and there was nothing I could do. The freeze damaged my files and I couldn't reboot-
Unfortunately, I had to reinstall the OS.
Any chance you have a CentOS Live CD?
With it you can Rescue Installed System-
http://isoredirect.centos.org/centos/6/isos/x86_64/
http://www.standalone-sysadmin.com/b...S-bootmenu.png
I asked because I wasn't sure if the Live USB has that option or not.
Sorry your going through this, I've been there and I know it's not fun at all-
I downloaded the CentOS Bible 'PDF' if I find anything worthy of repairing your fs I'll post what I find.
Hope the instructions on the Manpage of btrfs 'scrub' works for you.
|
|
|
10-06-2014, 12:36 AM
|
#15
|
LQ Guru
Registered: Dec 2011
Distribution: Slackware, Debian 12, Devuan & MX Linux
Posts: 9,528
|
Quote:
Originally Posted by thealmightyos
Starting a new post because I am going in a new direction.
Data does not appear to be recoverable. All that was lost was some hard work I put into the configuration so it was not a major deal. In fact, this just might be a blessing disguise. I originally installed and configured this server just to see what the new CentOS7 could do kept using it after as I really did like the OS (sorry Ztcoracat, looks like that puts me in the systemd camp). What I should have done was install it with specific options and configure it with specific goals in mind. Specifically, I should have told my friend to use a developmental environment for his java experimenting. I am not pointing fingers but I am fairly certain he was messing with that tool of his when this whole thing started. Also, I wanted to tighten down security so I can only access the server via ssh and ftp with a cert and key.
What I am getting at is that this gives me the excuse to start over and plan my approach rather then going ad hoc and reacting to situations. This might be a single home server but I know I can do better.
If I can not figure out how to recover the filesystem in by this time tomorrow I will close this thread as "Solved: FS Corruption" and move on to reinstallation
|
I totally get it-
-::-Blessings in disguise can be a good teacher and in some cases a lesson well learned.-::-
Running a program in the improper environment most likely creates undesired effects.
Tightening Security is always a plus when your running a enterprise.
|
|
|
All times are GMT -5. The time now is 08:42 AM.
|
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
|
Latest Threads
LQ News
|
|