LinuxQuestions.org
Register a domain and help support LQ
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software > Linux - Kernel
User Name
Password
Linux - Kernel This forum is for all discussion relating to the Linux kernel.

Notices

Reply
 
Search this Thread
Old 12-09-2012, 06:35 PM   #1
Skaperen
Senior Member
 
Registered: May 2009
Location: WV, USA
Distribution: Slackware, CentOS, Ubuntu, Fedora, Timesys, Linux From Scratch
Posts: 1,777
Blog Entries: 20

Rep: Reputation: 115Reputation: 115
Errors in ext4 on 2.6.32-279.14.1.el6 in CentOS 6.3


I am periodically (about 20% of the time) running into a problem when copying a large number of files (over 87000) from one disk to another. This is an older kernel (2.6.32-279.14.1.el6) in the latest CentOS running under Xen domU on AWS. The source and target filesystems are both ext4. The error message I get from rsync looks like this:
Code:
rsync: stat "/mnt/xvdj/tmp/.CharName.pm.OV0XJM" failed: No such file or directory (2)
rsync: rename "/mnt/xvdj/tmp/.CharName.pm.OV0XJM" -> "root/.cpan/build/Unicode-String-2.09-YxjsfF/lib/Unicode/CharName.pm": No such file or directory (2)
It happens in both 32-bit and 64-bit at almost the same frequency, but 64-bit seems to be slightly more frequent. It happens with different files, but I have seen it happen on the same file a couple times (two each for two files ... and the latest happened in 32-bit once and 64-bit ones).

The breakdown of what I think is happening is this: The rsync program receives a new file from the source (on disk, not network). The --temp-dir= option specifies a "tmp" directory in the target filesystem (allows rsync to rename() the file to its ultimate place). It writes the file in that temporary directory and closes it. Then it stats it and finds it is not there. Then it tries to rename it (not sure why after stat files) to move it to the ultiplate place, and that files. It then proceeds to the next file. Never more than one file is affected. And the AWOL file is not actually present in the "tmp" directory at the end (plausibly a bug in rsync didn't create it, but I'd expect that to be more systemic and fail the same way each time).

When I run rsync under strace it never has this problem. But I've only done that half a dozen times since is slows things down quite a bit (probably why it succeeds). I have mounted the filesystem with "-o sync" and that does not stop the problem ... it happened both times I used "-o sync".

But here is my question:

I had read a few years back that ext4 was not yet "prime time ready" for general use. I believe this was around the time of the 2.6.32 kernel. I had read that the issue was because of ordering issues in ext4 that could cause files to be lost or invisible for a while (in the tiny time fragments of system calls).

Is this what those earlier readings might have been talking about?

Should I switch back to ext3 or ext2 (changing the kernel isn't a good option for this).
 
Old 12-11-2012, 08:47 AM   #2
MensaWater
Guru
 
Registered: May 2005
Location: Atlanta Georgia USA
Distribution: Redhat (RHEL), CentOS, Fedora, Debian, FreeBSD, HP-UX, Solaris, SCO
Posts: 5,937
Blog Entries: 5

Rep: Reputation: 752Reputation: 752Reputation: 752Reputation: 752Reputation: 752Reputation: 752Reputation: 752
CentOS6 is compiled from RHEL6 source. RHEL6 supports ext4 (RHEL5 only had it as a preview technology) so CentOS6 should as well.

You might want to check for your issue at bugs.centos6.org and if not found post it there.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Issues Creating 16 TB Storage on CENTOS 6.2 with Ext4 rog_ski Linux - Virtualization and Cloud 3 01-19-2012 10:32 AM
Disk Quotas not giving warning in ext4 filesystem of CentOS 6.0 32 Bit vikas027 Linux - Server 2 01-01-2012 01:05 AM
EL 5.x v.s. EL6.x | BIND Root Hint Referral Reponse List - DifferenceEl 5.x v.s. EL6. harmandeep Linux - Server 0 10-06-2011 06:20 AM
fsck shows thousands of group descriptor errors on ext4 volume. abejarano Linux - General 0 04-21-2011 01:42 AM
ECMA-279: is this standard obsolete? stf92 Linux - Hardware 2 11-07-2010 04:40 AM


All times are GMT -5. The time now is 08:36 PM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration