kernel: journal_get_undo_access: No memory for committed data
Red HatThis forum is for the discussion of Red Hat Linux.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
A full dmesg with all the boot info is at www unseen org/~kevin/tmp/dmesg
The ext3_orphan_cleanup in the above dmesg is due to the last time this
error happened.
A few log lines before the issue:
application vmware-vmx uses obsolete OSS audio interface
/dev/vmnet: open called by PID 20000 (vmware-vmx)
device eth0 entered promiscuous mode
bridge-eth0: enabled promiscuous mode
/dev/vmnet: port on hub 0 successfully opened
/dev/vmmon[20008]: host clock rate change request 0 -> 19
/dev/vmmon[20008]: host clock rate change request 19 -> 83
vortex: IRQ fifo error
I'm pretty sure the "vortex: IRQ fifo error" isnt an issue, that occours a
few random times in the log.
The Vmware thing may be related, the virtual disk is on hdb1 (see error
below), and it does vaguly seem to be related to me starting vmware but
not consistantly enough to say at the mo.
Mem-info:
DMA per-cpu:
cpu 0 hot: low 2, high 6, batch 1
cpu 0 cold: low 0, high 2, batch 1
Normal per-cpu:
cpu 0 hot: low 32, high 96, batch 16
cpu 0 cold: low 0, high 32, batch 16
HighMem per-cpu: empty
HighMem: empty
Swap cache: add 17737, delete 16749, find 656/979, race 0+0
Free swap: 2037900kB
196608 pages of RAM
0 pages of HIGHMEM
2939 reserved pages
230767 pages shared
988 pages swap cached
journal_get_undo_access: No memory for committed data
ext3_try_to_allocate_with_rsv: aborting transaction: Out of memory in
__ext3_journal_get_undo_a ccess
EXT3-fs error (device hdb1) in ext3_new_block: Out of memory
Aborting journal on device hdb1.
ext3_abort called.
EXT3-fs error (device hdb1): ext3_journal_start_sb: Detected aborted
journal
Remounting filesystem read-only
EXT3-fs error (device hdb1) in ext3_ordered_writepage: Out of memory
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
EXT3-fs error (device hdb1) in start_transaction: Journal has aborted
[snip]
I'm getting the exact same error. I'm running VmWare 5.0 on Redhat AS4 on an EXT3 filesystem. The system seems to run fine if I'm only running one guest OS. Once I get two or three gues OS's going, it doesn't seem to last more than 24 hours before I get a stack trace exactly like the one you posted. The very first indication in the logfile that this is happening is:
Jul 28 04:41:07 avalanche kernel: journal_get_undo_access: No memory for committed data
Jul 28 04:41:07 avalanche kernel: ext3_try_to_allocate_with_rsv: aborting transaction: Out of memory in __ext3_journal_get_undo_access
Jul 28 04:41:07 avalanche kernel: EXT3-fs error (device hda7) in ext3_new_block: Out of memory
Jul 28 04:41:07 avalanche kernel: Aborting journal on device hda7.
Jul 28 04:41:07 avalanche kernel: ext3_abort called.
Jul 28 04:41:07 avalanche kernel: EXT3-fs error (device hda7): ext3_journal_start_sb: Detected aborted journal
Jul 28 04:41:07 avalanche kernel: Remounting filesystem read-only
The machine has 1G RAM and 2G swap, both of which are plenty free when this happens (over 600G of used memory is file cache, and swap is only using 8M or so). Note that once it flips the filesystem into read-only above, the only fix is to reboot (attempts to remount rw fail with disk write-protected error).
I've run up to 5 simultaneous VM's on a similarly configured FC3 machine and never encountered this issue.
-----
1) Recommended. Edit /etc/sysctl.conf and set "vm.min_free_kbytes = <nnnn>"
2) "echo <nnnn> > /proc/sys/vm/min_free_kbytes" in /etc/rc.local or some other place in the boot process.
-----
<nnnn> should be 5120 or 10240 (or higher).
I have not had time to verify this yet, but it seemed to work for the problems they were having.
Brilliant, thanks for pointing that out, I made the change above and I have now had a win2k3 instance running in vmware and it has been stable for a week compared to less than 48 hours previously.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
Advertisement
Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Click Here to receive a complimentary subscription courtesy of LQ.