LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Server (http://www.linuxquestions.org/questions/linux-server-73/)
-   -   mail from cron with error at usr/lib/sa/sa2 -A (Opensuse 10.1) (http://www.linuxquestions.org/questions/linux-server-73/mail-from-cron-with-error-at-usr-lib-sa-sa2-a-opensuse-10-1-a-916774/)

mosmuller 12-02-2011 09:18 AM

mail from cron with error at usr/lib/sa/sa2 -A (Opensuse 10.1)
 
I had few days ago at the var/log/warn the following lines:

Nov 24 22:04:14 cluster2 kernel: <0>general protection fault: 0000 [#2849]
Nov 24 22:04:14 cluster2 kernel: Modules linked in: l2cap bluetooth nls_cp850 nls_utf8 smbfs st sr_mod joydev nfs lockd nfs_acl sunrpc af_packet ipt_MASQUERADE xt_pkttype ipt_TCPMSS ipt_LOG xt_limit edd snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device button battery ac ip6t_REJECT xt_tcpudp ipt_REJECT xt_state iptable_mangle iptable_nat ip_nat iptable_filter ip6table_mangle ip_conntrack nfnetlink ip_tables ip6table_filter ip6_tables x_tables ipv6 apparmor aamatch_pcre loop dm_mod i2c_i801 shpchp intel_agp parport_pc i2c_core pci_hotplug ide_cd cdrom agpgart parport ehci_hcd i8xx_tco uhci_hcd snd_intel8x0 snd_ac97_codec snd_ac97_bus usbcore ohci1394 ieee1394 snd_pcm snd_timer sk98lin snd e1000 soundcore snd_page_alloc ext3 jbd raid1 sg fan thermal processor ata_piix libata piix sd_mod scsi_mod ide_disk ide_core
Nov 24 22:04:14 cluster2 kernel: EIP: 0060:[<08048684>] Tainted: G U VLI
Nov 24 22:04:14 cluster2 kernel: EFLAGS: 00010206 (2.6.16.13-4-smp #1)
Nov 24 22:04:15 cluster2 kernel: <0>general protection fault: 0000 [#2850]
Nov 24 22:04:15 cluster2 kernel: Modules linked in: l2cap bluetooth nls_cp850 nls_utf8 smbfs st sr_mod joydev nfs lockd nfs_acl sunrpc af_packet ipt_MASQUERADE xt_pkttype ipt_TCPMSS ipt_LOG xt_limit edd snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device button battery ac ip6t_REJECT xt_tcpudp ipt_REJECT xt_state iptable_mangle iptable_nat ip_nat iptable_filter ip6table_mangle ip_conntrack nfnetlink ip_tables ip6table_filter ip6_tables x_tables ipv6 apparmor aamatch_pcre loop dm_mod i2c_i801 shpchp intel_agp parport_pc i2c_core pci_hotplug ide_cd cdrom agpgart parport ehci_hcd i8xx_tco uhci_hcd snd_intel8x0 snd_ac97_codec snd_ac97_bus usbcore ohci1394 ieee1394 snd_pcm snd_timer sk98lin snd e1000 soundcore snd_page_alloc ext3 jbd raid1 sg fan thermal processor ata_piix libata piix sd_mod scsi_mod ide_disk ide_core
Nov 24 22:04:15 cluster2 kernel: EIP: 0060:[<08048684>] Tainted: G U VLI
Nov 24 22:04:15 cluster2 kernel: EFLAGS: 00010206 (2.6.16.13-4-smp #1)

Since that i am receiving mail alerts from cron like this one:

Subject: usr/lib/sa/sa2 -A

Body:
find: /var/log/sa: No such file or directory
find: /var/log/sa: No such file or directory

But i can verify that the /var/log/sa exists indeed. No new sar.YYYY_MM_DD files created since then. Only sa.YYYY_MM_DD

What could happened?

I reboot my server and no more errors like that one on var/log/warn but i keep receiving mail alerts from cron and the issue with sa2 goes on.

Thanks for any answer that could help me to resolve this issue.

Mario.

MensaWater 12-02-2011 09:46 AM

First do "df -h /var/log/sa" to see what filesystem it is on (likely root [/] or /var).

Second run "ls -l /var/log/sa" and see what you get.

Third run "find /var/log/sa" and see what you get.

It could be that the filesystem is mounted read only (which should stop ls or find from working but would stop new files from being created in it). More likely the filesystem is corrupted and you need to run an fsck against it. This should be done with the filesystem unmounted - if it is / or /var you'll likely need to boot to single user to do the fsck. If, however, /var/log/sa is a separate filesystem you should be able to unmount it and do the fsck.

WARNING: If this is / or /var you want to make sure you have a good backup before doing the fsck. If the corruption is caused by a disk failure the disk could be made worse by the fsck.

mosmuller 12-02-2011 12:23 PM

Thanks Mensawater, here i tell you what i get at each step you suggest:

Step1: It is on [/]
Step 2: I can see the list of all files into /var/log/sa
Step 3: I get "no such file or directory" with the find command

The last backup i have is since Nov 23rd, 2011, as my cron stopped working since the mentioned issue and no daily backups could be done.

Must i do the fsck you´ve suggested me?

Regards.

MensaWater 12-02-2011 02:42 PM

Odd that find is failing but ls -l is working.

Do "which find" to find out what directory your find command is in.

Do "file $(which find)" to find out if the command is actually a binary. It is possible that your find command is corrupted or is actually a wrapper script that is having issues.

Try doing "find /tmp" and see if it works there.

mosmuller 12-03-2011 10:02 AM

Ok, i tell you what i get:

- with the "which find":

file $(which find) to fi
/usr/bin/find: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), for GNU/Linux 2.6.4, dynamically linked (uses shared libs), for GNU/Linux 2.6.4, stripped
to: cannot open `to' (No such file or directory)
fi: cannot open `fi' (No such file or directory)

- with the "find /tmp":

find /tmp
/tmp
find: /tmp: No such file or directory

mosmuller 12-13-2011 08:11 AM

How must i go on with this issue?

I have a mirror, i guess it could be better than running de fsck, first test the mirror disk to verify if it is good.

Which is the better way to start whith the mirror disk?

Thank you and best regards

mosmuller 12-20-2011 11:55 AM

find command does not run anymore
 
Following to my problem, i could find out, that on /usr/bin, it was not any more the find command. I restored it from backup, but it does not run properly. How could i get it running again?


All times are GMT -5. The time now is 05:05 PM.