LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Hardware (https://www.linuxquestions.org/questions/linux-hardware-18/)
-   -   Centos 6.3 crashes(Freezes) without slowing down first (https://www.linuxquestions.org/questions/linux-hardware-18/centos-6-3-crashes-freezes-without-slowing-down-first-4175444099/)

JBJ1962 01-04-2013 01:57 AM

Centos 6.3 crashes(Freezes) without slowing down first
 
Kernel version is 2.6.32-279.19.1.el6.x86_64

It crashes after 5 , 10 , 15 minutes I dont know sometimes after 2 hours, sometimes after 1 or 2 days working fine, I have no idea...
It doesn't slow down and then crash, it just stop responding to mouse and KB and that's how I know it is frozen again and then I force shutdown :( Often even if RAM usage is 1% ... no processes :D
Sometime it happens EVEN IF I DONT RUN OR DO ANYTHING, EVEN IF I JUST TURN IT ON AND LEAVE IT BE !!!! Sometimes it works for 2 3 days perfectly and then Again Faulty BAD DAY :((( Without any pattern !! or Rule , I can't understand!

Please share your ideas and experience

this is the last symptoms seen using Tail -f /var/log/messages , Error and Kern

[root@Workstation-B]# tail -f /var/log/error
Jan 4 13:39:41 Workstation-B mcelog: mcelog read: No such device
Jan 4 13:39:42 Workstation-B abrtd: Init complete, entering main loop
Jan 4 13:39:43 Workstation-B libvirtd: Could not find keytab file: /etc/libvirt/krb5.tab: No such file or directory]
Jan 4 13:56:57 Workstation-B automount[2450]: lookup_read_master: lookup(nisplus): couldn't locate nis+ table auto.master
Jan 4 13:56:57 Workstation-B mcelog: failed to prefill DIMM database from DMI data
Jan 4 13:56:57 Workstation-B mcelog: mcelog read: No such device
Jan 4 13:56:58 Workstation-B abrtd: Init complete, entering main loop
Jan 4 13:57:00 Workstation-B libvirtd: Could not find keytab file: /etc/libvirt/krb5.tab: No such file or directory
Jan 4 14:03:21 Workstation-B pulseaudio[3470]: pid.c: Daemon already running.
(CRASH!!!!)

[root@Workstation-B]# tail -f /var/log/messages
Jan 4 13:57:01 Workstation-B kernel: Ebtables v2.0 registered
Jan 4 13:57:03 Workstation-B kernel: lo: Disabled Privacy Extensions
Jan 4 13:57:08 Workstation-B polkitd[3052]: started daemon version 0.96 using a
Jan 4 13:57:08 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3062 o
Jan 4 13:57:08 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3067 o
Jan 4 13:57:08 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3069 o
Jan 4 13:57:08 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3070 o
Jan 4 13:57:09 Workstation-B kernel: hda-intel: IRQ timing workaround is activa
Jan 4 13:57:09 Workstation-B gdm-simple-greeter[3050]: Gtk-WARNING: gtkwidget.c
Jan 4 13:57:10 Workstation-B gdm-simple-greeter[3050]: WARNING: Unable to parse
Jan 4 14:03:20 Workstation-B kernel: fuse init (API version 7.13)
Jan 4 14:03:20 Workstation-B seahorse-daemon[3403]: DNS-SD initialization faile
Jan 4 14:03:20 Workstation-B seahorse-daemon[3403]: init gpgme version 1.1.8
Jan 4 14:03:20 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3426 o
Jan 4 14:03:20 Workstation-B pulseaudio[3426]: pid.c: Stale PID file, overwriti
Jan 4 14:03:20 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3431 o
Jan 4 14:03:20 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3453 o
Jan 4 14:03:20 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3455 o
Jan 4 14:03:21 Workstation-B rtkit-daemon[3064]: Sucessfully made thread 3470 o
Jan 4 14:03:21 Workstation-B pulseaudio[3470]: pid.c: Daemon already running.
Jan 4 14:03:36 Workstation-B pulseaudio[3426]: ratelimit.c: 1 events suppressed
Jan 4 14:09:48 Workstation-B kernel: radeon 0000:03:00.0: IH ring buffer overflow
(CRASH!!!)

Another Time Frame (Also Before Crash)
Jan 4 15:21:47 Workstation-B kernel: hda-intel: IRQ timing workaround is activated for card #0. Suggest a bigger bdl_pos_adj.
Jan 4 15:21:47 Workstation-B gdm-simple-greeter[3041]: Gtk-WARNING: gtkwidget.c:5460: widget not within a GtkWindow
Jan 4 15:21:48 Workstation-B gdm-simple-greeter[3041]: WARNING: Unable to parse history: (null) 54#012
Jan 4 15:27:19 Workstation-B gdm-simple-greeter[3041]: WARNING: Failed to send buffer
Jan 4 15:27:19 Workstation-B pam: gdm-password[3119]: WARNING: unable to log session
Jan 4 15:27:19 Workstation-B kernel: type=1400 audit(1357284439.625:4): avc: denied { read } for pid=3119 comm="gdm-session-wor" name="root" dev=dm-0 ino=131073 scontext=system_u:system_r:xdm_t:s0-s0:c0.c1023 tcontext=system_u:objec$
Jan 4 15:27:20 Workstation-B kernel: fuse init (API version 7.13)
Jan 4 15:27:20 Workstation-B seahorse-daemon[3215]: DNS-SD initialization failed: Daemon not running
Jan 4 15:27:20 Workstation-B seahorse-daemon[3215]: init gpgme version 1.1.8
Jan 4 15:27:21 Workstation-B pulseaudio[3277]: pid.c: Stale PID file, overwriting.
Jan 4 15:27:57 Workstation-B pulseaudio[3277]: ratelimit.c: 9 events suppressed

(CRASH !!!! this time here) this 2 bold lines happen a lot and usually they are the last things in the log before the crash.(not necessarily appear right before the crash)

top - 14:11:37 up 15 min, 5 users, load average: 3.48, 1.15, 0.46
Tasks: 375 total, 3 running, 370 sleeping, 0 stopped, 2 zombie
Cpu(s): 0.0%us, 9.1%sy, 0.0%ni, 90.9%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 99062384k total, 1718336k used, 97344048k free, 78124k buffers
Swap: 101302264k total, 0k used, 101302264k free, 227964k cached

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
3277 jane 20 0 15284 1468 944 R 0.3 0.0 0:01.34 top -c
1 root 20 0 19360 1556 1248 S 0.0 0.0 0:02.42 /sbin/init
2 root 20 0 0 0 0 S 0.0 0.0 0:00.00 [kthreadd]


Jan 4 14:28:35 Workstation-B kernel: lo: Disabled Privacy Extensions
Jan 4 14:28:39 Workstation-B polkitd[3046]: started daemon version 0.96 using authority implementation `local' version `0.96'
Jan 4 14:28:40 Workstation-B rtkit-daemon[3058]: Sucessfully made thread 3056 of process 3056 (/usr/bin/pulseaudio) owned by '42' high priority at nice level -11.
Jan 4 14:28:40 Workstation-B rtkit-daemon[3058]: Sucessfully made thread 3061 of process 3056 (/usr/bin/pulseaudio) owned by '42' RT at priority 5.
Jan 4 14:28:40 Workstation-B rtkit-daemon[3058]: Sucessfully made thread 3063 of process 3056 (/usr/bin/pulseaudio) owned by '42' RT at priority 5.
Jan 4 14:28:40 Workstation-B rtkit-daemon[3058]: Sucessfully made thread 3064 of process 3056 (/usr/bin/pulseaudio) owned by '42' RT at priority 5.
Jan 4 14:28:41 Workstation-B gdm-simple-greeter[3044]: Gtk-WARNING: gtkwidget.c:5460: widget not within a GtkWindow
Jan 4 14:28:41 Workstation-B kernel: hda-intel: IRQ timing workaround is activated for card #0. Suggest a bigger bdl_pos_adj.
Jan 4 14:28:41 Workstation-B gdm-simple-greeter[3044]: WARNING: Unable to parse history: (null) 54#012


df -h output just in case
[root@Workstation-B log]# df -h
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/vg_workstationb-lv_root
50G 30G 18G 64% /
tmpfs 48G 264K 48G 1% /dev/shm
/dev/sdb1 485M 146M 314M 32% /boot
/dev/mapper/vg_workstationb-lv_home
773G 76G 658G 11% /home

macemoneta 01-04-2013 02:47 AM

This type of problem is typically a hardware issue. Have you added / changed any RAM lately? Are all your fans running with temperatures at reasonable levels (never higher than 70C)?

JBJ1962 01-04-2013 02:57 AM

Quote:

Originally Posted by macemoneta (Post 4862663)
This type of problem is typically a hardware issue. Have you added / changed any RAM lately? Are all your fans running with temperatures at reasonable levels (never higher than 70C)?

No I havent changed RAM lately , About Temperature I don't know its a DELL PRECISION T7500 Workstation and everything physical about it appears to me as OK! but yet again this unpredictable HANGS and Crashes happen, Right now that Im typing this the Workstation is fine and is working its almost 4 hours since the last crash, But IM SURE it might even work for another 1 day or 2 3 days Or it might crash right now, Its not predictable, Thats why Im FRUSTRATED!!! I want it fixed, I dont want to simply reinstall the OS !!!


Dear macemoneta
Thanks a lot 4 Ur quick reply :D Can the problem be by any chance because of the sound card? snd module !!! pulseaudio thing happens a lot in the LOGS !!!

I really Appreciate it if you helped me on this problem, Its bugging me so much :(

Cheers

macemoneta 01-04-2013 11:20 AM

The pulseaudio messages are not fatal. The radeon ring buffer overflow is generally not fatal. I would suggest trying a newer kernel. If a newer kernel isn't available from CentOS, pull one from Fedora - 3.6.11 is the current stable kernel there.

JBJ1962 01-05-2013 09:33 PM

Quote:

Originally Posted by macemoneta (Post 4862982)
The pulseaudio messages are not fatal. The radeon ring buffer overflow is generally not fatal. I would suggest trying a newer kernel. If a newer kernel isn't available from CentOS, pull one from Fedora - 3.6.11 is the current stable kernel there.

If I RUN YUM UPDATE on the workstation wouldn't it update me kernel as well?

I ran "YUM UPDATE" 2 weeks ago as The crashes start to bug me, After that I guess the error regarding libvirtd: Could not find keytab file: /etc/libvirt/krb5.tab: No such file or directory] made its first appearance in my LOGS I guess. The workstation sometimes Works Fine like it fixes itself, but sometimes No matter what It crashes after even 8 or 10 minutes after startup or sometimes takes a little longer to crash ( 1 2 hr) no certain pattern or explanation.
I will try dig into this kernel thing but I really appreciate it if U can let me know How to make sure if I can use Centos kernel or how can I bring it from fedora if centos os N/A.

Tnx A million Macemoneta I hope Ur tips finally solve this thing out :-bd


All times are GMT -5. The time now is 09:07 PM.