LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - General (https://www.linuxquestions.org/questions/linux-general-1/)
-   -   Linux server running Ubuntu 6.06 freezing after a week or so (https://www.linuxquestions.org/questions/linux-general-1/linux-server-running-ubuntu-6-06-freezing-after-a-week-or-so-525645/)

hbbtstar 02-04-2007 10:30 PM

Linux server running Ubuntu 6.06 freezing after a week or so
 
This is the oddest problem I've ever come across, and I thought that perhaps the people here might know what to do. I recently set up a file server running Linux, and it works perfectly except for one odd flaw; it gradually slows down until after about a week it takes forever to do even simple things like transfer files on and off the server. It can't even reboot; I have to manually turn it off and on. After that, however, it works fine for another week, give or take. It's a pretty powerful machine, too; 2.2 GHz P4 with 1.5GB of RAM. Should be more than enough to handle simple file transfers and 2-3 users on at any given time. I thought it might be a hardware problem, but hardware diagnostics came back clean and another machine with the same specs runs Linux perfectly. The problem is that I have no idea where to begin diagnosing such a problem; what logs should I look at(I'm going to look at the syslogs tomorrow, but I don't know what to do past that)? Are there monitoring programs that can help me (I.E. can I have top put out a log every now and then to see what sort of stress the machine is being put under)? Any help would be greatly appreciated. Thanks in advance!

Micro420 02-04-2007 10:57 PM

look in logs:

/var/log/auth.log
/var/log/syslog
/var/log/messages

netstat -an (to check for H4x0r$)
top

What services do you have on this computer besides just file sharing? What are you using to file share? Samba? NFS?

You are right in that your machine is messed up. I run a SAMBA file server with printing and SSH on a Pentium 3, 256MB RAM computer with over 20+ users. It's stable like a rock!

hbbtstar 02-05-2007 12:36 AM

I can check what services are running besides Samba; off the top of my head, I know I'm using winbind. I'll try all those things when I get in tomorrow, as well as get you the output of the log files. Hopefully I'll be able to figure out what's going wrong with it.

hbbtstar 02-05-2007 12:36 PM

Well, I got to the logs and I don't see anything out of the ordinary anywhere. It does try to use postfix/sendmail, which is odd since I don't have them installed on the machine, but other than that I don't see anything weird with the logs. I've reposted them here for anyone's perusal pleasure.

syslog:
Code:

02/05/2007 10:38:08 AM        localhost        anacron[4698]        Job `cron.daily' terminated (mailing output)
02/05/2007 10:38:08 AM        localhost        postfix/sendmail[22675]        fatal: open /etc/postfix/main.cf: No such file or directory
02/05/2007 10:38:08 AM        localhost        syslogd 1.4.1#17ubuntu7        restart.
02/05/2007 10:38:09 AM        localhost        anacron[4698]        Normal exit (1 job run)
02/05/2007 10:38:09 AM        localhost        anacron[4698]        Tried to mail output of job `cron.daily', but mailer process (/usr/sbin/sendmail) exited with ststus 1
02/05/2007 10:45:03 AM        localhost        /USR/SBIN/CRON[22679]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 10:46:14 AM        localhost        gdm[4482]        Couldn't authenticate user
02/05/2007 10:46:32 AM        localhost        gconfd (sourcefileserv-22743)        starting (version 2.14.0), pid 22743 user 'sourcefileserv'
02/05/2007 10:46:33 AM        localhost        gconfd (sourcefileserv-22743)        Resolved address "xml:readonly:/etc/gconf/gconf.xml.defaults" to a read-only configuration source at position 2
02/05/2007 10:46:33 AM        localhost        gconfd (sourcefileserv-22743)        Resolved address "xml:readonly:/etc/gconf/gconf.xml.mandatory" to a read-only configuration source at position 0
02/05/2007 10:46:33 AM        localhost        gconfd (sourcefileserv-22743)        Resolved address "xml:readonly:/var/lib/gconf/debian.defaults" to a read-only configuration source at position 3
02/05/2007 10:46:33 AM        localhost        gconfd (sourcefileserv-22743)        Resolved address "xml:readonly:/var/lib/gconf/defaults" to a read-only configuration source at position 4
02/05/2007 10:46:33 AM        localhost        gconfd (sourcefileserv-22743)        Resolved address "xml:readwrite:/home/sourcefileserv/.gconf" to a writable configuration source at position 1
02/05/2007 10:46:42 AM        localhost        gconfd (sourcefileserv-22743)        Resolved address "xml:readwrite:/home/sourcefileserv/.gconf" to a writable configuration source at position 0
02/05/2007 10:50:06 AM        localhost        gconfd (root-22912)        Resolved address "xml:readonly:/etc/gconf/gconf.xml.defaults" to a read-only configuration source at position 2
02/05/2007 10:50:06 AM        localhost        gconfd (root-22912)        Resolved address "xml:readonly:/etc/gconf/gconf.xml.mandatory" to a read-only configuration source at position 0
02/05/2007 10:50:06 AM        localhost        gconfd (root-22912)        Resolved address "xml:readonly:/var/lib/gconf/debian.defaults" to a read-only configuration source at position 3
02/05/2007 10:50:06 AM        localhost        gconfd (root-22912)        Resolved address "xml:readonly:/var/lib/gconf/defaults" to a read-only configuration source at position 4
02/05/2007 10:50:06 AM        localhost        gconfd (root-22912)        Resolved address "xml:readwrite:/root/.gconf" to a writable configuration source at position 1
02/05/2007 10:50:06 AM        localhost        gconfd (root-22912)        starting (version 2.14.0), pid 22912 user 'root'
02/05/2007 10:50:36 AM        localhost        gconfd (root-22912)        Exiting
02/05/2007 10:50:36 AM        localhost        gconfd (root-22912)        GConf server is not in use, shutting down.
02/05/2007 10:53:07 AM        localhost        crontab[23008]        (sourcefileserv) LIST (sourcefileserv)
02/05/2007 10:55:01 AM        localhost        /USR/SBIN/CRON[23031]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:05:02 AM        localhost        /USR/SBIN/CRON[23038]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:15:01 AM        localhost        /USR/SBIN/CRON[23042]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:17:01 AM        localhost        /USR/SBIN/CRON[23046]        (root) CMD (  run-parts --report /etc/cron.hourly)
02/05/2007 11:25:01 AM        localhost        /USR/SBIN/CRON[23067]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:35:02 AM        localhost        /USR/SBIN/CRON[23070]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:45:01 AM        localhost        /USR/SBIN/CRON[23072]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:55:01 AM        localhost        /USR/SBIN/CRON[23074]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:05:02 PM        localhost        /USR/SBIN/CRON[23076]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:15:01 PM        localhost        /USR/SBIN/CRON[23079]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:17:01 PM        localhost        /USR/SBIN/CRON[23081]        (root) CMD (  run-parts --report /etc/cron.hourly)
02/05/2007 12:25:01 PM        localhost        /USR/SBIN/CRON[23085]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:35:01 PM        localhost        /USR/SBIN/CRON[23087]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:45:01 PM        localhost        /USR/SBIN/CRON[23089]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:55:01 PM        localhost        /USR/SBIN/CRON[23091]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 01:05:01 PM        localhost        /USR/SBIN/CRON[23093]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 01:15:01 PM        localhost        /USR/SBIN/CRON[23095]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 01:17:01 PM        localhost        /USR/SBIN/CRON[23098]        (root) CMD (  run-parts --report /etc/cron.hourly)
02/05/2007 01:22:54 PM        localhost        crontab[23205]        (sourcefileserv) LIST (sourcefileserv)
02/05/2007 01:23:41 PM        localhost        crontab[23212]        (sourcefileserv) REPLACE (sourcefileserv)
02/05/2007 01:23:49 PM        localhost        crontab[23216]        (sourcefileserv) LIST (sourcefileserv)
02/05/2007 01:25:01 PM        localhost        /USR/SBIN/CRON[23243]        (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })

authlog: (only a snippet, this sequence is more or less the same throughout the whole logs with no discernible deviations)
Code:

02/01/2007 04:06:12 PM        localhost        smbd[5139]        (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 04:15:02 PM        localhost        CRON[5195]        (pam_unix) session closed for user root
02/01/2007 04:15:02 PM        localhost        CRON[5195]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:17:01 PM        localhost        CRON[5197]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:17:02 PM        localhost        CRON[5197]        (pam_unix) session closed for user root
02/01/2007 04:25:01 PM        localhost        CRON[5199]        (pam_unix) session closed for user root
02/01/2007 04:25:01 PM        localhost        CRON[5199]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:35:01 PM        localhost        CRON[5211]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:35:02 PM        localhost        CRON[5211]        (pam_unix) session closed for user root
02/01/2007 04:45:01 PM        localhost        CRON[5213]        (pam_unix) session closed for user root
02/01/2007 04:45:01 PM        localhost        CRON[5213]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:55:01 PM        localhost        CRON[5218]        (pam_unix) session closed for user root
02/01/2007 04:55:01 PM        localhost        CRON[5218]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:05:01 PM        localhost        CRON[5225]        (pam_unix) session closed for user root
02/01/2007 05:05:01 PM        localhost        CRON[5225]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:15:01 PM        localhost        CRON[5232]        (pam_unix) session closed for user root
02/01/2007 05:15:01 PM        localhost        CRON[5232]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:17:01 PM        localhost        CRON[5241]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:17:02 PM        localhost        CRON[5241]        (pam_unix) session closed for user root
02/01/2007 05:17:25 PM        localhost        smbd[5247]        (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 05:25:01 PM        localhost        CRON[5248]        (pam_unix) session closed for user root
02/01/2007 05:25:01 PM        localhost        CRON[5248]        (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:25:46 PM        localhost        smbd[5253]        (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 05:29:36 PM        localhost        smbd[5261]        (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 05:30:29 PM        localhost        smbd[5261]        (pam_unix) session closed for user sourcefileserv
02/01/2007 05:35:01 PM        localhost        CRON[5262]        (pam_unix) session closed for user root
02/01/2007 05:35:01 PM        localhost        CRON[5262]        (pam_unix) session opened for user root by (uid=0)


exvor 02-05-2007 01:01 PM

Maybe hardware related issue but its hard to be sure. I would try backing up any important data and reformat reinstall the system. This def is a strange problem. My laptop does this from time to time as well where you just get a hard lock and all you can do is reboot. It only occurred once and I suspect that it was heat related and the bios was just trying to protect the system from an overheat. That might be what is going on you could create a temp script that emails you or creates a log with all of the temp reading and hope that you get one before it locks up.

hbbtstar 02-06-2007 01:49 PM

Yeah, I'm going to try reformatting and reinstalling. I hope it's not a hardware related problem.


All times are GMT -5. The time now is 07:05 PM.