Linux server running Ubuntu 6.06 freezing after a week or so
Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Linux server running Ubuntu 6.06 freezing after a week or so
This is the oddest problem I've ever come across, and I thought that perhaps the people here might know what to do. I recently set up a file server running Linux, and it works perfectly except for one odd flaw; it gradually slows down until after about a week it takes forever to do even simple things like transfer files on and off the server. It can't even reboot; I have to manually turn it off and on. After that, however, it works fine for another week, give or take. It's a pretty powerful machine, too; 2.2 GHz P4 with 1.5GB of RAM. Should be more than enough to handle simple file transfers and 2-3 users on at any given time. I thought it might be a hardware problem, but hardware diagnostics came back clean and another machine with the same specs runs Linux perfectly. The problem is that I have no idea where to begin diagnosing such a problem; what logs should I look at(I'm going to look at the syslogs tomorrow, but I don't know what to do past that)? Are there monitoring programs that can help me (I.E. can I have top put out a log every now and then to see what sort of stress the machine is being put under)? Any help would be greatly appreciated. Thanks in advance!
What services do you have on this computer besides just file sharing? What are you using to file share? Samba? NFS?
You are right in that your machine is messed up. I run a SAMBA file server with printing and SSH on a Pentium 3, 256MB RAM computer with over 20+ users. It's stable like a rock!
I can check what services are running besides Samba; off the top of my head, I know I'm using winbind. I'll try all those things when I get in tomorrow, as well as get you the output of the log files. Hopefully I'll be able to figure out what's going wrong with it.
Well, I got to the logs and I don't see anything out of the ordinary anywhere. It does try to use postfix/sendmail, which is odd since I don't have them installed on the machine, but other than that I don't see anything weird with the logs. I've reposted them here for anyone's perusal pleasure.
syslog:
Code:
02/05/2007 10:38:08 AM localhost anacron[4698] Job `cron.daily' terminated (mailing output)
02/05/2007 10:38:08 AM localhost postfix/sendmail[22675] fatal: open /etc/postfix/main.cf: No such file or directory
02/05/2007 10:38:08 AM localhost syslogd 1.4.1#17ubuntu7 restart.
02/05/2007 10:38:09 AM localhost anacron[4698] Normal exit (1 job run)
02/05/2007 10:38:09 AM localhost anacron[4698] Tried to mail output of job `cron.daily', but mailer process (/usr/sbin/sendmail) exited with ststus 1
02/05/2007 10:45:03 AM localhost /USR/SBIN/CRON[22679] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 10:46:14 AM localhost gdm[4482] Couldn't authenticate user
02/05/2007 10:46:32 AM localhost gconfd (sourcefileserv-22743) starting (version 2.14.0), pid 22743 user 'sourcefileserv'
02/05/2007 10:46:33 AM localhost gconfd (sourcefileserv-22743) Resolved address "xml:readonly:/etc/gconf/gconf.xml.defaults" to a read-only configuration source at position 2
02/05/2007 10:46:33 AM localhost gconfd (sourcefileserv-22743) Resolved address "xml:readonly:/etc/gconf/gconf.xml.mandatory" to a read-only configuration source at position 0
02/05/2007 10:46:33 AM localhost gconfd (sourcefileserv-22743) Resolved address "xml:readonly:/var/lib/gconf/debian.defaults" to a read-only configuration source at position 3
02/05/2007 10:46:33 AM localhost gconfd (sourcefileserv-22743) Resolved address "xml:readonly:/var/lib/gconf/defaults" to a read-only configuration source at position 4
02/05/2007 10:46:33 AM localhost gconfd (sourcefileserv-22743) Resolved address "xml:readwrite:/home/sourcefileserv/.gconf" to a writable configuration source at position 1
02/05/2007 10:46:42 AM localhost gconfd (sourcefileserv-22743) Resolved address "xml:readwrite:/home/sourcefileserv/.gconf" to a writable configuration source at position 0
02/05/2007 10:50:06 AM localhost gconfd (root-22912) Resolved address "xml:readonly:/etc/gconf/gconf.xml.defaults" to a read-only configuration source at position 2
02/05/2007 10:50:06 AM localhost gconfd (root-22912) Resolved address "xml:readonly:/etc/gconf/gconf.xml.mandatory" to a read-only configuration source at position 0
02/05/2007 10:50:06 AM localhost gconfd (root-22912) Resolved address "xml:readonly:/var/lib/gconf/debian.defaults" to a read-only configuration source at position 3
02/05/2007 10:50:06 AM localhost gconfd (root-22912) Resolved address "xml:readonly:/var/lib/gconf/defaults" to a read-only configuration source at position 4
02/05/2007 10:50:06 AM localhost gconfd (root-22912) Resolved address "xml:readwrite:/root/.gconf" to a writable configuration source at position 1
02/05/2007 10:50:06 AM localhost gconfd (root-22912) starting (version 2.14.0), pid 22912 user 'root'
02/05/2007 10:50:36 AM localhost gconfd (root-22912) Exiting
02/05/2007 10:50:36 AM localhost gconfd (root-22912) GConf server is not in use, shutting down.
02/05/2007 10:53:07 AM localhost crontab[23008] (sourcefileserv) LIST (sourcefileserv)
02/05/2007 10:55:01 AM localhost /USR/SBIN/CRON[23031] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:05:02 AM localhost /USR/SBIN/CRON[23038] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:15:01 AM localhost /USR/SBIN/CRON[23042] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:17:01 AM localhost /USR/SBIN/CRON[23046] (root) CMD ( run-parts --report /etc/cron.hourly)
02/05/2007 11:25:01 AM localhost /USR/SBIN/CRON[23067] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:35:02 AM localhost /USR/SBIN/CRON[23070] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:45:01 AM localhost /USR/SBIN/CRON[23072] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 11:55:01 AM localhost /USR/SBIN/CRON[23074] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:05:02 PM localhost /USR/SBIN/CRON[23076] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:15:01 PM localhost /USR/SBIN/CRON[23079] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:17:01 PM localhost /USR/SBIN/CRON[23081] (root) CMD ( run-parts --report /etc/cron.hourly)
02/05/2007 12:25:01 PM localhost /USR/SBIN/CRON[23085] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:35:01 PM localhost /USR/SBIN/CRON[23087] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:45:01 PM localhost /USR/SBIN/CRON[23089] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 12:55:01 PM localhost /USR/SBIN/CRON[23091] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 01:05:01 PM localhost /USR/SBIN/CRON[23093] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 01:15:01 PM localhost /USR/SBIN/CRON[23095] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
02/05/2007 01:17:01 PM localhost /USR/SBIN/CRON[23098] (root) CMD ( run-parts --report /etc/cron.hourly)
02/05/2007 01:22:54 PM localhost crontab[23205] (sourcefileserv) LIST (sourcefileserv)
02/05/2007 01:23:41 PM localhost crontab[23212] (sourcefileserv) REPLACE (sourcefileserv)
02/05/2007 01:23:49 PM localhost crontab[23216] (sourcefileserv) LIST (sourcefileserv)
02/05/2007 01:25:01 PM localhost /USR/SBIN/CRON[23243] (root) CMD ([ -x /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1 $SA1_OPTIONS 1 1 ; })
authlog: (only a snippet, this sequence is more or less the same throughout the whole logs with no discernible deviations)
Code:
02/01/2007 04:06:12 PM localhost smbd[5139] (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 04:15:02 PM localhost CRON[5195] (pam_unix) session closed for user root
02/01/2007 04:15:02 PM localhost CRON[5195] (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:17:01 PM localhost CRON[5197] (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:17:02 PM localhost CRON[5197] (pam_unix) session closed for user root
02/01/2007 04:25:01 PM localhost CRON[5199] (pam_unix) session closed for user root
02/01/2007 04:25:01 PM localhost CRON[5199] (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:35:01 PM localhost CRON[5211] (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:35:02 PM localhost CRON[5211] (pam_unix) session closed for user root
02/01/2007 04:45:01 PM localhost CRON[5213] (pam_unix) session closed for user root
02/01/2007 04:45:01 PM localhost CRON[5213] (pam_unix) session opened for user root by (uid=0)
02/01/2007 04:55:01 PM localhost CRON[5218] (pam_unix) session closed for user root
02/01/2007 04:55:01 PM localhost CRON[5218] (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:05:01 PM localhost CRON[5225] (pam_unix) session closed for user root
02/01/2007 05:05:01 PM localhost CRON[5225] (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:15:01 PM localhost CRON[5232] (pam_unix) session closed for user root
02/01/2007 05:15:01 PM localhost CRON[5232] (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:17:01 PM localhost CRON[5241] (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:17:02 PM localhost CRON[5241] (pam_unix) session closed for user root
02/01/2007 05:17:25 PM localhost smbd[5247] (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 05:25:01 PM localhost CRON[5248] (pam_unix) session closed for user root
02/01/2007 05:25:01 PM localhost CRON[5248] (pam_unix) session opened for user root by (uid=0)
02/01/2007 05:25:46 PM localhost smbd[5253] (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 05:29:36 PM localhost smbd[5261] (pam_unix) session opened for user sourcefileserv by (uid=0)
02/01/2007 05:30:29 PM localhost smbd[5261] (pam_unix) session closed for user sourcefileserv
02/01/2007 05:35:01 PM localhost CRON[5262] (pam_unix) session closed for user root
02/01/2007 05:35:01 PM localhost CRON[5262] (pam_unix) session opened for user root by (uid=0)
Maybe hardware related issue but its hard to be sure. I would try backing up any important data and reformat reinstall the system. This def is a strange problem. My laptop does this from time to time as well where you just get a hard lock and all you can do is reboot. It only occurred once and I suspect that it was heat related and the bios was just trying to protect the system from an overheat. That might be what is going on you could create a temp script that emails you or creates a log with all of the temp reading and hope that you get one before it locks up.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.