LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Debian
User Name
Password
Debian This forum is for the discussion of Debian Linux.

Notices


Reply
  Search this Thread
Old 10-10-2006, 09:20 PM   #1
norobro
Member
 
Registered: Feb 2006
Distribution: Debian Sid
Posts: 792

Rep: Reputation: 331Reputation: 331Reputation: 331Reputation: 331
Boinc/Einstein crashing


Hello all.

I’ve been running Einstein/Boinc on Debian Sid for about a year and a half. Recently my work units have been crashing with the following error:
Code:
_SSEgas.c
2006-10-07 22:03:57.7242 [CRITICAL]: At lowest level status code = 0, description: NO LAL ERROR REGISTERED

2006-10-07 22:03:57.7243 [CRITICAL]: APP DEBUG: Application caught signal 6.

2006-10-07 22:03:57.7243 [CRITICAL]: Stack trace of LAL functions in worker thread:
2006-10-07 22:03:57.7243 [CRITICAL]: TestLALDemod at line 80 of file /home/bema/einsteinathome/CFS/EaH_build_release_einstein_S5R1_4.17/extra_sources/lalapps-CVS/src/pulsar/FDS_isolated/CFSLALDemod_SSEgas.c
2006-10-07 22:03:57.7243 [CRITICAL]: At lowest level status code = 0, description: NO LAL ERROR REGISTERED
Einstein Site: Results

I checked the logs and there are no entries at the time that the program crashed.

Also, I posted to the Einstein problem and bug reports page and got the reply that “something on your machine keeps trying to terminate the app.” It’s trying and succeeding!

My system:
Processor: AMD Athlon XP 2700+
Memory: 512 Mb
Debian Sid
Kernel: 2.6.18 self compiled.

Thought I’d request help from the experts. Any ideas?

TIA for any help.

Norm
 
Old 10-10-2006, 09:42 PM   #2
Dutch Master
Senior Member
 
Registered: Dec 2005
Posts: 1,686

Rep: Reputation: 124Reputation: 124
Well, the Einstein folks gave you a hint: something is killing the app! Next job: figure out what makes the app to stop, i.e. what programs does it depend on that quit working. (like a chained dependency: app x depends on prog y to run, but process w kills prog y, causing app x to shutdown) Figuring that one out is a PITA (pain in the *ss ) but probably the only way to get to the bottom of your problem.
 
Old 10-11-2006, 09:00 AM   #3
norobro
Member
 
Registered: Feb 2006
Distribution: Debian Sid
Posts: 792

Original Poster
Rep: Reputation: 331Reputation: 331Reputation: 331Reputation: 331
Dutch,

Any suggestions on how to go about troubleshooting this?

Don't know how familiar you are with Einstein. I'm using a downloaded binary Boinc and it automatically downloads the Einstein app if there is a new version. It (Einstein app) is not open source.

With the absence of log entries, should I start killing programs? The last three WUs have run for four hours before terminating. That makes for a lot of wasted cpu cycles.

I haven't performed a dist upgrade in about a month, so I'll start with that.

The puzzle to me is that it just started in the last week. The last thing that I have done is compile the 2.6.18 kernel on Sept. 20. If the problem is there, it took over two weeks to show up.

Thanks for the reply!

Norm
 
Old 10-11-2006, 09:40 AM   #4
Dutch Master
Senior Member
 
Registered: Dec 2005
Posts: 1,686

Rep: Reputation: 124Reputation: 124
As you already anticipated, I'm not familiar with either Boinc or Einstein. Finding the culprit requires a methodical approach. First, figure out what the dependencies of Einstein and Boinc are. Was the binary a .deb package, 'cause if it was the dependencies should be in the package description. Next, check wether the installed versions of those dependent packages are suitable for Boinc/Einstein. As you've said that Boinc d/l's a new Einstein package automatically, I suspect the app expects a certain package installed but you still have an older, not quite compatible version. However, your remark on the uptime on those WU's I don't exclude the possibility there might be hardware issues (i.e. upcoming failures) causing these shutdown's.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
LXer: NASA Scientists Conquer Einstein Equations With Help From Columbia Supercomputer LXer Syndicated Linux News 0 07-24-2006 09:33 PM
tell me if this would work with BOINC ... chemdawg Linux - General 3 10-23-2005 01:39 PM
Running BOINC Carpinus Linux - General 1 08-03-2005 10:04 AM
BOINC proxy problem ReefShark Linux - General 2 01-28-2004 03:52 PM
einstein database macewan General 1 05-22-2003 06:40 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Debian

All times are GMT -5. The time now is 11:41 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration