LinuxQuestions.org
Help answer threads with 0 replies.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 07-02-2010, 02:42 PM   #16
gimpy530
Member
 
Registered: Oct 2007
Posts: 98

Original Poster
Rep: Reputation: 16

borisk, that was a terrific first post. Using that diff I was able to get it to compile and replaced the file "/lib/modules/2.6.31-21-server/kernel/drivers/message/fusion/mptsas.ko" with it. I was able to format both arrays without error and I am currently doing an rsync from my backup. I still don't fully trust this but I'll be keeping an eye on it.

Thanks for your help borisk and H_TeXMeX_H.

*edit*

Well, that didn't last. It did another panic during the rsync. I already created an RMA with LSI on the chance that this is hardware related as when I first used the card I didn't have any real problems, it's just been getting worse. Any suggestions on what I should try? Send it in to replace it or anything else I can try?

Here's a dmesg: http://pastebin.com/TncSqT0J

Last edited by gimpy530; 07-02-2010 at 03:00 PM.
 
Old 07-02-2010, 03:44 PM   #17
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
The only other thing is: So, if this card was ok when you first started using it, do you remember any change that was made right before this started happening ? Any clue as to what it might be related to, other than these drivers (which I suspect are the cause). Just in case it's something else.
 
Old 07-02-2010, 04:00 PM   #18
gimpy530
Member
 
Registered: Oct 2007
Posts: 98

Original Poster
Rep: Reputation: 16
When I built the server I upgraded everything to the latest version, including the kernel. After that I formatted the arrays and moved the data on to them. I then set up Samba, nfs, rsync, mail, and all the others things I needed on it. Since then no kernel or module changes were made. Back then I saw it kernel panic once in the first week. A few days later, again, a few days later, again. Then it started doing it very often (more than once a day) and I did some testing and discovered that it would no longer even complete a format. Now that I replaced the module with the compiled one it completed the format on both arrays and transferred 60GB of data before it had another panic.
 
Old 07-03-2010, 04:45 AM   #19
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
So then it has never been stable, just more stable than now, I see. Well, unfortunately I have no more suggestions, if everything else checks out then the driver is probably just very poorly maintained or something, and thus is unstable. Maybe if you post some more logs from '/var/log/syslog' and messages, right before the hard lock, it may give some more clues.
 
Old 07-03-2010, 08:21 AM   #20
gimpy530
Member
 
Registered: Oct 2007
Posts: 98

Original Poster
Rep: Reputation: 16
I removed the card from the server and decided to us software RAID with mdadm until I fix the problem. I created the two arrays and formatted them with no problem. I then started an rsync to put the 2TB of data back on the drives and.....PANIC.

So, this is NOT related to the card as I am using the on board adapter, the same one the sys/boot drives are on. The data drives were already tested and came up clean, I am now doing diagnostics on the sys/boot drives. If that does not come up with any error I will be leaving Ubuntu for Debian and starting over with this server. If Debian has the same problem I'll set the server on fire and quit working in IT.
 
Old 07-03-2010, 08:42 AM   #21
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
A newer kernel may solve it, or it might be just bad hardware. If it's not the card, then it could be anything. Just keep looking through logs and trying different hardware configurations. I would start with the kernel tho, it could be as simple as that, a bad kernel.

It's a good idea to try different distros and see if it helps.
 
Old 07-19-2010, 12:09 PM   #22
gimpy530
Member
 
Registered: Oct 2007
Posts: 98

Original Poster
Rep: Reputation: 16
I've tried a number of distros and kernels and all have the same problem. Obviously this is a hardware issue, but any guess as to what? If I run the program Folding@Home which uses 100% of the CPU (all cores, it's meant to do that) then I also get a kernel panic. If I simply to 4 endless BASH loops to force each core to have 100% usage it does not panic.

So, certain processes which cause high CPU cause a panic, but not all high CPU processes do this. I have never seen anything like this before.

Any guess on which hardware is causing the problem and how to resolve it? At this point all I can do is throw away several hundred dollars of hardware and start buying new ones and hope it works, but it may have the same problem as this one.
 
Old 09-05-2010, 08:51 AM   #23
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
I was away for a while, but if the problem is still there try blacklisting all modules but 'ahci' and use that for all SATA drives. For example blacklist 'pata_atiixp'.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
RAID and XFS on current filesystem snoopy1 Linux - Software 2 03-26-2009 09:04 AM
Kernel Panic on boot related to software raid 6 mdrum Linux - Newbie 1 11-20-2008 01:41 PM
XFS drops create/delete files to 6.6% of EXT3 (sw raid) and to 0.6% (hw raid) darlingm Linux - Software 3 04-24-2008 10:39 AM
Kernel 2.4.23 compile causes kernel panics on ATA RAID-1 (mirror) array Raptor Ramjet Slackware 3 12-18-2003 01:40 PM
help with setting up xfs on raid(0) array Dr Ecology Linux - General 0 06-16-2003 04:43 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 05:45 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration