Solaris / OpenSolaris This forum is for the discussion of Solaris and OpenSolaris.
General Sun, SunOS and Sparc related questions also go here. |
| Notices |
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Are you new to LinuxQuestions.org? Visit the following links:
Site Howto |
Site FAQ |
Sitemap |
Register Now
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
 |
GNU/Linux Basic Guide
This 255-page guide will provide you with the keys to understand the philosophy of free software, teach you how to use and handle it, and give you the tools required to move easily in the world of GNU/Linux. Many users and administrators will be taking their first steps with this GNU/Linux Basic guide and it will show you how to approach and solve the problems you encounter.
Click Here to receive this Complete Guide absolutely free. |
|
 |
07-24-2008, 04:47 AM
|
#1
|
|
Member
Registered: Dec 2005
Location: Madrid
Distribution: Solaris 10, Solaris Express Community Edition
Posts: 547
Rep:
|
How can I analyze why my system hangs during heavy ZFS over USB activity?
Hi.
I have a zpool made up of three 350GB Lacie USB 2.0 disks configured with a single parity RAIDZ. The zpool is 81% full and have 31 ZFS file systems in it. Every file system is snapshotted once a day and I retain one entire week of snapshots (217 snapshot in total). The scrubbing of the zpool is performed once a week and it takes more or less 5 hours. The machine is a Sun Ultra 20 M2 and both scrubbing and snapshotting are performed by cron jobs.
I don't know if it's just a coincidence, but I just upgraded OpenSolaris 2008.05 to build 93. What's happening these days is that during heavy activity, for example during a scrub or a scrub overlapped by snapshot creation, I find the system completely hung and I cannot even connect via ssh to it, and eventually I have to power it down. After some reboots I even noticed that the snapshot operation didn't even complete (I don't know about the scrub because it leaves no evidence, or at least I don't know where).
How can I try to identify the problem? I never observed this behavior before and I would expect the system not to hang, even during a disk failure, which is not even the case.
Any hints?
|
|
|
|
07-25-2008, 12:17 AM
|
#2
|
|
Member
Registered: Apr 2003
Location: Wisconsin
Distribution: Switched to regualr Ubuntu, because I don't like KDE4, at all. Looks like vista on crack.....
Posts: 675
Rep:
|
Any chance your system simply over heated ? The "heavy activity" clue points in that direction. Does OpenSolaris have something like Linux's sensor package ?
David
|
|
|
|
07-25-2008, 12:30 PM
|
#3
|
|
Member
Registered: Dec 2005
Location: Madrid
Distribution: Solaris 10, Solaris Express Community Edition
Posts: 547
Original Poster
Rep:
|
There could be chance... but the workstation works perfectly and the USB disks are external, how could this have a relationship with the OS hang? I mean: if some USB disk overheats, it's still strange that the system hangs!
|
|
|
|
07-25-2008, 03:14 PM
|
#4
|
|
Moderator
Registered: Feb 2004
Location: Outside Paris
Distribution: Solaris10, Solaris 11, Ubuntu, OL
Posts: 9,311
|
If the kernel hangs, you can configure it to generate a crash dump after 50 seconds of inactivity.
Add this line to the /etc/system file:
It will be taken into account after the next boot.
Better to run the system in console (no X11) mode so you'll be able to see system messages.
When you reboot the OS, have a look at /var/crash/<hostname> for dump files.
Last edited by jlliagre; 07-25-2008 at 03:15 PM.
|
|
|
|
07-25-2008, 08:47 PM
|
#5
|
|
Member
Registered: Apr 2003
Location: Wisconsin
Distribution: Switched to regualr Ubuntu, because I don't like KDE4, at all. Looks like vista on crack.....
Posts: 675
Rep:
|
If it's a heat issue, it almost certainly isn't the hard drives, external or not, that would be to blame. It would be the CPU. Too much or too little cpu paste, too much dust on the heat spreader. Improperly mounted heat sink. Could be lots of things, doesn't have to be the hard drives.
David
Oh, you might want to test your ram too.....
Last edited by budword; 07-25-2008 at 08:49 PM.
|
|
|
|
07-29-2008, 06:35 AM
|
#6
|
|
Member
Registered: Dec 2005
Location: Madrid
Distribution: Solaris 10, Solaris Express Community Edition
Posts: 547
Original Poster
Rep:
|
@budword:
Thanks, I'm checking if the Ultra 20 M2 has temperature sensors I can read from the OS but it seems it's not the case.
@jlliagre:
Thanks for the hint indeed.
|
|
|
|
| Thread Tools |
Search this Thread |
|
|
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -5. The time now is 05:15 PM.
|
|
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
|
Latest Threads
LQ News
|
|