LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Non-*NIX Forums > Programming
User Name
Password
Programming This forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.

Notices

Reply
 
Search this Thread
Old 12-16-2010, 02:51 AM   #1
mufy
Member
 
Registered: Oct 2004
Location: Kuwait
Distribution: Currently - AIX | Previously - RHEL 4 ES, FC 10
Posts: 206
Blog Entries: 4

Rep: Reputation: 30
KSH script behaving differently on an HACMP cluster node (prod) & a single node (UAT)


I have created a simple menu driven script for our Operations to take care of the basic monitoring and managing of our production application from the back-end. Now, the script when tested in UAT environment was fine, but when deployed to production it kind of behaved oddly.

This is the scenario:

When the Operator chooses an option from the menu he is given the output and at the end is prompted to return to the main menu by using ctrl+c. In production, this return does not occur for some strange reason and the program just sits there.

The session becomes unresponsive after that and I'm forced to terminated it by closing the PuTTY.

I tried enabling the debug mode too (set -x) and still was not able to find any useful hints/trails as to why.

Any troubleshooting tips would be greatly appreciated as I'm scheduled to have this sorted out by the end of the day :-).
 
Old 12-16-2010, 03:09 AM   #2
kbp
Senior Member
 
Registered: Aug 2009
Posts: 3,758

Rep: Reputation: 643Reputation: 643Reputation: 643Reputation: 643Reputation: 643Reputation: 643
Start with comparing the versions between UAT and Prod, then maybe compare the environments for the accounts you're using.
 
Old 12-16-2010, 03:16 AM   #3
mufy
Member
 
Registered: Oct 2004
Location: Kuwait
Distribution: Currently - AIX | Previously - RHEL 4 ES, FC 10
Posts: 206
Blog Entries: 4

Original Poster
Rep: Reputation: 30
Production OS level
Code:
op58@prapbc1[/home/op58] $oslevel -s
5300-09-01-0847
UAT OS level
Code:
op58@prapbc[/home/op58] $oslevel -s
5300-06-10-0846
The accounts environments are identical across both the areas.
 
Old 12-16-2010, 12:53 PM   #4
kbp
Senior Member
 
Registered: Aug 2009
Posts: 3,758

Rep: Reputation: 643Reputation: 643Reputation: 643Reputation: 643Reputation: 643Reputation: 643
Well that's a problem for a start .. UAT and Prod should really be identical, there's not much point looking any further until you resolve this - behaviour of applications will change with new versions
 
Old 12-30-2010, 12:05 PM   #5
mufy
Member
 
Registered: Oct 2004
Location: Kuwait
Distribution: Currently - AIX | Previously - RHEL 4 ES, FC 10
Posts: 206
Blog Entries: 4

Original Poster
Rep: Reputation: 30
For those who would like to have a bigger picture, it is available here:
http://www.linkedin.com/groupItem?vi...entID_28666370

Below are the part of traces I have gathered. I'd like to know understand their co-relation.
kioctl(2, 1074295912, 0x2FF19720, 0x00000000) = 0
kioctl(2, 22528, 0x00000000, 0x00000000) = 0
kioctl(2, 21505, 0x2000C0F8, 0x00000000) = 0
kioctl(2, 22528, 0x00000000, 0x00000000) = 0
kioctl(2, 21507, 0x2000C0B8, 0x00000000) Err#82 ERESTART
Received signal #22, SIGTTOU [caught]

root[/home/root] # lsof64 -p 2478194
In while loop:256
COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME
sh 2478194 root cwd VDIR 10,8 4096 28672 /home (/dev/hd1)
sh 2478194 root 0u VCHR 23,6 0t67759 1335 /dev/pts/6
sh 2478194 root 1u VCHR 23,6 0t67759 1335 /dev/pts/6
sh 2478194 root 2u VCHR 23,6 0t67759 1335 /dev/pts/6
sh 2478194 root 10r VREG 10,5 5773 21707 /usr (/dev/hd2)
sh 2478194 root 62r VREG 10,8 421 28678 /home (/dev/hd1)
 
Old 01-03-2011, 02:08 AM   #6
mufy
Member
 
Registered: Oct 2004
Location: Kuwait
Distribution: Currently - AIX | Previously - RHEL 4 ES, FC 10
Posts: 206
Blog Entries: 4

Original Poster
Rep: Reputation: 30
Finally the issue was resolved by upgrading sudo from 1.6.9p2 to 1.7.2p2. Never knew signal handlers could be so tricky.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
How long a node failover and another node take over resources on HA cluster? levinhha Linux - Server 2 10-28-2010 09:13 PM
Two node cluster, start CMAN fence the other node DevinXu Linux - Enterprise 1 06-21-2010 12:37 PM
[SOLVED] Rocks Cluster node asking for rolls upon node install Shouraku ROCK 1 04-15-2010 10:28 AM
mysqld node of mysql cluster system not connecting to management node coal-fire-ice Linux - Server 0 05-07-2008 11:39 AM
KSH & Expect Node Probe/Audit Script metallica1973 Programming 2 01-18-2008 09:07 AM


All times are GMT -5. The time now is 07:15 AM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration