Linux server hangs, how to debug?
We have installed few linux servers , all having same hardware and same OS. The hardware is DL380 Proliant. We have RHEL with the following details
Kernel 2.6.9-34.0.2.ELsmp #1 SMP Fri Jun 30 10:32:04 EDT 2006 x86_64 x86_64 x86_64 GNU/Linux
It has 16GB RAM and 2 Dual core CPUs. We have the below problem with 2 servers. One server has Oracle and other server has a java process.
The server starts perfectly and after some period(some times 1 day, sometimes 3 days, sometimes 3hrs of server startup) of time, it just hangs. Hangs means that Im not able connect(telnet, sqlplus or anytype connections to the server). The exsiting open oracle connections does not return any data.
It would be nice if someone could tell me what areas I should be looking for and how to debug this issue?
NOte: I did some analysis and found that the memory is pegging at 99.5% most of the times and approx 10% of SWAP is used.
|