System Performance issues
Hi All,
I attended many interviews for Linux Admin post. Most of them asked me a common question 'How do you handle System Performance Issues' on a Linux Server. Now how do I answer this? Please help... |
Quote:
And I don't mean this to sound nasty, but if you can't answer those questions....are you sure you're qualified to be a systems admin? |
Thanks for the info...I did answer this question and brought about the points you have mentioned..I was lukin for some more info on this, like we check n/w stats by netstat command. What possible answers do I give?
Yes...I might not be qualified to be linux admin...Thats why I joined this forum to get help n advise from smart people like urself. I come here to enhance my knowledge. Thanks for the info... |
Also, If you have enuf time to help me out, could you give me some possible scenarios that I can work on n get back to u wid answers? Then u cud tell me if this is correct or not.
|
Hi,
Welcome to LQ! 'How to Ask Questions the Smart Way' would be one link you should look at to help us to help you in the future. Abbreviated or text speak will get you into some bad habits. Employers want someone who can provide complete and concise response(s) or speech. Not tech speak or AOL type presentations. You could use <Linux> - Google Search or Search LQ with proper keywords. Look at 'Slackware-Links' for useful links that may aid you. More than just SlackwareŽ links! |
OK you must thank me. ok friends.
Checking Linux Disk Space
ofcourse man. try to decrease your space usage ;) Increasing swap space The interviewer will love you man. you must tell this. they may understand your expertise. Building and installing a new kernel modules update your system with latest kernel patches and drivers Use Compressions files Remove unwanted log files Linux always loves to stick with Apache and Mysql usage Are you happy? enough? |
top will tell you if there's a disk-indexing operation (like strigi or updatedb) slowing things down. These can be disabled, or scheduled to run at less intrusive times.
Building a kernel optimized for either low latency (with preemption) or high throughput (without preemption) should be considered, depending on the requirements. If the disk drives are on a SATA bus, they had better be detected by the kernel as SATA devices, or else I would make that happen. If they're on an IDE bus, then I would use hdparm to tune them. I would be aware of hardware devices that require special procedures. For example, I understand that Western Digital's Caviar Green line of hard drives need to be partitioned in a certain way to get optimal performance. And I'm with Onebuck: the writing style in your top post was appropriate; the deliberate misspellings in your subsequent posts really weren't. |
Quote:
Now, some examples. First, you have to determine what's on that server. Web server? Database? File/print? How many users? All of this plays a part in server load. A database server will naturally be disk-intensive...so if a DB server is slow, start there. Check IO stat..and be aware about what KIND of device. iSCSI might be fine...but if you're saturating that network link, your bottlneck might be there. SAN? Lots to check too, from the qbick to the HBA. Web server? Start with network load....how much saturation is the pipe getting? Check ping times, and how long it takes to do something NON web related, like SSH into the box on that address. If SSH comes right up, but web doesn't...start asking questions like "did someone update the web code lately?" Check CPU load no matter what, and memory load. These days, it's fairly rare to be swapping to disk, so if you are...you've got problems, usually. The whole question isn't about a "right" or "wrong" answer, but how you THINK about the system. You're not just managing a server..you're managing the PEOPLE too. You have to know what they're doing, and how, and take those things into account as well. If your server is chugging along just fine, and one morning it's moving as fast as a glacier uphill...time to ask the users what happened last night. Like "did anyone apply a patch?", "hey, DBA's...I noticed that the DB process was restarted...what's up?", etc. |
Quote:
|
Thanks a lot guys...I didnt know that I would get so much response. Now I am getting some confidence that I can go up that mark which I am very badly striving for. Once again, thanks a lot. Appreciate all your responses
|
Quote:
Some big things to keep in mind:
Anyone who has done the job for any length of time, can tell you horror stories about all three listed above. Always remember that as the sysadmin, YOU are responsible. Set good policies, and balance what the user WANTS, against the good of the system as a whole. They may not like having to change passwords every sixty days, being auto-logged out at night, etc., but stand firm. You'll be held accountable when the system croaks, and if you've taken all the precautions you can, you'll be safe. |
Hi,
I would add to what 'TBone' has said. Patience! You are going to stumble and make mistakes. Hopefully few but when these do occur then investigate, debug and own up to it. Find out what it takes to rectify and adjust things holistically. Treat the systems as if they are your young. Alive and need of nourishment daily. You wouldn't let some stranger discipline your kid unnecessarily would you? Respect the machines and your users but know when to stand the high ground. And when to yield, 'Don't sweat the small stuff... and it's all small stuff'. Again Patience! May the 'bit's flow eventually without errors that will cause undue problems. :jawa: 'bit on',, 'bit off',,'bit !on'...'bit!off'... :hattip: |
Quote:
Firstly, it might be a surprise to hear that the correct is is not 'Use (name of favourite distro) on a model 12345 blade with the Xeon processor option a SAS disk subsystem and 4TB of ram without finding out about the application.' While this might work, in practice, it is usually a good way of being rejected for the job. In general, what is being looked for is a logical approach to the problem, and a reasoned response. So, you don't suggest a solution until you know what the problem is. So you want to include lines like:
It may well be that adding ram or a faster processor or something are often 'cures' for this kind of problem (and if you have a mission-critical app that falls over every day at the peak load time, there may be pressure to get a 'quick fix' and there may even be an argument for that, in certain restricted circumstances), you lose points for just throwing hardware at the problem every time rather than trying to understand and proceeding logically. There may also be a consideration of 're-architecting' the solution (using separate machines for separate tasks, for example). If your answer demonstrates an awareness of what you are doing for the organisation and a competence at that rather than just a way of getting the immediate problem to go away, that should be fine. it is also fine to say things like 'if I can assume that...., then i would proceed in this way, because if the other thing is a factor then this other approach would have to be considered'; that shows an awareness of the wider corporate world than your immediate job role. |
All times are GMT -5. The time now is 02:48 PM. |