Do you use NTOP for troubleshooting on production servers?
Linux - SoftwareThis forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
ntop is a very good product when it's working, but I don't find it stable enough for a production environment. Additionally it's has (had?) annoying limitations such as not saving any data at all across a service restart, again significantly imparing it's production suitability.
What do you actually want to get out of ntop? If it's not the netflow / ipfix style data then there are probably better products. If you only have a few machines, I'd suggest you have a look at netflow analyzer by manageengine, which gives a free netflow server for 5 machines. you can than run a simpler netflow client on each box, ntop comes with one actually, and send the data in to a more robust and usable server. Or run it on each server if you want to. Same time, you can still run ntop in a central location as well...
I am currently use the following tools to monitor Java servlets:
Analyze stack (kill -3 & Thread Dump Analyzer)
Analyze memory dump (jmap & Memory Analyzer)
Analyze cpu usage, memory usage, garbage collection, number of threads running/waiting (YourKit Profiler)
I am still looking for ways to monitor:
Network issues which may cause problems with threads taking 300 seconds on particular times. We have no clue what causes the threads to be so slow sometimes. (happens once of twice a week, the entire web cluster is slow at these moments)
Databases issues which may cause threads to be so slow, same as above.
I thought ntop might be useful for discovering problems for the Network File System, isn't it?
hmm, not really. I mean it could assist in some ways, but not that directly. for something like an oracle connection issue you'll be able to see summary tabular data showing the volume of oracle transactions between each client and the server for periods of time during the day. If you want to retrospectively see what was happening on a given DB connection, i.e. what the network traffic looked like during the point of failure, it won't be able to. Instead if you can justify the disk space, i'd set up a tcpdump / tshark ring buffer to keep a given volume of network traffic on each server so you can go an see the raw traffic, or possibly just the tcp/ip headers to save on space, at any given point within your window.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.