please see it and give me some ideas about the eth0 traffic

linux_biao · 12-09-2011, 01:06 AM

server:centos
main service :web
it is only running web service with linux+nginx+php+mysql and the eth0 traffic is face to internet!Normal the eth0 traffic is less than 2M per,But In a time 7:55,this traffic rises 8M(outbound)! Please see the image as follow:
http://i1103.photobucket.com/albums/...o/b4b9c8f0.jpg

I want to find the reason!!How can i to check it ?

ps:I found the log such as :nginx's access and error,/var/log/message,cron.log but i can not check it out!
and in that time ,it is not used to bakup because the bakup process is in 6:00am with eth1!!

So I need your help ,give me some advise about it !thank you!!

ButterflyMelissa · 12-09-2011, 02:48 AM

Hi,

During the peak, issue:

Quote:

iotop

to see what process uses the in/out

Quote:

iftop

to see what process uses the card

check the crons, all of them

The outbound nature makes me worried, verrrrrry worried...

Good luck, I'll keep this thread on the scope!

Thor

linux_biao · 12-09-2011, 05:54 AM

thank you for your reply!
But I don't know whether if it will be appear again！
So if i can't know check it out i will make a script to record the process everytime!
thank you !

Quote:

Originally Posted by Thor_2.0

Hi,

During the peak, issue:

to see what process uses the in/out

to see what process uses the card

check the crons, all of them

The outbound nature makes me worried, verrrrrry worried...

Good luck, I'll keep this thread on the scope!

Thor

markseger · 12-09-2011, 06:54 AM

first of all I see you're using rrdtool for plotting and I'm also guessing you're taking infrequent samples, so you lose on 2 counts! first of all, rrd 'normalizes your data so if you have more samples than will fit on the graph, they get combined and by whatever way you have things set up to do. second, are those 15 minutes buckets? think about it. an 8MB spike in 15 minutes could easily be 100MB spike for many seconds or even minutes. you can't tell!

if this is something you really care about resolving you need finer grain metrics, full stop! if your monitoring tool can't handle, either use sar or collectl - I prefer collectl

. if you still want rrd graphics for even a coarse view, run both! tools like sar are and collectl and take almost no resources.

I also think the 'top' tools are only somewhat useful. they give you a quick look at what's going on right now, but no sense of looking at a longer term time line.

so many people sample at 5-10 minute intervals and happily think there system is doing just fine when in fact if there are spikes they'll never see them OR they'll be very small. for example, you might have short busts of 100% cpu, disk or network loads and never even know it. that's not to say short burts aren't necessarily a bad thing but if you have system errors corresponding to them they could be. w/o data you're just guessing...

-mark