What is using (kernel) memory?

mitsos1os · 12-15-2020, 07:21 AM

Hi everyone,

I am facing some memory starvation issues on a server and trying to figure out memory but there is a blank spot in my inderstanding regarding kernel memory usage. I an sentence, I need to find out what is using kernel memory in my server and seems to be unaccounted by any tools. More specifically:

I am running an AWS EC2 instance with the following specs: t3.medium instance with 4GB RAM and OS: Debian GNU/Linux 9.5 (stretch) with kernel: 4.9.0-7-amd64.

The problem is that without any application load on the server, there is steady memory usage increase, while at the same time the application memory usage remains stable and the increase is observed in kernel memory usage. However this kernel memory usage does not match the output from different diagnostic tools. We need to find what is using kernel memory.

Here are 2 charts, the 1st showing how the total application memory remains stable, and the 2nd with the zoom level, shows how the memory avaialble decreases without any other metric rising (as application memory shown stable in 1st chart as well):

1 - Overview:
https://i.stack.imgur.com/dernG.png
2 - Available memory decrease:
https://i.stack.imgur.com/BJXwb.png

More specifically:

Our application memory usage is around 2.7 - 2.8GB as all tools agree (when adding the memory in top or ps aux as well as info in vmstat and /proc/meminfo):

vmstat:

Code:

  3895 M total memory
         3288 M used memory
         2830 M active memory
          102 M inactive memory
          239 M free memory
           14 M buffer memory
          353 M swap cache
            0 M total swap
            0 M used swap
            0 M free swap
      9421426 non-nice user cpu ticks
        33553 nice user cpu ticks
      6918382 system cpu ticks
     77883981 idle cpu ticks
       388912 IO-wait cpu ticks
            0 IRQ cpu ticks
       278102 softirq cpu ticks
      2354770 stolen cpu ticks
    860794765 pages paged in
     53978341 pages paged out
            0 pages swapped in
            0 pages swapped out
    923803588 interrupts
   1521737988 CPU context switches
   1606336490 boot time
      3042098 forks

/proc/meminfo:

Code:

MemTotal:        3989436 kB
MemFree:          244820 kB
MemAvailable:     363728 kB
Buffers:           14724 kB
Cached:           261808 kB
SwapCached:            0 kB
Active:          2898128 kB
Inactive:         104688 kB
Active(anon):    2727444 kB
Inactive(anon):     1760 kB
Active(file):     170684 kB
Inactive(file):   102928 kB
Unevictable:           0 kB
Mlocked:               0 kB
SwapTotal:             0 kB
SwapFree:              0 kB
Dirty:              1516 kB
Writeback:             0 kB
AnonPages:       2726340 kB
Mapped:           121384 kB
Shmem:              2920 kB
Slab:             176432 kB
SReclaimable:     100284 kB
SUnreclaim:        76148 kB
KernelStack:       15584 kB
PageTables:        37408 kB
NFS_Unstable:          0 kB
Bounce:                0 kB
WritebackTmp:          0 kB
CommitLimit:     1994716 kB
Committed_AS:    8909600 kB
VmallocTotal:   34359738367 kB
VmallocUsed:           0 kB
VmallocChunk:          0 kB
HardwareCorrupted:     0 kB
AnonHugePages:      2048 kB
ShmemHugePages:        0 kB
ShmemPmdMapped:        0 kB
HugePages_Total:       0
HugePages_Free:        0
HugePages_Rsvd:        0
HugePages_Surp:        0
Hugepagesize:       2048 kB
DirectMap4k:     1523688 kB
DirectMap2M:     2609152 kB
DirectMap1G:           0 kB

top:

Code:

top - 11:51:45 up 5 days, 15:16,  1 user,  load average: 0.36, 0.56, 0.78
Tasks: 177 total,   1 running, 176 sleeping,   0 stopped,   0 zombie
%Cpu(s):  9.7 us,  7.1 sy,  0.0 ni, 80.1 id,  0.4 wa,  0.0 hi,  0.3 si,  2.4 st
KiB Mem :  3989436 total,   244380 free,  3368236 used,   376820 buff/cache
KiB Swap:        0 total,        0 free,        0 used.   363288 avail Mem 

  PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND                                                                                                                   
 6781 1001      20   0 1127824 261280  20528 S   0.0  6.5  14:26.31 node                                                                                                                      
 6812 1001      20   0 1123020 256864  20440 S   0.0  6.4   2:16.04 node                                                                                                                      
 6947 1001      20   0 2290368 141248   3672 S   6.2  3.5  60:54.13 beam.smp                                                                                                                  
 6804 1001      20   0  995868 134800  20664 S   0.0  3.4  13:45.90 node                                                                                                                      
 6833 1001      20   0  998772 132872  21552 S   0.0  3.3   2:47.05 node                                                                                                                      
 6793 1001      20   0 1028472 132472  21012 S   0.0  3.3  25:17.82 node                                                                                                                      
 6481 root      20   0 1449660 128308  10276 S   0.0  3.2  28:25.33 fluentd                                                                                                                   
 6773 1001      20   0  990176 127960  21572 S   0.0  3.2   4:45.88 node                                                                                                                      
 8151 root      20   0 1034568 127336   4044 S   0.0  3.2   0:57.51 gunicorn                                                                                                                  
 6783 1001      20   0  995804 119816  20416 S   0.0  3.0   1:41.30 node                                                                                                                      
 8149 root      20   0 1029884 115496    708 S   0.0  2.9   0:56.36 gunicorn                                                                                                                  
 8150 admin     20   0  791532 113856   6144 S   0.0  2.9   0:19.28 gunicorn                                                                                                                  
 8158 admin     20   0  791816 113608   5740 S   0.0  2.8   0:17.76 gunicorn                                                                                                                  
 8153 admin     20   0  792644 113456   4508 S   0.0  2.8   0:20.20 gunicorn                                                                                                                  
 8156 admin     20   0  790936 112940   5948 S   0.0  2.8   0:18.89 gunicorn                                                                                                                  
 8160 admin     20   0  791248 112604   5180 S   0.0  2.8   0:16.67 gunicorn                                                                                                                  
 8152 admin     20   0  791240 112584   4512 S   0.0  2.8   0:20.82 gunicorn                                                                                                                  
 8161 admin     20   0  790360 112560   6144 S   0.0  2.8   0:18.62 gunicorn                                                                                                                  
 8159 admin     20   0  790352 111080   4688 S   0.0  2.8   0:22.00 gunicorn                                                                                                                  
 4451 root      20   0  946304  75280   4920 S   0.0  1.9  23:19.41 dockerd                                                                                                                   
 2154 root      20   0 1263672  69024  13016 S   0.0  1.7 230:49.47 kubelet                                                                                                                   
 1804 root      20   0 1498348  25504      0 S   0.0  0.6  13:52.07 containerd                                                                                                                
 4599 root      20   0  454100  24132      0 S   0.0  0.6   0:05.73 docker                                                                                                                    
 6413 1001      20   0   28280  18904   3532 S   0.0  0.5  20:10.03 rabbitmq_export                                                                                                           
 7957 admin     20   0   30632  15424    688 S   0.0  0.4   0:08.72 gunicorn                                                                                                                  
 4632 root      20   0  346804  15020   2204 S   0.0  0.4   2:00.88 protokube                                                                                                                 
 7946 root      20   0   30624  14728      0 S   0.0  0.4   0:07.08 gunicorn                                                                                                                  
  190 root      20   0  148448  14012  13464 S   0.0  0.4   7:59.12 systemd-journal                                                                                                           
 6244 1001      20   0  761896  13888   3872 S   0.0  0.3   0:00.40 node                                                                                                                      
 6386 1001      20   0  762468  13792   3840 S   0.0  0.3   0:00.40 node                                                                                                                      
 6282 1001      20   0  761636  13716   3832 S   0.0  0.3   0:00.42 node                                                                                                                      
 5972 1001      20   0  762304  13608   3804 S   0.0  0.3   0:00.39 node                                                                                                                      
 5912 1001      20   0  755608  13400   3828 S   0.0  0.3   0:00.42 node                                                                                                                      
 6170 1001      20   0  755180  13352   3844 S   0.0  0.3   0:00.42 node                                                                                                                      
 6029 1001      20   0  755604  13348   3792 S   0.0  0.3   0:00.42 node                                                                                                                      
 5285 nobody    20   0  717380  10900   4524 S   0.0  0.3   1:27.01 node_exporter                                                                                                             
 5236 root      20   0  139620  10204    312 S   0.0  0.3   0:34.85 kube-proxy                                                                                                                
29897 root      20   0   92828   6468   5536 S   0.0  0.2   0:00.00 sshd                                                                                                                      
    1 root      20   0   57792   5716   3500 S   0.0  0.1   5:10.42 systemd                                                                                                                   
 5769 root      20   0  109100   4584   2192 S   0.0  0.1   0:57.32 containerd-shim                                                                                                           
29904 admin     20   0   19968   3920   3268 S   0.0  0.1   0:00.02 bash                                                                                                                      
29903 admin     20   0   92828   3740   2808 S   0.0  0.1   0:00.03 sshd                                                                                                                      
31515 admin     20   0   42828   3468   2784 R   0.0  0.1   0:00.00 top                                                                                                                       
 7847 root      20   0  107692   3132   1900 S   0.0  0.1   0:02.54 containerd-shim

According to slabtop our justified kernel memory usage is around 160MB. However according to smem and free (which shows the available memory being quite less that it should be taking into account), the correct calculation should be:

Code:

used = app_noncache + kernel_noncache

slabtop:

Code:

Active / Total Objects (% used)    : 494415 / 704589 (70.2%)
 Active / Total Slabs (% used)      : 42136 / 42159 (99.9%)
 Active / Total Caches (% used)     : 79 / 126 (62.7%)
 Active / Total Size (% used)       : 128233.55K / 162655.41K (78.8%)
 Minimum / Average / Maximum Object : 0.02K / 0.23K / 4096.00K

  OBJS ACTIVE  USE OBJ SIZE  SLABS OBJ/SLAB CACHE SIZE NAME                   
159873  77663   0%    0.19K   7613       21     30452K dentry                 
143468 142879   0%    0.03K   1157      124      4628K kmalloc-32             
 85568  34703   0%    0.06K   1337       64      5348K kmalloc-64             
 34782  34688   0%    0.12K   1023       34      4092K kernfs_node_cache      
 32127  29297   0%    1.05K  10709        3     42836K ext4_inode_cache       
 28735  24699   0%    0.57K   4105        7     16420K inode_cache            
 26460  22407   0%    0.20K   1323       20      5292K vm_area_struct         
 25216  15903   0%    0.06K    394       64      1576K anon_vma_chain         
 22113   6571   0%    0.10K    567       39      2268K buffer_head            
 18080  10004   0%    0.12K    565       32      2260K kmalloc-96             
 16688  11218   0%    0.07K    298       56      1192K anon_vma               
 15264  15002   0%    0.25K    954       16      3816K kmalloc-256            
 12000  11131   0%    0.12K    375       32      1500K kmalloc-node           
 11786   4039   0%    0.05K    142       83       568K ftrace_event_field     
 10332   9984   0%    0.19K    492       21      1968K kmalloc-192            
  9163   6680   0%    0.56K   1309        7      5236K radix_tree_node        
  7776   2972   0%    0.25K    486       16      1944K nf_conntrack           
  3954   3950   0%    2.00K   1977        2      7908K kmalloc-2048           
  3604   3523   0%    1.00K    901        4      3604K kmalloc-1024           
  3399   2716   0%    0.67K    309       11      2472K shmem_inode_cache      
  3213    924   0%    0.19K    153       21       612K cred_jar               
  3066   2726   0%    0.62K    511        6      2044K proc_inode_cache       
  2688   2622   0%    0.19K    128       21       512K kmem_cache             
  2579   2578   0%    4.00K   2579        1     10316K kmalloc-4096           
  2352   2322   0%    0.50K    294        8      1176K kmalloc-512            
  2331    247   0%    0.06K     37       63       148K fs_cache               
  2184   1964   0%    0.07K     39       56       156K Acpi-Operand           
  1870   1762   0%    0.38K    187       10       748K mnt_cache              
  1840   1802   0%    0.09K     40       46       160K trace_event_file       
  1782    541   0%    0.04K     18       99        72K ext4_extent_status     
  1184    686   0%    0.12K     37       32       148K pid                    
  1158    861   0%    0.62K    193        6       772K sock_inode_cache       
  1014   1000   0%    5.38K   1014        1      8112K task_struct            
   728    704   0%    0.14K     26       28       104K ext4_groupinfo_4k      
   702    322   0%    0.10K     18       39        72K blkdev_ioc             
   616    171   0%    0.69K     56       11       448K files_cache            
   490    226   0%    1.06K     70        7       560K signal_cache           
   432     86   0%    0.11K     12       36        48K jbd2_journal_head      
   396    335   0%    0.04K      4       99        16K Acpi-Namespace         
   330    323   0%    2.05K    110        3       880K idr_layer_cache        
   316    163   0%    1.00K     79        4       316K mm_struct              
   294    206   0%    2.06K     98        3       784K sighand_cache          
   286    257   0%    0.36K     26       11       104K blkdev_requests

free -m:

Code:

              total        used        free      shared  buff/cache   available
Mem:           3895        3288         238           2         367         355
Swap:             0           0           0

smem -twk:

Code:

Area                           Used      Cache   Noncache 
firmware/hardware                 0          0          0 
kernel image                      0          0          0 
kernel dynamic memory        866.5M     238.5M     628.0M 
userspace memory               2.7G     122.3M       2.6G 
free memory                  224.0M     224.0M          0 
----------------------------------------------------------
                               3.8G     584.8M       3.2G

We seem to have a rough 450MB unaccounted kernel memory usage. So what is using our kernel memory?

syg00 · 12-16-2020, 02:30 AM

I see no evidence to support your supposition that it is a kernel issue - maybe try kmemleak. Designed for the task, and all you have to do is turn it on.

Never used it - let it know how it goes.

mitsos1os · 12-16-2020, 03:11 AM

If you can take a look at the pictures I have attached as links, you will be able to see that the available memory is continuously decreasing, while other memory metrics remain stable.

Adding up all other memory metrics reported does not total the memory consumption reported by free.

Also the smem output I provided showing 628MB kernel non-cache memory, is the final state. It starts at 180MB and without any usage on the (test) server it goes to 628 level overnight.

Also slabtop which is suppossed to show a number close to kernel memory consumption is really lower than the reported noncache memory of smem.

That is why I am suggesting kernel memory usage.

I have also ready about the kmemleak option, but unfortunately it is not a single turn on option, it needs to re-configure and re-compile the kernel which unfortunately on the specific machines is a bit of trouble to do so in our current setup, so I was hoping for a more "compile-free"

solution or hint.

I will eventually fallback to this option though if not other suggestion is made.

Thanks

pan64 · 12-16-2020, 10:27 AM

Quote:

Originally Posted by mitsos1os

It starts at 180MB and without any usage on the (test) server it goes to 628 level overnight.

Do you mean ~0.5GB was lost in a day? That will stop the system in a week.
Or do you mean (by final state) it will not eat up more?

probably you will find it interesting: www.linuxatemyram.com

mitsos1os · 12-16-2020, 12:27 PM

0.5GB was lost in a day but then it seems to stabilize when around 100-200 mb are left only as available memory.
However this is non-cache memory which we are talking about so it cannot be freed on demand.
So we have really low memory available on the server that cannot cover any memory peaks from the running processes.

I have also seen linuxatemyram.com but unfortunately it mostly talks about caches.

In our case dropping the cache only frees around 10mb of RAM

pan64 · 12-16-2020, 12:51 PM

I would try to stop/kill dockerd, containerd, kubelet (and probably others) or reboot without them to see if that makes any differences.

mitsos1os · 12-17-2020, 01:43 AM

I have actually found the cause of the problem. It is caused by a TCP communication channel towards a FluentD daemon running.

However my question is mostly targeted, on how would I be able to find where this kernel memory is used with some kind of tool and not just by experimenting as I already did.

In addition I have also captured some graphs showing that the continuous decrease of available memory in the system (remember the application memory remains stable), is completely in par with the trend of a TCP memory increase. However they only match in trend since the absolute numbers of TCP usage are really lower. (800MB missing from memory vs 30mb memory increase in TCP)

Available Memory decrease: https://pasteboard.co/JESysQ6.png
TCP Memory increase: https://pasteboard.co/JESyPPp.png

TCP memory is calculated through /proc/net/sockstat where total_memory = mem * 4k (since the number in sockstat of mem is number of memory pages)

sanjibdas · 03-11-2021, 12:25 AM

I guess I too had similar issue,

You check once.
This link was really helpful (check for Memory black hole).
https://titanwolf.org/Network/Articl...2954#gsc.tab=0

Memory allocation via alloc_pages() without proper book-keeping could lead to such situation where we cannot track
memory usage from /proc/meminfo dump.

mitsos1os · 03-11-2021, 01:01 AM

Hi @sanjibdas. Thank you for your response.
Unfortunately the link for the article you provided is not working.
Could you please check it?
Thanks!

sanjibdas · 03-11-2021, 06:41 AM

please try now

https://titanwolf.org/Network/Articl...2954#gsc.tab=0

mitsos1os · 03-11-2021, 10:02 AM

I see....

It is the most reasonable explanation I have found so far...
It actually strikes me that I hadn't previously found this article since I had read all I could find about /proc/meminfo.

Really helpful and to the point explanations

Thanks sanjibdas for the good reference!

shruggy · 03-11-2021, 01:32 PM

Just for reference: Mysteries of /proc/meminfo (the Chinese original of the article at titanwolf.org).