Hello,
Ever since I upgraded my Linode to Debian squeeze, I have been seeing regular kernel "page allocation failure" messages such as the following:
Code:
kernel: swapper: page allocation failure. order:4, mode:0x20
kernel: Pid: 0, comm: swapper Not tainted 2.6.38-linode31 #1
kernel: Call Trace:
kernel: [<c01845c0>] ? __alloc_pages_nodemask+0x4f0/0x670
kernel: [<c01aa259>] ? cache_alloc_refill+0x2f9/0x520
kernel: [<c01aa534>] ? __kmalloc+0xb4/0xe0
kernel: [<c04f43be>] ? pskb_expand_head+0x12e/0x200
kernel: [<c010630b>] ? xen_restore_fl_direct_end+0x0/0x1
kernel: [<c01a9666>] ? kmem_cache_free+0x46/0x120
kernel: [<c04f490d>] ? __pskb_pull_tail+0x4d/0x2b0
kernel: [<c05ee17d>] ? packet_rcv_spkt+0xfd/0x140
kernel: [<c04fca7a>] ? dev_hard_start_xmit+0x26a/0x520
kernel: [<c0510792>] ? sch_direct_xmit+0xb2/0x170
kernel: [<c0534cc4>] ? nf_iterate+0x74/0xa0
kernel: [<c0559ca0>] ? ip_finish_output+0x0/0x300
kernel: [<c04fce51>] ? dev_queue_xmit+0x121/0x550
kernel: [<c0559ca0>] ? ip_finish_output+0x0/0x300
kernel: [<c0559dd4>] ? ip_finish_output+0x134/0x300
kernel: [<c055a04a>] ? ip_output+0xaa/0xe0
kernel: [<c0559ca0>] ? ip_finish_output+0x0/0x300
kernel: [<c0558f68>] ? ip_local_out+0x18/0x20
kernel: [<c0559667>] ? ip_queue_xmit+0x137/0x3a0
kernel: [<c0105b37>] ? xen_force_evtchn_callback+0x17/0x30
kernel: [<c0105b37>] ? xen_force_evtchn_callback+0x17/0x30
kernel: [<c056c652>] ? tcp_transmit_skb+0x372/0x7e0
kernel: [<c056ec18>] ? tcp_write_xmit+0x198/0x980
kernel: [<c0105b37>] ? xen_force_evtchn_callback+0x17/0x30
kernel: [<c056f464>] ? __tcp_push_pending_frames+0x24/0x90
kernel: [<c056a3f6>] ? tcp_rcv_established+0x146/0x840
kernel: [<c0571e46>] ? tcp_v4_do_rcv+0xd6/0x230
kernel: [<c01382c6>] ? local_bh_enable+0x16/0x80
kernel: [<c057266c>] ? tcp_v4_rcv+0x6cc/0x7b0
kernel: [<c0554e37>] ? ip_local_deliver_finish+0x97/0x220
kernel: [<c0554da0>] ? ip_local_deliver_finish+0x0/0x220
kernel: [<c05547a6>] ? ip_rcv_finish+0xf6/0x3c0
kernel: [<c04facfd>] ? __netif_receive_skb+0x32d/0x510
kernel: [<c0105b37>] ? xen_force_evtchn_callback+0x17/0x30
kernel: [<c04fbbb7>] ? netif_receive_skb+0x67/0x70
kernel: [<c049ff05>] ? xennet_poll+0x7f5/0xc20
kernel: [<c04fc2fa>] ? net_rx_action+0x9a/0x130
kernel: [<c013810c>] ? __do_softirq+0x7c/0x130
kernel: [<c0138090>] ? __do_softirq+0x0/0x130
kernel: <IRQ> [<c0138005>] ? irq_exit+0x65/0x70
kernel: [<c0439f2d>] ? xen_evtchn_do_upcall+0x1d/0x30
kernel: [<c0109487>] ? xen_do_upcall+0x7/0xc
kernel: [<c01013a7>] ? hypercall_page+0x3a7/0x1010
kernel: [<c0105b8f>] ? xen_safe_halt+0xf/0x20
kernel: [<c010f66f>] ? default_idle+0x2f/0x60
kernel: [<c0107ed2>] ? cpu_idle+0x42/0x70
kernel: [<c07ca8ac>] ? start_kernel+0x2da/0x2dfApr 12 06:38:43 li140-180 kernel: [<c07ca410>] ? unknown_bootoption+0x0/0x190
kernel: [<c07cdaa5>] ? xen_start_kernel+0x530/0x538
kernel: Mem-Info:
kernel: DMA per-cpu:
kernel: CPU 0: hi: 0, btch: 1 usd: 0
kernel: CPU 1: hi: 0, btch: 1 usd: 0
kernel: CPU 2: hi: 0, btch: 1 usd: 0
kernel: CPU 3: hi: 0, btch: 1 usd: 0
kernel: Normal per-cpu:
kernel: CPU 0: hi: 186, btch: 31 usd: 42
kernel: CPU 1: hi: 186, btch: 31 usd: 171
kernel: CPU 2: hi: 186, btch: 31 usd: 134
kernel: CPU 3: hi: 186, btch: 31 usd: 184
kernel: active_anon:16222 inactive_anon:27566 isolated_anon:0
kernel: active_file:27915 inactive_file:31023 isolated_file:0
kernel: unevictable:0 dirty:33 writeback:0 unstable:0
kernel: free:9224 slab_reclaimable:9493 slab_unreclaimable:2108
kernel: mapped:7298 shmem:4430 pagetables:611 bounce:0
kernel: DMA free:2084kB min:84kB low:104kB high:124kB active_anon:120kB inactive_anon:316kB active_file:2692kB inactive_file:2804kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15808kB mlocked:0kB dirty:0kB writeback:0kB mapped:580kB shmem:408kB slab_reclaimable:72kB slab_unreclaimable:32kB kernel_stack:8kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
kernel: lowmem_reserve[]: 0 500 500 500
kernel: Normal free:34812kB min:2816kB low:3520kB high:4224kB active_anon:64768kB inactive_anon:109948kB active_file:108968kB inactive_file:121288kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:512064kB mlocked:0kB dirty:132kB writeback:0kB mapped:28612kB shmem:17312kB slab_reclaimable:37900kB slab_unreclaimable:8400kB kernel_stack:1144kB pagetables:2444kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
kernel: lowmem_reserve[]: 0 0 0 0
kernel: DMA: 241*4kB 62*8kB 5*16kB 1*32kB 2*64kB 1*128kB 1*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 2084kB
kernel: Normal: 2285*4kB 2697*8kB 225*16kB 17*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 34860kB
kernel: 78061 total pagecache pages
kernel: 14676 pages in swap cache
kernel: Swap cache stats: add 330524, delete 315848, find 3444837/3487305
kernel: Free swap = 423812kB
kernel: Total swap = 524284kB
kernel: 133104 pages RAM
kernel: 0 pages HighMem
kernel: 5637 pages reserved
kernel: 67797 pages shared
kernel: 65096 pages non-shared
As indicated, this is using Linux 2.6.38-linode31.
The involved process is always "swapper", "apache2" or "ssh" and the problem most frequently occurs during a nightly script that rsyncs to a volume mounted over sshfs+encfs (rsync does not report any error).
This is the memory information that may be relevant:
Code:
# free -lm
total used free shared buffers cached
Mem: 497 426 71 0 38 166
Low: 497 426 71
High: 0 0 0
-/+ buffers/cache: 221 276
Swap: 511 77 434
# cat /proc/sys/vm/lowmem_reserve_ratio
256 32 32
# cat /proc/zoneinfo
Node 0, zone DMA
pages free 688
min 21
low 26
high 31
scanned 0
spanned 4080
present 3952
nr_free_pages 688
nr_inactive_anon 48
nr_active_anon 83
nr_inactive_file 578
nr_active_file 584
nr_unevictable 0
nr_mlock 0
nr_anon_pages 18
nr_mapped 150
nr_file_pages 1279
nr_dirty 0
nr_writeback 0
nr_slab_reclaimable 9
nr_slab_unreclaimable 0
nr_page_table_pages 0
nr_kernel_stack 21
nr_unstable 0
nr_bounce 0
nr_vmscan_write 0
nr_writeback_temp 0
nr_isolated_anon 0
nr_isolated_file 0
nr_shmem 102
nr_dirtied 5617
nr_written 5608
nr_anon_transparent_hugepages 0
protection: (0, 500, 500, 500)
pagesets
cpu: 0
count: 0
high: 0
batch: 1
vm stats threshold: 6
cpu: 1
count: 0
high: 0
batch: 1
vm stats threshold: 6
cpu: 2
count: 0
high: 0
batch: 1
vm stats threshold: 6
cpu: 3
count: 0
high: 0
batch: 1
vm stats threshold: 6
all_unreclaimable: 0
start_pfn: 16
inactive_ratio: 1
Node 0, zone Normal
pages free 17443
min 704
low 880
high 1056
scanned 0
spanned 129024
present 128016
nr_free_pages 17443
nr_inactive_anon 33819
nr_active_anon 18066
nr_inactive_file 18962
nr_active_file 27900
nr_unevictable 0
nr_mlock 0
nr_anon_pages 42007
nr_mapped 8023
nr_file_pages 61636
nr_dirty 30
nr_writeback 0
nr_slab_reclaimable 2336
nr_slab_unreclaimable 2213
nr_page_table_pages 919
nr_kernel_stack 171
nr_unstable 0
nr_bounce 0
nr_vmscan_write 232406
nr_writeback_temp 0
nr_isolated_anon 0
nr_isolated_file 0
nr_shmem 4470
nr_dirtied 3852185
nr_written 3961663
nr_anon_transparent_hugepages 0
protection: (0, 0, 0, 0)
pagesets
cpu: 0
count: 184
high: 186
batch: 31
vm stats threshold: 18
cpu: 1
count: 176
high: 186
batch: 31
vm stats threshold: 18
cpu: 2
count: 165
high: 186
batch: 31
vm stats threshold: 18
cpu: 3
count: 164
high: 186
batch: 31
vm stats threshold: 18
all_unreclaimable: 0
start_pfn: 4096
inactive_ratio: 1
Any ideas what this is about?
Thanks,
Bruno