It happened again. 4 days uptime and system crashed. In Lish i was able to view this:
Code:
[<c011f3bf>] ? do_page_fault+0x24f/0x3a0
[<c0105c27>] ? xen_force_evtchn_callback+0x17/0x30
[<c0106404>] ? check_events+0x8/0xc
[<c01063fb>] ? xen_restore_fl_direct_reloc+0x4/0x4
[<c011f170>] ? mm_fault_error+0x130/0x130
[<c06bfc66>] ? error_code+0x5a/0x60
[<c012007b>] ? try_preserve_large_page+0x7b/0x340
[<c011f170>] ? mm_fault_error+0x130/0x130
[<c01ab8a8>] ? swap_count_continued+0x158/0x180
[<c01abe22>] ? __swap_duplicate+0xc2/0x160
[<c01abb04>] ? add_swap_count_continuation+0x54/0x130
[<c01abee4>] ? swap_duplicate+0x14/0x40
[<c01a068b>] ? copy_pte_range+0x45b/0x500
[<c0106404>] ? check_events+0x8/0xc
[<c01a08c5>] ? copy_page_range+0x195/0x200
[<c0132756>] ? dup_mmap+0x1c6/0x2c0
[<c0132b88>] ? dup_mm+0xa8/0x130
[<c01335fa>] ? copy_process+0x98a/0xb30
[<c01337ef>] ? do_fork+0x4f/0x280
[<c010f780>] ? sys_clone+0x30/0x40
[<c06c000d>] ? ptregs_clone+0x15/0x48
[<c06bf6f1>] ? syscall_call+0x7/0xb
[<c06b0000>] ? sctp_backlog_rcv+0xf0/0x100
INFO: rcu_sched_state detected stall on CPU 2 (t=60000 jiffies)
INFO: rcu_sched_state detected stall on CPU 1 (t=60000 jiffies)
INFO: rcu_sched_state detected stall on CPU 3 (t=240030 jiffies)
INFO: rcu_sched_state detected stall on CPU 2 (t=240031 jiffies)
INFO: rcu_sched_state detected stall on CPU 1 (t=240031 jiffies)
INFO: rcu_sched_state detected stall on CPU 1 (t=420061 jiffies)
INFO: rcu_sched_state detected stall on CPU 2 (t=420061 jiffies)
INFO: rcu_sched_state detected stall on CPU 1 (t=600091 jiffies)
Lish was not responsive. Was not able to write anything there. And as usually - no SSH, no web, nothing.
Ideas?