0000401AlmaLinux-8kernelpublic2023-06-06 01:16
Summary0000401: Upgrade to 8.8 leads to hung_task_timeout
DescriptionWe upgraded hosts to 8.8 and get the following message and very high load. Downgrading to 8.7 fixed the issue.

kernel: INFO: task consul:9062 blocked for more than 120 seconds.
kernel: Not tainted 4.18.0-477.10.1.el8_8.x86_64 #1
kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
kernel: task:consul state:D stack: 0 pid: 9062 ppid: 9061 flags:0x00000004
kernel: Call Trace:
kernel: __schedule+0x2d1/0x870
kernel: ? flush_tlb_func_common.constprop.9+0x129/0x220
kernel: schedule+0x55/0xf0
kernel: io_schedule+0x12/0x40
kernel: migration_entry_wait_on_locked+0x1ea/0x290
kernel: ? filemap_fdatawait_keep_errors+0x50/0x50
kernel: do_swap_page+0x5b0/0x710
kernel: ? pmd_devmap_trans_unstable+0x2e/0x40
kernel: ? handle_pte_fault+0x5d/0x880
kernel: __handle_mm_fault+0x453/0x6c0
kernel: handle_mm_fault+0xca/0x2a0
kernel: __do_page_fault+0x1f0/0x450
kernel: do_page_fault+0x37/0x130
kernel: ? page_fault+0x8/0x30
kernel: page_fault+0x1e/0x30
kernel: RIP: 0033:0x1aaafb7
kernel: Code: Unable to access opcode bytes at RIP 0x1aaaf8d.
kernel: RSP: 002b:000000c0008c1da8 EFLAGS: 00010212
kernel: RAX: 0000000004f49b40 RBX: 0000000000000009 RCX: 000000c000000300
kernel: RDX: 0000000004f49b88 RSI: 0000000001aaafa0 RDI: 000000c00024b5e8
kernel: RBP: 000000c0008c1df8 R08: 0000000003440118 R09: 0000000000000000
kernel: R10: 0000000000000008 R11: 000000c00024b500 R12: 0000000000000034
kernel: R13: 000000000000000c R14: 0000000000000030 R15: 000000c000307340



2023-06-06 01:16

reporter   ~0000907

For the record, there is a reference to the error at upstream web site

