View Issue Details

IDProjectCategoryView StatusLast Update
0000258AlmaLinux-8qemu-kvmpublic2022-06-03 06:29
Reporterjpbennett Assigned To 
PriorityhighSeveritycrashReproducibilitysometimes
Status newResolutionopen 
PlatformAmd ServerOSAlmaLinuxOS Version8
Summary0000258: qemu-kvm process goes to 100%, VM becomes unresponsive, Triggered by Creating a new VM
DescriptionMultiple times on two separate servers, qemu-kvm processes have suddenly spiked to 100% usage, and the VMs represented by those processes have become unresponsive to the point of needing force-power-cycled. Multiple VMs enter this state at the exact same moment. In two cases, the VM has recovered after multiple minutes, and in both cases, the VM's system clock was set in the future. It seems that the longer the hang goes on, the further in the future, as one machine thought it was the year 2217. Dates were correct before the hangs.

These servers were running CentOS 8 before converting to Alma, and didn't exhibit this bug on CentOS. The hanging VMs have also all been running Alma 8, though that's likely coincidence, as that's most of my VMs now. There hasn't been anything notable in the host machine's dmesg or logs that I have seen.
Steps To ReproduceRun multiple Alma 8 VMs on a server. Migrate an additional VM to that server using a command like virsh migrate --live --persistent --undefinesource --copy-storage-all --verbose Gitlab qemu+ssh://10.10.3.2/system. During the migration process, there is a chance that multiple running VMs will hang in the manner described. Oddly, not every VM hangs, nor does it happen on every migration, but when it occurs, all the affected VMs hang simultaneously.

Additionally, this has been reproduced when generating a new VM on a server. One of the other Alma VMs already running on that server entered this odd failure state.
Additional InformationBoth servers are SuperMicro builds, one a AMD Opteron(TM) Processor 6272, the other a AMD EPYC 7302P 16-Core Processor. I'm using the Opteron_G3 processor type to host, as this allows live migration.
Tagsalmalinux8, QEMU-KVM
abrt_hash
URL

Activities

jpbennett

2022-06-03 05:40

reporter   ~0000585

Just reproduced this issue by *rebooting* a running VM, and a sistem VM jumped to 100% CPU and unresponsive.

jpbennett

2022-06-03 06:29

reporter   ~0000586

Dmesg from a VM as it hit this bug.
log2.png (85,648 bytes)   
log2.png (85,648 bytes)   
log1.png (80,747 bytes)   
log1.png (80,747 bytes)   

Issue History

Date Modified Username Field Change
2022-06-03 05:25 jpbennett New Issue
2022-06-03 05:25 jpbennett Tag Attached: almalinux8
2022-06-03 05:25 jpbennett Tag Attached: QEMU-KVM
2022-06-03 05:40 jpbennett Note Added: 0000585
2022-06-03 06:29 jpbennett Note Added: 0000586
2022-06-03 06:29 jpbennett File Added: log2.png
2022-06-03 06:29 jpbennett File Added: log1.png