qemu/nvme at staging-9.0 - qemu

History

Klaus Jensen 30e7e3fdd6 hw/nvme: fix handling of over-committed queues If a host chooses to use the SQHD "hint" in the CQE to know if there is room in the submission queue for additional commands, it may result in a situation where there are not enough internal resources (struct NvmeRequest) available to process the command. For a lack of a better term, the host may "over-commit" the device (i.e., it may have more inflight commands than the queue size). For example, assume a queue with N entries. The host submits N commands and all are picked up for processing, advancing the head and emptying the queue. Regardless of which of these N commands complete first, the SQHD field of that CQE will indicate to the host that the queue is empty, which allows the host to issue N commands again. However, if the device has not posted CQEs for all the previous commands yet, the device will have less than N resources available to process the commands, so queue processing is suspended. And here lies an 11 year latent bug. In the absense of any additional tail updates on the submission queue, we never schedule the processing bottom-half again unless we observe a head update on an associated full completion queue. This has been sufficient to handle N-to-1 SQ/CQ setups (in the absense of over-commit of course). Incidentially, that "kick all associated SQs" mechanism can now be killed since we now just schedule queue processing when we return a processing resource to a non-empty submission queue, which happens to cover both edge cases. However, we must retain kicking the CQ if it was previously full. So, apparently, no previous driver tested with hw/nvme has ever used SQHD (e.g., neither the Linux NVMe driver or SPDK uses it). But then OSv shows up with the driver that actually does. I salute you. Fixes: `f3c507adcd` ("NVMe: Initial commit for new storage interface") Cc: qemu-stable@nongnu.org Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2388 Reported-by: Waldemar Kozaczuk <jwkozaczuk@gmail.com> Reviewed-by: Keith Busch <kbusch@kernel.org> Signed-off-by: Klaus Jensen <k.jensen@samsung.com> (cherry picked from commit `9529aa6bb4`) Signed-off-by: Michael Tokarev <mjt@tls.msk.ru>		2024-11-08 20:41:36 +03:00
..
ctrl.c	hw/nvme: fix handling of over-committed queues	2024-11-08 20:41:36 +03:00
dif.c	hw/nvme: fix CRC64 for guard tag	2023-08-08 08:09:38 +02:00
dif.h	hw/nvme: 64-bit pi support	2022-03-03 09:30:21 +01:00
Kconfig	kconfig: Add NVME to s390x machines	2023-09-12 12:07:16 +02:00
meson.build	hw/nvme: Add NVMe NGUID property	2024-03-12 15:48:56 +01:00
nguid.c	hw/nvme: Add NVMe NGUID property	2024-03-12 15:48:56 +01:00
ns.c	hw/nvme: Add NVMe NGUID property	2024-03-12 15:48:56 +01:00
nvme.h	hw/nvme: add machine compatibility parameter to enable msix exclusive bar	2024-03-12 16:05:53 +01:00
subsys.c	hw/nvme: fix verification of number of ruhis	2023-06-28 11:22:17 +02:00
trace-events	hw/nvme: fix compliance issue wrt. iosqes/iocqes	2023-08-07 12:27:24 +02:00
trace.h	hw/nvme: move nvme emulation out of hw/block	2021-05-17 09:19:00 +02:00