qemu/hw at 51254ffb320183a4636635840c23ee0e3a1efffa - qemu

History

Daniel Henrique Barboza 51254ffb32 spapr_drc.c: introduce unplug_timeout_timer The LoPAR spec provides no way for the guest kernel to report failure of hotplug/hotunplug events. This wouldn't be bad if those operations were granted to always succeed, but that's far for the reality. What ends up happening is that, in the case of a failed hotunplug, regardless of whether it was a QEMU error or a guest misbehavior, the pSeries machine is retaining the unplug state of the device in the running guest. This state is cleanup in machine reset, where it is assumed that this state represents a device that is pending unplug, and the device is hotunpluged from the board. Until the reset occurs, any hotunplug operation of the same device is forbid because there is a pending unplug state. This behavior has at least one undesirable side effect. A long standing pending unplug state is, more often than not, the result of a hotunplug error. The user had to dealt with it, since retrying to unplug the device is noy allowed, and then in the machine reset we're removing the device from the guest. This means that we're failing the user twice - failed to hotunplug when asked, then hotunplugged without notice. Solutions to this problem range between trying to predict when the hotunplug will fail and forbid the operation from the QEMU layer, from opening up the IRQ queue to allow for multiple hotunplug attempts, from telling the users to 'reboot the machine if something goes wrong'. The first solution is flawed because we can't fully predict guest behavior from QEMU, the second solution is a trial and error remediation that counts on a hope that the unplug will eventually succeed, and the third is ... well. This patch introduces a crude, but effective solution to hotunplug errors in the pSeries machine. For each unplug done, we'll timeout after some time. If a certain amount of time passes, we'll cleanup the hotunplug state from the machine. During the timeout period, any unplug operations in the same device will still be blocked. After that, we'll assume that the guest failed the operation, and allow the user to try again. If the timeout is too short we'll prevent legitimate hotunplug situations to occur, so we'll need to overestimate the regular time an unplug operation takes to succeed to account that. The true solution for the hotunplug errors in the pSeries machines is a PAPR change to allow for the guest to warn the platform about it. For now, the work done in this timeout design can be used for the new PAPR 'abort hcall' in the future, given that for both cases we'll need code to cleanup the existing unplug states of the DRCs. At this moment we're adding the basic wiring of the timer into the DRC. Next patch will use the timer to timeout failed CPU hotunplugs. Signed-off-by: Daniel Henrique Barboza <danielhb413@gmail.com> Message-Id: <20210222194531.62717-4-danielhb413@gmail.com> Signed-off-by: David Gibson <david@gibson.dropbear.id.au>		2021-03-10 09:07:09 +11:00
..
acpi	acpi/core: always set SCI_EN when SMM isn't supported	2021-02-23 10:58:42 -05:00
adc	hw/adc: Add an ADC module for NPCM7XX	2021-01-12 21:19:02 +00:00
arm	hw/arm/mps2: Update old infocenter.arm.com URLs	2021-03-08 11:54:16 +00:00
audio	qom: Put name parameter before value / visitor parameter	2020-07-10 15:18:08 +02:00
block	qdev: Move softmmu properties to qdev-properties-system.h	2020-12-18 15:20:17 -05:00
char	hw/char/pl011: add a clock input	2020-10-27 11:10:44 +00:00
core	accel/tcg: Precompute curr_cflags into cpu->tcg_cflags	2021-03-06 11:53:57 -08:00
cpu	Use OBJECT_DECLARE_SIMPLE_TYPE when possible	2020-09-18 14:12:32 -04:00
cris	sysbus: Convert to sysbus_realize() etc. with Coccinelle	2020-06-15 22:05:28 +02:00
display	Clean up includes	2020-12-10 17:16:44 +01:00
dma	arm: Update infocenter.arm.com URLs	2021-02-11 11:50:14 +00:00
firmware	machine: Refactor smp-related call chains to pass MachineState	2019-07-05 17:07:36 -03:00
gpio	hw/gpio: Add GPIO model for Nuvoton NPCM7xx	2020-10-27 11:10:32 +00:00
hyperv	Use OBJECT_DECLARE_SIMPLE_TYPE when possible	2020-09-18 14:12:32 -04:00
i2c	hw/i2c: Implement NPCM7XX SMBus Module FIFO Mode	2021-02-16 14:12:54 +00:00
i386	i386/acpi: restore device paths for pre-5.1 vms	2021-03-02 05:40:35 -05:00
ide	nomaintainer: Fix Lesser GPL version number	2020-11-15 17:04:40 +01:00
input	input: tsc2xxx fix.	2020-09-22 21:11:10 +01:00
intc	hw/ppc: Remove unused ppcuic_init()	2021-01-19 10:20:29 +11:00
ipack	Use OBJECT_DECLARE_SIMPLE_TYPE when possible	2020-09-18 14:12:32 -04:00
ipmi	Use OBJECT_DECLARE_SIMPLE_TYPE when possible	2020-09-18 14:12:32 -04:00
isa	vt82c686: Make vt82c686b-pm an abstract base class and add vt8231-pm based on it	2021-02-21 19:42:34 +01:00
kvm	target/i386: always create kvmclock device	2020-09-30 19:11:36 +02:00
lm32	Include qemu-common.h exactly where needed	2019-06-12 13:20:20 +02:00
m68k	hw/m68k/next-cube: Add missing header comment to next-cube.h	2021-01-19 09:11:52 +01:00
mem	acpi: Permit OEM ID and OEM table ID fields to be changed	2021-02-05 08:52:59 -05:00
mips	hw/mips: Add a bootloader helper	2021-02-21 18:41:04 +01:00
misc	hw/arm/mps2: Update old infocenter.arm.com URLs	2021-03-08 11:54:16 +00:00
net	hw/net: Add npcm7xx emc model	2021-03-05 15:17:34 +00:00
nubus	Use OBJECT_DECLARE_SIMPLE_TYPE when possible	2020-09-18 14:12:32 -04:00
nvram	fw_cfg: Refactor extra pci roots addition	2020-12-08 13:48:57 -05:00
pci	vt82c686: Make vt82c686b-pm an abstract base class and add vt8231-pm based on it	2021-02-21 19:42:34 +01:00
pci-bridge	Use OBJECT_DECLARE_SIMPLE_TYPE when possible	2020-09-18 14:12:32 -04:00
pci-host	Pull request	2021-02-10 15:42:20 +00:00
ppc	spapr_drc.c: introduce unplug_timeout_timer	2021-03-10 09:07:09 +11:00
rdma	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
remote	multi-process: perform device reset in the remote process	2021-02-10 09:23:28 +00:00
riscv	hw/riscv: sifive_u: Change SIFIVE_U_GEM_IRQ to decimal value	2021-03-04 09:43:29 -05:00
rtc	m48t59: remove legacy m48t59_init() function	2020-10-18 16:21:42 +01:00
rx	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
s390x	s390: Recognize confidential-guest-support option	2021-02-08 16:57:38 +11:00
scsi	qemu-sparc queue	2021-03-09 13:50:35 +00:00
sd	Pull request trivial patches 20210220	2021-02-21 12:12:18 +00:00
semihosting	semihosting: Fix Lesser GPL version number	2020-11-15 16:38:03 +01:00
sh4	hw/sh4: Add missing license	2021-03-06 16:18:42 +01:00
southbridge	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
sparc	include/hw/sparc/grlib.h: Remove unused set_pil_in_fn typedef	2021-01-06 11:41:37 +00:00
ssi	hw/ssi: Add SiFive SPI controller support	2021-03-04 09:43:29 -05:00
timer	arm: Remove frq properties on CMSDK timer, dualtimer, watchdog, ARMSSE	2021-01-29 15:54:44 +00:00
tricore	Include hw/irq.h a lot less	2019-08-16 13:31:52 +02:00
unicore32	hw/unicore32: restrict hw addr defines to source file	2017-12-18 17:07:02 +03:00
usb	usb: xlnx-usb-subsystem: Add xilinx usb subsystem	2020-12-15 12:04:30 +00:00
vfio	vfio: Change default dirty pages tracking behavior during migration	2020-11-23 10:05:58 -07:00
virtio	display/ui: add a callback to indicate GL state is flushed	2021-02-04 15:58:54 +01:00
watchdog	arm: Remove frq properties on CMSDK timer, dualtimer, watchdog, ARMSSE	2021-01-29 15:54:44 +00:00
xen	xen: remove GNUC check	2020-12-15 12:53:13 -05:00
xtensa	Include hw/irq.h a lot less	2019-08-16 13:31:52 +02:00
boards.h	confidential guest support: Rework the "memory-encryption" property	2021-02-08 16:57:38 +11:00
clock.h	clock: Add new clock_has_source() function	2021-01-29 15:54:42 +00:00
elf_ops.h	elf_ops: correct loading of 32 bit PVH kernel	2021-03-06 11:41:54 +01:00
fw-path-provider.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
hotplug.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
hw.h	Include hw/hw.h exactly where needed	2019-08-16 13:31:52 +02:00
ide.h	hw/ide: Move MAX_IDE_DEVS define to hw/ide/internal.h	2020-03-17 12:22:36 -04:00
irq.h	include/hw/irq.h: New function qemu_irq_is_connected()	2020-08-03 17:55:03 +01:00
loader-fit.h	nomaintainer: Fix Lesser GPL version number	2020-11-15 17:04:40 +01:00
loader.h	hw/core/loader: Let load_elf() populate a field with CPU-specific flags	2020-01-29 19:28:52 +01:00
nmi.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
or-irq.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
pcmcia.h	Use OBJECT_DECLARE_TYPE when possible	2020-09-18 14:12:32 -04:00
platform-bus.h	nomaintainer: Fix Lesser GPL version number	2020-11-15 17:04:40 +01:00
ptimer.h	ptimer: Add new ptimer_set_period_from_clock() function	2021-01-29 15:54:42 +00:00
qdev-clock.h	hw/qdev-clock: Avoid calling qdev_connect_clock_in after DeviceRealize	2020-08-28 10:02:46 +01:00
qdev-core.h	machine: introduce MachineInitPhase	2020-12-15 12:51:52 -05:00
qdev-dma.h	Supply missing header guards	2019-06-12 13:20:21 +02:00
qdev-properties-system.h	qdev: Reuse DEFINE_PROP in all DEFINE_PROP_* macros	2020-12-18 15:20:17 -05:00
qdev-properties.h	qdev: Rename qdev_get_prop_ptr() to object_field_prop_ptr()	2020-12-18 15:20:18 -05:00
register.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
registerfields.h	hw/registerfields: Prefix local variables with underscore in macros	2020-05-27 11:23:07 -07:00
resettable.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00
stream.h	hw/core/stream: Rename StreamSlave as StreamSink	2020-12-10 12:15:04 -05:00
sysbus.h	qom: Remove module_obj_name parameter from OBJECT_DECLARE* macros	2020-09-18 14:12:32 -04:00
usb.h	usb: add pcap support.	2021-01-22 14:51:35 +01:00
vmstate-if.h	Use DECLARE_CHECKER macros	2020-09-09 09:27:09 -04:00