qemu/target at 681e338ee60f5cde11360c498e9c0cd100133672 - qemu

History

Peter Maydell 318a0b0556 target/arm: Handle denormals correctly for FMOPA (widening) The FMOPA (widening) SME instruction takes pairs of half-precision floating point values, widens them to single-precision, does a two-way dot product and accumulates the results into a single-precision destination. We don't quite correctly handle the FPCR bits FZ and FZ16 which control flushing of denormal inputs and outputs. This is because at the moment we pass a single float_status value to the helper function, which then uses that configuration for all the fp operations it does. However, because the inputs to this operation are float16 and the outputs are float32 we need to use the fp_status_f16 for the float16 input widening but the normal fp_status for everything else. Otherwise we will apply the flushing control FPCR.FZ16 to the 32-bit output rather than the FPCR.FZ control, and incorrectly flush a denormal output to zero when we should not (or vice-versa). (In commit `207d30b5fd` we tried to fix the FZ handling but didn't get it right, switching from "use FPCR.FZ for everything" to "use FPCR.FZ16 for everything".) (Mjt: it is commit `d5373d7bdb` in stable-7.2) Pass the CPU env to the sme_fmopa_h helper instead of an fp_status pointer, and have the helper pass an extra fp_status into the f16_dotadd() function so that we can use the right status for the right parts of this operation. Cc: qemu-stable@nongnu.org Fixes: `207d30b5fd` ("target/arm: Use FPST_F16 for SME FMOPA (widening)") Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2373 Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Reviewed-by: Richard Henderson <richard.henderson@linaro.org> (cherry picked from commit `55f9f4ee01`) Signed-off-by: Michael Tokarev <mjt@tls.msk.ru> (Mjt: s/tcg_env/cpu_env/ due to missingv 8.1.0-1189-gad75a51e84af "tcg: Rename cpu_env to tcg_env")		2024-08-02 14:01:04 +03:00
..
alpha	accel/tcg: Remove will_exit argument from cpu_restore_state	2022-11-01 08:31:41 +11:00
arm	target/arm: Handle denormals correctly for FMOPA (widening)	2024-08-02 14:01:04 +03:00
avr	target/avr: Convert to tcg_ops restore_state_to_opc	2022-10-26 11:11:28 +10:00
cris	accel/tcg: Remove will_exit argument from cpu_restore_state	2022-11-01 08:31:41 +11:00
hexagon	target/hexagon: Convert to tcg_ops restore_state_to_opc	2022-10-26 11:11:28 +10:00
hppa	target/hppa: Clear psw_n for BE on use_nullify_skip path	2024-04-09 20:09:20 +03:00
i386	i386/cpu: fixup number of addressable IDs for processor cores in the physical package	2024-07-01 09:50:03 +03:00
loongarch	target/loongarch: fix a wrong print in cpu dump	2024-06-07 08:10:49 +03:00
m68k	target/m68k: Fix semihost lseek offset computation	2023-08-03 08:26:26 +03:00
microblaze	accel/tcg: Remove will_exit argument from cpu_restore_state	2022-11-01 08:31:41 +11:00
mips	target/mips: Fix TX79 LQ/SQ opcodes	2023-11-19 21:15:23 +03:00
nios2	target/nios2: Fix semihost lseek offset computation	2023-08-03 08:26:26 +03:00
openrisc	accel/tcg: Remove will_exit argument from cpu_restore_state	2022-11-01 08:31:41 +11:00
ppc	target/ppc: Fix lxv/stxv MSR facility check	2024-02-24 19:29:15 +03:00
riscv	target/riscv: Fix mcycle/minstret increment behavior	2024-01-08 19:24:44 +03:00
rx	target/rx: Use target_ulong for address in LI	2024-07-30 22:28:41 +03:00
s390x	target/s390x: Fix LAE setting a wrong access register	2024-01-13 11:28:02 +03:00
sh4	target/sh4: add missing CHECK_NOT_DELAY_SLOT	2024-04-10 20:40:02 +03:00
sparc	target/sparc: Convert to tcg_ops restore_state_to_opc	2022-10-26 11:11:28 +10:00
tricore	target/tricore: Rename tricore_feature	2023-11-19 21:15:23 +03:00
xtensa	target/xtensa: fix OOB TLB entry access	2024-01-27 18:05:30 +03:00
Kconfig	hw/loongarch: Add support loongson3 virt machine type.	2022-06-06 18:09:03 +00:00
meson.build	target/loongarch: Add target build suport	2022-06-06 18:09:03 +00:00