[for-6.2,00/53] target/arm: MVE slices 3 and 4

Message ID	20210729111512.16541-1-peter.maydell@linaro.org
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Peter Maydell <peter.maydell@linaro.org> To: qemu-arm@nongnu.org, qemu-devel@nongnu.org Subject: [PATCH for-6.2 00/53] target/arm: MVE slices 3 and 4 Date: Thu, 29 Jul 2021 12:14:19 +0100 Message-Id: <20210729111512.16541-1-peter.maydell@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2a00:1450:4864:20::436; envelope-from=peter.maydell@linaro.org; helo=mail-wr1-x436.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=unavailable autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	target/arm: MVE slices 3 and 4 \| expand [for-6.2,00/53] target/arm: MVE slices 3 and 4 [for-6.2,01/53] target/arm: Note that we handle VMOVL as a special case of VSHLL [for-6.2,02/53] target/arm: Print MVE VPR in CPU dumps [for-6.2,03/53] target/arm: Fix MVE VSLI by 0 and VSRI by <dt> [for-6.2,04/53] target/arm: Fix signed VADDV [for-6.2,05/53] target/arm: Fix mask handling for MVE narrowing operations [for-6.2,06/53] target/arm: Fix 48-bit saturating shifts [for-6.2,07/53] target/arm: Fix MVE 48-bit SQRSHRL for small right shifts [for-6.2,08/53] target/arm: Fix calculation of LTP mask when LR is 0 [for-6.2,09/53] target/arm: Factor out mve_eci_mask() [for-6.2,10/53] target/arm: Fix VPT advance when ECI is non-zero [for-6.2,11/53] target/arm: Fix VLDRB/H/W for predicated elements [for-6.2,12/53] target/arm: Implement MVE VMULL (polynomial) [for-6.2,13/53] target/arm: Implement MVE incrementing/decrementing dup insns [for-6.2,14/53] target/arm: Factor out gen_vpst() [for-6.2,15/53] target/arm: Implement MVE integer vector comparisons [for-6.2,16/53] target/arm: Implement MVE integer vector-vs-scalar comparisons [for-6.2,17/53] target/arm: Implement MVE VPSEL [for-6.2,18/53] target/arm: Implement MVE VMLAS [for-6.2,19/53] target/arm: Implement MVE shift-by-scalar [for-6.2,20/53] target/arm: Move 'x' and 'a' bit definitions into vmlaldav formats [for-6.2,21/53] target/arm: Implement MVE integer min/max across vector [for-6.2,22/53] target/arm: Implement MVE VABAV [for-6.2,23/53] target/arm: Implement MVE narrowing moves [for-6.2,24/53] target/arm: Rename MVEGenDualAccOpFn to MVEGenLongDualAccOpFn [for-6.2,25/53] target/arm: Implement MVE VMLADAV and VMLSLDAV [for-6.2,26/53] target/arm: Implement MVE VMLA [for-6.2,27/53] target/arm: Implement MVE saturating doubling multiply accumulates [for-6.2,28/53] target/arm: Implement MVE VQABS, VQNEG [for-6.2,29/53] target/arm: Implement MVE VMAXA, VMINA [for-6.2,30/53] target/arm: Implement MVE VMOV to/from 2 general-purpose registers [for-6.2,31/53] target/arm: Implement MVE VPNOT [for-6.2,32/53] target/arm: Implement MVE VCTP [for-6.2,33/53] target/arm: Implement MVE scatter-gather insns [for-6.2,34/53] target/arm: Implement MVE scatter-gather immediate forms [for-6.2,35/53] target/arm: Implement MVE interleaving loads/stores [for-6.2,36/53] target/arm: Implement MVE VADD (floating-point) [for-6.2,37/53] target/arm: Implement MVE VSUB, VMUL, VABD, VMAXNM, VMINNM [for-6.2,38/53] target/arm: Implement MVE VCADD [for-6.2,39/53] target/arm: Implement MVE VFMA and VFMS [for-6.2,40/53] target/arm: Implement MVE VCMUL and VCMLA [for-6.2,41/53] target/arm: Implement MVE VMAXNMA and VMINNMA [for-6.2,42/53] target/arm: Implement MVE scalar fp insns [for-6.2,43/53] target/arm: Implement MVE fp-with-scalar VFMA, VFMAS [for-6.2,44/53] softfloat: Remove assertion preventing silencing of NaN in default-NaN mode [for-6.2,45/53] target/arm: Implement MVE FP max/min across vector [for-6.2,46/53] target/arm: Implement MVE fp vector comparisons [for-6.2,47/53] target/arm: Implement MVE fp scalar comparisons [for-6.2,48/53] target/arm: Implement MVE VCVT between floating and fixed point [for-6.2,49/53] target/arm: Implement MVE VCVT between fp and integer [for-6.2,50/53] target/arm: Implement MVE VCVT with specified rounding mode [for-6.2,51/53] target/arm: Implement MVE VCVT between single and half precision [for-6.2,52/53] target/arm: Implement MVE VRINT insns [for-6.2,53/53] target/arm: Enable MVE in Cortex-M55

Message ID

20210729111512.16541-1-peter.maydell@linaro.org

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	209.51.188.17 as permitted sender) client-ip=209.51.188.17; 
From: Peter Maydell <peter.maydell@linaro.org>
To: qemu-arm@nongnu.org,
	qemu-devel@nongnu.org
Subject: [PATCH for-6.2 00/53] target/arm: MVE slices 3 and 4
Date: Thu, 29 Jul 2021 12:14:19 +0100
Message-Id: <20210729111512.16541-1-peter.maydell@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2a00:1450:4864:20::436;
	envelope-from=peter.maydell@linaro.org; helo=mail-wr1-x436.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
	DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
	RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
	SPF_PASS=-0.001 autolearn=unavailable autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

target/arm: MVE slices 3 and 4 | expand

Message

Peter Maydell July 29, 2021, 11:14 a.m. UTC

This patchseries provides the third and fourth slices of the MVE
implementation, which gives us complete coverage of all instructions
and brings us to the point where we can actually enable it.

In this series:
 * fixes for minor bugs in a couple of the insns already upstream
 * all the remaining integer instructions
 * the remaining loads and stores (scatter-gather and interleaving)
 * the floating point instructions
 * patch enabling MVE for the Cortex-M55

Things still to do:
 * MVE loads/stores should check alignment (this will depend on
   the patchset that RTH just sent out, and I didn't want to
   entangle the two features unnecessarily)
 * gdbstub support (blocked on the gdb folks nailing down what
   the XML for it should be)
 * optimization: many of the insns should have inline versions
   to use when we know we aren't doing any predication

But none of those are blockers for this landing upstream once
we reopen for 6.2.

Still to review:
 03, 07, 10, 21, 26, and the new patches 36-53

thanks
-- PMM

Peter Maydell (53):
  target/arm: Note that we handle VMOVL as a special case of VSHLL
  target/arm: Print MVE VPR in CPU dumps
  target/arm: Fix MVE VSLI by 0 and VSRI by <dt>
  target/arm: Fix signed VADDV
  target/arm: Fix mask handling for MVE narrowing operations
  target/arm: Fix 48-bit saturating shifts
  target/arm: Fix MVE 48-bit SQRSHRL for small right shifts
  target/arm: Fix calculation of LTP mask when LR is 0
  target/arm: Factor out mve_eci_mask()
  target/arm: Fix VPT advance when ECI is non-zero
  target/arm: Fix VLDRB/H/W for predicated elements
  target/arm: Implement MVE VMULL (polynomial)
  target/arm: Implement MVE incrementing/decrementing dup insns
  target/arm: Factor out gen_vpst()
  target/arm: Implement MVE integer vector comparisons
  target/arm: Implement MVE integer vector-vs-scalar comparisons
  target/arm: Implement MVE VPSEL
  target/arm: Implement MVE VMLAS
  target/arm: Implement MVE shift-by-scalar
  target/arm: Move 'x' and 'a' bit definitions into vmlaldav formats
  target/arm: Implement MVE integer min/max across vector
  target/arm: Implement MVE VABAV
  target/arm: Implement MVE narrowing moves
  target/arm: Rename MVEGenDualAccOpFn to MVEGenLongDualAccOpFn
  target/arm: Implement MVE VMLADAV and VMLSLDAV
  target/arm: Implement MVE VMLA
  target/arm: Implement MVE saturating doubling multiply accumulates
  target/arm: Implement MVE VQABS, VQNEG
  target/arm: Implement MVE VMAXA, VMINA
  target/arm: Implement MVE VMOV to/from 2 general-purpose registers
  target/arm: Implement MVE VPNOT
  target/arm: Implement MVE VCTP
  target/arm: Implement MVE scatter-gather insns
  target/arm: Implement MVE scatter-gather immediate forms
  target/arm: Implement MVE interleaving loads/stores
  target/arm: Implement MVE VADD (floating-point)
  target/arm: Implement MVE VSUB, VMUL, VABD, VMAXNM, VMINNM
  target/arm: Implement MVE VCADD
  target/arm: Implement MVE VFMA and VFMS
  target/arm: Implement MVE VCMUL and VCMLA
  target/arm: Implement MVE VMAXNMA and VMINNMA
  target/arm: Implement MVE scalar fp insns
  target/arm: Implement MVE fp-with-scalar VFMA, VFMAS
  softfloat: Remove assertion preventing silencing of NaN in default-NaN
    mode
  target/arm: Implement MVE FP max/min across vector
  target/arm: Implement MVE fp vector comparisons
  target/arm: Implement MVE fp scalar comparisons
  target/arm: Implement MVE VCVT between floating and fixed point
  target/arm: Implement MVE VCVT between fp and integer
  target/arm: Implement MVE VCVT with specified rounding mode
  target/arm: Implement MVE VCVT between single and half precision
  target/arm: Implement MVE VRINT insns
  target/arm: Enable MVE in Cortex-M55

 docs/system/arm/emulation.rst  |    1 +
 target/arm/helper-mve.h        |  425 +++++++
 target/arm/translate-a32.h     |    2 +
 target/arm/translate.h         |    6 +
 target/arm/vec_internal.h      |   11 +
 target/arm/mve.decode          |  463 +++++++-
 target/arm/t32.decode          |    1 +
 target/arm/cpu.c               |    3 +
 target/arm/cpu_tcg.c           |    7 +-
 target/arm/mve_helper.c        | 1899 +++++++++++++++++++++++++++++++-
 target/arm/translate-mve.c     | 1154 ++++++++++++++++++-
 target/arm/translate-neon.c    |    6 -
 target/arm/translate-vfp.c     |    2 +-
 target/arm/translate.c         |   33 +
 target/arm/vec_helper.c        |   14 +-
 fpu/softfloat-specialize.c.inc |    1 -
 16 files changed, 3911 insertions(+), 117 deletions(-)

-- 
2.20.1