[v2,098/101] target/arm: Implement MOVAZ for SME2p1

Message ID	20250621235037.74091-99-richard.henderson@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Subject: [PATCH v2 098/101] target/arm: Implement MOVAZ for SME2p1 Date: Sat, 21 Jun 2025 16:50:34 -0700 Message-ID: <20250621235037.74091-99-richard.henderson@linaro.org> In-Reply-To: <20250621235037.74091-1-richard.henderson@linaro.org> References: <20250621235037.74091-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::62f; envelope-from=richard.henderson@linaro.org; helo=mail-pl1-x62f.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	target/arm: Implement FEAT_SME2p1 \| expand [v2,000/101] target/arm: Implement FEAT_SME2p1 [v2,001/101] tcg: Add dbase argument to do_dup_store [v2,002/101] tcg: Add dbase argument to do_dup [v2,003/101] tcg: Add dbase argument to expand_clr [v2,004/101] tcg: Add base arguments to check_overlap_[234] [v2,005/101] tcg: Split out tcg_gen_gvec_2_var [v2,006/101] tcg: Split out tcg_gen_gvec_3_var [v2,007/101] tcg: Split out tcg_gen_gvec_mov_var [v2,008/101] tcg: Split out tcg_gen_gvec_{add,sub}_var [v2,009/101] tcg: Split out tcg_gen_gvec_dup_imm_var [v2,010/101] linux-user/aarch64: Update hwcap bits from 6.14 [v2,011/101] target/arm: Remove CPUARMState.vfp.scratch [v2,012/101] target/arm: Introduce FPST_ZA, FPST_ZA_F16 [v2,013/101] target/arm: Use FPST_ZA for sme_fmopa_[hsd] [v2,014/101] target/arm: Rename zarray to za_state.za [v2,015/101] target/arm: Add isar feature tests for SME2, SVE2p1 [v2,016/101] target/arm: Add ZT0 [v2,017/101] target/arm: Add zt0_excp_el to DisasContext [v2,018/101] target/arm: Implement SME2 ZERO ZT0 [v2,019/101] target/arm: Implement SME2 LDR/STR ZT0 [v2,020/101] target/arm: Implement SME2 MOVT [v2,021/101] target/arm: Split get_tile_rowcol argument tile_index [v2,022/101] target/arm: Rename MOVA for translate [v2,023/101] target/arm: Implement SME2 MOVA to/from tile, multiple registers [v2,024/101] target/arm: Split out get_zarray [v2,025/101] target/arm: Implement SME2 MOVA to/from array, multiple registers [v2,026/101] target/arm: Implement SME2 BMOPA [v2,027/101] target/arm: Implement SME2 SMOPS, UMOPS (2-way) [v2,028/101] target/arm: Introduce gen_gvec_sve2_sqdmulh [v2,029/101] target/arm: Implement SME2 Multiple and Single SVE Destructive [v2,030/101] target/arm: Implement SME2 Multiple Vectors SVE Destructive [v2,031/101] target/arm: Implement SME2 ADD/SUB (array results, multiple and single vector) [v2,032/101] target/arm: Implement SME2 ADD/SUB (array results, multiple vectors) [v2,033/101] target/arm: Pass ZA to helper_sve2_fmlal_zz[zx]w_s [v2,034/101] target/arm: Implement SME2 FMLAL, BFMLAL [v2,035/101] target/arm: Implement SME2 FDOT [v2,036/101] target/arm: Implement SME2 BFDOT [v2,037/101] target/arm: Implement SME2 FVDOT, BFVDOT [v2,038/101] target/arm: Rename helper_gvec_dot_[bh] to _4[bh] [v2,039/101] target/arm: Remove helper_gvec_sudot_idx_4b [v2,040/101] target/arm: Implemement SME2 SDOT, UDOT, USDOT, SUDOT [v2,041/101] target/arm: Rename SVE SDOT and UDOT patterns [v2,042/101] target/arm: Tighten USDOT (vectors) decode [v2,043/101] target/arm: Implement SDOT, UDOT (2-way) for SME2/SVE2p1 [v2,044/101] target/arm: Implement SME2 SVDOT, UVDOT, SUVDOT, USVDOT [v2,045/101] target/arm: Implement SME2 SMLAL, SMLSL, UMLAL, UMLSL [v2,046/101] target/arm: Implement SME2 SMLALL, SMLSLL, UMLALL, UMLSLL [v2,047/101] target/arm: Rename gvec_fml[as]_[hs] with _nf_ infix [v2,048/101] target/arm: Implement SME2 FMLA, FMLS [v2,049/101] target/arm: Implement SME2 BFMLA, BFMLS [v2,050/101] target/arm: Implement SME2 FADD, FSUB, BFADD, BFSUB [v2,051/101] target/arm: Implement SME2 BFCVT, BFCVTN, FCVT, FCVTN [v2,052/101] target/arm: Implement SME2 FCVT (widening), FCVTL [v2,053/101] target/arm: Implement SME2 FCVTZS, FCVTZU [v2,054/101] target/arm: Implement SME2 SCVTF, UCVTF [v2,055/101] target/arm: Implement SME2 FRINTN, FRINTP, FRINTM, FRINTA [v2,056/101] target/arm: Introduce do_[us]sat_[bhs] macros [v2,057/101] target/arm: Use do_[us]sat_[bhs] in sve_helper.c [v2,058/101] target/arm: Implement SME2 SQCVT, UQCVT, SQCVTU [v2,059/101] target/arm: Implement SQCVTN, UQCVTN, SQCVTUN for SME2/SVE2p1 [v2,060/101] target/arm: Implement SME2 SUNPK, UUNPK [v2,061/101] target/arm: Implement SME2 ZIP, UZP (four registers) [v2,062/101] target/arm: Move do_urshr, do_srshr to vec_internal.h [v2,063/101] target/arm: Implement SME2 SQRSHR, UQRSHR, SQRSHRN [v2,064/101] target/arm: Implement SME2 ZIP, UZP (two registers) [v2,065/101] target/arm: Implement SME2 FCLAMP, SCLAMP, UCLAMP [v2,066/101] target/arm: Enable SCLAMP, UCLAMP for SVE2p1 [v2,067/101] target/arm: Implement FCLAMP for SME2, SVE2p1 [v2,068/101] target/arm: Implement SME2 SEL [v2,069/101] target/arm: Implement SME2p1 Multiple Zero [v2,070/101] target/arm: Introduce pred_count_test [v2,071/101] target/arm: Fold predtest_ones into helper_sve_brkns [v2,072/101] target/arm: Split out do_whilel from helper_sve_whilel [v2,073/101] target/arm: Split out do_whileg from helper_sve_whileg [v2,074/101] target/arm: Move scale by esz into helper_sve_while* [v2,075/101] target/arm: Split trans_WHILE to lt and gt [v2,076/101] target/arm: Implement SVE2p1 WHILE (predicate pair) [v2,077/101] target/arm: Implement SVE2p1 WHILE (predicate as counter) [v2,078/101] target/arm: Implement SVE2p1 PTRUE (predicate as counter) [v2,079/101] target/arm: Enable PSEL for SVE2p1 [v2,080/101] target/arm: Implement {ADD, SMIN, SMAX, UMIN, UMAX}QV for SVE2p1 [v2,081/101] target/arm: Implement SVE2p1 PEXT [v2,082/101] target/arm: Implement ANDQV, ORQV, EORQV for SVE2p1 [v2,083/101] target/arm: Implement FADDQV, F{MIN, MAX}{NM}QV for SVE2p1 [v2,084/101] target/arm: Implement BFMLSLB{L, T} for SME2/SVE2p1 [v2,085/101] target/arm: Implement CNTP (predicate as counter) for SME2/SVE2p1 [v2,086/101] target/arm: Implement DUPQ for SME2p1/SVE2p1 [v2,087/101] target/arm: Implement EXTQ for SME2p1/SVE2p1 [v2,088/101] target/arm: Implement PMOV for SME2p1/SVE2p1 [v2,089/101] target/arm: Implement ZIPQ, UZPQ for SME2p1/SVE2p1 [v2,090/101] target/arm: Implement TBLQ, TBXQ for SME2p1/SVE2p1 [v2,091/101] target/arm: Implement SME2 counted predicate register load/store [v2,092/101] target/arm: Split the ST_zpri and ST_zprr patterns [v2,093/101] target/arm: Implement {LD1, ST1}{W, D} (128-bit element) for SVE2p1 [v2,094/101] target/arm: Move ld1qq and st1qq primitives to sve_ldst_internal.h [v2,095/101] target/arm: Implement {LD, ST}[234]Q for SME2p1/SVE2p1 [v2,096/101] target/arm: Implement LD1Q, ST1Q for SVE2p1 [v2,097/101] target/arm: Implement LUTI2, LUTI4 for SME2/SME2p1 [v2,098/101] target/arm: Implement MOVAZ for SME2p1 [v2,099/101] linux-user/aarch64: Set hwcap bits for SME2p1/SVE2p1 [v2,100/101] target/arm: Enable FEAT_SME2p1 on -cpu max [v2,101/101] tests/tcg/aarch64: Add sme2-matmul test case

Message ID

20250621235037.74091-99-richard.henderson@linaro.org

State

New

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Subject: [PATCH v2 098/101] target/arm: Implement MOVAZ for SME2p1
Date: Sat, 21 Jun 2025 16:50:34 -0700
Message-ID: <20250621235037.74091-99-richard.henderson@linaro.org>
In-Reply-To: <20250621235037.74091-1-richard.henderson@linaro.org>
References: <20250621235037.74091-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::62f;
 envelope-from=richard.henderson@linaro.org; helo=mail-pl1-x62f.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

target/arm: Implement FEAT_SME2p1 | expand

Commit Message

Richard Henderson June 21, 2025, 11:50 p.m. UTC

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/helper-sme.h    |  6 ++++
 target/arm/tcg/sme_helper.c    | 60 ++++++++++++++++++++++++++++++++++
 target/arm/tcg/translate-sme.c | 46 +++++++++++++++++++-------
 target/arm/tcg/sme.decode      | 36 ++++++++++++++++++++
 4 files changed, 137 insertions(+), 11 deletions(-)

diff --git a/target/arm/tcg/helper-sme.h b/target/arm/tcg/helper-sme.h
index d86fdcbd83..3e2ae83fe5 100644
--- a/target/arm/tcg/helper-sme.h
+++ b/target/arm/tcg/helper-sme.h
@@ -42,6 +42,12 @@  DEF_HELPER_FLAGS_3(sme2_mova_zc_s, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
 DEF_HELPER_FLAGS_3(sme2_mova_cz_d, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
 DEF_HELPER_FLAGS_3(sme2_mova_zc_d, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
 
+DEF_HELPER_FLAGS_3(sme2p1_movaz_zc_b, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
+DEF_HELPER_FLAGS_3(sme2p1_movaz_zc_h, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
+DEF_HELPER_FLAGS_3(sme2p1_movaz_zc_s, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
+DEF_HELPER_FLAGS_3(sme2p1_movaz_zc_d, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
+DEF_HELPER_FLAGS_3(sme2p1_movaz_zc_q, TCG_CALL_NO_RWG, void, ptr, ptr, i32)
+
 DEF_HELPER_FLAGS_5(sme_ld1b_h, TCG_CALL_NO_WG, void, env, ptr, ptr, tl, i32)
 DEF_HELPER_FLAGS_5(sme_ld1b_v, TCG_CALL_NO_WG, void, env, ptr, ptr, tl, i32)
 DEF_HELPER_FLAGS_5(sme_ld1b_h_mte, TCG_CALL_NO_WG, void, env, ptr, ptr, tl, i32)
diff --git a/target/arm/tcg/sme_helper.c b/target/arm/tcg/sme_helper.c
index 7757085adf..16bdf61f51 100644
--- a/target/arm/tcg/sme_helper.c
+++ b/target/arm/tcg/sme_helper.c
@@ -250,6 +250,66 @@  void HELPER(sme2_mova_zc_d)(void *vdst, void *vsrc, uint32_t desc)
     }
 }
 
+void HELPER(sme2p1_movaz_zc_b)(void *vdst, void *vsrc, uint32_t desc)
+{
+    uint8_t *src = vsrc;
+    uint8_t *dst = vdst;
+    size_t i, n = simd_oprsz(desc);
+
+    for (i = 0; i < n; ++i) {
+        dst[i] = src[tile_vslice_index(i)];
+        src[tile_vslice_index(i)] = 0;
+    }
+}
+
+void HELPER(sme2p1_movaz_zc_h)(void *vdst, void *vsrc, uint32_t desc)
+{
+    uint16_t *src = vsrc;
+    uint16_t *dst = vdst;
+    size_t i, n = simd_oprsz(desc) / 2;
+
+    for (i = 0; i < n; ++i) {
+        dst[i] = src[tile_vslice_index(i)];
+        src[tile_vslice_index(i)] = 0;
+    }
+}
+
+void HELPER(sme2p1_movaz_zc_s)(void *vdst, void *vsrc, uint32_t desc)
+{
+    uint32_t *src = vsrc;
+    uint32_t *dst = vdst;
+    size_t i, n = simd_oprsz(desc) / 4;
+
+    for (i = 0; i < n; ++i) {
+        dst[i] = src[tile_vslice_index(i)];
+        src[tile_vslice_index(i)] = 0;
+    }
+}
+
+void HELPER(sme2p1_movaz_zc_d)(void *vdst, void *vsrc, uint32_t desc)
+{
+    uint64_t *src = vsrc;
+    uint64_t *dst = vdst;
+    size_t i, n = simd_oprsz(desc) / 8;
+
+    for (i = 0; i < n; ++i) {
+        dst[i] = src[tile_vslice_index(i)];
+        src[tile_vslice_index(i)] = 0;
+    }
+}
+
+void HELPER(sme2p1_movaz_zc_q)(void *vdst, void *vsrc, uint32_t desc)
+{
+    Int128 *src = vsrc;
+    Int128 *dst = vdst;
+    size_t i, n = simd_oprsz(desc) / 16;
+
+    for (i = 0; i < n; ++i) {
+        dst[i] = src[tile_vslice_index(i)];
+        memset(&src[tile_vslice_index(i)], 0, 16);
+    }
+}
+
 /*
  * Clear elements in a tile slice comprising len bytes.
  */
diff --git a/target/arm/tcg/translate-sme.c b/target/arm/tcg/translate-sme.c
index 397e328a1b..12d32e3620 100644
--- a/target/arm/tcg/translate-sme.c
+++ b/target/arm/tcg/translate-sme.c
@@ -232,7 +232,8 @@  static bool do_mova_tile(DisasContext *s, arg_mova_p *a, bool to_vec)
 TRANS_FEAT(MOVA_tz, aa64_sme, do_mova_tile, a, false)
 TRANS_FEAT(MOVA_zt, aa64_sme, do_mova_tile, a, true)
 
-static bool do_mova_tile_n(DisasContext *s, arg_mova_t *a, int n, bool to_vec)
+static bool do_mova_tile_n(DisasContext *s, arg_mova_t *a, int n,
+                           bool to_vec, bool zero)
 {
     static gen_helper_gvec_2 * const cz_fns[] = {
         gen_helper_sme2_mova_cz_b, gen_helper_sme2_mova_cz_h,
@@ -242,9 +243,16 @@  static bool do_mova_tile_n(DisasContext *s, arg_mova_t *a, int n, bool to_vec)
         gen_helper_sme2_mova_zc_b, gen_helper_sme2_mova_zc_h,
         gen_helper_sme2_mova_zc_s, gen_helper_sme2_mova_zc_d,
     };
+    static gen_helper_gvec_2 * const zc_z_fns[] = {
+        gen_helper_sme2p1_movaz_zc_b, gen_helper_sme2p1_movaz_zc_h,
+        gen_helper_sme2p1_movaz_zc_s, gen_helper_sme2p1_movaz_zc_d,
+        gen_helper_sme2p1_movaz_zc_q,
+    };
     TCGv_ptr t_za;
     int svl;
 
+    assert(a->esz <= MO_64 + zero);
+
     if (!sme_smza_enabled_check(s)) {
         return true;
     }
@@ -262,7 +270,9 @@  static bool do_mova_tile_n(DisasContext *s, arg_mova_t *a, int n, bool to_vec)
             TCGv_ptr t_zr = vec_full_reg_ptr(s, a->zr * n + i);
             t_za = get_tile_rowcol(s, a->esz, a->rs, a->za,
                                    a->off * n + i, 1, a->v);
-            if (to_vec) {
+            if (zero) {
+                zc_z_fns[a->esz](t_zr, t_za, t_desc);
+            } else if (to_vec) {
                 zc_fns[a->esz](t_zr, t_za, t_desc);
             } else {
                 cz_fns[a->esz](t_za, t_zr, t_desc);
@@ -275,6 +285,9 @@  static bool do_mova_tile_n(DisasContext *s, arg_mova_t *a, int n, bool to_vec)
                                    a->off * n + i, 1, a->v);
             if (to_vec) {
                 tcg_gen_gvec_mov_var(MO_8, tcg_env, o_zr, t_za, 0, svl, svl);
+                if (zero) {
+                    tcg_gen_gvec_dup_imm_var(MO_8, t_za, 0, svl, svl, 0);
+                }
             } else {
                 tcg_gen_gvec_mov_var(MO_8, t_za, 0, tcg_env, o_zr, svl, svl);
             }
@@ -283,12 +296,17 @@  static bool do_mova_tile_n(DisasContext *s, arg_mova_t *a, int n, bool to_vec)
     return true;
 }
 
-TRANS_FEAT(MOVA_tz2, aa64_sme2, do_mova_tile_n, a, 2, false)
-TRANS_FEAT(MOVA_tz4, aa64_sme2, do_mova_tile_n, a, 4, false)
-TRANS_FEAT(MOVA_zt2, aa64_sme2, do_mova_tile_n, a, 2, true)
-TRANS_FEAT(MOVA_zt4, aa64_sme2, do_mova_tile_n, a, 4, true)
+TRANS_FEAT(MOVA_tz2, aa64_sme2, do_mova_tile_n, a, 2, false, false)
+TRANS_FEAT(MOVA_tz4, aa64_sme2, do_mova_tile_n, a, 4, false, false)
+TRANS_FEAT(MOVA_zt2, aa64_sme2, do_mova_tile_n, a, 2, true, false)
+TRANS_FEAT(MOVA_zt4, aa64_sme2, do_mova_tile_n, a, 4, true, false)
 
-static bool do_mova_array_n(DisasContext *s, arg_mova_a *a, int n, bool to_vec)
+TRANS_FEAT(MOVAZ_zt, aa64_sme2p1, do_mova_tile_n, a, 1, true, true)
+TRANS_FEAT(MOVAZ_zt2, aa64_sme2p1, do_mova_tile_n, a, 2, true, true)
+TRANS_FEAT(MOVAZ_zt4, aa64_sme2p1, do_mova_tile_n, a, 4, true, true)
+
+static bool do_mova_array_n(DisasContext *s, arg_mova_a *a, int n,
+                            bool to_vec, bool zero)
 {
     TCGv_ptr t_za;
     int svl;
@@ -306,6 +324,9 @@  static bool do_mova_array_n(DisasContext *s, arg_mova_a *a, int n, bool to_vec)
 
         if (to_vec) {
             tcg_gen_gvec_mov_var(MO_8, tcg_env, o_zr, t_za, o_za, svl, svl);
+            if (zero) {
+                tcg_gen_gvec_dup_imm_var(MO_8, t_za, o_za, svl, svl, 0);
+            }
         } else {
             tcg_gen_gvec_mov_var(MO_8, t_za, o_za, tcg_env, o_zr, svl, svl);
         }
@@ -313,10 +334,13 @@  static bool do_mova_array_n(DisasContext *s, arg_mova_a *a, int n, bool to_vec)
     return true;
 }
 
-TRANS_FEAT(MOVA_az2, aa64_sme2, do_mova_array_n, a, 2, false)
-TRANS_FEAT(MOVA_az4, aa64_sme2, do_mova_array_n, a, 4, false)
-TRANS_FEAT(MOVA_za2, aa64_sme2, do_mova_array_n, a, 2, true)
-TRANS_FEAT(MOVA_za4, aa64_sme2, do_mova_array_n, a, 4, true)
+TRANS_FEAT(MOVA_az2, aa64_sme2, do_mova_array_n, a, 2, false, false)
+TRANS_FEAT(MOVA_az4, aa64_sme2, do_mova_array_n, a, 4, false, false)
+TRANS_FEAT(MOVA_za2, aa64_sme2, do_mova_array_n, a, 2, true, false)
+TRANS_FEAT(MOVA_za4, aa64_sme2, do_mova_array_n, a, 4, true, false)
+
+TRANS_FEAT(MOVAZ_za2, aa64_sme2p1, do_mova_array_n, a, 2, true, true)
+TRANS_FEAT(MOVAZ_za4, aa64_sme2p1, do_mova_array_n, a, 4, true, true)
 
 static bool do_movt(DisasContext *s, arg_MOVT_rzt *a,
                     void (*func)(TCGv_i64, TCGv_ptr, tcg_target_long))
diff --git a/target/arm/tcg/sme.decode b/target/arm/tcg/sme.decode
index 9740d74410..94e8653b89 100644
--- a/target/arm/tcg/sme.decode
+++ b/target/arm/tcg/sme.decode
@@ -100,6 +100,42 @@  MOVA_za2        11000000 00 00011 00 .. 010 00 off:3 zr:4 0  \
 MOVA_za4        11000000 00 00011 00 .. 011 00 off:3 zr:3 00 \
                 &mova_a rv=%mova_rv
 
+### SME Move and Zero
+
+MOVAZ_za2       11000000 00000110 0 .. 01010 off:3 zr:4 0    \
+                &mova_a rv=%mova_rv
+MOVAZ_za4       11000000 00000110 0 .. 01110 off:3 zr:3 00   \
+                &mova_a rv=%mova_rv
+
+MOVAZ_zt        11000000 00 00001 0 v:1 .. 0001 off:4 zr:5    \
+                &mova_t rs=%mova_rs esz=0 za=0
+MOVAZ_zt        11000000 01 00001 0 v:1 .. 0001 za:1 off:3 zr:5    \
+                &mova_t rs=%mova_rs esz=1
+MOVAZ_zt        11000000 10 00001 0 v:1 .. 0001 za:2 off:2 zr:5    \
+                &mova_t rs=%mova_rs esz=2
+MOVAZ_zt        11000000 11 00001 0 v:1 .. 0001 za:3 off:1 zr:5    \
+                &mova_t rs=%mova_rs esz=3
+MOVAZ_zt        11000000 11 00001 1 v:1 .. 0001 za:4 zr:5    \
+                &mova_t rs=%mova_rs esz=4 off=0
+
+MOVAZ_zt2       11000000 00 00011 0 v:1 .. 00010 off:3 zr:4 0 \
+                &mova_t rs=%mova_rs esz=0 za=0
+MOVAZ_zt2       11000000 01 00011 0 v:1 .. 00010 za:1 off:2 zr:4 0 \
+                &mova_t rs=%mova_rs esz=1
+MOVAZ_zt2       11000000 10 00011 0 v:1 .. 00010 za:2 off:1 zr:4 0 \
+                &mova_t rs=%mova_rs esz=2
+MOVAZ_zt2       11000000 11 00011 0 v:1 .. 00010 za:3 zr:4 0 \
+                &mova_t rs=%mova_rs esz=3 off=0
+
+MOVAZ_zt4       11000000 00 00011 0 v:1 .. 001100 off:2 zr:3 00 \
+                &mova_t rs=%mova_rs esz=0 za=0
+MOVAZ_zt4       11000000 01 00011 0 v:1 .. 001100 za:1 off:1 zr:3 00 \
+                &mova_t rs=%mova_rs esz=1
+MOVAZ_zt4       11000000 10 00011 0 v:1 .. 001100 za:2 zr:3 00 \
+                &mova_t rs=%mova_rs esz=2 off=0
+MOVAZ_zt4       11000000 11 00011 0 v:1 .. 00110 za:3 zr:3 00 \
+                &mova_t rs=%mova_rs esz=3 off=0
+
 ### SME Move into/from ZT0
 
 MOVT_rzt        1100 0000 0100 1100 0 off:3 00 11111 rt:5

[v2,098/101] target/arm: Implement MOVAZ for SME2p1

Commit Message

Patch