[PULL,02/80] include/exec/memop: Add MO_ATOM_*

Message ID	20230516194145.1749305-3-richard.henderson@linaro.org
State	Accepted
Commit	37031fefc777a715320f86fc35ee3dd82d9d945e
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: Peter Maydell <peter.maydell@linaro.org> Subject: [PULL 02/80] include/exec/memop: Add MO_ATOM_* Date: Tue, 16 May 2023 12:40:27 -0700 Message-Id: <20230516194145.1749305-3-richard.henderson@linaro.org> In-Reply-To: <20230516194145.1749305-1-richard.henderson@linaro.org> References: <20230516194145.1749305-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::435; envelope-from=richard.henderson@linaro.org; helo=mail-pf1-x435.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	[PULL,01/80] tcg/i386: Set P_REXW in tcg_out_addi_ptr \| expand [PULL,01/80] tcg/i386: Set P_REXW in tcg_out_addi_ptr [PULL,02/80] include/exec/memop: Add MO_ATOM_* [PULL,03/80] accel/tcg: Honor atomicity of loads [PULL,04/80] accel/tcg: Honor atomicity of stores [PULL,05/80] tcg: Unify helper_{be,le}_{ld,st}* [PULL,06/80] accel/tcg: Implement helper_{ld,st}_mmu for user-only [PULL,07/80] tcg/tci: Use helper_{ld,st}_mmu for user-only [PULL,08/80] tcg: Add 128-bit guest memory primitives [PULL,09/80] meson: Detect atomic128 support with optimization [PULL,10/80] tcg/i386: Add have_atomic16 [PULL,11/80] tcg/aarch64: Detect have_lse, have_lse2 for linux [PULL,12/80] tcg/aarch64: Detect have_lse, have_lse2 for darwin [PULL,13/80] tcg/i386: Use full load/store helpers in user-only mode [PULL,14/80] tcg/aarch64: Use full load/store helpers in user-only mode [PULL,15/80] tcg/ppc: Use full load/store helpers in user-only mode [PULL,16/80] tcg/loongarch64: Use full load/store helpers in user-only mode [PULL,17/80] tcg/riscv: Use full load/store helpers in user-only mode [PULL,18/80] tcg/arm: Adjust constraints on qemu_ld/st [PULL,19/80] tcg/arm: Use full load/store helpers in user-only mode [PULL,20/80] tcg/mips: Use full load/store helpers in user-only mode [PULL,21/80] tcg/s390x: Use full load/store helpers in user-only mode [PULL,22/80] tcg/sparc64: Allocate %g2 as a third temporary [PULL,23/80] tcg/sparc64: Rename tcg_out_movi_imm13 to tcg_out_movi_s13 [PULL,24/80] target/sparc64: Remove tcg_out_movi_s13 case from tcg_out_movi_imm32 [PULL,25/80] tcg/sparc64: Rename tcg_out_movi_imm32 to tcg_out_movi_u32 [PULL,26/80] tcg/sparc64: Split out tcg_out_movi_s32 [PULL,27/80] tcg/sparc64: Use standard slow path for softmmu [PULL,28/80] accel/tcg: Remove helper_unaligned_{ld,st} [PULL,29/80] tcg/loongarch64: Check the host supports unaligned accesses [PULL,30/80] tcg/loongarch64: Support softmmu unaligned accesses [PULL,31/80] tcg/riscv: Support softmmu unaligned accesses [PULL,32/80] tcg: Introduce tcg_target_has_memory_bswap [PULL,33/80] tcg: Add INDEX_op_qemu_{ld,st}_i128 [PULL,34/80] tcg: Introduce tcg_out_movext3 [PULL,35/80] tcg: Merge tcg_out_helper_load_regs into caller [PULL,36/80] tcg: Support TCG_TYPE_I128 in tcg_out_{ld, st}_helper_{args, ret} [PULL,37/80] tcg: Introduce atom_and_align_for_opc [PULL,38/80] tcg/i386: Use atom_and_align_for_opc [PULL,39/80] tcg/aarch64: Use atom_and_align_for_opc [PULL,40/80] tcg/arm: Use atom_and_align_for_opc [PULL,41/80] tcg/loongarch64: Use atom_and_align_for_opc [PULL,42/80] tcg/mips: Use atom_and_align_for_opc [PULL,43/80] tcg/ppc: Use atom_and_align_for_opc [PULL,44/80] tcg/riscv: Use atom_and_align_for_opc [PULL,45/80] tcg/s390x: Use atom_and_align_for_opc [PULL,46/80] tcg/sparc64: Use atom_and_align_for_opc [PULL,47/80] tcg/i386: Honor 64-bit atomicity in 32-bit mode [PULL,48/80] tcg/i386: Support 128-bit load/store with have_atomic16 [PULL,49/80] tcg/aarch64: Rename temporaries [PULL,50/80] tcg/aarch64: Support 128-bit load/store [PULL,51/80] tcg/ppc: Support 128-bit load/store [PULL,52/80] tcg/s390x: Support 128-bit load/store [PULL,53/80] tcg: Split out memory ops to tcg-op-ldst.c [PULL,54/80] tcg: Widen gen_insn_data to uint64_t [PULL,55/80] accel/tcg: Widen tcg-ldst.h addresses to uint64_t [PULL,56/80] tcg: Widen helper_{ld,st}_i128 addresses to uint64_t [PULL,57/80] tcg: Widen helper_atomic_* addresses to uint64_t [PULL,58/80] tcg: Widen tcg_gen_code pc_start argument to uint64_t [PULL,59/80] accel/tcg: Merge gen_mem_wrapped with plugin_gen_empty_mem_callback [PULL,60/80] accel/tcg: Merge do_gen_mem_cb into caller [PULL,61/80] tcg: Reduce copies for plugin_gen_mem_callbacks [PULL,62/80] accel/tcg: Widen plugin_gen_empty_mem_callback to i64 [PULL,63/80] tcg: Add addr_type to TCGContext [PULL,64/80] tcg: Remove TCGv from tcg_gen_qemu_{ld,st}_* [PULL,65/80] tcg: Remove TCGv from tcg_gen_atomic_* [PULL,66/80] tcg: Split INDEX_op_qemu_{ld, st}* for guest address size [PULL,67/80] tcg/tci: Elimnate TARGET_LONG_BITS, target_ulong [PULL,68/80] tcg/i386: Always enable TCG_TARGET_HAS_extr[lh]_i64_i32 [PULL,69/80] tcg/i386: Conditionalize tcg_out_extu_i32_i64 [PULL,70/80] tcg/i386: Adjust type of tlb_mask [PULL,71/80] tcg/i386: Remove TARGET_LONG_BITS, TCG_TYPE_TL [PULL,72/80] tcg/arm: Remove TARGET_LONG_BITS [PULL,73/80] tcg/aarch64: Remove USE_GUEST_BASE [PULL,74/80] tcg/aarch64: Remove TARGET_LONG_BITS, TCG_TYPE_TL [PULL,75/80] tcg/loongarch64: Remove TARGET_LONG_BITS, TCG_TYPE_TL [PULL,76/80] tcg/mips: Remove TARGET_LONG_BITS, TCG_TYPE_TL [PULL,77/80] tcg: Remove TARGET_LONG_BITS, TCG_TYPE_TL [PULL,78/80] tcg: Add page_bits and page_mask to TCGContext [PULL,79/80] tcg: Add tlb_dyn_max_bits to TCGContext [PULL,80/80] tcg: Split out exec/user/guest-base.h

Message ID

20230516194145.1749305-3-richard.henderson@linaro.org

State

Accepted

Commit

37031fefc777a715320f86fc35ee3dd82d9d945e

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: Peter Maydell <peter.maydell@linaro.org>
Subject: [PULL 02/80] include/exec/memop: Add MO_ATOM_*
Date: Tue, 16 May 2023 12:40:27 -0700
Message-Id: <20230516194145.1749305-3-richard.henderson@linaro.org>
In-Reply-To: <20230516194145.1749305-1-richard.henderson@linaro.org>
References: <20230516194145.1749305-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::435;
 envelope-from=richard.henderson@linaro.org; helo=mail-pf1-x435.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

[PULL,01/80] tcg/i386: Set P_REXW in tcg_out_addi_ptr | expand

Commit Message

Richard Henderson May 16, 2023, 7:40 p.m. UTC

This field may be used to describe the precise atomicity requirements
of the guest, which may then be used to constrain the methods by which
it may be emulated by the host.

For instance, the AArch64 LDP (32-bit) instruction changes semantics
with ARMv8.4 LSE2, from

  MO_64 | MO_ATOM_IFALIGN_PAIR
  (64-bits, single-copy atomic only on 4 byte units,
   nonatomic if not aligned by 4),

to

  MO_64 | MO_ATOM_WITHIN16
  (64-bits, single-copy atomic within a 16 byte block)

The former may be implemented with two 4 byte loads, or a single 8 byte
load if that happens to be efficient on the host.  The latter may not
be implemented with two 4 byte loads and may also require a helper when
misaligned.

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 include/exec/memop.h | 37 +++++++++++++++++++++++++++++++++++++
 tcg/tcg.c            | 27 +++++++++++++++++++++------
 2 files changed, 58 insertions(+), 6 deletions(-)

diff --git a/include/exec/memop.h b/include/exec/memop.h
index 07f5f88188..a86dc6743a 100644
--- a/include/exec/memop.h
+++ b/include/exec/memop.h
@@ -72,6 +72,43 @@  typedef enum MemOp {
     MO_ALIGN_64 = 6 << MO_ASHIFT,
     MO_ALIGN    = MO_AMASK,
 
+    /*
+     * MO_ATOM_* describes the atomicity requirements of the operation:
+     * MO_ATOM_IFALIGN: the operation must be single-copy atomic if it
+     *    is aligned; if unaligned there is no atomicity.
+     * MO_ATOM_IFALIGN_PAIR: the entire operation may be considered to
+     *    be a pair of half-sized operations which are packed together
+     *    for convenience, with single-copy atomicity on each half if
+     *    the half is aligned.
+     *    This is the atomicity e.g. of Arm pre-FEAT_LSE2 LDP.
+     * MO_ATOM_WITHIN16: the operation is single-copy atomic, even if it
+     *    is unaligned, so long as it does not cross a 16-byte boundary;
+     *    if it crosses a 16-byte boundary there is no atomicity.
+     *    This is the atomicity e.g. of Arm FEAT_LSE2 LDR.
+     * MO_ATOM_WITHIN16_PAIR: the entire operation is single-copy atomic,
+     *    if it happens to be within a 16-byte boundary, otherwise it
+     *    devolves to a pair of half-sized MO_ATOM_WITHIN16 operations.
+     *    Depending on alignment, one or both will be single-copy atomic.
+     *    This is the atomicity e.g. of Arm FEAT_LSE2 LDP.
+     * MO_ATOM_SUBALIGN: the operation is single-copy atomic by parts
+     *    by the alignment.  E.g. if the address is 0 mod 4, then each
+     *    4-byte subobject is single-copy atomic.
+     *    This is the atomicity e.g. of IBM Power.
+     * MO_ATOM_NONE: the operation has no atomicity requirements.
+     *
+     * Note the default (i.e. 0) value is single-copy atomic to the
+     * size of the operation, if aligned.  This retains the behaviour
+     * from before this field was introduced.
+     */
+    MO_ATOM_SHIFT         = 8,
+    MO_ATOM_IFALIGN       = 0 << MO_ATOM_SHIFT,
+    MO_ATOM_IFALIGN_PAIR  = 1 << MO_ATOM_SHIFT,
+    MO_ATOM_WITHIN16      = 2 << MO_ATOM_SHIFT,
+    MO_ATOM_WITHIN16_PAIR = 3 << MO_ATOM_SHIFT,
+    MO_ATOM_SUBALIGN      = 4 << MO_ATOM_SHIFT,
+    MO_ATOM_NONE          = 5 << MO_ATOM_SHIFT,
+    MO_ATOM_MASK          = 7 << MO_ATOM_SHIFT,
+
     /* Combinations of the above, for ease of use.  */
     MO_UB    = MO_8,
     MO_UW    = MO_16,
diff --git a/tcg/tcg.c b/tcg/tcg.c
index 1231c8ab4c..f156ca65f5 100644
--- a/tcg/tcg.c
+++ b/tcg/tcg.c
@@ -2195,6 +2195,15 @@  static const char * const alignment_name[(MO_AMASK >> MO_ASHIFT) + 1] = {
     [MO_ALIGN_64 >> MO_ASHIFT] = "al64+",
 };
 
+static const char * const atom_name[(MO_ATOM_MASK >> MO_ATOM_SHIFT) + 1] = {
+    [MO_ATOM_IFALIGN >> MO_ATOM_SHIFT] = "",
+    [MO_ATOM_IFALIGN_PAIR >> MO_ATOM_SHIFT] = "pair+",
+    [MO_ATOM_WITHIN16 >> MO_ATOM_SHIFT] = "w16+",
+    [MO_ATOM_WITHIN16_PAIR >> MO_ATOM_SHIFT] = "w16p+",
+    [MO_ATOM_SUBALIGN >> MO_ATOM_SHIFT] = "sub+",
+    [MO_ATOM_NONE >> MO_ATOM_SHIFT] = "noat+",
+};
+
 static const char bswap_flag_name[][6] = {
     [TCG_BSWAP_IZ] = "iz",
     [TCG_BSWAP_OZ] = "oz",
@@ -2330,17 +2339,23 @@  static void tcg_dump_ops(TCGContext *s, FILE *f, bool have_prefs)
             case INDEX_op_qemu_ld_i64:
             case INDEX_op_qemu_st_i64:
                 {
+                    const char *s_al, *s_op, *s_at;
                     MemOpIdx oi = op->args[k++];
                     MemOp op = get_memop(oi);
                     unsigned ix = get_mmuidx(oi);
 
-                    if (op & ~(MO_AMASK | MO_BSWAP | MO_SSIZE)) {
-                        col += ne_fprintf(f, ",$0x%x,%u", op, ix);
+                    s_al = alignment_name[(op & MO_AMASK) >> MO_ASHIFT];
+                    s_op = ldst_name[op & (MO_BSWAP | MO_SSIZE)];
+                    s_at = atom_name[(op & MO_ATOM_MASK) >> MO_ATOM_SHIFT];
+                    op &= ~(MO_AMASK | MO_BSWAP | MO_SSIZE | MO_ATOM_MASK);
+
+                    /* If all fields are accounted for, print symbolically. */
+                    if (!op && s_al && s_op && s_at) {
+                        col += ne_fprintf(f, ",%s%s%s,%u",
+                                          s_at, s_al, s_op, ix);
                     } else {
-                        const char *s_al, *s_op;
-                        s_al = alignment_name[(op & MO_AMASK) >> MO_ASHIFT];
-                        s_op = ldst_name[op & (MO_BSWAP | MO_SSIZE)];
-                        col += ne_fprintf(f, ",%s%s,%u", s_al, s_op, ix);
+                        op = get_memop(oi);
+                        col += ne_fprintf(f, ",$0x%x,%u", op, ix);
                     }
                     i = 1;
                 }

[PULL,02/80] include/exec/memop: Add MO_ATOM_*

Commit Message

Patch