[v4,069/163] tcg: Merge INDEX_op_muls2_{i32,i64}

Message ID	20250415192515.232910-70-richard.henderson@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: =?utf-8?q?Philippe_Mathieu-Daud=C3=A9?= <philmd@linaro.org> Subject: [PATCH v4 069/163] tcg: Merge INDEX_op_muls2_{i32,i64} Date: Tue, 15 Apr 2025 12:23:40 -0700 Message-ID: <20250415192515.232910-70-richard.henderson@linaro.org> In-Reply-To: <20250415192515.232910-1-richard.henderson@linaro.org> References: <20250415192515.232910-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::535; envelope-from=richard.henderson@linaro.org; helo=mail-pg1-x535.google.com Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	tcg: Convert to TCGOutOp structures \| expand [v4,000/163] tcg: Convert to TCGOutOp structures [v4,001/163] tcg: Add all_outop[] [v4,002/163] tcg: Use extract2 for cross-word 64-bit extract on 32-bit host [v4,003/163] tcg: Remove INDEX_op_ext{8,16,32}* [v4,004/163] tcg: Merge INDEX_op_mov_{i32,i64} [v4,005/163] tcg: Convert add to TCGOutOpBinary [v4,006/163] tcg: Merge INDEX_op_add_{i32,i64} [v4,007/163] tcg: Convert and to TCGOutOpBinary [v4,008/163] tcg: Merge INDEX_op_and_{i32,i64} [v4,009/163] tcg/optimize: Fold andc with immediate to and [v4,010/163] tcg/optimize: Emit add r, r, -1 in fold_setcond_tst_pow2 [v4,011/163] tcg: Convert andc to TCGOutOpBinary [v4,012/163] tcg: Merge INDEX_op_andc_{i32,i64} [v4,013/163] tcg: Convert or to TCGOutOpBinary [v4,014/163] tcg: Merge INDEX_op_or_{i32,i64} [v4,015/163] tcg/optimize: Fold orc with immediate to or [v4,016/163] tcg: Convert orc to TCGOutOpBinary [v4,017/163] tcg: Merge INDEX_op_orc_{i32,i64} [v4,018/163] tcg: Convert xor to TCGOutOpBinary [v4,019/163] tcg: Merge INDEX_op_xor_{i32,i64} [v4,020/163] tcg/optimize: Fold eqv with immediate to xor [v4,021/163] tcg: Convert eqv to TCGOutOpBinary [v4,022/163] tcg: Merge INDEX_op_eqv_{i32,i64} [v4,023/163] tcg: Convert nand to TCGOutOpBinary [v4,024/163] tcg: Merge INDEX_op_nand_{i32,i64} [v4,025/163] tcg/loongarch64: Do not accept constant argument to nor [v4,026/163] tcg: Convert nor to TCGOutOpBinary [v4,027/163] tcg: Merge INDEX_op_nor_{i32,i64} [v4,028/163] tcg/arm: Fix constraints for sub [v4,029/163] tcg: Convert sub to TCGOutOpSubtract [v4,030/163] tcg: Merge INDEX_op_sub_{i32,i64} [v4,031/163] tcg: Convert neg to TCGOutOpUnary [v4,032/163] tcg: Merge INDEX_op_neg_{i32,i64} [v4,033/163] tcg: Convert not to TCGOutOpUnary [v4,034/163] tcg: Merge INDEX_op_not_{i32,i64} [v4,035/163] tcg: Convert mul to TCGOutOpBinary [v4,036/163] tcg: Merge INDEX_op_mul_{i32,i64} [v4,037/163] tcg: Convert muluh to TCGOutOpBinary [v4,038/163] tcg: Merge INDEX_op_muluh_{i32,i64} [v4,039/163] tcg: Convert mulsh to TCGOutOpBinary [v4,040/163] tcg: Merge INDEX_op_mulsh_{i32,i64} [v4,041/163] tcg: Convert div to TCGOutOpBinary [v4,042/163] tcg: Merge INDEX_op_div_{i32,i64} [v4,043/163] tcg: Convert divu to TCGOutOpBinary [v4,044/163] tcg: Merge INDEX_op_divu_{i32,i64} [v4,045/163] tcg: Convert div2 to TCGOutOpDivRem [v4,046/163] tcg: Merge INDEX_op_div2_{i32,i64} [v4,047/163] tcg: Convert divu2 to TCGOutOpDivRem [v4,048/163] tcg: Merge INDEX_op_divu2_{i32,i64} [v4,049/163] tcg: Convert rem to TCGOutOpBinary [v4,050/163] tcg: Merge INDEX_op_rem_{i32,i64} [v4,051/163] tcg: Convert remu to TCGOutOpBinary [v4,052/163] tcg: Merge INDEX_op_remu_{i32,i64} [v4,053/163] tcg: Convert shl to TCGOutOpBinary [v4,054/163] tcg: Merge INDEX_op_shl_{i32,i64} [v4,055/163] tcg: Convert shr to TCGOutOpBinary [v4,056/163] tcg: Merge INDEX_op_shr_{i32,i64} [v4,057/163] tcg: Convert sar to TCGOutOpBinary [v4,058/163] tcg: Merge INDEX_op_sar_{i32,i64} [v4,059/163] tcg: Do not require both rotr and rotl from the backend [v4,060/163] tcg: Convert rotl, rotr to TCGOutOpBinary [v4,061/163] tcg: Merge INDEX_op_rot{l,r}_{i32,i64} [v4,062/163] tcg: Convert clz to TCGOutOpBinary [v4,063/163] tcg: Merge INDEX_op_clz_{i32,i64} [v4,064/163] tcg: Convert ctz to TCGOutOpBinary [v4,065/163] tcg: Merge INDEX_op_ctz_{i32,i64} [v4,066/163] tcg: Convert ctpop to TCGOutOpUnary [v4,067/163] tcg: Merge INDEX_op_ctpop_{i32,i64} [v4,068/163] tcg: Convert muls2 to TCGOutOpMul2 [v4,069/163] tcg: Merge INDEX_op_muls2_{i32,i64} [v4,070/163] tcg: Convert mulu2 to TCGOutOpMul2 [v4,071/163] tcg: Merge INDEX_op_mulu2_{i32,i64} [v4,072/163] tcg/loongarch64: Support negsetcond [v4,073/163] tcg/mips: Support negsetcond [v4,074/163] tcg/tci: Support negsetcond [v4,075/163] tcg: Remove TCG_TARGET_HAS_negsetcond_{i32,i64} [v4,076/163] tcg: Convert setcond, negsetcond to TCGOutOpSetcond [v4,077/163] tcg: Merge INDEX_op_{neg}setcond_{i32,i64}` [v4,078/163] tcg: Convert brcond to TCGOutOpBrcond [v4,079/163] tcg: Merge INDEX_op_brcond_{i32,i64} [v4,080/163] tcg: Convert movcond to TCGOutOpMovcond [v4,081/163] tcg: Merge INDEX_op_movcond_{i32,i64} [v4,082/163] tcg/ppc: Drop fallback constant loading in tcg_out_cmp [v4,083/163] tcg/arm: Expand arguments to tcg_out_cmp2 [v4,084/163] tcg/ppc: Expand arguments to tcg_out_cmp2 [v4,085/163] tcg: Convert brcond2_i32 to TCGOutOpBrcond2 [v4,086/163] tcg: Convert setcond2_i32 to TCGOutOpSetcond2 [v4,087/163] tcg: Convert bswap16 to TCGOutOpBswap [v4,088/163] tcg: Merge INDEX_op_bswap16_{i32,i64} [v4,089/163] tcg: Convert bswap32 to TCGOutOpBswap [v4,090/163] tcg: Merge INDEX_op_bswap32_{i32,i64} [v4,091/163] tcg: Convert bswap64 to TCGOutOpUnary [v4,092/163] tcg: Rename INDEX_op_bswap64_i64 to INDEX_op_bswap64 [v4,093/163] tcg: Convert extract to TCGOutOpExtract [v4,094/163] tcg: Merge INDEX_op_extract_{i32,i64} [v4,095/163] tcg: Convert sextract to TCGOutOpExtract [v4,096/163] tcg: Merge INDEX_op_sextract_{i32,i64} [v4,097/163] tcg: Convert ext_i32_i64 to TCGOutOpUnary [v4,098/163] tcg: Convert extu_i32_i64 to TCGOutOpUnary [v4,099/163] tcg: Convert extrl_i64_i32 to TCGOutOpUnary [v4,100/163] tcg: Convert extrh_i64_i32 to TCGOutOpUnary [v4,101/163] tcg: Convert deposit to TCGOutOpDeposit [v4,102/163] tcg/aarch64: Improve deposit [v4,103/163] tcg: Merge INDEX_op_deposit_{i32,i64} [v4,104/163] tcg: Convert extract2 to TCGOutOpExtract2 [v4,105/163] tcg: Merge INDEX_op_extract2_{i32,i64} [v4,106/163] tcg: Expand fallback add2 with 32-bit operations [v4,107/163] tcg: Expand fallback sub2 with 32-bit operations [v4,108/163] tcg: Do not default add2/sub2_i32 for 32-bit hosts [v4,109/163] tcg/mips: Drop support for add2/sub2 [v4,110/163] tcg/riscv: Drop support for add2/sub2 [v4,111/163] tcg: Move i into each for loop in liveness_pass_1 [v4,112/163] tcg: Sink def, nb_iargs, nb_oargs loads in liveness_pass_1 [v4,113/163] tcg: Add add/sub with carry opcodes and infrastructure [v4,114/163] tcg: Add TCGOutOp structures for add/sub carry opcodes [v4,115/163] tcg/optimize: Handle add/sub with carry opcodes [v4,116/163] tcg/optimize: With two const operands, prefer 0 in arg1 [v4,117/163] tcg: Use add carry opcodes to expand add2 [v4,118/163] tcg: Use sub carry opcodes to expand sub2 [v4,119/163] tcg/i386: Honor carry_live in tcg_out_movi [v4,120/163] tcg/i386: Implement add/sub carry opcodes [v4,121/163] tcg/i386: Remove support for add2/sub2 [v4,122/163] tcg/i386: Special case addci r, 0, 0 [v4,123/163] tcg: Add tcg_gen_addcio_{i32,i64,tl} [v4,124/163] target/arm: Use tcg_gen_addcio_* for ADCS [v4,125/163] target/hppa: Use tcg_gen_addcio_i64 [v4,126/163] target/microblaze: Use tcg_gen_addcio_i32 [v4,127/163] target/openrisc: Use tcg_gen_addcio_* for ADDC [v4,128/163] target/ppc: Use tcg_gen_addcio_tl for ADD and SUBF [v4,129/163] target/s390x: Use tcg_gen_addcio_i64 for op_addc64 [v4,130/163] target/sh4: Use tcg_gen_addcio_i32 for addc [v4,131/163] target/sparc: Use tcg_gen_addcio_tl for gen_op_addcc_int [v4,132/163] target/tricore: Use tcg_gen_addcio_i32 for gen_addc_CC [v4,133/163] tcg/aarch64: Implement add/sub carry opcodes [v4,134/163] tcg/aarch64: Remove support for add2/sub2 [v4,135/163] tcg/arm: Implement add/sub carry opcodes [v4,136/163] tcg/arm: Remove support for add2/sub2 [v4,137/163] tcg/ppc: Implement add/sub carry opcodes [v4,138/163] tcg/ppc: Remove support for add2/sub2 [v4,139/163] tcg/s390x: Honor carry_live in tcg_out_movi [v4,140/163] tcg/s390: Add TCG_CT_CONST_N32 [v4,141/163] tcg/s390x: Implement add/sub carry opcodes [v4,142/163] tcg/s390x: Use ADD LOGICAL WITH SIGNED IMMEDIATE [v4,143/163] tcg/s390x: Remove support for add2/sub2 [v4,144/163] tcg/sparc64: Hoist tcg_cond_to_bcond lookup out of tcg_out_movcc [v4,145/163] tcg/sparc64: Implement add/sub carry opcodes [v4,146/163] tcg/sparc64: Remove support for add2/sub2 [v4,147/163] tcg/tci: Implement add/sub carry opcodes [v4,148/163] tcg/tci: Remove support for add2/sub2 [v4,149/163] tcg: Remove add2/sub2 opcodes [v4,150/163] tcg: Formalize tcg_out_mb [v4,151/163] tcg: Formalize tcg_out_br [v4,152/163] tcg: Formalize tcg_out_goto_ptr [v4,153/163] tcg: Assign TCGOP_TYPE in liveness_pass_2 [v4,154/163] tcg: Convert ld to TCGOutOpLoad [v4,155/163] tcg: Merge INDEX_op_ld_{i32,i64} [v4,156/163] tcg: Convert st to TCGOutOpStore [v4,157/163] tcg: Merge INDEX_op_st_{i32,i64} [v4,158/163] tcg: Stash MemOp size in TCGOP_FLAGS [v4,159/163] tcg: Remove INDEX_op_qemu_st8_* [v4,160/163] tcg: Merge INDEX_op_{ld,st}_{i32,i64,i128} [v4,161/163] tcg: Convert qemu_ld{2} to TCGOutOpLoad{2} [v4,162/163] tcg: Convert qemu_st{2} to TCGOutOpLdSt{2} [v4,163/163] tcg: Remove tcg_out_op

Message ID

20250415192515.232910-70-richard.henderson@linaro.org

State

New

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: =?utf-8?q?Philippe_Mathieu-Daud=C3=A9?= <philmd@linaro.org>
Subject: [PATCH v4 069/163] tcg: Merge INDEX_op_muls2_{i32,i64}
Date: Tue, 15 Apr 2025 12:23:40 -0700
Message-ID: <20250415192515.232910-70-richard.henderson@linaro.org>
In-Reply-To: <20250415192515.232910-1-richard.henderson@linaro.org>
References: <20250415192515.232910-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::535;
 envelope-from=richard.henderson@linaro.org; helo=mail-pg1-x535.google.com
Precedence: list
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

tcg: Convert to TCGOutOp structures | expand

Commit Message

Richard Henderson April 15, 2025, 7:23 p.m. UTC

Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 include/tcg/tcg-opc.h    |  3 +--
 tcg/optimize.c           | 17 +++++++++--------
 tcg/tcg-op.c             |  8 ++++----
 tcg/tcg.c                |  9 +++------
 tcg/tci.c                |  6 ++----
 docs/devel/tcg-ops.rst   |  2 +-
 tcg/tci/tcg-target.c.inc |  3 +--
 7 files changed, 21 insertions(+), 27 deletions(-)

Comments

Pierrick Bouvier April 15, 2025, 9:17 p.m. UTC | #1

On 4/15/25 12:23, Richard Henderson wrote:
> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>   include/tcg/tcg-opc.h    |  3 +--
>   tcg/optimize.c           | 17 +++++++++--------
>   tcg/tcg-op.c             |  8 ++++----
>   tcg/tcg.c                |  9 +++------
>   tcg/tci.c                |  6 ++----
>   docs/devel/tcg-ops.rst   |  2 +-
>   tcg/tci/tcg-target.c.inc |  3 +--
>   7 files changed, 21 insertions(+), 27 deletions(-)
> 
> diff --git a/include/tcg/tcg-opc.h b/include/tcg/tcg-opc.h
> index f4ccde074b..a45b22ca1a 100644
> --- a/include/tcg/tcg-opc.h
> +++ b/include/tcg/tcg-opc.h
> @@ -51,6 +51,7 @@ DEF(divu, 1, 2, 0, TCG_OPF_INT)
>   DEF(divu2, 2, 3, 0, TCG_OPF_INT)
>   DEF(eqv, 1, 2, 0, TCG_OPF_INT)
>   DEF(mul, 1, 2, 0, TCG_OPF_INT)
> +DEF(muls2, 2, 2, 0, TCG_OPF_INT)
>   DEF(mulsh, 1, 2, 0, TCG_OPF_INT)
>   DEF(muluh, 1, 2, 0, TCG_OPF_INT)
>   DEF(nand, 1, 2, 0, TCG_OPF_INT)
> @@ -92,7 +93,6 @@ DEF(brcond_i32, 0, 2, 2, TCG_OPF_BB_END | TCG_OPF_COND_BRANCH)
>   DEF(add2_i32, 2, 4, 0, 0)
>   DEF(sub2_i32, 2, 4, 0, 0)
>   DEF(mulu2_i32, 2, 2, 0, 0)
> -DEF(muls2_i32, 2, 2, 0, 0)
>   DEF(brcond2_i32, 0, 4, 2, TCG_OPF_BB_END | TCG_OPF_COND_BRANCH)
>   DEF(setcond2_i32, 1, 4, 1, 0)
>   
> @@ -134,7 +134,6 @@ DEF(bswap64_i64, 1, 1, 1, 0)
>   DEF(add2_i64, 2, 4, 0, 0)
>   DEF(sub2_i64, 2, 4, 0, 0)
>   DEF(mulu2_i64, 2, 2, 0, 0)
> -DEF(muls2_i64, 2, 2, 0, 0)
>   
>   #define DATA64_ARGS  (TCG_TARGET_REG_BITS == 64 ? 1 : 2)
>   
> diff --git a/tcg/optimize.c b/tcg/optimize.c
> index 78979623c5..2b0ae4c12d 100644
> --- a/tcg/optimize.c
> +++ b/tcg/optimize.c
> @@ -2062,16 +2062,17 @@ static bool fold_multiply2(OptContext *ctx, TCGOp *op)
>               h = (int32_t)(l >> 32);
>               l = (int32_t)l;
>               break;
> -        case INDEX_op_muls2_i32:
> -            l = (int64_t)(int32_t)a * (int32_t)b;
> -            h = l >> 32;
> -            l = (int32_t)l;
> -            break;
>           case INDEX_op_mulu2_i64:
>               mulu64(&l, &h, a, b);
>               break;
> -        case INDEX_op_muls2_i64:
> -            muls64(&l, &h, a, b);
> +        case INDEX_op_muls2:
> +            if (ctx->type == TCG_TYPE_I32) {
> +                l = (int64_t)(int32_t)a * (int32_t)b;
> +                h = l >> 32;
> +                l = (int32_t)l;
> +            } else {
> +                muls64(&l, &h, a, b);
> +            }
>               break;
>           default:
>               g_assert_not_reached();
> @@ -2961,7 +2962,7 @@ void tcg_optimize(TCGContext *s)
>           case INDEX_op_muluh:
>               done = fold_mul_highpart(&ctx, op);
>               break;
> -        CASE_OP_32_64(muls2):
> +        case INDEX_op_muls2:
>           CASE_OP_32_64(mulu2):
>               done = fold_multiply2(&ctx, op);
>               break;
> diff --git a/tcg/tcg-op.c b/tcg/tcg-op.c
> index 8a0846a8d2..0f48484dfe 100644
> --- a/tcg/tcg-op.c
> +++ b/tcg/tcg-op.c
> @@ -1162,8 +1162,8 @@ void tcg_gen_mulu2_i32(TCGv_i32 rl, TCGv_i32 rh, TCGv_i32 arg1, TCGv_i32 arg2)
>   
>   void tcg_gen_muls2_i32(TCGv_i32 rl, TCGv_i32 rh, TCGv_i32 arg1, TCGv_i32 arg2)
>   {
> -    if (tcg_op_supported(INDEX_op_muls2_i32, TCG_TYPE_I32, 0)) {
> -        tcg_gen_op4_i32(INDEX_op_muls2_i32, rl, rh, arg1, arg2);
> +    if (tcg_op_supported(INDEX_op_muls2, TCG_TYPE_I32, 0)) {
> +        tcg_gen_op4_i32(INDEX_op_muls2, rl, rh, arg1, arg2);
>       } else if (tcg_op_supported(INDEX_op_mulsh, TCG_TYPE_I32, 0)) {
>           TCGv_i32 t = tcg_temp_ebb_new_i32();
>           tcg_gen_op3_i32(INDEX_op_mul, t, arg1, arg2);
> @@ -2880,8 +2880,8 @@ void tcg_gen_mulu2_i64(TCGv_i64 rl, TCGv_i64 rh, TCGv_i64 arg1, TCGv_i64 arg2)
>   
>   void tcg_gen_muls2_i64(TCGv_i64 rl, TCGv_i64 rh, TCGv_i64 arg1, TCGv_i64 arg2)
>   {
> -    if (tcg_op_supported(INDEX_op_muls2_i64, TCG_TYPE_I64, 0)) {
> -        tcg_gen_op4_i64(INDEX_op_muls2_i64, rl, rh, arg1, arg2);
> +    if (tcg_op_supported(INDEX_op_muls2, TCG_TYPE_I64, 0)) {
> +        tcg_gen_op4_i64(INDEX_op_muls2, rl, rh, arg1, arg2);
>       } else if (tcg_op_supported(INDEX_op_mulsh, TCG_TYPE_I64, 0)) {
>           TCGv_i64 t = tcg_temp_ebb_new_i64();
>           tcg_gen_op3_i64(INDEX_op_mul, t, arg1, arg2);
> diff --git a/tcg/tcg.c b/tcg/tcg.c
> index e4b38d9bda..8e6f8c1194 100644
> --- a/tcg/tcg.c
> +++ b/tcg/tcg.c
> @@ -1041,8 +1041,7 @@ static const TCGOutOp * const all_outop[NB_OPS] = {
>       OUTOP(INDEX_op_divu2, TCGOutOpDivRem, outop_divu2),
>       OUTOP(INDEX_op_eqv, TCGOutOpBinary, outop_eqv),
>       OUTOP(INDEX_op_mul, TCGOutOpBinary, outop_mul),
> -    OUTOP(INDEX_op_muls2_i32, TCGOutOpMul2, outop_muls2),
> -    OUTOP(INDEX_op_muls2_i64, TCGOutOpMul2, outop_muls2),
> +    OUTOP(INDEX_op_muls2, TCGOutOpMul2, outop_muls2),
>       OUTOP(INDEX_op_mulsh, TCGOutOpBinary, outop_mulsh),
>       OUTOP(INDEX_op_muluh, TCGOutOpBinary, outop_muluh),
>       OUTOP(INDEX_op_nand, TCGOutOpBinary, outop_nand),
> @@ -4008,8 +4007,7 @@ liveness_pass_1(TCGContext *s)
>               }
>               goto do_not_remove;
>   
> -        case INDEX_op_muls2_i32:
> -        case INDEX_op_muls2_i64:
> +        case INDEX_op_muls2:
>               opc_new = INDEX_op_mul;
>               opc_new2 = INDEX_op_mulsh;
>               goto do_mul2;
> @@ -5474,8 +5472,7 @@ static void tcg_reg_alloc_op(TCGContext *s, const TCGOp *op)
>           }
>           break;
>   
> -    case INDEX_op_muls2_i32:
> -    case INDEX_op_muls2_i64:
> +    case INDEX_op_muls2:
>           {
>               const TCGOutOpMul2 *out =
>                   container_of(all_outop[op->opc], TCGOutOpMul2, base);
> diff --git a/tcg/tci.c b/tcg/tci.c
> index 51cbb5760a..708ded34c7 100644
> --- a/tcg/tci.c
> +++ b/tcg/tci.c
> @@ -581,8 +581,7 @@ uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,
>               tci_args_rr(insn, &r0, &r1);
>               regs[r0] = ctpop_tr(regs[r1]);
>               break;
> -        case INDEX_op_muls2_i32:
> -        case INDEX_op_muls2_i64:
> +        case INDEX_op_muls2:
>               tci_args_rrrr(insn, &r0, &r1, &r2, &r3);
>   #if TCG_TARGET_REG_BITS == 32
>               tmp64 = (int64_t)(int32_t)regs[r2] * (int32_t)regs[r3];
> @@ -1095,10 +1094,9 @@ int print_insn_tci(bfd_vma addr, disassemble_info *info)
>                              str_r(r3), str_r(r4), str_c(c));
>           break;
>   
> +    case INDEX_op_muls2:
>       case INDEX_op_mulu2_i32:
>       case INDEX_op_mulu2_i64:
> -    case INDEX_op_muls2_i32:
> -    case INDEX_op_muls2_i64:
>           tci_args_rrrr(insn, &r0, &r1, &r2, &r3);
>           info->fprintf_func(info->stream, "%-12s  %s, %s, %s, %s",
>                              op_name, str_r(r0), str_r(r1),
> diff --git a/docs/devel/tcg-ops.rst b/docs/devel/tcg-ops.rst
> index fb7764e3c0..0394767291 100644
> --- a/docs/devel/tcg-ops.rst
> +++ b/docs/devel/tcg-ops.rst
> @@ -604,7 +604,7 @@ Multiword arithmetic support
>        - | Similar to mul, except two unsigned inputs *t1* and *t2* yielding the full
>            double-word product *t0*. The latter is returned in two single-word outputs.
>   
> -   * - muls2_i32/i64 *t0_low*, *t0_high*, *t1*, *t2*
> +   * - muls2 *t0_low*, *t0_high*, *t1*, *t2*
>   
>        - | Similar to mulu2, except the two inputs *t1* and *t2* are signed.
>   
> diff --git a/tcg/tci/tcg-target.c.inc b/tcg/tci/tcg-target.c.inc
> index f568d4edb9..aa3ce929b4 100644
> --- a/tcg/tci/tcg-target.c.inc
> +++ b/tcg/tci/tcg-target.c.inc
> @@ -716,8 +716,7 @@ static TCGConstraintSetIndex cset_mul2(TCGType type, unsigned flags)
>   static void tgen_muls2(TCGContext *s, TCGType type,
>                          TCGReg a0, TCGReg a1, TCGReg a2, TCGReg a3)
>   {
> -    tcg_out_op_rrrr(s, glue(INDEX_op_muls2_i,TCG_TARGET_REG_BITS),
> -                    a0, a1, a2, a3);
> +    tcg_out_op_rrrr(s, INDEX_op_muls2, a0, a1, a2, a3);
>   }
>   
>   static const TCGOutOpMul2 outop_muls2 = {

Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>

diff --git a/include/tcg/tcg-opc.h b/include/tcg/tcg-opc.h
index f4ccde074b..a45b22ca1a 100644
--- a/include/tcg/tcg-opc.h
+++ b/include/tcg/tcg-opc.h
@@ -51,6 +51,7 @@  DEF(divu, 1, 2, 0, TCG_OPF_INT)
 DEF(divu2, 2, 3, 0, TCG_OPF_INT)
 DEF(eqv, 1, 2, 0, TCG_OPF_INT)
 DEF(mul, 1, 2, 0, TCG_OPF_INT)
+DEF(muls2, 2, 2, 0, TCG_OPF_INT)
 DEF(mulsh, 1, 2, 0, TCG_OPF_INT)
 DEF(muluh, 1, 2, 0, TCG_OPF_INT)
 DEF(nand, 1, 2, 0, TCG_OPF_INT)
@@ -92,7 +93,6 @@  DEF(brcond_i32, 0, 2, 2, TCG_OPF_BB_END | TCG_OPF_COND_BRANCH)
 DEF(add2_i32, 2, 4, 0, 0)
 DEF(sub2_i32, 2, 4, 0, 0)
 DEF(mulu2_i32, 2, 2, 0, 0)
-DEF(muls2_i32, 2, 2, 0, 0)
 DEF(brcond2_i32, 0, 4, 2, TCG_OPF_BB_END | TCG_OPF_COND_BRANCH)
 DEF(setcond2_i32, 1, 4, 1, 0)
 
@@ -134,7 +134,6 @@  DEF(bswap64_i64, 1, 1, 1, 0)
 DEF(add2_i64, 2, 4, 0, 0)
 DEF(sub2_i64, 2, 4, 0, 0)
 DEF(mulu2_i64, 2, 2, 0, 0)
-DEF(muls2_i64, 2, 2, 0, 0)
 
 #define DATA64_ARGS  (TCG_TARGET_REG_BITS == 64 ? 1 : 2)
 
diff --git a/tcg/optimize.c b/tcg/optimize.c
index 78979623c5..2b0ae4c12d 100644
--- a/tcg/optimize.c
+++ b/tcg/optimize.c
@@ -2062,16 +2062,17 @@  static bool fold_multiply2(OptContext *ctx, TCGOp *op)
             h = (int32_t)(l >> 32);
             l = (int32_t)l;
             break;
-        case INDEX_op_muls2_i32:
-            l = (int64_t)(int32_t)a * (int32_t)b;
-            h = l >> 32;
-            l = (int32_t)l;
-            break;
         case INDEX_op_mulu2_i64:
             mulu64(&l, &h, a, b);
             break;
-        case INDEX_op_muls2_i64:
-            muls64(&l, &h, a, b);
+        case INDEX_op_muls2:
+            if (ctx->type == TCG_TYPE_I32) {
+                l = (int64_t)(int32_t)a * (int32_t)b;
+                h = l >> 32;
+                l = (int32_t)l;
+            } else {
+                muls64(&l, &h, a, b);
+            }
             break;
         default:
             g_assert_not_reached();
@@ -2961,7 +2962,7 @@  void tcg_optimize(TCGContext *s)
         case INDEX_op_muluh:
             done = fold_mul_highpart(&ctx, op);
             break;
-        CASE_OP_32_64(muls2):
+        case INDEX_op_muls2:
         CASE_OP_32_64(mulu2):
             done = fold_multiply2(&ctx, op);
             break;
diff --git a/tcg/tcg-op.c b/tcg/tcg-op.c
index 8a0846a8d2..0f48484dfe 100644
--- a/tcg/tcg-op.c
+++ b/tcg/tcg-op.c
@@ -1162,8 +1162,8 @@  void tcg_gen_mulu2_i32(TCGv_i32 rl, TCGv_i32 rh, TCGv_i32 arg1, TCGv_i32 arg2)
 
 void tcg_gen_muls2_i32(TCGv_i32 rl, TCGv_i32 rh, TCGv_i32 arg1, TCGv_i32 arg2)
 {
-    if (tcg_op_supported(INDEX_op_muls2_i32, TCG_TYPE_I32, 0)) {
-        tcg_gen_op4_i32(INDEX_op_muls2_i32, rl, rh, arg1, arg2);
+    if (tcg_op_supported(INDEX_op_muls2, TCG_TYPE_I32, 0)) {
+        tcg_gen_op4_i32(INDEX_op_muls2, rl, rh, arg1, arg2);
     } else if (tcg_op_supported(INDEX_op_mulsh, TCG_TYPE_I32, 0)) {
         TCGv_i32 t = tcg_temp_ebb_new_i32();
         tcg_gen_op3_i32(INDEX_op_mul, t, arg1, arg2);
@@ -2880,8 +2880,8 @@  void tcg_gen_mulu2_i64(TCGv_i64 rl, TCGv_i64 rh, TCGv_i64 arg1, TCGv_i64 arg2)
 
 void tcg_gen_muls2_i64(TCGv_i64 rl, TCGv_i64 rh, TCGv_i64 arg1, TCGv_i64 arg2)
 {
-    if (tcg_op_supported(INDEX_op_muls2_i64, TCG_TYPE_I64, 0)) {
-        tcg_gen_op4_i64(INDEX_op_muls2_i64, rl, rh, arg1, arg2);
+    if (tcg_op_supported(INDEX_op_muls2, TCG_TYPE_I64, 0)) {
+        tcg_gen_op4_i64(INDEX_op_muls2, rl, rh, arg1, arg2);
     } else if (tcg_op_supported(INDEX_op_mulsh, TCG_TYPE_I64, 0)) {
         TCGv_i64 t = tcg_temp_ebb_new_i64();
         tcg_gen_op3_i64(INDEX_op_mul, t, arg1, arg2);
diff --git a/tcg/tcg.c b/tcg/tcg.c
index e4b38d9bda..8e6f8c1194 100644
--- a/tcg/tcg.c
+++ b/tcg/tcg.c
@@ -1041,8 +1041,7 @@  static const TCGOutOp * const all_outop[NB_OPS] = {
     OUTOP(INDEX_op_divu2, TCGOutOpDivRem, outop_divu2),
     OUTOP(INDEX_op_eqv, TCGOutOpBinary, outop_eqv),
     OUTOP(INDEX_op_mul, TCGOutOpBinary, outop_mul),
-    OUTOP(INDEX_op_muls2_i32, TCGOutOpMul2, outop_muls2),
-    OUTOP(INDEX_op_muls2_i64, TCGOutOpMul2, outop_muls2),
+    OUTOP(INDEX_op_muls2, TCGOutOpMul2, outop_muls2),
     OUTOP(INDEX_op_mulsh, TCGOutOpBinary, outop_mulsh),
     OUTOP(INDEX_op_muluh, TCGOutOpBinary, outop_muluh),
     OUTOP(INDEX_op_nand, TCGOutOpBinary, outop_nand),
@@ -4008,8 +4007,7 @@  liveness_pass_1(TCGContext *s)
             }
             goto do_not_remove;
 
-        case INDEX_op_muls2_i32:
-        case INDEX_op_muls2_i64:
+        case INDEX_op_muls2:
             opc_new = INDEX_op_mul;
             opc_new2 = INDEX_op_mulsh;
             goto do_mul2;
@@ -5474,8 +5472,7 @@  static void tcg_reg_alloc_op(TCGContext *s, const TCGOp *op)
         }
         break;
 
-    case INDEX_op_muls2_i32:
-    case INDEX_op_muls2_i64:
+    case INDEX_op_muls2:
         {
             const TCGOutOpMul2 *out =
                 container_of(all_outop[op->opc], TCGOutOpMul2, base);
diff --git a/tcg/tci.c b/tcg/tci.c
index 51cbb5760a..708ded34c7 100644
--- a/tcg/tci.c
+++ b/tcg/tci.c
@@ -581,8 +581,7 @@  uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,
             tci_args_rr(insn, &r0, &r1);
             regs[r0] = ctpop_tr(regs[r1]);
             break;
-        case INDEX_op_muls2_i32:
-        case INDEX_op_muls2_i64:
+        case INDEX_op_muls2:
             tci_args_rrrr(insn, &r0, &r1, &r2, &r3);
 #if TCG_TARGET_REG_BITS == 32
             tmp64 = (int64_t)(int32_t)regs[r2] * (int32_t)regs[r3];
@@ -1095,10 +1094,9 @@  int print_insn_tci(bfd_vma addr, disassemble_info *info)
                            str_r(r3), str_r(r4), str_c(c));
         break;
 
+    case INDEX_op_muls2:
     case INDEX_op_mulu2_i32:
     case INDEX_op_mulu2_i64:
-    case INDEX_op_muls2_i32:
-    case INDEX_op_muls2_i64:
         tci_args_rrrr(insn, &r0, &r1, &r2, &r3);
         info->fprintf_func(info->stream, "%-12s  %s, %s, %s, %s",
                            op_name, str_r(r0), str_r(r1),
diff --git a/docs/devel/tcg-ops.rst b/docs/devel/tcg-ops.rst
index fb7764e3c0..0394767291 100644
--- a/docs/devel/tcg-ops.rst
+++ b/docs/devel/tcg-ops.rst
@@ -604,7 +604,7 @@  Multiword arithmetic support
      - | Similar to mul, except two unsigned inputs *t1* and *t2* yielding the full
          double-word product *t0*. The latter is returned in two single-word outputs.
 
-   * - muls2_i32/i64 *t0_low*, *t0_high*, *t1*, *t2*
+   * - muls2 *t0_low*, *t0_high*, *t1*, *t2*
 
      - | Similar to mulu2, except the two inputs *t1* and *t2* are signed.
 
diff --git a/tcg/tci/tcg-target.c.inc b/tcg/tci/tcg-target.c.inc
index f568d4edb9..aa3ce929b4 100644
--- a/tcg/tci/tcg-target.c.inc
+++ b/tcg/tci/tcg-target.c.inc
@@ -716,8 +716,7 @@  static TCGConstraintSetIndex cset_mul2(TCGType type, unsigned flags)
 static void tgen_muls2(TCGContext *s, TCGType type,
                        TCGReg a0, TCGReg a1, TCGReg a2, TCGReg a3)
 {
-    tcg_out_op_rrrr(s, glue(INDEX_op_muls2_i,TCG_TARGET_REG_BITS),
-                    a0, a1, a2, a3);
+    tcg_out_op_rrrr(s, INDEX_op_muls2, a0, a1, a2, a3);
 }
 
 static const TCGOutOpMul2 outop_muls2 = {

[v4,069/163] tcg: Merge INDEX_op_muls2_{i32,i64}

Commit Message

Comments

Patch