[v4,11/69] target/arm: Simplify op_smlawx for SMLAW*

Message ID	20190904193059.26202-12-richard.henderson@linaro.org
State	Accepted
Commit	485b607d4f393e0de92c922806a68aef22340c98
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Date: Wed, 4 Sep 2019 12:30:01 -0700 Message-Id: <20190904193059.26202-12-richard.henderson@linaro.org> In-Reply-To: <20190904193059.26202-1-richard.henderson@linaro.org> References: <20190904193059.26202-1-richard.henderson@linaro.org> Subject: [Qemu-devel] [PATCH v4 11/69] target/arm: Simplify op_smlawx for SMLAW* Precedence: list Cc: peter.maydell@linaro.org, qemu-arm@nongnu.org Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	target/arm: Convert aa32 base isa to decodetree \| expand [v4,00/69] target/arm: Convert aa32 base isa to decodetree [v4,01/69] target/arm: Use store_reg_from_load in thumb2 code [v4,02/69] target/arm: Add stubs for aa32 decodetree [v4,03/69] target/arm: Convert Data Processing (register) [v4,04/69] target/arm: Convert Data Processing (reg-shifted-reg) [v4,05/69] target/arm: Convert Data Processing (immediate) [v4,06/69] target/arm: Convert multiply and multiply accumulate [v4,07/69] target/arm: Simplify UMAAL [v4,08/69] target/arm: Convert Saturating addition and subtraction [v4,09/69] target/arm: Convert Halfword multiply and multiply accumulate [v4,10/69] target/arm: Simplify op_smlaxxx for SMLAL* [v4,11/69] target/arm: Simplify op_smlawx for SMLAW* [v4,12/69] target/arm: Convert MSR (immediate) and hints [v4,13/69] target/arm: Convert MRS/MSR (banked, register) [v4,14/69] target/arm: Convert Cyclic Redundancy Check [v4,15/69] target/arm: Convert BX, BXJ, BLX (register) [v4,16/69] target/arm: Convert CLZ [v4,17/69] target/arm: Convert ERET [v4,18/69] target/arm: Convert the rest of A32 Miscelaneous instructions [v4,19/69] target/arm: Convert T32 ADDW/SUBW [v4,20/69] target/arm: Convert load/store (register, immediate, literal) [v4,21/69] target/arm: Convert Synchronization primitives [v4,22/69] target/arm: Diagnose UNPREDICTABLE ldrex/strex cases [v4,23/69] target/arm: Convert USAD8, USADA8, SBFX, UBFX, BFC, BFI, UDF [v4,24/69] target/arm: Convert Parallel addition and subtraction [v4,25/69] target/arm: Convert packing, unpacking, saturation, and reversal [v4,26/69] target/arm: Convert Signed multiply, signed and unsigned divide [v4,27/69] target/arm: Convert MOVW, MOVT [v4,28/69] target/arm: Convert LDM, STM [v4,29/69] target/arm: Diagnose writeback register in list for LDM for v7 [v4,30/69] target/arm: Diagnose too few registers in list for LDM/STM [v4,31/69] target/arm: Diagnose base == pc for LDM/STM [v4,32/69] target/arm: Convert B, BL, BLX (immediate) [v4,33/69] target/arm: Convert SVC [v4,34/69] target/arm: Convert RFE and SRS [v4,35/69] target/arm: Convert Clear-Exclusive, Barriers [v4,36/69] target/arm: Convert CPS (privileged) [v4,37/69] target/arm: Convert SETEND [v4,38/69] target/arm: Convert PLI, PLD, PLDW [v4,39/69] target/arm: Convert Unallocated memory hint [v4,40/69] target/arm: Convert Table Branch [v4,41/69] target/arm: Convert SG [v4,42/69] target/arm: Convert TT [v4,43/69] target/arm: Simplify disas_thumb2_insn [v4,44/69] target/arm: Simplify disas_arm_insn [v4,45/69] target/arm: Add skeleton for T16 decodetree [v4,46/69] target/arm: Convert T16 data-processing (two low regs) [v4,47/69] target/arm: Convert T16 load/store (register offset) [v4,48/69] target/arm: Convert T16 load/store (immediate offset) [v4,49/69] target/arm: Convert T16 add pc/sp (immediate) [v4,50/69] target/arm: Convert T16 load/store multiple [v4,51/69] target/arm: Convert T16 add/sub (3 low, 2 low and imm) [v4,52/69] target/arm: Convert T16 one low register and immediate [v4,53/69] target/arm: Convert T16 branch and exchange [v4,54/69] target/arm: Convert T16 add, compare, move (two high registers) [v4,55/69] target/arm: Convert T16 adjust sp (immediate) [v4,56/69] target/arm: Convert T16, extract [v4,57/69] target/arm: Convert T16, Change processor state [v4,58/69] target/arm: Convert T16, Reverse bytes [v4,59/69] target/arm: Convert T16, nop hints [v4,60/69] target/arm: Split gen_nop_hint [v4,61/69] target/arm: Convert T16, push and pop [v4,62/69] target/arm: Convert T16, Conditional branches, Supervisor call [v4,63/69] target/arm: Convert T16, Miscellaneous 16-bit instructions [v4,64/69] target/arm: Convert T16, shift immediate [v4,65/69] target/arm: Convert T16, load (literal) [v4,66/69] target/arm: Convert T16, Unconditional branch [v4,67/69] target/arm: Convert T16, long branches [v4,68/69] target/arm: Clean up disas_thumb_insn [v4,69/69] target/arm: Inline gen_bx_im into callers

Message ID

20190904193059.26202-12-richard.henderson@linaro.org

State

Accepted

Commit

485b607d4f393e0de92c922806a68aef22340c98

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	209.51.188.17 as permitted sender) client-ip=209.51.188.17; 
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Date: Wed,  4 Sep 2019 12:30:01 -0700
Message-Id: <20190904193059.26202-12-richard.henderson@linaro.org>
In-Reply-To: <20190904193059.26202-1-richard.henderson@linaro.org>
References: <20190904193059.26202-1-richard.henderson@linaro.org>
Subject: [Qemu-devel] [PATCH v4 11/69] target/arm: Simplify op_smlawx for
	SMLAW*
Precedence: list
Cc: peter.maydell@linaro.org, qemu-arm@nongnu.org
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

target/arm: Convert aa32 base isa to decodetree | expand

Commit Message

Richard Henderson Sept. 4, 2019, 7:30 p.m. UTC

By shifting the 16-bit input left by 16, we can align the desired
portion of the 48-bit product and use tcg_gen_muls2_i32.

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

---
 target/arm/translate.c | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

-- 
2.17.1

diff --git a/target/arm/translate.c b/target/arm/translate.c
index 37aa873e25..71cc96b70e 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -8242,7 +8242,6 @@  DO_SMLAX(SMLALTT, 2, 1, 1)
 static bool op_smlawx(DisasContext *s, arg_rrrr *a, bool add, bool mt)
 {
     TCGv_i32 t0, t1;
-    TCGv_i64 t64;
 
     if (!ENABLE_ARCH_5TE) {
         return false;
@@ -8250,16 +8249,17 @@  static bool op_smlawx(DisasContext *s, arg_rrrr *a, bool add, bool mt)
 
     t0 = load_reg(s, a->rn);
     t1 = load_reg(s, a->rm);
+    /*
+     * Since the nominal result is product<47:16>, shift the 16-bit
+     * input up by 16 bits, so that the result is at product<63:32>.
+     */
     if (mt) {
-        tcg_gen_sari_i32(t1, t1, 16);
+        tcg_gen_andi_i32(t1, t1, 0xffff0000);
     } else {
-        gen_sxth(t1);
+        tcg_gen_shli_i32(t1, t1, 16);
     }
-    t64 = gen_muls_i64_i32(t0, t1);
-    tcg_gen_shri_i64(t64, t64, 16);
-    t1 = tcg_temp_new_i32();
-    tcg_gen_extrl_i64_i32(t1, t64);
-    tcg_temp_free_i64(t64);
+    tcg_gen_muls2_i32(t0, t1, t0, t1);
+    tcg_temp_free_i32(t0);
     if (add) {
         t0 = load_reg(s, a->ra);
         gen_helper_add_setq(t1, cpu_env, t1, t0);

[v4,11/69] target/arm: Simplify op_smlawx for SMLAW*

Commit Message

Patch