[v3,63/88] target/hppa: Implement HADD

Message ID	20231102013016.369010-64-richard.henderson@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: deller@gmx.de Subject: [PATCH v3 63/88] target/hppa: Implement HADD Date: Wed, 1 Nov 2023 18:29:51 -0700 Message-Id: <20231102013016.369010-64-richard.henderson@linaro.org> In-Reply-To: <20231102013016.369010-1-richard.henderson@linaro.org> References: <20231102013016.369010-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::535; envelope-from=richard.henderson@linaro.org; helo=mail-pg1-x535.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	target/hppa: Implement hppa64 cpu \| expand [v3,00/88] target/hppa: Implement hppa64 cpu [v3,01/88] target/hppa: Include PSW_P in tb flags and mmu index [v3,02/88] target/hppa: Rename hppa_tlb_entry to HPPATLBEntry [v3,03/88] target/hppa: Use IntervalTreeNode in HPPATLBEntry [v3,04/88] target/hppa: Always report one page to tlb_set_page [v3,05/88] target/hppa: Split out hppa_flush_tlb_range [v3,06/88] target/hppa: Populate an interval tree with valid tlb entries [v3,07/88] tcg: Improve expansion of deposit of constant [v3,08/88] tcg: Improve expansion of deposit into a constant [v3,09/88] target/hppa: Remove get_temp [v3,10/88] target/hppa: Remove get_temp_tl [v3,11/88] target/hppa: Remove load_const [v3,12/88] target/hppa: Fix hppa64 case in machine.c [v3,13/88] target/hppa: Fix load in do_load_32 [v3,14/88] target/hppa: Truncate rotate count in trans_shrpw_sar [v3,15/88] target/hppa: Fix trans_ds for hppa64 [v3,16/88] target/hppa: Fix do_add, do_sub for hppa64 [v3,17/88] target/hppa: Fix bb_sar for hppa64 [v3,18/88] target/hppa: Fix extrw and depw with sar for hppa64 [v3,19/88] target/hppa: Introduce TYPE_HPPA64_CPU [v3,20/88] target/hppa: Make HPPA_BTLB_ENTRIES variable [v3,21/88] target/hppa: Implement cpu_list [v3,22/88] target/hppa: Implement hppa_cpu_class_by_name [v3,23/88] target/hppa: Update cpu_hppa_get/put_psw for hppa64 [v3,24/88] target/hppa: Handle absolute addresses for pa2.0 [v3,25/88] target/hppa: Adjust hppa_cpu_dump_state for hppa64 [v3,26/88] target/hppa: Fix hppa64 addressing [v3,27/88] target/hppa: Pass DisasContext to copy_iaoq_entry [v3,28/88] target/hppa: Always use copy_iaoq_entry to set cpu_iaoq_[fb] [v3,29/88] target/hppa: Use copy_iaoq_entry for link in do_ibranch [v3,30/88] target/hppa: Mask inputs in copy_iaoq_entry [v3,31/88] target/hppa: sar register allows only 5 bits on 32-bit CPU [v3,32/88] target/hppa: Pass d to do_cond [v3,33/88] target/hppa: Pass d to do_sub_cond [v3,34/88] target/hppa: Pass d to do_log_cond [v3,35/88] target/hppa: Pass d to do_sed_cond [v3,36/88] target/hppa: Pass d to do_unit_cond [v3,37/88] linux-user/hppa: Fixes for TARGET_ABI32 [v3,38/88] target/hppa: Drop attempted gdbstub support for hppa64 [v3,39/88] target/hppa: Remove TARGET_HPPA64 [v3,40/88] target/hppa: Decode d for logical instructions [v3,41/88] target/hppa: Decode d for unit instructions [v3,42/88] target/hppa: Decode d for cmpclr instructions [v3,43/88] target/hppa: Decode d for add instructions [v3,44/88] target/hppa: Decode d for sub instructions [v3,45/88] target/hppa: Decode d for bb instructions [v3,46/88] target/hppa: Decode d for cmpb instructions [v3,47/88] target/hppa: Decode CMPIB double-word [v3,48/88] target/hppa: Decode ADDB double-word [v3,49/88] target/hppa: Implement LDD, LDCD, LDDA, STD, STDA [v3,50/88] target/hppa: Implement DEPD, DEPDI [v3,51/88] target/hppa: Implement EXTRD [v3,52/88] target/hppa: Implement SHRPD [v3,53/88] target/hppa: Implement CLRBTS, POPBTS, PUSHBTS, PUSHNOM [v3,54/88] target/hppa: Implement STDBY [v3,55/88] target/hppa: Implement IDTLBT, IITLBT [v3,56/88] hw/hppa: Use uint32_t instead of target_ureg [v3,57/88] target/hppa: Remove TARGET_REGISTER_BITS [v3,58/88] target/hppa: Remove most of the TARGET_REGISTER_BITS redirections [v3,59/88] target/hppa: Remove remaining TARGET_REGISTER_BITS redirections [v3,60/88] target/hppa: Adjust vmstate_env for pa2.0 tlb [v3,61/88] target/hppa: Use tcg_temp_new_i64 not tcg_temp_new [v3,62/88] target/hppa: Replace tcg_gen__tl with tcg_gen__i64 [v3,63/88] target/hppa: Implement HADD [v3,64/88] target/hppa: Implement HSUB [v3,65/88] target/hppa: Implement HAVG [v3,66/88] target/hppa: Implement HSHL, HSHR [v3,67/88] target/hppa: Implement HSHLADD, HSHRADD [v3,68/88] target/hppa: Implement MIXH, MIXW [v3,69/88] target/hppa: Implement PERMH [v3,70/88] target/hppa: Fix interruption based on default PSW [v3,71/88] target/hppa: Precompute zero into DisasContext [v3,72/88] target/hppa: Return zero for r0 from load_gpr [v3,73/88] include/hw/elf: Remove truncating signed casts [v3,74/88] hw/hppa: Translate phys addresses for the cpu [v3,75/88] linux-user/hppa: Drop EXCP_DUMP from handled exceptions [v3,76/88] target/hppa: Implement pa2.0 data prefetch instructions [v3,77/88] target/hppa: Add pa2.0 cpu local tlb flushes [v3,78/88] target/hppa: Avoid async_safe_run_on_cpu on uniprocessor system [v3,79/88] target/hppa: Clear upper bits in mtctl for pa1.x [v3,80/88] target/hppa: Add unwind_breg to CPUHPPAState [v3,81/88] target/hppa: Create raise_exception_with_ior [v3,82/88] target/hppa: Update IIAOQ, IIASQ for pa2.0 [v3,83/88] target/hppa: Improve interrupt logging [v3,84/88] hw/pci-host/astro: Map Astro chip into 64-bit I/O memory region [v3,85/88] hw/pci-host/astro: Trigger CPU irq on CPU HPA in high memory [v3,86/88] hw/hppa: Turn on 64-bit CPU for C3700 machine [v3,87/88] hw/hppa: Allow C3700 with 64-bit and B160L with 32-bit CPU only [v3,88/88] hw/hppa: Map PDC ROM and I/O memory area into lower memory

Message ID

20231102013016.369010-64-richard.henderson@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: deller@gmx.de
Subject: [PATCH v3 63/88] target/hppa: Implement HADD
Date: Wed,  1 Nov 2023 18:29:51 -0700
Message-Id: <20231102013016.369010-64-richard.henderson@linaro.org>
In-Reply-To: <20231102013016.369010-1-richard.henderson@linaro.org>
References: <20231102013016.369010-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::535;
 envelope-from=richard.henderson@linaro.org; helo=mail-pg1-x535.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

target/hppa: Implement hppa64 cpu | expand

Commit Message

Richard Henderson Nov. 2, 2023, 1:29 a.m. UTC

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/hppa/helper.h     |  3 +++
 target/hppa/insns.decode |  8 +++++++-
 target/hppa/op_helper.c  | 32 ++++++++++++++++++++++++++++++++
 target/hppa/translate.c  | 37 +++++++++++++++++++++++++++++++++++++
 4 files changed, 79 insertions(+), 1 deletion(-)

diff --git a/target/hppa/helper.h b/target/hppa/helper.h
index 57ea5447b6..b3c961b50d 100644
--- a/target/hppa/helper.h
+++ b/target/hppa/helper.h
@@ -14,6 +14,9 @@  DEF_HELPER_FLAGS_3(stdby_e_parallel, TCG_CALL_NO_WG, void, env, tl, tl)
 
 DEF_HELPER_FLAGS_1(ldc_check, TCG_CALL_NO_RWG, void, tl)
 
+DEF_HELPER_FLAGS_2(hadd_ss, TCG_CALL_NO_RWG, i64, i64, i64)
+DEF_HELPER_FLAGS_2(hadd_us, TCG_CALL_NO_RWG, i64, i64, i64)
+
 DEF_HELPER_FLAGS_4(probe, TCG_CALL_NO_WG, tl, env, tl, i32, i32)
 
 DEF_HELPER_FLAGS_1(loaded_fr0, TCG_CALL_NO_RWG, void, env)
diff --git a/target/hppa/insns.decode b/target/hppa/insns.decode
index 820049b0c5..4bcfc94b1c 100644
--- a/target/hppa/insns.decode
+++ b/target/hppa/insns.decode
@@ -65,6 +65,7 @@ 
 &ldst           t b x disp sp m scale size
 
 &rr_cf_d        t r cf d
+&rrr            t r1 r2
 &rrr_cf         t r1 r2 cf
 &rrr_cf_d       t r1 r2 cf d
 &rrr_cf_d_sh    t r1 r2 cf d sh
@@ -81,6 +82,7 @@ 
 ####
 
 @rr_cf_d        ...... r:5 ..... cf:4 ...... d:1 t:5    &rr_cf_d
+@rrr            ...... r2:5 r1:5 .... ....... t:5       &rrr
 @rrr_cf         ...... r2:5 r1:5 cf:4 ....... t:5       &rrr_cf
 @rrr_cf_d       ...... r2:5 r1:5 cf:4 ...... d:1 t:5    &rrr_cf_d
 @rrr_cf_d_sh    ...... r2:5 r1:5 cf:4 .... sh:2 d:1 t:5 &rrr_cf_d_sh
@@ -208,6 +210,10 @@  subi_tsv        100101 ..... ..... .... 1 ...........   @rri_cf
 
 cmpiclr         100100 ..... ..... .... . ...........   @rri_cf_d
 
+hadd            000010 ..... ..... 00000011 11 0 .....  @rrr
+hadd_ss         000010 ..... ..... 00000011 01 0 .....  @rrr
+hadd_us         000010 ..... ..... 00000011 00 0 .....  @rrr
+
 ####
 # Index Mem
 ####
@@ -429,7 +435,7 @@  fmpyfadd_d      101110 rm1:5 rm2:5 ... 0 1 ..0 0 0 neg:1 t:5    ra3=%rc32
 
 @f0e_f_3        ...... ..... ..... ... .0 110 ..0 .....    \
                 &fclass3 r1=%ra64 r2=%rb64 t=%rt64
-@f0e_d_3        ...... r1:5  r2:5  ... 01 110 000 t:5
+@f0e_d_3        ...... r1:5  r2:5  ... 01 110 000 t:5      &fclass3
 
 # Floating point class 0
 
diff --git a/target/hppa/op_helper.c b/target/hppa/op_helper.c
index 0bccca1e11..a230a3a0c3 100644
--- a/target/hppa/op_helper.c
+++ b/target/hppa/op_helper.c
@@ -377,3 +377,35 @@  target_ulong HELPER(read_interval_timer)(void)
     return qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) >> 2;
 #endif
 }
+
+uint64_t HELPER(hadd_ss)(uint64_t r1, uint64_t r2)
+{
+    uint64_t ret = 0;
+
+    for (int i = 0; i < 64; i += 16) {
+        int f1 = sextract64(r1, i, 16);
+        int f2 = sextract64(r2, i, 16);
+        int fr = f1 + f2;
+
+        fr = MIN(fr, INT16_MAX);
+        fr = MAX(fr, INT16_MIN);
+        ret = deposit64(ret, i, 16, fr);
+    }
+    return ret;
+}
+
+uint64_t HELPER(hadd_us)(uint64_t r1, uint64_t r2)
+{
+    uint64_t ret = 0;
+
+    for (int i = 0; i < 64; i += 16) {
+        int f1 = extract64(r1, i, 16);
+        int f2 = sextract64(r2, i, 16);
+        int fr = f1 + f2;
+
+        fr = MIN(fr, UINT16_MAX);
+        fr = MAX(fr, 0);
+        ret = deposit64(ret, i, 16, fr);
+    }
+    return ret;
+}
diff --git a/target/hppa/translate.c b/target/hppa/translate.c
index f570b17ecd..f564aea8fb 100644
--- a/target/hppa/translate.c
+++ b/target/hppa/translate.c
@@ -23,6 +23,7 @@ 
 #include "qemu/host-utils.h"
 #include "exec/exec-all.h"
 #include "tcg/tcg-op.h"
+#include "tcg/tcg-op-gvec.h"
 #include "exec/helper-proto.h"
 #include "exec/helper-gen.h"
 #include "exec/translator.h"
@@ -2767,6 +2768,42 @@  static bool trans_cmpiclr(DisasContext *ctx, arg_rri_cf_d *a)
     return nullify_end(ctx);
 }
 
+static bool do_multimedia(DisasContext *ctx, arg_rrr *a,
+                          void (*fn)(TCGv_i64, TCGv_i64, TCGv_i64))
+{
+    TCGv_i64 r1, r2, dest;
+
+    if (!ctx->is_pa20) {
+        return false;
+    }
+
+    nullify_over(ctx);
+
+    r1 = load_gpr(ctx, a->r1);
+    r2 = load_gpr(ctx, a->r2);
+    dest = dest_gpr(ctx, a->t);
+
+    fn(dest, r1, r2);
+    save_gpr(ctx, a->t, dest);
+
+    return nullify_end(ctx);
+}
+
+static bool trans_hadd(DisasContext *ctx, arg_rrr *a)
+{
+    return do_multimedia(ctx, a, tcg_gen_vec_add16_i64);
+}
+
+static bool trans_hadd_ss(DisasContext *ctx, arg_rrr *a)
+{
+    return do_multimedia(ctx, a, gen_helper_hadd_ss);
+}
+
+static bool trans_hadd_us(DisasContext *ctx, arg_rrr *a)
+{
+    return do_multimedia(ctx, a, gen_helper_hadd_us);
+}
+
 static bool trans_ld(DisasContext *ctx, arg_ldst *a)
 {
     if (!ctx->is_pa20 && a->size > MO_32) {

[v3,63/88] target/hppa: Implement HADD

Commit Message

Patch