[v2,29/32] arm/translate-a64: add FP16 FMOV to simd_mod_imm

Message ID	20180208173157.24705-30-alex.bennee@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 2001:4830:134:3::11 as permitted sender) client-ip=2001:4830:134:3::11; From: =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org> To: qemu-arm@nongnu.org Date: Thu, 8 Feb 2018 17:31:54 +0000 Message-Id: <20180208173157.24705-30-alex.bennee@linaro.org> In-Reply-To: <20180208173157.24705-1-alex.bennee@linaro.org> References: <20180208173157.24705-1-alex.bennee@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Subject: [Qemu-devel] [PATCH v2 29/32] arm/translate-a64: add FP16 FMOV to simd_mod_imm Precedence: list Cc: Peter Maydell <peter.maydell@linaro.org>, =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>, qemu-devel@nongnu.org Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	Add ARMv8.2 half-precision functions \| expand [v2,00/32] Add ARMv8.2 half-precision functions [v2,01/32] include/exec/helper-head.h: support f16 in helper calls [v2,02/32] target/arm/cpu64: introduce ARM_V8_FP16 feature bit [v2,03/32] target/arm/cpu64: allow fp16 to be disabled [v2,04/32] target/arm/cpu.h: update comment for half-precision values [v2,05/32] target/arm/cpu.h: add additional float_status flags [v2,06/32] target/arm/helper: pass explicit fpst to set_rmode [v2,07/32] arm/translate-a64: implement half-precision F(MIN\|MAX)(V\|NMV) [v2,08/32] arm/translate-a64: handle_3same_64 comment fix [v2,09/32] arm/translate-a64: initial decode for simd_three_reg_same_fp16 [v2,10/32] arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16 [v2,11/32] arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16 [v2,12/32] arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16 [v2,13/32] arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16 [v2,14/32] arm/translate-a64: add FP16 pairwise ops simd_three_reg_same_fp16 [v2,15/32] arm/translate-a64: add FP16 FMULX/MLS/FMLA to simd_indexed [v2,16/32] arm/translate-a64: add FP16 x2 ops for simd_indexed [v2,17/32] arm/translate-a64: initial decode for simd_two_reg_misc_fp16 [v2,18/32] arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16 [v2,19/32] arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16 [v2,20/32] arm/translate-a64: add FP16 FCMxx (zero) to simd_two_reg_misc_fp16 [v2,21/32] arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16 [v2,22/32] arm/translate-a64: add FP16 FNEG/FABS to simd_two_reg_misc_fp16 [v2,23/32] arm/helper.c: re-factor recpe and add recepe_f16 [v2,24/32] arm/translate-a64: add FP16 FRECPE [v2,25/32] arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16 [v2,26/32] arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16 [v2,27/32] arm/helper.c: re-factor rsqrte and add rsqrte_f16 [v2,28/32] arm/translate-a64: add FP16 FRSQRTE to simd_two_reg_misc_fp16 [v2,29/32] arm/translate-a64: add FP16 FMOV to simd_mod_imm [v2,30/32] arm/translate-a64: add all FP16 ops in simd_scalar_pairwise [v2,31/32] arm/translate-a64: implement simd_scalar_three_reg_same_fp16 [v2,32/32] arm/translate-a64: add all single op FP16 to handle_fp_1src_half

Message ID

20180208173157.24705-30-alex.bennee@linaro.org

State

New

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	2001:4830:134:3::11 as permitted sender)
	client-ip=2001:4830:134:3::11; 
From: =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>
To: qemu-arm@nongnu.org
Date: Thu,  8 Feb 2018 17:31:54 +0000
Message-Id: <20180208173157.24705-30-alex.bennee@linaro.org>
In-Reply-To: <20180208173157.24705-1-alex.bennee@linaro.org>
References: <20180208173157.24705-1-alex.bennee@linaro.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Subject: [Qemu-devel] [PATCH v2 29/32] arm/translate-a64: add FP16 FMOV to
	simd_mod_imm
Precedence: list
Cc: Peter Maydell <peter.maydell@linaro.org>, =?utf-8?q?Alex_Benn=C3=A9e?=
	<alex.bennee@linaro.org>, 	qemu-devel@nongnu.org
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

Add ARMv8.2 half-precision functions | expand

Commit Message

Alex Bennée Feb. 8, 2018, 5:31 p.m. UTC

Only one half-precision instruction has been added to this group.

Signed-off-by: Alex Bennée <alex.bennee@linaro.org>


---
v2
  - checkpatch fixes
---
 target/arm/translate-a64.c | 48 ++++++++++++++++++++++++++++++++++++----------
 1 file changed, 38 insertions(+), 10 deletions(-)

-- 
2.15.1

Comments

Richard Henderson Feb. 9, 2018, 6:23 p.m. UTC | #1

On 02/08/2018 09:31 AM, Alex Bennée wrote:
> Only one half-precision instruction has been added to this group.

> 

> Signed-off-by: Alex Bennée <alex.bennee@linaro.org>

> 

> ---

> v2

>   - checkpatch fixes

> ---

>  target/arm/translate-a64.c | 48 ++++++++++++++++++++++++++++++++++++----------

>  1 file changed, 38 insertions(+), 10 deletions(-)

> 

> diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c

> index fa21299061..b209f57d55 100644

> --- a/target/arm/translate-a64.c

> +++ b/target/arm/translate-a64.c

> @@ -6160,6 +6160,8 @@ static void disas_simd_copy(DisasContext *s, uint32_t insn)

>   *   MVNI - move inverted (shifted) imm into register

>   *   ORR  - bitwise OR of (shifted) imm with register

>   *   BIC  - bitwise clear of (shifted) imm with register

> + * With ARMv8.2 we also have:

> + *   FMOV half-precision

>   */

>  static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)

>  {

> @@ -6176,8 +6178,11 @@ static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)

>      int i;

>  

>      if (o2 != 0 || ((cmode == 0xf) && is_neg && !is_q)) {

> -        unallocated_encoding(s);

> -        return;

> +        /* Check for FMOV (vector, immediate) - half-precision */

> +        if (!(arm_dc_feature(s, ARM_FEATURE_V8_FP16) && o2 && cmode == 0xf)) {

> +            unallocated_encoding(s);

> +            return;

> +        }

>      }

>  

>      if (!fp_access_check(s)) {

> @@ -6235,19 +6240,42 @@ static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)

>                      imm |= 0x4000000000000000ULL;

>                  }

>              } else {

> -                imm = (abcdefgh & 0x3f) << 19;

> -                if (abcdefgh & 0x80) {

> -                    imm |= 0x80000000;

> -                }

> -                if (abcdefgh & 0x40) {

> -                    imm |= 0x3e000000;

> +                if (o2) {

> +                    /* FMOV (vector, immediate) - half-precision

> +                     *

> +                     * We don't need fancy immediate expansion, just:

> +                     * imm16 = imm8<7>:NOT(imm8<6>):Replicate(imm8<6>,2):

> +                     *         imm8<5:0>:Zeros(6);

> +                     */

> +                    uint32_t imm8_5_0 = extract32(abcdefgh, 0, 6);

> +                    uint32_t imm8_6 = extract32(abcdefgh, 6, 1);

> +                    uint32_t imm8_7 = extract32(abcdefgh, 7, 1);

> +                    uint32_t imm8_6_rep = imm8_6 << 1 | imm8_6;

> +                    uint32_t imm8_6_not = ~imm8_6;

> +                    imm = deposit64(imm, 6, 6, imm8_5_0);

> +                    imm = deposit64(imm, 12, 2, imm8_6_rep);

> +                    imm = deposit64(imm, 14, 1, imm8_6_not);

> +                    imm = deposit64(imm, 15, 1, imm8_7);

> +                    /* now duplicate across the lanes */

> +                    imm = bitfield_replicate(imm, 16);

>                  } else {

> -                    imm |= 0x40000000;

> +                    imm = (abcdefgh & 0x3f) << 19;

> +                    if (abcdefgh & 0x80) {

> +                        imm |= 0x80000000;

> +                    }

> +                    if (abcdefgh & 0x40) {

> +                        imm |= 0x3e000000;

> +                    } else {

> +                        imm |= 0x40000000;

> +                    }

> +                    imm |= (imm << 32);

>                  }

> -                imm |= (imm << 32);


Please use vfp_expand_imm(MO_16, abcdefgh), which probably didn't exist when
you first wrote this.


r~

diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
index fa21299061..b209f57d55 100644
--- a/target/arm/translate-a64.c
+++ b/target/arm/translate-a64.c
@@ -6160,6 +6160,8 @@  static void disas_simd_copy(DisasContext *s, uint32_t insn)
  *   MVNI - move inverted (shifted) imm into register
  *   ORR  - bitwise OR of (shifted) imm with register
  *   BIC  - bitwise clear of (shifted) imm with register
+ * With ARMv8.2 we also have:
+ *   FMOV half-precision
  */
 static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)
 {
@@ -6176,8 +6178,11 @@  static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)
     int i;
 
     if (o2 != 0 || ((cmode == 0xf) && is_neg && !is_q)) {
-        unallocated_encoding(s);
-        return;
+        /* Check for FMOV (vector, immediate) - half-precision */
+        if (!(arm_dc_feature(s, ARM_FEATURE_V8_FP16) && o2 && cmode == 0xf)) {
+            unallocated_encoding(s);
+            return;
+        }
     }
 
     if (!fp_access_check(s)) {
@@ -6235,19 +6240,42 @@  static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)
                     imm |= 0x4000000000000000ULL;
                 }
             } else {
-                imm = (abcdefgh & 0x3f) << 19;
-                if (abcdefgh & 0x80) {
-                    imm |= 0x80000000;
-                }
-                if (abcdefgh & 0x40) {
-                    imm |= 0x3e000000;
+                if (o2) {
+                    /* FMOV (vector, immediate) - half-precision
+                     *
+                     * We don't need fancy immediate expansion, just:
+                     * imm16 = imm8<7>:NOT(imm8<6>):Replicate(imm8<6>,2):
+                     *         imm8<5:0>:Zeros(6);
+                     */
+                    uint32_t imm8_5_0 = extract32(abcdefgh, 0, 6);
+                    uint32_t imm8_6 = extract32(abcdefgh, 6, 1);
+                    uint32_t imm8_7 = extract32(abcdefgh, 7, 1);
+                    uint32_t imm8_6_rep = imm8_6 << 1 | imm8_6;
+                    uint32_t imm8_6_not = ~imm8_6;
+                    imm = deposit64(imm, 6, 6, imm8_5_0);
+                    imm = deposit64(imm, 12, 2, imm8_6_rep);
+                    imm = deposit64(imm, 14, 1, imm8_6_not);
+                    imm = deposit64(imm, 15, 1, imm8_7);
+                    /* now duplicate across the lanes */
+                    imm = bitfield_replicate(imm, 16);
                 } else {
-                    imm |= 0x40000000;
+                    imm = (abcdefgh & 0x3f) << 19;
+                    if (abcdefgh & 0x80) {
+                        imm |= 0x80000000;
+                    }
+                    if (abcdefgh & 0x40) {
+                        imm |= 0x3e000000;
+                    } else {
+                        imm |= 0x40000000;
+                    }
+                    imm |= (imm << 32);
                 }
-                imm |= (imm << 32);
             }
         }
         break;
+    default:
+        fprintf(stderr, "%s: cmode_3_1: %x\n", __func__, cmode_3_1);
+        g_assert_not_reached();
     }
 
     if (cmode_3_1 != 7 && is_neg) {

[v2,29/32] arm/translate-a64: add FP16 FMOV to simd_mod_imm

Commit Message

Comments

Patch