[v3,11/22] fpu/softfloat: define decompose structures

Message ID	20180124131315.30567-12-alex.bennee@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 2001:4830:134:3::11 as permitted sender) client-ip=2001:4830:134:3::11; From: =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org> To: richard.henderson@linaro.org, peter.maydell@linaro.org, laurent@vivier.eu, bharata@linux.vnet.ibm.com, andrew@andrewdutcher.com Date: Wed, 24 Jan 2018 13:13:04 +0000 Message-Id: <20180124131315.30567-12-alex.bennee@linaro.org> In-Reply-To: <20180124131315.30567-1-alex.bennee@linaro.org> References: <20180124131315.30567-1-alex.bennee@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Subject: [Qemu-devel] [PATCH v3 11/22] fpu/softfloat: define decompose structures Precedence: list Cc: =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>, qemu-devel@nongnu.org, Aurelien Jarno <aurelien@aurel32.net> Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	re-factor softfloat and add fp16 functions \| expand [v3,00/22] re-factor softfloat and add fp16 functions [v3,01/22] fpu/softfloat: implement float16_squash_input_denormal [v3,02/22] include/fpu/softfloat: remove USE_SOFTFLOAT_STRUCT_TYPES [v3,03/22] fpu/softfloat-types: new header to prevent excessive re-builds [v3,04/22] target/*/cpu.h: remove softfloat.h [v3,05/22] include/fpu/softfloat: implement float16_abs helper [v3,06/22] include/fpu/softfloat: implement float16_chs helper [v3,07/22] include/fpu/softfloat: implement float16_set_sign helper [v3,08/22] include/fpu/softfloat: add some float16 constants [v3,09/22] fpu/softfloat: improve comments on ARM NaN propagation [v3,10/22] fpu/softfloat: move the extract functions to the top of the file [v3,11/22] fpu/softfloat: define decompose structures [v3,12/22] fpu/softfloat: re-factor add/sub [v3,13/22] fpu/softfloat: re-factor mul [v3,14/22] fpu/softfloat: re-factor div [v3,15/22] fpu/softfloat: re-factor muladd [v3,16/22] fpu/softfloat: re-factor round_to_int [v3,17/22] fpu/softfloat: re-factor float to int/uint [v3,18/22] fpu/softfloat: re-factor int/uint to float [v3,19/22] fpu/softfloat: re-factor scalbn [v3,20/22] fpu/softfloat: re-factor minmax [v3,21/22] fpu/softfloat: re-factor compare [v3,22/22] fpu/softfloat: re-factor sqrt

Message ID

20180124131315.30567-12-alex.bennee@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	2001:4830:134:3::11 as permitted sender)
	client-ip=2001:4830:134:3::11; 
From: =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>
To: richard.henderson@linaro.org, peter.maydell@linaro.org, laurent@vivier.eu,
	bharata@linux.vnet.ibm.com, andrew@andrewdutcher.com
Date: Wed, 24 Jan 2018 13:13:04 +0000
Message-Id: <20180124131315.30567-12-alex.bennee@linaro.org>
In-Reply-To: <20180124131315.30567-1-alex.bennee@linaro.org>
References: <20180124131315.30567-1-alex.bennee@linaro.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Subject: [Qemu-devel] [PATCH v3 11/22] fpu/softfloat: define decompose
	structures
Precedence: list
Cc: =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>,
	qemu-devel@nongnu.org, Aurelien Jarno <aurelien@aurel32.net>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

re-factor softfloat and add fp16 functions | expand

Commit Message

Alex Bennée Jan. 24, 2018, 1:13 p.m. UTC

These structures pave the way for generic softfloat helper routines
that will operate on fully decomposed numbers.

Signed-off-by: Alex Bennée <alex.bennee@linaro.org>

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>


---
v3
 - comment box style
 - CamelCase structs
 - hide DECOMPOSED_BINARY_POINT - frac in macro
 - more comments
 - add exp_size, frac_size to FloatFmt
 - compute exp_bias and exp_max from FLOAT_PARAMS
 - remove include bitops (in next patch)
---
 fpu/softfloat.c | 86 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++-
 1 file changed, 85 insertions(+), 1 deletion(-)

-- 
2.15.1

Comments

Philippe Mathieu-Daudé Jan. 24, 2018, 2:22 p.m. UTC | #1

On 01/24/2018 10:13 AM, Alex Bennée wrote:
> These structures pave the way for generic softfloat helper routines

> that will operate on fully decomposed numbers.


I have to say this patch in particular is very elegant (seeing how it
simplify the later refactors). I suppose you had a long brainstorming
before...

Total-brain-hours-spent: 141
> Signed-off-by: Alex Bennée <alex.bennee@linaro.org>

> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>


Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>


> 

> ---

> v3

>  - comment box style

>  - CamelCase structs

>  - hide DECOMPOSED_BINARY_POINT - frac in macro

>  - more comments

>  - add exp_size, frac_size to FloatFmt

>  - compute exp_bias and exp_max from FLOAT_PARAMS

>  - remove include bitops (in next patch)

> ---

>  fpu/softfloat.c | 86 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++-

>  1 file changed, 85 insertions(+), 1 deletion(-)

> 

> diff --git a/fpu/softfloat.c b/fpu/softfloat.c

> index 297e48f5c9..568d555595 100644

> --- a/fpu/softfloat.c

> +++ b/fpu/softfloat.c

> @@ -83,7 +83,6 @@ this code that are retained.

>   * target-dependent and needs the TARGET_* macros.

>   */

>  #include "qemu/osdep.h"

> -

>  #include "fpu/softfloat.h"

>  

>  /* We only need stdlib for abort() */

> @@ -186,6 +185,91 @@ static inline flag extractFloat64Sign(float64 a)

>      return float64_val(a) >> 63;

>  }

>  

> +/*

> + * Classify a floating point number. Everything above float_class_qnan

> + * is a NaN so cls >= float_class_qnan is any NaN.

> + */

> +

> +typedef enum __attribute__ ((__packed__)) {

> +    float_class_unclassified,

> +    float_class_zero,

> +    float_class_normal,

> +    float_class_inf,

> +    float_class_qnan,  /* all NaNs from here */

> +    float_class_snan,

> +    float_class_dnan,

> +    float_class_msnan, /* maybe silenced */

> +} FloatClass;

> +

> +/*

> + * Structure holding all of the decomposed parts of a float. The

> + * exponent is unbiased and the fraction is normalized. All

> + * calculations are done with a 64 bit fraction and then rounded as

> + * appropriate for the final format.

> + *

> + * Thanks to the packed FloatClass a decent compiler should be able to

> + * fit the whole structure into registers and avoid using the stack

> + * for parameter passing.

> + */

> +

> +typedef struct {

> +    uint64_t frac;

> +    int32_t  exp;

> +    FloatClass cls;

> +    bool sign;

> +} FloatParts;

> +

> +#define DECOMPOSED_BINARY_POINT    (64 - 2)

> +#define DECOMPOSED_IMPLICIT_BIT    (1ull << DECOMPOSED_BINARY_POINT)

> +#define DECOMPOSED_OVERFLOW_BIT    (DECOMPOSED_IMPLICIT_BIT << 1)

> +

> +/* Structure holding all of the relevant parameters for a format.

> + *   exp_size: the size of the exponent field

> + *   exp_bias: the offset applied to the exponent field

> + *   exp_max: the maximum normalised exponent

> + *   frac_size: the size of the fraction field

> + *   frac_shift: shift to normalise the fraction with DECOMPOSED_BINARY_POINT

> + * The following are computed based the size of fraction

> + *   frac_lsb: least significant bit of fraction

> + *   fram_lsbm1: the bit bellow the least significant bit (for rounding)

> + *   round_mask/roundeven_mask: masks used for rounding

> + */

> +typedef struct {

> +    int exp_size;

> +    int exp_bias;

> +    int exp_max;

> +    int frac_size;

> +    int frac_shift;

> +    uint64_t frac_lsb;

> +    uint64_t frac_lsbm1;

> +    uint64_t round_mask;

> +    uint64_t roundeven_mask;

> +} FloatFmt;

> +

> +/* Expand fields based on the size of exponent and fraction */

> +#define FLOAT_PARAMS(E, F)                                           \

> +    .exp_size       = E,                                             \

> +    .exp_bias       = ((1 << E) - 1) >> 1,                           \

> +    .exp_max        = (1 << E) - 1,                                  \

> +    .frac_size      = F,                                             \

> +    .frac_shift     = DECOMPOSED_BINARY_POINT - F,                   \

> +    .frac_lsb       = 1ull << (DECOMPOSED_BINARY_POINT - F),         \

> +    .frac_lsbm1     = 1ull << ((DECOMPOSED_BINARY_POINT - F) - 1),   \

> +    .round_mask     = (1ull << (DECOMPOSED_BINARY_POINT - F)) - 1,   \

> +    .roundeven_mask = (2ull << (DECOMPOSED_BINARY_POINT - F)) - 1

> +

> +static const FloatFmt float16_params = {

> +    FLOAT_PARAMS(5, 10)

> +};

> +

> +static const FloatFmt float32_params = {

> +    FLOAT_PARAMS(8, 23)

> +};

> +

> +static const FloatFmt float64_params = {

> +    FLOAT_PARAMS(11, 52)

> +};

> +

>  /*----------------------------------------------------------------------------

>  | Takes a 64-bit fixed-point value `absZ' with binary point between bits 6

>  | and 7, and returns the properly rounded 32-bit integer corresponding to the

>

diff --git a/fpu/softfloat.c b/fpu/softfloat.c
index 297e48f5c9..568d555595 100644
--- a/fpu/softfloat.c
+++ b/fpu/softfloat.c
@@ -83,7 +83,6 @@  this code that are retained.
  * target-dependent and needs the TARGET_* macros.
  */
 #include "qemu/osdep.h"
-
 #include "fpu/softfloat.h"
 
 /* We only need stdlib for abort() */
@@ -186,6 +185,91 @@  static inline flag extractFloat64Sign(float64 a)
     return float64_val(a) >> 63;
 }
 
+/*
+ * Classify a floating point number. Everything above float_class_qnan
+ * is a NaN so cls >= float_class_qnan is any NaN.
+ */
+
+typedef enum __attribute__ ((__packed__)) {
+    float_class_unclassified,
+    float_class_zero,
+    float_class_normal,
+    float_class_inf,
+    float_class_qnan,  /* all NaNs from here */
+    float_class_snan,
+    float_class_dnan,
+    float_class_msnan, /* maybe silenced */
+} FloatClass;
+
+/*
+ * Structure holding all of the decomposed parts of a float. The
+ * exponent is unbiased and the fraction is normalized. All
+ * calculations are done with a 64 bit fraction and then rounded as
+ * appropriate for the final format.
+ *
+ * Thanks to the packed FloatClass a decent compiler should be able to
+ * fit the whole structure into registers and avoid using the stack
+ * for parameter passing.
+ */
+
+typedef struct {
+    uint64_t frac;
+    int32_t  exp;
+    FloatClass cls;
+    bool sign;
+} FloatParts;
+
+#define DECOMPOSED_BINARY_POINT    (64 - 2)
+#define DECOMPOSED_IMPLICIT_BIT    (1ull << DECOMPOSED_BINARY_POINT)
+#define DECOMPOSED_OVERFLOW_BIT    (DECOMPOSED_IMPLICIT_BIT << 1)
+
+/* Structure holding all of the relevant parameters for a format.
+ *   exp_size: the size of the exponent field
+ *   exp_bias: the offset applied to the exponent field
+ *   exp_max: the maximum normalised exponent
+ *   frac_size: the size of the fraction field
+ *   frac_shift: shift to normalise the fraction with DECOMPOSED_BINARY_POINT
+ * The following are computed based the size of fraction
+ *   frac_lsb: least significant bit of fraction
+ *   fram_lsbm1: the bit bellow the least significant bit (for rounding)
+ *   round_mask/roundeven_mask: masks used for rounding
+ */
+typedef struct {
+    int exp_size;
+    int exp_bias;
+    int exp_max;
+    int frac_size;
+    int frac_shift;
+    uint64_t frac_lsb;
+    uint64_t frac_lsbm1;
+    uint64_t round_mask;
+    uint64_t roundeven_mask;
+} FloatFmt;
+
+/* Expand fields based on the size of exponent and fraction */
+#define FLOAT_PARAMS(E, F)                                           \
+    .exp_size       = E,                                             \
+    .exp_bias       = ((1 << E) - 1) >> 1,                           \
+    .exp_max        = (1 << E) - 1,                                  \
+    .frac_size      = F,                                             \
+    .frac_shift     = DECOMPOSED_BINARY_POINT - F,                   \
+    .frac_lsb       = 1ull << (DECOMPOSED_BINARY_POINT - F),         \
+    .frac_lsbm1     = 1ull << ((DECOMPOSED_BINARY_POINT - F) - 1),   \
+    .round_mask     = (1ull << (DECOMPOSED_BINARY_POINT - F)) - 1,   \
+    .roundeven_mask = (2ull << (DECOMPOSED_BINARY_POINT - F)) - 1
+
+static const FloatFmt float16_params = {
+    FLOAT_PARAMS(5, 10)
+};
+
+static const FloatFmt float32_params = {
+    FLOAT_PARAMS(8, 23)
+};
+
+static const FloatFmt float64_params = {
+    FLOAT_PARAMS(11, 52)
+};
+
 /*----------------------------------------------------------------------------
 | Takes a 64-bit fixed-point value `absZ' with binary point between bits 6
 | and 7, and returns the properly rounded 32-bit integer corresponding to the

[v3,11/22] fpu/softfloat: define decompose structures

Commit Message

Comments

Patch