From patchwork Sun Feb  4 04:11:12 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
X-Patchwork-Submitter: Richard Henderson <richard.henderson@linaro.org>
X-Patchwork-Id: 126793
Delivered-To: patch@linaro.org
Received: by 10.46.124.24 with SMTP id x24csp923319ljc;
 Sat, 3 Feb 2018 20:12:15 -0800 (PST)
X-Google-Smtp-Source: AH8x225nP+y3BoqLu8pBfWMHNAd/heucxuaoOWkcLV1klm6tHN99LkXc70PUE6J1+focOIqvi1/s
X-Received: by 10.37.32.11 with SMTP id g11mr16156803ybg.480.1517717535082; 
 Sat, 03 Feb 2018 20:12:15 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; t=1517717535; cv=none;
 d=google.com; s=arc-20160816;
 b=DG5escdqfCxE/P+VvHVjjIAuSKqskRx2pxj0h7DNIFqLmxMabbhIXmZ1Uhk0Bms3mW
 B4O7xNvLeWdpS9I7XH+gihY/Z5BL2otK9AuTlsL6mYC+SRD9QC2Cc0LWyxIqsvT47sPS
 3MPejZH6Pdyzgrl/9yrvVokgW2cxgMFpirSGlUDijpPFWS0P1+YSqtZizu7U3HDxVq3F
 hAUGi2M/rOOQM+J2A9f7TsVuUh/Gs7r+7lUu306y+I7g0lPa+NtNHuy+TNI06FHxg0pI
 qDKtEiD5Re9u4LhPfV2GFgJRmLM56XWFViuaN1P4t6TwoLbOj4jsK29/4fhpUnor/PXV
 N+Bw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com;
 s=arc-20160816; 
 h=sender:errors-to:cc:list-subscribe:list-help:list-post:list-archive
 :list-unsubscribe:list-id:precedence:subject
 :content-transfer-encoding:mime-version:message-id:date:to:from
 :dkim-signature:arc-authentication-results;
 bh=DKddWh3zOvBJ7FAiwk7TVgMJhvHsgce9QOzM3LuEa1A=;
 b=UVGRcoaKY9Ef0dgcSY+xIYRfb2mqvnx1CNzwJn1uel+6mbL8e4fsh5/QnLCw3/skkg
 APCeg3qb8fZgSHlqlscUYhNkcuFyNf7WS/oeRd/w26MLDTN9HyXf+PxSKGAcEclyno1z
 EWodOamDyZu2jJkWMVa0cos2nkNkpToGLgNuuxPIh5kMxRjW18jWSkYEoem+v0dPoBGf
 TPptM+Z3/+i3M5mvkE7hjKHdrcolPDxHxeoIOvUuJiTtzgMCzaYgfN+Qn6ZXlEGzivvY
 aAjOIW++M+Jb8+l6e8utoyHr5HE8UJbGFd8dKhf8X+XD/1Ew8MQ4nrtp32mMkYIevJ6w
 8zmw==
ARC-Authentication-Results: i=1; mx.google.com;
 dkim=fail header.i=@linaro.org header.s=google header.b=J7FJmiCf;
 spf=pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates
 2001:4830:134:3::11 as permitted sender)
 smtp.mailfrom=qemu-devel-bounces+patch=linaro.org@nongnu.org; 
 dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linaro.org
Return-Path: <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Received: from lists.gnu.org (lists.gnu.org. [2001:4830:134:3::11])
 by mx.google.com with ESMTPS id
 m124si1019527ybb.413.2018.02.03.20.12.14 for <patch@linaro.org>
 (version=TLS1 cipher=AES128-SHA bits=128/128);
 Sat, 03 Feb 2018 20:12:15 -0800 (PST)
Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates
 2001:4830:134:3::11 as permitted sender)
 client-ip=2001:4830:134:3::11; 
Authentication-Results: mx.google.com;
 dkim=fail header.i=@linaro.org header.s=google header.b=J7FJmiCf;
 spf=pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates
 2001:4830:134:3::11 as permitted sender)
 smtp.mailfrom=qemu-devel-bounces+patch=linaro.org@nongnu.org; 
 dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linaro.org
Received: from localhost ([::1]:58234 helo=lists.gnu.org)
 by lists.gnu.org with esmtp (Exim 4.71)
 (envelope-from <qemu-devel-bounces+patch=linaro.org@nongnu.org>)
 id 1eiBf4-0004V7-Cd
 for patch@linaro.org; Sat, 03 Feb 2018 23:12:14 -0500
Received: from eggs.gnu.org ([2001:4830:134:3::10]:47232)
 by lists.gnu.org with esmtp (Exim 4.71)
 (envelope-from <richard.henderson@linaro.org>) id 1eiBed-0004UC-1Y
 for qemu-devel@nongnu.org; Sat, 03 Feb 2018 23:11:48 -0500
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <richard.henderson@linaro.org>) id 1eiBeY-0004oP-2d
 for qemu-devel@nongnu.org; Sat, 03 Feb 2018 23:11:47 -0500
Received: from mail-pg0-x232.google.com ([2607:f8b0:400e:c05::232]:45544)
 by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16)
 (Exim 4.71) (envelope-from <richard.henderson@linaro.org>)
 id 1eiBeX-0004oB-Q1
 for qemu-devel@nongnu.org; Sat, 03 Feb 2018 23:11:41 -0500
Received: by mail-pg0-x232.google.com with SMTP id m136so15978715pga.12
 for <qemu-devel@nongnu.org>; Sat, 03 Feb 2018 20:11:41 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; 
 h=from:to:cc:subject:date:message-id:mime-version
 :content-transfer-encoding;
 bh=DKddWh3zOvBJ7FAiwk7TVgMJhvHsgce9QOzM3LuEa1A=;
 b=J7FJmiCf2PZkZZk7NCe4L/Ylmpxh4OxoEpdha/QVFwcZ+v0JanwqlkEuFuSRfojbFP
 +DX4JH0wAE/GVwBIzf1kBh5Yhr4YqClo+OqLd2KXvTcRa/L9CbxR3aS/SJwcW8D8ZLls
 fR9DeI685+5TrzJlEVuGx4hXhrk7thW3hd6eI=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version
 :content-transfer-encoding;
 bh=DKddWh3zOvBJ7FAiwk7TVgMJhvHsgce9QOzM3LuEa1A=;
 b=UM4itOkELFwuXayTnE/WTjoOXqDZbDtUHiBBD8CKsp8m93oDrhEy5yJiWq39KWWNGY
 A4MIf3xy6FjQhqkIeA2hvBuhu/SD5lyJwju9s4gJx3PPsiXYt2wihoti+DDwtpoZn73X
 5sjhUNhweBYnuR+6vS4SG9DihgIqyJFP1sib0AsATzYPg6txoHBsJLmlWHo5SspKTA+A
 UPRQHVd0JlxoPPJEdoNliNRNLNKsNoE3m65bABglrK8HAtWYpPJyMKaX8acF9l7IBVLl
 8qoFIeom4ztC6FQ5mgLb53Obz2/DmaXBgoVqDhELx1ME/aBpnSVtdY4AiiWbT0O8iFEt
 csYQ==
X-Gm-Message-State: AKwxytfIY+OulN7oqMqILsvPwqOfA+ZqVe1yN8KMMizRMqlRuJ71Rwr4
 665GCfYXC68ulltZgzO4EF3A8VdDwGM=
X-Received: by 10.98.228.5 with SMTP id r5mr45854004pfh.193.1517717499987;
 Sat, 03 Feb 2018 20:11:39 -0800 (PST)
Received: from cloudburst.twiddle.net (174-21-6-47.tukw.qwest.net.
 [174.21.6.47]) by smtp.gmail.com with ESMTPSA id
 k3sm1399425pgr.12.2018.02.03.20.11.38
 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256);
 Sat, 03 Feb 2018 20:11:38 -0800 (PST)
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Date: Sat,  3 Feb 2018 20:11:12 -0800
Message-Id: <20180204041136.17525-1-richard.henderson@linaro.org>
X-Mailer: git-send-email 2.14.3
MIME-Version: 1.0
X-detected-operating-system: by eggs.gnu.org: Genre and OS details not
 recognized.
X-Received-From: 2607:f8b0:400e:c05::232
Subject: [Qemu-devel] [PATCH 00/24] re-factor and add fp16 using glibc soft-fp
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.21
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel/>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Cc: peter.maydell@linaro.org, cota@braap.org, alex.bennee@linaro.org,
 hsp.cat7@gmail.com
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

As discussed on list, the structure and inline function solution that
Alex and I have been writing from scratch introduces a sizeable
performance regression.  Alex and I have done some work earlier
in the week that improved things some, but not enough.

Which leaves us with a bit of a problem.  The were two existing
code bases that we originally considered:

There's softfloat v3, which would need a large structural reorg in
order to be able to handle multiple float_status contexts.  But when
Alex communicated with upstream they weren't ready to accept patches.

Or there's the code from glibc.  I know Peter didn't like the idea;
debugging this code is fairly painful -- the massive preprocessor
macros mean that you can't step through anything.  But at least we
have a good relationship with glibc, so merging patches back and
forth should be easy.

The result seems to perform slightly better than mainline.
With an aarch64 guest and a i7-8550U host, nbench gives

- FLOATING-POINT INDEX: 3.095
+ FLOATING-POINT INDEX: 3.438

I've also run this through my usual set of aarch64 RISU tests.

Thoughts?


r~


Alex Bennée (9):
  fpu/softfloat: implement float16_squash_input_denormal
  include/fpu/softfloat: remove USE_SOFTFLOAT_STRUCT_TYPES
  fpu/softfloat-types: new header to prevent excessive re-builds
  target/*/cpu.h: remove softfloat.h
  include/fpu/softfloat: implement float16_abs helper
  include/fpu/softfloat: implement float16_chs helper
  include/fpu/softfloat: implement float16_set_sign helper
  include/fpu/softfloat: add some float16 constants
  fpu/softfloat: improve comments on ARM NaN propagation

Richard Henderson (15):
  fpu/soft-fp: Import soft-fp from glibc
  fpu/soft-fp: Adjust soft-fp types
  fpu/soft-fp: Add ties_away and to_odd rounding modes
  fpu/soft-fp: Add arithmetic macros to half.h
  fpu/soft-fp: Adjust _FP_CMP_CHECK_NAN
  fpu: Implement add/sub/mul/div with soft-fp.h
  fpu: Implement float_to_int/uint with soft-fp.h
  fpu: Implement int/uint_to_float with soft-fp.h
  fpu: Implement compares with soft-fp.h
  fpu: Implement min/max with soft-fp.h
  fpu: Implement sqrt with soft-fp.h
  fpu: Implement scalbn with soft-fp.h
  fpu: Implement float_to_float with soft-fp.h
  fpu: Implement muladd with soft-fp.h
  fpu: Implement round_to_int with soft-fp.h

 Makefile.target                 |    5 +
 fpu/double.h                    |  321 +++
 fpu/half.h                      |  180 ++
 fpu/op-1.h                      |  369 +++
 fpu/op-2.h                      |  705 ++++++
 fpu/op-4.h                      |  875 +++++++
 fpu/op-8.h                      |    1 +
 fpu/op-common.h                 | 2154 +++++++++++++++++
 fpu/quad.h                      |  328 +++
 fpu/sfp-machine.h               |  222 ++
 fpu/single.h                    |  197 ++
 fpu/soft-fp-specialize.h        |  254 ++
 fpu/soft-fp.h                   |  379 +++
 fpu/softfloat-specialize.h      |  273 +--
 include/fpu/softfloat-types.h   |  179 ++
 include/fpu/softfloat.h         |  254 +-
 include/qemu/bswap.h            |    2 +-
 target/alpha/cpu.h              |    2 -
 target/arm/cpu.h                |    2 -
 target/hppa/cpu.h               |    1 -
 target/i386/cpu.h               |    4 -
 target/m68k/cpu.h               |    1 -
 target/microblaze/cpu.h         |    2 +-
 target/moxie/cpu.h              |    1 -
 target/nios2/cpu.h              |    1 -
 target/openrisc/cpu.h           |    1 -
 target/ppc/cpu.h                |    1 -
 target/s390x/cpu.h              |    2 -
 target/sh4/cpu.h                |    2 -
 target/sparc/cpu.h              |    2 -
 target/tricore/cpu.h            |    1 -
 target/unicore32/cpu.h          |    1 -
 target/xtensa/cpu.h             |    1 -
 fpu/float128.c                  |   35 +
 fpu/float16.c                   |   43 +
 fpu/float32.c                   |   35 +
 fpu/float64.c                   |   35 +
 fpu/floatconv.c                 |  154 ++
 fpu/floatxx.inc.c               |  541 +++++
 fpu/softfloat.c                 | 5092 +--------------------------------------
 target/arm/cpu.c                |    1 +
 target/arm/helper-a64.c         |    1 +
 target/arm/helper.c             |    1 +
 target/arm/neon_helper.c        |    1 +
 target/hppa/cpu.c               |    1 +
 target/hppa/op_helper.c         |    1 +
 target/i386/fpu_helper.c        |    1 +
 target/m68k/cpu.c               |    2 +-
 target/m68k/fpu_helper.c        |    1 +
 target/m68k/helper.c            |    1 +
 target/m68k/translate.c         |    2 +
 target/microblaze/cpu.c         |    1 +
 target/microblaze/op_helper.c   |    1 +
 target/openrisc/fpu_helper.c    |    1 +
 target/ppc/fpu_helper.c         |    1 +
 target/ppc/int_helper.c         |    1 +
 target/ppc/translate_init.c     |    1 +
 target/s390x/cpu.c              |    1 +
 target/s390x/fpu_helper.c       |    1 +
 target/sh4/cpu.c                |    1 +
 target/sh4/op_helper.c          |    1 +
 target/sparc/fop_helper.c       |    1 +
 target/tricore/fpu_helper.c     |    1 +
 target/tricore/helper.c         |    1 +
 target/unicore32/cpu.c          |    1 +
 target/unicore32/ucf64_helper.c |    1 +
 target/xtensa/op_helper.c       |    1 +
 67 files changed, 7184 insertions(+), 5503 deletions(-)
 create mode 100644 fpu/double.h
 create mode 100644 fpu/half.h
 create mode 100644 fpu/op-1.h
 create mode 100644 fpu/op-2.h
 create mode 100644 fpu/op-4.h
 create mode 100644 fpu/op-8.h
 create mode 100644 fpu/op-common.h
 create mode 100644 fpu/quad.h
 create mode 100644 fpu/sfp-machine.h
 create mode 100644 fpu/single.h
 create mode 100644 fpu/soft-fp-specialize.h
 create mode 100644 fpu/soft-fp.h
 create mode 100644 include/fpu/softfloat-types.h
 create mode 100644 fpu/float128.c
 create mode 100644 fpu/float16.c
 create mode 100644 fpu/float32.c
 create mode 100644 fpu/float64.c
 create mode 100644 fpu/floatconv.c
 create mode 100644 fpu/floatxx.inc.c

-- 
2.14.3