From patchwork Fri Nov 29 13:17:42 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Adhemerval Zanella Netto X-Patchwork-Id: 846137 Delivered-To: patch@linaro.org Received: by 2002:adf:f2c4:0:b0:382:43a8:7b94 with SMTP id d4csp866856wrp; Fri, 29 Nov 2024 05:39:51 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCW+FCNdGsxKmbqvZm8pmIbGS3Qfdb07NuvSPL1MXvVD9V7+2ko8HhH7aYZMCH4mKECXO4lXbQ==@linaro.org X-Google-Smtp-Source: AGHT+IHOCYmulyrSXrAo/0CYDhX9YAbcR2gazESG/Ux5W1lNmdoiQUCVpfa6a6Ra1I/mjAfLrAgG X-Received: by 2002:a05:620a:4807:b0:7b6:62f9:109b with SMTP id af79cd13be357-7b67c437504mr1523705285a.42.1732887590989; Fri, 29 Nov 2024 05:39:50 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1732887590; cv=pass; d=google.com; s=arc-20240605; b=h4cL2epItsj7tt/IbJ1quktS2A2baGMywRm7y8X2i9DR/OVnWIQZCybcUtWTeGs+aY koINofZFx7dR/Np81tU3hnnn2gQHxStTDA6eMwIOOAC9MKY1ILH17KZABDGz/LL1fhnN In2/UihFPt75rRWZX8AXfLeEI9qVnWhvQvMBE0YTSZ1rFn7gzac2l5Kcr7MB7a5Zv8gX 7rHQKjejAmbspabz7p0OuSbpmF0jI2feAODhg5zL6zZZ20etAWQ+MlKg2ZmWwxdQ9OgV fHV+t27XQPX+5/YaDTzE32QNL0Lxpdvf5+3CYgdcNVQh0ilimLLVGK2FybjULC38/OCr OBpg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=errors-to:list-subscribe:list-help:list-post:list-archive :list-unsubscribe:list-id:precedence:content-transfer-encoding :mime-version:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature:dkim-filter:arc-filter:dmarc-filter :delivered-to:dkim-filter; bh=64qVRHyOb+22SiTt3KZgjBYFXldw7oeO2ddz4P62H+U=; fh=sFucH9KQW8Y8eMoQXaNIgycLDa7roysdjTHpHLIprh4=; b=D8X8Fatb9U9ahvrHvu8t6KTIyk9MKcKDDxIf0pPwcS4AfVB04EsSfpguRDX4I4pVKJ tYBKHtwnT8u8RgdFr7CHu+f8vZvZPsbskucWS1ttNcZkDdM5M2iO/Dj9CR82MMxgJqsi ecz3DxD8lCvrZHGoUC8dfCsL9MIAZyXlKhvlWzo3lexXVWFowfOkIUWxKfJ9jESzpvRJ 9xCBuwuV/dxp4PmEIf8ZlC3zj8ecfhaNistu473xGLPrSdTOsFc1AWe99UaeJz04ne8w lTLqsjstfa2+jCob3X1l6wf0ak8GJdYwuGcGM2rreU3OwOBoGz322Ozks2UZTxVw/zlI YtAw==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=m6CTN4Br; arc=pass (i=1); spf=pass (google.com: domain of libc-alpha-bounces~patch=linaro.org@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="libc-alpha-bounces~patch=linaro.org@sourceware.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from server2.sourceware.org (server2.sourceware.org. [2620:52:3:1:0:246e:9693:128c]) by mx.google.com with ESMTPS id af79cd13be357-7b6849d727bsi433468185a.486.2024.11.29.05.39.50 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 29 Nov 2024 05:39:50 -0800 (PST) Received-SPF: pass (google.com: domain of libc-alpha-bounces~patch=linaro.org@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) client-ip=2620:52:3:1:0:246e:9693:128c; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=m6CTN4Br; arc=pass (i=1); spf=pass (google.com: domain of libc-alpha-bounces~patch=linaro.org@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="libc-alpha-bounces~patch=linaro.org@sourceware.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 78CC83858CD9 for ; Fri, 29 Nov 2024 13:39:50 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 78CC83858CD9 Authentication-Results: sourceware.org; dkim=pass (2048-bit key, unprotected) header.d=linaro.org header.i=@linaro.org header.a=rsa-sha256 header.s=google header.b=m6CTN4Br X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from mail-pl1-x644.google.com (mail-pl1-x644.google.com [IPv6:2607:f8b0:4864:20::644]) by sourceware.org (Postfix) with ESMTPS id 455183858C98 for ; Fri, 29 Nov 2024 13:21:31 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 455183858C98 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=linaro.org Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=linaro.org ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 455183858C98 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=2607:f8b0:4864:20::644 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732886491; cv=none; b=eFz9zQMI6hqazB/lxPSfaMJFBp+dof6R0TOAUngKxYpktQs4+C0hxaePMh/v5ZEJwZ6VRdIRjeenjFRVTaholXHGj4AEXghBlJALlNq0nK61UVHNI67m5C4utkecIm1n7uMVPhhGL7eBGJHSVqSVd8Iwa/VQWzj2i5wi+rkdQ+8= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732886491; c=relaxed/simple; bh=AgAuR94MKYn8/Vr+2cAxAT2PlAWawqlRdYmiJ0wOTWs=; h=DKIM-Signature:From:To:Subject:Date:Message-ID:MIME-Version; b=EtHRy2vzg5ZP3KwJkj1ONfbk+jDmpyA1T2K5BFAquBK8CCjsz6T1Cdm4Crr/VV8x60TwkRv1J6kTQ4NTWeUD/+zAmhIg4mVLVvt3zyoC2uV48gcwJD8ue0cPUJpFi88mJ81NGRwWZ8tzIMogfE7imQ3JQG1mZRH9zINN65YBDl8= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 455183858C98 Received: by mail-pl1-x644.google.com with SMTP id d9443c01a7336-21288ce11d7so16174735ad.2 for ; Fri, 29 Nov 2024 05:21:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; t=1732886490; x=1733491290; darn=sourceware.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=64qVRHyOb+22SiTt3KZgjBYFXldw7oeO2ddz4P62H+U=; b=m6CTN4Br+EJj/DHtiFiVcTi2AR5wNiWdxuQ8Xy6VAAJrrHFcr5N3AOr+nqCHMHnIBq kFB6VTmRNN7+N1NY26Cz62WWB/rJmKb1hkWkXmBNYHxqSuuhdCiSqeCm/heiZvbYqrdi lClEZgO7H1E20ukKInXZaB3isApxstjFbRQ2ehlnHiOzJ5TaRskfpsxhQRpOSz9DTryn MeaLNMNAs+XvssxOYRFo95MPZKdGBx1zCWk37dp59oisoa7v5UndHuedKGVHN33VhmiA wQEYADq8gvLa8KseiXt3p+uhO/hVB39hnsUyPURA4p6ix1Aw+SK3DXtllJgp93JbOKsb pyUQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1732886490; x=1733491290; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=64qVRHyOb+22SiTt3KZgjBYFXldw7oeO2ddz4P62H+U=; b=Vl02X9myA9Xjt8y3F5Jqk6da39baDEFbI0maq4jttR5TH6qW7mFktTLmuagrXzi/q0 STAvd10DMs5ih/kHqbVA0qLHBIUUolgaiEyKyOXEuZyMC6jsssTohQlFPYQdqYl0ga4m 0nlrObAyr9h97mUGoF0LIqFCuX7V7IDLSbbeopQdjPy9GNuMJY7QARA3WftnnaMoxf+i HeGbAKr74RJxDw8r2uQvo54bOap4C8AFOuLsBLZc1rWAqB1Kj9QC0LDKdT4OJG2T4e+I Ov2zFKyCgLwqXhZr41WZJrkR3v21mdulCVbpFXeNUIZuQJeivKp6UbdqJeYrp1KMvkVP M7CQ== X-Gm-Message-State: AOJu0YztAWm8N3yXjfOWperZl105SkaJ0Ca/DipBBc+N7KuKtXATlLsB 6fga9TCIhdJe3SfC9hyJTrqq/y3uGdErb2M2mzlYa3TOpRkYgXedSMNYGhl3CbrxgzmScv6RzDj h9zy1krJs X-Gm-Gg: ASbGncvEqiHkkBWR/VHIHzEfX/LGlAJIU82cCfMTb7OxybA9O8BtXlswVa9UPgLLN33 K0QQp7sABHUieEFmWOzhlViWnfFiwAyluZpvwINHre6GfTLvW9KDdSbddyuAUmADMHN01LDfJ8D Cmz4m3/T4BINnazF7dF4N14GdOmGXFV4TGHUfSfPXA2FtdWmeku2I4lN60xwDSYBVTmk0jhlDI4 ulhFMDzQzsRLrGl5/PAMUR8R7dq0M2QPjkdf35Kt9NhbVwOI6lZDiLyX/14Yl0= X-Received: by 2002:a17:903:187:b0:20c:9326:559 with SMTP id d9443c01a7336-21501857d56mr121764785ad.29.1732886489636; Fri, 29 Nov 2024 05:21:29 -0800 (PST) Received: from mandiga.. ([2804:1b3:a7c1:68c8:3143:6603:ad16:715e]) by smtp.gmail.com with ESMTPSA id d9443c01a7336-2153d5f66d5sm14472255ad.201.2024.11.29.05.21.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 29 Nov 2024 05:21:29 -0800 (PST) From: Adhemerval Zanella To: libc-alpha@sourceware.org Cc: DJ Delorie , Alexei Sibidanov , Paul Zimmermann Subject: [PATCH 18/23] math: Use atanf from CORE-MATH Date: Fri, 29 Nov 2024 10:17:42 -0300 Message-ID: <20241129132032.476978-19-adhemerval.zanella@linaro.org> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20241129132032.476978-1-adhemerval.zanella@linaro.org> References: <20241129132032.476978-1-adhemerval.zanella@linaro.org> MIME-Version: 1.0 X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: libc-alpha-bounces~patch=linaro.org@sourceware.org The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic atanf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 56.8265 53.6842 5.53% x86_64v2 54.8177 53.6842 2.07% x86_64v3 46.2915 48.7034 -5.21% i686 158.3760 108.9560 31.20% aarch64 (Neoverse) 21.687 20.5893 5.06% power10 13.1903 13.5012 -2.36% reciprocal-throughput master patched improvement x86_64 16.6787 16.7601 -0.49% x86_64v2 16.6983 16.7601 -0.37% x86_64v3 16.2268 12.1391 25.19% i686 138.6840 36.0640 74.00% aarch64 (Neoverse) 11.8012 10.3565 12.24% power10 5.3212 4.2894 19.39% Signed-off-by: Alexei Sibidanov Signed-off-by: Paul Zimmermann Signed-off-by: Adhemerval Zanella --- SHARED-FILES | 4 + sysdeps/aarch64/libm-test-ulps | 8 +- sysdeps/alpha/fpu/libm-test-ulps | 4 - sysdeps/arc/fpu/libm-test-ulps | 4 - sysdeps/arc/nofpu/libm-test-ulps | 1 - sysdeps/arm/libm-test-ulps | 8 +- sysdeps/csky/fpu/libm-test-ulps | 4 - sysdeps/csky/nofpu/libm-test-ulps | 4 - sysdeps/hppa/fpu/libm-test-ulps | 4 - sysdeps/i386/fpu/libm-test-ulps | 3 - sysdeps/i386/fpu/s_atanf.S | 30 --- .../i386/i686/fpu/multiarch/libm-test-ulps | 3 - sysdeps/ieee754/flt-32/s_atanf.c | 186 +++++++++--------- sysdeps/loongarch/lp64/libm-test-ulps | 8 +- sysdeps/microblaze/libm-test-ulps | 1 - sysdeps/mips/mips32/libm-test-ulps | 4 - sysdeps/mips/mips64/libm-test-ulps | 4 - sysdeps/or1k/fpu/libm-test-ulps | 4 - sysdeps/or1k/nofpu/libm-test-ulps | 4 - sysdeps/powerpc/fpu/libm-test-ulps | 8 +- sysdeps/powerpc/nofpu/libm-test-ulps | 4 - sysdeps/riscv/nofpu/libm-test-ulps | 4 - sysdeps/riscv/rvd/libm-test-ulps | 4 - sysdeps/s390/fpu/libm-test-ulps | 4 - sysdeps/sh/libm-test-ulps | 2 - sysdeps/sparc/fpu/libm-test-ulps | 4 - sysdeps/x86_64/fpu/libm-test-ulps | 8 +- 27 files changed, 109 insertions(+), 217 deletions(-) delete mode 100644 sysdeps/i386/fpu/s_atanf.S diff --git a/SHARED-FILES b/SHARED-FILES index 18b3244e44..b9627afdfe 100644 --- a/SHARED-FILES +++ b/SHARED-FILES @@ -310,3 +310,7 @@ sysdeps/ieee754/flt-32/s_asinhf.c: (src/binary32/asinh/asinhf.c in CORE-MATH) - The code was adapted to use glibc code style and internal functions to handle errno, overflow, and underflow. +sysdeps/ieee754/flt-32/s_atanf.c: + (src/binary32/atan/atanf.c in CORE-MATH) + - The code was adapted to use glibc code style and internal + functions to handle errno, overflow, and underflow. diff --git a/sysdeps/aarch64/libm-test-ulps b/sysdeps/aarch64/libm-test-ulps index 5e17a4b2c3..44934af245 100644 --- a/sysdeps/aarch64/libm-test-ulps +++ b/sysdeps/aarch64/libm-test-ulps @@ -99,7 +99,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -135,7 +134,6 @@ float: 1 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_sve": @@ -144,12 +142,10 @@ float: 1 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": @@ -218,7 +214,7 @@ ldouble: 6 Function: Real part of "cacos_towardzero": double: 3 -float: 2 +float: 3 ldouble: 3 Function: Imaginary part of "cacos_towardzero": @@ -263,7 +259,7 @@ ldouble: 5 Function: Imaginary part of "cacosh_towardzero": double: 3 -float: 2 +float: 3 ldouble: 3 Function: Real part of "cacosh_upward": diff --git a/sysdeps/alpha/fpu/libm-test-ulps b/sysdeps/alpha/fpu/libm-test-ulps index 708299915d..f9c1cf7cf5 100644 --- a/sysdeps/alpha/fpu/libm-test-ulps +++ b/sysdeps/alpha/fpu/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/arc/fpu/libm-test-ulps b/sysdeps/arc/fpu/libm-test-ulps index 1c34bd36d6..37b0efae66 100644 --- a/sysdeps/arc/fpu/libm-test-ulps +++ b/sysdeps/arc/fpu/libm-test-ulps @@ -51,7 +51,6 @@ double: 3 Function: "atan": double: 1 -float: 1 Function: "atan2": double: 7 @@ -71,15 +70,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 2 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/arc/nofpu/libm-test-ulps b/sysdeps/arc/nofpu/libm-test-ulps index 58fc499f53..8d283f0627 100644 --- a/sysdeps/arc/nofpu/libm-test-ulps +++ b/sysdeps/arc/nofpu/libm-test-ulps @@ -15,7 +15,6 @@ double: 2 Function: "atan": double: 1 -float: 1 Function: "atan2": float: 2 diff --git a/sysdeps/arm/libm-test-ulps b/sysdeps/arm/libm-test-ulps index a20cb5bcc3..bb4ee0f2e4 100644 --- a/sysdeps/arm/libm-test-ulps +++ b/sysdeps/arm/libm-test-ulps @@ -51,7 +51,6 @@ double: 3 Function: "atan": double: 1 -float: 1 Function: "atan2": float: 2 @@ -70,15 +69,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 @@ -126,7 +122,7 @@ float: 3 Function: Real part of "cacos_towardzero": double: 3 -float: 2 +float: 3 Function: Imaginary part of "cacos_towardzero": double: 5 @@ -162,7 +158,7 @@ float: 3 Function: Imaginary part of "cacosh_towardzero": double: 3 -float: 2 +float: 3 Function: Real part of "cacosh_upward": double: 4 diff --git a/sysdeps/csky/fpu/libm-test-ulps b/sysdeps/csky/fpu/libm-test-ulps index 2b7b5cfc92..9d3fcf693d 100644 --- a/sysdeps/csky/fpu/libm-test-ulps +++ b/sysdeps/csky/fpu/libm-test-ulps @@ -48,7 +48,6 @@ Function: "asinh_upward": double: 3 Function: "atan": -float: 1 Function: "atan2": float: 1 @@ -67,15 +66,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/csky/nofpu/libm-test-ulps b/sysdeps/csky/nofpu/libm-test-ulps index 0eb62de8b2..1bab8effc7 100644 --- a/sysdeps/csky/nofpu/libm-test-ulps +++ b/sysdeps/csky/nofpu/libm-test-ulps @@ -48,7 +48,6 @@ Function: "asinh_upward": double: 3 Function: "atan": -float: 1 Function: "atan2": float: 1 @@ -67,15 +66,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/hppa/fpu/libm-test-ulps b/sysdeps/hppa/fpu/libm-test-ulps index 40ae1806d4..8de00f442b 100644 --- a/sysdeps/hppa/fpu/libm-test-ulps +++ b/sysdeps/hppa/fpu/libm-test-ulps @@ -51,7 +51,6 @@ double: 3 Function: "atan": double: 1 -float: 1 Function: "atan2": float: 2 @@ -70,15 +69,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/i386/fpu/libm-test-ulps b/sysdeps/i386/fpu/libm-test-ulps index d1a20a1a98..31286ea178 100644 --- a/sysdeps/i386/fpu/libm-test-ulps +++ b/sysdeps/i386/fpu/libm-test-ulps @@ -109,19 +109,16 @@ ldouble: 1 Function: "atan_downward": double: 1 -float: 1 float128: 2 ldouble: 1 Function: "atan_towardzero": double: 1 -float: 1 float128: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 1 float128: 2 ldouble: 1 diff --git a/sysdeps/i386/fpu/s_atanf.S b/sysdeps/i386/fpu/s_atanf.S deleted file mode 100644 index 4a8f5e3600..0000000000 --- a/sysdeps/i386/fpu/s_atanf.S +++ /dev/null @@ -1,30 +0,0 @@ -/* - * Public domain. - */ - -#include -#include -#include - -RCSID("$NetBSD: s_atanf.S,v 1.3 1995/05/08 23:51:33 jtc Exp $") - -DEFINE_FLT_MIN - -#ifdef PIC -# define MO(op) op##@GOTOFF(%ecx) -#else -# define MO(op) op -#endif - - .text -ENTRY(__atanf) -#ifdef PIC - LOAD_PIC_REG (cx) -#endif - flds 4(%esp) - fld1 - fpatan - FLT_CHECK_FORCE_UFLOW - ret -END (__atanf) -libm_alias_float (__atan, atan) diff --git a/sysdeps/i386/i686/fpu/multiarch/libm-test-ulps b/sysdeps/i386/i686/fpu/multiarch/libm-test-ulps index 4e65110265..0a872570d1 100644 --- a/sysdeps/i386/i686/fpu/multiarch/libm-test-ulps +++ b/sysdeps/i386/i686/fpu/multiarch/libm-test-ulps @@ -109,19 +109,16 @@ ldouble: 1 Function: "atan_downward": double: 1 -float: 1 float128: 2 ldouble: 1 Function: "atan_towardzero": double: 1 -float: 1 float128: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 1 float128: 2 ldouble: 1 diff --git a/sysdeps/ieee754/flt-32/s_atanf.c b/sysdeps/ieee754/flt-32/s_atanf.c index 3dbf5c5bb7..7a5cf4d5b1 100644 --- a/sysdeps/ieee754/flt-32/s_atanf.c +++ b/sysdeps/ieee754/flt-32/s_atanf.c @@ -1,102 +1,106 @@ -/* s_atanf.c -- float version of s_atan.c. - */ +/* Correctly-rounded arc-tangent of binary32 value. -/* - * ==================================================== - * Copyright (C) 1993 by Sun Microsystems, Inc. All rights reserved. - * - * Developed at SunPro, a Sun Microsystems, Inc. business. - * Permission to use, copy, modify, and distribute this - * software is freely granted, provided that this notice - * is preserved. - * ==================================================== - */ +Copyright (c) 2022-2024 Alexei Sibidanov. -#if defined(LIBM_SCCS) && !defined(lint) -static char rcsid[] = "$NetBSD: s_atanf.c,v 1.4 1995/05/10 20:46:47 jtc Exp $"; -#endif +The original version of this file was copied from the CORE-MATH +project (file src/binary32/atan/atanf.c, revision 01a29dc). -#include -#include -#include -#include -#include +Permission is hereby granted, free of charge, to any person obtaining a copy +of this software and associated documentation files (the "Software"), to deal +in the Software without restriction, including without limitation the rights +to use, copy, modify, merge, publish, distribute, sublicense, and/or sell +copies of the Software, and to permit persons to whom the Software is +furnished to do so, subject to the following conditions: -static const float atanhi[] = { - 4.6364760399e-01, /* atan(0.5)hi 0x3eed6338 */ - 7.8539812565e-01, /* atan(1.0)hi 0x3f490fda */ - 9.8279368877e-01, /* atan(1.5)hi 0x3f7b985e */ - 1.5707962513e+00, /* atan(inf)hi 0x3fc90fda */ -}; +The above copyright notice and this permission notice shall be included in all +copies or substantial portions of the Software. -static const float atanlo[] = { - 5.0121582440e-09, /* atan(0.5)lo 0x31ac3769 */ - 3.7748947079e-08, /* atan(1.0)lo 0x33222168 */ - 3.4473217170e-08, /* atan(1.5)lo 0x33140fb4 */ - 7.5497894159e-08, /* atan(inf)lo 0x33a22168 */ -}; +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, +OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE +SOFTWARE. +*/ -static const float aT[] = { - 3.3333334327e-01, /* 0x3eaaaaaa */ - -2.0000000298e-01, /* 0xbe4ccccd */ - 1.4285714924e-01, /* 0x3e124925 */ - -1.1111110449e-01, /* 0xbde38e38 */ - 9.0908870101e-02, /* 0x3dba2e6e */ - -7.6918758452e-02, /* 0xbd9d8795 */ - 6.6610731184e-02, /* 0x3d886b35 */ - -5.8335702866e-02, /* 0xbd6ef16b */ - 4.9768779427e-02, /* 0x3d4bda59 */ - -3.6531571299e-02, /* 0xbd15a221 */ - 1.6285819933e-02, /* 0x3c8569d7 */ -}; - -static const float -one = 1.0, -huge = 1.0e30; +#include +#include +#include +#include "math_config.h" -float __atanf(float x) +float +__atanf (float x) { - float w,s1,s2,z; - int32_t ix,hx,id; - - GET_FLOAT_WORD(hx,x); - ix = hx&0x7fffffff; - if(ix>=0x4c000000) { /* if |x| >= 2^25 */ - if(ix>0x7f800000) - return x+x; /* NaN */ - if(hx>0) return atanhi[3]+atanlo[3]; - else return -atanhi[3]-atanlo[3]; - } if (ix < 0x3ee00000) { /* |x| < 0.4375 */ - if (ix < 0x31000000) { /* |x| < 2^-29 */ - math_check_force_underflow (x); - if(huge+x>one) return x; /* raise inexact */ - } - id = -1; - } else { - x = fabsf(x); - if (ix < 0x3f980000) { /* |x| < 1.1875 */ - if (ix < 0x3f300000) { /* 7/16 <=|x|<11/16 */ - id = 0; x = ((float)2.0*x-one)/((float)2.0+x); - } else { /* 11/16<=|x|< 19/16 */ - id = 1; x = (x-one)/(x+one); - } - } else { - if (ix < 0x401c0000) { /* |x| < 2.4375 */ - id = 2; x = (x-(float)1.5)/(one+(float)1.5*x); - } else { /* 2.4375 <= |x| < 2^66 */ - id = 3; x = -(float)1.0/x; - } - }} - /* end of argument reduction */ - z = x*x; - w = z*z; - /* break sum from i=0 to 10 aT[i]z**(i+1) into odd and even poly */ - s1 = z*(aT[0]+w*(aT[2]+w*(aT[4]+w*(aT[6]+w*(aT[8]+w*aT[10]))))); - s2 = w*(aT[1]+w*(aT[3]+w*(aT[5]+w*(aT[7]+w*aT[9])))); - if (id<0) return x - x*(s1+s2); - else { - z = atanhi[id] - ((x*(s1+s2) - atanlo[id]) - x); - return (hx<0)? -z:z; + const double pi2 = 0x1.921fb54442d18p+0; + uint32_t t = asuint (x); + int e = (t >> 23) & 0xff; + bool gt = e >= 127; + uint32_t ta = t & 0x7fffffff; + if (__glibc_unlikely (ta >= 0x4c700518u)) /* |x| > 0x1.e00a3p+25 */ + { + if (ta > 0x7f800000u) + return x + x; /* nan */ + return copysign (pi2, (double) x); + } + if (__glibc_unlikely (e < 127 - 13)) + { + if (__glibc_unlikely (e < 127 - 25)) + { + if (!(t << 1)) + return x; + return fmaf (-x, fabsf (x), x); } + return fmaf (-0x1.5555555555555p-2f * x, x * x, x); + } + /* now |x| >= 0x1p-13 */ + double z = x; + if (gt) + z = 1 / z; /* gt is non-zero for |x| >= 1 */ + double z2 = z * z; + double z4 = z2 * z2; + double z8 = z4 * z4; + /* polynomials generated using rminimax + (https://gitlab.inria.fr/sfilip/rminimax) with the following command: + ./ratapprox --function="atan(x)" --dom=[0.000122070,1] + --num=[x,x^3,x^5,x^7,x^9,x^11,x^13] --den=[1,x^2,x^4,x^6,x^8,x^10,x^12] + --output=atanf.sollya --log (see output atanf.sollya) The coefficient + cd[0] was slightly reduced from the original value 0x1.51eccde075d67p-2 to + avoid an exceptional case for |x| = 0x1.1ad646p-4 and rounding to nearest. + */ + static const double cn[] = + { + 0x1.51eccde075d67p-2, 0x1.a76bb5637f2f2p-1, 0x1.81e0eed20de88p-1, + 0x1.376c8ca67d11dp-2, 0x1.aec7b69202ac6p-5, 0x1.9561899acc73ep-9, + 0x1.bf9fa5b67e6p-16 + }; + static const double cd[] = + { + 0x1.51eccde075d66p-2, 0x1.dfbdd7b392d28p-1, 0x1p+0, + 0x1.fd22bf0e89b54p-2, 0x1.d91ff8b576282p-4, 0x1.653ea99fc9bbp-7, + 0x1.1e7fcc202340ap-12 + }; + double cn0 = cn[0] + z2 * cn[1]; + double cn2 = cn[2] + z2 * cn[3]; + double cn4 = cn[4] + z2 * cn[5]; + double cn6 = cn[6]; + cn0 += z4 * cn2; + cn4 += z4 * cn6; + cn0 += z8 * cn4; + cn0 *= z; + double cd0 = cd[0] + z2 * cd[1]; + double cd2 = cd[2] + z2 * cd[3]; + double cd4 = cd[4] + z2 * cd[5]; + double cd6 = cd[6]; + cd0 += z4 * cd2; + cd4 += z4 * cd6; + cd0 += z8 * cd4; + double r = cn0 / cd0; + if (!gt) + return r; /* for |x| < 1, (float) r is correctly rounded */ + + /* now |x| >= 1 */ + r = copysign (0x1.0fdaa22168c23p-7, z) - r + copysign (0x1.9p0, z); + return r; } libm_alias_float (__atan, atan) diff --git a/sysdeps/loongarch/lp64/libm-test-ulps b/sysdeps/loongarch/lp64/libm-test-ulps index b24bc582ea..ff1cf6b2e4 100644 --- a/sysdeps/loongarch/lp64/libm-test-ulps +++ b/sysdeps/loongarch/lp64/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": @@ -162,7 +158,7 @@ ldouble: 6 Function: Real part of "cacos_towardzero": double: 3 -float: 2 +float: 3 ldouble: 3 Function: Imaginary part of "cacos_towardzero": @@ -207,7 +203,7 @@ ldouble: 5 Function: Imaginary part of "cacosh_towardzero": double: 3 -float: 2 +float: 3 ldouble: 3 Function: Real part of "cacosh_upward": diff --git a/sysdeps/microblaze/libm-test-ulps b/sysdeps/microblaze/libm-test-ulps index b7e73db063..5dce4c8f89 100644 --- a/sysdeps/microblaze/libm-test-ulps +++ b/sysdeps/microblaze/libm-test-ulps @@ -12,7 +12,6 @@ Function: "asinh": double: 1 Function: "atan": -float: 1 Function: "atan2": float: 1 diff --git a/sysdeps/mips/mips32/libm-test-ulps b/sysdeps/mips/mips32/libm-test-ulps index ca4eac5090..9046a17170 100644 --- a/sysdeps/mips/mips32/libm-test-ulps +++ b/sysdeps/mips/mips32/libm-test-ulps @@ -51,7 +51,6 @@ double: 3 Function: "atan": double: 1 -float: 1 Function: "atan2": float: 2 @@ -70,15 +69,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/mips/mips64/libm-test-ulps b/sysdeps/mips/mips64/libm-test-ulps index 30e8d46c68..1525e55eb5 100644 --- a/sysdeps/mips/mips64/libm-test-ulps +++ b/sysdeps/mips/mips64/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/or1k/fpu/libm-test-ulps b/sysdeps/or1k/fpu/libm-test-ulps index dd972b3063..6edadaed89 100644 --- a/sysdeps/or1k/fpu/libm-test-ulps +++ b/sysdeps/or1k/fpu/libm-test-ulps @@ -51,7 +51,6 @@ double: 3 Function: "atan": double: 1 -float: 1 Function: "atan2": float: 2 @@ -70,15 +69,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/or1k/nofpu/libm-test-ulps b/sysdeps/or1k/nofpu/libm-test-ulps index 4263ce7aa5..aff536b890 100644 --- a/sysdeps/or1k/nofpu/libm-test-ulps +++ b/sysdeps/or1k/nofpu/libm-test-ulps @@ -51,7 +51,6 @@ double: 3 Function: "atan": double: 1 -float: 1 Function: "atan2": float: 2 @@ -70,15 +69,12 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atan_upward": double: 1 -float: 2 Function: "atanh": double: 2 diff --git a/sysdeps/powerpc/fpu/libm-test-ulps b/sysdeps/powerpc/fpu/libm-test-ulps index e5aa59fca1..342054bb72 100644 --- a/sysdeps/powerpc/fpu/libm-test-ulps +++ b/sysdeps/powerpc/fpu/libm-test-ulps @@ -87,7 +87,6 @@ ldouble: 7 Function: "atan": double: 1 -float: 1 float128: 1 ldouble: 1 @@ -116,19 +115,16 @@ ldouble: 3 Function: "atan_downward": double: 1 -float: 2 float128: 2 ldouble: 1 Function: "atan_towardzero": double: 1 -float: 1 float128: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 float128: 2 ldouble: 2 @@ -202,7 +198,7 @@ ldouble: 8 Function: Real part of "cacos_towardzero": double: 3 -float: 2 +float: 3 float128: 3 ldouble: 7 @@ -256,7 +252,7 @@ ldouble: 8 Function: Imaginary part of "cacosh_towardzero": double: 3 -float: 2 +float: 3 float128: 3 ldouble: 7 diff --git a/sysdeps/powerpc/nofpu/libm-test-ulps b/sysdeps/powerpc/nofpu/libm-test-ulps index 939468399c..c7242e5fec 100644 --- a/sysdeps/powerpc/nofpu/libm-test-ulps +++ b/sysdeps/powerpc/nofpu/libm-test-ulps @@ -71,7 +71,6 @@ ldouble: 7 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -95,17 +94,14 @@ ldouble: 3 Function: "atan_downward": double: 1 -float: 2 ldouble: 1 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/riscv/nofpu/libm-test-ulps b/sysdeps/riscv/nofpu/libm-test-ulps index 7c89f7915b..4fa17a3da2 100644 --- a/sysdeps/riscv/nofpu/libm-test-ulps +++ b/sysdeps/riscv/nofpu/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/riscv/rvd/libm-test-ulps b/sysdeps/riscv/rvd/libm-test-ulps index e1c1a6aee5..0e3fb96ee5 100644 --- a/sysdeps/riscv/rvd/libm-test-ulps +++ b/sysdeps/riscv/rvd/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/s390/fpu/libm-test-ulps b/sysdeps/s390/fpu/libm-test-ulps index f36f0e3f5a..921ff284af 100644 --- a/sysdeps/s390/fpu/libm-test-ulps +++ b/sysdeps/s390/fpu/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/sh/libm-test-ulps b/sysdeps/sh/libm-test-ulps index 4bd1ff1f98..b429f42d89 100644 --- a/sysdeps/sh/libm-test-ulps +++ b/sysdeps/sh/libm-test-ulps @@ -24,7 +24,6 @@ Function: "asinh_towardzero": double: 2 Function: "atan": -float: 1 Function: "atan2": float: 1 @@ -35,7 +34,6 @@ float: 2 Function: "atan_towardzero": double: 1 -float: 1 Function: "atanh": double: 2 diff --git a/sysdeps/sparc/fpu/libm-test-ulps b/sysdeps/sparc/fpu/libm-test-ulps index 0cbfc5be76..ee7eea81f9 100644 --- a/sysdeps/sparc/fpu/libm-test-ulps +++ b/sysdeps/sparc/fpu/libm-test-ulps @@ -67,7 +67,6 @@ ldouble: 4 Function: "atan": double: 1 -float: 1 ldouble: 1 Function: "atan2": @@ -91,17 +90,14 @@ ldouble: 2 Function: "atan_downward": double: 1 -float: 2 ldouble: 2 Function: "atan_towardzero": double: 1 -float: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 ldouble: 2 Function: "atanh": diff --git a/sysdeps/x86_64/fpu/libm-test-ulps b/sysdeps/x86_64/fpu/libm-test-ulps index 5f9afc7f6e..1589403c1c 100644 --- a/sysdeps/x86_64/fpu/libm-test-ulps +++ b/sysdeps/x86_64/fpu/libm-test-ulps @@ -160,7 +160,6 @@ float: 1 Function: "atan": double: 1 -float: 1 float128: 1 ldouble: 1 @@ -209,19 +208,16 @@ float: 2 Function: "atan_downward": double: 1 -float: 2 float128: 2 ldouble: 1 Function: "atan_towardzero": double: 1 -float: 1 float128: 1 ldouble: 1 Function: "atan_upward": double: 1 -float: 2 float128: 2 ldouble: 1 @@ -335,7 +331,7 @@ ldouble: 6 Function: Real part of "cacos_towardzero": double: 3 -float: 2 +float: 3 float128: 3 ldouble: 2 @@ -389,7 +385,7 @@ ldouble: 5 Function: Imaginary part of "cacosh_towardzero": double: 3 -float: 2 +float: 3 float128: 3 ldouble: 2