glibc/sysdeps/ieee754
Adhemerval Zanella bccb0648ea math: Use tanf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance to the generic tanf.

The code was adapted to glibc style, to use the definition of
math_config.h, to remove errno handling, and to use a generic
128 bit routine for ABIs that do not support it natively.

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (neoverse1,
gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1):

latency                       master       patched  improvement
x86_64                       82.3961       54.8052       33.49%
x86_64v2                     82.3415       54.8052       33.44%
x86_64v3                     69.3661       50.4864       27.22%
i686                         219.271       45.5396       79.23%
aarch64                      29.2127       19.1951       34.29%
power10                      19.5060       16.2760       16.56%

reciprocal-throughput         master       patched  improvement
x86_64                       28.3976       19.7334       30.51%
x86_64v2                     28.4568       19.7334       30.65%
x86_64v3                     21.1815       16.1811       23.61%
i686                         105.016       15.1426       85.58%
aarch64                      18.1573       10.7681       40.70%
power10                       8.7207        8.7097        0.13%

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
2024-11-22 10:52:27 -03:00
..
dbl-64 math: Split s_erfF in erff and erfc 2024-11-22 10:52:26 -03:00
float128 math: Split s_erfF in erff and erfc 2024-11-22 10:52:26 -03:00
flt-32 math: Use tanf from CORE-MATH 2024-11-22 10:52:27 -03:00
ldbl-64-128 math: Provide missing math symbols on libc.a (BZ 31781) 2024-05-23 09:36:08 -03:00
ldbl-96 math: Split s_erfF in erff and erfc 2024-11-22 10:52:26 -03:00
ldbl-128 math: Split s_erfF in erff and erfc 2024-11-22 10:52:26 -03:00
ldbl-128ibm math: Split s_erfF in erff and erfc 2024-11-22 10:52:26 -03:00
ldbl-128ibm-compat Implement C23 exp2m1, exp10m1 2024-06-17 16:31:49 +00:00
ldbl-opt Convert to autoconf 2.72 (vanilla release, no distribution patches) 2024-06-17 21:15:28 +02:00
soft-fp Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
ieee754.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
k_standard.c Use copysign functions not __copysign functions in glibc libm. 2018-09-27 20:04:48 +00:00
k_standardf.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
k_standardl.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
libm-alias-finite.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
Makefile Avoid -Wno-write-strings for k_standard.c. 2015-02-26 22:50:54 +00:00
s_lib_version.c Simplify math-svid-compat code. 2017-08-28 15:19:52 +00:00
s_matherr.c Obsolete matherr, _LIB_VERSION, libieee.a. 2017-08-21 17:45:10 +00:00
s_signgam.c Remove unnecessary math_private.h includes. 2018-09-28 21:53:33 +00:00