glibc/sysdeps/loongarch
Adhemerval Zanella bccb0648ea math: Use tanf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance to the generic tanf.

The code was adapted to glibc style, to use the definition of
math_config.h, to remove errno handling, and to use a generic
128 bit routine for ABIs that do not support it natively.

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (neoverse1,
gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1):

latency                       master       patched  improvement
x86_64                       82.3961       54.8052       33.49%
x86_64v2                     82.3415       54.8052       33.44%
x86_64v3                     69.3661       50.4864       27.22%
i686                         219.271       45.5396       79.23%
aarch64                      29.2127       19.1951       34.29%
power10                      19.5060       16.2760       16.56%

reciprocal-throughput         master       patched  improvement
x86_64                       28.3976       19.7334       30.51%
x86_64v2                     28.4568       19.7334       30.65%
x86_64v3                     21.1815       16.1811       23.61%
i686                         105.016       15.1426       85.58%
aarch64                      18.1573       10.7681       40.70%
power10                       8.7207        8.7097        0.13%

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
2024-11-22 10:52:27 -03:00
..
bits Refer to C23 in place of C2X in glibc 2024-02-01 11:02:01 +00:00
fpu LoongArch: Correct {__ieee754, _}_scalb -> {__ieee754, _}_scalbf 2024-03-12 14:07:27 +08:00
lp64 math: Use tanf from CORE-MATH 2024-11-22 10:52:27 -03:00
nofpu Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
nptl nptl: fix __builtin_thread_pointer detection on LoongArch 2024-11-07 14:08:30 +08:00
sys LoongArch: Ensure sp 16-byte aligned for tlsdesc 2024-06-14 10:14:54 +08:00
__longjmp.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
abort-instr.h LoongArch: Linux Syscall Interface 2022-07-26 12:35:12 -03:00
bsd-_setjmp.c
bsd-setjmp.c
configure Convert to autoconf 2.72 (vanilla release, no distribution patches) 2024-06-17 21:15:28 +02:00
configure.ac LoongArch: Fix tst-gnu2-tls2 compiler error 2024-05-21 11:23:03 +08:00
cpu-tunables.c LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00
dl-audit-check.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-get-cpu-features.c LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00
dl-irel.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-link.sym LoongArch: Add lasx/lsx support for _dl_runtime_profile. 2023-09-15 09:07:42 +08:00
dl-machine.h LoongArch: Add cfi instructions for _dl_tlsdesc_dynamic 2024-08-09 09:06:17 +08:00
dl-tls.h LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
dl-tlsdesc-dynamic.h LoongArch: Fix macro redefined warning in tls-desc.S 2024-09-06 15:46:13 +08:00
dl-tlsdesc.h LoongArch: Add cfi instructions for _dl_tlsdesc_dynamic 2024-08-09 09:06:17 +08:00
dl-tlsdesc.S LoongArch: Fix macro redefined warning in tls-desc.S 2024-09-06 15:46:13 +08:00
dl-trampoline.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-trampoline.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-tunables.list LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00
e_sqrtl.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
fpu_control.h LoongArch: Use "$fcsr0" instead of "$r0" in _FPU_{GET,SET}CW 2024-05-28 09:17:05 +08:00
hp-timing.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
Implies LoongArch: Build Infrastructure 2022-07-26 12:35:12 -03:00
jmpbuf-offsets.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
jmpbuf-unwind.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
ldsodefs.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
libc-tls.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
linkmap.h LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
machine-gmon.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
Makefile LoongArch: Fix tst-gnu2-tls2 compiler error 2024-05-21 11:23:03 +08:00
math_private.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
math-use-builtins-ffs.h LoongArch: Use builtins for ffs and ffsll 2024-02-05 15:19:41 -03:00
preconfigure LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
preconfigure.ac LoongArch: Add soft float support. 2022-09-01 09:10:08 +08:00
setjmp.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
sfp-machine.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
sotruss-lib.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
stackinfo.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
start.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
tininess.h LoongArch: Generic <math.h> and soft-fp Routines 2022-07-26 12:35:12 -03:00
tlsdesc.c LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
tlsdesc.sym LoongArch: Add cfi instructions for _dl_tlsdesc_dynamic 2024-08-09 09:06:17 +08:00
tst-audit.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
tst-gnu2-tls2.h LoongArch: Fix tst-gnu2-tls2 test case 2024-06-26 12:02:07 +08:00
tst-hwcap-tunables.c LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00