Go to file
Siddhesh Poyarekar 436e4d5b96 [aarch64] Add an ASIMD variant of strlen for falkor
This variant of strlen uses vector loads and operations to reduce the
size of the code and also eliminate the non-ascii fallback.  This
works very well for falkor because of its two vector units and
efficient vector ops.  In the best case it reduces latency of cases in
bench-strlen by 48%, with gains throughout the benchmark.
strlen-walk also sees uniform gains in the 5%-15% range.

Overall the routine appears to work better than the stock one for falkor
regardless of the benchmark, length of string or cache state.

The same cannot be said of a53 and a72 though.  a53 performance was
greatly reduced and for a72 it was a bit of a mixed bag, slightly on the
negative side but I reckon it might be fast in some situations.

	* sysdeps/aarch64/strlen.S (__strlen): Rename to STRLEN.
	[!STRLEN](STRLEN): Set to __strlen.
	* sysdeps/aarch64/multiarch/strlen.c: New file.
	* sysdeps/aarch64/multiarch/strlen_generic.S: Likewise.
	* sysdeps/aarch64/multiarch/strlen_asimd.S: Likewise.
	* sysdeps/aarch64/multiarch/ifunc-impl-list.c
	(__libc_ifunc_impl_list): Add strlen.
	* sysdeps/aarch64/multiarch/Makefile (sysdep_routines): Add
	strlen_generic and strlen_asimd.

Reviewed-By: szabolcs.nagy@arm.com
CC: pinskia@gmail.com
2018-08-15 23:01:33 +05:30
argp Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
assert Fix uninitialized variable in assert_perror (bug 22761) 2018-02-05 11:06:15 +01:00
benchtests [benchtests] Add workload test properties to schema 2018-08-11 18:55:09 +05:30
bits Add <bits/indirect-return.h> 2018-07-24 07:55:47 -07:00
catgets intl/tst-gettext: fix failure with newest msgfmt 2018-02-18 18:16:05 +01:00
ChangeLog.old Add missing reference to bug 21654 2017-10-07 13:14:36 +02:00
conform Fix C11 conformance issues 2018-07-25 12:02:32 -03:00
crypt New configure option --disable-crypt. 2018-06-29 16:53:47 +02:00
csu Build csu/elf-init.c and csu/static-reloc.c with stack protector 2018-07-05 22:57:45 +02:00
ctype Use libc_hidden_* for tolower, toupper (bug 15105). 2018-02-23 13:54:53 +00:00
debug Compile debug/stack_chk_fail_local.c with stack protector 2018-07-05 19:28:35 +02:00
dirent Consolidate alphasort{64} and versionsort{64} implementation 2018-04-23 17:35:16 -03:00
dlfcn libc: Extend __libc_freeres framework (Bug 23329). 2018-06-29 22:39:06 -04:00
elf Rename the glibc.tune namespace to glibc.cpu 2018-08-02 23:49:19 +05:30
gmon Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
gnulib Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
grp Avoid insecure usage of tmpnam in tests. 2018-07-18 21:04:12 +00:00
gshadow Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
hesiod Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
htl hurd: Add missing symbols for proper libc_get/setspecific 2018-08-08 01:55:04 +02:00
hurd hurd: Fix exec usage of mach_setup_thread 2018-08-01 00:10:03 +02:00
iconv Fix s390 -Os iconv build. 2018-03-05 21:46:55 +00:00
iconvdata Fix out-of-bounds access in IBM-1390 converter (bug 23448) 2018-07-24 16:45:46 +02:00
include error, warn, warnx: Use __fxprintf for wide printing [BZ #23519] 2018-08-14 17:54:49 +02:00
inet manual: Revise crypt.texi. 2018-06-29 16:53:37 +02:00
intl intl/tst-gettext: fix failure with newest msgfmt 2018-02-18 18:16:05 +01:00
io Avoid insecure usage of tmpnam in tests. 2018-07-18 21:04:12 +00:00
libio Fix copyright years in recent commits 2018-07-10 11:03:08 +02:00
locale Fix out of bounds access in findidxwc (bug 23442) 2018-07-25 10:50:03 +02:00
localedata Add convenience target 'install-locale-files'. 2018-08-02 15:31:12 -04:00
login Fix Linux fcntl OFD locks for non-LFS architectures (BZ#20251) 2018-06-26 13:22:53 -03:00
mach hurd: Fix exec usage of mach_setup_thread 2018-08-01 00:10:03 +02:00
malloc libc: Extend __libc_freeres framework (Bug 23329). 2018-06-29 22:39:06 -04:00
manual Update install.texi documentation of uses of Perl and Python. 2018-08-13 17:20:46 +00:00
math Improve performance of sincosf 2018-08-10 17:34:39 +01:00
mathvec Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
misc error, warn, warnx: Use __fxprintf for wide printing [BZ #23519] 2018-08-14 17:54:49 +02:00
nis nisplus: Correct pwent parsing issue and resulting build error [BZ #23266] 2018-06-27 21:12:16 +01:00
nptl Fix ISO C threads installed header and HURD assumption 2018-07-25 17:27:45 -03:00
nptl_db nptl_db: Remove stale match_pid' parameter from iterate_thread_list' 2018-03-01 16:10:05 +00:00
nscd nscd: Deallocate existing user names in file parser 2018-08-14 10:51:07 +02:00
nss nss_files: Fix file stream leak in aliases lookup [BZ #23521] 2018-08-14 10:52:06 +02:00
po Update be translations. 2018-08-03 10:13:47 -04:00
posix regex: Gnulib unibyte RRI uses bytes not chars 2018-08-10 14:27:56 -07:00
pwd manual: Revise crypt.texi. 2018-06-29 16:53:37 +02:00
resolv libc: Extend __libc_freeres framework (Bug 23329). 2018-06-29 22:39:06 -04:00
resource resource/tst-getrlimit.c: Add copyright header 2018-01-05 20:34:10 +01:00
rt hurd: Add hurd thread library 2018-04-02 01:44:14 +02:00
scripts Use Linux 4.18 in build-many-glibcs.py. 2018-08-13 16:46:13 +00:00
setjmp x86: Use pad in pthread_unwind_buf to preserve shadow stack register 2018-05-02 06:17:41 -07:00
shadow manual: Revise crypt.texi. 2018-06-29 16:53:37 +02:00
signal Add tst-sigaction.c to test BZ #23069 2018-04-26 22:21:13 +02:00
socket Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
soft-fp Make powerpc-nofpu __sqrtsf2, __sqrtdf2 compat symbols (bug 18473). 2018-06-01 17:25:12 +00:00
stdio-common error, warn, warnx: Use __fxprintf for wide printing [BZ #23519] 2018-08-14 17:54:49 +02:00
stdlib mbstowcs: Remove outdated comment 2018-08-13 14:05:01 +02:00
streams Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
string Simplify and speedup strstr/strcasestr first match 2018-08-03 17:24:12 +01:00
sunrpc libc: Extend __libc_freeres framework (Bug 23329). 2018-06-29 22:39:06 -04:00
support support: Add TEST_NO_SETVBUF 2018-06-26 12:30:50 +02:00
sysdeps [aarch64] Add an ASIMD variant of strlen for falkor 2018-08-15 23:01:33 +05:30
sysvipc Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
termios Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
time Use _STRUCT_TIMESPEC as guard in <bits/types/struct_timespec.h> [BZ #23349] 2018-06-28 13:12:16 +02:00
timezone Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
wcsmbs Add tests for sign of NaN returned by strtod (bug 23007). 2018-06-15 17:36:21 +00:00
wctype Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
.gitattributes Assume __NR_openat is always defined 2016-03-23 23:35:08 +01:00
.gitignore Add *.pyc to .gitignore 2015-05-18 15:26:26 +05:30
abi-tags Remove the bulk of the NaCl port. 2017-05-20 08:09:10 -04:00
aclocal.m4 LIBC_SLIBDIR_RTLDDIR: substitute arguments in single quotes 2018-01-25 17:20:28 +01:00
ChangeLog [aarch64] Add an ASIMD variant of strlen for falkor 2018-08-15 23:01:33 +05:30
config.h.in Switch IDNA implementation to libidn2 [BZ #19728] [BZ #19729] [BZ #22247] 2018-05-23 15:27:24 +02:00
config.make.in New configure option --disable-crypt. 2018-06-29 16:53:47 +02:00
configure x86: Support IBT and SHSTK in Intel CET [BZ #21598] 2018-07-16 14:08:27 -07:00
configure.ac x86: Support IBT and SHSTK in Intel CET [BZ #21598] 2018-07-16 14:08:27 -07:00
COPYING Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
COPYING.LIB Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
extra-lib.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
gen-locales.mk Improve gen-locales.mk and gen-locale.sh to make test files with @ options work 2018-02-27 17:01:57 +01:00
INSTALL Update install.texi documentation of uses of Perl and Python. 2018-08-13 17:20:46 +00:00
libc-abis libc-abis: Define ABSOLUTE ABI [BZ #19818][BZ #23307] 2018-07-05 18:06:43 +01:00
libof-iterator.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
LICENSES stdio-common/tst-printf.c: Remove part under a non-free license [BZ #23363] 2018-07-03 18:29:16 +02:00
MAINTAINERS Add MAINTAINERS 2017-05-11 13:38:30 -04:00
Makeconfig New configure option --disable-crypt. 2018-06-29 16:53:47 +02:00
Makefile testrun.sh: Implement --tool=strace, --tool=valgrind 2018-07-04 15:30:45 +02:00
Makefile.in New make target to only build benchmark binaries 2016-04-20 10:23:28 +05:30
Makerules Run thread shutdown functions in an explicit order 2018-06-26 15:27:12 +02:00
NEWS Improve performance of sinf and cosf 2018-08-14 10:45:59 +01:00
o-iterator.mk
README Remove tilegx port. 2018-04-27 19:11:24 +00:00
Rules Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
shlib-versions Extend NSS test suite 2017-07-17 15:52:44 -04:00
test-skeleton.c Update copyright dates with scripts/update-copyrights. 2018-01-01 00:32:25 +00:00
version.h Open master branch for glibc 2.29 development 2018-08-01 01:35:02 -04:00

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arm-*-linux-gnueabi
	hppa-*-linux-gnu
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	riscv64-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see http://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at http://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see http://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.