mirror of
https://sourceware.org/git/binutils-gdb.git
synced 2025-01-15 12:34:05 +08:00
252dcdf432
ppc64 ld optimises sequences like the following addis 3,13,wot@tprel@ha lwz 3,wot@tprel@l(3) to nop lwz 3,wot@tprel(13) when "wot" is located near enough to the thread pointer. However, the ABI doesn't require that R_PPC64_TPREL16_HA always be on an addis rt,13,imm instruction, and while ld checked for that on the high-part instruction it didn't disable the optimisation on the low-part instruction. This patch fixes that problem, disabling the tprel optimisation globally if high-part instructions don't pass sanity checks. The optimisation is also enabled for ppc32, where before ld.bfd had the code in the wrong place and ld.gold had it in a block only enabled for ppc64. bfd/ * elf32-ppc.c (ppc_elf_check_relocs): Set has_tls_reloc for high part tprel16 relocs. (ppc_elf_tls_optimize): Sanity check high part tprel16 relocs. Clear do_tls_opt on odd instructions. (ppc_elf_relocate_section): Move TPREL16_HA/LO optimisation later. Don't sanity check them here. * elf64-ppc.c (ppc64_elf_check_relocs): Set has_tls_reloc for high part tprel16 relocs. (ppc64_elf_tls_optimize): Sanity check high part tprel16 relocs. Clear do_tls_opt on odd instructions. (ppc64_elf_relocate_section): Don't sanity check TPREL16_HA. ld/ * testsuite/ld-powerpc/tls32.d: Update for TPREL_HA/LO optimisation. * testsuite/ld-powerpc/tlsexe32.d: Likewise. * testsuite/ld-powerpc/tlsldopt32.d: Likewise. * testsuite/ld-powerpc/tlsmark32.d: Likewise. * testsuite/ld-powerpc/tlsopt4_32.d: Likewise. * testsuite/ld-powerpc/tprel.s, * testsuite/ld-powerpc/tprel.d, * testsuite/ld-powerpc/tprel32.d: New tests. * testsuite/ld-powerpc/tprelbad.s, * testsuite/ld-powerpc/tprelbad.d: New test. * testsuite/ld-powerpc/powerpc.exp: Run them. gold/ * powerpc.cc (Target_powerpc): Add tprel_opt_ and accessors. (Target_powerpc::Scan::local): Sanity check tprel high relocs. (Target_powerpc::Scan::global): Likewise. (Target_powerpc::Relocate::relocate): Control tprel optimisation with tprel_opt_ and enable for 32-bit. |
||
---|---|---|
.. | ||
po | ||
testsuite | ||
aarch64-reloc-property.cc | ||
aarch64-reloc-property.h | ||
aarch64-reloc.def | ||
aarch64.cc | ||
aclocal.m4 | ||
archive.cc | ||
archive.h | ||
arm-reloc-property.cc | ||
arm-reloc-property.h | ||
arm-reloc.def | ||
arm.cc | ||
attributes.cc | ||
attributes.h | ||
binary.cc | ||
binary.h | ||
ChangeLog | ||
ChangeLog-0815 | ||
ChangeLog-2016 | ||
ChangeLog-2017 | ||
ChangeLog-2018 | ||
ChangeLog-2019 | ||
common.cc | ||
common.h | ||
compressed_output.cc | ||
compressed_output.h | ||
config.in | ||
configure | ||
configure.ac | ||
configure.tgt | ||
copy-relocs.cc | ||
copy-relocs.h | ||
cref.cc | ||
cref.h | ||
debug.h | ||
defstd.cc | ||
defstd.h | ||
descriptors.cc | ||
descriptors.h | ||
dirsearch.cc | ||
dirsearch.h | ||
dwarf_reader.cc | ||
dwarf_reader.h | ||
dwp.cc | ||
dwp.h | ||
dynobj.cc | ||
dynobj.h | ||
ehframe.cc | ||
ehframe.h | ||
errors.cc | ||
errors.h | ||
expression.cc | ||
ffsll.c | ||
fileread.cc | ||
fileread.h | ||
freebsd.h | ||
ftruncate.c | ||
gc.cc | ||
gc.h | ||
gdb-index.cc | ||
gdb-index.h | ||
gold-threads.cc | ||
gold-threads.h | ||
gold.cc | ||
gold.h | ||
i386.cc | ||
icf.cc | ||
icf.h | ||
incremental-dump.cc | ||
incremental.cc | ||
incremental.h | ||
int_encoding.cc | ||
int_encoding.h | ||
layout.cc | ||
layout.h | ||
main.cc | ||
Makefile.am | ||
Makefile.in | ||
mapfile.cc | ||
mapfile.h | ||
merge.cc | ||
merge.h | ||
mips.cc | ||
mremap.c | ||
nacl.cc | ||
nacl.h | ||
NEWS | ||
object.cc | ||
object.h | ||
options.cc | ||
options.h | ||
output.cc | ||
output.h | ||
parameters.cc | ||
parameters.h | ||
plugin.cc | ||
plugin.h | ||
powerpc.cc | ||
pread.c | ||
README | ||
readsyms.cc | ||
readsyms.h | ||
reduced_debug_output.cc | ||
reduced_debug_output.h | ||
reloc-types.h | ||
reloc.cc | ||
reloc.h | ||
resolve.cc | ||
s390.cc | ||
script-c.h | ||
script-sections.cc | ||
script-sections.h | ||
script.cc | ||
script.h | ||
sparc.cc | ||
stringpool.cc | ||
stringpool.h | ||
symtab.cc | ||
symtab.h | ||
system.h | ||
target-reloc.h | ||
target-select.cc | ||
target-select.h | ||
target.cc | ||
target.h | ||
tilegx.cc | ||
timer.cc | ||
timer.h | ||
tls.h | ||
TODO | ||
token.h | ||
version.cc | ||
workqueue-internal.h | ||
workqueue-threads.cc | ||
workqueue.cc | ||
workqueue.h | ||
x86_64.cc | ||
yyscript.y |
gold is an ELF linker. It is intended to have complete support for
ELF and to run as fast as possible on modern systems. For normal use
it is a drop-in replacement for the older GNU linker.
gold is part of the GNU binutils. See ../binutils/README for more
general notes, including where to send bug reports.
gold was originally developed at Google, and was contributed to the
Free Software Foundation in March 2008. At Google it was designed by
Ian Lance Taylor, with major contributions by Cary Coutant, Craig
Silverstein, and Andrew Chatham.
The existing GNU linker manual is intended to be accurate
documentation for features which gold supports. gold supports most of
the features of the GNU linker for ELF targets. Notable
omissions--features of the GNU linker not currently supported in
gold--are:
* MRI compatible linker scripts
* cross-reference reports (--cref)
* various other minor options
Notes on the code
=================
These are some notes which may be helpful to people working on the
source code of gold itself.
gold is written in C++. It is a GNU program, and therefore follows
the GNU formatting standards as modified for C++. Source documents in
order of decreasing precedence:
http://www.gnu.org/prep/standards/
http://gcc.gnu.org/onlinedocs/libstdc++/manual/source_code_style.html
http://www.zembu.com/eng/procs/c++style.html
The linker is intended to have complete support for cross-compilation,
while still supporting the normal case of native linking as fast as
possible. In order to do this, many classes are actually templates
whose parameter is the ELF file class (e.g., 32 bits or 64 bits). The
C++ code is the same, but we don't pay the execution time cost of
always using 64-bit integers if the target is 32 bits. Many of these
class templates also have an endianness parameter: true for
big-endian, false for little-endian.
The linker is multi-threaded. The Task class represents a single unit
of work. Task objects are stored on a single Workqueue object. Tasks
communicate via Task_token objects. Task_token objects are only
manipulated while holding the master Workqueue lock. Relatively few
mutexes are used.
Build requirements
==================
The gold source code uses templates heavily. Building it requires a
recent version of g++. g++ 4.0.3 and 4.1.3 are known to work. g++
3.2, 3.4.3, and 4.1.2 are known to fail.
The linker script parser uses features which are only in newer
versions of bison. bison 2.3 is known to work. bison 1.26 is known
to fail. If you are building gold from an official binutils release,
the bison output should already be included.
Copyright (C) 2012-2020 Free Software Foundation, Inc.
Copying and distribution of this file, with or without modification,
are permitted in any medium without royalty provided the copyright
notice and this notice are preserved.