binutils-gdb

mirror of https://sourceware.org/git/binutils-gdb.git synced 2024-11-25 02:53:48 +08:00

Author	SHA1	Message	Date
Richard Sandiford	e87ff6724f	aarch64: Add the SME2 ADD and SUB instructions Add support for the SME2 ADD. SUB, FADD and FSUB instructions. SUB and FSUB have the same form as ADD and FADD, except that ADD also has a 2-operand accumulating form. The 64-bit ADD/SUB instructions require FEAT_SME_I16I64 and the 64-bit FADD/FSUB instructions require FEAT_SME_F64F64. These are the first instructions to have tied register list operands, as opposed to tied single registers. The parse_operands change prevents unsuffixed Z registers (width==-1) from being treated as though they had an Advanced SIMD-style suffix (.4s etc.). It means that: Error: expected element type rather than vector type at operand 2 -- `add za\.s\[w8,0\],{z0-z1}' becomes: Error: missing type suffix at operand 2 -- `add za\.s\[w8,0\],{z0-z1}'	2023-03-30 11:09:13 +01:00
Richard Sandiford	cbd11b8818	aarch64: Add the SME2 ZT0 instructions SME2 adds lookup table instructions for quantisation. They use a new lookup table register called ZT0. LUTI2 takes an unsuffixed SVE vector index of the form Zn[<imm>], which is the first time that this syntax has been used.	2023-03-30 11:09:12 +01:00
Richard Sandiford	99e01a66b4	aarch64: Add the SME2 predicate-related instructions Implementation-wise, the main things to note here are: - the WHILE* instructions have forms that return a pair of predicate registers. This is the first time that we've had lists of predicate registers, and they wrap around after register 15 rather than after register 31. - the predicate-as-counter WHILE* instructions have a fourth operand that specifies the vector length. We can treat this as an enumeration, except that immediate values aren't allowed. - PEXT takes an unsuffixed predicate index of the form PN<n>[<imm>]. This is the first instance of a vector/predicate index having no suffix.	2023-03-30 11:09:12 +01:00
Richard Sandiford	b408ebbf52	aarch64: Add the SME2 multivector LD1 and ST1 instructions SME2 adds LD1 and ST1 variants for lists of 2 and 4 registers. The registers can be consecutive or strided. In the strided case, 2-register lists have a stride of 8, starting at register x0xxx. 4-register lists have a stride of 4, starting at register x00xx. The instructions are predicated on a predicate-as-counter register in the range pn8-pn15. Although we already had register fields with upper bounds of 7 and 15, this is the first plain register operand to have a nonzero lower bound. The patch uses the operand-specific data field to record the minimum value, rather than having separate inserters and extractors for each lower bound. This in turn required adding an extra bit to the field.	2023-03-30 11:09:12 +01:00
Richard Sandiford	d8773a8a5f	aarch64: Add the SME2 MOVA instructions SME2 defines new MOVA instructions for moving multiple registers to and from ZA. As with SME, the instructions are also available through MOV aliases. One notable feature of these instructions (and many other SME2 instructions) is that some register lists must start at a multiple of the list's size. The patch uses the general error "start register out of range" when this constraint isn't met, rather than an error specifically about multiples. This ensures that the error is consistent between these simple consecutive lists and later strided lists, for which the requirements aren't a simple multiple.	2023-03-30 11:09:12 +01:00
Richard Sandiford	503fae1299	aarch64: Add support for predicate-as-counter registers SME2 adds a new format for the existing SVE predicate registers: predicates as counters rather than predicates as masks. In assembly code, operands that interpret predicates as counters are written pn<N> rather than p<N>. This patch adds support for these registers and extends some existing instructions to support them. Since the new forms are just a programmer convenience, there's no need to make them more restrictive than the earlier predicate-as-mask forms.	2023-03-30 11:09:11 +01:00
Richard Sandiford	586c62819f	aarch64; Add support for vector offset ranges Some SME2 instructions operate on a range of consecutive ZA vectors. This is indicated by syntax such as: za[<Wv>, <imml>:<immh>] Like with the earlier vgx2 and vgx4 support, we get better error messages if the parser allows all ZA indices to have a range. We can then reject invalid cases during constraint checking.	2023-03-30 11:09:11 +01:00
Richard Sandiford	e2dc4040f3	aarch64: Add support for vgx2 and vgx4 Many SME2 instructions operate on groups of 2 or 4 ZA vectors. This is indicated by adding a "vgx2" or "vgx4" group size to the ZA index. The group size is optional in assembly but preferred for disassembly. There is not a binary distinction between mnemonics that have group sizes and mnemonics that don't, nor between mnemonics that take vgx2 and mnemonics that take vgx4. We therefore get better error messages if we allow any ZA index to have a group size during parsing, and wait until constraint checking to reject invalid sizes. A quirk of the way errors are reported means that if an instruction is wrong both in its qualifiers and its use of a group size, we'll print suggested alternative instructions that also have an incorrect group size. But that's a general property that also applies to things like out-of-range immediates. It's also not obviously the wrong thing to do. We need to be relatively confident that we're looking at the right opcode before reporting detailed operand-specific errors, so doing qualifier checking first seems resonable.	2023-03-30 11:09:11 +01:00
Richard Sandiford	90cd80f8c2	aarch64: Add _off4 suffix to AARCH64_OPND_SME_ZA_array SME2 adds various new fields that are similar to AARCH64_OPND_SME_ZA_array, but are distinguished by the size of their offset fields. This patch adds _off4 to the name of the field that we already have.	2023-03-30 11:09:11 +01:00
Richard Sandiford	abd542a2f1	aarch64: Add a _10 suffix to FLD_imm3 SME2 adds various new 3-bit immediate fields, so this patch adds an lsb position suffix to the name of the field that we already have.	2023-03-30 11:09:10 +01:00
Richard Sandiford	4eede8c244	aarch64: Prefer register ranges & support wrapping Until now, binutils has supported register ranges such as { v0.4s - v3.4s } as an unofficial shorthand for { v0.4s, v1.4s, v2.4s, v3.4s }. The SME2 ISA embraces this form and makes it the preferred disassembly. It also embraces wrapped lists such as { z31.s - z2.s }, which is something that binutils didn't previously allow. The range form was already binutils's preferred disassembly for 3- and 4-register lists. This patch prefers it for 2-register lists too. The patch also adds support for wrap-around.	2023-03-30 11:09:10 +01:00
Richard Sandiford	f5b57feac2	aarch64: Add support for strided register lists SME2 has instructions that accept strided register lists, such as { z0.s, z4.s, z8.s, z12.s }. The purpose of this patch is to extend binutils to support such lists. The parsing code already had (unused) support for strides of 2. The idea here is instead to accept all strides during parsing and reject invalid strides during constraint checking. The SME2 instructions that accept strided operands also have non-strided forms. The errors about invalid strides therefore take a bitmask of acceptable strides, which allows multiple possibilities to be summed up in a single message. I've tried to update all code that handles register lists.	2023-03-30 11:09:10 +01:00
Richard Sandiford	b5c36ad2e0	aarch64: Sort fields alphanumerically This patch just sorts the field enum alphanumerically, which makes it easier to see if a particular field has already been defined.	2023-03-30 11:09:09 +01:00
Richard Sandiford	ccb6da7c82	aarch64: Resync field names This patch just makes the comments in aarch64-opc.c:fields match the names of the associated FLD_* enum.	2023-03-30 11:09:09 +01:00
Richard Sandiford	1d1060427d	aarch64: Regularise FLD_* suffixes Some FLD_imm* suffixes used a counting scheme such as FLD_immN, FLD_immN_2, FLD_immN_3, etc., while others used the lsb as the suffix. The latter seems more mnemonic, and was a big help in doing the SME2 work. Similarly, the _10 suffix on FLD_SME_size_10 was nonobvious. Presumably it indicated a 2-bit field, but it actually starts in bit 22.	2023-03-30 11:09:09 +01:00
Richard Sandiford	199cfcc475	aarch64: Add a aarch64_cpu_supports_inst_p helper Quite a lot of SME2 instructions have an opcode bit that selects between 32-bit and 64-bit forms of an instruction, with the 32-bit forms being part of base SME2 and with the 64-bit forms being part of an optional extension. It's nevertheless useful to have a single opcode entry for both forms since (a) that matches the ISA definition and (b) it tends to improve error reporting. This patch therefore adds a libopcodes function called aarch64_cpu_supports_inst_p that tests whether the target supports a particular instruction. In future it will depend on internal libopcodes routines.	2023-03-30 11:09:09 +01:00
Richard Sandiford	d09b87e0b1	aarch64: Reorder some OP_SVE_* macros This patch just moves some out-of-order-looking OP_SVE_* macros.	2023-03-30 11:09:09 +01:00
Richard Sandiford	56723f6934	aarch64: Rename aarch64-tbl.h OP_SME_* macros This patch renames the OP_SME_* macros in aarch64-tbl.h so that they follow the same scheme as the OP_SVE_* ones. It also uses OP_SVE_ as the prefix, since there is no real distinction between the SVE and SME uses of qualifiers: a macro defined for one can be useful for the other too.	2023-03-30 11:09:08 +01:00
Richard Sandiford	b5b4f66545	aarch64: Try to report invalid variants against the closest match If an instruction has invalid qualifiers, GAS would report the error against the final opcode entry that got to the qualifier- checking stage. It seems better to report the error against the opcode entry that had the closest match, just like we pick the closest match within an opcode entry for the "did you mean this?" message. This patch adds the number of invalid operands as an argument to AARCH64_OPDE_INVALID_VARIANT and then picks the AARCH64_OPDE_INVALID_VARIANT with the lowest argument.	2023-03-30 11:09:08 +01:00
Richard Sandiford	38c5aa5e88	aarch64: Make AARCH64_OPDE_REG_LIST take a bitfield AARCH64_OPDE_REG_LIST took a single operand that specified the expected number of registers. However, there are quite a few SME2 instructions that have both 2-register forms and (separate) 4-register forms. If the user tries to use a 3-register list, it isn't obvious which opcode entry they meant. Saying that we expect 2 registers and saying that we expect 4 registers would both be wrong. This patch therefore switches the operand to a bitfield. If a AARCH64_OPDE_REG_LIST is reported against multiple opcode entries, the patch ORs up the expected lengths. This has no user-visible effect yet. A later patch adds more error strings, alongside tests that use them.	2023-03-30 11:09:08 +01:00
Richard Sandiford	db3c06bf93	aarch64: Add an operand class for SVE register lists SVE register lists were classified as SVE_REG, since there had been no particular reason to separate them out. However, some SME2 instructions have tied register list operands, and so we need to distinguish registers and register lists when checking whether two operands match. Also, the register list operands used a general error message, even though we already have a dedicated error code for register lists that are the wrong length.	2023-03-30 11:09:07 +01:00
Richard Sandiford	56ead579a5	aarch64: Commonise checks for index operands This patch splits out the constraint checking for index operands, so that it can be reused by new SME2 operands.	2023-03-30 11:09:07 +01:00
Richard Sandiford	859f51df4d	aarch64: Add an error code for out-of-range registers libopcodes currently reports out-of-range registers as a general AARCH64_OPDE_OTHER_ERROR. However, this means that each register range needs its own hard-coded string, which is a bit cumbersome if the range is determined programmatically. This patch therefore adds a dedicated error type for out-of-range errors.	2023-03-30 11:09:07 +01:00
Richard Sandiford	7da28504bf	aarch64: Move w12-w15 range check to libopcodes In SME, the vector select register had to be in the range w12-w15, so it made sense to enforce that during parsing. However, SME2 adds instructions for which the range is w8-w11 instead. This patch therefore moves the range check from the parsing stage to the constraint-checking stage. Also, the previous error used a capitalised range W12-W15, whereas other register range errors used lowercase ranges like p0-p7. A quick internal poll showed a preference for the lowercase form, so the patch uses that. The patch uses "selection register" rather than "vector select register" so that the terminology extends more naturally to PSEL.	2023-03-30 11:09:05 +01:00
Richard Sandiford	ff60bcbfbe	aarch64: Move ZA range checks to aarch64-opc.c This patch moves the range checks on ZA vector select offsets from gas to libopcodes. Doing the checks there means that the error messages contain the expected range. It also fits in better with the error severity scheme, which becomes important later. (This is because out-of-range indices are treated as more severe than syntax errors, on the basis that parsing must have succeeded if we get to the point of checking the completed opcode.) The patch also adds a new check_za_access function for checking ZA accesses. That's a bit over the top for one offset check, but the function becomes more complex with later patches. sme-9-illegal.s checked for an invalid .q suffix using: psel p1, p15, p3.q[w15] but this is doubly invalid because it misses the immediate part of the index. The patch keeps that test but adds another with a zero index, so that .q is the only thing wrong. The aarch64-tbl.h change includes neatening up the backslash positions.	2023-03-30 11:09:05 +01:00
Richard Sandiford	c888354c61	aarch64: Make indexed_za use 64-bit immediates A later patch moves the range checking for ZA vector select offsets from gas to libopcodes. That in turn requires the immediate field to be big enough to support all parsed values. This shouldn't be a particularly size-sensitive structure, so there should be no memory problems with doing this.	2023-03-30 11:09:04 +01:00
Richard Sandiford	575c497a4a	aarch64: Rename za_tile_vector to za_index za_tile_vector is also used for indexing ZA as a whole, rather than just for indexing tiles. The former is more common than the latter in SME2, so this patch generalises the name to "indexed_za". The patch also names the associated structure, so that later patches can reuse it during parsing.	2023-03-30 11:09:04 +01:00
Richard Sandiford	e9e1ddbb98	aarch64: Treat ZA as a register We already treat the ZA tiles ZA0-ZA15 as registers. This patch does the same for ZA itself. parse_sme_zero_mask can then parse ZA tiles and ZA in the same way, through parsed_type_reg. One important effect of going through parsed_type_reg (in general) is that it allows ZA to take qualifiers. This is necessary for many SME2 instructions. However, to support existing unqualified uses of ZA, parse_reg_with_qual needs to treat the qualiier as optional. Hopefully the net effect is to give better error messages, since now that SME2 makes "za.<T>" valid in some contexts, it might be natural to use it (incorrectly) in ZERO too. While there, the patch also tweaks the error messages for invalid ZA tiles, to try to make some cases more specific. For now, parse_sme_za_array just uses parse_reg, rather than parse_typed_reg/parse_reg_with_qual. A later patch consolidates the parsing further.	2023-03-30 11:09:04 +01:00
Richard Sandiford	a5791d5814	aarch64: Make SME instructions use F_STRICT This patch makes all SME instructions use F_STRICT, so that qualifiers have to be provided explicitly rather than being inferred from other operands. The main change is to move the qualifier setting from the operand-level decoders to the opcode level. This is one step towards consolidating the ZA parsing code and extending it to handle SME2.	2023-03-30 11:09:02 +01:00
Richard Sandiford	89f55b440a	aarch64: Restrict range of PRFM opcodes In the register-index forms of PRFM, the unallocated prefetch opcodes 24-31 have been reused for the encoding of the new RPRFM instruction. The PRFM opcode space is now capped at 23 for these forms. The other forms of PRFM are unaffected.	2023-03-30 11:09:02 +01:00
Richard Sandiford	d346e1aafd	aarch64: Fix PSEL opcode mask The opcode mask for PSEL was missing some bits, which meant that some upcoming SME2 opcodes would be misinterpreted as PSELs.	2023-03-30 11:09:02 +01:00
Richard Sandiford	78addeae53	aarch64: Add sme-i16i64 and sme-f64f64 aliases Most extension flags are named after the associated architectural FEAT_* flags, but sme-i64 and sme-f64 were exceptions. This patch adds sme-i16i64 and sme-f64f64 aliases, but keeps the old names too for compatibility.	2023-03-30 11:09:01 +01:00
Jiawei	e43d8768d9	RISC-V: Fix disassemble fetch fail return value. This bug reported in https://sourceware.org/bugzilla/show_bug.cgi?id=30184 And discussed in https://sourceware.org/pipermail/binutils/2023-February/126213.html We also checked the implementation of return value in arm and mips. So this patch changes the return value to -1, that can fix bugs and maintain consistency with other architectures. opcodes/ChangeLog: * riscv-dis.c (print_insn_riscv):Change the return value.	2023-03-21 17:47:47 +08:00
Jan Beulich	3f155099ad	x86: drop "shimm" special case template expansions With VexVVVV only being boolean, the SSE shift-by-immediate instructions don't need special casing anymore for SSE2AVX handling. Simplify the two respective templates. (No change to generated tables.)	2023-03-20 16:57:19 +01:00
Jan Beulich	eea96d3f86	x86: VexVVVV is now merely a boolean With the SDM long having dropped the NDS/NDD/DDS concept of identifying encoding variants, we can finally do away with this concept as well. Of the few consumers of the attribute, only an assertion was still checking for a particular value, which we don't really need to retain. When touching lines anyway, modernize other aspects as well. This often improves similarity to adjacent lines.	2023-03-20 16:56:53 +01:00
Jan Beulich	ecb96e5549	x86: re-work build_modrm_byte()'s register assignment The function has accumulated a number of special cases for no real reason. Some were necessary because insn attributes (SwapSources in particular) weren't suitably utilized instead. Note that the addition of SwapSources actually increases consistency among the templates: Like others which already have the attribute, these are all insns where the VEX.VVVV-encoded register comes first (or last when looking at the SDM). Note that the vexvvvv attribute now has merely boolean meaning anymore, in line with the SDM long having dropped the NDS/NDD/DDS concept of identifying encoding variants. The fallout will be taken care of subsequently, though, to not further clutter the change here. As to the TILEZERO special case: If more instructions like this appeared, a new attribute would likely be the way to go. But as long as it's only a single insn, going from the mnemonic is cheaper.	2023-03-20 16:56:24 +01:00
Alan Modra	834e4d7162	Revert "segfault at i386-dis.c:9815" This reverts commit `92d450c79a`. Accessing these local var structs using a volatile qualified pointer may indeed read the object, but I don't think changed values are guaranteed to be written back to the object unless the actual object is declared volatile. That would probably slow down i386 disassembly unacceptably.	2023-03-20 21:02:14 +10:30
Alan Modra	92d450c79a	segfault at i386-dis.c:9815 * i386-dis.c (print_insn): Access "ins" and "priv" via volatile pointers after second sigsetjmp return.	2023-03-19 23:35:18 +10:30
Alan Modra	3e8b13bf77	cpu/mem.opc whitespace tidy cpu/ * mep.opc: Whitespace and formatting. opcodes/ * mep-asm.c: Regenerate. * mep-dis.c: Regenerate.	2023-03-16 17:30:19 +10:30
Nick Clifton	71f646f2b3	Fix an illegal memory access when disassembling a corrupt MeP file. PR 30231 * mep.opc (mep_print_insn): Check for an out of range index.	2023-03-15 13:06:23 +00:00
Nick Clifton	7718604518	Fix an illegal memory access when disassebling a corrupt ARM file. PR 30230 * arm-dis.c (get_sym_code_type): Check for non-ELF symbols.	2023-03-15 11:44:56 +00:00
Richard Ball	31f2faf5cf	[Aarch64] Add Binutils support for MEC This change supports MEC which is part of RME (Realm Management Extension).	2023-02-28 10:55:25 +00:00
Nick Clifton	85b922fcdd	Updated Serbian translations for gold, gprof and opcodes sub-directories	2023-02-27 12:50:31 +00:00
Andrew Burgess	82341e9798	opcodes/m68k: enable libopcodes styling for GDB The following commit added libopcodes styling for m68k: commit `c22ff44927` Date: Tue Feb 14 18:07:19 2023 +0100 opcodes: style m68k disassembler output but didn't set disassemble_info::created_styled_output in disassemble.c, which is needed in order for GDB to start using the libopcodes based styling. This commit fixes this small oversight. GDB now styles correctly.	2023-02-25 10:20:01 +00:00
Jan Beulich	cafa5ef72e	x86: MONITOR/MWAIT are not SSE3 insns These have their own CPUID bit and hence they should also have their own separate control.	2023-02-24 13:59:35 +01:00
Jan Beulich	c3bb24f566	x86-64: don't permit LAHF/SAHF with "generic64" The feature isn't universally available on 64-bit CPUs. Note that in i386-gen.c:isa_dependencies[] I'm only adding it to models where I'm certain the functionality exists. For Nocona and Core I'm uncertain in particular.	2023-02-24 13:58:35 +01:00
Jan Beulich	5eeeafe0a6	x86: have insns acting on segment selector values allow for consistent operands While MOV to/from segment register as well as selector storing insns already permit 32- and 64-bit GPR operands, selector loading insns and ARPL do not. Split templates accordingly.	2023-02-24 13:57:31 +01:00
Jan Beulich	c34d1cc920	x86: restrict insn templates accepting negative 8-bit immediates For shifts (but not ordinary rotates) and other cases where an immediate describes e.g. a bit count or position, allowing negative operands is at best confusing. An extreme example would be the two rotate-through-carry insns, where a negative value would _not_ mean rotating the corresponding number of bits in the other direction. To refuse such, give meaning to the combination of Imm8 and Imm8S in templates (so far these weren't used together anywhere). The issue was with smallest_imm_type() blindly setting .imm8 for signed numbers determined to fit in a byte. VPROT{B,W,D,Q} is a little special: The rotate count there is a signed quantity, so Imm8 is replaced by Imm8S. Adjust affected testcases accordingly as well. Another small adjustment to the testsuite is necessary: AAM and AAD were never sensible to use with 0xffffff90 operands. This should have been an error.	2023-02-24 13:56:57 +01:00
Jan Beulich	ba25141c1e	x86-64: LAR and LSL don't need REX.W Just like we suppress emitting REX.W for e.g. MOV from/to segment register, there's also no need for it for LAR and LSL - these can only ever return 32-bit values and hence always zero-extend their results anyway. While there also drop the redundant Word from the first operand of the second template each - this is already implied by Reg16.	2023-02-22 14:12:52 +01:00
Jan Beulich	ad2f443680	x86: optimize BT{,C,R,S} $imm,%reg In 64-bit mode BT can have REX.W or a data size prefix dropped in certain cases. Outside of 64-bit mode all 4 insns can have the data size prefix dropped in certain cases.	2023-02-22 14:12:24 +01:00
Andreas Schwab	c22ff44927	opcodes: style m68k disassembler output	2023-02-20 19:39:01 +01:00
Jan Beulich	676dcbb0a0	x86: {LD,ST}TILECFG use an extension opcode It being zero and happening to work right now doesn't mean the insns shouldn't be spelled out properly.	2023-02-14 08:34:42 +01:00
Michael Matz	25a0d393c7	PR30120: fix x87 fucomp misassembled this fixes the entry for 'fucomp' to use the correct Reg value (otherwise it's assembled as 'fucom').	2023-02-13 18:41:05 +01:00
Andrew Burgess	77be725744	opcodes/mips: disassemble unknown micromips instructions as two shorts Before commit: commit `2438b771ee` Date: Wed Nov 2 15:53:43 2022 +0000 opcodes/mips: use .word/.short for undefined instructions unknown 32-bit microMIPS instructions were disassembled as a raw 32-bit number with no '.word' directive. The above commit changed this and added a '.word' directive before the 32-bit number. It was pointed out on the mailing list, that for microMIPS it would be better to display such 32-bit instructions using a '.short' directive followed by two 16-bit values. This commit updates the mips disassembler to do this, and adds a new test that validates this output.	2023-02-13 12:05:32 +00:00
Jan Beulich	aa1807419b	x86: drop use of VEX3SOURCES The attribute really specifies that the sum of register and memory operands is 4. Express it like that in most places, while using the 2nd (apart from XOP) CPU feature flags (FMA4) in reversed operand matching logic. With the use in build_modrm_byte() gone, part of an assertion there also becomes meaningless - simplify that at the same time. With all uses of the opcode modifier field gone, also drop that.	2023-02-10 08:15:11 +01:00
Jan Beulich	5dab1799d7	x86: drop use of XOP2SOURCES The few XOP insns which used it wrongly didn't have VexVVVV specified. With that added, the only further missing piece to use more generic code elsewhere is SwapSources - see e.g. the BMI2 insns for similar operand patterns. With the only users gone, drop the #define as well as the special case code.	2023-02-10 08:14:46 +01:00
Jan Beulich	ba3ffa6de0	x86: limit use of XOP2SOURCES The VPROT* forms with an immediate operand are entirely standard in the way their ModR/M bytes are built. There's no reason to invoke special case code. With that the handling of an immediate there can also be dropped; it was partially bogus anyway, as in its "no memory operands" portion it ignores the possibility of an immediate operand (which was okay only because that case was already handled by more generic code).	2023-02-10 08:14:27 +01:00
Jan Beulich	ddb6249593	x86: move (and rename) opcodespace attribute This really isn't a "modifier" and rather ought to live next to the base opcode anyway. Use the bits we presently have available to fit in the field, renaming it to opcode_space. As an intended side effect this helps readability at the use sites, by shortening the references quite a bit. In generated code arrange for human readable output, by using the SPACE_* constants there rather than raw numbers. This may aid debugging down the road.	2023-02-10 08:10:38 +01:00
Guillermo E. Martinez	7f6ebecd56	bpf: fix error conversion from long unsigned int to unsigned int [-Werror=overflow] Regenerating BPF target using the maintainer mode emits: .../opcodes/bpf-opc.c:57:11: error: conversion from ‘long unsigned int’ to ‘unsigned int’ changes value from ‘18446744073709486335’ to ‘4294902015’ [-Werror=overflow] 57 \| 64, 64, 0xffffffffffff00ff, { { F (F_IMM32) }, { F (F_OFFSET16) }, { F (F_SRCLE) }, { F (F_OP_CODE) }, { F (F_DSTLE) }, { F (F_OP_SRC) }, { F (F_OP_CLASS) }, { 0 } } The use of a narrow size to handle the mask CGEN in instruction format is causing this error. Additionally eBPF `call' instructions constructed by expressions using symbols (BPF_PSEUDO_CALL) emits annotations in `src' field of the instruction, used to identify BPF target endianness. cpu/ * bpf.cpu (define-call-insn): Remove `src' field from instruction mask. include/ opcode/cge.h (CGEN_IFMT): Adjust mask bit width. opcodes/ bpf-opc.c: Regenerate.	2023-02-03 11:18:50 -06:00
Jan Beulich	0655669b86	RISC-V: don't disassemble unrecognized insns as .byte Insn width granularity being 16 bits, producing byte granular output isn't very useful. With there being a way to specific otherwise unknown insns to the assembler, use that same representation (to be precise: its <length>,<encoding> flavor) for disassembly.	2023-02-03 08:20:32 +01:00
Jan Beulich	a3c2d24868	RISC-V: make C-extension JAL available again for (32-bit) assembly Along with the normal JAL alias, the C-extension one should have been moved as well by `839189bc93` ("RISC-V: re-arrange opcode table for consistent alias handling"), for the assembler to actually be able to use it where/when possible. Since neither this nor any other compressed branch insn was being tested so far, take the opportunity and introduce a new testcase covering those.	2023-01-31 09:47:22 +01:00
Jan Beulich	bd78280846	x86: use ModR/M for FPU insns with operands This is the correct way of expressing things; encoding the ModR/M byte directly in base_opcode has always been bogus.	2023-01-27 09:22:49 +01:00
Jan Beulich	800c5dd7b3	opcodes: suppress internationalization on build helper tools While one of the two actually having been instrumented (i386-gen.c) now has that instrumentation dropped, there's still no point in honoring such instrumentation in general (i.e. now for ia64-gen.c only), as that only leads to a waste of resources. With CFILES then being merely an alias of LIBOPCODES_CFILES, drop the former variable altogether.	2023-01-27 09:20:58 +01:00
Jan Beulich	06ceca3a90	x86: remove internationalization from i386-gen.c This is a build time helper utility, which doesn't require translation.	2023-01-27 09:20:26 +01:00
Jan Beulich	a2e2f5ad74	x86: split i386-gen's opcode hash entry struct All glibc malloc() implementations I've checked have a smallest allocation size worth of 3 pointers, with an increment worth of 2 pointers. Hence mnemonics with multiple templates can be stored more efficiently when maintaining the shared "name" field only in the actual hash entry. (To express the shared nature, also convert "name" to by pointer-to-const.) While doing the conversation also pull out common code from the involved if/else construct in expand_templates().	2023-01-20 10:18:40 +01:00
Jan Beulich	2d9e089097	x86: embed register and alike names in disassembler Register names are (including their nul terminators) on average almost 4 bytes long. Otoh no register name is longer than 8 bytes. Hence even for 32-bit builds using a pointer is only slightly more space efficient than embedding the strings. A level of indirection can be also avoided by embedding the names as an array of 8 characters directly in the arrays, and the number of base relocations in libopcodes.so (or PIE builds of statically linked executables) goes down as well. To amortize for the otherwise reduced folding of string literals by the linker, use att_names_seg[] in place of string literals in append_seg() and OP_ESreg().	2023-01-20 10:18:17 +01:00
Jan Beulich	edf772580d	x86: embed register names in reg_entry Register names are (including their nul terminators) on average almost 4 bytes long. Otoh no register name is longer than 7 bytes. Hence even for 32-bit builds using a pointer is only slightly more space efficient than embedding the strings. A level of indirection can be also avoided by embedding the names as an array of 8 characters directly in the struct, and the number of base relocations in PIE builds of gas goes down as well.	2023-01-20 10:17:53 +01:00
Jan Beulich	992dd393c9	x86: absorb allocation in i386-gen When generating the mnemonic string table we already set up an identifier for the following entry in a number of cases. Re-use that on the next loop iteration rather than re-doing allocation and conversion.	2023-01-20 10:16:56 +01:00
Jan Beulich	3e451ee4a6	x86: re-use insn mnemonic strings as much as possible Compact the mnemonic string table such that the tails of longer mnemonics are re-used for shorter ones, going beyond what compilers would typically do, but matching what ELF linkers may do when processing SHF_MERGE\|SHF_STRINGS sections. This reduces table size by about 12.5%.	2023-01-20 10:16:17 +01:00
Jan Beulich	5c13920291	x86: move insn mnemonics to a separate table Using full pointers to reference the insn mnemonic strings is not very efficient. With overall string size presently just slightly over 20k, even a 16-bit value would suffice. Use "unsigned int" for now, as there's no good use we could presently make of the otherwise saved 16 bits. For 64-bit builds this reduces table size by 6.25% (prior to the recent ISA extension additions it would have been 12.5%), with a similar effect on cache occupation of table entries accessed. For PIE builds of gas this also reduces the number of base relocations quite a bit (obviously independent of bitness).	2023-01-20 10:15:48 +01:00
Max Filippov	e80512c8d8	opcodes: xtensa: fix jump visualization for FLIX opcodes/ * xtensa-dis.c (print_insn_xtensa): Add local variables insn_type, target and imm_pcrel to track control flow across multiple slots.	2023-01-03 13:30:57 -08:00
Max Filippov	39086586b7	opcodes: xtensa: implement styled disassembly opcodes/ * xtensa-dis.c (print_xtensa_operand) (print_insn_xtensa): Replace fprintf_func with fprintf_styled_func.	2023-01-03 13:30:57 -08:00
Nick Clifton	11982f9f8b	Updated translations for various languages and sub-directories	2023-01-03 11:32:42 +00:00
Alan Modra	d87bef3a7b	Update year range in copyright notice of binutils files The newer update-copyright.py fixes file encoding too, removing cr/lf on binutils/bfdtest2.c and ld/testsuite/ld-cygwin/exe-export.exp, and embedded cr in binutils/testsuite/binutils-all/ar.exp string match.	2023-01-01 21:50:11 +10:30
Nick Clifton	96e786d198	Update version number and regenerate files	2022-12-31 12:23:00 +00:00
Nick Clifton	a72b07181d	Add markers for 2.40 branch	2022-12-31 12:05:28 +00:00
Jan Beulich	760ab3d0db	x86: correct/improve TSX controls TSXLDTRK takes RTM as a prereq. Additionally introduce an umbrella "tsx" extension option covering both RTM and HLE, paralleling the "abm" one we already have.	2022-12-22 09:36:16 +01:00
Jan Beulich	0919e770af	x86: add dependencies on SVME SEV-ES is an extension to SVME. SNP in turn is an extension to SEV-ES, and yet in turn RMPQUERY is a SNP extension. Note that cpu_arch[] has no SNP entry, so CPU_ANY_SNP_FLAGS remains unused (just like CPU_SNP_FLAGS already is).	2022-12-22 09:35:53 +01:00
Jan Beulich	25626f7939	x86: add dependencies on VMX Both EPT and VMFUNC are extensions to VMX.	2022-12-22 09:35:32 +01:00
Jan Beulich	af1ad9aac5	x86: correct XSAVE* dependencies Like various other features AMX-TILE takes XSAVE as a prereq. XSAVES, unconditionally using compacted format, in turn effectively takes XSAVEC as a prereq (an SDM clarification to this effect is in the works).	2022-12-22 09:35:11 +01:00
Jan Beulich	9a019125a6	x86: correct dependencies of a few AVX512 sub-features Like AVX512-FP16, several other extensions require wider than 16-bit mask registers. As a result they take AVX512BW as a prereq, not (just) AVX512F. Which in turn points out wrong expectations in the noavx512-1 testcase.	2022-12-22 09:34:50 +01:00
Jan Beulich	b20f426174	x86: add dependencies on AVX2 Like AVX-VNNI both VAES and VPCLMUL take AVX2 as a prereq, for operating on up to 256-bit packed integer vectors.	2022-12-22 09:33:53 +01:00
Jan Beulich	88bd2203ef	x86: correct SSE dependencies SSE itself takes FXSR as a prereq. Like AES, PCLMUL, and SHA both GFNI and KL take SSE2 as a prereq, for operating on packed integers. And while correcting KL also record it as a prereq to WIDEKL.	2022-12-22 09:33:26 +01:00
Jan Beulich	4d97c5c833	x86: re-work ISA extension dependency handling Getting both forward and reverse ISA dependencies right / consistent has been a permanent source of mistakes. Reduce what needs specifying manually to just the direct forward dependencies. Transitive forward dependencies as well as reverse ones are now derived and hence cannot go out of sync anymore (at least in the vast majority of cases; there are a few special cases to still take care of manually). In the course of this several CPU_ANY__FLAGS disappear, requiring adjustment to the assembler's cpu_arch[]. Note that to retain the correct reverse dependency of AVX512F wrt AVX512-VP2INTERSECT, the latter has the previously missing AVX512F prereq added. Note further that to avoid adding the following undue prereqs: ATHLON, K8, and AMDFAM10 gain CMOV and FXSR, * IAMCU gains 387, auxiliary table entries (including a colon-separated modifier) are introduced in addition to the ones representing from converting the old table. To maintain forward-only dependencies between AVX (XOP) and SSE* (SSE4a) (i.e. "nosse" not disabling AVX), reverse dependency tracking is artifically suppressed. As a side effect disabling of SSE or SSE2 will now also disable AES, PCLMUL, and SHA (respective elements were missing from CPU_ANY_SSE2_FLAGS).	2022-12-22 09:32:29 +01:00
Jan Beulich	9c19e9ec4d	x86: rename CheckRegSize to CheckOperandSize While originally indeed used for register size checking only, the attribute has been used for memory operand size checking as well already for quite a while, with more such uses recently having been added.	2022-12-21 09:07:03 +01:00
Alan Modra	b3737aadf1	Re: x86: remove i386-opc.c Regen opcodes/po/POTFILES.in	2022-12-20 19:01:10 +10:30
Jan Beulich	4fdeb2a379	x86: omit Cpu prefixes from opcode table These enumerators can be used in only one specific field, and hence the Cpu prefix isn't needed ther for disambiguation / name space separation.	2022-12-19 09:22:33 +01:00
Jan Beulich	689933867a	x86: change representation of extension opcode Having a "None" field in the vast majority of entries is needlessly cluttering the overall table. Instead of this being a separate field, use a representation matching that of Intel SDM and AMD PM for the main use of the field: Append the value after a / as the separator.	2022-12-16 09:00:23 +01:00
Jan Beulich	a28fedbc3f	x86: further re-work insn/suffix recognition to also cover MOVSX PR gas/29524 Having templates with a suffix explicitly present has always been quirky. After prior adjustment all that's left to also eliminate the anomaly from move-with-sign-extend is to consolidate the insn templates and to make may_need_pass2() cope (plus extend testsuite coverage).	2022-12-12 14:01:02 +01:00
Jan Beulich	77a19f0e19	x86: drop (now) stray IsString The need for them on the operand-less string insns has gone away with the removal of maybe_adjust_templates() and associated logic. Since i386_index_check() needs adjustment then anyway, take the opportunity and also simplify it, possible again as a result of said removal (plus the opcode template adjustments done here).	2022-12-12 13:53:40 +01:00
Jan Beulich	04784e33fa	x86: re-work insn/suffix recognition Having templates with a suffix explicitly present has always been quirky. Introduce a 2nd matching pass in case the 1st one couldn't find a suitable template _and_ didn't itself already need to trim off a suffix to find a match at all. This requires error reporting adjustments (albeit luckily fewer than I was afraid might be necessary), as errors previously reported during matching now need deferring until after the 2nd pass (because, obviously, we must not emit any error if the 2nd pass succeeds). While also related to PR gas/29524, it was requested that move-with-sign-extend be left as broken as it always was. PR gas/29525 Note that with the dropped CMPSD and MOVSD Intel Syntax string insn templates taking operands, mixed IsString/non-IsString template groups (with memory operands) cannot occur anymore. With that maybe_adjust_templates() becomes unnecessary (and is hence being removed). PR gas/29526 Note further that while the additions to the intel16 testcase aren't really proper Intel syntax, we've been permitting all of those except for the MOVD variant. The test therefore is to avoid re-introducing such an inconsistency.	2022-12-12 13:51:46 +01:00
Jan Beulich	6825a3bc86	x86: revert disassembler parts of "x86: Allow 16-bit register source for LAR and LSL" This reverts the disassembler parts of `859aa2c86d` ("x86: Allow 16-bit register source for LAR and LSL"), adjusting testcases as necessary. That change was itself a partial revert of `c9f5b96bda` ("x86: correct handling of LAR and LSL"), without actually saying so. While the earlier commit was properly agreed upon, the partial revert was not, and hence should not have been committed. This is even more so that the revert part of that change wasn't even necessary to address PR gas/29844.	2022-12-12 13:49:56 +01:00
Jan Beulich	65f440c8fb	x86: generate template sets data at build time Speed up gas startup by avoiding runtime allocation of the instances of type "templates". At the same time cut the memory requirement to just very little over half (not even accounting for any overhead notes_alloc() may incur) by reusing the "end" slot of a preceding entry for the "start" slot of the subsequent one.	2022-12-12 08:49:26 +01:00
Jan Beulich	daf15e3e96	x86: drop sentinel from i386_optab[] Now that the table is local to gas, ARRAY_SIZE() can be used to determine the end of the table. Re-arrange the processing loop in md_begin() accordingly, at the same time folding the two calls to notes_alloc() into just one.	2022-12-12 08:49:00 +01:00
Jan Beulich	bc30b54aef	x86: remove i386-opc.c Remove the now empty i386-opc.c. To compensate, tie table generation in opcodes/ to the building of i386-dis.o, despite the file not really depending on the generated data.	2022-12-12 08:47:26 +01:00
Jan Beulich	99f0fb12d5	x86: instantiate i386_{op,reg}tab[] in gas instead of in libopcodes Unlike many other architectures, x86 does not share an opcode table between assembly and disassembly. Any consumer of libopcodes would only ever access one of the two. Since gas is the only consumer of the assembly data, move it there. While doing so mark respective entities "static" in i386-gen (we may want to do away with i386_regtab_size altogether). This also shrinks the number of relocations to be processed for libopcodes.so by about 30%.	2022-12-12 08:46:47 +01:00
Peter Bergner	51b52f4ade	PowerPC: Add support for RFC02655 - Saturating Subtract Instruction opcodes/ * ppc-opc.c (XOL): New define. (XOL_MASK): Likewise. (powerpc_opcodes): Add subfus, subfus., subwus, subwus., subdus, subdus. gas/ * testsuite/gas/ppc/rfc02655.s: New test. * testsuite/gas/ppc/rfc02655.d: Likewise * testsuite/gas/ppc/future-raw.s: Likewise. * testsuite/gas/ppc/future-raw.d: Likewise. * testsuite/gas/ppc/ppc.exp: Run them.	2022-12-07 11:47:16 -06:00
Peter Bergner	a60038c648	PowerPC: Add support for RFC02656 - Enhanced Load Store with Length Instructions opcodes/ * ppc-opc.c (PPCVSXF): New define. (powerpc_opcodes): Add lxvrl, lxvrll, lxvprl, lxvprll, stxvrl, stxvrll, stxvprl, stxvprl. gas/ * testsuite/gas/ppc/rfc02656.s: New test. * testsuite/gas/ppc/rfc02656.d: Likewise. * testsuite/gas/ppc/ppc.exp: Run it.	2022-12-07 11:47:16 -06:00
Haochen Jiang	40849d84cb	x86: Remove unnecessary vex.w check for xh_mode in disassembler For all the xh_mode usage in table, they are all using %XH, which will print "{bad}" while EVEX.W=1. This makes this vex.w check unnecessary. opcodes/ChangeLog: * i386-dis.c (OP_E_memory): Remove vex.w check for xh_mode.	2022-12-06 15:49:40 +08:00
Andrew Burgess	ecfbeec8d0	libopcodes/mips: add support for disassembler styling This commit adds disassembler styling support for MIPS. After this commit objdump and GDB will style disassembler output. This is a pretty straight forward change, we switch to use the disassemble_info::fprintf_styled_func callback, and pass an appropriate style through as needed. No additional tricks were needed (compared to say i386, or ARM). Tested by running all of the objdump commands used by the gas testsuite and manually inspecting the styled output, everything looks reasonable, though I'm not a MIPS expert, so it is possible that I've missed some corner cases. Worst case though is that something will be styled incorrectly, the actual content should be unchanged. All the gas, ld, and binutils tests still pass for me.	2022-12-05 10:05:45 +00:00

1 2 3 4 5 ...

4411 Commits