Commit Graph

215660 Commits

Author SHA1 Message Date
Torbjörn SVENSSON
b12bc0bd59 testsuite: arm: Use effective-target for thumb2-slow-flash-data* tests
Update test cases to use -mcpu=unset/-march=unset feature introduced in
r15-3606-g7d6c6a0d15c.

gcc/testsuite/ChangeLog:

	* gcc.target/arm/thumb2-slow-flash-data-2.c: Use
	effective-target arm_arch_v7em_hard.
	* gcc.target/arm/thumb2-slow-flash-data-3.c: Likewise.
	* gcc.target/arm/thumb2-slow-flash-data-4.c: Likewise.
	* gcc.target/arm/thumb2-slow-flash-data-5.c: Likewise.

Signed-off-by: Torbjörn SVENSSON <torbjorn.svensson@foss.st.com>
2024-11-21 07:51:48 +01:00
Torbjörn SVENSSON
f55cc57c6e testsuite: arm: Use effective-target for small-multiply-m* tests
Update test cases to use -mcpu=unset/-march=unset feature introduced in
r15-3606-g7d6c6a0d15c.

gcc/testsuite/ChangeLog:

	* gcc.target/arm/small-multiply-m0-1.c: Use effective-target
	arm_arch_v6m and added option "-march=unset".
	* gcc.target/arm/small-multiply-m0-2.c: Likewise.
	* gcc.target/arm/small-multiply-m0-3.c: Likewise.
	* gcc.target/arm/small-multiply-m0plus-1.c: Likewise.
	* gcc.target/arm/small-multiply-m0plus-2.c: Likewise.
	* gcc.target/arm/small-multiply-m0plus-3.c: Likewise.
	* gcc.target/arm/small-multiply-m1-1.c: Likewise.
	* gcc.target/arm/small-multiply-m1-2.c: Likewise.
	* gcc.target/arm/small-multiply-m1-3.c: Likewise.
	* lib/target-supports.exp: Define effective-target
	arm_cpu_cortex_m0_small, arm_cpu_cortex_m0plus_small and
	arm_cpu_cortex_m1_small.

Signed-off-by: Torbjörn SVENSSON <torbjorn.svensson@foss.st.com>
2024-11-21 07:51:42 +01:00
Torbjörn SVENSSON
703839b8bd testsuite: arm: Use effective-target for pure-code/* tests
Update test cases to use -mcpu=unset/-march=unset feature introduced in
r15-3606-g7d6c6a0d15c.

gcc/testsuite/ChangeLog:

	* gcc.target/arm/pure-code/no-literal-pool-m0.c: Use
	effective-target arm_cpu_cortex-m0.
	* gcc.target/arm/pure-code/no-literal-pool-m23.c: Use
	effective-target arm_cpu_cortex-m23.
	* gcc.target/arm/pure-code/pr94538-1.c: Likewise.
	* gcc.target/arm/pure-code/pr109800.c: Use effective-target
	arm_arch_v7em_hard.
	* lib/target-supports.exp: Define effective-target
	arm_cpu_cortex_m0, arm_cpu_cortex_m23 and arm_arch_v7em_hard.

Signed-off-by: Torbjörn SVENSSON <torbjorn.svensson@foss.st.com>
2024-11-21 07:50:30 +01:00
Torbjörn SVENSSON
0380051bba testsuite: arm: Use effective-target for crc_hf_1.c test
Update test cases to use -mcpu=unset/-march=unset feature introduced in
r15-3606-g7d6c6a0d15c.

gcc/testsuite/ChangeLog:

	* gcc.target/arm/acle/crc_hf_1.c: Use effective-target
	arm_arch_v8a_crc_hard.
	* lib/target-supports.exp: Define effective-target
	arm_arch_v8a_crc_hard.

Signed-off-by: Torbjörn SVENSSON <torbjorn.svensson@foss.st.com>
2024-11-21 07:49:11 +01:00
Torbjörn SVENSSON
dc044641a0 testsuite: arm: Use effective-target for pacbti-m-predef* tests
Update test cases to use -mcpu=unset/-march=unset feature introduced in
r15-3606-g7d6c6a0d15c.

gcc/testsuite/ChangeLog:

	* gcc.target/arm/acle/pacbti-m-predef-1.c: Use effective-target
	arm_arch_v8_1m_main.
	* gcc.target/arm/acle/pacbti-m-predef-2.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-3.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-4.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-5.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-6.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-8.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-9.c: Likewise.
	* gcc.target/arm/acle/pacbti-m-predef-10.c: Likewise.

Signed-off-by: Torbjörn SVENSSON <torbjorn.svensson@foss.st.com>
2024-11-21 07:48:39 +01:00
Torbjörn SVENSSON
3ae9d01eb4 testsuite: arm: Use effective-target for bti* and pac* tests
Update test cases to use -mcpu=unset/-march=unset feature introduced in
r15-3606-g7d6c6a0d15c.

gcc/testsuite/ChangeLog:

	* gcc.target/arm/pac-1.c: Use effective-target
	arm_arch_v8_1m_main_pacbti.
	* gcc.target/arm/pac-2.c: Likewise.
	* gcc.target/arm/pac-3.c: Likewise.
	* gcc.target/arm/pac-4.c: Likewise.
	* gcc.target/arm/pac-5.c: Likewise.
	* gcc.target/arm/pac-7.c: Likewise.
	* gcc.target/arm/pac-8.c: Likewise.
	* gcc.target/arm/pac-9.c: Likewise.
	* gcc.target/arm/pac-10.c: Likewise.
	* gcc.target/arm/pac-11.c: Likewise.
	* gcc.target/arm/pac-12.c: Added option "-mcpu=unset".
	* gcc.target/arm/pac-13.c: Likewise.
	* gcc.target/arm/pac-14.c: Likewise.
	* lib/target-supports.exp
	(check_effective_target_arm_pacbti_hw): Likewise.
	* gcc.target/arm/pac-6.c: Use effective-target
	arm_arch_v8_1m_main.
	* gcc.target/arm/pac-15.c: Use effective-target
	arm_arch_v8_1m_main_pacbti and added option "-mcpu=unset".

Signed-off-by: Torbjörn SVENSSON <torbjorn.svensson@foss.st.com>
Co-authored-by: Yvan ROUX <yvan.roux@foss.st.com>
2024-11-21 07:45:50 +01:00
GCC Administrator
cf261dd522 Daily bump. 2024-11-21 00:20:27 +00:00
Lewis Hyatt
81c29232b6
tree-cfg: Fix call to next_discriminator_for_locus()
While testing future 64-bit location_t support, I ran into an
-fcompare-debug issue that was traced back here. Despite the name,
next_discriminator_for_locus() is meant to take an integer line number
argument, not a location_t. There is one call site which has been passing a
location_t instead. For the most part that is harmless, although in case
there are two CALL stmts on the same line with different location_t, it may
fail to generate a unique discriminator where it should. If/when location_t
changes to be 64-bit, however, it will produce an -fcompare-debug
failure. Fix it by passing the line number rather than the location_t.

I am not aware of a testcase that demonstrates any observable wrong
behavior, but the file debug/pr53466.C is an example where the discriminator
assignment is indeed different before and after this change.

gcc/ChangeLog:

	* tree-cfg.cc (assign_discriminators): Fix incorrect value passed to
	next_discriminator_for_locus().
2024-11-20 18:08:57 -05:00
Gaius Mulley
26f3efccaa PR modula2/117703: libgm2 soname bumps for GCC 15
Bump libgm2 version ready for the gcc-15 release.

libgm2/ChangeLog:

	PR modula2/117703
	* configure: Regenerate.
	* configure.ac (libtool_VERSION): Bump to 20:0:0.

Signed-off-by: Gaius Mulley <gaiusmod2@gmail.com>
2024-11-20 22:17:30 +00:00
Harald Anlauf
3c130e410a Fortran: fix checking of protected variables in submodules [PR83135]
When a symbol was use-associated in the ancestor of a submodule, a
PROTECTED attribute was ignored in the submodule or its descendants.
Find the real ancestor of symbols when used in a variable definition
context in a submodule.

	PR fortran/83135

gcc/fortran/ChangeLog:

	* expr.cc (sym_is_from_ancestor): New helper function.
	(gfc_check_vardef_context): Refine checking of PROTECTED attribute
	of symbols that are indirectly use-associated in a submodule.

gcc/testsuite/ChangeLog:

	* gfortran.dg/protected_10.f90: New test.
2024-11-20 23:07:47 +01:00
Joseph Myers
d5cebf7e44 c: Diagnose compound literal for empty array [PR114266]
As reported in bug 114266, GCC fails to pedwarn for a compound
literal, whose type is an array of unknown size, initialized with an
empty initializer.  This case is disallowed by C23 (which doesn't have
zero-size objects); the case of a named object is diagnosed as
expected, but not that for compound literals.  (Before C23, the
pedwarn for empty initializers sufficed.)  Add a check for this
specific case with a pedwarn.

Bootstrapped with no regressions for x86_64-pc-linux-gnu.

	PR c/114266

gcc/c/
	* c-decl.cc (build_compound_literal): Diagnose array of unknown
	size with empty initializer for C23.

gcc/testsuite/
	* gcc.dg/c23-empty-init-4.c: New test.
2024-11-20 21:29:48 +00:00
Antoni Boucher
cf544af03a libgccjit: Add support for setting the comment ident
gcc/jit/ChangeLog:

	* docs/topics/compatibility.rst (LIBGCCJIT_ABI_34): New ABI tag.
	* docs/topics/contexts.rst: Document gcc_jit_context_set_output_ident.
	* jit-playback.cc (set_output_ident): New method.
	* jit-playback.h (set_output_ident): New method.
	* jit-recording.cc (recording::context::set_output_ident,
	recording::output_ident::output_ident,
	recording::output_ident::~output_ident,
	recording::output_ident::replay_into,
	recording::output_ident::make_debug_string,
	recording::output_ident::write_reproducer): New methods.
	* jit-recording.h (class output_ident): New class.
	* libgccjit.cc (gcc_jit_context_set_output_ident): New function.
	* libgccjit.h (gcc_jit_context_set_output_ident): New function.
	* libgccjit.map: New function.

gcc/testsuite/ChangeLog:

	* jit.dg/all-non-failing-tests.h: New test.
	* jit.dg/test-output-ident.c: New test.
2024-11-20 16:10:59 -05:00
Antoni Boucher
d8cf8917ed libgccjit: Add support for creating temporary variables
gcc/jit/ChangeLog:

	* docs/topics/compatibility.rst (LIBGCCJIT_ABI_33): New ABI tag.
	* docs/topics/functions.rst: Document gcc_jit_function_new_temp.
	* jit-playback.cc (new_local): Add support for temporary
	variables.
	* jit-recording.cc (recording::function::new_temp): New method.
	(recording::local::write_reproducer): Support temporary
	variables.
	* jit-recording.h (new_temp): New method.
	* libgccjit.cc (gcc_jit_function_new_temp): New function.
	* libgccjit.h (gcc_jit_function_new_temp): New function.
	* libgccjit.map: New function.

gcc/testsuite/ChangeLog:

	* jit.dg/all-non-failing-tests.h: Mention test-temp.c.
	* jit.dg/test-temp.c: New test.
2024-01-18 17:54:59 -05:00
Vladimir N. Makarov
56fc6a6d9e [PR116587][LRA]: Fix last chance reload pseudo allocation
On i686 PR116587 test compilation resulted in LRA failure to find
registers for a reload insn pseudo.  The insn requires 6 regs for 4
reload insn pseudos where two of them require 2 regs each.  But we
have only 5 free regs as sp is a fixed reg, bp is fixed because of
-fno-omit-frame-pointer, bx is assigned to pic_offset_table_pseudo
because of -fPIC.  LRA spills pic_offset_table_pseudo as the last
chance approach to allocate registers to the reload pseudo.  Although
it makes 2 free registers for the unallocated reload pseudo requiring
also 2 regs, the pseudo still can not be allocated as the 2 free regs
are disjoint.  The patch spills all pseudos conflicting with the
unallocated reload pseudo including already allocated reload insn
pseudos, then standard LRA code allocates spilled pseudos requiring
more one register first and avoid situation of the disjoint regs for
reload pseudos requiring more one reg.

gcc/ChangeLog:

	PR target/116587
	* lra-assigns.cc (find_all_spills_for): Consider all pseudos whose
	classes intersect given pseudo class.

gcc/testsuite/ChangeLog:

	PR target/116587
	* gcc.target/i386/pr116587.c: New test.
2024-11-20 14:30:33 -05:00
Antoni Boucher
87f0136fa4 libgccjit: Add support for machine-dependent builtins
gcc/jit/ChangeLog:
	PR jit/108762
	* docs/topics/compatibility.rst (LIBGCCJIT_ABI_32): New ABI tag.
	* docs/topics/functions.rst: Add documentation for the function
	gcc_jit_context_get_target_builtin_function.
	* dummy-frontend.cc: Include headers target.h, jit-recording.h,
	print-tree.h, unordered_map and string, new variables (target_builtins,
	target_function_types, and target_builtins_ctxt), new function
	(tree_type_to_jit_type).
	* jit-builtins.cc: Specify that the function types are not from
	target builtins.
	* jit-playback.cc: New argument is_target_builtin to new_function.
	* jit-playback.h: New argument is_target_builtin to
	new_function.
	* jit-recording.cc: New argument is_target_builtin to
	new_function_type, function_type constructor and function
	constructor, new function
	(get_target_builtin_function).
	* jit-recording.h: Include headers string and unordered_map, new
	variable target_function_types, new argument is_target_builtin
	to new_function_type, function_type and function, new functions
	(get_target_builtin_function, copy).
	* libgccjit.cc: New function
	(gcc_jit_context_get_target_builtin_function).
	* libgccjit.h: New function
	(gcc_jit_context_get_target_builtin_function).
	* libgccjit.map: New functions
	(gcc_jit_context_get_target_builtin_function).

gcc/testsuite:
	PR jit/108762
	* jit.dg/all-non-failing-tests.h: New test test-target-builtins.c.
	* jit.dg/test-target-builtins.c: New test.
2024-11-20 14:03:57 -05:00
Andrew Pinski
beab0a3ecb aarch64: Fix aarch64 after moving to C23
This fixes a few aarch64 specific testcases after the move to default to GNU C23.
For the SME testcases, the GNU C23 cases as `()` changing to mean `(void)` instead
of a non-prototype declaration; the non-prototype declaration merging was confusing
some of the time so the updated way is the expected way even for that.
For pic-*.c `-Wno-old-style-definition` was added not to warn about old style definitions.
For pr113573.c, I added `-std=gnu17` since I was not sure if `(...)` with C23 would invoke
the same issue.

tested for aarch64-linux-gnu.

	PR testsuite/117680
gcc/testsuite/ChangeLog:

	* gcc.target/aarch64/pic-constantpool1.c: Add -Wno-old-style-definition.
	* gcc.target/aarch64/pic-symrefplus.c: Likewise.
	* gcc.target/aarch64/pr113573.c: Add `-std=gnu17`
	* gcc.target/aarch64/sme/streaming_mode_1.c: Correct testcase.
	* gcc.target/aarch64/sme/za_state_1.c: Likewise.
	* gcc.target/aarch64/sme/za_state_2.c: Likewise.

Signed-off-by: Andrew Pinski <quic_apinski@quicinc.com>
2024-11-20 10:59:48 -08:00
Andrew Pinski
e74f3eb189 rtl-reader: Disable reuse_rtx support for generator building
reuse_rtx is not documented nor the format to use it is ever documented.
So it should not be supported for the .md files.

This also fixes the problem if an invalid index is supplied for reuse_rtx,
instead of ICEing, put out a real error message.  Note since this code
still uses atoi, an invalid index can still be used in some cases but that is
recorded as part of PR 44574.

Note I did a grep of the sources to make sure that this was only used for
the read rtl in the GCC rather than while reading in .md files.

Bootstrapped and tested on x86_64-linux-gnu.

gcc/ChangeLog:

	* read-md.h (class rtx_reader): Don't include m_reuse_rtx_by_id
	when GENERATOR_FILE is defined.
	* read-rtl.cc (rtx_reader::read_rtx_code): Disable reuse_rtx
	support when GENERATOR_FILE is defined.

Signed-off-by: Andrew Pinski <quic_apinski@quicinc.com>
2024-11-20 10:07:38 -08:00
Edwin Lu
342eb518bd RISC-V: testsuite: restrict big endian test to non vector
RISC-V vector currently does not support big endian so the postcommit
was getting the sorry, not implemented error on vector targets. Restrict
the testcase to non-vector targets

gcc/testsuite/ChangeLog:

	* gcc.target/riscv/pr117595.c: Restrict to non vector targets.

Signed-off-by: Edwin Lu <ewlu@rivosinc.com>
2024-11-20 09:49:53 -08:00
Richard Biener
f5bd88b5e8 tree-optimization/117709 - bogus offset for gather load
When diverting to VMAT_GATHER_SCATTER we fail to zero *poffset
which was previously set if a load was classified as
VMAT_CONTIGUOUS_REVERSE.  The following refactors
get_group_load_store_type a bit to avoid this but this all needs
some serious TLC.

	PR tree-optimization/117709
	* tree-vect-stmts.cc (get_group_load_store_type): Only
	set *poffset when we end up with VMAT_CONTIGUOUS_DOWN
	or VMAT_CONTIGUOUS_REVERSE.
2024-11-20 18:38:14 +01:00
Richard Biener
2383ed144b tree-optimization/117698 - SLP vectorization and alignment
When SLP vectorizing we fail to mark the general alignment check
as irrelevant when using VMAT_STRIDED_SLP (the implementation checks
for itself) and when VMAT_INVARIANT the override isn't effective.

This results in extra FAILs on sparc which the following fixes.

	PR tree-optimization/117698
	* tree-vect-stmts.cc (get_group_load_store_type): Properly
	disregard alignment for VMAT_STRIDED_SLP and VMAT_INVARIANT.
	(vectorizable_load): Adjust guard for dumping whether we
	vectorize and unaligned access.
	(vectorizable_store): Likewise.
2024-11-20 18:38:14 +01:00
Antoni Boucher
16cf1c010d libgccjit: Allow comparing aligned int types
gcc/jit/ChangeLog:

	* jit-common.h: Add forward declaration of memento_of_get_aligned.
	* jit-recording.h (type::is_same_type_as): Compare integer
	types.
	(dyn_cast_aligned_type): New method.
	(type::is_aligned, memento_of_get_aligned::is_same_type_as,
	memento_of_get_aligned::is_aligned): new methods.

gcc/testsuite/ChangeLog:

	* jit.dg/test-types.c: Add checks comparing aligned types.
2024-11-20 11:01:35 -05:00
Antoni Boucher
ede14092bc libgccjit: Add option to allow special characters in function names
gcc/jit/ChangeLog:

	* docs/topics/contexts.rst: Add documentation for new option.
	* jit-recording.cc (recording::context::get_str_option): New
	method.
	* jit-recording.h (get_str_option): New method.
	* libgccjit.cc (gcc_jit_context_new_function): Allow special
	characters in function names.
	* libgccjit.h (enum gcc_jit_str_option): New option.

gcc/testsuite/ChangeLog:

	* jit.dg/test-special-chars.c: New test.
2024-11-20 10:46:45 -05:00
Antoni Boucher
452abe143e libgccjit: Add vector permutation and vector access operations
gcc/jit/ChangeLog:
	PR jit/112602
	* docs/topics/compatibility.rst (LIBGCCJIT_ABI_31): New ABI tag.
	* docs/topics/expressions.rst: Document
	gcc_jit_context_new_rvalue_vector_perm and
	gcc_jit_context_new_vector_access.
	* jit-playback.cc (playback::context::new_rvalue_vector_perm,
	common_mark_addressable_vec,
	gnu_vector_type_p,
	lvalue_p,
	convert_vector_to_array_for_subscript,
	new_vector_access): new functions.
	* jit-playback.h (new_rvalue_vector_perm, new_vector_access):
	New functions.
	* jit-recording.cc (recording::context::new_rvalue_vector_perm,
	recording::context::new_vector_access,
	memento_of_new_rvalue_vector_perm,
	recording::memento_of_new_rvalue_vector_perm::replay_into,
	recording::memento_of_new_rvalue_vector_perm::visit_children,
	recording::memento_of_new_rvalue_vector_perm::make_debug_string,
	recording::memento_of_new_rvalue_vector_perm::write_reproducer,
	recording::vector_access::replay_into,
	recording::vector_access::visit_children,
	recording::vector_access::make_debug_string,
	recording::vector_access::write_reproducer): New methods.
	* jit-recording.h (class memento_of_new_rvalue_vector_perm,
	class vector_access): New classes.
	* libgccjit.cc (gcc_jit_context_new_vector_access,
	gcc_jit_context_new_rvalue_vector_perm): New functions.
	* libgccjit.h (gcc_jit_context_new_rvalue_vector_perm,
	gcc_jit_context_new_vector_access): New functions.
	* libgccjit.map: New functions.

gcc/testsuite/ChangeLog:
	PR jit/112602
	* jit.dg/all-non-failing-tests.h: New test test-vector-perm.c.
	* jit.dg/test-vector-perm.c: New test.
2024-11-20 10:39:24 -05:00
Paul-Antoine Arras
377eff7c38 OpenMP: common C/C++ testcases for dispatch + adjust_args
gcc/testsuite/ChangeLog:

	* c-c++-common/gomp/declare-variant-2.c: Adjust dg-error directives.
	* c-c++-common/gomp/adjust-args-1.c: New test.
	* c-c++-common/gomp/adjust-args-2.c: New test.
	* c-c++-common/gomp/declare-variant-dup-match-clause.c: New test.
	* c-c++-common/gomp/dispatch-1.c: New test.
	* c-c++-common/gomp/dispatch-2.c: New test.
	* c-c++-common/gomp/dispatch-3.c: New test.
	* c-c++-common/gomp/dispatch-4.c: New test.
	* c-c++-common/gomp/dispatch-5.c: New test.
	* c-c++-common/gomp/dispatch-6.c: New test.
	* c-c++-common/gomp/dispatch-7.c: New test.
	* c-c++-common/gomp/dispatch-8.c: New test.
	* c-c++-common/gomp/dispatch-9.c: New test.
	* c-c++-common/gomp/dispatch-10.c: New test.

libgomp/ChangeLog:

	* testsuite/libgomp.c-c++-common/dispatch-1.c: New test.
	* testsuite/libgomp.c-c++-common/dispatch-2.c: New test.
2024-11-20 15:31:22 +01:00
Paul-Antoine Arras
ed49709acd OpenMP: C++ front-end support for dispatch + adjust_args
This patch adds C++ support for the `dispatch` construct and the `adjust_args`
clause. It relies on the c-family bits comprised in the corresponding C front
end patch for pragmas and attributes.

Additional C/C++ common testcases are provided in a subsequent patch in the
series.

gcc/cp/ChangeLog:

	* decl.cc (omp_declare_variant_finalize_one): Set adjust_args
	need_device_ptr attribute.
	* parser.cc (cp_parser_direct_declarator): Update call to
	cp_parser_late_return_type_opt.
	(cp_parser_late_return_type_opt): Add 'tree parms' parameter. Update
	call to cp_parser_late_parsing_omp_declare_simd.
	(cp_parser_omp_clause_name): Handle nocontext and novariants clauses.
	(cp_parser_omp_clause_novariants): New function.
	(cp_parser_omp_clause_nocontext): Likewise.
	(cp_parser_omp_all_clauses): Handle PRAGMA_OMP_CLAUSE_NOVARIANTS and
	PRAGMA_OMP_CLAUSE_NOCONTEXT.
	(cp_parser_omp_dispatch_body): New function, inspired from
	cp_parser_assignment_expression and cp_parser_postfix_expression.
	(OMP_DISPATCH_CLAUSE_MASK): Define.
	(cp_parser_omp_dispatch): New function.
	(cp_finish_omp_declare_variant): Add parameter. Handle adjust_args
	clause.
	(cp_parser_late_parsing_omp_declare_simd): Add parameter. Update calls
	to cp_finish_omp_declare_variant and cp_finish_omp_declare_variant.
	(cp_parser_omp_construct): Handle PRAGMA_OMP_DISPATCH.
	(cp_parser_pragma): Likewise.
	* semantics.cc (finish_omp_clauses): Handle OMP_CLAUSE_NOCONTEXT and
	OMP_CLAUSE_NOVARIANTS.
	* pt.cc (tsubst_omp_clauses): Handle OMP_CLAUSE_NOCONTEXT and
	OMP_CLAUSE_NOVARIANTS.
	(tsubst_stmt): Handle OMP_DISPATCH.
	(tsubst_expr): Handle IFN_GOMP_DISPATCH.

gcc/testsuite/ChangeLog:

	* g++.dg/gomp/adjust-args-1.C: New test.
	* g++.dg/gomp/adjust-args-2.C: New test.
	* g++.dg/gomp/adjust-args-3.C: New test.
	* g++.dg/gomp/dispatch-1.C: New test.
	* g++.dg/gomp/dispatch-2.C: New test.
	* g++.dg/gomp/dispatch-3.C: New test.
	* g++.dg/gomp/dispatch-4.C: New test.
	* g++.dg/gomp/dispatch-5.C: New test.
	* g++.dg/gomp/dispatch-6.C: New test.
	* g++.dg/gomp/dispatch-7.C: New test.
2024-11-20 15:31:22 +01:00
Paul-Antoine Arras
d7d8d9dae9 OpenMP: C front-end support for dispatch + adjust_args
This patch adds support to the C front-end to parse the `dispatch` construct and
the `adjust_args` clause. It also includes some common C/C++ bits for pragmas
and attributes.

Additional common C/C++ testcases are in a later patch in the series.

gcc/c-family/ChangeLog:

	* c-attribs.cc (c_common_gnu_attributes): Add attribute for adjust_args
	need_device_ptr.
	* c-omp.cc (c_omp_directives): Uncomment dispatch.
	* c-pragma.cc (omp_pragmas): Add dispatch.
	* c-pragma.h (enum pragma_kind): Add PRAGMA_OMP_DISPATCH.
	(enum pragma_omp_clause): Add PRAGMA_OMP_CLAUSE_NOCONTEXT and
	PRAGMA_OMP_CLAUSE_NOVARIANTS.

gcc/c/ChangeLog:

	* c-parser.cc (c_parser_omp_dispatch): New function.
	(c_parser_omp_clause_name): Handle nocontext and novariants clauses.
	(c_parser_omp_clause_novariants): New function.
	(c_parser_omp_clause_nocontext): Likewise.
	(c_parser_omp_all_clauses): Handle nocontext and novariants clauses.
	(c_parser_omp_dispatch_body): New function adapted from
	c_parser_expr_no_commas.
	(OMP_DISPATCH_CLAUSE_MASK): Define.
	(c_parser_omp_dispatch): New function.
	(c_finish_omp_declare_variant): Parse adjust_args.
	(c_parser_omp_construct): Handle PRAGMA_OMP_DISPATCH.
	* c-typeck.cc (c_finish_omp_clauses): Handle OMP_CLAUSE_NOVARIANTS and
	OMP_CLAUSE_NOCONTEXT.

gcc/testsuite/ChangeLog:

	* gcc.dg/gomp/adjust-args-1.c: New test.
	* gcc.dg/gomp/dispatch-1.c: New test.
	* gcc.dg/gomp/dispatch-2.c: New test.
	* gcc.dg/gomp/dispatch-3.c: New test.
	* gcc.dg/gomp/dispatch-4.c: New test.
	* gcc.dg/gomp/dispatch-5.c: New test.
2024-11-20 15:31:22 +01:00
Paul-Antoine Arras
084ea8ad58 OpenMP: middle-end support for dispatch + adjust_args
This patch adds middle-end support for the `dispatch` construct and the
`adjust_args` clause. The heavy lifting is done in `gimplify_omp_dispatch` and
`gimplify_call_expr` respectively. For `adjust_args`, this mostly consists in
emitting a call to `omp_get_mapped_ptr` for the adequate device.

For dispatch, the following steps are performed:

* Handle the device clause, if any: set the default-device ICV at the top of the
dispatch region and restore its previous value at the end.

* Handle novariants and nocontext clauses, if any. Evaluate compile-time
constants and select a variant, if possible. Otherwise, emit code to handle all
possible cases at run time.

gcc/ChangeLog:

	* builtins.cc (builtin_fnspec): Handle BUILT_IN_OMP_GET_MAPPED_PTR.
	* gimple-low.cc (lower_stmt): Handle GIMPLE_OMP_DISPATCH.
	* gimple-pretty-print.cc (dump_gimple_omp_dispatch): New function.
	(pp_gimple_stmt_1): Handle GIMPLE_OMP_DISPATCH.
	* gimple-walk.cc (walk_gimple_stmt): Likewise.
	* gimple.cc (gimple_build_omp_dispatch): New function.
	(gimple_copy): Handle GIMPLE_OMP_DISPATCH.
	* gimple.def (GIMPLE_OMP_DISPATCH): Define.
	* gimple.h (gimple_build_omp_dispatch): Declare.
	(gimple_has_substatements): Handle GIMPLE_OMP_DISPATCH.
	(gimple_omp_dispatch_clauses): New function.
	(gimple_omp_dispatch_clauses_ptr): Likewise.
	(gimple_omp_dispatch_set_clauses): Likewise.
	(gimple_return_set_retval): Handle GIMPLE_OMP_DISPATCH.
	* gimplify.cc (enum omp_region_type): Add ORT_DISPATCH.
	(struct gimplify_omp_ctx): Add in_call_args.
	(gimplify_call_expr): Handle need_device_ptr arguments.
	(is_gimple_stmt): Handle OMP_DISPATCH.
	(gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DEVICE in a dispatch
	construct. Handle OMP_CLAUSE_NOVARIANTS and OMP_CLAUSE_NOCONTEXT.
	(omp_has_novariants): New function.
	(omp_has_nocontext): Likewise.
	(omp_construct_selector_matches): Handle OMP_DISPATCH with nocontext
	clause.
	(find_ifn_gomp_dispatch): New function.
	(gimplify_omp_dispatch): Likewise.
	(gimplify_expr): Handle OMP_DISPATCH.
	* gimplify.h (omp_has_novariants): Declare.
	* internal-fn.cc (expand_GOMP_DISPATCH): New function.
	* internal-fn.def (GOMP_DISPATCH): Define.
	* omp-builtins.def (BUILT_IN_OMP_GET_MAPPED_PTR): Define.
	(BUILT_IN_OMP_GET_DEFAULT_DEVICE): Define.
	(BUILT_IN_OMP_SET_DEFAULT_DEVICE): Define.
	* omp-general.cc (omp_construct_traits_to_codes): Add OMP_DISPATCH.
	(struct omp_ts_info): Add dispatch.
	(omp_resolve_declare_variant): Handle novariants. Adjust
	DECL_ASSEMBLER_NAME.
	* omp-low.cc (scan_omp_1_stmt): Handle GIMPLE_OMP_DISPATCH.
	(lower_omp_dispatch): New function.
	(lower_omp_1): Call it.
	* tree-inline.cc (remap_gimple_stmt): Handle GIMPLE_OMP_DISPATCH.
	(estimate_num_insns): Handle GIMPLE_OMP_DISPATCH.
2024-11-20 15:31:22 +01:00
Paul-Antoine Arras
0e15f1df4a OpenMP: dispatch + adjust_args tree data structures and front-end interfaces
This patch introduces the OMP_DISPATCH tree node, as well as two new clauses
`nocontext` and `novariants`. It defines/exposes interfaces that will be
used in subsequent patches that add front-end and middle-end support, but
nothing generates these nodes yet.

gcc/ChangeLog:

	* builtin-types.def (BT_FN_PTR_CONST_PTR_INT): New.
	* omp-selectors.h (enum omp_ts_code): Add OMP_TRAIT_CONSTRUCT_DISPATCH.
	* tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_NOVARIANTS and
	OMP_CLAUSE_NOCONTEXT.
	* tree-pretty-print.cc (dump_omp_clause): Handle OMP_CLAUSE_NOVARIANTS
	and OMP_CLAUSE_NOCONTEXT.
	(dump_generic_node): Handle OMP_DISPATCH.
	* tree.cc (omp_clause_num_ops): Add OMP_CLAUSE_NOVARIANTS and
	OMP_CLAUSE_NOCONTEXT.
	(omp_clause_code_name): Add "novariants" and "nocontext".
	* tree.def (OMP_DISPATCH): New.
	* tree.h (OMP_DISPATCH_BODY): New macro.
	(OMP_DISPATCH_CLAUSES): New macro.
	(OMP_CLAUSE_NOVARIANTS_EXPR): New macro.
	(OMP_CLAUSE_NOCONTEXT_EXPR): New macro.

gcc/fortran/ChangeLog:

	* types.def (BT_FN_PTR_CONST_PTR_INT): Declare.
2024-11-20 15:31:22 +01:00
Antoni Boucher
fa720364e4 libgccjit: Fix float playback for cross-compilation
gcc/jit/ChangeLog:
	PR jit/113343
	* jit-playback.cc (new_rvalue_from_const): Fix to have the
	correct value when cross-compiling.
2024-11-20 09:08:21 -05:00
Eric Botcazou
30c2678cc6 Enable symbolic backtraces on more Linux and BSD ports (continued)
gcc/ada
	PR ada/117538
	PR ada/117708
	* Makefile.rtl (GNU Hurd): Add $(TRASYM_DWARF_UNIX_PAIRS).
	(x86-64 kfreebsd): Likewise.
	(aarch64 FreeBSD): Likewise.
	(x86-64 DragonFly): Likewise.
	(S390 Linux): Likewise and add Linux version of s-tsmona.adb.
	(Mips Linux): Likewise.
	(SPARC Linux): Likewise.
	(HP/PA Linux): Linux.
	(M68K Linux): Likewise.
	(SH4 Linux): Likewise.
	(Alpha Linux): Likewise.
	(RISC-V Linux): Likewise.
2024-11-20 15:07:46 +01:00
Antoni Boucher
efdc80a8bf libgccjit: Add type checks in gcc_jit_block_add_assignment_op
gcc/jit/ChangeLog:

	* libgccjit.cc (RETURN_IF_FAIL_PRINTF3): New macro.
	(gcc_jit_block_add_assignment_op): Add numeric checks.

gcc/testsuite/ChangeLog:

	* jit.dg/test-error-bad-assignment-op.c: New test.
2024-11-20 09:00:23 -05:00
Antoni Boucher
04b67ac1e8 libgccjit: Support signed char flag
gcc/jit/ChangeLog:

	* dummy-frontend.cc (jit_langhook_init): Send flag_signed_char
	argument to build_common_tree_nodes.

gcc/testsuite/ChangeLog:

	* jit.dg/all-non-failing-tests.h: Add test-signed-char.c.
	* jit.dg/test-signed-char.c: New test.
2024-11-20 08:49:09 -05:00
Richard Sandiford
70ec3d20bf aarch64: Add support for SME2p1
This patch adds support for FEAT_SME2p1.  There are two sets of
new instructions: MOVAZ to read from ZA and zero the source data,
and new forms of ZERO.  All of them require streaming mode.

MOVAZ can't reuse the existing UNSPEC_SME_READ* patterns because
of the write to ZA.  I did wonder about trying to use a define_subst,
but it seemed a bit too awkward.

gcc/
	* config/aarch64/aarch64-option-extensions.def (sme2p1): New extension.
	* doc/invoke.texi: Document it.
	* config/aarch64/aarch64.h (TARGET_STREAMING_SME2p1): New macro.
	* config/aarch64/aarch64-c.cc (aarch64_update_cpp_builtins):
	Conditionally define __ARM_FEATURE_SME2p1.
	* config/aarch64/iterators.md (UNSPEC_SME_READZ, UNSPEC_SME_READZ_HOR)
	(UNSPEC_SME_READZ_VER): New unspecs.
	(optab, hv): Handle them.
	(SME_READZ_HV): New int iterator.
	* config/aarch64/aarch64-sme.md
	(UNSPEC_SME_ZERO_SLICES): New unspec.
	(@aarch64_sme_<SME_READZ_HV:optab><v_int_container><mode>)
	(*aarch64_sme_<SME_READZ_HV:optab><v_int_container><mode>_plus)
	(@aarch64_sme_<SME_READZ_HV:optab><VNx1TI_ONLY:mode><SVE_FULL:mode>)
	(@aarch64_sme_<SME_READZ_HV:optab><SVE_FULLx24:mode><mode>)
	(*aarch64_sme_<SME_READZ_HV:optab><SVE_FULLx24:mode><mode>_plus)
	(@aarch64_sme_readz<mode>, *aarch64_sme_readz<mode>_plus)
	(@aarch64_sme_zero_za_slices<mode>): New patterns.
	(*aarch64_sme_zero_za_slices<mode>_plus): Likewise.
	* config/aarch64/aarch64-sve-builtins-shapes.h
	(inherent_za_slice): Declare.
	* config/aarch64/aarch64-sve-builtins-shapes.cc
	(inherent_za_slice_def, inherent_za_slice): New shape.
	* config/aarch64/aarch64-sve-builtins-sme.h (svreadz_za)
	(svreadz_hor_za, svreadz_ver_za): Declare.
	* config/aarch64/aarch64-sve-builtins-sme.cc
	(svread_za_slice_base): New class, split out from...
	(svread_za_impl): ...here.
	(svreadz_za_impl, svreadz_za_tile_impl): New type aliases.
	(zero_slices_mode): New function.
	(svzero_za_impl::expand): Handle the slice forms.
	(svreadz_za, svreadz_hor_za, svreadz_ver_za): New functions.
	* config/aarch64/aarch64-sve-builtins-sme.def: Add the SME2p1
	instructions.

gcc/testsuite/
	* lib/target-supports.exp: Test the assembler for sve-b16b16 support.
	* gcc.target/aarch64/pragma_cpp_predefs_4.c: Add tests for
	__ARM_FEATURE_SME2p1.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za128.c: New test.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za16.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za16_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za16_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za32.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za32_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za32_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za64.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za64_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za64_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za8.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za8_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_hor_za8_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za16.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za16_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za16_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za32.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za32_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za32_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za64.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za64_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za64_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za8.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za8_vg2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_ver_za8_vg4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za32_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za32_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za64_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za64_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za8_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/readz_za8_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg2x1.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg2x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg2x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg4x1.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg4x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/zero_za64_vg4x4.c: Likewise.
2024-11-20 13:27:42 +00:00
Richard Sandiford
cdacb32bd5 aarch64: Add support for SME_B16B16
This patch adds support for the SME_B16B16 extension.  It follows
similar lines to the SME_F16F16 extension added earlier.

gcc/
	* config/aarch64/aarch64-option-extensions.def
	(sme-b16b16): New extension.
	* doc/invoke.texi: Document it.
	* config/aarch64/aarch64.h (TARGET_STREAMING_SME_B16B16): New macro.
	* config/aarch64/aarch64-c.cc (aarch64_update_cpp_builtins):
	Conditionally define __ARM_FEATURE_SME_B16B16.
	* config/aarch64/aarch64-sve-builtins-sme.def: Add SME_B16B16 forms
	of existing intrinsics.
	* config/aarch64/aarch64-sme.md
	(@aarch64_sme_<SME_BINARY_SLICE_HSDF:optab><mode>)
	(*aarch64_sme_<SME_BINARY_SLICE_HSDF:optab><mode>_plus)
	(@aarch64_sme_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(*aarch64_sme_<SME_FP_TERNARY_SLICE:optab><mode><mode>_plus)
	(@aarch64_sme_single_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(*aarch64_sme_single_<SME_FP_TERNARY_SLICE:optab><mode><mode>_plus)
	(@aarch64_sme_lane_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(*aarch64_sme_lane_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(@aarch64_sme_<SME_FP_MOP:optab><mode><mode>): Extend to BF16 modes.
	* config/aarch64/aarch64-sve-builtins.cc (TYPES_za_h_bfloat): New
	type macro.
	* config/aarch64/iterators.md (SME_ZA_HSDFx24): Add BF16 modes.
	(SME_MOP_HSDF): Likewise.

gcc/testsuite/
	* lib/target-supports.exp: Test the assembler for sve-b16b16 support.
	* gcc.target/aarch64/pragma_cpp_predefs_4.c: Add tests for
	__ARM_FEATURE_SME_B16B16.
	* gcc.target/aarch64/sme2/acle-asm/add_za16_bf16_vg1x2.c: New test.
	* gcc.target/aarch64/sme2/acle-asm/add_za16_bf16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_lane_za16_bf16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_lane_za16_bf16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_za16_bf16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_za16_bf16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_lane_za16_bf16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_lane_za16_bf16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_za16_bf16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_za16_bf16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mopa_za16_bf16.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mops_za16_bf16.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/sub_za16_bf16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/sub_za16_bf16_vg1x4.c: Likewise.
2024-11-20 13:27:41 +00:00
Richard Sandiford
e6751e1deb aarch64: Add support for SME_F16F16
This patch adds support for the SME_F16F16 extension.  The extension
adds two new instructions to convert from a single vector of f16s
to two vectors of f32s.  It also adds f16 variants of existing SME
ZA instructions.

gcc/
	* config/aarch64/aarch64-option-extensions.def
	(sme-f16f16): New extension.
	* doc/invoke.texi: Document it.  Also document that sme-i16i64 and
	sme-f64f64 enable SME.
	* config/aarch64/aarch64.h (TARGET_STREAMING_SME_F16F16): New macro.
	* config/aarch64/aarch64-c.cc (aarch64_update_cpp_builtins):
	Conditionally define __ARM_FEATURE_SME_F16F16.
	* config/aarch64/aarch64-sve-builtins-sve2.def (svcvt, svcvtl): Add
	new SME_F16F16 intrinsics.
	* config/aarch64/aarch64-sve-builtins-sme.def: Add SME_F16F16 forms
	of existing intrinsics.
	* config/aarch64/aarch64-sve-builtins.cc (TYPES_h_float)
	(TYPES_cvt_f32_f16, TYPES_za_h_float): New type macros.
	* config/aarch64/aarch64-sve-builtins-base.cc
	(svcvt_impl::expand): Add sext_optab as another possibility.
	* config/aarch64/aarch64-sve-builtins-sve2.h (svcvtl): Declare.
	* config/aarch64/aarch64-sve-builtins-sve2.cc (svcvtl_impl): New class.
	(svcvtl): New function.
	* config/aarch64/iterators.md (VNx8SF_ONLY): New mode iterator.
	(SME_ZA_SDFx24): Replace with...
	(SME_ZA_HSDFx24): ...this.
	(SME_MOP_SDF): Replace with...
	(SME_MOP_HSDF): ...this.
	(SME_BINARY_SLICE_SDF): Replace with...
	(SME_BINARY_SLICE_HSDF): ...this.
	* config/aarch64/aarch64-sve2.md (extendvnx8hfvnx8sf2)
	(@aarch64_sve_cvtl<mode>): New patterns.
	* config/aarch64/aarch64-sme.md
	(@aarch64_sme_<SME_BINARY_SLICE_SDF:optab><mode>): Extend to...
	(@aarch64_sme_<SME_BINARY_SLICE_HSDF:optab><mode>): ...this.
	(*aarch64_sme_<SME_BINARY_SLICE_SDF:optab><mode>_plus): Extend to...
	(*aarch64_sme_<SME_BINARY_SLICE_HSDF:optab><mode>_plus): ...this.
	(@aarch64_sme_<SME_FP_TERNARY_SLICE:optab><mode><mode>): Extend to
	HF modes.
	(*aarch64_sme_<SME_FP_TERNARY_SLICE:optab><mode><mode>_plus)
	(@aarch64_sme_single_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(*aarch64_sme_single_<SME_FP_TERNARY_SLICE:optab><mode><mode>_plus)
	(@aarch64_sme_lane_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(*aarch64_sme_lane_<SME_FP_TERNARY_SLICE:optab><mode><mode>)
	(@aarch64_sme_<SME_FP_MOP:optab><mode><mode>): Likewise.

gcc/testsuite/
	* lib/target-supports.exp: Test the assembler for sve-f16f16 support.
	* gcc.target/aarch64/pragma_cpp_predefs_4.c: Add tests for
	__ARM_FEATURE_SME_F16F16.  Also extend the existing SME tests.
	* gcc.target/aarch64/sve/acle/asm/test_sve_acle.h
	(TEST_X2_WIDE): New macro
	* gcc.target/aarch64/sme2/acle-asm/add_za16_f16_vg1x2.c: New test.
	* gcc.target/aarch64/sme2/acle-asm/add_za16_f16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/cvt_f32_f16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/cvtl_f32_f16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_lane_za16_f16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_lane_za16_f16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_za16_f16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mla_za16_f16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_lane_za16_f16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_lane_za16_f16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_za16_f16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mls_za16_f16_vg1x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mopa_za16_f16.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/mops_za16_f16.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/sub_za16_f16_vg1x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/sub_za16_f16_vg1x4.c: Likewise.
2024-11-20 13:27:41 +00:00
Richard Sandiford
648fc673e6 aarch64: Add support for SVE_B16B16
This patch adds support for the SVE_B16B16 extension, which provides
non-widening BF16 versions of existing instructions.

Mostly it's just a simple extension of iterators.  The main
complications are:

(1) The new instructions have no immediate forms.  This is easy to
    handle for the cond_* patterns (the ones that have an explicit
    else value) since those are already divided into register and
    non-register versions.  All we need to do is tighten the predicates.

    However, the @aarch64_pred_<optab><mode> patterns handle the
    immediates directly.  Rather than complicate them further,
    it seemed best to add a single @aarch64_pred_<optab><mode> for
    all BF16 arithmetic.

(2) There is no BFSUBR, so the usual method of handling reversed
    operands breaks down.  The patch deals with this using some
    new attributes that together disable the "BFSUBR" alternative.

(3) Similarly, there are no BFMAD or BFMSB instructions, so we need
    to disable those forms in the BFMLA and BFMLS patterns.

The patch includes support for generic bf16 vectors too.

It would be possible to use these instructions for scalars, as with
the recent FLOGB patch, but that's left as future work.

gcc/
	* config/aarch64/aarch64-option-extensions.def
	(sve-b16b16): New extension.
	* doc/invoke.texi: Document it.
	* config/aarch64/aarch64.h (TARGET_SME_B16B16, TARGET_SVE2_OR_SME2)
	(TARGET_SSVE_B16B16): New macros.
	* config/aarch64/aarch64-c.cc (aarch64_update_cpp_builtins):
	Conditionally define __ARM_FEATURE_SVE_B16B16
	* config/aarch64/aarch64-sve-builtins-sve2.def: Add AARCH64_FL_SVE2
	to the SVE2p1 requirements.  Add SVE_B16B16 forms of existing
	intrinsics.
	* config/aarch64/aarch64-sve-builtins.cc (type_suffixes): Treat
	bfloat as a floating-point type.
	(TYPES_h_bfloat): New macro.
	* config/aarch64/aarch64.md (is_bf16, is_rev, supports_bf16_rev)
	(mode_enabled): New attributes.
	(enabled): Test mode_enabled.
	* config/aarch64/iterators.md (SVE_FULL_F_BF): New mode iterator.
	(SVE_CLAMP_F): Likewise.
	(SVE_Fx24): Add BF16 modes when TARGET_SSVE_B16B16.
	(sve_lane_con): Handle BF16 modes.
	(b): Handle SF and DF modes.
	(is_bf16): New mode attribute.
	(supports_bf16, supports_bf16_rev): New int attributes.
	* config/aarch64/predicates.md
	(aarch64_sve_float_maxmin_immediate): Reject BF16 modes.
	* config/aarch64/aarch64-sve.md
	(*post_ra_<sve_fp_op><mode>3): Add BF16 support, and likewise
	for the associated define_split.
	(<optab:SVE_COND_FP_BINARY_OPTAB><mode>): Add BF16 support.
	(@cond_<optab:SVE_COND_FP_BINARY><mode>): Likewise.
	(*cond_<optab:SVE_COND_FP_BINARY><mode>_2_relaxed): Likewise.
	(*cond_<optab:SVE_COND_FP_BINARY><mode>_2_strict): Likewise.
	(*cond_<optab:SVE_COND_FP_BINARY><mode>_3_relaxed): Likewise.
	(*cond_<optab:SVE_COND_FP_BINARY><mode>_3_strict): Likewise.
	(*cond_<optab:SVE_COND_FP_BINARY><mode>_any_relaxed): Likewise.
	(*cond_<optab:SVE_COND_FP_BINARY><mode>_any_strict): Likewise.
	(@aarch64_mul_lane_<mode>): Likewise.
	(<optab:SVE_COND_FP_TERNARY><mode>): Likewise.
	(@aarch64_pred_<optab:SVE_COND_FP_TERNARY><mode>): Likewise.
	(@cond_<optab:SVE_COND_FP_TERNARY><mode>): Likewise.
	(*cond_<optab:SVE_COND_FP_TERNARY><mode>_4_relaxed): Likewise.
	(*cond_<optab:SVE_COND_FP_TERNARY><mode>_4_strict): Likewise.
	(*cond_<optab:SVE_COND_FP_TERNARY><mode>_any_relaxed): Likewise.
	(*cond_<optab:SVE_COND_FP_TERNARY><mode>_any_strict): Likewise.
	(@aarch64_<optab:SVE_FP_TERNARY_LANE>_lane_<mode>): Likewise.
	* config/aarch64/aarch64-sve2.md
	(@aarch64_pred_<optab:SVE_COND_FP_BINARY><mode>): Define BF16 version.
	(@aarch64_sve_fclamp<mode>): Add BF16 support.
	(*aarch64_sve_fclamp<mode>_x): Likewise.
	(*aarch64_sve_<maxmin_uns_op><SVE_Fx24:mode>): Likewise.
	(*aarch64_sve_single_<maxmin_uns_op><SVE_Fx24:mode>): Likewise.
	* config/aarch64/aarch64.cc (aarch64_sve_float_arith_immediate_p)
	(aarch64_sve_float_mul_immediate_p): Return false for BF16 modes.

gcc/testsuite/
	* lib/target-supports.exp: Test the assembler for sve-b16b16 support.
	* gcc.target/aarch64/pragma_cpp_predefs_4.c: Test the new B16B16
	macros.
	* gcc.target/aarch64/sve/fmad_1.c: Test bfloat16 too.
	* gcc.target/aarch64/sve/fmla_1.c: Likewise.
	* gcc.target/aarch64/sve/fmls_1.c: Likewise.
	* gcc.target/aarch64/sve/fmsb_1.c: Likewise.
	* gcc.target/aarch64/sve/cond_mla_9.c: New test.
	* gcc.target/aarch64/sme2/acle-asm/clamp_bf16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/clamp_bf16_x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/max_bf16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/max_bf16_x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/maxnm_bf16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/maxnm_bf16_x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/min_bf16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/min_bf16_x4.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/minnm_bf16_x2.c: Likewise.
	* gcc.target/aarch64/sme2/acle-asm/minnm_bf16_x4.c: Likewise.
	* gcc.target/aarch64/sve/bf16_arith_1.c: Likewise.
	* gcc.target/aarch64/sve/bf16_arith_1.h: Likewise.
	* gcc.target/aarch64/sve/bf16_arith_2.c: Likewise.
	* gcc.target/aarch64/sve/bf16_arith_3.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/add_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/clamp_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/max_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/maxnm_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/min_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/minnm_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/mla_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/mla_lane_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/mls_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/mls_lane_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/mul_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/mul_lane_bf16.c: Likewise.
	* gcc.target/aarch64/sve2/acle/asm/sub_bf16.c: Likewise.
2024-11-20 13:27:40 +00:00
Richard Sandiford
164fbe01d5 aarch64: Fix the choice of unspec in two SME patterns
@aarch64_sme_write<mode> and *aarch64_sme_write<mode>_plus
were using UNSPEC_SME_READ instead of UNSPEC_SME_WRITE.

gcc/
	* config/aarch64/aarch64-sme.md (@aarch64_sme_write<mode>)
	(*aarch64_sme_write<mode>_plus): Use UNSPEC_SME_WRITE instead
	of UNSPEC_SME_READ.
2024-11-20 13:27:40 +00:00
Richard Sandiford
3e582f8c79 aarch64: Rename some SME iterators
This patch just renames the iterators SME_READ and SME_WRITE to
SME_READ_HV and SME_WRITE_HV, to distinguish them from other forms
of ZA read and write.

gcc/
	* config/aarch64/iterators.md (SME_READ): Rename to...
	(SME_READ_HV): ...this.
	(SME_WRITE): Rename to...
	(SME_WRITE_HV): ...this.
	* config/aarch64/aarch64-sme.md: Update accordingly.
2024-11-20 13:27:39 +00:00
Richard Sandiford
3381213780 aarch64: Refactor SVE predicated-to-unpredicated splits
There are separate patterns for predicated FADD, FSUB, and FMUL.
Previously they each had their own in-built split to convert the
instruction to unpredicated form where appropriate.  However, it's
more convenient for later patches if we use a single separate split
instead.

gcc/
	* config/aarch64/iterators.md (SVE_COND_FP): New code attribute.
	* config/aarch64/aarch64-sve.md: Use a single define_split to
	handle the conversion of predicated FADD, FSUB, and FMUL into
	unpredicated forms.
2024-11-20 13:27:39 +00:00
Richard Sandiford
12c60ff1ff aarch64: Rework sme_2mode_function insns
Many of the SME ZA intrinsics have two type suffixes: one for ZA
and one for the vectors.  The ZA suffix only conveys an element
size, while the vector suffix conveys both an element type and
an element size.  Internally, the ZA suffix maps to an integer mode;
e.g. za32 maps to VNx4SI.

For SME2, it was relatively convenient to use the modes associated
with both suffixes directly.  For example, the (non-widening) FMLA
intrinsics used SME_ZA_SDF_I to iterate over the possible ZA modes,
used SME_ZA_SDFx24 to iterate over the possible vector tuple modes,
and used a C++ condition to make sure that the element sizes agree.

However, for later patches it's more convenient to rely only on
the vector mode in cases where the ZA and vector element sizes
are the same.  This means splitting the widening MOPA/S patterns
from the non-widening ones, but otherwise it's not a big change.

gcc/
	* config/aarch64/iterators.md (SME_ZA_SDF_I): Delete.
	(SME_MOP_HSDF): Replace with...
	(SME_MOP_SDF): ...this.
	* config/aarch64/aarch64-sme.md: Change the non-widening FMLA and
	FMLS patterns so that both mode parameters are the same, rather than
	using both SME_ZA_SDF_I and SME_ZA_SDFx24 and checking that their
	element sizes are the same.  Split the FMOPA and FMOPS patterns
	into separate non-widening and widening forms, then update the
	non-widening forms in a similar way to FMLA and FMLS.
	* config/aarch64/aarch64-sve-builtins-functions.h
	(sme_2mode_function_t::expand): If the two type suffixes have the same
	element size, use the vector tuple mode for both mode parameters.
2024-11-20 13:27:38 +00:00
Mikael Morin
237380cdec fortran: Evaluate once BACK argument of MINLOC/MAXLOC with DIM [PR90608]
Evaluate the BACK argument of MINLOC/MAXLOC once before the
scalarization loops in the case where the DIM argument is present.

This is a follow-up to r15-1994-ga55d24b3cf7f4d07492bb8e6fcee557175b47ea3
which added knowledge of BACK to the scalarizer, to
r15-2701-ga10436a8404ad2f0cc5aa4d6a0cc850abe5ef49e which removed it to
handle it out of scalarization instead, and to more immediate previous
patches that added inlining support for MINLOC/MAXLOC with DIM.  The
inlining support for MINLOC/MAXLOC with DIM introduced nested loops, which
made the evaluation of BACK (removed from the scalarizer knowledge by the
forementionned commit) wrapped in a loop, so possibly executed more than
once.  This change adds BACK to the scalarization chain if MINLOC/MAXLOC
will use nested loops, so that it is evaluated by the scalarizer only once
before the outermost loop in that case.

	PR fortran/90608

gcc/fortran/ChangeLog:

	* trans-intrinsic.cc
	(walk_inline_intrinsic_minmaxloc): Add a scalar element for BACK as
	first item of the chain if BACK is present and there will be nested
	loops.
	(gfc_conv_intrinsic_minmaxloc): Evaluate BACK using an inherited
	scalarization chain if there is a nested loop.

gcc/testsuite/ChangeLog:

	* gfortran.dg/maxloc_8.f90: New test.
	* gfortran.dg/minloc_9.f90: New test.
2024-11-20 13:59:51 +01:00
Uros Bizjak
086ee8d086 i386: Remove workaround for Solaris ld 64-bit TLS IE limitation
As detailed in PR target/43309, the Solaris linker initially took the
64-bit x86 TLS IE code sequence literally, assuming that the spec only
allowed %rax as target register.

A workaround has been in place for more than a decade, but is no longer
necessary.  The bug had already been fixed for Solaris 11.1, while trunk
requires Solaris 11.4.

Uros pointed this out and suggested the attached patch.

Bootstrapped without regressions on i386-pc-solaris2.11.

2024-10-15  Uros Bizjak  <ubizjak@gmail.com>

	gcc:
	* config/i386/i386.cc (legitimize_tls_address)
	<TLS_MODEL_INITIAL_EXEC>: Remove 64-bit Solaris ld workaround.
	* config/i386/i386.md (UNSPEC_TLS_IE_SUN): Remove.
	(tls_initial_exec_64_sun): Remove.
2024-11-20 12:57:25 +01:00
Rainer Orth
bf57f9b599 testsuite: i386: Fix gcc.target/i386/pr117232-1.c etc. with Solaris as
Two tests FAIL on Solaris/x86 with the native assembler:

FAIL: gcc.target/i386/pr117232-1.c scan-assembler-times (?n)cmovn?c 7
FAIL: gcc.target/i386/pr117232-apx-1.c scan-assembler-times (?n)cmovn?c 7

The problem is that as expects a slightly different insn syntax, e.g.

	cmovl.nc	%esi, %eax

instead of

	cmovnc	%esi, %eax

This patch allows for both forms.

Tested on i386-pc-solaris2.11 (as and gas) and x86_64-pc-linux-gnu.

2024-11-15  Rainer Orth  <ro@CeBiTec.Uni-Bielefeld.DE>

	gcc/testsuite:
	* gcc.target/i386/pr117232-1.c (scan-assembler-times): Allow for
	cmovl.nc etc.
	* gcc.target/i386/pr117232-apx-1.c: Likewise.
2024-11-20 12:54:22 +01:00
Pan Li
fa18217f60 RISC-V: Refine the rtl dump expand check for vector SAT_ADD
This patch would like to remove the unnecessary option for the
vector SAT_ADD testcases at first.  And the different optimization
option like O2 and O3 will be passed to the test files for rtl
expand dump check.  If there are different dump check times for
different optimization options, the target no-opts and/or any-opts
will be leveraged for the dg-final check.

The below test suites are passed for this patch.
* The rv64gcv fully regression test.

It is test only patch and obvious up to a point, will commit it
directly if no comments in next 48H.

gcc/testsuite/ChangeLog:

	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s16.c: Remove
	the unnecessary option and refine the rtl IFN dump check.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u64.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u8.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-4-u16.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-4-u32.c: Ditto.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-4-u8.c: Ditto.

Signed-off-by: Pan Li <pan2.li@intel.com>
2024-11-20 19:27:16 +08:00
Pan Li
b95d585fd4 RISC-V: Introduce riscv/rvv/autovec/sat folder to rvv.exp testsuite
After we move vector SAT_ADD testcases into a isolated folder, aka
riscv/rvv/autovec/sat.  We would like to add the folder as one of
the test items of the rvv.exp testsuite.

The below test suites are passed for this patch.
* The rv64gcv fully regression test.

It is test only patch and obvious up to a point, will commit it
directly if no comments in next 48H.

gcc/testsuite/ChangeLog:

	* gcc.target/riscv/rvv/rvv.exp: Add the vector sat folder to
	the rvv.exp testsuite.

Signed-off-by: Pan Li <pan2.li@intel.com>
2024-11-20 19:27:09 +08:00
Pan Li
42f1a08e41 RISC-V: Rearrange the test files for vector SAT_ADD [NFC]
The test files of scalar SAT_TRUNC only has numbers as the suffix.
Rearrange the file name to -{form number}-{target-type}.  For example,
test form 3 for uint32_t SAT_TRUNC will have -3-u32.c for asm check and
-run-3-u32.c for the run test.

Meanwhile, all related test files moved to riscv/rvv/autovec/sat/.

The below test suites are passed for this patch.
* The rv64gcv fully regression test.

It is test only patch and obvious up to a point, will commit it
directly if no comments in next 48H.

gcc/testsuite/ChangeLog:

	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-1-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-2-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-3-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-16.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-4-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-1-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-2-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-3-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-16.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_s_add-run-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_s_add-run-4-s8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-1-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-2-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-3-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-16.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-4-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-18.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-19.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-20.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-17.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-5-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-22.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-23.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-24.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-21.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-6-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-26.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-27.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-28.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-25.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-7-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-30.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-31.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-32.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-29.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-8-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-1-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-2-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-3-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-16.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-4-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-18.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-19.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-20.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-17.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-5-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-22.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-23.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-24.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-21.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-6-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-26.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-27.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-28.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-25.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-7-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-30.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-31.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-32.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add-run-29.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add-run-8-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-1-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-2-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-3-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-16.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-4-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-1-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-2-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-3-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-16.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm-run-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm-run-4-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-2.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-3.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-4.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-1.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-1-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-6.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-7.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-8.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-5.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-2-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-10.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-11.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-12.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u64.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-9.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-3-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-14.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-4-u16.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-15.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-4-u32.c: ...here.
	* gcc.target/riscv/rvv/autovec/binop/vec_sat_u_add_imm_reconcile-13.c: Move to...
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_u_add_imm_reconcile-4-u8.c: ...here.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_arith.h: New test.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_binary_vvv_run.h: New test.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_binary_vvx_run.h: New test.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_binary_vx_run.h: New test.
	* gcc.target/riscv/rvv/autovec/sat/vec_sat_data.h: New test.

Signed-off-by: Pan Li <pan2.li@intel.com>
2024-11-20 19:27:02 +08:00
Richard Biener
ff5a14abeb tree-optimization/117574 - bougs niter lt-to-ne
When trying to change a IV from IV0 < IV1 to IV0' != IV1' we apply
fancy adjustments to the may_be_zero condition we compute rather
than using the obvious IV0->base >= IV1->base expression (to be
able to use > instead of >=?).  This doesn't seem to go well.

	PR tree-optimization/117574
	* tree-ssa-loop-niter.cc (number_of_iterations_lt_to_ne):
	Use the obvious may_be_zero condition.

	* gcc.dg/torture/pr117574-1.c: New testcase.
2024-11-20 11:24:26 +01:00
Richard Sandiford
ed6d0867fd Extend expand_absneg_bit to vector modes
Expand can implement NEG and ABS of scalar floating-point modes
by using logic ops to manipulate the sign bit.  This patch extends
that approach to vectors, since it fits relatively easily into the
same structure.

The motivating use case was to inline bf16 NEG and ABS operations
for AArch64.  The patch includes tests for that.

get_absneg_bit_mode required a new opt_mode constructor, so that
opt_mode<T> can be constructed from opt_mode<U> if T is no less
general than U.

gcc/
	* machmode.h (opt_mode::opt_mode): New overload.
	* optabs-query.h (get_absneg_bit_mode): Declare.
	* optabs-query.cc (get_absneg_bit_mode): New function, split
	out from expand_absneg_bit.
	(can_open_code_p): Use get_absneg_bit_mode.
	* optabs.cc (expand_absneg_bit): Likewise.  Take an outer and inner
	mode, rather than just one.  Handle vector modes.
	(expand_unop, expand_abs_nojump): Update calls accordingly.
	Handle vector modes.

gcc/testsuite/
	* gcc.target/aarch64/abs_bf_1.c: New test.
	* gcc.target/aarch64/neg_bf_1.c: Likewise.
	* gcc.target/aarch64/neg_bf_2.c: Likewise.
2024-11-20 10:04:46 +00:00
Richard Sandiford
0abb5fa523 Use can_implement_p in the vectoriser
This patch goes through the tree-vect-* code and mechanically replaces
all tests of optab_handler against CODE_FOR_nothing with calls to the
new helper functions.

gcc/
	* tree-vect-data-refs.cc (vect_supportable_dr_alignment): Use
	can_implement_p instead of optab_handler.
	* tree-vect-generic.cc (add_rshift, expand_vector_divmod): Likewise.
	(optimize_vector_constructor, type_for_widest_vector_mode): Likewise.
	(lower_vec_perm, expand_vector_operations_1): Likewise.
	* tree-vect-loop.cc (have_whole_vector_shift): Likewise.
	* tree-vect-patterns.cc (vect_recog_rotate_pattern): Likewise.
	(target_has_vecop_for_code, vect_recog_mult_pattern): Likewise.
	(vect_recog_divmod_pattern): Likewise.
	* tree-vect-stmts.cc (vect_supportable_shift, vectorizable_shift)
	(scan_store_can_perm_p, check_scan_store, vectorizable_store)
	(vectorizable_load, vectorizable_condition): Likewise.
	(vectorizable_comparison_1): Likewise.
2024-11-20 10:04:45 +00:00
Richard Sandiford
485ab50c20 Add helpers to test whether an optab can be implemented
The vectoriser and vector lowering passes both had tests of the form:

    if (op
	&& (optab_handler (op, compute_mode) != CODE_FOR_nothing
	    || optab_libfunc (op, compute_mode)))
      ...success...
    if (code == MULT_HIGHPART_EXPR
	&& can_mult_highpart_p (compute_mode,
				TYPE_UNSIGNED (compute_type)))
      ...success...

This patch adds helper routines for this kind of test, so that it's
easier to handle other optab alternatives in a similar way.

gcc/
	* optabs-query.cc (can_open_code_p, can_implement_p): Declare.
	* optabs-query.h (can_open_code_p, can_implement_p): New functions.
	* optabs-tree.cc (target_supports_op_p): Use can_implement_p.
	* tree-vect-stmts.cc (vectorizable_operation): Likewise.
	* tree-vect-generic.cc (get_compute_type): Likewise.  Remove code
	parameter.
	(expand_vector_scalar_condition, expand_vector_conversion)
	(expand_vector_operations_1): Update calls accordingly.
2024-11-20 10:04:45 +00:00