mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2024-11-27 12:14:10 +08:00

Author	SHA1	Message	Date
Christian Gmeiner	5a15b36a64	etnaviv: nir: Enforce stricter swizzle for virtual scalar x register This change enforces stricter swizzle behavior for the virtual scalar x register, addressing a regression encountered in piglit's spec@glsl-1.10@execution@derivatives@glsl-derivs-abs-sign test. The regression occurred after switching to derivative intrinsics. CC: mesa-stable Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31034>	2024-10-16 12:32:07 +00:00
Danylo Piliaiev	7b09fc98fb	nir/opt_16b_tex_image: Sign extension should matter for texel buffer txf Texel buffer could be arbitrary large, so the assumption being made in the following comment is wrong: "Zero-extension (u16) and sign-extension (i16) have the same behavior here - txf returns 0 if bit 15 is set because it's out of bounds and the higher bits don't matter." Sign extension should matter for GLSL_SAMPLER_DIM_BUF. This fixes the case of doing texelFetch with u16 offset: uniform itextureBuffer s1; uint16_t offset = some_ssbo.offset; value = texelFetch(s1, offset).x; If the offset is higher than s16 optimization incorrectly left it as 16b. In spirv the above glsl is translated into: %22 = OpLoad %ushort %21 %23 = OpUConvert %uint %22 %24 = OpBitcast %int %23 %26 = OpImageFetch %v4int %16 %24 Cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31664>	2024-10-16 10:10:00 +00:00
Martin Krastev	5f7f33f323	svga/ci: disable vmware farm Disable farm for DUT maintenance. Signed-off-by: Martin Krastev <martin.krastev@broadcom.com>	2024-10-16 11:38:48 +03:00
Benjamin Lee	49e9cd5211	panvk: don't dirty preloaded tiles for writeback Tiles that are only touched by the preload shader don't need to be written back, because their contents will be the same. The comment suggests that this was the original intent, and that the value of clean_fragment_write was just swapped by accident. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31652>	2024-10-16 07:00:29 +00:00
Timothy Arceri	aa7c59e02c	nir/glsl: set deref cast mode for blocks during function inlining More cast fixes this time for UBO and SSBO. Which were missing testing previously. Fixes: `d681cf96fb` ("nir/glsl: set deref cast mode during function inlining") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11587 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31668>	2024-10-16 06:25:57 +00:00
Leder, Brendan Steve	bd6f3e30f2	amd/vpelib: Fix output_ctx gamma curve cache bug [Why] Reprogramming gamma curve unnecessary and wastes power/time. [How] Bool array checked instead of bool inside said array, add index. [Testing] Tested with corresponding test cases. Reviewed-by: Roy Chan <Roy.Chan@amd.com> Reviewed-by: Krunoslav Kovac <Krunoslav.Kovac@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Brendan Leder <breleder@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:17 +00:00
Jude Shih	259b44d97b	amd/vpelib: Restructure CDC FE/BE [Why] CDC has two separate functionalities : backend and frontend. split them into two for better resource management. Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Jude Shih <shenshih@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:17 +00:00
Roy Chan	0d7f51724b	amd/vpelib: Add documentation Reviewed-by: Navid Assadian <navid.assadian@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Roy Chan <roy.chan@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:17 +00:00
Brendan	5826ef5e44	amd/vpelib: Visual confirm on input [Why] The gap pointer should advance if it is used. Not a functional bug fix but a nice-to-have change. [How] Advancing the pointer once the memory is used Reviewed-by: Jesse Agate <Jesse.Agate@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Brendan <brendanSteve.leder@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:17 +00:00
Assadian, Navid	a0d2f074cd	amd/vpelib: Add mirror & rotation capability check function for vpe1.1 Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Navid Assadian <navid.assadian@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Brendan	af01d7a181	amd/vpelib: Update configs to be per-pipe specific [Why] config settings should be per pipe in concept [How] update the framework to store configs per pipe Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Brendan <brendanSteve.leder@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Assadian, Navid	6a68af7d21	amd/vpelib: Add better rotation and mirror capability check Add a general rotation and mirror support check support. Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Navid Assadian <navid.assadian@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Evan	3ef8e6a6ae	amd/vpelib: Luma AND Color Keyer Full Support [New] - Added new vpe_stream params for color keying - Added new struct in dpp to capture keying params - Added new capability for color keying - Added keying support in vpe1.0 [Updated] - Updated capability check - Updated Luma and Color Keying functions to better implement the entire feature - Updated resource to map stream params -> dpp keyer param Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Evan <evan.damphousse@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Assadian, Navid	338760d9b5	amd/vpelib: Document public API structures Doxygen style inline comments are added to the public API functions and structures. Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Navid Assadian <navid.assadian@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Brendan	ded1a2b3f0	amd/vpelib: Multiple instance support in caching framework Generalize the caching to work with multiple instances of objects. Change some static functions to public functions to maximize function re-use possibilities. Reviewed-by: Roy Chan <Roy.Chan@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Brendan <brendanSteve.leder@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Evan	7a293a812a	amd/vpelib: Color Keyer Implementation [New] - New color keyer function. Take input from VPE_STREAM and FORMAT, program - Added color keyer call in Front end - Hard code Alpha Keyer Range to be entire range - Added vpe_is_yuv() to determine if input format is any YUV format. Combines existing functions - Set Per-pixel Alpha in vpelib when stream.enable_luma_key=1 [Updated] - Updated the macros to include the luma_key field - Updated vpe10 and resource calls to have mathching function calls - removed unused data structure for keying mode since hard code keyer mode to 0x3 in real world - Updated Luma Key enum to properly represent reg spec [Removed] - Removed unused alpha keyer struct. Opted to take in directly from stream_ctx Reviewed-by: Brendan Steve Leder <brendanSteve.leder@amd.com> Acked-by: Chih-Wei Chien <Chih-Wei.Chien@amd.com> Signed-off-by: Evan <evan.damphousse@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31605>	2024-10-16 02:30:16 +00:00
Marek Olšák	2272db2ac6	radeonsi: set the valid buffer range for bindless image buffers This was missing. Cc: mesa-stable Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31651>	2024-10-16 00:27:56 +00:00
Marek Olšák	88f057eb53	gallium/u_threaded: fix crash in tc_create_image_handle due to resource == NULL Fixes: `3df9d8ed80` - gallium/u_threaded: implement pipelined partial buffer uploads using CPU storage Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12020 Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31651>	2024-10-16 00:27:56 +00:00
Paulo Zanoni	4d60f905b3	anv/trtt: extract anv_trtt_first_bind_init_queue() The code that initializes each queue got big enough that the repetitive error handling is getting ugly and it could benefit from being on its own function. v2: Rebase, try to improve the comments. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	8280a6fee9	anv/trtt: convert anv_trtt_bind arrays to util_dynarray Since the L2 bug fix we've been overestimating l3l2_binds by a lot in most of the cases: almost every single call to anv_sparse_bind_trtt ends up using either 0 or 1 elements for l3l2_binds, with occasionally something using 512 or more. By switching to util_dynarray we can guarantee the best of every case: - l1_binds will remain a stack array for the vast majority of the calls - even more than before, since STACK_ARRAY was limited to 8 elements and now we do 32 - l1 will be properly dimensioned without the need for reallocs - l3l2_binds will be completely empty most of the times and only trigger allocations when necessary Here's the top 10 most common results of anv_sparse_bind_trtt() for a trace of Assassin's Creed: Valhalla. The first column is how many times we had that case while running the trace. After this patch, all these cases will proceed without any memory allocations. 168 trtt_binds: num_vm_binds:04 l3l2:0000 l1:0004 344 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0004 420 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0012 422 trtt_binds: num_vm_binds:04 l3l2:0000 l1:0008 479 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0024 560 trtt_binds: num_vm_binds:03 l3l2:0000 l1:0003 1005 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0002 1024 trtt_binds: num_vm_binds:02 l3l2:0000 l1:0004 2145 trtt_binds: num_vm_binds:02 l3l2:0000 l1:0002 3735 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0001 Only 70 out of total 11340 calls to anv_sparse_bind_trtt() contained l3l2 elements. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	5839b9d083	anv/trtt: remove TODO comment regarding the reloc list We use 2MB page table BOs, as defined by ANV_TRTT_PAGE_TABLE_BO_SIZE. Each BO is enough to hold 512 pages, since each one has 4096 bytes. Each L1 page can fit 1024 entries of 64kb size, which means our 512 pages should be able to fit a little less than 32gb of sparse resource memory, since we also need some L2 pages and an L3 page. I don't see any real world application using more than a single BO. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	e047365d76	anv/trtt: delay batch garbage collection Move it past the (n_l3l2_binds == 0 && n_l1_binds == 0) check so we don't end up trying to do garbage collection more often than we submit batches. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	acb759c44f	anv/trtt: decrement trtt->timeline_val when submission fails Otherwise code such as anv_sparse_trtt_garbage_collect_batches() may end up stuck waiting forever on a timeline of a submission that failed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	0802bbd486	anv/trtt: don't submit empty batches when there are no binds to do The application can submit bind operations where it simply resets state that is already in our page tables, so there's nothing to do. Before commit `7da5b1caef` ("anv: move trtt submissions over to the anv_async_submit") we would simply return and not submit any batches when this happened, but the commit reorganized things in a way where we started submitting empty batches instead. Fix this by simply jumping out when we detect this case. Because of this, rename the "error" labels to "out" as they can now happen on a happy case. It should be noted that an alternative to this implementation would be to move all the handling of 'submit' to after the n_lX_binds check, but this would put all the initialization inside the trtt->mutex, creating extra contention even when we have stuff to bind. Since the "there's nothing to bind" check is now rare (after we stopped doing NULL binds during resource creation), it is probably better to reduce lock contention in the common case at the expense of a little more CPU in the rare case. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	aea9ac47d2	anv/trtt: fix error handling when adding binds We're missing a check for 'result' in the middle of a loop and we have an unnecessary check for 'result' after the loop. Fixes: `7da5b1caef` ("anv: move trtt submissions over to the anv_async_submit") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	fd43c4a973	anv/trtt: unset trtt->l3_addr if initialization fails There is a lot that can go wrong during initialization after we assign trtt->l3_addr, and we use its value to check if trtt is initialized. If an initialization fails after l3_addr is already assigned, the next bind will attempt to use the leftover values from the failed initialization attempt and will likely cause all sorts of random errors. So when we fail, just set l3_addr back to 0, causing the next bind to attempt to initialize everything again. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	c832ed6bd2	anv/trtt: ensure all L3 entries are NULL-bound during init Since everything is always NULL-bound at init and we always bind things back to NULL in anv_free_sparse_bindings(), this means we don't need to do NULL bindings during anv_init_sparse_bindings(), saving us a bunch of time, espcially since we don't track L1 entries so we may end up submitting TR-TT batches just to write zeroes on top of zeroes. v2: Don't unnecessarily check for uses_relocs (Lionel). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	3845d0d393	anv/trtt: inline anv_trtt_init_queues_state() - There's only one caller. - The caller is rather small. - We want to introduce initialization code that's not exactly queue state and reuse the 'submit'. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	6af1d1ae7f	anv/trtt: extract anv_trtt_first_bind_init() In the next commits we're going to move this out of anv_sparse_bind_trtt() and we're also going to add more code to it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	4c366ef67b	anv/trtt: set every entry to NULL when we create an L2 table When we create sparse resources the first thing we do is a NULL bind on them, as the Vulkan spec mandates certain behavior even for unbound sparse resources. We do this with the minimal effort possible: if we can get away with marking an L2 pointer as NULL in the L3 table, we just do it and return, instead of going all the way to creating L1 tables and marking all the final entries as NULL. The strategy we were using had a bug that could lead to previously created NULL entries not being marked as NULL anymore. Let's give an example: (before proceeding, keep in mind that a NULL entry in the L3 and L2 tables has bit 1 set, it does not have the value 0) - Create a 64mb buffer that uses an entire L1 table (needs to be properly aligned), which triggers a NULL bind. - Our algorithm will just set the L3 entry (pointing to the L2 table) as NULL. - Create a 64kb buffer that uses the same L2 table (but a different L1 table). - The NULL bind triggered won't do anything as the L2 table is already NULL. - Bind the first buffer to actual memory. This will end up creating the L2 table and the L1 table. The only entry we will set in the L2 table will be the one pointing to the L1 table. All the other values will be 0 (so they won't have neither the NULL or Invalid bits set: access to them will lead to page faults). - Try to use the second buffer, which is still unbound. It was relying on the fact that its L2 table pointer was NULL, but now it's not anymore, so the page walker will fetch the L1 entries in the L2 table and they will all be zero instead of having the NULL bit set. The fix is pretty simple: whenever we create a new L2 table, set every entry to NULL (except the one we're about to set to non-NULL). This preserves behavior for every other NULL resource relying on the L3 entry being set to NULL. We don't need to do this for the L1 table because its entries are different and instead of having bits to signal NULL entries we have a special TR-TT register that we can set that gets compared to check if an entry is NULL, and we conveniently program it to 0: see ANV_TRTT_L1_NULL_TILE_VAL. I am not aware of any real workloads that are triggering this behavior, I found this issue while investigating something else, running a custom sparse program in our pre-silicon environment, and it told us about the page faults. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:30 +00:00
M Henning	537ada2308	nak: Phi coalescing via biased register coloring Reduces code size by -29.08% on shaderdb + nvk-fossils-foss Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31498>	2024-10-15 22:29:11 +00:00
Dylan Baker	38f7ae5288	release: push 24.3 out two weeks I've had a couple of requests to push the release out 1-2 weeks. There have been various reasons for this, but the best one (IMHO) is that this is the week directly after XDC, and many people will be jetlagged and/or suffering from the post-XDC flu. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31637>	2024-10-15 14:59:50 -07:00
Karol Herbst	ff2c4e8f11	zink: add CL CTS result Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31614>	2024-10-15 21:07:07 +00:00
Juston Li	0c9ee0f2b9	android: look for debug/vendor prefixed options Properties from the vendor partition must use a "vendor." prefix from Android T+. Meanwhile the "debug." prefix can be used for local overrides. The order of precedence thus becomes: 1. getenv 2. debug.mesa.* 3. vendor.mesa.* 4. mesa.* (as a fallback for older versions) Signed-off-by: Juston Li <justonli@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31584>	2024-10-15 20:22:17 +00:00
Kenneth Graunke	4cb67cb07a	intel/brw: Use whole 512-bit registers in constant combining on Xe2 Xe2 increased the register size from 256-bits to 512-bits. So we can store 32 16-bit values in a register, rather than 16 values. Prior to this patch, we hadn't updated the pass, so the second half of each of our registers was unused. Backport-to: 24.2 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	d9e5022650	intel/brw: Delete more Gfx8 code from brw_fs_combine_constants These platforms are supported by elk, not brw. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	dea61b7399	intel/brw: Fix register and builder size in emit_barrier() for Xe2 We were manually allocating 1 REG_SIZE for the barrier payload, which is only half a register on Xe2. This should eventually get allocated to a whole register anyway, but it's awkward in the meantime. Also, we were zero-initializing the header using group(8, 0) which only initialized half the register. The rest of the fields are Reserved MBZ, so they're likely unused and unread anyway - but it's better to zero-initialize them so we don't get random undefined, miserable-to-debug behavior. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	7c9eb8b289	intel/brw: Make a ubld temporary in emit_barrier() Saves typing .exec_all() in a lot of places. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	a9d9488788	intel/brw: Delete Gfx7-8 code from emit_barrier() Those are supported by elk, not brw. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	c747c1e1f4	intel/brw: Fix spill/fill count for load/store_scratch in SIMD32 Honestly, I don't know what I was thinking - we are emitting a single spill/fill message here, but were counting it as 2 spill/fills in SIMD32 shaders. So our eventual shader stat reporting would subtract the number of spills and fills from send_count, and get a negative number, wrapping around to just shy of UINT32_MAX. That's way too many sends. This is especially noticable on Xe2 which often uses SIMD32 shaders. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Pavel Ondračka	58d6906f8c	r300/ci: update ci expectations after piglit uprev Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31663>	2024-10-15 17:43:00 +00:00
Faith Ekstrand	03a393d6ca	nak: Handle annotations in legalization Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31665>	2024-10-15 17:13:27 +00:00
Faith Ekstrand	36d9d11882	nak: Remove annotations before calc_instr_deps() Otherwise the annotations might throw off latency information which needs exact instruction counts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31665>	2024-10-15 17:13:27 +00:00
Aleksi Sapon	9e769a0620	lavapipe: enable alpha-to-coverage dithering This is a common feature on hardware, both Nvidia and Apple GPUs have it always enabled. On OpenGL this can be controlled using NV_alpha_to_coverage_dither_control, but as far as I can tell there is no extension on Vulkan. Metal also has this feature without a control. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31373>	2024-10-15 16:17:40 +00:00
Aleksi Sapon	ad4635d6ef	llvmpipe: implement alpha-to-coverage dithering Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31373>	2024-10-15 16:17:40 +00:00
Danylo Piliaiev	6d6d5b869c	freedreno/cffdec: Add option to dump bindless descriptors cffdump --bindless would dump bindless descriptors. We don't know what exactly is in the descriptors, so we dump all interpretations for each of them. Example: set[1]: UBO[0]: { BASE_LO = 0x23806420 } { BASE_HI = 0xc \| SIZE = 0x2 } STORAGE/TEXEL/IMAGE[0]: { TILE_MODE = TILE6_LINEAR \| SWIZ_X = A6XX_TEX_Z \| SWIZ_Y = A6XX_TEX_X \| SWIZ_Z = A6XX_TEX_Y \| SWIZ_W = A6XX_TEX_W \| MIPLVLS = 0 \| SAMPLES = MSAA_ONE \| FMT = FMT6_R8_G8B8_2PLANE_420_UNORM \| SWAP = WZYX } { WIDTH = 12 \| HEIGHT = 8 } { STRUCTSIZETEXELS = 1024 \| STARTOFFSETTEXELS = 0 \| PITCHALIGN = 1 \| PITCH = 128 \| TYPE = A6XX_TEX_2D } { ARRAY_PITCH = 4096 \| MIN_LAYERSZ = 0 } { BASE_LO = 0xa5000 } { BASE_HI = 0x1 \| DEPTH = 1 } { MIN_LOD_CLAMP = 0.000000 \| PLANE_PITCH = 128 } { FLAG_LO = 0xa6000 } { FLAG_HI = 0x1 } { FLAG_BUFFER_ARRAY_PITCH = 327680 \| 0xa0000 } { FLAG_BUFFER_PITCH = 64 \| FLAG_BUFFER_LOGW = 0 \| FLAG_BUFFER_LOGH = 0 } { 11 = 0 } { 12 = 0 } { 13 = 0 } { 14 = 0 } { 15 = 0 } SAMPLER[0]: { XY_MAG = A6XX_TEX_NEAREST \| XY_MIN = A6XX_TEX_NEAREST \| WRAP_S = A6XX_TEX_CLAMP_TO_EDGE \| WRAP_T = A6XX_TEX_MIRROR_CLAMP \| WRAP_R = A6XX_TEX_MIRROR_CLAMP \| ANISO = A6XX_TEX_ANISO_2 \| LOD_BIAS = 4.437500 } { COMPARE_FUNC = FUNC_GEQUAL \| MAX_LOD = 4.000000 \| MIN_LOD = 0.000000 } { REDUCTION_MODE = A6XX_REDUCTION_MODE_MIN \| BCOLOR = 0x400080 } { 3 = 0x1 } Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31632>	2024-10-15 15:35:39 +00:00
Danylo Piliaiev	e2e9dd4f21	freedreno/rnndec: Consider array length when finding by reg name Otherwise we get a valid reg base for reg array with OOB index. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31632>	2024-10-15 15:35:39 +00:00
Deborah Brouwer	0007077c11	ci: remove xfail program@build@include-directories Now that build-piglit.sh is no longer removing ‘include_test.h’ this test `program@build@include-directories` is passing which is causing jobs to fail due to this unexpected improvement. Remove this test from expected fails so that the jobs can pass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31379>	2024-10-15 15:50:47 +01:00
Collabora's Gfx CI Team	68aa78a858	Uprev Piglit to 7ce69da1199d12ed0ddaa251ed489750523798fb `e9ab30aeae...7ce69da119` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31379>	2024-10-15 15:50:47 +01:00
Mike Blumenkrantz	4ac4004816	llvmpipe: expose GL multiview extensions this is a no-op since lavapipe is already doing it Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31590>	2024-10-15 14:01:42 +00:00

... 3 4 5 6 7 ...

196622 Commits