mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2024-12-02 22:54:05 +08:00

Author	SHA1	Message	Date
José Roberto de Souza	660877cf38	iris: Drop I915_EXEC_FENCE types Those are i915_drm.h specific types and should not be in code paths shared by i915 and Xe KMD. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21887>	2023-03-15 02:05:58 +00:00
Mike Blumenkrantz	747c3ddb9d	glthread: align small buffer uploads to 4 bytes some apps (e.g., supertuxkart) use a ton of 4 byte subdata calls, and this halves their memory consumption Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21875>	2023-03-15 01:22:12 +00:00
Mohamed Ahmed	5ada09412f	anv: remove GetBufferMemoryRequirements2() Signed-off-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21898>	2023-03-15 00:30:35 +00:00
Mohamed Ahmed	2649ee0724	vulkan/runtime: implement vkGetBufferMemoryRequirements2() Signed-off-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21898>	2023-03-15 00:30:35 +00:00
Mohamed Ahmed	10a4412966	vulkan/runtime: move common buffer related entrypoints to vk_buffer.c Signed-off-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21898>	2023-03-15 00:30:35 +00:00
Alyssa Rosenzweig	2bab56737c	panfrost: Note glDrawRangeElements underflow Hopefully this helps someone wiring up robustness later on. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	c832831a6f	panfrost/ci: Remove fbo-mrt-new-bind fail+flake Seems to pass reliably now. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	179ed2ff60	panfrost/ci: Add some Piglit skips Skip heavyweight crashing tests that have the potential to take down not just themselves but also other Piglit tests running concurrently via piglit-runner (which would otherwise become piglit-runner level flakes). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	e060513533	panfrost/ci: Identify some Piglit flakes Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	6788d37a1f	panfrost/ci: Skip draw_buffers_indexed.random.* on Midgard These are (have always been) quite broken. Given that the whole section is already in the flakes.txt, and there's no plan for improving this (I've tried and fails), I'd rather just skip the section and reduce the noise in the #panfrost-ci channel. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	a0e9f9278d	panfrost: Handle null textures robustly This is really dumb. But this fixes arb_shader_language_420pack-active-sampler-conflict on v7 which otherwise dereferences a null pointer trying to access the nonexistant texture arrays, or DATA_INVALID_FAULTs if you give it a texture array filled with zeroes. But it seems happy if you bind in null textures. This is dumb but less faults in Piglit is good for reducing flakes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	b8b6bb18f5	panfrost: Defeature 24-bit textures mesa/st doesn't like to use 24-bit textures, preferring RGBX over true RGB even for texture views where this isn't valid. Given how silly true RGB is in practice, I'd rather drop support and fix texture views than go against the grain and risk more issues down the line since nobody else in tree is testing these paths and apps really shouldn't be caring. Fixes page faults in arb_texture_view-rendering-formats_gles3 which tries to sample an R8G8B8_UINT texture with a R8G8B8X8_UNORM view in one subcase. That test is now passing reliably. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:01 +00:00
Alyssa Rosenzweig	7dda731a38	panfrost: Assert that we don't see unsupported vertex formats Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:00 +00:00
Alyssa Rosenzweig	589a0fe865	panfrost: Identify "Base vertex offset" signedness This is signed, not unsigned. We were already passing negatives and silently relying on 2's complement and C to do the right thing. But that's silly. We should just, actually do the right thing. Found while struggling to debug primitive-restart-draw-mode. v2: Update the other architectures too, including a decode_csf.c change for the v10 incarnation of this v4-era field. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [v1] Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:00 +00:00
Alyssa Rosenzweig	90e78f6008	pan/bi: Ignore signedness in vertex fetch We just want a bit-exact transfer for integers. Using .auto32 accomplishes this without any clamping shenanigans. Fixes gl-3.0-vertexattribipointer. Note we can't use .auto32 unconditionally, since reading a uint vertex as float is supposed to convert (or something like that, gl-2.0-vertexattribpointer tests the bad case at any rate). Fixes: `482cc273af` ("pan/bi: Implement load attribute with the builder") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:00 +00:00
Alyssa Rosenzweig	62497d4860	util/prim_convert: Don't set index_bounds_valid draw->index_bounds_valid tells drivers that the values of min_index/max_index are set correctly and can be used e.g. to allocate memory for varyings. If set incorrectly, the GL promises badness. But, with primconvert, we go mucking with index buffers and then never update the bounds. So it doesn't matter if the original index bounds were valid, we can't promise the original bounds are still valid. If we were trying to optimize CPU overhead, we could try to preserve the new min/max index but seeing as only older Mali cares about this flag, and if you're using primconvert you're already screwed, I'm not too inclined to go rework primconvert. Fixes* page faults in primitive-restart-draw-mode on Mali-G52 for GL_QUAD_STRIPS and GL_POLYGON, which hit the primconvert path. The full dmesg splat looks like: [ 5438.811727] panfrost ffe40000.gpu: Unhandled Page fault in AS0 at VA 0x000000100A16BAC0 Reason: TODO raw fault status: 0x25002C1 decoded fault status: SLAVE FAULT exception type 0xC1: TRANSLATION_FAULT_1 access type 0x2: READ source id 0x250 Notice that a high bit is randomly set in the address, this is trying to read a varying from the actual varying buffer in the vicinity of 0xa16bac0. What's actually happening is that we're trying to read index #0 despite promising the driver a minimum index of 2, causing an integer underflow as we try to read index -2, or as the hardware sees, 4294967294. As long as we stop lying to panfrost about the bounds being correct, panfrost is able to calculate the real (post-primconverted) bounds on its own, fixing the test. * Alternatively, maybe Panfrost should just ignore this bit, in which I don't know why we have it in Gallium, since it's probably not conformant to fault on out-of-range glDrawRangeElements. Fixes: `72ff53098c` ("gallium: add pipe_draw_info::index_bounds_valid") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21891>	2023-03-14 23:10:00 +00:00
Mike Blumenkrantz	2409ddb5db	zink: fix copy box iteration Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21907>	2023-03-14 21:25:55 +00:00
Mike Blumenkrantz	7d41b8fe4e	tu: don't set startup debug on debug builds this is incredibly annoying on normal linux systems Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21809>	2023-03-14 19:48:24 +00:00
Jarred Davies	1115a29025	pvr: Fix segfaults when pDepthStencilAttachment is NULL depth_stencil_attachment has been changed from a pointer to the attachment idx to just the attachment idx, as this avoids the driver having to check for NULL when comparing attachments indexes with depth_stencil_attachment. Anyplace that relies on depth_stencil_attachment being a valid index must already check that depth_stencil_attachment is not VK_ATTACHMENT_UNUSED, so this change avoids having to check both the pointer and the index for the same information. Noticed when running dEQP-VK.api.smoke.triangle Signed-off-by: Jarred Davies <jarred.davies@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21690>	2023-03-14 19:27:27 +00:00
Eric Engestrom	a0bf0adade	ci/broadcom: move rare failure to the flakes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21901>	2023-03-14 18:26:31 +00:00
Yiwei Zhang	179fadb332	venus: make external fence and semaphore export async This also makes vn_QueueSignalReleaseImageANDROID async since it makes use of a queue submit followed by an external fence export internally. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Yiwei Zhang	a37771b42a	venus: refactor to add vn_sync_payload_external Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Yiwei Zhang	891af34bca	venus: make common wsi bo submission async Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Yiwei Zhang	0a3f612ab3	venus: let vn_instance_submit_command track ring seqno Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Yiwei Zhang	1cb42a629f	venus: make vn_instance_wait_roundtrip asynchronous vn_instance_roundtrip does 2 things: 1. vn_instance_submit_roundtrip - before: encode a cmd to write vq seqno to ring extra field - after: encode a cmd to update vq seqno against a ring - submit the encoded cmd via vq 2. vn_instance_wait_roundtrip - before: wait until ring extra field has the vq seqno - after: let renderer ring thread wait for the vq seqno Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Yiwei Zhang	9b7a78cac6	venus: switch to use 64bit roundtrip seqno This is to prepare for later async roundtrip waiting while seamlessly compatible with legacy way. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Yiwei Zhang	932073d3e6	venus: sync to latest protocol for asyncRoundtrip Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21716>	2023-03-14 18:07:38 +00:00
Martin Roukala (né Peres)	10e0c5fd46	ci/b2c: move away from the hand-rolled initscript Up until now, we have been handrolling part of the init-stage2.sh in the b2c command line. Let's stop doing that and instead use the same script as every other HW farms. Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21872>	2023-03-14 17:22:07 +00:00
SoroushIMG	4affc3b361	zink: rename shadow key to zs swizzle No functional change. The shadow shader swizzle pass has been extended to optionally include all z/s textures. Rename the structs/variables to reflect this now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:30 +00:00
SoroushIMG	24a2530ed8	zink: workaround undefined swizzle 1 for z/s textures using swizzle 1 with z/s textures returns undefined data on some Imagination hardware. Work around this by using the same shader swizzling used for shadow samplers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:30 +00:00
SoroushIMG	2cf117ee39	zink: add depth/stencil needs shader swizzle workaround field Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:30 +00:00
SoroushIMG	cc15dbc4f8	zink: extend shadow swizzle pass to all zs textures if needs_zs_shader_swizzle is used, apply constant swizzles to all depth/stencil textures and not just shadow samplers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:30 +00:00
SoroushIMG	79557c2747	zink: add needs_zs_shader_swizzle shader key This will be used later, but for now it should always be disabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:30 +00:00
SoroushIMG	b707cdccf5	zink: minor formatting change that line was becoming too long. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:30 +00:00
SoroushIMG	f7257b1c75	zink: track shadow swizzle for all shader stages this will be used later on to enable the pass in all shader stages. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:29 +00:00
SoroushIMG	a83e63437f	zink: fix shadow mask change logic when binding sampler views First make sure shadow mask change sets dirty state. Second move shadow mask bit removal to unbind_samplerview which is cleaner and correctly clears the shadow bit when binding buffer texture. Fixes: `5193f4f712` ("zink: add a fs shader key member to indicate depth texturing mode") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21571>	2023-03-14 17:03:29 +00:00
SoroushIMG	5903868f99	zink: fix stale point sprite mode state Fixes: `cf8ca77be1` ("zink: handle point sprite") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21840>	2023-03-14 16:41:48 +00:00
Mike Blumenkrantz	4b4306fe10	zink: super reorder buffer copies usually zink_get_cmdbuf() is enough for reordering operations, but with new technology, it becomes possible to promote even the most stubborn buffers to the unordered cmdbuf first, check the src buffer to ensure that there's no pending writes in the main cmdbuf that would prohibit reordering second, apply a TRANSFER_DST to the dst buffer using the util function to determine whether it can be reordered if both the src and dst can be reordered for their respective regions and read/write usage, then the entire op can be promoted regardless of the unordered_read/unordered_write flags this optimizes out patterns like upload index buffer (offset=0) draw upload index buffer (offset=128) draw upload index buffer (offset=256) draw ... so that the uploads and draws can be separated and batched Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21802>	2023-03-14 16:23:06 +00:00
Mike Blumenkrantz	128d19da5e	zink: rename zink_check_transfer_dst_barrier() Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21802>	2023-03-14 16:23:06 +00:00
Mike Blumenkrantz	e0c53554ae	zink: unify image TRANSFER_DST barrier checks this should be consistent with buffers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21802>	2023-03-14 16:23:06 +00:00
Mike Blumenkrantz	e55e9014b3	zink: return the unordered state from zink_resource_buffer_transfer_dst_barrier() convenience usage Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21802>	2023-03-14 16:23:06 +00:00
Mike Blumenkrantz	fe6f0692ed	zink: rework zink_resource::valid_buffer_range this is now the valid buffer region for the "main" command buffer, and all transfer ops store their regions in the copy boxes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21802>	2023-03-14 16:23:06 +00:00
Mike Blumenkrantz	8b38c4f43c	lavapipe: beef up LVP_POISON_MEMORY this makes lavapipe behave more like a tiler and completely annihilate any existing data for DONTCARE load/store ops Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21847>	2023-03-14 14:52:24 +00:00
Lionel Landwerlin	d4a2c0fcaa	vulkan/wsi: add a headless swapchain implementation/option I wanted to find slow pieces of code in our Anv driver using our drm-shim stub. The last bit of code still talking to the compositor was the WSI swapchain code and failing because none of the submissions are taking place (because of the stub). This change introduces a new variable MESA_VK_WSI_HEADLESS_SWAPCHAIN which when set turns every swapchain creation into a headless swapchain. This swapchain does not present anything, allowing the application to spin as many frames as possible. Thus helping to identify slow spots in command buffer building path. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6156>	2023-03-14 14:03:31 +00:00
Dave Airlie	4e0d4aab48	anv: fix image height for field pictures. Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21807>	2023-03-14 13:34:53 +00:00
Lionel Landwerlin	56474fae93	intel/fs: fix subgroup invocation read bounds checking nir->info.subgroup_size can be set to an enum : SUBGROUP_SIZE_VARYING = 0 SUBGROUP_SIZE_UNIFORM = 1 SUBGROUP_SIZE_API_CONSTANT = 2 SUBGROUP_SIZE_FULL_SUBGROUPS = 3 So compute the API subgroup size value and compare it to the dispatch size to determine whether we need some bound checking. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9ac192d79d` ("intel/fs: bound subgroup invocation read to dispatch size") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21856>	2023-03-14 12:15:48 +00:00
Daniel Schürmann	f6a36190a1	radv/rt: Fix any_hit scratch variables. We have to make sure not to change call_data locations as well. Fixes: `481f78ab93` ('radv/rt: place any-hit scratch vars after intersection scratch vars') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21876>	2023-03-14 11:57:02 +00:00
Emma Anholt	5bb9ab896c	ci: Re-enable some swrast testing using fd.o's shared runners for now. I'm not planning to stand mesa-swrast back up until we get Kata set up, so turn the testing back on at a reduced fraction on so that venus/llvmpipe/etc. dev can still get some coverage. I haven't turned lavapipe back on, because it is now unstable in memory model / atomics tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21880>	2023-03-14 11:31:34 +00:00
Lionel Landwerlin	bf59cfcee1	intel/fs: prevent large vector ops generated by peephole_ffma Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	bc08f43991	intel/fs: add MOV source count validation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	ed3c2f73db	intel/fs: fixup sources number from opt_algebraic Fixes issues with register_coalesce : fossilize-replay: brw_fs_register_coalesce.cpp:297: bool fs_visitor::register_coalesce(): Assertion `mov[i]->sources == 1' failed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	18bdc71459	intel/fs: fix nir_opt_peephole_ffma max vec assumption There can be larger vec than vec4. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	efde1917c9	intel/fs: don't SEND messages as partial writes For instance, to load uniform data with the LSC we usually rely on tranpose messages which have to execute in SIMD1. Those end up being considered as partial writes so within loops their life span spread to the whole loop, increasing register pressure. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21867>	2023-03-14 10:10:32 +00:00
Lionel Landwerlin	adcdc38f3b	anv: more formats for acceleration structure vertices Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21821>	2023-03-14 09:34:27 +00:00
Dave Airlie	cb24faf1a6	anv/video: disable picture id reampping. This isn't needed at the hw level with vulkan Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.") Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21433>	2023-03-14 07:32:00 +00:00
Dave Airlie	f85b2cbe33	anv/video: fix chroma qp to be a integer value. This is just a cleanup to the genxml Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21433>	2023-03-14 07:32:00 +00:00
Mike Blumenkrantz	c28c995645	lavapipe: add command debugging I keep adding this in locally. it's great for debugging Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21814>	2023-03-14 06:16:32 +00:00
Mike Blumenkrantz	e6e1d01be0	lavapipe: set render_condition_enabled=false for vkCmdClearDepthStencilImage this command ignores conditional rendering fixes: dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_host_memory_expect_noop dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_host_memory_secondary_buffer_expect_noop dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_host_memory_secondary_buffer_expect_noop_inverted dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_host_memory_secondary_buffer_inherited_expect_noop dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_local_memory_expect_noop_inverted dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_local_memory_secondary_buffer_expect_noop dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_local_memory_secondary_buffer_expect_noop_inverted dEQP-VK.conditional_rendering.conditional_ignore.clear_condition_local_memory_secondary_buffer_inherited_expect_noop Fixes: `fe53c22294` ("lavapipe: fix only clearing depth or stencil paths.") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21814>	2023-03-14 06:16:32 +00:00
Mike Blumenkrantz	c9e757c61e	lavapipe: fix dynamic depth clamping on pipeline bind with dynamic state, depth_clip_near needs to either be set by * applying the dynamic state * using the pipeline state the previous code always used the pipeline state fixes: dEQP-VK.pipeline.*.extended_dynamic_state.between_pipelines.depth_clamp_enable Fixes: `650880105e` ("vulkan,lavapipe: Use a tri-state enum for depth clip enable") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21814>	2023-03-14 06:16:31 +00:00
Lionel Landwerlin	d8013976c7	anv: export EXT_pipeline_library_group_handles only with RT Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21870>	2023-03-14 02:08:01 +00:00
Eric Engestrom	76b591d8f7	broadcom/ci: no need to skip the tests that swap buffers anymore Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21877>	2023-03-14 01:31:19 +00:00
Mike Blumenkrantz	43facca195	aux/tc: use renderpass tracking to optimize texture_subdata calls if it's known that a renderpass is active and the driver wants to do renderpass optimizing, help out by not forcing a sync and instead doing what the driver would do: create a staging buffer and copy it to the image this requires that the driver already handles buffer -> image copies with resource_copy_region Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21801>	2023-03-14 00:53:28 +00:00
Konstantin Seurer	ecf29228d0	radv/sqtt: Skip dumping pipeline libraries They don't have any shaders which can lead to crashes when dumping them. Fixes: `2e04aeb` ("radv: capture RT pipelines from the SQTT layer") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21852>	2023-03-13 20:05:49 +00:00
Mark Collins	715adcb884	tu: fix tu_GetInstanceProcAddr not handling null instance It is legal to pass in nullptr as an instance into vkGetInstanceProcAddr when resolving any global addresses, this wasn't handled correctly and an illegal access to a member of a null struct was made. Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21827>	2023-03-13 19:31:33 +00:00
Mark Collins	9c808043f3	tu: KGSL backend rewrite This commit rewrites the KGSL backend to utilize vk common wherever possible to bring the codebase in line with DRM while implicitly fixing minor API bugs that may have occurred as a result of manually implementing VK functions. As a part of moving to vk common, KGSL sync is now implemented atop vk common sync and vastly expanded in terms of functionality such as: * Import/Export of sync FDs - A required capability for properly supporting the Android WSI and as these functions were stubbed when a presentation operation used semaphores, it would cause a leak of FDs that were imported due to the expectation that the driver would close them. As well as causing UB around due to ignoring the imported FD or not exporting a valid FD. * Supporting pre-signalled fences - Vulkan allows fences to be created in a signalled state which was stubbed prior and can lead to UB. * Timeline semaphore support - As a result of utilizing vk common as the backbone for synchronization, its timeline semaphore emulation has been utilized to provide support for them without needing kernel support. (Note: On newer versions of KGSL, timeline semaphores can be implemented natively rather than using emulation as they support wait-before-signal) Fixes freezes due to semaphore usage with presentation on: * Genshin Impact * Skyline Emulator Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21651>	2023-03-13 18:59:50 +00:00
Pierre-Eric Pelloux-Prayer	88989379b1	Revert "driconf: add a workaround for plasmashell freezing" This reverts commit `41eb491fb6`. The underlying issue was fixed by the previous commit. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20887>	2023-03-13 18:28:15 +00:00
Pierre-Eric Pelloux-Prayer	a98e4195f5	yegl/wayland: fix glthread deadlocks We need to make sure that glthread is idle before using wl_* functions or they might be used from 2 threads at the same time. Thanks to @deltib for the investigation of this issue. Fixes: `58f90fd03f` ("egl/wayland: fix glthread crashes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7624 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8136 Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20887>	2023-03-13 18:28:15 +00:00
Daniel Stone	95e8be29a7	ci/panfrost: Add texturesize flake seen in the wild Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20887>	2023-03-13 18:28:15 +00:00
Rob Clark	ea3e9d541f	freedreno/a6xx: Simplify iova emit Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:24 +00:00
Danylo Piliaiev	5ca3481b5d	freedreno/register: Define chip enum values Otherwise it cannot be used in templates Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:24 +00:00
Rob Clark	6b2c1b00ff	freedreno/registers: Define rest of CP_REG_WRITE Enough that we can use OUT_PKT() to emit it, which will be needed when we use it to write regs that are different btwn a6xx and a7xx. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:24 +00:00
Rob Clark	6dd5b4ca5f	freedreno/registers: Fix nameless fields Originally if we had an anonymous field (ie. field declared as part of the register definition itself) the name in the generated field struct would include the gen prefix (ie. .a6xx_rb_stencil_buffer_pitch), but this doesn't work for variants because the variant regs would have different gen prefixes. Fix this by using reg name instead of the full_name. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:24 +00:00
Rob Clark	dc43237d1a	freedreno/registers: Add c++ magic for register variants For regs with multiple variants, generate a template'ized function to pack the reg value. If the template param is known at compile time (which is the expected usage) this will optimize to the same thing as the "traditional" reg packing. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:24 +00:00
Rob Clark	d58af7b5c7	freedreno/registers: Split out regpair builder helper We are going to want to re-use this in the next commit. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:24 +00:00
Rob Clark	d54edcfc72	freedreno/registers: Track varset Track varset and assert that variants refer to a valid varset enum value. This adds a bit of extra sanity checking, but becomes more useful in the next commit. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	f011189642	freedreno/registers: Start adding stuff for a7xx Start adding the bits needed for userspace. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	b90d4a0701	freedreno/decode: Start adding a7xx support Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	dd6e7041ab	freedreno/registers: Start adding a7xx pipe/control regs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	56f9371f7e	freedreno/registers: Merge a6xx and a7xx regs They have more similarities than differences, so merge them and use "variant" attribute as needed to manage differences. Note initially using "variant" conservatively when it comes to regs known on a7xx but not a6xx. It could be that they exist also on later versions of a6xx as well, for example. For ex, LPAC related regs/bits likely existed on later a6xx (eg. a660 family) but BV stuff is not. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	684931166d	freedreno/registers: Add prefix="variant" To merge a7xx and a6xx regs, using variant property to manage the differences, we'll want regs/etc to be named according to the first generation it is use rather than the domain name. Add a new prefix type to accomplish this. By default, if no variant property, things will still be named based on domain (ie. REG_A6XX_...), and things that have variant="A6XX" will also end up as they currently are (since the chip enum matches domain name), but things that have variant="A7XX" will end up as REG_A7XX_... Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	fadf76b938	freedreno/registers: Fix designator order C++ is picky about order matching for some reason. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	4a528e8f5f	freedreno/a6xx: Convert to c++ Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	ce336097f1	freedreno/a6xx: Fix designator initializer order Clang seems more relaxed about this, allowing C99 style initializers without requiring ordering. But unfortunately g++ is more picky :-/ TODO this doesn't completely fix everything with g++, namely sparse array initialization.. for ir3 driver-params, I think we can convert these to structs. But there are still one or two others to deal with. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:23 +00:00
Rob Clark	96ca37b9af	freedreno/a6xx: Add missing "inline" Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	af2f0c3d9b	freedreno/a6xx: Rework texture_clear fallback C++ is more picky about a goto jumping over variable initialization, even if unused after the goto label (presumably because of destructors that can be called after a variable goes out of scope). Since there is only a single fallback path, get rid of the goto. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	f921b7c09b	freedreno: c++-proofing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	05958fa6c9	freedreno: Un-inline buffer-mask enum Also, fix obsolete comment. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	37a036500a	freedreno/ir3: Add missing driver params Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	5eed59cc87	freedreno/ir3+tu: Calculate subgroup size in ir3 TBD if the size changes for a7xx, but at least let's have it in one place instead of duplicating in turnip and gallium. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	c449e63809	freedreno/ir3: c++-proof the headers Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	bff0ff5ae3	freedreno/ir3: Don't use negative opc for meta instructions Stricter compilers complain about this, ie: error: left operand of shift expression ‘(-1 << 7)’ is negative [-fpermissive] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	7c7761574e	freedreno/ir3: Un-inline enums It seems to be a thing that c++ dislikes Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	69947b284e	freedreno: Quiet c++ warning about designated initializers And various other things that c++ is more strict about. Perhaps we re-instate a few of the more reasonable warnings over time. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	64e93ca9a1	freedreno/registers: Add regs for a690 New regs needed on kernel side. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	44d0365a4d	freedreno/registers: Schema validation for gen_header.py Lets catch issues at build time, and not relying on someone remembering to run the unit tests. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:22 +00:00
Rob Clark	963729af2a	freedreno: Nerf strict-aliasing warning for all of gcc Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21846>	2023-03-13 17:31:21 +00:00
Samuel Pitoiset	4d03bf0f9d	radv: allow to cache optimized (LTO) pipelines with GPL This should be working now, except PS epilogs that are still not added to the cache. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21834>	2023-03-13 13:35:24 +00:00
Samuel Pitoiset	532d63993f	radv: keep track of the retained NIR shaders sha1 for LTO pipelines Otherwise the per pipeline cache key doesn't consider shaders at all when they are imported from libs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21834>	2023-03-13 13:35:24 +00:00
Samuel Pitoiset	fbc7e8f3df	radv: determine if a graphics pipeline needs a noop FS earlier Also introduce a helper. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21834>	2023-03-13 13:35:24 +00:00
Samuel Pitoiset	86ab8c33ed	radv: fix the error code when the driver fails to create a PS epilog It would have been returned VK_SUCCESS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21834>	2023-03-13 13:35:24 +00:00
Daniel Schürmann	481f78ab93	radv/rt: place any-hit scratch vars after intersection scratch vars If both, any-hit and intersection shader, use scratch vars, it could happen that they end up in the same location and overwrite each other. Found by inspection. Fixes: `c3d82a9622` ('radv: Add pass to lower anyhit shader into an intersection shader.') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21863>	2023-03-13 11:45:26 +00:00
Jordan Justen	48ff68820e	intel/dev: Enable MTL PCI ids Ref: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/include/drm/i915_pciids.h?h=v6.0-rc4#n736 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18481>	2023-03-13 10:17:51 +00:00
Mike Blumenkrantz	e28b982db8	radv: avoid a huge memset in radv_graphics_pipeline_compile() this has a noticeable impact on pipeline creation Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20947>	2023-03-13 08:11:10 +01:00
Samuel Pitoiset	1c286db14e	radv: zero-initialize radv_shader_info earlier for graphics pipeline This should allow us to remove a big memset when compiling a graphics pipeline. This is mostly for imported NIR stages which don't go through radv_pipeline_stage_init(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20947>	2023-03-13 08:11:10 +01:00
Samuel Pitoiset	67635bb3e3	radv: zero-initialize radv_shader_args right before declaring them This should allow us to remove a big memset when compiling a graphics pipeline. This is mostly for imported NIR stages which don't go through radv_pipeline_stage_init(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20947>	2023-03-13 08:11:10 +01:00
Mike Blumenkrantz	c505f892d4	radv: delete radv_graphics_pipeline_compile() asserts validation should catch these by now Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20947>	2023-03-13 08:11:10 +01:00
Vinson Lee	29c6a09887	pps: Fix build errors. In file included from ../src/tool/pps/pps_device.cc:10: ../src/tool/pps/pps_device.h:23:11: error: ‘uint32_t’ does not name a type 23 \| static uint32_t device_count(); \| ^~~~~~~~ In file included from ../src/tool/pps/pps_counter.cc:10: ../src/tool/pps/pps_counter.h:22:4: error: ‘uint32_t’ does not name a type 22 \| uint32_t id; \| ^~~~~~~~ Fixes: `1cc72b2aef` ("pps: Gfx-pps v0.3.0") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8186 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21714>	2023-03-13 01:22:46 +00:00
Marek Olšák	c455ea6144	glthread: qualify the *cmd unmarshal parameter with restrict This seems like a logical thing to do. Clearly the memory can't be accessed with any other pointer. Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21777>	2023-03-12 17:56:18 -04:00
Marek Olšák	862b00b795	mesa: put dispatch table initialization into one place We have 3 new/changed functions with this commit: 1. _mesa_alloc_dispatch_tables creates all dispatch tables that are not created on demand and sets them to nop. This operates on gl_dispatch, so it's reusable (e.g. glthread will want to use it) 2. _mesa_free_dispatch_tables frees everything 3. _mesa_initialize_dispatch_tables initializes gl_dispatch for GL (not glthread) Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21777>	2023-03-12 17:56:16 -04:00
Marek Olšák	dae902e11e	mesa: rename CurrentClientDispatch to GLApi I like this more. The name self-documents itself. It's always equal to the dispatch set in glapi. GLAPI is a definition, so can't use that. Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21777>	2023-03-12 17:56:15 -04:00
Marek Olšák	6b22642e21	mesa: move ctx->Table -> ctx->Dispatch.Table except Client & MarshalExec There is a new struct gl_dispatch, which I'd like to reuse in glthread. This allows building code around gl_dispatch that can be shared between mesa and glthread. This is only refactoring. Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21777>	2023-03-12 17:56:11 -04:00
Marek Olšák	ef0e327d9f	glapi: inline the meson list files_mapi_util so that people can easily tell where these files are used by searching for the file names in the meson files. Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21777>	2023-03-12 17:56:10 -04:00
Marek Olšák	eed145004b	glapi: move files specific to shared-glapi into the shared-glapi subdirectory Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21777>	2023-03-12 17:56:03 -04:00
David Heidelberg	7cf7d497e7	ci/clover: disable the jobs Prepare for Clover removal; don't waste resources on Clover anymore. Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21865>	2023-03-12 20:50:14 +01:00
Daniel Schürmann	3d4f6a00b8	aco/spill: allow for disconnected CFG Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20853>	2023-03-12 18:07:18 +00:00
Daniel Schürmann	caec48529b	aco/insert_exec_mask: allow for disconnected CFG Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20853>	2023-03-12 18:07:18 +00:00
Daniel Schürmann	7f7a70778f	aco/dead_code_analysis: don't add artificial uses to p_startpgm Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20853>	2023-03-12 18:07:18 +00:00
Daniel Schürmann	fb99bc5f30	aco/value_numbering: clear hashmap between disconnected CFGs There is no dominance-relationship between two disconnected CFGs, thus no CSE is possible. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20853>	2023-03-12 18:07:18 +00:00
Daniel Schürmann	678aef9f06	aco/dominance: set immediate dominator for any BB without predecessors Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20853>	2023-03-12 18:07:18 +00:00
Kai Wasserbäch	bb2db56ffe	fix: gallivm: fix LLVM #include of Host.h, moved to TargetParser Upstream moved Host.h from Support to TargetParser in LLVM 17. This shouldn't lead to a FTBFS, since there is a forwarding include left behind. Sadly the added deprecation warning #pragma is invalid and thus causes a build failure right away. But since we would have to follow the move anyway in the future, just do it right away. Reference: `d768bf994f` Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Closes: #8275 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21263>	2023-03-12 14:02:23 +00:00
Konstantin Seurer	e3aa058317	radv/rt: Properly handle pNext of pipeline library stages Fixes dEQP-VK.pipeline.pipeline_library.graphics_library.misc.non_graphics.shader_module_info_rt_lib. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21707>	2023-03-12 13:18:15 +00:00
Konstantin Seurer	ef5cba56a0	vulkan: Add vk_shader_module_init This will be used for allocating shader modules using ralloc by RADV. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21707>	2023-03-12 13:18:15 +00:00
Konstantin Seurer	0fc8335ccb	radv/rt: Use vk_pipeline_hash_shader_stage for RT stages Fixes dEQP-VK.pipeline.pipeline_library.graphics_library.misc.non_graphics.shader_module_info_rt. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21707>	2023-03-12 13:18:15 +00:00
David Heidelberg	2b00eaaedc	ci/iris: update apl and glk expectations, after enabling Wayland support After enabling the Wayland platform for x86_64, multiple new tests were triggered, some of which timed out. Also wayland-dEQP-EGL.functional.negative_api.create_pixmap_surface now pass. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21786>	2023-03-12 00:11:09 +00:00
Alyssa Rosenzweig	45554a957a	agx: Lower discard late Fixes regression with Dolphin's ubershaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21855>	2023-03-11 23:34:56 +00:00
Mike Blumenkrantz	c04a7c9267	zink: ignore renderdoc if ZINK_RENDERDOC isn't in use this otherwise has some weird side effects Fixes: `48a0478126` ("zink: add renderdoc handling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21816>	2023-03-11 22:10:38 +00:00
Alyssa Rosenzweig	7e908878c1	ail: Restructure generated tests Currently, the generated tests consist of some boilerplate, generated test cases, and at the very end the actual test. This is bad for readability, because the actual code is all the way at the bottom. It's also bad for clang-format linting: even though the test cases are /* clang-format off */, they still take an exceptionally long time to parse when linting. I suspect this is a clang-format bug, but it's easy enough to workaround. To solve these issues, restructure so that the test cases are in separate files (containing the actual data), but the manually written test functions are consolidated into a new family of generated layout tests. This is probably cleaner. Parallel clang-format linting is now 10x faster on the M1, which means it's now practical to lint in my "publish branch" hook. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21854>	2023-03-11 20:45:42 +00:00
José Roberto de Souza	43e21702f6	anv: Integrate gem vm bind and unbind kmd backend functions Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21698>	2023-03-11 17:56:01 +00:00
José Roberto de Souza	37fa2fa30e	anv: Add gem VM bind and unbind to backend Not using it yet, that will be done in the next patch. Xe only supports submission using VM. For i915 the backend functions are just a noop. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21698>	2023-03-11 17:56:01 +00:00
José Roberto de Souza	324d22d684	anv: Implement gem close and mmap for Xe backend Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21698>	2023-03-11 17:56:01 +00:00
José Roberto de Souza	149e945ad4	anv: Implement Xe functions to create and destroy VM Also using the vm_id to create gem buffers. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21698>	2023-03-11 17:56:01 +00:00
José Roberto de Souza	d5f767edf9	anv: Implement gem_create for Xe backend Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21698>	2023-03-11 17:56:01 +00:00
Isabella Basso	59fea8af3a	nir/algebraic: remove duplicate bool conversion lowerings While [1] added some boolean conversion lowering patterns, those were already dealt with on [2]. [1] - `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") [2] - `d7e0d47b` ("nir/algebraic: nir: Add a bunch of b2[if] optimizations") Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:38 +00:00
Isabella Basso	a553d3cd29	nir/algebraic: make patterns for float conversion lowerings imprecise As noted on [1], lowering patterns of the form floatS -> floatB -> floatS ==> floatS cannot require precision since this may cause flush denorming. [1] `3f779013` ("nir: Add an algebraic optimization for float->double->float") Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	79c94ef52e	nir/algebraic: extend lowering patterns for conversions on smaller bit sizes Conversions on smaller bit sizes should also be collapsed when composed. This also adds more patterns on the intS -> intB -> floatB ==> intS -> floatB lowering so as to deal with any int size C > B instead of a fixed intB. Closes: #7776 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	a27bcd63d0	nir/algebraic: extend mediump patterns Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Suggested-by: Italo Nicola <italonicola@collabora.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	b3685f3ba7	nir/algebraic: insert patterns inside optimizations list Some patterns were outside the list of optimizations. Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Alyssa Rosenzweig	2ba48eea88	nir/lower_point_size: Use shader_instructions_pass Sleepy code deletion mood. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21750>	2023-03-11 16:42:36 +00:00
Alyssa Rosenzweig	933b5c76f6	agx: Switch to scoped_barrier Rather than ingesting separate control and memory barriers, ingest only the combined and optimized scoped_barrier intrinsic. For barriers originating from GLSL, this makes it easier to ensure correctness. For barriers originating from SPIR-V, this is required for translation at all, as spirv_to_nir knows only scoped barriers. So this gets us closer to Vulkan and OpenCL. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21752>	2023-03-11 16:20:06 +00:00
David Heidelberg	84767a5160	ci/lava: every LAVA job doesn't want to run gles2 deqp, drop it Very annoying when adding new job and not getting failure due to missing `DEQP_VER: ` Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
David Heidelberg	8cdbb894ca	ci/panfrost: correct the job name, as it runs on gles2 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
David Heidelberg	e3660c2820	ci/amd: move skqp and va jobs on raven from XOrg to the XWayland Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
David Heidelberg	1e262f129b	ci: add and utilize dalboz devices New 10 devices - asus-CM1400CXA-dalboz hosted on Collabora farm. 1x Move VA-API tests to the dalboz (more resources). One timeout dropped. 9x Run VKCTS on dalboz. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
Sil Vilerino	3067bda0f3	d3d12: Fix video decode for interlaced streams with reference only textures required Fixes: `d8206f6286` ("d3d12: Add video decode implementation of pipe_video_codec") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21832>	2023-03-11 14:31:32 +00:00
Alyssa Rosenzweig	b768a254f7	agx: Use nir_lower_mem_access_bit_sizes Lowers away 64-bit loads, which we'll create in the sysval lowering for dynamically indexed UBOs/VBOs. The lowering generates pack_64_2x32 instructions, so lower those too. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	8a53050d7d	agx: Implement extract_[ui]16 Instead of lowering to bitwise ops. Yet another way of subdividing in NIR. Probably insignificant but makes it easy to check that the pass ordering from the previous pass is right. It does let us get much better codegen for unpacksnorm2x16, whatever that's worth. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	706815488e	agx: Fix subdivision coalescing As intended. We can't CSE with partial null destinations in the way, so we shouldn't eliminate dead destinations until after CSE has run. But we should still eliminate dead instructions to ensure CSE doesn't move things around needlessly, hurting register pressure. Noticed while debugging live range splitting. No GLES3.0 shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	5ea9c2e634	agx: Make partial DCE optional Our dead code elimination pass does two things: 1. delete instructions that are entirely unnecessary 2. delete unnecessary destinations of necessary instructions To deal with pass ordering issues, we sometimes want to do #1 without #2. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	16f8bfb042	agx: Don't set lower_pack_split We should handle nir_op_unpack_32_2x16_split_* natively, since we can generate better code with agx_subdivide (coalescing the ops away) than the bitshift lowering. That said, we do need some extra instructions for the floating point conversions. No shader-db changes (which makes sense because we're targetting the GLES3.0 shader-db, which doesn't have the packing GLSL functions). The real motivation of this change isn't optimizing some GLSL pack functions, though, it's avoiding a code regression from using NIR's memory bit size lowering in a future MR. That lowering will turn things like "load i16vec4" into "load i32vec2 + unpack_32_2x16", so we need to be able to coalesce that unpack. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Daniel Stone	50378f59a7	ci: Actually run Piglit on LAVA At some point in a refactoring long ago, our 'Piglit' runs on arm64 started actually being dEQP-GLES2 runs. Oh dear. Surprisingly, there are a number of expectation changes; added every fail I saw from a long overnight stress test. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21851>	2023-03-11 11:58:30 +00:00
Alyssa Rosenzweig	b190d08a8a	pan/mdg: Remove reference to removed macro This will soon be more confusing than helpful. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	cc16e7322f	panfrost: Remove MALI_POSITIVE macro Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	131845eb84	panfrost: Inline the last MALI_POSITIVE use Big shrug on this one. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	14eb964e59	panfrost: Remove FBD tag enum from XML This was a hack to avoid modelling the full data structure. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	67cbbf9417	panfrost: Use framebuffer pointer XML Rather than manipulating the raw pointers. This is cleaner. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	1a5546293c	panfrost: Add XML for framebuffer pointers We shouldn't have to open-code these. They are real data structures, model them as such in the architecture XML files. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	35985be275	panfrost: Handle fixed-point packing in GenXML Minimum/maximum LOD and LOD bias are unsigned and signed fixed point formats respectively. They are not unsigned integers. Introduce fixed-point types into our GenXML and use them in the XML, rather than packing in sidebands. This makes the XML more correct and fixes pretty-printing of texture and sampler descriptors. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	17c55e0d12	panfrost: Don't use DECODE_FIXED16 for sample position Strictly this is a signed fixed-point, anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	e0752673be	docs/panfrost: Move description of instancing Connor Abbott wrote a nice explanation of how instance divisors work on Mali. Let's add it to the driver docs instead of letting it languish in a forgotten header file. This is mostly pasted from the existing header in tree, with a few local changes applied. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	07b43d6231	panfrost: Remove some unused definitions Nowadays, formats are defined with GenXML, not the old panfrost-job.h, so most of the format #defines in panfrost-job.h are unused. That said, a few are still in use as a backdoor for compressed format queries to avoid a GenXML dependency. That's not great but cleaning that up isn't the subject of this MR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Felix DeGrood	341f1011a6	intel/perf: Hide extended metrics by default XE architecture enables many more metrics, perhaps too many for the average user. Reduce reported metrics to smaller subset, known as non-extended metrics, by default. Can re-enable extended metrics with env var INTEL_EXTENDED_METRICS=1 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21841>	2023-03-11 05:05:06 +00:00
Alyssa Rosenzweig	6b22a02f90	asahi,agx: Implement buffer textures with gnarly NIR Implement buffer textures in full generality. There are a few issues here: * OpenGL requires buffer textures support a minimum size of 65536 elements, however 1D textures in AGX are (at most) 8192 elements. * OpenGL 4.0 (and OpenGL ES) require buffer textures to support the "RGB32" texture formats. These are 3 packed channels of 32-bits each. In general, non-power-of-two texel sizes are problematic. AGX does not support any such formats and we rely on the GL frontend to lower to a padded format (RGBX) if necessary. Such a lowering cannot work for buffer textures, however, so we need to find a way to implement RGB32 buffer textures. We solve these issues in the follow way: * Use 2D texture descriptors for buffer textures, with a large fixed power-of-two size along one axis. Then large texel indices may be accessed at a small vec2 texel coordinate, and since the fixed dimension is a power-of-two, that vector may be recovered by simply shifting and masking. This effectively avoids size restriction. We do need to clamp texel indices to the buffer size to avoid faulting on OOB reads, since we may read past the end of the buffer (if the app binds a non-page-aligned offset into the buffer). * Use a general purpose memory load for RGB32 buffer textures. Lower the texture load instruction to a memory load from the buffer and some address arithmetic. There's no format conversion needed for RGB32, other than maybe filling in a format-appropriate alpha, so this is straightforward. Again, we need to clamp the texel index for robustness with OOB reads. Each of these solutions brings its own problem. * Using 2D textures instead of 1D requires physically rounding up the buffer size when packing the descriptor, so we can no longer implement textureSize() by reading off the texture descriptor like normal. * We don't know at compile-time whether a given texture load will read from an RGB32 buffer texture or not, so we need to emit code for both. In Vulkan, we can't key the shader to this property, either, since it's descriptor set state and not pipeline state. And each of these problems in turn brings its own solution: * The texture descriptor is linear, so the "compression buffer address" field is ignored by the hardware. We stash the real buffer size there so that textureSize becomes a load from the texture descriptor like usual, without requiring a sideband (which would complicate bindless textures). * If we determine a texture descriptor contains RGB32 data, then it will never be interpreted by the hardware and hence does not need to be a valid texture descriptor. So, we extend the hardware's format enum to contain a software-defined RGB32 format enum. Then, when lowering texture buffer loads, we either read it as a typed RGB32 memory load or as a texture load depending on the value of the format field in the texture descriptor. All of this is accomplished with a big NIR pass generating a pile of strange looking code. But it should be good enough in practice for this silly feature. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21672>	2023-03-11 02:26:31 +00:00
Alyssa Rosenzweig	826649ba19	asahi, agx: Implement dummy samplers In NIR, texelFetch (txf) does not use a sampler, but in AGX, it does -- even though the contents of the sampler are semantically irrelevant. Rather than requiring the state tracker to bind a sampler anyway (indicated for texture buffers with PIPE_CAP_TEXTURE_BUFFER_SAMPLER), just add a dummy sampler ourselves if txf is used and there are otherwise no samplers. This is helpful because PIPE_CAP_TEXTURE_BUFFER_SAMPLER isn't honoured by Rusticl or seemingly mesa/st's PBO code, and after implementing this dummy sampler workaround in Panfrost for Rusticl, I realized this CAP is silly and shouldn't exist in the first place. (And I regret pushing for its reinclusion.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21672>	2023-03-11 02:26:31 +00:00
Guilherme Gallo	256e7888fd	ci: Fix release build use for performance jobs This commit ensures that we are using mesa release builds in performance jobs. To achieve that, some modifications were made on top of https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21492. - Append the `BUILDTYPE` variable into the S3 artifact name (MINIO_ARTIFACT_NAME environment variable) to allow for better artifact management. - The ./artifacts directory has been added to the list of artifact directories for build-common. This ensures that the debian-release and debian-arm64-release jobs are the only ones necessary for running performance jobs. These jobs only produce artifacts via prepare-artifacts.sh when we are under performance workflow. - Make lava-submit.sh behave similar to baremetal jobs regarding MINIO_ARTIFACT_NAME variable. For example, users can now easily differentiate between mesa-arm64.tar.zstd and mesa-arm64-release.tar.zstd by looking inside the `Downloading artifacts from s3` Gitlab section. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21804>	2023-03-10 21:40:23 +00:00
José Roberto de Souza	91a129b44a	iris: Move i915 submit_batch() to i915 backend No changes in behavior intented here. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21700>	2023-03-10 20:13:56 +00:00
José Roberto de Souza	21d5034edb	iris: Add batch_check_for_reset() to kmd backend Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21700>	2023-03-10 20:13:56 +00:00
José Roberto de Souza	e0ce31d7cf	iris: Add gem_mmap() to kmd backend Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21700>	2023-03-10 20:13:56 +00:00
José Roberto de Souza	757e2dd692	intel/perf: Disable it for Xe KMD Xe still don't have support for performance metrics. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21773>	2023-03-10 19:41:14 +00:00
José Roberto de Souza	266d961fdc	iris: Don't mark protected bo as reusable The check in alloc_bo_from_cache() was skiping any try to get a bo from cache but after use a protected bo was still being put in some cache bucket and could be used for cases that don't require a protected bo. Using a protected bo in cases that don't require it can have performance implications. So here returning NULL when trying to get a cache bucket for a protected bo, this will cause bo->real.reusable to be set to false avoiding the bo to be reused. Fixes: `9402ac8023` ("iris: handle protected BO creation") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21824>	2023-03-10 18:59:59 +00:00
Alyssa Rosenzweig	e61d6540e6	asahi: Don't allow linear depth/stencil buffers We don't have a way to tell the ZLS hardware to use linear buffers, so if a buffer could be used for depth/stencil, we have to twiddle. This isn't a problem in practice, since depth/stencil buffers can't be shared across processes or mapped directly as linear. Fixes faults in depthstencil-render-miplevels, which was picking linear for one buffer because of a STAGING bind flag. But that won't work :-) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21753>	2023-03-10 18:29:52 +00:00
Ian Romanick	0cadc3830f	nir/lower_int64: Optionally lower ufind_msb using uadd_sat v2: Fix inverted condition for applying the optimization. Noticed by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	831f9d3f61	nir/algebraic: Optimize some ifind_msb to ufind_msb On Intel platforms, the uclz lowering if ufind_msb is either one instruction better (Gfx7 and newer) or two instructions better (all older platforms) than the ifind_msb implementations. On platforms that use lower_find_msb_to_reverse, there should be no difference. All Haswell and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 19938662 -> 19938634 (<.01%) instructions in affected programs: 850 -> 822 (-3.29%) helped: 2 / HURT: 0 total cycles in shared programs: 858467067 -> 858465538 (<.01%) cycles in affected programs: 10080 -> 8551 (-15.17%) helped: 2 / HURT: 0 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	db6d1edc1b	nir: Restrict ufind_msb and ufind_msb_rev to 32- or 64-bit sources `4d802df3aa` loosened the type restrictions on these opcodes to enable support for 64-bit ballot operations. In doing so, it enabled 8-bit and 16-bit sizes as well. It's impossible to get these sizes through GLSL or SPIR-V. None of the lowering in nir_opt_algebraic can handle non-32-bit sizes. Almost no drivers can handle non-32-bit sizes. It doesn't seem possible to enforce anything other than "one bit size" or "all bit sizes" in nir_opcodes.py. The only way it seems possible to enforce this is in nir_validate. This is not ideal, but it be what it be. v2: Remove restriction on find_lsb. It is acutally possible to get this via GLSL by doing findLSB() on a lowp value. findMSB declares its parameter as highp, so that path is still impossible. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	2d6f48f6ef	nir/algebraic: Do not generate 8- or 16-bit find_msb The next commit will add validation to restrict this instruction (and others) to only 32-bit or 64-bit sources. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	2119ab7319	nir/builder: Do not generate 8- or 16-bit find_msb Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	28311f9d02	nir: intel/compiler: Move ufind_msb lowering to NIR Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Cycles in all programs: 9098346105 -> 9098333765 (-0.0%) Cycles helped: 6 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	a4052e70ea	nir/algebraic: Only lower ufind_msb with 32-bit sources The 31-ufind_msb_rev(x) lowering only produces the correct result for 32-bit sources. ufind_msb_rev can also have 64-bit sources, and most platforms are expected to lower this to 32-bit instructions with extra logic operations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	08ca862ef8	intel/compiler: Tighter src and dest size bounds checking for some opcodes Enforce the sizes listed in the Skylake PRM: BFREV: source types: D destination types: D CBIT: source types: UB, UW, UD destination types: UD FBH: source types: D, UD destination types: UD FBL: source types: UD destination types: UD LZD: source types: D, UD destination types: UD v2: Update BFREV commit message documentation. Suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	0cc7bf63b7	nir: intel/compiler: Move ifind_msb lowering to NIR Unlike ufind_msb, ifind_msb is only defined in NIR for 32-bit values, so no @32 annotation is required. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	15c6c859cf	intel/compiler: Lower find_lsb in NIR No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	66840b98e4	nir: ifind_msb_rev can only have int32 sources Just like ifind_msb. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
antonino	29be4e9e9b	zink: fix stipple pattern in oblique lines Stipple lines now appear correctly when they are oblique. Previously the number of steps of the stipple counter between two vertices was calculated as the euclidian distance between them in screen space, however the length occupied by pixel along a line is only `1` for lines that are either vertical or horizontal and will be anywhere between `1` and `sqrt(2)` for other cases. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21290>	2023-03-10 14:52:01 +00:00
Alyssa Rosenzweig	ee6785309e	agx: Handle indirect texture/samplers Get the texture/sampler index from the texture/sampler_offset source (which is an offset from 0 thanks to the lower_index_to_offset lowering) and feed it in as corresponding 16-bit texture instruction sources. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21704>	2023-03-10 14:14:42 +00:00
Alyssa Rosenzweig	e12bf97153	agx: Pack indirect texture/sampler handles For indirect indexing into the binding table. Note this does not handle packing the bindless forms, since that's a bit more involved. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21704>	2023-03-10 14:14:42 +00:00
Mike Blumenkrantz	e5b29e6735	Revert "Revert "ci: disable mesa-swrast runner jobs"" This reverts commit `7ae0d9d2e8`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21836>	2023-03-10 12:37:56 +00:00
Eric Engestrom	e29772f134	v3dv: split out broadcom_shader_stage_to_gl() calls to improve readability This is an inline function with a compile-constant switch, so I expect the compiler wouldn't produce any better code like this, but for humans it's easier to read when function calls are not embedded into other function calls. Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21835>	2023-03-10 10:38:43 +00:00
Eric Engestrom	f5d3d1e7ed	meson: inline gtest_test_protocol now that it's always 'gtest' Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21485>	2023-03-10 07:20:29 +00:00
Sagar Ghuge	9a34b2ab0e	intel/compiler: Add swsb_stall debug option When enabled, on gfx12 plus, we will add the sync nop instruction after each instruction to make sure that current instruction depends on the previous instruction explicitly. This option will help us to get a hint if something is missing or broken in software scoreboard pass. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21797>	2023-03-10 06:55:39 +00:00
Alyssa Rosenzweig	cdf63e6dce	agx: Fix clang-formatting Not sure how this one slipped in. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21828>	2023-03-10 06:33:01 +00:00
Emma Anholt	7ae0d9d2e8	Revert "ci: disable mesa-swrast runner jobs" This reverts commit `aef0f3efdf`. We've got a new set of runners now (mesa-swrast-4, 5, and 7 because counting is hard) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21826>	2023-03-10 04:04:59 +00:00
Lionel Landwerlin	5aec829f97	iris: trace frames with u_trace Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21648>	2023-03-10 00:36:41 +00:00
Kenneth Graunke	dfe652fb03	intel/eu: Simplify brw_F32TO16 and brw_F16TO32 Now that we aren't using them on Gfx8+ we can drop a lot of cruft. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	c590a3eadf	intel/fs: Move packHalf2x16 handling to lower_pack() This mainly lets the software scoreboarding pass correctly mark the instructions, without needing to resort to fragile manual handling in the generator. We can also make small improvements. On Gfx 8LP-12.0, we no longer have the restrictions about DWord alignment, so we can simply write each half into its intended location, rather than writing it to the low DWord and then shifting it in place. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	f5e5705c91	intel/fs: Use F32TO16/F16TO32 helpers in fquantize16 handling I originally thought that we were intentionally emitting the legacy opcodes here to make them opaque to the optimizer, so that it wouldn't eliminate the explicit type conversions, as they're actually required to do the quantization. But...we don't actually optimize those away currently anyway. So...go ahead and use the helpers for consistency. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	44c6ccb197	Revert "intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes" With the previous patch, we no longer need to special case this, as we emit a MOV with an HF source, rather than F16TO32 with an UW source, on all platforms that need scoreboarding. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	309ec3725a	intel/fs: Use new F16TO32 helpers for unpack_half_split_* opcodes This gets us a MOV at the IR level on Gfx8+ which should be more optimizable than F16TO32. It also removes confusion about which pipe which the instruction will run on. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	78bf53904e	intel/fs: Delete a TODO about using brw_F32TO16. We can just use the new builder helpers to get the optimization advantages of a MOV on Gfx8+ while also getting the necessary F32TO16 on Gfx7.x and yet not worry too hard about it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	966995d911	intel/fs: Add builder helpers for F32TO16/F16TO32 that work on Gfx7.x These take care of emitting the F32TO16/F16TO32 instructions on Gfx7.x but otherwise just emit a type converting MOV on Gfx8+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Kenneth Graunke	3864049184	intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes For converting half-float to float, we currently emit BRW_OPCODE_F16TO32 with a UW source, to match legacy Gfx7 behavior. In the generator, this becomes a MOV with a HF source on Gfx8+. Unfortunately, this UW source confuses the scoreboarding pass into thinking it's an integer source, leading to incorrect SWSB annotations on Alchemist. We should ultimately fix the IR to stop being so...legacy...here, but this is the simplest fix for stable branches. Fixes misrendering in Elden Ring and likely Sekiro: Shadows Die Twice. Cc: mesa-stable Tested-by: Chuansheng Liu <chuansheng.liu@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> References: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8018 References: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8375 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21783>	2023-03-09 23:26:17 +00:00
Mark Janes	4978db6b9e	intel: use generated workaround helpers for Wa_1409600907 Wa_1409600907 was enabled for gen12+. It should not be applied for platforms after gen12.0. Use generated helpers to ensure application to all relevant platforms. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21743>	2023-03-09 22:56:51 +00:00
Chia-I Wu	5691b10b0f	radv: set RADEON_FLAG_GTT_WC for external mem on vram We used to set RADEON_FLAG_GTT_WC when wsi_info is set. This changes it to set the flag for any external mem on vram, extending the logic for apps using external memory directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21803>	2023-03-09 22:21:09 +00:00
Daniel Stone	ae893089e9	ci/radv: Lower stoney CTS load CTS runs on stoney are currently taking ~20min to complete, which seems to have begun with the upgrade to CTS 1.3.5.0. This is a bit too long in and of itself, but it means that - assuming zero contention - a job that has to be retried because the machine hung can take 40 minutes. Aim to drop this to 15min turnaround by lowering the overall fraction from 1/8th of the CTS to 1/11th. As the jobs we run have been reshuffled, this adds a lot more expected fails. As most of them categorise easily into patterns, group the failures together in the file. Non-strict wide lines has passed since we last ran it; the other failures all group into existing classes seen for a long time. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21791>	2023-03-09 19:34:58 +00:00
Daniel Stone	f07c69d8b6	ci/zink: Add flake seen in the wild Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21820>	2023-03-09 19:15:13 +00:00
David Heidelberg	aef0f3efdf	ci: disable mesa-swrast runner jobs Temporarily. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21815>	2023-03-09 18:01:09 +00:00
Daniel Stone	6f1aa8cfc1	ci/fdno: Add a618 Vulkan flakes It looks like descriptors are generically a bit broken, which takes out a massive number of tests periodically. The pipeline-library tests also have some unknowable issues. cf. #8219 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21740>	2023-03-09 14:47:57 +00:00
Georg Lehmann	13ff4a5f64	aco: use bitfield_array for temporary neg/abs/opsel Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21766>	2023-03-09 14:15:14 +00:00
Georg Lehmann	d0eebb0e8b	aco: access neg/abs as int in usesModifiers Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21766>	2023-03-09 14:15:14 +00:00
Georg Lehmann	828aff2a2d	aco: use array indexing for opsel/opsel_lo/opsel_hi Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21766>	2023-03-09 14:15:13 +00:00
Georg Lehmann	a47c3f84fb	aco: use integer access for neg_lo/neg_hi Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21766>	2023-03-09 14:15:13 +00:00
Georg Lehmann	60cd3ba39f	aco: copy abs/neg with assignment Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21766>	2023-03-09 14:15:13 +00:00
Tapani Pälli	5fdbc4a23e	intel/isl: disable TILE64 for YCRCB formats Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21723>	2023-03-09 13:50:39 +00:00
Daniel Stone	fad9c69e42	ci/radv: Drop raven quick_shader load It currently takes ~21 minutes to complete. That's not quick. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21793>	2023-03-09 09:56:31 +00:00
Eric Engestrom	a19739f1b0	v3dv/ci: add a test to the known failures New test since the 1.3.5 update, and running it on older mesa it would have always failed, so it's not a regression -> let's just mark it as a known failure Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21792>	2023-03-09 09:19:21 +00:00
Emma Anholt	ec513270e3	zink: Pass the cmdbuf to the end of the marker, too. Otherwise the end wanders off to some unrelated cmdbuf. Fixes: `271ebdd735` ("zink: pass cmdbuf to debug marker begin") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21799>	2023-03-09 06:53:37 +00:00
David Heidelberg	11a4e10fe2	ci/zink: fixup the zink-lvp job Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8501 Fixes: `4cc0cec473` ("ci: implement unified sections") Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21796>	2023-03-09 06:27:39 +00:00
antonino	27c8d6ca7b	drirc: set `zink_emulate_point_smooth` for Quake II Quake II uses GL_POINT_SMOOTH to render particles. Zink currently requires `zink_emulate_point_smooth` to support that feature. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
antonino	ffe36abf7c	zink: handle point_smooth emulation Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
antonino	3a59b2a670	nir: handle output beeing written to deref in `nir_lower_point_smooth` Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
antonino	4b07182c8c	zink/nir_to_spirv: add support for `nir_intrinsic_load_point_coord` Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
antonino	e121b6d9eb	zink: add `lower_point_smooth` to `zink_fs_key` Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
antonino	c32a5b8d04	zink: add `zink_emulate_point_smooth` driconf Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
antonino	e280d6a7c9	zink: fix line smooth lowering Fixes: `80285db9ef` ("zink: lower smooth-lines if not supported") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21731>	2023-03-09 04:38:24 +00:00
Eric Engestrom	c28f144c81	osmesa: add exported symbols check Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1308>	2023-03-09 02:55:49 +00:00
Emma Anholt	8b75b72613	anv+hasvk: Use driconf to disable 16-bit for zink. The HW can technically execute 16-bit operations, but the restrictions on 16-bit ALU ops are so great that it ends up not being a win for GLES-on-Vulkan to lower mediump to 16-bit operations, at least with the current state of the Intel compiler. This brings zink-on-anv in line with iris and angle-on-anv for mediump behavior (ANGLE uses RelaxedPrecision, which we ignore). Perf on some angle traces on my brya (ADL) and i9-9900K (CFL): ADL zink pubg_mobile_battle_royale: +13.4574% +/- 5.2046% (n=5) CFL zink pubg_mobile_battle_royale: +29.5332% +/- 0.646585% (n=6) ADL zink aztec_ruins_high: +5.78027% +/- 4.80645% (n=4) CFL zink aztec_ruins_high: -1.10641% +/- 0.140562% (n=12) ADL zink trex_200: +5.86956% +/- 2.09633% (n=10) CFL zink trex_200: +9.72136% +/- 0.749261% (n=10) Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21775>	2023-03-09 02:27:01 +00:00
Daniel Stone	daa1468b54	intel/isl: Don't scream FINISHME into logs for 3D vs. CCS This would probably be a nice optimisation to have, but it really does make the CTS logs awful: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/37692447 Just demote this isl_finishme() to a comment; given it's been unfinished since 2019, we can probably live without it. Fixes: `126c9562d9` ("isl: Redefine the CCS layout for Gen12") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21788>	2023-03-09 01:36:54 +00:00
Daniel Stone	df7b40d002	ci/anv: Temporarily halve TGL testing load Our TGL machines are currently slightly oversubscribed (max. 17 jobs in a pipeline on 15 DUTs). They're also currently suffering from thermally-induced GPU throttling (being investigated), and a thundering-herd network load effect: as all 15 jobs start at once, we end up saturating one of our network links. The combination of all three of these things means that TGL is often our long pole in CI runs. Until we can ameliorate the two issues constraining throughput (and a third where an unreliable hardware UART sometimes kills jobs when it shouldn't), halve the workload so we at least have some breathing room to absorb them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21790>	2023-03-09 01:07:36 +00:00
Lionel Landwerlin	b801724352	util: allow align64() to do alignments >= 4Gb Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21757>	2023-03-08 23:32:37 +00:00
Lionel Landwerlin	9a058f6b4c	radv: use 1ull for alignment computations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21757>	2023-03-08 23:32:37 +00:00
Lionel Landwerlin	11bc2bde83	anv: force MEDIA_INTERFACE_DESCRIPTOR_LOAD reemit after 3D->GPGPU switch Seems to fix a hang in the following titles : - Age of Empire 4 - Monster Hunter Rise where the HW is hung on a PIPE_CONTROL after a GPGPU_WALKER but no MEDIA_INTERFACE_DESCRIPTOR_LOAD was emitted since the switch from 3D to GPGPU. This would happen in the following case : vkCmdBindPipeline(COMPUTE, cs_pipeline); vkCmdDispatch(...); vkCmdBindPipeline(GRAPHICS, gfx_pipeline); vkCmdDraw(...); vkCmdDispatch(...); Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17247>	2023-03-08 23:09:36 +00:00
Konstantin Seurer	d17bf881ea	radv/rt: Fix updating stack_size if the shader uses scratch src_vars contains the stack_size of the shader that is about to get inlined. Fixes: `7fadee9b70` ('radv/rt: only reserve stack_sizes after rt_case insertion') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21768>	2023-03-08 22:17:00 +00:00
Daniel Stone	3af675dfc1	ci/radv: Skip vkCreateInstance memory-fail test This has been failing a bit ever since CTS 1.3.5.0. Skip it for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21789>	2023-03-08 21:35:27 +00:00
Georg Lehmann	0614c2e8bd	aco: don't reallocate fma{mk,ak,_mix} instruction Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21762>	2023-03-08 18:42:21 +00:00
Georg Lehmann	a4873071e6	aco/optimizer: don't reallocate instruction when converting to VOP3 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21762>	2023-03-08 18:42:21 +00:00
Mike Blumenkrantz	7413ce7e0d	lavapipe: break out main shader lowering into separate function Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	f2765cd6d6	lavapipe: move uniform inline functions to shader struct Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	7718d7f31a	lavapipe: rename inline uniform function params Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	990fa82c61	lavapipe: move xfb init to shader struct Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	b221f6c128	lavapipe: more small shader struct usage tweaks Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	a0c9609e59	lavapipe: pass shader struct and layout to scan_pipeline_info() Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	6e5fe71599	lavapipe: split out shader struct members into their own struct kinda gross but simplifies some code Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	2af3476639	lavapipe: split out spirv compile of shaders Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21778>	2023-03-08 18:25:01 +00:00
Mike Blumenkrantz	bf1b4ed54e	vulkan/wsi: fix crash in failed swapchain creation for wayland this otherwise calls wsi_wl_swapchain_chain_free() before the wsi pointer has been set ref #6578 cc: mesa-stable Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21563>	2023-03-08 17:33:00 +00:00
Daniel Schürmann	41ae2d0725	radv/rt: use terminate() when returning from raygen shaders Q2RTX stats: Totals from 7 (0.01% of 134913) affected shaders: CodeSize: 204712 -> 204744 (+0.02%); split: -0.06%, +0.07% Instrs: 37526 -> 37522 (-0.01%); split: -0.07%, +0.06% Latency: 950563 -> 956024 (+0.57%) InvThroughput: 187915 -> 188977 (+0.57%) Copies: 4829 -> 4763 (-1.37%) Branches: 1570 -> 1583 (+0.83%) PreSGPRs: 407 -> 400 (-1.72%) PreVGPRs: 614 -> 617 (+0.49%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21736>	2023-03-08 16:59:41 +00:00
Daniel Schürmann	cd1e5b1858	aco: fix NIR infinite loops The previous solution breaks potential loop header phis. Move the dummy-break to the bottom of the loop. Fixes: dEQP-VK.reconvergence.subgroup_uniform_control_flow_ballot.* Fixes: `a9c4a31d8d` ('aco: handle NIR loops without breaks') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21736>	2023-03-08 16:59:41 +00:00
Daniel Schürmann	3073810397	nir/gather_info: allow terminate() in non-PS RADV will use terminate() to end ray-tracing shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21736>	2023-03-08 16:59:41 +00:00
Samuel Pitoiset	842b8f14f4	radv: move device memory related code to radv_device_memory.c radv_device.c is getting too big. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Samuel Pitoiset	4316a64e27	radv: move buffer related code to radv_buffer.c radv_device.c is getting too big and this follows the Vulkan common runtime infrastructure. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Samuel Pitoiset	17c5a91028	radv: move event related code to radv_event.c radv_device.c is getting too big. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Samuel Pitoiset	4de305cb8a	radv: move sampler related code to radv_sampler.c radv_device.c is getting too big and this follows the Vulkan common runtime infrastructure. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Samuel Pitoiset	7a157b3a4c	radv: move queue related code to radv_queue.c radv_device.c is getting too big and this follows the Vulkan common runtime infrastructure. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Samuel Pitoiset	4e5db63482	radv: move physical device related code to radv_physical_device.c radv_device.c is getting too big and this follows the Vulkan common runtime infrastructure. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Samuel Pitoiset	06fa90e14e	radv: move instance related code to radv_instance.c radv_device.c is getting too big and this follows the Vulkan common runtime infrastructure. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21767>	2023-03-08 16:21:10 +00:00
Rhys Perry	98cb7e0108	nir: add nir_lower_alu_width_test.fdot_order Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20812>	2023-03-08 14:38:26 +00:00
Rhys Perry	50f7e21481	nir: make fdph lowering match fdot Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20812>	2023-03-08 14:38:26 +00:00
Rhys Perry	3668da7c83	nir: use xyzw order for precise fdot Fixes flickering grass in Immortals Fenyx Rising. fossil-db (gfx1100): Totals from 13969 (10.38% of 134574) affected shaders: MaxWaves: 442794 -> 442878 (+0.02%) Instrs: 4861105 -> 4901408 (+0.83%); split: -0.02%, +0.85% CodeSize: 24316100 -> 24396272 (+0.33%); split: -0.03%, +0.35% VGPRs: 446256 -> 445572 (-0.15%); split: -0.20%, +0.05% Latency: 28122456 -> 28162233 (+0.14%); split: -0.10%, +0.24% InvThroughput: 2899673 -> 2904323 (+0.16%); split: -0.07%, +0.23% VClause: 119599 -> 119631 (+0.03%); split: -0.07%, +0.09% SClause: 186636 -> 186265 (-0.20%); split: -0.23%, +0.03% Copies: 301370 -> 300386 (-0.33%); split: -0.75%, +0.42% Branches: 85066 -> 85047 (-0.02%); split: -0.02%, +0.00% PreSGPRs: 436167 -> 436137 (-0.01%) PreVGPRs: 329715 -> 329809 (+0.03%); split: -0.01%, +0.04% fossil-db (gfx1100, RADV_DEBUG=invariantgeom): Totals from 43116 (32.04% of 134574) affected shaders: MaxWaves: 1332938 -> 1333012 (+0.01%); split: +0.01%, -0.00% Instrs: 16424513 -> 16658021 (+1.42%); split: -0.06%, +1.48% CodeSize: 81258868 -> 81827860 (+0.70%); split: -0.07%, +0.77% VGPRs: 1720368 -> 1719648 (-0.04%); split: -0.19%, +0.15% SpillSGPRs: 1670 -> 1600 (-4.19%); split: -5.27%, +1.08% Latency: 82063766 -> 82425418 (+0.44%); split: -0.23%, +0.67% InvThroughput: 9665803 -> 9727810 (+0.64%); split: -0.09%, +0.73% VClause: 449662 -> 451099 (+0.32%); split: -0.32%, +0.64% SClause: 498841 -> 498639 (-0.04%); split: -0.24%, +0.20% Copies: 1001020 -> 1000770 (-0.02%); split: -1.20%, +1.17% Branches: 237580 -> 239637 (+0.87%); split: -0.01%, +0.88% PreSGPRs: 1198167 -> 1198024 (-0.01%); split: -0.01%, +0.00% PreVGPRs: 1225202 -> 1225035 (-0.01%); split: -0.06%, +0.05% fossil-db (navi10): Totals from 13969 (10.38% of 134563) affected shaders: MaxWaves: 474386 -> 474508 (+0.03%); split: +0.05%, -0.03% Instrs: 3740895 -> 3771566 (+0.82%); split: -0.00%, +0.82% CodeSize: 19426592 -> 19459916 (+0.17%); split: -0.00%, +0.18% VGPRs: 389916 -> 389852 (-0.02%); split: -0.09%, +0.07% Latency: 25452927 -> 25502482 (+0.19%); split: -0.14%, +0.34% InvThroughput: 3880807 -> 3923144 (+1.09%); split: -0.07%, +1.16% VClause: 66835 -> 66712 (-0.18%); split: -0.38%, +0.20% SClause: 178805 -> 178802 (-0.00%); split: -0.01%, +0.01% Copies: 167601 -> 167625 (+0.01%); split: -0.54%, +0.56% Branches: 83788 -> 83784 (-0.00%) PreSGPRs: 388229 -> 388216 (-0.00%) PreVGPRs: 342984 -> 343062 (+0.02%); split: -0.01%, +0.03% fossil-db (navi10, RADV_DEBUG=invariantgeom): Totals from 43116 (32.04% of 134563) affected shaders: MaxWaves: 1260184 -> 1256414 (-0.30%); split: +0.10%, -0.40% Instrs: 12804951 -> 12983628 (+1.40%); split: -0.01%, +1.41% CodeSize: 65813224 -> 66137852 (+0.49%); split: -0.03%, +0.52% VGPRs: 1556396 -> 1561340 (+0.32%); split: -0.09%, +0.41% SpillSGPRs: 1377 -> 1395 (+1.31%) Latency: 76095867 -> 76355111 (+0.34%); split: -0.32%, +0.66% InvThroughput: 13546863 -> 13788789 (+1.79%); split: -0.05%, +1.84% VClause: 310910 -> 311283 (+0.12%); split: -0.63%, +0.75% SClause: 474878 -> 474941 (+0.01%); split: -0.09%, +0.10% Copies: 639367 -> 637610 (-0.27%); split: -1.03%, +0.76% Branches: 240178 -> 240185 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 1056594 -> 1056590 (-0.00%); split: -0.00%, +0.00% PreVGPRs: 1247950 -> 1247798 (-0.01%); split: -0.05%, +0.04% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7920 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20812>	2023-03-08 14:38:26 +00:00
Mike Blumenkrantz	6ee5337d94	aux/tc: fix rp info resizing clobbering current info the recording rp_info may be a pointer to a member of the array being reallocated, so test for this and re-set it to avoid invalid memory access found with this caselist: KHR-GL46.texture_gather.offset-gather-unorm-2darray KHR-GL46.texture_view.view_sampling cc: mesa-stable Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21729>	2023-03-08 14:10:01 +00:00
Lionel Landwerlin	10057d19f2	anv: report max register pressure in pipeline properties Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21756>	2023-03-08 13:37:07 +00:00
Lionel Landwerlin	09cdb77a92	intel/fs: report max register pressure in shader stats Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21756>	2023-03-08 13:37:07 +00:00
Lionel Landwerlin	8dd960e056	anv/iris: report counter symbols with debug option v2: rename to INTEL_DEBUG=perf-symbol-names Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17672>	2023-03-08 12:45:43 +00:00
Samuel Pitoiset	e6735409ee	radv: disable DCC with signedness reinterpretation on GFX11 All formats should be compatible on GFX11 but for some weird reasons DCC with signedness reinterpretation doesn't work as expected, like R8_UINT<->R8_SINT. Note that RadeonSI also has issues with this. This might be a hardware bug on RDNA3. This fixes DCC issues with Cyberpunk and A Plague Tale: Requiem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8020 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8371 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21755>	2023-03-08 11:53:25 +00:00
Pierre-Eric Pelloux-Prayer	79ab787a8f	radeonsi: fix fast depth_clear_value/stencil_clear_value We need to update the when promoting from non-TC-compatible to TC-compatible or we'll get incorrect values in the buffer. Fixes: `9defe8aca9` ("radeonsi: implement fast Z/S clears using clear_buffer on HTILE") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8418 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21726>	2023-03-08 10:56:21 +00:00
Pierre-Eric Pelloux-Prayer	b75acbf88f	radeonsi: don't use PKT3_SET_SH_REG_INDEX on gfx9 and older Fixes: `ccaaf8fe04` ("amd: massively simplify how info->spi_cu_en is applied") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8464 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21726>	2023-03-08 10:56:21 +00:00
Pierre-Eric Pelloux-Prayer	49913fa418	radeonsi/test: update test results Depends on https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/779 to fix glx-make-current GLX errors. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21726>	2023-03-08 10:56:21 +00:00
Pierre-Eric Pelloux-Prayer	9eb05801ad	radeonsi/test: use gbm-skips.txt Use shared skips file to avoid running tests that can't pass on gbm. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21726>	2023-03-08 10:56:21 +00:00
Samuel Pitoiset	f88dbb27d4	radv: enable VK_KHR_fragment_shading_rate on GFX11 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20333>	2023-03-08 10:30:48 +00:00
Samuel Pitoiset	1fb8e0eff2	radv: advertise attachmentFragmentShadingRate on GFX11 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20333>	2023-03-08 10:30:48 +00:00
Samuel Pitoiset	d1e724b952	radv: do not emit PA_SC_VRS_OVERRIDE_CNTL from the pipeline on GFX11 PA_SC_VRS_OVERRIDE_CNTL is emitted when a framebuffer is bound because it controls the VRS surface enable bit. Though, if a pipeline is bound after the framebuffer is emitted, it can override the state. Remove it completely since VRS for flat shading and RADV_FORCE_VRS are disabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20333>	2023-03-08 10:30:48 +00:00
Samuel Pitoiset	c186420b26	radv: add support for VRS attachment on GFX11 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20333>	2023-03-08 10:30:48 +00:00
Samuel Pitoiset	31d699106d	ac/surface: add RADEON_SURF_VRS_RATE for selecting swizzle mode on GFX11 On GFX11, VRS rate images can't use linear tiling and the swizzle mode must be either SW_Z or SW_R. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20333>	2023-03-08 10:30:48 +00:00
Samuel Pitoiset	ce4a1b1c3c	radv: move disabling DCC for VRS rate images in radv_get_surface_flags() On GFX11, the VRS rate image needs a specific swizzle mode and a new flag will be added here. gned-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20333>	2023-03-08 10:30:48 +00:00
Lionel Landwerlin	e8793f2a86	anv: enable VK_EXT_pipeline_library_group_handles A noop for us. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20929>	2023-03-08 08:59:52 +00:00
Marek Olšák	461ccb00e1	radeonsi: increase NGG workgroup size to 256 for VS/TES with streamout and GS NGG streamout performance is limited by the workgroup size, so make it as large as possible. Since this uses si_get_max_workgroup_size() to set the NGG workgroup size, the side effect is that all GS is also getting an increase to 256, which is OK. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	43fd552872	radeonsi: allow using 64K LDS for NGG to allow larger workgroups This should help with NGG streamout performance, which is limited by the workgroup size (it should be as large as possible). Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	e01d505291	radeonsi: other cosmetic changes in si_state_shaders.cpp VS_W32_EN has no effect on Gfx11, but we better not set it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	ef965d5681	radeonsi: reorganize si_shader_ps To make branching based on gfx_level nicer and the code in a logical order. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	c9d297fc77	radeonsi: reorganize si_shader_ngg To make branching based on gfx_level nicer and the code in a logical order. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	1664aad43c	radeonsi: reorganize si_shader_hs To make branching based on gfx_level nicer and the code in a logical order. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	b3459eae7a	radeonsi: reindent si_shader_ls, si_shader_es, si_shader_gs, si_shader_vs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	7e0ed2c4f0	radeonsi: set pm4.atom.emit in si_get_shader_pm4_state except gfx10_shader_ngg, which isn't as trivial Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	4c1475fc1c	radeonsi: lower nir_texop_sampler_descriptor_amd Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	1417ced72c	radeonsi: separate nir_texop_descriptor_amd lowering This moves the code to a separate branch to make it less intertwined with the rest to allow sampler descriptor lowering later. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	54ebd90739	radeonsi: merge si_emit_initial_compute_regs with si_init_cs_preamble_state It's better to set all immutable registers in one place. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Marek Olšák	ddded6fbb5	radeonsi: emulate VGT_ESGS_RING_ITEMSIZE in the shader on gfx9-11 The hardware uses the register to premultiply GS vertex indices in input VGPRs. This changes the behavior as follows: - VGT_ESGS_RING_ITEMSIZE is always 1 on gfx9-11, set in the preamble. - The value is passed to the shader via current_gs_state (vs_state_bits). - The shader does the multiplication. The reason is that VGT_ESGS_RING_ITEMSIZE will be removed in the future. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Timur Kristóf	fb819fdb13	ac/nir: clear nir_var_shader_out from TCS barriers Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21403>	2023-03-08 07:29:09 +00:00
Timur Kristóf	87de5b2b9e	aco: Don't include headers from radv. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Timur Kristóf	a0141c6308	aco, radv: Don't use radv_shader_args in aco. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Timur Kristóf	e9793331db	aco, radv: Move PS epilog and VS prolog args to their info structs. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Timur Kristóf	84a2cea596	aco, radv: Rename aco__key to aco__info. The naming of aco__key didn't make sense because they were never actually used as cache keys, only radv__key are used as cache keys. Rename the aco structs to aco_*_info instead. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Qiang Yu	91e68db0e1	aco, radv: Move is_trap_handler_shader to aco info. v2 by Timur Kristóf: - Rebase this patch on latest main. Signed-off-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Qiang Yu	978220c99a	aco, radv: Add load_grid_size_from_user_sgpr to aco options. v2 by Timur Kristóf: - Rebase this patch. Signed-off-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Timur Kristóf	3058ab6090	aco: Generalize vs_inputs to args_pending_vmem. Handle arguments that need a waitcnt without relying on RADV specific VS input information. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Timur Kristóf	1583bea9da	radv: Set pending_vmem on dynamic VS input args. These are loaded from VMEM and need a waitcnt before use. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Timur Kristóf	1a7b5979df	ac: Add pending_vmem field to args. This is to indicate when an argument was loaded from VMEM and needs a waitcnt before it can be used. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21696>	2023-03-08 04:39:18 +00:00
Rob Clark	d5376c3feb	freedreno: Promote non-drawing batches to sysmem Sometimes we can end up with a sequence where we need to flush a batch with no clears and no draws (for ex, to get a fence). Promote these to sysmem. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21747>	2023-03-08 04:10:45 +00:00
Mike Blumenkrantz	aaed609e57	zink: hook up buffer TRANSFER_DST barrier optimizing this should massively optimize e.g., incremental index buffer overwrites ref #8358 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21779>	2023-03-08 03:50:33 +00:00
Mike Blumenkrantz	fe469a7618	zink: add a driver workaround to disable copy box optimizations turnip is nonconformant regarding cache access (see noted issue), meaning that any attempt to omit barriers breaks things qcom proprietary may also be affected Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21779>	2023-03-08 03:50:33 +00:00
Mike Blumenkrantz	46f98da188	zink: add a mechanism to trigger copy box resets from batch state reset the resource isn't available during batch state reset, so a new flag is needed to force a reset the next time the copy boxes would be used Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21779>	2023-03-08 03:50:33 +00:00
Mike Blumenkrantz	aaca91eb79	zink: add a mechanism for managing TRANSFER_DST buffer barriers this enables successive or unrelated transfer writes to avoid triggering barriers, and ensuing reads of those writes should trigger their own barriers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21779>	2023-03-08 03:50:33 +00:00
Mike Blumenkrantz	54f3c589d5	zink: track the last write access for resources this enables some optimization Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21779>	2023-03-08 03:50:33 +00:00
SureshGuttula	30a89323ad	radeonsi: Add support for DPB resize This patch will add support for dpb resize when low to high resolution change/ svc use-cases. With DPB tier1 type,vp9 svc decoder use cases are failed. This Change will fix this[VCN1/VCN2]. Signed-off-by: SureshGuttula <suresh.guttula@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21548>	2023-03-08 02:19:58 +00:00

... 4 5 6 7 8 ...

155849 Commits