mirror of
https://gitlab.freedesktop.org/mesa/mesa.git
synced 2024-11-27 12:14:10 +08:00
5613 lines
264 KiB
ReStructuredText
5613 lines
264 KiB
ReStructuredText
Mesa 23.2.1 Release Notes / 2023-09-28
|
||
======================================
|
||
|
||
Mesa 23.2.1 is a new development release. People who are concerned
|
||
with stability and reliability should stick with a previous release or
|
||
wait for Mesa 23.2.2.
|
||
|
||
Mesa 23.2.1 is an unusual first stable release due to the accidentl tagging of
|
||
23.2.0 durring the rc cycle.
|
||
|
||
Mesa 23.2.1 implements the OpenGL 4.6 API, but the version reported by
|
||
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
||
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
||
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
||
4.6 is **only** available if requested at context creation.
|
||
Compatibility contexts may report a lower version depending on each driver.
|
||
|
||
Mesa 23.2.1 implements the Vulkan 1.3 API, but the version reported by
|
||
the apiVersion property of the VkPhysicalDeviceProperties struct
|
||
depends on the particular driver being used.
|
||
|
||
SHA256 checksum
|
||
---------------
|
||
|
||
::
|
||
|
||
64de0616fc2d801f929ab1ac2a4f16b3e2783c4309a724c8a259b20df8bbc1cc mesa-23.2.1.tar.xz
|
||
|
||
|
||
|
||
New features
|
||
------------
|
||
|
||
- VK_EXT_attachment_feedback_loop_dynamic_state on RADV
|
||
|
||
- extendedDynamicState3SampleLocationsEnable on RADV
|
||
|
||
- VK_EXT_dynamic_rendering_unused_attachments on RADV
|
||
|
||
- VK_EXT_mesh_shader on lavapipe
|
||
|
||
- OpenGL 3.1 on Asahi
|
||
|
||
- OpenGL ES 3.0 on Asahi
|
||
|
||
- VK_KHR_fragment_shader_barycentric on RADV/GFX10.3+
|
||
|
||
- VK_KHR_ray_tracing_pipeline on RADV/GFX10.3+
|
||
|
||
- VK_EXT_depth_bias_control on RADV
|
||
|
||
- VK_EXT_fragment_shader_interlock on RADV/GFX9+
|
||
|
||
- VK_EXT_pipeline_robustness on RADV
|
||
|
||
|
||
|
||
Bug fixes
|
||
---------
|
||
|
||
- intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL.
|
||
- RadeonSI: glClear() causes clear texture for some frames on RX580
|
||
- shader_test causing a crash in compiler
|
||
- Crash in st_ReadPixels
|
||
- [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git
|
||
- Penumbra: Overture hangs on new game loading screen
|
||
- radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958
|
||
- h264 encoding picture showed randomly repeated frames.
|
||
- [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM.
|
||
- VAAPI on VCN: bad stream may crash whole gfx system
|
||
- aco: Assertion when compiling CP2077 shader
|
||
- [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6
|
||
- gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess*
|
||
- KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+
|
||
- wsi: deadlocks when DISPLAY is changed
|
||
- VAAPI: AMDGPU crash on RX 6900 XT on corrupted video
|
||
- [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX
|
||
- blorp regression on dg2
|
||
- radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX
|
||
- [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled
|
||
- Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX
|
||
- glCopyTexSubImage2D is very slow on Intel
|
||
- NVE4 (GeForce 710) fails to get vdpau in mesa git
|
||
- nouveau prevents hardware acceleration with Chromium (Wayland)
|
||
- Corrupt text rendering in Blender
|
||
- DRI2 gallium frontend is using bad format type
|
||
- Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium
|
||
- ci: do not download perfetto on-fly in build jobs
|
||
- Shared Memory Leak With Qt OpenGL Applications
|
||
- OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out
|
||
- 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video.
|
||
- Rusticl,radeonsi: ac_rtld error(2): too much LDS
|
||
- aco, radv Rage 2 menu corruption - bisected
|
||
- radv, aco: World War Z character texture regression on 7900xtx
|
||
- lavapipe/llvmpipe: regressions since descriptor rewrite
|
||
- Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c
|
||
- [anv] Death Stranding crashes
|
||
- Can no longer build Clover without llvmspirvlib
|
||
- Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression)
|
||
- intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction
|
||
- gpu hangs on dg2 with mesh shading enabled on vkcts
|
||
- GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms
|
||
- radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context
|
||
- robustness2 raygen tests intermittently fail in Intel Mesa CI
|
||
- glthread: huge performance regression
|
||
- DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected]
|
||
- [Vega 64] Newer Mesa-git revisions past 283be8ac3b8610a77b28ebe9e44b946b979f0381 crash the system when accessing hardware accelerated apps
|
||
- Docs: Imagination driver not have documentation in https://docs.mesa3d.org/
|
||
- Unigine Heaven broken on Navi 21 since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22846
|
||
- [bisected] amdgpu graphics acceleration causing system crashes on 22f3bcfb5a33 or later
|
||
- anv: GPU hangs on MSAA tests with Angle
|
||
- AMD OpenGL texture corruption and crashing regression in java app
|
||
- The filenumber part of the #line preprocessor directive is ignored for multiline error messages
|
||
- r300: channel merging missed case for mad + mov
|
||
- radv: incorrect RTE rounding in corner cases
|
||
- Confidential issue #4103
|
||
- r600 regression
|
||
- clc: compiler_test gets built even if unit tests are disabled via -Dbuild-tests=false
|
||
- x11 swapchain fails to check for DRI3 PixmapFromBuffers error
|
||
- msys2: build fails with error: conflicting types for '_glapi_add_dispatch'
|
||
- [bisected][anv] newly enabled test (dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm) failing
|
||
- deqp gles3.1 gpu hangs on DG2 A380 when running zink
|
||
- [amd/drm-shim] build issue on ppc64
|
||
- radeonsi: bogus advertisement for encode/decode support for 10 bit h264 video
|
||
- radeon: Basphemous graphical glitch
|
||
- aztec ruins gl benchmark slow to compile shaders on intel
|
||
- anv: false cacheline flushing or insufficient buffer alignment on at least ADL
|
||
- macOS build error timespec
|
||
- intel: Borderlands 2 misrendering with ZINK with OpTerminateInvocation instruction
|
||
- gallium: Error path in st_create_context_priv leads to segfault
|
||
- [Vulkan][TGL] vkCmdCopyQueryPoolResults failed to write buffer after compute shader write with overlap
|
||
- r600: GPU hang on The Long Dark on R600/R700
|
||
- Add iris and crocus to features.txt
|
||
- r600: Segfault on glxgears and almost every OpenGL applications on RV770 (regression)
|
||
- R9 280 - Broken font rendering in Godot Engine (GLES2) - Radeonsi
|
||
- radeonsi gcn1 regression
|
||
- ANV crashes on init on 32 bit builds
|
||
- eglCreateImageKHR should throw a error when called with anything but EGL_NO_CONTEXT
|
||
- virgl: Stack overflow in virgl_bind_sampler_states on hosts with more than 32 samplers
|
||
- [TGL] regression with r11_g11_b10 formats when running through virgl
|
||
- anv: incorrect vkGetPhysicalDeviceImageFormatProperties2KHR success
|
||
- r300: reconstruct ARR in shaders from wined3d
|
||
- ci: enable pre-merge testing for Zink/RADV
|
||
- rusticl: segmentation fault when enabling for llvmpipe and zink with the driver set to zink
|
||
- unify load_ubo_dxil and load_ubo_vec4
|
||
- Move \`lower_loads_and_stores_to_dxil` bit-size lowering logic to \`nir_lower_mem_access_bit_sizes`
|
||
- Intel drivers fail to link with -Dxmlconfig=disabled
|
||
- mesa: Remove dynamic dispatch stubs
|
||
- radv: regression UE5.2 nanite d3d12 vertex explosion
|
||
- [ANV/DG2] nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_reflections crash
|
||
- validate_intrinsic_instr: Assertion \`dst' failed.
|
||
- anv: fails to build on aarch64
|
||
- radv: crash/freeze/assert with raytracing and Elden Ring 1.09
|
||
- Elden Ring freeze when summoning cooperator with Raytracing
|
||
- gc_alloc_size returns unaligned allocations
|
||
- Rusticl OpenCL: panicked at 'called \`Option::unwrap()` on a \`None` value' && void SPIRVKernelParser::applyDecoration(uint32_t, const spv_parsed_instruction_t*): Assertion \`c.first == id' failed.
|
||
- Using a \`NULL` pointer as \`bitmap` in \`glBitmap` leads to \`GL_OUT_OF_MEMORY` while creating display-list ("glNewList -> glBitmap")
|
||
- Bitwise and with constant 31 removed on width argument to BitFieldSExtract, causing incorrect result on RADV ACO
|
||
- Rusticl OpenCL: Simple SYCL / DPC++ program hangs indefinitely at rusticl::api::event::wait_for_events()
|
||
- radeonsi: Metro Last Light Redux graphical glitches
|
||
- radv: VK_KHR_fragment_shader_barycentric support
|
||
- freedreno/a6xx: assert(state->view_rsc_seqno[i] == seqno) failed with sway
|
||
- radv: Trackmania 2020 crashing on mesa-git
|
||
- radv crashes when using vertex format VK_FORMAT_B10G11R11_UFLOAT_PACK32
|
||
- changes in commit e4b6a0a82457b3ef40c5857412e20bc344ff302c leads to GPU hang
|
||
- radv,aco: In the game Rise of the Tomb Raider on RDNA 3 GPUs appeared artifacts after commit 290c3d360e5a6f5226c062d6a9267629adb1060e
|
||
- CI: Linux CI jobs naming
|
||
- docs.mesa3d.org contrast is bad in dark mode
|
||
- iris now requires Linux v5.3
|
||
- Wolfenstein II: The New Colossus vsync off crash
|
||
- Surfaceless mode ES2.0 number of vertices limitation
|
||
- freedreno: firefox crashes on video playback
|
||
- radv: unaligned vertex input regression
|
||
- rusticl build error: error[E0308]: mismatched types on some archs
|
||
- GetInternalformativ with GL_TEXTURE_2D and GL_CLEAR_TEXTURE incrrectly returns GL_INVALID_ENUM
|
||
- radeonsi: texturing is broken on R9.270x since eaf98b14220d8cbc186d67a929254acc3e7de41a
|
||
- aco: KHR-GL46.shader_image_load_store.basic-allTargets-atomic asserts on Bonaire
|
||
- Firefox / VA-API / H.264 decoding artifacts on AMD RX 6600 / Fedora 37
|
||
- AMD/RX 6600 - VA-API video output is corrupted if decoded surfaces are exported by vaExportSurfaceHandle and then quickly returned to ffmpeg/va-api decoder and reused
|
||
- Pixel Game Maker MV - Elfin Force
|
||
- Anvil - Vulkan CTS tests fail if has_context_isolation set to false
|
||
- anv: binding table pool leak or overly cached
|
||
- [regression/bisected] Bone wireframes are no longer rendered correctly in Blender with RadeonSi/Vega
|
||
- [nine, radeonsi] Texture missing in Heroes of Might and Magic 5
|
||
- [REGRESSION] Crash in \`loader_dri3_wait_gl()` due to \`dri3_front_buffer(draw) == NULL`
|
||
- r600: Visual glitches on The Long Dark with the NIR backend
|
||
- some piglit tests seg-fault if -Dgles1=disabled is set
|
||
- anv: Tom Clancy's Rainbow Six Siege [DX11: Image Corruption(FIXED)/Vulkan: crash on lauch]
|
||
- glSpecializeShaderARB works on SPIR-V compiled with shader compilers ca. 2021, but fails for SPIR-V generated with current compilers
|
||
- ANV: Vulkan driver regression in clearing Depth/Stencil
|
||
- radv: Sample rate shading broken in AC:Valhalla
|
||
- FTBFS: gallivm: src/gallium/auxiliary/gallivm/lp_bld_init.c:45:10: fatal error: llvm-c/Transforms/Scalar.h: No such file or directory (Legacy Pass Manager removed in LLVM 17)
|
||
- intel: workaround mechanism initialized before device revision (stepping) is available
|
||
- radv/rt: crash compiling Unity Enemies Demo RT pipelines
|
||
- gfxbench4/5 crashing on android
|
||
- mesa 23.0.3 build failure
|
||
- gallium-aux msan failure on Debian bookworm
|
||
- radv: Battlefield {1,5} hangs on RX 7900 XTX
|
||
- radv: graphical artifacts in MSFS running via DXVK on RX7900XT
|
||
- vulkancts regressions on bdw
|
||
- [BDW] intel/blorp: MCS partial resolve produces unexpected result
|
||
- Compile failure v23.0.0 - error: implicit declaration of function
|
||
- ci/radv: Stop setting MESA_SPIRV_LOG_LEVEL
|
||
- "frontends/va: report min width and min height values if available" broke VA-API tests on amd-raven
|
||
- [radeonsi] flickering debug chunk border lines in Minecraft
|
||
- nine: Lower alpha test in shader key? or require PIPE_CAP_ALPHA_TEST?
|
||
- radv, radeonsi: Rogue Legacy 2 alpha-to-coverage rendering issues
|
||
- [r600, TURKS] R600: Unsupported instruction: vec1 32 ssa_1 = intrinsic image_samples (ssa_0) on spec@arb_shader_texture_image_samples@compiler@fs-image-samples.frag (23.1.0-rc4)
|
||
- anv: Overwatch 2 hangs GPU with GPL enabled
|
||
- Penumbra: Overture ingame enviroment not displaying on Proton version
|
||
- Penumbra: Overture ingame enviroment not displaying on Proton version
|
||
- nir: 'base' may be used uninitialized
|
||
- vulkan/device_select: no way to select between GPUs of the same model due to bugs
|
||
- radv: 7900 XTX hair flickering/rendering issues in VaM
|
||
- radv: cache crashing
|
||
- nouveau: Regression in arb_transform_feedback_overflow_query-basic from multithreading patches
|
||
- regression in aco,ac/llvm,radv,radeonsi: handle ps bc optimization in nir for radv
|
||
- radeonsi: vaapi: \`width >2880 && width % 64 != 0` results in wrong width in h265 stream
|
||
- [regression] iris: unable to use driver as secondary GPU (primary AMD GPU)
|
||
- iris: steam doesn't render on dg2
|
||
- [llvm 16+] [microsoft-clc] opencl-c-base.h does not exist
|
||
- Vulkancts clipping / tesselation tests trigger gpu hang on DG2
|
||
- Swaped fields in picture in vlc and mythtv if hw accel is on
|
||
- radeonsi: glGetGraphicsResetStatusEXT keeps returning GL_INNOCENT_CONTEXT_RESET after a GPU reset
|
||
- WGL: Assert assigns dwThreadId variable
|
||
- Intel/anv: Modifier problems running gamescope embedded
|
||
- R600: drop TGSI code path
|
||
- r600_shader.c:193 r600_pipe_shader_create - translation from TGSI failed !
|
||
- nine regression with r600 (bisected)
|
||
- [ACO] [RADV] Flickering squares in some areas in The Last of Us Part 1 (with workaround)
|
||
- radv: Jedi Fallen Order flickering & blocky plants
|
||
- qemu 7.2.0-rc4 with sdl output crashes with assert
|
||
- nouveau: NV50 (NVAC) broken in latest master
|
||
- [dozen]: [vkcube] force closing on WSL2
|
||
- rusticl failed to build with rust-bindgen 0.65.0
|
||
- nine: strange color or transparency of trees when called DrawIndexedPrimitive?
|
||
- Regression, Bisected: glsl: Delete the lower_tess_level pass breaks r600 tesselation
|
||
- vkcts-navi21-valve failing often with GCVM_L2_PROTECTION_FAULT_STATUS:0x00X00830
|
||
- ci/radv: Remove vkctx-navi21-llvm-valve job?
|
||
- Deep Rock Galactic GPU freeze (AMD, DX11 DXVK Proton)
|
||
- radv: Resident Evil 4 Chainsaw Demo GPU hang with Navi 24
|
||
- radv: Gotham Knights GPU hang with Navi 24
|
||
- SPIR-V error "Invalid back or cross-edge in the CFG"
|
||
- SPIR-V parsing FAILED: Loop breaks can only break out of the inner most nested loop level
|
||
- ci: a618 traces performance broken
|
||
- aco: s_load_dword with negative soffset cause GPU hang
|
||
- d3d12: Attempting to display a framebuffer through GDI with low bpc produces on-screen corruption
|
||
- piglit.spec.ext_image_dma_buf_import.ext_image_dma_buf_import crash shutting down
|
||
- overlay layer: unable to launch titles on steam
|
||
- radv/zink: spec@ext_texture_integer@multisample-fast-clear gl_ext_texture_integer
|
||
- ci: a530-gl with 6.3 kernel
|
||
- a530: hangs with newer firmware version on db820c (apq8096)
|
||
- tu: debug marker support
|
||
- VAAPI: Wrong H.264 playback on RX 6900 XT and RX 6700 XT (all Sienna?)
|
||
- radv: possibly not setting state dirty bits correctly
|
||
- RADV: VRS attachment not working in specific scenario
|
||
- VAAPI/AMD: videos less than 64 pixels in width or height are decoded to black
|
||
- d3d12: DirectX doesn't support seperate stencil functions for front and back face
|
||
|
||
|
||
Changes
|
||
-------
|
||
|
||
Adam Jackson (1):
|
||
|
||
- egl: Clear EGL_WINDOW_BIT for non-double-buffered EGLConfigs
|
||
|
||
Alan Previn (2):
|
||
|
||
- drm-uapi: bump headers (except AMD)
|
||
- iris: Add GET_PARAM for protected context capability support
|
||
|
||
Alejandro Piñeiro (12):
|
||
|
||
- v3dv/pipeline: don't prepack up early-z configuration
|
||
- v3d: use more an auxiliar devinfo
|
||
- v3d: remove v3d_create_texture_shader_state_bo
|
||
- v3d: remove v3d_tfu_supports_tex_format
|
||
- v3d: remove v3d_get_internal_type_bpp_for_output_format
|
||
- broadcom/compiler: return NULL if we fail to register allocate
|
||
- v3d: assert if v3d_compile returns NULL
|
||
- broadcom/compiler: disable tmu pipelining when needed
|
||
- broadcom/compiler: clarify use of QFILE_VPM
|
||
- v3dv: refactor copy_image_to_buffer_blit
|
||
- v3dv: add a linear images to buffer copy codepath
|
||
- v3dv/device: update conformanceVersion
|
||
|
||
Aleksey Komarov (2):
|
||
|
||
- pan/va: Fix MUX.v2i16 and MUX.v4i8 description
|
||
- pan/va: fix typo in IADD_IMM.i32 description
|
||
|
||
Alex Denes (1):
|
||
|
||
- virgl: link VA driver with build-id
|
||
|
||
Alexander von Gluck IV (1):
|
||
|
||
- egl/haiku: Fix potential crash if double buffering is disabled
|
||
|
||
Alyssa Rosenzweig (289):
|
||
|
||
- gallium: Add u_default_get_sample_position
|
||
- zink: Use u_default_get_sample_position
|
||
- panfrost: Use u_default_get_sample_position
|
||
- freedreno: Use u_default_get_sample_position
|
||
- d3d12: Use u_default_get_sample_position
|
||
- nir: Add more system values for lowering XFB
|
||
- pan/bi: Don't set has_fsub
|
||
- asahi: Fix disk cache disable with AGX_MESA_DEBUG
|
||
- asahi: Minify width/height in create_surface
|
||
- asahi: Don't use depth/stencil staging blits
|
||
- asahi: Identify XML for barycentric coordinates
|
||
- asahi: Track write to separate stencil
|
||
- agx: Handle splits of uniforms
|
||
- agx: Fix abs/neg propagation into fcmpsel
|
||
- agx/lower_zs_emit: Fix progress returning
|
||
- agx: Handle linear 2D array textureSize()
|
||
- asahi: Explicitly ban MSAA, compression with linear
|
||
- asahi: Use 2D array staging resources for cube/3D
|
||
- asahi: Compress more texture targets
|
||
- agx: Remove bogus assert
|
||
- asahi: Use u_default_get_sample_position
|
||
- agx: Defeature fsub
|
||
- asahi: Use device_load shift for VBO loads
|
||
- agx: Fix packing for iadd with shift
|
||
- asahi: Rename no colour output to tag write disable
|
||
- asahi: Copy resources if needed to shadow
|
||
- agx: Don't wait at the end of the shader
|
||
- asahi: Bind staging resources as RENDER_TARGET
|
||
- agx/lower_address: Add helper to match multiplies
|
||
- agx/lower_address: Match multiplies, not only shifts
|
||
- agx: Ensure load_frag_coord has the right sizes
|
||
- agx: Rework z/s emit
|
||
- agx: Validate that collect sources are the same size
|
||
- agx: Lower I/O to scalar later
|
||
- asahi: Shrink disk cache size of push ranges
|
||
- asahi: Bump MAX_PUSH_RANGES to the worst-case
|
||
- asahi: Implement transform feedback
|
||
- asahi: Fix depth load/store flags
|
||
- nir: Add nir_alu_src_as_uint helper
|
||
- pan/bi: Use nir_alu_src_as_uint
|
||
- agx: Use nir_alu_src_as_uint
|
||
- nir: Model AGX-specific multiply-shift-add
|
||
- agx: Handle imadshl_agx, imsubshl_agx
|
||
- agx: Fix packing of imsub instructions
|
||
- agx: Optimize multiplies
|
||
- zink: Always set a blend state for shader-db
|
||
- ail: Handle larger block sizes
|
||
- nir: Allow adding descriptions to ALU opcodes
|
||
- nir: Make ALU descriptions machine-readable
|
||
- docs: Include ALU opcode descriptions
|
||
- nir: Add nir_foreach_phi(_safe) macro
|
||
- nir: Use nir_foreach_phi(_safe)
|
||
- dxil: Use nir_foreach_phi_safe
|
||
- ac/llvm: Use nir_foreach_phi
|
||
- nir: Use nir_block_last_phi_instr more
|
||
- nir: Add unified atomics
|
||
- nir: Add pass to lower atomics to unified
|
||
- agx: Use unified atomics
|
||
- pan/bi: Use unified atomics
|
||
- pan/mdg: Fix icky formatting
|
||
- pan/mdg: Use unified atomics
|
||
- gallivm: Use unified atomics
|
||
- ntt: Use unified atomics
|
||
- ac/llvm: Don't handle atomic derefs
|
||
- ac/llvm: Use unified atomics
|
||
- aco,radv: Use unified atomics
|
||
- zink: Use unified atomics
|
||
- ir3: Use unified atomics
|
||
- nir: Handle unified atomics in simple cases
|
||
- nir/lower_task_shader: Handle unified atomics
|
||
- nir/lower_io: Handle unified atomics
|
||
- nir/lower_ssbo: Handle unified atomics
|
||
- nir/opt_uniform_atomics: Handle unified atomics
|
||
- nir/validate: Handle unified atomics
|
||
- radv: Constify radv_device_supports_etc
|
||
- radv: Use common GetPhysicalDeviceFeatures2
|
||
- r600: Use unified atomics
|
||
- lvp: Use common GetPhysicalDeviceFeatures2
|
||
- tu: Use common GetPhysicalDeviceFeatures2
|
||
- agx: Lower legacy atomics sooner
|
||
- pan/mdg: Lower legacy atomics earlier
|
||
- panvk: Lower legacy atomics earlier
|
||
- tu: Lower legacy atomics earlier
|
||
- v3dv: Lower legacy atomics earlier
|
||
- lavapipe: Lower legacy atomics sooner
|
||
- glsl/nir: Produce unified atomics
|
||
- nir/lower_atomics_to_ssbo: Produce unified atomics
|
||
- nir/lower_printf: Produce unified atomic
|
||
- mesa/st: Produce unified atomics
|
||
- vtn: Produce unified atomics
|
||
- intel: Produce unified atomics
|
||
- ac: Produce unified atomic
|
||
- treewide: Stop lowering legacy atomics
|
||
- nir: Drop nir_lower_legacy_atomics
|
||
- ntt: Stop handling legacy atomics
|
||
- nir: Drop legacy atomics in simple cases
|
||
- nir/lower_io: Drop legacy atomics
|
||
- nir/lower_task_shader: Drop legacy atomics
|
||
- nir/validate: Drop legacy atomics
|
||
- nir/opt_load_store_vectorize: Reclaim ATOMIC
|
||
- nir/opt_uniform_atomics: Drop legacy atomics
|
||
- nir: Remove legacy atomics
|
||
- nir: Drop unused name from nir_ssa_dest_init
|
||
- nir: Drop unused argument from nir_ssa_dest_init_for_type
|
||
- nir: Remove stale TODOs
|
||
- nir: Fix incorrect comment
|
||
- util: Add common hex dump utility
|
||
- asahi: Use common hexdump utility
|
||
- pan/decode: Use common hexdump
|
||
- CODEOWNERS: Update panfrost
|
||
- gallium: Drop Asahi-as-a-swrast hack
|
||
- asahi: Drop Asahi-as-a-swrast hack
|
||
- nir: Document extra image source
|
||
- nir: Add image_texel_address intrinsics
|
||
- nir: Add pass to lower image atomics
|
||
- pan/bi: Fix atomic exchange on Valhall
|
||
- pan/bi: Use nir_lower_image_atomics_to_global
|
||
- pan/mdg: Use nir_lower_image_atomics_to_global
|
||
- gallium: Add pipe_image_view::single_layer_view
|
||
- mesa/st: Set pipe_shader_image::single_layer_view
|
||
- dxil: Rely on scoped_barrier
|
||
- treewide: Avoid nir_lower_regs_to_ssa calls
|
||
- nir/opt_barriers: Add a default callback
|
||
- agx: Use common combine_all_barriers callback
|
||
- nir: Drop stale comments
|
||
- zink: Switch to scoped barriers
|
||
- panfrost/ci: Skip Piglit tests known to crash
|
||
- panfrost/ci: Skip hanging test
|
||
- nir: Add intrinsics for multisampling on AGX
|
||
- nir/builder: Add nir_replicate helper
|
||
- treewide: Use nir_replicate
|
||
- pan/lower_framebuffer: Use nir_replicate
|
||
- radv/query: Use nir_trim_vector
|
||
- intel/blorp: Use nir_trim_vector
|
||
- nir/print: Print locations for geometry shader inputs
|
||
- gallium: Add util_image_to_sampler_view helper
|
||
- panfrost: Use util_pipe_image_to_sampler_view
|
||
- nir: Add and use nir_tex_src_ssa
|
||
- treewide: Use nir_tex_src_for_ssa
|
||
- treewide: Use nir_trim_vector more
|
||
- agx: Set support_16bit_alu
|
||
- agx: Constant fold when optimizing int64
|
||
- agx: Use textures_used, not num_textures
|
||
- asahi: Add passes to lower MSAA
|
||
- asahi: Add passes to lower sample intrinsics
|
||
- asahi: Add alpha-to-coverage (and alpha-to-one) lowering
|
||
- agx: Assert that sample shading is lowered
|
||
- asahi: Set uses_sample_shading for background program
|
||
- asahi: Plumb API sample mask into shaders
|
||
- asahi: Plumb ppp_multisamplectl into shaders
|
||
- agx: Model both sources of sample_mask
|
||
- agx: Plumb in nir_intrinsic_load_sample_mask_in
|
||
- agx: Handle sample_mask_agx
|
||
- agx: Enable tag writes when sample mask written
|
||
- agx: Lower discard in NIR
|
||
- asahi,agx: Call lower_discard_zs_emit in the driver
|
||
- agx: Split iter and iterproj instructions
|
||
- agx: Model interpolation for iter instructions
|
||
- agx: Handle centroid and sample interpolation
|
||
- asahi: Lower MSAA
|
||
- asahi: Use nonempty tib for MSAA
|
||
- agx: Emit shader info late
|
||
- asahi: Advertise GL 3.1
|
||
- agx: Stop bit-inexact conversion propagation
|
||
- asahi: Add ASAHI_MESA_DEBUG=nowc flag
|
||
- asahi: Extract transition_resource helper
|
||
- asahi: Decompress writable images
|
||
- asahi: Decompress with format reinterpretation
|
||
- asahi: Remove stale comments
|
||
- pan/mdg: Drop lower_locals_to_regs call
|
||
- lima: Drop lower_locals_to_regs call
|
||
- ir2: Drop lower_locals_to_regs call
|
||
- nir: Add AGX atomic intrinsics
|
||
- agx: Refactor expressions in agx_nir_lower_address
|
||
- agx: Fold addressing math into atomics
|
||
- nir/builder: Add steal_tex_src helper
|
||
- nir/lower_tex: Use nir_steal_tex_src
|
||
- agx: Use common nir_steal_tex_src
|
||
- nir: Add interleave_agx instruction
|
||
- vtn: Handle atomic counter semantics
|
||
- ir3: Drop reference to unsupported intrinsic
|
||
- ttn: Emit scoped barriers when needed
|
||
- ntt: Use scoped barriers
|
||
- ac/llvm: Drop memory_barrier_buffer impl
|
||
- glsl: Assume use_scoped_barrier
|
||
- vtn: Assume use_scoped_barrier
|
||
- nir: Assume use_scoped_barrier
|
||
- ttn: Assume use_scoped_barrier
|
||
- treewide: Remove use_scoped_barrier
|
||
- nir/tests: Use scoped barriers internally
|
||
- nir: Remove handling for non-scoped barriers
|
||
- radeonsi: Scan for scoped barriers
|
||
- nir: Remove non-scoped barriers
|
||
- iris: Don't use STREAMING_LOAD without SSE
|
||
- nir/builder: Add ubitfield_extract_imm helper
|
||
- agx: Implement bitfieldExtract natively
|
||
- asahi: Use bitfield_extract for texture lowering
|
||
- nir: Remove integer and 64-bit modifiers
|
||
- aco: Drop NIR parallel copy handling
|
||
- nir: Add discard_agx intrinsic
|
||
- agx: Update explanation of sample_mask behaviour
|
||
- agx: Fix discards
|
||
- agx: Extract coordinate register size calculation
|
||
- agx: Recollect stored vectors at their use
|
||
- agx: Add loop header? flag
|
||
- agx: Validate predecessor information
|
||
- agx/lower_parallel_copy: Lower 64-bit copies
|
||
- agx: Implement vector live range splitting
|
||
- nir/lower_bool_to_int32: Fix progress reporting
|
||
- nir/lower_locals_to_regs: Add bool bitsize knob
|
||
- gallivm: Use NIR_PASS macros
|
||
- nir: Add pixel_coord, frag_coord_zw intrinsics
|
||
- nir: Add lower_frag_coord_to_pixel_coord pass
|
||
- pan/bi: Use lower_frag_coord_to_pixel_coord
|
||
- agx: Use nir_lower_frag_coord_to_pixel_coord
|
||
- asahi: Use txf for background program
|
||
- nir/lower_blend: Optimize masked out RTs
|
||
- nir: Add nir_builder_create returning nir_builder
|
||
- nir: Use nir_builder_create
|
||
- treewide: Use nir_builder_create more
|
||
- treewide: Remove unused builders
|
||
- nir: Add nir_foreach_function_impl helper
|
||
- nir: Convert to nir_foreach_function_impl
|
||
- nir/validate: Assert txf(_ms) matches dimension
|
||
- nir: Add nir_lower_robust_access pass
|
||
- broadcom/compiler: Use nir_lower_robust_access
|
||
- broadcom/compiler: Remove v3d_nir_lower_robust_access
|
||
- broadcom/compiler: Remove unused #define
|
||
- broadcom/compiler: Use nir_steal_tex_src
|
||
- nir: Add b32fcsel_mdg opcode for Midgard
|
||
- pan/mdg: Optimize b32csel(inot) in NIR
|
||
- pan/mdg: Type CSEL with a NIR pass
|
||
- pan/mdg: Lower isub in common code
|
||
- pan/mdg: Constant fold after algebraic_late
|
||
- pan/mdg: Add is_ssa helper
|
||
- pan/mdg: Fix IR from scheduling conditions
|
||
- pan/mdg: Fix 2-const CSEL at block beginning
|
||
- pan/mdg: Fix temp count calculation
|
||
- pan/mdg: Lower special reads better
|
||
- pan/mdg: Reset predicate.exclude while scheduling
|
||
- pan/mdg: Copy-prop even with swizzle restrictions
|
||
- pan/mdg: Propagate modifiers in the backend
|
||
- nir: Rename load/store_reg -> load/store_register
|
||
- nir: Rename nir_reg_{src,dest} -> nir_register_{src,dest}
|
||
- agx: Add algebraic opt to help with discard lowering
|
||
- agx: Smarten discard_agx -> sample_mask lowering
|
||
- asahi: Strip ? in GenXML
|
||
- asahi: Rename 'Render Target' to 'PBE'
|
||
- asahi: Identify PBE::sRGB flag
|
||
- asahi: Remove ; in perf_debug_ctx
|
||
- agx: Use nir_opt_shrink_stores
|
||
- agx: Use nir_opt_shrink_vectors
|
||
- agx: Assert that barriers are not used in the preamble
|
||
- asahi: Assert we don't transition shared resources
|
||
- asahi: Fix scissor_culls_everything check
|
||
- asahi: Use ralloc harder
|
||
- asahi: Take ownership of compute shader NIR
|
||
- agx: Don't leak ssa_to_reg_out
|
||
- asahi: Use txf_ms for MSAA background programs
|
||
- nir: Fix breaking in nir_foreach_phi(_safe)
|
||
- vulkan: Add vk_index_type_to_bytes helper
|
||
- lavapipe: Use vk_index_type_to_bytes
|
||
- v3dv: Use vk_index_type_to_bytes
|
||
- rogue: Remove commented convert_from_ssa call
|
||
- nir: Add intrinsics for register access
|
||
- nir: Add helpers for walking register uses
|
||
- nir: Add pass for trivializing register access
|
||
- nir: Add legacy data structures & helpers
|
||
- nir: Add new version of lower_regs_to_ssa
|
||
- nir: Produce intrinsics in lower_{phis,ssa_defs}_to_regs
|
||
- nir: Add intrinsics version of locals_to_regs
|
||
- nir: Add lower_vec_to_regs pass
|
||
- gallium: Return SSA values from TTN ALU helpers
|
||
- gallium: Convert TTN to register intrinsics
|
||
- mesa: Simplify ptn_log() a bit
|
||
- mesa: Return SSA defs from PTN ALU helpers
|
||
- mesa: Convert PTN to register intrinsics
|
||
- nir/lower_shader_calls: Convert to register intrinsics
|
||
- nir: Remove nir_lower_regs_to_ssa
|
||
- nir: Remove nir_register-based unit tests
|
||
- gallivm: Switch to reg intrinsics
|
||
- pan/mdg: Ingest new-style registers
|
||
- panfrost: Fix transform feedback on v9
|
||
- panfrost: Lower vertex_id for XFB
|
||
- panfrost: Fix transform feedback on v9 harder
|
||
- nir/trivialize: Handle more RaW hazards
|
||
- nir/lower_blend: Fix 32-bit logicops
|
||
- nir/lower_helper_writes: Consider bindless images
|
||
- nir/passthrough_gs: Fix array size
|
||
|
||
Amber (3):
|
||
|
||
- turnip: fix buffer markers using wrong addresses
|
||
- ir3, freedreno: implement GL_ARB_shader_draw_parameters
|
||
- freedreno: implement GL_ARB_indirect_parameters
|
||
|
||
Andres Calderon Jaramillo (1):
|
||
|
||
- r600: Report multi-plane formats as unsupported
|
||
|
||
Andres Gomez (3):
|
||
|
||
- .mailmap: add an alias for Miguel Casas-Sanchez
|
||
- .mailmap: add an alias for Clayton Craft
|
||
- .mailmap: add an alias for Christian Gmeiner
|
||
|
||
André Almeida (2):
|
||
|
||
- radv: debug: Update decode ring umr command
|
||
- radv: Search for guilty contexts at radv_check_status
|
||
|
||
Antonio Gomes (3):
|
||
|
||
- rusticl: Move nir compilation to Program
|
||
- rusticl: Drop some Kernel data and have a NirKernelBuild ref instead
|
||
- rusticl: Drop Program::kernel_count
|
||
|
||
Asahi Lina (33):
|
||
|
||
- asahi: Identify ZS resolve bits (tentative)
|
||
- asahi: Broadcast Z for all components on texture fetch
|
||
- asahi: Enable 2xMSAA (for deqp)
|
||
- asahi: Add batch state debugging
|
||
- asahi: Fix batch writer tracking for null batches
|
||
- asahi: Clear batch->resolve on agx_batch_init
|
||
- asahi: Assert that freed BOs have no pending writers
|
||
- asahi: Fix batch writer_syncobj cleanup
|
||
- asahi: Implement memory_barrier
|
||
- asahi: Implement create_fence_fd and fence_server_sync
|
||
- asahi: Make framebuffer texture barriers a no-op
|
||
- asahi: Disable tilebuffer write masking optimization
|
||
- asahi: Add missing stdbool include to lib/hexdump.h
|
||
- asahi: Fix check for sprite coord mode in agx_bind_rasterizer_state
|
||
- asahi: Add some more system registers
|
||
- asahi: Partially identify some missing index list stuff
|
||
- asahi: Lazily initialize batch state on first draw
|
||
- asahi: Make bo->writer_syncobj atomic
|
||
- ail: Implement multisampling for compression meta calculation
|
||
- asahi: Use ail_can_compress() in agx_compression_allowed()
|
||
- ail: Add MSAA tests
|
||
- asahi: Use os_dupfd_cloexec() instead of dup()
|
||
- asahi: Fix memory leak in agx_nir_lower_sysvals()
|
||
- asahi: Do not leak meta shader NIR
|
||
- asahi: Revert "Advertise ARB_texture_barrier"
|
||
- asahi: Disable PIPE_CAP_SURFACE_SAMPLE_COUNT
|
||
- asahi: Pass through surface sample count
|
||
- asahi: match_soa: Treat offsets as signed
|
||
- asahi: Identify the separate varying count fields
|
||
- asahi: Gather flat/linear shaded input info from uncompiled FS
|
||
- asahi: Fix type confusion for fragment shader keys
|
||
- asahi: Add flat/linear shaded varyings mask to the VS shader key
|
||
- asahi: Arrange VS varyings in the correct order
|
||
|
||
Axel Davy (17):
|
||
|
||
- frontend/nine: Fix missing clamping of pointsize for ff
|
||
- frontend/nine: Apply writemask to pointsize
|
||
- frontend/nine: fix fog key overflow
|
||
- frontend/nine: fix wfog
|
||
- frontend/nine: Fix num_textures count
|
||
- frontend/nine: Drop max_ps_const_f
|
||
- frontend/nine: Implement alpha test backup support
|
||
- frontend/nine: Implement backup support for pointsize
|
||
- frontend/nine: Improve VS_WINDOW_SPACE_POSITION fallback
|
||
- frontend/nine: Print warning incomplete position_t support
|
||
- frontend/nine: Enforce legacy pow behaviour
|
||
- frontend/nine: Get rid of INTERPOLATE_COLOR
|
||
- frontend/nine: initialize force_color_in_centroid
|
||
- docs/gallium: Clarify PIPE_CAP_CLIP_PLANES
|
||
- frontend/nine: Implement backup support for clip planes
|
||
- frontend/nine: Fix shader cap test for POSITIONT
|
||
- frontend/nine: Add debug driconf var force_features_emulation
|
||
|
||
Bas Nieuwenhuizen (11):
|
||
|
||
- radv: Reserve space for indirect descriptor set address writes.
|
||
- radv: Reserve space in the ACE pre/postambles.
|
||
- radv: Add stricter space checks.
|
||
- radv: Add asserts in radeon_emit{,_array}.
|
||
- radv: Move all the dirty flags from TES binding to TCS binding.
|
||
- amd/drm-shim: Add vangogh entry.
|
||
- amd/drm-shim: Add raphael&mendocino, polaris12 and gfx1100.
|
||
- amd/drm-shim: Update docs for more devices.
|
||
- aco: fix nir_op_vec8/16 with 16-bit elements.
|
||
- aco: Fix some constant patterns in 16-bit vec4 construction with s_pack.
|
||
- nir: Fix 16-component nir_replicate.
|
||
|
||
Benjamin Cheng (1):
|
||
|
||
- radv/video: use app provided hevc scaling list order
|
||
|
||
Benjamin Lee (1):
|
||
|
||
- intel: Fix stack overflow in intel_dump_gpu
|
||
|
||
Billy Laws (1):
|
||
|
||
- wgl: Fix depth/stencil image support when using zink kopper
|
||
|
||
Blisto (1):
|
||
|
||
- driconf: set vk_x11_strict_image_count for Wolfenstein II
|
||
|
||
Boris Brezillon (4):
|
||
|
||
- panfrost: Check blend enabled state in pan_allow_forward_pixel_to_kill()
|
||
- renderonly: Fix potential NULL deref in the error path
|
||
- renderonly: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource()
|
||
- winsys/panfrost: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource()
|
||
|
||
Boyuan Zhang (2):
|
||
|
||
- frontends/va: add default intra idr period
|
||
- radeonsi: disable H264HIGH10 profile
|
||
|
||
Brian Paul (5):
|
||
|
||
- llvmpipe: remove lp_setup_alloc_triangle()'s unneeded tri_size param
|
||
- llvmpipe: code clean-ups in llvmpipe_get_query_result_resource()
|
||
- lavapipe: clean-ups in lvp_GetQueryPoolResults()
|
||
- lavapipe: clean-ups in lvp_physical_device_get_format_properties()
|
||
- lavapipe: asst. clean-ups in lvp_execute.c
|
||
|
||
Caio Oliveira (56):
|
||
|
||
- spirv/tests: Add test for single-block loop
|
||
- spirv: Output spirv2nir tool result to stdout
|
||
- spirv: Add --optimize flag to spirv2nir tool
|
||
- spirv: Rework structured control flow handling
|
||
- spirv: Do more on spirv2nir --optimize
|
||
- spirv: Use NIR_PASS for spirv2nir --optimize
|
||
- spirv: Extract vtn_handle_debug_text() helper
|
||
- spirv: Fix gl_spirv_validation when OpLine with strings is present
|
||
- spirv: Improve the 'ID is the wrong kind of value' error messages
|
||
- mesa/spirv: Provide more specific error message for glSpecializeShader()
|
||
- spirv: Validate Dim of OpTypeSampledImage and OpSampledImage
|
||
- spirv: Assert sampler_dim is valid when building nir_tex_instr
|
||
- nir/print: Print 0 when mem_modes or resource_intel have no values
|
||
- nir/print: Do not print raw values
|
||
- spirv: Add workaround for OpImageQueryLevels with Multi-sampled images
|
||
- compiler/types: Make key in subroutine_name more effective
|
||
- r600/sfn: Fix warning about overloads hiding virtual functions
|
||
- spirv: Refactor and rename scope translation helper
|
||
- spirv: Use vtn_translate_scope for OpReadClockKHR
|
||
- intel/compiler: Refactor dump_instruction(s)
|
||
- intel/compiler: Remove unused functions and declarations
|
||
- compiler/types: Be consistent when naming array element/size
|
||
- compiler/types: Tidy up the asserts in get_*_instance functions
|
||
- compiler/types: Use hash table pre-hashed functions for type caching
|
||
- microsoft/clc: Add unreachable() to fix 'may be unitialized' warning
|
||
- compiler: Move from nir_scope to mesa_scope
|
||
- compiler: Add mesa_scope_name() function
|
||
- nir/print: Use mesa_scope_name() function to print scopes
|
||
- intel/compiler: Move brw_kernel.c to the intel_clc target
|
||
- compiler/clc: Rename the internal library from libclc to libmesaclc
|
||
- compiler/clc: Move related NIR passes to the common mesa clc
|
||
- compiler: Move spirv into a module of its own
|
||
- nir/print: Print whether the shader is internal or not
|
||
- intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL flag
|
||
- meson: Explicitly add "check : false" to a couple instances of run_command
|
||
- vulkan: Add NV suffix to VK_NV_cooperative_matrix feature names
|
||
- vulkan: Update XML and headers to 1.3.255
|
||
- nir: Allow nir_gather_ssa_types() to ignore regs instead of assert
|
||
- nir/print: Improve NIR_PRINT=print_consts by using nir_gather_ssa_types()
|
||
- nir/print: Make NIR_DEBUG=print_consts behavior the default
|
||
- nir: Make a const-friendly way to get the offset_src and arrayed_io_src from intrinsic
|
||
- nir: Extract logic to get dest and srcs types from intrinsic
|
||
- nir/print: Use src_type when printing consts in SSA uses
|
||
- nir/print: Print more representations in load_const
|
||
- nir/print: Use symbols % for SSA and @ for intrinsic
|
||
- nir/print: Use \`bN` instead of \`block_N` for identifying basic blocks
|
||
- nir/print: Use BITSIZExELEMENTS for SSA sizes
|
||
- nir/print: Align instructions around \`=`
|
||
- nir/print: Rename print_tabs() to print_indentation() and use it more
|
||
- nir/print: Don't use comment syntax for deref_cast properties
|
||
- nir/print: Use \`//` for comments
|
||
- nir/print: Use 4-space indentation
|
||
- nir/print: Print div/con annotation first
|
||
- nir/print: Reformat the preds/succs block information
|
||
- meson: Ensure that LLVMSPIRVLib is not required for Clover
|
||
- compiler/types: Use right hash for function types
|
||
|
||
Caleb Cornett (3):
|
||
|
||
- d3d12: Fix Xbox GDK build errors
|
||
- wgl: Add BITMAPV5HEADER to stw_gdishim.h
|
||
- d3d12: Fix Xbox frame scheduling for interval != 1
|
||
|
||
Charmaine Lee (7):
|
||
|
||
- translate: do not clamp element index in generic_run
|
||
- svga: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY for VGPU10 device
|
||
- mesa/main: fix distance attenuation calculation in ffvertex
|
||
- svga: fix shader type after ntt
|
||
- svga: fix compute shader type after ntt
|
||
- svga: lower images before ntt
|
||
- svga: set clear_texture to NULL for vgpu9
|
||
|
||
Chia-I Wu (24):
|
||
|
||
- drm-shim: apply file overrides for open
|
||
- amd/drm-shim: add amdgpu drm-shim
|
||
- hasvk: Refactor Android externalFormat handling in CreateYcbcrConversion
|
||
- hasvk/android: Use VkFormat for externalFormat
|
||
- hasvk: Use the common vk_ycbcr_conversion object
|
||
- vulkan: make sure vk_image::format is never UNDEFINED
|
||
- vulkan: make sure vk_image_view::format is never UNDEFINED
|
||
- vulkan: rename vk_image::ahardware_buffer_format
|
||
- vulkan: define inline stubs when android api level < 26
|
||
- vulkan: add vk_ahb_format_to_image_format
|
||
- anv,hasvk,radv: do not fall back to AHARDWAREBUFFER_FORMAT_BLOB
|
||
- vulkan: add vk_image_format_to_ahb_format
|
||
- anv,hasvk: android ahb is not always exportable
|
||
- radv: improve externalMemoryFeatures for android ahb
|
||
- amd/drm-shim: add raven2
|
||
- ac/surface: print tile_swizzle as well
|
||
- radv: do not use a pipe offset for aliased images
|
||
- aco: fix alignment check in emit_load
|
||
- ac, radeonsi: add and use ac_get_ps_iter_mask
|
||
- radv: fix gl_SampleMaskIn for sample shading
|
||
- radv: fix msaa feedback loop without tc-compat cmask
|
||
- radv: fix non-square compressed image copy on gfx9
|
||
- radv: disable calibrated timestamps on raven/raven2
|
||
- ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces
|
||
|
||
Christian Gmeiner (31):
|
||
|
||
- etnaviv: Add util_blitter_save_so_targets(..) call
|
||
- etnaviv: nir: improve uniform usage for ALU opc
|
||
- etnaviv: correct number of instructions in dump_shader_info(..)
|
||
- etnaviv: move printing of final shader out of etna_link_shaders(..)
|
||
- etnaviv: nir: do not call nir_lower_idiv(..) unconditionally
|
||
- etnaviv: make wider use of DBG_ENABLED(..)
|
||
- ci: add debian-arm32-asan
|
||
- ci/etnaviv: add asan run
|
||
- etnaviv: Add support for conditional rendering
|
||
- etnaviv: add support for performance warnings
|
||
- mesa/arbprog: fix compile errors
|
||
- etnaviv: remove tgsi remains
|
||
- etnaviv: drop usage of tgsi_swizzle_names
|
||
- etnaviv: remove not used tgsi includes
|
||
- ci/etnaviv: update ci expectation
|
||
- ir3/analyze_ubo_ranges: Move IR3_DBG_NOUBOOPT check
|
||
- etnaviv: nir: call nir_remove_dead_variables(..) before linking setup
|
||
- etnaviv: linker: add fallback lookup to VARYING_SLOT_BFC[n]
|
||
- nir: add helper to clear all pass_flags
|
||
- nir/lower_amul: make use nir_shader_clear_pass_flags(..)
|
||
- etnaviv: make use nir_shader_clear_pass_flags(..)
|
||
- etnaviv: nir: do a late nir_opt_cse run
|
||
- docs: mark OES_texture_half_float done on etnaviv
|
||
- etnaviv: support OES_texture_half_float_linear
|
||
- ci/etnaviv: update ci expectation
|
||
- docs: update etnaviv extensions
|
||
- etnaviv: linker: handle scenario where there are FS inputs without matching VS output
|
||
- etnaviv: linker: clean up etna_link_shader(..)
|
||
- nir: rename intrinsic to have a more generic nameing
|
||
- nir: rename has_txs to has_texture_scaling
|
||
- nir/lower_tex: optimize offset lowering for has_texture_scaling
|
||
|
||
Christopher Snowhill (2):
|
||
|
||
- Corrects log print to produce hexadecimal base output
|
||
- intel: Sync xe_drm.h
|
||
|
||
Collabora's Gfx CI Team (4):
|
||
|
||
- Uprev Piglit to 79a084c56b6dd79f7c3a97b57a72963121ebb1e6
|
||
- Uprev Piglit to 536975d94a40cf76a69fcfa786c2513eccd0c989 https://gitlab.freedesktop.org/mesa/piglit/-/compare/79a084c56b6dd79f7c3a97b57a72963121ebb1e6...536975d94a40cf76a69fcfa786c2513eccd0c989
|
||
- Uprev Piglit to d8c08d123fadb986e9a8a7887b922ff63fcff52e https://gitlab.freedesktop.org/mesa/piglit/-/compare/536975d94a40cf76a69fcfa786c2513eccd0c989...d8c08d123fadb986e9a8a7887b922ff63fcff52e
|
||
- Uprev Piglit to 5036601c43fff63f7be5cd8ad7b319a5c1f6652c
|
||
|
||
Connor Abbott (42):
|
||
|
||
- tu: Don't override depth for GMEM
|
||
- tu: Don't pre-shift depth and stencil pitch
|
||
- freedreno/fdl: Don't pre-shift image view pitch
|
||
- freedreno/fdl: Expose view offset
|
||
- tu: Add 3D GMEM load path
|
||
- tu: Use dirty bit for scissor state
|
||
- tu: Precompute maximum views across all subpasses
|
||
- tu: Merge RB_DEPTH_CNTL and RB_STENCIL_CONTROL drawstates
|
||
- tu: Make dynamic viewport and scissor count more accurate
|
||
- freedreno/a6xx: Document per-view viewport in GRAS_SU_CNTL
|
||
- tu: Parse fragment density map attachment info
|
||
- tu: Implement sampling the fragment density map
|
||
- tu/cs: Add support for CS patching
|
||
- tu: Add core FDM patchpoint infrastructure
|
||
- ir3: Record whether a shader writes gl_ViewportIndex
|
||
- tu: Implement FDM viewport patching
|
||
- tu: Implement FDM scaled loads/stores
|
||
- nir, ir3: Add option to use unscaled FragCoord for input attachments
|
||
- tu, ir3: Handle FDM shader builtins
|
||
- tu/autotune: Always prefer GMEM with fragment density maps
|
||
- tu: Don't allow importing/exporting subsampled images with modifiers
|
||
- tu: Expose VK_EXT_fragment_density_map
|
||
- util/bitset: Add some extra functions
|
||
- vk/graphics_state: Remove vk_subpass_info
|
||
- vk/graphics_state: Add feedback_loop_input_only
|
||
- vk/graphics_state: Add VI_BINDINGS_VALID state
|
||
- vk/graphics_state: Fix some assertions when copying state
|
||
- vk/graphics_state: Add helpers for pre-baking state
|
||
- radv: Fix radv_pipeline_is_blend_enabled
|
||
- vk/graphics_state: Track attachment count as state
|
||
- vulkan: Fix renderpass flags with driver-specific renderpass
|
||
- vk/graphics_state: Don't track each vertex input field
|
||
- tu: Don't use A6XX_PC_PRIMITIVE_CNTL_0::TESS_UPPER_LEFT_DOMAIN_ORIGIN
|
||
- freedreno/a6xx: Fix name of A6XX_PC_PRIMITIVE_CNTL_0::TESS_UPPER_LEFT_DOMAIN_ORIGIN
|
||
- tu: Split pipeline struct into different types
|
||
- tu: Rewrite to use common Vulkan dynamic state
|
||
- tu: Use common dirty tracking for PC_PRIMITIVE_CNTL_0
|
||
- freedreno/regs: Document a7xx CP_FIXED_STRIDE_DRAW_TABLE
|
||
- tu: Fix vk2tu_*_stage flag type
|
||
- vk/graphics_state: Fix copying MS locations pipeline state
|
||
- tu: Fix per-view viewport state propagation
|
||
- tu: Fix assert in FDM state emission
|
||
|
||
Constantine Shablia (3):
|
||
|
||
- anv: move get_features after get_device_extensions (ugly diff)
|
||
- panvk: use common vkGetPhysicalDeviceFeatures2
|
||
- v3dv: use common vkGetPhysicalDeviceFeatures2
|
||
|
||
Constantine Shablya (7):
|
||
|
||
- vulkan: add common implementation of vkGetPhysicalDeviceFeatures2
|
||
- vulkan: introduce supported_features parameter to vk_physical_device_init
|
||
- anv: switch to using the common vkGetPhysicalDeviceFeatures2
|
||
- vulkan: inline vk_get_physical_device_features into vk_common_GetPhysicalDeviceFeatures2
|
||
- vulkan: put interesting code before boring code
|
||
- vulkan: put TEMPLATE_H before TEMPLATE_C
|
||
- vulkan: rename vk_physical_device_features.py to vk_physical_device_features_gen.py
|
||
|
||
Corentin Noël (18):
|
||
|
||
- ci: Uprev crosvm and virglrenderer
|
||
- nir: Propagate the type sampler type change to the used variable.
|
||
- build-crosvm: Use the pkg-config crate 0.3.27
|
||
- util: Use the gcc_struct attribute for packed structures in mingw
|
||
- ci: Bump base tag to rebuild piglit
|
||
- ci: uprev virglrenderer and crosvm
|
||
- gallium: Incorporate the device release in dri_destroy_screen_helper
|
||
- gallium: Rename dri_destroy_screen_helper into dri_release_screen
|
||
- pipe-loader: Document the behavior regarding screen creating failures
|
||
- pipe-loader: Do not destroy the winsys on screen creation failure
|
||
- gallium: Only call dri_init_options when the screen is actually created
|
||
- gallium: Use the common destroy function on screen initialization failure
|
||
- gallium: Rename dri_init_screen_helper into dri_init_screen
|
||
- compiler: Allow the explicit_stride of aoa types to be zero
|
||
- nir/split_64bit_vec3_and_vec4: Use the right number of components
|
||
- ci: Uprev virglrenderer
|
||
- ci: Add locked flag to bindgen-cli installation
|
||
- virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host
|
||
|
||
Daniel Schürmann (60):
|
||
|
||
- radv/rt: fix total stack size computation
|
||
- radv/rt: properly destroy radv_ray_tracing_lib_pipeline on error
|
||
- radv/rt: rename radv_ray_tracing_module -> radv_ray_tracing_group
|
||
- radv/rt: add shader stage indices to radv_ray_tracing_group
|
||
- radv/rt: replace uses of pGroups with radv_ray_tracing_group
|
||
- radv/rt: remove merged VkRayTracingShaderGroupCreateInfoKHR
|
||
- vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects
|
||
- vulkan/pipeline_cache: use vk_pipeline_cache_insert_object() to replace raw data objects
|
||
- radv: add padding to radv_shader_binary_legacy
|
||
- vulkan/pipeline_cache: expose vk_raw_data_cache_object
|
||
- radv/pipeline_cache: add NIR caching capabilities
|
||
- radv/rt: expose radv_parse_rt_stage()
|
||
- radv/rt: introduce struct radv_ray_tracing_stage
|
||
- radv/rt: retain parsed NIR shaders in radv_ray_tracing_lib_pipeline
|
||
- radv/rt: use precompiled stages to create RT shader
|
||
- radv/rt: refactor compute_rt_stack_size() to use radv_ray_tracing_stage information
|
||
- radv/rt: remove merged VkPipelineShaderStageCreateInfo
|
||
- radv/rt: Fix and improve VkPipelineCreationFeedback
|
||
- radv/rt: change base of radv_ray_tracing_lib_pipeline to radv_compute_pipeline
|
||
- radv/rt: unify radv_ray_tracing_lib_pipeline and radv_ray_tracing_pipeline
|
||
- radv/rt: unify radv_rt_pipeline_create() and radv_rt_pipeline_library_create()
|
||
- radv/rt: refactor radv_rt_pipeline_compile()
|
||
- radv/rt: use vk_multialloc for radv_ray_tracing_pipeline
|
||
- radv/rt: store stack_sizes per stage instead of per group
|
||
- vulkan/pipeline_cache: don't log warnings for internal caches
|
||
- vulkan/pipeline_cache: don't log warnings for client-invisible caches
|
||
- radv: add remaining RT shader args for separate compilation
|
||
- nir,amd: add nir_intrinsic_store_[scalar|vector]_arg_amd to overwrite inputs
|
||
- nir: add nir_intrinsic_resume_shader_address_amd
|
||
- aco: implement nir_intrinsic_load_resume_shader_address_amd
|
||
- aco: implement select_program_rt()
|
||
- radv/rt: adjust shared_size when lowering hit_attribs
|
||
- radv/rt: extend radv_pipeline_group_handle with shader VAs
|
||
- radv/shader_info: add RT stages to radv_get_user_data_0()
|
||
- radv/rt: implement radv_nir_lower_rt_abi to lower RT shaders for separate compilation
|
||
- radv/rt: implement radv_rt_nir_to_asm()
|
||
- radv/rt: change RT main shader to MESA_SHADER_INTERSECTION
|
||
- radv/rt: replace pCreateInfo with VkPipelineCreateFlags in rt_variables
|
||
- radv/rt: pass radv_ray_tracing_pipeline to RT shader creation
|
||
- radv/rt: add and use specialized cache search/insert functions
|
||
- radv/rt: reference library shaders during radv_rt_fill_stage_info()
|
||
- radv/rt: don't write cache hit feedback per stage.
|
||
- radv/rt: create compile_rt_prolog() function
|
||
- radv/rt: set up RT shader args for separate compilation
|
||
- radv/rt: adjust lower_rt_instructions() for shader functions [disables RT]
|
||
- aco: adjust RT prolog for shader functions [disables RT]
|
||
- radv/rt: separate shader compilation
|
||
- radv/debug: dump ray tracing shaders in case of a hang
|
||
- radv/rt: use priorities to select the next shader
|
||
- radv/rt: remove now dead code
|
||
- radv: reference pipeline cache object in radv_pipeline
|
||
- aco/assembler: align resume shaders with cache lines
|
||
- aco/assembler: align loops if it reduces the number of cache lines
|
||
- aco/assembler: change prefetch mode on GFX10.3+ during loops if beneficial
|
||
- vulkan/pipeline_cache: add 'skip_disk_cache' option
|
||
- radv/meta: disable disk cache for meta shaders
|
||
- radv: migrate radv_shader hash to BLAKE3
|
||
- amd: move end-of-code marker padding to ACO.
|
||
- amd: Do shader binary alignment for prefetch at memory allocation time.
|
||
- aco/insert_exec_mask: set Exact mode after p_discard_if when necessary
|
||
|
||
Daniel Stone (10):
|
||
|
||
- wsi/wayland: Support VK_KHR_present_wait
|
||
- ci/zink: Disable Freedoom trace on ANV
|
||
- ci: Respect $HTTP_PROXY for ci_run_n_monitor
|
||
- ci: Elaborate causes for job retries
|
||
- ci: Don't retry manual or scheduled jobs
|
||
- ci: Extend a618_vk_full runtime
|
||
- CI: Re-enable freedreno CI
|
||
- ci/fdno: Pause a660 testing
|
||
- Revert "ci/fdno: Pause a660 testing"
|
||
- egl/wayland: Always initialise fd_display_gpu
|
||
|
||
Danylo Piliaiev (42):
|
||
|
||
- freedreno: Early exit in device matching if id doesn't have chip_id
|
||
- ir3/a7xx: NOPs may have some no-op bits set
|
||
- ir3/a7xx: Add new lock/unlock CS instructions
|
||
- ir3/a7xx: Add new form of stg.a/ldg.a addressing
|
||
- ir3/a7xx: Add STSC definition
|
||
- ir3: Document that stc has higher DST upper bound than we defined
|
||
- ir3/a7xx: Document "alias" instruction
|
||
- ir3: documents (ss) flag for cat7 instructions
|
||
- tu: Create drm fd per logical device
|
||
- tu: Move VMA heap to the logical device
|
||
- tu: Re-enable bufferDeviceAddressCaptureReplay
|
||
- freedreno/perfcntrs: Link with libfreedreno_common
|
||
- freedreno: Decouple GPU gen from gpu_id/chip_id
|
||
- freedreno,ir3: Don't call fd_dev_64b more than necessary
|
||
- freedreno/decode: Correctly handle chip_id
|
||
- tu: Add missing dbg reg stomping to tu_CmdBeginRendering
|
||
- tu: Fix zombie VMAs array not initialized when first BOs may be freed
|
||
- freedreno/regs: Print xml validation error if validation fails
|
||
- freedreno/rnn: Fix addvariant being set effectively once
|
||
- freedreno/rnn: Make addvariant work for fields in the same reg
|
||
- freedreno/rnn: Take into account array's variant for regs
|
||
- freedreno/regs: Change a7xx regs to have open range for generation
|
||
- freedreno/regs: More CP commands are the same on a7xx as on a6xx
|
||
- freedreno/regs: Document CP_MEM_TO_SCRATCH_MEM
|
||
- freedreno/regs: Document a7xx CP_MODIFY_TIMESTAMP
|
||
- freedreno/regs: Clarify polling on a7xx for CP_WAIT_REG_MEM/CP_COND_WRITE5
|
||
- freedreno/regs: Add a7xx pseudo-regs to CP_SET_PSEUDO_REG
|
||
- freedreno/regs: a7xx has a new source type CP_REG_TEST
|
||
- freedreno/regs: Add 2 new a7xx modes to CP_COND_REG_EXEC
|
||
- freedreno/regs: Add some new a7xx events
|
||
- freedreno/regs: Add more a7xx regs and reg fields
|
||
- freedreno/regs: Fix a7xx SP_FS_PREFETCH definition
|
||
- freedreno/regs: Generate per-gen reg usage tables
|
||
- freedreno/regs: Define usage for all a6xx/a7xx regs
|
||
- tu: Allow reg stomping of compute related registers
|
||
- tu: Use reg usage tables for stale reg dbg option
|
||
- freedreno/regs: Properly document a7xx CP_EVENT_WRITE, CP_WAIT_TIMESTAMP
|
||
- freedreno/regs: Document a7xx CP_BV_BR_COUNT_OPS
|
||
- freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK
|
||
- ir3: Fix FS quad ops returning wrong values from helper invocations
|
||
- tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults
|
||
- radv: fix unused non-xfb shader outputs not being removed
|
||
|
||
Dave Airlie (134):
|
||
|
||
- radeonsi/ac: move some vcn defines to common
|
||
- radv/video: add missing gfx family
|
||
- radv: set a video decode ip block in physical device.
|
||
- radv/winsys: handle encoder queue padding/submits.
|
||
- radv/video: add a video addr gfx mode
|
||
- radv/video: fix dpb surface programming
|
||
- radv/video: start adding gfx11 vcn decoder
|
||
- lp_jit: use pipe max for the lp_jit texture levels.
|
||
- gallivm: consolidate draw/lp texture type.
|
||
- gallivm: consolidate llvmpipe/draw sampler types.
|
||
- gallium: consolidate jit image types between draw/llvmpipe
|
||
- gallivm: reorder some texture/image members.
|
||
- vulkan/cmd_queue: handle beta extensions.
|
||
- vulkan: write beta extensions into generator scripts.
|
||
- draw: align common members in jit context structs.
|
||
- llvmpipe: refactor fs/cs jit structure members.
|
||
- gallivm: refactor common resources out of contexts
|
||
- gallivm/draw/llvmpipe: consolidate the sampler/image dynamic state fns
|
||
- gallivm: add common code for sample/image tracking.
|
||
- llvmpipe: move to common sampler/image binding code
|
||
- draw: move to use common sampler/image binding code
|
||
- llvmpipe/cs: refactor cs generator args to use an enum
|
||
- gallivm/draw: refactor vertex header jit type out
|
||
- llvmpipe: convert a bunch of shader_type ifs to switches.
|
||
- llvmpipe/cs: start making variant generator less compute specific
|
||
- llvmpipe/cs: support passing a csctx instead of using implicit one
|
||
- lavapipe: add lavapipe specific shader stages define.
|
||
- lvp: explictly skip compute shader stage.
|
||
- gallivm: fix whitespace in get_deref_offset
|
||
- gallivm/nir: refactor the local invocation index calc.
|
||
- lvp: use stage mask
|
||
- lvp: use stage iterator macros instead of explicit loops
|
||
- ci: reenable lavapipe
|
||
- radv/video: add missing space checks for video.
|
||
- radv/video: use correct h264 levels
|
||
- radv/video: fix h264/265 dpb usage.
|
||
- radv/video: add missing offset to the dpb binding.
|
||
- radv/video: rework stream handle generation.
|
||
- radv/video: fix some whitespace.
|
||
- radv/video: add debug flag to enable dpb image array on newer GPUs.
|
||
- radv/video: fix physical device format property count.
|
||
- vk/video: add a common function to get block alignments for profiles
|
||
- radv: align video images internal width/height inside the driver.
|
||
- anv/video: move format properties to outarray.
|
||
- radv/meta: fix uninitialised stack memory usage.
|
||
- gallium: add task/mesh shader query types to stats interface.
|
||
- gallium: expand pipe_grid_info to handle task/mesh.
|
||
- gallium: add a new PIPE_SHADER_MESH_TYPES
|
||
- freedreno: don't report task/mesh.
|
||
- gallium: add task/mesh shader entrypoints in context
|
||
- iris: don't return shader params for task/mesh.
|
||
- crocus: don't report mesh/task limits
|
||
- radeonsi: don't report shader params for task/mesh
|
||
- svga: don't report mesh/task shader limits
|
||
- d3d12: don't report mesh/task limits
|
||
- gallium/cso: add task/mesh shaders to the cso cache
|
||
- gallium/nir/tgsi: add various support for task/mesh bits
|
||
- lavapipe: when in doubt, swizzle the swizzle
|
||
- lavapipe: fix pipeline sanitizing.
|
||
- lavapipe: fix indentation whitespace
|
||
- draw: add mesh shader infrastructure
|
||
- draw: move draw_vertex_info and draw_prim_info to public header.
|
||
- draw: add a mesh primitive assembler.
|
||
- draw: add mesh pipeline middle end.
|
||
- draw: add support for per primitive aos emission
|
||
- gallivm: add support for payload access
|
||
- gallivm/nir: add launch mesh workgroups
|
||
- gallivm/nir: add a mesh interface and vert/prim count setting.
|
||
- gallivm/nir: call task shader lowering.
|
||
- gallivm/nir: add support for mesh shader outputs.
|
||
- llvmpipe: resize arrays to handle mesh shaders.
|
||
- llvmpipe: start adding task/mesh support.
|
||
- llvmpipe: bump dirty tracker to 64-bits.
|
||
- llvmpipe: add dirty bits for mesh and task shaders.
|
||
- llvmpipe: add debug bit for mesh shaders
|
||
- llvmpipe: add query support for task/mesh shaders
|
||
- llvmpipe: bind task/mesh resources and dirty bits
|
||
- gallivm/cs: add payload ptr to the cs thread data.
|
||
- llvmpipe/cs: add task/mesh shader support to compute shader builder.
|
||
- llvmpipe/cs: add multiple stride indirect to fill_grid_info.
|
||
- llvmpipe: add mesh shader drawing.
|
||
- llvmpipe: enable task/mesh shader support.
|
||
- lavapipe: handle some mesh shader stage differences.
|
||
- lavapipe: add mesh query support
|
||
- lavapipe: add support for task/mesh shader stages in various places
|
||
- lavapipe: add execution backends for mesh shader draw apis
|
||
- lavapipe: enable task/mesh shaders.
|
||
- docs: update docs for lavapipe mesh shading
|
||
- llvmpipe: emit fences for barrier.
|
||
- lavapipe: don't remove queue family barriers.
|
||
- gallivm/nir: fix shuffleup tests.
|
||
- draw: rename jit to vs_jit in lots of places.
|
||
- draw/tess: drop unused tgsi bits.
|
||
- gallium/tgsi/draw/softpipe: remodel shader const/buffer bindings.
|
||
- draw: refactor resources to use arrays instead of explicit structs.
|
||
- draw: add a max stage define and use it in a few places
|
||
- draw: repack some members of context.
|
||
- radv/video: convert video format properties to an outarray
|
||
- radv/video: convert session memory requirements to outarray.
|
||
- radv/video: don't supply an 8-bit format for a 10-bit dpb.
|
||
- radv/video: rework h265 reference frame bindings.
|
||
- radv/video: fix hevc st rps programming
|
||
- radv/video: fix hevc scaling lists.
|
||
- lavapipe: ignore another yuv format.
|
||
- radv/video: report bad profile operation if h264 profile isn't supported.
|
||
- radv/video: fix hevc scaling list order.
|
||
- radv/video: program frame number correctly.
|
||
- radv/video: program hevc max dec pic buffering correctly
|
||
- radv/video: restrict the number of IBs on video related queues.
|
||
- ac/radeonsi: add av1 defaults header file from radeonsi
|
||
- radv/video: drop incorrect defines for uapi ones.
|
||
- lavapipe: check sampler pointer before deref
|
||
- draw/gs: handle extra shader outputs in geometry.
|
||
- lavapipe: expose subgroups in mesh/task shaders.
|
||
- gallivm: store thread id in separate values.
|
||
- gallivm: convert block_id to discrete values.
|
||
- gallivm: convert grid_size to discrete values.
|
||
- gallivm: make block_size use discrete values.
|
||
- clc: llvm 17 requires opaque pointers.
|
||
- gallium/va: fix superres av1 decoding.
|
||
- llvmpipe/linear: don't allow linear path for shader output with location frac
|
||
- llvmpipe/linear: refactor linear samplers into templated code.
|
||
- llvmpipe/linear/tgsi: calculate num_texs properly for nir.
|
||
- llvmpipe/linear: add sample routines for swapping r/b channels
|
||
- llvmpipe/linear: add support for sampling when cbuf order is different.
|
||
- llvmpipe/linear: add support for rgba color buffers.
|
||
- ci: update fails for fixed tests due to llvmpipe linear changes.
|
||
- gallivm: fix atomic global temporary storage.
|
||
- llvmpipe: fix fragdata/lastfragdata heuristic a bit more.
|
||
- zink: turn off threaded cpu access if not visible.
|
||
- llvmpipe: enable f16 paths on aarch64.
|
||
- radv: don't emit event code on video queues.
|
||
- spirv: use a pointer sized int type for opencl event_t
|
||
- radv/video: take db alignment into account when allocating images.
|
||
|
||
David (Ming Qiang) Wu (1):
|
||
|
||
- radeonsi/vcn: add an exception of field case for h264 decoding
|
||
|
||
David Heidelberg (129):
|
||
|
||
- ci/amd: 4/5 runners TPad-C13 runners are online, restore most of the tests
|
||
- ci/dxvk: uprev to 2.1
|
||
- ci/amd: update checksums after DXVK 2.1 update
|
||
- ci: bump kernel to the 6.3, support HDK 888 based on sm8350
|
||
- ci/freedreno: do not restrict to 2 cpus on a530
|
||
- ci: drop overriding new a530 firmware due to preemption issues with older kernel
|
||
- ci/freedreno: a530 behaves stable in 6.3
|
||
- ci/freedreno: update a530 flakes, fails and skips
|
||
- ci/freedreno: fix the a530_piglit job and switch to Weston
|
||
- ci: polish deqp-runner a bit
|
||
- ci: uninstall libdrm from the GL and VK containers
|
||
- ci: do not retry on forks to get the upstream kernel and rootfs
|
||
- ci/mold: bump to 1.11.0
|
||
- ci: add Adreno 660 on sm8350 chipset (HDK 888)
|
||
- ci/lava: implement fastboot support
|
||
- ci/lava: add support for HDK 888 firmware
|
||
- ci: add a660 firmware into rootfs
|
||
- pvr: drop unused variable
|
||
- ci/dzn: add flaking test
|
||
- ci/skqp: replace license with SPDX and extract the used branch
|
||
- ci/skqp: update to the Android CTS 12.1_r5 version
|
||
- mesa/main: drop unused variable
|
||
- nir/lower_io_to_vector: initialize base
|
||
- panvk: clear dangling pointers
|
||
- ci: uprev kernel to 6.3.1 with fixed patch for Adreno SMMU
|
||
- util/tests: adjust for new gtest
|
||
- gtest: Update to 1.13.0
|
||
- ci/skqp: handle all warnings printed with clang >= 14
|
||
- panvk: drop path from panvk_physical_device struct
|
||
- venus: drop unused sem_feedback_count from vn_queue_bind_sparse_submit_batch
|
||
- ci/broadcom: skip timeouting ssbo.layout.3_level_array.std430.mat4 on RPi4
|
||
- ci/venus: add recent flakes
|
||
- ci/freedreno: add recent a630 flake
|
||
- ci/v3d: add flaking opengl 1.1@depthstencil-default_fb-drawpixels-float-and-ushort
|
||
- ci/amd: re-enable VA-API testing
|
||
- ci/rules: radeonsi VAAPI rules should include also VA-API targets
|
||
- ci: update libva to 2.18.1
|
||
- ci/gtest: improve the runner script
|
||
- ci/amd: update VA-API expectations
|
||
- ci/amd: add radeonsi-raven-va-full job to cover all VA-API tests
|
||
- ci/gtest-runner: fix results reporting
|
||
- ci/venus: add missing flakes
|
||
- ci/crosvm: update cmdline options
|
||
- docs: update crosvm networking options
|
||
- ci/radv: add another raven flake dEQP-VK.draw.dynamic_rendering.primary_cmd_buff.linear_interpolation
|
||
- ci/v3dv: add often timeouting ssbo.layout.3_level_array.std140.column_major_mat4
|
||
- r300: workaround GCC 12+ warning, declare NULL value as unreachable
|
||
- docs: use meson instead invoking ninja directly
|
||
- ci/freedreno: disable 3 jobs to match our farm 3 devices down
|
||
- ci/freedreno: rename piglit job to respresent the real testing it does
|
||
- ci: move from pkg-config to pkgconf
|
||
- ci: use meson setup and meson install instead of meson and invoking ninja directly
|
||
- ci: bump libdrm from 2.4.110 to 2.4.114 present in Debian 12
|
||
- ci: install stock android-libext4-utils (available in 12, bookworm)
|
||
- ci: bump gfxreconstruct revision up to compatible version with Debian 12
|
||
- ci: libwayland from 1.18 to 1.21 and wayland protocols from 1.24 to 1.31
|
||
- ci: VVL uprev (temporary until new release will be published)
|
||
- ci: bump from Debian 11 (bullseye) to 12 (bookworm)
|
||
- ci/apitrace: install win64 apitrace only on x86_64
|
||
- ci/crosvm: install libelogind0 and sysvinit-core for poweroff functionality
|
||
- ci: add clang-15 and clang++-15 wrapper script
|
||
- ci/skqp: skqp can't live with compiler named clang-15, provide symlink
|
||
- ci: drop gallium-aux test on msan builds, renable freedreno
|
||
- ci/mingw: disable as it's broken
|
||
- ci/venus: add fail after CI uprev to the Debian 12
|
||
- ci/virpipe: add flakes introduced with CI uprev to Debian 12
|
||
- ci/zink: disable flaking anv traces
|
||
- ci: enable shellcheck on whole .gitlab-ci
|
||
- ci: disable bogus GCC warning with -Warray-bounds
|
||
- ci: do not fail when SHA1 impl. produce stringop-overreads warning
|
||
- ci/lavapipe: document subgroups.shuffle.compute.subgroupshuffleup_double_constant crash
|
||
- ci/lavapipe: zink failures
|
||
- ci/llvmpipe: document intel_shader_atomic_float_minmax@execution@ssbo-atomic*
|
||
- bin/ci: mention requirements.txt
|
||
- gitlab: add template for merge requests
|
||
- ci/zink: add KHR-GL46.limits.max_fragment_interpolation_offset flake
|
||
- ci/amd: previously missed raven flake
|
||
- ci/panfrost: add largest possible eglcreatepbuffersurface and then glclear flake
|
||
- gitlab: prefill MR template with first multiline commit message
|
||
- ci: bump Alpine to 3.18
|
||
- ci/ccache: recent ccache changed a output a bit, adapt script
|
||
- ci: rename x86 and amd64 to x86_64, armhf to arm32, and i386 to x86_32
|
||
- ci: use bash arrays in Fedora script + shebang change
|
||
- ci/fedora: re-enable ccache
|
||
- traces: update sir-f720 trace expectations for zink on anv and freedreno
|
||
- ci: missed variable inside the big rename and split ARCH and DEBIAN_ARCH
|
||
- ci: fix KVM module modprobe code
|
||
- ci: explicitely state BUILDTYPE
|
||
- ci: rename S3 artifacts according to scheme mesa-$arch-$config-$buildtype
|
||
- ci: rename MINIO to S3
|
||
- ci: rename MINIO_HOST variable to S3_HOST
|
||
- ci: replace MINIO_RESULTS_UPLOAD with S3_RESULTS_UPLOAD
|
||
- ci: remove BUILD_PATH, always use S3_ARTIFACT_NAME
|
||
- ci/lava: rename rest local MINIO\_ variables to S3\_
|
||
- ci/android: remove the artifact file just as we unpack it
|
||
- ci: valve and freedreno farm is down
|
||
- ci/windows: move microsoft farm rules
|
||
- ci/etnaviv: if farm is down, we expect no manual jobs can be triggered
|
||
- ci/amd: hide vaapi job dependent on Collabora farm when it's down
|
||
- ci/crocus: depend on state of the Anholt farm
|
||
- ci: implement farms handling trough files inside .ci-farms
|
||
- ci/docs: fixup incorrect spacing around console block
|
||
- ci/panfrost: switch panfrost-g52-piglit-gles2 from X to XWayland
|
||
- ci/fastboot: use gzipped Image to avoid compressing on the runner
|
||
- ci/microsoft: uploading artifacts gets stuck currently (retried)
|
||
- ci/microsoft: rename manual rules according to rest introduced rules
|
||
- ci: create manual farm rules
|
||
- ci/traces: guard DXVK and VK behind VK_DRIVER
|
||
- ci/apitrace: include version with LTO enabled
|
||
- ci/traces: print version of apps used for replaying traces
|
||
- ci: when touching farms, never run manual jobs
|
||
- ci/microsoft: partly revert rename from container-rules to manual-rules
|
||
- ci/x86: Build ANGLE for testing layering on VK drivers.
|
||
- ci/amd: switch all possible jobs from X11 to Wayland
|
||
- ci/freedreno: switch a630_{piglit,skqp} and a618_gl to Weston
|
||
- ci/freedreno: re-enable a530 as it's now stable with multiple skips
|
||
- ci/freedreno: document number of a630 devices available
|
||
- ci/freedreno: add KHR-GL46.buffer_storage flakes
|
||
- ci/freedreno: add execution@varying-struct-copy-return-vs flake
|
||
- ci/container: add weston into Vulkan container
|
||
- ci/container: we need to keep the wine inside
|
||
- ci/traces: switch from xvfb to Weston XWayland
|
||
- ci/freedreno: another batch of a530 flakes
|
||
- ci: add quirk for GitLab assuming changes is always true for scheduled runs
|
||
- ci/microsoft: when re-enabling Windows Farm, always run the container
|
||
- ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace
|
||
- ci/amd: fix timeouting radeonsi-raven-va-full job
|
||
- ci: add perfetto into mesa git-cache
|
||
- ci/deqp: really remove the uncompressed results.csv file
|
||
|
||
David Redondo (1):
|
||
|
||
- egl/wayland: fix oob buffer access during buffer_fds clean up
|
||
|
||
David Rosca (7):
|
||
|
||
- radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT
|
||
- frontends/va: Init view_resources array in vlVaPut/GetImage
|
||
- frontends/va: Ignore requested size when creating VAEncCodedBufferType
|
||
- Revert "radeonsi/vcn: add an exception of field case for h264 decoding"
|
||
- frontends/va: Flush after unmapping VAImageBufferType
|
||
- frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture
|
||
- frontends/va: Set default rate control values once when creating encoder
|
||
|
||
Derek Foreman (1):
|
||
|
||
- vulkan/wsi: Allow binding presentation_timing when software rendering
|
||
|
||
Diederik de Haas (1):
|
||
|
||
- treewide: spelling fixes
|
||
|
||
Dmitry Baryshkov (3):
|
||
|
||
- freedreno/registers: updte HDMI registers to include CEC details
|
||
- freedreno/registers: add bitfield for DSI wide bus enablement
|
||
- tu: Pass real size of prime buffers to allocator
|
||
|
||
Dmitry Osipenko (4):
|
||
|
||
- iris/bufmgr: Use intel_ioctl() helper for GEM_SET_TILING
|
||
- intel/dev: Use intel_ioctl() helper for GEM_SET_TILING
|
||
- anv: Use intel_ioctl() helper for GEM_SET_TILING
|
||
- hasvk: Use intel_ioctl() helper for GEM_SET_TILING
|
||
|
||
Dmitry Rogozhkin (1):
|
||
|
||
- meson/vaon12: fix driver file name for mingw build
|
||
|
||
Donald Robson (2):
|
||
|
||
- pvr: Move heap initialisation out of pvr_winsys_helper.
|
||
- pvr: Rename rogue_fw.xml -> rogue_kmd_stream.xml.
|
||
|
||
Dor Askayo (3):
|
||
|
||
- meson: add feature option for use of system Clang headers at runtime
|
||
- ci: Disable "opencl-external-clang-headers" when "microsoft-clc" is enabled
|
||
- nouveau: add exported GEM handles to the global list
|
||
|
||
Dr. David Alan Gilbert (4):
|
||
|
||
- rusticl/screen: Wrap get_timestamp
|
||
- rusticl/device: Stash timestamp availability
|
||
- rusticl/api: Implement get_{device_and\_}host_timer
|
||
- rusticl/api: Wire up CL_DEVICE_PROFILING_TIMER_RESOLUTION
|
||
|
||
Dylan Baker (57):
|
||
|
||
- docs: add release notes for 23.0.1
|
||
- docs: Add sha256 sum for 23.0.1
|
||
- docs: add release notes for 23.0.2
|
||
- docs: Add sha256 sum for 23.0.2
|
||
- docs: add release notes for 23.0.3
|
||
- docs: Add sha256 sum for 23.0.3
|
||
- docs: update calendar for 23.0.1
|
||
- docs: update calendar for 23.0.2
|
||
- docs: update calendar for 23.0.3
|
||
- docs: add release notes for 23.0.4
|
||
- docs: Add sha256 sum for 23.0.4
|
||
- docs: update calendar for 23.0.4
|
||
- intel/tools/error2aub: Fix potential out of bounds read
|
||
- meson: Key whether to build batch decoder on expat
|
||
- bin/pick: fix issue where None for nomination_type could fail
|
||
- bin/pick: use lineboxes to make the UI clearer
|
||
- bin/pick: Add support for adding notes on patches
|
||
- bin/pick-ui: use asyncio.new_event_loop
|
||
- meson: Add back execmem option as a deprecated option
|
||
- VERSION: update to 23.2.0-rc1
|
||
- docs: Update release calendar for 23.2.0-rc1
|
||
- .pick_status.json: Update to 6e87b277bde71e30c98ab9dda7bd2f2017b77ed5
|
||
- .pick_status.json: Update to 27d30fe3c0e71efd90fcfe209d8515b195b0075f
|
||
- .pick_status.json: Update to 3a8aae9e6aa526367523c58dfe5046909776be74
|
||
- .pick_status.json: Update to 59087003c4b7a4f5a6bf207f214a4c3443b9759f
|
||
- ci: mark passing zink and lima tests as expected
|
||
- docs: truncate new_features.txt
|
||
- docs: add release notes for 23.2.0
|
||
- VERSION: update to 23.2.0
|
||
- docs: Update release calendar for 23.2.0
|
||
- docs: Add sha256 sum for 23.2.0
|
||
- Revert incorrect 23.2.0 release
|
||
- VERSION: update to 23.2.0-rc2
|
||
- docs: Update release calendar for 23.2.0-rc2
|
||
- .pick_status.json: Update to e88c0770969f6ae0bfa5bea0f9d99687d257fea1
|
||
- .pick_status.json: Mark d3f26cbbe1a957b76804da44bbf5e30de2bac941 as denominated
|
||
- .pick_status.json: Update to c5a6e88c4e816ded6105b74f101528eb004e0581
|
||
- .pick_status.json: Update to 088c2bbd51a48eb0de1e9fd23c529759585bad59
|
||
- .pick_status.json: Update to 088c2bbd51a48eb0de1e9fd23c529759585bad59
|
||
- VERSION: update to 23.2.0-rc3
|
||
- docs: Extend calendar entries for 23.2 by 2 releases.
|
||
- docs: update calendar for 23.2.0-rc3
|
||
- .pick_status.json: Update to 10e75aae1bddee9795b1ff04ffd656b0da79b5b5
|
||
- .pick_status.json: Updates notes for aebe58458611e0bb585a5bce8e16c1175783f3cc
|
||
- .pick_status.json: Updates notes for f8cb0d8a44afb9c70f38e359ffe0ad57416e66a4
|
||
- Revert "Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR""
|
||
- .pick_status.json: Updates notes for 93b4f200dead198e680991a1e95bf3d3b58f87bd
|
||
- .pick_status.json: Updates notes for 7e246f7f2bde0c859269c4b81505bd0887045e7b
|
||
- .pick_status.json: Updates notes for 9865e5dff49395543da4331a943ba5a03ce6a413
|
||
- .pick_status.json: Update to 1cdc4be14b66108ae0e8069686ac3efe52bef3cb
|
||
- .pick_status.json: Updates notes for b8ea9724fa5ca38620bc0cdc01b7addd05574954
|
||
- .pick_status.json: Updates notes for 68027bd38e134f45d1fe8612c0c31e5379ed7435
|
||
- VERSION: update to 23.2.0-rc4
|
||
- docs: Update release calendar for 23.2.0-rc4
|
||
- .pick_status.json: Mark fa6562b239f00f9f72c988459e252bdee072fd73 as denominated
|
||
- .pick_status.json: Update to f4fecdad724edf8187d22928ed844af7fd84654d
|
||
- zink/ci: mark unexpcted pass as expected
|
||
|
||
Emma Anholt (124):
|
||
|
||
- zink: Avoid infinite loop finding no var in update_so_info.
|
||
- ci/crocus: Update checksum for STK.
|
||
- symbol_table: Store the symbol name in the same allocation as the symbol entry.
|
||
- symbol_table: Don't maintain the HT as we're destroying the table.
|
||
- symbol_table: Don't bother resetting the key on popping scope.
|
||
- symbol_table: Prehash the key on insert, and reuse the entry on shadowing.
|
||
- tu/perfetto: Refactor code out of the macro, to stage_end.
|
||
- tu/perfetto: Clean up an extra token paste to just use the arg being passed.
|
||
- tu/perfetto: Use tu_CmdBeginDebugUtilsLabelEXT as a stage event in perfetto.
|
||
- tu/perfetto: Drop unused arg to send_descriptors().
|
||
- tu/perfetto: s/MRTs/attachment_count/ in traces.
|
||
- anv: Only enable GPL if ANV_GPL=true, or if zink or DXVK are the engine.
|
||
- anv: Refactor repeated pipeline creation feedback output code.
|
||
- ci/lvp: Update sanctuary trace hash.
|
||
- ci/radv: Demote navi21 to manual until recent flakiness resolves.
|
||
- ci/zink+tu: Drop some intermittently failing a630 traces.
|
||
- ci/freedreno: Drop portal-2-v2 trace.
|
||
- ci/radv: Add known flakes for #8817
|
||
- ci: Crank up the yamllint line length limit.
|
||
- ci/freedreno: Demote a530 to manual again.
|
||
- ci: Make a variable for the repeated rootfs directory name.
|
||
- ci: Add the Vulkan validation layer to amd64 rootfs builds.
|
||
- ci/zink: Re-enable traces now that !20319 has landed.
|
||
- ci: Move zink's validation layer setup to deqp-runner.sh.
|
||
- ci/zink: Enable the validation layer on the TGL GL46 run.
|
||
- blob: Don't valgrind assert for defined memory if we aren't writing.
|
||
- util/log: Fix log messages over 1024 characters.
|
||
- ci: Move some timeout xfails to skips.
|
||
- ci/deqp: Update to 1.3.5.1 and pull in additional bugfixes from main.
|
||
- ci/zink: Drop anv/lvp validation exceptions that should be fixed in the CTS.
|
||
- ci/valve: Add a workaround for finding libdrm on navi21s.
|
||
- ci/panfrost: Drop tex3d-maxsize on g52.
|
||
- ci/lima: Skip ppgtt_memory_alignment that flaked a job with the oomkiller.
|
||
- ci/crocus: Note a recent regression.
|
||
- ci/zink: Try to update TGL results for new MSAA behavior.
|
||
- vulkan: Handle alignment failure in the pipeline cache.
|
||
- vulkan: Actually increment the count of objects in GetPipelineCacheData.
|
||
- Revert "ci/zink: Try to update TGL results for new MSAA behavior."
|
||
- ci/zink: Update more xfails for tgl piglit.
|
||
- ci/zink+anv: Test piglit quick_gl pre-merge, dropping a few KHR-GL46 tests.
|
||
- ci/radeonsi: Mark glx-make-current as flaky.
|
||
- ci/radv: Disable flaky heaven d3d9 trace.
|
||
- ci/turnip: Drop an xfail from the full run for a recent fix.
|
||
- ci/turnip: Drop the IUB bug fallout flakes.
|
||
- mesa: Fix debug logging of fp compile compare func.
|
||
- mesa: Fix precompile of GLSL programs with shadow samplers.
|
||
- zink: Explain some of the current pathway for shadow sampling.
|
||
- zink: Fix silly void * type in rewrite_tex_dest.
|
||
- zink: Don't flag legacy_shadow_mask for RED-only reads in the shader.
|
||
- ci: Re-enable some piglit tests that should be fast enough post-uprev.
|
||
- ci/zink+anv: Skip a couple more long tests pre-merge.
|
||
- compiler: Update reference to name_for_stage func.
|
||
- nir: Add helpers for lazy var creation.
|
||
- drm-shim: Avoid assertion fail if someone does close(-1).
|
||
- glsl: Allow invariant flags on sysvals, such as gl_PointCoord.
|
||
- nir/lower_texcoord_replace: Flag SYSTEM_VALUE_POINT_COORD read when we load it.
|
||
- zink: Use PIPE_CAP_FS_POINT_IS_SYSVAL.
|
||
- mesa: Use find_state_var in lower_builtin.
|
||
- nir: Use find_state_var in lower_atomics_to_ssbo.
|
||
- nir,mesa: Add helpers for creating uniform state variables.
|
||
- mesa: Move ATI_fragment_shader fog code emit to a NIR lowering pass.
|
||
- mesa/ARB_fp: Drop an extra enum for fog mode.
|
||
- mesa/ARB_fp: Use the NIR pass for adding fog code instead of ARB instrs.
|
||
- mesa: Move ARB_vp position invariant option handling to NIR.
|
||
- mesa: Drop ARB program helper functions that are no longer used.
|
||
- mesa: Drop unused control flow instructions for ARB programs.
|
||
- mesa: Drop remaining unused ARB program instructions.
|
||
- mesa: Move st_prog_to_nir_postprocess out of prog_to_nir.
|
||
- mesa/ati_fs: Move sampler dim adjustment to a separate NIR pass.
|
||
- mesa/ati_fs: Move NIR translation to ATI_fs compile time.
|
||
- mesa/ati_fs: Move prog->SamplersUsed/TexturesUsed setup to EndFragmentShader.
|
||
- mesa: Use the NIR pass for fixed function fog.
|
||
- mesa/ffvs: Fix mvp_with_dp4 position transformation.
|
||
- mesa: Use shared NIR code for ARB_vp and FF VS position transformation.
|
||
- ci/freedreno: Update minetest hash.
|
||
- Revert "ci: disable anholt's farm"
|
||
- crocus: Fix regression from !20153
|
||
- ci/crocus: Add a missing xfail.
|
||
- ci/turnip: Update full-run xfails.
|
||
- tu: Ignore unused shader stages in pipeline library creation.
|
||
- anv: Drop unused ALL_GRAPHICS_LIB_FLAGS.
|
||
- ci/crocus: Update trace hash for the neverball regression.
|
||
- ci/etnaviv: Update some xfails common between the last 3 nightly runs.
|
||
- v3d: Respect nir_intrinsic_store_output's write_mask.
|
||
- mesa: Emit full output write in st_pbo_create_vs().
|
||
- mesa: Port the pbo.use_gs path to NIR and let it get used on NIR drivers.
|
||
- softpipe: Drop the use_tgsi debug flag.
|
||
- llvmpipe: Drop the LP_DEBUG=tgsi_ir debug option.
|
||
- virgl: Drop the VIRGL_DEBUG=use_tgsi debug var.
|
||
- r600: Drop docs for use_tgsi debug var.
|
||
- r300: Drop RADEON_DEBUG=use_tgsi.
|
||
- nouveau: Delete the NV50_PROG_USE_TGSI env var.
|
||
- svga: Switch to preferring NIR by default.
|
||
- nine: Drop the nir_vs/nir_ps env vars.
|
||
- gallium: Drop PIPE_SHADER_CAP_PREFERRED_IR.
|
||
- mesa/drawtex: Cut out the TGSI semantic translation.
|
||
- svga: Stop asserting that compute params are queried against TGSI.
|
||
- mesa: Always query our compute params against IR_NIR.
|
||
- mesa: Drop TGSI token handling
|
||
- mesa: Simplify st_get_nir_compiler_options().
|
||
- mesa: Drop dead TGSI serialization prototypes.
|
||
- mesa/atifs: Rename the header guard.
|
||
- mapi: clang-format _glapi_add_dispatch().
|
||
- mapi: Delete dynamic stub generation.
|
||
- mesa: Drop the function parameter spec from the remap table.
|
||
- mapi: Clean up mapi_stub struct.
|
||
- mesa: Drop the aliases from the remap table.
|
||
- mapi: Drop the unused_functions table.
|
||
- mapi: Delete execmem support code.
|
||
- intel: Count reads_remaining across all blocks.
|
||
- intel: Allocate the last_grf_write once per scheduler.
|
||
- intel: Reduce cost of resetting last_grf_write.
|
||
- ci/zink: Update current xfails on tgl.
|
||
- ci: Update to vulkan-cts-1.3.5.2 (and pull in some more fixes).
|
||
- ci: Drop skips for some previously-invalid CTS tests.
|
||
- ci: Drop some skips of GL CTS ArraysOfArrays tests.
|
||
- ci/anv: Make anv-manual-rules actually manual on anv-only changes.
|
||
- ci: Clean up .intel-rules definition.
|
||
- ci/amd: Report flakes to #amd-ci on OFTC.
|
||
- ci/anv: Add testing of the GLES CTS using ANGLE on TGL.
|
||
- ci/radv+radeonsi: Fix the combo rules to include core vulkan changes.
|
||
- ci/radv: Add testing of the GLES CTS using ANGLE on stoney.
|
||
- ci/tu: Drop some xfails for !24086
|
||
- disk_cache: Disable the "List" test for RO disk cache.
|
||
|
||
Eric Engestrom (134):
|
||
|
||
- VERSION: bump to 23.2
|
||
- docs: reset new_features.txt
|
||
- v3d: add flake spec@ext_framebuffer_blit@fbo-sys-sub-blit
|
||
- ci: stop removing -x11 suffix for x11 build of deqp-egl
|
||
- ci: add -android suffix for android build of deqp-egl
|
||
- ci: move deqp-egl instead of copying it
|
||
- ci: start documenting which image tags need to be bumped
|
||
- ci: bump tags
|
||
- ci: update shebang to make it more portable
|
||
- broadcom/ci: deduplicate script definition
|
||
- v3dv/ci: drop fixed failure from fails.txt
|
||
- amd: fix buggy usage of unreachable()
|
||
- compiler: fix buggy usage of unreachable()
|
||
- pvr: fix buggy usage of unreachable()
|
||
- vk/util: fix buggy usage of unreachable()
|
||
- util: enforce unreachable()'s argument being a literal string
|
||
- egl: inline driver.GetProcAddress() as it's always _glapi_get_proc_address()
|
||
- ci: rework vulkan validation layer build script
|
||
- v3d: document that \`V3D_DEBUG=shaderdb` is \*not* for shader-db
|
||
- v3d: fix tfu_supports_tex_format() param type, and document why
|
||
- v3d: fix various minor issues in gen_pack_header.py
|
||
- dzn: fix pointer type mismatch
|
||
- ci: bump bin/ci/ deps to support python 3.11
|
||
- ci: drop GENERATE_ENV_SCRIPT
|
||
- ci: stop marking environment variable list as executable
|
||
- ci: replace write + cat with tee
|
||
- ci: disable anholt's farm
|
||
- ci: only execute capture-devcoredump.sh when it's present
|
||
- util/bitset: ensure the sets compared have the same size at compile time
|
||
- docs: add release notes for 23.1.0
|
||
- docs: update calendar for 23.1.0
|
||
- ci/b2c: increase timeout to 5 minutes
|
||
- ci/amd: don't override the b2c timeout in the steamdeck config
|
||
- ci/zink: add new zink-radv-navi10-valve flakes
|
||
- mailmap: update @mupuf's name
|
||
- docs: fix release date of 23.1.0
|
||
- ci/zink: document new zink-radv-navi10-valve failures
|
||
- v3dv: fix align() computation for pixel formats with non-POT block sizes
|
||
- docs: update calendar for 23.1.1
|
||
- docs: add release notes for 23.1.1
|
||
- docs/relnotes: add sha256sum for 23.1.1
|
||
- ci_run_n_monitor: add ability to specify the pipeline to use, instead of auto-detecting it
|
||
- ci/amd: move AMD-specific LD_PRELOAD to AMD config
|
||
- ci/amd: only define AMDGPU_GPU_ID for the duration of the call
|
||
- bin/ci: fix mistakenly hardcoded repo name in get_gitlab_project()
|
||
- ci/intel: reuse iris_file_list instead of copying its definition
|
||
- meson: simplify another "any of" check
|
||
- wsi/display: drop unused parameters from local functions
|
||
- ci: split clang-format list of folders for easier maintenance
|
||
- ci: show diff when clang-format check fails
|
||
- panfrost: fix formatting of a couple of files that were missed
|
||
- panfrost: rename \*.cc files to \*.cpp
|
||
- ci/zink+radv: fix flakes definition
|
||
- ci/zink+radv: mark all spec@arb_copy_image@arb_copy_image-targets* as flaky after getting a bunch more of them
|
||
- ci/zink+radv: document recent regressions
|
||
- ci: color the diff for clang-format
|
||
- meson: enable the clang-format target
|
||
- ci: use meson to run clang-format
|
||
- docs: document clang-format and how to use it
|
||
- docs/calendar: add 23.2 branchpoint and release candidates
|
||
- ci/zink+radv: mark flakes as such
|
||
- ci/radv: fix flakes definition
|
||
- ci/crocus: fix flakes definition
|
||
- ci/zink+anv: fix flakes definition
|
||
- ci/b2c: also detect non-soft GPU hangs with AMDGPU
|
||
- amd/ci: run gl(es) cts & piglit on radeonsi on vangogh
|
||
- ci/radv: update expectations
|
||
- ci/zink+radv: update expectations
|
||
- docs/relnotes/23.1.1: clear "new features"
|
||
- docs: add release notes for 23.1.2
|
||
- docs/relnotes: add sha256sum for 23.1.2
|
||
- docs: update calendar for 23.1.2
|
||
- egl: return correct error for EGL_KHR_image_pixmap
|
||
- clang-format: add explanation for anyone reading .clang-format-include
|
||
- radv,aco: tweaks to get clang-format to print nicer code
|
||
- radv: reformat according to its .clang-format
|
||
- aco: reformat according to its .clang-format
|
||
- ci: enforce formatting for RADV & ACO
|
||
- radv: fix formatting
|
||
- Revert "ci: remove clang-format testing"
|
||
- asahi: drop unnecessary DRM_FORMAT_MOD_{LINEAR,INVALID} fallbacks
|
||
- ci: mark the valve farm as down
|
||
- docs/ci: fix command to disable/re-enable farms
|
||
- docs: add release notes for 23.1.3
|
||
- docs/relnotes: add sha256sum for 23.1.3
|
||
- docs: update calendar for 23.1.3
|
||
- docs/coding-style: add example vim config for clang-format
|
||
- docs/coding-style: add example emacs config for clang-format
|
||
- docs/coding-style: add pre-commit hook fallback for clang-format
|
||
- v3dv: replace boolean and uint with bool and size_t
|
||
- amd/ci: add another dEQP-VK.multiview.renderpass2.multisample.* flake
|
||
- amd/ci: add another dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* flake
|
||
- ci: split valve farm in two
|
||
- util/disk_cache: fix ~/.cache/ permissions
|
||
- panfrost/ci: drop invalid skips that are already marked as known flakes
|
||
- intel/ci: fix skips definitions
|
||
- etnaviv/ci: fix skips definition
|
||
- zink/ci: fix skips & flakes for zink+radv on vangogh & navi10
|
||
- docs/codingstyle: fix clang-format command
|
||
- vc4/ci: fix skipping of gles3 piglit tests
|
||
- v3dv/ci: fix skipping of vk tests
|
||
- v3dv/ci: skip more tests that are timing out
|
||
- virgl/ci: fix skips definition
|
||
- clang-format: add egl foreach macro
|
||
- clang-format: add wayland foreach macros
|
||
- egl: change a couple of clang-format settings
|
||
- egl: add a few trailing commas
|
||
- egl: protect the formatting in a couple of places
|
||
- egl: prevent clang-format from reordering some headers
|
||
- egl: re-format using clang-format
|
||
- clang-format: enforce formatting of egl
|
||
- add initial .git-blame-ignore-revs
|
||
- ci/zink+radv: document another flake
|
||
- ci/zink+radv: fix flake definition
|
||
- ci: document workflow rules
|
||
- ci: set priority:low tag only on non-Marge pipelines
|
||
- ci: fix .valve-farm-manual-rules
|
||
- ci: split farm rules out of test-source-dep.yml
|
||
- etnaviv/ci: drop duplicate line in etnaviv files list
|
||
- broadcom/ci: add the renderonly folder to things that can affect v3d & vc4
|
||
- meson: clarify description of \`opengl` option
|
||
- meson: clarify what "off-screen rendering" means
|
||
- ci: avoid running hardware jobs if there are already trivial issues
|
||
- ci: avoid running hardware jobs if lint fails - now on LAVA too!
|
||
- ci: avoid running hardware jobs if lint fails - now on Windows too!
|
||
- bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match
|
||
- panfrost: upcast uint8/uint16 before shifting them beyond their range
|
||
- vc4: drop duplicate .lower_ldexp
|
||
- zink: fix format in zink_make_{image,texture}_handle_resident()
|
||
- v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo
|
||
- v3dv: fix copy/pasted type of \`sample`
|
||
- v3dv: fix shader stage name in error message
|
||
- v3d/qpu: fix type of function argument
|
||
- ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines
|
||
|
||
Erico Nunes (6):
|
||
|
||
- Revert "ci: disable lima farm, currently out-of-space, needs to be fixed"
|
||
- lima: fix stringop-overflow warning
|
||
- lima/ci: temporarily disable deqp-egl tests due to timeouts
|
||
- ci: temporarily disable lima farm
|
||
- ci: restore lima farm
|
||
- lima: fix plbu block stride calculation
|
||
|
||
Erik Faye-Lund (144):
|
||
|
||
- nir: remove nir_state_slot::swizzle
|
||
- glsl: remove ir_state_slot::swizzle
|
||
- docs: renderpass -> render pass
|
||
- docs: statechanges -> state changes
|
||
- docs: backfacing -> back-facing
|
||
- docs: codepath -> code-path
|
||
- docs: did't -> didn't
|
||
- docs: cma -> CMA
|
||
- docs: Anv -> ANV
|
||
- docs: perfetto -> Perfetto
|
||
- docs: use correct tick for "doesn't"
|
||
- docs: vlan -> VLAN
|
||
- docs: toplevel -> top-level
|
||
- docs: correct spelling of "source"
|
||
- docs: correct spelling of "tagged"
|
||
- docs: correct spelling of "frame"
|
||
- docs: sort extensions
|
||
- docs: add custom html theme
|
||
- docs: add bootstrap extension
|
||
- docs: translate admonitions into bootstrap alerts
|
||
- docs: remove support for old sphinx-versions
|
||
- docs: use custom html theme
|
||
- nir: clean up white-space in deref-printing
|
||
- mesa/main: clean up white-space in ffvertex_prog.c
|
||
- mesa/main: drop disasm-code from ffvertex_prog.c
|
||
- mesa/main: allow passing nir-shaders to st_program_string_notify
|
||
- mesa/main: make ffvertex output nir
|
||
- nir: fix constant-folding of 64-bit fpow
|
||
- docs: fix edit-links
|
||
- mesa/main: drop use_legacy_math_rules
|
||
- llvmpipe: fixup refactor copypasta
|
||
- docs: fixup About Mesa3D.org link
|
||
- docs/tgsi: fix up indent
|
||
- docs/tgsi: fix bad latex
|
||
- docs/tgsi: fixup bad latex
|
||
- docs/tgsi: wrap overly long lines
|
||
- docs/tgsi: use math-notations for conditionals
|
||
- docs/tgsi: do not use math-block for non-latex
|
||
- docs/tgsi: fixup latex for TEX and TEX2
|
||
- docs/tgsi: use \\ll and \\gg for left and right shift
|
||
- aux/draw: check for lines when setting clipping-mode
|
||
- zink: fix bad indent
|
||
- zink: clean up tcs_vertices_out_word handling
|
||
- zink: do not open-code memcpy
|
||
- aco: use c++17
|
||
- meson: remove needless c++17-overrides
|
||
- mesa/main: clean up white-space in ff_fragment_shader.cpp
|
||
- mesa/st: refactor st_translate_fragment_program
|
||
- mesa/st: allow using nir for ff-fragment shaders
|
||
- compiler/nir: move find_state_var to common code
|
||
- mesa/main: ff-fragshader to nir
|
||
- mesa/main: compile ff_fragment_shader as c-code
|
||
- mesa/program_cache: remove unused shader-cache functions
|
||
- panfrost: expose PIPE_CAP_POLYGON_OFFSET_CLAMP
|
||
- util: mark externally-unused functions as static
|
||
- nir: use more nir_fmul_imm
|
||
- nir: use more nir_fadd_imm
|
||
- nir: fsub -> fadd_imm
|
||
- nir: use more nir_ffma_imm variants
|
||
- nir: add nir_fsub_imm
|
||
- nir: use nir_fsub_imm
|
||
- radeonsi,radv: use nir_format_linear_to_srgb
|
||
- docs: explicitly mark extensions as obsolete
|
||
- docs: mark MESA_multithread_makecurrent as obsolete
|
||
- docs: mark MESA_shader_debug as obsolete
|
||
- docs: mark MESA_swap_frame_usage as obsolete
|
||
- docs: mark MESA_texture_array as obsolete
|
||
- docs: move obsolete extensions to their own list
|
||
- zink: update profiles schema
|
||
- zink: keep gl46_optimal extensions/features sorted
|
||
- zink: compute correct location for line-smooth gs
|
||
- zink: do not lower line-smooth for non-lines
|
||
- docs: increase contrast in dark-theme
|
||
- zink: update profiles schema
|
||
- d3d12, dozen: make sure we pass float to fge
|
||
- nir: use nir_i{ne,eq}_imm helpers
|
||
- nir: generate nir_{cmp}_imm variants
|
||
- nir: use generated immediate comparison helpers
|
||
- nir: add nir_[fui]gt_imm and nir_[fui]le_imm helpers
|
||
- nir: use new immediate comparison helpers
|
||
- mesa/st: use nir_imm_vec4
|
||
- nir: use more imm-helpers
|
||
- nir: isub -> iadd_imm
|
||
- nir: use nir_imm_{true,false}
|
||
- nir: add and use nir_fdiv_imm
|
||
- nir: add and use nir_imod_imm
|
||
- nir: add missed nir_cmp_imm-helpers
|
||
- docs: upgrade bootstrap to 5.3.0
|
||
- cso: use enum for render-conditions
|
||
- draw: use enum for tgsi-semantic
|
||
- draw: use uint32_t instead of uint
|
||
- draw: use enum for primitive-type
|
||
- draw: track vertices and vertex_ptr as byte-pointers
|
||
- draw: use stdint.h types
|
||
- cso: use unsigned instead of uint
|
||
- draw: match type of pipe_draw_start_count_bias::count
|
||
- draw: use unsigned instead of uint
|
||
- aux/indices: use stdint.h types
|
||
- draw/i915: move hwfmt array to i915 specific struct
|
||
- microsoft/compiler: use nir_imm_zero
|
||
- mesa/st: use nir_ineg
|
||
- vulkan: avoid needless constant-folding
|
||
- broadcom/compiler: use imm-helpers
|
||
- v3dv: use imm-helpers
|
||
- pan: use imm-helpers
|
||
- freedreno: use imm-helpers
|
||
- r600/sfn: use imm-helpers
|
||
- d3d12: use imm-helpers
|
||
- radeonsi: use imm-helpers
|
||
- vc4: use imm-helpers
|
||
- intel: use imm-helpers
|
||
- anv: use imm-helpers
|
||
- hasvk: use imm-helpers
|
||
- mesa/st: use imm-helpers
|
||
- amd: use imm-helpers
|
||
- etnaviv: use imm-helpers
|
||
- gallium: use imm-helpers
|
||
- nir: use imm-helpers
|
||
- math: fix indentation in m_matrix.[ch]
|
||
- math: remove unused defines
|
||
- math: drop MAT_[ST][XYZ] defines
|
||
- aux/trace: use stdint.h types
|
||
- pipebuffer: use unsigned instead of uint
|
||
- gallivm: use unsigned instead of uint
|
||
- aux/pp: use unsigned instead of uint
|
||
- aux/util: use enum for render-condition
|
||
- aux/util: match type of pipe_draw_start_count_bias::start/count
|
||
- aux/util: use enum for primitive-type
|
||
- aux/util: use unsigned instead of uint
|
||
- aux/util: use stdint.h types
|
||
- aux/util: uint -> unsigned
|
||
- tgsi: use enum instead of defines
|
||
- tgsi: use stdint.h types
|
||
- tgsi: use enum for tgsi-file type
|
||
- tgsi: use enum for property-name
|
||
- tgsi: use enum for shader-type
|
||
- tgsi: use enum for interpolate-mode
|
||
- tgsi: uint -> uint32_t
|
||
- tgsi: uint -> unsigned
|
||
- nir: constify intrin
|
||
- nir: use nir_intrinsic_get_var
|
||
- radv: do not rely on constant-folding
|
||
- nir: do not needlessly rely on optimizations
|
||
- panfrost: delete stale editorconfig file
|
||
|
||
Faith Ekstrand (16):
|
||
|
||
- nouveau/nir: image_samples/size don't have coordinates
|
||
- vulkan: Document vk_physical_device::supported_features
|
||
- nir/opt_if: Use block_ends_in_jump
|
||
- nir: Add a reg_intrinsics flag to nir_convert_from_ssa
|
||
- nir/from_ssa: Make additional assumptions in coalescing
|
||
- nir/from_ssa: Support register intrinsics
|
||
- freedreno/ci: Update pixmark piano checksums
|
||
- nv50/ir: Support vector movs
|
||
- nir: Properly handle divergence for load_reg
|
||
- nir/trivialize: Maintain divergence information
|
||
- nir/trivialize: Trivialize cross-block loads
|
||
- Revert "mesa, compiler: Move gl_texture_index to glsl_types.h"
|
||
- Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h"
|
||
- nir: Handle nir_op_mov properly in opt_shrink_vectors
|
||
- nir: Don't handle nir_op_mov in get_undef_mask in opt_undef
|
||
- nir: Fix metadata in nir_lower_is_helper_invocation
|
||
|
||
Felix DeGrood (19):
|
||
|
||
- anv: disable reset query pools using blorp opt on MTL
|
||
- anv: Add END_OF_PIPE_SYNC reporting to INTEL_DEBUG=pc
|
||
- anv: Add flush reasons to raytracing flushes
|
||
- anv: Add flush reason to NEEDS_END_OF_PIPE_SYNC
|
||
- anv: split INTEL_MEASURE multi events
|
||
- intel: INTEL_MEASURE cpu mode
|
||
- anv: Enable INTEL_MEASURE=cpu
|
||
- iris: Enable INTEL_MEASURE=cpu
|
||
- docs: add INTEL_MEASURE=cpu
|
||
- intel/debug: Control start/stop frame of batch debug
|
||
- anv: Enable INTEL_DEBUG_BATCH_FRAME_START/_STOP
|
||
- iris: Enable INTEL_DEBUG_BATCH_FRAME_START/_STOP
|
||
- docs: Add INTEL_DEBUG_BATCH_FRAME_START/_STOP
|
||
- anv: fix INTEL_MEASURE on MTL
|
||
- anv: re-enable RT data in INTEL_MEASURE
|
||
- intel: refactor INTEL_MEASURE pointer dumping
|
||
- intel: batch consecutive dispatches into implicit renderpasses
|
||
- intel: Secondary CB print primary CB's renderpass
|
||
- anv: override vendorID for Cyberpunk 2077
|
||
|
||
Feng Jiang (3):
|
||
|
||
- frontends/va: Fix memory leak of decrypt_key
|
||
- radeonsi/vcn: Remove unnecessary type conversion
|
||
- virgl/video: Fix out-of-bounds access in fill_mpeg4_picture_desc()
|
||
|
||
Filip Gawin (5):
|
||
|
||
- nine: add fallback for D3DFMT_D16 in d3d9_to_pipe_format_checked
|
||
- glx: fix build with APPLEGL
|
||
- ac/nir: fix slots in clamping legacy colors
|
||
- anv: allow intel_clflush_range only on igpu
|
||
- crocus: Avoid fast-clear with incompatible view
|
||
|
||
Francisco Jerez (3):
|
||
|
||
- anv: Fix calculation of guardband clipping region.
|
||
- intel/gfx12.5: Enable L3 partial write merging for compressible surfaces among other cases.
|
||
- anv: Swap ordering of memory types on non-LLC platforms to work around application bugs.
|
||
|
||
Frank Binns (7):
|
||
|
||
- pvr: add missing explicit check against VK_SUCCESS
|
||
- pvr: use util_dynarray_begin() in more places
|
||
- pvr: replace transfer EOT binary shaders with run-time compiled shaders
|
||
- pvr: fix typo in pvr_rt_get_region_headers_stride_size()
|
||
- pvr: fix array overflow in pvr_device_tile_buffer_ensure_cap()
|
||
- pvr: fix invalid read reported by valgrind
|
||
- pvr: skip setting up SPM consts buffer when no const shared regs are used
|
||
|
||
Friedrich Vock (41):
|
||
|
||
- radv/rmv: Fix creating RT pipelines
|
||
- radv/rmv: Fix import memory
|
||
- radv/rt: Plug some memory leaks during shader creation
|
||
- radv: Don't leak the RT prolog binary
|
||
- radv: Hash pipeline libraries separately
|
||
- radv: Always call si_emit_cache_flush before writing timestamps
|
||
- radv: Add driconf to always drain waves before writing timestamps
|
||
- nir: Rematerialize derefs in use blocks before repairing SSA
|
||
- nir: Remove unnecessary assert in nir_before_src
|
||
- radv: Disable capture/replay handles
|
||
- aco: Lower divergent bool phis iteratively
|
||
- radv: Always flush before writing acceleration structure properties
|
||
- aco: Reset scratch_rsrc on blocks without predecessors
|
||
- aco: Fix live_var_analysis assert
|
||
- aco: Fix assert in insert_exec_mask
|
||
- radv: Add driconf to force wave64 for RT
|
||
- radv: Add RADV_DEBUG=nort
|
||
- radv: Enable ray tracing pipelines by default
|
||
- radv: Add the BOs of all shaders in a RT pipeline
|
||
- radv: Add radv_shader_free_list
|
||
- radv: Move shader arena allocation to a separate function
|
||
- radv: Add option to allocate shaders in replayable VA range
|
||
- radv: Add utilities to serialize and deserialize shader allocation info
|
||
- radv: Add radv_shader_reupload
|
||
- radv: Break up radv_shader_nir_to_asm
|
||
- radv: Split up implementation of radv_shader_create
|
||
- radv: Add support for creating capture/replay shaders
|
||
- radv: Add radv_rt_capture_replay_handle
|
||
- radv/rt: Only compare the non-recursive capture/replay handle
|
||
- radv/rt: Associate capture/replay handles with stages
|
||
- radv/rt: Replay shader allocations according to capture/replay handle
|
||
- radv/rt: Rework radv_GetRayTracingCaptureReplayShaderGroupHandlesKHR
|
||
- radv: Re-enable RT pipeline capture/replay handles
|
||
- meson: Prefix Vulkan "Ray Tracing" summary with "Intel"
|
||
- radv/ci: Skip ray tracing tests on vangogh
|
||
- Revert "radv/rt: Enable RT pipelines on GFX10_3+ excluding vangogh"
|
||
- Revert "Revert "radv: Enable ray tracing pipelines by default""
|
||
- radv/rt: Enable exact on software intersection functions
|
||
- radv/rt: Miss rays that hit the triangle's v edge
|
||
- radv: Handle VK_SUBOPTIMAL_KHR in trace layers
|
||
- nir/load_store_vectorize: Handle intrinsics with constant base
|
||
|
||
Ganesh Belgur Ramachandra (5):
|
||
|
||
- gallium/pipe: Add get_resources() to pipe_video_buffer
|
||
- gallium/vl: implementation for get_resources()
|
||
- nouveau: implementation for get_resources()
|
||
- d3d12: implementation for get_resources()
|
||
- frontends/va: use resources instead of views
|
||
|
||
Georg Lehmann (51):
|
||
|
||
- nir: lower ballot_bit_count_exclusive/inclusive to mbcnt_amd
|
||
- radv: use lower_ballot_bit_count_to_mbcnt_amd
|
||
- aco: Assert that operands have the same byte offset when reassigning split vectors
|
||
- aco: also reassign p_extract_vector post ra
|
||
- aco/vn: compare all valu modifers
|
||
- aco/optimizer: don't use pass_flags for mad idx
|
||
- aco/optimizer: copy pass flags for newly created valu instructions
|
||
- aco/assembler: support VOP3P with DPP
|
||
- aco/builder: support VOP3(P) with dpp
|
||
- aco: add assembler tests for VOP3(P) with DPP
|
||
- aco/ra: convert VOPC_DPP instructions without vcc to VOP3
|
||
- aco: use VOP3+DPP
|
||
- aco: don't apply dpp if the alu instr uses the operand twice
|
||
- aco: emit_wqm on MIMG dst, not operands
|
||
- aco: introduce helper to swap valu operands with modifiers
|
||
- aco/gfx11: use fmamk/fmaak with opsel
|
||
- aco: add withoutVOP3 helper
|
||
- aco/ra: use smaller operand stride for VOP3P with DPP
|
||
- aco/ra: use fmac with DPP/opsel on GFX11
|
||
- aco: add helper function for can_use_input_modifiers
|
||
- aco: use get_operand_size for dpp opt
|
||
- aco: use can_use_input_modifiers helper
|
||
- aco/optimizer: allow DPP to use VOP3 on GFX11
|
||
- util: fix stack dynarray used by multiple tus
|
||
- nir/opt_if: use nir_alu_instr_is_comparison directly
|
||
- aco: cleanup v_cmp_class usage
|
||
- aco: p_start_linear_vgpr doesn't always need exec mask
|
||
- aco/ir: return true in hasRegClass for Operand(reg, rc)
|
||
- aco/statistics: improve v_fma_mix dual issuing detection
|
||
- aco: use v_add_f{16,32} with clamp for fsat
|
||
- aco: use v_fma_mix for f2f32 and f2f16 on gfx11 if wave64
|
||
- aco: make validation work without SSA temps
|
||
- aco: move cfg validation to its own function
|
||
- aco: don't validate p_constaddr_addlo/p_resumeaddr_addlo operands
|
||
- aco: validate ir for prologs and after lower_to_hw_instr
|
||
- aco/opcodes: move v_cndmask_b32 back to the VOP2 list
|
||
- aco: remove v_cvt_pkrtz_f16_f32_e64 when it's actually VOP2
|
||
- aco/opcodes: delete wrong comment copy pasted from NIR
|
||
- aco: use uses helpers for pk_fma opt
|
||
- aco: combine scalar mul+pk_add to pk_fma
|
||
- aco/gfx10+: use v_cndmask with literal for reduction identity
|
||
- nir: add single bit test opcodes
|
||
- nir/lower_bit_size: mask bitz/bitnz src1 like shifts
|
||
- aco: implement nir_op_bitz/bitnz
|
||
- nir/opt_algebraic: combine bitz/bitnz
|
||
- radv: set has_bit_test for aco
|
||
- aco/optimizer: delete s_bitcmp optimization
|
||
- aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16
|
||
- aco: fix non constant 16bit bitnz/bitz
|
||
- aco: fix u2f16 with 32bit input
|
||
- nir/opt_algebraic: remove broken fddx/fddy patterns
|
||
|
||
George Ouzounoudis (1):
|
||
|
||
- radv: small fix for VkDescriptorSetVariableDescriptorCountLayoutSupport
|
||
|
||
Gert Wollny (98):
|
||
|
||
- r600/sfn: Lower tess levels to vectors in TCS
|
||
- r600/sfn: make sure f2u32 is lowered late and correctly for 64 bit floats
|
||
- r600: remove TGSI code path
|
||
- r600/sfn: Add a type for address registers
|
||
- r600/sfn: don't track address registers in live ranges
|
||
- r600/sfn: Handle MOVA_INT in sfn assembler
|
||
- r600/sfn/tests: Cleanup and move some code around
|
||
- r600/sfn: Add address and index registers creation to ValueFactory
|
||
- r600/sfn: Rework query for indirect access in alu instr and opt
|
||
- r600/sfn: don't allow more than one AR per instruction
|
||
- r600: Allow both index registers for all CF types
|
||
- r600/sfn: Prepare uniforms and local arrays for better address handling
|
||
- r600/sfn: handle AR and IDX register in shader from string
|
||
- r600/sfn: add method to update indirect address to all instrution types
|
||
- r600/sfn: Add function to insert op in block
|
||
- r600/sfn: Update resource based instruction index mode check
|
||
- r600/sfn: Be able to track expected AR uses
|
||
- r600/sfn: AR and IDX don't need the write flag, but haev a parent
|
||
- r600/sfn: Add a RW get function of IF predicate access
|
||
- r600/sfn: Add interface to count AR uses in ALU op
|
||
- r600/sfn: Add pass to split addess and index register loads
|
||
- r600/sfn: Add function to check whether a group loads a index register
|
||
- r600/sfn: take address loads into account when scheduling
|
||
- r600/sfn: Add more tests and update to use address splits
|
||
- r600/sfn: Don't copy-propagate indirect access into LDS instr
|
||
- r600/sfn: Add test for multiple index load
|
||
- r600/sfn: set CF force flag always when starting a new block
|
||
- r600/sfn: Start a new ALU CF on index use, not on index emission
|
||
- r600/sfn: Add chip family to shader class
|
||
- r600/sfn: Add handling for R600 indirect access alias handling
|
||
- r600/sfn: Override Array access handling in backend assembler
|
||
- r600/sfn: Fix copy-prop with array access
|
||
- r600/sfn: scheduled instructions are always ready
|
||
- r600/sfn: Add more tests and update to use address splits
|
||
- r600/sfn: print failing block when scheduling fails
|
||
- r600/sfn: Can't use an indirect array access as source to AR load
|
||
- r600/sfn: factor out index loading for non-alu instructions
|
||
- r600/sfn: prepare for emitting AR loads
|
||
- r600/sfn: Tie in address load splitting
|
||
- r600+sfn: Assign ps_conservative_z and switch to NIR defines
|
||
- r600/sfn: assign window_space_position in shader state
|
||
- r600/sfn: Ass support for image_samples
|
||
- r600/sfn: fix cube to array lowering for LOD
|
||
- r600/sfn: Fix iterator use
|
||
- r600/sfn: move kill instruction test to alu instruction
|
||
- r600/sfn: add dependencies for kill instructions
|
||
- r600/sfn: move kill handling fully to scheduling
|
||
- r600/sfn: use correct FS output location if not all outputs are used
|
||
- virgl: Make query result resource as dirty before requesting result
|
||
- virgl: Add support for ARB_pipeline_statistics
|
||
- virgl/ci: uprev virglrenderer
|
||
- docs/features: fix empty line error
|
||
- virgl: Fix IB upload when a start >0 is given
|
||
- virgl: Submit drawid_offset if is not zero
|
||
- virgl: signal support for group vote and draw parameters
|
||
- virgl: enable ARB_gl_spirv
|
||
- features: Update virgl features
|
||
- ci: uprev virglrenderer to include changes needed for GL 4.6 support
|
||
- r600/sfn: assert that group barrier is not emitted in divergent code flow
|
||
- r600/sfn: Switch to scoped barriers
|
||
- util/driconf: pin minImageCount to three for "Path of Exile"
|
||
- r600/sfn: add read instruction for unused but required LDS op results
|
||
- r600/sfn: Don't rewrite TESS_EVAL inner tess level outputs
|
||
- r600/sfn: Add experimental support for load/store_global
|
||
- r600/sfn: Handle store_global when lowering 64 bit ops to vec2
|
||
- r600/sfn: Handle load_global in 64 to vec2 lowering
|
||
- rusticl: compile r600 driver
|
||
- r600: fix handling of use_sb flag
|
||
- r600/sfn: move kill handling to fully scheduling
|
||
- 600/sfn: Trigger use of ACK for some barriers
|
||
- r600: Disable SB if we use the ariable length DOT
|
||
- r600/sfn: Silence warnings "overloaded-virtual"
|
||
- r600/sfn: Downgrade some error message to warning
|
||
- r600: Split tex CF only if written component is read
|
||
- r600/sfn: Don't deref unused group slots
|
||
- r600/sfn: on R600/R700 write a dummy pixel output if there is a gap
|
||
- r600/sfn: Clean up FS member initialization
|
||
- virgl: don't allow vertex input arrays on GLES hosts
|
||
- r600/sfn: Fix typo
|
||
- r600/sfn: drop use of nir source mods
|
||
- r600/sfn: allow source mods for per source with multi-slot ops
|
||
- r600/sfn: add source and dest mod info to opcode table
|
||
- r600/sfn: Implement source mod optimization in backend
|
||
- r600/sfn: Implement fsat for 64 bit ops
|
||
- r600/sfn: Add source mod propagation also to fp64 ops
|
||
- r600/sfn: Don't clear clear group flag on vec4 that comes from TEX or FETCH
|
||
- virgl/ci: Drop duplicate runs
|
||
- ci: Upref virglrenderer
|
||
- r600/sfn: Fix filling FS output gaps
|
||
- r600: Pre-EG - Set wrap texture modes to repeat when seemless cube is used
|
||
- r600/sfn: Be more conservative with AR re-use
|
||
- r600/sfn: Shorten array elements live range
|
||
- r600/sfn: remove debug output leftovers
|
||
- r600/sfn: Fix use of multiple IDX with kcache
|
||
- r600/sfn: Don't try to propagate to vec4 with more than one use
|
||
- r600/sfn: Only switch to other CF if no AR uses are pending
|
||
- r600/sfn: AR loads should depend on all previous non ALU instructions
|
||
- r600/sfn: Take source uses into account when switching channels
|
||
|
||
Giancarlo Devich (5):
|
||
|
||
- d3d12: Update and require DirectX-Headers 1.610.0
|
||
- d3d12: Query device for D3D12_FEATURE_D3D12_OPTIONS14
|
||
- d3d12: Update PSO creation to use CreatePipelineState
|
||
- d3d12: Add ID3D12GraphicsCommandList8 to the context
|
||
- d3d12: Support separate front/back stencils
|
||
|
||
Gregory Mitrano (2):
|
||
|
||
- ac/sqtt: Add RGP Definitions for Mesh Shaders
|
||
- radv/sqtt: Add RGP Markers for Mesh Shaders
|
||
|
||
Guilherme Gallo (29):
|
||
|
||
- ci/lava: Move job definition stuff to another file
|
||
- ci/lava: Extract LAVA proxy and LAVAJob abstractions
|
||
- ci/lava: Use python-fire in job submitter
|
||
- ci/lava: Update LogFollower for better section handling and history
|
||
- ci/lava: Add a simple Structural Logger into submitter
|
||
- bin/ci: Add StructuredLogger to improve log handling
|
||
- ci/lava: Integrate StructuralLogger with AutoSaveDict
|
||
- ci/lava: Force use of UTC timezones
|
||
- ci/lava: Refactor LAVAJobSubmitter and add tests
|
||
- ci/lava: Use f-strings in job definition
|
||
- ci/lava: Skip regression test if LAVA log file is not present
|
||
- ci/freedreno: Fix a618-traces-performance rules
|
||
- ci/lava: Bypass arg list to print_log function
|
||
- ci/lava: Fix last section in job submitter
|
||
- ci: Use absolute paths in init-stage2.sh
|
||
- ci/lava: Add SSH support in rootfs
|
||
- ci/lava: Add SSH job definition
|
||
- ci/lava: Add bridge function for job definition
|
||
- ci/lava: Distinguish test suites in DUT vs Docker
|
||
- ci/lava: Only check for the first section marker
|
||
- ci/lava: Hide JWT block during YAML dump
|
||
- ci/lava: Tweak http-download timeout in SSH based jobs
|
||
- ci/lava: Raise the post test metadata gathering retry count
|
||
- ci/lava: Force LAVA panfrost jobs to use UART
|
||
- dzn: Skip a few deqp tests which are prone to timeout
|
||
- ci/lava: Renable SSH sessions for panfrost jobs
|
||
- ci/lava: Increase Docker action failure_retry counter
|
||
- ci/lava: Add LAVA SSH client container
|
||
- ci/lava: Use an alpine image for SSH client container
|
||
|
||
Hans-Kristian Arntzen (6):
|
||
|
||
- wsi/x11: Fix present ID signal when IDLE comes before COMPLETE.
|
||
- wsi/wayland: Simplify wait logic for present wait.
|
||
- wsi/wayland: Do not assert that all present IDs have been waited on.
|
||
- radv/amdgpu: Report 48-bit VAs in bo logs.
|
||
- Fix DGC bug where indirect count > maxSequencesCount.
|
||
- wsi/x11: Fix potential deadlock in present ID.
|
||
|
||
Harri Nieminen (11):
|
||
|
||
- amd: fix typos
|
||
- amd: fix typos in code
|
||
- r300: fix typos
|
||
- radeonsi: fix typos
|
||
- r600: fix typos
|
||
- r600/sb: fix typo
|
||
- r600/sfn: fix typos
|
||
- r600/sfn: fix typos in code
|
||
- broadcom: fix typos
|
||
- egl: fix typos
|
||
- glx: fix typos
|
||
|
||
Helen Koike (3):
|
||
|
||
- ci: move .microsoft-farm-container-rules to test-source-dep.yml
|
||
- ci: remove unused tag DEBIAN_X86_64_TEST_IMAGE_PATH
|
||
- ci/android: remove strace output from cuttlefish-runner.sh
|
||
|
||
Hyunjun Ko (27):
|
||
|
||
- intel/genxml: fix num bits of some MOCS fields
|
||
- intel/genxml: conform some fields to each other gen.
|
||
- intel/genxml: align some fields on gen9/11/12/125 with media driver.
|
||
- intel/genxml: add a command VD_CONTROL_STATE to gen12/125
|
||
- util/vl: initialize data/end pointers.
|
||
- vulkan/video: add to parse h265 slice.
|
||
- vulkan/video: add h265 reference structures and relevant util functions.
|
||
- anv/image: Add a surface usage bit for video decoding
|
||
- anv/image: allocate mv storage buffers for h265
|
||
- anv/image: allow VK_IMAGE_CREATE_ALIAS_BIT with a private binding.
|
||
- anv: add initial video decode support for h265
|
||
- anv: support P010 format for video 10-bit hevc decoding
|
||
- anv/image: get width/height for each plane of a surface for video decoding.
|
||
- anv: support HEVC 10-bit decoding
|
||
- anv: enable the video h265 decode extension.
|
||
- anv/ci: Add tests for video formats to the failing tests.
|
||
- anv/video: move video requirements to outarray.
|
||
- vulkan/video: adds more conditions for setting loop_filter_across_slices_enable in h265 slice parsing.
|
||
- vulkan/video: move parsing longterm rps in h265 slice parsing.
|
||
- util/rbsp: keep track of removed bits for the emulation prevention three bytes.
|
||
- vulkan/video: consider removed bits when calculating the size of comsumed data.
|
||
- anv/video: fix to set U/V offset correctly.
|
||
- vulkan/video: keep delta weight and offsets of predicted weight tables in h265 slice parsing
|
||
- intel/genxml: changes the type for predicted weight to unsigned.
|
||
- anv: fix to set predicted weight tables correctly.
|
||
- anv/video: fix to support HEVC 10bit on some of 9th gens.
|
||
- anv: Adds a workaround for HEVC decoding on some old platforms.
|
||
|
||
Iago Toral Quiroga (34):
|
||
|
||
- broadcom/compiler: fix v3d_qpu_uses_sfu
|
||
- broadcom/compiler: add a v3d_qpu_instr_is_legacy_sfu helper
|
||
- broadcom/compiler: fix incorrect check for SFU op
|
||
- broadcom/compiler: fix incorrect ALU checks
|
||
- broadcom/compiler: return early for SFU op latency calculation
|
||
- broadcom/compiler: try harder to merge thread switch earlier
|
||
- broadcom/compiler: don't allocate undef to rf0
|
||
- broadcom/compiler: move buffer loads to lower register pressure
|
||
- broadcom/compiler: increase peephole limit to 24 instructions
|
||
- broadcom/compiler: use unified atomics
|
||
- broadcom/compiler: skip jumps in non-uniform if/then when block cost is small
|
||
- v3dv: simplify too small Z viewport scale workaround
|
||
- v3dv: store slice dimensions in pixels
|
||
- v3dv: allow TFU transfers for mip levels other than 0
|
||
- v3dv: align compressed image regions to block size
|
||
- broadcom/compiler: flag use of control barriers
|
||
- broadcom/compiler: use scoped barriers
|
||
- v3d: only warn about bining sync for indirect draw once
|
||
- v3dv: remove bogus viewport code
|
||
- v3dv: simplify scissor setup for negative viewport height
|
||
- broadcom/cle: fix up viewport offset packet definition for V3D 4.1+
|
||
- v3d,v3dv: fix viewport offset for negative viewport center
|
||
- broadcom/compiler: only use last thread switch flag to detect final section
|
||
- nir/lower_tex: copy missing fields when creating copy of tex instruction
|
||
- nir/lower_tex: handle lower_tg4_offsets with lower_tg4_broadcom_swizzle
|
||
- broadcom/compiler: handle textureGatherOffsets
|
||
- v3dv: expose shaderImageGatherExtended
|
||
- v3dv: fix slice size for miplevels >= 2
|
||
- v3dv: don't use the TLB path if we might be copying partial tiles
|
||
- v3dv: use div_round_up for division by block size
|
||
- v3dv: fix blit path for compressed image to buffer copies
|
||
- broadcom: use nir info to keep track of implicit sample shading
|
||
- broadcom/compiler: free defin and defout arrays if they already exist
|
||
- broadcom/compiler: don't leak v3d_compile when finding a new best strategy
|
||
|
||
Ian Romanick (20):
|
||
|
||
- intel/fs: Don't munge source order of 3-src instructions in opt_algebraic
|
||
- intel/fs: Fix handling of W, UW, and HF constants in combine_constants
|
||
- intel/fs: Allow HF const in MAD on Gfx12.5 if all sources are HF
|
||
- nir/algebraic: Fixup iadd3 related patterns
|
||
- intel/fs: Add constant propagation for ADD3
|
||
- intel/eu/validate: Use a single macro define half_float_conversion cases
|
||
- intel/eu/validate: Add Gfx12.5
|
||
- intel/eu/validate: Add some validation of ADD3
|
||
- nir: Add optimization pass to reassociate some bfi instructions
|
||
- intel/fs: Use nir_opt_reassociate_bfi
|
||
- nir/algebraic: Lower some bfi with two constant sources
|
||
- intel/fs: Emit better code for bfi(..., 0)
|
||
- nir/algebraic: Optimize some u2f of bfi
|
||
- nir/algebraic: Simplify various trivial bfi
|
||
- intel/stub_gpu: Don't run program again after using GDB
|
||
- intel/fs: Constant propagate into SHADER_OPCODE_SHUFFLE
|
||
- intel/fs: Add missing newline
|
||
- intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress
|
||
- intel/fs: Constant fold SHL
|
||
- intel/fs: Constant fold OR and AND
|
||
|
||
Ikshwaku Chauhan (2):
|
||
|
||
- radeonsi/gfx11: updated si_is_format_supported
|
||
- radeonsi/gfx11: updated vertex format changes
|
||
|
||
Illia Abernikhin (3):
|
||
|
||
- docs: add iris features to docs/features.txt
|
||
- docs: add crocus features to docs/features.txt
|
||
- docs: remove i965 features from docs/features.txt
|
||
|
||
Illia Polishchuk (6):
|
||
|
||
- glx: add fail check for current context in another thread
|
||
- drirc: add allow_sampled_tex_copy option
|
||
- nir: switch to a normal sampler for ARB program with not depth textures
|
||
- zink, drirc: Add Borderlands 2 workaround to fix spir-v 1.6 translated discard
|
||
- zink: move find_sampler_var from zink to nir core
|
||
- nir: fix invalid sampler search by texture id
|
||
|
||
Italo Nicola (22):
|
||
|
||
- egl: disable partial redraw when gallium hud is active
|
||
- egl: fix comments alignment
|
||
- freedreno: implement clear_render_target and clear_depth_stencil
|
||
- v3d: implement clear_render_target and clear_depth_stencil
|
||
- vc4: implement clear_render_target and clear_depth_stencil
|
||
- d3d12: fix clear_depth_stencil texture deref
|
||
- gallium: implement u_default_clear_texture
|
||
- gallium: use u_default_clear_texture where applicable
|
||
- gallium: rename util_clear_texture to util_clear_texture_sw
|
||
- mesa/st: use fallback path when pipe->clear_texture is not available
|
||
- rusticl: use fallback path when pipe->clear_texture is not available
|
||
- clover: use fallback path when pipe->clear_texture is not available
|
||
- gallium: cleanup util_blitter_clear_render_target
|
||
- gallium: remove PIPE_CAP_CLEAR_TEXTURE
|
||
- lima/ci: add some ARB_clear_texture piglit tests to lima-fails.txt
|
||
- d3d12/ci: add piglit arb_clear_texture-integer fail to CI expectations
|
||
- nir: add options to lower y_vu, yv_yu, yx_xvxu and xy_vxux
|
||
- gallium/st: add support for PIPE_FORMAT_NV21 and PIPE_FORMAT_G8_B8R8_420
|
||
- mesa/main: add PIPE_FORMAT_YVYU and PIPE_FORMAT_R8B8_R8G8
|
||
- mesa/main: add PIPE_FORMAT_VYUY and PIPE_FORMAT_B8R8_G8R8
|
||
- freedreno/ci: add KHR-GL46.buffer_storage.map_persistent_flush to flakes
|
||
- egl: reenable partial redraw with a warning when using gallium hud
|
||
|
||
Iván Briano (24):
|
||
|
||
- anv: Remove dead parameters from copy_fast_clear_dwords
|
||
- anv: make anv_can_fast_clear_color_view more generally available
|
||
- anv: factor out code for ccs_op and mcs_op
|
||
- anv: expose some helper functions
|
||
- anv: support fast color clears on vkCmdClearAttachments
|
||
- anv: put EXT_mesh_shader behind an environment variable
|
||
- anv: enable graphics pipeline libraries by default
|
||
- hasvk: avoid assert due to unsupported format
|
||
- anv: enable the GPL feature based on whether the extension is supported
|
||
- vulkan/wsi: fix double free on error condition
|
||
- anv: do not explode on 32 bit builds
|
||
- anv: update conformanceVersion
|
||
- anv: flush data cache before emitting availability
|
||
- anv: ensure CFE_STATE is emitted for ray tracing pipelines
|
||
- iris: ensure mesh is disabled on context init
|
||
- anv: ensure mesh is disabled on context init
|
||
- anv: implement Wa_14019750404
|
||
- blorp: fix hangs with mesh enabled
|
||
- anv: use a simpler MUE layout for fast linked libraries
|
||
- anv: track what kind of pipeline a fragment shader may be used with
|
||
- intel/fs: read viewport and layer from the FS payload
|
||
- intel/fs: handle URB setup for fast linked mesh pipelines
|
||
- anv: enable VK_EXT_mesh_shader where supported
|
||
- intel/fs: use ffsll so we don't explode on 32 bits
|
||
|
||
James Glanville (7):
|
||
|
||
- pvr: Improve support for image clears
|
||
- pvr: Fix vtxin special var allocation count
|
||
- pvr: Fix image to buffer copies
|
||
- pvr: Fix incorrect PBE packmode for S8_UINT
|
||
- pvr: Adjust clear's region clip words
|
||
- pvr: Fix seg fault on unused ds attachment
|
||
- pvr: Fix deferred_control_stream_flags
|
||
|
||
James Knight (1):
|
||
|
||
- meson: ensure i915 Gallium driver includes Intel sources
|
||
|
||
Janne Grunau (4):
|
||
|
||
- asahi: Fix typo in debug/error message helper macro
|
||
- asahi: Free low VA BOs correctly
|
||
- st/mesa: Set gl_config.floatMode based on color_format
|
||
- asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs
|
||
|
||
Jarred Davies (3):
|
||
|
||
- pvr: Don't ralloc build context from compiler
|
||
- pvr: Use vk_device's enabled features struct
|
||
- pvr: Reduce free list initial size when multiple devices are created
|
||
|
||
Jesse Natalie (133):
|
||
|
||
- d3d12: Remove #if D3D12_SDK_VERSION blocks now that 610 is required
|
||
- microsoft/clc: Remove #if D3D12_SDK_VERSION blocks now that 610 is required
|
||
- dzn: Remove #if D3D12_SDK_VERSION blocks now that 610 is required
|
||
- util: Delete Offset() macro from u_memory.h
|
||
- d3d12: Respect buffer offsets for sampler views
|
||
- d3d12: Support blit texture uploads
|
||
- spirv2dxil: Lower quad ops in non-fragment/compute stages
|
||
- dzn: Remove driconf for quad ops in vertex stages
|
||
- dzn: Add physical device arg to format lookup
|
||
- dzn: Support dynamic depth bias via command list instead of PSO
|
||
- dzn: Use narrow quadrilateral lines when supported
|
||
- dzn: Support aniso-with-point-mip samplers
|
||
- dzn: Align-up heap sizes when allocating memory
|
||
- ci/windows: Update Agility SDK to 1.610.2
|
||
- dzn: Use unrestricted copy alignments when available
|
||
- dzn: Handle opaque BC1
|
||
- dzn: Handle depth bias for point fill mode emulation
|
||
- dzn: Re-design custom buffer descriptors
|
||
- ci/dzn: Run almost the full CTS
|
||
- dzn: Expose core VK1.1 extensions that aren't optional
|
||
- dzn: Expose core VK1.2 extensions that aren't optional
|
||
- meson: Don't use masm with VS backend
|
||
- spirv2dxil: Mark SSBO reads for bindless as CAN_REORDER
|
||
- microsoft/compiler: Unroll loops in opt passes
|
||
- dzn: Fix UBO descriptors pointing to the end of the buffer
|
||
- dzn: Hook up subgroup size to compute shader compilation
|
||
- dzn: Ensure sample-rate shading is factored into nir hash
|
||
- dzn: Use the nir hash as an input to the dxil hash
|
||
- dzn: Ensure subgroup size control is factored into pipeline hash
|
||
- dzn: Ensure bindless is factored into pipeline/nir hash
|
||
- dzn: Augment blit resolve to support min/max/sample-zero modes
|
||
- dzn: Support all available depth/stencil resolve modes
|
||
- dzn: Support separate depth/stencil resolves via blits
|
||
- dzn: Delete queue-level event waits
|
||
- ci/windows: Pick up WARP 1.0.6 NuGet with lots of dzn fixes
|
||
- dzn: Use A4B4G4R4 instead of B4G4R4A4 when available
|
||
- spirv2dxil: Lower large temps to scratch
|
||
- microsoft/compiler: Avoid integer divides by 0
|
||
- dzn: Run nir_opt_remove_phis before nir_lower_returns
|
||
- dzn/ci: Remove 'exclude' for graphicsfuzz cases
|
||
- microsoft/compiler: Allocate space for I/O and viewID dependency tables before instruction processing
|
||
- microsoft/compiler: Do basic I/O analysis for dependency tables
|
||
- spirv2dxil: Support int64 and doubles
|
||
- d3d12: Convert from D3D shader model to Mesa shader model earlier
|
||
- dzn: Enable 64-bit ints and floats
|
||
- microsoft/compiler: Take inputs from callers before providing nir options
|
||
- microsoft/compiler: Enable packed dot product intrinsics for SM6.4+
|
||
- dzn: Enable KHR_shader_integer_dot_product
|
||
- nir_lower_system_values: Add ASSERTED to assert-only variable
|
||
- nir: Load/store atomic op indices when lowering image intrinsics
|
||
- microsoft/compiler: Remove alu type info from store_dest()
|
||
- microsoft/compiler: Duplicate some SSA values to simplify SSA typing
|
||
- microsoft/compiler: Back-propagate type requirement information
|
||
- dxil: Use unified atomics
|
||
- vulkan: Win32 sync import/export support
|
||
- dzn: Don't zero an output struct that can have pNext
|
||
- dzn: Finish implementing KHR_synchronization2
|
||
- dzn: Dedicated resource cleanup
|
||
- dzn: External Win32 memory extension
|
||
- dzn: External Fd memory extension
|
||
- dzn: Hook up win32 semaphore import/export
|
||
- dzn: Hook up fd semaphore import/export
|
||
- docs: Update list of extensions implemented by dzn
|
||
- glsl: Delete dead intrinsics
|
||
- microsoft/compiler: Better and simpler bitcast reduction
|
||
- dzn: Add a no-bindless debug flag
|
||
- dzn: Fix inverted assert
|
||
- dzn: Partial revert of 8887852d
|
||
- dzn: Don't expose copy queues
|
||
- dzn: Fix src/dest confusion for some non-bindless descriptor copies
|
||
- wsi/win32: Handle acquiring an image while one is already acquired
|
||
- nir_lower_returns: Optimize phis before beginning the pass
|
||
- nir: Add undef phi srcs when adding successors
|
||
- radv: Don't run opt_remove_phis before lower_returns
|
||
- dxil: Don't run opt_remove_phis before lower_returns
|
||
- ci/windows: Update WARP to 1.0.7
|
||
- microsoft/compiler: Enable emitting type info for textures with <4 comps
|
||
- microsoft/compiler: Add a pass to assign image formats based on number of components
|
||
- spirv2dxil: Assign formats to image vars before lowering to bindless
|
||
- microsoft/compiler: Use image formats to determine texture types
|
||
- ci/windows: Update WARP to 1.0.7.1
|
||
- nir_opt_algebraic: Don't shrink 64-bit bitwise ops if pack_split is going to be lowered
|
||
- nir: Add preserve_mediump as a shader compiler option
|
||
- microsoft/compiler: Always set support_16bit_alu
|
||
- microsoft/compiler: Handle mediump
|
||
- spirv2dxil: Enable mediump
|
||
- dzn: Don't lower away mediump
|
||
- microsoft/compiler: Fix the int->uint pass for arrayed I/O
|
||
- microsoft/compiler: Fix usage of type var in semantic asserts
|
||
- microsoft/compiler: Viewport/layer as input to GS/HS needs to set feature bit
|
||
- d3d12: Support PIPE_CAP_VS_LAYER_VIEWPORT
|
||
- dzn: Don't create D3D objects for secondary command buffers
|
||
- dzn: Fix incremental binding of VBs
|
||
- d3d12: Fully initialize UAV desc for null SSBOs
|
||
- dzn: Don't support VK R4G4B4A4_UNORM_PACK16 unless we have B4G4R4A4
|
||
- nir_opt_constant_folding: Fix nir_deref_path leak
|
||
- nir: Add is_null_constant to nir_constant
|
||
- vtn: Set is_null_constant
|
||
- nir_split_struct_vars: Support more modes and constant initializers
|
||
- nir: Allow atomics as non-complex uses for var-splitting passes
|
||
- nir_lower_ubo_vec4: Delete an invalid assert
|
||
- nir_lower_mem_access_bit_sizes: Add a bit_size input to the callback
|
||
- nir_lower_mem_access_bit_sizes: Move options into a struct
|
||
- nir_lower_mem_access_bit_sizes: Support unaligned stores via a pair of atomics
|
||
- nir: Fix constant expression for unpack_64_4x16
|
||
- nir: Optimize unpacking 16 bit values that were originally packed
|
||
- microsoft/clc: Try harder to optimize memcpys before lowering them
|
||
- microsoft/clc: Fix progress reporting for some lowering
|
||
- microsoft/compiler: Support vec/struct const vals
|
||
- microsoft/compiler: Improvements to constant -> shader_temp pass used for CL
|
||
- microsoft/compiler: Add some more lowering passes for derefs
|
||
- microsoft/compiler: Emit const accesses as load_deref
|
||
- microsoft/compiler: Use mem_constant instead of shader_temp for consts
|
||
- microsoft/compiler: Un-lower shared/scratch to derefs
|
||
- spirv2dxil: Don't lower shared/temp to explicit I/O
|
||
- microsoft/compiler: Support load_ubo_vec4
|
||
- dxil: Don't generate load_ubo_dxil directly
|
||
- dxil: Delete load_ubo_dxil intrinsic
|
||
- microsoft/compiler: Don't lower bit sizes for movs
|
||
- microsoft/compiler: Don't over-align raw buffer load/store intrinsics
|
||
- dxil: Remove custom SSBO lowering
|
||
- nir_lower_returns: Mark assert-only var as ASSERTED
|
||
- dzn: Ignore export access parameters
|
||
- dzn: Inline D3D12 device creation in physical device creation
|
||
- dzn: Use common GetPhysicalDeviceFeatures2
|
||
- dzn: Remove dynamic check for block-compressed support
|
||
- dzn: Fix multisample counts in device limits
|
||
- dzn: Align placed footprints used when copying linear <-> optimal for BC formats
|
||
- dzn: VK_EXT_external_memory_host
|
||
- radv: Fix label name
|
||
- microsoft/clc: Fix usage of nir_builder_at
|
||
- ci/windows: Re-enable Windows builds
|
||
- d3d12: Fix indexing of local_reference_state
|
||
|
||
Jiadong Zhu (1):
|
||
|
||
- ac: enable SHADOW_GLOBAL_CONFIG for preemptible ib
|
||
|
||
Jianxun Zhang (8):
|
||
|
||
- iris: Fix memory alignment when importing dmabuf (GFX12.5)
|
||
- include/uapi: Update drm_fourcc.h from drm kernel
|
||
- intel/isl: Add MTL RC CCS modifier into modifier info
|
||
- iris: Support I915_FORMAT_MOD_4_TILED_MTL_RC_CCS modifier
|
||
- intel/isl: Add MTL RC CCS CC modifier into modifier info
|
||
- iris: Support I915_FORMAT_MOD_4_TILED_MTL_RC_CCS_CC modifier
|
||
- intel/isl: Add MTL MC CCS modifier into modifier info
|
||
- iris: Support MTL modifier MC_CCS
|
||
|
||
Jonathan Gray (1):
|
||
|
||
- intel/dev: remove dg2 0x5698 pci id
|
||
|
||
Jordan Justen (26):
|
||
|
||
- intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops
|
||
- mesa/main: Exit early when trying to create an unsupported context API
|
||
- iris: Flush untyped dataport cache when HDC flush is requested on compute
|
||
- iris: Flush untyped dataport cache DC flush is requested on compute
|
||
- anv: Clear untyped dataport cache flush bit if not in GPGPU mode
|
||
- anv: Flush untyped dataport cache when HDC flush is requested on compute
|
||
- anv: Flush untyped dataport cache DC flush is requested on compute
|
||
- intel/devinfo: Add has_set_pat_uapi
|
||
- intel/devinfo: Define PAT indices used on MTL
|
||
- iris/bufmgr: Add iris_pat_index_for_bo_flags()
|
||
- iris/bufmgr: Skip bucket allocation if not using writeback cache PAT index
|
||
- iris: Map aux-map with WC on MTL+ (has_set_pat_uapi)
|
||
- drm-uapi/i915_drm.h: Update from drm-next (2023-06-09)
|
||
- iris: Use set PAT extension on BO creation for MTL
|
||
- anv: Use set PAT extension on BO creation for MTL
|
||
- intel/devinfo/i915: Set has_set_pat_uapi for MTL+
|
||
- intel/genxml: Add COMPCS0 aux-table registers
|
||
- anv: Program compute aux-map base address during queue init
|
||
- anv: Use correct CCS0 aux-map register offset in pipe flush
|
||
- isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT
|
||
- anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS
|
||
- isl/dev: Add uncached MOCS value
|
||
- isl: Set MOCS to uncached for MTL stream-out
|
||
- intel/dev: Use RPL-U name on RPL-U devices
|
||
- intel/dev: Add more RPL PCI IDs
|
||
- intel/dev: Update device string for MTL PCI ID 0x7d55
|
||
|
||
Joshua Ashton (8):
|
||
|
||
- radv: Do not enable robustness for push constants with robustBufferAccess2
|
||
- radv: Refactor buffer robustness to an enum
|
||
- radv: Rename radv_nir_compiler_options::robust_buffer_access to robust_buffer_access_llvm
|
||
- radv: Split and move buffer robustness to shader key
|
||
- radv: Rename radv_required_subgroup_info to radv_shader_stage_key
|
||
- radv: Implement VK_EXT_pipeline_robustness
|
||
- radv: Advertise VK_EXT_pipeline_robustness
|
||
- radv: Remove unused pipeline param from radv_generate_pipeline_key
|
||
|
||
Joshua Watt (2):
|
||
|
||
- drm-shim: Set file type in readdir()
|
||
- drm-shim: Use anonymous file for file override
|
||
|
||
José Fonseca (2):
|
||
|
||
- wgl: Fix unintentional assignment on assert.
|
||
- wgl: Remove needless \`if (1) { ... }`.
|
||
|
||
José Roberto de Souza (29):
|
||
|
||
- iris: Move i915 batch destroy logic to iris_i915_destroy_batch()
|
||
- iris: Initialize batch screen in iris_init_batch()
|
||
- iris: Move iris_batch i915 specific variables to union
|
||
- iris: Create, destroy and replace Xe engines
|
||
- iris: Implement batch_check_for_reset() in Xe kmd backend
|
||
- iris: Set priority to Xe engines
|
||
- iris: Fix close of exported bos
|
||
- intel/common: Add gt_id to intel_engine_class
|
||
- iris: Implement batch_submit() in Xe kmd backend
|
||
- iris: Fix vm bind of imported bos from other GPUs
|
||
- build: Add Iris and ANV to ARM's auto-generated drivers
|
||
- anv: Take into consideration physical device max heap size to set maxStorageBufferRange
|
||
- iris: Allow shared scanout buffer to be placed in smem as well
|
||
- iris: Add a function to return allocated bo mmap mode
|
||
- iris: Add function to return mmap mode for userptr bos
|
||
- iris: Add function to return mmap mode for aux map
|
||
- anv: Set memory types supported by Xe KMD
|
||
- anv: Fix ANV_BO_ALLOC_NO_LOCAL_MEM flag
|
||
- anv: Nuke ANV_BO_ALLOC_WRITE_COMBINE
|
||
- iris: Fix return of xe_batch_submit() when exec fails
|
||
- iris: Replace aperture_bytes by sram size in iris_resource_create_for_image() for PIPE_USAGE_STAGING
|
||
- intel: Fix support of kernel versions without DRM_I915_QUERY_ENGINE_INFO
|
||
- iris: Attach a dma-buf to bo flink
|
||
- iris: Implement external object implicit syncronization for Xe kmd
|
||
- anv: Fix compute maximum number of threads value
|
||
- anv: Fix some mismatches of canonical and regular addresses around anv_bo_vma_alloc_or_close()
|
||
- anv: Drop unnecessary intel_canonical_address() call around anv_address_physical()
|
||
- anv: Drop unnecessary intel_canonical_address() calls around bo->offset
|
||
- iris: Convert slab address to canonical
|
||
|
||
Juan A. Suarez Romero (34):
|
||
|
||
- v3d: set depth compare function correctly
|
||
- v3d: use primitive type to get stream output offset
|
||
- v3d/ci: annotate failure
|
||
- v3dv/ci: rename waiver test
|
||
- v3d: add support for ARB_texture_cube_map_array
|
||
- v3d/ci: enable glsl 1.30 and 1.40 piglit tests
|
||
- v3d: apply 1D texture miplevel alignment in arrays
|
||
- v3d/ci: update neverball-v2 trace reference
|
||
- vc4/ci: skip unsupported test versions
|
||
- vc4/ci: disable VC4 jobs
|
||
- v3d: add per hw-version caller macro
|
||
- v3d: upgrade V3D 4.1 to 4.2 version
|
||
- v3d: apply proper clamping when setting up RT
|
||
- v3d/ci: annotate failures
|
||
- vc4/ci: re-enable VC4 testing
|
||
- v3d: delay offset/counter values with primitive restart
|
||
- v3d/ci: run GPU piglit profile
|
||
- v3d/ci: make traces test mandatory
|
||
- v3d: enable NIR compact arrays
|
||
- vc4: set blit mask correctly
|
||
- vc4: call blit paths in chain
|
||
- vc4: allow tile-based blit for Z/S
|
||
- vc4: add specific stencil blit path
|
||
- v3d/v3dv/ci: adjust job fractions
|
||
- v3dv/vc4/ci: update expected results
|
||
- v3d/ci: update traces
|
||
- v3d: Z/S blit require Z/S formats
|
||
- broadcom/ci: update expected results
|
||
- v3d: handle samplerExternalOES
|
||
- broadcom/ci: update expected results
|
||
- gallium/util: fix color clamp for alpha-only formats
|
||
- v3d: clear alpha-only as red-only
|
||
- vc4/v3d/ci: update expected results
|
||
- v3d/ci: add new flake
|
||
|
||
Julia Tatz (7):
|
||
|
||
- zink: Implement PIPE_CAP_OPENCL_INTEGER_FUNCTIONS and PIPE_CAP_INTEGER_MULTIPLY_32X16.
|
||
- zink: Implement PIPE_CAP_RESOURCE_FROM_USER_MEMORY
|
||
- zink: fix layout(local_size_variable) for vk1.3+
|
||
- zink/ci: update expected results
|
||
- aux/trace: fix (u)int dump
|
||
- gallium/dri: fix dri2_from_names
|
||
- aux/trace: fix set_hw_atomic_buffers method name
|
||
|
||
Julia Zhang (1):
|
||
|
||
- virgl: remove check of VIRGL_CAP_V2_UNTYPED_RESOURCE
|
||
|
||
Julian Hagemeister (1):
|
||
|
||
- Gallium: Fix shared memory segment leak
|
||
|
||
Juston Li (24):
|
||
|
||
- venus: use pipelineCacheUUID for shader cache id
|
||
- venus: filter out queue familes with exclusive sparse binding support
|
||
- venus: add helper function support for VkBindSparseInfo
|
||
- venus: add back sparse binding support
|
||
- venus: enable sparse binding features
|
||
- venus: enable sparse binding properties
|
||
- venus: sync to latest protocol header from v1.3.252
|
||
- venus: sync protocol for multiple extensions for zink
|
||
- venus: enable VK_EXT_non_seamless_cube_map
|
||
- venus: enable VK_EXT_dynamic_rendering_unused_attachments
|
||
- venus: enable VK_KHR_shader_clock
|
||
- venus: enable VK_EXT_border_color_swizzle
|
||
- venus: enable VK_EXT_fragment_shader_interlock
|
||
- venus: enable VK_EXT_shader_subgroup_ballot
|
||
- venus: enable VK_EXT_color_write_enable
|
||
- docs: venus: update extension support
|
||
- radv: fix incorrect size for primitives generated query
|
||
- venus: factor out flush barrier cmd
|
||
- venus: expose vn_feedback_buffer_create()
|
||
- venus: add query pool feedback cmds
|
||
- venus: track viewMask
|
||
- venus: track render pass
|
||
- venus: batch query feedback and defer until after render pass
|
||
- venus: use feedback for vkGetQueryPoolResults
|
||
|
||
Karmjit Mahil (44):
|
||
|
||
- pvr: Add missing includes in pvr_common.h
|
||
- pvr: Implement vkCmdUpdateBuffer().
|
||
- pvr: Implement simple internal format v2 transfer paths.
|
||
- pvr: Add deferred RTA clears for cores without gs_rta_support.
|
||
- pvr: Finish pvr_perform_start_of_render_attachment_clear().
|
||
- pvr: Collect vertex input data and fill info struct.
|
||
- pvr: Fix a comment in the PDS code
|
||
- pvr: Fix typo in PDS function name
|
||
- pvr: Add handling for missing entries in pvr_setup_vertex_buffers()
|
||
- pvr: Handle special built-in variable loading in vertex shader
|
||
- pvr: Add PVR_DW_TO_BYTES()
|
||
- pvr: Fix pvr_csb_bake() list return.
|
||
- pvr: Change push_constants_shader_stages to type pvr_stage_allocation
|
||
- pvr: Fix static assert check
|
||
- pvr: Fix unaligned VDMCTRL_PDS_STATE1 data address
|
||
- pvr: Don't advertise S8_UINT support
|
||
- pvr: Fix cs corruption in pvr_pack_clear_vdm_state()
|
||
- pvr: Add missing NULL checks in some vkDestroy...() functions
|
||
- pvr: Use original binding numbers instead of reassigning
|
||
- pvr: Remove custom status in command buffer
|
||
- pvr: Fix missing invalidation of the command buffer
|
||
- pvr: Fix possible allocation of 0 size
|
||
- pvr: Fix vk_free() in vkCreateRenderPass2() error path
|
||
- pvr: Use the suballocator for queries
|
||
- pvr: Add pvrsrvkm sync prim set bridge call
|
||
- pvr: Move pvrsrv sync prim code into new pvr_srv_sync_prim.{c,h}
|
||
- pvr: Use idalloc as the allocator for sync prims
|
||
- pvr: Handle barrier load and store flags.
|
||
- pvr: Fix typo causing seg faults copying immutable samplers
|
||
- pvr: Fix draw indirect page faults due to missing index list buffer
|
||
- pvr: Rename temps_count to pds_temps_count
|
||
- pvr: Fix PDS temps allocation on fragment stage
|
||
- pvr: HWRT creation simplifications.
|
||
- pvr: Dedup a check with pvr_is_render_area_tile_aligned()
|
||
- pvr: Remove outdated finishme
|
||
- pvr: Fix seg fault on empty descriptor set
|
||
- pvr: Fix dynamic offset patching
|
||
- pvr: Fix csb control stream extension
|
||
- pvr: Fix missing BITFIELD_BIT for winsys frag job flag
|
||
- pvr: Change winsys flag defines to bitfields
|
||
- pvr: Setup ZLS depth and stencil load/store separately
|
||
- docs: Add inital PowerVR driver documentation
|
||
- pvr: Fix \`for` loop itarator usage
|
||
- pvr: Fix dynamic desc offset storage
|
||
|
||
Karol Herbst (140):
|
||
|
||
- rusticl: rework CLVec helper function to calculate bounds
|
||
- rusticl/mem: fix Mem::copy_rect
|
||
- rusticl/mem: replace buffer_offset_size with CLVec::calc_offset_size
|
||
- gallium: correctly name the flags of svm_migrate
|
||
- rusticl/context: add helper to get the max mem alloc size for all devices
|
||
- rusticl/memory: Rework mapping of memory located in system RAM
|
||
- rusticl/mem: add get_parent helper
|
||
- rusticl: add support for fine-grained system SVM
|
||
- nv50/ir: ignore CL system values
|
||
- nouveau: allow to enable SVM without having to enable CL
|
||
- nouveau: nouveau_copy_buffer can deal with user_ptrs just fine
|
||
- rusticl/event: drop work item before updating status
|
||
- rusticl: add create_pipe_box to better deal with pipe_box restrictions
|
||
- rusticl/mem: more region and origin validation
|
||
- radeonsi: lower mul_high
|
||
- ac/llvm: support shifts on 16 bit vec2
|
||
- rusticl: don't set size_t-is-usize for >=bindgen-0.65
|
||
- rusticl/device: improve advertisement of fp64 support
|
||
- rusticl/platform: make the initialization more explicit
|
||
- rusticl/platform: extract env variable parsing from Platform::init
|
||
- rusticl/platform: add RUSTICL_FEATURES boilerplate
|
||
- rusticl/device: allow enablement of fp64 via RUSTICL_FEATURES
|
||
- rusticl/program: rework dynamic Program state
|
||
- rusticl/program: use if let to get rid of an unwrap in build
|
||
- clc: free kernel args in clc_free_kernels_info
|
||
- rusticl/nir: finish blob after serializing
|
||
- nvc0: do not randomly emit fences.
|
||
- nv50/ir: Use unified atomics
|
||
- Reviewed-by: Nora Allen <blackcatgames@protonmail.com>rusticl/platform: make the extension array a static
|
||
- rusticl/device: use PLATFORM_EXTENSIONS as a template for filling extensions
|
||
- rusticl/platform: advertise byte_addressable_store
|
||
- rusticl/device: split add_ext in fill_extensions
|
||
- rusticl: explicitly state supported SPIR-V extensions
|
||
- rusticl/platform: generate extension constants via macro
|
||
- rusticl/spirv: skip printing info messages
|
||
- rusticl/device: limit MAX_PARAMETER_SIZE to 32k
|
||
- rusticl/device: set preferred vector size of doubles if fp64 is enabled
|
||
- nv50/ir: convert to scoped_barrier
|
||
- doc/rusticl: add Rust Update Policy
|
||
- rusticl: bump rust req to 1.60
|
||
- rusticl/event: flush queues from dependencies
|
||
- ci: add and use clippy for rusticl
|
||
- rusticl: fix clippy errors on image_slice_pitch change to usize
|
||
- clc: relax spec constant validation
|
||
- rusticl: add proc macro module for generating API stubs
|
||
- rusticl/icd: make release return nothing
|
||
- rusticl/icd: use new proc macros
|
||
- ac/llvm: support vec2 on b2i16
|
||
- ac/llvm: replace MESA_SHADER_COMPUTE checks with gl_shader_stage_is_compute
|
||
- ac/llvm: set +cumode for radeonsi
|
||
- lp: align memory for long16 CL types
|
||
- rusticl/icd: fix ReferenceCountedAPIPointer::from_ptr for NULL pointers
|
||
- rusticl/api: remove some repr(C)
|
||
- rusticl/event: ensure even status is updated in order
|
||
- docs: improve OpenCL features
|
||
- rusticl/queue: overhaul of the queue+event handling
|
||
- rusticl: enforce using unsafe blocks in unsafe functions
|
||
- nv50/ir: use override
|
||
- nv50/ir: resolve -Woverloaded-virtual=1 warnings
|
||
- clc: add commment to clc_optional_features to ensure no padding exists
|
||
- rusticl/spirv: Key optional clc features when caching.
|
||
- clc: static assert that clc_optional_features has no padding
|
||
- nouveau: eliminate busy waiting on fences
|
||
- rusticl/device: add intel usm queries DPCPP cares about
|
||
- rusticl/device: sort cl_device_info queries
|
||
- rusticl/version: use cl_version instead of cl_uint and provide a From impl
|
||
- rusticl: advertize cl_khr_extended_versioning
|
||
- docs/cl: fix whitespace issues and add missing entries
|
||
- rusticl: advertize cl_khr_spirv_no_integer_wrap_decoration
|
||
- docs/cl: improve reporting of image features
|
||
- rusticl/mem: cache the pipe_format
|
||
- rusticl/mem: fix validation of packed image formats
|
||
- rusticl/format: pass order and type to rusticl_image_format directly
|
||
- rusticl/format: extract CL format to pipe format mapping into const function
|
||
- rusticl/format: extract required format checks into const functions
|
||
- rusticl/format: drop req_for_3d_image_write_ext
|
||
- rusticl/format: add required format table for CL2.0
|
||
- rusticl/format: document cl to pipe format mapping
|
||
- rusticl/format: move format table generation into a macro
|
||
- rusticl/format: enable all trivial to support optional image formats
|
||
- clc: fix SPIRVMessageConsumer for NULL src
|
||
- clc: allow passing custom validator options
|
||
- rusticl/program: pass our max param size along to the spirv validator
|
||
- compiler/types: fix size of padded OpenCL Structs
|
||
- rusticl/device: rename doubles to fp64 and long to int64
|
||
- rusticl: experimental support for cl_khr_fp16
|
||
- rusticl: add ld_args_gc_sections
|
||
- rusticl: specify which symbols to export
|
||
- rusticl: stop linking with libgalliumvl
|
||
- rusticl/device: create helper context before loading libclc
|
||
- nir/load_libclc: run some opt passes for everybody
|
||
- docs: document CLC_DEBUG
|
||
- rusticl/program: add debugging for OpenCL C compilation
|
||
- rusticl/program: add debugging option to disable SPIR-V validation
|
||
- nvc0: fix printing shaders
|
||
- nv50/ir/nir: set numBarriers if we emit an OP_BAR
|
||
- rusticl: structurize and reorder mesa binding args
|
||
- rusticl: generate bindings for build-id stuff
|
||
- rusticl/meson: extract common bindgen rust args
|
||
- rusticl/mesa: create proper build-id hash for the disk cache
|
||
- rusticl: bump bindgen requirement
|
||
- rusticl/program: skip linking compiled binaries
|
||
- docs/rusticl: mark building section as such
|
||
- docs/rusticl: add Enabling section
|
||
- docs/cl: remove cl_khr_byte_addressable_store from extension list.
|
||
- docs/cl: move vec3 support under OpenCL C 1.1
|
||
- docs/cl: timer sync is implemented
|
||
- docs: add missing get_compute_state_info documentation
|
||
- vtn: more CL subgroups
|
||
- clc: rework optional subgroup feature
|
||
- llvmpipe: report the proper subgroup size
|
||
- gallium: add simd_sizes to pipe_compute_state_object_info
|
||
- gallium: add get_compute_state_subgroup_size
|
||
- gallium: add PIPE_COMPUTE_CAP_MAX_SUBGROUPS
|
||
- iris: implement get_compute_state_subgroup_size
|
||
- rusticl/util: add an Iterator to iterate over set bits in an integer
|
||
- rusticl/util: add div_round_up
|
||
- rusticl/device: rework subgroups to subgroup_sizes
|
||
- gallium: change PIPE_COMPUTE_CAP_SUBGROUP_SIZE to a bitfield of sizes
|
||
- rusticl: deal with compute_param returning 0
|
||
- rusticl: support subgroups
|
||
- nvc0: backport fp helper invocation fix to 2nd gen Maxwell+
|
||
- rusticl/kernel: silence newer clippy warning
|
||
- rusticl: Replace &Arc<Device> with &Device
|
||
- rusticl/device: make it &'static
|
||
- api/icd: drop static lifetime from \`get_ref` return type
|
||
- nvc0: initial Ada enablement
|
||
- rusticl: fix warnings with newer rustc
|
||
- nv50/ir/nir: fix txq emission on MS textures
|
||
- nv50/ir/nir: Fix zero source handling of tex instructions.
|
||
- rusticl/kernel: only handle function_temp memory before lowering printf
|
||
- n50/compute: submit initial compute state in nv50_screen_create
|
||
- nv50: fix code uploads bigger than 0x10000 bytes
|
||
- nouveau: take glsl_type ref unconditionally
|
||
- nv50: limit max code uploads to 0x8000
|
||
- clc: use CLANG_RESOURCE_DIR for clang's resource path
|
||
- zink: fix source type in load/store scratch
|
||
- zink: fix global stores
|
||
- rusticl/disk_cache: fix stack corruption
|
||
- rusticl/memory: do not verify pitch for IMAGE1D_BUFFER
|
||
|
||
Kenneth Graunke (17):
|
||
|
||
- intel/compiler: UNDEF comparisons with smaller than 32-bit
|
||
- intel/compiler: UNDEF SubgroupInvocation's register
|
||
- intel/compiler: Fold constants after distributing source modifiers
|
||
- nir: Add a variant of nir_lower_int64 for float conversions only
|
||
- intel/compiler: Postpone most int64 lowering to brw_postprocess_nir
|
||
- nir: Add find_lsb lowering to nir_lower_int64.
|
||
- intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count
|
||
- nir: Assert that we don't shrink bit-sizes in nir_lower_bit_size()
|
||
- intel/compiler: Fix a fallthrough in components_read() for atomics
|
||
- intel/genxml: Drop Tiled Resource Mode fields
|
||
- intel: Initialize FF_MODE2 on all Gfx12 platforms
|
||
- iris: Allocate coherent buffers for resources flagged as persistent/coherent
|
||
- isl: Don't set "Enable Unorm Path in Color Pipe" on Alchemist
|
||
- intel/genxml: Fix gen_sort_tags.py to handle mbz/mbo
|
||
- intel/genxml: Update RENDER_SURFACE_STATE Fields
|
||
- iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404)
|
||
- iris: Check prog[] instead of uncompiled[] for BLORP state skipping
|
||
|
||
Kiskae (1):
|
||
|
||
- vulkan/wsi: check for dri3 buffer initialization failure
|
||
|
||
Konrad Dybcio (2):
|
||
|
||
- freedreno: Add some A6/7xx registers
|
||
- freedreno: Partially decode CP_PROTECT_CNTL
|
||
|
||
Konstantin Kharlamov (1):
|
||
|
||
- loader/dri3: temporarily work around a crash when front is NULL
|
||
|
||
Konstantin Seurer (133):
|
||
|
||
- nir/lower_fp16_casts: Fix SSA dominance
|
||
- nir/lower_io: Emit less iadd(x, 0)
|
||
- nir: Make rq_load committed src an index
|
||
- radv: Stop running constant folding during ray query lowering
|
||
- radv/ci: Test ray tracing pipelines
|
||
- gallium/nir: Handle unified atomics in nir_to_tgsi_info
|
||
- nir/inline_uniforms: Handle num_components > 1
|
||
- nir/lower_shader_calls: Remat derefs earlier
|
||
- radv: Stop using radv_get_int_debug_option
|
||
- treewide: Add a .clang-format file
|
||
- amd: Use the Mesa base style
|
||
- asahi: Use the Mesa base style
|
||
- freedreno: Use the Mesa base style
|
||
- d3d12: Use the Mesa base style
|
||
- i915: Use the Mesa base style
|
||
- r600/sfn: Use the Mesa base style
|
||
- panfrost: Use the Mesa base style
|
||
- util/perf: Use the Mesa base style
|
||
- venus: Use the Mesa base style
|
||
- asahi: Reformat using the new style
|
||
- panfrost: Reformat using the new style
|
||
- gallivm: Fix gather/scatter types for newer llvm
|
||
- radv/rt: Fix pipeline libraries
|
||
- gallivm: Fix anisotropic sampling with num_mips=1
|
||
- gallivm: Cast read_first_invocation source to an int
|
||
- llvmpipe: refactor out the pipe->lp_jit structure fillers.
|
||
- llvmpipe: Add lp_storage_image_format_supported
|
||
- llvmpipe: Add lp_storage_render_image_format_supported
|
||
- gallivm: Add lp_build_nir_sample_key
|
||
- gallivm: Add lp_img_op_from_intrinsic
|
||
- gallivm: Handle invalid image format/op combinations
|
||
- gallivm: Zero initialize param structs
|
||
- radv/rt: Do not guard the raygen shader
|
||
- radv/rt: Clear NIR metadata after lowering the ABI
|
||
- aco/rt: Do not initialize the next shader addr
|
||
- radv/ci: Test ray tracing on vkd3d-proton
|
||
- radv/rt: Stop forcing wave32 by setting compute_subgroup_size
|
||
- Revert "radv: Enable ray tracing pipelines by default"
|
||
- radv/rt: Enable RT pipelines on GFX10_3+ excluding vangogh
|
||
- radv: Move the shader type to radv_shader_info
|
||
- radv: Adjust the traversal shader description
|
||
- radv: Use get_shader_from_executable_index for executable properties
|
||
- radv: Implement executable properties for ray tracing stages
|
||
- radv: Use _mesa_shader_stage_to_string for executable name
|
||
- radv/rt: Store the prolog outside the shaders array
|
||
- radv: Call radv_pipeline_init_scratch per shader
|
||
- meson: Add a xcb-keysyms dependency
|
||
- vulkan: Common trace capturing infrastructure
|
||
- radv: Add radv_trace_mode
|
||
- vulkan/wsi/x11: Capture traces using a hotkey
|
||
- radv/rra: Use common trace trigger
|
||
- radv/rgp: Use common trace trigger
|
||
- vulkan/rmv,radv: Use common trace trigger
|
||
- docs: Update envvars used for tracing
|
||
- amd: Use nir\_ instead of nir_build\_ helpers
|
||
- microsoft: Use nir\_ instead of nir_build\_ helpers
|
||
- intel: Use nir\_ instead of nir_build\_ helpers
|
||
- freedreno: Use nir\_ instead of nir_build\_ helpers
|
||
- vtn: Use nir\_ instead of nir_build\_ helpers
|
||
- nir: Use nir\_ instead of nir_build\_ helpers
|
||
- nir/builder_opcodes: Remove nir_build\_ prefixed helpers
|
||
- util: Do not include immintrin.h in half_float.h
|
||
- radv/rt: Fix caching non-recursive stages
|
||
- radv/rt: Hash stages using radv_hash_shaders
|
||
- llvmpipe: Add BDA jit type helpers
|
||
- gallivm: Add missing includes
|
||
- gallivm: Add lp_descriptor struct
|
||
- gallivm: Expose lp_build_sample_soa_code
|
||
- llvmpipe: Add lp_build_sampler_soa_dynamic_state
|
||
- llvmpipe: Add lp_build_image_soa_dynamic_state
|
||
- gallivm: Add LP_IMG_OP_COUNT
|
||
- gallivm: Expose LP_MAX_TEX_FUNC_ARGS
|
||
- llvmpipe: Add LP_TOTAL_IMAGE_OP_COUNT
|
||
- gallivm: Expose lp_build_texel_type
|
||
- gallivm: Propagate vulkan resources
|
||
- gallivm: Clamp the texel buffer size
|
||
- llvmpipe: Pre compile sample functions
|
||
- gallivm: Add a function for loading vulkan descriptors
|
||
- gallivm: Implement vulkan UBOs
|
||
- gallivm: Implement vulkan SSBOs
|
||
- gallivm: Implement vulkan textures
|
||
- gallivm: Implement vulkan images
|
||
- llvmpipe: Disable the linear path when running vulkan
|
||
- lavapipe: Include llvmpipe
|
||
- lavapipe: Lower more texture OPs
|
||
- lavapipe: Make pipeline_lock generic for accessing the queue
|
||
- lavapipe: Rework descriptor handling
|
||
- lavapipe: Lower non uniform access
|
||
- lavapipe: EXT_descriptor_indexing
|
||
- llvmpipe: Use lp_jit_buffer_from_pipe_const in setup
|
||
- lavapipe: Make shader compilation thread safe
|
||
- zink: Increase ZINK_FBFETCH_DESCRIPTOR_SIZE to 280
|
||
- zink/ci: Update lavapipe expectations
|
||
- venus/ci: Update fails
|
||
- lavapipe/ci: Update CI expectations for new extensions
|
||
- llvmpipe/ci: Update expectations
|
||
- nir: Add nir_builder_at
|
||
- radv: Use nir_builder_at
|
||
- asahi: Use nir_builder_at
|
||
- v3d: Use nir_builder_at
|
||
- glsl: Use nir_builder_at
|
||
- nir: Use nir_builder_at
|
||
- spirv: Use nir_builder_at
|
||
- freedreno: Use nir_builder_at
|
||
- gallium,st: Use nir_builder_at
|
||
- crocus: Use nir_builder_at
|
||
- etnaviv: Use nir_builder_at
|
||
- r600: Use nir_builder_at
|
||
- radeonsi: Use nir_builder_at
|
||
- vc4: Use nir_builder_at
|
||
- zink: Use nir_builder_at
|
||
- lavapipe: Use nir_builder_at
|
||
- microsoft: Use nir_builder_at
|
||
- panfrost: Use nir_builder_at
|
||
- intel: Use nir_builder_at
|
||
- nir/opt_dead_cf: Handle if statements ending in a jump correctly
|
||
- nir/builder_opcodes: Do not generate empty intrinsic indices
|
||
- amd: Move ac_hw_stage to its own file
|
||
- gallivm: Fix atomic_global types
|
||
- lavapipe: Set the descriptor count to what vkd3d-proton requires
|
||
- llvmpipe: Allow comparison sampling for float formats
|
||
- llvmpipe: Allocate more dummy sample functions for FORMAT_NONE
|
||
- llvmpipe,lavapipe: Relayout lp_descriptor
|
||
- lavapipe: Always advertise formatless storage image OPs
|
||
- nir/lower_shader_calls: Remat derefs after shader calls
|
||
- nir/opt_dead_cf: Run dead_cf_block while it makes progress
|
||
- nir/opt_dead_cf: Clarify comment
|
||
- draw: Do not restart the primitive_id at 0
|
||
- llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE
|
||
- llvmpipe: Zero extend vectors in widen_to_simd_width
|
||
- vulkan/wsi/x11: Implement capture hotkey using the keymap
|
||
- radv: Don't use the depth image view for depth bias emission
|
||
- aco/spill: Make sure that offset stays in bounds
|
||
|
||
Kurt Kartaltepe (1):
|
||
|
||
- drirc: Set limit_trig_input_range option for Nier games
|
||
|
||
Leo Liu (6):
|
||
|
||
- radeonsi: create a new context for transcode with multiple video engines
|
||
- radeonsi/vcn: AV1 skip the redundant bs resize
|
||
- radeonsi: Remove redundant vcn_decode from info
|
||
- amd: Add vcn ip version info
|
||
- radeonsi: Use vcn version instead of CHIP family for VCNs
|
||
- radeonsi/vcn: fix the incorrect dt_size
|
||
|
||
Lina Versace (2):
|
||
|
||
- venus: Advertise 1.3 in ICD file
|
||
- venus: Fix detection of push descriptor set
|
||
|
||
LingMan (4):
|
||
|
||
- rusticl: core: stop using cl_prop from the api module
|
||
- rusticl: drop CLProp implementation for String
|
||
- rusticl: drop cl_prop_for_type macro
|
||
- rusticl: fix UB in CLProp machinery
|
||
|
||
Lionel Landwerlin (185):
|
||
|
||
- docs: add missing MESA_VK_WSI_HEADLESS_SWAPCHAIN variable
|
||
- vulkan/runtime: discard unused graphics stages in libraries
|
||
- intel/vec4: force exec_all on float control instruction
|
||
- anv: enable blorp query reset for performance queries
|
||
- vulkan/overlay: deal with unknown pNext structures
|
||
- isl: don't set inconsistent fields for depth when using stencil only
|
||
- anv: introduce a base graphics pipeline object
|
||
- anv: move force shading rate writes checks
|
||
- anv: make input attachments available through bindless
|
||
- anv: move preprocessing of NIR right before compilation
|
||
- anv: add dynamic buffer offsets support with independent sets
|
||
- anv: implement VK_EXT_graphics_pipeline_library
|
||
- anv: Work around the spec question about pipeline feedback vs GPL.
|
||
- isl: fix a number of errors on storage format support on Gfx9/12.5
|
||
- intel/nir: add options to storage image lowering
|
||
- anv: drop lowered storage images code
|
||
- anv: enable shaderStorageImageReadWithoutFormat on Gfx12.5+
|
||
- anv: rework Wa_14017076903 to only apply with occlusion queries
|
||
- intel/tools: add ability to dump out raw kernels data
|
||
- nir/divergence: add missing load_global_constant_* intrinsics
|
||
- anv: fix anv_nir_lower_ubo_loads pass
|
||
- anv: enable shaderUniformBufferArrayNonUniformIndexing
|
||
- intel/fs: fix per vertex input clamping
|
||
- nir/lower_non_uniform_access: add get_ssbo_size handling
|
||
- intel/compiler: make uses_pos_offset a tri-state
|
||
- vulkan: bump headers to 1.3.249
|
||
- spirv: update to latest headers
|
||
- spirv/nir: wire ray interection triangle position fetch
|
||
- intel/nir/rt: use a single load for instance leaf loading
|
||
- intel/nir/rt: wire position fetch intrinsic
|
||
- anv: implement VK_KHR_ray_tracing_position_fetch
|
||
- intel/fs: fix scheduling of HALT instructions
|
||
- anv: remove 48bit address space checks
|
||
- anv: avoid hardcoding instruction VA constant in shaders
|
||
- anv: link anv_bo to its VMA heap
|
||
- anv: make internal address space allocation more dynamic
|
||
- anv: increase instruction heap to 2Gb
|
||
- intel/fs: reduce register usage for relocated constants
|
||
- intel: enable protected context creation along with engines
|
||
- Revert "intel/compiler: make uses_pos_offset a tri-state"
|
||
- anv: fixup workaround 16011411144
|
||
- intel/mi_builder: fixup tests for newer kernel uAPI
|
||
- intel: switch over to unified atomics
|
||
- spirv: fix argument to ray query intrinsic
|
||
- intel/devinfo: printout on stdout
|
||
- intel/devinfo: allow -p to take a pci-id in hexa
|
||
- intel/devinfo: call intel_device_info_init_was only once
|
||
- anv: put private binding BOs into execlists
|
||
- anv: mark images compressed for untracked layout/access
|
||
- gitlab-ci: add capture for i915 error state
|
||
- anv: defer binding table block allocation to when necessary
|
||
- anv: assume context isolation support
|
||
- anv: fix push descriptor deferred surface state packing
|
||
- intel/fs: fix size_read() for LOAD_PAYLOAD
|
||
- anv: move timestamp vfunc initialization to genX code
|
||
- anv: use COMPUTE_WALKER post sync field to track compute work
|
||
- iris: use COMPUTE_WALKER post sync field to track compute work
|
||
- intel/fs: make tcs input_vertices dynamic
|
||
- anv: implement EDS2.extendedDynamicState2PatchControlPoints
|
||
- iris: rework Wa_14017076903 to only apply with occlusion queries
|
||
- intel: add alignment helper for aux map
|
||
- iris: add a comment about aux-tt alignment requirements
|
||
- anv: update aux-tt alignment requirements for MTL
|
||
- intel: reduce minimum memory alignment on Gfx12.5
|
||
- anv: further reduce pool alignments
|
||
- anv: opportunistically align VMA to 2Mb
|
||
- anv: update internal address space to have 4Gb of dynamic state
|
||
- anv: fix push range for descriptor offsets
|
||
- intel/fs: reuse descriptor helper
|
||
- intel/fs: lower get_buffer_size like other logical sends
|
||
- nir/lower_shader_calls: add ability to force remat of instructions
|
||
- nir: add a new intrinsic to describe resources accessed on intel
|
||
- nir: teach nir_chase_binding about resource_intel
|
||
- nir/opt_gcm: allow resource_intel to be moved anywhere
|
||
- intel/fs: add a pass to move resource_intel closer to user
|
||
- intel/fs: teach ubo range analysis pass about resource_intel
|
||
- intel/fs: keep track of new resource_intel information
|
||
- intel/fs: enable SSBO accesses through the bindless heap
|
||
- intel/fs: enable UBO accesses through bindless heap
|
||
- intel/fs: enable get_buffer_size on bindless heap
|
||
- intel/fs: enable extended bindless surface offset
|
||
- intel/fs: enable bindless sampler state offsets
|
||
- intel/fs: enable uniform block accesses through bindless heap
|
||
- intel/fs: try to rematerialize surface computation code
|
||
- anv: remove unused define
|
||
- anv: fix null descriptor handling with A64 messages
|
||
- anv: remove incorrect ifdef
|
||
- anv: bail flush_gfx_state when not gfx push constant is dirty
|
||
- anv: track pipeline in anv_cmd_pipeline_state
|
||
- anv: move pipeline active_stages to common structure
|
||
- anv: increase workaround BO so that we can hold a full 4Kb page of 0s
|
||
- anv: toggle extended bindless surface state on Gfx12.5+
|
||
- docs/anv: some binding table explanations
|
||
- anv: add an option for using indirect descriptors
|
||
- anv: introduce a new descriptor set layout type
|
||
- anv: create a pool for indirect descriptors
|
||
- anv: reduce push constant size for descriptor sets
|
||
- anv: new structure to hold surface states
|
||
- anv: add a pass to partially lower resource_intel
|
||
- nir: expose a couple of address format add helpers
|
||
- anv: bound load descriptor mem better
|
||
- anv: prepare image/buffer views for non indirect descriptors
|
||
- anv: add support for direct descriptor in allocation/writes
|
||
- anv: add helpers to build pipeline bindings
|
||
- anv: handle null surface in the binding table with direct descriptors
|
||
- anv: factor out dynamic buffer bti emission
|
||
- anv: implement binding table emission for direct descriptors
|
||
- anv: simplify ycbcr bti computations
|
||
- anv: track descriptor data size
|
||
- anv: add direct descriptor support to apply_layout
|
||
- anv: bring back the max number of sets to 8
|
||
- anv: descriptor binding for direct descriptors
|
||
- anv: ensure descriptor addresses are used with bindless stages
|
||
- anv: enable direct descriptors on platforms with extended bindless offset
|
||
- anv: add support for VK_EXT_dynamic_rendering_unused_attachments
|
||
- anv: remove unused functions
|
||
- intel/fs: fix a couple of descriptor mistakes
|
||
- intel/stub_gpu: add an option to launch valgrind
|
||
- intel/fs: fix pull-constant-load prior to gfx7
|
||
- anv: allow binding tables allocations on compute only queues
|
||
- intel/nir: switch ray query state tracking to local variables uint16_t
|
||
- anv: add query tracepoints
|
||
- anv: deal with unsupported VkImageFormatListCreateInfo::pViewFormats
|
||
- anv: report max simd width only once for fragment shaders
|
||
- anv: always report all pipeline stats regardless of stages
|
||
- anv: only disable mesh when enabled at the VkDevice level
|
||
- anv: disable mesh/task for generated draws
|
||
- anv: fix incorrect batch for 3DSTATE_CONSTANT_ALL emission
|
||
- anv: limit ANV_PIPE_RENDER_TARGET_BUFFER_WRITES to blorp operations using 3D
|
||
- anv: factor out generation kernel dispatch into helper
|
||
- anv: add support for simple internal compute shaders
|
||
- anv: generalize internal kernel concept
|
||
- anv: add shaders for copying query results
|
||
- intel/ds: add query count in query tracepoints
|
||
- anv: enable CmdCopyQueryPoolResults to use shader for copies
|
||
- intel/fs: fix bindless/shared surface mistake
|
||
- intel/fs: print identation for control flow
|
||
- intel/fs: avoid reusing the VGRF for uniform load_ubo
|
||
- nir: add a new ubo uniform loading intrinsic for intel
|
||
- intel/fs: make use of load_ubo_uniform_block_intel
|
||
- nir: add a load_global_constant uniform intel variant
|
||
- intel/fs: handle load_global_constant_uniform_block_intel
|
||
- anv: avoid private buffer allocations in vkGetDeviceImageMemoryRequirementsKHR
|
||
- anv: add missing query clear flush for acceleration structure queries
|
||
- anv: track buffer writes from shaders for query results writes
|
||
- anv: change the way we clear pending query bits
|
||
- anv: fix pending query bits for compute only command buffers
|
||
- anv: tracking query buffer writes & query clears separately
|
||
- anv: switch copy query results with shaders from semaphore waits to flushes
|
||
- vulkan: registry/headers bump to 1.3.254
|
||
- vulkan/runtime: add support for EXT_depth_bias_control
|
||
- anv: add VK_EXT_depth_bias_control support
|
||
- isl: assert on gfx6 condition that should not be met
|
||
- isl: assert on gfx7 condition that should not be met
|
||
- isl: assert on gfx8 condition that should not be met
|
||
- isl: add surface creation reporting mechanism
|
||
- anv: align buffers to a cache line
|
||
- anv: fix utrace batch allocation
|
||
- genxml: enable decoding on compute engine
|
||
- intel/aubinator_error_decode: add ccs support
|
||
- anv: look into batch bo reloc list looking for BOs to decode
|
||
- anv: implement storage image depth query using descriptor buffer read
|
||
- Revert "isl: Set Depth to array len for 3D storage images"
|
||
- docs/features: update anv entries
|
||
- intel/fs: disable coarse pixel shader with interpolater messages at sample
|
||
- nir/opt_shrink_vectors: enable sparse intrinsics shrinking
|
||
- docs/features: add more missing extensions
|
||
- docs/features: add hasvk entries
|
||
- zink: update profile vulkan version requirements
|
||
- zink: drop linear D32_SFLOAT_S8_UINT requirement
|
||
- anv: fix utrace signaling with Xe
|
||
- intel/fs: fix missing predicate on SEL instruction
|
||
- intel/fs: don't try to rebuild sequences of non ssa values
|
||
- anv: fix 3DSTATE_RASTER::APIMode field setting
|
||
- hasvk: fix null descriptor handling with A64 messages
|
||
- anv: don't try to access dynamic buffers from surface states
|
||
- intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch
|
||
- anv: add missing ISL storage usage
|
||
- intel/nir: rerun lower_tex if it lowers something
|
||
- hasvk: add state cache invalidation back before fast clears
|
||
- anv: fix utrace timestamp buffer copies
|
||
- intel: don't assume Linux minor dev node
|
||
- blorp: switch blorp_update_clear_color to early return
|
||
- blorp: update and move fast clear PIPE_CONTROLs to drivers
|
||
- iris: ensure stalling pipe control before fast clear
|
||
|
||
Liviu Prodea (1):
|
||
|
||
- microsoft/clc: Don't build compiler test if build-tests is false
|
||
|
||
Lone_Wolf (3):
|
||
|
||
- compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+
|
||
- clc: Add clangASTMatchers to fix static llvm build of microsoft-clc with LLVM 16+
|
||
- clc: Add clang frontendhlsl module to fix build of microsoft-clc with llvm 16+
|
||
|
||
Luc Ma (1):
|
||
|
||
- meson: keep Mako version checking in accord with build msg
|
||
|
||
Luca Bacci (1):
|
||
|
||
- Add checks for NULL dxil_validator
|
||
|
||
Luca Weiss (1):
|
||
|
||
- freedreno: Enable A506
|
||
|
||
Lucas Fryzek (6):
|
||
|
||
- broadcom: Add support for VK_FORMAT_A2R10G10B10_UNORM_PACK32
|
||
- broadcom: Fix slice memory allocation logic for compressed textures
|
||
- v3d: Add support for ASTC texture compression
|
||
- v3dv: Update texture padding logic to match v3d changes
|
||
- mailmap: Add Lucas Fryzek to mailmap
|
||
- gallium: Remove \`PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND`
|
||
|
||
Lucas Stach (16):
|
||
|
||
- etnaviv: update derived state after forced commandstream flush
|
||
- etnaviv: don't flush implicit flush resources when forced
|
||
- etnaviv: rs: flush TS cache before making configuration changes
|
||
- etnaviv: rs: unconditionally flush color and depth cache before using RS
|
||
- etnaviv: optimize transfer flushes
|
||
- etnaviv: query: move sample counter manipulation into query providers
|
||
- etnaviv: query: reset sample count on begin_query
|
||
- etnaviv: query: remove incorrect comment
|
||
- etnaviv: query: correct max number of perfmon samples
|
||
- etnaviv: query: correct max number of occlusion query samples
|
||
- etnaviv: query: optimize context flushes
|
||
- mesa/st: discard whole resource when mapping drawpixels texture
|
||
- etnaviv: only emit sampler config for changed samplers
|
||
- etnaviv: move resource level dimension members to make comments line up
|
||
- etnaviv: rs: fix multisampled blits
|
||
- etnaviv: blt: fix multisampled blits
|
||
|
||
Luigi Santivetti (13):
|
||
|
||
- pvr: use PVR_DW_TO_BYTES for stream_link_space calculation
|
||
- pvr: add GUARD_SIZE_DEFAULT for CDM and VDM control stream links 1 and 2
|
||
- pvr: fixup stack overflow in {start,end}_sub_cmd
|
||
- pvr: introduce suballocator for internal allocations
|
||
- pvr: switch pvr_gpu_upload_* to use pvr_bo_suballoc
|
||
- pvr: switch pvr_cmd_buffer_alloc_mem to use pvr_bo_suballoc
|
||
- pvr: switch pvr_descriptor_set_create to use pvr_bo_suballoc
|
||
- pvr: switch pvr_clear to use pvr_bo_suballoc
|
||
- pvr: switch pvr_spm to use pvr_bo_suballoc
|
||
- pvr: fixup assert in pvr_cmd_buffer_alloc_mem
|
||
- pvr: fix division by block size in blit
|
||
- pvr: fixup transfer primary sub-command list
|
||
- pvr: do not claim support for ASTC texture compression
|
||
|
||
Luna Nova (5):
|
||
|
||
- device_select_layer: fix inverted strcmp in device_select_find_dri_prime_tag_default (v1)
|
||
- device_select_layer: apply DRI_PRIME even if default device is > 1 to match opengl behavior
|
||
- device_select_layer: pick a default device before applying DRI_PRIME
|
||
- device_select_layer: add MESA_VK_DEVICE_SELECT_DEBUG which logs why default selection was made
|
||
- device_select_layer: log selectable devices if MESA_VK_DEVICE_SELECT_DEBUG or DRI_PRIME_DEBUG are set
|
||
|
||
Lynne (4):
|
||
|
||
- radv/video: reject general unsupported video formats
|
||
- radv/video: reject non-8bit H264
|
||
- radv/video: reject unsupported hevc profiles and bit depths
|
||
- anv_video: reject decoding of unsupported profiles and formats
|
||
|
||
M Henning (12):
|
||
|
||
- nvc0: Use nir in nvc0_program_init_tcp_empty
|
||
- nvc0: Use nir in nvc0_blitter_make_vp
|
||
- nv50,nvc0: Use nir in nv50_blitter_make_fp
|
||
- nv50,nvc0: Stop advertising TGSI by default
|
||
- nv50,nvc0: Use ttn for tgsi shaders by default
|
||
- gallium: Add pipe_shader_state_from_nir
|
||
- nouveau/codegen: Check nir_dest_num_components
|
||
- nv50/codegen: Set lower_uniforms_to_ubo
|
||
- nouveau/nir: Set isSigned on all atomic_imax/imin
|
||
- nv50,nvc0: Free nir from blitter fp shader
|
||
- nvc0: Free blitter->vp
|
||
- nv50: Fix return type of nv50_blit_is_array
|
||
|
||
Marcin Ślusarz (17):
|
||
|
||
- intel: split URB space between task and mesh proportionally to entry sizes
|
||
- anv: move nir_shader_gather_info to anv_pipeline_nir_preprocess
|
||
- intel/tools: decode ACTHD printed by newer kernels
|
||
- nir: extract try_lower_id_to_index_1d
|
||
- nir: use wg id to wg idx shortcut if two dims of num_workgroups are 1
|
||
- nir: use constant components of num_workgroups in wg id to wg idx lowering
|
||
- nir: lower num_workgroups to constants
|
||
- intel/compiler: pass num_workgroups from task to mesh shaders
|
||
- nir: add cheap shortcut for wg id to wg idx lowering
|
||
- anv,intel/compiler: enable shortcut in wg id to wg idx lowering on >= gfx12.5
|
||
- intel/compiler: simplify reading of gl_NumWorkGroups in task/mesh
|
||
- anv: fix how NULL buffer_view is handled in anv_descriptor_set_write_buffer_view
|
||
- anv: pass anv_surface_state using a pointer
|
||
- anv: limit stack usage for anv_surface_state
|
||
- intel/compiler/mesh: compactify MUE layout
|
||
- intel/compiler,anv: put some vertex and primitive data in headers
|
||
- intel/compiler: load debug mesh compaction options once
|
||
|
||
Marek Olšák (169):
|
||
|
||
- nir: fix 2 bugs in nir_create_passthrough_tcs
|
||
- nir: lower load_barycentric_at_offset in lower_wpos_ytransform
|
||
- nir: assign IO bases in nir_lower_io_passes
|
||
- nir: skip nir_lower_io_passes for compute shaders
|
||
- nir: extend nir_opt_fragdepth to handle lowered IO
|
||
- nir: handle more opcodes in nir_lower_io_to_scalar
|
||
- nir: handle all varying slots in gl_varying_slot_name_for_stage
|
||
- nir: don't remove dead IO variables in nir_lower_io_passes for st_link_nir
|
||
- nir: rework nir_lower_color_inputs to work with lowered IO intrinsics
|
||
- nir: return a status from nir_remove_varying whether it removed the instruction
|
||
- nir: remove an obsolete comment from nir_gather_xfb_info_from_intrinsics
|
||
- nir: add next_stage parameter to nir_slot_is_sysval_output to return better info
|
||
- nir: add next_stage parameter to nir_remove_varying
|
||
- nir: set uses_wide_subgroup_intrinsics for all shader stages
|
||
- venus: fix the RHEL8 build by using syscall for gettid
|
||
- nir: rename ACCESS_STREAM_CACHE_POLICY -> ACCESS_NON_TEMPORAL and document
|
||
- nir: add/update comments for gl_access_qualifier
|
||
- ac/surface: don't expose modifiers with DCC retiling if radeon_info forbids it
|
||
- ac/gpu_info: disable display DCC on Raphael and Mendocino to improve power usage
|
||
- radeon: add radeon_info parameter into radeon_winsys::surface_init
|
||
- radeonsi: do AMD_DEBUG=nodisplaydcc differently to also remove modifiers
|
||
- aco: don't treat ACCESS_NON_READABLE as ACCESS_COHERENT
|
||
- ac/llvm: don't treat ACCESS_NON_READABLE as ACCESS_COHERENT
|
||
- ac/llvm: rewrite and unify how GLC, DLC, SLC are set
|
||
- nir/lower_io: don't renumber VS inputs when not called from a linker
|
||
- ac/surface: fix address calculation for large images by using uint64_t
|
||
- radv: fix sparse image address calculation for large images by using uint64_t
|
||
- radv: fix SDMA image address calculation for large images by using uint64_t
|
||
- radeonsi: fix SDMA image address calculation for large images by using uint64_t
|
||
- radeonsi: fix image address calculation for large images by using uint64_t
|
||
- radeonsi: fix sparse image address calculation for large images by using uint64_t
|
||
- radeonsi: fix image size calculation in fast clear
|
||
- ac/surface: clean up and move the PIPE_CONFIG helper to ac_surface.c
|
||
- ac/surface: define LINEAR_PITCH_ALIGNMENT
|
||
- ac/surface: validate overridden pitch for all chips
|
||
- ac/surface: fix overridden linear pitch for CPU access
|
||
- ac/surface: add ac_surf_config::is_array
|
||
- amd/registers: update pitch definitions in descriptors
|
||
- mesa: fix a VBO buffer reference leak in _mesa_bind_vertex_buffer
|
||
- ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT
|
||
- winsys/radeon: set has_image_opcodes to unbreak gfx6-7
|
||
- winsys/radeon: fix the scratch buffer on gfx6-7
|
||
- winsys/radeon: set more radeon_info fields
|
||
- ac/gpu_info: give has_msaa_sample_loc_bug a more accurate name
|
||
- ac/surface: move CB format translation helpers here
|
||
- ac/surface: move determing ADDR_FMT_* into a helper function
|
||
- ac/llvm: clean up translation of nir_intrinsic_load_invocation_id
|
||
- ac/llvm: clean up visit_load_local_invocation_index and visit_load_subgroup_id
|
||
- ac/llvm: use LLVM 0/1 constants from ac_llvm_context instead of LLVMConstInt
|
||
- radeonsi/gfx11: fix alpha-to-coverage with blending
|
||
- radeonsi: reorder code in si_texture_create_object as preparation for the future
|
||
- radeonsi: cosmetic changes in si_shader.h
|
||
- radeonsi: remove the gl_SampleMask FS output if MSAA is disabled
|
||
- radeonsi: don't enable WGP_MODE because of high cost of workgroup mem coherency
|
||
- radeonsi: move emitting draws states out of si_emit_all_states
|
||
- radeonsi/gfx11: use DISABLE_FOR_AUTO_INDEX to disable non-indexed prim restart
|
||
- radeonsi: reduce the supported compute grid size
|
||
- radeonsi: update test results and flakes
|
||
- radeonsi: re-enable fp16_rtz for compute blits to fix PBO tests on gfx11
|
||
- amd/addrlib: switch the license to the SPDX identifier MIT
|
||
- amd/addrlib: add ADDR_FMT_BG_RG_16_16_16_16
|
||
- ac/surface: fix is_linear for stencil-only surfaces
|
||
- ac/nir: handle DEPTH as PITCH in ac_nir_lower_resinfo
|
||
- radeonsi: implement setting a custom pitch to any multiple of 256B on gfx10.3+
|
||
- radv: implement setting a custom pitch to any multiple of 256B on gfx10.3+
|
||
- ac/surface: relax custom pitch requirements to any multiple of 256B on gfx10.3+
|
||
- ac/surface: fix R32G3B32 image format regression for gfx6-8
|
||
- ac/nir/ngg: always use load_initial_edgeflags_amd, choose the value in drivers
|
||
- amd: add radeon_info* into ac_llvm_context and radv_nir_compiler_options
|
||
- radeonsi: define si_shader_io_get_unique_index() values as SI_UNIQUE_SLOT_*
|
||
- radeonsi: remove gl_BackColor VS outputs on demand if color_two_side is disabled
|
||
- radeonsi: export non-zero edgeflags for GS and tess
|
||
- radeonsi/gfx11: extend DB_Z_INFO.NUM_SAMPLES programming to > GFX11
|
||
- radeonsi: print shader-db stats with AMD_DEBUG=vs,ps,stats
|
||
- radeonsi: use nir_lower_alu_to_scalar correctly
|
||
- radeonsi: remove a useless depth texture function call in a fast color clear
|
||
- radeonsi: add a gfx11 version of si_decompress_textures, add assertions < GFX11
|
||
- radeonsi: remove RADEON_FLAG_MALL_NOALLOC due to no use
|
||
- radeonsi: completely rewrite how VGT_SHADER_STAGES_EN is set
|
||
- radeonsi: unduplicate si_translate_format_to_hw
|
||
- radeonsi: decompress DCC for SDMA if we're really going to use SDMA
|
||
- radeonsi: increase SDMA gfx9+ limits
|
||
- radeonsi: split tracked_regs masks into context registers and other registers
|
||
- radeonsi: reorder and comment tracked registers
|
||
- radeonsi: move PA_CL_NGG_CNTL emission into rasterizer state
|
||
- radeonsi: always set sample locations even for 1x MSAA for simplicity
|
||
- radeonsi: adjust 16x EQAA sample locs to make PA_SU_PRIM_FILTER_CNTL immutable
|
||
- radeonsi: move PA_SU_SMALL_PRIM_FILTER_CNTL to the preamble when possible
|
||
- radeonsi: merge si_emit_msaa_sample_locs with si_emit_sample_locations
|
||
- radeonsi: rename the msaa_sample_locs state to sample locations
|
||
- radeonsi: optimize no-op primitive restart index changes thanks to index masking
|
||
- radeonsi: don't program COMPUTE_MAX_WAVE_ID (GDS register) on gfx6
|
||
- radeonsi: add helpers to create and clone a sized pm4 state
|
||
- radeonsi: add a separate gfx10_init_gfx_preamble_state function
|
||
- radeonsi: don't set registers set by CLEAR_STATE in the preamble for gfx10-11
|
||
- radeonsi: add a separate cdna_init_compute_preamble_state function
|
||
- radeonsi/ci: add gfx6 failures
|
||
- radeonsi: re-indent gfx10_create_sh_query_result_cs
|
||
- radeonsi: don't use SET_SH_REG_INDEX on gfx7-9
|
||
- radeonsi: don't use SET_SH_REG_INDEX if the kernel doesn't use CU reservation
|
||
- amd: remove unused PKT0 definitions
|
||
- treewide: use uint64_t / (u)intptr_t in image address calculations
|
||
- amd: drop support for LLVM 11
|
||
- amd: drop support for LLVM 12
|
||
- amd: drop support for LLVM 13
|
||
- amd: drop support for LLVM 14
|
||
- mesa: fix glBitmap in display lists when width <= 0 || height <= 0
|
||
- gallium/hud: append results to files instead of overwriting them
|
||
- radeonsi: don't convert L8A8 to R8A8 when blitting via compute to fix gfx7
|
||
- amd: update SET_*_REG_PAIRS* documentation and remove radeon_info options
|
||
- amd: improve the IB parser, parse more packets
|
||
- amd: rename mid_command_buffer_preemption_enabled -> register_shadowing_required
|
||
- amd: increase the attribute ring size on gfx1103_r1
|
||
- amd: don't set PA_RATE_CNTL because it has no effect
|
||
- amd: fix GPU cache sizes retrieved from the kernel
|
||
- amd: remove non-shadowed register tables
|
||
- amd: remove ac_check_shadowed_regs
|
||
- amd: add a new helper that prints all non-shadowed regs
|
||
- amd: update shadowed register tables for gfx11
|
||
- amd: skip redundant PKT3_NUM_INSTANCES even with register shadowing
|
||
- amd: skip redundant INDEX_TYPE even with register shadowing
|
||
- radeonsi: set register_shadowing_enabled if AMD_DEBUG=shadowregs is set
|
||
- radeonsi/ci: add glx@glx-visuals-stencil to skips because it gets stuck often
|
||
- radeonsi: fix RB+ and gfx11 issues with framebuffer state
|
||
- radeonsi: change si_emit_derived_tess_state into a state atom
|
||
- radeonsi: shrink the last field of tcs_offchip_layout due to LDS limit
|
||
- radeonsi: don't do PFP_SYNC_ME before CP DMA and compute blits
|
||
- radeonsi: don't needlessly invalidate L0/L1 caches at the beginning of IBs
|
||
- radeonsi: add more variables into si_pm4_state and rework how it's created
|
||
- radeonsi: remove sscreen parameter from si_pm4_set_reg_idx3
|
||
- radeonsi: set non-graphics uconfig registers first in the preamble
|
||
- radeonsi: handle demoted si_pm4_set_reg_idx3 as si_pm4_set_reg
|
||
- radeonsi: eliminate redundant compute SH register changes
|
||
- radeonsi: handle VGT_GS_OUT_PRIM_TYPE like a tracked register
|
||
- radeonsi: handle VGT_LS_HS_CONFIG like a tracker register
|
||
- radeonsi: handle GE_CNTL and IA_MULTI_VGT_PARAM as a tracked register
|
||
- radeonsi: remove gfx10 NGG streamout
|
||
- ci: remove clang-format testing
|
||
- intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR
|
||
- radeonsi: move TCS.gl_PatchVerticesIn into the tcs_offchip_layout SGPR
|
||
- radeonsi: replace tcs_out_lds_layout with nearly identical tes_offchip_addr
|
||
- radeonsi: move the only tcs_out_lds_offsets field to vs_state_bits
|
||
- radeonsi: eliminate redundant TCS user data and RSRC2 register changes
|
||
- radeonsi/gfx11: use SET_*_REG_PAIRS_PACKED packets for pm4 states
|
||
- radeonsi: determine si_pm4_state::reg_va_low_idx automatically
|
||
- radeonsi: keep pipeline statistics disabled when they are not used
|
||
- radeonsi: don't do BREAK_BATCH for context regs with only 1 context per batch
|
||
- radeonsi: use si_pm4_create_sized for the shadowing preamble
|
||
- radeonsi: remove radeon_winsys::cs_set_preamble
|
||
- radeonsi: remove uses_reg_shadowing parameter from si_init_gfx_preamble_state
|
||
- radeonsi/gfx11: fix GLCTS with register shadowing by keeping the CS preamble
|
||
- radeonsi/gfx11: enable register shadowing by default
|
||
- radeonsi: reorder compute code to prepare for packed SET_SH_REG packets
|
||
- radeonsi/gfx11: use SET_SH_REG_PAIRS_PACKED for gfx by buffering reg writes
|
||
- radeonsi/gfx11: use SET_SH_REG_PAIRS_PACKED for compute by buffering reg writes
|
||
- radeonsi: clean up query functions, make them static, remove forward decls
|
||
- radeonsi: declare compiler[] and nir_options as pointers to reduce #includes
|
||
- radeonsi: clean up #includes
|
||
- Revert "egl: return correct error for EGL_KHR_image_pixmap"
|
||
- vbo: correctly restore _VaryingInputs for display list fast path
|
||
- radeonsi/gfx11: only use SET_*_PAIRS* packets on dGPUs
|
||
- radeonsi: fix gfx9 regression causing GPU hangs
|
||
- radeonsi/gfx11: fix a regression with PAIRS packets due to shader changes
|
||
- Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling."
|
||
- glthread: determine global locking once every 64 batches to fix get_time perf
|
||
- mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA
|
||
- util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock
|
||
- radeonsi: fix a CDNA regression breaking compute
|
||
- Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info"
|
||
|
||
Mark Collins (1):
|
||
|
||
- ir3/a7xx: Add definitions for (last) src GPR attribute
|
||
|
||
Mark Janes (8):
|
||
|
||
- intel/dev: update mesa_defs.json from defect database
|
||
- intel/dev: report stepping for TGL systems
|
||
- intel/dev: switch defect identifiers to use lineage numbers
|
||
- isl: use generated workaround helpers for Wa_1806565034
|
||
- iris: convert Wa_14010455700 to use workaround mechanism
|
||
- anv: convert Wa_14010455700 to use workaround mechanism
|
||
- intel: use generated helpers for Wa_1508744258
|
||
- intel/dev: update mesa_defs.json from defect database
|
||
|
||
Martin Roukala (né Peres) (26):
|
||
|
||
- radv/ci: disable the vkcts-navi21-llvm-valve job
|
||
- radv/ci: document all the flakes we hit while I was away
|
||
- ci/b2c: allow not specifying a reboot condition
|
||
- radv/ci: only reboot on hangs for vkcts-navi10-valve
|
||
- zink/ci: document that some tests no longer fail
|
||
- zink/ci: mark 77 multisample-related tests as fixed
|
||
- radv/ci: document another vkcts flake on vega10
|
||
- radv/ci: document a series of recent regressions
|
||
- zink/ci: document recent fixes on RADV
|
||
- zink/ci: document new flakes on RADV
|
||
- radv/ci: document more flakes for navi21
|
||
- radv/ci: switch to b2c v0.9.10
|
||
- ci/b2c: update to mesa-trigger:2023-03-08.1
|
||
- zink/ci: add more QBO-related fails on RADV
|
||
- amd/ci: add another test to the vkcts-vega10 flake list
|
||
- zink/ci: remove spec@nv_shader_atomic_int64@* from the fail lists
|
||
- ci: bring back the valve farm
|
||
- ci/b2c: select the DUT to run on by name
|
||
- radv/ci: use the low-priority runners for vangogh jobs
|
||
- ci/b2c: change the default first-console-activity timeout to 2 minutes
|
||
- zink/ci: add more tests to the flake list of vangogh
|
||
- zink/ci: enable zink-radv-vangogh-valve for pre-merge testing
|
||
- Revert "ci: mark the valve farm as down"
|
||
- amd/ci: temporarily disable some manual jobs that take a long time to run
|
||
- zink/ci: remove 3 tests from the fails list
|
||
- Revert "amd/ci: temporarily disable some manual jobs that take a long time to run"
|
||
|
||
Martin Stransky (1):
|
||
|
||
- llvmpipe: fix UAF in lp_scene_is_resource_referenced.
|
||
|
||
Matt Coster (57):
|
||
|
||
- pvr: Complete pvr_isp_ctrl_stream()
|
||
- pvr: Fully declare support for VK_EXT_private_data
|
||
- pvr: Remove false assumption from pvr_write_draw_indirect_vdm_stream()
|
||
- pvr: Fixup format features
|
||
- pvr: Unmap mapped memory on free
|
||
- pvr: Correctly validate PBE accum format
|
||
- pvr: Actually check for depth load when setting up load op constants
|
||
- pvr: Initialize aspect_mask when creating buffer views
|
||
- pvr: Correctly compile graphics pipelines without a fragment shader
|
||
- pvr: Fix off-by-one in pvr_cmd_buffer_upload_desc_set_table() assert
|
||
- pvr: Remove unneeded assert in pvr_get_hw_clear_color()
|
||
- pvr: Set output_offset correctly in pvr_clear_color_attachment_static()
|
||
- pvr: Return correct pbe_accum_format size for A2B10G10R10_UINT_PACK32
|
||
- pvr: Remove bad assert in pvr_clear_attachments()
|
||
- pvr: Add PVR_DEBUG=vk_desc option to dump descriptor set layouts
|
||
- pvr: Simplify descriptor set layout dump separators
|
||
- pvr: Return VkResult from pvr_winsys_create()
|
||
- pvr: Propagate errors as VkResults from ioctls through winsys
|
||
- pvr: Fix incorrect error return in pvr_ctx_sr_programs_setup()
|
||
- pvr: Fix incorrect error handling in pvr_render_ctx_switch_init()
|
||
- pvr: Squeeze fd handling into winsys layer
|
||
- pvr: Drop pdevice from pvr_physical_device_get_supported_extensions()
|
||
- pvr: Rename primary_{device,fd,path} to display_*
|
||
- pvr: Use common physical device enumeration
|
||
- pvr: Assorted cleanup
|
||
- pvr: Return VkResult from winsys buffer_map operation
|
||
- pvr: Fix allocation scopes in vkCreateRenderPass2() code path
|
||
- pvr: Fix memory leaks on realloc failure in pvr_pipeline.c
|
||
- pvr: Correct error flow in pvr_graphics_pipeline_compile()
|
||
- pvr: Correct error flow in pvr_compute_pipeline_compile()
|
||
- pvr: Use correct surface for deferred RTA clear
|
||
- pvr: Rename shadowing loop variable in pvr_add_deferred_rta_clear()
|
||
- pvr: Do not free deferred pvr_transfer_cmd instances
|
||
- pvr: Fix out of range stream errors for geometry-only jobs on pvrsrvkm
|
||
- pvr: Reorder execution in pvr_cmd_buffer_end_sub_cmd()
|
||
- pvr: Fix page faults in occlusion query tests
|
||
- pvr: Fix rect splitting logic in pvr_unwind_rects()
|
||
- pvr: Use correct pbe format for VK_FORMAT_A8B8G8R8_UNORM_PACK32
|
||
- pvr: Use common vkGetPhysicalDeviceFeatures2() implementation
|
||
- pvr: Fix segfault in pvr_physical_device_init()
|
||
- pvr: Move pvr_get_isp_num_tiles_xy() to rogue_hw_utils.h
|
||
- pvr: Use pvr_sub_cmd_event union members directly
|
||
- pvr: Add wait_on_previous_transfer flag to graphics subcommand
|
||
- pvr: Cleanup in pvr_process_cmd_buffer()
|
||
- pvr: Add pvr_image_view_get_image()
|
||
- pvr: Publicise some static functions from pvr_blit.c
|
||
- pvr: Rename ds_{image,iview} in pvr_gfx_sub_cmd_job_init()
|
||
- pvr: Implement ZLS subtile alignment
|
||
- pvr: Correct calculations in pvr_unwind_rects()
|
||
- pvr: Refactor pvr_unwind_rects()
|
||
- pvr: Allow S8_UINT to be used as a stencil attachment format
|
||
- pvr: Don't overwrite PDS vertex input flags
|
||
- pvr: Declare dependency on idep_mesautil
|
||
- pvr: Add support for sampler border colors
|
||
- pvr: Correctly read dynamic state setup during blend constant setup
|
||
- pvr: Advance entry pointer in pvr_setup_vertex_buffers()
|
||
- pvr: Rename transfer 3D heap to transfer frag heap
|
||
|
||
Matt Turner (13):
|
||
|
||
- intel: Disable shader cache when executing intel_clc during the build
|
||
- u_format: Use memcpy to avoid unaligned accesses
|
||
- meson: Remove reference to removed SWR driver
|
||
- anv: Pipe anv_physical_device to anv_get_image_format_features2
|
||
- anv: Only expose video decode bits with KHR_video_decode_queue
|
||
- intel: Rearrange for next commit
|
||
- intel: Consider with_intel_clc in with_any_intel
|
||
- intel: Only build blorp if drivers are enabled
|
||
- intel: Only build ds if drivers are enabled
|
||
- intel: Only build perf if drivers or tools are enabled
|
||
- intel: Allow using intel_clc from the system
|
||
- intel: Limit Intel Vulkan RT to x86_64
|
||
- Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput"
|
||
|
||
Matthieu Bouron (1):
|
||
|
||
- lavapipe: honor dst base array layer when resolving color attachments
|
||
|
||
Michael Tretter (2):
|
||
|
||
- panfrost: remove BO from cache before closing GEM
|
||
- kmsro: assert that scanout refcount is larger than 0
|
||
|
||
Michel Dänzer (17):
|
||
|
||
- ci: Explicitly test for meson feature checks in compiler wrapper
|
||
- ci: Use set -e in frontend compiler wrapper scripts.
|
||
- ci: Remove shebang from backend compiler wrapper script
|
||
- ci: Drop executable permissions from backend compiler wrapper script
|
||
- tgsi: Make ureg_DECL_output_masked definition match its declaration
|
||
- llvmpipe: Make lp_build_interp_soa declaration match its definition
|
||
- mesa/st: Make st_convert_image(_from_unit) declaration match definition
|
||
- vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definition
|
||
- anv/format: Fix GetPhysicalDeviceSparseImageFormatProperties definition
|
||
- vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions
|
||
- svga: Make vmw_svga_winsys_buffer_map definition match declaration
|
||
- svga: Make declaration of emit_input_declaration match definition
|
||
- clover/llvm: Use llvm::DataLayout::getABITypeAlign with LLVM >= 16
|
||
- clover/llvm: Use std::nullopt already with LLVM 16
|
||
- ci: Drop -Wno-error=array-bounds from fedora-release job
|
||
- ci: Upgrade fedora-release job to Fedora 38
|
||
- ci: Enable rusticl in the fedora-release job
|
||
|
||
Michel Zou (4):
|
||
|
||
- vulkan/wsi: fix -Wnarrowing warning
|
||
- vk/entry_points:: fix mingw build
|
||
- mesa/draw: fix -Wformat warning
|
||
- util: reinstate ENUM_PACKED
|
||
|
||
Mihai Preda (1):
|
||
|
||
- nir: update nir->num_inputs, num_outputs in nir_recompute_io_bases()
|
||
|
||
Mike Blumenkrantz (364):
|
||
|
||
- mesa/st/program: don't init xfb info if there are no outputs
|
||
- zink: remove atomics from zink_query
|
||
- zink: pass ctx through query destroy paths
|
||
- zink: always defer query pool deletion
|
||
- zink: remove screen param from zink_prune_query()
|
||
- util/cpu: add big.LITTLE cpu detection
|
||
- driconf: rework glthread enablement
|
||
- glthread: disable by default with fewer than 4 (big) CPUs
|
||
- zink: move memoryTypeIndex selection down in general bo allocation
|
||
- zink: slightly rework memoryTypeIndex selection to pre-determine heap
|
||
- zink: restore BAR allocation failure demotion
|
||
- zink: make general bo allocation more robust by iterating
|
||
- zink: avoid zero-sized memcmp for descriptor layouts
|
||
- iris: use util_framebuffer_get_num_samples when setting ps dispatch samples
|
||
- nir/lower_alpha_test: rzalloc state slots
|
||
- zink: fix non-db bindless texture buffers
|
||
- util/blitter: fix line wrapping on error to avoid giving wrong line number
|
||
- glthread: add newline to env override
|
||
- zink: emit demote cap when using demote
|
||
- zink: only print copy box warning once per resource
|
||
- zink: hook up debug callback
|
||
- zink: use a perf_debug() macro for debug message logging of copy box warning
|
||
- util/debug: move null checks out of debug message macro
|
||
- zink: manually re-set framebuffer after msrtss replicate blit
|
||
- zink: handle 'blitting' flag better in msrtss replication
|
||
- zink: skip msrtss replicate if the attachment will be full-cleared
|
||
- zink: avoid recursion during msrtss blits from flushing clears
|
||
- zink: don't bitcast bool deref loads/stores
|
||
- zink: zink_shader_free -> zink_gfx_shader_free
|
||
- zink: split out generic shader destruction for reuse
|
||
- zink: always wait on precompile fence at start of zink_gfx_shader_free()
|
||
- zink: call zink_shader_free for compute shaders
|
||
- zink: add a util function for printing shaders
|
||
- zink: don't create separate shader dsls if there are no bindings
|
||
- drisw: don't leak the winsys
|
||
- zink: check for extendedDynamicState3DepthClipNegativeOneToOne for ds3 support
|
||
- mesa/st: try to block multisampled texsubimage from doing cpu writes
|
||
- mesa: fix ms fallback texture creation
|
||
- draw: fix viewmask iterating
|
||
- zink: use tes to generate tcs
|
||
- zink: hook up EXT_shader_object
|
||
- zink: wrap zink_shader_compile_separate() return
|
||
- zink: wrap return of compile_module()
|
||
- zink: make zink_shader_spirv_compile static
|
||
- zink: more zink_shader_object conversion
|
||
- zink: use zink_shader_object for precompiled separate shaders
|
||
- zink: minor whitespace cleanup
|
||
- zink: move separate shader dsl creation to compiler function
|
||
- zink: add a 'separate' flag to shader module compile to indicate separate shaders
|
||
- zink: run bo lowering passes for separate shader compile with uniform inlining
|
||
- zink: remove redundant compute program batch ref
|
||
- zink: use EXT_shader_object to (re)implement separate shaders
|
||
- zink: add validation exceptions for shader object extension enable
|
||
- zink: don't pin flush queue threads if no threads exist
|
||
- zink: add z32s8 as mandatory GL3.0 profile attachment format
|
||
- zink: add a driver workaround to disable background compiles
|
||
- nir/gs: fix array type copying for passthrough gs
|
||
- zink: fix array copying in pv lowering
|
||
- gallivm: break out native vector width calc for reuse
|
||
- llvmpipe: do late init for llvm builder
|
||
- zink: print the type of shader when dumping
|
||
- zink: use intermediate variable for separate shader descriptor update loop
|
||
- zink: use intermediate variable for separate shader db resize check
|
||
- zink: simplify separate shader prog init a little
|
||
- zink: streamline separate shader descriptor update
|
||
- zink: switch to a regular loop to wait on precompile shader fences
|
||
- zink: move some shader CSO functions around
|
||
- zink: assign separate shader prog stages from ctx->shader_stages
|
||
- zink: use a more standardized loop for initing separate shader program descriptors
|
||
- zink: move separate shader creation to shader CSO creation
|
||
- zink: handle all stages in fixup_io_locations()
|
||
- zink: fix longstanding TODO for generated tcs
|
||
- zink: use EXT_shader_object to implement generic separate shader precompile
|
||
- bump VVL to 1.3.248
|
||
- zink: prune some validation errors from ci
|
||
- zink: break out VkImageViewUsageCreateInfo applying for reuse
|
||
- zink: reapply VkImageViewUsageCreateInfo when rebinding a surface
|
||
- zink: add a workaround for a nir_assign_io_var_locations bug
|
||
- zink: don't run update_so_info if shader has no outputs
|
||
- zink: add ZINK_DEBUG=noshobj to disable EXT_shader_object
|
||
- zink: rename 'separate' param in shader compilation to 'can_shobj'
|
||
- zink: explicitly block sample shading in the GPL precompile path
|
||
- zink: add zink_program::uses_shobj for managing shader object binds
|
||
- zink: use local screen var in zink_gfx_program_update_optimal()
|
||
- zink: deduplicate separable program replacement handling
|
||
- zink: delete redundant conditional
|
||
- zink: use zink_shader_object for zink_shader_module
|
||
- zink: use zink_destroy_shader_module() for compute to deduplicate code
|
||
- zink: store spirv onto zink_shader_object structs
|
||
- zink: allow zink_shader_module to be either a shobj or a mod using a bool
|
||
- zink: avoid accessing zink_gfx_program::modules during pipeline compile
|
||
- zink: add a union to zink_gfx_pipeline_cache_entry for gpl
|
||
- zink: use zink_shader_object for pipeline compiles from zink_gfx_program
|
||
- zink: make zink_shader_spirv_compile public
|
||
- zink: enable EXT_shader_object for generic precompiles
|
||
- draw: fix robust ubo size calc
|
||
- ci: disable all a306/a530/a630 jobs
|
||
- llvmpipe: fix native vector width init
|
||
- zink: update amdpro fails
|
||
- zink: add extendedDynamicState3DepthClipNegativeOneToOne to profile
|
||
- zink: only unset a generated tcs if the bound tcs is the generated one
|
||
- Revert "zink: don't create separate shader dsls if there are no bindings"
|
||
- zink: disable a630 traces
|
||
- zink: set depth dynamic state values unconditionally
|
||
- zink: null some descriptor buffer pointers during destruction
|
||
- zink: sync queries at the end of cmdbufs
|
||
- cso: unbind fb state when unbinding the context
|
||
- i915: use util_copy_framebuffer_state to set fb state
|
||
- i915: use util_unreference_framebuffer_state to unref fb state
|
||
- iris: use util_unreference_framebuffer_state to unref fb state
|
||
- softpipe: use util_unreference_framebuffer_state to unref fb state
|
||
- v3d: use util_unreference_framebuffer_state to unref fb state
|
||
- vc4: use util_unreference_framebuffer_state to unref fb state
|
||
- llvmpipe: use util_unreference_framebuffer_state to unref fb state
|
||
- svga: use util_unreference_framebuffer_state to unref fb state
|
||
- zink: move EXT_shader_object check to another place
|
||
- zink: break out optimal key handling into separate function
|
||
- zink: disable EXT_shader_object if !optimal_keys
|
||
- zink: add ZINK_DEBUG=optimal_keys
|
||
- gallium: pipe_rasterizer_state::point_tri_clip -> point_line_tri_clip
|
||
- aux/draw: guard_band_points_xy -> guard_band_points_lines_xy
|
||
- aux/draw: add guardband clipping for lines
|
||
- zink: don't init mutable resource bit for swapchain images
|
||
- zink: don't init mutable for swapchain src during blit
|
||
- tgsi_to_nir: handle PIPE_CAP_NIR_COMPACT_ARRAYS for clipdistance
|
||
- zink: allow vk 1.2 timelineSemaphore feature if extension isn't supported
|
||
- zink: stringify unsupported prim restart log error
|
||
- zink: delete persistent map tracking
|
||
- zink: add PERSISTENT for db buffer maps
|
||
- zink: delete unnecessary pipeline stage flags from inference
|
||
- zink: use an intermediate variable for binding ssbo slots
|
||
- zink: unbind the ssbo slot being iterated, not the index of the buffer
|
||
- zink: flush INDIRECT_BUFFER mem barrier for compute
|
||
- zink: disable batched unordered barries with ZINK_DEBUG=noreorder
|
||
- zink: block batching of unordered barriers if previous usage was write
|
||
- zink: fix uncached memory readback
|
||
- glsl/lower_samplers_as_deref: apply bindings for unused samplers
|
||
- vulkan/runtime: add VK_DYNAMIC_STATE_ATTACHMENT_FEEDBACK_LOOP_ENABLE_EXT
|
||
- zink: add ZINK_DEBUG=noopt
|
||
- zink: add ZINK_DEBUG=nobgc
|
||
- zink: make mesa_logw separate from perf_debug
|
||
- zink: add perf_debug for "interesting" shader compiles
|
||
- zink: set debug callback on context
|
||
- zink: bind bindless db set when updating separate shader db sets
|
||
- zink: compare desc set to detect bindless vars in separate shaders
|
||
- zink: adjust bindless texel buffer handle before indexing
|
||
- zink: block more flushes during unordered blits
|
||
- zink: also cache swapchain semaphores
|
||
- zink: disable always zs feedback loop on radv
|
||
- zink: add back some anv qbo flakes
|
||
- zink: disable have_EXT_vertex_input_dynamic_state without EDS2
|
||
- zink: disable dynamic state exts if the previous ones aren't present
|
||
- zink: add some ci flakes
|
||
- zink: don't leak swapchain readback semaphores
|
||
- zink: destroy current batch state after all other batch states
|
||
- zink: reorder some native blit code
|
||
- zink: reject blits where src/dst is 3D and dst/src z!=0
|
||
- zink: reorder some image copy code
|
||
- zink: ignore no-op image copies
|
||
- zink: only add feedback loop usage bit if extension is supported
|
||
- lavapipe: EXT_attachment_feedback_loop_layout_dynamic_state
|
||
- zink: slightly simplify bda allocation chaining
|
||
- zink: hook up some memory extensions
|
||
- zink: set higher prio on dedicated memory allocations
|
||
- zink: flag batch usage on swapchain images
|
||
- vulkan/wsi: add feedback loop usage to swapchain caps if supported
|
||
- zink: add feedback loop usage for swapchains
|
||
- vtn: add spirv index to type mismatch error for debugging
|
||
- vtn: print spirv id for type mismatch error
|
||
- vtn: print spirv ids for type mismatch in bcsel
|
||
- vtn: add more info to bitcast bit size error message
|
||
- zink: try update fb resource refs when starting new renderpass
|
||
- zink: add special-casing for (not) reordering certain image barriers
|
||
- zink: use batch usage function for a simple case
|
||
- zink: move zink_batch_state::submit_count to zink_batch_usage
|
||
- zink: move batch usage to substruct on zink_bo objects
|
||
- zink: track/check submit info on resource batch usage
|
||
- zink: disable unordered blits when swapchain images need aqcuire
|
||
- zink: explicitly disable reordering after restricted swapchain readback blits
|
||
- zink: explicitly disable promotion on images that are both unflushed and non-reorderable
|
||
- zink: flag 'has_work' on batch when promoting a cmd
|
||
- lavapipe: more correctly handle null pipeline states
|
||
- anv: more correctly handle null pipeline states
|
||
- vk/graphics_state: handle null pipeline state structs in creation
|
||
- zink: promote flushed clears to unordered cmdbuf when possible
|
||
- zink: also declare int size caps inline with signed int type usage
|
||
- zink: delete unnecessary bitcast in load_shared/scratch
|
||
- zink: use void return for store_dest
|
||
- zink: move get_alu_type() up in file
|
||
- zink: manually memcpy the spirv instruction buffer
|
||
- zink: write out register variables to a separate spirv buffer
|
||
- zink: dynamically emit non-bool register values using local_vars spirv buffer
|
||
- zink: store and use alu types for ntv defs
|
||
- zink: infer types from load_const instrs to avoid more bitcasts
|
||
- lavapipe: bump memory allocation heap to 3GiB
|
||
- lavapipe: report full memory in heap for 64bit processes
|
||
- lavapipe: EXT_memory_budget
|
||
- lavapipe: EXT_memory_priority
|
||
- lavapipe: store memory allocation size onto lvp_device_memory
|
||
- lavapipe: VK_EXT_pageable_device_local_memory
|
||
- zink: don't wait on queue thread if disabled
|
||
- zink: use the per-context track_renderpasses flag in more places
|
||
- zink: don't remove psiz from linked shaders if the consumer reads it
|
||
- zink: don't propagate psiz in quads emulation gs
|
||
- lavapipe: VK_EXT_dynamic_rendering_unused_attachments
|
||
- zink: require EXT_dynamic_rendering_unused_attachments for dynamic rendering
|
||
- zink: explicitly avoid ci errors due to unrecognized extensions in VVL
|
||
- vulkan: reorder vk_cmd_queue_entry
|
||
- vulkan/cmd_queue: allocate cmds based on the size of the cmd
|
||
- vulkan/cmd_queue: expose cmd sizes
|
||
- vulkan: use cmd size array for queued cmd allocations
|
||
- ci: uprev VVL to 1.3.251
|
||
- lavapipe: fix DS3 min sample setting
|
||
- lavapipe: bump max push constant size
|
||
- lavapipe: stop setting patch vertices constantly
|
||
- lavapipe: don't pass indirect info in streamout draws
|
||
- draw: add (disabled) vertex dumping for non-linear emit
|
||
- lavapipe: fix memory budget reporting
|
||
- zink: also disable bg compile for compute with nobgc
|
||
- zink: hook up VK_EXT_attachment_feedback_loop_dynamic_state
|
||
- zink: use dynamic state for feedback loops when available
|
||
- zink: enable EXT_shader_object globally with have_EXT_attachment_feedback_loop_dynamic_state
|
||
- zink: add a ci flake
|
||
- lavapipe: pass list to cmdbuf exec, not cmdbuf
|
||
- lavapipe: add a mapping for BDA
|
||
- lavapipe: add a zeroed buffer that can be bound in place of an index buffer
|
||
- lavapipe: handle index buffers with offsets for indirect draws
|
||
- lavapipe: NV_device_generated_commands
|
||
- zink: combine some rast state draw conditionals
|
||
- zink: don't check prog->shaders when creating gfx pipeline
|
||
- zink: check for cached mem correctly when mapping buffer
|
||
- zink: remove assert for dt in zink_kopper_update
|
||
- zink: stop swizzling conditional render during batch flush
|
||
- zink: update some radv qbo fails
|
||
- radv: tweak gfx pipeline stage binding
|
||
- zink: only try to create srgb mutable images if the vk format is supported
|
||
- vk: make vk_format_map[] public
|
||
- radv: directly use vk_format_map for vertex input
|
||
- lavapipe: use PACKAGE_VERSION for cache uuid in release builds
|
||
- zink: massively shrink qbo size for timestamp queries
|
||
- zink: assert that ntv image creation isn't clobbering existing images
|
||
- zink: add some ntv asserts for ms txf
|
||
- zink: add a dgc debug mode for testing
|
||
- lavapipe: add version uuid to shader binary validation
|
||
- egl/dri2: trigger drawable invalidation from surface queries for zink
|
||
- zink: add some ci flakes
|
||
- zink: break out vk flag unrolling into util function
|
||
- zink: add mem debugging
|
||
- zink: remove redundant conditional in set_sampler_views
|
||
- zink: wrap format mismatch checks for blit/surface
|
||
- zink: add srgb mutable for all resources by default
|
||
- zink: drop dt checks for mutable format init
|
||
- zink: strip format list when disabling mutable during image creation
|
||
- dri3: only invalidate drawables on geometry change if geometry has changed
|
||
- zink: more anv ci flakes
|
||
- aux/trace: add methods for mesh shaders
|
||
- lavapipe: more fixes for sample shading
|
||
- lavapipe: fix shader binary binding with mesh shaders
|
||
- lavapipe: correctly update shader object per-stage push constant sizes
|
||
- zink: add COHERENT requirement for CACHED memory
|
||
- zink: ZINK_HEAP_HOST_VISIBLE_CACHED -> ZINK_HEAP_HOST_VISIBLE_COHERENT_CACHED
|
||
- zink: fix anv ci flake wildcarding
|
||
- aux/pipebuffer: add a return to pb_slabs_reclaim()
|
||
- aux/pipebuffer: add a return to pb_cache_release_all_buffers()
|
||
- zink: only retry bo allocation after reclaim if reclaims actually happened
|
||
- zink: fix ubo array sizing in ntv
|
||
- zink: acquire persistently bound swapchain descriptors before setting usage
|
||
- zink: recache present semaphores
|
||
- zink: always clamp NUM_QUERIES to 500
|
||
- zink: radv vangogh ci updates
|
||
- radv: remove redundant intermediate variable in radv_is_mrt0_dual_src()
|
||
- radv: inline radv_can_enable_dual_src()
|
||
- zink: no-op redundant samplemask changes
|
||
- zink: force inlining for a bunch of functions
|
||
- zink: make invalidate_descriptor_state a ctx hook
|
||
- zink: specialize invalidate_descriptor_state hook for compact mode
|
||
- zink: clean up rp update tracking on dsa bind
|
||
- zink: use local screen var in blend state bind
|
||
- zink: track and apply ds3 states only on change
|
||
- zink: don't update tc info directly from cso binds
|
||
- zink: check sampler views pointer before loop
|
||
- zink: add fastpaths for no-op sampler/view rebinds
|
||
- nir/lower_tex: ignore saturate for txf ops
|
||
- radv: pre-init surface info
|
||
- ci: add a test-dozen-deqp flake
|
||
- lavapipe: handle multiview queries
|
||
- zink: fix assert for inline uniform invalidation with generated gs bound
|
||
- zink: fix unbinding generated gs on real gs bind
|
||
- zink: get new bda when rebinding invalidated buffers
|
||
- lavapipe: create a desc set for immutable sampler layouts
|
||
- lavapipe: split out descriptor stage setting
|
||
- lavapipe: EXT_descriptor_buffer
|
||
- lavapipe: VK_EXT_mutable_descriptor_type
|
||
- llvmpipe: flush/reference fs ubos on bind
|
||
- zink: do initial program unref during program creation
|
||
- zink: fix separate shader program refcounting
|
||
- docs: update lavapipe extensions
|
||
- zink: don't destroy swapchain on initial CreateSwapchainKHR fail
|
||
- aux/trace: fix bindless texture dumping
|
||
- vk/wsi/x11: move surface alpha check from get_caps to creation
|
||
- vk/wsi/x11: handle geometry updating more asynchronously
|
||
- vk/wsi/x11: stop roundtripping on presentation
|
||
- vk/wsi: unify dmabuf exporting
|
||
- vk/wsi: add error logging for syncfile import/export failures
|
||
- zink: fix anv ci flakes (for real this time)
|
||
- zink: fix batch disambiguation on first submit
|
||
- zink: set pipeline dynamic state count after all dynamic states are set
|
||
- zink: be even dumber about buffer refs when replacing storage
|
||
- zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage
|
||
- zink: fix the fix for separate shader program refcounting
|
||
- kopper: handle pixmap creation failure more gracefully
|
||
- glxsw: check geometry of drawables on creation
|
||
- zink: don't clobber descriptor mode on multiple screen creation
|
||
- nir: fix slot calculations for compact variables with location_frac
|
||
- lavapipe: use the component offset directly for xfb
|
||
- glsl: only explicitly check GS components in PSIZ injection with output variables
|
||
- lavapipe: don't check geometry for fb attachments
|
||
- zink: better handle separate shader dsl creation when no bindings exist
|
||
- zink: force image barriers after dmabuf import
|
||
- zink: use VK_WHOLE_SIZE when binding null db buffer descriptors
|
||
- zink: unset line stipple ds3 state flags when stipple not available
|
||
- nir/lower_io_to_scalar: fix 64bit io splitting
|
||
- nir/linking_helpers: force type matching in does_varying_match
|
||
- zink: add batch refs for transient images
|
||
- zink: fix zs resolve attachment indexing
|
||
- zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images
|
||
- zink: don't append msrtss to dynamic render if not supported
|
||
- zink: set msrtss depth resolve mode when enabled
|
||
- zink: add more locking for pipeline cache
|
||
- aux/trace: fix winsys handle dumping
|
||
- zink: generated tcs is on the tes, not the vs
|
||
- llvmpipe: block weird uses of subsampled formats in buffers
|
||
- llvmpipe: fix early depth + alpha2coverage + occlusion query interaction
|
||
- lavapipe: fix resolves where src image has a layer offset
|
||
- lavapipe: block yuv formats from getting blit feature flags
|
||
- zink: explicitly set non-optimal last_vertex_stage shader key on ctx create
|
||
- zink: fix big tcs output io
|
||
- zink: fix crash in lower_pv_mode_gs_store
|
||
- u/draw: skip zero-sized indirect draws
|
||
- nir/zink: fix gs emulation xfb_info sizing
|
||
- vk/graphics: fix CWE handling with DS3
|
||
- Revert "vk/wsi/x11: handle geometry updating more asynchronously"
|
||
- zink: wait on async fence during ctx program removal
|
||
- zink: don't start multiple cache jobs for the same program
|
||
- zink: disable validation
|
||
- zink: be more precise about flagging rp changes around unordered u_blitter
|
||
- zink: fix linear modifier dmabuf imports
|
||
- aux/tc: handle stride mismatch during rp-optimized subdata
|
||
- zink: always add a per-prog ref for gpl libs
|
||
- zink: set is_xfb=false for all i/o variables
|
||
- nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms
|
||
- aux/tc: fix staging buffer sizing for texture_subdata
|
||
- aux/tc: fix address calc for segmented texture subdata
|
||
- glsl: check for xfb setting xfb info
|
||
- aux/tc: fix renderpass tracking fb state clobber scenario
|
||
- aux/tc: fix rp info handling around tc_sync calls
|
||
- aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata
|
||
- zink: flag db maps as unsynchronized
|
||
- lavapipe: clamp cache uuid size
|
||
- tu: handle unused color attachments without crashing
|
||
- zink: propagate rp_tc_info_updated across unordered blits
|
||
- zink: move swapchain fence to swapchain object
|
||
- zink: avoid UAF on wayland async present with to-be-retired swapchain
|
||
- zink: always trace_screen_unwrap in acquire
|
||
|
||
MouriNaruto (1):
|
||
|
||
- dzn: Fix segmentation fault when Direct3D 12 user mode driver from at least one of GPUs is not available.
|
||
|
||
MrRobbin (1):
|
||
|
||
- zink: Move the workaround before the EDS setting.
|
||
|
||
Mykhailo Skorokhodov (4):
|
||
|
||
- mesa: Implement GL_CLEAR_TEXTURE flag
|
||
- mesa: Fallthrough GL_SRB_DECODE_ARB pname
|
||
- iris: Fix memory size with disabled resizable bar
|
||
- nir: Rematerialize derefs after opt_dead_cf
|
||
|
||
Mykola Piatykop (1):
|
||
|
||
- mesa: Fix use after free.
|
||
|
||
Nanley Chery (28):
|
||
|
||
- iris: Allocate ZEROED BOs for shared resources
|
||
- iris/bufmgr: Add and use zero_bo
|
||
- iris/bufmgr: Handle flat_ccs for BO_ALLOC_ZEROED
|
||
- intel/isl: Bump the MCS halign value for BDW+
|
||
- iris: Add a barrier to iris_mcs_partial_resolve
|
||
- intel: Implement ISL_AUX_OP_AMBIGUATE for MCS
|
||
- iris: Enable MCS init with ISL_AUX_OP_AMBIGUATE
|
||
- anv: Drop the MCS initialization performance warning
|
||
- anv: Enable MCS init with ISL_AUX_OP_AMBIGUATE
|
||
- intel/blorp: Assert an 8bpp fast clear restriction
|
||
- iris: Init CCS_E to COMPRESSED_NO_CLEAR for XeHP
|
||
- intel/blorp: Use the depth copy format more on BDW+
|
||
- intel/blorp: Add depth usage check for copy format
|
||
- intel/blorp: Change condition for CCS_E copy formats
|
||
- intel/blorp: Add and use blorp_copy_get_formats
|
||
- iris: Use known formats for tex_cache_flush_hack
|
||
- iris: Drop a GFX12_CCS_E check in can_fast_clear_color
|
||
- intel: Rename the GFX12_CCS_E aux-usage to FCV_CCS_E
|
||
- iris: Avoid extra CCS_E flushes for aux mode changes
|
||
- iris: Avoid FCV_CCS_E for shader image accesses
|
||
- iris: Assert against FCV_CCS_E for blitter writes
|
||
- intel/blorp: Avoid 32bpc fast clear sampling issue
|
||
- Revert "iris: Add missed tile flush flag"
|
||
- iris: Drop the RT flush for PIPE_BARRIER_TEXTURE
|
||
- iris: Drop GPGPU Tex Invalidate restriction for TGL+
|
||
- isl: Add and use size and alignment calculators
|
||
- anv: Don't support ASTC images with modifiers
|
||
- intel/blorp: Ambiguate after CCS resolves on gfx7-8
|
||
|
||
Oskar Rundgren (20):
|
||
|
||
- pvr: Allow block compressed source blit
|
||
- pvr: Transfer PBE source snorm format should be signed
|
||
- pvr: Transfer PBE gamma is unset
|
||
- pvr: Transfer fix blit with multiple emits
|
||
- pvr: Transfer multiple emits clip rectangle
|
||
- pvr: Add back S8_UINT support
|
||
- pvr: Add PBE packmode for depth stencil formats
|
||
- pvr: Transfer add depth merge support for X8_D24
|
||
- pvr: Transfer add s8_uint support
|
||
- pvr: PBE fix mesa pipe swizzle conversion
|
||
- pvr: Transfer ignore non zero stride for twiddled surface
|
||
- pvr: Transfer block compressed with 3d twiddled layout
|
||
- pvr: Transfer support flipped rectangle mapping
|
||
- pvr: Transfer remove byte unwind workaround
|
||
- pvr: fix texel unwind workaround mappings
|
||
- pvr: Transfer check valid source address mask
|
||
- pvr: Transfer optimisation remove unused features from API
|
||
- pvr: Transfer image to buffer dest rect
|
||
- pvr: Fix transfer image clearing PBE packmodes
|
||
- pvr: add block compressed formats blit support
|
||
|
||
Patrick Lerda (22):
|
||
|
||
- r600: fix refcnt imbalance related to r600_set_vertex_buffers()
|
||
- r600: fix refcnt imbalance related to evergreen_set_shader_images()
|
||
- lima: fix refcnt imbalance related to framebuffer
|
||
- r600/sfn: fix memory leak related to sh_info->arrays
|
||
- aux/draw: fix memory leak related to ureg_get_tokens()
|
||
- crocus: fix refcnt imbalance related to framebuffer
|
||
- crocus: fix refcnt imbalance related to crocus_create_surface()
|
||
- r600: fix refcnt imbalance related to atomic_buffer_state
|
||
- radeonsi: set proper drm_amdgpu_cs_chunk_fence alignment
|
||
- crocus: fix scratch_bos memory leak
|
||
- mesa: fix refcnt imbalance related to egl_image_target_texture()
|
||
- glthread: fix typo related to upload_vertices()
|
||
- mesa: fix refcnt imbalance related to _mesa_delete_semaphore_object()
|
||
- mesa/st: fix refcnt imbalance related to st_feedback_draw_vbo()
|
||
- mesa/st: fix buffer overflow related to set_program_string()
|
||
- r600: fix r600_draw_vbo() buffer overflow
|
||
- nouveau: fix nouveau_heap_destroy() memory leak
|
||
- r600: fix cayman_convert_border_color() swizzle behavior
|
||
- util/blitter: fix util_blitter_clear_buffer() refcnt imbalance
|
||
- util/blitter: revert util_blitter_clear_buffer()
|
||
- radeonsi: fix refcnt imbalance related to util_blitter_save_fragment_constant_buffer_slot()
|
||
- panfrost: fix refcnt imbalance related to blitter
|
||
|
||
Paul Gofman (2):
|
||
|
||
- driconf: add a workaround for Captain Lycop: Invasion of the Heters
|
||
- driconf: add a workaround for Rainbow Six Extraction
|
||
|
||
Paulo Zanoni (9):
|
||
|
||
- iris: Store prime fd of external bos for Xe KMD
|
||
- iris: Add functions to import and export implicit sync state
|
||
- iris: Extend iris_bo_wait_syncobj() to wait on external implicit syncobj
|
||
- iris: Add iris_implicit_sync struct and functions to do implicit synchronization for Xe kmd
|
||
- iris: also avoid isl_memcpy_linear_to_tiled for Tile64
|
||
- intel/isl: tile 64 calculations work with 1D surfaces
|
||
- iris: assert bufmgr->bo_deps_lock is held
|
||
- iris: avoid stack overflow in iris_bo_wait_syncobj()
|
||
- iris: assert(bo->deps) after realloc()
|
||
|
||
Pavel Ondračka (33):
|
||
|
||
- r300: fix unconditional KIL on R300/R400
|
||
- r300: add CI list of known rv370 dEQP failures
|
||
- r300: remove simple duplicate ARL instructions
|
||
- r300: fuse ROUND and ARL to ARR
|
||
- r300: remove nir round lowering
|
||
- r300: enable PIPE_CAP_TGSI_TEXCOORD
|
||
- r300: fail linking instead of using dummy shaders
|
||
- CODEOWNERS: add r300 driver
|
||
- r300: move nir stuff to r300_nir file
|
||
- r300: move the ARL merging pass up in the opt loop
|
||
- r300: move the ROUND+ARL->ARR fusing to main optimization loop
|
||
- r300: optimize the load A0 pattern from wined3d
|
||
- r300: remove duplicate ARRs
|
||
- r300: be more agressive when merging A0 loads
|
||
- r300: remove unused SIN/COS lowering
|
||
- r300: remove unused SSG lowering
|
||
- r300: move CEIL lowering to NIR
|
||
- r300: remove unused FLR lowering
|
||
- r300: remove unused POW lowering
|
||
- r300: remove unused DST lowering
|
||
- r300: remove unused ROUND lowering
|
||
- r300: remove unused LIT lowering
|
||
- r300: remove unused opcodes from r300_tgsi_to_rc
|
||
- nir_opt_algebraic: don't use i32csel without native integer support
|
||
- r300: add partial CMP support on R5xx
|
||
- r300: properly count maximum used register index
|
||
- r300: lower undefs to zero
|
||
- r300: add some early safe bool lowering
|
||
- r300: remove most of backend contant folding
|
||
- r300: disable ntt regalloc for vertex shaders
|
||
- r300: assert that every writer has a reader
|
||
- r300: update RV370 failures
|
||
- r300: don't abort on flow control when using draw for vs
|
||
|
||
Philipp Zabel (1):
|
||
|
||
- etnaviv: fix segfault after compile failure
|
||
|
||
Pierre-Eric Pelloux-Prayer (13):
|
||
|
||
- amd: update amdgpu_drm.h
|
||
- amd: determine info->has_fw_based_shadowing
|
||
- radeonsi: implement fw based mcbp
|
||
- amd: update amdgpu_drm.h
|
||
- radeonsi: stop reporting reset to app once gpu recovery is done
|
||
- winsys/amdgpu: add a helper function to submit a no-op job
|
||
- winsys/amdgpu: use the no-op helper to detect if reset completion
|
||
- mesa: don't share reset status across contexts
|
||
- mesa: remove unused bools
|
||
- llvmpipe: only include old Transform includes when needed
|
||
- Revert "gallium/u_threaded: buffer sharedness tracking"
|
||
- st/mesa: check renderbuffer before using it
|
||
- radeonsi: emit framebuffer state after allocating cmask
|
||
|
||
Qiang Yu (119):
|
||
|
||
- nir: add nir_load_barycentric_optimize_amd intrinsic
|
||
- radeonsi: implement nir_load_barycentric_optimize_amd
|
||
- ac/nir/ps: lower barycentric load when bc_optimize
|
||
- ac/nir/ps: add force lower barycentric load options
|
||
- ac/nir/ps: lower sample mask input when needed
|
||
- ac/llvm,radeonsi: lower ps color load in nir
|
||
- radeonsi: add si_nir_lower_ps_color_input
|
||
- radeonsi: add si_nir_emit_polygon_stipple
|
||
- radeonsi: handle lowered ps in scan_io_usage
|
||
- radeonsi: monolithic ps emit prolog in nir directly
|
||
- radeonsi: restructure mono merged shader build
|
||
- radeonsi: remove separate_prolog parameter
|
||
- radeonsi: add si_mark_divergent_texture_non_uniform
|
||
- ac/llvm,radeonsi: use texture non-uniform flag as waterfall switch
|
||
- nir,ac/llvm,radeonsi: replace nir_load_smem_buffer_amd with nir_load_ubo
|
||
- ac/llvm,radeonsi: lower nir_load_point_coord_maybe_flipped in nir
|
||
- ac,radv: move ps arg compation to common place
|
||
- aco: support 32bit address in nir_load_smem_amd
|
||
- nir: add missing image atomic_inc/dec_wrap intrinsic
|
||
- aco: implement nir_bindless_image_atomic_inc/dec_wrap
|
||
- aco: skip scratch buffer init when its arg is not used
|
||
- aco: fix nir_f2u64 translation
|
||
- nir: add nir_export_dual_src_blend_amd intrinsic
|
||
- aco: move create_fs_dual_src_export_gfx11 above
|
||
- aco: implement nir_export_dual_src_blend_amd
|
||
- ac/nir/ps: use nir_export_dual_src_blend_amd when aco
|
||
- ac/nir/ps: add no_color_export option
|
||
- aco: support nir_export_amd with ps targets
|
||
- aco,radv: lower outputs to exports when nir for monolithic ps
|
||
- ac/llvm: remove output variable declaration for radv ps
|
||
- radv: implement nir_load_barycentric_optimize_amd
|
||
- ac/nir/ps: remove used nir_variable if created
|
||
- aco,ac/llvm,radv,radeonsi: handle ps bc optimization in nir for radv
|
||
- aco,radv: remove unused aco compile options
|
||
- aco,radv: support symbol relocation in aco
|
||
- aco: get scratch addr from symbol for radeonsi
|
||
- aco: allow no export instruction for gfx10+ fs
|
||
- ac/nir/cull: fix line position w culling
|
||
- meson: build radeonsi with aco
|
||
- radeonsi: add aco debug option
|
||
- radeonsi: add use_aco field for struct si_shader
|
||
- radeonsi: add shader info for frag coord and sample pos read
|
||
- radeonsi: add shader info uses_sampleid
|
||
- radeonsi: pack spi ps input fixup to a function
|
||
- radeonsi: init spi ps input shader config when aco
|
||
- radeonsi: add a raw shader binary type
|
||
- ac/binary: pack prefech align code to a function
|
||
- radeonsi: support raw shader binary upload
|
||
- radeonsi: support print raw shader binary
|
||
- radeonsi: remove ps vgpr index save when args init
|
||
- tgsi_to_nir: call nir_lower_int64 when required
|
||
- ac/llvm,radeonsi: lower idiv in nir
|
||
- ac/llvm,radeonsi: lower fsin/fcos in nir
|
||
- ac/llvm,radeonsi: lower txf offset in nir
|
||
- ac/llvm,radeonsi: lower ineg in nir
|
||
- ac/llvm,radeonsi: lower some pack/unpack ops not supported by aco
|
||
- ac/llvm,radeonsi: lower nir_fpow for aco and llvm
|
||
- radeonsi: lower some 64bit ops aco does not support
|
||
- radeonsi: lower vector const to scalar at last for aco
|
||
- radeonsi: add has_non_uniform_tex_access shader info
|
||
- radeonsi: lower non uniform texture access when aco
|
||
- radeonsi: add initial aco compile code
|
||
- radeonsi: add symbols to si_shader_binary
|
||
- radeonsi: resolve aco scratch addr symbols
|
||
- radeonsi: adjust ps args for aco
|
||
- radeonsi: pass use_aco to ac_nir_lower_ps
|
||
- radeonsi: clamp shadow texture reference in nir for aco
|
||
- ac/llvm,radeonsi: enable lower_array_layer_round_even
|
||
- radeonsi: fixup sampler desc for tg4 in nir
|
||
- radeonsi: be able to use aco compiler for mono ps
|
||
- ac/llvm: remove the double frcp special handling
|
||
- radeonsi: fix aco compile for atomic ops
|
||
- ac/llvm: remove redundant nir_lower_legacy_atomics
|
||
- radeonsi: fix uses_instanceid for merged mono shader stage
|
||
- aco: implement two load lds ngg intrininsic for radeonsi
|
||
- aco,radv: remove unused aco_shader_info fields
|
||
- ac/nir/ngg: don't use 8bit alu ops
|
||
- aco: implement load buffer with ACCESS_USES_FORMAT_AMD
|
||
- aco/assembler: handle ds_(add|sub)_gs_reg_rtn encoding
|
||
- aco: use gds reg when ordered xfb counter add
|
||
- aco: implement nir_xfb_counter_sub_amd
|
||
- aco: implement nir_bindless_image_fragment_mask_load_amd
|
||
- aco: use ac_get_image_dim for array check when image intrinsic
|
||
- radeonsi: resolve lds ngg aco symbols
|
||
- radeonsi: add scratch offset vs args explicitly for aco
|
||
- ac/llvm,radeonsi: lower nir_load_gs_vertex_offset_amd in abi
|
||
- ac/llvm,radeonsi: lower nir_load_merged_wave_info_amd in abi
|
||
- ac/llvm,radeonsi: lower load_workgroup_num_input_(vertices|primitives) in abi
|
||
- ac/llvm,radeonsi: lower nir_load_initial_edgeflags_amd in abi
|
||
- ac/llvm,radeonsi: lower nir_load_packed_passthrough_primitive_amd in abi
|
||
- ac/llvm,radeonsi: lower nir_load_ordered_id_amd in abi
|
||
- ac/llvm,radeonsi: lower nir_load_ring_esgs_amd in abi
|
||
- nir,ac/llvm,radeonsi: replace nir_buffer_atomic_add_amd with ssbo atomic
|
||
- radeonsi: fill aco shader info for mono standalone vs
|
||
- radeonsi: calculate needed lds size when upload raw binary for vs
|
||
- radeonsi: use nir_umul_high for fast udiv
|
||
- radeonsi: always use scoped barrier
|
||
- ac/llvm: remove unused barrier implementation
|
||
- radeonsi: enable aco for mono standalone vs
|
||
- aco,radv: remove unused gs aco shader info
|
||
- ac/nir,radv: add 1 dword to LS/HS vertex stride
|
||
- ac/nir,radv: add 1 dword to ES/GS item size
|
||
- radeonsi: add scratch_offset arg for aco tcs
|
||
- radeonsi: lower nir_load_tess_rel_patch_id_amd in abi for aco
|
||
- ac/llvm,radeonsi: lower nir_load_ring_tess_offchip_amd in abi
|
||
- radeonsi: enable aco support for mono standalone tcs
|
||
- radeonsi: add scratch_offset arg for aco tes
|
||
- radeonsi: init tes aco shader info fields
|
||
- radeonsi: update lds size for tes
|
||
- radeonsi: enable aco support for standalone tes
|
||
- radeonsi: add scratch_offset arg for aco gs
|
||
- ac/llvm,radeonsi: lower nir_load_ring_gsvs_amd in abi
|
||
- radeonsi: enable aco for standalone gs
|
||
- radeonsi: enable aco support for gs copy shader
|
||
- radeonsi: add scratch_offset arg for aco cs
|
||
- ac/llvm,radeonsi: lower nir_load_user_data_amd in abi
|
||
- radeonsi: fix crash when AMD_DEBUG=cs,initnir
|
||
- radeonsi: enable aco support for compute shader
|
||
- ac/nir/ngg: fix ngg_gs_clear_primflags crash
|
||
|
||
QwertyChouskie (1):
|
||
|
||
- docs/features.txt(fix): mark VK_EXT_pipeline_robustness as supported on radv
|
||
|
||
Rajnesh Kanwal (9):
|
||
|
||
- pvr: Add support to process transfer and blit cmds
|
||
- pvr: Implement vkCmdCopyBufferToImage API.
|
||
- pvr: Implement vkCmdCopyImage2KHR API.
|
||
- pvr: Implement vkCmdBlitImage API.
|
||
- pvr: Implement vkCmdClearColorImage API.
|
||
- pvr: Implement vkCmdCopyImageToBuffer2 API.
|
||
- pvr: Implement vkCmdFillBuffer API.
|
||
- pvr: Implement vkCmdResolveImage2KHR API.
|
||
- pvr: Implement vkCmdClearDepthStencilImage API.
|
||
|
||
Rhys Perry (92):
|
||
|
||
- nir: add is_gather_implicit_lod
|
||
- vtn: set is_gather_implicit_lod
|
||
- aco: support implicit LOD for nir_texop_tg4
|
||
- ac/llvm: support implicit LOD for nir_texop_tg4
|
||
- aco: remove SMEM_instruction::prevent_overflow
|
||
- aco: use apply_nuw_to_ssa() with load_smem_amd
|
||
- ac/nir/ps: fix null export write mask miss set to 0xf
|
||
- aco: don't move exec reads around exec writes
|
||
- aco: don't move exec writes around exec writes
|
||
- radv: fix bc optimization with POS_W_FLOAT_ENA(1)
|
||
- aco/ra: create M0-affinities for s_sendmsg
|
||
- aco/gfx11: fix VMEM/DS->VALU WaW/RaW hazard
|
||
- amd/drm-shim: move device list to external file
|
||
- amd/drm-shim: add polaris10
|
||
- amd/drm-shim: add vega10
|
||
- amd/drm-shim: add navi10
|
||
- aco: add get_op_fixed_to_def() helper
|
||
- aco: consider how definitions fixed to operands can change register demand
|
||
- nir/fold_16bit_tex_image: skip tex instructions with backend1
|
||
- nir,vtn,aco,ac/llvm: make cube_face_coord_amd more direct
|
||
- ac/nir: add pass for lowering 1d/cube coordinates
|
||
- ac/nir: round layer in ac_nir_lower_tex
|
||
- radv,radeonsi: use ac_nir_lower_tex
|
||
- nir/lower_tex: remove lower_array_layer_round_even
|
||
- ac/nir: add fix_derivs_in_divergent_cf
|
||
- aco: remove unused RegType
|
||
- aco: let p_start_linear_vgpr take an operand
|
||
- aco: add MIMG_instruction::strict_wqm
|
||
- aco: implement strict_wqm_coord_amd
|
||
- aco: implement texture samples with strict WQM coordinates
|
||
- radv: use fix_derivs_in_divergent_cf
|
||
- aco/tests: improve performance of declaration parsing
|
||
- aco/tests: add fix_derivs_in_divergent_cf tests
|
||
- aco: fix update_alu(clear=true) for exports
|
||
- aco: use pass_flags to recover s_delay_alu cycles
|
||
- aco: insert s_delay_alu on the linear CFG
|
||
- aco: improve printing of s_delay_alu
|
||
- radv: allow wave32 for geometry shaders
|
||
- aco: fix has_color_exports=true for mrtz exports
|
||
- aco/tests: add discard export target tests
|
||
- aco: fix ds_sub_gs_reg_rtn validation
|
||
- radv: initialize aco_compiler_options::is_opengl
|
||
- radv: correctly skip vertex loads with packed formats
|
||
- aco: consider position/primitive exports around memory barriers
|
||
- ac/nir: use scoped barriers to finish stores before exports
|
||
- aco: remove memory_barrier_buffer implementation
|
||
- aco: mask bits source of s_bfe
|
||
- aco/tests: test that s_bfe bits is masked
|
||
- util: fix gc_alloc_size alignment
|
||
- util/tests: add gc_alloc_size alignment tests
|
||
- aco: run nir_lower_int64 after nir_opt_uniform_atomics
|
||
- ac: fix PIPE_FORMAT_R11G11B10_FLOAT DST_SEL_W
|
||
- radv: refactor CS subgroup size determination
|
||
- radv: use wave32 for small workgroups
|
||
- aco: don't try to form load+store clauses
|
||
- aco/gfx11: use s_clause with stores
|
||
- aco/gfx11: schedule for VMEM store clauses
|
||
- aco: don't set exec_hi for wave32 scan reductions
|
||
- amd/drm-shim: use fixed-width types
|
||
- nir/peephole_select: allow some invocation broadcast intrinsics
|
||
- aco: include helpers in emit_uniform_{reduce,scan}
|
||
- nir,aco: add INCLUDE_HELPERS index to reduce intrinsic
|
||
- nir/opt_intrinsic: optimize quad vote
|
||
- radv: use nir_opt_intrinsics
|
||
- aco,ac/llvm,ac/nir,vtn: unify cube opcodes
|
||
- nir: split nir_lower_mov64
|
||
- radv: use nir_lower_conv64
|
||
- radv: call nir_lower_int64 later
|
||
- radeonsi: use nir_lower_conv64
|
||
- aco: remove 64-bit integer conversion opcodes
|
||
- ac/llvm: fix AC_TM_CHECK_IR
|
||
- radv: fix radv_get_ballot_bit_size with CS
|
||
- ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask
|
||
- ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64
|
||
- radv: add conformant_trunc_coord to cache UUID
|
||
- ac/nir: always round cube array layers
|
||
- nir/unsigned_upper_bound: fix phi(bcsel)
|
||
- nir/opt_dead_cf: remove nodes after a jump earlier
|
||
- aco: insert s_nop before VGPR deallocation
|
||
- radv: workaround WWZ exporting index=1 through location=1
|
||
- radv: correctly skip MRT output NaN fixup for meta shaders
|
||
- aco: summarize register demand after handling branches
|
||
- aco: don't create sendmsg(dealloc_vgprs) if scratch is used
|
||
- radv: disable 64-bit color attachments
|
||
- aco: fix p_bpermute_gfx6 with input at non-zero byte
|
||
- radv: fix 128bpp comp-to-single clears
|
||
- aco/spill: skip p_branch in process_block
|
||
- aco/spill: add all live-in to merge block spill candidates
|
||
- aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since()
|
||
- aco: check logical_phi_info at p_logical_end when eliminating exec writes
|
||
- aco: remove unused p_logical_end check when optimizing branching sequence
|
||
- aco: reset prefetch in the correct block after removing the exit
|
||
|
||
Rob Clark (58):
|
||
|
||
- freedreno/a6xx: Fix valid_format_cast logic for newer a6xx
|
||
- freedreno: Remove unused fd_batch_reset()
|
||
- freedreno: Inline single-caller helpers
|
||
- freedreno: Extra casting to make C++ happy
|
||
- freedreno/registers: C++ struct casting
|
||
- util/log: Add missing "const"
|
||
- freedreno/ir3: More perfetto tracing
|
||
- mesa/nir: Add some perfetto traces
|
||
- freedreno/perfetto: Add shader_id for compute stages
|
||
- freedreno: Add dirty state logging
|
||
- freedreno/a6xx: Pass ring to __ONE_REG()
|
||
- freedreno: Add more tracepoint fields
|
||
- freedreno: Fix resource tracking vs rebind/invalidate
|
||
- freedreno/a6xx: Change a618 tile_align_h back to 32
|
||
- dri/android: Fix MSAA resolve
|
||
- Revert "ci: disable all a306/a530/a630 jobs"
|
||
- freedreno/a6xx: Rework set_bin_size()
|
||
- freedreno/a6xx+: Use template to handle a6xx vs a7xx differences
|
||
- freedreno/batch: Add helper to set fb state
|
||
- freedreno/a6xx: Move LRZ clear to blitter
|
||
- freedreno/a6xx: Add ctx->emit_sysmem()
|
||
- freedreno/a6xx: Simplify per-tile conditional IBs
|
||
- freedreno/a6xx: Switch to batch->cleared
|
||
- freedreno/a6xx: Split tile loads and clears
|
||
- freedreno/a6xx: Introduce batch subpasses
|
||
- freedreno/a6xx: Per-subpass LRZ
|
||
- freedreno/a6xx: New subpass on mid-frame clears
|
||
- freedreno/a6xx: Move LRZ clears to gmem
|
||
- freedreno/a6xx: Actually use LRZ for ms
|
||
- freedreno/a5xx+a6xx: Don't allocate LRZ for z32
|
||
- tu: Move queue deletion to last
|
||
- mesa: Skip update_gl_clamp() if samplers need clamp
|
||
- freedreno/a6xx: Template specialization for draw type
|
||
- freedreno/a6xx: Template specialization for pipeline type
|
||
- freedreno/a6xx: Optimize max_indices calculation
|
||
- freedreno/batch: Move submit bo tracking to batch
|
||
- freedreno/drm: Don't try to export suballoc bo
|
||
- freedreno: Handle export error handling
|
||
- freedreno: Add aux-context support
|
||
- freedreno: Reallocate on unshared export
|
||
- freedreno/a6xx: Clean up open coded flushes
|
||
- freedreno/a6xx: Stop using fd_wfi()
|
||
- freedreno/a6xx: Add missing cap
|
||
- freedreno/a6xx: Fix xfb stream configuration
|
||
- freedreno/a6xx: Remove primitives_relocw()
|
||
- freedreno/a6xx: GL_ARB_transform_feedback_overflow_query
|
||
- freedreno/a6xx: Split primitives and pipeline-stats queries
|
||
- freedreno/a6xx: Handle nested pipeline stats queries
|
||
- freedreno: Handle compute queries
|
||
- freedreno/a6xx: GL_ARB_pipeline_statistics_query
|
||
- freedreno/a6xx: Enable gl46
|
||
- freedreno: Add extra assert
|
||
- freedreno/batch: Add driver-thread assert
|
||
- freedreno/a6xx: Directly invalidate on samp view update
|
||
- freedreno/a6xx: Use idalloc for samp/view seqno's
|
||
- freedreno/fdperf: Use common device info helpers
|
||
- freedreno/drm/virtio: Trigger host side wait boost
|
||
- tu/drm: Add missing error path cleanup
|
||
|
||
Robert Beckett (1):
|
||
|
||
- winsys/panfrost: Fix a scanout resource leak
|
||
|
||
Robert Mader (1):
|
||
|
||
- egl/wayland: wait for compositor to release shm buffers
|
||
|
||
Rohan Garg (41):
|
||
|
||
- anv: use the workaround framework for WA 14013111325
|
||
- hasvk: drop dead code
|
||
- iris: use the workaround framework for WA 14013111325
|
||
- anv: use the common vulkan runtime to do the heavy lifting
|
||
- anv: drop duplicated nir_opt_dce passes
|
||
- intel: infer scalar'ness locally for brw_postprocess_nir
|
||
- intel: drop unused is_scalar function parameter in brw_nir_apply_key
|
||
- intel: update comments about non-existent function parameter
|
||
- intel: infer scalar'ness locally for brw_vectorize_lower_mem_access
|
||
- anv: drop duplicate checks when setting the compressed bit
|
||
- iris: correctly set alignment to next power of two for struct size
|
||
- ac/surface: make sure alignment is a POT
|
||
- freedreno: set alignment to next POT
|
||
- util: fix ROUND_DOWN_TO alignment type
|
||
- util: migrate alignment functions and macros to use ALIGN_POT
|
||
- util: revert back to ALIGN since it moved to util
|
||
- util: move pot functions to use existing macros
|
||
- anv: enable single texel alignment
|
||
- isl: add helper to check if aux usage is CCS_E
|
||
- anv: set aux usage to GFX12_CCS_E if a platform needs WA 14010672564
|
||
- anv: limit non zero fast clear check to GFX12_CCS_E
|
||
- anv: fix incorrect asserts when combining CPS and per sample interpolation
|
||
- hasvk: enable single texel alignment
|
||
- anv: split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES for finer grained flushing
|
||
- anv: move WA 1607854226 to use the WA infrastructure
|
||
- intel/compiler: construct masks instead of using magic values
|
||
- intel/compiler: reuse previously computed bitsize
|
||
- anv: retry batchbuffer submission with i915
|
||
- iris: migrate WA 14013910100 to use the WA framework
|
||
- iris: migrate WA 14016118574 to use the WA framework
|
||
- iris: fix iris for WA 16013000631
|
||
- intel/perf: add perf query support for Intel Raptorlake
|
||
- anv: use the correct GFX_VERx10 macro for WA
|
||
- anv,iris: program the maximum number of threads on compute queue init
|
||
- anv: partially revert 2e8b1f6d
|
||
- anv: drop dead ifdef
|
||
- iris: use the correct WA macros and lineage numbers
|
||
- anv: use the lineage number for WA
|
||
- crocus: fix GFX_VERx10 macro
|
||
- blorp: drop undefined macro
|
||
- iris: migrate preemption streamwout wa to WA infra
|
||
|
||
Roland Scheidegger (2):
|
||
|
||
- llvmpipe: minor cleanups in line rendering code
|
||
- llvmpipe: fix some corner cases with line rendering
|
||
|
||
Romain Failliot (1):
|
||
|
||
- docs(fix): remove last ref to i965 in features.txt
|
||
|
||
Ruijing Dong (19):
|
||
|
||
- radeonsi/vcn: add macros used in av1 encoding
|
||
- radeonsi/vcn: enable 2 pass search center map
|
||
- radeonsi/vcn: enable swizzle mode in encoding ref frames.
|
||
- radeonsi/vcn: merge get_output_format_param function
|
||
- radeonsi/vcn: remove extra zero bytes from bitstream
|
||
- radeonsi/vcn: add av1 dpb variables and cdf table
|
||
- gallium/pipe: add av1 encoding data structure in pipe
|
||
- radeonsi/vcn: add av1 enc data structure
|
||
- radeonsi/vcn: add some av1 encoding function
|
||
- radeonsi/vcn: add av1 encoding ib packages and get_info
|
||
- frontends/va: adding va av1 encoding functions
|
||
- radeonsi/vcn: use PIPE_ENC_FEATURE enum
|
||
- frontends/va: define va av1 encoding caps
|
||
- radeonsi/vcn: correct cropping for hevc case
|
||
- radeonsi/vcn: fix decoding bs buffer alignement issue.
|
||
- gallium/pipe: add interface update_decoder_target
|
||
- radeonsi/vcn: apply update_decoder_target logic
|
||
- frontends/va: remove private member and update target buffer
|
||
- radeonsi/vcn: change max_poc to fixed value for hevc encoder.
|
||
|
||
Ryan Houdek (1):
|
||
|
||
- util: move check for AVX512
|
||
|
||
Ryan Neph (2):
|
||
|
||
- virgl: add debug flag to force synchronous GL shader compilation
|
||
- virgl: check a debug option again at context creation
|
||
|
||
Sagar Ghuge (20):
|
||
|
||
- anv: Factor out code from anv_image_hiz_clear
|
||
- anv: Move and make anv_can_hiz_clear_ds_view non-static
|
||
- anv: Fast clear depth/stencil surface in vkCmdClearAttachments
|
||
- anv: Set CS stall bit during HIZ_CCS_WT surface fast clear
|
||
- iris: Set CS stall bit during HIZ_CCS_WT surface fast clear
|
||
- intel/genxml: Add CCS cache flush field to PIPE_CONTROL
|
||
- intel/genxml: Add Compute/Blitter CCS aux invalidation register
|
||
- anv: Add CCS cache flush bits to anv_pipe_bits
|
||
- anv: Fix AUX-TT invalidation
|
||
- anv: implement recommended flush/wait of AUX-TT invalidation on compute
|
||
- iris: Add CCS cache flush bits
|
||
- iris: Fix AUX-TT invalidation
|
||
- iris: implement recommended flush/wait of AUX-TT invalidation
|
||
- intel/ds: Track CCS cache flush bit
|
||
- iris: Use correct CCS0 aux-map register offset
|
||
- intel/genxml: Fix typo in CCS cache flush enable
|
||
- intel/genxml: Drop incorrect compute aux-inv register entry
|
||
- anv: Drop depth cache flush requirement after depth clear/resolve
|
||
- iris: Drop depth cache flush requirement after depth clear/resolve
|
||
- blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth
|
||
|
||
Samuel Holland (3):
|
||
|
||
- Android.mk: Allow building only Vulkan drivers
|
||
- Android.mk: Explicitly enable/disable LLVM support
|
||
- Android.mk: Only link LLVM for radeonsi, not amd_vk
|
||
|
||
Samuel Pitoiset (203):
|
||
|
||
- radv: fix detecting FMASK_DECOMPRESS/DCC_DECOMPRESS meta pipelines
|
||
- vulkan: ignore rasterizationSamples when the state is dynamic
|
||
- radv: try to keep HTILE compressed for READ_ONLY_OPTIMAL layout
|
||
- radv: re-emit the guardband state when related PSO are bound
|
||
- radv: tidy up dirtying RBPLUS state in radv_bind_dynamic_state()
|
||
- radv: disable fast-clears with CMASK for 128-bit formats
|
||
- radv: require DRM 3.27
|
||
- radv/amdgpu: remove legacy code path for creating the BO list
|
||
- radv/amdgpu: remove legacy code for querying context status
|
||
- radv: do not allow 1D block-compressed images with (extended) storage on GFX6
|
||
- radv: fix usage flag for 3D compressed 128 bpp images on GFX9
|
||
- radv: wait for occlusion queries in the resolve query shader
|
||
- radv: delay enabling/disabling occlusion queries at draw time
|
||
- radv: track DB_COUNT_CONTROL changes to avoid context rolls
|
||
- radv: emit PIXEL_PIPE_STAT_CONTROL in the gfx preamble for GFX11
|
||
- radv: use gfx_level in radv_flush_occlusion_query_state()
|
||
- radv: update binning settings to work around GPU hangs
|
||
- radv/ci: remove one expected test failure on PITCAIRN
|
||
- radv/amdgpu: fix adding continue preambles and postambles BOs to the list
|
||
- Revert "ci/radv: Demote navi21 to manual until recent flakiness resolves."
|
||
- radv: add the perf counters BO to the preambles BO list
|
||
- radv: do not overallocate the CS array during submissions
|
||
- ac/sqtt: add rgp_sqtt_marker_cb_id definition
|
||
- ac/sqtt: add a helper to get cmdbuf IDs per queue
|
||
- radv: reserve command buffer index for SQTT
|
||
- docs: rename ACO_DEBUG=noscheduling to ACO_DEBUG=nosched
|
||
- docs: add missing ACO_DEBUG=force-waitdeps
|
||
- radv: only enable extendedDynamicState3ConservativeRasterizationMode on GFX9+
|
||
- ac/spm: introduce ac_spm_trace and ac_spm_get_trace()
|
||
- ac/spm: rename ac_spm_trace_data to ac_spm
|
||
- ac/sqtt: add a helper for adding clock calibration records
|
||
- ac/sqtt: add helpers for initializing ac_thread_trace_data
|
||
- ac/sqtt: initialize clock calibration/queue info/queue event records
|
||
- radv/sqtt: sample CPU/GPU clocks before starting the trace
|
||
- radv/sqtt: add support for queue info
|
||
- ac/sqtt: add new bits to rgp_sqtt_marker_barrier_end
|
||
- ac/sqtt: add missing EventUnknown to rgp_sqtt_marker_event_type
|
||
- ac/rgp: update SQTT_FILE_CHUNK_TYPE_API_INFO to minor version 2
|
||
- ac/rgp: update SQTT_FILE_CHUNK_TYPE_ASIC_INFO to minor version 5
|
||
- ac/sqtt: add ac_sqtt_se_is_disabled() helper
|
||
- ac/sqtt: add ac_sqtt_get_trace() helper
|
||
- radv: do not abort when the SQTT buffer resize failed
|
||
- ac/rgp: remove ac_thread_trace_data from ac_thread_trace
|
||
- ac,radv,radeonsi: rename thread_trace to sqtt everywhere
|
||
- ac/nir: fix 8-bit/10-bit PS exports clamping
|
||
- radv: enable RADV_THREAD_TRACE_CACHE_COUNTERS by default
|
||
- radv: fix dynamic depth clamp enable support
|
||
- radv: fix invalid type for usage in radv_get_buffer_memory_requirements()
|
||
- radv: fix fast-clearing images with VK_REMAINING_{ARRAY_LAYERS,MIP_LEVELS}
|
||
- radv: replace radv_get_layerCount by vk_image_subresource_layer_count()
|
||
- radv: replace radv_get_levelCount() by vk_image_subresource_level_count()
|
||
- radv/meta: rename dest to dst
|
||
- radv: disable RB+ blend optimizations on GFX11 when a2c is enabled
|
||
- radv: use vk_image::mip_levels instead of radv_image::info::levels
|
||
- radv: use vk_image::array_layers instead of radv_image::info::array_size
|
||
- radv: use vk_image::samples instead of radv_image::info::storage_samples
|
||
- radv: use vk_image::samples instead of radv_image::info::samples
|
||
- radv: use vk_image::extent instead of radv_image::info::{width,height,depth}
|
||
- radv: remove ac_surf_info from radv_image
|
||
- ac/spm: switch to SPM version 2.0
|
||
- vulkan: Update XML and headers to 1.3.250
|
||
- radv: implement VK_EXT_attachment_feedback_loop_dynamic_state
|
||
- radv: advertise VK_EXT_attachment_feedback_loop_dynamic_state
|
||
- spirv: ignore SpvDecorationInvariant warning on struct members
|
||
- radv/ci: stop setting MESA_SPIRV_LOG_LEVEL
|
||
- radv: reset the emitted VS prolog when a new vertex shader is bound
|
||
- radv: dirty the dynamic vertex input state only when needed
|
||
- radv: re-emit fragment shading rate state when PA_CL_VRS_CNTL changes
|
||
- radv: configure PA_CL_VRS_CNTL entirely from the cmd buffer
|
||
- radv: implement dynamic sample locations enable
|
||
- radv: handle NULL fragment shaders when recording cmdbuf
|
||
- radv: handle NULL fragment shaders when creating graphics pipelines
|
||
- radv: rework the checks for implicit exports with GPL
|
||
- radv: allow to determine NGG settings with a NULL fragment shader
|
||
- radv: stop compiling a noop FS when the application doesn't provide a FS
|
||
- radv: advertise VK_EXT_tooling_info
|
||
- radv: reset the emitted PS epilog when a new fragment shader is bound
|
||
- radv: remove unused pipeline param in radv_generate_ps_epilog_key()
|
||
- radv: stop using the pipeline for determining the null export workaround
|
||
- radv: fix emitting VRS state with a null fragment shader
|
||
- radv: fix resetting VRS if the graphics pipeline doesn't enable it
|
||
- radv: fix a sync issue with primitives generated query and NGG/legacy
|
||
- amd/drm-shim: add navi21
|
||
- amd/drm-shim: add pitcairn
|
||
- amd/drm-shim: add bonaire
|
||
- amd/drm-shim: update README about which file to modify
|
||
- ci: build drm-shim in debian-testing
|
||
- ci,radv: use drm-shim instead of the null winsys for radv-fossils
|
||
- ci: stop using the hang-detection tool for vkd3d-proton
|
||
- ci: rework vkd3d-proton runner and fix detecting failures
|
||
- radv: reserve cmdbuf space in radv_flush_gfx2ace_semaphore()
|
||
- radv: bump the global VRS image size to maximum supported FB dimensions
|
||
- radv: disable IMAGE_USAGE_STORAGE with depth-only and stencil-only formats
|
||
- radv: remove useless check about USAGE_STORAGE for TC-compat HTILE
|
||
- nir: add nir_intrinsic_load_poly_line_smooth_enabled
|
||
- radeonsi: lower nir_intrinsic_load_poly_line_smooth_enabled_amd
|
||
- nir: lower smooth lines conditionally using the new intrinsic
|
||
- radv: track if the smoothLines features is enabled in the device
|
||
- radv: determine if smooth lines can be used in the pipeline key
|
||
- radv: declare a new user SGPR for the dynamic line rasterization mode
|
||
- radv: lower nir_intrinsic_load_poly_line_smooth_enabled_amd
|
||
- radv: add support for smooth lines
|
||
- radv: enable smoothLines
|
||
- radv: apply a bug workaround for smoothing on GFX6
|
||
- radv: do not enable VRS flat shading if the VRS builtin is read
|
||
- zink/ci: update VANGOGH expected list of failures
|
||
- vulkan/pipeline_cache: remove a bogus assert when inserting objects
|
||
- zink/ci: skip arb_texture_buffer_object@texture-buffer-size-clamp* with RADV
|
||
- radv: fix copying 2D to 3D images
|
||
- ci: uprev vkd3d-proton to 2.9
|
||
- amd: fix 64-bit integer color image clears
|
||
- radv: rework configuring VGT_SHADER_STAGES_EN
|
||
- radv/ci: update list of expected failures since Vulkan loader 1.3
|
||
- radv/ci: skip tests that timeout since Vulkan loader 1.3
|
||
- vulkan: Update XML and headers to 1.3.251
|
||
- radv: advertise VK_EXT_dynamic_rendering_unused_attachments
|
||
- aco: remove nir_intrinsic_load_barycentric_at_sample occurences
|
||
- radv/ci: removed expected failures that are skipped now
|
||
- radv/nir: use ac_nir_unpack_arg() for packed shader input user SGPRS
|
||
- radv: introduce SHIFT/MASK for unpacking shader input args
|
||
- radv: regroup fragment shader user SGPRs emission
|
||
- radv: merge all FS user SGPRs into one using packed arguments
|
||
- spirv: add support for SpvCapabilityFragmentBarycentricKHR
|
||
- spirv,nir: add support for BaryCoord{NoPersp}KHR builtins
|
||
- spirv,nir: add support for SpvDecorationPerVertexKHR
|
||
- nir/lower_io: add nir_intrinsic_load_input_vertex to is_input()
|
||
- nir: print locations for per-vertex fragment shader inputs
|
||
- zink/ci: remove useless RADV_PERFTEST=gpl
|
||
- radv: initialize the device cache UUID even if on-disk cache is disabled
|
||
- nir: add nir_intrinsic_load_provoking_vtx_amd
|
||
- radv: add support for nir_intrinsic_load_provoking_vtx_amd
|
||
- radv: track if the rasterization primitive is known at compile time
|
||
- nir: add nir_intrinsic_load_rasterization_primitive_amd
|
||
- radv: add support for nir_intrinsic_load_rasterization_primitive_amd
|
||
- radv: handle per_vertex variables when gathering FS inputs
|
||
- radv: set ROTATE_PC_PTR for custom interpolations
|
||
- radv: configure RSRC1.LOAD_PROVOKING_VTX for the fragment shader
|
||
- radv: add a NIR pass that lower fragment shader barycentric intrinsics
|
||
- radv: gather info about nir_intrinsic_load_sample_positions_amd
|
||
- radv: advertise VK_KHR_fragment_shader_barycentric on GFX10.3+
|
||
- radv: add a helper for emitting a null depth/stencil target
|
||
- radv: reset more DB registers when emitting a null ds target
|
||
- radv: emit DB_RENDER_CONTROL as part of the framebuffer
|
||
- radv: disable HTILE compression only when layouts are compressed
|
||
- radv/ci: update the list of expected failures on STONEY
|
||
- radv: gather info about load_poly_line_smooth_enabled
|
||
- radv: add a helper for forcing VRS 1x1 in some situations
|
||
- radv: do not force VRS 1x1 when smooth lines are enabled
|
||
- radv: fix smooth lines with graphics pipeline library
|
||
- radv: fix re-emitting some dynamic states when the previous FS is NULL
|
||
- radv: fix re-emitting early_z/late_z when the bound PS changes
|
||
- radv: reset some dynamic states when the fragment shader stage is unbound
|
||
- radv: remove unused radv_dgc_token struct
|
||
- radv: add dgc_emit_state() helper
|
||
- radv: add dgc_emit_push_constant() helper
|
||
- radv: add dgc_emit_vertex_buffer() helper
|
||
- radv: add dgc_emit_draw() helper
|
||
- radv: add dgc_emit_draw_indexed() helper
|
||
- radv: add dgc_emit_index_buffer()
|
||
- radv: do not use IB for the GFX preamble with RADV_DEBUG=noibs
|
||
- radv: use IB for the GFX preamble on GFX6
|
||
- radv: reserve space for shadowed regs
|
||
- radv/amdgpu: fix a buffer overflow for submissions with RADV_DEBUG=noibs
|
||
- radv/amdgpu: remove useless assert in radv_amdgpu_winsys_cs_submit_internal()
|
||
- radv/amdgpu: add cs_execute_ib() for executing IBs
|
||
- radv: use cs_execute_ib() for GFX, MBCP and DGC IBs
|
||
- vulkan/runtime: call CmdSetDepthBias2EXT() from CmdSetDepthBias()
|
||
- radv: implement VK_EXT_depth_bias_control
|
||
- radv: advertise VK_EXT_depth_bias_control
|
||
- radv: implement padding cmdbuffer for DGC on GFX6
|
||
- radv: enable NV_device_generated_commands on GFX6
|
||
- radv: reserve more space in CS for SQTT
|
||
- radv/amdgpu: fix dumping cs with RADV_DEBUG=noibs
|
||
- radv/amdgpu: dump all cs with RADV_DEBUG=noibs
|
||
- radv: only dirty the index type when necessary with DGC
|
||
- radv: only dirty the active push constant stages with DGC
|
||
- radv: adjust alignment of the preprocess buffer with DGC
|
||
- radv/amdgpu: use the correct IB size when growing a CS with RADV_DEBUG=noibs
|
||
- radv/amdgpu: rework growing a CS with the chained IB path slightly
|
||
- radv/amdgpu: do not set the IB size when ending a CS with RADV_DEBUG=noibs
|
||
- radv/amdgpu: use the array of IB buffers for the chained IB path
|
||
- radv/amdgpu: use cs_finalize() when growing a CS
|
||
- radv/amdgpu: rename old_ib_buffers to ib_buffers
|
||
- radv/amdgpu: add a helper to get a new IB
|
||
- radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build
|
||
- radv/amdgpu: workaround a kernel bug when replacing sparse mappings
|
||
- radv/amdgpu: add more small helpers for managing CS
|
||
- radv/amdgpu: add support for executing DGC cmdbuf with RADV_DEBUG=noibs
|
||
- radv: allow NV_device_generated_commands with RADV_DEBUG=noibs
|
||
- radv: stop emitting TILE_SURFACE_ENABLE for the ZRANGE_PRECISION workaround
|
||
- radv: inline more values in radv_emit_fb_ds_state()
|
||
- radv: emit PA_SC_SCREEN_SCISSOR_BR with the actual fb extent
|
||
- zink/ci: update list of expected failures for NAVI10
|
||
- zink: fix setting VkShaderCreateInfoEXT::nextStage
|
||
- radv/rt: fix capture/replay support
|
||
- vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic
|
||
- Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings"
|
||
- Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build"
|
||
- radv/amdgpu: fix executing secondaries without IB2
|
||
- radv/amdgpu: do not copy the original chain link for IBs
|
||
- radv: fix emitting SQTT userdata when CAM is needed
|
||
- radv: fix capturing RGP on RDNA3 with more than one Shader Engine
|
||
- radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws
|
||
|
||
Sarah Walker (15):
|
||
|
||
- pvr: Support single core transfer queue commands on multicore GPUs
|
||
- pvr: Implement pvr_pbe_setup_modify_defaults()
|
||
- pvr: Complete pvr_modify_command()
|
||
- pvr: Complete pvr_unwind_rects()
|
||
- pvr: Complete pvr_double_stride()
|
||
- pvr: Implement pvr_isp_scan_direction()
|
||
- pvr: Implement pvr_reroute_to_clip()
|
||
- pvr: Support ipf_creq_pf in pvr_isp_ctrl_stream()
|
||
- pvr: Complete pvr_3d_validate_addr()
|
||
- pvr: Support multiple sources per pass in TQ job submission
|
||
- pvr: Complete pvr_generate_custom_mapping()
|
||
- pvr: Fragment register fb_cdc_zls is feature dependent
|
||
- pvr: use pvr_csb_pack() to setup CR_FB_CDC_ZLS
|
||
- pvr: Rename heap reserved area to static data carveout
|
||
- pvr: Merge main and extension command streams
|
||
|
||
Sathishkumar S (5):
|
||
|
||
- util/format: add planar3 r8_g8_b8_unorm pipe format
|
||
- frontends/va: add support for RGBP rt_format
|
||
- radeonsi/vcn: enable RGBP format on gfx940 jpeg
|
||
- radeonsi/vcn: engage all jpeg engines on gfx940 for mjpeg decode
|
||
- frontends/va: return matching drm format for yuyv pipe format
|
||
|
||
Semjon Kravtsenko (1):
|
||
|
||
- glx: Assign unique serial number to GLXBadFBConfig error
|
||
|
||
Sergi Blanch Torne (8):
|
||
|
||
- ci: Move Vulkan CTS patches to their own directory
|
||
- ci: disable Collabora's LAVA lab for maintance
|
||
- Revert "ci: disable Collabora's LAVA lab for maintance"
|
||
- ci: Allow zink-radv jobs to be manual when uprev piglit
|
||
- ci: disable Collabora's LAVA lab for maintance
|
||
- Revert "ci: disable Collabora's LAVA lab for maintance"
|
||
- ci: disable Collabora's LAVA lab for maintance
|
||
- Revert "ci: disable Collabora's LAVA lab for maintance"
|
||
|
||
Shan-Min Chao (1):
|
||
|
||
- tu/kgsl: Fix memory overwrite with vkFlushMappedMemoryRanges when more than 1 range
|
||
|
||
Sil Vilerino (30):
|
||
|
||
- d3d12: Do not fail d3d12_screen creation if D3D12_FEATURE_D3D12_OPTIONS14 not available
|
||
- frontend/va: Support QVBR rate control mode
|
||
- frontend/va: Allow distinction for HRD params sent from app and frontend defaults
|
||
- frontend/va: Allow distinction for Min/MaxQP params sent from app and frontend defaults
|
||
- d3d12: Support QVBR rate control mode
|
||
- d3d12: Support rate control HRD and MaxFrameSize app params
|
||
- d3d12: Support QPMin/QPMax app params
|
||
- d3d12: Support PIPE_VIDEO_CAP_MIN_WIDTH/HEIGHT caps
|
||
- d3d12: Support PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL
|
||
- frontend/va: Add VAProfileH264High10
|
||
- frontend/va: Add H264 decode slice data
|
||
- d3d12: Use frontend H264 decode slice offsets and sizes instead of parsing buffer
|
||
- d3d12: Clean unused code for parsing slices
|
||
- frontends/va: Extend AV1 Encode params
|
||
- d3d12: AV1 Encode
|
||
- CI/windows: Update headers and Agility redist to 1.711.3-preview
|
||
- d3d12: Correct tx_mode_support reporting as specified in libva spec
|
||
- d3d12: Only set reduced_tx_set when supported by D3D12 caps (no libva caps for reduced_tx_set to map to)
|
||
- d3d12: Fix usage of D3D12_VIDEO_ENCODER_RATE_CONTROL_FLAG, was using D3D12_VIDEO_ENCODER_SUPPORT_FLAG wrongly instead
|
||
- frontend/va: Pass surf->fence in PIPE_VIDEO_ENTRYPOINT_ENCODE contexts for driver to wait on input surface pending work
|
||
- frontend/va: Add video processing async fence support
|
||
- d3d12: Video Decode - Implement get_decoder_fence and async queing
|
||
- d3d12: Apply style format to d3d12_video_dec.cpp
|
||
- d3d12: Video Decode - Sync 3D context copy with decode work for texture array case
|
||
- d3d12: Video Encode - GPU wait on input surface fence
|
||
- d3d12: Video Process - Implement get_processor_fence and async queing
|
||
- d3d12: Video Decode - Refactor and style fixes
|
||
- frontend/va: Fix vaSyncSurface and vaQuerySurface status for drivers not implementing get_processor_fence
|
||
- frontend/va: Remove fence_server_sync for surface in vlVaHandleVAProcPipelineParameterBufferType
|
||
- aux/tc: Add ASSERTED to unreferenced release build variable
|
||
|
||
Simon Perretta (5):
|
||
|
||
- pvr: Amend validation when checking multiple supported types
|
||
- pvr: Use movc for reading special registers
|
||
- pvr: Add support for generating transfer fragment programs
|
||
- pvr: Add support for generating transfer EOT programs
|
||
- pvr: Use driver vertex input data in the compiler
|
||
|
||
Simon Ser (4):
|
||
|
||
- wayland: generalize wayland-protocols code generation
|
||
- radv: advertise LINEAR filter support for multiplanar/subsampled
|
||
- vulkan/wsi/wayland: add 16-bit formats
|
||
- Update OpenGL headers
|
||
|
||
SoroushIMG (8):
|
||
|
||
- zink: do not emit line stipple dynamic state when emulating
|
||
- zink: take location_frac into account in lower_line_smooth_gs
|
||
- zink: fix incorrect line mode check for bresenham
|
||
- zink: refcount the correct query pool
|
||
- pvr: fix sync waiting while using pvrsrvkm
|
||
- pvr: fix infinite recursion in pvr_cmd_buffer_{start,end}_sub_cmd
|
||
- pvr: add missing frag to geom dependency for jobs targetting same render target
|
||
- pvr: Fix barrier insertion on merged subpasses
|
||
|
||
SureshGuttula (2):
|
||
|
||
- va/surface : Add Nv12 support for PRIME_2 imports
|
||
- radeonsi/vcn: update luma and chroma size
|
||
|
||
Sviatoslav Peleshko (7):
|
||
|
||
- isl: Check all channels in isl_formats_have_same_bits_per_channel
|
||
- anv: Handle UNDEFINED format in image format list
|
||
- anv: Improve image/view usage bits verification
|
||
- nir/lower_shader_calls: Fix cursor if broken after nir_cf_extract() call
|
||
- glsl: Fix yylloc.source propagation in YYLLOC_DEFAULT
|
||
- dri: Use RGB internal formats for RGBX formats
|
||
- intel/fs: Check if the whole ubo load range is in the push const range
|
||
|
||
Sylvain Munaut (1):
|
||
|
||
- egl/dri2: Add a couple of missing mutex release in error path
|
||
|
||
Tapani Pälli (33):
|
||
|
||
- isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125
|
||
- iris: implement state cache invalidate for Wa_16013063087
|
||
- anv: cleanup bitmask construction for PIPELINE_SELECT
|
||
- anv: implement state cache invalidate for Wa_16013063087
|
||
- isl: fix layout for comparing surf and view properties
|
||
- egl/loader: move crtc resource infrastructure as common helper
|
||
- anv: handle missing astc for gfx125 in CreateImageView
|
||
- mesa: set a type for depth fallback texture
|
||
- intel/dev: provide helper to check if devinfo is ATS-M
|
||
- anv: add required invalidate/flush for Wa_14014427904
|
||
- iris: add required invalidate/flush for Wa_14014427904
|
||
- mesa: validate shader binary format in _mesa_spirv_shader_binary
|
||
- iris: make Wa_16013994831 to use intel_needs_workaround
|
||
- anv: make Wa_16013994831 to use intel_needs_workaround
|
||
- anv: remove BDW specific WA for CS stall enable
|
||
- intel/dev: add parentheses around intel_needs_workaround macro
|
||
- iris: use workaround framework for 1408224581, 14014097488
|
||
- anv: use workaround framework for 1408224581, 14014097488
|
||
- anv: wrap pipe control emission to a set of helper functions
|
||
- anv: implement flush part of emit_apply_pipe_flushes with helper
|
||
- anv: implement invalidate part of emit_apply_pipe_flushes with helper
|
||
- anv: convert genX_query pipe controls to use pc helper
|
||
- anv: change pipe controls in genX_state to use pc helper
|
||
- anv: change pipe control in genX_pipeline to use pc helper
|
||
- anv: change pipe controls in genX_gpu_memcpy to use pc helper
|
||
- anv: change pipe control in indirect draw gen to use pc helper
|
||
- anv: change most pipe controls in gfx8_cmd_buffer to use pc helper
|
||
- anv: convert most pc in genX_cmd_buffer to use pc helper
|
||
- isl: handle DRM_FORMAT_MOD_INVALID in isl_drm_modifier_has_aux
|
||
- intel/compiler: add more validation for acc register usage
|
||
- mesa: fix some TexParameter and SamplerParameter cases
|
||
- iris: avoid issues with undefined clip distance
|
||
- crocus: avoid issues with undefined clip distance
|
||
|
||
Tatsuyuki Ishi (18):
|
||
|
||
- util: Add dedicated hex conversion functions and use it.
|
||
- util: Call mesa_bytes_to_hex directly instead of disk_cache_format_hex_id.
|
||
- util: Add a copy of BLAKE3 hash library.
|
||
- util/blake3: Patch with hidden visibility for asm symbols.
|
||
- util: Add mesa_blake3 wrappers.
|
||
- nir: Fix serializing pointer initializers.
|
||
- radv: Make shader related destruction happen before hw_ctx.
|
||
- radv: Add RGP barrier markers for render pass transition and copy.
|
||
- radv: Guard against misplaced RGP barrier markers.
|
||
- util/blake3: Add blake3_hash typedef.
|
||
- vulkan: Migrate shader module hash to BLAKE3.
|
||
- vulkan/pipeline_cache: Do not consume object passed into remove_object.
|
||
- vulkan/pipeline_cache: Move locking outside of remove_object.
|
||
- vulkan/pipeline_cache: Move cache_object_unref out of header.
|
||
- vulkan/pipeline_cache: Introduce weak reference mode.
|
||
- radv: Enable weak reference cache for device->mem_cache.
|
||
- zink/ci: Add ext_transform_feedback@api-errors to fail list.
|
||
- radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region.
|
||
|
||
Teng, Jin Chung (1):
|
||
|
||
- d3d12: HEVC Encode - Fix num_subregions_per_scanline rounding
|
||
|
||
Thomas H.P. Andersen (30):
|
||
|
||
- nir/nir_lower_wpos_center: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_wpos_ytransform: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_viewport_transform: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_var_copies: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_uniforms_to_ubo: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_two_sided_color: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_to_source_mods: Use the nir_shader_instructions_pass() helper
|
||
- nir/nir_lower_vec3_to_vec4: Use the nir_shader_instructions_pass() helper
|
||
- r600: remove unused code
|
||
- tgsi: delete unused functions
|
||
- aux: remove unused tgsi includes
|
||
- d3d12: remove unused tgsi includes
|
||
- etnaviv: remove unused tgsi includes
|
||
- freedreno: remove unused tgsi includes
|
||
- i915: remove unused tgsi includes
|
||
- llvmpipe: remove unused tgsi includes
|
||
- nouveau: remove unused tgsi includes
|
||
- r300: remove unused tgsi includes
|
||
- r600: remove unused tgsi includes
|
||
- radeonsi: remove unused tgsi includes
|
||
- softpipe: remove unused tgsi includes
|
||
- svga: remove unused tgsi includes
|
||
- v3d: remove unused tgsi includes
|
||
- vc4: remove unused tgsi includes
|
||
- virgl: remove unused tgsi includes
|
||
- zink: remove unused tgsi includes
|
||
- lavapipe: remove unused tgsi includes
|
||
- st: remove unused tgsi includes
|
||
- r600: tgsi cleanup
|
||
- tgsi: remove unused functions and structs
|
||
|
||
Thong Thai (11):
|
||
|
||
- gallium/pipe: add min width and min height video cap enums
|
||
- radeonsi: return min width and min height video cap values
|
||
- frontends/va: report min width and min height values if available
|
||
- mesa/main: rework locale setup/teardown
|
||
- util: check and initialize locale before using it
|
||
- tgsi: use locale independent float and double parsing
|
||
- frontends/va/config: add disable packed headers as valid config
|
||
- frontends/va/context: check min supported resolution when creating
|
||
- frontends/va/config: check for QVBR support when creating
|
||
- frontends/va/context: return error if context_id == 0
|
||
- frontends/va: fix some coverity scan reported issues
|
||
|
||
Tim Pambor (1):
|
||
|
||
- virgl: Fix stack overflow in virgl_bind_sampler_states
|
||
|
||
Timothy Arceri (32):
|
||
|
||
- util: add Pixel Game Maker MV workaround
|
||
- util: add Jamestown+ workaround
|
||
- st/glsl: move linking code to the same st file
|
||
- glsl: call nir_opt_find_array_copies() when linking
|
||
- glsl: port lower_blend_equation_advanced() to nir
|
||
- glsl: call nir version of lower_blend_equation_advanced()
|
||
- glsl: remove old lower_blend_equation_advanced() code
|
||
- glsl: add some more c wrappers for string_to_uint_map
|
||
- mesa: add some new constants
|
||
- glsl: move some compiler code out of st
|
||
- glsl: move lowering linker code out of st
|
||
- glsl: port assign location code for VS inputs or FS outputs
|
||
- glsl: call assign_attribute_or_color_locations() in NIR linker
|
||
- glsl: remove unused buffer objects with packed layout
|
||
- glsl: remove unused system vars
|
||
- glsl: drop the dce of global vars from GLSL IR linker
|
||
- nir/glsl: add nir_var_declared_implicitly enum
|
||
- glsl: move disable_varying_optimizations_for_sso() to NIR linker
|
||
- glsl: remove the always_active_io flag from GLSL IR
|
||
- glsl: inline link_varyings()
|
||
- glsl: set last_vert_prog in the nir linker
|
||
- glsl: drop link_invalidate_variable_locations()
|
||
- glsl: move store_fragdepth_layout() to nir linker
|
||
- glsl: remove glsl ir optimisation loop from linker
|
||
- st/glsl: merge link_shader() into st_link_nir()
|
||
- st/glsl: merge st_link_glsl_to_nir() into st_link_nir()
|
||
- st/glsl: merge st_glsl_to_ir.cpp with st_glsl_to_nir.cpp
|
||
- glsl: remove dead varyings before assigning attr locations
|
||
- glsl: do vs attribute validation in NIR linker
|
||
- glsl: fix validation of ES vertex attribs
|
||
- glsl: fix spirv sso validation
|
||
- util: add radeonsi workaround for Nowhere Patrol
|
||
|
||
Timur Kristóf (61):
|
||
|
||
- radv/amdgpu: Remove unnecessary assertions from chaining.
|
||
- radv: Disallow IB2 on GFX6 when using draw_indirect_multi.
|
||
- radv: Use IB BOs (chaining) by default on GFX6.
|
||
- radv: Chain command buffers on GFX6 in radv_queue.
|
||
- amd: Rename INDIRECT_BUFFER_CIK to just INDIRECT_BUFFER.
|
||
- radv: Simplify IB2 workaround.
|
||
- radv: Remove IB2 workaround from mesh shader draws.
|
||
- radv: Enable IB2 workaround on all indirect draws.
|
||
- radv: Fix dword alignment in SDMA buffer copy.
|
||
- aco: Disallow constant propagation on SOPP and fixed operands.
|
||
- amd: Add and implement sendmsg_amd intrinsic.
|
||
- amd: Add and implement gs_wave_id sysval.
|
||
- amd: Move sendmsg defines to ac_shader_util.
|
||
- ac/llvm: Clarify arguments of ac_build_sendmsg.
|
||
- ac/nir: Use sendmsg in legacy GS lowering.
|
||
- ac/nir: Emit legacy GS DONE signal in NIR.
|
||
- ac/nir/ngg: Use sendmsg in NGG lowering.
|
||
- amd: Cleanup old GS intrinsics code.
|
||
- aco: Don't allow any VALU instruction to write m0.
|
||
- aco: Initialize vcmpx field in get_cmp_info.
|
||
- radv/amdgpu: Remove unused extra BO array.
|
||
- radv/amdgpu: Split radv_amdgpu_get_bo_list to smaller functions.
|
||
- radv/amdgpu: Pass preambles to get_bo_list.
|
||
- radv/amdgpu: Use STACK_ARRAY for IB array to reduce stack usage.
|
||
- radv: Move perf counter CS creation to where it's used.
|
||
- ac: Use const keyword for some function arguments.
|
||
- radv: Use const keyword more.
|
||
- radv: Emit primitive reset index with primitive restart enable.
|
||
- radv: Compute tess info when emitting patch control points.
|
||
- radv: Move ignore forced VRS code to more optimal place.
|
||
- radv: Set last_index_type in radv_before_draw.
|
||
- radv: Slight refactor to late_scissor_emission.
|
||
- radv: Move indirect check from index buffer emission to caller.
|
||
- radv: Move empty dynamic states check to caller.
|
||
- radv: Clear query dirty flags when flushing them.
|
||
- radv: Clarify gang submit terminology.
|
||
- radv: Use RESET_FILTER_CAM for some mesh shading draws.
|
||
- aco: Mark exec write used when it writes other registers.
|
||
- radv: Remove primitive reset index from late scissor workaround.
|
||
- radv: Leave primitive reset index at max on GFX8+.
|
||
- ac: Add ac_hw_stage enum.
|
||
- aco: Use ac_hw_stage instead of aco-specific HWStage.
|
||
- aco: Add hw_stage field to aco_shader_info.
|
||
- radeonsi: Set aco_shader_info::hw_stage
|
||
- radv: Set aco_shader_info::hw_stage
|
||
- aco: Use aco_shader_info::hw_stage instead of guessing.
|
||
- aco: Remove unneeded stage related info fields.
|
||
- ac/nir/ngg: Call nir_convert_to_lcssa before divergence analysis.
|
||
- ac/nir/ngg: Add upper limit to reusable uniforms.
|
||
- ac/nir/ngg: Follow intrinsic sources when analyzing before culling.
|
||
- ac/nir/ngg: Follow tex sources when analyzing before culling.
|
||
- radv: Refactor required subgroup size in pipeline key.
|
||
- radv: Use required subgroup info for graphics shaders.
|
||
- radv: Enable required subgroup size on mesh/task.
|
||
- aco: Add MESA_SHADER_KERNEL to instruction selection setup.
|
||
- aco: Fix subgroup_id intrinsic on GFX10.3+.
|
||
- ac/nir: Add done arg to ac_nir_export_position.
|
||
- ac/nir: Slightly refactor how pos0 exports are added when missing.
|
||
- ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export.
|
||
- ac/nir/ngg: Refactor mesh shader primitive export.
|
||
- ac/nir/ngg: Wait for attribute ring stores in mesh shaders.
|
||
|
||
Tony Wasserka (2):
|
||
|
||
- aco/spill: Use arena allocator for next use distances
|
||
- aco/spill: Use arena allocator for spills
|
||
|
||
Veerabadhran Gopalakrishnan (2):
|
||
|
||
- radeonsi: return kernel queried video capability for HEVC and JPEG
|
||
- radeonsi: return kernel queried video capability for HEVC and JPEG
|
||
|
||
Viktoriia Palianytsia (1):
|
||
|
||
- iris,crocus: Add proper way of assigning num_levels value
|
||
|
||
Vinson Lee (10):
|
||
|
||
- r600/sfn: Initialize BlockScheduler member m_chip_family.
|
||
- freedreno/a6xx: Fix memory leak on error path.
|
||
- nv50: Fix memory leak in error path
|
||
- pvr: Fix signed comparison
|
||
- dzn: Fix qpool->queries_lock double lock
|
||
- tu: Fix missing unlock
|
||
- vulkan/wsi: Remove duplicate NULL check
|
||
- frontends/va: Fix missing unlock
|
||
- r600/sfn: Remove duplicate assignment
|
||
- vk/wsi/x11: Remove dead code
|
||
|
||
Vitaliy Triang3l Kuzmin (27):
|
||
|
||
- lavapipe: Fix vk_instance_init vk_error instance use-after-free
|
||
- radv: Fix vk_instance_init vk_error instance use-after-free
|
||
- radv: Move most of DB_SHADER_CONTROL to PS, more precise GFX11 blend WA
|
||
- docs/amd: Document Primitive Ordered Pixel Shading
|
||
- ac/nir: Support Primitive Ordered Pixel Shading in lower_ps
|
||
- aco: Support pops_exiting_wave_id PhysReg usage
|
||
- ac: Define POPS collision wave ID argument SGPR
|
||
- aco: Add s_wait_event argument bit definitions
|
||
- aco: Add Primitive Ordered Pixel Shading pseudo-instructions
|
||
- aco: Skip waitcnt insertion in the discard early exit block
|
||
- aco: Add Primitive Ordered Pixel Shading scheduling rules
|
||
- aco: Send MSG_ORDERED_PS_DONE where necessary
|
||
- aco: Add Primitive Ordered Pixel Shading waitcnt rules
|
||
- aco: Implement fragment shader interlock intrinsics
|
||
- radeonsi: Remove unconditional POPS_DRAIN_PS_ON_OVERLAP setting
|
||
- radv: Remove unconditional POPS_DRAIN_PS_ON_OVERLAP setting
|
||
- radv: Detect the use of Primitive Ordered Pixel Shading
|
||
- radv: Ensure 1x1 shading rate on GFX10.3 with interlock execution mode
|
||
- radv: Declare POPS collision wave ID shader argument
|
||
- radv: Enable POPS collision wave ID shader argument
|
||
- radv: Enable the null export workaround with POPS
|
||
- radv: Handle Primitive Ordered Pixel Shading in DB_SHADER_CONTROL
|
||
- ac/gpu_info: Check whether the device has the POPS missed overlap bug
|
||
- radv: Apply the POPS missed overlap hardware bug workaround
|
||
- radv: Disable VRS forcing with Primitive Ordered Pixel Shading
|
||
- zink/ci: Add broken fragment shader interlock test to RADV flakes
|
||
- radv: Enable VK_EXT_fragment_shader_interlock
|
||
|
||
Víctor Manuel Jáquez Leal (1):
|
||
|
||
- vulkan: complete the usage flags for video layouts
|
||
|
||
Weibin Wu (1):
|
||
|
||
- winsys/gdi: GDI B5G6R5 display target support
|
||
|
||
Xaver Hugl (1):
|
||
|
||
- vulkan wsi: add support for PresentOptionAsyncMayTear
|
||
|
||
Xi Ruoyao (1):
|
||
|
||
- Revert "glx: Remove pointless GLX_INTEL_swap_event paranoia"
|
||
|
||
Yiwei Zhang (46):
|
||
|
||
- radv: respect VK_QUERY_RESULT_WAIT_BIT in GetQueryPoolResults
|
||
- venus: stop query experimental features
|
||
- venus: adopt venus protocol release
|
||
- meson/ci: promote virtio-experimental to virtio
|
||
- docs: update Virtio-GPU Venus driver page
|
||
- ci: carry venus-protocol 1.0 release patches in virglrenderer
|
||
- ci: uprev virglrenderer to drop venus release patches
|
||
- anv: apply ANV_BO_ALLOC_IMPLICIT_SYNC for external memory
|
||
- pipe-loader: avoid undefined memcpy behavior
|
||
- lvp: avoid accessing member of NULL ptr for global entries
|
||
- venus: bump ring space to 128K
|
||
- docs/venus: update vtest instructions
|
||
- radv: fix radv_emit_userdata_vertex for vertex offset -1
|
||
- venus: silence -Wuninitialized
|
||
- venus: sync to latest protocol from header v1.3.248
|
||
- venus: sync protocol for VK_EXT_image_2d_view_of_3d
|
||
- venus: enable VK_EXT_image_2d_view_of_3d
|
||
- docs/venus: advertise VK_EXT_image_2d_view_of_3d
|
||
- venus: temporarily disable VK_EXT_memory_budget
|
||
- venus: refactor vn_device_memory to track VkMemoryType
|
||
- venus: handle device memory report requests
|
||
- venus: emit device memory report for device memory events
|
||
- venus: enable VK_EXT_device_memory_report
|
||
- docs: update venus VK_EXT_device_memory_report support
|
||
- anv: avoid requiring ordered memory planes for explicit import
|
||
- venus: suballocate feedback slot with feedback buffer alignment
|
||
- venus: refactor ahb buffer mem type bits cache to be lazy
|
||
- venus: refactor buffer cache related bits
|
||
- venus: extend VkBuffer cache to cover concurrent sharing
|
||
- venus: fix a cmd tmp storage leak
|
||
- venus: fix leaks from tracked present src images
|
||
- venus: track pool in cmd and track device in pool
|
||
- venus: cmd to reuse alloc copy from cmd pool
|
||
- venus: refactor vn_cmd_add_query_feedback and miscs
|
||
- venus: cache query batches at cmd pool
|
||
- venus: refactor query batch handling
|
||
- venus: recheck valid bit after acquiring lock to init ahb mem type bits
|
||
- venus: handle query feedback creation failure
|
||
- venus: ensure consistency of query overflow behavior
|
||
- venus: add a missing barrier before copying query feedback
|
||
- turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults
|
||
- lvp: avoid reading immutable sampler from desc write info
|
||
- venus: fix a cmd builder render_pass state leak across reset
|
||
- venus: fix cmd state leak across implicit reset
|
||
- venus: fix a device memory report leak
|
||
- vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage
|
||
|
||
Yogesh Mohan Marimuthu (2):
|
||
|
||
- ac/gpu_info: num_cu = 4 and gfx11 enable dcc with retile
|
||
- ac/gpu_info: rearrange if checks for dcc config
|
||
|
||
Yonggang Luo (121):
|
||
|
||
- loader: Replace usage of mtx_t with simple_mtx_t in loader/loader_dri3_helper.c
|
||
- v3d: Replace usage of mtx_t with simple_mtx_t in v3d_simulator.c
|
||
- vc4: Replace usage of mtx_t with simple_mtx_t in vc4/vc4_simulator.c
|
||
- drm-shim: Replace usage of mtx_t with simple_mtx_t in drm_shim.c
|
||
- drm: Replace usage of mtx_t with simple_mtx_t in virgl/drm/virgl_drm_winsys.c
|
||
- drm: Replace usage of mtx_t with simple_mtx_t in drm/radeon_drm_winsys.c
|
||
- drm: Replace usage of mtx_t with simple_mtx_t in nouveau_drm_winsys.c
|
||
- hud: Replace usage of mtx_t with simple_mtx_t in hud_cpufreq.c
|
||
- hud: Replace usage of mtx_t with simple_mtx_t in hud_diskstat.c
|
||
- hud: Replace usage of mtx_t with simple_mtx_t in hud_nic.c
|
||
- hud: Replace usage of mtx_t with simple_mtx_t in hud_sensors_temp.c
|
||
- xlib: Replace usage of mtx_t with simple_mtx_t in xm_api.c
|
||
- rtasm: Trim trailing spaces and replace tab with 3 space
|
||
- rtasm: Replace usage of mtx_t with simple_mtx_t in rtasm_execmem.c
|
||
- nine: Replace usage of mtx_t with simple_mtx_t in nine_lock.c
|
||
- omx: Replace usage of mtx_t with simple_mtx_t in vid_omx_common.c
|
||
- vdpau: Replace usage of mtx_t with simple_mtx_t in htab.c
|
||
- c11: Remove _MTX_INITIALIZER_NP as it's not used anymore
|
||
- microsoft/compiler: Getting function impl to be consistence with decl in dxil_enums.*
|
||
- compiler: Getting shader_prim to be PACKED that consistence with pipe_prim_type
|
||
- compiler: Add SHADER_PRIM_COUNT to be SHADER_PRIM_MAX + 1
|
||
- compiler: Rename shader_prim to mesa_prim and replace all usage of pipe_prim_type with mesa_prim
|
||
- docs: Update document about pipe_prim_type with mesa_prim
|
||
- util: Replace all usage of PIPE_TIMEOUT_INFINITE with OS_TIMEOUT_INFINITE
|
||
- r300: Replace usage of os_get_process_name with util_get_process_name in r300_chipset.c
|
||
- virgl: Array cmdline on stack should initialized to 0
|
||
- virgl: Replace the usage of os_get_process_name with util_get_process_name
|
||
- compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h
|
||
- compiler: Move can_implicitly_convert_to helper to glsl module from glsl_types.h
|
||
- mesa, compiler: Move gl_texture_index to glsl_types.h
|
||
- compiler: Remove the need include "util/glheader.h" and "util/ralloc.h" in glsl_types.h
|
||
- compiler: Remove redundant struct glsl_type in nir_types.h
|
||
- vulkan: move nir_convert_ycbcr into vulkan runtime
|
||
- util: Remove redundant type cast in function align64
|
||
- util: use uint32_t as the parameter of align function
|
||
- util: Do not use align as variable name
|
||
- compiler: use align instead glsl_align and remove glsl_align
|
||
- panfrost: Replace the usage of PIPE_BIND_* with PAN_BIND_*
|
||
- ac: Replace the usage of pipe_compare_func with compare_func
|
||
- dri: Replace usage of boolean/TRUE/FALSE with bool/true/false
|
||
- freedreno: Fixes error: passing argument 1 of ‘pthread_mutex_unlock’ from incompatible pointer type in tu_pipeline.c
|
||
- wsi: Fixes passing argument 1 of ‘mtx_unlock’ from incompatible pointer type
|
||
- c11: Improve timespec_get to support TIME_MONOTONIC TIME_ACTIVE TIME_THREAD_ACTIVE TIME_MONOTONIC_RAW
|
||
- c11: Improve mtx_timedlock to use timespec_get instead of time(NULL)
|
||
- c11: Implement os_time_get_nano with timespec_get(&ts, TIME_MONOTONIC)
|
||
- zink: Replace the usage of os_get_process_name with util_get_process_name
|
||
- dd: Replace the usage of os_get_process_name with util_get_process_name in dd_draw.c
|
||
- gallium: Remove unused os_process.h in gallium/auxiliary
|
||
- util: Fixes prototype of threads_timespec_compare
|
||
- mapi: Fixes check_table.cpp for DrawArraysInstancedARB and DrawElementsInstancedARB
|
||
- meson: Use consistence disabled/enabled comment for shared-glapi option
|
||
- mapi: Fixes non-constant-expression cannot be narrowed from type 'unsigned long' to 'unsigned int' in initializer list with clang
|
||
- meson: Guard the glsl tests that only working when OpenGL ES2 is enabled
|
||
- draw: Replace usage of boolean/TRUE/FALSE with bool/true/false in draw_pt_vsplit*
|
||
- draw: Replace usage of ubyte/ushort/uint with uint8_t/uint16_t/uint32_t in draw_pt_vsplit.c
|
||
- draw: Update the comment and function name to match the type
|
||
- vtn: Do not assign main_entry_point->impl twice
|
||
- nir: Add function nir_function_set_impl
|
||
- hud: Use bool/true/false to replace boolean/TRUE/FALSE in hud/hud_context.c
|
||
- gallium/draw: Replace the usage of ushort to uint16_t in files that can not found by tools
|
||
- llvmpipe: altivec.h inclusion in -std=c++98..11 causes bool to be redefined
|
||
- treewide: replace usage of boolean to bool
|
||
- treewide: style fixes after replace usage of boolean to bool
|
||
- treewide: Replace the usage of TRUE/FALSE with true/false
|
||
- treewide: Replace the usage of ubyte/ushort with uint8_t/uint16_t
|
||
- treewide: style fixes after replace the usage of ubyte/ushort with uint8_t/uint16_t
|
||
- util: Merge p_compiler.h into src/util/compiler.h
|
||
- util: include "util/compiler.h" instead of "pipe/p_compiler.h"
|
||
- mapi: Fixes compile error with build option "-D shared-glapi=disabled"
|
||
- mapi: Now _glapi_get_dispatch_table_size always equal to sizeof(struct _glapi_table) / sizeof(void \*)
|
||
- mapi: Hide OpenGL functions to be exported when shared-glapi is disabled
|
||
- ci: Testing -D shared-glapi=disabled with debian-clang-release
|
||
- d3d12: Fixes unused-variable compile error
|
||
- compiler: set alignment=1 by default for handling empty struct/interface in glsl_types.cpp
|
||
- util: Add function util_is_power_of_two_nonzero64 in bitscan.h
|
||
- util: use uint32_t instead of unsigned in bitscan.h
|
||
- util: Getting align and align64 consistence with ALIGN
|
||
- util: Replace the usage of redundant u_align_u32 with align and remove u_align_u32
|
||
- util: Do not use align64 over unsigned int in register_allocate.c
|
||
- util: sizeof bucket are always 32bit width, use align instead align64
|
||
- mapi: Style fixes in glapi/glapi_getproc.c
|
||
- mapi: Merge get_static_proc_address into _glapi_get_proc_address
|
||
- mapi: Remove dead struct _glapi_function in glapi/glapi_getproc.c
|
||
- nir: Split macro nir_foreach_function_with_impl out of nir_foreach_function_impl
|
||
- clang-format: Add nir_foreach_function_with_impl into src/.clang-format
|
||
- treewide: Switch to use nir_foreach_function_with_impl when possible
|
||
- clang-format: Add nir_foreach_function_impl into src/.clang-format
|
||
- gallium/auxiliary: Switch to use nir_foreach_function_impl
|
||
- asahi: Use nir_foreach_function_impl instead nir_foreach_function in function agx_nir_lower_zs_emit
|
||
- d3d12: Switch to use nir_foreach_function_impl
|
||
- glsl: Switch to use nir_foreach_function_impl from nir_foreach_function
|
||
- glsl: Remove the extra scope in gl_nir_link_uniforms.c
|
||
- crocus: Switch to use nir_foreach_function_impl
|
||
- intel/compiler: Switch to use nir_foreach_function_impl
|
||
- broadcom: replace redefined ALIGN() macro with common util functions
|
||
- util: Remove redundant defined(_WIN32) in u_string.h
|
||
- util: Remove redundant #if !defined(XF86_LIBC_H) in u_string.h
|
||
- nir: Strip the const modifier on nir_function * in nir_foreach_function_with_impl
|
||
- panfrost: Convert to use nir_foreach_function_with_impl in function midgard_compile_shader_nir
|
||
- panfrost: Convert to use nir_foreach_function_impl when possible
|
||
- mesa: Convert to use nir_foreach_function_impl
|
||
- llvmpipe: Convert to use nir_foreach_function_impl
|
||
- sfn: Convert to use nir_foreach_function_impl
|
||
- sfn: indent fixes after switch to use nir_foreach_function_impl
|
||
- compiler/clc: Switch to use nir_foreach_function_impl in function nir_lower_libclc
|
||
- dxil: Use nir_remove_non_entrypoints
|
||
- nir: Update the comment to call nir_remove_non_entrypoints directly
|
||
- glsl: Use nir_remove_non_entrypoints to simplify the code
|
||
- radv: Use nir_remove_non_entrypoints in radv_shader.c
|
||
- nir: Add nir_foreach_function_safe and use it
|
||
- pvr: Use alignas instead of ALIGN_ATTR and remove ALIGN_ATTR
|
||
- vc4: Convert to use nir_foreach_function_impl when possible
|
||
- v3d: Switch to use nir_foreach_function_impl
|
||
- broadcom: Switch to use nir_foreach_function_impl
|
||
- radeonsi: Use ALIGN_POT instead ALIGN_TO
|
||
- etnaviv: Convert to use nir_foreach_function_impl
|
||
- intel/vulkan: Convert to use nir_foreach_function_impl when possible
|
||
- iris: Convert to use nir_foreach_function_impl
|
||
- treewide: Remove all usage of nir_builder_init with nir_builder_create and nir_builder_at
|
||
- treewide: remove unused nir_builder
|
||
- nir: Remove nir_builder_init, it's not used anymore
|
||
|
||
Zhang Ning (2):
|
||
|
||
- lima: use u_pipe_screen_lookup_or_create in the renderonly path too
|
||
- Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR"
|
||
|
||
Zhang, Jianxun (3):
|
||
|
||
- intel/isl: Fix map between sRGB and linear formats
|
||
- anv: Support 1MB AUX mapping (MTL)
|
||
- anv: Remove alignment to aux ratio on size of main surface
|
||
|
||
antonino (29):
|
||
|
||
- zink: don't emulate edgeflags for patches
|
||
- zink: use correct primitives for passthrough gs with tess
|
||
- zink: add \`single_sample` to fs key
|
||
- zink: add to multisample field to \`zink_gfx_pipeline_state`
|
||
- zink: don't render with multisampling when it is disabled
|
||
- zink/ci: remove xt_framebuffer_multisample-interpolation fail
|
||
- zink: fix pv mode lowring index calculation
|
||
- zink: use ring buffer to preserve last element
|
||
- zink: fix exit condition on pv emulation loop
|
||
- zink: fix line strip offsets in pv mode emulation
|
||
- nir/zink: use sysvals in \`nir_create_passthrough_gs`
|
||
- zink: fix store subsitution in \`lower_pv_mode_gs_store`
|
||
- zink: set when pipeline dirty flag when multisample changes
|
||
- Revert "zink: set when pipeline dirty flag when multisample changes"
|
||
- Revert "zink/ci: remove xt_framebuffer_multisample-interpolation fail"
|
||
- Revert "zink: don't render with multisampling when it is disabled"
|
||
- Revert "zink: add to multisample field to \`zink_gfx_pipeline_state`"
|
||
- Revert "zink: add \`single_sample` to fs key"
|
||
- zink: take location_frac into account in pv emulation
|
||
- nir: use \`nir_variable_clone` in \`nir_create_passthrough_gs`
|
||
- nir: don't create invalid inputs in \`nir_create_passthrough_gs`
|
||
- zink: don't replace non generated gs
|
||
- nir: handle interface blocks in \`copy_vars`
|
||
- zink: handle interface blocks in \`copy_vars`
|
||
- nir: make var arrays large enough in \`nir_create_passthrough_gs`
|
||
- zink: don't create invalid inputs in \`zink_create_quads_emulation_gs`
|
||
- vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf
|
||
- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle"
|
||
- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion"
|
||
|
||
i509VCB (1):
|
||
|
||
- docs/asahi: Add hardware glossary
|
||
|
||
lorn10 (1):
|
||
|
||
- docs: Update Clover's env variable documentation
|
||
|
||
nihui (1):
|
||
|
||
- panvk: port panvk_logi to vk_logi
|
||
|
||
norablackcat (24):
|
||
|
||
- rusticl: implement cl_khr_pci_bus_info
|
||
- docs/rusticl: add Contributing section
|
||
- rusticl/types add ::new for cl_dev_idp_accel_props
|
||
- rusticl/api: add integer_dot_product api
|
||
- rusticl/clc add integer_dot_prod feature macros
|
||
- rusticl/kernel: remove nir_lower_pack pass
|
||
- rusticl/device: add cl_khr_integer_dot_product ext
|
||
- rusticl/program: fix clippy cast to the same type
|
||
- rusticl/types: fix clippy new() not returning Self
|
||
- rusticl/screen: implement uuid wrapper funcs
|
||
- rusticl/device: implement cl_khr_device_uuid
|
||
- rusticl/screen: fix driver_uuid on non x86
|
||
- rusticl: add cl_khr_create_command_queue
|
||
- docs/features update opencl extensions add rusticl
|
||
- docs: rusticl envvars list supported drivers
|
||
- rusticl/memory: fix clippy errors
|
||
- gallium: add PIPE_CAP_TIMER_RESOLUTION
|
||
- llvmpipe/screen: add PIPE_CAP_TIMER_RESOLUTION
|
||
- sofpipe/screen: add PIPE_CAP_TIMER_RESOLUTION
|
||
- crocus/screen: add PIPE_CAP_TIMER_RESOLUTION
|
||
- iris/screen: add PIPE_CAP_TIMER_RESOLUTION
|
||
- r600/pipe: add PIPE_CAP_TIMER_RESOLUTION
|
||
- radeonsi/get: add PIPE_CAP_TIMER_RESOLUTION
|
||
- zink/screen: add PIPE_CAP_TIMER_RESOLUTION
|
||
|
||
timmac-qmc (1):
|
||
|
||
- glsl: fix potential crash with DisableUniformArrayResize
|
||
|
||
xurui (6):
|
||
|
||
- zink: Some return values of malloc should be checked
|
||
- zink: Use malloc instead of ralloc
|
||
- zink: Use malloc to allocate libs
|
||
- zink: Add some printfs when initialization fails
|
||
- zink: Free the cdt when an error occurs
|
||
- zink: The result should be assigned a value when returned
|