mirror of
https://gitlab.freedesktop.org/mesa/mesa.git
synced 2025-01-25 07:04:06 +08:00
6220 lines
295 KiB
ReStructuredText
6220 lines
295 KiB
ReStructuredText
Mesa 22.2.0 Release Notes / 2022-09-21
|
||
======================================
|
||
|
||
Mesa 22.2.0 is a new development release. People who are concerned
|
||
with stability and reliability should stick with a previous release or
|
||
wait for Mesa 22.2.1.
|
||
|
||
Mesa 22.2.0 implements the OpenGL 4.6 API, but the version reported by
|
||
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
||
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
||
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
||
4.6 is **only** available if requested at context creation.
|
||
Compatibility contexts may report a lower version depending on each driver.
|
||
|
||
Mesa 22.2.0 implements the Vulkan 1.3 API, but the version reported by
|
||
the apiVersion property of the VkPhysicalDeviceProperties struct
|
||
depends on the particular driver being used.
|
||
|
||
SHA256 checksum
|
||
---------------
|
||
|
||
::
|
||
|
||
b1f9c8fd08f2cae3adf83355bef4d2398e8025f44947332880f2d0066bdafa8c mesa-22.2.0.tar.xz
|
||
|
||
|
||
|
||
New features
|
||
------------
|
||
|
||
- WGL_ARB_create_context_robustness
|
||
|
||
- d3d12 ARB_robust_buffer_access_behavior
|
||
|
||
- VK_EXT_robustness2 for lavapipe
|
||
|
||
- VK_EXT_image_2d_view_of_3d on RADV
|
||
|
||
- zink and d3d12 GL_EXT_memory_object_win32 and GL_EXT_semaphore_win32 support
|
||
|
||
- vertexAttributeInstanceRateZeroDivisor support for lavapipe
|
||
|
||
- panfrost Valhall support (conformant OpenGL ES 3.1 on Mali-G57)
|
||
|
||
- VK_EXT_primitives_generated_query on RADV
|
||
|
||
- VK_EXT_non_seamless_cube_map on RADV, ANV, lavapipe
|
||
|
||
- VK_EXT_border_color_swizzle on lavapipe, ANV, turnip, RADV
|
||
|
||
- VK_EXT_shader_module_identifier on RADV
|
||
|
||
- variablePointers on lavapipe
|
||
|
||
- VK_EXT_multisampled_render_to_single_sampled on lavapipe
|
||
|
||
- VK_EXT_shader_subgroup_vote/ballot on lavapipe
|
||
|
||
- EGL_KHR_context_flush_control for all drivers
|
||
|
||
- GLX_ARB_context_flush_control for all drivers
|
||
|
||
- VK_EXT_attachment_feedback_loop_layout on RADV
|
||
|
||
|
||
Bug fixes
|
||
---------
|
||
|
||
- turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill"
|
||
- [radeonsi] Graphical corruption in Valve Source-based games
|
||
- Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO
|
||
- nouveau: corruption on NVAC after switching to the NIR backend
|
||
- zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip
|
||
- Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built
|
||
- Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi)
|
||
- turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635
|
||
- regression from !17949
|
||
- r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled
|
||
- r600/sfn: register allocation fails for a number of piglits and deqp tests
|
||
- DG2 corruption in gnome desktop from F36
|
||
- radeonsi: error allocating sparse buffers.. bug?
|
||
- [BISECTED/radeonsi] Flickering triangles with wayfire cube
|
||
- ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED
|
||
- turnip: gfxbench aztec ruins crashes on zink
|
||
- glsl: textureGatherOffset param broken
|
||
- v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c
|
||
- radeonsi: driver crashes with kwin-effects-cube
|
||
- [Turnip] Final Fantasy X: Incorrect hair rendering
|
||
- turnip: gfxbench car chase crash on zink
|
||
- [r300] - osu! - Shader optimizations cause black and glitchy rendering
|
||
- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available
|
||
- anv: invalid returns from vkCreateDescriptorPool
|
||
- [Bisected][RadeonSI] Blender crashes on startup
|
||
- turnip: depth/stencil regressions
|
||
- llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages
|
||
- Poor performance/utilization with Splitgate
|
||
- freedreno / mesa 22.1.3 crashes Firefox Nightly
|
||
- [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3
|
||
- dzn: Build failure when it's the only driver built - 22.2-devel regression
|
||
- d3d12: Unresolved external symbols from Version.lib
|
||
- turnip: Geometry shader never invoked if there is no EmitVertex and gl_Position is not written
|
||
- Trine 4 causing GPU HANG: ecode 12:1:849f7804, in vkr-ring-7
|
||
- turnip: GPU fault on clear
|
||
- intel_nullhw.c:179:28: error: passing 'const void \*const' to parameter of type 'void \*' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers]
|
||
- macOS undefined symbol _glapi_tls_Context build error
|
||
- Build on macOS regressed
|
||
- Regression: virgl: The switch to NTT breaks "Total War: Warhammer"
|
||
- macOS's LIST_ENTRY macro conflicts with ours
|
||
- src/util/disk_cache_os.c:234:21: error: too many arguments provided to function-like macro invocation
|
||
- [RadeonSI] Teamfortress 2 crashing frequently with mesa 22.1.1
|
||
- Unity game segfaults often
|
||
- Alien Isolation segfaulting in radeonsi_dri.so starting with Mesa 22.1.0
|
||
- turnip: dEQP-VK.subgroups.ballot_broadcast.framebuffer.* has failures on older deqp-vk
|
||
- !17062 broke XTS xts5/Xlib9/XDrawImageString(16) with Xephyr using glamor on llvmpipe
|
||
- DG2 full BAR broken by low BAR?
|
||
- crocus: SSBOs regressed
|
||
- Homerun Clash game triggers assert in compiler
|
||
- radv: line rasterization mode affects multisampling
|
||
- turnip: caselist fail
|
||
- vtn: mediump_16bit_alu and compatibility
|
||
- radv: dynamic vertex input failure
|
||
- navi21\\radv\\amdgpu Halo Infinite screen corruption and then hang
|
||
- ci/freedreno: a618_vk 1/7 timeouts
|
||
- Windows OpenGL32.dll without shared-glapi currently broken
|
||
- anv: KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage fails on TGL
|
||
- perfetto: crash on context teardown / driver unload
|
||
- panfrost shader classification is wrong in some cases
|
||
- kopper: the latest awfulness exposed by piglit glx-multi-window-single-context
|
||
- android: r600 building error after commit 79ca456b
|
||
- Error compiling gallium-nine on i686 using musl libc
|
||
- [anv][regression][bisected][bsw,gen9atom] dEQP-VK.memory.mapping.dedicated_alloc failing on bsw and gen9atom
|
||
- anv: GTF-GL46.gtf32.GL3Tests.packed_pixels.packed_pixels_pbo failure
|
||
- r600: Tesseract does not render properly with the new NIR backend
|
||
- radv: some zink cts fails
|
||
- anv: ICL hiz issue
|
||
- [RADV, bisected] addrlib changes cause Navi CTS regressions
|
||
- r300: Segfault in shaders/closed/steam/kerbal-space-program/1017.shader_test
|
||
- Get rid of debug_assert()
|
||
- nine: docs?
|
||
- d3d12: Gallium d3d12 backend performance is slow
|
||
- [RADV][bisected] after commit 124848bf9efa1cc5d8a3cdfc654e0c03d57540e3 the games which use vulkan API not starting
|
||
- freedreno: skqp instability from GPU hangs
|
||
- ci/a618: intermittent network failure
|
||
- turnip: EXT_primitives_generated_query broken
|
||
- Godot crashes with RX5500m
|
||
- intel: Some Alan Wake shader fails validation due to bad register usage in EOT message
|
||
- r600/sfn: Add support for fmulz
|
||
- radv: failure in image.sample_texture.*_compressed_format*
|
||
- radeonsi: Hang on RX 5500 XT on latest mesa git
|
||
- blob_align assert triggers
|
||
- radv: Enabling support for instance extensions inside vkCreateInstance does not work
|
||
- zink: rendering artifacts in Portal 2
|
||
- a630: AmnesiaTDD trace gpu faults
|
||
- freedreno build issue - FAILED: src/freedreno/decode/crashdec.txt
|
||
- segfaults under X11 since the Ubuntu update to 22
|
||
- anv: renderdoc explosion on icl
|
||
- turnip: EXT_depth_clip_enable broken
|
||
- lava: test result parsing fail
|
||
- turnip: LRZ fast clear
|
||
- [ANV] GTA IV flickering
|
||
- [TGL] 25-100% perf impact on SynMark2 with 2GB dynamic pool size
|
||
- [Crocus] - Minecraft with Forge and JourneyMap - Graphical glitches
|
||
- intel - Minecraft char bar is broken after Mesa upgrade
|
||
- [wine] RAGE: texture problems
|
||
- "Unsupported modifier, resource creation failed." on stderr
|
||
- [RADV/ACO] Star Citizen Flickeriing Lights
|
||
- anv: query regressions
|
||
- ir3: Infinite loop in scheduler when splitting
|
||
- ci/lava: "Listened to connection" spam
|
||
- ci: docs build fail
|
||
- turnip: 3d copy/clear/... ops should not be counted by occlusion query
|
||
- anv: sampleLocationsEnable ignored in VkPipelineSampleLocationsStateCreateInfoEXT
|
||
- turnip: wrong border color for Z24S8
|
||
- aco_tests fail with llvm-15
|
||
- Clang 14 issues compiling src/util/u_cpu_detect.c
|
||
- eglExportDMABUFImageQueryMESA crashes on amdgpu gallium/drivers/radeonsi
|
||
- crocus: GL_SELECT regressed on hsw
|
||
- Random texture flickering in Minecraft
|
||
- panfrost: Broken interaction between CRC and tile size
|
||
- panfrost: allow_forward_to_be_killed is too conservative
|
||
- APU downgraded to OpenGL 3.0 since UBO size change in radeonsi
|
||
- [anv, bisected, regression] Pipeline statistics query segfaults
|
||
- radv/NIR: Suboptimal codegen for AccessChain on buffer device addresses
|
||
- Regression - vah264dec + glimagesink causes green taint on entire video with mesa>=22.0.0
|
||
- va: AMD: Wrong buffer size of P010 Luma plane
|
||
- 625b352f14b broke glcts on Navi1x
|
||
- Use marketing_name for the Vulkan device string in RADV
|
||
- Gallium driver crashes due to fd double close in iris driver
|
||
- radv: Khronos ray tracing sample issues
|
||
- r300: copy propagate does not correctly account for control flow
|
||
- r300: Amnesia the Dark Descent slowdown since NIR loop unrolling
|
||
- r300: some unused uniforms are not removed with the NIR varying linker (compared to old glsl varying linker)
|
||
- Mesa currently FTBFS on x32 architecture
|
||
- Runescape randomly crashes. malloc_consolidate(): unaligned fastbin chunk
|
||
- tu: viewport assert
|
||
- d3d12: point coord regression due to NIR IO changes
|
||
- nir_lower_phis_to_regs_block: Assertion \`src->src.is_ssa' failed.
|
||
- Redream emulator showing artifacts
|
||
- Build failure on sparc
|
||
- r600 video playback broken
|
||
- r300: hundreds of tests fail since "clear color buffers using color from a constant buffer"
|
||
- turnip: shader viewport handling
|
||
- Signal 6 (Aborted) when opening GTK3 applications
|
||
- radv: Conditional rendering on compute queue
|
||
- anv: line failure
|
||
- 32 bit build failure
|
||
- d3d12 build regression: d3d12video.h: error: expected ‘;’ at end of member declaration
|
||
- panfrost midgard - major issues with MelonDS emulator - not able to trace properly too
|
||
- Intel/anv: The cloak is flickering in Batman:Arkham Knight
|
||
- regression: "NIR based varying linker" breaks some xwayland/glamor applications with etnaviv
|
||
- pvr: build failure on archs with unsigned long != uint64_t
|
||
- turnip: dEQP-GLES31.functional.copy_image.non_compressed.viewclass_32_bits.* fails with angle
|
||
- Nheko misrendering on RK3399 (Mali T860)
|
||
- intel_pps_driver.cc:308:74: error: too many arguments to function ‘void intel_perf_query_result_accumulate_fields(intel_perf_query_result*, const intel_perf_query_info*, const void*, const void*, bool)
|
||
- radv: no nir validation in radv_link_shaders
|
||
- r300: deqp regressions from "glsl: enable the use of the nir based varying linker"
|
||
- [crocus] glmark2 -b jellyfish rendering issues
|
||
- radeonsi-stoney: Failing jobs during a CI bug
|
||
- iris: skqp gles_atlastext flake
|
||
- turnip: use-after-free in shader compilation
|
||
- anv: EXT_primitives_generated_query fails
|
||
- r300: vs predicate logic fails in vs-loop-complex-unroll-with-else-break.shader_test
|
||
- [Intel/Vulkan] White blinking artifacts in Wolfenstein Youngblood
|
||
- radv: Indirect unaligned dispatch
|
||
- [radv] DCC causes artifacts in Senran Kagura Shinovi Versus
|
||
- Radeon RX 5700XT crash when using prusa-slicer
|
||
- glcts proxy_textures_invalid_samples tests assert on iris
|
||
- turnip: gfxbench mh3.1 offscreen crash with ANGLE
|
||
- dEQP-GLES31.functional.shaders.builtin_functions.integer.[i|u]mulextended.* fail
|
||
- Radv leaks a descriptor set layout
|
||
- radeonsi: KHR_debug thread safety
|
||
- [radeon, rv370] Running piglit shaders@glsl-vs-raytrace-bug26691 test causes hard lockup & reboot
|
||
- clover: Assertion \`NumContainedTys && "Attempting to get element type of opaque pointer"' failed
|
||
- wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes causes memory leak in Apex Legends
|
||
- v3d: port v3dv workaround for GFXH-1918
|
||
- freedreno: fragile location packing can break cull/clip distance
|
||
- turnip: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.generated_args.* failures
|
||
- Panfrost: wrong textures and lighting in "Thief" games in wine (not happening when using Software render)
|
||
- [Bisected] r600: Big performance regression on some games
|
||
- turnip: report corresponding drmFormatModifierPlaneCount for planar format
|
||
- piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression
|
||
- Huge amount of anon_inode:sync_file file descriptor created
|
||
- enable warnings for GNU extensions we don't want
|
||
- Grid Autosport via proton/vulkan: glitches on the grass on amdgpu, fine on amdgpu-pro
|
||
- NIR validation failed after nir_opt_if
|
||
- Etnaviv: The triangle is lost during linking together triangle strips with degenerate triangles
|
||
- intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results
|
||
- Aperture Desk Job rendering artifacts Intel TGL
|
||
- intel: integer_sub_sat produces stack corruption / OpISubBorrow compilation issue
|
||
- r300/rv530: "Black rendering" of Baldur's Gate Dark Alliance in Xbox emulator CXBX-R under wine with Gallium Nine and RADEON_DEBUG=use_tgsi
|
||
- radv/aco: xfb bug
|
||
- '../src/amd/common/ac_surface.c:1575: void ac_copy_dcc_equation(const struct radeon_info \*, ADDR2_COMPUTE_DCCINFO_OUTPUT \*, struct gfx9_meta_equation \*): assertion "dcc->equation.gfx9.num_bits <= ARRAY_SIZE(equation->u.gfx9.bit)" failed'
|
||
- Occasional flicker corruption in Rage 2, e.g. after loading, with ACO on RX 5700 XT
|
||
- Vulkan::Calling vkWaitForFences Timeout
|
||
- turnip: bad gmem rendering with PUBG mobile on angle
|
||
- Intel (CHT) - Uplink text rendering bugged out in Mesa 22.0
|
||
- gen9atom gpu hang on dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1
|
||
- bad memory managment on panfrost RK3399 - cannot alocate more ram - fury unleashed
|
||
- radv,aco: Regression with global atomics with negative offsets
|
||
|
||
|
||
Changes
|
||
-------
|
||
|
||
Adam Jackson (32):
|
||
|
||
- docs: Note EGL enum allocation for EGL_EXT_present_opaque
|
||
- docs: Update supported drivers for 22.x
|
||
- glx/kopper: Wire up a way for SwapBuffers to fail
|
||
- wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes
|
||
- kopper: Grow a swap interval API
|
||
- glx/kopper: Enable GLX_EXT_swap_control etc.
|
||
- egl/kopper: Hook up eglSwapInterval
|
||
- wsi/x11: Hook up KHR_incremental_present
|
||
- dri: Inline dri2InvalidateDrawable into its one caller
|
||
- dri: Move driCopySubBufferExtension into drisw
|
||
- dri: move driQueryRendererIntegerCommon near its one caller
|
||
- dri: Move driC{reate,oncat}Configs near their only caller
|
||
- dri: Move dri{Get,Index}ConfigAttrib near their vtable
|
||
- dri: Remove what's left of utils.[ch]
|
||
- glx/dri2: Require DRI2 >= 1.3 for working buffer invalidation
|
||
- dri2: Require a loader with working buffer invalidation
|
||
- zink: Print the VkResult if vkCreateInstance fails
|
||
- glx/dri: Fix DRI drawable release at MakeCurrent time
|
||
- mesa: Stub out GL_APPLE_object_purgeable
|
||
- glx: Remove the remains of GLX_MESA_multithread_makecurrent
|
||
- gallium/dri: Remove the memory of GLX_MESA_multithread_makecurrent
|
||
- glx: Remove some can't-happen checks from protocolDestroyDrawable
|
||
- glx: Remove unused __glXSetupForCommand from __glXIsDirect
|
||
- glx: Remove excess screen parameter from CreateContext
|
||
- glx/dri2: Stop tracking the protocol version in the display state
|
||
- glx: Fix return values from __glXGetDrawableAttribute
|
||
- glx: Fix drawable refcounting for naked Windows
|
||
- kopper: Implement {EGL,GLX}_EXT_buffer_age
|
||
- wsi/x11: Synchronously check for error from xcb_present_pixmap
|
||
- mesa: Remove Purgeable fields from texture/buffer/renderbuffer objects
|
||
- llvmpipe: Indentation fix
|
||
- gallium/dri2: Enable {GLX_ARB,EGL_KHR}_context_flush_control
|
||
|
||
Alejandro Piñeiro (22):
|
||
|
||
- v3dv/pipeline: fix small comment typo
|
||
- v3dv/pipeline_cache: adds check to skip searching for a entry
|
||
- v3dv/pipeline_cache: add on disk cache hit stats
|
||
- v3dv/pipeline: include pipeline layout on the pipeline sha1
|
||
- v3dv/bo: unmap any bo that goes to the bo cache
|
||
- v3dv/format: no need for GetPhysicalDeviceFormatProperties
|
||
- v3dv/format: Add support for VK_KHR_format_feature_flags2
|
||
- nir: get res binding using component 0, instead of asssumig an uint
|
||
- v3dv/pipeline: call nir_lower_explicit_io after first nir optimization loop
|
||
- v3dv/pipeline: expand nir_optimize, drop st_nir_opts
|
||
- v3dv/device: add comment on depthClamp feature
|
||
- broadcom: add no_merge_jobs debug option
|
||
- broadcom: document which debug options are specific for a given driver
|
||
- broadcom/compiler: wrap nir_convert_to_lcssa with NIR_PASS_V
|
||
- broadcom/compiler: use NIR_PASS for nir_lower_vars_to_ssa at v3d_optimize_nir
|
||
- nir: call nir_metadata_preserve at nir_remove_unused_io_vars
|
||
- v3dv/pipeline: use nir_shader_instructions_pass when lowering layout
|
||
- broadcom/compiler: removed unused function
|
||
- broadcom/compiler: make several passes to return a progress
|
||
- v3d/v3dv: use NIR_PASS(_
|
||
- vc4/v3d: stop adding NORAST when SHADERDB debug option is used
|
||
- vc4/v3d: restore calling debug_get_option_vc4/v3d_debug
|
||
|
||
Alessandro Astone (5):
|
||
|
||
- kopper: Only include XCB headers on supported platforms
|
||
- Android.mk: Cleanup mesa3d-lib
|
||
- Android.mk: Make mesa3d-lib work with absolute path meson outputs
|
||
- Android.mk: Generate the dummy source in local-generated-sources-dir
|
||
- Android.mk: Intermediate output paths may already be absolute
|
||
|
||
Alexander Kanavin (1):
|
||
|
||
- swrast_kms: use swkmsDRI2Extension instead of driDRI2Extension
|
||
|
||
Alexey Bozhenko (2):
|
||
|
||
- intel/compiler: fix singleton pointer coverity warning
|
||
- spirv: fix OpBranchConditional when both branches are the same
|
||
|
||
Alyssa Rosenzweig (454):
|
||
|
||
- panfrost: Add 48-bit and 96-bit tiling routines
|
||
- panfrost: Remove bpp check for tiling
|
||
- panfrost: Only handle 2D in should_linear_convert
|
||
- panfrost: Implement tiled 3D resource access
|
||
- panfrost: Only tile for 2D dimensions
|
||
- panfrost: Allow tiling all texture targets
|
||
- panfrost: Assert alignment in tiling routine
|
||
- panfrost: Document arguments to tiling routines
|
||
- panfrost: Unit test u-interleaved tiling routines
|
||
- docs/panfrost: Document u-interleaved tiling
|
||
- panfrost: Rework dirty tracking for Valhall
|
||
- panfrost: Dirty track batch masks
|
||
- panfrost: Rework PAN_GPU_ID mechanism
|
||
- docs/panfrost: Document drm-shim usage
|
||
- meson: Ignore unused variables in release builds
|
||
- pan/bi: Simplify bi_propagate_pass_flags
|
||
- panvk: Respect line width
|
||
- panvk: Consider primitive restart in index buffer walks
|
||
- pan/bi: Implement fquantize2f16
|
||
- panvk: Handle empty shaders gracefully
|
||
- panvk: Lower shared memory
|
||
- util: Add a generic worklist implementation
|
||
- nir: Use u_worklist to back nir_block_worklist
|
||
- pan/bi: Implement pack_uvec[24]_to_uint
|
||
- panfrost: Add texture features enum to v9.xml
|
||
- pan/blit: Compile blit shaders without IDVS
|
||
- pan/blit: Generalize texture alignment
|
||
- pan/blit: Prepare for Valhall port
|
||
- pan/blit: Support v9 data structures
|
||
- panfrost: Don't allocate storage for PSIZ on Valhall
|
||
- panfrost: Compile libpanfrost for v9
|
||
- panfrost: Add helper to emit UBOs
|
||
- panfrost: Port uniform/UBO logic to Valhall
|
||
- panfrost: Don't fix up alpha test on Bifrost
|
||
- panfrost: Use common state emit for compute jobs
|
||
- panfrost: Adapt compute job emit for Valhall
|
||
- panfrost: Specialize rasterizer state for Valhall
|
||
- panfrost: Simplify attribute format expression
|
||
- panfrost: Specialize vertex elements for Valhall
|
||
- panfrost: Specialize ZSA state for Valhall
|
||
- panfrost: Specialize shader descriptors for Valhall
|
||
- panfrost: Use dirty flags to emit v9 descriptors
|
||
- panfrost: Generate Valhall Malloc IDVS jobs
|
||
- panfrost: Compile for v9
|
||
- panfrost: Kick off v9 support in the driver
|
||
- panfrost: Advertise all textures in drm-shim
|
||
- panfrost: Split pan_layout.c from pan_texture.c
|
||
- panfrost: Always use 64-bit SD with strides
|
||
- panfrost: Add afbc_superblock_{size, width, height} helpers
|
||
- panfrost: Use panfrost_afbc_superblock_width
|
||
- panfrost: Don't pretend to support multiplane AFBC
|
||
- panfrost: Extract panfrost_afbc_is_wide helper
|
||
- panfrost: Use pan_block_size in layout calculation
|
||
- panfrost: Unify panfrost_block_size paths
|
||
- panfrost: Unify paths through image_layout_init
|
||
- panfrost: Remove unused dev argument
|
||
- panfrost: Unit test block size queries
|
||
- panfrost: Simplify how image_layout_init is called
|
||
- panfrost: Add a test for pan_image_layout_init
|
||
- asahi: Workaround broken GLSL compiler
|
||
- agx: Implement some shader-db stats
|
||
- agx: Note that RA proceeds in dominance-order
|
||
- agx: Track write registers more accurately
|
||
- agx: Match order for designated initializers
|
||
- agx: Use correct types for some IR enums
|
||
- agx: Wrap compiler header in extern "C"
|
||
- agx: Add unit test infrastructure
|
||
- agx: Use pseudo ops for mov/not/and/xor/or
|
||
- agx: Implement simple copyprop
|
||
- agx: Dynamically allocate agx_instr->src
|
||
- agx: Add p_split pseudoinstruction
|
||
- agx: Add phi pseudo instruction
|
||
- agx: Add a hash table for vector extracts
|
||
- agx: Add helper to emit combines
|
||
- agx: Add helper to emit splits
|
||
- agx: Use a transfer graph for parallel copies
|
||
- agx: Lower p_split after RA
|
||
- agx: Optimize p_split(kill) specially
|
||
- agx: Emit splits for intrinsics
|
||
- agx: Use extract helper for tex internally
|
||
- agx: Remove else optimization
|
||
- agx: Use a dynarray for predecessors
|
||
- agx: Add predecessor index helper
|
||
- agx: Mark the logical ends of blocks
|
||
- agx: Add agx_after_block_logical helper
|
||
- agx: Introduce worklist infrastructure
|
||
- agx: Adapt liveness analysis for SSA
|
||
- agx: Make DCE dumber
|
||
- agx: Translate phi nodes
|
||
- agx: Copy prop into phi nodes
|
||
- agx: Don't lower phis in NIR
|
||
- agx: Lower phi nodes to parallel copies
|
||
- agx: Split up RA from post-RA lowering
|
||
- agx: Always use hash table for extracts
|
||
- agx: Unit test parallel copy lowering
|
||
- agx: Remove identity moves
|
||
- agx: Add validation pass
|
||
- agx: Update RA comment
|
||
- agx: Remove has_liveness
|
||
- agx: Remove nir_register support
|
||
- agx: Lower mediump flat shading
|
||
- panvk: Ignore point size for !points
|
||
- panvk: Use correct point size limits
|
||
- panfrost: Only flip point sprites for GL
|
||
- panvk: Streamline no shader RSD case
|
||
- panvk: Gate rasterization on !discard, not fs req
|
||
- panvk: Emit fragment RSDs even with no shader
|
||
- panfrost: Remove ancient unused code
|
||
- panfrost: Call tgsi_to_nir earlier
|
||
- panfrost: Don't subclass pipe_compute_state
|
||
- panfrost: Use s->info.stage instead of tgsi stage
|
||
- panfrost: Simplify shader key architecture
|
||
- panfrost: Refactor variant selection code
|
||
- panfrost: Refactor variant rebind code
|
||
- panfrost: Lower point sprites on Bifrost
|
||
- panfrost: Lower user clip planes
|
||
- panfrost: Flip point coords in hardware
|
||
- pan/va: Generalize message? check for asm
|
||
- pan/va: Add missing TABLE (SFU) instructions
|
||
- pan/va: Mark NOT instructions as deprecated
|
||
- pan/va: Add VAR_TEX_BUF_GRADIENT instruction
|
||
- pan/va: Add non-IDVS VAR_TEX instructions
|
||
- panvk: Lower global vars to local
|
||
- panvk: Allocate stack for each thread
|
||
- pan/bi: Handle texture offset + index
|
||
- panvk: Lower image deref atomics
|
||
- panvk: Only write depth when depth test enabled
|
||
- pan/decode: Print row strides, not line strides
|
||
- panvk: Remove unused layout structs
|
||
- panvk: Report row_stride in GetImageSubresourceLayout
|
||
- panfrost: Add helpers to work with legacy strides
|
||
- panfrost: Use row stride for explicit layouts
|
||
- panfrost: Rename away from bytes_per_pixel
|
||
- panfrost: Use row stride to calculate slice size
|
||
- panfrost: Adapt get_param for row strides
|
||
- panfrost: Use row_stride even for linear resources
|
||
- lima,panfrost: Use row stride for tiling routines
|
||
- panfrost: Remove line_stride
|
||
- panfrost: Unify row stride and AFBC row stride
|
||
- panfrost: Unit test stride calculations
|
||
- panfrost: Unit test "from legacy" helper
|
||
- pan/bi: Rename bi_block->name to bi_block->index
|
||
- pan/bi: Count blocks
|
||
- pan/bi: Add u_worklist wrapper macros
|
||
- pan/bi: Use bi_worklist in analyze_helper_requirements
|
||
- pan/bi: Use bi_worklist for liveness
|
||
- pan/bi: Use bi_worklist for post-RA liveness
|
||
- pan/bi: Use worklist for scoreboard analysis
|
||
- pan/bi: Use a dynarray for predecessors
|
||
- nir: Don't set writes_memory for reading XFB
|
||
- panfrost: Relax image check
|
||
- pan/va: Serialize memory stores
|
||
- pan/bi: Align spilled registers on Valhall
|
||
- pan/bi: Add option to test spilling
|
||
- pan/bi: Assert that blend shaders may not spill
|
||
- panfrost: Correct XML for TLS
|
||
- panfrost: Use emit_tls
|
||
- panfrost: Use packed TLS on Valhall
|
||
- panfrost/ci: Smoke test spilling
|
||
- panfrost: Don't use VARYING_SLOT_TEX0 internally
|
||
- panvk: Don't use VARYING_SLOT_TEX0 internally
|
||
- pan/bi: Rework varying linking on Valhall
|
||
- panfrost: Increase the maximum tiler heap size
|
||
- mesa/st: Fix building tests on macOS
|
||
- panvk: Stub pipeline cache using the common code
|
||
- panvk: Conform viewport code to Vulkan spec
|
||
- panvk: Call nir_opt_trivial_continues
|
||
- pan/va: Add some whitespace to Valhall disassembly
|
||
- pan/va: Add whitespace after disassembled branches
|
||
- agx: Restore Valve copyright header
|
||
- pan/bi: Don't analyze td on Valhall
|
||
- pan/bi: Emit LEA_TEX on Valhall
|
||
- pan/bi: Support image loads on Valhall
|
||
- pan/bi: Preserve AXCHG.i32 destination
|
||
- pan/bi: Handle shared atomic exchange on Valhall
|
||
- pan/bi: Handle shared/scratch on Valhall
|
||
- pan/bi: Support atomics on Valhall
|
||
- pan/bi: Ensure the end NOP isn't eliminated
|
||
- asahi: Fix hangs waiting on the notification queue
|
||
- pan/bi: Update bi_count_write_registers for Valhall
|
||
- pan/va: Mark more source sizes
|
||
- pan/va: Lower split 64-bit sources
|
||
- pan/va: Use 64-bit lowering for texturing
|
||
- pan/bi: +JUMP can't read same-cycle temp
|
||
- pan/bi: Lower phis to scalar
|
||
- pan/bi: Use value-based interference with LCRA
|
||
- pan/bi: Add source/destination counts
|
||
- pan/bi: Add collect and split instructions
|
||
- pan/bi: Add bi_before_block cursor
|
||
- pan/bi: Lower split/collect before RA
|
||
- pan/bi: Fix mov and pack_32_2x16
|
||
- pan/bi: Expand MAX_DESTS to 4
|
||
- pan/bi: Create COLLECT during isel
|
||
- pan/bi: Lift split/collect cache from AGX
|
||
- pan/bi: Simplfy BLEND emit
|
||
- pan/bi: Emit collect and split
|
||
- pan/va: Use split for 64-bit lowering
|
||
- pan/va: Don't use bi_word in FAU unit test
|
||
- pan/bi: Scalarize bi_lower_swizzle
|
||
- pan/bi: Scalarize bi_opt_cse
|
||
- pan/bi: Scalarize modifier propagation
|
||
- pan/bi: Scalarize copyprop
|
||
- pan/bi: Remove bi_word and bi_word_node
|
||
- pan/bi: Simplify register precolouring in the IR
|
||
- pan/bi: Remove liveness metadata tracking
|
||
- pan/bi: Don't propagate discard
|
||
- pan/bi: Optimize split of collect
|
||
- pan/bi: See through splits for var_tex fusion
|
||
- pan/bi: Validate preload constraints are satisfied
|
||
- pan/bi: Validate vector widths
|
||
- pan/va: Handle 2-src blend in lower_split_src
|
||
- pan/va: Remove DISCARD.f32 destination
|
||
- pan/va: Use ^ instead of ` to indicate last-use
|
||
- asahi: Run lower_blend for all fragment shaders
|
||
- asahi: Use z24_in_z32f lowering
|
||
- asahi: Validate depth/stencil formats
|
||
- asahi: Don't store to unbound render targets
|
||
- asahi: Handle uncompressed Z32F depth buffers
|
||
- asahi: Pass depth/stencil clear values to firmware
|
||
- asahi: Flush on partial clears
|
||
- asahi: Handle 3rd deflake address
|
||
- asahi: Account for every byte of the cmdbuf
|
||
- asahi: Identify more depth pipeline fields
|
||
- asahi: Dump "unknown 4a" structure
|
||
- asahi: Add XML used for visibility tests
|
||
- asahi: Relax Draw "command" check
|
||
- asahi: Note unknown field
|
||
- asahi: Identify depth bias array
|
||
- asahi: Identify "set depth bias index" field
|
||
- asahi: Identify depth bias enable bit
|
||
- asahi: Add a depth bias structure
|
||
- asahi: Implement polygon offset
|
||
- asahi: Identify partial render pipeline
|
||
- asahi: Sync some names with Project Zero
|
||
- asahi: Add segment list pretty printer
|
||
- asahi: Split unknown field in segment list header
|
||
- asahi: Validate segment list length
|
||
- asahi: Compute segment list length
|
||
- asahi: Align segment list unknowns with Metal
|
||
- asahi: Decode resource allocation responses
|
||
- asahi: Decode resource allocation requests
|
||
- asahi: Identify suballocation mechanism
|
||
- asahi: Identify suballocated size
|
||
- asahi: Drop some unknowns
|
||
- asahi: Sync some more unknown fields
|
||
- asahi: Improve render target dumping
|
||
- asahi: Fix pipeline decoding
|
||
- asahi: Handle RECT textures as 2D
|
||
- asahi: Allocate a larger segment list
|
||
- asahi: Guard against encoder overflows
|
||
- asahi: Add wrap library
|
||
- docs/asahi: Add initial driver docs
|
||
- panfrost: Simplify the kmsro create path
|
||
- panfrost: Don't allocate SHARED as scanout
|
||
- pan/bi: Recoalesce tied operands after spilling
|
||
- pan/bi: Schedule for pressure pre-RA
|
||
- pan/bi: Allow CSEing LEA_BUF_IMM
|
||
- panfrost: Fix crash with GALLIUM_HUD
|
||
- panfrost: Extract afbc_stride_blocks helper
|
||
- panfrost: Extract pan_afbc_row_stride helper
|
||
- panfrost: Fix is_wide return type
|
||
- panfrost: Add pan_afbc_tile_size helper
|
||
- panfrost: Support tiled AFBC in stride helpers
|
||
- panfrost: Align layouts to tiles of superblocks
|
||
- panfrost: Add AFBC stride unit tests
|
||
- panfrost: Add 3D texture layout unit test
|
||
- panfrost: Calculate header_size based on row_stride
|
||
- panfrost: Add a tiled 16x16 layout unit test
|
||
- panfrost: Add 1x1 layout unit tests
|
||
- panfrost: Fix AFBC flags on v6
|
||
- panfrost: Put comment in correct #ifdef
|
||
- panfrost: Handle AFBC Tiled
|
||
- panfrost: Add helper checking tiled AFBC support
|
||
- panfrost: Gate tiled AFBC on GPUs that support it
|
||
- panfrost: Advertise 16x16 tiled AFBC
|
||
- panfrost: Use tiled AFBC textures
|
||
- pan/bi: Fuse result types
|
||
- pan/bi: Test CMP result_type optimization
|
||
- panfrost: Allow FPK on Valhall
|
||
- panfrost: Set overdraw_alpha[01] flags
|
||
- panfrost: Set allow_rotating_primitives
|
||
- pan/bi: Don't allow spilling coverage mask writes
|
||
- pan/bi: Add VAR_TEX fusing unit test
|
||
- pan/va: Fill in missing src_flat16 enum
|
||
- pan/bi: Make LD_VAR w=format instead of w=vecsize
|
||
- pan/bi: Model Valhall source formats
|
||
- pan/bi: Interpolate varyings at 16-bit
|
||
- pan/bi: Extend bi_scoreboard_state for finer tracking
|
||
- pan/bi: Add slot to bi_instr
|
||
- pan/bi: Preserve flow control for non-psiz variant
|
||
- pan/bi: Add ASSERT_SHADER_EQUAL macro
|
||
- pan/bi: Add a trivial ctx->inputs for unit tests
|
||
- pan/bi: Extract bit_block helper
|
||
- pan/bi: Export bi_block_add_successor
|
||
- pan/bi: Export helper termination analysis
|
||
- pan/bi: Print flow control on instructions
|
||
- pan/va: Add pass to insert flow control
|
||
- pan/va: Do scoreboard analysis
|
||
- pan/va: Terminate helper threads
|
||
- pan/va: Add flow control merging pass
|
||
- pan/va: Assign slots roundrobin
|
||
- pan/va: Respect assigned slots
|
||
- pan/va: Unit test flow control insertion
|
||
- pan/va: Unit test flow control merging
|
||
- pan/bi: Use flow control lowering on Valhall
|
||
- pan/mdg: Scalarize with 64-bit sources
|
||
- pan/mdg: Remove uppercase write masks
|
||
- pan/mdg: Use "<<" instead of "lsl"
|
||
- pan/mdg: Only print masked components of swizzle
|
||
- pan/mdg: Only print 2 sources for ALU
|
||
- pan/mdg: Only print 1 source for moves
|
||
- pan/mdg: Distinguish SSA vs reg when printing IR
|
||
- pan/mdg: Print outmods when printing IR
|
||
- panfrost: Fix XML for AFBC header on v9
|
||
- panfrost: Pass arch instead of dev into afbc_format
|
||
- panfrost: Add pan_afbc_compression_mode on Valhall
|
||
- panfrost: Support rendering to AFBC on Valhall
|
||
- panfrost: Handle texturing from AFBC on Valhall
|
||
- panfrost: Allow 2D AFBC on Valhall
|
||
- panvk: Use vk_image_subresource_*_count for clears
|
||
- pan/bi: Respect swizzles in nir_op_pack_64_2x32_split
|
||
- pan/bi: Allow vec6 for collects
|
||
- pan/bi: Align 64-bit register sources
|
||
- pan/bi: Rework Valhall register alignment
|
||
- pan/bi: Fix clper_xor on Mali-G31
|
||
- panfrost/ci: Mark draw_buffers_indexed.* as flakes
|
||
- panfrost: Use C11 static_assert for enums
|
||
- nir: Add transform feedback system values
|
||
- nir: Export nir_io_add_intrinsic_xfb_info
|
||
- pan/bi: Add transform feedback lowering pass
|
||
- pan/bi: Handle transform feedback intrinsics
|
||
- panfrost: Don't allow vertex shaders to have side effects
|
||
- panfrost: Wire up transfrom feedback sysvals
|
||
- panfrost: Create transform feedback shaders
|
||
- panfrost: Launch transform feedback shaders
|
||
- pan/va: Do not insert NOPs into empty shaders
|
||
- panfrost: Extract panfrost_get_index_buffer helper
|
||
- panfrost: Don't calculate min/max indices on v9
|
||
- pan/bi: Extract MUX to CSEL optimization
|
||
- pan/va: Implement more lanes
|
||
- pan/va: Lower MUX to CSEL where possible
|
||
- pan/va: Add MUX lowering tests
|
||
- pan/bi: Implement b2i with MUX
|
||
- pan/va: Record which instructions are signed
|
||
- pan/va: Try negating small constants when lowering
|
||
- pan/va: Try widening small constants
|
||
- pan/va: Unit test constant lowering pass
|
||
- panfrost: Enable Mali-G57
|
||
- docs: Document Mali-G57 conformance
|
||
- panfrost: Fix inverted indirect dispatch condition
|
||
- pan/bi: Implement load_push_constant
|
||
- pan/bi: Don't reorder push with no_ubo_to_push
|
||
- panfrost: Use push constants for indirect dispatch
|
||
- panfrost: Use push constants for indirect draws
|
||
- panvk: Use push constants for clear
|
||
- panvk: Use push constants for copy shaders
|
||
- panfrost: Deduplicate indirect dispatch structs
|
||
- panfrost: Remove redundant first_tag access
|
||
- panfrost: Remove is_64b assignments
|
||
- panfrost: Add panfrost_query_l2_slices helper
|
||
- pan/perf: Fix performance counters on G57
|
||
- mesa,gallium: Make point coord origin a CAP
|
||
- asahi,agx: Don't set PIPE_CAP_POINT_COORD_ORIGIN_UPPER_LEFT
|
||
- panvk: Drop support for Midgard
|
||
- panfrost: Allow pixels using discard to be killed
|
||
- panfrost: Inline pan_fbd_has_zs_crc_ext
|
||
- panfrost: Disable CRC at <16x16 tile sizes
|
||
- panvk: Don't specialize clear shaders for RT
|
||
- panvk: Remove unused pushmaps
|
||
- panvk: Clear Z/S attachments without a shader
|
||
- panvk: Fix stencil clears of combined ZS images
|
||
- panvk: Simplify depth clear preload condition
|
||
- panvk: Use common CmdBeginRenderPass
|
||
- panfrost: Disable QUADS on Valhall
|
||
- panfrost: Fix batch state changes on Valhall
|
||
- panfrost: Add tristate_is_defined helper
|
||
- panfrost: Implement provoking vertices on Valhall
|
||
- pan/va: Move VA_NUM_GENERAL_SLOTS to common
|
||
- pan/va: Simplify insert flow tests
|
||
- pan/va: Workaround quirk of barrier handling
|
||
- pan/va: Unit test barrier handling
|
||
- pan/bi: Mark bi_postra_liveness_ins as MUST_CHECK
|
||
- pan/bi: Constify bi_is_staging_src argument
|
||
- pan/va: Add header guards to valhall_enums.h
|
||
- pan/va: Move va_flow_is_wait_or_none to common
|
||
- pan/va: Include BLEND for va_swap_12
|
||
- pan/va: Use validate_register_pair for BLEND pack
|
||
- pan/va: Mark last register reads
|
||
- pan/va: Unit test va_mark_last
|
||
- pan/bi: Fix LD_BUFFER.i16 definition
|
||
- pan/bi: Align accesses with packed TLS
|
||
- pan/va: Lower FADD_RSCALE.f32 to FMA_RSCALE.f32
|
||
- pan/va: Implement sample positions FAU packing
|
||
- pan/va: Rename RSCALE to LDEXP
|
||
- pan/bi: Remove FRSCALE from IR
|
||
- pan/bi: Model MKVEC.v2i8
|
||
- pan/bi: Constant fold MKVEC.v2i8
|
||
- pan/va: Pack MKVEC.v2i8 byte lanes
|
||
- pan/va: Replace MKVEC.v4i8 with MKVEC.v2i8
|
||
- pan/bi: Tune lower_vars_to_scratch
|
||
- panfrost: Enable more FP16 caps on Valhall
|
||
- panfrost: Enable CAP_INDIRECT_TEMP_ADDR on Valhall
|
||
- panfrost: Bump ESSL_FEATURE_LEVEL on Valhall
|
||
- v3d: Drop workaround for u_blitter bug
|
||
- agx: Handle loop { if { loop { .. } } }
|
||
- r600g: Remove streamout-based buffer copy path
|
||
- u_blitter: Remove util_blitter_copy_buffer
|
||
- lima,panfrost: Do not ralloc contexts against the screen
|
||
- pan/va: Handle terminal barriers
|
||
- ttn: Set nir->info.separate_shader
|
||
- pan/bi: Fix unpack_32_2x16 definition
|
||
- pan/va: Add missing <roundmode/> to V2F32_TO_V2F16
|
||
- pan/bi: Implement f2f16{_rtz, _rtne}
|
||
- panfrost: Separate core ID range from core count
|
||
- panfrost: Move bifrost_lanes_per_warp to common
|
||
- panfrost: Fix PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS
|
||
- panfrost: Fix PIPE_COMPUTE_CAP_SUBGROUP_SIZE
|
||
- panfrost/ci: Disable 0ad trace on T860
|
||
- panfrost: Clear with a quad to avoid flushing
|
||
- panfrost/ci: Disable T760 jobs
|
||
- nir/opt_shrink_vectors: Round to supported vec size
|
||
- panfrost: Revert provoking vertex assertion
|
||
- panfrost: Only key points to point coord origin
|
||
- pan/va: Dump unencodable instructions
|
||
- pan/va: Use invalid_instruction in more places
|
||
- pan/va: Print instructions with pack assert fails
|
||
- pan/va: Handle BIFROST_MESA_DEBUG=nosb
|
||
- panfrost: Make pan_select_max_tile_size O(1)
|
||
- panfrost: Use implementation-specific tile size
|
||
- panfrost: Protect pandecode by a mutex
|
||
- pan/bi: Require ATEST coverage mask input in R60
|
||
- pan/va: Stall after ATEST
|
||
- panfrost: Fix shader_modifies_coverage on Valhall
|
||
- panfrost: Add decoupled early-ZS helpers
|
||
- panfrost: Unit test early-ZS helpers
|
||
- panfrost: Add zsa->zs_always_passes flag
|
||
- panfrost: Use early-ZS helpers
|
||
- util: Enable list_assert in debugoptimized builds
|
||
- util: Gate simple_mtx_assert_locked on !NDEBUG
|
||
- agx: Port minifloat tests to GTest
|
||
- pan/bi: Assert that we use the correct CLPER
|
||
- pan/bi: Rename CLPER_V6.i32 to CLPER_OLD.i32
|
||
- pan/decode: Fix overrun decoding planes
|
||
- nir/lower_idiv: Be less creative about signs
|
||
- agx: Fix packing of samplers in texture instrs
|
||
- panfrost: Don't segfault on unknown models
|
||
- pan/bi: Don't reorder image loads across stores
|
||
- pan/bi: Don't allow ATEST to take a temporary
|
||
- pan/mdg: Print 3 sources for CSEL
|
||
- pan/bi: Fix dual texturing with uniforms
|
||
- pan/bi: Fix out-of-bounds write in va_lower_split_64bit
|
||
- pan/bi: Consider all dests in helper_block_update
|
||
- agx: Fix float copyprop of neg(neg) case
|
||
- panfrost: Respect buffer offset for OpenCL
|
||
|
||
Andres Calderon Jaramillo (2):
|
||
|
||
- nir: Account for YUV range.
|
||
- mesa/st: Account for YUV color space and range.
|
||
|
||
Andres Gomez (5):
|
||
|
||
- radv/ci: update vkd3d-proton results for AMD's Kabini
|
||
- ci: install a more recent version of Wine in the test-vk image
|
||
- ci: install gfxinfo-mupuf in the test-vk image
|
||
- ci: move b2c artifacts to its own directory
|
||
- ci: add wrapper script for Valve's traces runner
|
||
|
||
Anuj Phogat (2):
|
||
|
||
- include/uapi: Update drm_fourcc.h from kernel
|
||
- isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP
|
||
|
||
Arvind Yadav (7):
|
||
|
||
- nir: Add a lowering pass for polygon and line smoothing
|
||
- radeonsi: Add nir_lower_poly_line_smooth pass for polygon and line smoothing
|
||
- nir: add load_point_coord_maybe_flipped intrinsics for point smoothing
|
||
- nir: Add a lowering pass for point smoothing
|
||
- ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi
|
||
- ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes
|
||
- radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing
|
||
|
||
Autumn on Tape (5):
|
||
|
||
- gallivm: add subgroup shuffle support
|
||
- lavapipe: enable subgroup shuffle operations
|
||
- gallivm: use shufflevector for shuffles when index is constant data
|
||
- gallivm: use VPERMPS (x86/AVX2) for 32-bit 8-element shuffles
|
||
- lavapipe: enable subgroup quad operations
|
||
|
||
Axel Davy (6):
|
||
|
||
- frontend/nine: Skip invalid swvp calls
|
||
- frontend/nine: Fix buffer tracking out of bounds
|
||
- frontend/nine: Fix ATOC handling
|
||
- frontend/nine: Fix cso restore bug
|
||
- frontend/nine: Fix shader multi-use crash
|
||
- frontend/nine: Fix ff position_t fallback when w = 0
|
||
|
||
Bas Nieuwenhuizen (39):
|
||
|
||
- vulkan/wsi/x11: Ensure we have the required number of images for acquire.
|
||
- radv: always enable NGG on GFX11
|
||
- radv: Mark all formats as DCC compatible with each other on gfx11.
|
||
- radv: gfx11 register changes.
|
||
- radv: Adjust for new SWIZZLE_ENABLE.
|
||
- radv: Add gfx11 DCC fast clear support.
|
||
- radv: Disable transform feedback for gfx11.
|
||
- radv: update the initialization of SGPR0/1 registers for HS and GS on GFX11
|
||
- radv: Deal with derefs from opaque types in function parameters.
|
||
- radv: Add performance counter reg write.
|
||
- radv: move ac_perfcounters to physical_device.
|
||
- radv: Add pstate locking for perfcounters.
|
||
- radv: Rename SPM functions.
|
||
- radv: Make inhibiting clockgating not static.
|
||
- radv: Add helper BO for perf counters.
|
||
- radv: Add perf counter lock/unlock commandbuffers.
|
||
- radv: Add a list of performance counters.
|
||
- radv: Implement support for querying performance counters.
|
||
- radv: Implement listing performance counters.
|
||
- radv: Expose VK_KHR_performance_query.
|
||
- radv: Add ability to override the build id for the cache.
|
||
- radv: Add a 32bit memory type.
|
||
- radv: Skip setting empty index buffers to avoid hang
|
||
- radv: Expose function to write vertex descriptors for dgc.
|
||
- radv: Always store stride in the vbo descriptor.
|
||
- radv: Require 32bit memory for indirect buffers.
|
||
- radv: Expose helper for base pa_su_sc_mode_cntl.
|
||
- radv: Make radv_get_vgt_index_size non-static.
|
||
- radv: Add helper to write scissors.
|
||
- radv: Add flushing for DGC.
|
||
- radv: Add DGC meta shader.
|
||
- radv: Implement DGC generated command layout structure.
|
||
- radv: Implement DGC cmdbuffer generation.
|
||
- radv: Implement CmdExecuteGeneratedCommandsNV.
|
||
- radv: Add stub for vkCmdBindPipelineShaderGroupNV.
|
||
- radv: Expose VK_NV_device_generated_commands.
|
||
- vulkan/wsi: Take max extent into consideration for modifier selection.
|
||
- amd/common: Don't rely on DCN support checks with modifiers.
|
||
- amd/common: Disable DCC retile modifiers on RDNA1
|
||
|
||
Ben Skeggs (3):
|
||
|
||
- nv50/ir: fix OUT.FINAL on ampere
|
||
- nv50/ir: fix scope of memory ops on ampere
|
||
- nv50/ir: fix ALD/AST where address comes from AL2P
|
||
|
||
Benjamin Cheng (5):
|
||
|
||
- radv: fix memory leak of descriptor set layout
|
||
- radv: refactor image binding into struct
|
||
- radv: consolidate descriptor set buffer counts
|
||
- radv: implement disjoint memory for multiplane images
|
||
- radv: expose disjoint image support
|
||
|
||
Boris Brezillon (159):
|
||
|
||
- dzn: Skip binding entries with zero descriptors
|
||
- dzn: Make sure sampler heaps don't contain more than 2048 samplers
|
||
- microsoft/compiler: Add a dunmmy SV_SampleIndex when needed
|
||
- microsoft/spirv_to_dxil: Allow forcing per-sample shading
|
||
- dzn: Add support for sampleRateShading
|
||
- dzn: Pass the right type to CreateCommandList() in the reset path
|
||
- dzn: refactor error-handling
|
||
- ci/windows: Add a variable to globally disable jobs using windows runners
|
||
- dzn: Use core helpers to fill physical device features/properties
|
||
- dzn: Report actual device limits
|
||
- d3d12: Move the D3D12ResourceState code to the gallium driver
|
||
- ci/windows: Split the windows-related tests
|
||
- dzn: Add support for EXT_vertex_attribute_divisor
|
||
- dzn: Rename a bunch of dzn_descriptor_set helpers
|
||
- dzn: Pass layout objects to various dzn_descriptor_set_ptr_xxx helpers
|
||
- dzn: Split the write desc helpers in two halves
|
||
- dzn: Add support for KHR_descriptor_update_template
|
||
- microsoft/spirv_to_dxil: Add a draw_id field to vertex_runtime_data
|
||
- dzn: Fix exec_buf_stride calculation
|
||
- dzn: Simplify the triangle_fan_rewrite_index shader
|
||
- dzn: Update the draw_id sysval when issuing indirect draws
|
||
- dzn: Advertise KHR_shader_draw_parameters
|
||
- dzn: Pass an image view to dzn_cmd_buffer_clear_attachment()
|
||
- dzn: Set MAX_RTS to D3D12_SIMULTANEOUS_RENDER_TARGET_COUNT
|
||
- dzn: Move all render-related states to a substruct
|
||
- dzn: Inline dzn_cmd_buffer_clear_attachments()
|
||
- dzn: Add support for dynamic rendering
|
||
- dzn: Get rid of the render pass logic
|
||
- dzn: Allow multiDrawIndirect and drawIndirectFirstInstance
|
||
- dzn: Add KHR_draw_indirect_count support
|
||
- dzn: Add missing VKAPI_{ATTR,CALL} specifiers to BeginCommandBuffer()
|
||
- dzn: Pass the dzn_event pointer to _mesa_hash_table_insert()
|
||
- dzn: Fix the STATIC_ASSERT() in dzn_meta_blits_get_context()
|
||
- vulkan/wsi: Make wsi_win32_surface_get_capabilities() return the current extent
|
||
- dzn: Fix dzn_cmd_buffer_collect_queries()
|
||
- dzn: Use the vk_sync_binary abstraction
|
||
- spirv: Fix windows build
|
||
- dzn: Use the correct ICD path on Win32
|
||
- microsoft/spirv_to_dxil: Fix push_constants type declaration
|
||
- microsoft/spirv_to_dxil: Fix the push_constant UBO size calculation
|
||
- dzn: Emit missing transition barriers for rendering attachments
|
||
- ci/windows: Bump spirv-samples commit-ID
|
||
- microsoft/spirv2dxil: Add a debug option
|
||
- microsoft/compiler: Make sure all SV_Position components are written
|
||
- microsoft/spirv_to_dxil: Extract NIR passes out of spirv_to_dxil()
|
||
- microsoft/spirv_to_dxil: Constify the descriptor binding mapping fields
|
||
- microsoft/compiler: Make input_mappings immune to driver_location holes
|
||
- microsoft/spirv_to_dxil: Remove dead IO vars at the end of dxil_spirv_nir_passes()
|
||
- microsoft/spirv_to_dxil: Add a linking helper
|
||
- microsoft/compiler: Allow vulkan users to handle varyings linking
|
||
- microsoft/spirv_to_dxil: vectorize IOs
|
||
- dzn: Fix maxVertexInputAttributes limit
|
||
- dzn: Handle varying linking explicitly
|
||
- dzn: Fix unitialized var access in dzn_meta_compile_shader()
|
||
- dzn: Fix vertex input handling
|
||
- microsoft/spirv_to_dxil: Let spirv_to_dxil() handle the linking
|
||
- microsoft/compiler: Drop the auto_link parameter passed to nir_to_dxil()
|
||
- dzn: Switch to ID3D12Device2
|
||
- dzn: Drop an unused argument passed to dzn_graphics_pipeline_translate_vi()
|
||
- dzn: Use CreatePipelineState()
|
||
- dzn: Enable depth-bounds testing
|
||
- microsoft/compiler: Lower fquantize2f16
|
||
- nir: Fix flat new_var assignment in create_new_io_vars()
|
||
- microsoft/compiler: Pick a type that matches interpolation mode for structs
|
||
- microsoft/compiler: Truncate function names when needed
|
||
- vulkan/util: Fill VkPhysicalDeviceIDProperties::deviceNodeMask
|
||
- spirv: Add a dst/src type to image deref loads/stores coming from OpAtomic{Load,Store}
|
||
- microsoft/compiler: Images are no longer reprensented by uniform vars
|
||
- microsoft/compiler: Set typed_uav_load_additional_formats when appropriate
|
||
- microsoft/compiler: s/assert()/unreachable()/ in emit_intrinsic()
|
||
- microsoft/compiler: Support load_sample_id_no_per_sample
|
||
- dzn: Decorrelate external image info and external image properties
|
||
- dzn: Return empty external properties
|
||
- dzn: Add a dummy GetDeviceGroupPeerMemoryFeatures()
|
||
- dzn: Add a dummy GetImageSparseMemoryRequirements2()
|
||
- dzn: Add dummy {Create,Destroy}SamplerYcbcrConversion() implementations
|
||
- dzn: Check that no export is requested in the memory allocation path
|
||
- dzn: Lower alignment requirements when allocating buffers or single-sample images
|
||
- dzn: Add a dzn_desc_type_has_sampler() helper
|
||
- dzn: Implement GetDescriptorSetLayoutSupport()
|
||
- dzn: Vulkan 1.1 wants maxPerSetDescriptors >= 1024
|
||
- dzn: Prepare support for pipeline variants
|
||
- dzn: Fix primitiveRestart support
|
||
- dzn: Fix triangle-fan emulation
|
||
- dzn: Support dynamic depth testing parameters properly
|
||
- dzn: Support dynamic line width
|
||
- ci/windows: Allow skipping deqp tests if we have to
|
||
- ci/windows: Add dEQP-VK.dynamic_state.* to the test list
|
||
- ci/windows: Skip dEQP-VK.api.command_buffers.record_many_draws*
|
||
- d3d12: Keep access masks up-to-date when removing vars in d3d12_disable_multisampling()
|
||
- microsoft/compiler: Move d3d12_fix_io_uint_type() to dxil_nir.c
|
||
- microsoft/compiler: Fix Layer type
|
||
- microsoft/compiler: Support gl_Layer
|
||
- dzn: Disable rasterization if the last geometry stage doesn't write the position
|
||
- dzn: Enable support for geometry shaders
|
||
- ci/dzn: Move to a .toml description
|
||
- dzn: Fix the 2DMSArray case in dzn_image_get_dsv_desc()
|
||
- dzn: Fix CmdPushConstants()
|
||
- dzn: Clamp depthBiasConstantFactor when doing the float -> int conversion
|
||
- dzn: Transition resource to RENDER_TARGET/DEPTH_WRITE before clears
|
||
- microsoft/compiler: Fix emit_ubo_var()
|
||
- ci/dzn: Copy testlog.{css,xsl} to the result dir
|
||
- nir/serialize: Put dest last in packed_instr::tex
|
||
- dzn: Advertise anisotropic filtering support
|
||
- dzn: Advertise shaderImageGatherExtended support
|
||
- dzn: Enable dynamic indexing on all kind of descriptors
|
||
- dzn: Enable shader{Clip,Cull}Distance
|
||
- dzn: Enable the depthClamp feature
|
||
- nir/serialize: Support texop >= 16
|
||
- nir/serialize: Silence integer-overflow false positive
|
||
- vulkan: Add a vk_pipeline_hash_shader_stage() helper
|
||
- anv: Use vk_pipeline_hash_shader_stage()
|
||
- v3dv: Fix nir_shader leaks in v3dv_meta_{clear,copy}()
|
||
- v3dv: Use vk_pipeline_hash_shader_stage()
|
||
- tu: Use vk_pipeline_hash_shader_stage()
|
||
- radv: Kill unused fs_m local var in radv_create_shaders()
|
||
- radv: Use vk_pipeline_hash_shader_stage()
|
||
- dzn: Hook-up device-lost detection
|
||
- dzn: Fix indentation
|
||
- dzn: Initialize UUIDs
|
||
- dzn: Drop dzn_pipeline_cache.c
|
||
- dzn: Add missing D3D12_CACHED_PIPELINE_STATE to MAX_COMPUTE_PIPELINE_STATE_STREAM_SIZE
|
||
- dzn: Fix assertion in d3d12_pipeline_state_stream_new_desc()
|
||
- dzn: Use d3d12_compute_pipeline_state_stream_new_desc() in the compute path
|
||
- dzn: Pass the maximum stream size to d3d12_pipeline_state_stream_new_desc()
|
||
- dzn: Drop useless while(link_mask != 0)
|
||
- dzn: Drop unused allocator passed to dzn_graphics_pipeline_create()
|
||
- dzn: Let dzn_pipeline_init() initialize the root signature in the stream
|
||
- dzn: Fix potential nir_shader leak
|
||
- dzn: Use vk_to_mesa_shader_stage()
|
||
- dzn: Save a few indentation levels in graphics_pipeline_compile_shaders()
|
||
- dzn: Don't delegate binding translation to dxil_spirv_nir_passes()
|
||
- dzn: Hash pipeline layout stages
|
||
- dzn: Add a generic cached blob wrapper
|
||
- dzn: Drop unneeded goto statement in dzn_compute_pipeline_create()
|
||
- dzn: Move the compute shader compilation logic to a sub-function
|
||
- dzn: Cache NIR shaders
|
||
- dzn: Caculate a binding translation hash
|
||
- dzn: Cache DXIL shaders
|
||
- dzn: Cache pipeline info
|
||
- ci/dzn: Add pipeline cache tests to the include list
|
||
- dzn: Cache and pack transition barriers
|
||
- dzn: Don't call dzn_cmd_buffer_queue_transition_barriers() prematurely
|
||
- dzn: Rename dzn_cmd_buffer_queue_image_subres_range_transition()
|
||
- dzn: Introduce dzn_cmd_buffer_queue_image_range_state_transition()
|
||
- dzn: Allow shaders to read from depth/stencil attachments
|
||
- dzn: Fix copies of block-based formats
|
||
- dzn: Fix unused var warnings
|
||
- dzn: Fix 'statement with no effect' warning
|
||
- dzn: Fix printf() format in dxgi_get_factory() error message
|
||
- dzn: Fix {back,front}_wr_uses_ref assignment
|
||
- dzn: Fix -Wswitch warnings
|
||
- dzn: Use the vk_descriptor_set_layout base struct
|
||
- dzn: Use the vk_pipeline_layout base struct
|
||
- dzn: Mark transition barriers as executed when we execute barriers
|
||
- dzn: Query D3D12_FEATURE_D3D12_OPTIONS3 features
|
||
- dzn: Provide a helper to check if 2 formats are compatible
|
||
- dzn: Support native image copies when formats are compatible
|
||
- dzn: Take VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT into account
|
||
|
||
Boyuan Zhang (9):
|
||
|
||
- radeonsi/vcn: add vcn 4.0 encode support
|
||
- radeonsi/vcn: add vcn 4.0 encode fw interface version
|
||
- radeonsi/vcn: add encode context ib for vcn 4.0
|
||
- radeonsi/vcn: add session init ib for vcn 4.0
|
||
- radeonsi/vcn: enable vcn 4.0 encode for gfx11 asic
|
||
- radeonsi/vcn: use calculated max hierarchy depth for hevc enc
|
||
- vl: add cabac init idc for h264 encode
|
||
- frontends/va: get entropy mode info for h264 encode
|
||
- radeonsi/vcn/enc: enable cabac entropy mode
|
||
|
||
Brian Paul (101):
|
||
|
||
- gallivm: assorted clean-ups in lp_bld_nir_aos.c
|
||
- gallivm: whitespace fixes in lp_bld_tgsi.h
|
||
- gallivm: asst. clean-ups in lp_bld_sample.h
|
||
- llvmpipe: asst. clean-ups in lp_linear_fastpath.c
|
||
- llvmpipe: asst. clean-ups in lp_setup.c
|
||
- llvmpipe: asst. clean-ups in lp_state_setup.c
|
||
- gallivm: asst. clean-ups in lp_bld_type.[ch]
|
||
- gallivm: cleanup lp_bld_arit.c
|
||
- gallivm: clean-up in lp_bld_nir.h
|
||
- gallivm: add const qualifier in lp_bld_nir_soa.c
|
||
- llvmpipe: asst. clean-up in lp_state_derived.c
|
||
- llvmpipe: asst. clean-ups in lp_rast_linear.c
|
||
- llvmpipe: fix comment typo in lp_linear_fastpath.c
|
||
- llvmpipe: asst. clean-ups in lp_rast.c
|
||
- llvmpipe: asst. clean-ups in lp_linear.c
|
||
- llvmpipe: asst. clean-ups in lp_scene.h
|
||
- llvmpipe: asst. clean-up and add comments in lp_state_fs.h
|
||
- gallivm: clean-up in lp_bld_sample.c
|
||
- llvmpipe: asst. clean-up in lp_setup_context.h
|
||
- gallivm: asst. clean-ups in lp_bld_sample_soa.c
|
||
- gallivm: clean-ups in lp_bld_tgsi.[ch]
|
||
- llvmpipe: asst. clean-ups in lp_state_fs.c
|
||
- llvmpipe: minor clean-ups in lp_jit.h
|
||
- llvmpipe: asst. clean-ups in lp_state_fs_linear_llvm.c
|
||
- llvmpipe: asst. clean-ups in lp_state_fs_linear.c
|
||
- llvmpipe: asst. clean-ups in lp_setup_tri.c, lp_setup_rect.c
|
||
- llvmpipe: whitespace clean-ups in lp_rast.h
|
||
- llvmpipe: minor whitespace/formatting cleanups in lp_state_setup.h
|
||
- llvmpipe: minor clean-ups in lp_state_fs_analysis.c
|
||
- llvmpipe: misc clean-up in lp_rast_tri.c
|
||
- llvmpipe: asst. clean-ups in lp_bld_blend_aos.c
|
||
- llvmpipe: asst. clean-ups in lp_rast_tri_tmp.h
|
||
- llvmpipe: asst. clean-ups in lp_rast_priv.h
|
||
- gallivm: minor clean-ups in lp_bld_sample_soa.c
|
||
- llvmpipe: misc clean-ups in lp_bld_interp.c
|
||
- gallivm: asst. clean-ups in lp_bld_nir.c
|
||
- llvmpipe: fix comment typo in lp_bld_interp.h
|
||
- llvmpipe: asst. clean-ups in lp_linear_sampler.c
|
||
- llvmpipe: asst. clean-ups in lp_tex_sample.c
|
||
- llvmpipe: asst. clean-ups in lp_scene.c
|
||
- llvmpipe: minor clean-up in lp_state_blend.c
|
||
- llvmpipe: use enum blend types instead of unsigned
|
||
- llvmpipe: s/unsigned/enum pipe_compare_func/
|
||
- gallivm: asst. clean-ups in lp_bld_logic.
|
||
- llvmpipe: asst. clean-ups in lp_state_fs.c
|
||
- draw: s/unsigned/enum pipe_prim_type/
|
||
- draw: asst. clean-up in draw_context.h
|
||
- draw: asst. clean-up in draw_gs.h
|
||
- draw: asst. clean-up in draw_pipe.c
|
||
- draw: asst. clean-up in draw_pipe_validate.c
|
||
- draw: asst. clean-up in draw_private.h
|
||
- draw: asst. clean-up in draw_pt.h
|
||
- draw: asst. clean-up in draw_pt.c
|
||
- draw: asst. clean-up in draw_pt_fetch_shader_emit.c
|
||
- draw: asst. clean-up in draw_pt_fetch_shader_pipeline.c
|
||
- draw: asst. clean-up in draw_pt_util.c
|
||
- draw: asst. clean-up in draw_pt_vsplit.c
|
||
- draw: asst. clean-up in draw_vbuf.h
|
||
- gallivm: use enum types instead of unsigned in lp_bld_sample.[ch]
|
||
- llvmpipe: add minor comments in lp_rast.h, lp_setup-rect.c
|
||
- llvmpipe: remove lp_rast_cmd_arg::state field
|
||
- gallivm: s/unsigned/enum pipe_swizzle/
|
||
- llvmpipe: add simple assertion in generate_fragment()
|
||
- util/bitset: add BITSET_SIZE()
|
||
- lavapipe: rework code to compute textures_used, samplers_used
|
||
- llvmpipe: minor code re-org in lp_state_fs_analysis.c
|
||
- nir: add const qualifiers, move some decls in nir_to_tgsi_info.c
|
||
- llvmpipe: replace if/then with switch in llvmpipe_nir_fn_is_linear_compat()
|
||
- gallivm: s/0/LP_BLD_TEX_MODIFIER_NONE/
|
||
- llvmpipe: replace GET_A0() macro w/ inline function
|
||
- llvmpipe: add missing tex_info->texture_unit assignment
|
||
- llvmpipe: fix invalid memory used in lp_fs_linear_run
|
||
- llvmpipe: fix texcoord analysis in llvmpipe_nir_fn_is_linear_compat()
|
||
- llvmpipe: replace LP_RAST_OP\_ #defines with enum type
|
||
- gallivm: increase LP_MAX_TGSI_SHADER_IMAGES from 16 to 32
|
||
- lavapipe: fix logicop, independent blend enable/disable
|
||
- llvmpipe: don't allow texture/resource swizzles on linear path
|
||
- llvmpipe: initialize a local var to fix compiler warning in release build
|
||
- lavapipe: fix incorrect sv[] array size
|
||
- llvmpipe: fix texcoord analysis regression
|
||
- llvmpipe: asst. clean-up in lp_bld_depth.c
|
||
- llvmpipe: minor white-space clean-up in lp_setup.h
|
||
- llvmpipe: asst. clean-ups in lp_setup_rect.c
|
||
- gallivm: asst. clean-ups in lp_bld_const.c
|
||
- llvmpipe: asst. clean-ups in lp_linear_interp.c
|
||
- llvmpipe: minor optimization in lp_linear_init_interp()
|
||
- llvmpipe: asst. clean-ups in lp_state_setup.c
|
||
- llvmpipe: remove unneeded casts
|
||
- llvmpipe: replace 'unsigned' with enum types
|
||
- gallivm: asst. clean-ups in lp_bld_swizzle.c
|
||
- gallivm: simplify shuffling in emit_store_reg()
|
||
- gallivm: fix incorrect memset() in SOA emit_load_const()
|
||
- gallivm: rename 'var' param in emit_store_reg(), emit_store_var().
|
||
- llvmpipe: make opaque field a 1-bit field
|
||
- llvmpipe: minor whitespace clean-up in lp_state_fs.h
|
||
- llvmpipe: pass frontfacing param to jit_function() call
|
||
- llvmpipe: asst. clean-up in lp_rast_linear_fallback.c
|
||
- llvmpipe: remove unneeded casts in lp_rast_linear_fallback.c
|
||
- llvmpipe: consolidate code in lp_rast_linear_fallback.c
|
||
- llvmpipe: tighten up some code in lp_rast_linear_fallback.c
|
||
- llvmpipe: add some const qualifiers in lp_rast_linear_fallback.c
|
||
|
||
Caio Oliveira (1):
|
||
|
||
- intel/compiler: In XeHP prefer <1;1,0> regions before compacting
|
||
|
||
Chad Versace (2):
|
||
|
||
- venus: Refactor vn_fix_graphics_pipeline_create_info
|
||
- venus: Don't encode ignored pTessellationState
|
||
|
||
Charlie Turner (4):
|
||
|
||
- ci: fix shellcheck violations in the test-vk building script
|
||
- ci: add Linux build of apitrace
|
||
- ci: upgrade gfxreconstruct build to v0.9.10
|
||
- ci: upgrade to DXVK v1.10.1
|
||
|
||
Charmaine Lee (20):
|
||
|
||
- gallium/util: Increase the debug_flush map depth to 64
|
||
- draw: fix double free of NIR IR
|
||
- svga: fix shader IR type passed to draw create shader function
|
||
- svga/nir: enable PIPE_CAP_TGSI_TEXCOORD
|
||
- svga: enable PIPE_CAP_TGSI_TEXCOORD for vgpu10 and up
|
||
- svga/tgsi: add translation for DTRUNC
|
||
- svga/nir: enable lower double option
|
||
- svga: workaround for unexpected double swizzle
|
||
- svga: fix clip distance shadow copy writemask
|
||
- svga: add a helper function for common shader creation
|
||
- svga: create svga_shader_info
|
||
- svga: add svga_compile_shader helper function
|
||
- svga: add need_texcoord_semantic to tgsi_add_point_sprite & tgsi_add_aa_point
|
||
- svga: fix aa point
|
||
- svga: fix reference to NULL offset for atomic counter
|
||
- svga: fix shared memory index
|
||
- svga: fix mksstats build
|
||
- mesa/st: fix reference to nir->info after nir_to_tgsi
|
||
- svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context
|
||
- svga: fix invalid component access of domain location
|
||
|
||
Chia-I Wu (61):
|
||
|
||
- turnip: fix drm modifier support with planar formats
|
||
- util: Keep quiet NaNs quiet when converting to half float.
|
||
- anv: advertise rectangularLines only for Gen10+
|
||
- turnip: let modifier takes precedence over TU_DEBUG=noubwc
|
||
- freedreno/fdperf: make refresh rate configurable
|
||
- freedreno/fdperf: support dumping counters
|
||
- turnip: fix sampledImageIntegerSampleCounts
|
||
- turnip: emit VPC_SO_DISABLE in xfb begin/end
|
||
- ir3: fix mem_ctx for ir3_disasm_info::nir
|
||
- turnip: fix off-by-one in border color bitset
|
||
- turnip: fix tu6_pack_border_color for z24
|
||
- turnip: disable UBWC for SNORM formats
|
||
- turnip: if-checks autotune debug macros
|
||
- turnip: base pipeline draw costs on colorWriteMask
|
||
- turnip: include stencil test for drawcall costs
|
||
- turnip: estimate render pass costs
|
||
- turnip: consider render pass costs in autotune
|
||
- ci/turnip: add a link to VkPhysicalDeviceMemoryProperties failure
|
||
- turnip: force sample interpolations for sample shading
|
||
- turnip: advertise VK_EXT_border_color_swizzle
|
||
- freedreno/fdl: use XYZW swap for PIPE_FORMAT_X24S8_UINT
|
||
- vulkan/device_select: remove get_pdevice_proc_addr
|
||
- freedreno/drm-shim: add a660
|
||
- turnip: fix an assertion with drm-shim
|
||
- ir3: update ir3_const_state comment
|
||
- ir3: copy req_local_mem for MESA_SHADER_KERNEL
|
||
- ir3: fix output_loc size
|
||
- ir3: remove unused patch_vertices_in
|
||
- ir3: increment constoff right after it is assigned
|
||
- ir3: fix tess param allocation
|
||
- ir3: set UL flag before ir3_lower_subgroups
|
||
- turnip: remove shared_consts from tu_compiled_shaders
|
||
- turnip: shared_consts and push_consts are mutually exclusive
|
||
- turnip: add tu_common.h as the common header
|
||
- turnip: remove includes that are already in tu_common.h
|
||
- turnip: add tu_drm.h
|
||
- turnip: add tu_suballoc.h
|
||
- turnip: update tu_cs.h
|
||
- turnip: add tu_query.h
|
||
- turnip: add tu_image.h
|
||
- turnip: add tu_formats.h
|
||
- turnip: update tu_descriptor_set.h
|
||
- turnip: add tu_shader.h
|
||
- turnip: add tu_pipeline.h
|
||
- turnip: add tu_clear_blit.h
|
||
- turnip: add tu_dynamic_rendering.h
|
||
- turnip: add tu_lrz.h
|
||
- turnip: add tu_pass.h
|
||
- turnip: add tu_wsi.h
|
||
- turnip: update tu_autotune.h
|
||
- turnip: add tu_device.h
|
||
- turnip: add tu_cmd_buffer.h
|
||
- turnip: add tu_android.h
|
||
- turnip: update tu_util.h
|
||
- turnip: move away from tu_private.h
|
||
- turnip: remove tu_private.h
|
||
- turnip: remove headers from libtu_files
|
||
- turnip: use SPDX-License-Identifier
|
||
- turnip: fix a use-after-free in autotune
|
||
- turnip: lower the queue priority to 1
|
||
- ir3: fix predicate splitting in scheduler
|
||
|
||
Christian Gmeiner (12):
|
||
|
||
- isaspec: Extend split_bits(..) to accept a bitsize
|
||
- isaspec: Handle patterns bigger then 64 bit
|
||
- nir: Fix unused-variable compile warnings
|
||
- freedreno: Wait for async compile jobs to finish
|
||
- ir3: Create at least one thread for async shader compile
|
||
- gtest: Fix maybe-uninitialized compiler warning
|
||
- r600: Switch to unreachable(..)
|
||
- svga: Clear query before usage
|
||
- ci: Remove -Wno-error=maybe-uninitialized
|
||
- etnaviv: factor out shader screen related init/deint
|
||
- etnaviv: async shader compile
|
||
- etnaviv: Move nir_shader_compiler_options to compiler
|
||
|
||
Chuansheng Liu (1):
|
||
|
||
- iris,anv: correct the max thread number for DG2+
|
||
|
||
Clément Guérin (1):
|
||
|
||
- venus: implement VK_EXT_physical_device_drm
|
||
|
||
Connor Abbott (69):
|
||
|
||
- tu: Delete unused tu_clear_blit GS handling
|
||
- ir3, freedreno, tu: Make ir3_shader_variant store stream_output
|
||
- ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant
|
||
- ir3: Use ir3_shader_variant::type more often
|
||
- ir3: Allocate disasm_info under variant
|
||
- ir3: Add ir3_shader_variant::compiler
|
||
- tu: Keep original blit shaders separately
|
||
- ir3: Remove ir3_shader_variant::shader
|
||
- ir3: Add functions to serialize variants
|
||
- ir3: Support disabling the pipeline cache
|
||
- ir3: Add ir3_shader_create_variant()
|
||
- tu: Hash pipeline layout contents
|
||
- tu: Rewrite cache UUID based on radv
|
||
- tu: Include turnip debug flags in pipeline cache UUID
|
||
- tu: Implement pipeline caching with shared Vulkan cache
|
||
- tu: Zero-initialize compute driver key
|
||
- tu: Implement VK_EXT_pipeline_creation_feedback
|
||
- tu: Implement VK_EXT_pipeline_creation_cache_control
|
||
- tu: Convert to sync2 entrypoints
|
||
- tu: Expose VK_KHR_synchronization2
|
||
- tu: Implement VK_EXT_image_2d_view_of_3d
|
||
- tu: Enable shaderTessellationAndGeometryPointSize
|
||
- tu: Enable VK_KHR_swapchain_mutable_format
|
||
- tu: Fix partial-write LRZ determination
|
||
- tu: Implement extendedDynamicState2LogicOp
|
||
- tu: Implement VK_EXT_color_write_enable
|
||
- tu: Enable uniform texture buffers with NPOT formats
|
||
- freedreno/a6xx: Support more 3-component formats
|
||
- tu: Fix 3d GMEM store swizzle in texture descriptor
|
||
- tu: Add missing WFI to the 3d GMEM store path
|
||
- tu: Save/restore \*_BIN_CONTROL in 3d GMEM store path
|
||
- ir3/sched: Fix could_sched() determination
|
||
- tu: Fix linemode for tessellation with isolines
|
||
- ir3: Fix vectorizer condition for SSBOs
|
||
- freedreno: Document a650-specific CP_REG_WRITE flag
|
||
- freedreno/a6xx: Document border color/swizzle interaction
|
||
- freedreno/a6xx: Fix border color swizzling
|
||
- freedreno/a6xx: Use fdl format swizzle
|
||
- freedreno/fdl: Note border color constraints
|
||
- freedreno/a6xx: Fix border color with Z24UINT_S8UINT
|
||
- tu: Fix stencil border color with has_z24uint_s8uint
|
||
- mesa/st: Fix border color type for stencil sampling
|
||
- nv50/ir/ra: Fix copying compound for moves
|
||
- freedreno/a6xx: VPC_SO_NCOMP is actually VPC_SO_BUFFER_STRIDE
|
||
- tu: Use incoherent CCU write for buffer accesses
|
||
- tu: Support resolving D24S8 -> S8
|
||
- tu: Fix resolving d32s8 into s8 on fast path
|
||
- tu/lrz: Do not use framebuffer when inheriting LRZ
|
||
- tu: Treat CP_WAIT_FOR_ME as a cache invalidate
|
||
- tu: Rewrite tess factor emission
|
||
- tu: Move TU_DONT_CARE_AS_LOAD into attachment_set_ops()
|
||
- tu: Disable GMEM for multiview inside tu_render_pass_gmem_config
|
||
- tu: Move tu_render_pass definition up
|
||
- tu: Remove usage of RenderPassBeginInfo
|
||
- tu: Initial support for dynamic rendering
|
||
- tu: Remove has_subpass_predication
|
||
- tu: Update more state with secondaries
|
||
- tu: Split out some state into a separate struct
|
||
- util/trace: Fix u_trace_*_iterator with no tracepoints
|
||
- tu: Add support for suspending and resuming renderpasses
|
||
- tu: Use common vk_image struct
|
||
- tu: Use common vk_image_view base struct
|
||
- tu: Add debug option to use emulated renderpass support
|
||
- tu: Restore formatting of tu_clear_blit.c
|
||
- tu: Fix sysmem depth attachment clear flushing
|
||
- ir3/spill: Fix extracting from a vector at the end of a block
|
||
- tu/lrz: Fix multiple subpass case with secondaries
|
||
- tu/lrz: Fix multiple depth attachment case with secondaries
|
||
- tu: Don't preload variable-count descriptors
|
||
|
||
Constantine Shablya (1):
|
||
|
||
- anv: set image_read_without_format NIR option on Vulkan 1.3
|
||
|
||
Corentin Noël (5):
|
||
|
||
- mesa: Make sure to fallback to handling the original choose texture format
|
||
- virgl/ci: Update virglrenderer
|
||
- virgl/ci: Update virglrenderer
|
||
- virgl: Add support for passing the supported number of components from virgl
|
||
- virgl: Only progagate the uniform numbers if the numbers are actually right
|
||
|
||
Cristian Ciocaltea (4):
|
||
|
||
- virgl/ci: Rename virgl-lava-* jobs to virgl-iris-*
|
||
- virgl/ci: Update checksums for some virgl iris traces
|
||
- ci: Add CPU frequency adjustment capability
|
||
- ci: Limit Intel CPU scaling frequency for performance tests
|
||
|
||
Daniel Schürmann (36):
|
||
|
||
- nir/lower_alu_to_scalar: don't set the nir_builder cursor
|
||
- aco: remove 'max_waves' and use 'num_waves' to adjust for LDS and workgroup size
|
||
- aco: split num_waves adjustment into separate function
|
||
- aco: adjust num_waves for LDS before scheduling
|
||
- aco/optimizer: prevent any overflow between SGPR and const offset on MUBUF
|
||
- aco/ra: fix live-range splits of phi definitions
|
||
- aco: don't re-create vectors for load_barycentric_* intrinsics
|
||
- aco/ra: improve split_vector register assignment if the operand is not killed
|
||
- aco/optimizer: re-combine and copy-propagate p_create_vector(p_split_vector)
|
||
- radv: move two NIR passes out of tight optimization loop
|
||
- radv: call nir_lower_vars_to_ssa later
|
||
- aco: fix spilling of phis without temp operands
|
||
- nir/opt_vectorize: add callback for max vectorization width
|
||
- nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback
|
||
- nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions
|
||
- aco: correctly validate v_fma_mixhi_f16 register assignment
|
||
- radv: don't lower vectorized instructions to 32bit
|
||
- radv: use callback for nir_lower_to_scalar
|
||
- radv: vectorize nir_op_fdiv & nir_op_flrp & nir_op_ineg
|
||
- radv: vectorize nir_op_fabs
|
||
- radv: fix swizzles after nir_opt_algebraic_late
|
||
- radv: reorder some NIR passes
|
||
- nir/opt_shrink_vectors: fix re-using of components for vecN
|
||
- radv/shader_info: fix load_frag_coord and load_sample_pos read masks
|
||
- aco: fix packed 16bit fneg/fsat optimization
|
||
- aco: avoid unnecessary copies in emit_wqm()
|
||
- aco: Avoid live-range splits in Exact mode
|
||
- aco: fix assertion in insert_exec_mask
|
||
- spirv: handle phis decorated with RelaxedPrecision
|
||
- radv: Only create noop-fs for graphics pipelines
|
||
- radv: create RT traversal as separate shader
|
||
- radv/rt: use derefs for the traversal stack
|
||
- radv/rt: use stage ID as handle for general and closestHit shaders
|
||
- radv/rt: Don't load ClosestHit SBT on every hit, but only once after traversal
|
||
- radv/rt: fix nir_builder cursor in lower_rt_instructions()
|
||
- radv/rt: simplify lower_rt_instructions()
|
||
|
||
Daniel Stone (10):
|
||
|
||
- dzn/ci: Don't spam conformance warnings
|
||
- CI: Disable Windows jobs
|
||
- ci: Allow lima jobs to be globally disabled
|
||
- ci: Also disable Windows container builds when down
|
||
- ci/windows: Eliminate duplicate script definition
|
||
- ci/zink: Add glx-tfp to skips
|
||
- ci/lava: Sleep before, not after, API calls
|
||
- ci: freedreno farm is down
|
||
- CI: Disable Collabora lab
|
||
- Revert "CI: Disable Collabora lab"
|
||
|
||
Danylo Piliaiev (53):
|
||
|
||
- turnip: Try harder to keep LRZ valid and fix a few edge cases
|
||
- freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC
|
||
- turnip: Skip load/stores for tiles with no geometry
|
||
- turnip: Add debug option to print gmem load/store skip stats
|
||
- turnip: Fix tu_debug_flags values clashing
|
||
- tu: Implement VK_EXT_image_view_min_lod
|
||
- tu: Do not flush ccu in clear/blits during renderpass
|
||
- pps: Open writable renderer node in DrmDevice::create
|
||
- tu: Fix indices of drm_msm_gem_submit_cmd when filling them
|
||
- freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0
|
||
- util/u_trace: Add json output
|
||
- docs/u_trace: document u_trace usage
|
||
- ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS
|
||
- ir3: Force late-z if FS has global store/atomic
|
||
- tu: Fix D32S8 resolves into D32 or S8 in sysmem mode
|
||
- spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark
|
||
- turnip: Allow nested CP_COND_REG_EXEC
|
||
- turnip: Fix out-of-bounds builder->shader_iova access
|
||
- tu/autotune: Prevent division by zero
|
||
- turnip: copy disasm strings for pipeline executables
|
||
- tu: Disable sample counting for 3d blits during occlusion query
|
||
- tu: Reset xfb_used at the end of a renderpass
|
||
- meson: Define _GNU_SOURCE for android host system
|
||
- meson/pps: Check if libdrm exists to compile pps
|
||
- meson/tu: Don't compile libdrm paths if KGSL is selected
|
||
- tu: Do not expose storage image/buffer features for PACK16 formats
|
||
- tu: Use hw binning or sysmem with QUERY_TYPE_PRIMITIVES_GENERATED
|
||
- tu: Don't count 3d blits in QUERY_TYPE_PRIMITIVES_GENERATED
|
||
- freedreno: Document rest of GRAS_LRZ_CNTL, clarify UNK_25 event
|
||
- freedreno: document GRAS_UNKNOWN_810A
|
||
- tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear
|
||
- ir3: Use NIR's info.writes_memory to detect when when to force late-z
|
||
- tu,freedreno: Refactored START/STOP events for pipeline stats
|
||
- tu: Fix prim gen query and pipeline stats query interaction
|
||
- zink: re-enable EXT_primitives_generated_query for Turnip
|
||
- util/u_trace: Fix iteration over config_control
|
||
- turnip: Refactor tracepoints generation to reduce duplication
|
||
- turnip: Add TU_GPU_TRACEPOINT envvar to toggle tracepoints
|
||
- freedreno: Add the rest of tracepoints with start/end to perfetto
|
||
- freedreno: Refactor tracepoints generation to reduce duplication
|
||
- freedreno: Add FD_GPU_TRACEPOINT envvar to toggle tracepoints
|
||
- docs/u_trace: Add documentation for \*_GPU_TRACEPOINT envvars
|
||
- turnip: Disable LRZ fast-clear for gen1 and gen2
|
||
- tu: Do not dereference descriptorSetLayout in push descriptors tmpl
|
||
- zink/turnip: Add comments to already triaged failures for tu baseline
|
||
- turnip: Simple breadcrumbs implementation to debug hangs
|
||
- tu: WFI after PC_TESSFACTOR_ADDR update
|
||
- freedreno: WFI after PC_TESSFACTOR_ADDR update
|
||
- ir3/ra: Always insert interval for precolored inputs
|
||
- ir3: Never remove GS_HEADER_IR3 sysval input
|
||
- tu: Disable LRZ write when alpha-to-coverage is enabled
|
||
- freedreno: Disable LRZ write when alpha-to-coverage is enabled
|
||
- ir3: Prevent reordering movmsk with kill
|
||
|
||
Dave Airlie (82):
|
||
|
||
- clc: add simple llvm initialise API
|
||
- clc: initialise one more llvm stage
|
||
- u_blitter/stencil: take dstbox x/y into accounts for dst fb width
|
||
- util/stencil: fix stencil fallback blit shader texture types.
|
||
- llvmpipe: add user memory resources to the debug global list.
|
||
- gallium/vl: wrap codec support checks in a common function.
|
||
- gallium/omx: add video codec supported hook for decode paths.
|
||
- meson: add a video codec support option
|
||
- gallium/vl: respect the video codecs configure in meson
|
||
- vulkan/wsi: keep allocate queue families in image, just don't fill them
|
||
- gallivm: move to new pass manager to handle coroutines change.
|
||
- radv: precalculate hs offchip parameters.
|
||
- radv: precalculate tess ring sizes/offsets.
|
||
- radv/ac: introduce a new common function for hs calcs.
|
||
- ac: port radeonsi tess factor calculations to common code.
|
||
- radeonsi: port tess ring calcs to the common helper.
|
||
- zink: avoid extra submits for empty wait cmdbufs.
|
||
- zink: don't finish barriers cmd buffer if not used.
|
||
- radv: cache the slab bo pointer in the pipeline.
|
||
- radv: only dirty line stipple when necessary.
|
||
- radv: optimised command buffer reset of vertex bindings.
|
||
- aco: move info pointer to a copy.
|
||
- aco: move to a minimal aco shader info struct.
|
||
- aco/info: reduce the gs ring info to what is needed.
|
||
- aco: remove radv specific streamout info
|
||
- aco: convert vs and so info over to aco structs.
|
||
- aco/info: add some more fields.
|
||
- aco/radv: convert to aco shader info at the radv level.
|
||
- aco: remove radv vs prolog key from aco internals.
|
||
- aco: drop unused radv include
|
||
- ac/radv: drop info pointer from the ac and radv shader structs
|
||
- aco/radv: drop radv_nir_compiler_options from aco.
|
||
- llvmpipe: align scratch size to 64-bit size.
|
||
- meson: add build-id to pipe libraries
|
||
- vl: fix codec checks to disable properly
|
||
- lavapipe: fix depth bias offset flag enables.
|
||
- llvmpipe: flush resources for kms swrast path.
|
||
- llvmpipe: handle timespec overflow on fence waits.
|
||
- nouveau: move codegen into a standalone library.
|
||
- nouveau: move codegen to a common higher level directory.
|
||
- nouveau/codegen: drop all ubytes from codegen.
|
||
- nouveau/codegen: drop gallium headers from the interface.
|
||
- llvmpipe: disable alpha test branch if multisample is enabled.
|
||
- egl/x11: split large put image requests to avoid server destroy
|
||
- egl/x11: add missing put_image cookie cleanups
|
||
- wsi/x11: add xcb_put_image support for larger transfers.
|
||
- glx/drisw: use xcb instead of X to query connection
|
||
- draw/cliptest: add support for guard band + full Z
|
||
- zink: workaround depth sampler border color when z24 is z32
|
||
- aco: refactor the radv binary builder out of the core aco fn.
|
||
- aco/radv: provide a callback from aco shader building to build binary
|
||
- aco/radv: provide a vs prolog callback from aco to radv.
|
||
- aco: drop radv_shader.h include
|
||
- radv: add a dynamic vertex format cache.
|
||
- lavapipe: drop unreachable pNext checks.
|
||
- lavapipe: enable variablePointers
|
||
- kms/dri: add mutex lock around map/unmap
|
||
- crocus: fail query begin if upload allocation fails.
|
||
- lavapipe: state latest conformance tests passed
|
||
- llvmpipe: fix aniso cube map arrays.
|
||
- crocus: fix leak in query code.
|
||
- llvmpipe: make last_fence a screen/rast object not a context one.
|
||
- llvmpipe: keep context list and use to track resource usage.
|
||
- gallivm: use progress from subgroup lowering.
|
||
- gallivm: fix casting around read invocation.
|
||
- llvmpipe: export ARB_shader_ballot
|
||
- lavapipe: expose some subgroup extensions
|
||
- zink: fixup subgroup vote/ballot enables.
|
||
- zink/llvmpipe/ci: skip broken CTS ballot test.
|
||
- util: add reallocarray wrapper for win32
|
||
- llvmpipe: Allocate scene tiles dynamically
|
||
- gallium: add a rasterizer state bit for unrestricted depth values.
|
||
- mesa/st: disable unrestricted fragment depth values for GL/GLES
|
||
- util/pack_color: clamp depth values outside range for unorm formats.
|
||
- llvmpipe/fs: handle unrestricted depth values.
|
||
- lavapipe: enable VK_EXT_depth_range_unrestricted.
|
||
- util/reallocarray: add errno.h include.
|
||
- lavapipe: enable VK_FORMAT_R5G5B5A1_UNORM_PACK16
|
||
- gallivm: fix printf hook for cached shaders.
|
||
- draw: don't touch info values that aren't valid.
|
||
- nir_to_tgsi_info: drop const_buffers_declared
|
||
- llvmpipe: finish rendering before flushing frontbuffer resources.
|
||
|
||
David Heidelberg (58):
|
||
|
||
- ci: skqp: patch skqp to report also GL results
|
||
- ci: skqp: upgrade to skqp based on Android CTS 11.0 (r7)
|
||
- ci: skqp: update URL
|
||
- ci: intel: Merge anv and iris into src/intel/ci
|
||
- ci/iris: Enable SKQP on Tiger Lake boards
|
||
- ci: traces: switch to Valve trimmed traces
|
||
- ci: traces: re-enable disabled traces which are now trimmed traces
|
||
- ci: traces: virgl: reenable previously crashing Godot trace
|
||
- ci: uprev piglit 2022-05-10
|
||
- ci/iris: skqp: add default GLES rendertests for TGL
|
||
- ci/iris: skqp: remove flaking atlastext for TGL
|
||
- ci/virgl: fix checksum for valve traces which run on iris
|
||
- ci/freedreno: enable ROR and Nheko traces
|
||
- ci/i915: add entries for RoR and Nheko traces
|
||
- ci/intel: add RoR and Nheko traces and reenable most of Valve traces
|
||
- ci/crocus: add RoR and Nheko traces
|
||
- ci/llvmpipe: add RoR and Nheko traces
|
||
- ci/radeonsi: add RoR and Nheko traces
|
||
- ci/virgl: traces: add RoR and Nheko traces
|
||
- ci/panfrost: add RoR and Nheko traces
|
||
- ci/iris: skqp: remove flaking atlastext for TGL (gl version)
|
||
- ci: traces: temporarily disable nheko trace
|
||
- ci/virgl: traces: temporarily disable nheko trace
|
||
- ci: uprev piglit 2022-06-09
|
||
- ci/etnaviv: sort fails.txt
|
||
- ci/etnaviv: add recent fails and remove 3 unexpected passes
|
||
- ci/etnaviv: disable tests which takes too long
|
||
- ci/etnaviv: add recent flakes for gc2000
|
||
- ci/etnaviv: gc2000: drop flakes from fails
|
||
- ci/freedreno: add Blender, Warzone2100, Freedoom and Unvanquished traces
|
||
- ci/freedreno: fix A530 glmark2@ideas:speed=10000 trace
|
||
- ci/llvmpipe: add Blender, Warzone2100, Freedoom and Unvanquished traces
|
||
- ci/iris: add Blender, Warzone2100, Freedoom and Unvanquished traces
|
||
- ci/virgl: add Blender, Warzone2100, Freedoom and Unvanquished traces
|
||
- ci/virgl: iris: disable Counter Strike 1.6 trace flaking
|
||
- ci/crocus: add Blender, Warzone2100, Freedoom and Unvanquished traces
|
||
- ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces
|
||
- ci/iris: disable Blender Diorama trace (incompatible with perf. traces)
|
||
- ci/freedreno: update restricted traces to the brotli compressed version
|
||
- ci/freedreno: smuggle myself into restricted traces access list
|
||
- ci/freedreno: disable non-existing trace
|
||
- ci: Uprev apitrace to 790380e05854
|
||
- ci: traces: switch to brotli compressed traces
|
||
- ci/freedreno: add more restricted traces
|
||
- ci/freedreno: temporary disable AmnesiaTDD
|
||
- ci/freedreno: disable Stellaris trace
|
||
- ci/freedreno: disable SpecOps trace, each run flaky
|
||
- Revert "ci/freedreno: temporary disable AmnesiaTDD"
|
||
- ci/traces: piglit, be more verbose
|
||
- docs: briefly document Gallium Nine
|
||
- ci: add script for manage Mesa CI
|
||
- ci: Allow disabling the whole of the Igalia farm
|
||
- ci: move apitrace instalation outside of x86_test-vk
|
||
- ci: wine-apitrace bump to 11.1
|
||
- ci/freedreno: 3 pixel change in Raven restricted trace
|
||
- nine: char can be unsigned on non-x86 platforms
|
||
- mailmap: fix order of emails for Axel Davy
|
||
- ci: set reasonable amount of wget retries for each download
|
||
|
||
Dawn Han (6):
|
||
|
||
- Add iub features and properties passthrough
|
||
- Refactor the descriptor enums to be extensible
|
||
- Add iub type to vn_descriptor_type Verified the pNext VkWriteDescriptorSet is not NULL on iub type
|
||
- Add the iub binding count tracking
|
||
- Refactor and add template support for iub
|
||
- venus: enabled iub in physical device
|
||
|
||
Denis Pauk (1):
|
||
|
||
- panvk: Return VK_ERROR_INCOMPATIBLE_DRIVER for Midgard
|
||
|
||
Dmitry Osipenko (1):
|
||
|
||
- virgl: Fix unmapping of blob resources
|
||
|
||
Dylan Baker (74):
|
||
|
||
- VERSION: bump to 22.2-devel for next cycle
|
||
- docs: truncate new_features.txt
|
||
- docs: add release notes for 22.0.2
|
||
- docs: Add sha256sums for 22.0.2
|
||
- docs: update calendar and link releases notes for 22.0.2
|
||
- docs: update calendar for 22.1.0-rc1
|
||
- docs: update calendar for 22.1.0-rc2
|
||
- winsys/radeon: remove unused simple_list.h
|
||
- r300: remove unused simple_list.h
|
||
- gallium/rbug: replace simple_list.h with list.h
|
||
- gallium/driver_trace: remove simple_list.h
|
||
- util/list.h: add a function to move an item in a list
|
||
- gallium/util/u_cache.c: remove trailing whitespace
|
||
- gallium/util/u_cache: replace simple_list.h with list.h
|
||
- gallium/draw: remove trailing whitespace
|
||
- gallium/draw: use list.h instead of simple_list.h
|
||
- llvmpipe: cleanup trailing whitespace
|
||
- llvmpipe: replace uses of simple_list.h with list.h
|
||
- util: remove simple_list
|
||
- docs: Extend calendar entries for 22.0 by 1 releases.
|
||
- docs: update calendar and link releases notes for 22.0.3
|
||
- docs: Extend calendar entries for 22.1 by 1 releases.
|
||
- docs: update calendar for 22.1.0-rc3
|
||
- docs: update calendar for 22.1.0-rc4
|
||
- docs: add release notes for 22.0.3
|
||
- docs: Add sh256sum for mesa 20.0.3
|
||
- docs: update calendar and link releases notes for 22.1.0
|
||
- docs: add release notes for 22.1.0
|
||
- relnotes: Add sha256sum and fix minor formatting issues
|
||
- docs: Add calendar entries for 22.1 release.
|
||
- docs: add release notes for 22.0.4
|
||
- docs: add sha256sum to 22.0.4 notes
|
||
- docs: Extend calendar entries for 22.0 by 1 releases.
|
||
- docs: update calendar and link releases notes for 22.0.4
|
||
- docs: add release notes for 22.1.1
|
||
- docs: Add sh256sums for 22.1.1
|
||
- docs: update calendar and link releases notes for 22.1.1
|
||
- docs: add release notes for 22.0.5
|
||
- docs: update sha256 for 22.0.5
|
||
- docs: update calendar and link releases notes for 22.0.5
|
||
- docs: add release notes for 21.3.9
|
||
- docs Add sha256 sums for 21.3.9
|
||
- docs: link releases notes for 21.3.9
|
||
- docs: add release notes for 22.1.2
|
||
- docs: add sha256sum to 22.1.2 notes
|
||
- docs: update calendar and link releases notes for 22.1.2
|
||
- docs: drop 22.1.4 calendar entry.
|
||
- docs: Add calendar entries for 22.2 release candidates.
|
||
- docs: Extend calendar entries for 22.1 by 3 releases.
|
||
- docs: add release notes for 22.1.3
|
||
- docs: add sah256sum for mesa 22.1.3
|
||
- docs: update calendar and link releases notes for 22.1.3
|
||
- docs: Bump 22.2 branchpoint by two weeks
|
||
- anv: add gfx version 12.5 flushes to CCS path
|
||
- iris|anv: gfx version 12.5 data cache flush is not a workaround
|
||
- VERSION: bump 22.3.0-devel
|
||
- VERSION: bump for 22.2.0-rc1
|
||
- .pick_status.json: Update to 8e6bdb2ed31f27c28d258859ec48e8e67f51ff41
|
||
- .pick_status.json: Update to 0a0205f04552ab21971c431a5f8bb7486d664422
|
||
- .pick_status.json: Update to 1a3b086b06c6c1ff63a4abe3752ca0a7a854dc77
|
||
- .pick_status.json: Update to c67e60ae8fefaeadbe06bf0cbd8ad2ee267a6c0e
|
||
- .pick_status.json: Update to 70891edd9728e7dd73115171fdb03622ce6450ba
|
||
- VERSION: update to 22.2.0
|
||
- Revert "VERSION: update to 22.2.0"
|
||
- VERSION: bump to 22.2.0-rc2
|
||
- .pick_status.json: Update to a3bf0da1cbd4b10043c80bf44609a3024b5fcc36
|
||
- .pick_status.json: Update to 24b9ad7cd5ebc7cfa5d03cf0f243ea4841c971b9
|
||
- .pick_status.json: Update to 74fc367127ccf945f4c649dd6ddff955c802e36e
|
||
- .pick_status.json: Mark 11ab6087797f805cf158048915c67945613c9a72 as denominated
|
||
- VERSION: bump to 22.2.0-rc3
|
||
- .pick_status.json: Update to 0c6fbfca0c91ef012e8ab767a317c07f1f6dc5e6
|
||
- .pick_status.json: Update to 8eac45b27446cd9b9eaeb147af97fff1e09832cb
|
||
- .pick_status.json: Update to baf24dea943202b3a92cad0c9f9648597040955a
|
||
- .pick_status.json: Update to 3d4c36a3bcc51ed441b2667d92291bea30ef7449
|
||
|
||
Eli Schwartz (1):
|
||
|
||
- meson: add various generated header dependencies as order-only deps
|
||
|
||
Ella Stanforth (1):
|
||
|
||
- v3dv: Implement VK_KHR_performance_query
|
||
|
||
Emil Velikov (1):
|
||
|
||
- c11: reinstate the original license and authorship
|
||
|
||
Emma Anholt (232):
|
||
|
||
- nouveau/nir: Don't try to emit OP_FMA pre-nvc0.
|
||
- Revert "ci: Disable Google's lab"
|
||
- ci/virgl: Add piglit and GL4.3 testing to the virpipe path.
|
||
- nir: Add lowering for fround_even on r300.
|
||
- nir: Skip fround_even on already-integral values.
|
||
- nir: Avoid generating extra ftruncs for array handling.
|
||
- turnip: Add TU_DEBUG=layout for dumping image layouts.
|
||
- freedreno/ir3: Make sched nodes before adding deps.
|
||
- turnip: Add nir_opt_conditional_discard.
|
||
- freedreno/ir3: Call nir_opt_find_array_copies().
|
||
- turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags.
|
||
- turnip: Don't disable LRZ in subpasses after the first in the easy case.
|
||
- nir_to_tgsi: Add a workaround for virglrenderer TG4.
|
||
- nir_to_tgsi: Fix assertion failures handling 64-bit vec3/vec4 ssa undefs.
|
||
- nir_to_tgsi: Don't forget to split 64-bit store_per_vertex_output.
|
||
- nir_to_tgsi: Avoid swizzling from undefined channels in load_output.
|
||
- nir_to_tgsi: Allocate the primid sysval to num_inputs, not num_outputs.
|
||
- tgsi: Emit ureg HW_ATOMIC decls in range order.
|
||
- tgsi_to_nir: Emit load_ubo_vec4 instead of load_ubo on non-integer HW.
|
||
- nine: Disable optional use of TTN when MUL_ZERO_WINS is available.
|
||
- r600: Fix up some mis-indentation of blocks.
|
||
- ci/r600: Manual run updates.
|
||
- r600: Fix reading back from a temp array immediately after writing on RV770.
|
||
- r600/sb: Avoid causing an exception when getting the reciprocal of 0u.
|
||
- r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled.
|
||
- nouveau/nv50: Print the number of loops in shader-db output.
|
||
- ci/nouveau: Add expectations files for GM206.
|
||
- ci/nouveau: Add nv92 xfails.
|
||
- nouveau/nir: Disable bitfield ops pre-nvc0.
|
||
- nouveau/nv50: Enable mesa/st alpha test lowering on nv50 with NIR.
|
||
- nouveau/nv50: Set the primid sysval flag if it's in the sysval list, too.
|
||
- nouveau/nir: Fix the inverted sense of usesSampleMaskIn.
|
||
- nouveau/nir: Fix edgeflag input detection.
|
||
- nouveau/nir: Set the input for vertex/instance ID like TGSI does.
|
||
- ci/dzn: Cut the runtime by running 1/3 of the enabled tests.
|
||
- nir_to_tgsi: Make vec_to_movs avoid unsupported coalescing for 64-bit.
|
||
- nir_to_tgsi: Lower FS input array indexing since we don't declare input arrays.
|
||
- tgsi: Extend array lengths when merging usage_masks.
|
||
- virgl: Switch to nir-to-tgsi by default.
|
||
- nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional.
|
||
- nir_to_tgsi: Drop the txl(lod=0)->tex nir_lower_tex() fixup.
|
||
- nouveau/nir: Move FS output stores to the end of the last block.
|
||
- ci/nouveau: Add MESA_GLES_VERSION_OVERRIDE=3.1 baseline state.
|
||
- nouveau/nir: Add support for pre-GF100 images and ssbos.
|
||
- nouveau/nir: Put the UBO offset indirect into the address reg.
|
||
- nouveau: Enable the NIR backend by default.
|
||
- util: Add some unit tests of the half-float conversions.
|
||
- nir: Eliminate out-of-bounds read/writes in local lowering.
|
||
- nir: Don't assert on tg4 offset range.
|
||
- svga: Add support for requesting NIR and translating to TGSI.
|
||
- ci/iris: Demote APL deqp to manual-only for now.
|
||
- ci/iris: Add a bunch of APL and KBL flakes recently.
|
||
- mesa/st: Always generate NIR from GLSL, and use nir_to_tgsi for TGSI drivers.
|
||
- mesa/st: Remove st_glsl_to_tgsi.
|
||
- mesa/st: Remove TGSI shader program variant handling.
|
||
- mesa/st: Drop the attrib remapping in st_prepare_vertex_program().
|
||
- mesa/st: Remove TGSI-only shader lowering code.
|
||
- tgsi_scan: Remove unused input_array_last/output_array_last.
|
||
- glsl: Remove UBO reference lowering.
|
||
- mesa: Remove unused Mesa IR PROGRAM_* register files.
|
||
- gallium/tgsi: Remove tgsi_emulate.
|
||
- mesa/st: Remove now unused TGSI paths from disk cache.
|
||
- glsl: Retire the non-NIR GLSL linking paths.
|
||
- gallium: Remove now-unused shader caps.
|
||
- glsl: Remove the unused lower_if_to_cond_assign.
|
||
- glsl: Drop the dead MOD_TO_FLOOR path.
|
||
- mesa/st: Clean up a bit of st_prepare_vertex_program().
|
||
- compiler/glsl: Remove the dead parts of build_program_resource_list().
|
||
- glsl: Remove EmitNoLoops and the associated lower_jumps(lower_break=true) code.
|
||
- glsl: Remove unused lower_variable_index_to_cond_assign.
|
||
- svga: Set lower_bitops for vpu9 screen.
|
||
- glsl: Stop lowering ir_quadop_vector.
|
||
- nouveau: disable fences when running under drm-shim.
|
||
- Revert "ci: remove nouveau from shader-db runs"
|
||
- ci/iris: Cut the glk-deqp test coverage in half.
|
||
- glsl: Make all drivers take the GLSLOptimizeConservatively path.
|
||
- turnip: Add a TU_DEBUG=perf debug option.
|
||
- ci/crocus: Manual CI updates after CI was down for a bit.
|
||
- ci/crocus: Merge the piglit runs with the deqp runs.
|
||
- i915g/ci: Add depth-clear-precision-check xfails like everyone else.
|
||
- ci/crosvm: Simplify the CID setup.
|
||
- ci/crosvm: Rename VSOCK_TEMP_DIR -> VM_TEMP_DIR.
|
||
- ci/crosvm: Terminate the previous crosvm after a deqp-runner timeout.
|
||
- freedreno/ir3: Fix 16-bit bit_count.
|
||
- spirv: Use nir_vec_scalars() to simplify matrix transpose.
|
||
- spirv_to_nir: Cast RelaxedPrecision ALU op dests to mediump.
|
||
- turnip: Make RelaxedPrecision-decorated ALU ops 16-bit.
|
||
- freedreno/ir3: Add support for 16-bit nir_texop_lod.
|
||
- freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit.
|
||
- mesa/st: Only scalarize for doubles lowering if we're lowering doubles.
|
||
- glsl,nir: Move i/umulExtended lowering to NIR.
|
||
- panfrost: always print the bad ALU op if we're failing to translate.
|
||
- ci/turnip: Add missing xfails for a618 full VK run
|
||
- nir_lower_mediump: Drop assertion about not containing movs.
|
||
- freedreno/ir3: Enable load/store vectorization for SSBO access, too.
|
||
- freedreno/ir3: Make the types of tex coord padding match.
|
||
- freedreno/ir3: Fix validation of half-precision image store values.
|
||
- freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts.
|
||
- i915g: Ignore the new SEPARABLE_PROGRAM property.
|
||
- ci/i915: Merge the piglit and deqp runs.
|
||
- ci/i915: Update manual piglit job expectations.
|
||
- turnip: Don't bother creating tile_load/store_cs for sysmem rendering.
|
||
- turnip: Refactor a bit of tu6_emit_tile_select().
|
||
- turnip: Move binning decisions from FB usage time to FB creation time.
|
||
- turnip: Use fb->binning_possible to decide on conditional tile load/stores.
|
||
- ci/turnip: Add a little forced touch-testing of XFB with no binning requested.
|
||
- turnip: Refactor a bit of repeated code for subpass setup.
|
||
- turnip: Include 3d-based CmdClearAttachments() in binning visibility.
|
||
- turnip: Refactor a bit of subpass attachment processing.
|
||
- turnip: Allow load/store skipping in vkCmdClearAttachments().
|
||
- turnip: Move tile loads back into the draw CS.
|
||
- nouveau/nv30: Make sure fsat is lowered in the VS.
|
||
- gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE
|
||
- glsl: Remove stale lower_instructions comments.
|
||
- glsl: Move exp/log-to-exp2/log2 lowering to glsl-to-NIR.
|
||
- tgsi_exec: Fix inf/nan handling for divide by zero.
|
||
- glsl: Drop the div-to-mul-rcp lowering for floats.
|
||
- nir: Fix idiv lowering on !NativeIntegers when lower_fdiv is also set.
|
||
- glsl: Drop INT_DIV_TO_MUL_RCP lowering.
|
||
- ci/crocus: Drop g41's xfail for rgba_half_float_oes.
|
||
- ci/iris: Disable skqp until it can be stabilized.
|
||
- turnip: Reverse the order of walking pipes or tiles on odd rows.
|
||
- mesa/st: Abort the linking on driver link failure.
|
||
- .gitignore: Qualify the path for the ignored build directory.
|
||
- nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning.
|
||
- gallium: Rename MUL_ZERO_WINS to LEGACY_MATH_RULES.
|
||
- nir_to_tgsi: Set LEGACY_MATH_RULES cap for use_legacy_math_rules shaders.
|
||
- r600/sfn: Implement 0*x=0 behavior for use_legacy_math_rules.
|
||
- nouveau/nir: Implement mul_zero_wins behavior for use_legacy_math_rules.
|
||
- tgsi_to_nir: Set use_legacy_math_rules for TGSI_PROPERTY_LEGACY_MATH_RULES.
|
||
- gallium: Rename PIPE_CAP_TGSI_LEGACY_MATH_RULES to drop "TGSI"
|
||
- Revert "nine: Disable optional use of TTN when MUL_ZERO_WINS is available."
|
||
- iris: Enable PIPE_CAP_LEGACY_MATH_RULES.
|
||
- crocus: Enable PIPE_CAP_LEGACY_MATH_RULES.
|
||
- ci/dzn: Skip a test that intermittently times out at a minute.
|
||
- ci/iris: Disable blender-demo-cube_diorama on APL.
|
||
- ci: Terminate capture-devcoredump before tarring up artifacts.
|
||
- turnip: Document some fields about resolves.
|
||
- ci/bare-metal: Apply autopep8 to our python scripts.
|
||
- ci/bare-metal: Add per-boot-stage timeouts for fastboot and poe.
|
||
- ci/bare-metal: Get rid of servo's serial feed threads.
|
||
- ci/bare-metal: Add handling of netboot firmwares for servo boards.
|
||
- ci/bare-metal: Add test phase timeouts to all boards.
|
||
- ci: Remove the unused .test-manual
|
||
- ci/vc4: Merge quick_shader in with deqp-gles
|
||
- ci/vc4: Turn on deqp-egl testing by default.
|
||
- vc4: Propagate txf_ms's dest_type to the lowered txf.
|
||
- freedreno/ir3: Disable image/ssbo 16-bit conversion folding pre-a6xx.
|
||
- freedreno/a5xx: Set the buffer bit appropriately in XS_CTRL_REG0.
|
||
- ci/freedreno: Turn a530 back on by default and update expectations.
|
||
- ci/bare-metal: Remove "stage: test" from .baremetal-test.
|
||
- ci/bare-metal: Consolidate needs declarations in .baremetal-test-\*.
|
||
- ci/bare-metal: Collapse artifacts wget by default.
|
||
- ci/traces: GC unused code for DXVK trace replay.
|
||
- ci/traces: Drop ZINK_USE_LAVAPIPE ICD override.
|
||
- ci/freedreno: Add vulkan+gl integration testing in piglit.
|
||
- ci/turnip: Test traces on turnip using zink.
|
||
- ci/turnip: Trim the a630 VK run a bit.
|
||
- ci: Use "!references" to manage scheduled pipeline rules.
|
||
- ci: Pull out farm disables to a reused rule.
|
||
- ci: Pull common zink frontend rules to a shared rule list.
|
||
- ci/freedreno: Use !references to clean up restricted traces rules.
|
||
- ci: Simplify vulkan rules using !references.
|
||
- ci: Split core GL from core VK-or-GL rules.
|
||
- ci/freedreno: Filter when we run test jobs for VK or GL changes.
|
||
- ci/zink: Reuse anv-rules.
|
||
- ci/zink: Simplify lavapipe rules setup, and clarify what the rules are for.
|
||
- ci: Remove the trailing "when: never"s from rules.
|
||
- ir3: Retire the cp postsched pass now that we do RA in SSA.
|
||
- ci: Uprev deqp to 1.3.3.0.
|
||
- ci/llvmpipe: Skip ssbo.layout.random.all_shared_buffer.36
|
||
- ci: Make the retry policy default for all jobs.
|
||
- ci: Rename .ci-run-policy rules to .build-rules.
|
||
- ci: Drop .build-rules from container jobs.
|
||
- ci: Remove .build-rules from core test job definitions.
|
||
- ci: Fix non-freedreno performance jobs running during Marge merges.
|
||
- ci: Remove GIT_STRATEGY from .test-manual-mr.
|
||
- mesa/st: Disable OES_texture_3D for PIPE_CAP_MAX_TEXTURE_3D_LEVELS==0.
|
||
- vc4: Disable OES_texture_3D being exposed.
|
||
- vc4: Move previous existing 3D xfails up to the group of 3d xfails.
|
||
- vc4: Add notes on the remaining dEQP failures.
|
||
- turnip: Account for additional_cs_reserve_size for both bin and render.
|
||
- turnip: Fix up per-stage additional size accounting.
|
||
- turnip: Fix the reservation for vertex inputs.
|
||
- turnip: Refactor vertex input setup a little.
|
||
- ir3: Make sure to pass the interp_mode through in our load_bary lowering.
|
||
- nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3.
|
||
- freedreno: Rename the "SIZE" regs for interpolateAtOffset to "CENTERRHW"
|
||
- ir3: Clarify what's happening in the interpolateAtOffset() math.
|
||
- ir3: Use non-persp interpolation when appropriate for interpolateAtOffset.
|
||
- zink: Do the timestamp-to-ns math in a double to have better precision.
|
||
- mesa: Fix the error check for VertexAttrib*.
|
||
- vc4: Work around a HW bug with 2-vert line loops.
|
||
- vc4: Add links to test bug reports.
|
||
- kopper: Fix the return value of kopperSetSwapInterval().
|
||
- kopper: Use the swap interval that was set at swapchain creation time.
|
||
- glx: Unify dri2/dri3 vblank_mode logic.
|
||
- kopper: Respect the vblank_mode env var.
|
||
- egl: Just include driconf.h instead of redefining its values.
|
||
- loader: Just include driconf.h instead of redefining its values.
|
||
- ci/freedreno: Mark an occasional flake pass that happens on a530.
|
||
- ir3: Fix the no-emitted-vertex condition emission in geom lowering.
|
||
- mesa/arbprog: Move the GLSLFragCoordIsSysVal handling to prog_to_nir.
|
||
- mesa/arbprog: Use nir_lower_io_to_temporaries.
|
||
- mesa/arbprog: Stop doing optimization in the ARB program IR.
|
||
- ci/crocus: Update portal 2 trace shas for the recent fix.
|
||
- ci/crocus: Disable the blender trace.
|
||
- ci/crocus: Drop xfails for the recent image external fix.
|
||
- freedreno/ir3: Enable core NIR's 16-bit ALU optimizations.
|
||
- intel/fs: Simplify brw_barycentric_mode() args.
|
||
- ci/bare-metal: Add timeouts to the shell commands called in fastboot.
|
||
- ci/freedreno: Switch a630 to manual/disabled for lab maintenance.
|
||
- glsl: Use the same NIR path for shared mem lowering as SPIRV does.
|
||
- glsl: Remove optimize_swizzles.
|
||
- ci/turnip: Add a bit of spilling-vs-ballot testing on a618.
|
||
- ci/freedreno: Add some more known flakes for a630 from our IRC logs.
|
||
- ci/turnip: Bump up the a630 full run timeout.
|
||
- ci/turnip: Add a couple of missing a630 fails.
|
||
- Revert "ci/freedreno: Switch a630 to manual/disabled for lab maintenance."
|
||
- nir: Split usub_sat lowering flag from uadd_sat.
|
||
- ci/crocus: Fix the commented YML for blender-demo-cube_diorama.
|
||
- gallivm: Refactor a bit of UBO/SSBO range checking.
|
||
- gallivm: Optimize single-invocation SSBO stores.
|
||
- ci/virgl: Make the VM-based virgl tests report to #virgl-ci, too.
|
||
- nir: Add an opt pass for phis after if choosing between true/false.
|
||
- glsl: Remove opt_conditional_discard().
|
||
- nir: Use nir_foreach_phi_src consistently.
|
||
- ci/freedreno: Update a630 s8 resolve xfails.
|
||
- turnip: Split the tiling config into separate layouts based on CCU usage.
|
||
- turnip: Use the GMEM CCU space for attachments when the stores won't.
|
||
- zink: Make sure that we keep the existing ici pNext chain on inserts.
|
||
- spirv: Mark phis as mediump instead of directly lowering them to 16 bit.
|
||
|
||
Enrico Galli (6):
|
||
|
||
- microsoft/spirv_to_dxil: Add pass to lower dynamic accesses on ubo[1]
|
||
- microsoft/compiler: Fix for arrays of 1 element
|
||
- microsoft/compiler: Add support for unnamed ubos
|
||
- microsoft/compiler: Fix when using a shadow sampler more than once
|
||
- microsoft/spirv_to_dxil: Fix discard semantics
|
||
- microsoft/spirv_to_dxil: Fix discard to demote + return pass
|
||
|
||
Eric Engestrom (36):
|
||
|
||
- meson: replace deprecated dep.get_pkgconfig_variable(...) with dep.get_variable(pkgconfig : ...)
|
||
- docs: show and link to the new macOS CI
|
||
- broadcom/rpi4-skips: drop duplicated lines
|
||
- mailmap: update my email to Igalia
|
||
- vulkan: add script to detect & replace identifiers that have been renamed
|
||
- vulkan/runtime: use updated tokens from vk.xml
|
||
- vulkan: use updated tokens from vk.xml
|
||
- nir: use updated tokens from vk.xml
|
||
- radv: use updated tokens from vk.xml
|
||
- v3dv: use updated tokens from vk.xml
|
||
- turnip: use updated tokens from vk.xml
|
||
- zink: use updated tokens from vk.xml
|
||
- lavapipe: use updated tokens from vk.xml
|
||
- pvr: use updated tokens from vk.xml
|
||
- anv: use updated tokens from vk.xml
|
||
- dzn: use updated tokens from vk.xml
|
||
- panvk: use updated tokens from vk.xml
|
||
- venus: use updated tokens from vk.xml
|
||
- zink: fix portability_subset usage after rename from EXTX to KHR
|
||
- zink: drop unused VkPhysicalDevicePortabilitySubsetPropertiesKHR
|
||
- zink: add missing guards around \`have_{ext}`
|
||
- docs/isl: fix typos & formatting
|
||
- vk/util: handful of pythonic cleanups
|
||
- vk/util: simplify extensions gen code
|
||
- util/list: rename LIST_ENTRY() to list_entry()
|
||
- nouveau: use existing list_first_entry() macro instead of re-writing it
|
||
- docs/features.txt: mark VK_ANDROID_native_buffer as supported on turnip
|
||
- docs/features.txt: mark VK_ANDROID_native_buffer as supported on v3dv
|
||
- docs/ci: fix Emma's name
|
||
- turnip: expose support for VK_EXT_acquire_drm_display
|
||
- ci: add \`--print-errorlogs` to \`meson test`
|
||
- bin/gen_release_notes.py: bump advertised vulkan version to 1.3
|
||
- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties
|
||
- wsi/x11: fix memleak in wsi_x11_connection_create()
|
||
- meson: replace manual compiler flags with meson arguments
|
||
- broadcom: fix dependencies in static_library() calls
|
||
|
||
Erico Nunes (11):
|
||
|
||
- ci: put lima farm back online
|
||
- docs/ci: fix literalinclude of caching setup files
|
||
- lima/ppir: optimize branch comparison when possible
|
||
- lima/ppir: merge branch nodes in simple cases
|
||
- egl: Fix DETECT_OS macro usage
|
||
- ci: update linux kernel to v5.17
|
||
- ci: update docs for linux kernel uprevs
|
||
- lima/ci: update piglit ci expectations
|
||
- lima/ci: reenable lima-mali450-piglit-gpu:arm64
|
||
- lima/ci: Don't skip piglit opengl 1 tests
|
||
- ci: Revert "CI: Lima farm is offline"
|
||
|
||
Erik Faye-Lund (157):
|
||
|
||
- gallium: rename continue shader-cap
|
||
- gallium: rename dround shader-cap
|
||
- gallium: rename dfracexp/dldexp shader-cap
|
||
- gallium: rename ldexp shader-cap
|
||
- vulkan: explicitly cast object-type enum
|
||
- meson: turn on -Wno-unused-function project-wide
|
||
- nir: introduce and use nir_component_mask
|
||
- dzn: fixup indent
|
||
- dzn: remove all usage of ComPtr<T>
|
||
- dzn: always use ID3D12Device1
|
||
- dzn: always use ID3D12GraphicsCommandList1
|
||
- dzn: use ID3D10Blob instead of ID3DBlob
|
||
- dzn: pass IDXGIAdapter1 to d3d12_create_device
|
||
- dzn: port to d3d12 c-api
|
||
- dzn: add a bunch of missing struct-keywords
|
||
- dzn: drop auto usage
|
||
- dzn: drop using references
|
||
- dzn: no more reinterpret_cast
|
||
- dzn: use define instead of constexpr
|
||
- dzn: use c-style for-statement
|
||
- dzn: use c-style initialization
|
||
- dzn: c-style casts
|
||
- dzn: do not set unused default member initializer
|
||
- dzn: port code to plain c
|
||
- dzn: merge util sources
|
||
- dzn: remove needless defines
|
||
- dzn: remove some needless casts
|
||
- dzn: clean up pointer syntax
|
||
- dzn: add space before += operator
|
||
- dzn: drop needless enum-casts
|
||
- dzn: fixup indent
|
||
- dzn: remove stale cpp_args
|
||
- nir/lower_int64: do not try to clamp floats to int-range
|
||
- vulkan: do not depend on alignof(void)
|
||
- lavapipe: quiet non-conformant warning on ci
|
||
- panvk: quiet non-conformant warning on ci
|
||
- vulkan: use c_msvc_compat_args for shared code
|
||
- vulkan: drop empty vulkan_wsi_args
|
||
- microsoft/clc: fixup indentation
|
||
- microsoft/compiler: fixup indentation
|
||
- meson: remove unused defines
|
||
- meson: deprecate specifying osmesa-bits
|
||
- gallium/xlib: fix stale comment
|
||
- mesa: add missing error-path
|
||
- util: fix test on msvc
|
||
- lavapipe: tolerate NULL UBO and SSBO descriptors
|
||
- lavapipe: tolerate NULL image view descriptors
|
||
- lavapipe: tolerate NULL image-buffer view descriptors
|
||
- lavapipe: tolerate NULL sampler-buffer view descriptors
|
||
- lavapipe: add support for null-descriptors from EXT_robustness2
|
||
- zink: pass screen to extension getter macro
|
||
- zink: initialize dispatch-table before queue-init
|
||
- zink: call vk-functions through dispatch-table
|
||
- zink: pass screen to zink_create_instance
|
||
- zink: dynamically load a few functions
|
||
- zink: use run-time linking to loader
|
||
- dzn: reset correct list
|
||
- vulkan/wsi: unbreak win32-support
|
||
- d3d12: add missing cmath includes
|
||
- tests/graw: rename shaders from .sh to .txt
|
||
- dzn: add a debug-flag to wait for the debugger
|
||
- dzn: add debug option to redirect stdout/stderr
|
||
- editorconfig: remove scons-ism
|
||
- editorconfig: remove html-rule
|
||
- editorconfig: remove m4-rule
|
||
- editorconfig: remove pl-rule
|
||
- radeonsi: port amdgcn_glslc build to meson
|
||
- freedreno: remove stale makefile
|
||
- intel: remove stale makefile
|
||
- meson: modernize win_flex stdint.h logic
|
||
- c99_compat.h: remove inline and __func__ shims
|
||
- util: add missing c99_compat.h includes
|
||
- util: remove needless c99_compat.h includes
|
||
- egl: remove needless c99_compat.h includes
|
||
- gallium/util: remove needless c99_compat.h includes
|
||
- mapi: remove needless c99_compat.h includes
|
||
- gallium: remove needless c99_compat.h includes
|
||
- include: remove needless c99_compat.h includes
|
||
- intel/isl: remove needless c99_compat.h includes
|
||
- vulkan/device_select: remove needless c99_compat.h includes
|
||
- include: drop c99_math.h
|
||
- include: drop c11_compat.h
|
||
- pvr: do not use c_msvc_compat_args
|
||
- turnip: do not do STATIC_ASSERT on a variable
|
||
- freedreno: do not use variable in STATIC_ASSERT
|
||
- freedreno: do not do STATIC_ASSERT on variables
|
||
- iris: do not do STATIC_ASSERT on variables
|
||
- freedreno: degrade STATIC_ASSERT to assert
|
||
- v3dv: do not do STATIC_ASSERT on variables
|
||
- util: add IS_POT macro
|
||
- intel/compiler: use macro for power-of-two check
|
||
- broadcom/compiler: use macro for power-of-two check
|
||
- util: use static_assert directly
|
||
- util: implement STATIC_ASSERT using c++11 / c11 primitives
|
||
- meson: add back -Werror=vla to msvc compat flags
|
||
- ci/windows: add back build-error detection
|
||
- d3d12: move cubemap-lowering to common-code
|
||
- microsoft/compiler: mark image-functions as such
|
||
- microsoft/compiler: make sampler-lowering optional
|
||
- microsoft/spirv_to_dxil: lower cube-images to 2d arrays
|
||
- dzn: enable binding_model tests
|
||
- dzn: correct assert-condition
|
||
- dzn: handle stencil-attachment-optimal layout
|
||
- dzn: use a null-rtv to handle no-attachment
|
||
- dzn: enable two more exts
|
||
- util: sort includes
|
||
- llvmpipe: assume c99
|
||
- c99_compat.h: assume c99 support
|
||
- util: use c11 alignof instead of our own
|
||
- glcpp: remove outdated msvc hack
|
||
- vulkan/util: assume c11
|
||
- util: use c11 alignas instead of rolling our own
|
||
- gallium: remove stale comment
|
||
- gallium: reorganize includes
|
||
- llvmpipe: use c11 alignas instead of PIPE_ALIGN_VAR
|
||
- gallium: use c11 alignas instead of PIPE_ALIGN_VAR
|
||
- gallium: use c++11 alignas instead of PIPE_ALIGN_VAR
|
||
- gallium: remove repeated check
|
||
- gallium: remove unused macros
|
||
- gallium: clean up PIPE_ALIGN_STACK-definition
|
||
- docs/zink: correct requirement for tbos
|
||
- dzn: correct order of src_subres and dst_subres args
|
||
- docs: set language to english
|
||
- dzn: fill in driver name and info
|
||
- dzn: fill in float-control details
|
||
- dzn: fill in non-uniform-indexing props
|
||
- dzn: fill in bindless props
|
||
- dzn: fill in depth/stencil resolve props
|
||
- dzn: fill in minmax props
|
||
- dzn: fill misc props
|
||
- dzn: expose VK_KHR_driver_properties
|
||
- zink: remove pointless test
|
||
- zink: remove pointless assert
|
||
- zink: drop pointless comment
|
||
- llvmpipe: merge all rasterizer-bind functions
|
||
- llvmpipe: consolidate legacy_points and multisample state
|
||
- llvmpipe: clean up bresenham implementation
|
||
- docs/zink: correct name of extension
|
||
- docs/zink: document required VK_KHR_swapchain_mutable_format
|
||
- docs/zink: document required VK_EXT_border_color_swizzle
|
||
- zink: warn about missing image2DViewOf3D feature
|
||
- zink: fix EXT_color_write_enable check
|
||
- gallium/hud: do not use texture-rect for font
|
||
- llvmpipe: remove unused function
|
||
- zink: clean up extension emitting
|
||
- zink: set dynamic indexing caps
|
||
- zink: more accurately set {Sampled,Image}1D caps
|
||
- zink: more accurately set {Sampled,Image}Buffer caps
|
||
- zink: more accurately set ImageQuery cap
|
||
- docs: fixup link to virgl docs
|
||
- zink: add have_D32_SFLOAT_S8_UINT boolean
|
||
- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking
|
||
- zink: type_main -> type_void_func
|
||
- zink: add spirv_builder_function_call
|
||
- zink: wrap discard in a function
|
||
- zink: clamp miplodbias when creating sampler
|
||
- docs/zink: document rgtc requirement
|
||
|
||
Fabrice Fontaine (1):
|
||
|
||
- src/util/futex.h: fix build on 32-bit architectures using 64-bit time_t
|
||
|
||
Feng Jiang (1):
|
||
|
||
- virgl: do not share virgl_screen between different drm_files
|
||
|
||
Filip Gawin (5):
|
||
|
||
- r300: Print warning when stubbing derivatives
|
||
- r300: keep negation if w is an inline constant
|
||
- r300: don't check for unitialized reads when rewriting register
|
||
- r300: prefer old not native swizzle in constant folding
|
||
- r300: don't read from output transform_r300_vertex_SEQ/SNE
|
||
|
||
Francisco Jerez (2):
|
||
|
||
- intel/dev: Compute pixel pipe information based on geometry topology DRM query.
|
||
- intel/fs/xehp+: Emit scheduling fence for all NIR barriers on platforms with LSC.
|
||
|
||
Frank Binns (3):
|
||
|
||
- pvr: replace p_compiler.h boolean with stdbool
|
||
- pvr: shorten error to err in label names
|
||
- pvr: Add TI AM62 as a supported device.
|
||
|
||
Georg Lehmann (74):
|
||
|
||
- aco: Remove some old comments in aco_opcodes.py.
|
||
- aco: Reuse previous -1 result in find_msb to avoid using VOP3.
|
||
- nir/lower_mediump: Add a new pass to fold 16bit image load/store.
|
||
- aco: Implement image_store d16.
|
||
- aco: Implement image_load d16.
|
||
- radv: Use nir_fold_16bit_image_load_store_conversions.
|
||
- radv: Run copy_prop and dce after folding 16bit sampling/load/store.
|
||
- aco: Remove D16 zero components from image stores.
|
||
- meson: Tell glslang to be quiet.
|
||
- radeonsi: Use nir_fold_16bit_image_load_store_conversions.
|
||
- wsi/x11: Don't leak xcb_get_geometry_reply_t.
|
||
- nir/opt_algebraic: Fix mask in shift by constant combining.
|
||
- nir/opt_algebraic: Simple xor/ishr optimizations.
|
||
- radv: Don't retile read-only images.
|
||
- nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB.
|
||
- aco: Implement uclz.
|
||
- aco: Implement usub_sat.
|
||
- aco: Implement isub_sat.
|
||
- ac/llvm: Implement usub_sat and isub_sat.
|
||
- ac/llvm: Implement uclz.
|
||
- nir: Add optional lowering for mul_32x16.
|
||
- radv: Lower mul_32x16.
|
||
- radv: Lower hadd/hradd.
|
||
- radv: Lower 8bit isub_sat/usub_sat.
|
||
- radv: Enable VK_INTEL_shader_integer_functions2.
|
||
- radv, aco: Packed usub_sat/isub_sat.
|
||
- zink: Use VK_USE_64_BIT_PTR_DEFINES to check for 64bit platforms.
|
||
- radv, aco: Lower txf offset in NIR.
|
||
- nir/lower_tex: Add lower_array_layer_round_even option.
|
||
- radv, aco: Round texture array layer in NIR.
|
||
- vulkan: Spec update to 1.3.217.
|
||
- radv: Implement VK_EXT_non_seamless_cube_map.
|
||
- aco: Print r128/a16 MIMG bits separately.
|
||
- aco: Remove r128_a16 MIMG builder option.
|
||
- anv: Implement VK_EXT_non_seamless_cube_map.
|
||
- radv: Remove redundant memset in radv_descriptor_set_create.
|
||
- radv: Check descriptor pool entry count before allocating a new set.
|
||
- radv: Don't check if we need to copy immutable samplers for non push templates.
|
||
- aco/ra: Static assert that changing instruction type to VOP2 is valid.
|
||
- aco/ra: Move mac encoding optimization to its own function.
|
||
- aco/ir: Pad SOP2 and SOPC to the same size as SOPK.
|
||
- aco/ra: Optimize some SOP2 instructions with literal to SOPK.
|
||
- aco/optimizer: Convert s_add_u32 with literals to s_add_i32 if carry is not used.
|
||
- aco/optimizer: Optimize SOPC with literal to SOPK.
|
||
- r600/sfn: Add missing std::array include.
|
||
- aco: Fix swapping sources in SOPC -> SOPK optimization.
|
||
- aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9.
|
||
- nir_to_tgsi: Lower uadd_sat/usub_sat.
|
||
- zink: Lower uadd_sat/usub_sat.
|
||
- r600: Lower uadd_sat/usub_sat.
|
||
- ir3: Implement [iu]sub_sat.
|
||
- nir/lower_io: Fix array length of buffers larger than INT32_MAX.
|
||
- nir/opt_algebraic: Optimize [ui](add|sub)_sat with 0.
|
||
- radv,aco: Don't lower and vectorize 16bit iabs.
|
||
- radv: Remove ineg from lower_bit_size_callback callback.
|
||
- radv, aco: Don't lower 16bit isign.
|
||
- ir3: Only run 16bit tex NIR passes on a5xx+.
|
||
- ir3: Lower alu to scalar if nir_legalize_16bit_sampler_srcs made progress.
|
||
- nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass.
|
||
- nir/lower_mediump: Add an option to only fold if all tex sources can be folded.
|
||
- ir3: Stop using nir_legalize_16bit_sampler_srcs.
|
||
- aco: Add G16 opcodes.
|
||
- aco: Support 16bit sources for texture ops.
|
||
- radv: Fold 16bit tex sources.
|
||
- aco: Add a note about G16 without A16 to README-ISA.md.
|
||
- aco/ir: Generalize (un)ordered_swapped.
|
||
- aco/ir: Add integer get_cmp_info.
|
||
- aco/ir: Fix swapped nle.
|
||
- aco/ir: Add vcmpx opcode to get_cmp_info.
|
||
- aco/ir: Add v_cmp_class to get_cmp_info.
|
||
- aco/ir: Add swapped opcode for v_cmp_u/v_cmp_o.
|
||
- radeonsi: Stop using nir_legalize_16bit_sampler_srcs.
|
||
- nir/fold_16bit_tex_image: Default to only_fold_all.
|
||
- aco: Force tex operand to have the correct sub dword size before packing.
|
||
|
||
Gert Wollny (85):
|
||
|
||
- virgl: Extend integer write out output fix to all non-move integers ops
|
||
- r600/sfn: Fix store_shared_r600 write masks
|
||
- ntt: Add option to not optimized register allocation
|
||
- nir: Add pass to split 64 bit vec3 and vec4 variable access and phis
|
||
- ntt: Split 64 bit vec3 and vec4 local variables and phis
|
||
- virgl: Clear all indirect flags when writing to a temporary
|
||
- nir: Don't optimize to 64 bit fsub if the driver doesn't support it
|
||
- ntt: remove dead input variables before lowering FS IO
|
||
- r600/sb: Don't optimize float GT and GE
|
||
- r600: tune nir options
|
||
- r600: Add support for TGSI_OPCODE_ATOMIMIN and IMAX
|
||
- virgl: don't move input gl_SampleMaskIn to a temp
|
||
- r600: Allow eight bit, three channel formats for vertex buffers
|
||
- nir: honor lower_double options for ffloor and ffract
|
||
- r600/sb: Don't create three source ops with all kcache values
|
||
- r600/sb: Fall back to un-optimized shader if scheduling fails
|
||
- virgl: Don't let ntt optimize the register allocation
|
||
- tgsi: Don't bother setting the TGSI_PROPERTY_SEPARABLE_PROGRAM
|
||
- virgl: Submit the TGSI_PROPERTY_SEPARABLE_PROGRAM when the host understands it
|
||
- r600: Add number of ALU groups to statistics
|
||
- r600: Make sure that LDS instructions only use bank swizzle 012
|
||
- r600: Update nir options
|
||
- r600/sfn: rewrite NIR backend
|
||
- r600: enable sb by default also for NIR
|
||
- r600: Cleanup nir options
|
||
- r600/sfn: support nir_op_mulz and legazy math rules
|
||
- r600/sfn: Add missing copyright information
|
||
- r600/sfn: Don't use assert to test success of operation
|
||
- r600: limit loops when trying to merge alu groups
|
||
- r600/sfn: Never consider an op with register dest as dead
|
||
- r600/sfn: Add support for fdph
|
||
- r600/sfn: lower CLIPVERTEX to clip planes
|
||
- r600/sfn: emulate pmr::monotonic_buffer_resource if needed
|
||
- r600/sfn: clean up some commented out code
|
||
- r600/sfn: fix storing shader output array locations
|
||
- r600/sfn: lower undef to zero
|
||
- r600/sfn: count LDS queue pop reads separately in assembler
|
||
- r600/sfn: Fix the kcache failure handling
|
||
- r600/sfn: Fix emmision of LDS instructions
|
||
- r600/sfn: Fix used of local shared write mask
|
||
- r600/sfn: Fix streamout with non-zero component write
|
||
- r600: don't dump shader info to files on debug
|
||
- r600/sfn: make sure the memory pool is released after shader translation
|
||
- r600/sfn: lower the right shader variant
|
||
- r600: Delete possible old NIR variant when translating from TGSI
|
||
- r600: don't clone the nir shader, just use it
|
||
- r600ß/sfn: Don't fall through with VS outputs
|
||
- r600/dri-tweak: Disable ARB_shader_image_load_store for "Tomb Raider 2013"
|
||
- r600/sb: fall back to unoptimized shader if RA fails
|
||
- r600: Don't set TES_EVAL UCP dirty when it is not supported
|
||
- virgL: lower fneg and fabs
|
||
- virgl: honor host caps in shader cash sha evaluation
|
||
- r600/sfn: Factor out fragement shader class for EG
|
||
- r600/sfn: Support pre-EG style FS input
|
||
- r600/sfn: Prepare StreamOut instruction for pre EG opcodes
|
||
- r600/sfn: Handle slot differences between pre-EG and EG
|
||
- r600/sfn: Handle chipclass with kcache allocation
|
||
- r600/sfn: Handle buffer size query on pre-EG
|
||
- r600/sfn: Handle pre-EG GS input fetch
|
||
- r600/sfn: Handle primitive input handling for pre-EG
|
||
- r600/sfn: Handle non-compressed MS texel fetch
|
||
- r600/sfn: Set trans flag for flt_to_int on pre-EG
|
||
- r600/sfn: Handle pre-EG buffer fetch
|
||
- r600/sfn: Disable FP64 for pre-CEDAR
|
||
- r600/sfn: Advertise support for fmulz
|
||
- r600/sfn: Enable NIR for pre RG hardware
|
||
- r600/sfn: remove leftover debug output
|
||
- r600/sfn: Handle R600 class sin/cos
|
||
- r600: increase possible stack size in binary code
|
||
- r600/sfn: Schedule shift instruction on R600 in t-slot
|
||
- r600/sfn: Add GS thread fix just like the TGSI code path
|
||
- r600/sfn: Sort FS inputs to make interpolated values come first
|
||
- r600/sfn: Fix color outputs when color0 writes all
|
||
- r600: Force NOPs when loading AR on R600 class hardware
|
||
- r600/sfn: Handle R600 scratch read
|
||
- r600: Don't use SB with R600 style scratch reads
|
||
- r600/sfn: Handle color0 writes all on R700 like on EG
|
||
- r600/sfn: Don't tag mem-ring and stream instructions as exports
|
||
- r600/sfn: Don't schedule GDS instructions early
|
||
- r600/sfn: Don't scan the whole block for ready instructions
|
||
- r600/sfn: Use a heuristic to keep SSBO setup and store close
|
||
- r600: Fix reporting TGSI IR support
|
||
- r600/sfn: Use a low number for unused target register
|
||
- virgl: when reading back wait first, then do the transfer
|
||
- r600/sfn: override register ID when it doesn't matter
|
||
|
||
Glenn Kennard (1):
|
||
|
||
- nv30: Fix non-scissored clears after a scissor has been set
|
||
|
||
Greg Depoire--Ferrer (5):
|
||
|
||
- util/queue: add missing space to comment in util_queue_destroy
|
||
- zink: check for error when initializing util_queue
|
||
- lavapipe: check for error when initializing lvp_queue
|
||
- util/queue: handle thread cration failure in util_queue_adjust_num_threads
|
||
- zink: fix crash on dlclose when zink_internal_create_screen failed
|
||
|
||
Guilherme Gallo (48):
|
||
|
||
- ci/lava: Encapsulate job data in a class
|
||
- ci/lava: Filter log lines from LAVA return
|
||
- ci/lava: Return test-suite result as exit code
|
||
- ci/lava: Fix LAVA job validation
|
||
- ci/lava: Set lava-signal to kmsg
|
||
- ci/lava: Always validate the lava job
|
||
- ci/lava: Use lava-test-case to run custom scripts in LAVA
|
||
- ci/lava: Trap init-stage2.sh background processes
|
||
- ci/lava: Fix shebang in deqp-runner.sh
|
||
- ci/lava: Parse all test cases from 0_mesa suite
|
||
- ci/lava: Let LAVA job submitter run without JWT file
|
||
- ci/lava: Reduce LAVA boot phase timeout to 3 minutes
|
||
- ci/lava: Cancel the job if the script is interrupted
|
||
- ci/lava: Improve exception handling
|
||
- ci/lava: Retry when data fetching log RPC call is corrupted
|
||
- ci: Fix init-stage2 exit code
|
||
- ci: Update trace after CI fix
|
||
- ci: Fix tests expectations
|
||
- ci/lava: Make job submitter parse the job result
|
||
- ci/lava: Fix colored LAVA outputs
|
||
- ci/lava: Fix Gitlab Section markers
|
||
- ci/lava: Add support for more complex color codes
|
||
- ci/lava: Filter out undesired messages
|
||
- ci/lava: Improve result parsing regex
|
||
- ci/lava: Create Gitlab log sections handler
|
||
- ci/lava: Create LogFollower and move logging methods
|
||
- ci/lava: Follow job execution via LogFollower
|
||
- ci/lava: heartbeat: don't consider kernel message logs
|
||
- ci/lava: Wrap job info into a collapsed section
|
||
- ci/lava: Wrap job definition dump into a collapsed section
|
||
- ci/lava: Add an integration test for LAVA jobs
|
||
- ci/lava: Don't print LAVA debug messages
|
||
- ci/lava: Highlight kernel messages in bold
|
||
- ci/lava: Highlight job retrying message
|
||
- ci/lava: Stop printing after the result line
|
||
- ci/lava: Flexibilize section marker regexes
|
||
- ci/lava: Rename console color names
|
||
- ci/lava: Update license header
|
||
- ci/lava: Split lava_log into modules
|
||
- ci/lava: Detect R8152 issues preemptively and retry
|
||
- ci/lava: Make hung job status yellow
|
||
- ci/lava: Color red for fatal and yellow for warning
|
||
- ci/lava: Add \`slow` pytest marker
|
||
- ci/lava: Add canceled job status
|
||
- ci/lava: Add delay before calling lava-test-case
|
||
- ci/lava: Increase boot timeout
|
||
- ci/lava: Only parse result within testcase section
|
||
- ci/lava: Remove MINIO_RESULTS_UPLOAD if requested
|
||
|
||
Hans-Kristian Arntzen (4):
|
||
|
||
- radv: Fix RTPSO hashing of pGroups.
|
||
- vulkan: Update Vulkan XML and headers to 1.3.219.
|
||
- vulkan: Add common code for VK_EXT_shader_module_identifier.
|
||
- radv: Implement VK_EXT_shader_module_identifier.
|
||
|
||
Hoe Hao Cheng (4):
|
||
|
||
- zink/codegen: remember the fields in feats/props structs of extensions
|
||
- zink/codegen: do not include compilation structs with extension structs
|
||
- zink/codegen: do not automatically consider extensions promoted
|
||
- zink/codegen: use the updated identifiers
|
||
|
||
Hyunjun Ko (15):
|
||
|
||
- radv: fix assertion for the count of xfb output
|
||
- turnip: add an assertion for max descriptor set count.
|
||
- ir3: handle intrinsic_load_draw_id when scanning driver constants
|
||
- turnip: clamp to 1 for negative bottom-right of viewport.
|
||
- turnip: fix wrong size for lrz cs when lrz_track_quirk is enabled.
|
||
- freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE
|
||
- turnip: keep the depth_clip_disable state at the pipeline builder.
|
||
- zink: remove the workaround for depth_clip_enable on turnip
|
||
- turnip: Remove an unnecessary assert.
|
||
- turnip: expose VK_EXT_shader_module_identifier
|
||
- ir3: change maximum size of const files.
|
||
- ir3: handle shared consts.
|
||
- turnip: clean up unused parameters for user consts.
|
||
- turnip: Change to use shared consts for PushConstants
|
||
- tu: increase maxPushConstantsSize to 256.
|
||
|
||
Iago Toral Quiroga (79):
|
||
|
||
- v3dv: ignore no-op barriers
|
||
- v3dv: drop obsolete comment
|
||
- v3dv: don't emit final noop job if we don't have anything to signal
|
||
- v3dv: document Vulkan requirements for signaling operations
|
||
- v3dv: ignore barriers for image layout transitions from undefined layouts
|
||
- v3dv: drop default sampler states if not used
|
||
- v3dv: flag BCL barriers for all-graphics and all-commands stages
|
||
- v3dv: skip binning sync if binning shaders don't access external resources
|
||
- v3dv: use wait stage for wait semaphores to decide about binning syncs
|
||
- v3dv: check input attachment usage as sampled usage
|
||
- v3dv: implement VK_EXT_separate_stencil_usage
|
||
- v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts
|
||
- docs/features: flag VK_KHR_timeline_semaphore as implemented
|
||
- v3dv: expose separateDepthStencilLayouts
|
||
- v3dv: implement vkGetPipelineExecutableInternalRepresentationsKHR
|
||
- v3dv: implement vkGetPipelineExecutablePropertiesKHR
|
||
- v3d/compiler: add more stats to prog_data
|
||
- v3dv: implement vkGetPipelineExecutableStatisticsKHR
|
||
- v3dv: expose VK_KHR_pipeline_executable_properties
|
||
- v3dv: don't leak variant QPU when pipeline compile fails
|
||
- v3dv: drop unsused struct
|
||
- v3dv: expose VK_KHR_shader_float_controls
|
||
- v3dv: expose VK_KHR_spirv_1_4
|
||
- v3dv: expose VK_KHR_vulkan_memory_model
|
||
- v3dv: fix format usage checks when extended usage flag is set
|
||
- broadcom/compiler: don't promote early fragment tests when writing sample mask
|
||
- v3dv: consume barriers at the right stages
|
||
- v3dv: only clear BCL barrier state if we don't have pending graphics barriers
|
||
- v3dv: use an explicit struct type to track barrier state
|
||
- v3dv: track sources of barriers
|
||
- v3dv: limit sync for barriers to hw queues selected by source mask
|
||
- v3dv: don't be too aggressive disabling early Z
|
||
- v3dv: use the global RCL EZ disable if we don't have any EZ draws in the job
|
||
- broadcom/compiler: disable flags optimization for loop conditions
|
||
- v3dv: merge pending secondary barrier state into primary command buffers
|
||
- v3dv: handle barriers at the end of a command buffer
|
||
- v3dv: fix leak
|
||
- broadcom/compiler: fix postponed TMU spills with multiple writes
|
||
- broadcom/compiler: don't predicate postponed spills
|
||
- v3dv: fix pool descriptor count for inline uniform buffers
|
||
- v3dv: fix merge jobs
|
||
- nir: fix documentation for uadd_carry and usub_borry opcodes
|
||
- v3dv: don't lower uadd_carry and usub_borrow
|
||
- v3dv: remove obsolete comment
|
||
- nir/serialize: fix missing divergence info after deserialization
|
||
- broadcom/compiler: don't try to hide TMU latency at QPU scheduling
|
||
- broadcom/compiler: don't over-estimate latency of TMU instructions
|
||
- v3dv: fix comment in texel buffer shader copy path
|
||
- v3dv: remove unused lowering for nir_intrinsic_load_layer_id
|
||
- v3dv: fix comment for point_sprite_mask filed in shader key
|
||
- v3dv: clean up get_internal_type_bpp_for_image_aspects
|
||
- v3d,v3dv: stop hardcoding various image limits
|
||
- v3d: stop using a smaller texture limit in OpenGL
|
||
- v3d,v3dv: stop copying and pasting the translate_swizzle helper
|
||
- v3dv: only save/restore push constant data for meta operations if needed
|
||
- v3dv: move push constant data to the command buffer state
|
||
- v3dv: limit upload of indirect push constant data
|
||
- v3dv: don't allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO
|
||
- v3dv: stop tracking push constant buffer references
|
||
- v3dv: allocate more push constant buffers if needed
|
||
- v3dv: add a dirty state for pending push constants UBO updates
|
||
- v3dv: explain why we clear certain state after a draw call
|
||
- v3dv: remove duplicate condition
|
||
- nir: add nir_address_format_2x32bit_global
|
||
- broadcom/compiler: implement 2x32 global intrinsics
|
||
- broadcom/compiler: track if a shader uses global intrinsics
|
||
- v3dv: implement VK_KHR_buffer_device_address
|
||
- v3dv: handle VK_STRUCTURE_TYPE_DRM_FORMAT_MODIFIER_PROPERTIES_LIST_2_EXT
|
||
- v3dv: remove assert that poolSizeCount must be larger than 0
|
||
- v3dv: fix offset reported by vkGetImageSubresourceLayout
|
||
- v3dv: remove unnecesary FIXME comment
|
||
- v3d,v3dv: add a common v3d_hw_prim_type helper
|
||
- v3dv: only check binning sync for semaphores for the first CL job
|
||
- v3dv: explicitly set sampler min/max filter properties to false
|
||
- v3dv: fill in Vulkan 1.2 missing device properties
|
||
- v3dv: enable missing Vulkan 1.2 features for supported extensions
|
||
- v3dv: enable missing mandatory Vulkan 1.2 features
|
||
- broadcom/ci: update list of slow tests
|
||
- v3dv: expose Vulkan 1.2
|
||
|
||
Ian Romanick (26):
|
||
|
||
- glsl: Fix mixed tabs and spaces in lower_mat_op_to_vec.cpp
|
||
- glsl: Add flag to disable part of do_vec_index_to_cond_assign
|
||
- nir: Use nir_vector_extract to generate code for ir_binop_vector_extract
|
||
- glsl: Delete lower_extracts code
|
||
- anv: Remove FS executables when applying the null FS optimization
|
||
- intel/fs: Add missing synchronization for WaW dependency
|
||
- nir: i32csel opcodes should compare with integer zero
|
||
- nir/algebraic: Fix NaN-unsafe fcsel patterns
|
||
- nir: Add and use algebraic property "is selection"
|
||
- nir/range_analysis: Teach range analysis about fdot opcodes
|
||
- intel/compiler: Rename vec4 state URB opcodes to have VEC4\_ prefix
|
||
- intel/eu: Validate some aspects of URB messages
|
||
- intel/compiler: Move logical-send lowering to a separate file
|
||
- intel/fs: Add _LOGICAL versions of URB messages
|
||
- intel/fs: Lower URB messages to SEND
|
||
- intel/fs: Remove non-_LOGICAL URB messages
|
||
- intel/vec4: Set lower_usub_sat
|
||
- spirv: Fix array length of buffers larger than INT32_MAX.
|
||
- intel/fs: Make logical URB write instructions more like other logical instructions
|
||
- intel/fs: Make logical URB read instructions more like other logical instructions
|
||
- intel/fs: Don't pass flags to lower_urb_read_logical_send or lower_urb_write_logical_send
|
||
- intel/fs: Eliminate "masked" and "per slot offset" URB messages
|
||
- intel/fs: Use canonical form for "work around" tags
|
||
- nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident
|
||
- radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering
|
||
- glsl: Remove lower_offset_arrays pass
|
||
|
||
Icecream95 (45):
|
||
|
||
- clc: Use stringstream for printing spirv errors
|
||
- panfrost: Stop overallocating compressed textures
|
||
- panfrost: Enable NIR lowering of half float packing
|
||
- panfrost: Fix pack_32_2x16 implementation
|
||
- pan/bi: Use texture index instead of sampler for message preloading
|
||
- pan/va: Improvements to LEA_TEX instructions
|
||
- pan/va: Add more transcendental operations
|
||
- pan/va: Add SEG_ADD/SEG_SUB operation
|
||
- pan/va: Add absneg modifier to V2F32_TO_V2F16
|
||
- pan/va: Improve texture instructions
|
||
- nir/lower_tex: Copy more fields in lower_tex_to_txd and friends
|
||
- pan/mdg: Keep min_bound at 16 when alignment requires it
|
||
- pan/mdg: Use MAX2 to set min_alignment
|
||
- pan/mdg: Fix mask usage when filling before a spill
|
||
- pan/mdg: Fix disassembly of store instructions
|
||
- pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled
|
||
- pan/mdg: Fix multiple spilt writes in the same bundle
|
||
- panfrost: Copy blend constant into variant even when reusing it
|
||
- panfrost: Increase the limit for blend shader variants
|
||
- panfrost: Move patched_s out of the pan_blitter_views struct
|
||
- nir: Add store_combined_output_pan BASE back
|
||
- pan/bi: Read base for combined stores
|
||
- pan/mdg: Read base for combined stores
|
||
- panfrost: Only write depth / stencil once if MRT is used
|
||
- pan/bi: Reverse linear constraint bits
|
||
- pan/bi: Add nodearray datastructure
|
||
- pan/bi: Use nodearrays for linear constraints
|
||
- panfrost: Constant stencil value tracking
|
||
- panfrost: Stop using sparse_array for batch BOs
|
||
- panfrost: Use a macro for checking for a shared bind type
|
||
- panfrost: Remove sync arguments from panfrost_batch_submit
|
||
- panfrost: Enable QUAD_STRIP and POLYGON on v6
|
||
- pan/va: Use the _safe iterator when adding blend shader calls
|
||
- pan/decode: fflush buffers after dumping and before aborts
|
||
- pan/decode: Use tag bits for resource entry count
|
||
- pan/decode: Change indent when decoding resources
|
||
- panfrost: Only emit images when they are present
|
||
- panfrost: Emit the correct number of attributes
|
||
- panfrost: Clear the GENERAL varying buffer field if unused
|
||
- panfrost: Correctly calculate prefetch suppresion varying index
|
||
- panfrost: Add a debug option for checking overflows on pool uploads
|
||
- panfrost: Don't unbind recently bound streamout targets
|
||
- panfrost: Allow NULL streamout targets
|
||
- nine: Make vdecl_index_map always signed
|
||
- nine: Only enable thread_submit by default on x86
|
||
|
||
Igor Torrente (2):
|
||
|
||
- venus: Add support to VK_KHR_maintenance4 extension
|
||
- venus: Use maintenance4 to get max_size_buffer
|
||
|
||
Ikshwaku Chauhan (1):
|
||
|
||
- Revert "radeon: add EFC support to only VCN2.0 devices"
|
||
|
||
Illia Abernikhin (1):
|
||
|
||
- vulkan/runtime: repair thread safety in method vk_queue_start_submit_thread()
|
||
|
||
Indrajit Das (5):
|
||
|
||
- radeonsi/gfx11: VRS changes
|
||
- radeonsi: save the fs constant buffer to the util blitter context
|
||
- gallium/u_blitter: clear color buffers using color from a constant buffer
|
||
- mesa/st: add nir shader to clear color buffers using constant value
|
||
- mesa/st: clear color buffers using color from a constant buffer
|
||
|
||
Italo Nicola (3):
|
||
|
||
- mesa/st: call pipe->link_shader even if ir loaded from cache when linking
|
||
- tgsi: Add SEPARABLE_PROGRAM property
|
||
- virgl: overpropagate precise flags
|
||
|
||
Iván Briano (6):
|
||
|
||
- vulkan/wsi/wayland: Fix double free on error condition
|
||
- anv: re-alloc push constants after secondary command buffers
|
||
- anv: do not get rid of empty/useless fragment shaders
|
||
- anv: assert inheritance_info is not NULL
|
||
- anv: emit scissors when the pipeline changes
|
||
- anv: pipelineStageCreationFeedbackCount is allowed to be 0
|
||
|
||
James Zhu (5):
|
||
|
||
- amd: update headers to support decode software ring
|
||
- radeonsi/gfx11: update codec support for gfx11
|
||
- radeonsi/vcn: add decode software ring support for gfx11
|
||
- radeonsi/vcn: add jpeg decode support for gfx11
|
||
- amd/common: some ASICs with gfx9 use compute rings for render
|
||
|
||
Jan Beich (1):
|
||
|
||
- vulkan/wsi: treat EBADF as missing DMA_BUF_IOCTL_{EXPORT,IMPORT}_SYNC_FILE
|
||
|
||
Jan Palus (1):
|
||
|
||
- pvr: use UINT64_C for 64-bit constant
|
||
|
||
Jason Ekstrand (312):
|
||
|
||
- clc: Only initialize LLVM once
|
||
- clc: Rework logging a bit
|
||
- clc: Declare LLVMContexts on the stack
|
||
- intel/compiler: Set lower_fisnormal
|
||
- nir,microsoft: Move scale_fdiv into a common NIR pass
|
||
- nir/opcodes: fisfinite32 should return bool32
|
||
- util: Remove util_cpu_detect
|
||
- util/blob: Add align helpers
|
||
- util/set: Respect found in search_or_add_pre_hashed
|
||
- vulkan: Add a common VkPipelineCache implementation
|
||
- vulkan/pipeline_cache: Add helpers for storing NIR in the cache
|
||
- anv: Rename a fail label in CreateDevice
|
||
- anv: Switch to the new common pipeline cache
|
||
- anv: Clean up pipeline cache helpers a bit
|
||
- nir: Lower all bit sizes of usub_borrow
|
||
- vulkan: Set signals[i].stageMask = ALL_COMMANDS for QueueSubmit2 wrapping
|
||
- vulkan: Use ALL_COMMANDS_BIT for waits/signals instead of ~0
|
||
- spirv: Handle Op*MulExtended for non-32-bit types
|
||
- vulkan/log: Allow but warn if called with a NULL object
|
||
- vulkan/log: Allow but warn for client-invisible objects
|
||
- nir/constant_folding: Break TXB folding into a helper function
|
||
- nir: Constant fold sampler/texture offsets
|
||
- panvk: Take buffer offsets into account in BindVertexBuffers
|
||
- panvk: Eliminate unused vertex attributes
|
||
- nir/deref: Add an alu-of-cast optimization
|
||
- panvk: Add address/range helpers for panvk_buffer
|
||
- vulkan/wsi: Close file descriptors in wsi_destroy_image
|
||
- vulkan/wsi: Only use a single fd per wsi_image
|
||
- vulkan/wsi: Hang on to file descriptors
|
||
- vulkan/wsi: Reset the image fence right before vkQueueSubmit
|
||
- vulkan/wsi: Set the right stage flags for semaphore waits
|
||
- vulkan/wsi: Add signal_fence/semaphore_for_image helpers
|
||
- nir/cf: Return a cursor from nir_cf_extract as well
|
||
- nir: Fix constant folding for non-32-bit ifind_msb and clz
|
||
- nir: Set image_buffers and msaa_images in lower_samplers_as_deref
|
||
- nir: Stop assuming shader_info::textures_used is 32-bit
|
||
- shader_info: Make images_used a bitset
|
||
- nir: Gather samplers_used separately from textures
|
||
- util/bitset: Support larger ranges in BITSET_TEST/CLEAR_RANGE
|
||
- shader_info: Bump the number of images and textures supported
|
||
- gallium/u_threaded_context: Use PIPE_MAX_SHADER_SAMPLER_VIEWS for sampler_buffers
|
||
- mesa/st: Clamp MaxImageUniforms to MAX_IMAGE_UNIFORMS
|
||
- gallium: Bump PIPE_MAX_SHADER_IMAGES to 64
|
||
- panvk: Call nir_lower_io_arrays_to_elements_no_indirects
|
||
- nir/lower_blend: Be more explicit about deref assumptions
|
||
- nir/lower_blend: Stop passing the whole options object around
|
||
- nir/lower_blend: Support SNORM and integer formats for logic ops
|
||
- nir/builder: Add a nir_trim_vector helper
|
||
- nir/builder: Add a nir_resize_vector helper
|
||
- midgard: Handle FB fetch from non-vec4 output variables.
|
||
- nir/lower_blend: Expand or shrink output variables as needed
|
||
- panvk: Advertise support for logicOp
|
||
- iris: Set BindingTableEntryCount for compute shaders
|
||
- nir: Mark negative re-distribution on fadd as imprecise
|
||
- clover: Set images/samplers_used when lowering images
|
||
- lavapipe: Set shader_info::samplers_used
|
||
- gallium: Set shader_info::samplers_used in pstipple_fs
|
||
- ttn: Set shader_info::samplers_used
|
||
- mesa/st: Set samplers_used in lower_tex_src_plane
|
||
- gallium/draw: Properly handle nr_samplers != nr_sampler_views in keys
|
||
- llvmpipe: Fill out samplers even if nr_samplers = 0
|
||
- lavapipe: Set images_used in lvp_lower_pipeline_layout
|
||
- ntt: Don't gather samplers_declared twice
|
||
- panvk: Move CreateDescriptorSetLayout to per-arch
|
||
- panvk: Break descriptor lowering into its own file
|
||
- panvk: Add a buffer to each descriptor set
|
||
- panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned
|
||
- panfrost: Add some sanity checking for sysvals
|
||
- bifrost,midgard: Allow providing a fixed sysval layout
|
||
- panvk: Get rid of the per-pipeline sysvals BO
|
||
- panvk: Use a flat sysvals struct
|
||
- panvk: Stop calling lower_uniforms_to_ubo
|
||
- panvk: Put the sysval and push const UBOs at fixed indices
|
||
- panvk: Interleave UBOs with multiple descriptor sets
|
||
- panvk: Put SSBO addresses in the descriptor buffer
|
||
- vulkan: Start collecting enabled features in vk_device
|
||
- panvk: Enable robustBufferAccess
|
||
- panvk: Advertise VK_KHR_variable_pointers
|
||
- panvk: Add an elems field to panvk_buffer_view
|
||
- pavnk: Pass bind layouts to texture and image descriptor helpers
|
||
- panvk: Implement texture/image queries
|
||
- FIXUP: Use 16-bit things for texture sizes
|
||
- panvk: Re-arrange descriptor set functions
|
||
- panvk: Rewrite the write portion of vkUpdateDescriptorSets
|
||
- panvk: Set immutable samplers properly up-front
|
||
- panvk: Implement descriptor copies properly
|
||
- panvk: Drop panvk_descriptor
|
||
- panvk: Round FillBuffer sizes down to a multiple of 4
|
||
- nir: Preserve metadata if remove_dead_derefs makes no progress
|
||
- nir: Add a var set version of lower_indirect_derefs
|
||
- mesa/st: Use nir_shader_instructions_pass for st_nir_lower_builtin
|
||
- mesa/st: Check deref modes in lower_builtin_instr
|
||
- mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin
|
||
- nir: Put the builder first in lower_input_attachments helpers
|
||
- nir: Use nir_shader_instructions_pass in nir_lower_input_attachments
|
||
- bifrost: Run nir_lower_global_vars_to_local before nir_lower_vars_to_scratch
|
||
- panvk: So more nir_lower_tex before descriptor lowering
|
||
- panvk: Lower blending after lower_var_copies
|
||
- v3dv: Add a create_image_view helper for internal views
|
||
- vulkan: Only be clever about vk_image_view::view_format for normal views
|
||
- v3dv: Don't use color aspects for depth/stencil images
|
||
- v3dv: Drop the region temporary from blit_shader
|
||
- vulkan,v3dv: Add a driver_internal flag to vk_image_view_init/create
|
||
- radv: Only use PLANE_0 in meta when actually needed
|
||
- radv: Use vk_image as the base for radv_image
|
||
- radv: Use vk_image_view as the base for radv_image_view
|
||
- glsl: Drop this != NULL assertions
|
||
- vulkan/pipeline_cache: Implement deserialize for raw objects
|
||
- v3dv: Loosen an assert in copy_buffer_to_image_shader
|
||
- vulkan: Fall back to raw data objects when deserializing if ops == NULL
|
||
- lavapipe: Use the correct ICD path on Win32
|
||
- vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper
|
||
- radv: Add a sqtt entrypoint for CmdBindVertexBuffers2
|
||
- radv: Use the common CmdBindVertexBuffers wrapper
|
||
- lavapipe: Use the common BindVertexBuffers wrapper
|
||
- nir/algebraic: Add two more pack/unpack rules
|
||
- intel/fs: Drop fs_visitor::emit_alpha_to_coverage_workaround()
|
||
- intel/fs: Copy color_outputs_valid into wm_prog_data
|
||
- anv: Drop alpha_to_coverage from the NULL FS optimization
|
||
- anv: Handle the null FS optimization after compiling shaders
|
||
- anv: Don't disable the fragment shader if XFB is enabled
|
||
- intel: Only set VectorMaskEnable when needed
|
||
- vulkan: Add a base struct for buffers
|
||
- anv: Use the base vk_buffer struct
|
||
- panvk: Use the vk_buffer base struct
|
||
- vulkan/runtime: Add min_lod to vk_image_view
|
||
- intel/isl: Add isl_view::min_lod_clamp for IVB+
|
||
- anv: Implement VK_EXT_image_view_min_lod
|
||
- isl: Add some asserts about multisampled surfaces
|
||
- nir: Handle register sources in lower_phis_to_regs_block
|
||
- nir: Add a correctness note for nir_lower_phis_to_regs_block
|
||
- nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info
|
||
- nir: Add a nir_xfb_info to nir_shader
|
||
- anv: Use nir_shader_gather_xfb_info
|
||
- lavapipe: Use nir_shader_gather_xfb_info
|
||
- turnip: Use nir_gather_xfb_info
|
||
- radv: Use nir_gather_xfb_info
|
||
- nir: Drop nir_shader_get_xfb_info
|
||
- glsl/nir: Populate nir_shader::xfb_info after linking varyings
|
||
- glsl/nir: Stop leaking varyings_info
|
||
- glsl/nir: Stash the xfb_info in the nir_shader when linking XFB
|
||
- st,nir: Use nir_shader::xfb_info in nir_lower_io_passes
|
||
- nir/glsl: Use rzalloc for nir_xfb_info
|
||
- nir: xfb_buffer_info::stride is in bytes
|
||
- ttn: Populate the images/textures/samplers_used fields in shader_info
|
||
- vulkan,anv: Move the image offset/extent sanitize helpers to common code
|
||
- radv: Use the common image offset/extent sanitize helpers
|
||
- vulkan,radv: Move vk_format_get_plane_format to common code
|
||
- vulkan/format: Add a vk_format_get_aspect_format helper
|
||
- vulkan: Add a helper for image<->buffer copies
|
||
- anv: Use the common image<->buffer copy helper
|
||
- vulkan,radv: Steal some image offset/extent helpers from radv
|
||
- radv: Use vk_image_buffer_copy_layout
|
||
- panvk: Use vk_image_buffer_copy_layout
|
||
- lavapipe: Use vk_image_buffer_copy_layout
|
||
- nir/deref: Break out a helper for compare_deref_paths
|
||
- nir/deref: Use an index in compare_deref_paths
|
||
- nir/deref: Make compare_deref_paths take a stop callback
|
||
- nir/deref: Re-arrange variable checks in compare_deref_paths
|
||
- nir: Increase nir_variable_data::mode to 16 bits
|
||
- nir/vars_tests: Use nir_var_mem_global instead of ssbo
|
||
- nir/deref: Handle SSBO array bindings specially
|
||
- nir/deref: Handle RESTRICT for SSBO deref bindings
|
||
- compiler/types: Don't place members in the previous substruct's end padding
|
||
- anv/wsi: Stop resetting semaphores
|
||
- vulkan/wsi: Fix a signal_semaphore_with_memory check
|
||
- wsi: Always signal semaphores and fences in wsi_common_acquire_next_image
|
||
- panvk: Use the common AcquireNextImage implementation
|
||
- v3dv: Use the common AcquireNextImage implementation
|
||
- radv: Drop create_sync_for_memory
|
||
- lavapipe: Use the common AcquireNextImage implementation
|
||
- pvr: Use the common AcquireNextImage implementation
|
||
- turnip: Use the common AcquireNextImage implementation
|
||
- turnip: Use the common QueuePresent implementation
|
||
- turnip: Set supported_sync_types before wsi_init
|
||
- vulkan/wsi: Use a STACK_ARRAY for stage_flags
|
||
- vulkan/wsi: Reset fences earlier in wsi_common_queue_present
|
||
- vulkan/wsi: Re-arrange QueueSubmit code in wsi_common_queue_present
|
||
- vulkan/wsi: Set memory ownership after signaling fences and semaphores
|
||
- vulkan/wsi: Signal semaphores and fences from the dma-buf
|
||
- intel/compiler: Use NIR_PASS(_, ...)
|
||
- anv: Use NIR_PASS(_, ...)
|
||
- anv: Properly clamp attachment layer counts
|
||
- vulkan/wsi: Use HAVE_LIBDRM to detect DRM instead of !_WIN32
|
||
- vulkan: Add some border color helpers
|
||
- panvk: Use the new border color helpers
|
||
- lavapipe: Use the new border color helper
|
||
- turnip: Use the new border color helpers
|
||
- vulkan/nir: Make spirv_data const in vk_spirv_to_nir
|
||
- vulkan: Add a vk_pipeline_shader_stage_to_nir helper
|
||
- gallium: Add a u_default_clear_buffer helper
|
||
- iris: Use u_default_clear_buffer
|
||
- panfrost: Use u_default_clear_buffer
|
||
- nir/opt_memcpy: lower copies to/from tightly packed types
|
||
- nir: Add an options parameter to deref_instr_has_complex_use
|
||
- nir/opt_memcpy: Add another case for function_temp
|
||
- vulkan/render_pass: Add a better helper for render pass inheritance
|
||
- anv: Use CmdBeginRendering for resumes in BeginCommandBuffer when possible
|
||
- vulkan/render_pass: Use a special layout for self-dependencies
|
||
- vulkan/render_pass: Pass sample locations to barriers
|
||
- vulkan/render_pass: Allow for mixed sample counts
|
||
- vulkan/render_pass: Support VkAttachmentSampleCountInfoAMD
|
||
- vulkan: Fix a comment
|
||
- vulkan: Add a base struct for descriptor set layouts
|
||
- vulkan: Add a common implementation of pipeline layouts
|
||
- lavapipe: Use the vk_descriptor_set_layout base struct
|
||
- lavapipe: Use the vk_pipeline_layout base struct
|
||
- panvk: Use the vk_descriptor_set_layout base struct
|
||
- panvk: Use the vk_pipeline_layout base struct
|
||
- vulkan: Depend on vk_pipeline_layout in vk_cmd_enqueue
|
||
- anv: Replace an assert() with unreachable()
|
||
- util: Re-indent util_sign_extend, comment, and add asserts
|
||
- util: Use shifts in util_sign_extend
|
||
- util: Use util_sign_extend in fast_idiv_by_const
|
||
- mesa/bptc: Use util_sign_extend
|
||
- util,nir: Move mask_sign_extend from opt_load_store_vectorize to util
|
||
- isl: Use util_sign_extend
|
||
- intel/decoder: Use util_mask_sign_extend()
|
||
- panfrost,asahi: Use util_sign_extend for unpacking
|
||
- nir: Use util_mask_sign_extend when serializing constants
|
||
- vulkan/wsi: Pass the size to MapMemory in the SW path
|
||
- vulkan/wsi/wayland: Use a single memcpy in the SW path
|
||
- vulkan/wsi: Move select_memory_type to common and rework it a bit
|
||
- vulkan/wsi: Add a helper for creating CPU images
|
||
- vulkan/wsi/x11: Use wsi_configure_cpu_image
|
||
- vulkan/wsi/wayland: Use wsi_configure_cpu_image
|
||
- vulkan/wsi: Delete SW support from configure_native_image
|
||
- vulkan/wsi: Persistently map CPU images
|
||
- vulkan/wsi: Compute stride and size in configure_buffer_image
|
||
- vulkan/wsi: Align buffer image strides to optimalBufferCopyRowPitchAlignment
|
||
- vulkan/wsi: Support tiled CPU images
|
||
- vulkan/wsi/x11: Don't leak shm_reply if we don't have dri3 or present
|
||
- vulkan/wsi/x11: Only use MIT_SHM if the device supports EXT_external_memory_host
|
||
- vulkan/wsi/wayland: Use host pointer import when available
|
||
- dozen: Increase optimalBufferCopy*Alignment
|
||
- dozen: Stop asking for semaphore/fence signaling
|
||
- vulkan/wsi/wayland: Only memcpy if the swapchain is actually software
|
||
- vulkan/wsi: Fix structure chaining in wsi_create_buffer_image_mem
|
||
- shader_info: Move subgroup_size out of cs and make it an enum
|
||
- vulkan: Re-order pipeline hashing
|
||
- vulkan: Constify vk_spirv_version
|
||
- spirv,vulkan: Set shader_info::subgroup_size
|
||
- mesa,glsl,ttn: Set subgroup_size to UNIFORM
|
||
- anv: Drop unnecessary parameters to anv_pipeline_compile_cs
|
||
- anv: Put a VkPipelineShaderStageCreateInfo* in anv_pipeline_stage
|
||
- anv: Use vk_pipeline_shader_stage_to_nir
|
||
- vulkan: Add a vk_pipeline_shader_stage_is_null() helper
|
||
- anv: Use vk_pipeline_shader_stage_is_null()
|
||
- intel,anv,iris,crocus: Drop subgroup size from the shader key
|
||
- anv: Implement VK_EXT_shader_module_identifier
|
||
- anv: Init supported extensions before WSI
|
||
- vulkan/wsi/wayland: Add a BUFFER_TYPE flag
|
||
- vulkan/wsi: Add debug variables to force the SW and PRIME buffer blit paths
|
||
- intel/blorp: Set uses_sample_shading for MSAA blit shaders
|
||
- radv: Set uses_sample_shading for copy shaders
|
||
- intel/fs: Use shader_info::fs::uses_sample_shading
|
||
- nir: Add a pass for lowering shaders to single-sampled
|
||
- intel/fs: Use nir_lower_single_sampled
|
||
- glsl/nir: Set uses_sample_shading from glsl_to_nir
|
||
- spirv/nir: Set uses_sample_shading from spirv_to_nir
|
||
- nir/gather_info: Stop gathering uses_sample_shading
|
||
- glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue
|
||
- intel/fs: Simplify persample_dispatch
|
||
- intel/fs,anv: Rework handling of coarse and sample shading
|
||
- panfrost: Simplify sample_shading
|
||
- anv: Stop compacting surface state tables
|
||
- vulkan: Detect pNext chain loops in vk_foreach_struct()
|
||
- anv: Don't use the wrong ARRAY_SIZE
|
||
- Revert "vulkan: Detect pNext chain loops in vk_foreach_struct()"
|
||
- vulkan: Add a vk_limits.h file for runtime limits
|
||
- vulkan: Add an enum for all dynamic graphics states
|
||
- vulkan: Add standard sample locations
|
||
- vulkan: Add data structures to store all graphics state
|
||
- util: Constify __bitset_test_range
|
||
- vulkan: Add a common data structure for dynamic states
|
||
- vulkan: Add a common implementation of vkCmdSet*
|
||
- vulkan: Copy the depth/stencil state optimization code from ANV
|
||
- Use vk_foreach_struct_const where needed
|
||
- vulkan: Detect pNext chain loops in vk_foreach_struct() (v2)
|
||
- anv: Add an instance multiplier to anv_pipeline
|
||
- anv: Handle vertex buffer sizes in anv_CmdBindVertexBuffers2
|
||
- anv: Stop recording sample locations per-sample-count
|
||
- anv: Convert to using vk_graphics_pipeline_state
|
||
- anv: Rework setting primitive topology
|
||
- anv: Stop merging DEPTH_STENCIL state
|
||
- anv: Switch to using common dynamic state tracking
|
||
- vulkan/graphics_state: Improve the depth/stencil optimization code
|
||
- anv: Do depth/stencil optimization for dynamic depth/stencil
|
||
- vulkan: Call lower_clip_cull_distance_arrays in vk_spirv_to_nir
|
||
- vulkan/nir: Don't remove dead XFB outputs
|
||
- vulkan: Call gather_xfb_info in vk_spirv_to_nir
|
||
- lavapipe: Use more Vulkan NIR heleprs
|
||
- vulkan: Allow up to 12 pointers in multialloc
|
||
- vulkan: Record shader stages in vk_graphics_pipeline_state
|
||
- vulkan: Multisample state isn't always included in fragment shader state
|
||
- vulkan: Depth/stencil isn't fragment output state but multisample is
|
||
- vulkan: Add a fully_dynamic_state_groups() helper
|
||
- vulkan: Fix pipeline libraries with dynamic-only VI or FSR state
|
||
- vulkan: Input assembly and depth/stencil can also be fully dynamic
|
||
- vulkan: Don't assert VkGraphicsPipelineLibraryCreateInfoEXT::Flags == 0
|
||
- zink: Mark depth/stencil visual tests as flakes
|
||
- lavapipe: Use shader_info::uses_sample_shading
|
||
- lavapipe: Use the vk_graphics_pipeline_state
|
||
- vulkan: Append subpass structures to VkRenderingInfo last
|
||
- vulkan: Include self-dep info in rendering continues
|
||
- intel/eu: Don't throw validation errors on float MOV_INDIRECT
|
||
- lima: Lower undefs to zero and run DCE after from_ssa
|
||
- nir/from_ssa: Ignore undef sources
|
||
- vulkan: Always populate vk_render_pass_state::render_pass
|
||
- vulkan: Improve the docs for vk_subpass_info
|
||
- radv: Use both aspects for depth/stencil blit destinations
|
||
- vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors
|
||
|
||
Jason Volk (4):
|
||
|
||
- r600: Fix userspace pointer support for evergreen compute.
|
||
- r600: Improve compute memory pool performance; reduce fragmentation.
|
||
- r600: Elide downloads for discarded and immutable compute memories.
|
||
- radeon: Support shared memory user pointers.
|
||
|
||
Jeffrey Knockel (1):
|
||
|
||
- egl: dereference XCB drawable pointers
|
||
|
||
Jesse Natalie (158):
|
||
|
||
- d3d12: Don't block DISPLAY based on format for non-Windows
|
||
- wgl: Implement WGL_ARB_create_context_robustness
|
||
- d3d12: Hook up robustness extensions
|
||
- d3d12: Plug context leaks
|
||
- d3d12: Plug screen leaks
|
||
- d3d12: Attempt screen reset during context create
|
||
- d3d12: Add a unit test for context reset recovery
|
||
- microsoft/compiler: Unload DXIL validator library \*after* calling Release()
|
||
- d3d12: Correctly key off of polygon stipple enable cap
|
||
- mesa: Ensure ARB programs end in a newline
|
||
- bin: Add a script for filtering/sorting refcount logs
|
||
- u_debug_refcnt: Don't loop for initial refcounts if the initial value is huge
|
||
- u_debug_symbol: Use correct sizeof for DbgHelp
|
||
- u_debug_symbol: Allow re-init of DbgHelp to work around already-initialized issues
|
||
- u_debug_stack: Lock around stack dumps to prevent interleaving
|
||
- u_primconvert: Refactor to remove recursion from util_primconvert_draw_vbo
|
||
- u_primconvert: Handle take_index_buffer_ownership
|
||
- d3d12: Use a pipe_reference in d3d12_bo
|
||
- d3d12: Fix leaks in map with do-not-wait
|
||
- d3d12: Fix a couple over-releases from incorrect take-ownership flag
|
||
- d3d12: Destroy empty residency bo set during early-return
|
||
- d3d12: Include windows.h on Windows before dxgicommon.h
|
||
- d3d12: Don't use VLAs
|
||
- d3d12: MSVC warning around operator precedence causing uint32_t==bool
|
||
- dzn: Add new parameter to vk_image_view_init call
|
||
- d3d12: Fix forward decl for sw_winsys
|
||
- microsoft/compiler: Fixup sampler derefs in tex instrs that don't \*need* samplers
|
||
- nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*)
|
||
- d3d12: Re-add missed bitmasks to shader key comparison
|
||
- CI/d3d12: Add skips and update baselines
|
||
- CI/windows: Install Vulkan SDK from LunarG directly isntead of Chocolatey
|
||
- CI/windows: Delete comment for installing vulkan-runtime
|
||
- CI/windows: Disable chocolatey progress printing to shrink log size
|
||
- CI/windows: Disable LLVM CMake developer warnings to shrink log size
|
||
- CI/windows: Move --fraction for deqp-runner to run command line
|
||
- dzn: Update fails/flakes
|
||
- CI: Re-enable Windows builds
|
||
- nir: Consider PNTC to be a varying
|
||
- wgl, d3d12: Handle front buffer writes for double-buffered MSAA surfaces backed by swapchains
|
||
- meson: Use a C99 STDC_VERSION for flex
|
||
- d3d12: Report PIPE_CAP_ACCELERATED as 0 on WARP
|
||
- d3d12: Properly set HS input control point count
|
||
- dzn: Init sync types before wsi
|
||
- mesa: Use C11 alignas for cross-platform SSE support
|
||
- CI: Lima farm is offline
|
||
- dzn: Skip another test that times out occasionally
|
||
- util/disk_cache: Implement disk_cache_get_function_identifier for Windows
|
||
- microsoft/clc: Enable tests that pass on server 2022
|
||
- microsoft/clc: Fix test double free in the case of compilation failure
|
||
- microsoft/clc: Remove dead image vars
|
||
- microsoft/clc: Add a unit test for unused image kernel args
|
||
- vulkan_entrypoints_gen: For Windows ARM64EC, prefix symbols with '#'
|
||
- dzn: Support Vulkan loader interface v5
|
||
- gallium/util: Move u_dl and u_pointer to src/util
|
||
- dzn: Annotate sometimes-unused variables.
|
||
- dzn: Define COBJMACROS before any includes
|
||
- dzn: Use os_time_sleep instead of Win32 Sleep
|
||
- dzn: Use u_dl to load D3D12
|
||
- dzn: Add an eventfd sync implementation
|
||
- dzn: Fix maybe-uninitialized warning
|
||
- dzn: Skip dxil validator and some Windows-only debug flags for Linux
|
||
- dzn: Use IUnknown instead of IDXGIAdapter1 as the stored adapter
|
||
- dzn: Don't hash adapter LUID as part of device UUID
|
||
- dzn: Use a custom adapter desc instead of DXGI adapter desc
|
||
- dzn: Move DXGI code to a separate file and only build it on Windows
|
||
- dzn: Add a DXCore enumeration path
|
||
- dzn: Add ABI helpers for D3D12 functions returning structs
|
||
- vulkan/wsi/win32: Use the new helpers and persistent map
|
||
- dzn: Fix winsys reporting
|
||
- dzn: Only support high/normal queue priorities
|
||
- dzn: Add for condition to break nested loop
|
||
- dzn: Re-order includes in dzn_nir to prevent TRUE redefinition
|
||
- dzn: Only add Werror flags that the compiler supports
|
||
- dzn: Disable Wunused-value for C++ files
|
||
- dzn: Fix comparison order for format enums
|
||
- dzn: Re-order dzn_physical_device_desc so = { 0 } doesn't initialize a struct
|
||
- dzn: Use fabs on floating point input
|
||
- CI: Add dzn to debian vulkan/clang/release builds
|
||
- mesa: Support D3D11/D3D12 memory imports
|
||
- mapi: Add more EXT_external_objects_win32 functions/enums
|
||
- gallium: Add a 'name' field to winsys_handle
|
||
- mesa: Implement ImportMemoryWin32NameEXT
|
||
- gallium: Add 'name' field to Win32 semaphore import
|
||
- driver_noop: Remove infinite recursion from create_fence_win32
|
||
- gallium: Add a new fence type with a pipe cap to indicate it can be imported
|
||
- mesa: Support importing D3D12 fences as timeline semaphores
|
||
- mesa: Implement ImportSemaphoreWin32NameEXT
|
||
- gallium, mesa: Support setting timeline semaphore values
|
||
- d3d12: Store the rest of the device IDs in the screen
|
||
- d3d12: Compute UUIDs required by external objects extension
|
||
- d3d12: Add pipe getters for Win32 and base external objects device matching
|
||
- d3d12: Support B4G4R4A4 format
|
||
- d3d12: Get adapter LUID after device creation
|
||
- d3d12: PIPE_BIND_SHARED doesn't mean linear and is always on opened resources
|
||
- d3d12: Support creating memory objects
|
||
- d3d12: Implement resource_from_memobj
|
||
- d3d12: Support opening resources and memobj by name
|
||
- d3d12: Support importing fences / timeline semaphores
|
||
- d3d12: Implement server signal/wait
|
||
- d3d12: Implement fence opening and value setting
|
||
- d3d12: Fix up resource import validation
|
||
- docs: Update features.txt and new_features.txt
|
||
- u_atomic: Implement p_atomic_xchg for Windows
|
||
- u_atomic: Fix MSVC p_atomic_add_return
|
||
- u_atomic: Add p_atomic_fetch_add which returns the old value
|
||
- zink: Use p_atomic_fetch_add
|
||
- simple_mtx: Replace GCC sync intrinsics with u_atomic ops
|
||
- Fix static glapi on Windows
|
||
- dzn: Missed ABI fixes for GetCustomHeapProperties
|
||
- dzn: Remove the cast when the SDK version is high enough
|
||
- d3d12: Add helpers to build with correct ABI for MinGW
|
||
- microsoft/clc: Add helpers to build with correct ABI for MinGW
|
||
- subprojects: Point DirectX-Headers at a specific tag
|
||
- CI: Update DirectX-Headers dependency for MinGW/Debian
|
||
- util: Add a Win32 futex impl
|
||
- d3d12: Support clip halfz
|
||
- d3d12: Add a blank d3d12_resource_state.h/cpp
|
||
- d3d12: Move desired resource state to new files
|
||
- d3d12: Move current resource state to new files
|
||
- d3d12: Give bos a unique identifier to be used for state tracking
|
||
- d3d12: Add a list of contexts alive for the current screen
|
||
- d3d12: Hold lock when removing resources from residency list
|
||
- d3d12: Notify contexts about deletion of bos
|
||
- d3d12: Add a context state tracking structure
|
||
- d3d12: Treat depth/stencil as planar for plane count helper
|
||
- d3d12: Create/free context state entries
|
||
- d3d12: Track a global resource state for non-simultaneous-access resources
|
||
- d3d12: Record a state fixup command list when necessary
|
||
- d3d12: Swap the remainder of state tracking to new method
|
||
- d3d12: Rename bind invalidate options to transition flags
|
||
- d3d12: Extract core barrier logic
|
||
- d3d12: Optimize transition_subresource_states that covers a whole resource
|
||
- d3d12: Add a transition flag indicating that state accumulation is needed
|
||
- d3d12: Remove state tracking implementation details from header
|
||
- nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O
|
||
- microsoft/compiler: Add a max shader model option
|
||
- microsoft/compiler: Add a max validator version
|
||
- microsoft/compiler: Emit metadata based on long-lived signature data
|
||
- microsoft/compiler: Split signature processing into two parts
|
||
- microsoft/compiler: Ensure 4-component position writes via NIR
|
||
- microsoft/compiler: Lower I/O to scalar
|
||
- microsoft/compiler: Support DXIL validator version 1.5
|
||
- microsoft/compiler: Compute correct usage masks
|
||
- microsoft/compiler: Correctly compute dynamic indexing I/O masks
|
||
- microsoft/compiler: DXIL validator 1.6 counts UAVs differently
|
||
- microsoft/compiler: Set the barycentrics flag for attribute_at_vertex
|
||
- microsoft/compiler: DXIL validator 1.6 uses a new PSV struct version
|
||
- microsoft/compiler: DXIL validator 1.6 uses a new PSV resource struct
|
||
- microsoft/compiler: Support DXIL validator 1.6
|
||
- microsoft/compiler: Support DXIL validator 1.7
|
||
- microsoft/compiler: Add a getter for a validator version from DXIL.dll
|
||
- microsoft/compiler: Blacklist DXIL validator 1.6 from 20348 SDK
|
||
- d3d12: Retrieve validator version
|
||
- dzn: Retrieve validator version
|
||
- egl/wgl: Delete unused variables/code
|
||
- egl/wgl: Fix some awkward sizeof formatting
|
||
- microsoft/compiler: Discard shouldn't be marked readnone
|
||
- microsoft/compiler: Fix PSV struct when numthreads is 0
|
||
|
||
Jiang Feng (1):
|
||
|
||
- virgl: Set res->maybe_busy to true when creating resources
|
||
|
||
Jonathan Gray (2):
|
||
|
||
- intel/dev: sync ADL-S pci ids with linux
|
||
- intel/dev: add RPL-S pci ids from drm-intel-next
|
||
|
||
Jonathan Marek (2):
|
||
|
||
- freedreno/registers: add a7xx registers for drm/msm kernel driver
|
||
- freedreno/registers: add missing varset="chip" for new enum values
|
||
|
||
Jonathan Weinstein (1):
|
||
|
||
- lavapipe: support instance rate zero for VK_EXT_vertex_attribute_divisor
|
||
|
||
Jordan Justen (48):
|
||
|
||
- intel/dev: Add device info for RPL-P
|
||
- vulkan/wsi/x11: Use atomic_int with sent_image_count
|
||
- drm-uapi/i915_drm.h: Update from drm-next (2022-04-28)
|
||
- intel/dev: Read hwconfig from i915
|
||
- intel_dev_info: Add --hwconfig command line parameter
|
||
- iris: Fix assertion meant to only target the clear-color stride
|
||
- nir/divergence: handle more \*_intel intrinsics
|
||
- intel/dev: Add INTEL_PLATFORM_DG2_G12
|
||
- intel/dev: Add DG2 G12 PCI IDs
|
||
- intel/dev: Enable first set of DG2 PCI IDs
|
||
- intel: Fix build of mi_builder_tests by including c99_compat.h
|
||
- intel: Build mi_builder_test whenever build-tests is set
|
||
- drm-uapi/i915_drm.h: Update from drm-next (2022-05-06)
|
||
- intel/gem: Add support for I915_ENGINE_CLASS_COMPUTE
|
||
- anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state()
|
||
- anv: Move STATE_BASE_ADDRESS programming into init_common_queue_state()
|
||
- anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE
|
||
- anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1
|
||
- intel/dev: Add devinfo::mem to store i915 regions information
|
||
- anv: Use memory info from devinfo
|
||
- iris/bufmgr: Use memory info from devinfo
|
||
- intel/tools: Print memory info in intel_dev_info
|
||
- vulkan/wsi: Disable dma-buf sync file if ENOSYS is returned
|
||
- intel/compiler: Don't create vec4 reg-set for gen8+
|
||
- intel/dev: Add vram.unmappable.size region info
|
||
- intel/tools: Print unmappable region info in intel_dev_info
|
||
- intel/dev: Add intel_vram_all_mappable()
|
||
- iris/resource: Assert that DG2 CCS buffers don't also try to set BO_ALLOC_SMEM
|
||
- iris/resource: Avoid mapping when not needed in iris_resource_init_aux_buf()
|
||
- iris/bufmgr: Add all_vram_mappable which is currently always true
|
||
- iris/bufmgr: Set mmap_mode to IRIS_MMAP_NONE for lmem in small-BAR mode
|
||
- iris: Set clear_color_unknown if the bo is not mappable
|
||
- anv/allocator: Add assert and TODO comment for future small BAR uapi
|
||
- iris/bufmgr: Add assert and TODO comment for future small BAR uapi
|
||
- drm-uapi/i915_drm.h: Update from drm-next (2022-07-22)
|
||
- iris/bufmgr: Add I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS for vram mappable buffers
|
||
- intel/dev: Use i915 region probed_cpu_visible_size when non-zero
|
||
- intel/dev: Determine the amount of free vram using small BAR uapi
|
||
- intel/dev: Enable remaining DG2 and ATS-M device IDs
|
||
- anv: Fix PHYSICAL_DEVICE_MEMORY_BUDGET_PROPERTIES with large BAR
|
||
- intel/dev: Fill in system memory info when using INTEL_DEVID_OVERRIDE
|
||
- iris: Drop extra file-descriptor dup in iris_drm_screen_create()
|
||
- intel/pci_ids: Update ADL-S strings
|
||
- intel/pci_ids: Add 0x468b ADL-S PCI-id
|
||
- intel/pci_ids: Update ATS-M device names
|
||
- intel/pci_ids: Update DG2 device names
|
||
- intel/pci_ids: Add dg2 0x5698 pci-id
|
||
- intel/pci_ids: Drop non-upstream dg2 pci-ids
|
||
|
||
Jose Maria Casanova Crespo (1):
|
||
|
||
- v3dv/ci: increase fraction to 10 on v3dv ci jobs.
|
||
|
||
Josh Billingsley (1):
|
||
|
||
- driconf: add SD Gundam G Generation Cross Rays
|
||
|
||
Joshua Ashton (1):
|
||
|
||
- aco: Use movk for AddressHi bits in vertex prolog
|
||
|
||
José Fonseca (6):
|
||
|
||
- trace: Fix framebuffer state serialization.
|
||
- lavapipe: Prevent mapping buffers beyond their size.
|
||
- trace: Dump NIR.
|
||
- trace: Parse character data more efficiently.
|
||
- trace: Bring state dump up to speed.
|
||
- trace: Allow to control nir dumping via an environment variable.
|
||
|
||
Juan A. Suarez Romero (37):
|
||
|
||
- ci/v3dv: remove fixed test
|
||
- v3d: report the correct unsupported blit format
|
||
- v3d: do not tile 1D textures
|
||
- v3d: use surface format defined on pipe_blit
|
||
- gallium: add hook on getting canonical format
|
||
- v3d: define our own canonical supported formats
|
||
- v3d: enable GL_ARB_copy_image extension
|
||
- gallium: move get_canonical_format hook to pipe_screen
|
||
- v3dv: duplicate key on hashtable insert
|
||
- v3dv: store device_id on device init
|
||
- v3d/simulator: add support for AMD cards
|
||
- mesa: unref syncobj after wait_sync
|
||
- v3d: enable early Z/S clears
|
||
- v3d: disable early-Z on odd frame dimensions
|
||
- v3d: allow TFU blitting for single layer textures
|
||
- v3d: add hook to get on-disk shader cache
|
||
- v3d: add builtin support for white/black clamp-to-border
|
||
- v3dv: check BO mapping result
|
||
- docs: update extensions for V3D and VC4
|
||
- ci: bump VK-GL-CTS to 1.3.2.0
|
||
- v3d: fix blending for mixed RT formats
|
||
- v3d: expose GL_ARB_draw_buffers_blend
|
||
- v3d/ci: Add traces
|
||
- v3d: use function to initialize refcount
|
||
- v3d: save only required states in blitter
|
||
- v3d: release all color buffers on context destroy
|
||
- v3d: add ARB_polygon_offset_clamp extension support
|
||
- v3dv/ci: Update expected results
|
||
- v3d/ci: Add flake test
|
||
- v3d/v3dv/ci: update expected results
|
||
- v3d/ci: (partially) revert expected list
|
||
- v3d/ci: update expected list
|
||
- v3dv/ci: Update expected list
|
||
- v3d/vc4/ci: Fix typo in skipped list
|
||
- v3d/ci: enable ARB_texture_cube_map piglit tests
|
||
- vc4: properly restore vc4 debug option
|
||
- vc4: store tex sampler in proper register
|
||
|
||
Kai Wasserbäch (1):
|
||
|
||
- fix(gallivm): Replace LLVMConstF* with LLVMBuild* methods.
|
||
|
||
Karmjit Mahil (26):
|
||
|
||
- pvr: Fix clang-format errors caused by vk outarrays.
|
||
- pvr: Add PVR_WORKGROUP_DIMENSIONS define.
|
||
- pvr: Add initial implementation of vkCmdDispatch().
|
||
- pvr: Add compute update shared kernel in vkCmdDispatch().
|
||
- pvr: Add compute update kernel in vkCmdDispatch().
|
||
- pvr: Update pvrsrvkm to fw 1.17 .
|
||
- pvr: Add csb enum helper funciton header.
|
||
- pvr: Write immutable descriptor words in vkAllocateDescriptorSets().
|
||
- pvr: Fix incorrect samples to cr_isp_aa_mode conversion.
|
||
- pvr: Change compute hard coding infrastructure.
|
||
- pvr: Remove unused output_selects from vertex shader state.
|
||
- pvr: Add graphics pipeline hard coding infrastructure.
|
||
- pvr: Enable 33.15.11.3 (AXE-1-16M).
|
||
- pvr: Extend hard coding infra to allow per stage per device coding.
|
||
- pvr: Fix physical device limits.
|
||
- pvr: Handle vdm degen_cull_enable.
|
||
- pvr: Fix off by 1 error in buffer_id for ubo pds program.
|
||
- pvr: Rename loop iterator variable.
|
||
- pvr: Update pvrsrv build version for fixed size fw.
|
||
- pvr: Rename uniform pds program to descriptor pds program.
|
||
- pvr: Add static consts support in descriptor pds program.
|
||
- pvr: Add static consts to hard coding infra.
|
||
- pvr: Add initial vkCmdPipelineBarrier skeleton.
|
||
- pvr: Add IDF/WDF program for compute pipeline barrier.
|
||
- pvr: Add compute pipeline barrier handling.
|
||
- pvr: Fix cdm shared reg usage reported to fw.
|
||
|
||
Karol Herbst (20):
|
||
|
||
- nir: add 16 and 64 bit fisnormal lowering
|
||
- vtn: clamp SpvOpImageQuerySize dest to 32 bit
|
||
- nv50/nir: align tlsspace to 0x10
|
||
- llvmpipe: PIPE_COMPUTE_CAP_GRID_DIMENSION is uint64_t
|
||
- llvmpipe/fence: make the fence id counter atomic
|
||
- iris/cs: take buffer offsets into account for CL
|
||
- nir: add a nir_remove_non_entrypoints helper
|
||
- ntt: make use of new samplers_used field
|
||
- clc: parse localSize and localSizeHint
|
||
- clc: drop parsingComplete check
|
||
- clc: speed up compilation by not relying on opencl-c.h
|
||
- clc: fix compiler features_macro CTS Test
|
||
- clc: disable opaque pointers until they are supported
|
||
- nv50/ir: recoginse Ampere
|
||
- nir/gather_info: allow to run it before inlining
|
||
- nir/gce: pin call instructions
|
||
- nouveau: use the contexts pushbuf and client where possible
|
||
- nouveau/mm: make code thread safe
|
||
- nv50/ir: fix OP_UNION resolving when used for vector values
|
||
- nv50: properly flush the TSC cache on 3D
|
||
|
||
Ken Johnson (2):
|
||
|
||
- d3d12: Reduce cost of shader key comparisons
|
||
- mesa/main: Correct alloca() use without #include "c99_alloca.h"
|
||
|
||
Kenneth Graunke (41):
|
||
|
||
- anv: Fix INTEL_DEBUG=bat on XeHP
|
||
- intel/decoder: Fix binding table pointer decoding with large offsets
|
||
- iris: Add FLUSH_HDC to PIPE_CONTROL_CACHE_FLUSH_BITS
|
||
- Revert "st/mesa: Transcode ASTC to BC7 (BPTC) where possible"
|
||
- intel: Drop Wa_1409226450 (stall before instruction cache invalidation)
|
||
- intel: Fix analysis invalidation in eliminate_find_live_channel
|
||
- intel/compiler: Move spill/fill tracking to the register allocator
|
||
- mesa: Split tx_compress_dxtn into per-format functions
|
||
- mesa: Avoid temp images in _mesa_texstore_rgb_dxt1 for GL_RGBA source
|
||
- mesa: Require temp images in _mesa_texstore_*_dxt* with SkipImages
|
||
- mesa: Avoid temp images in _mesa_texstore_*_dxt* for stride = 0
|
||
- isl: Add an isl_format_supports_typed_atomics() helper.
|
||
- intel/compiler: Fix uncompaction of signed word immediates on Tigerlake
|
||
- intel: Set a more useful fake devinfo->gtt_size in no-hw mode
|
||
- intel/compiler: Fix A64 header construction with a uniform address
|
||
- iris: Update comment about 2GB dynamic state range
|
||
- iris: Use PIPE_* defines rather than ones from main/config.h
|
||
- crocus: Use PIPE_* defines rather than ones from main/config.h
|
||
- intel/compiler: Stop including src/mesa/main/config.h
|
||
- intel/compiler: Fix brw_gfx_ver_enum.h to be a proper header file
|
||
- intel/compiler: Split 3DPRIM_* defines out to a separate header.
|
||
- intel/tools: Stop malloc'ing device info in i965_disasm
|
||
- intel/compiler: Move opcode_desc handling to a separate header
|
||
- intel/compiler: Introduce a new brw_isa_info structure
|
||
- intel/compiler: Remove use of thread_local for opcode tables
|
||
- intel/compiler: Convert brw_eu.cpp back to brw_eu.c
|
||
- intel/compiler: Handle split-sends in EOT high-register pinning case
|
||
- intel/fs: Opportunistically split SEND message payloads
|
||
- intel/compiler: Avoid copy propagating large registers into EOT messages
|
||
- intel/compiler: Don't set SBID on EOT send messages
|
||
- mesa: Mark render-to-texture as unsafe if there's no pipe_resource
|
||
- intel/eu: Handle compaction when inserting validation errors
|
||
- intel/eu: Fix XeHP register region validation for hstride == 0
|
||
- intel/eu: Clarify spec citations for XeHP region restrictions
|
||
- intel/eu: Mark header present in URB memory fences on XeHP
|
||
- intel/compiler: Lower FIND_[LAST\_]LIVE_CHANNEL in IR on Gfx8+
|
||
- intel/compiler: Delete unused Gfx8+ code in brw_find_live_channel()
|
||
- iris: Fix PIPE_CAP_UMA
|
||
- iris: Use linear for exported resources if we can't convey tiling
|
||
- iris: Delete unused iris_screen::aperture_bytes field
|
||
- crocus: Fix memory leaks on iris_resource_create failure paths
|
||
|
||
Konrad Dybcio (1):
|
||
|
||
- freedreno: Enable A619
|
||
|
||
Konstantin Seurer (106):
|
||
|
||
- radv: Fix ray queries with !15854
|
||
- radv: Do not discard hits with t=tmax
|
||
- radv: Enable rt primitive culling for spirv2nir
|
||
- ci: Always install glslangValidator for build jobs
|
||
- radv: Use VAs in radv_meta_buffer.c
|
||
- radv: Make fill_buffer_shader non-static
|
||
- radv: Format radv_acceleration_structure.c
|
||
- radv: Add the fuchsia radix sort
|
||
- radv: Build accaleration structures using LBVH
|
||
- radv: Clean up the accel-struct build shaders
|
||
- radv: Fix lowering ignore_ray_intersection
|
||
- vulkan: Add a shader module clone helper
|
||
- radv: Copy shader modules to avoid use after free
|
||
- radv/radix_sort: Add missing entry points
|
||
- radv/radix_sort: Make variable names consistent
|
||
- radv: Use the entire morton code as sort key
|
||
- ac/nir: Do not set cursor in try_extract_additions
|
||
- radv: Move RADV_RT_STAGE_BITS to radv_private.h
|
||
- radv: Treat rt stages like compute stages
|
||
- radv: Use RADV_RT_STAGE_BITS more often
|
||
- nir: Add a ray launch size addr intrinsic
|
||
- radv: Allow indirect dispatches without bo
|
||
- radv: Indirect unaligned dispatches
|
||
- radv,aco: Use ray_launch_size_addr
|
||
- radv: Implement vkCmdTraceRaysIndirectKHR
|
||
- radv: Advertise indirect ray tracing
|
||
- radv: Allow radv_fill_buffer to work with VAs only
|
||
- radv: Use radv_fill_buffer for accel struct builds
|
||
- Revert "radv: Make fill_buffer_shader non-static"
|
||
- radv: Add and use radv_bvh_node_type
|
||
- radv: Replace nir_vector_extract with nir_channel
|
||
- radv/ci: Skip slow/crashing ray tracing tests
|
||
- radv: Enable ray queries by default
|
||
- radv: Add fail/flake lists for navi22
|
||
- radv: Fix handling of primitiveOffset
|
||
- radv: Ignore transformOffset if transformData is 0
|
||
- vulkan: Use the correct enum in vk_buffer
|
||
- radv: Use vk_buffer
|
||
- radv: Add and use radv_cp_dma_wait_for_stages
|
||
- radv: Some acceleration structure cleanups
|
||
- radv: Require an alignment of 64 for accel structs
|
||
- radv: Fix the accel struct size calculation
|
||
- vulkan: Spec update to 1.3.216
|
||
- spirv: Add plumbing for ray_cull_mask
|
||
- radv: ray cull mask
|
||
- radv: Add accel struct queries for maintenance1
|
||
- radv: ACCESS_2_SHADER_BINDING_TABLE_READ_BIT_KHR
|
||
- radv: PIPELINE_STAGE_2_ACCELERATION_STRUCTURE_COPY
|
||
- nir: Handle ray_launch_size_addr in opt_preamble
|
||
- radv: vkCmdTraceRaysIndirect2KHR
|
||
- radv: Advertise VK_KHR_ray_tracing_maintenance1
|
||
- nir,spirv: Preserve inbounds access information
|
||
- nir/nir_lower_io: Optimize 32-bit inbounds access
|
||
- radv: Move some rt intrinsics to the top
|
||
- radv: Introduce an enum for the any hit status
|
||
- radv: Fix the t comparison for triangles
|
||
- radv: Refactor lower_rt_derefs
|
||
- nir: Add a nir_test_mask helper
|
||
- radv: Use nir_test_mask instead of i2b(iand)
|
||
- nir: Use nir_test_mask instead of i2b(iand)
|
||
- d3d12: Use nir_test_mask instead of i2b(iand)
|
||
- intel: Use nir_test_mask instead of i2b(iand)
|
||
- microsoft: Use nir_test_mask instead of i2b(iand)
|
||
- dozen: Use nir_test_mask instead of i2b(iand)
|
||
- radv: Use two bools for ahit_status
|
||
- radv: Skip pipeline_no_null_shaders_flag tests
|
||
- radv: Fix acceleration structure size queries
|
||
- radv: Use a global address for ray_launch_size
|
||
- radv: Use a global address for sbt_base
|
||
- radv: Only set rt stack size for dynamic stacks
|
||
- vulkan: Common vk_format_get_component_bits
|
||
- nir: Add a common gen_rect_vertices implementation
|
||
- radv: Use nir_gen_rect_vertices
|
||
- v3dv: Use nir_gen_rect_vertices
|
||
- radeonsi: Set vertex_id_zero_based
|
||
- aco: Remove dead nir_intrinsic_load_vertex_id case
|
||
- ac/llvm: Remove load_vertex_id handling
|
||
- util/format: Use an explicit length for the descs
|
||
- util/format: Assert that formats are valid
|
||
- radv: Remove format desc null checks
|
||
- radv: Use desc->format
|
||
- v3dv: Remove format desc null asserts
|
||
- turnip: Remove format desc null assert
|
||
- gallivm: Remove format desc null checks
|
||
- tgsi: Remove format desc null check
|
||
- util: Remove format desc null check
|
||
- agx: Remove format desc null check
|
||
- etnaviv: Remove format desc null check
|
||
- llvmpipe: Remove format desc null checks
|
||
- panfrost: Remove format desc null check
|
||
- r300: Remove format desc null check
|
||
- r600: Remove format desc null checks
|
||
- radeonsi: Remove format desc null checks
|
||
- softpipe: Remove format desc null check
|
||
- virgl: Remove format desc null checks
|
||
- pvr: Remove format desc null check
|
||
- util/format: Remove format desc null checks
|
||
- gallium/tests: Remove format desc null checks
|
||
- radv: Add meta saving/restoring for predicating
|
||
- radv: Use RADV_META_SUSPEND_PREDICATING
|
||
- radv: Simplify the meta init fail path
|
||
- radv: Fix out of bounds access in RTPSO creation
|
||
- vulkan: Fix descriptor set layout allocation scope
|
||
- vulkan: Handle descriptor set layout alloc fails
|
||
- radv: Use vk_descriptor_set_layout
|
||
- radv: Fix stack size calculation with stage ids
|
||
|
||
Kristian Høgsberg (1):
|
||
|
||
- util/u_debug: Use 'initialized' instead of 'first'
|
||
|
||
Kyle Russell (1):
|
||
|
||
- etnaviv: don't halve max_instructions for GC400
|
||
|
||
Leandro Ribeiro (1):
|
||
|
||
- Revert "egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()"
|
||
|
||
Lionel Landwerlin (120):
|
||
|
||
- anv: use shadow surface for stencil input attachment on gfx7
|
||
- intel: fix URB programming for GT1s
|
||
- intel/fs: require memory fence commit bit on Gfx9
|
||
- anv: improve INTEL_DEBUG for submit
|
||
- anv/intel: add a new debug flag for stalling after every draw/dispatch
|
||
- anv: implement DEBUG_SYNC
|
||
- anv: allow getting the address of the beginning of the batch
|
||
- intel/fs: tidy up lower of ray queries
|
||
- ci/iris: mark another test as flaky
|
||
- anv: implement VK_EXT_primitives_generated_query
|
||
- nir/divergence: handle load_global_block_intel
|
||
- anv: fix INTEL_DEBUG=sync
|
||
- intel: fixup number of threads per EU on XeHP
|
||
- anv: remove unused enum
|
||
- anv: fix acceleration structure descriptor template writes
|
||
- anv: skip acceleration structure in binding table emission
|
||
- anv: reemit 3DSTATE_STREAMOUT after memcpy
|
||
- anv: fix clflush usage on utrace copy batch
|
||
- anv: limit clflush usage
|
||
- vulkan/runtime: 0-out pipeline cache object
|
||
- iris: use new kernel uAPI to compute video memory
|
||
- anv: fix invalid utrace memcpy l3 config on gfx < 11
|
||
- anv: reset all dynamic state after secondary execution
|
||
- anv: add missing logic op set in pipeline dyn state
|
||
- Revert "anv: fix dynamic state emission"
|
||
- anv: rework sample location
|
||
- anv: never emit 3DSTATE_CPS in the pipeline batch
|
||
- anv: don't emit 3DSTATE_STREAMOUT in pipeline batch
|
||
- anv: don't emit 3DSTATE_WM in pipeline batch
|
||
- anv: don't emit 3DSTATE_BLEND_STATE_POINTERS in pipeline batch
|
||
- anv: don't emit 3DSTATE_DEPTH_BOUNDS in pipeline batch
|
||
- anv: don't emit 3DSTATE_VF_TOPOLOGY in pipeline batch
|
||
- anv: remove static_state_mask
|
||
- docs: explain state emission in Anv
|
||
- intel: fix stall debug option
|
||
- nir/cf: return cursor after insertion of cf_list
|
||
- nir/lower_shader_calls: don't use nop instructions as cursors
|
||
- nir/lower_shader_calls: don't insert code after break/continue
|
||
- nir/lower_shader_calls: put inserted instructions into a dummy block
|
||
- anv: fix primitives generated queries values
|
||
- intel/perf: add support for new opcodes in code generation
|
||
- intel/perf: store a copy of devinfo
|
||
- intel/perf: add support for dualsubslice count variable
|
||
- intel/perf: add support for GtSlice/GtSliceXDualsubsliceY variables
|
||
- intel/perf: stop overriding oa_format
|
||
- intel/perf: add OA A counter type
|
||
- intel/perf: add support new variable counting the number of EUs in slice0-3
|
||
- intel/perf: add max vfuncs
|
||
- intel/perf: add new layout for Gfx12.5 products
|
||
- intel/perf: disable sseu setting on Gfx12.5+
|
||
- intel/perf: deal with OA reports timestamp values on DG2
|
||
- intel/ds: fix compilation
|
||
- u_trace/anv/iris: drop cs argument for recording traces
|
||
- intel/disasm: add missing handling of <1;1,0>
|
||
- util/debug: add an enable parsing helper
|
||
- util/perf: add options to enable/disable tracepoints
|
||
- anv: move internal RT shaders to the internal cache
|
||
- nir/divergence: handle load_scratch_base_ptr
|
||
- nir/divergence: handle load_ray_num_dss_rt_stacks_intel
|
||
- anv: don't expose EXT_border_color_swizzle on gfx7
|
||
- intel/validator: validate dst/src types against devinfo support
|
||
- anv: fix index buffer emission
|
||
- vulkan/wsi: fix crash with debug names on swapchain
|
||
- intel/dev: add a helper to update memory info
|
||
- intel/dev: add a fallback when memory regions are not available
|
||
- anv: limit RT writes to number of color outputs
|
||
- anv: disable perf queries on non RCS engines
|
||
- anv: make apply_pipeline_layout/compute_push_layout visible to NIR debug
|
||
- anv: silence border color swizzle debug message
|
||
- clc: add new feature options for intel_clc
|
||
- intel/clc: enable fp16 & subgroups for GRL
|
||
- anv: disable non uniform indexing of UBOs
|
||
- anv: remove device features handling now in runtime
|
||
- anv: use the right helper to invalidate memory
|
||
- intel/fs: make sure memory writes have landed for thread dispatch
|
||
- intel/nir: temporarly disable opt_uniform_atomics for RT/CL
|
||
- anv/utrace: use a bo pool for utrace buffers
|
||
- intel/fs: ray query fix for global address
|
||
- nir/serialize: restore ray query variables
|
||
- isl: add new helper for format component compatibility
|
||
- anv: track if images can be fast clear with non-zero color
|
||
- anv: move gfx12+ CCS_E clear color restriction
|
||
- anv: deal with isl format swizzles for buffer views
|
||
- anv: validate image view lowered storage formats for storage
|
||
- anv: add a comment about handling buffer view swizzles on gfx7
|
||
- intel: protect against empty invalidate ranges
|
||
- anv: skip flush/invalidate faster
|
||
- spirv: switch to uint64 for rayquery internal type
|
||
- anv: ensure tile flush before streamout writes
|
||
- intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it.
|
||
- anv: remove local computation of dynamic states
|
||
- anv: simplify dynamic buffer count in pipeline layout
|
||
- anv: break up anv_pipeline_compile_graphics()
|
||
- anv: split graphics nir loading
|
||
- anv: rename internal function for consistency
|
||
- anv: move CreateGraphicsPipelines to common code
|
||
- anv: move CreateComputePipelines to common code
|
||
- anv: move CreateRayTracingPipelines to common code
|
||
- anv: reorder & document fields of anv_graphics_pipeline
|
||
- anv: name non dynamic state fields correctly
|
||
- anv: fix primitive topology dynamic state emission on gfx7
|
||
- anv: move restart index to gfx state
|
||
- anv: remove pipeline setting dynamic state
|
||
- intel/dev: deal with i915 unallocated_size on smem
|
||
- anv: allocate RT scratch in local memory
|
||
- anv: make use of the new smallbar uAPI
|
||
- anv: decode init batch with INTEL_DEBUG=bat
|
||
- intel/dev: move verification function to a header
|
||
- intel/tools/drm-shim: fixup eu_stride for topology
|
||
- intel/tools/stub: fixup parsing of --platform=
|
||
- intel/dev: add a test verifying that device override works
|
||
- intel/dev: remove INTEL_DEVID_OVERRIDE
|
||
- anv: fixup PIPE_CONTROL restriction on gfx8
|
||
- anv: don't return incorrect error code for vkCreateDescriptorPool
|
||
- anv: fix GetPipelineExecutableStatistics for ray tracing pipelines
|
||
- anv: fixup assertions on lowered storage formats
|
||
- anv: fix assert in memory budget code when extension is not supported
|
||
- intel/fs: fix load_scratch intrinsic
|
||
- intel/fs: fixup scratch load/store handling on Gfx12.5+
|
||
- intel/fs: fixup SEND validation check on overlapping src0/src1
|
||
|
||
Liviu Prodea (7):
|
||
|
||
- dzn: Fix incompatible pointer type error affecting MSYS2 MINGW32
|
||
- ci/MSVC: LLD is not really used anywhere
|
||
- ci/MinGW: LLD is not really used anywhere
|
||
- d3d12/dzn/spirv2dxil: Require version library
|
||
- Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it
|
||
- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover
|
||
- meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly
|
||
|
||
Louis-Francis Ratté-Boulianne (4):
|
||
|
||
- d3d12: Move d3d12_nir_lower_vs_vertex_conversion() to a common place
|
||
- microsoft/compiler: Add support for more scaled formats
|
||
- dzn: Allow some non-native formats to be used as vertex inputs
|
||
- dzn: Enable dEQP-VK.pipeline.vertex_input.* test cases
|
||
|
||
Lucas Stach (37):
|
||
|
||
- etnaviv: update headers from rnndb
|
||
- etnaviv: set VIVS_GL_VERTEX_ELEMENT_CONFIG depending on prim type
|
||
- etnaviv: don't supertile textures if supertiling is disabled via debug option
|
||
- etnaviv: fill all minor GPU features from the kernel
|
||
- etnaviv: update headers from rnndb
|
||
- etnaviv: correct bits per tile and clear value for HALTI5
|
||
- etnaviv: use new PE pipe address states on >= HALTI0
|
||
- etnaviv: use feature flag to determine which RS states to use
|
||
- etnaviv: use feature bit for one const src per instuction limitation
|
||
- etnaviv: clean up tiling setup in etna_compile_rs_state
|
||
- etnaviv: add support for big tile RS states
|
||
- etnaviv: properly set additional DEC400 compression states
|
||
- etnaviv: use feature bit to check for big tile support
|
||
- etnaviv: add tile size helper
|
||
- etnaviv: reinstate accidentially commented assert
|
||
- etnaviv: improve single buffer setup
|
||
- etnaviv: add linear PE support
|
||
- etnaviv: switch depth mode when linear PE is used
|
||
- etnaviv: expose real map buffer alignment
|
||
- etnaviv: allow mapped buffers during execution
|
||
- etnaviv: drm: always use hash to track BO index
|
||
- etnaviv: drm: don't cache mmap offset
|
||
- etnaviv: drm: make etna_bo_map thread safe
|
||
- etnaviv: drm: rename etna_drm_table_lock
|
||
- etnaviv: add function to get resource status
|
||
- etnaviv: rework resource status tracking (again)
|
||
- etnaviv: drop scanout assumption for resources allocated via modifier interface
|
||
- kmsro: add 'imx-lcdif' driver support
|
||
- etnaviv: move dummy BOs to screen
|
||
- etnaviv: add texture descriptor suballocator
|
||
- etnaviv: tex_desc: make error handling more consistent
|
||
- etnaviv: tex_desc: remove descriptor patch TODO comment
|
||
- etnaviv: fix use after free in async shader compile
|
||
- etnaviv: remove etna_align_up() function
|
||
- etnaviv: move checking for MC2.0 for TS into screen init
|
||
- etnaviv: use linear PE rendering only on properly aligned surfaces
|
||
- etnaviv: add debug option to disable linear PE feature
|
||
|
||
M Henning (7):
|
||
|
||
- nouveau/nir: Allow up to 6 nested joins
|
||
- nouveau: Emit cctl to flush L1 cache for atomics
|
||
- nouveau: Skip cctl for atomic counters in tgsi
|
||
- nouveau/nir: Split fewer 64-bit loads
|
||
- ttn: Make ttn_optimize_nir match gl_nir_opts
|
||
- nv50: Fix tlsSpace alignment
|
||
- nouveau: Drop C++03 compat code
|
||
|
||
Manas Chaudhary (2):
|
||
|
||
- panvk: Add PANVK_DEBUG_DUMP dump mappings
|
||
- panvk: Allow PAN_BO_INVISIBLE to be mapped with dump flag
|
||
|
||
Manuel Stoeckl (4):
|
||
|
||
- dri_interface: add XBGR16161616 format
|
||
- gbm: add GBM_FORMAT_[AX]BGR16161616
|
||
- dri: fix incorrect fourcc for R16G16B16A16_UNORM format
|
||
- dri: add [ax]bgr16161616 to format lookup tables
|
||
|
||
Marcin Ślusarz (40):
|
||
|
||
- anv: invalidate all metadata in anv_nir_lower_ubo_loads
|
||
- intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader
|
||
- intel/compiler: invalidate metadata in brw_nir_initialize_mue
|
||
- compiler: add VARYING_SLOT_CULL_PRIMITIVE
|
||
- intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh
|
||
- intel/compiler: handle gl_Viewport and gl_Layer in FS URB setup
|
||
- anv: update task/mesh distribution with the recommended values
|
||
- intel/compiler: print name of the unhandled intrinsic
|
||
- anv: disable streamout before emitting mesh shading state
|
||
- intel/compiler: implement primitive shading rate for mesh
|
||
- anv: handle primitive shading rate for mesh
|
||
- intel/tools: add support for GEM_CREATE_EXT in intel_dump_gpu
|
||
- intel/tools: clean up mmio regs definitions
|
||
- intel/tools: drop wrappers around mmio regs macros
|
||
- intel/tools/aubinator: add support for 2 "new" subopcodes
|
||
- intel/tools/aubinator: list all platforms in help message
|
||
- intel/tools: add macros for gfx12+ variant of VCSUNIT0
|
||
- nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis
|
||
- nir: add load_task_payload intrinsic to nir_divergence_analysis
|
||
- intel/tools: fix 32-bit build
|
||
- anv: remove redundant code calculating dynamic states mask
|
||
- anv: remove invalid copy/pasted comment
|
||
- anv: mask out not applicable state flags when setting up mesh pipeline
|
||
- intel/dev: fixup max_cs_workgroup_threads after hwconfig is read
|
||
- nir/opt_load_store_vectorize: handle task payloads
|
||
- intel/compiler: vectorize task payload loads/stores
|
||
- nir/lower_task_shader: insert barrier before/after shared memory read/write
|
||
- intel/compiler: assert that base is 0 for [load|store]_shared intrins
|
||
- intel: fix typos found by codespell
|
||
- intel/common: allocate space for at least one task urb
|
||
- intel/compiler: adjust task payload offsets as late as possible
|
||
- anv: disable injection of primitive shading rate for mesh
|
||
- intel/compiler: print shaders after nir_remove_unused_varyings
|
||
- nir: convert unused mesh outputs to shared memory
|
||
- intel/compiler: emit URB fences for TASK/MESH
|
||
- intel/compiler: there are 4 types of fences on gfx >= 12.5
|
||
- intel/compiler: insert URB fence before task/mesh termination
|
||
- intel/compiler: use NIR_PASS more
|
||
- intel/compiler: fix mesh urb write regression
|
||
- anv: disable task redistribution
|
||
|
||
Marek Olšák (311):
|
||
|
||
- gallium: move drivers/radeon/* into drivers/radeonsi/
|
||
- gallium: move radeon_winsys.h into gallium/include/winsys/
|
||
- gallium/winsys: move {amdgpu,radeon_drm}_public.h contents into radeon_winsys.h
|
||
- gallium/ddebug: implement pipe_vertex_state callbacks
|
||
- nir: fix an uninitialized variable valgrind warning in nir_group_loads
|
||
- ac/surface: fix an addrlib race condition on gfx9
|
||
- radeonsi: set more state bits for disabled binning
|
||
- radeonsi: don't compile monolithic PS for dual source blending
|
||
- radeonsi: add an SQTT workaround for chips with disabled RBs
|
||
- radeonsi: set PA_SC_EDGERULE according to bottom_edge_rule state
|
||
- radeonsi: initialize line stipple config registers
|
||
- radeonsi: fix ALPHA_IS_ON_MSB determination
|
||
- radeonsi: handle an out-of-bounds sampler/image index better
|
||
- radeonsi: don't treat skipped GS copy shader compilation as successful
|
||
- radeonsi: dump shaders after si_shader_binary_upload to fix printed LDS stat
|
||
- radeonsi: ignore AMD_DEBUG=nodcc if the texture is imported
|
||
- radeonsi: run nir_io_add_const_offset_to_base for TES/TCS as late optimizations
|
||
- radeonsi: scalarize IO instructions
|
||
- radeonsi: add pipe_stream_output_info into si_shader_context
|
||
- radeonsi: use nir_shader_get_entrypoint in si_nir_scan_shader
|
||
- radeonsi: apply key.ge.opt.kill_{outputs,pointsize,clipdistance} in NIR
|
||
- radeonsi: gather pipe_stream_output_info from NIR intrinsics
|
||
- radeonsi: move most "info" fields from si_shader_selector into si_shader_info
|
||
- radeonsi: move si_shader_info::stage into si_shader_selector
|
||
- ac: add ac_nir_optimize_outputs, a NIR version of ac_optimize_vs_outputs
|
||
- radeonsi: assign param export indices before compilation
|
||
- ac/llvm: remove LLVM pass ac_optimize_vs_outputs
|
||
- amd: document chips
|
||
- ac/surface/tests: generalize and extend gfx10 tests
|
||
- ac: add more non-shadowed registers to the lists
|
||
- ac/llvm: add AC_WAIT_EXP for ac_build_waitcnt
|
||
- ac/llvm: remove immoffset parameter from ac_build_tbuffer_store
|
||
- ac/llvm: remove immoffset parameter from ac_build_tbuffer_load
|
||
- ac/llvm: remove inst_offset parameter from ac_build_buffer_store_dword
|
||
- ac/llvm: use soffset correctly in ac_build_buffer_load
|
||
- ac/llvm: remove inst_offset parameter from ac_build_buffer_load
|
||
- ac/llvm: remove immoffset parameter from ac_build_tbuffer_load_byte/short
|
||
- ac/llvm: remove unused code from ac_llvm_build.c/h
|
||
- ac/llvm: don't use tbuffer_store as a fallback for swizzled stores
|
||
- ac/llvm: rename tbuffer -> buffer where tbuffer instructions are not generated
|
||
- ac,radeonsi: rework how scratch_waves is used and move it to ac_gpu_info.c
|
||
- ac/gpu_info: remove old and unused fields from radeon_info
|
||
- ac/llvm: remove unused LLVM helpers
|
||
- ac/llvm: don't create the target machine if the LLVM processor is unsupported
|
||
- radeonsi: remove an obsolete comment about LLVM 14
|
||
- radeonsi: fail to create pipe_screen if LLVM doesn't support the GPU
|
||
- winsys/amdgpu: fix a mutex deadlock when we fail to create pipe_screen
|
||
- radeonsi/ci: add polaris11 baseline
|
||
- util: add pipe formats R10G10B10X2_SINT, B10G10R10X2_SINT, G16R16_SINT
|
||
- util: change util_format_snorm8_to_sint8 -> snorm_to_sint for radeonsi
|
||
- ac/gpu_info: rework how num_se is derived
|
||
- ac: remove really_needs_scratch, parameter from ac_parse_shader_binary_config
|
||
- ac/llvm: set the correct cache policy for sparse buffer loads
|
||
- ac/llvm: don't set ac_dlc in visit_intrinsic
|
||
- ac: support GR channel order in ac_choose_spi_color_formats
|
||
- ac,radeonsi: report SCALED formats as unsupported by samplers and color buffers
|
||
- radeonsi: always print the lower-case GPU name for radeonsi-run-tests.py
|
||
- radeonsi: cleanup local includes
|
||
- radeonsi: align the tessellation ring address to 2MB
|
||
- radeonsi: remove unused si_cs_shader_state::uses_scratch
|
||
- radeonsi: simplify setting COMPUTE_PGM_RSRC2.SCRATCH_EN
|
||
- radeonsi: code style fix
|
||
- radeonsi: add si_can_use_compute_blit
|
||
- radeonsi: don't report R64_*INT as a sampler format because it doesn't work
|
||
- radeonsi: don't allow broken compute image copies into A8R8_UNORM
|
||
- radeonsi: add color buffer support for 2_10_10_10 pipe formats
|
||
- radeonsi: add color buffer support for R9G9B9E5_FLOAT on gfx10.3
|
||
- radeonsi: implement compute_copy_image between 1D_ARRAY and other texture types
|
||
- radeonsi: cosmetic changes in si_compute_copy_image
|
||
- radeonsi: assert format support in si_compute_copy_image
|
||
- radeonsi: don't expose formats that CB supports, but not shaders
|
||
- radeonsi: rename si_test_blit -> si_test_image_copy_region
|
||
- radeonsi/test_image_copy: remove whole image copies
|
||
- radeonsi/test_image_copy: remove linear and power-of-two size testing
|
||
- radeonsi/test_image_copy: remove special SDMA codepaths
|
||
- radeonsi/test_image_copy: simplify texture size generation
|
||
- radeonsi/test_image_copy: separate generating a random pipe_resource
|
||
- radeonsi/test_image_copy: separate printing pipe_resource into a function
|
||
- radeonsi/test_image_copy: test all interesting texture targets
|
||
- radeonsi/test_image_copy: test copying a block-compressed format
|
||
- radeonsi/test_image_copy: test mipmaps
|
||
- radeonsi/test_image_copy: test all formats
|
||
- radeonsi/ci: update sienna_cichlid results
|
||
- radeonsi: handle compressed formats in si_compute_copy_image
|
||
- radeonsi: improve the subsampled copy_image calculation
|
||
- radeonsi: remove compressed and subsampled gfx copy from resource_copy_region
|
||
- radeonsi: remove si_create_sampler_view_custom and related code
|
||
- radeonsi: remove si_create_surface_custom
|
||
- mesa: consider the sample count when choosing a texture format
|
||
- frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers
|
||
- amd/registers: hardcode GC base offsets in the json generator
|
||
- amd/registers: add gfx11 to the json generator
|
||
- amd/registers: add gfx11.json
|
||
- amd/registers: add gfx11-rsrc.json
|
||
- amd: enable gfx11 in header generator, fix drivers with renamed gfx6-10 defs
|
||
- amd: add gfx11 to packet definitions
|
||
- amd: import gfx11 addrlib
|
||
- amd: add Mesa-only addrlib changes
|
||
- ac: don't align VGPRs to 8 or 16 for gfx11
|
||
- ac: implement ac_get_tbuffer_format for gfx11
|
||
- ac: scratch buffer register changes for gfx11
|
||
- ac: implement register shadowing for gfx11
|
||
- ac/llvm: export mrt0 instead of null on gfx11
|
||
- ac/llvm: don't set DLC on gfx11 because it means something else there
|
||
- ac/llvm: update pknorm and waitcnt for gfx11
|
||
- ac/llvm: don't set GLC for stores on gfx11
|
||
- ac/gpu_info: set cu_mask correctly for gfx11
|
||
- ac/surface: gfx11 changes
|
||
- ac/surface: define gfx11 modifiers
|
||
- ac/surface: add gfx11 support to modifiers tests
|
||
- radeonsi/gfx11: increase the hw screen offset alignment
|
||
- radeonsi/gfx11: always allow DCC stores
|
||
- radeonsi/gfx11: expect packed threadID VGPRs
|
||
- radeonsi/gfx11: enable NGG-only draw paths
|
||
- radeonsi/gfx11: enable arbitrary DCC format reinterpretation
|
||
- radeonsi/gfx11: use the new TCS WaveID SGPR to compute vs_rel_patch_id
|
||
- radeonsi/gfx11: program inst_pref_size for graphics
|
||
- radeonsi/gfx11: buffer descriptor changes
|
||
- radeonsi/gfx11: add CB deltas
|
||
- ac,radeonsi/gfx11: remove FMASK loads
|
||
- ac,radeonsi/gfx11: set SWIZZLE_ENABLE correctly
|
||
- radeonsi/gfx11: don't set non-existent SPI_SHADER_USER_DATA_VS_x
|
||
- radeonsi/gfx11: don't set COMPR for exports, use 0x3 channel mask instead
|
||
- radeonsi/gfx11: implement attributes through memory
|
||
- radeonsi/gfx11: scattered register deltas
|
||
- radeonsi/gfx11: don't set non-existent CP_COHER_START_DELAY
|
||
- radeonsi/gfx11: emit SQ_NON_EVENT for tessellation at the end of IBs
|
||
- radeonsi/gfx11: don't use FLUSH_AND_INV_DB_META
|
||
- radeonsi/gfx11: TF_RING_SIZE changed to a per-SE size
|
||
- radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order
|
||
- radeonsi/gfx11: update the initialization of SGPR0/1 registers for HS and GS
|
||
- radeonsi/gfx11: change LDS allocation granularity for PS
|
||
- radeonsi/gfx11: don't count the non-existent scratch_byte_offset SGPR
|
||
- radeonsi/gfx11: resolve MSAA using u_blitter
|
||
- radeonsi/gfx11: mark streamout as unimplemented for now
|
||
- radeonsi/gfx11: limit CP DMA to max 32KB sizes
|
||
- radeonsi: inline si_cp_dma_prefetch in si_draw_vbo for lower overhead
|
||
- radeonsi/gfx11: add a workaround for CB perf counters
|
||
- amd: add chip identification for gfx1100-1103
|
||
- radeonsi: remove tautologies when setting CU_EN
|
||
- amd: rename ring_type --> amd_ip_type and match the kernel enum values
|
||
- amd: rename RING_* enums to AMD_IP_*
|
||
- amd: replace num_rings[type] with ip[type].num_queues
|
||
- ac/gpu_info: simplify HW IP querying
|
||
- ac/gpu_info: remove dead GDS query code
|
||
- ac/gpu_info: print all IP versions reported by the kernel
|
||
- ac/gpu_info: fix incorrect IP versions reported by the kernel
|
||
- radeonsi: report 10_10_10_2 scaled formats as unsupported on gfx6-9 as well
|
||
- radeonsi: report more missing/broken texture formats as unsupported on gfx6-9
|
||
- ac/llvm: implement memory_barrier_tcs_patch properly to fix LLVM 15
|
||
- radeonsi: wait before s_barrier in TCS epilog to fix LLVM 15
|
||
- ac/llvm: inline emit_membar
|
||
- ac/llvm: replace LLVMBuildFence with ac_build_waitcnt(VSTORE)
|
||
- ac/llvm: add gl_shader_stage parameter into ac_build_s_barrier
|
||
- ac/llvm: move the gfx6 optimization for TCS barriers into ac_build_s_barrier
|
||
- ac/llvm: fix the remaining s_barriers for LLVM 15
|
||
- radeonsi: add si_get_tcs_epilog_key
|
||
- ac/llvm: skip s_barrier if tess patches don't cross a wave boundary
|
||
- radeonsi: inline si_nir_build_llvm
|
||
- ac/llvm: remove ac_shader_abi::emit_outputs
|
||
- radeonsi: cleanups getting rid of the ambigous "epilogue" word
|
||
- radeonsi: get rid of the ambiguous "prologue" word
|
||
- radeonsi: try to group stage-specific code in si_llvm_translate_nir
|
||
- ac: replace 5 ac_shader_abi::load_* callbacks with 1 intrinsic_load callback
|
||
- amd: change chip_class naming to "enum amd_gfx_level gfx_level"
|
||
- radeonsi: rename RADEON_FLAG_UNCACHED -> RADEON_FLAG_GL2_BYPASS
|
||
- winsys/amdgpu-radeon: rework how we describe heaps
|
||
- winsys/amdgpu: handle RADEON_FLAG_ENCRYPTED as a heap modifier
|
||
- radeonsi: use PIPE_RESOURCE_FLAG_UNMAPPABLE and DRIVER_INTERNAL more
|
||
- winsys/amdgpu-radeon: clean up code related to heaps
|
||
- include: update amdgpu_drm.h
|
||
- radeonsi: use the new flag AMDGPU_GEM_CREATE_DISCARDABLE
|
||
- radeonsi: use AMDGPU_VM_PAGE_NOALLOC to disable MALL (infinity cache)
|
||
- radeonsi: fix a crash when failing to create a context
|
||
- radeonsi: print an error when failing to create a context
|
||
- radeonsi/gfx11: set BIG_PAGE for the attribute ring
|
||
- ac/gpu_info: silence a valgrind warning due to amdgpu_query_hw_ip_info
|
||
- radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition
|
||
- ac/gfx11: fix the scratch buffer
|
||
- radeonsi/gfx11: fix alpha-to-coverage with stencil or samplemask export
|
||
- radeonsi/gfx11: don't insert shader code for GS_PIPELINE_STATS_EMU
|
||
- radeonsi/gfx11: fix the value of VGT_GS_OUT_PRIM_TYPE at the beginning of IBs
|
||
- Revert "radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order"
|
||
- radeonsi/gfx11: fix VM faults due to the attribute ring
|
||
- radeonsi/gfx11: optimize attribute stores
|
||
- radeonsi: merge all preamble states into one
|
||
- radeonsi: move CS preamble emission into the winsys
|
||
- winsys/amdgpu: initialize IB_PREAMBLE in advance
|
||
- radeonsi: fix resource_copy_region with ETC formats (e.g. for Stoney)
|
||
- Revert "frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers"
|
||
- Revert "mesa: consider the sample count when choosing a texture format"
|
||
- amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt
|
||
- r300,r600,svga: save the FS constant buffer for u_blitter to fix a regression
|
||
- st/mesa: remove unused GENERIC input and output from the clear VS
|
||
- ac/gpu_info: always retile DCC on gfx10 and newer chips
|
||
- mesa: clamp UINT values greater than INT_MAX for glGetInteger
|
||
- mesa: change GL_MAX_COMPUTE_WORK_GROUP_COUNT to UINT
|
||
- mesa: change GL_MAX_UNIFORM_BLOCK_SIZE to UINT
|
||
- mesa: change GL_MAX_TEXTURE_BUFFER_SIZE to UINT
|
||
- mesa: change GL_MAX_SHADER_STORAGE_BLOCK_SIZE to UINT
|
||
- st/mesa: limit MaxComputeWorkGroupCount to INT_MAX - 1 due to failing tests
|
||
- st/mesa: round down MaxUniformBlockSize to a multiple of 4 for piglit
|
||
- gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to \*_BUFFER0_*
|
||
- gallium: add PIPE_CAP_MAX_CONSTANT_BUFFER_SIZE_UINT for UBOs
|
||
- gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT
|
||
- gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> \*_UINT
|
||
- radeonsi: increase the max compute LDS size to 64KB for gfx7+
|
||
- radeonsi: report correct maximum compute grid sizes
|
||
- radeonsi: clamp against MAX_TEXEL_BUFFER_ELEMENTS correctly
|
||
- radeonsi: change max TBO/SSBO sizes again and rework max alloc size
|
||
- radeonsi: compute PIPE_CAP_MAX_TEXEL_BUFFER_ELEMENTS_UINT correctly
|
||
- radeonsi: set the max UBO size same as the max SSBO size
|
||
- winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+
|
||
- radeonsi: print LDS size in bytes
|
||
- radeonsi: don't use info.gs.invocations if it's not GS
|
||
- radeonsi: fix uninitialized wait_mem_scratch_tmz
|
||
- radeonsi: remove unused dword from wait_mem_scratch
|
||
- radeonsi: fix polygon stippling without color and Z outputs (v2)
|
||
- radeonsi: restructure PS no-export fixups
|
||
- radeonsi: determine DB_SHADER_CONTROL in si_shader_ps
|
||
- radeonsi: set INTERPOLATE_COMP_Z to 0 to work around an EQAA bug
|
||
- radeonsi: add BREAK_BATCH at the beginning of IBs
|
||
- radeonsi: simplify how pipeline statistic offsets are computed
|
||
- radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS
|
||
- radeonsi: rename and regroup VS_STATE definitions
|
||
- radeonsi: move GS_STATE bits to the end to make space at the beginning
|
||
- radeonsi: rework how vs_state_bits is set and unpacked
|
||
- radeonsi: fix an NGG streamout hang with monolithic shaders
|
||
- radeonsi: fix a crash in gfx10_sh_query_get_result_resource
|
||
- radeonsi: unconditionally enable the streamout overflow query with NGG
|
||
- radeonsi: inline gfx10_emit_streamout_begin/end
|
||
- radeonsi: fix NGG streamout hang by allocating GDS in the right place
|
||
- radeonsi: remove streamout code from shaders if no streamout buffers are bound
|
||
- winsys/amdgpu: add a kernel GDS management workaround retrying on -ENOMEM
|
||
- radeonsi: allocate GDS only once per process
|
||
- radeonsi: allocate only 1 GDS OA counter for gfx10 NGG streamout
|
||
- ac,radeonsi: don't export null from PS if it has no effect on gfx10+
|
||
- ac/gpu_info: clamp gart_size_kb and vram_size_kb to fix buggy kernel driver
|
||
- gallium/u_threaded: fix buffer busyness tracking
|
||
- ac/llvm: fix tcs_wave_id unpacking on gfx11
|
||
- amd/gfx11: add PixelWaitSync packet fields
|
||
- radeonsi/gfx11: compile monolithic PS if it writes memory
|
||
- radeonsi/gfx11: allocate more space for pipeline statistics
|
||
- radeonsi/gfx11: synchronize correctly before setting SPI_ATTRIBUTE_RING_*
|
||
- radeonsi/gfx11: don't use memory for waiting for cache flushes
|
||
- radeonsi/gfx11: don't allocate unused wait_mem_scratch
|
||
- radeonsi/gfx11: drop the ES vertex count requirement
|
||
- radeonsi/gfx11: add missing register shadowing code
|
||
- radeonsi/gfx11: rework GDS streamout code to single-lane and enable streamout
|
||
- amd: update addrlib - trivial changes
|
||
- ac/surface: expose all 64K_R_X and 256K_R_X modifiers on gfx11
|
||
- radeonsi: fix random PS wave size
|
||
- util: remove our redefinition of assert
|
||
- Change all debug_assert calls to assert
|
||
- util: remove now unused debug_assert
|
||
- ttn: set dest_type for TXQ
|
||
- gallium: add pipe_blit_info::dst_sample, update u_blitter to write that sample
|
||
- ac/gpu_info: rename info fields to num_cu, memory_bus_width, memory_freq_mhz
|
||
- ac/gpu_info: remove tabs
|
||
- ac/gpu_info: rework and extend device info to be more informative
|
||
- ac/gpu_info: use hw_ip::ip_discovery_version to set IP versions
|
||
- ac/gpu_info: use the kernel-reported GFX IP version to set gfx_level
|
||
- ac/gpu_info: remove vram_size and gtt_size in favor of \*_kb variants
|
||
- ac/gpu_info: remove redundant uvd_decode
|
||
- ac/gpu_info: remove redundant jpeg_decode
|
||
- ac/gpu_info: remove redundant uvd_encode
|
||
- ac/gpu_info: remove redundant vce_encode
|
||
- ac/gpu_info: remove redundant vcn_encode
|
||
- amd: require amdgpu DRM 3.2.0 from April 2016
|
||
- gallium/radeon: require radeon DRM 2.45.0 from April 2016
|
||
- radeonsi: remove workarounds for radeon DRM < 2.45.0
|
||
- ac/gpu_info: use drm_amdgpu_device_info instead of amdgpu_gpu_info
|
||
- ac/gpu_info: remove amdgpu_gpu_info parameter from ac_query_gpu_info
|
||
- ac/gpu_info: remove unused has_unaligned_shader_loads
|
||
- gallium/radeon: require radeon DRM 2.50.0 (kernel 4.12) from July 2017
|
||
- ac/gpu_info: require amdgpu DRM 3.15.0 (kernel 4.12) from July 2017
|
||
- util/u_threaded: fix a CPU storage crash due to an invalid codepath taken
|
||
- glthread: don't ignore glPushAttrib/glPopAttrib when tracking GL_CULL_FACE
|
||
- glthread: don't sync on IsEnabled(GL_DEPTH_TEST) by tracking it in glthread
|
||
- util/format: add util_format_rgbx_to_rgba helper
|
||
- amd/addrlib: fix 3D texture allocation failures on gfx11
|
||
- ac/surface: don't forbid 256KB swizzle modes on smaller gfx11 chips
|
||
- radeonsi: unify VGT_TESS_DISTRIBUTION programming
|
||
- radeonsi: move SI_MAX_VRAM_MAP_SIZE to si_debug_options.h
|
||
- radeonsi: call pipe->blit instead of util_blitter_blit after MSAA resolving
|
||
- radeonsi: make si_launch_grid_internal static
|
||
- radeonsi: add common helper si_launch_grid_internal_images that is more robust
|
||
- radeonsi: remove compute-based DCC decompression because it's broken
|
||
- radeonsi: don't do image stores with RGBX, L, LA, I, and SRGB formats
|
||
- radeonsi: follow shader_info.float_controls_execution_mode (mostly)
|
||
- radeonsi: add need_fmask_expand parameter into si_decompress_subresource
|
||
- radeonsi: check for 16-bit hw support instead of relying on options.fp16
|
||
- radeonsi: move compute-related code from si_blit.c to si_compute_blit.c
|
||
- radeonsi: fold async_copy into the preceding conditional in si_blit
|
||
- radeonsi: cosmetic changes around do_hardware_msaa_resolve
|
||
- radeonsi/gfx11: use correct VGT_TESS_DISTRIBUTION settings
|
||
- radeonsi/gfx11: set SAMPLE_MASK_TRACKER_WATERMARK = 15 and clean up
|
||
- radeonsi/gfx11: use better PRIM_GRP_SIZE_GFX11 setting
|
||
- radeonsi/gfx11: skip code in si_update_shaders that has no effect
|
||
- radeonsi/gfx11: rename si_calc_inst_pref_size -> si_get_shader_prefetch_size
|
||
- radeonsi/gfx11: enable shader prefetch except for initial chip revisions
|
||
- radeonsi/gfx11: use a better workaround for the export conflict bug
|
||
- radeonsi: make various blit functions non-static
|
||
- radeonsi: allow texture_map to upload only 1 sample for MSAA instead of all
|
||
- radeonsi: add a randomized blit test
|
||
- radeonsi: move the no-AA small prim precision cull constant into an SGPR
|
||
- radeonsi: move small prim precision computation out of si_emit_cull_state
|
||
- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9
|
||
- glthread: unbind framebuffers in glDeleteFramebuffers
|
||
- glthread: call _mesa_glthread_DeleteBuffers unconditionally
|
||
- ac/surface: disallow 256KB swizzle modes on gfx11 APUs
|
||
|
||
Marek Vasut (3):
|
||
|
||
- dri: Check buffer height and avoid overflow
|
||
- etnaviv: Rename etna_emit_tex() args
|
||
- etnaviv: Implement TXD
|
||
|
||
Mark Collins (1):
|
||
|
||
- tu: Clamp priority in DRM submitqueue creation
|
||
|
||
Mark Janes (11):
|
||
|
||
- intel: parse intel_measure environment without side effects
|
||
- intel: relax assertion in INTEL_MEASURE
|
||
- anv: fix performance bug in INTEL_MEASURE
|
||
- intel: provide a callback to clean up after intel_measure_gather
|
||
- iris: provide a callback to INTEL_MEASURE to clean up snapshots
|
||
- util: define helpful macros for compiler diagnostic features
|
||
- iris: pad all structures used in a shader key
|
||
- iris: reorder to minimize padding
|
||
- intel/compiler: pad all data structures used by shader cache keys
|
||
- intel/compiler: reorder shader cache keys to minimize padding
|
||
- intel/dev: drop warning for unhandled hwconfig keys
|
||
|
||
Martin Roukala (né Peres) (27):
|
||
|
||
- ci/b2c: fix the generation of the IMAGE_UNDER_TEST variable
|
||
- ci/b2c: bump the kernel to 5.17.1
|
||
- ci/b2c: update b2c to enable monitoring support
|
||
- radv/ci: add a Van Gogh dEQP runner
|
||
- ci: add ACO_DEBUG to the list of variables to pass down for testing
|
||
- ci/radv: remove amdgpu.gpu_recovery=0 from the kernel cmdline
|
||
- radv/ci: reboot the DUTs when hitting a ring timeout
|
||
- radv/ci: reduce the execution timeout
|
||
- radv/ci: re-enable fossils testing
|
||
- radv/ci: enable the experimental support for mesh shaders
|
||
- radv/ci: update the list of flakes
|
||
- radv/ci: allow polaris10 to retry more times
|
||
- radv/ci: bump the timeout of vangogh/renoir
|
||
- radv/ci: update the vkcts expectations for the CI machines
|
||
- radv/ci: remove an expected fail from all families
|
||
- radv/ci: try to guess failure expectations for Fiji/Raven/Navi14/Navi22
|
||
- ci/valve: do not extract the install tarball on the runner
|
||
- ci/valve: uprev boot2container
|
||
- Revert "ci/valve: uprev boot2container"
|
||
- radv/ci: test vkd3d on kabini
|
||
- ci/radv: move a test from the fail list to the flake list
|
||
- ci/radv: add a missing flake
|
||
- ci/radv: enable vkcts testing on kabini
|
||
- ci/x86: update to llvm 13
|
||
- radv/ci: test the llvm backend on navi21
|
||
- radv/ci: allow specifying a warning pattern
|
||
- radv/ci: do not reboot on soft-recovered hangs, just warn
|
||
|
||
Matt Coster (20):
|
||
|
||
- pvr: csbgen: Formatting pass (PEP-8 plus other minor changes)
|
||
- pvr: csbgen: Add a missing @staticmethod decorator
|
||
- pvr: csbgen: Make some loops more pythonic
|
||
- pvr: csbgen: Cleanup imports
|
||
- pvr: csbgen: Remove "dim" functionality from Group
|
||
- pvr: csbgen: Remove unused function parameters
|
||
- pvr: csbgen: Add typing information
|
||
- pvr: csbgen: Isolate "main" function with __name__ check
|
||
- pvr: csbgen: Replace map() and filter() with list comprehension
|
||
- pvr: csbgen: Fix "local variable may be undefined" lints
|
||
- pvr: csbgen: Add \*_unpack() functions for all generated struct types
|
||
- pvr: debug: Implement PVR_DEBUG variable
|
||
- pvr: csbgen: Make all generated enums unambiguous
|
||
- pvr: Add a const qualifier to inline pvr_csb_*() helpers
|
||
- pvr: Remove pvr_cmd_struct() macro
|
||
- pvr: Make pvr_cmd_pack() macro clearly internal
|
||
- pvr: Split out unioned structs from struct pvr_sub_cmd
|
||
- pvr: Implicitly assert that the correct sub-command type is present
|
||
- pvr: Split pvr_dev_addr_t into a separate header
|
||
- pvr: Add helper macros for creating pvr_dev_addr_t instances
|
||
|
||
Matt Turner (3):
|
||
|
||
- mesa: Drop dead #include "sparc/sparc.h"
|
||
- freedreno/ir3: Move the texture array coord fixup to nir
|
||
- freedreno/ir3: Pass 16-bit sampler coordinates when possible.
|
||
|
||
Matti Hamalainen (7):
|
||
|
||
- gallium/tools: fixes to option handling
|
||
- gallium/tools: add option for ignoring junk calls in trace dumper
|
||
- gallium/tools: reimplement tracediff completely in Python
|
||
- pytracediff: add per-line difference highlighting for blocks
|
||
- pytracediff: make -M ("method only") option print arguments for differing calls
|
||
- pytracediff: change how 'junk' calls are handled
|
||
- pytracediff: implement pager ('less') invocation internally
|
||
|
||
Max Kellermann (2):
|
||
|
||
- gallium/u_threaded: add missing reference counts for draw_multi slots
|
||
- gallium/u_threaded: fix offset calculation for draw_multi slots
|
||
|
||
Michael Olbrich (1):
|
||
|
||
- crocus: export GEM handle with RDWR access rights
|
||
|
||
Michael Skorokhodov (2):
|
||
|
||
- glsl: Fix ir_quadop_vector validation
|
||
- anv: Update line range
|
||
|
||
Michel Zou (7):
|
||
|
||
- gallium: fix unused symbols warnings
|
||
- meson: call run_command with check=true
|
||
- vulkan/wsi: fix missing unistd include
|
||
- zink: fix pointer size conversion warning
|
||
- lavapipe: allow null handleTypes
|
||
- meson: drop dladdr check on win32
|
||
- util/xmlconfig: fix -Wmaybe-uninitialized warning
|
||
|
||
Mihai Preda (28):
|
||
|
||
- gallium: add opaque pointers shim for LLVM < 8.0
|
||
- gallium: use LLVM opaque pointers in draw_llvm.c
|
||
- gallium/llvmpipe: use LLVM opaque pointers in lp_bld_interp.c
|
||
- gallium/llvmpipe: use LLVM opaque pointers in lp_bld_depth.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_arit.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_assert.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_format_aos.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_conv.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_coro.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_printf.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_flow.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_intr.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_format_s3tc.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_gather.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_nir_soa.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_sample.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_sample_soa.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_struct.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c
|
||
- gallivm: use LLVM opaque pointers in lp_bld_tgsi_aos.c
|
||
- gallivm: LLVM opaque pointers small changes
|
||
- gallium: refactor a channel loop in draw_llvm.c
|
||
- Revert "gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c"
|
||
- gallium: LLVM-15 contexts use non-opaque pointers
|
||
- gallivm: fix a few llvm non-opaque pointers
|
||
- gallivm: refactor a bit the cache access in view of LLVM opaque pointers
|
||
- gallivm: add lp_build_struct_get() variants that take the LLVM type
|
||
- gallivm: push LLVM version guard into assert
|
||
|
||
Mike Blumenkrantz (634):
|
||
|
||
- nir/lower_tex: fix rect queries with lower_rect set
|
||
- mesa/st: set normalized coords for RECT samplers if rects are unsupported
|
||
- zink: unset PIPE_CAP_TEXRECT
|
||
- zink: radv baseline fix
|
||
- zink: nv ci update
|
||
- zink: clamp max shader images to 32
|
||
- zink: remove tcs patch slot map
|
||
- zink: disable EXT_extended_dynamic_state2 on AMDPRO
|
||
- zink: add baseline for amdpro
|
||
- zink: reject resource creation if format features don't match attachment
|
||
- zink: use mixed zs renderpass for depth read/write
|
||
- zink: unconditionally set line width on rasterizer state change
|
||
- zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available
|
||
- zink: fix extended restart prim types without dynamic state2
|
||
- zink: do not create fences at all if timeline semaphores are supported
|
||
- zink: require timeline semaphores
|
||
- zink: remove batch lock
|
||
- zink: rename zink_query::batch_id
|
||
- zink: remove driver-based max_fences throttling
|
||
- zink: hook up VK_KHR_create_renderpass2
|
||
- zink: only use VK_DEPENDENCY_BY_REGION_BIT if sync2 is available
|
||
- zink: require renderpass2
|
||
- zink: require KHR_imageless_framebuffer
|
||
- zink: delete all non-imageless framebuffer code
|
||
- zink: remove framebuffer indirection
|
||
- zink: rename imageless framebuffer functions
|
||
- zink: remove the worst part of basic framebuffer support
|
||
- zink: remove ZINK_NO_TIMELINES
|
||
- zink: make a kopper debug print into an error
|
||
- zink: rework zink_kopper_update() assert
|
||
- zink: always bind gfx pipeline at the top of draw
|
||
- zink: store min required stride values on the vertex state
|
||
- zink: selectively disable dynamic vertex stride
|
||
- nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims
|
||
- lavapipe: run nir_fold_16bit_sampler_conversions
|
||
- zink: set VK_QUERY_RESULT_WAIT_BIT when copying to qbo
|
||
- zink: NV_linear_color_attachment
|
||
- zink: nv ci update
|
||
- zink: fix synchronization when drawing from streamout
|
||
- zink: fix xfb counter buffer barriers
|
||
- zink: remove xfb_barrier flag
|
||
- zink: handle device-local unsynchronized maps
|
||
- util/draw: fix map size of indirect buffer in util_draw_indirect_read
|
||
- util/draw: handle draw_count=0 when reading indirect parameters
|
||
- util/draw: fix indirect draw count readback
|
||
- gallium: add PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE
|
||
- zink: unset PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE
|
||
- zink: move the kopper present fence to the displaytarget object
|
||
- wgl: support GL 4.6
|
||
- zink: make unsupported blit formats an error
|
||
- zink: make update_framebuffer_state() public
|
||
- zink: rework clear_depth_stencil hook
|
||
- zink: add a stencil blit fallback path
|
||
- zink: fix tcs control barriers for use without vk memory model
|
||
- zink: fix semantics mask for compute control barriers
|
||
- zink: hook up VK_KHR_vulkan_memory_model
|
||
- zink: use vulkan memory model shader semantics for tcs barriers
|
||
- zink: always use vk mem model for shaders if available
|
||
- zink: fix memory_barrier intrinsic
|
||
- zink: fix stride=0 for dynamic state vertex stride checking
|
||
- zink: add synchronization for buffer clears
|
||
- mesa/st: clamp GL_RENDERBUFFER to GL_TEXTURE_2D for sparse queries
|
||
- aux/trace: make get_sparse_texture_virtual_page_size useful
|
||
- glsl/nir: set new_style_shadow for sparse tex ops as necessary
|
||
- zink: fix group memory barrier emission
|
||
- vulkan: bump layer api versions to current vk header version
|
||
- vk/cache: unbreak vk_pipeline_cache_create with flags
|
||
- kopper: add DISPLAY_TARGET bind for depth buffer
|
||
- zink: fix/improve swapchain surface info updating
|
||
- zink: fix up swapchain depth buffer geometry during fb update
|
||
- drisw: remove dead code
|
||
- kopper: always fetch and store drawable info
|
||
- kopper: move drawable geometry updating up in function
|
||
- kopper: store whether screen has dmabuf support
|
||
- kopper: copy a bunch of code for texture_from_pixmap
|
||
- zink: add supported present modes to kopper displaytarget
|
||
- zink: add a util function for finding swapchain resource
|
||
- zink: flag swapchains when updating fails
|
||
- zink: handle dead swapchains in acquire
|
||
- zink: unset image layout when killing swapchain
|
||
- zink: rework swapchain object check for acquires
|
||
- zink: fix return for zink_kopper_acquire
|
||
- zink: handle swapchain acquire failures more directly
|
||
- zink: flag swapchain resources as swapchains
|
||
- zink/kopper: add a mechanism for checking swapchain status
|
||
- zink: radv ci updates
|
||
- ac: remove gallivm header include
|
||
- kopper: add some error logging for pixmap->texture failure
|
||
- kopper: check for modifiers to use modifier functionality
|
||
- zink/kopper: add a function for determining if running on software
|
||
- kopper: check whether zink is using sw
|
||
- kopper: rename a confusing variable
|
||
- kopper: implement texture-from-pixmap for software (non-dmabuf)
|
||
- zink: fix dmabuf plane returns
|
||
- zink: export fd info for all 2d images
|
||
- zink: ignore KMS handle types
|
||
- zink: remove drm_fd
|
||
- llvmpipe: add a ci flake
|
||
- llvmpipe: disable statistic queries during draw if queries are disabled
|
||
- llvmpipe: remove misleading query comment
|
||
- llvmpipe: disable compute statistics queries if queries are disabled
|
||
- wgl: always set alpha on kopper windows
|
||
- zink: hook up VK_EXT_primitives_generated_query
|
||
- zink: pass query object to get_num_queries()
|
||
- zink: pass query object to get_num_query_pools()
|
||
- zink: pass query object to get_num_results()
|
||
- zink: pass screen param to convert_query_type()
|
||
- zink: add and use a function to detected emulated primgen queries
|
||
- zink: add a flag to zink_query to trigger rasterizer discard workaround
|
||
- zink: use VK_EXT_primitives_generated_query when available
|
||
- wgl: don't auto-load zink before software drivers
|
||
- lavapipe: lower quad_broadcast intrinsics
|
||
- zink: add a ci flake
|
||
- kopper: invalidate drawables when resizing textures in place
|
||
- kopper: pass the current context to dri_flush
|
||
- util/blitter: fix sampler restore with 0 saved samplers
|
||
- radv: emit fewer framebuffer registers
|
||
- zink: restore conditional ordering for query begin/end
|
||
- nir/gather_info: flag fbfetch on subpass image loads
|
||
- llvmpipe: never infer early zs tests when fbfetch is active
|
||
- zink: split renderpasses for TextureBarrierNV() usage
|
||
- zink: scale depth bias by factor of 2
|
||
- zink: pass sparse backing page offset to binding function
|
||
- zink: pass sparse bind bo offset through for texture binds
|
||
- zink: set all usage flags when querying sparse features
|
||
- zink: fix multisample conditional in sparse image query
|
||
- zink: bump number of image binds that can be batched to 50
|
||
- zink: semi-handle 1D sparse texture rewrites for drivers that don't support them
|
||
- zink: decouple descriptor templates from layouts
|
||
- zink: flatten push descriptor template into normal template array
|
||
- zink: implement indirect buffer indexing
|
||
- zink: delete nir_lower_dynamic_bo_access
|
||
- lavapipe: enqueue pipeline destruction
|
||
- zink: fix null buffer/surface formats
|
||
- zink: add more image usage for null surfaces
|
||
- zink: use descriptor surfaces for notemplates ref updating
|
||
- lavapipe: handle null samplerviews
|
||
- gallivm: fix ssbo oob reads
|
||
- lavapipe: fix CmdPushDescriptorSetWithTemplateKHR with refcounting
|
||
- st/draw_feedback: set constant buffer stride
|
||
- gallivm/draw: fix oob ubo reads
|
||
- llvmpipe: always set ssbo data pointers for draw
|
||
- gallivm: fix oob txf swizzling
|
||
- lavapipe: set robustBufferAccess2 and robustImageAccess2
|
||
- lavapipe: ci updates
|
||
- docs: EXT_robustness2 for lavapipe
|
||
- zink: ci updates
|
||
- kopper: remove unused zink include
|
||
- zink: remove unused headers/struct members/linkage
|
||
- zink: outdent some code
|
||
- zink: add a flag to indicate whether a resource is exportable
|
||
- zink: break out resource bind-adding into separate function for reuse
|
||
- zink: introduce copy contexts
|
||
- zink: create a copy context for the screen on init
|
||
- Revert "zink: export fd info for all 2d images"
|
||
- zink: always check for fb rebinds when starting renderpass
|
||
- zink: unset pipe_resource::next pointers when creating resource copies
|
||
- zink: add a bind flag to indicate a resource is exported as a dmabuf
|
||
- zink: fix conditional for modifier usage
|
||
- zink: force memory exports if dmabuf bind is specified
|
||
- zink: add a LINEAR drm modifier if rebinding to add dmabuf export
|
||
- zink: rebind resources for export as needed
|
||
- zink: create images with modifiers any time there is an import handle
|
||
- zink: fix up sparse texture sampling for shadow samplers
|
||
- zink: clamp out min_lod operands for explicit lod ops
|
||
- zink: fix sparse binding for arrayed textures
|
||
- zink: set sparse flag in cubemap lowering
|
||
- zink: remove misleading sparse comment
|
||
- zink: fix sparse texture depth calcs for arrayed textures
|
||
- zink: remove implicit query resets
|
||
- zink: remove refs from shader images
|
||
- zink: reuse local res pointer in set_shader_images
|
||
- zink: simplify dumb update flagging in set_shader_images
|
||
- zink: read shader image r/w usage from incoming data struct
|
||
- zink: copy incoming shader image struct after doing updates
|
||
- zink: stop leaking shader image surfaces
|
||
- zink: fix 3d shader image miplevel calc for depth
|
||
- zink: directly reuse surface ivci when rebinding
|
||
- gallium/noop: implement pipe_screen::create_fence_win32
|
||
- aux/trace: screen::create_fence_win32
|
||
- zink: add win32 guards for device extensions with win32 in name
|
||
- lavapipe: ignore depth/stencil states if attachment is not present
|
||
- lavapipe: fix pipeline handling for dynamic render zs state with pipeline library
|
||
- spirv: fix barrier scope assert
|
||
- zink: manually validate shaders in debug builds
|
||
- zink: fix init with MESA_SHADER_CACHE_DISABLE enabled
|
||
- zink: use copy context to eliminate dependency on EXT_calibrated_timestamps
|
||
- zink: move get_timestamp to screen
|
||
- zink: add tu baseline for a630
|
||
- zink: add an anv-tgl ci job
|
||
- zink: never create a sw context unless sw is requested
|
||
- zink: only reject non-blittable resource creation if it has no other binds
|
||
- zink: add separate error message for push descriptor set alloc fail
|
||
- zink: add extra validation for resource creation
|
||
- zink: remove input attachment usage when pruning surface usage
|
||
- zink: add extended usage and retry if resource creation fails
|
||
- zink: fix surface/bufferview cache comparisons
|
||
- zink: force render target usage for sampler-only resources
|
||
- zink: clamp 1D_ARRAY sparse textures to 2D as needed
|
||
- zink: handle PIPE_BUFFER sparse texture queries
|
||
- zink: fix non-dynamic vertex stride update flagging
|
||
- zink: add c++ guards for zink_screen.h
|
||
- zink: add ZINK_DEBUG=sync
|
||
- zink: update radv baseline
|
||
- zink: rename a fb surface variables
|
||
- zink: always use zink_batch_rp to init renderpass
|
||
- zink: use zink_batch_no_rp to end renderpasses
|
||
- zink: move renderpass handling to zink_render_pass.c
|
||
- zink: use global image rebind counter for dmabuf export
|
||
- zink: remove fb rebind check during renderpass begin
|
||
- zink: use local variable for getting rt barrier info
|
||
- zink: break out sample location updating to separate function
|
||
- zink: break up swapchain renderpass update to util function
|
||
- zink: remove renderpass state flag for swapchain
|
||
- zink: pass rt attrib to zink_render_pass_attachment_get_barrier_info()
|
||
- zink: fix indentation
|
||
- zink: break out color rt attrib analysis into util function
|
||
- zink: break out zs rt attrib analysis into util function
|
||
- zink: track whether there's any transient attachments bound
|
||
- zink: lift some renderpass mechanics up a level in the api
|
||
- zink: split out scissor flagging from framebuffer updating
|
||
- zink: use dynamic rendering (most of the time)
|
||
- turnip: fix assert for max xfb outputs
|
||
- zink: only warn once for SRGB fb without KHR_swapchain_mutable_format
|
||
- zink: clamp dynamic render layerCount to 1
|
||
- zink: create all descriptor ref templates at once
|
||
- zink: add screen-based indexing to descriptor sets
|
||
- zink: use descriptor indices in compiler
|
||
- zink: use descriptor set index indirection in program init
|
||
- zink: use descriptor set index indirection in cache state updates
|
||
- zink: flag descriptor sets as changed before updating hashes
|
||
- zink: add a "compact" descriptor mode
|
||
- zink: use XOR for descriptor hash accumulation
|
||
- zink: radv ci updates
|
||
- zink: use a zink_render_pass_pipeline_state bit for fbfetch
|
||
- zink: add renderpass bits for color/depth r/w
|
||
- zink: update radv piglit fails
|
||
- vtn: assert that vector shuffle indices are in-bounds
|
||
- vtn: assert that composite members have the same bit size as the result
|
||
- zink: require draw params ext/cap in ntv if drawid is used
|
||
- zink: emit spirv cap for subgroup vote ops
|
||
- zink: use correct scope for subgroup vote ops
|
||
- zink: fix subgroup vote detection
|
||
- zink: lower subgroup width to 1 for unsupported subgroup vote stages
|
||
- zink: remove first_frame stalling
|
||
- zink: drop wideLines requirement
|
||
- zink: drop largePoints requirement
|
||
- st/pbo_compute: do pbo readback directly to the buffer object if it exists
|
||
- st/pbo_compute: use different calc for non-3d compute buffer sizing
|
||
- st/pbo_compute: make compute download conditional in shader slightly more readable
|
||
- st/pbo_compute: fix z coords for compute pbos
|
||
- zink: only infer modifiers if winsys handle has a stride
|
||
- zink: drop requirement for 6 descriptorsets to use bindless
|
||
- zink: store the real (non-compact) binding usage for programs
|
||
- zink: fix change flagging for compact descriptor cache
|
||
- zink: flag all cached descriptors as needing update on program change
|
||
- zink: add a #define for descriptor compaction
|
||
- zink: make swapchain kill message more descriptive
|
||
- st/texture: use base teximage for compute blits
|
||
- st/texture: allow compute blits for (some) non-finalized textures
|
||
- lavapipe: fix dynamic patchControlPoints
|
||
- mesa: throw a log warning any time a fallback texture is used
|
||
- zink: fix radv piglit fails
|
||
- zink: force queries to start/end out of renderpass
|
||
- zink: tu ci updates
|
||
- zink: reorder pipeline dynamic state1 substruct
|
||
- zink: improve packing for pipeline dynamic state1 info
|
||
- zink: break out dynamic state1 pipeline info to struct
|
||
- zink: use memcmp for comparing dynamic state1 info
|
||
- zink: use dynamic cull mode
|
||
- zink: break out pipeline dynamic state2 into struct
|
||
- zink: use dynamic patch vertices if available
|
||
- zink: expand patch_vertices pipeline key bitsize
|
||
- zink: fix framebuffer attachment usage asserts for dummy attachments
|
||
- aux/trace: fix sampler view dumping
|
||
- aux/trace: implement pipe_screen::is_compute_copy_faster
|
||
- zink: disable compute pbos on turnip
|
||
- egl/wayland: manually swap backbuffer when using zink
|
||
- egl/wayland: skip buffer creation on zink
|
||
- kopper: use get_drawable_info path for non-x11 drawables
|
||
- ci: disable unit tests
|
||
- zink: fix dmabuf plane layout struct scoping
|
||
- zink: represent plane offsets using offset from plane 0 vs size of plane
|
||
- zink: rename a variable
|
||
- zink: handle aux plane imports
|
||
- Revert "zink: remove drm_fd"
|
||
- zink: add back kms handling
|
||
- mesa: improve relocation problem message
|
||
- mesa/st: bump param reservation to 20
|
||
- format_utils: properly parenthesize macro params
|
||
- d3d12: skip time-elapsed piglit tests in ci
|
||
- virgl: add some ci flakes
|
||
- mesa/st: bump param reservation to 28
|
||
- mesa: add statevar for atomic counter offsets
|
||
- mesa: conditionally set constants dirty for atomic counter binds
|
||
- st/glsl_to_nir: call st_set_prog_affected_state_flags() as late as possible
|
||
- mesa: handle atomic counter lowering for drivers with big ssbo offset aligns
|
||
- zink: remove ANV depth clip control workaround
|
||
- zink: disable EXT_primitives_generated_query on turnip
|
||
- zink: track invalidation for image resources
|
||
- zink: flag renderpass for change if image resource changes valid state
|
||
- zink: rename renderpass attrib value
|
||
- zink: split out a dynamic render ternary
|
||
- zink: hook up surface invalidation to LOAD_OP_DONT_CARE
|
||
- zink: invalidate blit dsts if fully covered
|
||
- zink: remove buffer valid range tracking from blit
|
||
- mesa/streaming-memcpy: add memcpy fallback
|
||
- streaming-memcpy: move to src/util/ and compile unconditionally
|
||
- mesa/st: use util_streaming_load_memcpy for compute pbo copying
|
||
- zink: break out of zs mixed layout update loop when work is done
|
||
- zink: only update layout when doing mixed zs attachment renderpass check
|
||
- aux/trace: add pipe_context::link_shader
|
||
- aux/tc: handle link_shader
|
||
- zink: simplify some compiler bo code
|
||
- zink: fix buffer descriptor hashing
|
||
- zink: always create descriptor templates
|
||
- zink: flag optimization pass after inlining uniforms
|
||
- zink: break out nir alu typing in ntv
|
||
- zink: run nir_lower_alu_to_scalar in optimizer loop for 64bit lowering
|
||
- zink: always scalarize pack/unpack alu ops
|
||
- zink: don't lower pack/unpack intrinsics
|
||
- zink: run more int64 passes during optimization loop if int64 isn't available
|
||
- zink: fix ntv partial stores
|
||
- zink: flag all shader i/o variables as !is_xfb
|
||
- zink: fix variable locations in manual xfb emission
|
||
- zink: if a variable is flagged as being xfb, treat it as manual xfb output
|
||
- zink: improve/expand manual xfb emission
|
||
- zink: implement nir_op_unpack_64_2x32 in ntv
|
||
- zink: rewrite 64bit shader i/o as 32bit
|
||
- zink: scalarize when rewriting explicit 64bit io
|
||
- zink: avoid adding more 64bit alus during bo rewriting
|
||
- zink: fix 32bit bo rewriting
|
||
- zink: update tu a630 baseline
|
||
- zink: add new flakes
|
||
- zink: fix feature requirement
|
||
- zink: add feature req for border colors
|
||
- zink: more requirement docs
|
||
- zink: rewrite atomic ssbo intrinsics as atomic derefs
|
||
- zink: run copy_prop_vars during optimization
|
||
- zink: warn on missing customBorderColorWithoutFormat
|
||
- zink: more lavapipe glcts fails
|
||
- zink: cap driver inlining using ssa allocation limit
|
||
- zink: use separate pointer for swapchain presents
|
||
- zink: keep a count of async presents occuring for a given swapchain
|
||
- zink: defer old swapchain destruction
|
||
- zink: add implicit sync workaround for non-mesa drivers
|
||
- zink: unset res->swapchain upon killing a swapchain
|
||
- zink: fix up KILL to a more sensible log message
|
||
- lavapipe: VK_EXT_border_color_swizzle
|
||
- anv: VK_EXT_border_color_swizzle
|
||
- zink: fix dynamic stride conditional in pipeline creation
|
||
- zink: fix generated tcs update path
|
||
- zink: use the patch_vertices value from the shader key, not the context
|
||
- zink: unset generated tcs pointer from tes on deletion
|
||
- zink: fix generated tcs deletion
|
||
- zink: only add necessary binds during dmabuf export
|
||
- zink: avoid uninit values in renderpass state
|
||
- zink: don't print VK_EXT_shader_atomic_float warning for nir_intrinsic_image_deref_atomic_exchange
|
||
- zink: hook up VK_EXT_non_seamless_cube_map
|
||
- zink: use VK_EXT_non_seamless_cube_map when available
|
||
- zink: rename nonseamless struct members to emulate_nonseamless
|
||
- zink: skip some emulated cube code if the extension is present
|
||
- zink: fix cubemap lowering for array samplers
|
||
- zink: fix cubemap lowering bit size
|
||
- zink: clamp dynamic renderArea to framebuffer size
|
||
- zink: check for pending clears to determine write status of zs attachments
|
||
- zink: add a function for getting the minimum framebuffer layers
|
||
- zink: clamp dynamic fb layers better
|
||
- zink: clamp renderpass layers better
|
||
- zink: anv ci updates
|
||
- zink: use uint64_t for batch_id
|
||
- zink: remove thread_queue
|
||
- zink: use a dynarray for semaphore wait flags
|
||
- zink: add batch api for adding a wait semaphore
|
||
- zink: add semaphore handling for sparse binds
|
||
- zink: detect sparse-only binding queue
|
||
- zink: always bind on the sparse queue
|
||
- zink: only enable vtn debugging if validation is active
|
||
- zink: tu ci baseline update
|
||
- zink: radv baseline updates
|
||
- drisw: probe kms if fd is passed
|
||
- kmsro: add error message on drm ioctl failure
|
||
- zink: rename a variable
|
||
- zink: unify actual map calls in buffer mapping
|
||
- zink: use only the extents for buffer unmap flushes
|
||
- zink: fix buffer transfer flushes with offsets
|
||
- zink: store and reuse memory heap flags for buffer placement
|
||
- zink: move BAR allocation demotion up the stack
|
||
- zink: be a little selective about BAR fallback memory type
|
||
- zink: hook up VK_EXT_border_color_swizzle
|
||
- zink: use VK_EXT_border_color_swizzle
|
||
- zink: print runtime warning on missing EXT_border_color_swizzle
|
||
- mesa/st: add PIPE_CAP_DITHERING
|
||
- zink: unset PIPE_CAP_DITHERING
|
||
- mesa/st: increment num_uniforms for hw accel select shaders
|
||
- zink: move draw-time barrier generation down a little
|
||
- zink: track vertex buffer bind counts on resources
|
||
- zink: rework buffer barrier generation
|
||
- zink: track image binds
|
||
- zink: add a #define for vk shader bits
|
||
- zink: collect gfx stages for all bindings during barrier generation
|
||
- zink: don't short-circuit gfx stage finding during barrier generation
|
||
- zink: relax zink_resource_buffer_needs_barrier checks
|
||
- zink: ci updates
|
||
- glx/drisw: store the flush extension to the screen
|
||
- glx/drisw: invalidate drawables upon binding context if flush extension exists
|
||
- zink: fix dual_src_blend driconf workaround
|
||
- nir/lower_point_size: apply point size clamping
|
||
- mesa: break out PointSizeIsOne setting to util function
|
||
- mesa: rename PointSizeIsOne -> PointSizeIsSet
|
||
- mesa: skip pointsize exports if pointsize attenuation is enabled
|
||
- mesa: enforce pointsize exports if pointsize is being clamped
|
||
- mesa: explicitly disallow multiple pointsize exports from generating
|
||
- zink: ci updates
|
||
- zink: add flag to indicate if a resource is a dmabuf
|
||
- zink: flag dmabufs for foreign queue transition on flush_resource call
|
||
- lavapipe: zero out blend info if blend isn't enabled
|
||
- lavapipe: copy more pNexts for pipeline creation
|
||
- lavapipe: fix renderpass info handling during pipeline creation
|
||
- lavapipe: add a pipeline library assert
|
||
- lavapipe: skip post-copy pNext checking during pipeline creation for composites
|
||
- zink: use the bigger of the variable type and interface type for bo sizing
|
||
- zink: track ssbo bind counts
|
||
- zink: track overall samplerview bind counts
|
||
- zink: fix image bind counting
|
||
- zink: track gfx/compute descriptor barrier info
|
||
- zink: use tracked barrier info for generated barriers
|
||
- zink: remove another zink/tu fail
|
||
- zink: fix-ish depth clipping without VK_EXT_depth_clip_enable
|
||
- zink: add a turnip driver workaround for EXT_depth_clip_enable
|
||
- zink: remove swizzle from fbfetch lowering
|
||
- lavapipe: always set point_tri_clip
|
||
- zink: fix kopper_acquire return value
|
||
- zink: track whether current swapchain has data
|
||
- zink: move to AoS for storing swapchain struct data
|
||
- zink: move 'acquired' flag onto swapchain struct
|
||
- zink: use better determination for handling swapchain acquire submits
|
||
- zink: remove broken kopper assert
|
||
- zink: remove unnecessary code in kopper readback acquire
|
||
- zink: improve no-op acquire detection
|
||
- zink: flag batch as having work during frontbuffer flush
|
||
- zink: ensure pending present flushes are handled during frontbuffer flush
|
||
- zink: delete zink_resource_object::acquire
|
||
- zink: no-op pixmap frontbuffer flush calls
|
||
- zink: acquire swapchain images on image map
|
||
- zink: use kopper_displaytarget type directly
|
||
- zink: delete zink_resource_object::dt_has_data
|
||
- zink: set PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY for device-local resources
|
||
- lavapipe: break out part of shader optimization into function
|
||
- lavapipe: streamline xfb shader setup
|
||
- nir/types: fix glsl_matrix_type_is_row_major() assert
|
||
- zink: store context flags
|
||
- zink: don't increment screen->num_contexts for copy context
|
||
- zink: disable turnip traces temporarily
|
||
- lavapipe: don't remove xfb outputs
|
||
- zink: enforce viewport depth clamping
|
||
- zink: size uniform_0 ubo accurately
|
||
- zink: further improve bo sizing
|
||
- zink: force unsized array in bo creation using the last block member type
|
||
- zink: move/refactor bo_vars struct creation in compiler
|
||
- zink: handle bitsizes in get_bo_vars() analysis
|
||
- zink: don't run nir_lower_uniforms_to_ubo repeatedly
|
||
- zink: pass zink_shader to optimize_nir during final optimize pass
|
||
- zink: add an optimizer pass to enforce bo bounds
|
||
- zink: use std430 packing by default
|
||
- zink: use a std430 interface for bo types
|
||
- mesa: remove incomplete texture warning
|
||
- mesa/st: make get_sampler_view_format() public
|
||
- mesa/st: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO
|
||
- zink: init driver workarounds earlier in screen creation
|
||
- zink: disable customBorderColorWithoutFormat on turnip
|
||
- zink: don't warn for missing customBorderColorWithoutFormat on turnip
|
||
- zink: support PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO
|
||
- zink: flush pending clears for fb texture barriers
|
||
- mesa: fix SignalSemaphoreEXT behavior
|
||
- lavapipe: don't overwrite entire VkFormatProperties3 struct
|
||
- lavapipe: don't crash on null xfb buffer pointer
|
||
- zink: reject swizzled format blits
|
||
- lavapipe: support inlined shader spirv for compute
|
||
- zink: allow multiple tex components for depth tg4
|
||
- zink: always use 32bit sample ops
|
||
- zink: add a compiler pass to match up tex op dest types
|
||
- zink: use sampler_bind_count to simplify some code
|
||
- zink: break out samplerview layout reset code
|
||
- zink: always update sampler descriptor layouts on fb surface unbind
|
||
- zink: always use 32bit floats for so output types
|
||
- zink: handle bare matrix types in xfb emission
|
||
- zink: split xfb block emission from array/matrix handling
|
||
- zink: fix xfb array inlining
|
||
- zink: use right glsl length getter for ntv partial stores
|
||
- zink: bitcast extracted streamout components to uint before creating uvec
|
||
- zink: add a compiler pass to split xfb block outputs
|
||
- zink: fix xfb emit check in compiler
|
||
- mesa: move is_wrap_gl_clamp() to samplerobj.h and deduplicate
|
||
- mesa: track which sampler wrap params use GL_CLAMP
|
||
- mesa: update GL_CLAMP emulation when binding/unbinding textures
|
||
- zink: remove bogus range tracking from texture clear
|
||
- zink: fix transient attachment rp assert
|
||
- zink: remove format check from clear texture
|
||
- zink: fix coverage check for texture clears
|
||
- zink: stop using u_blitter for texture clears
|
||
- zink: improve zink_clear_depth_stencil check for current attachment
|
||
- zink: massively simplify zink_clear_depth_stencil
|
||
- zink: remove non-renderpass clear path from zink_clear_texture
|
||
- zink: remove u_blitter usage from zink_clear_render_target
|
||
- zink: always use storeOp=STORE for depth renderpass
|
||
- zink: track a bitmask of fb attachments with mismatched layer counts
|
||
- zink: add explicit (awful) handling for fb layer mismatch clears
|
||
- zink: clamp color clear values based on format
|
||
- zink: lift clearing on fb state change up a level
|
||
- zink: remove out-of-renderpass clears
|
||
- zink: delete zink_fb_clear_util_unpack_clear_color
|
||
- zink: delete srgb tracking for clears
|
||
- zink: simplify zink_framebuffer_clear_data union
|
||
- zink: inject a 0,0,0,1 clear for RGBX formats
|
||
- zink: break out tc/trace context unwrapping
|
||
- zink: nv baseline update
|
||
- zink: tu a630 baseline update
|
||
- mesa/st: add implicit zeroing of clipdistance array
|
||
- zink: move descriptor_mode to be a global
|
||
- zink: delete ZINK_DESCRIPTOR_MODE_NOFALLBACK
|
||
- zink: add an explicit ZINK_DESCRIPTOR_MODE_CACHED
|
||
- zink: default to lazy/templated descriptors
|
||
- zink: handle max_vertices=0 in geometry shader
|
||
- zink: handle null samplerview in get_imageview_for_binding()
|
||
- zink: fix crashing on bo allocation failure
|
||
- zink: improve oom error message
|
||
- zink: call bind_last_vertex_stage() when binding vs
|
||
- zink: fix viewport count tracking
|
||
- zink: use gfx_barrier for synchronization when binding buffer descriptors
|
||
- vulkan/wsi: fix multiple acquires for sw without mit-shm
|
||
- llvmpipe: ignore multisample state for alpha_to_one
|
||
- zink: don't try to start a renderpass before CmdClearAttachments
|
||
- zink: don't flag renderpass change on depth write change as often
|
||
- zink: flag renderpass change if renderpass clears change
|
||
- zink: do dynamic render implicit clear updates on rp_changed
|
||
- zink: unflag ctx->rp_changed after dynamic render update
|
||
- zink: remove redundant renderpass update check for clears
|
||
- zink: add an assert to verify that deferred clears aren't added in renderpass
|
||
- zink: split out rp_changed to be more granular for dynamic render
|
||
- zink: break out renderpass attachment load op calc
|
||
- zink: break out zs renderpass layout calc
|
||
- zink: break out color rt layout calc
|
||
- zink: reuse rt layout functions to refactor zink_render_pass_attachment_get_barrier_info()
|
||
- zink: add a bits union for zink_render_pass_state comparisons
|
||
- zink: improve granularity of renderpass switching
|
||
- zink: allow no-op renderpass updates in zink_batch_rp()
|
||
- zink: move layout-setting for dynamic render attachments after no-op case
|
||
- zink: handle null samplerview/image rebinds more gracefully
|
||
- zink: scale number of shader cache-get threads
|
||
- zink: invoke descriptor_program_deinit for programs on context destroy
|
||
- vulkan/wsi: return VK_SUBOPTIMAL_KHR for sw/x11 on window resize
|
||
- zink: add env var to abort on device-lost if no reset callback is set
|
||
- util/vbuf: handle multidraws
|
||
- anv: disable dev.i915.perf_stream_paranoid=0 warning
|
||
- lavapipe: propagate xfb info for pipeline library rasterization stages
|
||
- zink: remove deqp fails for lavapipe
|
||
- zink: only flag fbfetch as rp update if fbfetch wasn't already configured
|
||
- zink: assert !rp_changed after renderpass has started
|
||
- zink: avoid crashing when bindless texture updates aren't used
|
||
- lavapipe: EXT_non_seamless_cube_map
|
||
- lavapipe: accurately set image/ssbo access based on shader usage
|
||
- lavapipe: implement extreme uniform inlining
|
||
- lavapipe: remove some skips
|
||
- zink: add a ci flake
|
||
- lavapipe: move vertex input dynamic states into vertex input block
|
||
- lavapipe: make binding/layout helper functions static inline
|
||
- lavapipe: use correct offset calculation for flagging ssbo/image write access
|
||
- lavapipe: use 64bit masks for shader access
|
||
- lavapipe: set image access on pipeline bind, not descriptor bind
|
||
- llvmpipe: remove weird clamping for format support with sample counts
|
||
- llvmpipe: allow resource_copy_region to perform replicate operations
|
||
- lavapipe: fix stencil clears
|
||
- llvmpipe: add a shortcut for blitter resolves
|
||
- vulkan: hook up VK_EXT_multisampled_render_to_single_sampled
|
||
- lavapipe: VK_EXT_multisampled_render_to_single_sampled
|
||
- lavapipe: advertise VK_EXT_multisampled_render_to_single_sampled
|
||
- util/blitter: respect PIPE_TEXTURE_RECT
|
||
- lavapipe: fix shader access stage conversion from mesa -> pipe
|
||
- zink: stop ORing unordered barrier access
|
||
- zink: convert ZINK_DEBUG to enum
|
||
- zink: add doc for ZINK_DEBUG=compact
|
||
- zink: add ZINK_DEBUG=noreorder
|
||
- zink: zink_resource::unordered_barrier -> unordered_exec
|
||
- zink: explicitly unset unordered_exec when resource can't be unordered
|
||
- zink: rework unordered_exec handling
|
||
- zink: add a second resource param to get_cmdbuf() for unordered promotion
|
||
- zink: promote image/buffer copy operations to unordered
|
||
- zink: move buffer/image functions down in file
|
||
- zink: expand unordered_exec
|
||
- zink: make get_cmdbuf() public
|
||
- zink: enable unordered blit/resolve ops
|
||
- virgl: more ci flakes
|
||
- d3dadapter: fix sw pipe loading
|
||
- nine: set view_mask=0
|
||
- zink: fix gfx program cache pruning with generated tcs
|
||
- zink: don't flag lazy push constant set dirty on batch change
|
||
- zink: combine loops for lazy descriptor program deinit
|
||
- zink: handle mutable swapchain images with dmabuf
|
||
- zink: store VkFormatFeatureFlags on creation
|
||
- zink: use modifier feature flags during surface creation when necessary
|
||
- zink: init cache_put program fence on program creation
|
||
- zink: force flush clears on fb change if fb geometry changes
|
||
- zink: force a new framebuffer for clear_depth_stencil if the clear region is big
|
||
- zink: drop mode_changed check from linewidth/depthbias draw updates
|
||
- zink: handle unscaled depth bias from nine
|
||
- zink: handle !half_pixel_center
|
||
- zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE
|
||
- nine: check return on resource_get_handle
|
||
- zink: demote dmabuf tiling to linear if modifiers aren't supported
|
||
- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param
|
||
- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present
|
||
- zink: add all format modifiers when adding for dmabuf export
|
||
- zink: don't fixup sparse texops
|
||
- vulkan: Update the XML and headers to 1.3.224
|
||
- radv: fix return type for meta resolve shaders
|
||
- nir/validate: clamp unsized tex dests to 32bit
|
||
- mesa: fix blending when using luminance/intensity emulation
|
||
- mesa: require render target bind for A/L/I in format selection
|
||
- vk/render_pass: don't deref null resolve attachments
|
||
- zink: handle nir_intrinsic_sparse_residency_code_and mechanics
|
||
- zink: ignore nir_texop_lod for tex dest matching
|
||
- tu: fix invalid free on alloc failure
|
||
- zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache()
|
||
- zink: don't emit entrypoints for function temp variables
|
||
- zink: check the variable mode before taking samplemask path in ntv
|
||
- zink: explicitly use unsigned types for bit shifts
|
||
- zink: only add srgb mutable for images with modifiers
|
||
- zink: don't emit illegal interpolation
|
||
- llvmpipe: don't assume pipe_context is always available in flush_frontbuffer
|
||
- zink: flag all assigned output slots as mapped
|
||
- zink: handle split acquire/present
|
||
- st_pbo/compute: fix 1D_ARRAY offsets
|
||
- st_pbo/compute: fix 1D coord dimension by pre-trimming vectors
|
||
|
||
Mike Lothian (1):
|
||
|
||
- util: Fix invalid usage of alignas in u_cpu_detect.c
|
||
|
||
Mykhailo Skorokhodov (3):
|
||
|
||
- anv: Use sampleLocationsEnable for sample locations
|
||
- iris: Move Wa_1806527549 and enable by default
|
||
- anv: Move Wa_1806527549 and enable by default
|
||
|
||
Nagappa Koppad, Basanagouda (1):
|
||
|
||
- iris:Duplicate DRM fd internally instead of reuse.
|
||
|
||
Nanley Chery (11):
|
||
|
||
- iris: Return a 64B stride for clear color plane
|
||
- intel/isl: Add a score for I915_FORMAT_MOD_4_TILED
|
||
- isl,iris: Add DG2 CCS modifier support for XeHP
|
||
- mesa: Simplify UNORM8 conversions for sRGB ASTC textures
|
||
- mesa: Follow ASTC decode mode extension for RGBA8 output
|
||
- iris: Don't leak surface states for compressed resources
|
||
- iris: Don't leak compressed resources in iris_create_surface
|
||
- iris: Use fill_surface_states for compressed resources
|
||
- anv: Make the D16 reg mode single-sampled
|
||
- iris: Make the D16 reg mode single-sampled
|
||
- iris: Dedent enum iris_depth_reg_mode
|
||
|
||
Nicolas Caramelli (1):
|
||
|
||
- egl: Fix EGL_EXT_platform_xcb name string to match the registry
|
||
|
||
Panagiotis Apostolou (1):
|
||
|
||
- util: Don't block SIGSEGV for new threads
|
||
|
||
Patrick Lerda (1):
|
||
|
||
- panfrost: Fix unwanted valgrind message related to restart_index
|
||
|
||
Paulo Zanoni (4):
|
||
|
||
- iris: fix race condition during busy tracking
|
||
- intel/compiler: compute int64_options based on devinfo->has_64bit_int
|
||
- intel/compiler: split handling of 64 bit floats and ints
|
||
- intel/compiler: call ordered_unit() only once at update_inst_scoreboard()
|
||
|
||
Pavel Asyutchenko (7):
|
||
|
||
- gallium: add PIPE_CAP_FBFETCH_ZS and expose extension
|
||
- glsl: add language support for GL_ARM_shader_framebuffer_fetch_depth_stencil
|
||
- nir_to_tgsi: Don't count ZS fbfetch vars as outputs
|
||
- llvmpipe: enable per-sample shading when FB fetch is used
|
||
- llvmpipe: simplify early/late zs tests selection
|
||
- llvmpipe: implement FB fetch for depth/stencil
|
||
- llvmpipe: enable PIPE_CAP_FBFETCH_ZS
|
||
|
||
Pavel Ondračka (13):
|
||
|
||
- r300: set PIPE_BIND_CONSTANT_BUFFER for const_uploader
|
||
- r300: skip draws instead of using a dummy vertex shader
|
||
- r300: fix vertex shader control flow in loops
|
||
- r300: deduplicate common NIR options
|
||
- r300: be less agresive with copy propagate in loops
|
||
- r300: don't try to use inline constants instead of constant swizzles
|
||
- r300: Update list of RV515 dEQP failures and add some flakes
|
||
- r300: merge simple movs with constant swizzles together
|
||
- r300: expose PIPE_CAP_LEGACY_MATH_RULES
|
||
- r300: only run merge_movs pass on R500
|
||
- r300: Keep rc_rename_regs() from overflowing RC_REGISTER_MAX_INDEX
|
||
- r300: disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF
|
||
- r300: fix variables detection for paired ALU and TEX instructions in different branches
|
||
|
||
Pierre-Eric Pelloux-Prayer (81):
|
||
|
||
- radeonsi: make si_pc_emit_shaders non static
|
||
- radeonsi: add helpers to deal with spm counters
|
||
- radeonsi: add spm counters setup code
|
||
- radeonsi/sqtt: setup spm bo
|
||
- radeonsi/sqtt: enable spm counters on gfx10+
|
||
- radeonsi: make sure profile_peak is used before using sqtt
|
||
- ac/spm: setup write broadcasting correctly
|
||
- ac/surface: adjust gfx9.pitch[*] based on surf->blk_w
|
||
- radeonsi: remove commutative_blend_add option
|
||
- radeonsi: drop assume_no_z_fights option
|
||
- drirc: remove i965 entry
|
||
- gallium: add PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER
|
||
- radeonsi: enable PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER
|
||
- radeonsi: fix VS kill_outputs handling
|
||
- radeonsi: don't always strip NIR debug infos
|
||
- radeonsi: generate unique shader name in si_get_nir_shader
|
||
- amd: update gfx10_format_table.py for gfx11
|
||
- radeonsi/gfx11: add assert in legacy vs path
|
||
- radeonsi/gfx11: register changes
|
||
- radeonsi/gfx11: image descriptor changes
|
||
- radeonsi/tests: allow empty line and comments in csv files
|
||
- radeonsi/tests: add a heuristic to pick the baseline
|
||
- radeonsi/tests: reformat with black
|
||
- u_threaded: clear non-async debug callback correctly
|
||
- radeonsi: flush VGT streamout like PAL
|
||
- radeonsi: implement pipeline stats workaround
|
||
- gallium: add a union to access queries counters
|
||
- gallium: always return the correct pipeline stats index
|
||
- gallium: deduplicate target -> stats index code
|
||
- radeonsi: deduplicate query offsets
|
||
- radeonsi: store the pipeline stats index
|
||
- radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders
|
||
- radeonsi/test: update sienna_cichlid results
|
||
- radeonsi/query: use the qbo correct size
|
||
- radeonsi/ngg: reuse the pipeline stats buffer when using atomics
|
||
- radeonsi: fix gs_invocation query with NGG
|
||
- radeonsi: don't use wave32 for GE on gfx10 if culling is used
|
||
- amd: fix ac_build_mbcnt_add in wave32 mode
|
||
- util/u_queue: rework UTIL_QUEUE_INIT_SCALE_THREADS to scale faster
|
||
- radeonsi: remove unused params from si_shader_select_with_key
|
||
- radeonsi: scale the number of shader compiler threads
|
||
- radeonsi: store shader variants in an array
|
||
- glx: set max values for pbuffer width / height
|
||
- radeonsi: fix glTexBuffer max size handling
|
||
- radeonsi: wait for PS idle in si_set_framebuffer_state
|
||
- radeonsi/tests: add a --slow option
|
||
- radeonsi/tests: use a smaller tests-per-group value
|
||
- radeonsi: don't use sel->nir in si_check_blend_dst_sampler_noop
|
||
- ac: export LLVM c++ ResetCommandLineParser method
|
||
- ac: reset LLVM command line parser
|
||
- ac/llvm: add use_waterfall_for_divergent_tex_samplers option
|
||
- radeonsi: enable use_waterfall_for_divergent_tex_samplers
|
||
- st/mesa: use mutex in st_get_texture_sampler_view_from_stobj
|
||
- gallium: update winsys_handle documentation
|
||
- gallium: add size attribute to winsys_handle
|
||
- va/surface: set the correct size in vaExportSurfaceHandle
|
||
- radeonsi: set size in si_texture_get_handle
|
||
- tradeonsi: fix preamble state producing incorrect packets
|
||
- radeonsi: prevent recursion in si_decompress_dcc
|
||
- radeonsi: add helper to use si_screen::aux_context
|
||
- radeonsi: use helpers to access si_screen::aux_context
|
||
- Revert "winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+"
|
||
- dri: store internal_format when known
|
||
- mesa/st: don't guess the internal format if it's known
|
||
- util/blob: use memcpy in read functions
|
||
- mesa: skip draws with invalid indices offset
|
||
- vulkan/wsi: define pWaitDstStageMask in the blit submission
|
||
- ac: use LLVMContextSetOpaquePointers if available
|
||
- radeonsi: use LLVMBuildLoad2 where possible
|
||
- ac: add per output is_16bit flag to ac_shader_abi
|
||
- ac/llvm: handle opaque pointers in visit_store_output
|
||
- ac/llvm: use LLVMBuildLoad2 in visit_load
|
||
- radeonsi: use LLVMBuildLoad2 in llvm PS
|
||
- radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads
|
||
- radeonsi: check last_dirty_buf_counter and dirty_tex_counter
|
||
- amdgpu/bo: update uses_secure_bos when importing buffers
|
||
- nir: add a nir_opt_if_options enum
|
||
- radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot
|
||
- radeonsi: use nir_opt_large_constants earlier
|
||
- mesa: avoid reading back textures from VRAM
|
||
- radeonsi: invalidate L2 when using dcc stores
|
||
|
||
Qiang Yu (63):
|
||
|
||
- radeonsi: lower nir_intrinsic_sparse_residency_code_and
|
||
- mesa/program: fix nir output reg overflow
|
||
- mesa/vbo: remove unused vbo_context->binding
|
||
- nir/builder: add load/store array variable helper functions
|
||
- mesa: add hardware accelerated select constant
|
||
- mesa: add _mesa_bufferobj_get_subdata
|
||
- mesa: refine name stack code to prepare for hw select
|
||
- mesa: add hw select name stack code path
|
||
- mesa/vbo: enclose none-vertex functions with HW_SELECT_MODE
|
||
- mapi: add api setup header for hw select mode
|
||
- mesa: set CurrentServerDispatch too when glBegin/End
|
||
- mesa: add HWSelectModeBeginEnd dispatch table
|
||
- mesa: pass select result buffer offset as attribute/varying
|
||
- mesa/st: implement hardware accelerated GL_SELECT
|
||
- zink: reset PIPE_CAP_ACCELERATED when cpu soft rendering
|
||
- virgl: return -1 for PIPE_CAP_ACCELERATED
|
||
- mesa: enable HardwareAcceleratedSelect
|
||
- ac/llvm: get back nir_intrinsic_load_tess_rel_patch_id_amd
|
||
- nir: add nir_intrinsic_load_lshs_vertex_stride_amd
|
||
- ac/nir: use nir_intrinsic_load_lshs_vertex_stride_amd
|
||
- radeonsi: implement load_lshs_vertex_stride abi
|
||
- radeonsi: add tcs_vgpr_only_inputs parameter to si_get_nir_shader
|
||
- ac/nir: skip gl_Layer/gl_ViewportIndex write for LS
|
||
- radeonsi: replace llvm ls/hs interface lds ops with nir lowered ones
|
||
- gallium: disable hardware select for crocus
|
||
- ac/llvm: conditionally check wave id in gs sendmsg
|
||
- radv: no need to do gs_alloc_req for newer chips in ngg vs/tes
|
||
- radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop
|
||
- radeonsi: use si_shader as parameter in si_get_nir_shader
|
||
- radeonsi: support multi stage shader state creation in nir shaderlib
|
||
- radeonsi: add si_create_passthrough_tcs
|
||
- radeonsi: replace llvm based fixed tcs with nir
|
||
- radeonsi: implement nir_intrinsic_load_tcs_num_patches_amd
|
||
- ac/nir: remove unused parameter in tes input lower
|
||
- ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd
|
||
- ac/nir: use nir_intrinsic_load_hs_out_patch_data_offset_amd in tess lower
|
||
- ac/nir: add pass_tessfactors_by_reg param to hs output lower
|
||
- ac/nir: add wave_size parameter to ac_nir_lower_hs_outputs_to_mem
|
||
- radeonsi: preload tess offchip ring for tcs
|
||
- radeonsi: implement nir_intrinsic_load_ring_tess_offchip(_offset)_amd
|
||
- radeonsi: implement nir_intrinsic_load_tess_rel_patch_id_amd for both tcs and tes
|
||
- ac/llvm: handle write mask for nir_intrinsic_store_buffer_amd
|
||
- ac/nir: add no_input_lds_space param to hs output lower
|
||
- radeonsi: replace llvm tcs output with nir lower pass
|
||
- radeonsi: enable PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS
|
||
- radeonsi: set uses_vmem_load_other for nir_intrinsic_load_buffer_amd
|
||
- radeonsi: replace llvm tes input load with nir lowering
|
||
- ac/llvm: fix code format alignment in visit_load_local_invocation_index
|
||
- radeonsi: implement two esgs ring nir intrinsic
|
||
- ac/nir: skip gl_ViewportIndex and gl_Layer write in ES
|
||
- radeonsi: set lds for gs/es to handle nir shared memory load/store
|
||
- ac/nir: remove unused param num_reserved_es_outputs from gs input lower
|
||
- ac/nir: change es output lower param to esgs_itemsize
|
||
- radeonsi: replace llvm es output with nir lowering
|
||
- ac/nir: add triangle_strip_adjacency_fix to gs input lower
|
||
- ac/llvm: get back nir_intrinsic_load_gs_vertex_offset_amd
|
||
- radeonsi: replace llvm gs input handle with nir lowering
|
||
- ac/nir/ngg: fix nogs culling scratch size
|
||
- ac/nir/ngg: add a barrier before prim id export
|
||
- ac/nir/ngg: Decouple primitive ID store and primitive export.
|
||
- nir/lower_gs_intrinsics: fix primitive count for points
|
||
- radeonsi: fix tcs_out_lds_offsets arg alignment
|
||
- winsys/amdgpu: fix non-page-aligned sparse buffer creation
|
||
|
||
Rajnesh Kanwal (29):
|
||
|
||
- pvr: Remove double error reporting in error path.
|
||
- pvr: Change phase_rate_change type to bool from uint32_t.
|
||
- pvr: Setup common nop shader.
|
||
- pvr: Add services winsys transfer context support.
|
||
- pvr: Add support to create transfer context and setup required shaders.
|
||
- pvr: Add vk_sync support and remove service winsys syncobjs interface.
|
||
- pvr: Fix argument comments in render job submission.
|
||
- pvr: Pass device info struct in pds generator.
|
||
- pvr: Add support for R32G32B32A32_UINT, R32_UINT and R8_UINT formats.
|
||
- pvr: Allow signal_sync pointer to be NULL in job submission.
|
||
- pvr: Remove vk_format_is_pure_integer and use common helper.
|
||
- pvr: Add transfer 3d parameter heap support.
|
||
- pvr: Fix ISP_RENDER_MODE_TYPE enum in rogue_cr.xml
|
||
- pvr: Restrict argument to dev_info where possible.
|
||
- pvr: Fix UTIL_ARCH_BIG_ENDIAN check in vk_format.h.
|
||
- pvr: Change ALIGN macro to ALIGN_ATTR.
|
||
- pvr: Add services winsys transfer cmd submit interface.
|
||
- pvr: Move transfer logic to pvr_job_transfer.[ch]
|
||
- pvr: Remove clang-format off comment from vk_format.h
|
||
- pvr: Implement vkCmdDraw API.
|
||
- pvr: Init and store default sampler in pvr_device.
|
||
- pvr: Use common vk_buffer implementation.
|
||
- pvr: Implement vkCreateBufferView and vkDestroyBufferView.
|
||
- pvr: Move binding related checks in common code.
|
||
- pvr: Add support for sampler descriptor.
|
||
- pvr: Add support for combined image and sampler descriptor.
|
||
- pvr: Add support for image descriptor.
|
||
- pvr: Add support for storage and uniform texel descriptor.
|
||
- pvr: Add support for input attachment descriptor.
|
||
|
||
Renato Pereyra (4):
|
||
|
||
- venus: add support for vk_xwayland_wait_ready
|
||
- venus: Add driconf option to enable implicit fencing
|
||
- Revert "wsi/x11: Don't leak xcb_get_geometry_reply_t."
|
||
- Revert "wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes"
|
||
|
||
Rhys Perry (94):
|
||
|
||
- aco: fix disassembly of SMEM with both SGPR and constant offset
|
||
- ac/nir: properly handle large global access constant offsets
|
||
- nir/gather_info: fix system_value_read for rt/mesh system values
|
||
- nir/builder: add nir_{ine,ibfe,ubfe}_imm() helper
|
||
- radv: remove unnecessary align_mul/offset initialization
|
||
- radv: initialize meta shader options earlier
|
||
- radv: use radv_meta_build_nir_vs_generate_vertices() more
|
||
- radv: use nir_op_imm helpers
|
||
- radv: use inot(a) instead of ine(a, true)/ieq(a, false)
|
||
- ac/nir: fix 64-bit NGG GS output stores
|
||
- radv: always use nir_lower_io_lower_64bit_to_32
|
||
- nir/copy_prop_vars: fix non-vector shader call payloads
|
||
- radv: fix clearing of TRUNC_COORD with tg4 and immutable samplers
|
||
- radv: consider radix sort shaders to be internal/meta
|
||
- aco/ra: fix vgpr_limit
|
||
- aco: ensure that definitions fixed to operands have matching regclasses
|
||
- radv: add radv_has_shader_buffer_float_minmax
|
||
- radv: allow LBVH on GFX6+
|
||
- radv: remove subtractions in address calculations
|
||
- radv: use SMEM for a few load_global
|
||
- aco: split load_sbt_amd result
|
||
- aco: skip s_barrier if TCS patches are within subgroup
|
||
- ac/nir: skip s_barrier if TCS patches are within subgroup
|
||
- aco: fix cmpswap global atomic definition on GFX6
|
||
- nir: allow NIR_PASS(_, )
|
||
- nir: print file when validation fails
|
||
- nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base
|
||
- nir: call nir_metadata_preserve in nir_vectorize_tess_levels
|
||
- nir: call nir_metadata_preserve in nir_lower_memory_model
|
||
- radv: call nir_metadata_preserve in various lowering passes
|
||
- radv: add missing NIR_PASS() and switch from NIR_PASS_V()
|
||
- radv: validate shaders after linking passes
|
||
- aco: fix p_constaddr with a non-zero offset
|
||
- aco/tests: add test for p_constaddr with a non-zero offset
|
||
- aco: only add/subtract low bits of program addresses
|
||
- aco/ra: fix usage of invalidated iterator
|
||
- aco: use p_parallelcopy for uniform reduction with zero source
|
||
- aco: clarify a portion of do_pack_2x16
|
||
- aco/tests: disable regalloc.subdword_alloc.reuse_16bit_operands on GFX11
|
||
- aco/tests: fix gfx11 variants printed as gfx12
|
||
- aco: disable sdwa on gfx11
|
||
- aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00
|
||
- aco/tests: update for GFX11's removal of SDWA
|
||
- radv,aco: disable shader cache when ACO debug options are used
|
||
- aco: fix SMEM load_global_amd with non-zero offset
|
||
- aco: fix SMEM load_global with VGPR address and non-zero offset
|
||
- aco: consider fma with multiplication by power-of-two unfused
|
||
- ac: use ResetAllOptionOccurrences instead of ResetCommandLineParser
|
||
- gitlab: ask for useful titles
|
||
- gitlab: ask that reporters don't include long logs in descriptions
|
||
- nir/deref: stop assuming coherent accesses of different SSBOs may alias
|
||
- aco: update SMEM offset workaround for LLVM 15
|
||
- aco: don't skip VS->TCS barrier if TCS output vertices doesn't match input
|
||
- aco: cleanup force-waitcnt output
|
||
- aco/ra: update register file when updating phi definition
|
||
- radv/ci: re-enable vega10 fossils testing
|
||
- aco: add and use constantValue16()
|
||
- aco: don't accept med3 opcodes in get_minmax_info()
|
||
- aco: include _e64 variants of 16-bit min/max in minmax optimizations
|
||
- aco: ignore precise flag when optimizing integer clamps
|
||
- aco: fix single-alignbyte do_pack_2x16() path with fp inline constants
|
||
- aco: fix propagate_constants_vop3p with integer vop3p and 16-bit constants
|
||
- aco: don't use opsel to fold constants into dot accumulation sources
|
||
- aco: fix redirect combine in propagate_constants_vop3p() with negatives
|
||
- aco: try sign-extending or shifting constants in propagate_constants_vop3p
|
||
- aco: don't use 32-bit fp inline constants for fp16 vop3p literals
|
||
- aco/tests: add vop3p constant combine tests
|
||
- nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd
|
||
- nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b))
|
||
- nir: allow 16-bit fsin_amd/fcos_amd
|
||
- radv,aco,ac/llvm: use nir_op_f{sin,cos}_amd
|
||
- aco: make flat access latency match mtbuf/mubuf/mimg
|
||
- aco: include flat-like in vmem clause statistics
|
||
- aco: make FLAT_instruction::offset signed
|
||
- aco: improve support for scratch_* instructions
|
||
- aco: combine additions and constants into scratch load/store
|
||
- aco: handle subtractions in parse_base_offset
|
||
- aco: refactor VGPR spill/reload lowering
|
||
- aco: avoid WAW hazard with BVH MIMG and other VMEM
|
||
- aco: include scratch/global in VMEM WAW optimization
|
||
- aco: treat flat-like as vmem in some scheduling heuristics
|
||
- aco: initialize scratch base registers on GFX9-GFX10.3
|
||
- aco: use scratch_* for scratch load/store on GFX9+
|
||
- aco: use scratch_* for VGPR spill/reload on GFX9+
|
||
- radv: fix vbo_bound_mask indexing
|
||
- radv: don't update misaligned_mask in CmdBindVertexBuffers2
|
||
- aco: remove unnecessary exp_cnt increments
|
||
- aco: fix LdsBranchVmemWARHazard with 2+ branch chains
|
||
- aco: fix hash statistic
|
||
- radv: remove claimed support for sRGB vertex buffer formats
|
||
- radv: fix 16-bit support in radv_lower_vs_input
|
||
- aco: fix 16-bit VS inputs
|
||
- aco: don't expand vec3 VS input load to vec4 on GFX6
|
||
- aco: add SCC clobber in build_cube_select
|
||
|
||
Ricardo Garcia (1):
|
||
|
||
- vulkan: allow null descriptor set layouts in pipeline layouts
|
||
|
||
Riteo (1):
|
||
|
||
- vulkan/device_select_wayland: fix a memory leak with DRM device handling
|
||
|
||
Rob Clark (64):
|
||
|
||
- freedreno: Misc indent fixes
|
||
- freedreno/drm/virtio: Appease valgrind
|
||
- freedreno/drm: Drop FD_PP_PGTABLE
|
||
- freedreno/drm: Close bo handle after bo->destroy()
|
||
- freedreno/drm: Move bo common init
|
||
- freedreno/drm/virtio: Support ring_idx
|
||
- freedreno/drm/virtio: Use userspace IOVA allocation
|
||
- freedreno/drm/virtio: Drop blocking in host
|
||
- freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE
|
||
- freedreno/drm/virtio: Protocol updates
|
||
- freedreno/drm/virtio: Pass guest handles to execbuf
|
||
- freedreno/drm/virtio: Async ccmd batching
|
||
- freedreno/drm/virtio: Split up large uploads
|
||
- freedreno/a6xx: A couple more FD_BO_NOMAP
|
||
- freedreno: Split out helper for staging blit
|
||
- freedreno: Use staging transfer if mmap fails
|
||
- freedreno/drm: Fix bos_on_stack calculation
|
||
- drm-shim: Add GET_UNIQUE support
|
||
- freedreno/drm-shim: Update to latest uapi version
|
||
- freedreno/drm-shim: Robustify error handling
|
||
- drm-shim: Better mmap offsets
|
||
- drm-shim: Add error handling for bo_init()
|
||
- freedreno/drm-shim: Better iova handling
|
||
- drm-shim: Cleanup on device file close
|
||
- freedreno/drm: Use DEBUG_GET_ONCE_OPTION()
|
||
- freedreno/drm/virtio: Don't try to mmap imported bo's
|
||
- freedreno/devices: Add another SKU
|
||
- Revert "ci: freedreno farm is down"
|
||
- freedreno: Add pkt4 assert
|
||
- freedreno/registers: Small cleanup
|
||
- freedreno/a6xx: Split VFD_FETCH[] if needed
|
||
- freedreno/a6xx: Initialize VFD_FETCH[n].SIZE to zero
|
||
- freedreno/a6xx: assert valid vertex_flags reg
|
||
- turnip: assert valid vertex_flag reg
|
||
- freedreno/ir3: Don't lower_gs multiple times
|
||
- freedreno/ir3: Add copy_vars() helper
|
||
- freedreno/ir3: Copy vars if needed on EndPrimitive()
|
||
- freedreno/ir3: Add more tess varying slots
|
||
- freedreno/a6xx: Fix indentation
|
||
- freedreno/a6xx: Fix VS const packet size
|
||
- freedreno/ir3: Fix GS clip-plane lowering
|
||
- freedreno: rename ir3_emit_driver_params()
|
||
- freedreno/a6xx: Handle driver-params in GS/DS
|
||
- freedreno/a6xx: Do clip-plane lowering in backend
|
||
- freedreno/gmem: Reverse order of alternative tile rows
|
||
- util: Fix c++ enum casting pickiness
|
||
- libsync: Fix compiler warning
|
||
- freedreno/a6xx: Remove fd6_format.[ch]
|
||
- freedreno/a6xx: Fix enum tag
|
||
- freedreno: Extract common helper macros
|
||
- freedreno/ir3: Remove unneeded forward declaration
|
||
- freedreno/autotune: Make 'offset' macro "private"
|
||
- freedreno/drm: Combine upper and lower 32b of OR val
|
||
- freedreno/a6xx: De-open-code CACHE_FLUSH enum
|
||
- freedreno: Use enum for primtypes table
|
||
- freedreno/ci: Update unit test reference decodes
|
||
- freedreno/registers: Whitespace fix for gen_header.py
|
||
- freedreno/registers: Allow varset to be specified on enum
|
||
- freedreno/registers: Move varset to <enum>
|
||
- freedreno/registers/a6xx: Some reg64 conversion
|
||
- freedreno/drm: Fix potential bo cache vs export crash
|
||
- freedreno/gmem: Fix col0 calc
|
||
- llvmpipe: Add some missing locking
|
||
- freedreno: We really don't need aligned vbo's
|
||
|
||
Roman Stratiienko (3):
|
||
|
||
- Android.mk: Fix c11-related build failures
|
||
- v3dv: Enable sync_fd importing/exporting on Android
|
||
- v3dv: Limit API version to v1.0 for Android
|
||
|
||
Ruijing Dong (6):
|
||
|
||
- radeonsi/vcn: Add support of array_mode for gfx11
|
||
- radeonsi/vcn: update av1 decoding to support vcn4
|
||
- radeonsi/vcn: prepare for unified queue in vcn4
|
||
- radeonsi/vcn: support unified queue in vcn4
|
||
- frontends/va: do texture_map when needed
|
||
- frontends/va: WA for ffmpeg 10bit encoding crash
|
||
|
||
Ryan Neph (5):
|
||
|
||
- venus: update venus-protocol with VK_EXT_image_view_min_lod
|
||
- venus: enable VK_EXT_image_view_min_lod
|
||
- venus: support VK_KHR_copy_commands2
|
||
- venus: refactor VkCommandBufferBeginInfo fixups to function
|
||
- venus: add support for VK_KHR_dynamic_rendering
|
||
|
||
Sagar Ghuge (1):
|
||
|
||
- anv: Disable storage image compression for possible atomic ops
|
||
|
||
Sami Kyöstilä (2):
|
||
|
||
- subprojects: Roll Perfetto to v27.1
|
||
- util: Shut down Perfetto before driver unload
|
||
|
||
Samuel Pitoiset (213):
|
||
|
||
- radv: exclude PRIMITIVE_{COUNT,INDICES} from the per-vertex output mask
|
||
- radv: use shader_info::writes_memory
|
||
- radv: use shader_info::gs::active_stream_mask
|
||
- radv: use shader_info::inputs_read/outputs_written for FS IO
|
||
- radv: use shader_info::outputs_written/per_primitive_outputs for VS outputs
|
||
- radv: use shader_info::system_values_read
|
||
- radv: fix enabling adjust_frag_coord_z and apply per-pipeline
|
||
- radv: add support for independent descriptor set layouts
|
||
- radv: add support for VkShaderModuleCreateInfo as pNext
|
||
- radv: fix initializing pipeline_key::topology for GFX9 and older
|
||
- radv: mark all active stages earlier in the pipeline creation path
|
||
- radv: use the hardware primitive topology everywhere
|
||
- radv: initialize the vertex input interface state in only one place
|
||
- radv: do not support UNIFORM_TEXEL_BUFFER with SRGB
|
||
- radv: only apply enable_mrt_output_nan_fixup for 32-bit float MRTs
|
||
- radv/ci: update the flakes list for GFX9 chips
|
||
- aco: fix load_barycentric_at_{sample,offset} on GFX6-7
|
||
- nir: fix marking XFB varyings as always active IO
|
||
- nir: mark XFB varyings as unmoveable to prevent them to be remapped
|
||
- radv: clarify why STAGE_2_CLEAR_BIT needs to wait for CP DMA to be idle
|
||
- radv: remove redundant VK_PIPELINE_STAGE_2_TRANSFER_BIT for CP DMA idle
|
||
- radv: add an SQTT workaround for chips with disabled RBs
|
||
- radv: fix handling divisor == 0 with dynamic vertex input state
|
||
- radv: allow to disable sinking of load inputs for FS via drirc
|
||
- radv: enable radv_disable_sinking_load_input_fs for Grid Autosport
|
||
- radv: use correct push constants range for internal operations
|
||
- radv/ci: update list of expected failures for Bonaire (GFX7)
|
||
- radv/ci: remove empty flakes lists for GFX9
|
||
- radv: fix the number of generated primitive queries with NGG GS vs legacy
|
||
- radv/ci: stop skipping dEQP-VK.synchronization.* on Bonaire
|
||
- radv: use 3D views for 3D internal operations on GFX6-8
|
||
- radv: re-emit dynamic line stipple state if the primitive topology changed
|
||
- radv/radix: handle intentional allocation failures properly
|
||
- ac,radeonsi: add has_sqtt_auto_flush_mode_bug
|
||
- radv/sqtt: fix configuring AUTO_FLUSH_MODE on GFX10.3
|
||
- radv: only init acceleration structure if RT is enabled
|
||
- radv: disable instance packing to fix pipeline query stats
|
||
- radv: remove useless check against nir_texop_samples_identical
|
||
- radv/winsys: allow to use RADV_FORCE_FAMILY=gfx1100
|
||
- radv: disable DCC for Senra Kagura Shinovi Versus
|
||
- aco: recognize GFX11 in few places
|
||
- aco: do not align VGPRS to 8 or 16 on GFX11
|
||
- radv,aco: add support for packed threadID VGPRs on GFX11
|
||
- radv,aco: use the new TCS WaveID SGPR to compute vs_rel_patch_id on GFX11
|
||
- aco: export MRT0 instead of NULL on GFX11
|
||
- aco: do not set DLC for loads on GFX11
|
||
- aco: do not set GLC stores on GFX11
|
||
- aco: update LDS allocation granularity for PS on GFX11
|
||
- aco: update waitcnt on GFX11
|
||
- aco: do not set RESOURCE_LEVEL for buffer descriptors on GFX11
|
||
- aco: do not set COMPR for exports but use 0x3 channel mask on GFX11
|
||
- radv,aco: export alpha-to-coverage via MRTZ on GFX11
|
||
- aco: only retrieve the scratch offset when it's declared
|
||
- radv,aco: do not implicitly export the primitive ID for mesh shaders
|
||
- aco: remove unreachable code about viewport index/layer and mesh shaders
|
||
- radv,aco: add a workaround for binding 2D views of a 3D image on GFX9
|
||
- radv: implement VK_EXT_image_2d_view_of_3d
|
||
- radv: advertise VK_EXT_image_2d_view_of_3d
|
||
- radv: do not declare the scratch offset on GFX11
|
||
- aco: do not emit the primitive ID twice for NGG VS or TES with GS
|
||
- radv: do not align VGPRS to 8 or 16 on GFX11
|
||
- radv: use the new format table on GFX11
|
||
- radv: update scratch buffer registers on GFX11
|
||
- radv: update sampler registers on GFX11
|
||
- radv: update buffer descriptor registers on GFX11
|
||
- radv: update image descriptor registers on GFX11
|
||
- radv: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE on GFX11
|
||
- radv: configure DB_RENDER_CONTROL on GFX11
|
||
- radv: do not emit non-existent CP_COHER_START_DELAY on GFX11
|
||
- radv: do not emit FLUSH_AND_INV_DB_META on GFX11
|
||
- radv: update TF_RING_size to a per-SE size on GFX11
|
||
- radv: update VRS registers on GFX11
|
||
- radv: apply a workaround for CB perf counters on GFX11
|
||
- radv: limit CP DMA to max 32KB sizes on GFX11
|
||
- radv: more register changes on GFX11
|
||
- radv: do not lower loading TESS/ESGS rings using the ABI for LLVM
|
||
- aco: use ac_is_llvm_processor_supported() for checking LLVM asm support
|
||
- radv: rename shader compile functions to spirv_to_nir/nir_to_asm
|
||
- radv: do not try to dump the NIR of the trap handler shader
|
||
- radv: add new pipeline helpers for NIR->ASM compilation
|
||
- radv: init states from pTessellationState at only one place
|
||
- radv: init states from pViewportState at only one place
|
||
- radv: init states from pRasterizationState at only one place
|
||
- aco: remove unnecessary intrinsics that are lowered at the ABI level
|
||
- radv: determine if shaders use NGG before linking
|
||
- radv: export implicit primitive ID in NIR for legacy VS or TES
|
||
- radv: constify radv_pipeline in more radv_pipeline_generate_XXX() helpers
|
||
- radv: rename radv_pipeline_generate_XXX() to radv_pipeline_emit_XXX()
|
||
- radv: remove couple of useless pCreateInfo pointers in the emit path
|
||
- radv: remove unused radv_get_conservative_raster_mode() helper
|
||
- radv: move gs_table_depth to the physical_device
|
||
- radv: cleanup using device/physical_device during pipeline creation
|
||
- radv: split radv_pipeline into radv_{graphics,compute,library}pipeline
|
||
- radv: introduce radv_graphics_pipeline_info and split existing info structs
|
||
- radv/ci: fix fails list for NAVI21
|
||
- radv: move HS info and task_num_entries to the physical device
|
||
- radv: rename radv_cmd_state::pipeline to graphics_pipeline
|
||
- radv: unify radv_pipeline_has_XXX() helpers
|
||
- radv: move streamout_shader to radv_graphics_pipeline
|
||
- radv: move active_stages to radv_graphics_pipeline
|
||
- radv: rename VERT_GRP_SIZE to VERTS_PER_SUBGRP on GFX11
|
||
- radv: do not emit more non-existent registers on GFX11
|
||
- radv: update framebuffer registers on GFX11
|
||
- radv: update VRS rates on GFX11
|
||
- radv: use the fragment resolve path by default on GFX11
|
||
- radv: report adjusted LDS size for fragment shaders on GFX11
|
||
- radv: fix configuring COLOR_INVALID on GFX11
|
||
- radv: fix VK_BLEND_FACTOR_CONSTANT_COLOR translation on GFX11
|
||
- radv: configure DB_Z_INFO.NUM_SAMPLES correctly on GFX11
|
||
- radv: disable VK_AMD_shader_fragment_mask on GFX11
|
||
- radv: disable attachementFragmentShadingRate on GFX11
|
||
- radv: use pipeline->slab_bo in more places
|
||
- radv: fix writing buffer markers with non-zero memory offset
|
||
- radv: copy viewport/scissor when initializing radv_viewport_info
|
||
- radv: init states from pMultisampleState at only one place
|
||
- radv: init states from pDepthStencilState at only one place
|
||
- radv: init states from VkPipelineRenderingCreateInfo at only one place
|
||
- radv: use AMD values for computing blend related state
|
||
- radv: init states from pColorBlendState at only one place
|
||
- radv: init states from VkAttachmentSampleCountInfo at only one place
|
||
- radv: init states from VkPipelineDiscardRectangleState at only one place
|
||
- radv: init states from VkPipelineFragmentShadingRateState at only one place
|
||
- radv: ignore DYNAMIC_STENCIL_OP if stencil test isn't enabled
|
||
- radv: reduce radv_tessellation_info::patch_control_points to 8-bit
|
||
- radv: reduce radv_input_assembly_info::primitive_topology to 8-bit
|
||
- radv: reduce radv_rasterization_info::polygon_mode to 8-bit
|
||
- radv: reduce radv_discard_rectangle_info::count to 8-bit
|
||
- radv: fix lowering GS intrinsics if NGG is disabled per pipeline
|
||
- radv/ci: update list of failures for Pitcairn
|
||
- radv: enable radv_zero_vram for Hammerting
|
||
- radv/winsys: add get_chip_name() to the null winsys
|
||
- radv: report the marketing name as part of the device name
|
||
- radv: rename ngg_gs_state to ngg_query_state
|
||
- radv: track if primitives generated query features are enabled
|
||
- radv: add primitives_generated_query to the graphics pipeline key
|
||
- radv: declare the NGG query argument for primitives generated query
|
||
- ac/nir/ngg: count the number of generated primitives for VS and TES
|
||
- radv: flush the NGG query state when the argument is declared
|
||
- radv: add few helpers related to streamout
|
||
- radv: implement VK_EXT_primitives_generated_query
|
||
- radv: advertise VK_EXT_primitives_generated_query
|
||
- radv/ci: rename deqp-XXX jobs to vkcts-XXX
|
||
- zink/ci: skip two KHR-GL46 tests that timeout most of the time with RADV
|
||
- zink/ci: update list of expected failures with RADV
|
||
- radv: remove redundant check when importing vertex input info
|
||
- radv: update radv_is_vrs_enabled() to use radv_graphics_pipeline_info
|
||
- radv: update the check to determine if dynamic discard rectangle is needed
|
||
- radv: update the check to determine if dynamic sample location is needed
|
||
- radv: update the check to determine if rasterization is enabled
|
||
- radv: remove remaining unused pCreateInfo pointers
|
||
- ci: bump the hang-detection tool in the test image for Vulkan
|
||
- ci: fix passing down MESA_LOADER_DRIVER_OVERRIDE for zink-anv-tgl
|
||
- zink/ci: change the surface type used for dEQP
|
||
- ci/valve: split .b2c-test into .b2c-test-vk and .b2c-test-gl
|
||
- radv/ci: add GLES/GLCTS testing with Zink on NAVI10
|
||
- zink/ci: update list of failures again
|
||
- radv/ci: refactor test machines and dEQP rules to reduce copy&paste config
|
||
- ci: update vkd3d-proton builder/runner
|
||
- ci: print a message when vk3d-proton returns successfully
|
||
- ci: add VKD3D_PROTON_RESULTS to the list of variables to pass down for testing
|
||
- radv/ci: add vkd3d-proton testing
|
||
- radv: advertise VK_EXT_border_color_swizzle on GFX10+
|
||
- ci: uprev vkd3d-proton to 39d07dea2cef34bfb3ed39741f026bc637e3eec4
|
||
- radv/amdgpu: make sure to reset the number of BO when there is no ranges
|
||
- radv/amdgpu: do not add "global" BO to the virtual BO list
|
||
- aco: fix validation of SOP1 instructions without definitions
|
||
- radv,aco: rename radv_prolog_binary/radv_shader_prolog to shader_part
|
||
- radv/llvm: always emit a null export even if the FS doesn't discard
|
||
- radv: disable small primitive culling for user sample locations
|
||
- radv: use RADEON_FLAG_VA_UNCACHED for the trace BO
|
||
- radv: fix command line for dumping waves with UMR
|
||
- radv: dump UMR waves before UMR rings
|
||
- radv: do not emit SQTT user data packets on the SDMA queue
|
||
- radv/ci: add CI lists for LLVM on NAVI21
|
||
- radv: use LOAD_CONTEXT_REG to load the opaque buffer size on GFX10+
|
||
- radv: do not abort if SPM isn't supported for the current GPU
|
||
- radv: remove old workaround for HTILE layers with F1 2021
|
||
- zink/ci: update list of failures for piglit with RADV
|
||
- radv: disable DCC for Melty Blood Actress Again Current Code
|
||
- radv: always enable VK_EXT_debug_utils
|
||
- zink/ci: update list of failures
|
||
- radv/ci: update list of failures against CTS 1.3.3.0
|
||
- aco: fix load_barycentric_at_sample without MSAA
|
||
- radv: fix dumping VS prologs assembly
|
||
- radv/ci: enable fossils testing for GFX1100
|
||
- radv: remove the radv_report_apu_as_dgpu workaround for Red Dead Redemption 2
|
||
- radv: fix wide points/lines by configuring the guardband correctly
|
||
- zink/ci: update list of CTS flakes for RADV
|
||
- ci: uprev vkd3d-proton to 5b73139f182d86cd58a757e4b5f0d4cfad96d319
|
||
- radv,aco: track if a fragment shader needs an epilog
|
||
- radv,aco: introduce {radv,aco}_ps_epilog_key
|
||
- radv: declare a new user SGPR arg in FS for the epilog PC
|
||
- radv: add a function that declares PS epilog shader arguments
|
||
- aco: add new pseudo instruction p_jump_to_epilog
|
||
- aco: emit p_jump_to_epilog if the main fragment shader has an epilog
|
||
- aco: do not abort if the FS doesn't export anything but has an epilog
|
||
- aco: prevent adding DONE/VM to the last export if the FS has an epilog
|
||
- aco: add aco_postprocess_shader() helper
|
||
- radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part
|
||
- aco: refactor export_fs_mrt_color() for PS epilogs preparation
|
||
- aco: add support for compiling PS epilogs
|
||
- radv: implement PS epilogs
|
||
- radv: disable viewport depth clamping only when necessary
|
||
- aco: requires Exact for p_jump_to_epilog
|
||
- radv: only force 1x sample for Bresenham lines when pipeline draws lines
|
||
- vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT
|
||
- radv: implement VK_EXT_attachment_feedback_loop_layout
|
||
- radv: ignore out-of-order rasterization if stencil write mask is dynamic
|
||
- radv: fix gathering XFB info if there is dead outputs
|
||
- radv: fix cleaning the meta query state if an error occured
|
||
- radv: re-emit viewports if negative one to one or depth clamp mode changed
|
||
- radv: fix bogus assertion with RADV_FORCE_VRS
|
||
- radv: fix pipelineStageCreationFeedbackCount when it's 0
|
||
|
||
Sarah Walker (2):
|
||
|
||
- pvr: Update for firmware 1.17@6256262
|
||
- pvr: Move BRN 44079, 48492 and 66011 code into pvrsrvkm specific directory
|
||
|
||
Sathishkumar S (2):
|
||
|
||
- radeon/vcn: skip create and destroy message for jpeg
|
||
- radeon/vcn: engage all available jpeg engines
|
||
|
||
Sebastian Keller (1):
|
||
|
||
- egl/wayland: Don't try to access modifiers u_vector as dynarray
|
||
|
||
Sergi Blanch Torne (1):
|
||
|
||
- ci/crosvm: clean crosvm socket
|
||
|
||
Sidney Just (19):
|
||
|
||
- wgl: add a flag to determine if running on zink
|
||
- wgl: add zink to the list of auto-loaded drivers
|
||
- zink: support VK_KHR_win32_surface
|
||
- kopper: add win32 loader interface
|
||
- zink: support win32 wsi
|
||
- wgl: support kopper
|
||
- gallium: support for implementing EXT_external_objects_win32
|
||
- mapi: added EXT_external_objects_win32 definitions
|
||
- gallium/noop: implement win32 device LUID and node mask getters
|
||
- gallium/trace: implement win32 device LUID and node mask getters
|
||
- mesa: implement GL_DEVICE_LUID_EXT and GL_DEVICE_NODE_MASK_EXT getters
|
||
- mesa: implement import win32 memory and semaphore
|
||
- mesa: pipe cap for EXT_memory_object_win32 and EXT_semaphore_win32
|
||
- zink: codegen for Win32 external object extensions
|
||
- zink: implement win32 fence import
|
||
- zink: implement win32 memory handle import
|
||
- zink: implement device LUID and node mask
|
||
- zink: enable pipe cap for win32 external memory and fences
|
||
- docs: updated zink features and release notes
|
||
|
||
Sil Vilerino (39):
|
||
|
||
- gallium/vl: Add software winsys and offscreen winsys
|
||
- util/u_format: Drop assert that has valid/well-defined behavior
|
||
- util/vl_vlc: Support compiling in C++
|
||
- st_vdpau: Pass format when opening resource from handle in st_vdpau_resource_from_description
|
||
- d3d12: Add .clang_format file
|
||
- gallium/va/radeonsi: Using private as a parameter name conflicts with C++ keywords
|
||
- gallium va: VaDeriveImage support stride/offset being different for NV12 planes
|
||
- gallium: Add values to pipe_video_cap for multi-slice and multi-reference encode
|
||
- gallium va: Handle new VA attributes with new pipe video caps
|
||
- gallium: Add multiple slice support to pipe_h264_enc_picture_desc
|
||
- gallium va: Add support for multiple slices encoding
|
||
- gallium radeon/r600/omx/va: Adds support for multiple reference encoding
|
||
- gallium: Add MinLumaBiPredSize8x8 to pipe_h264_sps, pic_init_qs_minus26 to pipe_h264_pps
|
||
- gallium omx: Fill out MinLumaBiPredSize8x8 and pic_init_qs_minus26
|
||
- gallium vdpau: Fill out level_idc and MinLumaBiPredSize8x8
|
||
- gallium va: VaHandlePictureParameterBufferH264 fill out new pipe params MinLumaBiPredSize8x8, pic_init_qs_minus26, chroma_format_idc, bit_depth_chroma, bit_depth_chroma_minus8
|
||
- d3d12: Add d3d12_promote_to_permanent_residency
|
||
- ci: Update x86 debian build to pick up tag v1.602.0-r1 from DirectX-Headers
|
||
- d3d12: Add video decode implementation of pipe_video_codec
|
||
- d3d12: Add video encode implementation of pipe_video_codec
|
||
- d3d12: Add create_video_codec and create_video_buffer entrypoints
|
||
- d3d12: Add util video functions to d3d12_format
|
||
- d3d12: Improve planar resource support to handle video requirements
|
||
- d3d12: Add support for d3d12 video in d3d12_screen
|
||
- d3d12: Add GLSL singleton refcounting to screen for offscreen non-GL rendering
|
||
- meson: Support d3d12 as a video-supporting driver
|
||
- winsys: Do not use Display type when X11 is not present in build
|
||
- meson: Update DirectX-Headers dependency to 1.602.0 version with fallback to the wrap
|
||
- meson: Add build option for gallium-d3d12-video feature
|
||
- d3d12: Align output buffer offset access to D3D12_FEATURE_DATA_VIDEO_ENCODER_RESOURCE_REQUIREMENTS.CompressedBitstreamBufferAccessAlignment
|
||
- d3d12: Fix debug_printf format strings for cross platform builds
|
||
- d3d12: Video - Remove unused spCopyQueues from enc/dec objects
|
||
- pipe/video: Add PIPE_VIDEO_ENTRYPOINT_PROCESSING
|
||
- vl: Replace usage of entrypoint UNKNOWN with PROCESSING for VP
|
||
- va: Replace usage of entrypoint UNKNOWN with PROCESSING for VP
|
||
- gallium/video: Add video post processing interface
|
||
- va: Add support for VPP rotation, flip, alpha blend, crop, scaling
|
||
- d3d12: Add pipe_video_codec::process_frame implementation
|
||
- d3d12: Enable VPP rotation, flip, alpha blend, crop, scaling via pipe_video_codec::process_frame
|
||
|
||
SoroushIMG (4):
|
||
|
||
- zink: Fix BO size when it's not aligned to 16 bytes
|
||
- zink: Fix spirv stream 0 vertex emit for multistream shaders
|
||
- Zink: Fix clear being missed when using emulated draws in zink_blit
|
||
- zink: Fix incorrect emission of SPIR-V shift ops
|
||
|
||
SureshGuttula (4):
|
||
|
||
- radeonsi: Set display_remote for non-refernced frames
|
||
- radeonsi/vcn : update enc->dpb ref_use for index 0
|
||
- Revert "radeonsi: Set display_remote for non-refernced frames"
|
||
- Revert "radeon: hardcode uvd/vce encoder not_referenced value to false"
|
||
|
||
Sviatoslav Peleshko (7):
|
||
|
||
- mesa: flush bitmap caches when changing scissors or window rects state
|
||
- anv: workaround apps that assume full subgroups without specifying it
|
||
- intel/blorp/gen6: Set BLEND_STATEChange only if emitting the blend state
|
||
- anv: Dirty all dynamic state bits when creating command buffer state
|
||
- intel/blorp: Dirty depth bounds dynamic state bits after blorp
|
||
- intel/nullhw: Use correct macro to fix build regression
|
||
- iris: Always initialize shader compilation queue ready fence
|
||
|
||
TSnake41 (1):
|
||
|
||
- zink: print result code string on vulkan failure
|
||
|
||
Tapani Pälli (5):
|
||
|
||
- isl: disable mcs (and mcs+ccs) for color msaa on DG2
|
||
- anv: use anv_cmd_dirty_mask_t type for dynamic state
|
||
- intel/fs: setup SEND message descriptor from nir scope
|
||
- iris: implement Wa_14015264727 for DG2
|
||
- anv: implement Wa_14015264727 for DG2
|
||
|
||
Tatsuyuki Ishi (9):
|
||
|
||
- radv: Fix redundant subpass barriers due to erroneous comparison
|
||
- radv/ci: skip dEQP-VK.fragment_operations.transient_attachment_bit
|
||
- radv/ci: Move transient_attachment_bit from fail to skip list
|
||
- radv/ci: skip image.sample_texture.*_compressed_format*
|
||
- radv/ci: Move sample_texture.*_compressed_format_* to faillist for gfx<=9
|
||
- radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard.
|
||
- amd: Revert gfx10 addrlib changes
|
||
- radv: Only set pstate for the first hw_ctx.
|
||
- radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs.
|
||
|
||
Thomas Debesse (2):
|
||
|
||
- gallium/clover: LLVM setLangDefaults moved from clangFrontend to clangBasic
|
||
- gallium/clover: pass -no-opaque-pointers to Clang
|
||
|
||
Tiago Koji Castro Shibata (2):
|
||
|
||
- d3d12: fallback to integrated adapter instead of arbitrary one
|
||
- d3d12: add more formats to supported conversions
|
||
|
||
Timothy Arceri (71):
|
||
|
||
- glsl/st: vectorise interfaces of SSO shader programs
|
||
- nir: fix setting varying from uniform as flat
|
||
- nir: fix sorting before assigning varying driver locations
|
||
- nir: always set the exact_trip_count_unknown loop terminator property
|
||
- nir: remove unreachable loop terminators
|
||
- svga: remove duplicate nir compile options
|
||
- nir/glsl: add glsl_record_compare() wrapper
|
||
- nir/glsl: wrapper contains_{double,interger}()
|
||
- nir/glsl: wrapper field_index()
|
||
- nir/glsl: wrap component_slots_aligned()
|
||
- glsl/mesa: move parse_program_resource_name() to common linker_util code
|
||
- glsl: add new build program resource helpers
|
||
- nir: add variable data fields required for NIR glsl varying linking
|
||
- nir: abort io info gathering if location is not set or is a temp value
|
||
- nir: skip lowering io to scalar for must_be_shader_input
|
||
- glsl: implement lower_packed_varyings() as a NIR pass
|
||
- glsl: add a NIR based varying linker
|
||
- glsl: implement opt_dead_builtin_varyings() as a NIR pass
|
||
- glsl: implement lower_xfb_varying() as a NIR pass
|
||
- glsl: enable the use of the nir based varying linker
|
||
- glsl: remove now unused GLSL IR varying linker code
|
||
- glsl: simplify finding cursor in varying packing code
|
||
- nir: add support for forced sampler indirect loop unrolling
|
||
- gallium/drivers: set force_indirect_unrolling_sampler for all required drivers
|
||
- glsl: move validation of sampler indirects to the nir linker
|
||
- i915g: use a valid setting for force_indirect_unrolling
|
||
- softpipe: switch to NIR loop unrolling
|
||
- asahi: switch to NIR loop unrolling
|
||
- d3d12: switch to NIR loop unrolling
|
||
- etnaviv: switch to NIR loop unrolling
|
||
- i915g: switch to NIR loop unrolling
|
||
- r300: disable GLSL IR loop unrolling
|
||
- r600: switch to NIR loop unrolling
|
||
- vc4: disable GLSL IR loop unrolling
|
||
- nir/i915g/r300/nv30: skip marking varyings as flat in some drivers
|
||
- etnaviv: assert if etna_shader_io_file reg overrun
|
||
- etnaviv: vectorise io
|
||
- nir/gcm: fix pushing instructions into if blocks
|
||
- ci: uprev piglit 2022-05-31
|
||
- nir: add nir based version of the lower_const_arrays_to_uniforms pass
|
||
- glsl/nir: skip adding hidden uniforms to the remap tables
|
||
- glsl/nir: allow the nir linker to remove dead uniforms we created
|
||
- glsl: move common link time optimisation calls to linker code
|
||
- glsl: move gl_nir_link_opts() call out of the st code
|
||
- glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms()
|
||
- glsl: remove now unused lower_const_arrays_to_uniforms()
|
||
- gallivm: disable GLSL IR loop unrolling in LLVMPIPE
|
||
- freedreno/ir3: tidy up duplication of common nir options
|
||
- freedreno: switch to NIR loop unrolling
|
||
- lima: lower all undefs to zero in vs
|
||
- lima: fixup nir indirect unroll options to match gallium CAP
|
||
- lima: switch to NIR loop unrolling
|
||
- nouveau/nv30: disable GLSL IR loop unrolling
|
||
- nouveau/nv50: disable GLSL IR loop unrolling
|
||
- nouveau/nvc0: disable GLSL IR loop unrolling
|
||
- svga: disable GLSL IR loop unrolling
|
||
- gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP
|
||
- glsl: remove the now unused GLSL IR loop unrolling code
|
||
- glsl: drop extra optimise swizzles call
|
||
- glsl: tidy up link_varyings_and_uniforms()
|
||
- glsl: remove never true do_dead_code() parameter
|
||
- glsl: inline do_common_optimization() call
|
||
- util: add dri config option to disable GL_MAP_UNSYNCHRONIZED_BIT
|
||
- util: use force_gl_map_buffer_synchronized workaround with RAGE
|
||
- glsl: merge lower_buffer_access with lower_shared_reference
|
||
- glsl: drop non-nir path for atan in builtin functions
|
||
- st/glsl: fix broken vertex attrib mapping
|
||
- nir/loop_unroll: clean up after complex_unroll_single_terminator()
|
||
- glsl: correctly track cross slot component packing
|
||
- Revert "nir: Preserve offsets in lower_io_to_scalar_early"
|
||
- glsl: dont lower precision for textureGatherOffsets
|
||
|
||
Timur Kristóf (119):
|
||
|
||
- radv: Minor formatting fix in radv_device.c file.
|
||
- radv: Move queue submit sparse bindings to a separate function.
|
||
- radv: Move empty queue submit code path to a separate function.
|
||
- radv: Move normal (non-empty) queue submit to a separate function.
|
||
- radv: Move up early exit for transfer queues in radv_get_preamble_cs.
|
||
- radv: Simplify some boolean code in radv_get_preamble_cs.
|
||
- radv: Initialize BO pointers when creating preambles.
|
||
- radv: Refactor cache flush code for the initial preambles.
|
||
- radv: Don't create continue preamble when it's not needed.
|
||
- radv: Rename fill_geom_tess_rings to radv_fill_shader_rings.
|
||
- radv: Don't use pointers to pointers when updating the preambles.
|
||
- radv: Only emit what is appropriate to the queue family for preambles.
|
||
- ac/nir: Add ac_nir_load_arg helper for shader arguments.
|
||
- radv: Move radv_nggc_settings enum out of radv_cmd_buffer.
|
||
- nir: Add upper bound for AMD shader arg intrinsics.
|
||
- radv: Lower ABI in NIR for tess/ESGS/NGG shader arguments.
|
||
- ac/nir: Remove now-superfluous ac_nir_lower_tess_to_const.
|
||
- aco: Remove now-superfluous intrinsics.
|
||
- ac/llvm: Remove now-superfluous intrinsics and ABI callbacks.
|
||
- radv/amdgpu: Group queue submit info into a structure.
|
||
- radv/amdgpu: Initialize semaphore info with a designated initializer.
|
||
- radv/amdgpu: Pass new queue submit info structure to internal function.
|
||
- radv/amdgpu: Pass correct struct type instead of repeating the cast.
|
||
- radv/amdgpu: Use scheduled dependency for submitting to multiple queues.
|
||
- ac/nir: Add I/O lowering for task and mesh shaders.
|
||
- ac: Add task ring entry shader argument.
|
||
- radv: Add radv_pipeline_has_task helper.
|
||
- radv: Set user data register for task shaders.
|
||
- radv: Set wave size for task shaders.
|
||
- radv: Fill task shader info.
|
||
- radv: Add task shader arguments.
|
||
- radv: Add task ring entry argument for mesh shaders.
|
||
- radv: Implement task shader intrinsics in the ABI.
|
||
- radv: Enable nir_opt_offsets for task shaders.
|
||
- radv: Use I/O lowering for task and mesh shaders.
|
||
- radv: Postprocess task shader configuration.
|
||
- radv: Allow linking task shaders.
|
||
- radv: Lower shared and task_payload variables in task/mesh shaders.
|
||
- radv: Print task shader stage name before disasm.
|
||
- radv: Fix task shader info.
|
||
- radv: Fix loading task shader ring buffer addresses.
|
||
- aco: Fix scratch with task shaders.
|
||
- radv: Introduce ring info structure for queues, refactor preamble generation.
|
||
- radv: Use a ws variable during preamble creation.
|
||
- radv: Pass radv_device pointer to some functions.
|
||
- radv: Refactor queue state to separate structure.
|
||
- radv: Clarify emitting graphics shader pointers.
|
||
- nir: Add explicit task payload atomic intrinsics.
|
||
- radv: Implement conditional rendering for async compute queue.
|
||
- ac: Remove trailing white space.
|
||
- aco: Remove trailing whitespace.
|
||
- radv: Remove trailing whitespace.
|
||
- radv: Disable predication for supass clear and image clears.
|
||
- nir: Add new launch_mesh_workgroups intrinsic.
|
||
- nir: Add common task shader lowering to make the backend's job easier.
|
||
- radv: Add mesh and task stage names to pipeline executable properties.
|
||
- nir: Keep track of cross-invocation mesh shader output access.
|
||
- radv, ac/nir: Fix multiview layer export for mesh shaders.
|
||
- ac/nir/taskmesh: Use task shader lowering from common NIR code.
|
||
- ac/nir/taskmesh: Use 3 dimensional workgroup ID.
|
||
- ac/nir: Add remappability to tess and ESGS I/O lowering passes.
|
||
- ac: Add RDNA2 task+mesh shader draw packet opcodes.
|
||
- ac: Add task shader ring information.
|
||
- radv: Implement task shader draw and payload rings.
|
||
- nir: Introduce new intrinsics for AMD specific mesh shader task ring.
|
||
- ac/nir/ngg: Clean up mesh shader output LDS layout.
|
||
- ac/nir/ngg: Use mesh shader scratch ring when outputs don't fit LDS.
|
||
- ac/nir/ngg: Use variables for outputs without cross-invocation access.
|
||
- radv: Implement mesh shader scratch ring.
|
||
- radv: Enable NGG wave ID for mesh shader scratch ring.
|
||
- ac/nir/ngg: Add mesh shader workgroup index.
|
||
- radv: Lower mesh shader 3D workgroup ID to 1D index.
|
||
- ac/nir/taskmesh: Preserve workgroup ID Y and Z when applying firstTask.
|
||
- radv: Always use 3D block ID and grid size in task shaders.
|
||
- gallium/u_blitter: Fix depth.
|
||
- zink: Enable the VK_EXT_depth_clip_enable extension.
|
||
- zink: Always enable depth clamping, make depth clipping independent.
|
||
- nir/lower_task_shader: don't use base index for shared memory intrinsics
|
||
- ac/nir/ngg: Refactor LDS instructions in NGG GS vertex emit and export.
|
||
- ac/nir/ngg: Ignore driver location for mesh shader outputs.
|
||
- radv: Don't assign driver locations to mesh shader outputs.
|
||
- radv: Add CULL_PRIMITIVE to special output mask.
|
||
- radv: Use NIR optimization to move discards to the top.
|
||
- radv: Only initialize DGC state when DGC is enabled.
|
||
- radv: Remove trailing whitespace introduced by DGC commits.
|
||
- radv: Refactor predication for compute queues.
|
||
- radv: Refactor some CP DMA functions to work with radeon_cmdbuf.
|
||
- radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf.
|
||
- radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf.
|
||
- radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf.
|
||
- radv: Refactor view index emit to use a per-stage function.
|
||
- radv: Move inline push constants to a new function.
|
||
- radv: Minor cleanup of radv_queue_submit_normal.
|
||
- radv: Allow reusing pipeline compute state emit functions.
|
||
- radv: Add dispatch_initiator_task field to radv_device.
|
||
- radv: Create internal cmdbuf when a graphics pipeline needs compute.
|
||
- radv: Flush descriptors and push constants for task shaders.
|
||
- radv: Introduce radv_before_taskmesh_draw.
|
||
- radv: Implement mesh shading draw calls with task shaders.
|
||
- radv: Synchronization for task shaders.
|
||
- radv: Support task shaders in secondary cmd buffers.
|
||
- radv: Workaround MEC taskmesh dispatch hang when count buffer has zero.
|
||
- radv: Submit internal compute cmdbuf.
|
||
- radv: Copy BO list to ACE internal CS.
|
||
- radv: Enable task shader feature for NV_mesh_shader.
|
||
- ac/nir/ngg: Move primitive ID workgroup barrier to proper place.
|
||
- aco: Remove hack for primitive ID export.
|
||
- ac/nir/ngg: Create output variable for primitive ID export.
|
||
- ac/llvm: Implement load_num_subgroups for NGG shaders.
|
||
- ac/llvm: Add LLVM bug workaround to ac_build_mbcnt_add.
|
||
- ac/llvm: Use gs_prim_id for NGG VS.
|
||
- ac/nir/ngg: Copy comment about LDS layout for NGG GS.
|
||
- ac/nir/ngg: Remember proper bit sizes of GS output variables.
|
||
- ac/llvm: Implement GDS atomic add NIR intrinsic.
|
||
- radv/llvm: Remove incorrect hardcoded workgroup size from NGG GS.
|
||
- radv: Properly set LDS size for LLVM NGG shaders.
|
||
- radv: Use NIR-based NGG lowering with LLVM.
|
||
- aco: Fix p_init_scratch for task shaders.
|
||
- nir/gather_info: Clear cross-invocation output mask.
|
||
|
||
Tomeu Vizoso (4):
|
||
|
||
- lima/ci: Disable lima-mali450-piglit-gpu:arm64
|
||
- panvk/ci: Disable CI for a while
|
||
- ci: Disable jobs to the Collabora lab
|
||
- Revert "ci: Disable jobs to the Collabora lab"
|
||
|
||
Vadym Shovkoplias (4):
|
||
|
||
- anv: Fix geometry flickering issue when compute and 3D passes are combined
|
||
- driconf: Add a limit_trig_input_range option
|
||
- drirc: Set limit_trig_input_range option for the Wolfenstein Youngblood
|
||
- drirc: Set limit_trig_input_range option for glmark2
|
||
|
||
Vasily Khoruzhick (7):
|
||
|
||
- lima: Implement texture_barrier
|
||
- lima: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT
|
||
- lima: Lower nir_op_idiv with other scalar math operations.
|
||
- u_transfer_helper: flush temporary transfer first for MSAA
|
||
- lima: implement lima-specific blitter
|
||
- lima: wire up MSAA 4x support
|
||
- lima: fix buffer overallocation for index, vertex and constant buffers
|
||
|
||
Victor Hermann Chiletto (1):
|
||
|
||
- radv: always check entry count in descriptor pool when allocating
|
||
|
||
Viktoriia Palianytsia (1):
|
||
|
||
- anv: Add workaround for sample mask with multisampling
|
||
|
||
Vinson Lee (14):
|
||
|
||
- panfrost: Fix memory leaks on unit test failures.
|
||
- zink: Fix memory leak on error path.
|
||
- radeonsi: Move NULL check before dereference.
|
||
- dri2: Fix memory leak.
|
||
- d3d12: Initialize d3d12_video_encoder_bitstream member m_uiOffset.
|
||
- amd: Initialize Gfx11Lib members in constructor.
|
||
- d3d12: Remove unnecessary NULL check.
|
||
- clc: Fix build with llvm-15.
|
||
- microsoft/spirv_to_dxil: Fix missing-prototypes errors.
|
||
- radv: Remove unnecessary null check.
|
||
- microsoft/compiler: Fix assert.
|
||
- tu: Check dereferenced value of rop_reads_dst.
|
||
- aco: Initialize spill_ctx members in constructor.
|
||
- r600/sfn: Initialize TestShaderFromString member m_instr_factory.
|
||
|
||
Vlad Zahorodnii (2):
|
||
|
||
- winsys/amdgpu-radeon: Allow specifying context priority
|
||
- radeonsi: Add support for EGL_IMG_context_priority
|
||
|
||
Yevhenii Kolesnikov (1):
|
||
|
||
- nir: Remove single-source phis before opt_if_loop_last_continue
|
||
|
||
Yiwei Zhang (51):
|
||
|
||
- venus: override aspectMask for internal tiling modifier
|
||
- venus: use linear modifier for legacy common wsi path
|
||
- venus: fix view format for ahb image
|
||
- venus: refactor to add struct vn_env
|
||
- venus: add env perf options and introduce no_async_set_alloc
|
||
- venus: add VN_PERF option no_async_buffer_create
|
||
- venus: add VN_PERF option no_async_queue_submit
|
||
- venus: refactor vn_cmd_submit
|
||
- venus: flush when batched draw calls reach a threshold
|
||
- venus: disable VK_EXT_extended_dynamic_state2
|
||
- venus: refactor android gralloc pieces
|
||
- venus: cache front_rendering_usage bit at gralloc init
|
||
- venus: update vn_GetSwapchainGrallocUsage2ANDROID for shared present
|
||
- venus: enable ANB shared presentable image prop
|
||
- venus: add VN_RELAX_BASE_SLEEP_US env var for tuning cpu utils
|
||
- venus: log performance envs upon instance creation with debug init
|
||
- venus: extend buffer cache to cover layering usage
|
||
- vulkan: remove the VkPhysicalDeviceMemoryProperties workaround
|
||
- radv: remove VkPhysicalDeviceMemoryProperties2 workaround
|
||
- venus: use common entry point for VkPhysicalDeviceMemoryProperties
|
||
- venus: resolve AHARDWAREBUFFER_FORMAT_Y8Cb8Cr8_420 to external format
|
||
- venus: fix a bug on buffer cache init failure path
|
||
- venus: fix the queue init failure path
|
||
- venus: fix a missing mtx_destroy in vn_device_init
|
||
- venus: adjust device resources init order
|
||
- venus: let device track queue families used for device creation
|
||
- venus: add vn_feedback_pool backed by coherent buffer (part 1)
|
||
- venus: add vn_feedback_pool backed by coherent buffer (part 2)
|
||
- venus: add feedback check/set/reset helpers
|
||
- venus: add event feedback
|
||
- venus: add NO_FENCE_FEEDBACK perf option and disable sparse resource
|
||
- venus: create per queue family command pool for feedback cmd
|
||
- venus: renderer to store allow_vk_wait_syncs capset
|
||
- venus: add fence feedback
|
||
- venus: resolve YV12 AHB to VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM
|
||
- venus: use narrow range to match up with mesa EGL
|
||
- venus: swizzle the chroma channels for YVU420 to match the VkFormat
|
||
- venus: GetPhysicalDeviceSparseImageFormatProperties to match sparse support
|
||
- venus: suballocate more for layering
|
||
- venus: add more tracepoints for perf analysis
|
||
- venus: filter out VK_EXT_physical_device_drm on the driver side
|
||
- venus: check dynamic state for raster enablement
|
||
- venus: reenable VK_EXT_extended_dynamic_state2
|
||
- Revert "venus: suballocate more for layering"
|
||
- anv: enable VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM for modifier support
|
||
- venus: fix external memory ext filtering
|
||
- venus: avoid feedback for external fence
|
||
- venus: allow no external memory handle when renderers lacks support
|
||
- zink: fix zink_create_fence_fd to properly import
|
||
- zink: fix in-fence lifecycle
|
||
- venus: ignore pInheritanceInfo if not secondary command buffer
|
||
|
||
Yogesh Mohan Marimuthu (12):
|
||
|
||
- radeonsi/gfx11: make flat_scratch changes for graphics
|
||
- radeonsi/gfx11: make flat_scratch changes for compute
|
||
- radeonsi/gfx11: interp changes for 32bit
|
||
- radeonsi/gfx11: interp changes for 16bit
|
||
- radeonsi/gfx11: instruction cache line size is 128 bytes
|
||
- radeonsi/gfx11: program inst_pref_size for compute
|
||
- radeonsi/gfx11: program db render control register
|
||
- radeonsi/gfx11: export alpha through mrtz for alpha-to-coverage if mrtz is there
|
||
- ac,radeonsi/gfx11: swizzle MRT0/1 for dual source blending
|
||
- vulkan/wsi: fix extra free if buffer_blit_queue
|
||
- radeonsi: remove tabs from code
|
||
- radeonsi: no need to call si_pm4_clear_state() in si_pm4_free_state()
|
||
|
||
Yogesh Mohanmarimuthu (1):
|
||
|
||
- radeonsi/gfx11: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE
|
||
|
||
Yonggang Luo (119):
|
||
|
||
- zink: Remove redundant framebuffer_mtx from zink_screen.h
|
||
- win32: Fixes 32 bits visual studio module definition files by add script gen_vs_module_defs.py
|
||
- win32: Do not use BUILD_GL32, we use def file to export win32 dll symbols.
|
||
- misc: Replace \`#ifdef\\t__cplusplus` with \`#ifdef\\s\\s__cplusplus`
|
||
- misc: replace \`#ifdef\\s\\s__cplusplus` with \`#ifdef\\s__cplusplus`
|
||
- zink: Trim spaces in zink.rst
|
||
- zink: Improve zink.rst with links
|
||
- st: Improve _mesa_error about GL_OUT_OF_MEMORY in st_cb_texture.c
|
||
- llvmpipe: Do not use _Atomic keyword that doesn't support by MSVC
|
||
- radv: fixes msvc compiling error
|
||
- radv: Fixes compiling error with msvc
|
||
- radv: Add losing member hw_ip_version_major and hw_ip_version_minor for win32
|
||
- radv: Add macros for win32 that accessed
|
||
- nir: Fixes MSVC compiling error about unused variable \`_`
|
||
- egl/wgl: On win32, there is no support for EGL_EXT_device and EGL_EXT_platform_device
|
||
- util: Fixes test_util_get_process_exec_path on windows host with msys2/mingw
|
||
- util: Fixes typo in test_util_get_process_exec_path
|
||
- ci/vs2019: Clear CI_COMMIT_MESSAGE and CI_COMMIT_DESCRIPTION for please meson
|
||
- radv: Remove usage of \`cnd_monotonic.h`
|
||
- d3d12: Fixes d3d12 compiling errors in \`D3D12ResourceState.h` for mingw
|
||
- d3d12: Move shared code that include d3d12 headers into d3d12_common.h
|
||
- d3d12: Fixes compiling error by include \`<unknwn.h>`
|
||
- d3d12: \`#include <windows.h>` instead \`#include <Windows.h>` for building under linux with mingw
|
||
- d3d12/wgl/test: Fixes wgl_tests.cpp for d3d12 with mingw
|
||
- dzn: Fixes compiling error when build with msys2/mingw
|
||
- dzn: Fixes compiling error by include \`<unknwn.h>`
|
||
- microsoft/clc: Fixes compiling of microsoft clc with mingw
|
||
- docs: Fixes dispatch.rst with __THREAD_INITIAL_EXEC
|
||
- meson: Fixes timespec_get detect code
|
||
- c11/threads: Remove the detecting code for \`HAVE_TIMESPEC_GET` in threads_win32.h
|
||
- c11: Implement c11/time.h with c11/impl/time.c
|
||
- c11: Implement timespec_get on win32 properly when not available
|
||
- util: Use timespec_get directly, it's always present
|
||
- util: Remove usage of designated initializers in timespec_test.cpp
|
||
- util: always enable timespec_test.cpp
|
||
- util: Remove unused function timespec_passed
|
||
- util: Add unittest for timespec_get
|
||
- ci/vs2019: There is no meaning to set $env:ErrorActionPreference
|
||
- ci/vs2019: switch to powershell 7
|
||
- ci/vs2019: Update editorconfig for ps1 files
|
||
- ci/vs2019: Give each option its own line in mesa_build.ps1
|
||
- ci/vs2019: Rename to vsInstallPath
|
||
- ci/vs2019: Do not install subprojects
|
||
- ci/vs2019: Ignore cert install output, it's too long
|
||
- ci/vs2019: Ignore error when removing directory recursively
|
||
- ci/vs2019: Disable progress when install python
|
||
- ci/vs2019: Disable progess bar for pip3 install
|
||
- ci/vs2019: Use shared VULKAN_SDK_VERSION for install and build vulkan related packages
|
||
- ci/vs2019: Add a dedicated script for entering VS dev shell
|
||
- ci/vs2019: Split call to C:\\vs_buildtools.exe into multiple lines
|
||
- ci/vs2019: Show the total vs2019 install time
|
||
- ci/vs2019: Reduce the vs2019 build tools components to be installed
|
||
- ci/vs2019: ignore _build and _install directory by write * into .gitignore
|
||
- ci/vs2019: Install all build artifacts into C:\\mesa-deps
|
||
- ci/vs2019: Split choco install output
|
||
- ci/vs2019: Split install of vs2019 and choco into separate Docker image
|
||
- ci/vs2019: Remove comment for windows_test_vs2019 that not true
|
||
- ci/vs2019: Convert mesa_build.ps1 to using PowerShell 7 && operator
|
||
- microsoft/compiler: Include stddef.h before using size_t
|
||
- microsoft/compiler: Fix error from double extern
|
||
- dxil: Fixes compiling dxcapi.h with mingw.
|
||
- dxil: Sync dxil_spirv_shader_stage with gl_shader_stage
|
||
- dxil: Fixes warning about comparing different enum value and enum literal
|
||
- microsoft/spirv_to_dxil: Delete unused local variable
|
||
- microsoft/clc: Delete unused local variables
|
||
- microsoft/clc: Add missing void to no-parameter function signature
|
||
- microsoft/spirv_to_dxil: Fixes maybe-uninitialized compiling error in dxil_spirv_nir.c
|
||
- ci: Building spirv-to-dxil on both mingw and debian/x64
|
||
- intel: using C++11 keyword thread_local
|
||
- glx: using C++11 keyword thread_local
|
||
- util: Define WIN32_LEAN_AND_MEAN before include of \`windows.h` in u_thread.h
|
||
- c11: Move the implementation of threads.h into c source code
|
||
- c11: Fixes memory leak of pack in thrd_create on win32
|
||
- c11: Implement thread_local in c11/threads.h
|
||
- util/c11: Update function u_thread_create to be c11 conformance
|
||
- meson: Using get_argument_syntax as the \`--compiler_id` option for gen_vs_module_defs.py
|
||
- meson: Use cc.get_argument_syntax instead cc.get_id when possible.
|
||
- meson/rtti: Using get_argument_syntax to check msvc compatible compiler
|
||
- meson/clover: Getting clover can be built with no-rtti llvm in mingw
|
||
- ci/vs2019: Upgrade Windows 10 SDK version to 20348
|
||
- docs: Update windows build requirement with Visual Studio
|
||
- d3d12: Use static_cast instead of dynamic_cast in d3d12_video_enc_h264.cpp
|
||
- d3d12: Convert #include <Windows.h> to #include <windows.h> for mingw on linux
|
||
- d3d12: Fixes compiling error in d3d12/wgl/d3d12_wgl_framebuffer.cpp with gcc
|
||
- microsoft/clc: Fixes narrowing error in clc_compiler_test.cpp with mingw/gcc
|
||
- microsoft/clc: Disable clc_compiler_test on non-windows platform
|
||
- dzn: Fixes incompatible pointer type error
|
||
- ci/x86_build: Getting pushd popd be paired, avoid using cd
|
||
- ci: Prepare the container for building all mesa components with mingw under linux
|
||
- ci: Trigger the new mingw/linux dockers to be build
|
||
- d3d12: Turn d3d12_format.h to include d3d12_common.h
|
||
- ci: Building all mesa functional with mingw on debian
|
||
- meson: Enable wgl tests on mingw
|
||
- CODEOWNERS: Update c11 code owners
|
||
- CODEOWNERS: evelikov renamed to xexaxo
|
||
- gallium: Move -DHAVE_PIPE_LOADER_DRI and -DHAVE_PIPE_LOADER_KMS to be pre_args
|
||
- gallium: Do not call to pipe_loader_sw_probe_kms when not present
|
||
- gallium: Rename macros HAVE_PIPE_LOADER*
|
||
- gallium: Disable dri2 interface on OSX
|
||
- glx: __glX_tls_Context should use __THREAD_INITIAL_EXEC
|
||
- util: thread_local doesn't support to be exported on OSX
|
||
- glx: Convert tab to space in applegl_glx.c
|
||
- dri: Trim trailing space in dri/dri_util.*
|
||
- ci: Building osmesa on OSX
|
||
- util: Disable tests/sparse_array_test.cpp on MacOS as it's too slow
|
||
- mapi: Trim trailing spaces in stub.c and u_current.c
|
||
- meson: Remove usage of use_elf_tls
|
||
- egl: Remove usage of USE_ELF_TLS macro
|
||
- mapi: Remove usage of USE_ELF_TLS
|
||
- loader: Remove usage of USE_ELF_TLS in loader.c
|
||
- glx: Remove usage of USE_ELF_TLS
|
||
- util: Remove usage of USE_ELF_TLS in u_thread.h
|
||
- meson: Remove pre_args += '-DUSE_ELF_TLS'
|
||
- glx: Remove usage of pipe_tsd_set and pipe_tsd_get
|
||
- auxiliary: Remove pipe_tsd
|
||
- d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library
|
||
- microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp
|
||
- util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func
|
||
- c11: #include <threads.h> when the os/platform provide it
|
||
|
||
Yurii Kolesnykov (2):
|
||
|
||
- Remove Travis CI config
|
||
- Init macOS GitHub Action
|
||
|
||
Zack Rusin (4):
|
||
|
||
- svga: Don't try to build x86/x64 assembly on different arch's
|
||
- svga: finish readbacks before mapping resources
|
||
- svga: Use direct maps when GB objects are present
|
||
- svga: Add support for SVGAv3
|
||
|
||
Zhang, Jianxun (1):
|
||
|
||
- iris: Wa_14016820455 for GFX_VERx10 == 12.5
|
||
|
||
illiliti (1):
|
||
|
||
- Use proper types for meson objects
|
||
|
||
jheaff1 (1):
|
||
|
||
- build(glx): Fix build by adding missing deps
|
||
|
||
naveen (1):
|
||
|
||
- github/ci: Set permissions for GitHub actions
|
||
|
||
newbluemoon (1):
|
||
|
||
- nine: replace ulimit with sysconf call
|
||
|
||
prashanth (1):
|
||
|
||
- vulkan/device_select: force default device if environment variable set
|
||
|
||
sjfricke (2):
|
||
|
||
- isl: fix bug where sb.MOCS is not being set
|
||
- anv: fix assert to build with shader cache disabled
|
||
|
||
xperia64 (1):
|
||
|
||
- wgl: Add driver_zink as a dependency of the wgl frontend
|
||
|
||
İlhan Atahan (1):
|
||
|
||
- Add Adreno 616 and 620 to use turnip on these GPU's .
|