mirror of
https://gitlab.freedesktop.org/mesa/mesa.git
synced 2025-01-10 15:54:58 +08:00
4382 lines
209 KiB
ReStructuredText
4382 lines
209 KiB
ReStructuredText
Mesa 22.0.0 Release Notes / 2022-03-09
|
|
======================================
|
|
|
|
Mesa 22.0.0 is a new development release. People who are concerned
|
|
with stability and reliability should stick with a previous release or
|
|
wait for Mesa 22.0.1.
|
|
|
|
Mesa 22.0.0 implements the OpenGL 4.6 API, but the version reported by
|
|
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
|
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
|
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
|
4.6 is **only** available if requested at context creation.
|
|
Compatibility contexts may report a lower version depending on each driver.
|
|
|
|
Mesa 22.0.0 implements the Vulkan 1.2 API, but the version reported by
|
|
the apiVersion property of the VkPhysicalDeviceProperties struct
|
|
depends on the particular driver being used.
|
|
|
|
SHA256 checksum
|
|
---------------
|
|
|
|
::
|
|
|
|
e6c41928b5b9917485bd67cec22d15e62cad7a358bf4c711a647979987601250 mesa-22.0.0.tar.xz
|
|
|
|
|
|
New features
|
|
------------
|
|
|
|
- lavapipe,radv,anv KHR_dynamic_rendering
|
|
- radv EXT_image_view_min_lod
|
|
- VK_KHR_synchronization2 on RADV.
|
|
- OpenSWR has been moved to the Amber branch
|
|
- radeonsi, zink ARB_sparse_texture
|
|
- d3d12 GLES3.1 (shader storage buffers, images, compute, indirect draw, draw params, ARB_framebuffer_no_attachments, ARB_sample_shading, and GLSL400)
|
|
- radeonsi, zink ARB_sparse_texture2
|
|
- zink EXT_memory_object, EXT_memory_object_fd, EXT_semaphore, EXT_semaphore_fd
|
|
- anv VK_VALVE_mutable_descriptor_type
|
|
- Vulkan 1.3 on RADV,Anv.
|
|
- radeonsi, zink ARB_sparse_texture_clamp
|
|
|
|
|
|
Bug fixes
|
|
---------
|
|
|
|
- anv: Unable to import I915_FORMAT_MOD_Y_TILED surfaces produced by iHD vaapi driver
|
|
- anv doesn't always resolve aux buffers with private bindings on transition to external queue
|
|
- Redraw freeze after upgrade to Xwayland 21.1.3
|
|
- [ANV] Rendering corruption in DOOM Eternal
|
|
- lavapipe: dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step fails
|
|
- radv: When using VS+PS only, primitive ID is mixed up after NGG culling
|
|
- [22.0.0_rc{1,2}] radeonsi: broken support for HD7850 (radeon 0000:0b:00.0: ring 0 stalled for more than...)
|
|
- RADV: Texture seams in The Evil Within 2 (regression)
|
|
- ANV: Bad output from TransformFeedback . Regression from Mesa 21. Something to do with VB+XFB -> VB+XFB dependency?
|
|
- radv: CullDistance fail
|
|
- Changing the buffer modifer via Wayland dmabuf-feedback apparently does not reliably reset the buffer age to 0
|
|
- radv: causes hang untl gpu reset on navy_flounder
|
|
- Crocus related crashes when running QT apps
|
|
- Please add missing ray tracing Vulkan extensions to features.txt
|
|
- [anv][bisected][regression][tgl] dEQP-VK.dynamic_rendering.suballocation.multisample_resolve.* failures
|
|
- radv/aco: DXVKs strict float emulation causes black screen/graphical corruption on RDNA1
|
|
- intel: env vars doc out of date
|
|
- [REGRESSION][BISECTED] iris: Qutebrowser/QtWebEngine sporadically flashes the window in white
|
|
- Flickering Intel Uhd 620 Graphics
|
|
- Vulkan Wayland WSI returns empty surface formats
|
|
- [radeon, rs690] piglit spec@glsl-1.10@execution@built-in-functions@fs-atan-float + 10 other 'atan' tests crash (mesa-22dev)
|
|
- [bisected] artifacts in vaapi h.264 playback
|
|
- [bisected] artifacts in vaapi h.264 playback
|
|
- radv: msaa regression
|
|
- [DG2] dEQP-GLES3.functional.shaders.derivate.fwidth* failures
|
|
- FIFO present mode doesn't wait
|
|
- Dota2 lockup wsi/x11: Avoid a class of deadlocks in the WSI queue thread
|
|
- [vulkan][intel] Implement VK_VALVE_mutable_descriptor_type
|
|
- Broken Terraria & Glitches in Forza Horizon 4
|
|
- radv: Q2RTX with cswave32 crashes the GPU
|
|
- CopelliaSim crashes on adding vision sensor to a scene on Ubuntu 20+, runs fine on Ubuntu 18.04
|
|
- radv: renderpass regression
|
|
- Dirt Rally: Flickering glitches on certain foliage going from Mesa 21.2.5 to 21.3.0
|
|
- FrontFacing input is broken on Intel/Vulkan
|
|
- glGetActiveUniform failing with GL_INVALID_VALUE for no reason
|
|
- llvmpipe: Unimplemented get_driver_uuid/get_device_uuid causes segfaults in e.g. Wine
|
|
- All vulkan apps terminate with VK_ERROR_DEVICE_LOST
|
|
- Texture rendering broken on Vega 56, probably due to MR 14171 (tested on Battlefield 1 / vkd3d-proton)
|
|
- Major graphic distortions with RADV in DX11/DX12 with Mesa-git as of January 12th 2022
|
|
- Setting mesa_glthread to true crashes Minecraft when Sodium optimisation mod is enabled.
|
|
- ANV: deviceUUID assumes a single device per system
|
|
- MESA Overlay Vulkan layer version
|
|
- [radeonsi, regression, bisected]: Rendering issues with Factorio
|
|
- Distorted graphics when running Battle.net app under Wine with Radeon hardware
|
|
- [r300g] Battle.net app window content rotated and stretched
|
|
- Freedreno reporting 10 MB of Video memory that's causing issues with WineD3D on Poco F1
|
|
- Panfrost - RK3399 - FACTORIO - glitches everywhere
|
|
- Baldurs Gate 3 (Patch 6) - ribbon-like artifacts on textures
|
|
- mesa >= 21.x (with llvmpipe) will break the game JVGS
|
|
- Game Starsector crashes under certain circumstances with mesa 21.3.+
|
|
- FreeSpace models incorrectly rendered on Polaris cards, causing system freeze
|
|
- ci: Test with Skqp
|
|
- ci: Test with Skqp
|
|
- Metro Exodus (native) fails to launch
|
|
- Poor performance on RX 580 (Polaris20)
|
|
- VAAPI H.264 encoding is slow on Polaris vs. HEVC and Windows AMF
|
|
- v3dv: Missing gem handle reference tracking logic
|
|
- Incomplete evaluation of nested DEFINE macros
|
|
- [r300g, bisected] piglit glsl-fs-discard-04 fails
|
|
- Panfrost G52 Firefox terrible glitches on youtube playback
|
|
- prim restart broken on crocus by prim restart fix for d3d12
|
|
- RADV: Invalid fast clear triggered
|
|
- Assassin's Creed Syndicate crashes with Mesa 21.3.0+ ACO
|
|
- anv: dEQP-VK.graphicsfuzz.spv-stable-pillars-volatile-nontemporal-store fails
|
|
- seg-fault with soft fp64 on iris after register allocation change
|
|
- Taking RGP traces on VANGOGH results in hangs
|
|
- Flickering and blackscreen on Mpv and Clapper (and also low performance on glxgears)
|
|
- intel/vec4: Rework texture handling to not use \`ir_texture_opcode`
|
|
- lavapipe leaks introduced in eb7eccc76f0a02b3a3291adaa39788b157979dc5
|
|
- DXVK SIGBUS with Turnip on Poco F1 at loading to open world.
|
|
- RADV: IsHelperInvocationEXT query is not considered volatile in ACO
|
|
- [GraphicsFuzz] dEQP-VK.graphicsfuzz.stable-binarysearch-tree-nested-if-and-conditional
|
|
- [bisected] Mesa 21.3.x breaks GBM with NVIDIA closed source driver 495.44
|
|
- [DG2] dEQP-GL[45|ES31].functional.shaders.builtin_functions.pack_unpack.packhalf2x16_compute fail
|
|
- Steam reports fatal error with missing OpenGL GLX extension not supported by display since December 6th
|
|
- SNK HEROINES Tag Team Frenzy stuck on white screen
|
|
- Run-time dependency dri found: NO
|
|
- Rise of the Tomb Raider segfault
|
|
- glsl/compiler: suspcious snippet of code (Unused variable)
|
|
- GPU Crash in Yuzu 6600xt 5.15
|
|
- anv: gfxbench 5.0 vulkan fails on android
|
|
- [spirv-fuzz] lower_trivial_continues_block: Assertion \`!first_instr || instr_is_continue(first_instr)' failed.
|
|
- [RADV] Crash in Metro Exodus in Caspain chapter and Sam's Story
|
|
- [nir][radv] more undefined shifts caught by gcc's sanitizer
|
|
- NIR validation failed after nir_copy_prop
|
|
- lima: Corrupted Android-12 UI on Allwinner A64
|
|
- RADV/ACO: Rendering glitches in Forza Horizon 5 windshields
|
|
- NV_pixel_buffer_object extension can be available in a GLES2 context but the associated glGet constants don't work
|
|
- ThreadSanitizer: data races inside libvulkan_radeon.so
|
|
- [radeonsi] Commit b1b491cdbbac1ccb785f5eb1802006f76ec253f5 leads to memory leak
|
|
- dEQP-GLES31.*imulextended* compiling fp64 glsl 4.00 shader
|
|
- anv: vkMapMemory fails "Memory object already mapped" with VA-API (mpv --gpu-api=vulkan --hwdec=vaapi)
|
|
- RADV: crash in radv_GetPhysicalDeviceImageFormatProperties2
|
|
- RADV/ACO: Monster Hunter Rise Demo renders wrong results
|
|
- radv: Odd lack of implicit host memory invalidation
|
|
- Regression/Bisected: Crash in Godot games when steam overlay enabled
|
|
- RADV: IsHelperInvocationEXT query is not considered volatile in ACO
|
|
- ANV: error submitting with the same semaphore for wait and signal - regression?
|
|
- Vulkan does not work when 10-bit colors ( DefaultDepth 30) are enabled under Xorg on AMD/RADV
|
|
- [TGL+] anv: some dEQP-VK.drm_format_modifiers.* fails
|
|
- radv: SQTT generates trace with no data
|
|
- Mesa 21.3rc3 - compile failure
|
|
- iris: subslice assertion failure on some hardware during initialization
|
|
- Final Fantasy V (Old version): Random text characters are not displayed
|
|
- Diagonal rendering artifacts in Tomb Raider
|
|
- vkCmdSetViewport sometimes ignored on Ivy Bridge, Haswell
|
|
- dota2 black squares corruption
|
|
- [hsw][bisected][regression] dEQP-VK.reconvergence.*nesting* failures
|
|
- Piglit.spec.arb_shader_image_load_store.compiler.expression-allowed-1d_vert failed.
|
|
- anv: dEQP-VK.wsi.wayland.<various> failures
|
|
- freedreno: Crash in fd_bo_cache_alloc()
|
|
- radv_android.c: build errors due to commit 49c3a88
|
|
- dEQP-EGL.functional.sharing.gles2.multithread.* regression with Iris
|
|
- [Turnip] Forsaken Remastered rendering issue
|
|
- [radeonsi] Euro Truck Simulator 2: broken mimimap
|
|
- Old Linux Rocket League version failing to start on Poco F1.
|
|
- [regression][bisected] Launching Valheim OpenGL game leads to GPU Hang
|
|
- Android Meson build regression: hardware/system information apps crash on Raspberry Pi 4
|
|
- radv: format properties are broken with modifiers
|
|
- anv: dEQP-VK.graphicsfuzz.cov-multiple-one-iteration-loops-global-counter-write-matrices fails
|
|
|
|
|
|
Changes
|
|
-------
|
|
|
|
Aaron Watry (2):
|
|
|
|
- clover: implement CL_IMAGE_BUFFER
|
|
- clover/image: add dimension property
|
|
|
|
Adam Jackson (37):
|
|
|
|
- mesa/x11: Remove the swrast-classic-based fake libGL
|
|
- mesa: Remove unused _mesa_all_buffers_are_unmapped
|
|
- mesa: Remove unused _mesa_get_render_format
|
|
- mesa: Remove unused _mesa_DrawTexx{,v}
|
|
- mesa: Remove unused _check_*TexGen*OES
|
|
- mesa: Remove unused _mesa_apply_ci_transfer_ops
|
|
- mesa: Remove unused _mesa_compressed_image_address
|
|
- mesa: Remove unused _es_color4ub
|
|
- mesa: Remove unused _es_RenderbufferStorageEXT
|
|
- mesa: Remove unused _es_{,Get}TexGenfv
|
|
- mesa: Remove unused _vbo_current_binding
|
|
- mesa/vbo: Always use buffer objects for storage
|
|
- mesa/program: Dead code cleanup
|
|
- mesa: Remove unused execmem code
|
|
- mesa: Make _mesa_generate_mipmap_level static
|
|
- glapi: Remove remnants of EXT_paletted_texture and the imaging subset
|
|
- docs: Remove no-longer-accurate text about the xlib driver
|
|
- mesa: Remove unused _mesa_initialize_visual
|
|
- dri: Remove unused driContextSetFlags
|
|
- dri: Remove unused driUpdateFramebufferSize
|
|
- dri: Remove unused driGetRendererString
|
|
- glx: Fix GLX_NV_float_buffer fbconfig handling
|
|
- mesa: Remove unused _mesa_allow_light_in_model
|
|
- mesa: Remove unused _mesa_AllocTextureStorage_sw
|
|
- mesa: Remove unused _mesa_bind_texture
|
|
- mesa: Remove unused _mesa_format_fallback_rgbx_to_rgba
|
|
- mesa: Remove unused _mesa_get_linear_format_srgb
|
|
- mesa: Remove unused _mesa_convert_colors
|
|
- mesa: Remove unused _mesa_all_varyings_in_vbos
|
|
- mesa: Remove unused _mesa_delete_nameless_texture
|
|
- mesa/math: Remove unused m_translate.c
|
|
- mesa: Remove unused _mesa_is_alpha_to_coverage_enabled
|
|
- mesa: Remove unused _mesa_is_front_buffer_{draw,read}ing
|
|
- mesa: Remove unused _mesa_set_sampler_{filters,srgb_decode,wrap}
|
|
- mesa: Remove unused src/mesa/x86-64
|
|
- wsi/x11: Avoid a class of deadlocks in the WSI queue thread
|
|
- dri_interface: Remove the remaining DRI1 API definitions
|
|
|
|
Alejandro Piñeiro (20):
|
|
|
|
- v3dv/pipeline: don't clone the nir shader at pipeline_state_create_binning
|
|
- v3d/clif: add support for dumping GS shader state
|
|
- broadcom/compiler: remove unused macro and function definition
|
|
- v3d: remove unused include
|
|
- v3d: remove static v3d_start_binning
|
|
- gallium/u_blitter: clean up texcoords ZW when filling up just XY
|
|
- v3d,v3dv: move TFU register definition to a common header
|
|
- vulkan: move common format helpers to vk_format
|
|
- broadcom/compiler: make shaderdb debug output compatible with shaderdb's report tool
|
|
- meson: bump meson requirement to 0.53.0
|
|
- v3d: support for texture buffer objects
|
|
- v3d: add support for no buffer object bound
|
|
- v3d: restrict formats supported for PIPE_BIND_SHADER_IMAGE
|
|
- v3d: enable ARB_texture_buffer_object and ARB_texture_buffer_range
|
|
- broadcom/compiler: avoid unneeded sint/unorm clamping when lowering stores
|
|
- v3dv: simplify v3dv_debug_ignored_stype
|
|
- vulkan: return default string for undefined enum
|
|
- v3dv: remove unused v3dv_descriptor_map_get_texture_format
|
|
- v3dv: check correct format when load/storing on a depth/stencil buffer
|
|
- vc4/nir_lower_blend: update write mask when we update num components
|
|
|
|
Alex Xu (Hello71) (5):
|
|
|
|
- Use initial-exec TLS for glibc only, enable TLS elsewhere
|
|
- Auto-enable TLSDESC support
|
|
- meson: check for lld split TLSDESC bug (fixes #5665)
|
|
- Fix TSD stubs for non-initial-exec case (fixes #5667).
|
|
- meson: tlsdesc: minor reformatting, add comments
|
|
|
|
Alyssa Rosenzweig (179):
|
|
|
|
- panfrost: Don't allow rendering/texturing 48-bit
|
|
- panfrost: Detect implementations support AFBC
|
|
- panfrost,panvk: Use dev->has_afbc instead of quirks
|
|
- panfrost: Remove unused MIDGARD_NO_AFBC quirk
|
|
- panfrost: Fix gl_FragColor lowering
|
|
- panfrost: Workaround ISSUE_TSIX_2033
|
|
- panfrost: Add internal afbc_formats
|
|
- panfrost: Decompress for incompatible AFBC formats
|
|
- panfrost: Enable AFBC on v7
|
|
- panfrost: Remove ancient TODO
|
|
- panfrost: Remove duplicated #if
|
|
- panfrost: Rename depth bias fields
|
|
- panfrost: Add alpha_zero_nop/one_store predicate
|
|
- panfrost: Test alpha_zero_nop/one_store predicates
|
|
- panfrost: Pass through alpha_zero_nop/one_store
|
|
- panvk: Pass through alpha_zero_nop/one_store flags
|
|
- nir/lower_blend: Use correct clamp for SNORM
|
|
- mesa: Require MRT support for GL3/ES3
|
|
- nir/lower_pntc_ytransform: Support PointCoordIsSysval
|
|
- pan/bi: Fix typo in helper invocation analysis
|
|
- pan/bi: Make bi_index padding explicit
|
|
- pan/bi: Add secondary staging count
|
|
- pan/bi: Add second destination to TEXC
|
|
- pan/bi: Add bifrost_texture_operation_mode enum
|
|
- pan/bi: Add bifrost_dual_texture_operation struct
|
|
- pan/bi: Use BIFROST_TEXTURE_OPERATION_SINGLE enum
|
|
- pan/bi: Support dual texture scheduling
|
|
- pan/bi: Add bi_dual_tex_as_u32 helper
|
|
- pan/bi: Fix up dual texturing registers
|
|
- pan/bi: Add dual texture fusing pass
|
|
- pan/bi: Test dual texture fusing
|
|
- pan/bi: Enable dual texture fusing pass
|
|
- pan/mdg: Remove duplicate compiler option
|
|
- pan/bi: Annotate Valhall instructions with units
|
|
- pan/bi: Add full form of Valhall MUX instruction
|
|
- pan/bi: Add sqrt form of Valhall FREXPM
|
|
- pan/bi: Add Valhall's special FMA_RSCALE instructions
|
|
- pan/bi: Add XML for assembling Valhall image stores
|
|
- pan/bi: Forbid unaligned staging registers on Valhall
|
|
- pan/bi: Confirm IDP unit on Valhall
|
|
- pan/bi: Suppress uniform validation for LD_BUFFER
|
|
- pan/bi: Add XML for LD_BUFFER
|
|
- asahi: Rename PANDECODE->AGXDECODE
|
|
- asahi: Deflake addresses
|
|
- asahi: Allocate special scratch buffers
|
|
- asahi: Fix agx_map_* structures
|
|
- asahi: Remove silly magic numbers
|
|
- asahi: Remove obnoxious workaround
|
|
- asahi: Fix BIND_PIPELINE sizing and alignment
|
|
- panfrost: Collapse 0 parameters in drm-shim
|
|
- panfrost: Handle AFBC_FEATURES in drm-shim
|
|
- panfrost: Add empty tile flags to GenXML
|
|
- panfrost: Only build GPU indirect kernels for v7
|
|
- pan/bi: Link with Valhall disassembler
|
|
- pan/va: Only hex dump when verbosely disassembling
|
|
- panfrost: Add "hex" type to GenXML
|
|
- panfrost: Add XML for Valhall data structures
|
|
- panfrost: Zero initialize disassembler stats
|
|
- panfrost: Don't shadow Mesa's fui()
|
|
- panfrost: Add Valhall support to pandecode
|
|
- pan/va: Improve assembler unit test output
|
|
- pan/va: Disambiguate sign of CSEL instructions
|
|
- pan/va: Add more assembler tests
|
|
- pan/va: Make LD_VAR index more fine-grained
|
|
- pan/va: Add sample/update modes to LD_VAR
|
|
- pan/va: Add table parameter to LD_ATTR_IMM
|
|
- pan/mdg: Fix definition of UBO unpack
|
|
- docs/macos: Update for recent Mesa changes
|
|
- pan/indirect_draw: Don't upload garbage UBO
|
|
- pan/indirect_draw: Split out update_dcd
|
|
- pan/indirect_draw: Support IDVS jobs
|
|
- panfrost: Track preloaded registers
|
|
- panfrost: Set preload descriptor more accurately
|
|
- panfrost: Remove unused shader info bits
|
|
- panfrost: Fix Secondary Shader field
|
|
- panfrost: Treat IDVS jobs as tiler for scoreboarding
|
|
- panfrost: Add IDVS fields to shader_info
|
|
- panfrost: Split out regalloc/preload helpers
|
|
- panfrost: Remove regalloc from v6.xml
|
|
- panfrost: Set secondary_* fields for IDVS
|
|
- panfrost: Extract panfrost_draw_emit_vertex_section
|
|
- panfrost: Emit IDVS jobs
|
|
- panfrost: Extract panfrost_batch_skip_rasterization
|
|
- panfrost: Skip rasterizer discard draws without side effects
|
|
- panfrost: Align instance size for IDVS
|
|
- panfrost: Add panfrost_compile_inputs->no_idvs option
|
|
- pan/bi: Extract bi_finalize_nir
|
|
- pan/bi: Allow UBO pushing to run multiple times
|
|
- pan/bi: Add IDVS mode to bi_context
|
|
- pan/bi: Remove the "wrong" stores in IDVS variants
|
|
- pan/bi: Split out varying store paths
|
|
- pan/bi: Use position shader ST_CVT path
|
|
- pan/bi: Add helper to decide if IDVS should be used
|
|
- pan/bi: Specialize shaders for IDVS
|
|
- pan/va: Remove extra LD_VAR_IMM_F32 source
|
|
- pan/va: Rename LEA_ATTR to LEA_VARY
|
|
- pan/va: Add .signed bit to right shift instructions
|
|
- pan/va: Generalize LD_VAR_IMM_* to support flat varyings
|
|
- pan/bi: Don't call useless NIR passes
|
|
- panfrost: Make pan_merge macro more robust
|
|
- pan/bi: Don't read base for combined stores
|
|
- pan/mdg: Don't read base for combined stores
|
|
- nir: Eliminate store_combined_output_pan BASE
|
|
- panfrost: Simplify blend lowering pass
|
|
- nir: Extend store_combined_output_pan
|
|
- panfrost: Combine dual source blends
|
|
- panfrost: Remove pan_nir_reorder_writeout
|
|
- pan/bi: Use is_staging_src helper
|
|
- pan/bi: Allow an extra staging source
|
|
- pan/bi: Use fused dual source blending
|
|
- pan/bi: Fix load_const of 1-bit booleans
|
|
- gallium/util: Add pixel->blocks box helper
|
|
- lima,panfrost: Correct pixel vs block mismatches
|
|
- v3d: Use u_box_pixels_to_blocks helper
|
|
- vc4: Use u_box_pixels_to_blocks helper
|
|
- pan/decode: Remove hierarchy mask check
|
|
- pan/decode: Include addresses for jobs
|
|
- pan/decode: Track mmaps with a red-black tree
|
|
- pan/decode: Add hexdump helper
|
|
- pan/decode: Add pandecode_dump_mappings
|
|
- pan/decode: Decode Valhall surface descriptor
|
|
- panfrost: Remove FBD pointer on Bifrost XML
|
|
- pan/decode: Don't print Preload twice
|
|
- pan/decode: Disassemble Bifrost quietly
|
|
- pan/bi: Schedule around blend shader register clobbering
|
|
- asahi: Fake more CAPs with dEQP hacks mode
|
|
- asahi: Warn when hacks mode is enabled
|
|
- asahi: Add XML for unknown 0x4a packet
|
|
- asahi: Break out Fragment Parameters word
|
|
- asahi: Rectify confusing XML comment
|
|
- asahi: Route sample mask from shader
|
|
- agx: Add sample_mask instruction
|
|
- agx: Handle discard intrinsics
|
|
- agx: Lower UBO loads to use per-element indexing
|
|
- panfrost: Use u_reduced_prim for primitive checks
|
|
- panfrost: Only cull polygons
|
|
- pan/bi: Pull BLEND precolouring out of per-dest loop
|
|
- pan/decode: Fix missing newlines in error messages
|
|
- panfrost: Remove HAS_SWIZZLES quirk
|
|
- panfrost: Remove MIDGARD_SFBD quirk
|
|
- panfrost: Remove MIDGARD_BROKEN_FP16 quirk
|
|
- panfrost: Remove NO_TILE_ENABLE_MAP quirk
|
|
- panfrost: Remove MIDGARD_{NO_TYPED_BLEND_STORES,MISSING_LOADS}
|
|
- panfrost: Don't set NO_BLEND_PACKS on Bifrost
|
|
- panfrost: Simplify format class selection
|
|
- panfrost: Remove NO_BLEND_PACKS quirk
|
|
- panfrost: Make primary_shader boolean
|
|
- panfrost: Fix v9 "Stencil from shader" bit
|
|
- panfrost: Don't pass quirks to pan_lower_framebuffer
|
|
- panfrost: Centralize our model list
|
|
- panfrost: Replace panfrost_model_name with model->name
|
|
- panfrost: Make the GPU allowlist implicit
|
|
- panfrost: Get performance counters from table
|
|
- pan/bi: Clean up quirks
|
|
- pan/bi: Assume future Valhall is 16-wide warps
|
|
- panfrost: Add Mali-G51 support
|
|
- docs/panfrost: Add new Midgard/Bifrost chips
|
|
- pan/va: Add .absolute bit to BRANCHZI
|
|
- pan/va: Add missing <clamp/> to V2F32_TO_V2F16
|
|
- pan/va: Add missing fields to LD_TILE
|
|
- pan/va: Test LD_TILE assembly
|
|
- pan/va: Fix MKVEC.v2i16 lane select
|
|
- pan/va: Fix lane select for [US]_TO_[USF]32
|
|
- pan/va: Vectorize 8->16-bit conversions
|
|
- pan/va: Add 2-channel 8-bit swizzles for conversions
|
|
- pan/va: Add lots of swizzle assembler tests
|
|
- pan/va: Handle shift lanes in assembler
|
|
- pan/va: Add ARM_shader_framebuffer_fetch asm test
|
|
- pan/bi: Avoid \*FADD.v2f16 hazard in optimizer
|
|
- pan/bi: Avoid \*FADD.v2f16 hazard in scheduler
|
|
- pan/bi: Lower swizzles on CSEL.i32/MUX.i32
|
|
- panvk: Use more reliable assert for UBO pushing
|
|
- pan/bi: Specialize IDVS in NIR
|
|
- panfrost: Fix FD resource_get_handle
|
|
- panfrost: Fix set_sampler_views for big GL
|
|
- panfrost: Handle NULL sampler views
|
|
- panfrost: Handle NULL samplers
|
|
- panfrost: Flush resources when shadowing
|
|
- panfrost: Push twice as many uniforms
|
|
|
|
Andreas Baierl (1):
|
|
|
|
- lima: Fix drawing wide lines
|
|
|
|
Antonio Caggiano (2):
|
|
|
|
- gallium: add a link shader hook
|
|
- virgl: Link shader program
|
|
|
|
Anuj Phogat (7):
|
|
|
|
- genxml/gen125: Update 3DSTATE_TE fields
|
|
- iris: Enable geometry distribution
|
|
- anv: Enable geometry distribution
|
|
- iris: Enable tessellation redistribution
|
|
- anv: Enable tessellation redistribution
|
|
- anv, iris: Add Wa_16011773973 for DG2
|
|
- anv, iris: Add Wa_22011440098 for DG2
|
|
|
|
Bas Nieuwenhuizen (73):
|
|
|
|
- radv: Fix modifier property query.
|
|
- radv: Add bufferDeviceAddressMultiDevice support.
|
|
- amd/rgp: Use VGH clocks for RGP workaround.
|
|
- radv: Disable coherent L2 optimization on cards with noncoherent L2.
|
|
- ci: Bump libdrm for the android image.
|
|
- ci: Add libelf to the Android image.
|
|
- amd/addrlib: Use alternative ADDR_C_ASSERT definition.
|
|
- radv: Remove android build warning.
|
|
- radv: Always use linker script when possible.
|
|
- aco: Remove useless sub-expr.
|
|
- meson: Check arguments before adding.
|
|
- amd/addrlib: Ignore self-assign warnings.
|
|
- util: Add support for clang::fallthrough.
|
|
- ci: Add RADV to Android CI.
|
|
- radv: Fix memory corruption loading RT pipeline cache entries.
|
|
- radv: Add more checking of cache sizes.
|
|
- radv: Don't crash if VkExternalImageFormatProperties isn't provided.
|
|
- radv: Avoid using a new subpass for ds clears.
|
|
- radv: Stop using a subpass for color clears.
|
|
- radv: Remove the skipping of framebuffer emission if we don't have a framebuffer.
|
|
- radv: Set RB+ registers correctly without framebuffer.
|
|
- radv: Don't emit framebuffer state if there is no renderpass active.
|
|
- radv: Add named constants for max framebuffer width/height.
|
|
- radv: Do not use VK_FORMAT_UNDEFINED in meta passes.
|
|
- radv: Support VK_KHR_dynamic_rendering for pipeline creation.
|
|
- radv: Support Begin/EndRendering.
|
|
- radv: Support dynamic rendering inheritance info.
|
|
- radv: Expose the VK_KHR_dynamic_rendering extension.
|
|
- radv/amdgpu: Add a syncobj per queue.
|
|
- radv/winsys: Add queue family param to submit.
|
|
- radv/amdgpu: Add support for submitting 0 commandbuffers.
|
|
- radv: Use the winsys 0 cmdbuffer submission support.
|
|
- radv: Set up ETC2 emulation wiring.
|
|
- radv: Use the correct base format for reintepretation.
|
|
- radv: Add extra plane for decoding ETC images with emulation.
|
|
- radv: Add ETC2 decode shader.
|
|
- radv: Deal with border colors with emulated ETC2.
|
|
- radv: Expose the ETC2 emulation.
|
|
- radv/amdgpu: Only wait on queue_syncobj when needed.
|
|
- radv: Add safety check for RGP traces on VanGogh.
|
|
- radv: Use correct buffer size for query pool result copies.
|
|
- radv: Skip wait timeline ioctl with 0 handles.
|
|
- radv/amdgpu: Use VkResult for wait_timeline_syncobj.
|
|
- vulkan/runtime: Refactor queue submit to take an argument struct.
|
|
- vulkan/runtime: Add sparse bind support.
|
|
- radv: Use dispatch table for QueueWaitIdle in the SQTT layer.
|
|
- radv: Use dispatch table for wsi_display.c
|
|
- radv: Use vulkan runtime for device lost.
|
|
- meson: Bump libdrm_amdgpu version req to 2.4.109.
|
|
- radv: Initialize vk device drm fd.
|
|
- radv: Add function to allow WSI signalling fences/semaphores.
|
|
- radv: Set horizontal sync types.
|
|
- radv: Use vk_command_buffer for preambles.
|
|
- radv: Add new cs_submit2 winsys call.
|
|
- radv: Add new submission path for use by the common sync framework.
|
|
- radv: Delete lots of sync code.
|
|
- radv: Remove syncobj reset mechanism.
|
|
- radv: Rename submit2->submit.
|
|
- radv: Remove dependencies on vk_common entrypoints.
|
|
- radv: Set optimal copy alignment to 1.
|
|
- radv: Add common entrypoint dependency.
|
|
- radv: Use MAX_PUSH_CONSTANTS_SIZE for saved push constants.
|
|
- radv: Use 16-bits to store push constant indices.
|
|
- radv: 256 byte push constants.
|
|
- Revert "nir/algebraic: distribute fmul(fadd(a, b), c) when b and c are constants"
|
|
- util/fossilize_db: Fix double free in error handling.
|
|
- vulkan/wsi/display: Add common implementation of VK_EXT_display_control.
|
|
- anv: Remove VK_EXT_display_control support in favor of common impl.
|
|
- radv: Remove VK_EXT_display_control support in favor of common impl.
|
|
- radv/amdgpu: Fix handling of IB alignment > 4 words.
|
|
- radv/amdgpu: Use aligned sizing for IB buffers.
|
|
- radv: Handle SDMA for padding.
|
|
- radv: Fix preamble argument order.
|
|
|
|
Biju Das (1):
|
|
|
|
- kmsro: Add 'rcar-du' driver support
|
|
|
|
Boris Brezillon (16):
|
|
|
|
- vulkan: Fix weak symbol emulation when compiling with MSVC
|
|
- vulkan: Set unused entrypoints to vk_entrypoint_stub when compiling with MSVC
|
|
- vulkan: Fix entrypoint generation when compiling for x86 with MSVC
|
|
- microsoft/compiler: Fix dxil_nir_create_bare_samplers()
|
|
- gallium/d3d12: Don't use designated initializers
|
|
- d3d12: Fix "use of designated initializers requires at least '/std:c++20'" error
|
|
- microsoft/compiler: Fix sampler/texture array emission
|
|
- microsoft/compiler: Skip images in redirect_texture_derefs()
|
|
- microsoft/compiler: textureLoad() doesn't take a LOD on MS textures
|
|
- microsoft/spirv_to_dxil: Remove dead variables after the struct split pass
|
|
- microsoft/spirv_to_dxil: Allow dumping NIR
|
|
- microsoft/spirv_to_dxil: Support [un]conditional YZ-flip
|
|
- microsoft/spirv_to_dxil: Discard PSIZ accesses
|
|
- microsoft/spirv_to_dxil: Lower atomics to their dxil variants
|
|
- microsoft/spirv_to_dxil: Make sure the SampleMask is a uint
|
|
- microsoft/spirv_to_dxil: Define idep_libspirv_to_dxil
|
|
|
|
Boyuan Zhang (5):
|
|
|
|
- radeon/vcn: update vcn1 enc interface
|
|
- radeon/vcn: update vcn2 enc interface
|
|
- radeon/vcn: remove redundancy for vcn2 enc
|
|
- radeon/vcn: combine encode params func
|
|
- radeon/vcn: combine session init func
|
|
|
|
Caio Oliveira (89):
|
|
|
|
- nir/schedule: Handle nir_intrisic_scoped_barrier
|
|
- nir: Add nir_var_mem_image
|
|
- intel/fs: Consider nir_var_mem_image for TGM fences
|
|
- anv: Use nir_foreach_image_variable
|
|
- spirv: Use nir_var_mem_image
|
|
- intel/compiler: Rename vec4 test fixtures
|
|
- intel/compiler: Build all tests in a single binary
|
|
- spirv: Use a single binary for gtests
|
|
- nir: Use a single binary for gtests
|
|
- util: Consolidate existing gtests in a single binary
|
|
- util: Move tests in single file directories to tests/
|
|
- util: Convert u_atomic_test to use gtest
|
|
- util: Convert blob_test to use gtest
|
|
- util: Convert mesa-sha1_test to use gtest
|
|
- util: Convert rb_tree_test to gtest
|
|
- util: Convert roundeven_test to use gtest
|
|
- util: Convert sparse array multithread test to use gtest
|
|
- util: Move test sources to tests/ directory
|
|
- intel/compiler: Add helpers to select SIMD for compute shaders
|
|
- intel/compiler: Use SIMD selection helpers for CS
|
|
- intel/compiler: Use SIMD selection helpers for variable workgroup size
|
|
- intel/compiler: Don't use SIMD larger than needed for workgroup
|
|
- anv: Get rid of "may be used initialized" warning in anv_QueueSubmit2KHR
|
|
- anv: Make shaders array in anv_graphics_pipeline fit Task/Mesh
|
|
- anv: Process FS last when compiling graphics pipeline
|
|
- intel/compiler: Make brw_nir_populate_wm_prog_data() static
|
|
- intel/compiler: Use gl_shader_stage_uses_workgroup() helpers
|
|
- .mailmap: Simplify my name
|
|
- intel/genxml: Add Mesh Shading structures
|
|
- intel/genxml: Inline the BODY structs into the instructions
|
|
- intel/dev: Add an intel_device_info::has_mesh_shading bit
|
|
- intel/blorp: Add option to emit packets that disable Mesh
|
|
- gtest: Add mesa-gtest-extras.h with array ASSERT/EXPECT macros
|
|
- util: Change blob_test to use macro from mesa-gtest-extras.h
|
|
- pan/bi: Make some headers compilable with C++
|
|
- pan/bi: Use gtest for test-scheduler-predicates
|
|
- pan/bi: Use gtest for test-packing
|
|
- pan/bi: Use gtest for test-pack-formats
|
|
- pan/bi: Use gtest for test-optimizer
|
|
- pan/bi: Use gtest for test-constant-fold
|
|
- pan/bi: Drop unused test helpers
|
|
- util/ra: Add simple test for register set serialization
|
|
- util/ra: Fix deserialization of register sets
|
|
- gtest: Fix output of array ASSERT/EXPECT macros
|
|
- intel: Add INTEL_DEBUG=task,mesh
|
|
- intel/compiler: Properly lower WorkgroupId for Task/Mesh
|
|
- intel/compiler: Handle per-primitive inputs in FS
|
|
- intel/compiler: Don't stage Task/Mesh outputs in registers
|
|
- intel/compiler: Don't lower Mesh/Task I/O to temporaries
|
|
- intel/compiler: Add structs to hold TUE/MUE
|
|
- intel/compiler: Make MUE available when setting up FS URB access
|
|
- intel/compiler: Export brw_nir_lower_simd
|
|
- intel/compiler: Add backend compiler basics for Task/Mesh
|
|
- intel/compiler: Lower Task/Mesh local_invocation_{id,index}
|
|
- intel/compiler: Implement Task Output and Mesh Input
|
|
- intel/compiler: Implement Mesh Output
|
|
- util: Convert cache test to use gtest
|
|
- util: Use ralloc for strings in cache test
|
|
- intel/compiler: Use a struct for brw_compile_tcs parameters
|
|
- intel/compiler: Use a struct for brw_compile_tes parameters
|
|
- intel/compiler: Use a struct for brw_compile_gs parameters
|
|
- intel/compiler: Use a struct for brw_compile_bs parameters
|
|
- nir: Initialize nir_register::divergent
|
|
- meson: Bump version required for gtest protocol
|
|
- Revert "nir: disable a NIR test due to undebuggable & locally unreproducible CI failures"
|
|
- ci/windows: Remove line numbers of SPIR-V errors in spirv2dxil tests
|
|
- spirv: Update headers and metadata to SPIR-V 1.6, revision 1
|
|
- nir: Handle volatile semantics for loading HelperInvocation builtin
|
|
- spirv: Identify non-temporal image operand added in SPIR-V 1.6
|
|
- spirv: Use the incorporated names
|
|
- anv/blorp: Split blorp_exec into a render and compute
|
|
- anv: Refactor dirty masking in cmd_buffer_flush_state
|
|
- anv: Simplify assertions related to graphics stages
|
|
- anv: Add another case to INTEL_DEBUG=pc output
|
|
- anv: Use pending pipe control mechanism in flush_pipeline_select()
|
|
- anv/blorp: Apply pending pipe flushes after PIPELINE_SELECT
|
|
- intel/fs/xehp: Add unit test for handling of RaR deps across multiple pipelines.
|
|
- anv: SPIR-V 1.6 shaders imply ALLOW_VARYING_SUBGROUP_SIZE
|
|
- anv: Fix subgroupSupportedStages physical property
|
|
- intel: Only reserve space for Compute Engine out of URB in Gfx12LP
|
|
- intel/compiler: Have specific mesh handling in calculate_urb_setup()
|
|
- intel/compiler: Merge Per-Primitive attribute handling in Mesh case
|
|
- compiler, intel: Add gl_shader_stage_is_mesh()
|
|
- intel: Add INTEL_URB_DEREF_BLOCK_SIZE_MESH
|
|
- intel/common: Add helper for URB allocation in Mesh pipeline
|
|
- anv: Add boilerplate for VK_NV_mesh_shader
|
|
- anv: Implement Mesh Shading pipeline
|
|
- intel/dev: Enable Mesh Shading for DG2
|
|
- anv: Add experimental support for VK_NV_mesh_shader
|
|
|
|
Caleb Callaway (2):
|
|
|
|
- vulkan/overlay: support Vulkan 1.2
|
|
- vulkan/overlay: revise and reformat README
|
|
|
|
Carsten Haitzler (3):
|
|
|
|
- panfrost: Add GPU G76 to the set of known ids
|
|
- kmsro: Add komeda DPU
|
|
- panfrost: Don't double-free when handling error for unsupported GPU
|
|
|
|
Charles Baker (5):
|
|
|
|
- zink: Enable VK_KHR_image_format_list for VK_KHR_imageless_framebuffer
|
|
- zink: Output PackHalf2x16 to uint not float
|
|
- zink: Avoid redundant cast to uint on PackHalf2x16 result
|
|
- zink: Set vertex binding stride without dynamic state extensions
|
|
- Revert "zink: handle vertex buffer offset overflows"
|
|
|
|
Charles Giessen (10):
|
|
|
|
- radv: Update description of vk_icdNegotiateLoaderICDInterfaceVersion
|
|
- v3dv: Update LoaderICDInterfaceVersion to v4
|
|
- panvk: Export vk_icdGetPhysicalDeviceProcAddr
|
|
- freedreno, tu: Export vk_icdGetPhysicalDeviceProcAddr
|
|
- radv: Update LoaderICDInterfaceVersion to v5
|
|
- panvk: Update LoaderICDInterfaceVersion to v5
|
|
- anv: Update LoaderICDInterfaceVersion to v5
|
|
- lavapipe: Update LoaderICDInterfaceVersion to v5
|
|
- freedreno, tu: Update LoaderICDInterfaceVersion to v5
|
|
- v3dv: Update LoaderICDInterfaceVersion to v5
|
|
|
|
Charmaine Lee (1):
|
|
|
|
- mesa: fix misaligned pointer returned by dlist_alloc
|
|
|
|
Cherser-s (1):
|
|
|
|
- radv: handle VK_DESCRIPTOR_TYPE_SAMPLER in VK_VALVE_mutable_descriptor_type extension
|
|
|
|
Chia-I Wu (23):
|
|
|
|
- venus: fix vn_instance_wait_roundtrip when seqno wraps
|
|
- venus: prefer VIRTGPU_BLOB_MEM_HOST3D for shmems
|
|
- venus: fix vn_buffer_get_max_buffer_size
|
|
- venus: add vn_renderer_util.[ch]
|
|
- venus: add vn_renderer_shmem_pool
|
|
- venus: use vn_renderer_shmem_pool for reply shmems
|
|
- venus: add vn_cs_encoder_storage_type
|
|
- venus: add VN_CS_ENCODER_STORAGE_SHMEM_POOL for VkCommandBuffer
|
|
- venus: cache shmems
|
|
- venus: add some trace points
|
|
- virgl: disable texture uploads with copy transfers
|
|
- vulkan/wsi: add wsi_common_get_image
|
|
- anv,lavapipe,v3dv: use wsi_common_get_image
|
|
- freedreno/drm, turnip: set DRM_RDWR for exported dma-bufs
|
|
- venus: fix VK_KHR_driver_properties
|
|
- venus: format with clang-format
|
|
- venus: remember the memory bound to a swapchain image
|
|
- venus: handle VkBindImageMemorySwapchainInfoKHR
|
|
- vulkan/wsi/x11: fix x11_image_init return value on errors
|
|
- venus: updates to the doc
|
|
- turnip: respect buf->bo_offset in transform feedback
|
|
- glthread: call _mesa_glthread_BindBuffer unconditionally
|
|
- venus: update venus-protocol to 1.3.204
|
|
|
|
Christian Gmeiner (25):
|
|
|
|
- ci/etnaviv: add manual piglit testing
|
|
- ci/bare-metal: armhf: move BM_ROOTFS to generic place
|
|
- ci/etnaviv: armhf: switch to .baremetal-test-armhf
|
|
- ci/etnaviv: no need to force nir anymore
|
|
- ci/bare-metal: add .baremetal-test-arm64
|
|
- ci/bare-metal: switch to common .baremetal-test-arm64
|
|
- panfrost/ci: update piglit fails
|
|
- etnaviv/ci: update piglit fails
|
|
- i915g/ci: update piglit fails
|
|
- iris/ci: update piglit fails
|
|
- ci: Uprev piglit to af1785f31
|
|
- mesa: always support occlusion queries
|
|
- broadcom/ci: use .test-manual-mr
|
|
- vc4: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY
|
|
- i915: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY
|
|
- lima: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY
|
|
- isaspec: Add support for special {:align=} field
|
|
- nir: make lower_sample_tex_compare a common pass
|
|
- nir/nir_lower_tex_shadow: support tex_instr without deref src
|
|
- etnaviv: make use of nir_lower_tex_shadow
|
|
- etnaviv: fix FRONT_AND_BACK culling
|
|
- etnaviv: add multiply_with_8 flag
|
|
- etnaviv: use bytes for read TX data
|
|
- etnaviv: add two new HI related perfmon counter
|
|
- etnaviv: add support for INTEL_blackhole_render
|
|
|
|
Clayton Craft (1):
|
|
|
|
- anv: don't advertise vk conformance on GPUs that aren't conformant
|
|
|
|
Connor Abbott (65):
|
|
|
|
- vk/format, v3dv: Add a vulkan -> pipe swizzle helper
|
|
- freedreno/fdl: Constify fdl6_get_ubwc_blockwidth()
|
|
- freedreno/fdl: Add mip_level to fdl_layout
|
|
- freedreno/fdl: Add fdl6_view
|
|
- tu: Use fdl6_view in tu_image_view and cross-check
|
|
- tu: Switch clear/blit to fdl6_view and cross-check
|
|
- tu: Remove cross-check scaffolding
|
|
- tu/clear_blit: Stop creating a franken-image for staging blits
|
|
- ir3/cse: Support mov instructions
|
|
- ir3: Use stp/ldp base offset for {load,store}_scratch
|
|
- tu/clear_blit: Move around copy_format()/tu6_plane_format()
|
|
- freedreno/a6xx: Rename GRAS_2D_BLIT_INFO
|
|
- tu: Emit GRAS_LRZ_MRT_BUF_INFO_0
|
|
- tu: Always write GRAS_LRZ_MRT_BUF_INFO_0
|
|
- freedreno/a6xx: Emit GRAS_LRZ_MRT_BUF_INFO_0
|
|
- ir3: Emit barriers for images again
|
|
- ir3: Don't emit barriers for make_available/make_visible
|
|
- ir3/spill: Mark root as non-spillable after inserting
|
|
- ir3/spill: Initial implementation of rematerialization
|
|
- tu: Enable subgroupBroadcastDynamicId
|
|
- tu: Add VK_KHR_buffer_device_address stubs
|
|
- tu: Expose Vulkan 1.2
|
|
- util/dag: Make edge data a uintptr_t
|
|
- util/dag: Add dag_add_edge_max_data
|
|
- ir3/sched: Rewrite delay handling
|
|
- ir3/postsched: Fix copy-paste mistake
|
|
- ir3/postsched: Handle sync dependencies better
|
|
- ir3/delay: Ignore earlier definitions to the same register
|
|
- ir3/postsched: Rewrite delay handling
|
|
- ir3/postsched: Only prefer tex/sfu if they are soft-ready
|
|
- ir3: Stop inserting nops during scheduling
|
|
- ir3/ra: Consider reg file size when swapping killed sources
|
|
- ir3/ra: Add missing asserts to ra_push_interval()
|
|
- ir3/spill: Support larger spill slot offset
|
|
- ir3/lower_pcopy: Fix shr.b illegal copy lowering
|
|
- ir3/lower_pcopy: Fix bug with "illegal" copies and swaps
|
|
- ir3/lower_subgroups: Fix potential infinite loop
|
|
- ir3/ra: Fix logic bug in compress_regs_left
|
|
- ir3: Bump type mismatch penalty to 3
|
|
- ir3: Introduce systall metric and new helper functions
|
|
- ir3: Use new (sy)/(ss) stall helpers in the compiler
|
|
- ir3/sched: Rename tex/sfu to sy/ss
|
|
- ir3/postsched: Rename tex/sfu to sy/ss
|
|
- ir3: Use (ss) for instructions writing shared regs
|
|
- ir3, freedreno: Add options struct for ir3_shader_from_nir()
|
|
- ir3: Pass shader to ir3_nir_post_finalize()
|
|
- ir3: Add wavesize control
|
|
- tu, ir3: Support runtime gl_SubgroupSize in FS
|
|
- tu: Implement VK_EXT_subgroup_size_control
|
|
- freedreno/fdl: Fix reinterpreting "size-compatible" formats
|
|
- nir: Reorder ffma and fsub combining
|
|
- freedreno/a6xx: Name texture descriptor bit
|
|
- tu/blit: Don't set CLAMPENABLE in sampler for 3d path
|
|
- tu: Report code size in pipeline statistics
|
|
- tu: Initial link-time optimizations
|
|
- nir/lower_subgroups: Rename lower_shuffle to lower_relative_shuffle
|
|
- nir: Add support for lowering shuffle to a waterfall loop
|
|
- ir3: Fix copy-paste mistakes in ir3_block_remove_physical_predecessor()
|
|
- ir3: Rewrite (jp) insertion
|
|
- ir3/cp: ir3: Prevent propagating shared regs out of loops harder
|
|
- ir3,tu: Enable subgroup shuffles and relative shuffles
|
|
- ir3/spill: Fix simplify_phi_nodes with multiple loop nesting
|
|
- ir3: Use CAN_REORDER instead of NON_WRITEABLE
|
|
- ir3: Don't always set bindless_tex with readonly images
|
|
- ir3/nir: Fix 1d array readonly images
|
|
|
|
Corentin Noël (1):
|
|
|
|
- virgl: Disable cache for VIRGL_BIND_SAMPLER_VIEW
|
|
|
|
Cristian Ciocaltea (12):
|
|
|
|
- ci: Uprev deqp-runner to 0.11.0
|
|
- ci: Support building and installing deqp-runner from source
|
|
- ci: Do not remove cmake
|
|
- ci: Create results folder before starting virgl_test_server
|
|
- virgl/ci: Force crosvm error when exit code file is missing
|
|
- virgl/ci: Prevent static link of virglrenderer inside crosvm
|
|
- virgl/ci: Do not hide crosvm output messages
|
|
- virgl/ci: Fix identification of dEQP binary paths
|
|
- iris/ci: Fix whl dEQP expectations
|
|
- iris/ci: Fix piglit tests expectations on amly
|
|
- panfrost/ci: Fix piglit tests expectations on G52
|
|
- freedreno/ci: Fix dEQP tests expectations on A530
|
|
|
|
Daniel Schürmann (22):
|
|
|
|
- nir/fold_16bit_sampler_conversions: skip sparse residency tex instructions
|
|
- aco: add more D16 load/store instructions to RA and validator
|
|
- aco: workaround GFX9 hardware bug for D16 image instructions
|
|
- aco: implement D16 texture loads
|
|
- radv: use nir_fold_16bit_sampler_conversions()
|
|
- aco/optimizer: fix fneg modifier propagation on VOP3P
|
|
- aco: change fneg for VOP3P to use fmul with +1.0
|
|
- aco/optimizer: propagate and fold inline constants on VOP3P instructions
|
|
- nir/opt_algebraic: lower fneg_hi/lo to fmul
|
|
- aco/ra: fix get_reg_for_operand() in case of stride mismatches
|
|
- aco: don't allow SDWA on VOP3P instructions
|
|
- aco/optimizer: keep instr_mod_labels after applying extract
|
|
- aco/optimizer: apply extract from p_extract_vector
|
|
- aco/optimizer: optimize extract(extract())
|
|
- aco/optimizer: apply extract from subdword p_split_vector
|
|
- aco: use explicit zero-padding for 64bit image loads in expand_vector()
|
|
- aco: use p_create_vector(v2b,v2b) in get_alu_src_vop3p()
|
|
- aco: don't split VOP3P definitions
|
|
- aco: validate VOP3P opsel correctly
|
|
- nir: refactor nir_opt_move
|
|
- nir/opt_if: merge two break statements from both branch legs
|
|
- nir/opt_if: also merge break statements with ones after the branch
|
|
|
|
Daniel Stone (10):
|
|
|
|
- CI: Disable Windows jobs
|
|
- Revert "CI: Disable Windows jobs"
|
|
- ci: Use common build script for libwayland
|
|
- ci: Consistently build Wayland and protocols
|
|
- ci: Upgrade to libdrm 2.4.109
|
|
- zink/ci: Add GL4.6 tessellation flake
|
|
- CI: Don't stream wget directly into bash
|
|
- Revert "gitlab-ci: disable radv-fossils"
|
|
- Revert "ci: disable vs2019 windows build"
|
|
- egl/wayland: Reset buffer age when destroying buffers
|
|
|
|
Danylo Piliaiev (54):
|
|
|
|
- ir3/freedreno: account for component in build_tessfactor_base
|
|
- turnip: add support for dirconf
|
|
- driconf: add vk_dont_care_as_load workaround option
|
|
- turnip: implement vk_dont_care_as_load workaround
|
|
- drirc: Apply vk_dont_care_as_load workaround to Forsaken Remastered
|
|
- nir/lower_amul: do not lower 64bit amul to imul24
|
|
- ir3/freedreno: add 64b undef lowering
|
|
- freedreno/ir3: disallow immediate addr/offset for ldg/ldg.a
|
|
- freedreno/ir3: set proper dst size for {store,load}_{global,shared}_ir3
|
|
- freedreno/ir3: use stg.a/ldg.a only if offset is reg or doesn't fit
|
|
- isaspec: inherite parent's bitset gpu gen requirements
|
|
- nir/serialize: Make more space for intrinsic_op allowing 1024 ops
|
|
- ir3/ra: Check register file upper bound when updating preferred_reg
|
|
- tu: fix rast state allocation size on a6xx gen4
|
|
- freedreno/computerator: Support A660 gpu
|
|
- vulkan/util: Handle depth-only formats in vk_att_ref_stencil_layout
|
|
- ir3: print half-dst/src for ldib.b/stib.b
|
|
- freedreno/ir3: add a6xx global atomics and separate atomic opcodes
|
|
- freedreno/ir3: handle global atomics
|
|
- turnip: implement VK_KHR_buffer_device_address
|
|
- ir3/cp: Prevent setting an address on subgroup macros
|
|
- freedreno,tu: Limit the amount of instructions preloaded into icache
|
|
- ir3: Add gen4 new subgroup instructions
|
|
- ir3: Use getfiberid for SubgroupInvocationID on gen4
|
|
- ir3,turnip: Enable subgroup ops support in all stages on gen4
|
|
- ir3,turnip: Add support for GL_KHR_shader_subgroup_quad
|
|
- turnip: Fix operator precedence in address calculation macros for queries
|
|
- ir3: Be able to reduce register limit for RA when CS has barriers
|
|
- ir3: Assert that we cannot have enough concurrent waves for CS with barrier
|
|
- tu: fix workaround for depth bounds test without depth test
|
|
- ir3: New cat3 instructions
|
|
- nir/algebraic: Separate has_dot_4x8 into has_sdot_4x8 and has_udot_4x8
|
|
- ir3: Make nir compiler options a part of ir3_compiler
|
|
- tu,ir3: Implement VK_KHR_shader_integer_dot_product
|
|
- vulkan/wsi: create a common function to compare drm devices
|
|
- tu: implement wsi hook to decide if we can present directly on device
|
|
- tu: support VK_EXT_primitive_topology_list_restart
|
|
- tu: expose VK_KHR_copy_commands2
|
|
- tu: add reference counting for descriptor set layouts
|
|
- turnip/perfetto: Optimize timestamp synchronization
|
|
- turnip/trace: refactor creation and usage of trace flush data
|
|
- turnip: rename tu_drm_get_timestamp into tu_device_get_gpu_timestamp
|
|
- turnip/trace: process u_trace chunks on queue submission
|
|
- turnip/perfetto: handle gpu timestamps being non-monotonic
|
|
- turnip/trace: Delete unused start/end_resolve tracepoints
|
|
- turnip: Drop references to layout of all sets on pool reset/destruction
|
|
- tu: implement sysmem vs gmem autotuner
|
|
- freedreno: Update gmem/sysmem debug options to be in line with turnip
|
|
- tu: add debug option to force gmem
|
|
- ci/freedreno: properly test sysmem and gmem paths
|
|
- ir3: opt_deref in opt loop to remove unnecessary tex casts
|
|
- turnip: Unconditionaly remove descriptor set from pool's list on free
|
|
- ir3: Limit the maximum imm offset in nir_opt_offset for shared vars
|
|
- turnip: Use LATE_Z when there might be depth/stencil feedback loop
|
|
|
|
Dave Airlie (249):
|
|
|
|
- brw/nir: remove unused function prototypes.
|
|
- crocus: Delete the MI_COPY_MEM_MEM resource_copy_region implementation.
|
|
- llvmpipe: fix userptr for texture resources.
|
|
- clover: use max shader sampler view/images queries for clover.
|
|
- llvmpipe: swizzle image stores for CL BGRA
|
|
- radv: fence->user_ptr and ctx->fence_map are now totally unused.
|
|
- lavapipe: drop EXT_acquire_xlib_display
|
|
- vulkan/wsi: set correct bits for host allocations/exports for images.
|
|
- vulkan/include: import the video codec headers.
|
|
- vulkan: add new image types undef beta define to switch statements.
|
|
- meson: allow building with vulkan beta extensions enabled.
|
|
- llvmpipe: disable 64-bit integer textures.
|
|
- llvmpipe: fix compressed image sizes.
|
|
- intel/genxml: cleanup video xml collisions.
|
|
- intel/genxml: fix some missing address from the 75 xml
|
|
- intel/genxml: align QM field names across gens.
|
|
- intel/genxml: fix Picure->Picture typo
|
|
- intel/genxml: fix gen6 LD->VLD typo.
|
|
- intel/genxml: generate video headers
|
|
- util/vl: move gallium vl_vlc.h and vl_rbsp.h to shared code.
|
|
- mtypes: drop some context pointers that are unused now
|
|
- mesa: drop unused sw extensions init
|
|
- mesa/dd: burn a bunch of legacy driver interfaces down
|
|
- mesa/dd: remove some fbo driver hooks.
|
|
- mesa/dd: remove NewSamplerObject
|
|
- mesa/light: make _mesa_light static do_light.
|
|
- iris/ci: comment out iris-cml-traces-performance due to hw unavailable
|
|
- mesa/query: remove all the mesa queryobj code.
|
|
- mesa/syncobj: drop unused syncobj code.
|
|
- mesa: remove unused buffer object code.
|
|
- mesa/transformfeedback: remove unused transform feedback code
|
|
- mesa/barrier: remove unused barrier functions
|
|
- mesa/externalobject: delete unused functions
|
|
- intel/compiler: drop glsl options from brw_compiler
|
|
- meson: make mesa/tests/glx depend on gallium
|
|
- mesa/dd/st: move get strings pointer out of dd.h
|
|
- mesa/dd/st: direct wire queries/timestamp/condrender.
|
|
- mesa/st: direct call sync object functions
|
|
- mesa/st: move barriers to direct call
|
|
- mesa/st: move transformfeedback to direct calls
|
|
- mesa/st: move external objects to direct calls
|
|
- mesa/crocus/iris/blorp: drop minify macro in favour of u_minify
|
|
- mesa: move _mesa_varying_slot_in_fs to shader_enums
|
|
- anv: include futex.h explicitly in allocator.
|
|
- brw/compiler: drop mtypes.h from compiler
|
|
- intel/compiler: drop shader_info.h from compiler header
|
|
- intel/crocus: push main/macros.h out to the users
|
|
- mesa/st: move rendermode to direct call
|
|
- mesa/st: move Enable to direct call
|
|
- mesa/st: move query memory info to direct call
|
|
- mesa/st: move perf query to direct call
|
|
- mesa/st: move perfomance monitor to direct call
|
|
- mesa/dd: drop purgeable interface
|
|
- mesa/st: move fbo code to direct calling
|
|
- mesa/st: move texture APIs to direct st calls
|
|
- mesa/st: move Clear to new direct call
|
|
- mesa/st: move pixel/bitmap functions to direct call
|
|
- mesa/st: move clear/flush/finish to direct call
|
|
- mesa/st: move some context functions to direct calls
|
|
- mesa/st: move viewport to direct call
|
|
- mesa/st: move copy image sub data to direct call
|
|
- mesa/st: move program calls to direct call
|
|
- mesa/st: replace most of buffer funcs with direct calls.
|
|
- mesa/st: move blit function to direct call
|
|
- mesa/st: convert the non-optional egl image to direct calls
|
|
- mesa/st: convert DrawTex to direct call
|
|
- mesa/st: move msaa functions to direct call
|
|
- mesa/st: move compute to direct call
|
|
- mesa/st: move draw indirect and xfb to direct calls.
|
|
- mesa/st: drop Draw from dd function table.
|
|
- treewide: drop mtypes/macros includes from main
|
|
- crocus: cleanup bo exports for external objects
|
|
- mesa/st: merge st buffer object into GL
|
|
- mesa/st: start moving bufferobject alloc/free/reference to main.
|
|
- mesa: add a pipe_context pointer to gl context
|
|
- mesa: add a pointer to st_config_options to gl_context
|
|
- mesa: add pointer to cso_context to gl_context
|
|
- mesa/st: migrate most of state tracker buffer objects into mesa
|
|
- mesa/st: make static the buffer object funcs that can be
|
|
- mesa/bufferobj: rename bufferobj functions to be more consistent.
|
|
- bufferobj: make sw clear buffer static, move it and rename it
|
|
- bufferobj: inline buffer clearing
|
|
- bufferobj: inline page commitment
|
|
- bufferobj: cleanup subdata copies
|
|
- mesa/st: rename access flag to transfer flag function
|
|
- mesa/bufferobj: move invalidate buffer to optional feature
|
|
- mesa/st: remove st_cb_bufferobjects*
|
|
- mesa: inline mesa_initialize_buffer_object.
|
|
- mesa/st: refactor compute dispatch to fill grid info earlier.
|
|
- mesa/st: migrate compute dispatch to mesa
|
|
- mesa/compute: refactor compute launch to look more like draw
|
|
- mesa/st: move get sample position code to static in mesa
|
|
- mesa/st: move msaa functionality into multisample.c
|
|
- mesa/st: migrate barrier code into mesa
|
|
- mesa/st: move st strings handling into mesa
|
|
- mesa: drop texformat code this isn't used.
|
|
- mesa/st: use has_stencil_export instead of querying screen cap.
|
|
- mesa: drop unused new renderbuffer code.
|
|
- mesa: drop unused _mesa_new_program.
|
|
- mesa/dd: drop unused InvalidateBufferSubData entry.
|
|
- intel/compiler: remove gfx6 gather wa from backend.
|
|
- intel/compiler: don't lower swizzles in backend.
|
|
- intel/compiler: drop unused decleration
|
|
- mesa: remove StripTextureBorder option.
|
|
- mesa/draw: drop the multi draw with indices fallback.
|
|
- mesa/st: move default enabled extensions into mesa.
|
|
- mesa: drop optional tex/tnl maintains mode.
|
|
- mesa/st: merge NewDepthClamp state flag
|
|
- mesa/st: drop the new array driver state bit
|
|
- mesa/st: drop the rasterizer driver flags
|
|
- mesa/st: remove the viewport driver state flags
|
|
- mesa/st: drop NewBlend driver state flags
|
|
- mesa/st: drop new depth/stencil state bits
|
|
- mesa/st: drop poly stipple driver state bit
|
|
- mesa/st: drop new tess state driver bit
|
|
- mesa/st: drop new uniform driver state bit
|
|
- mesa: drop unused transform feedback state driver flags
|
|
- mesa/st: drop ssbo, image and sampler driver state flags bits
|
|
- mesa/st: drop scissor/window rect driver state bits
|
|
- mesa/st: drop clip plane driver state bits
|
|
- mesa/st: drop new framebuffer srgb driver state bit
|
|
- mesa/st: drop multisample mask/locations state drivers bits
|
|
- mesa/st: remove conditionals for driver state bits that are always set.
|
|
- intel/genxml/gen4-5: fix more Raster Operation in BLT to be a uint
|
|
- crocus: fail resource allocation properly.
|
|
- crocus: don't create staging resources > half aperture
|
|
- gallium/mesa: enhance PIPE_CAP_CLIP_PLANES to support override number
|
|
- crocus: set max clip planes to 6 for gen4.
|
|
- mapi: generate correct dispatch for EXT_draw_instanced
|
|
- glsl/linker: cleanup passing gl_context unnecessarily
|
|
- glsl: don't pass gl_context to lower shared references.
|
|
- glsl/linker: don't pass gl_context just for constants in xfb code
|
|
- glsl/linker: move more ctx->Consts to consts.
|
|
- glsl/linker: remove direct gl_context usage in favour of consts/exts/api
|
|
- glsl/linker: get rid of gl_context from link varyings
|
|
- glsl/linker: get rid of gl_context from uniform assign paths
|
|
- glsl/linker: get rid of gl_context from atomic counters paths
|
|
- glsl/linker: remove gl_context from check image resources
|
|
- glsl/linker: remove gl_context usage from more places.
|
|
- glsl/nir/linker: avoid passing gl_context inside gl_nir linker
|
|
- glsl/linker/uniform_blocks: don't pass gl_context around.
|
|
- glsl/linker: drop unused gl_context.
|
|
- glsl/linker: remove a bunch more gl_context references.
|
|
- glsl/nir: don't pass gl_context to the convertor routine.
|
|
- mesa/program: don't use CALLOC_STRUCT for instructions.
|
|
- mesa: rebalance the CALLOC_STRUCT/FREE force.
|
|
- mesa/st/perfmon: rebalance CALLOC_STRUCT/FREE
|
|
- mesa/st: get rid of ST_CALLOC_STRUCT use CALLOC_STRUCT
|
|
- intel/compiler: add clamp_pointside to vs/tcs/tes keys.
|
|
- crocus: only clamp point size on last stage.
|
|
- amd: move vcn decoding regs + structs to a common file.
|
|
- amd: move uvd decode definitions to common place
|
|
- mesa/\*: use an internal enum for tessellation primitive types.
|
|
- mesa/\*: add a shader primitive type to get away from GL types.
|
|
- includes: add windows lean and mean guard.
|
|
- nir: remove gl.h include from nir headers.
|
|
- intel/brw: drop gl header from the brw backend.
|
|
- glsl/parser: extract consts/exts/api out of context at start.
|
|
- mesa: drop unused context parameter to shader program data reference.
|
|
- mtypes: split gl extensions and consts out into a separate header
|
|
- mesa/mtypes: move matrix enums to shader_enums.h
|
|
- glsl: remove some deps on mtypes.h.
|
|
- mesa: move some gl shader types to shader_types.h.
|
|
- glsl: avoid rebuilding builtin functions on mtypes.h changes.
|
|
- glsl: move ast_function.cpp off mtypes.h
|
|
- mesa: move ast_to_hir.cpp off mtypes.h
|
|
- mtypes: move gl_linked_shader and gl_shader_program to new shader_types.h
|
|
- mtypes: move gl_program to shader_types.h
|
|
- mtypes: more gl_active_atomic_buffer to shader_types.h
|
|
- mtypes: move transform feedback internal structs to shader_types.h
|
|
- mtypes: move uniform shader types to shader_types.h
|
|
- mtypes: move bindless image/sampler objects to shader_types.h
|
|
- mtypes: move gl_shader_variable to shader_types.h
|
|
- glsl: move off mtypes.h in lots of places.
|
|
- glsl/fp64: move context.h dependent checks into main.
|
|
- glsl: drop some more context.h/mtypes.h interactions
|
|
- vbo: drop unused mtypes.h
|
|
- docs: update docs for new extension header.
|
|
- mesa: more mtypes.h cleanups
|
|
- mesa: split struct gl_config into it's own header.
|
|
- glsl: drop glheader.h include.
|
|
- mesa/st: merge memoryobjects code from st into mesa
|
|
- mesa/st: merge semaphore objects from st into mesa
|
|
- mesa/st: merge the syncobj code from st into mesa
|
|
- mesa/st: merge queryobj code from st into mesa.
|
|
- mesa/st: merge condrender code from st into mesa.
|
|
- mesa/st: merge st transform feedback object into gl one.
|
|
- mesa/st: merge transform feedback code from st into mesa
|
|
- mesa/st: merge perfmon object from st into mesa
|
|
- mesa/st: merge perfmon counters/groups objects from st into mesa
|
|
- mesa/st: merge perfmon groups init/cleanup from st into mesa
|
|
- mesa/st: move perfmon code from st into mesa
|
|
- mesa/st: cleanup last bits of st perfmon code.
|
|
- mesa/st: merge texture object/image structs into mesa
|
|
- mesa/st: merge texture obj/image alloc/free into mesa
|
|
- intel/perf: use a function to do common allocations
|
|
- meson: start building intel earlier.
|
|
- mesa/st: move intel blackhole noop enable to frontend
|
|
- mesa/st: remove st_context from debug callback
|
|
- mesa/st: migrate debug callback code into mesa
|
|
- mesa/st: drop last user of st_Enable.
|
|
- mesa/st: directly call the uuid get funcs.
|
|
- mesa/st: drop emit string marker device table entry.
|
|
- mesa/st: move pin l3 cache to direct check/call.
|
|
- mesa/dd: drop GetProgramBinaryDriverSHA1
|
|
- mesa/st: drop useless tex parameter calls.
|
|
- mesa/st: move st_TexParameter into mesa
|
|
- mesa/st: drop release all sampler views wrapper
|
|
- mesa/st/vdpau: direct call the vdpau functions.
|
|
- mesa/st: move evaluate depth values into mesa
|
|
- mesa/ctx: store screen pointer in ctx as well
|
|
- mesa/st: inline st_max_shader_compiler_threads
|
|
- mesa/st: move shader completion into mesa
|
|
- mesa/st: move memory query into mesa.
|
|
- mesa/st: drop some bindless wrappers
|
|
- mesa/st: drop lots of perfquery wrappers
|
|
- mesa/st: move perf query test to st_context, drop files.
|
|
- mesa/st: merge st_renderbuffer into gl_renderbuffer.
|
|
- mesa/st: move some renderbuffer code into mesa
|
|
- mesa/st: merge framebuffer objects from st to mesa
|
|
- mesa/st: move render/finish_render texture in to mesa.
|
|
- mesa/st: move validate/discard framebuffer into mesa
|
|
- mesa/st: move st_ReadBuffer functionality into mesa
|
|
- mesa/st: move DrawBufferAllocate into mesa.
|
|
- mesa/st: move st renderbuffer code into mesa renderbuffer
|
|
- mesa/st: move map/unmap renderbuffer code into mesa
|
|
- mesa/st: move st_new_renderbuffer_fb to manager
|
|
- mesa/st: move some fbo helpers around.
|
|
- mesa/st: fixup viewport drawable invalidation
|
|
- mesa/st: migrate blit code into mesa
|
|
- mesa/st: move last of renderbuffer functionality into mesa
|
|
- mesa/st: move renderbuffer format choosing wrapper into mesa.
|
|
- mesa/st: move manager colorbuffer interface to gl_context.
|
|
- mesa/st: move invalidate_on_gl_viewport to ctx
|
|
- mesa/st: move st_fb_orientation into a mesa function
|
|
- mesa: consolidate setting no error state and checking suid.
|
|
- mesa/st: collapse st_program into gl_program object.
|
|
- mesa/st: move new ati fragment shader to mesa
|
|
- mesa/st: move st_vertex_program to gl_vertex_program in mesa
|
|
- mesa/st: move program new/delete into mesa
|
|
- mesa/st: refactor program translation into one file.
|
|
- crocus: find correct relocation target for the bo.
|
|
- crocus: fix leak on gen4/5 stencil fallback blit path.
|
|
- draw/so: don't use pre clip pos if we have a tes either.
|
|
- lavapipe: fix sampler + sampler view leaks.
|
|
- lavapipe: reference gallium fences correctly.
|
|
- vulkan/wsi: handle queue families properly for non-concurrent sharing mode.
|
|
- crocus: change the line width workaround for gfx4/5
|
|
- gallivm/nir: extract a valid texture index according to exec_mask.
|
|
|
|
Derek Foreman (3):
|
|
|
|
- egl/wayland: Properly clear stale buffers on resize
|
|
- panfrost: Support planar formats for scanout
|
|
- panfrost: support PIPE_RESOURCE_PARAM_NPLANES query
|
|
|
|
Dylan Baker (54):
|
|
|
|
- docs: add release notes for 21.2.4
|
|
- docs: add sha256 sum for 21.2.4 release
|
|
- docs: update calendar and link releases notes for 21.2.4
|
|
- meson: use gtest protocol for gtest based tests when possible
|
|
- docs: add release notes for 21.2.5
|
|
- docs: add sha256 sums for 21.2.5
|
|
- docs: update calendar and link releases notes for 21.2.5
|
|
- turnip: don't use mesa/macros.h to get utils/rounding.h
|
|
- docs: add release notes for 21.2.6
|
|
- docs: add sha256 sums for 21.2.6 relnotes
|
|
- docs: update calendar and link releases notes for 21.2.6
|
|
- docs/release-calendar: remove additional 21.2 releases
|
|
- docs: Add calendar entries for 22.0 release candidates.
|
|
- mesa/tests: ensure that util_cpu_detect has been called
|
|
- classic/r100: Delete driver
|
|
- classic/r200: Delete driver
|
|
- classic/nouveau: Remove driver
|
|
- classic/i915: Remove driver
|
|
- classic/i965: Remove driver
|
|
- mesa/dri: remove mega driver stub
|
|
- include/pci_ids: Move PCI ids supported by both i965 and iris to iris
|
|
- mesa/main/tests: remove dispatch sanity
|
|
- mesa: Delete libmesa_classic
|
|
- mesa: Merge libmesa_gallium and libmesa_common
|
|
- mesa: move common/dri into gallium
|
|
- meson: restore dri.pc file
|
|
- CODEOWNERS: remove OpenSWR
|
|
- new_features: Add OpenSWR removal
|
|
- fixup! gallium/swr: Remove driver source
|
|
- docs: move the release for 22.0 out
|
|
- VERSION: bump for 22.0.0-rc1 release
|
|
- .pick_status.json: Update to 0447a2303fb06d6ad1f64e5f079a74bf2cf540da
|
|
- .pick_status.json: Update to 8335fdfeafbe1fd14cb65f9088bbba15d9eb00dc
|
|
- .pick_status.json: Update to 5e9df85b1a4504c5b4162e77e139056dc80accc6
|
|
- VERSION: bump version for 22.0.0-rc2
|
|
- .pick_status.json: Update to 7955df28a6660d8dff77c79c345aa28aa7fa859c
|
|
- .pick_status.json: Update to 22fc53493092a7507c1e2db47b0c8763158d7b2d
|
|
- .pick_status.json: Update to 108e6eaa83eed3eb356f3cce835c5f5e3a836b8e
|
|
- .pick_status.json: Mark b07372312d7053f2ef5c858ceb1fbf9ade5e7c52 as denominated
|
|
- .pick_status.json: Update to 3759a16d8a883355effa435f46867951ce712dbe
|
|
- .pick_status.json: Mark a8418abd748e8e761dda9c3594e29e560833d9ff as denominated
|
|
- .pick_status.json: Update to 22efeec399fb55dcb364dabd65afab137d6f3fc3
|
|
- .pick_status.json: Update to f030b75b7d2c359b90c18ee4ed83fa05265c12e0
|
|
- .pick_status.json: Update to fd3451babd6cded6794561d74c8919576ba1f97d
|
|
- llvmpipe: remove test that passes on 22.0 for some reason from the fail list
|
|
- .pick_status.json: Update to b21e7e1ef7ba2c2cf97f073a8a01d6bc43835622
|
|
- d3d12: mark incorrectly passing tests as expected failures
|
|
- VERSION: bump for 22.0.0-rc3
|
|
- .pick_status.json: Update to 48b3ef625e1909c1be31fbe10adb53734af38eb4
|
|
- .pick_status.json: Update to 48b3ef625e1909c1be31fbe10adb53734af38eb4
|
|
- ci/zink: mark tests fixed by previous commit as fixed
|
|
- ci/zink: mark more expected passes as such
|
|
- Revert "panfrost: Fix set_sampler_views for big GL"
|
|
- .pick_status.json: Update to 342e6f83321a91816358dee82178809b2a8aeeaa
|
|
|
|
Edward O'Callaghan (4):
|
|
|
|
- clover/images: Add array_size to implement CL_IMAGE_ARRAY_SIZE
|
|
- clover: Implement CL_MEM_OBJECT_IMAGE2D_ARRAY
|
|
- clover: Implement CL_MEM_OBJECT_IMAGE1D_BUFFER
|
|
- clover: Implement CL_MEM_OBJECT_IMAGE1D_ARRAY
|
|
|
|
Eleni Maria Stea (1):
|
|
|
|
- dri_drawable: missing header
|
|
|
|
Ella Stanforth (2):
|
|
|
|
- v3dv: Implement VK_KHR_create_renderpass2
|
|
- vulkan: Allow RegisterDisplayEventEXT before first page flip
|
|
|
|
Ella-0 (2):
|
|
|
|
- v3dv: implement VK_EXT_host_query_reset
|
|
- v3dv: enable VK_KHR_swapchain_mutable_format
|
|
|
|
Emma Anholt (204):
|
|
|
|
- ci/lvp: Skip some slow tests under ASan.
|
|
- loader: Avoid enumerating drm devices just to get an fd's PCI ID.
|
|
- freedreno/afuc: Disable the disassembler on 32-bit builds.
|
|
- ci/deqp-runner: Drop SUMMARY_LIMIT env var.
|
|
- ci/deqp-runner: Simplify the --jobs argument setup.
|
|
- ci/deqp-runner: Use new deqp-runner's built-in renderer/version checks.
|
|
- ci/deqp-runner: Drop silly CSV env vars.
|
|
- ci/deqp-runner: Move remaining asan runs to --env LD_PRELOAD=
|
|
- ci/deqp-runner: Drop LD_LIBRARY_PATH=/usr/local for libkms workaround.
|
|
- ci/deqp-runner: Don't start GPU hang detection for making junit results.
|
|
- ci/deqp-runner: Move more non-suite logic under the non-suite 'if'.
|
|
- ci/piglit-runner: Fix funny indentation of the piglit-runner command.
|
|
- turnip: Switch tu6_format_texture() to a pipe_format.
|
|
- turnip: Switch tu6_format_color() to a pipe_format.
|
|
- turnip: Switch format_to_ifmt() to take a pipe_format.
|
|
- util: Move freedreno's snorm-to-unorm to util/, adding remaining cases.
|
|
- turnip: Make copy_format() and tu6_plane_format() return pipe_format
|
|
- gallium/u_blitter: Read MSAA z/s from sampler's .x instead of .y or .z.
|
|
- turnip: Drop the assertion about the temporary bit in sync fd imports.
|
|
- ci/radeonsi: Use a deqp-runner suite suite for stoney.
|
|
- ci/deqp-runner: Rename the deqp-drivername-\*.txt files to drivername-\*.txt
|
|
- ci/piglit-runner: Merge piglit-driver-\*.txt files into driver-\*.txt.
|
|
- ci: Enable testing radeonsi's libva using libva-util unit tests.
|
|
- ci/etnaviv: Fix the dependency for the build artifacts.
|
|
- ci/etnaviv: Add some more deqp flakes I've seen in recent runs.
|
|
- etnaviv: Switch to the NIR compiler by default.
|
|
- i915g: Check for negate/swizzle on TGSI_OPCODE_KILL_IF's src.yzw.
|
|
- i915g: Make sure we consider negates/swizzles on bias/shadow coords.
|
|
- i915g: Improve the explanation for the 1D Y swizzle.
|
|
- ci: Uprev VK-GL-CTS to 1.2.7.2, and pull in piglit while I'm here.
|
|
- freedreno: Fix gmem invalidating the depth or stencil of packed d/s.
|
|
- freedreno/a6xx: Emit a null descriptor for unoccupied IBO slots.
|
|
- freedreno/fdl6: Add an interface for setting up buffer descriptors.
|
|
- turnip: Use the new shared buffer-view descriptor creation function.
|
|
- turnip: Remove buffer-view cross-check code.
|
|
- freedreno/fdl6: Add support for texture swizzles of A/L/I/LA/RGBx.
|
|
- freedreno/a6xx: Use the fdl buffer view setup for img/ssbo descriptors.
|
|
- freedreno: Fix layered rendering to just Z/S and not color.
|
|
- freedreno/a6xx: Fix partial z/s clears with sysmem.
|
|
- freedreno/a5xx: Use the defined names for 2D_BLIT_CNTL regs.
|
|
- freedreno/a618: Mark a flaky test that triggers hangcheck.
|
|
- freedreno/a6xx: Don't try to generate mipmaps for SNORM with our blitter.
|
|
- freedreno/a5xx+: Fix missing LA formats.
|
|
- freedreno/a5xx: Diff reduction in fd5_layout to fd6_layout.
|
|
- freedreno/a6xx: Try to fix drawing to z/s miplevel/layer offsets.
|
|
- freedreno/a5xx: Remove bogus assertion about BO size.
|
|
- freedreno/a5xx: Try to fix drawing to z/s miplevel/layer offsets.
|
|
- freedreno/a5xx: Clean up a little bit of blitter array pitch setup.
|
|
- freedreno/ir3: Fix off-by-one in prefetch safety assert.
|
|
- turnip: Claim 1.2.7.1 CTS conformance.
|
|
- freedreno: Fix the uniform/nonuniform handling for cat5 bindful modes.
|
|
- freedreno: Fix constant-index assumptions in IBO loads.
|
|
- ci/etnaviv: Mark the rest of uniform_api.random as flaky.
|
|
- ci/etnaviv: Add more texturing flakes.
|
|
- ci/bare-metal: Close serial and join serial threads before exit.
|
|
- util: Rename PIPE_FORMAT_G8_B8_R8_420_UNORM.
|
|
- util/format: Add G8_B8R8_420_UNORM to match Vulkan.
|
|
- freedreno/fdl6: Skip redundant setting of TILE_ALL for NV12.
|
|
- freedreno: Set layer_first on (2D) resource imports.
|
|
- freedreno/a6xx: Create a fd6_view at sampler view update time.
|
|
- freedreno/a6xx: Switch to relying on fd6_view for our texture descriptors.
|
|
- freedreno/a6xx: Use fd6_view for non-buffer image descriptors, too.
|
|
- freedreno/a6xx: Clean up sysmem fb read patching using fd6_view.
|
|
- freedreno/a6xx: Drop an unused tile_mode arg.
|
|
- freedreno/a6xx: Inline remaining fd6_tex_const_0() call.
|
|
- mesa/st: Disable NV_copy_depth_to_color on non-doubles-capable HW.
|
|
- freedreno/a6xx: Add some notes about piglit failures.
|
|
- freedreno/ir3: Fix an off-by-one in so->outputs_count safety assert.
|
|
- ci/freedreno: Enable the tes-input/tcs-input tests.
|
|
- freedreno/a6xx: Do sparse setup of the TFB program.
|
|
- ci/freedreno: Add notes explaining the KHR-GL* failures.
|
|
- ci/freedreno: Add a link to the issue for color_depth_attachments.
|
|
- freedreno/fdl: Dump the generated layout when a layout test fails.
|
|
- freedreno: Fix the texturator unit test script.
|
|
- freedreno/cffdump: Fix 64-bit reg decode in script mode.
|
|
- freedreno/fdl: Add support for unit testing 3D texture array strides.
|
|
- freedreno/cffdump: Fix up formatting of texturator unit test script output.
|
|
- freedreno/cffdump: Handle the TILE_ALL flag in unit test generation.
|
|
- freedreno/a6xx: Fix a bunch of 3D texture layout to match blob behavior.
|
|
- freedreno: Stop exposing MSAA image load/store on desktop GL.
|
|
- freedreno/a6xx: Disable sample averaging on z/s or integer blits.
|
|
- freedreno/a6xx: Disable sample averaging on non-ubwc z24s8 MSAA blits.
|
|
- freedreno/a5xx: Define a5xx_2d_surf_info like a6xx has.
|
|
- freedreno/a5xx: Document the sRGB bit on RB_2D_SRC/DST info.
|
|
- freedreno/a5xx: Emit MSAA state for sysmem rendering, too.
|
|
- nir: Un-inline nir_builder_alu_instr_finish_and_insert()
|
|
- nir: Un-inline more of nir_builder.h.
|
|
- freedreno/ir3: Make a shared helper for the tess factor stride.
|
|
- turnip: Move CP_SET_SUBDRAW_SIZE to vkCmdBindPipeline() time.
|
|
- nir: Uninline a bunch of nir.h functions.
|
|
- nir: Make nir_build_alu() variants per 1-4 arg count.
|
|
- freedreno/a6xx: Allocate a fixed-size tess factor BO.
|
|
- freedreno/a6xx: Skip emitting tess BO pointers past the shader's constlen.
|
|
- freedreno/a6xx: Set the tess BO ptrs in the program stateobj.
|
|
- mesa/st: Remove GL_ARB_depth_clamp emulation support.
|
|
- r300: Add deqp expectations for RV515.
|
|
- r300: Turn a comment about presub into an assert.
|
|
- r300: Fix mis-optimization turning -1 - x into 1 - x.
|
|
- nouveau/nir: Use the address reg for indirect scratch access.
|
|
- nir/algebraic: Move some generated-code algebraic opt args into a struct.
|
|
- nir/algebraic: Drop the check for cache == None.
|
|
- nir/algebraic: Replace relocations for nir_search values with a table.
|
|
- nir/algebraic: Remove array-of-cond code
|
|
- nir/algebraic: Move relocations for expression conds to a table.
|
|
- nir/algebraic: Move relocations for variable conds to a table.
|
|
- nir/algebraic: Pack various bitfields in the nir_search_value_union.
|
|
- nir/algebraic: Mark the automaton's filter tables as const.
|
|
- nir/algebraic: Move all the individual transforms to a common table.
|
|
- r300: Move the instruction filter for r500_transform_IF() to the top.
|
|
- r300: Cache the var list in the peephole_mul_omod() loop.
|
|
- r300: Ensure that immediates have matching negate flags too.
|
|
- r300: Also consider ALU condition modifiers for loop DCE.
|
|
- r300: Remove the non_normalized_coords from the shader key.
|
|
- r300: Precompile the FS at shader creation time.
|
|
- r300: Route shader stats output to ARB_debug_output.
|
|
- r300/ci: Update loop expectations
|
|
- loader: Restore i915g support.
|
|
- r300/ci: Add some piglit expectations.
|
|
- nir/nir_to_tgsi: Add support for "if" statements with !native_integers
|
|
- nir_to_tgsi: Make !native_integers front face input match glsl_to_tgsi.
|
|
- r300: Disable loop unrolling on r500.
|
|
- r300: Request NIR shaders from mesa/st and use NIR-to-TGSI.
|
|
- nir_to_tgsi: Enable nir_opt_move.
|
|
- nir/nir_opt_move,sink: Include load_ubo_vec4 as a load_ubo instr.
|
|
- nir_to_tgsi: Set the TGSI Precise flag for exact ALU instructions.
|
|
- r300: Remove some dead compiler code.
|
|
- r300: Remove support for SCS.
|
|
- r300: Remove unused RC_OPCODE_ABS.
|
|
- r300: Remove unused RC_OPCODE_XPD.
|
|
- r300: Remove unused RC_OPCODE_SWZ.
|
|
- r300: Remove unused RC_OPCODE_CLAMP.
|
|
- r300: Remove unused RC_OPCODE_SFL
|
|
- r300: Remove unused RC_OPCODE_DPH
|
|
- ci/freedreno: Add known flakes from the last month.
|
|
- ci/crocus: Add support for manual CI runs on my G41.
|
|
- crocus: Clamp VS point sizes to the HW limits as required.
|
|
- glsl: Delete the vectorization opt pass.
|
|
- glsl: Delete the optimize_redundant_jumps pass.
|
|
- glsl: Remove dead prototype for old do_discard_simplification().
|
|
- glsl: Remove comment about non-existing DFREXP_TO_ARITH
|
|
- glsl: Retire unused modes for lower_64bit_integer_instructions.
|
|
- ci/r300: Add another xfail on the main branch.
|
|
- r300/vs: Allocate temps we see a use as a source, too.
|
|
- r300/vs: Reuse rc_match_bgnloop().
|
|
- r300/vs: Fix flow control processing just after an endloop.
|
|
- ci: Enable a build with MSan.
|
|
- glcpp: Disable the valgrind tests.
|
|
- softpipe: Drop the quad pstipple stage.
|
|
- softpipe: Use the draw module's poly stipple handling, like llvmpipe.
|
|
- softpipe: Drop duplicate decl of softpipe_find_fs_variant
|
|
- ci/crocus: Add manual CI for the new HSW box I have at home.
|
|
- ci: Enable reporting to the flakes IRC channel for i915g and crocus.
|
|
- ci/i915g: Add a couple more recent regressions.
|
|
- nir_to_tgsi: Use the same address reg mappings as GLSL-to-TGSI did.
|
|
- gallium: Delete PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS
|
|
- tgsi/exec: Simplify indirects now that they always use the ADDR file.
|
|
- i915g: Turn off FP16 in the vertex shaders.
|
|
- r300: Use uif() instead of pointer aliasing in program printing.
|
|
- ci/r300: Update xfails from a full dEQP run.
|
|
- r300: Fix omod failing to increase the number of channels stored.
|
|
- nir_to_tgsi: Enable fdot_replicates flag.
|
|
- ci: Shrink container/rootfs sizes.
|
|
- nir_to_tgsi: Fix a bug in TXP detection after backend lowering.
|
|
- ci/llvmpipe: Sort the list of traces.
|
|
- ci/llvmpipe: Add a trace for the game JVGS, which got regressed recently.
|
|
- ci: Add paraview traces to several drivers.
|
|
- freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars.
|
|
- nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well.
|
|
- r300: Drop unused r300_get_stats() call.
|
|
- r300: Add consts (uniforms) count to the shader-db output.
|
|
- nir/lower_dynamic_bo_access: Use copy_inst_indices for our cloned instrs.
|
|
- nir: Add a .base field to nir_load_ubo_vec4.
|
|
- nir/opt_offsets: Disable unsigned wrap checks on non-native-integers HW.
|
|
- nir/opt_offsets: Also apply the max offset to top-level constant folding.
|
|
- nir_to_tgsi: Use nir_opt_offsets for load_ubo_vec4.
|
|
- nir/opt_offsets: Use nir_ssa_scalar to chase offset additions.
|
|
- softpipe: respect !independent_blend_enable for color masks.
|
|
- softpipe: Request that st fix up DST_ALPHA blending for RGB render targets.
|
|
- ci/softpipe: Drop the GS sampling known-flakes.
|
|
- nir/lower_locals_to_regs: Do an ad-hoc copy propagate on our generated MOV.
|
|
- tgsi_exec: Fix shared var stores for >1 real invocation, and overflow checks.
|
|
- softpipe: Improve some local var naming in compute shaders.
|
|
- softpipe: Initialize the CS dispatch mask at machine setup time.
|
|
- softpipe: Dispatch 4 CS invocations per tgsi_exec thread.
|
|
- Revert "ci: freedreno: Update a530 dEQP fail expectation list"
|
|
- ci/llvmpipe: Drop the skip of piglit edgeflag test.
|
|
- ci/llvmpipe,softpipe: Switch piglit testing to piglit-runner.
|
|
- ci/traces: Rename the piglit/run.sh script to piglit-traces.sh.
|
|
- ci/traces: Drop the PIGLIT_PROFILES setting for traces replay.
|
|
- ci/traces: Clean up the failure report message.
|
|
- ci/traces: Drop the baseline file creation for trace results.
|
|
- ci/traces: Drop PIGLIT_REPLAY_UPLOAD_TO_MINIO.
|
|
- ci/traces: Always generate the junit XML.
|
|
- ci/freedreno: Update hashes for closed traces.
|
|
- vulkan: Fix leak of error messages
|
|
- ci: Use a dlclose-disabling preload library for leak checking in Vulkan.
|
|
- ci/freedreno: Switch 2 default a630 VK jobs to being GLES and VK ASan jobs.
|
|
- turnip: Store the computed iova in the tu_buffer.
|
|
- turnip: Store the computed iova in the tu_image.
|
|
- r300: Disable fp16 and int16 in swtcl vertex shaders.
|
|
- r300/r600: Add drm-shim support.
|
|
- freedreno/isaspec: Add missing dep of encode.py/decode.py calls on isa.py
|
|
- intel: Add missing dep of gen_*_header.py on utils.py.
|
|
- i915g: Initialize the rest of the "from_nir" temporary VS struct.
|
|
|
|
Enrico Galli (3):
|
|
|
|
- microsoft/compiler: Shadow tex instructions always use shadow samplers
|
|
- microsoft/spirv_to_dxil: Add DXIL validation to spirv2dxil
|
|
- ci/windows: Add validation tests for spriv_to_dxil
|
|
|
|
Eric Engestrom (33):
|
|
|
|
- VERSION: bump to 21.4
|
|
- docs: reset new_features.txt
|
|
- VERSION: bump to 22.0
|
|
- docs: update calendar for 21.3.0-rc1
|
|
- docs: update calendar for 21.3.0-rc2
|
|
- docs: update calendar for 21.3.0-rc3
|
|
- docs: update calendar for 21.3.0-rc4
|
|
- release-calendar: fix date for next 21.3 rc
|
|
- meson: drop duplicate addition of surfaceless & drm to the list of platforms
|
|
- meson: move \`egl_native_platform` definition inside the \`with_egl` block
|
|
- meson: drop impossible \`if no platform` branch
|
|
- meson: always define \`HAVE_{X11,XCB}_PLATFORM` when it's enabled
|
|
- meson: automatically define \`HAVE_{some}_PLATFORM`
|
|
- docs: update calendar for 21.3.0-rc5
|
|
- docs/submittingpatches: add link to section describing how to make a backport MR
|
|
- docs/submittingpatches: add formatting around the release branches names
|
|
- docs/submittingpatches: mention use of the \`-x` flag of \`git cherry-pick` when backporting a commit
|
|
- docs: update branchpoint instructions
|
|
- docs: add release notes for 21.3.0
|
|
- docs: update calendar and link releases notes for 21.3.0
|
|
- docs: add 21.3.x release schedule
|
|
- docs: add 22.0 branchpoint date for perspective
|
|
- docs: add release notes for 21.3.1
|
|
- docs: update calendar and link releases notes for 21.3.1
|
|
- docs: add release notes for 21.3.2
|
|
- docs: update calendar and link releases notes for 21.3.2
|
|
- docs: add release notes for 21.3.3
|
|
- docs: update calendar and link releases notes for 21.3.3
|
|
- docs: add release notes for 21.3.4
|
|
- docs: update calendar and link releases notes for 21.3.4
|
|
- docs: add release notes for 21.3.5
|
|
- docs: update calendar and link releases notes for 21.3.5
|
|
- docs/release-calendar: add another 21.3.x since 22.0 has been delayed a bit
|
|
|
|
Erico Nunes (5):
|
|
|
|
- ci: temporarily disable lima CI
|
|
- mesa: fix GL_MAX_SAMPLES with GLES2
|
|
- lima/gpir: avoid invalid write in regalloc
|
|
- lima/ppir: initialize slots array for dummy/undef
|
|
- lima/ppir: refactor bitcopy to use unsigned char
|
|
|
|
Erik Faye-Lund (9):
|
|
|
|
- docs: update trademark disclaimer
|
|
- CODEOWNERS: remove ownership of deleted code
|
|
- ci: remove testing of deleted code
|
|
- docs: remove mentions of deleted code
|
|
- docs: remove stale notice about deleted dir
|
|
- ensure csv-files are crlf on disk
|
|
- bin/gen_calendar_entries: fix newlines on windows
|
|
- docs: use http-redirect when possible
|
|
- docs: remove incorrect drivers from extension
|
|
|
|
Felix DeGrood (2):
|
|
|
|
- anv: increase binding table pool size to 64KB
|
|
- pps: increase intel.cfg buffer size
|
|
|
|
Filip Gawin (8):
|
|
|
|
- r300: improve precission of linear interpolation
|
|
- r300: stub derivatives on r300 and r400 hardware
|
|
- nir: assert that variables in optimize_atomic are initialized
|
|
- glsl: fix trivial strict aliasing warning
|
|
- radv: dont call calloc when BVH is empty
|
|
- iris: fix mapping compressed textures
|
|
- r300: fix handling swizzle in transform_source_conflicts
|
|
- r300: replace recursive calls with loops
|
|
|
|
Francisco Jerez (28):
|
|
|
|
- intel/fs/xehp: Teach SWSB pass about the exec pipeline of FS_OPCODE_PACK_HALF_2x16_SPLIT.
|
|
- intel/fs: Add physical fall-through CFG edge for unconditional BREAK instruction.
|
|
- intel/dev: Fix size of device info num_subslices array.
|
|
- intel/dev: Add support for pixel pipe subslice accounting on multi-slice GPUs.
|
|
- intel/dev: Implement DG2 restrictions requiring additional DSSes to be disabled.
|
|
- intel/xehp: Implement XeHP workaround Wa_14013910100.
|
|
- intel/xehp: Implement XeHP workaround Wa_14014148106.
|
|
- intel/xehp: Update 3DSTATE_PS maximum number of threads per PSD.
|
|
- intel/fs: Don't assume packed dispatch for fragment shaders on XeHP.
|
|
- intel/blorp/gfx12+: Drop unnecessary state cache invalidation from binding table setup.
|
|
- intel/genxml: Fix SLICE_HASH_TABLE struct on XeHP.
|
|
- iris: Merge gfx11\_ and gfx12_upload_pixel_hashing_tables() into the same function.
|
|
- intel: Move pixel hashing table computation into common header file.
|
|
- intel: Minimal calculation of pixel hash table for arbitrary number of pixel pipes.
|
|
- intel: Rename intel_compute_pixel_hash_table() to intel_compute_pixel_hash_table_3way().
|
|
- iris: Program pixel hashing tables on XeHP.
|
|
- anv: Program pixel hashing tables on XeHP.
|
|
- intel/xehp: Switch to coarser cross-slice pixel hashing with table permutation.
|
|
- iris/xehp: Implement workaround for 3D texturing+anisotropic filtering.
|
|
- intel/fs/xehp: Merge repeated in-order read dependencies instead of replacement.
|
|
- intel/fs: Move legal exec type calculation into helper function in lower_regioning pass.
|
|
- intel/fs: Teach the lower_regioning pass how to split instructions of unsuported exec type.
|
|
- intel/fs: Take into account region strides during SIMD lowering decision of SHUFFLE.
|
|
- intel/fs: Fix destination suboffset calculations for non-trivial strides in SHUFFLE codegen.
|
|
- intel/fs: Perform 64-bit SHUFFLE lowering in the lower_regioning pass.
|
|
- intel/fs: Perform 64-bit SEL_EXEC lowering in the lower_regioning pass.
|
|
- intel/fs: Honor strided source regions specified by the IR for CLUSTER_BROADCAST.
|
|
- intel/fs: Perform 64-bit CLUSTER_BROADCAST lowering in the lower_regioning pass.
|
|
|
|
Georg Lehmann (8):
|
|
|
|
- meson: Use get_supported_arguments more often.
|
|
- meson: Remove some unnecessary loops.
|
|
- amd/addrlib: Use get_supported_arguments to get compiler args.
|
|
- radv: Increase maxFragmentCombinedOutputResources.
|
|
- vulkan/wsi/wayland: Fix add_wl_shm_format alpha/opaqueness.
|
|
- vulkan/wsi/wayland: Convert missing vulkan formats to shm formats.
|
|
- vulkan/wsi/wayland: Add modifiers for RGB formats.
|
|
- vulkan/wsi/wayland: Fix add_drm_format_modifier aplha/opaqueness.
|
|
|
|
Gert Wollny (7):
|
|
|
|
- virgl: Add driconf tweak to force-enable reading back R8_SRGB textures
|
|
- virgl: obtain supported number of shader sampler views from host
|
|
- ci: pin virglrenderer version
|
|
- virgl: Enable higher compatibility profiles if host supports it
|
|
- util/primconvert: map only index buffer part that is needed
|
|
- nir_lower_io: propagate the "invariant" flag to outputs
|
|
- ntt: Set the output invariant flag according to the semantics
|
|
|
|
Greg V (2):
|
|
|
|
- util: __getProgramName: remove check for ancient FreeBSD versions, simplify ifdefs
|
|
- util: make util_get_process_exec_path work on FreeBSD w/o procfs
|
|
|
|
Guido Günther (6):
|
|
|
|
- etnaviv/drm: Use etna_mesa_debug for debugging messages
|
|
- etnaviv/drm: Add some bo debug output
|
|
- etnaviv/drm: Print gpu model at debug verbosity
|
|
- etnaviv/drm: Use mesa_log* for debugging
|
|
- entaviv/drm: Use same log format as gallium bits
|
|
- etnaviv: Use mesa_log*
|
|
|
|
Guilherme Gallo (14):
|
|
|
|
- ci: Update linux kernel to v5.15
|
|
- iris/ci: Fix traces for amly and deqp list for whl
|
|
- ci/freedreno: Add maxcpus=2 to the kernel cmdline on a530
|
|
- panfrost/ci: update piglit tests expectations on G52
|
|
- ci: Update ci-fairy to version with --token-file support
|
|
- ci: Uprev piglit
|
|
- ci: Use ci-fairy minio login via token file
|
|
- ci: Build skqp on ARM64 images
|
|
- ci: Add a630_skqp jobs
|
|
- ci: skqp: Add documentation on how to maintain skqp jobs
|
|
- ci: Uprev Kernel to v5.16
|
|
- ci: freedreno: Update a530 dEQP fail expectation list
|
|
- ci: Add docs for Linux Kernel uprevs
|
|
- virgl/ci: make crosvm-runner pass variables in a secure way
|
|
|
|
Hamish Arblaster (1):
|
|
|
|
- zink: Fix building on macOS
|
|
|
|
Henry Goffin (1):
|
|
|
|
- intel/compiler/test: Fix build with GCC 7
|
|
|
|
Hoe Hao Cheng (3):
|
|
|
|
- zink/codegen: support platform tags
|
|
- zink/codegen: remove core_since in constructor
|
|
- zink/codegen: remove bogus print statement
|
|
|
|
Hyunjun Ko (10):
|
|
|
|
- turnip: expose VK_KHR_driver_properties
|
|
- anv: Fix to honor the spec to get stencil layout.
|
|
- radv: Fix to honor the spec to get stencil layout.
|
|
- vulkan/util: Move helper functions for depth/stencil images to vk_iamge
|
|
- turnip: Enable VK_KHR_separate_depth_stencil_layouts
|
|
- turnip: Use the new common device lost tracking
|
|
- vulkan: fix typo
|
|
- turnip: Porting to common vulkan implementation for synchronization.
|
|
- turnip: Porting to common implementation for timeline semaphore
|
|
- turnip: fix leaks of submit requests.
|
|
|
|
Iago Toral Quiroga (63):
|
|
|
|
- v3dv: fix TLB buffer to image copy path for 3D images
|
|
- v3dv: enable Vulkan 1.1
|
|
- broadcom/compiler: disallow tsy barrier in thrsw delay slots
|
|
- broadcom/compiler: fix assert that current instruction must be in current block
|
|
- v3dv: refactor TFU jobs
|
|
- broadcom/compiler: rework simultaneous peripheral access checks
|
|
- broadcom/compiler: fix condition encoding bug
|
|
- broadcom/compiler: padding fixes to QPU assembly dumps
|
|
- broadcom/compiler: make opt passes set current block
|
|
- broadcom/compiler: check that sig packing is valid when pipelining ldvary
|
|
- broadcom/compiler: copy packing when converting add to mul
|
|
- v3dv,v3d: don't store swizzle pointer in shader/pipeline keys
|
|
- v3d: use V3D_MAX_DRAW_BUFFERS instead of hardcoded constant
|
|
- v3dv: account for multisampling when computing subpass granularity
|
|
- v3dv: don't use a global constant for default pipeline dynamic state
|
|
- v3d,v3dv: move tile size calculation to a common helper
|
|
- v3dv: fix internal bpp of D/S formats
|
|
- broadcom/compiler: fix early fragment tests setup
|
|
- broadcom/compiler: don't allow RF writes from signals after thrend
|
|
- broadcom/compiler: fix scoreboard locking checks
|
|
- broadcom/compiler: don't move ldvary earlier if current instruction has ldunif
|
|
- broadcom/compiler: allow color TLB writes in last instruction
|
|
- broadcom/compiler: relax restriction on VPM inst in last thread end slot
|
|
- broadcom/compiler: emit passthrough Z write if shader reads Z
|
|
- broadcom/compiler: track passthrough Z writes
|
|
- v3d,v3dv: don't disable EZ for passthrough Z writes
|
|
- broadcom/compiler: improve documentation for Z writes
|
|
- broadcom/compiler: improve thrsw merge
|
|
- v3dv: add a refcount mechanism to BOs
|
|
- v3dv: add swizzle helpers to identify formats wit R/B swap and reverse flags
|
|
- v3dv: handle formats with reverse flag
|
|
- v3dv: implement VK_EXT_4444_formats
|
|
- v3dv: implement double-buffer mode
|
|
- v3d: implement double-buffer mode
|
|
- docs/features: flag VK_KHR_create_renderpass2 as implemented for v3dv
|
|
- broadcom/simulator: handle DRM_V3D_PARAM_SUPPORTS_MULTISYNC_EXT
|
|
- v3dv: implement VK_KHR_driver_properties
|
|
- broadcom/compiler: add lowering pass to scalarize non 32-bit general load/store
|
|
- broadcom/compiler: better document vectorization implications
|
|
- broadcom/compiler: implement TMU general 16-bit load/store
|
|
- broadcom/compiler: lower packing after vectorization
|
|
- broadcom/compiler: support ldunifa with some 16-bit loads
|
|
- broadcom/compiler: use ldunifa with unaligned constant offset
|
|
- broadcom/compiler: implement 32-bit/16-bit conversion opcodes
|
|
- broadcom/compiler: support f32 to f16 RTZ and RTE rounding modes
|
|
- v3dv: support VK_KHR_16_bit_storage
|
|
- broadcom/compiler: support 16-bit uniforms
|
|
- v3dv: expose storagePushConstant16 feature from VK_KHR_16bit_storage
|
|
- broadcom/compiler: support 8-bit general store access
|
|
- broadcom/compiler: handle to/from 8-bit integer conversions
|
|
- broadcom/compiler: support 8-bit loads via ldunifa
|
|
- broadcom/compiler: allow vectorization to larger scalar type
|
|
- broadcom/compiler: update comment on load_uniform fast-path
|
|
- v3dv: support VK_KHR_8bit_storage
|
|
- v3dv: drop signature of undefined function
|
|
- v3dv: implement VK_KHR_imageless_framebuffer
|
|
- v3dv: rework Vulkan 1.2 feature queries
|
|
- v3dv: document why we don't expose VK_EXT_scalar_block_layout
|
|
- v3dv: support resolving depth/stencil attachments
|
|
- v3dv: fallback to blit resolve if render area is not aligned to tile boundaries
|
|
- v3dv: expose VK_KHR_depth_stencil_resolve
|
|
- v3dv: don't submit noop job if there is nothing to wait on or signal
|
|
- broadcom/compiler: fix offset alignment for ldunifa when skipping
|
|
|
|
Ian Romanick (35):
|
|
|
|
- nir/loop_unroll: Always unroll loops that iterate at most once
|
|
- glsl/nir: Don't build soft float64 when it cannot be used
|
|
- nir/constant_folding: Optimize txb with bias of constant zero to tex
|
|
- intel/compiler: Don't predicate a WHILE if there is a CONT
|
|
- intel/compiler: Don't store "scalar stage" bits on Gfx8 or Gfx9
|
|
- intel/stub: Suppress warnings about DRM_I915_QUERY_PERF_CONFIG
|
|
- intel/stub: Implement DRM_I915_QUERY_ENGINE_INFO
|
|
- intel/stub: Implement DRM_I915_QUERY_MEMORY_REGIONS
|
|
- intel/stub: Implement I915_PARAM_HAS_USERPTR_PROBE
|
|
- intel/fs: Use HF as destination type for F32TOF16 in fquantize2f16
|
|
- mesa: OpenGL 1.3 feature GL_ARB_texture_border_clamp is not optional
|
|
- mesa: OpenGL 1.3 feature GL_ARB_texture_cube_map is not optional
|
|
- mesa: OpenGL 1.3 feature GL_ARB_texture_env_combine is not optional
|
|
- mesa: OpenGL 1.3 feature GL_ARB_texture_env_dot3 is not optional
|
|
- mesa: OpenGL 1.3 and OpenGL ES 1.0 are not optional
|
|
- intel/stub: Silence "initialized field overwritten" warning
|
|
- intel/stub: Implement shell versions of DRM_I915_GEM_GET_TILING and DRM_I915_SEM_GET_TILING
|
|
- intel/fs: Fix gl_FrontFacing optimization on Gfx12+
|
|
- ntt: Extend ntt_compile::addr_declared and ntt_compile::addr_reg
|
|
- mesa: OpenGL 1.4 feature GL_ARB_depth_texture is not optional
|
|
- mesa: OpenGL 1.4 feature GL_ARB_texture_env_crossbar is not optional
|
|
- mesa: OpenGL 1.4 feature GL_EXT_blend_color is not optional
|
|
- mesa: OpenGL 1.4 feature GL_EXT_blend_func_separate is not optional
|
|
- mesa: OpenGL 1.4 feature GL_EXT_blend_minmax is not optional
|
|
- mesa: OpenGL 1.4 feature GL_EXT_point_parameters is not optional
|
|
- mesa: OpenGL ES 1.1 is not optional
|
|
- gallivm/nir: Call nir_lower_bool_to_int32 after nir_opt_algebraic_late
|
|
- nir: All set-on-comparison opcodes can take all float types
|
|
- intel/fs: Don't optimize out 1.0*x and -1.0*x
|
|
- spriv: Produce correct result for GLSLstd450Step with NaN
|
|
- spirv: Produce correct result for GLSLstd450Modf with Inf
|
|
- spirv: Produce correct result for GLSLstd450Tanh with NaN
|
|
- nir: Properly handle various exceptional values in frexp
|
|
- nir: Produce correct results for atan with NaN
|
|
- nir: Add missing dependency on nir_opcodes.py
|
|
|
|
Icecream95 (9):
|
|
|
|
- panfrost: Set PIPE_CAP_QUADS_FOLLOW_PROVOKING_VERTEX_CONVENTION
|
|
- pan/bi: Add interference between destinations
|
|
- pan/bi: Check dependencies of both destinations of instructions
|
|
- panfrost: Set dirty state in set_shader_buffers
|
|
- panfrost: Re-emit descriptors after resource shadowing
|
|
- pan/bi: Make disassembler build reproducibly
|
|
- panfrost: Fix ubo_mask calculation
|
|
- pan/bi: Skip psuedo sources in ISA.xml
|
|
- pan/bi: Don't assign slots for the blend second source
|
|
|
|
Ilia Mirkin (50):
|
|
|
|
- freedreno: support lua54
|
|
- meson: build freedreno tools when other parts of freedreno not enabled
|
|
- freedreno: check batch size after the fallback blitter clear
|
|
- freedreno/ir3: fix setting the max tf vertex when there are no outputs
|
|
- freedreno/a4xx: fix min/max/bias lod sampler settings
|
|
- mesa: add missing state to state string computation
|
|
- nv50,nvc0: expose R8/RG8_SRGB formats for texturing
|
|
- mesa: add just a tiny bit of debug info to some _mesa_problem calls
|
|
- freedreno: prefer float immediates when float values are involved
|
|
- freedreno/a[345]xx: add R8/RG8 SRGB formats
|
|
- a5xx: remove astc srgb workaround logic
|
|
- freedreno/ir3: fix image-to-tex flags, remove 3d -> array hack
|
|
- nir: remove double-validation of src component counts
|
|
- freedreno/a4xx: add proper buffer texture support
|
|
- freedreno/a5xx: re-express buffer textures more logically
|
|
- freedreno/a6xx: re-express buffer textures more logically
|
|
- freedreno/a4xx: fix stencil-textured border colors
|
|
- freedreno/a4xx: indicate whether outputs are uint/sint
|
|
- freedreno/a4xx: include guesses from a3xx for some of the constid's
|
|
- freedreno/a4xx: hook up sample mask/id, used to determine helper invocs
|
|
- freedreno/ir3: adjust condition for when to use ldib
|
|
- mesa: check target/format for Tex(ture)StorageMem*
|
|
- mesa: move around current texture object fetching
|
|
- freedreno/a4xx: add missing SNORM formats to help tests pass
|
|
- freedreno/a4xx: add some missing legacy formats to help TBOs
|
|
- freedreno/a5xx: add missing L8A8_UNORM format to support TBOs
|
|
- freedreno/a[345]xx: fix unorm/snorm blend factors when they're "over"
|
|
- nir: always keep the clip distance array size updated
|
|
- nir: apply interpolated input intrinsics setting when lowering clipdist
|
|
- freedreno/ir3: remove unused actual_in counting
|
|
- nir/lower_clip: replace bogus comment about gl_ClipDistance reading in GL
|
|
- nir/lower_clip: location offset goes into offset, not base
|
|
- nir/lower_clip: increment num_inputs/outputs by appropriate amount
|
|
- nir/lower_clip: support clipdist array + no vars
|
|
- freedreno/ir3: indicate that clipdist arrays are in use
|
|
- freedreno/ir3: get the post-lowering clip/cull mask
|
|
- ci: move windowoverlap exclusion to all-skips
|
|
- freedreno/ci: add piglit runs for a306
|
|
- freedreno/ci/a306: add additional skip which hangchecks
|
|
- freedreno/a3xx: add some legacy formats
|
|
- freedreno/ci/a306: separate msaa fails
|
|
- freedreno/ci/a306: split off the f32 blend / texturing failures
|
|
- freedreno/ci/a306: split off snorm blending failures
|
|
- freedreno/a5xx: enable OES_gpu_shader5
|
|
- gitlab-ci: serial close can leave an active read
|
|
- gitlab-ci: detect a3xx gpu hang recovery failure
|
|
- nv50,nvc0: add new caps to list
|
|
- nv50: don't claim support for format-less stores
|
|
- freedreno/ci/a306: add more skips
|
|
- freedreno/ci/a306: increase concurrency
|
|
|
|
Italo Nicola (4):
|
|
|
|
- virgl/vtest: use correct resource stride in flush_frontbuffer
|
|
- virgl/vtest: implement resource_create_front
|
|
- virgl: flush cmd buffer when flushing frontbuffer
|
|
- drisw: do an MSAA resolve when copying the backbuffer
|
|
|
|
Iván Briano (15):
|
|
|
|
- intel/nir: also allow unknown format for getting the size of a storage image
|
|
- anv: Remove unused struct member
|
|
- anv: add functions to set up fake render passes
|
|
- anv: allocate fake render pass on pipeline creation
|
|
- anv: Split out state attachments allocation
|
|
- anv: allocate fake render pass for continuation command buffers
|
|
- anv: Split attachment clearing code into their own functions
|
|
- anv: split end_subpass into more discrete components
|
|
- anv: implement the meat of VK_KHR_dynamic_rendering
|
|
- anv: Enable VK_KHR_dynamic_rendering
|
|
- anv: Handle resolveImageLayout on dynamic rendering
|
|
- anv: Report the right conformance version
|
|
- anv: make the pointer valid before we assign stuff into it
|
|
- anv: only advertise 64b atomic floats if 64b floats are supported
|
|
- intel/compiler: make CLUSTER_BROADCAST always deal with integers
|
|
|
|
Jakob Bornecrantz (1):
|
|
|
|
- vulkan-device-select: Don't leak drmDevicePtr
|
|
|
|
James Jones (1):
|
|
|
|
- gbm: Don't pass default usage flags on ABIs < 1
|
|
|
|
James Park (5):
|
|
|
|
- vulkan, radv: Support backslash in ICD paths
|
|
- aco: Work around MSVC restrict in c99_compat.h
|
|
- ac: Align ADDR_FASTCALL with addrlib
|
|
- radv: Match function definitions to declarations
|
|
- meson: Update libelf wrap for Windows
|
|
|
|
Jan Beich (1):
|
|
|
|
- meson: disable -Werror=thread-safety on FreeBSD
|
|
|
|
Jan Zielinski (3):
|
|
|
|
- gallium/swr: Remove driver source
|
|
- gallium/swr: Remove common code and build options
|
|
- gallium/swr: clean up the documentation after SWR removal from main
|
|
|
|
Jason Ekstrand (158):
|
|
|
|
- vulkan/log: Tweak our handling of a couple error enums
|
|
- vulkan/log: Don't assert on non-client-visible objects
|
|
- vulkan/wsi/win32: Delete the wrapper entrypoints
|
|
- i965: Emit a NULL surface for buffer textures with no buffer
|
|
- nir: Add a nir_foreach_image_variable() iterator
|
|
- i965/uniforms: Handle images as a separate pass
|
|
- amd/llvm/nir: Add support for nir_var_mem_image
|
|
- aco: Add support for nir_var_mem_image
|
|
- glsl/nir_linker: nir_var_mem_image is also a GL uniform
|
|
- nir/gl_nir_lower_samplers_as_deref: Support nir_var_mem_image
|
|
- nir/gl_nir_lower_images: Support nir_var_mem_image
|
|
- st/nir: Assign uniform locations to nir_var_mem_image vars
|
|
- ntt: Separate image and sampler handling
|
|
- llvmpipe: Support image variables living in nir_var_mem_image
|
|
- nir/lower_readonly_images_to_tex: Also rewrite variable modes
|
|
- clover: Insert dummy uniform variables for images
|
|
- clover: Use nir_foreach_image_variable for images
|
|
- zink: Images can live in nir_var_mem_image now
|
|
- ir3: Check for nir_var_mem_image in shared_barrier handling
|
|
- spirv: Assert that OpTypeForwardPointer only points to structs
|
|
- glsl: Use nir_var_mem_image for images
|
|
- ttn: Use nir_var_mem_image
|
|
- st/pbo: Use nir_var_mem_image for images
|
|
- nir: Validate image variable modes
|
|
- aco: Split var_mem_image barrier handling from global/ssbo
|
|
- intel/fs: Stop emitting TGM fences for nir_var_mem_ssbo
|
|
- nir/gl_nir_lower_images: Require nir_var_mem_image
|
|
- ir3: Images are always nir_var_mem_image
|
|
- nir: Drop our attempt at typed-based image mode validation
|
|
- nir: s/nir_var_mem_image/nir_var_image/g
|
|
- nir: Re-arrange the variable modes
|
|
- nir/serialize: Pack deref modes better
|
|
- compiler/types: Combine image and sampler type serialization
|
|
- compiler/types: Unify the guts of get_sampler/image_count
|
|
- compiler/types: Add a texture type
|
|
- nir: Allow texture types
|
|
- nir/deref: Also optimize samplerND -> textureND casts
|
|
- lavapipe: Allow for texture types
|
|
- clover/nir: Don't remove texture variables
|
|
- spirv: Use texture types for sampled images
|
|
- compiler/types: Add a wrap_in_arrays helper
|
|
- anv: Implement VK_EXT_global_priority_query
|
|
- glsl/nir/linker: Also remove image variables
|
|
- vulkan/vk_extensions_gen: Drop support for extra includes
|
|
- vulkan/vk_extensions_gen: Stop including vk_object.h
|
|
- v3dv: Use vk_instance_get_proc_addr_unchecked for WSI
|
|
- lavapipe: Use vk_instance_get_proc_addr_unchecked for WSI
|
|
- vulkan: Rework mako error handling in python generators
|
|
- vulkan: Drop unnecessary [en]coding comments from python generators
|
|
- vulkan/dispatch_table: EntrypointBase doesn't need to derive from object
|
|
- vulkan: Break entrypoint parsing into its own file
|
|
- vulkan: Move trampoline code-gen to its own file
|
|
- vulkan: Move all the common object code to runtime/
|
|
- lavapipe: Don't wrap errors returned from vk_device_init in vk_error
|
|
- intel: Add has_bit6_swizzle to devinfo
|
|
- anv,iris: Advertise a max 3D workgroup size of 1024^3
|
|
- anv/allocator: Add a couple of helpers
|
|
- anv: Pull aperture size from devinfo
|
|
- anv/allocator: Use anv_device_release_bo in anv_block_pool_finish
|
|
- anv: Rename anv_bo::index to exec_obj_index
|
|
- anv: Add get/set_tiling helpers
|
|
- anv: Add a couple more checks in MapMemory
|
|
- anv: Fix FlushMappedMemoryRanges for odd mmap offsets
|
|
- anv: Add helpers in anv_allocator for mapping BOs
|
|
- anv: Always set bindless surface base on SKL+
|
|
- anv: Int64 atomics don't need to depend on softpin
|
|
- anv: Add a anv_use_relocations helper and use it
|
|
- anv: Add a use_relocations physical device bit
|
|
- anv: Stop checking for HAS_EXEC_FENCE
|
|
- anv: Add an anv_bo_is_pinned helper
|
|
- anv: Also disallow CCS_E for multi-LOD images
|
|
- anv: Move device memory maps back to anv_device_memory
|
|
- vulkan: Pull the device lost framework from ANV
|
|
- anv: Use the new common device lost tracking
|
|
- vulkan/device: Add a check_status hook
|
|
- anv: Wire up the new status check
|
|
- vulkan: Add a vk_sync base class
|
|
- vulkan/meson: Re-arrange libvulkan_util deps a bit
|
|
- vulkan/util: Include stdlib.h
|
|
- vulkan/vk_device: Add a drm_fd field
|
|
- vulkan: Add a common vk_drm_syncobj struct
|
|
- c11/threads: Re-align return values for timed waits
|
|
- vulkan: Add an emulated timeline sync type
|
|
- vulkan: Add a common implementation of VkFence
|
|
- vulkan: Add a common implementation of VkSemaphore
|
|
- vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle
|
|
- vulkan/device: Log the timeline mode when lost
|
|
- vulkan/wsi: Drop wsi_common_get_current_time()
|
|
- wsi/display: Rework wsi_fence a bit
|
|
- vulkan/wsi/display: Wrap wsi_display_fence in a vk_sync
|
|
- anv: Delete ANV_SEMAPHORE_TYPE_DUMMY
|
|
- anv: Add a BO sync type
|
|
- anv: Remove the last remnants of in/out fences
|
|
- anv: Use helpers in util/os_time.h in the query code
|
|
- anv: Convert to the common sync and submit framework
|
|
- vulkan: Add an emulated binary vk_sync type
|
|
- anv: Remove unnecessary syncobj wrappers
|
|
- anv: Simplify submit_simple_batch()
|
|
- vulkan,anv: Auto-detect syncobj features
|
|
- vulkan: Add a dummy sync type
|
|
- vulkan/sync: Rework asserts a bit
|
|
- crocus: wm_prog_key::key_alpha_test uses GL enums
|
|
- intel/fs,vec4: Drop uniform compaction and pull constant support
|
|
- intel/fs,vec4: Drop support for shader time
|
|
- intel/blorp: Stop depending on prog_data binding tables
|
|
- intel/fs,vec4: Drop prog_data binding tables
|
|
- intel/compiler: Get rid of wm_prog_key::frag_coord_adds_sample_pos
|
|
- intel/fs: Drop high_quality_derivatives
|
|
- anv: Stop doing too much per-sample shading
|
|
- intel/dev: Add gtt_size to devinfo
|
|
- Revert "anv: Stop doing too much per-sample shading"
|
|
- vulkan/queue: Handle WSI memory signal information
|
|
- anv: Implement vk_device::create_sync_for_memory
|
|
- anv: Drop anv_sync_create_for_bo
|
|
- radeonsi/nir: Check for VARYING_SLOT_PRIMITIVE_ID not SYSTEM_VALUE
|
|
- nir: Add a new sample_pos_or_center system value
|
|
- intel/fs: Return fs_reg directly from builtin setup helpers
|
|
- intel/fs: Rework emit_samplepos_setup()
|
|
- intel/fs: Implement the sample_pos_or_center system value
|
|
- anv,nir: Use sample_pos_or_center in lower_wpos_center
|
|
- anv/pipeline: Get rid of sample_shading_enable
|
|
- intel/fs: Be more conservative in split_virtual_grfs
|
|
- intel/fs: Use OPT() for split_virtual_grfs
|
|
- intel/eu: Don't double-loop as often in brw_set_uip_jip
|
|
- Revert "intel/fs: Do cmod prop again after scheduling"
|
|
- intel/fs: Reset instruction order before re-scheduling
|
|
- intel/fs: Add a NONE scheduling mode
|
|
- vulkan/runtime: Validate instance version on 1.0 implementations
|
|
- anv,radv,v3dv: Move AcquireImageANDROID to common code
|
|
- radv: Move QueueSignalReleaseImageANDROID to common code
|
|
- anv: Use the common QueueSignalReleaseImageANDROID from RADV
|
|
- v3dv: Use the common QueueSignalReleaseImageANDROID from RADV
|
|
- turnip: Use vk_common_AcquireImageANDROID
|
|
- turnip: Use vk_common_QueueSignalReleaseImageANDROID for DRM
|
|
- intel/compiler: Stop using GLuint in brw_compiler.h
|
|
- intel/fs: Use compare_func for wm_prog_key::alpha_test_func
|
|
- spirv,radv: Fix some GL enum comments
|
|
- vulkan/runtime: Implement 1.3 features/properties
|
|
- anv/pass: Don't set first_subpass_layout for stencil-only attachments
|
|
- .mailmap: Switch Jason Ekstrand to @collabora.com
|
|
- anv: Implement 1.3 features/properties
|
|
- anv: Advertise Vulkan 1.3
|
|
- vulkan/wsi: Set MUTABLE_FORMAT_BIT in the prime path
|
|
- vulkan/wsi/drm: Break create_native_image in pieces
|
|
- vulkan/wsi: Add a helper for the configure/create/bind pattern
|
|
- vulkan/wsi/drm: Break create_prime_image in pieces
|
|
- vulkan/wsi/x11: Split image creation
|
|
- vulkan/wsi/wayland: Split image creation
|
|
- vulkan/wsi/display: Split image creation
|
|
- vulkan/wsi/win32: Delete unnecessary copy+paste from DRM
|
|
- vulkan/wsi/win32: Break create_win32_image in pieces
|
|
- vulkan/wsi/drm: Drop wsi_create_native/prime_image
|
|
- wsi/common: Set VK_IMAGE_CREATE_ALIAS_BIT
|
|
- vulkan/wsi: Add image create and bind helpers
|
|
- anv/image: Add some asserts when binding swapchain images
|
|
- anv/image: Call into WSI to create swapchain images
|
|
- anv: Call vk_command_buffer_finish if create fails
|
|
- anv: Don't assume depth/stencil attachments have depth
|
|
|
|
Jason2013 (1):
|
|
|
|
- Fix typo
|
|
|
|
Jesse Natalie (236):
|
|
|
|
- microsoft/clc: Images use nir_var_mem_image
|
|
- util/hash_table: Clear special 0/1 entries for u64 hash table too
|
|
- microsoft/compiler: Use textures for SRVs
|
|
- u_prim_restart: Fix index scanning with start offset
|
|
- d3d12: Don't accumulate timestamp queries
|
|
- u_threaded_context: Support including from C++
|
|
- d3d12: Inherit from threaded_query
|
|
- d3d12: Resources inherit from threaded_resource
|
|
- d3d12: Inherit from threaded_transfer
|
|
- d3d12: Use thread safe slab allocators in transfer_map handling
|
|
- d3d12: Pass explicit context to pre/post draw surface blits
|
|
- d3d12: Hook up threaded context
|
|
- d3d12: Fully init primconvert config
|
|
- d3d12: Fix Linux fence wait return value
|
|
- d3d12: Handle non-infinite wait timeouts > 49.7 days as infinite
|
|
- util/libsync: Fix timeout handling if poll() wakes up early
|
|
- d3d12: Don't wait for \*all* batches when synchronizing a resource
|
|
- d3d12: Don't wait for GPU reads to do CPU reads
|
|
- mesa/main: Fix use of alloca() without #include "c99_alloca.h"
|
|
- clc: Use kernel_arg_type_qual string to add const type qualifier to arg metadata
|
|
- microsoft/clc: Add a test for arg metadata
|
|
- d3d12: Fix incorrect hash table usage
|
|
- meson: Don't override built-in cpp_rtti option, error if it's invalid
|
|
- meson: Allow mismatching RTTI for MSVC
|
|
- android: Add a BOARD CFlags option so build can be customized
|
|
- mesa/main, android: Log errors to logcat
|
|
- android,d3d12: Support using DirectX-Headers dependency from AOSP
|
|
- android: Allow forcing softpipe
|
|
- d3d12: Support BGRA 555 and 565 formats
|
|
- d3d12: Support PIPE_CAP_MIXED_COLOR_DEPTH_BITS
|
|
- d3d12: Support RGBX formats mapped to RGBA
|
|
- microsoft/compiler: Handle GLES external textures
|
|
- gallium, windows: Use HANDLE instead of FD for external objects
|
|
- winsys/d3d12: Populate winsys handle format
|
|
- d3d12: Handle import/export of fd shared handles
|
|
- d3d12: Make format list all use macros
|
|
- d3d12: Generate format table using a macro list
|
|
- d3d12: Generate a pipe format -> typeless mapping table too
|
|
- d3d12: Validate opened D3D12 resource matches pipe template
|
|
- CI/windows: Uprev piglit
|
|
- CI/windows: Upload result.txt as an artifact
|
|
- CI/windows: Move D3D12 test YML to D3D12 driver folder
|
|
- CI/windows: Move SPIRV-to-DXIL test YML to microsoft folder
|
|
- CI/windows: Move reference files to relevant ci subdirectories
|
|
- CI/d3d12: Add a quick_shader run
|
|
- d3d12: Support compat level 330
|
|
- windows: Use TLS context/dispatch with shared-glapi
|
|
- d3d12: Handle depth readback on drivers that require full-resource copies for depth
|
|
- nir: Add an 'external' texture type for parity with samplers
|
|
- d3d12: Force emulation of all YUV formats using per-plane formats
|
|
- d3d12: Handle opening planar resources
|
|
- d3d12: Allow creating planar resources
|
|
- d3d12: Use overall resource format + plane format to get format info
|
|
- microsoft/compiler: Implement inot
|
|
- microsoft/compiler: Remove algebaric pass for inot
|
|
- ci/windows: Remove line numbers from assertions in spirv2dxil tests
|
|
- glapi: Never use dllimport/dllexport for TLS vars on Windows
|
|
- microsoft/compiler: Support lowered io (nir_intrinsic_load_input/store_output)
|
|
- microsoft/compiler: Lower io
|
|
- microsoft/compiler: Delete non-sysval deref load/store code
|
|
- microsoft/compiler: Load synthesized sysvals via lowered io
|
|
- d3d12: Fix NV12 resource importing
|
|
- softpipe: Add a dummy field to sp_fragment_shader_variant_key
|
|
- CI: Trigger Windows build on softpipe changes
|
|
- microsoft/compiler: Emit SSBOs from 0 -> count for GL (non-kernel, non-Vulkan) shaders
|
|
- microsoft/compiler: Hook up uavs-at-every-stage flag
|
|
- microsoft/compiler: Handle write masks in SSBO lowering pass
|
|
- d3d12: Support SSBOs in root signatures
|
|
- d3d12: Always create buffers as UAV-capable
|
|
- d3d12: Support setting SSBOs on the context and turning them into descriptors
|
|
- d3d12: Use DXIL load/store lowering pass
|
|
- d3d12: Set SSBO support caps
|
|
- d3d12: Avoid a debug warning trying to unmap a not-mapped resource
|
|
- d3d12: Replace pipe cap literals with D3D12 defines when available
|
|
- d3d12: Enable cubemap arrays
|
|
- microsoft/compiler: Position should always be no-perspective
|
|
- d3d12: Handle cubemap gather on int cubemaps
|
|
- d3d12: Enable texture gather
|
|
- microsoft/compiler: Fix LOD instruction to return 2 values
|
|
- gallium/aux: Move index offsetting from prim restart to primconvert
|
|
- microsoft/compiler: Change vulkan_environment bool to an enum
|
|
- microsoft/compiler: Put SSBO and image handles in separate arrays
|
|
- microsoft/compiler: Emit GL images in descriptor space 1 with driver_location instead of binding
|
|
- microsoft/compiler: Unify handle retrieval between images and UBO/SSBO
|
|
- microsoft/compiler: Emit SRVs/UAVs as arrays
|
|
- microsoft/compiler: Fix array-of-array handling for derefs of textures/images
|
|
- microsoft/compiler: Handle images as derefs for GL
|
|
- microsoft/compiler: Implement atomic image ops
|
|
- microsoft/compiler: Handle forced early depth
|
|
- microsoft/compiler: Hook up memory/control barriers
|
|
- microsoft/compiler: Fix handling of fp16-in-32bit-val ops to handle high bits
|
|
- d3d12: Shrink 2D array size so that max-layer cube arrays can be created
|
|
- d3d12: Fix format table typeless-ness for A8 and RGBA1010102
|
|
- d3d12: Rename UAV -> SSBO to disambiguate with image UAVs
|
|
- d3d12: Add missed SSBO binding enum value
|
|
- d3d12: Figure out if we can support GL shader images
|
|
- d3d12: Handle format support queries for shader images
|
|
- d3d12: Init null UAVs
|
|
- d3d12: Retrieve shader image dimensions during shader compiles
|
|
- d3d12: Handle images in the root signature
|
|
- d3d12: Handle set_shader_images
|
|
- d3d12: Create textures as UAV-capable when appropriate
|
|
- d3d12: Fill out shader image descriptor tables
|
|
- d3d12: Lower cube images to 2D arrays via existing int cubemap lowering pass
|
|
- d3d12: Handle memory barriers
|
|
- d3d12: Handle bitcasting of shader images
|
|
- d3d12: Set appropriate caps for shader images
|
|
- nir_opt_dead_cf: Remove dead ifs
|
|
- shader_info: tess.spacing needs to be unsigned
|
|
- microsoft/compiler: Move workgroup_size lowering from clc
|
|
- microsoft/compiler: Handle more GL memory barriers
|
|
- d3d12: Limit sampler view count to 32
|
|
- d3d12: Keep state vars last in the per-stage root parameters
|
|
- d3d12: Remove draw_info from selection_context
|
|
- d3d12: Stop trying to set D3D12_DIRTY_SHADER during bindings
|
|
- d3d12: Compile, bind, and cache compute PSOs
|
|
- d3d12: Support compute root signatures
|
|
- d3d12: Hook up compute shader variations
|
|
- d3d12: Implement launch_grid
|
|
- d3d12: Implement num workgroups as a state var
|
|
- d3d12: Handle indirect dispatch
|
|
- d3d12: Run DXIL shared atomic lowering pass
|
|
- d3d12: Enable compute
|
|
- docs: Update d3d12 extension list and new_features.txt
|
|
- d3d12/ci: Skip flaky tex-miplevel-selection and timestamp tests
|
|
- mesa/st: Assert that NIR drivers that support tess use tess levels as inputs
|
|
- d3d12: Export d3d12_get_state_var from d3d12_nir_passes.c
|
|
- d3d12: Fix re-enabling predication after temporary disablement
|
|
- d3d12: Predication fix: re-enable after restarting a batch if needed
|
|
- d3d12: Predication fix: For boolean queries used for predication, D3D12 uses uint64, so clear at least a uint64 in the result
|
|
- d3d12: Declare support for inverted conditional render
|
|
- d3d12: Upgrade first vertex state var into all vertex draw params
|
|
- d3d12: Enable base instance and draw params extensions
|
|
- d3d12: Add a command signature cache for indirect draws
|
|
- d3d12: Handle draw indirect and multi-draw indirect
|
|
- d3d12: Handle indirect twoface draws
|
|
- d3d12: Add a compute transformation to handle indirect draws that need draw params
|
|
- d3d12: Enable draw and multi-draw indirect
|
|
- docs: Update d3d12 features
|
|
- d3d12: When no framebuffer attachments are present, use ForcedSampleCount instead of SampleDesc.Count for MSAA
|
|
- d3d12: When no framebuffer attachments are present, the viewport must be clamped to framebuffer size
|
|
- d3d12: Support ARB_framebuffer_no_attachments
|
|
- docs: Update d3d12 feature list
|
|
- ci/windows: Use 2 container stages
|
|
- microsoft/compiler: Handle variables declared per-sample
|
|
- microsoft/compiler: Handle load_sample_pos_at_id
|
|
- microsoft/compiler: Always have at least one GS active stream
|
|
- microsoft/compiler: Handle 'pull model' explicit interpolation intrinsics
|
|
- microsoft/compiler: Handle textureGatherCmp
|
|
- microsoft/compiler: Handle input coverage
|
|
- microsoft/compiler: Handle tex texture/sampler offset srcs
|
|
- microsoft/compiler: Handle load_invocation_id for GS and HS
|
|
- microsoft/compiler: Emit samplers as array types
|
|
- microsoft/compiler: Handle bitfield_insert
|
|
- microsoft/compiler: Use ibfe/ubfe for bitfield extract instead of lowering to shifts
|
|
- microsoft/compiler: Handle msb/lsb/bfrev
|
|
- microsoft/compiler: Lower helper invocations
|
|
- d3d12: Sample mask output needs to be uint-typed
|
|
- d3d12: Lower load_sample_pos to load_sample_pos_at_id
|
|
- d3d12: Report sample positions
|
|
- d3d12: Modify shaders when MSAA is disabled
|
|
- d3d12: Relax multisampling direct copy requirements
|
|
- d3d12: Temp resources for same-resource copies can be MSAA too
|
|
- d3d12: Report number of GS streams
|
|
- d3d12: Apply GS point sprite lowering to fixed-function point size too
|
|
- d3d12: Run point sprite lowering pass on multi-stream GS when safe
|
|
- d3d12: Support dynamic UBO/SSBO indexing
|
|
- d3d12: When mapping a non-directly-mappable resource for write, readback first
|
|
- d3d12: Set sample-rate shading and GLSL 400 caps
|
|
- docs: Update d3d12 features
|
|
- mesa/st: Lower user clip planes for tess eval too
|
|
- microsoft/compiler: Force integer I/O vars to use flat/constant interpolation
|
|
- microsoft/compiler: Use driver_location instead of location for inter-stage varying index in GL
|
|
- microsoft/compiler: Semantic table should be de-duped for multi-row semantics too
|
|
- microsoft/compiler: Multi-row output semantics need to write multiple never_writes_masks
|
|
- microsoft/compiler: Getting a builtin function with an undeclared signature should be unreachable
|
|
- microsoft/compiler: Add mapping from MESA_SHADER_* to DXIL_*_SHADER for tessellation
|
|
- microsoft/compiler: Fix typo in enum entry
|
|
- microsoft/compiler: Emit statically-indexed resource handles and scratch later
|
|
- microsoft/compiler: Support emitting multiple functions into a DXIL module
|
|
- microsoft/compiler: Emit functions with actual function names
|
|
- microsoft/compiler: Emit all NIR functions into the DXIL module
|
|
- microsoft/compiler: Handle store_per_vertex_output for HS outputs
|
|
- microsoft/compiler: Split hull (tess ctrl) shaders into main and patch constant funcs
|
|
- microsoft/compiler: Delete misleading TODO comments about semantic table
|
|
- microsoft/compiler: Emit HS PSV validation and entrypoint metadata
|
|
- microsoft/compiler: Fix I/O signatures for tess shaders
|
|
- microsoft/compiler: Overlap patch and non-patch varyings so both are separately 0-indexed
|
|
- microsoft/compiler: When sorting patch varyings, adjust location to be in normal varying range
|
|
- microsoft/compiler: Gather patch const signature and handle tess factor in it
|
|
- microsoft/compiler: Add patch constant signature into PSV and as container blob
|
|
- microsoft/compiler: Add a pass for hull and domain shaders to shrink tess level vars
|
|
- microsoft/compiler: For store_output from HS, use storePatchConstant
|
|
- microsoft/compiler: For load_input from DS, use loadPatchConstant
|
|
- microsoft/compiler: Handle load_per_vertex_output as LoadOutputControlPoint
|
|
- microsoft/compiler: Handle load_output in the HS stage as reading a previously written patch constant
|
|
- microsoft/compiler: Handle domain location intrinsic
|
|
- microsoft/compiler: Emit DS PSV validation and entrypoint metadata
|
|
- microsoft/compiler: Primitive ID should only be added as a sysval in geometry shaders
|
|
- microsoft/compiler: Location_frac needs to be included in sort order
|
|
- microsoft/compiler: Handle clip/cull distance as an input to tess shaders
|
|
- d3d12: Enable PIPE_CAP_TGSI_TEXCOORD
|
|
- d3d12: Initial plumbing for tesselation
|
|
- d3d12: Link tesselation control and eval shaders
|
|
- d3d12: Handle patch_vertices and patch topology
|
|
- d3d12: Handle passthrough TCS in the case where eval is bound
|
|
- d3d12: Add a state variable for patch_vertices_in
|
|
- d3d12: Update varying creation logic to handle location_frac
|
|
- d3d12: Handle input clip array size in the shader key
|
|
- d3d12: Set caps for tesselation
|
|
- microsoft/compiler: Fix UAV resource ID counting for static indexed handles
|
|
- d3d12: Fix compute transform for multi-draw indirect with dynamic count + state vars
|
|
- d3d12: Add UAV barriers for UAVs that are being used by compute transforms
|
|
- d3d12: Include SO buffer count as a PSO dirty bit
|
|
- d3d12: Support transform feedback pause/resume
|
|
- d3d12: Move indirect compute to real indirect dispatches
|
|
- d3d12: SO buffer filled size is only 32-bit
|
|
- d3d12: Add a comment for what the existing compute transform does
|
|
- d3d12: Add a couple compute transforms for "fake" SO buffers
|
|
- d3d12: Add a compute transform for draw auto
|
|
- d3d12: Move compute transform state save/restore to compute_transforms.cpp
|
|
- d3d12: Move "fake" SO buffer handling to compute transforms instead of CPU readback
|
|
- d3d12: Implement DrawAuto aka DrawTransformFeedback
|
|
- d3d12: Compute transform UBO0 is actually binding 1
|
|
- d3d12: Rewrite subquery logic
|
|
- d3d12: Switch primitives-generated query to use XFB, GS, and IA data
|
|
- d3d12: ARB_transform_feedback2
|
|
- microsoft/compiler: Correctly support I/O on variables with location_frac
|
|
- microsoft/compiler: Support multiple GS output streams
|
|
- d3d12: Unpack multi-stream varyings
|
|
- d3d12: Fix xfb varying matching for vars with location_frac
|
|
- d3d12: Handle indexed queries
|
|
- d3d12: ARB_transform_feedback3
|
|
- microsoft/compiler: Only prep phis for the current function
|
|
- microsoft/compiler: Only treat tess level location as special if it's a patch constant
|
|
- tc: CPU storage needs to be freed with align_free
|
|
|
|
Jianxun Zhang (7):
|
|
|
|
- intel: provide pci bus and dev info in base device struct
|
|
- intel: use PCI info to compute device uuid
|
|
- anv: remove private pci fields
|
|
- intel: dump PCI info in intel_dev_info
|
|
- intel: remove chipset_id
|
|
- intel: add swizzle flag into driver uuid
|
|
- anv: refactor queue chain
|
|
|
|
Jonathan Gray (6):
|
|
|
|
- util: unbreak non-linux mips64 build
|
|
- util: fix util_cpu_detect_once() build on OpenBSD
|
|
- radv: use MAJOR_IN_SYSMACROS for sysmacros.h include
|
|
- util/u_atomic: fix build on clang archs without 64-bit atomics
|
|
- util: fix build with clang 10 on mips64
|
|
- util: use correct type in sysctl argument
|
|
|
|
Jonathan Marek (3):
|
|
|
|
- freedreno/layout: Fix the UBWC block size for the Y plane
|
|
- turnip: enable UBWC for NV12
|
|
- turnip: use SUBDRAW_SIZE and constant sized tess bos
|
|
|
|
Jordan Crouse (1):
|
|
|
|
- turnip: Update the msm_kgsl.h header with the sanitized 4.19 version
|
|
|
|
Jordan Justen (50):
|
|
|
|
- intel/genxml: Update genxml to support tessellation/geometry distribution
|
|
- intel/dev/test: Assert (verx10 / 10) == ver
|
|
- Revert "iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13"
|
|
- iris: Use mi_builder in iris_load_indirect_location()
|
|
- intel/genxml/125: Update COMPUTE_WALKER POSTSYNC_DATA struct
|
|
- anv,blorp,iris: Set MOCS for COMPUTE_WALKER post-sync operation
|
|
- intel/dev: Add platform enum with DG2 G10 & G11
|
|
- intel: Add intel_gem_count_engines
|
|
- intel: Add intel_gem_create_context_engines
|
|
- iris: Add iris_init_batches
|
|
- iris/batch: Move kernel context init to iris_init_non_engine_contexts
|
|
- iris/batch: Add exec_flags field
|
|
- iris: Move away from "hw" for some context terminology
|
|
- iris: Destroy all batches with a new iris_destroy_batches() function
|
|
- iris: Make iris_kernel_context_get_priority() public
|
|
- iris/batch: Add support for engines contexts
|
|
- intel/l3: Make DG1 urb-size exception more generic
|
|
- iris: Not all gfx12+ have aux_map_ctx
|
|
- anv: Align buffer VMA to 2MiB for XeHP
|
|
- iris: Align buffer VMA to 2MiB for XeHP
|
|
- nir/lower_tex: Add filter for tex offset lowering
|
|
- intel/compiler: Use nir_lower_tex_options::lower_offset_filter for tg4 on XeHP
|
|
- intel/genxml/12.5: Remove bt-pool enable from 3DSTATE_BINDING_TABLE_POOL_ALLOC
|
|
- anv: Add BINDING_TABLE_POOL_BLOCK_SIZE
|
|
- intel/compiler: Adjust TCS instance-id for dg2+
|
|
- isl: Don't enable HDC:L1 caches on DG2
|
|
- intel: Add device info for DG2
|
|
- intel: Add \*disabled* device ids for DG2
|
|
- intel/devinfo: Adjust L3 banks for DG2
|
|
- iris: Use mi_builder to set 3DPRIM registers for draws
|
|
- iris: Use mi_builder for load/store reg/mem/imm functions
|
|
- intel/dev: Add max_threads_per_psd field to devinfo for gfx8+
|
|
- anv,blorp,crocus,i965,iris: Use devinfo->max_threads_per_psd for gfx8+
|
|
- intel/dev: Add intel_hwconfig_types.h from random post on the internet
|
|
- intel/dev: Add intel_print_hwconfig_table()
|
|
- intel/dev: Print urb size with intel_dev_info
|
|
- intel/dev: Add intel_device_info::apply_hwconfig
|
|
- intel/dev: Set intel_device_info::apply_hwconfig for DG2
|
|
- intel/dev: Apply settings from hwconfig if devinfo::apply_hwconfig is set
|
|
- intel/dev: Recalculate max_cs_threads after applying hwconfig changes
|
|
- intel/gem: Return length from intel_i915_query_alloc
|
|
- intel/dev: Add DG1 PCI id 0x4909
|
|
- intel/dev: Add device ids for ADL-N
|
|
- intel/dev: Add device info for RPL
|
|
- intel/genxml: Extend length of 3DSTATE_WM_HZ_OP for gfx12.5
|
|
- intel/genxml: Extend length of 3DSTATE_DEPTH_BUFFER for gfx12.5
|
|
- isl: Enable compression with Tile4
|
|
- intel/fs: Assert that old pull-const code is not used if devinfo->has_lsc
|
|
- anv: Align GENERAL_STATE_POOL_MIN_ADDRESS to 2MiB
|
|
- anv: Align state pools to 2MiB on XeHP
|
|
|
|
Joshua Ashton (8):
|
|
|
|
- radv: Always inline descriptor writes
|
|
- radv: Split off cmd_buffer variant of descriptor set updates
|
|
- radv: Split off cmd_buffer variant of descriptor set template updates
|
|
- nvc0: Fix uninitialized width/height/depth warning.
|
|
- radv: Refactor S_FIXED to radv_float_to_{s,u}fixed
|
|
- radv: Expose min_lod in \*_make_texture_descriptor
|
|
- vulkan: Update the XML and headers to 1.2.199
|
|
- radv: Implement VK_EXT_image_view_min_lod
|
|
|
|
José Fonseca (4):
|
|
|
|
- d3d10umd: Rename Dxgi.h to DxgiFns.h.
|
|
- d3d10umd: Update for transfer interface changes.
|
|
- d3d10umd: Fix MSVC build.
|
|
- d3d10umd: Update for set_sampler_views take_ownership parameter.
|
|
|
|
Juan A. Suarez Romero (22):
|
|
|
|
- vc4/ci: update expected results
|
|
- v3dv/ci: update expected results
|
|
- broadcom/compiler: handle array of structs in GS/FS inputs
|
|
- broadcom/compiler: apply constant folding on early GS lowering
|
|
- nir: add NIR_DEBUG envvar
|
|
- mesa: allow TEXTURE_BUFFER target for ARB_texture_buffer_range
|
|
- st/pbo: do not use GS for NIR preferred shaders
|
|
- gallium/util: add helper to clamp colors to valid range
|
|
- v3d: clamp clear color
|
|
- gallium: add new PIPE_CAP_IMAGE_STORE_FORMATTED
|
|
- st/pbo: add the image format in the download FS
|
|
- st/pbo: set layer coord for array textures
|
|
- v3d: enable ARB_texture_view
|
|
- tgsi-to-nir: initialize NIR_DEBUG envvar
|
|
- nir: use call_once() to init debug variable
|
|
- broadcom/ci: restructure expected results
|
|
- softpipe: enable PIPE_CAP_IMAGE_STORE_FORMATTED
|
|
- d3d12: enable PIPE_CAP_IMAGE_STORE_FORMATTED
|
|
- mesa/st: do not expose ARB_shader_image_load_store if not fully implemented
|
|
- mesa: fix MAX_GEOMETRY_IMAGE_UNIFORMS check support
|
|
- v3d/doc: do not expose ARB_shader_image_load_store
|
|
- v3d: keep clear color untouched
|
|
|
|
Karol Herbst (5):
|
|
|
|
- spirv: Don't add 0.5 to array indicies for OpImageSampleExplicitLod
|
|
- clover/image: add templated basic_image class to simplify image subclassing
|
|
- clover/format: Full rework on how we declare supported images.
|
|
- clover/formats: pass in cl_mem_flags for better format checking
|
|
- clover/api: fix clGetMemObjectInfo for images
|
|
|
|
Kenneth Graunke (82):
|
|
|
|
- intel: Drop Tigerlake revision 0 workarounds
|
|
- crocus: Replace devinfo->ver[x10] checks with GFX_VER[x10]
|
|
- intel/genxml: Fix Indirect Object Access Upper Bound on Gfx4
|
|
- intel/genxml: Add an "mbz" data type
|
|
- intel/genxml: Drop "Hierarchical Depth Buffer MOCS" field
|
|
- intel/genxml: Change 3DSTATE_CONSTANT_XS::MOCS to be MBZ on Gfx8.
|
|
- isl: Fill in MOCS even for SURFTYPE_NULL surfaces.
|
|
- isl: Fill in MOCS for NULL depth, stencil, and HiZ buffers.
|
|
- blorp: Fill in MOCS even for SURFTYPE_NULL surfaces.
|
|
- blorp: Fill in MOCS for null depth/stencil/HiZ buffers.
|
|
- blorp: Use a non-zero MOCS for disabled constant buffers
|
|
- iris: Drop unnecessary parenthesis
|
|
- iris: Set Bindless Sampler State MOCS
|
|
- iris: Set default MOCS for NULL depth/stencil/HiZ buffers
|
|
- iris: Set MOCS on 3DSTATE_CONSTANT_XS on Gfx9+
|
|
- iris: Set MOCS on 3DSTATE_CONSTANT_ALL packets that disable all buffers
|
|
- iris: Set MOCS on NULL vertex buffers
|
|
- iris: Set MOCS on NULL stream output buffers
|
|
- iris: Fix MOCS for buffer copies
|
|
- anv: Set default MOCS for NULL depth/stencil/HiZ buffers
|
|
- anv: Set MOCS for 3DSTATE_CONSTANT_XS on Gfx7.x as well
|
|
- anv: Set MOCS in 3DSTATE_CONSTANT_XS even if there isn't a buffer.
|
|
- anv: Set MOCS on NULL vertex buffers
|
|
- anv: Set MOCS on NULL stream output buffers
|
|
- crocus: Set MOCS for most state base addresses on pre-Gen8
|
|
- crocus: Tidy the ifdefs for emitting STATE_BASE_ADDRESS
|
|
- crocus: Set MOCS for index buffers on Gen6+
|
|
- crocus: Set MOCS on NULL stream output buffers
|
|
- crocus: Set default MOCS for NULL depth/stencil/HiZ buffers
|
|
- crocus: Set MOCS for push constant buffers where possible
|
|
- crocus: Set MOCS for 3DSTATE_SO_BUFFERS on Gfx7.x too
|
|
- crocus: Fix MOCS for buffer copies.
|
|
- i965: Use ISL for MOCS rather than open coding it everywhere
|
|
- i965: Set default MOCS for NULL depth/stencil/HiZ buffers
|
|
- i965: Set MOCS for push constant buffers on Haswell and Gfx9+
|
|
- i965: Set MOCS on NULL stream output buffers
|
|
- i965: Set MOCS for 3DSTATE_SO_BUFFERS on Gfx7.x too
|
|
- i965: Set MOCS for 3DSTATE_INDEX_BUFFER on Gfx6/7 as well.
|
|
- i965: Fix MOCS for BLORP buffer copies
|
|
- i965: Port STATE_BASE_ADDRESS to genxml and fix bugs
|
|
- i965: Set MOCS for Bindless Surface/Sampler State base addresses
|
|
- intel/genxml: Add an field option for nonzero="true"
|
|
- intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+
|
|
- intel/genxml: Include blitter commands in gen*_pack.h
|
|
- intel/genxml: Allow MI_FLUSH_DW on the blitter
|
|
- intel/genxml: Add XY_BLOCK_COPY_BLT on Tigerlake and later.
|
|
- iris: Fix parameters to iris_copy_region in reallocate_resource_inplace
|
|
- intel/genxml: Simplify prefix handling for field value lists
|
|
- intel/genxml: Collapse leading underscores on prefixed value defines
|
|
- intel/genxml: Fix MI_FLUSH_DW to actually specify the length properly
|
|
- intel/genxml: Fix XY_BLOCK_COPY_BLT destination tiling field type
|
|
- intel/genxml: Decode VALIGN/HALIGN values in XY_BLOCK_COPY_BLT
|
|
- iris: Make a helper function for cross-batch dependency flushing
|
|
- iris: Check for cross-batch flushing whenever a buffer is newly written.
|
|
- iris: Tidy code in iris_use_pinned_bo a bit
|
|
- blorp: Fix compute-blits for rectangles not aligned to the workgroup
|
|
- blorp: Don't try to use the 3D stencil write hardware for compute
|
|
- blorp: Assert that BLORP_BATCH_PREDICATE_ENABLE isn't set for compute
|
|
- blorp: Disallow multisampling for BLORP compute blits and copies.
|
|
- iris: Rename is_render_target to is_dest in a few blit functions
|
|
- isl: Move some genxml surface state helpers into an include file
|
|
- intel/vec4: Use ir_texture_opcode less in emit_texture()
|
|
- intel/vec4: Use nir_texop in emit_texture instead of translating
|
|
- intel/vec4: Inline emit_texture and move helpers to brw_vec4_nir.cpp
|
|
- intel/compiler: Use uppercase enum values in brw_ir_performance.cpp
|
|
- intel/fs: Reuse the same FS input slot for VUE header fields.
|
|
- iris: Use prog_data->inputs rather than shader info in SBE code.
|
|
- iris: Do primitive ID overrides in 3DSTATE_SBE not SBE_SWIZ
|
|
- iris: Directly access BOs rather than using iris_resource_bo(...)
|
|
- intel: Allow copy engine class in intel_gem_create_context_engines()
|
|
- intel/genxml: Add XY_BLOCK_COPY_BLT Color Depth enum values
|
|
- intel/dev: Add a has_flat_ccs flag
|
|
- blorp: Add a blorp_address::local_hint flag
|
|
- isl: Add isl_dev->mocs.blitter_{src,dst} fields
|
|
- blorp: Add support for blorp_copy via XY_BLOCK_COPY_BLT
|
|
- iris: Create an IRIS_BATCH_BLITTER for using the BLT command streamer
|
|
- iris: Only have one blorp_batch_init/finish in iris_copy_region()
|
|
- iris: Set BLORP_BATCH_USE_{COMPUTE,BLITTER} flags for the target batch
|
|
- iris: Implement iris_blorp_exec() for the blitter engine
|
|
- iris: Fix and refactor check for clear color being fully zero
|
|
- iris: Make an iris_foreach_batch macro that skips unsupported batches
|
|
- blorp: Add blorp_measure hooks to the blitter codepaths
|
|
|
|
Khem Raj (1):
|
|
|
|
- v3dv: account for 64bit time_t on 32bit arches
|
|
|
|
Konstantin Seurer (4):
|
|
|
|
- lavapipe: Fixed maxFragmentCombinedOutputResources
|
|
- anv: Fixed maxFragmentCombinedOutputResources
|
|
- turnip: Fixed maxFragmentCombinedOutputResources
|
|
- panvk: Fixed maxFragmentCombinedOutputResources
|
|
|
|
Kostiantyn Lazukin (3):
|
|
|
|
- util/u_trace: Replace Flag with IntEnum to support python3.5
|
|
- util/ra: use adjacency matrix for undirected graph
|
|
- util/ra: Fix numeric overflow during bitset allocation
|
|
|
|
Krunal Patel (2):
|
|
|
|
- frontends/va: use un-padded width/height in ExportSurfaceHandle
|
|
- frontend/va: Setting the size of VADRMPRIMESurfaceDescriptor
|
|
|
|
Leandro Ribeiro (12):
|
|
|
|
- egl: remove unnecessary spaces after types
|
|
- egl/wayland: replace EGL_DRI2_MAX_FORMATS by EGL_DRI2_NUM_FORMATS
|
|
- egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()
|
|
- egl/wayland: do not try to access memory if allocation failed
|
|
- egl/wayland: move formats and modifiers to a separate struct
|
|
- egl/wayland: remove unused constant EGL_DRI2_NUM_FORMATS
|
|
- loader: add function to get render node from dev_t
|
|
- egl/wayland: add initial dma-buf feedback support
|
|
- egl/wayland: move loader_dri_create_image() calls to separate functions
|
|
- egl/wayland: use surface dma-buf feedback to allocate surface buffers
|
|
- egl/wayland: do not try to bind to wl_drm if not advertised
|
|
- egl/wayland: fix surface dma-buf feedback error exits
|
|
|
|
Lepton Wu (1):
|
|
|
|
- driconf: Fix unhandled tags in static conf
|
|
|
|
Lionel Landwerlin (119):
|
|
|
|
- vulkan/wsi/wayland: don't expose surface formats not fully supported
|
|
- anv: fix push constant lowering with bindless shaders
|
|
- pps: remove counter_ids fields
|
|
- pps: add an intel config file
|
|
- docs: put a list of commands to setup perfetto
|
|
- intel/dev: printout timestamp period
|
|
- intel/pps: provide accurate min sampling period
|
|
- intel/pps: reuse timestamp_frequency from intel_device_info
|
|
- intel/dev: fix HSW GT3 number of subslices in slice1
|
|
- intel/dev: don't forget to set max_eu_per_subslice in generated topology
|
|
- intel/dev: reuse internal functions to set mask
|
|
- intel/dev: fix subslice/eu total computations with some fused configurations
|
|
- intel/perf: fix perf equation subslice mask generation for gfx12+
|
|
- intel/devinfo: use compatible type for ARRAY_SIZE
|
|
- intel/devinfo: fix wrong offset computation
|
|
- intel: remove 2 preproduction pci-id for ADLS
|
|
- intel: move away from booleans to identify platforms
|
|
- intel/dev: also test crocus & i915 pci-ids
|
|
- anv: don't forget to add scratch buffer to BO list
|
|
- anv: fix multiple wait/signal on same binary semaphore
|
|
- anv: don't try to close fd = -1
|
|
- anv: initialize anv_bo_sync base fields
|
|
- intel/fs: fix shader call lowering pass
|
|
- pps: allow drivers to report timestamps in their own time domain
|
|
- intel/perf: add a helper to read timestamp from reports
|
|
- pps: fixup sporadic missing counters
|
|
- intel/ds: drop timestamp correlation code
|
|
- intel/perf: track end timestamp of queries
|
|
- intel/ds: drop unused constructors
|
|
- intel/ds: isolate intel/perf from the pps-producer
|
|
- intel/pps: tweak intel config some more
|
|
- intel/ds: remove verbose messages
|
|
- intel: move timestamp scaling helper to intel/perf
|
|
- anv: fix execbuf syncobjs/syncobj_values array leak
|
|
- util/u_trace: refcount payloads
|
|
- nir/opt_deref: don't try to cast empty structures
|
|
- util/u_trace: add end_of_pipe property to tracepoints
|
|
- util/u_trace/perfetto: add new env variable to enable perfetto
|
|
- intel/nir: preserve access value when duping intrinsic
|
|
- nir/lower_io: include the variable access in the lowered intrinsic
|
|
- nir/print: printout ACCESS_STREAM_CACHE_POLICY
|
|
- nir: add a new access flag to allow access in helper invocations
|
|
- nir: add intrinsics for ray queries
|
|
- nir: track variables representing ray queries
|
|
- nir: add a ray query optimization pass
|
|
- spirv: handle ray query intrinsics
|
|
- intel/debug: reclaim 7 unused bits from classic driver
|
|
- genxml: fix compilation with P/I defines
|
|
- genxml: protect _length defines in genX_bits.h
|
|
- docs/envvars: update after INTEL_DEBUG cleanup
|
|
- util/u_vector: prevent C++ warning on cast from void* to something else
|
|
- vulkan: fix missing handling of WSI memory signal
|
|
- anv,wsi: simplify WSI synchronization
|
|
- gitlab-ci: disable radv-fossils
|
|
- intel/devinfo: adjust subslice array size
|
|
- util/u_trace: protect against reentrant calls
|
|
- anv: don't leave anv_batch fields undefined
|
|
- anv: limit compiler valid color outputs using NIR variables
|
|
- intel/dev: fixup chv workaround
|
|
- ci: disable vs2019 windows build
|
|
- intel/devinfo: printout pixel pipes in info printout
|
|
- intel/devinfo: printout devinfo struct size
|
|
- intel/devinfo: add a helper to check for slice availability
|
|
- intel/devinfo: drop num_eus_per_subslice field
|
|
- anv: fix perf queries
|
|
- intel/dev: extract slice/subslice total computation
|
|
- intel/devinfo: split out l3/pixelpipes counting
|
|
- intel/devinfo: deal with i915 topology query change
|
|
- intel/fs: disable VRS when omask is written
|
|
- blorp: add description & helpers to printout ops
|
|
- isl: add helpers to printout ops
|
|
- anv: expose a couple of emit helper to build utrace buffer copies
|
|
- intel/dev,perf: Use a single timescale function
|
|
- intel/blorp: add measure_end entry point
|
|
- anv: implement u_trace support
|
|
- intel/ds: reuse intel_ioctl()
|
|
- intel/ds: allow user to select metric set at start time
|
|
- intel/ds: don't forget to reset upper dword timestamp read
|
|
- intel/ds: use the right i915_drm.h include location
|
|
- intel/ds: use a per GPU clock ID
|
|
- util/u_process: protect entrypoints for c++
|
|
- anv: add perfetto source
|
|
- pps: enable anv source in example config file
|
|
- tools/pps: limit intel cfg to 250ms of sampling
|
|
- iris: utrace/perfetto support
|
|
- intel/dev: fix ppipe_mask computation
|
|
- anv: fix missing descriptor copy of bufferview/surfacestate content
|
|
- genxml: reduce amount of generated code
|
|
- anv: verify that the format supports multisampling
|
|
- anv: switch a bunch of struct/enum to 1.3 versions
|
|
- relnotes/features: updates for Vulkan 1.3
|
|
- intel/ci: expected failure for 1.3 with older CTS
|
|
- docs: start some documentation on Anv
|
|
- docs/anv: list environment variables
|
|
- docs/anv: add descriptor memory layout
|
|
- anv: tidy long lines in descriptor code
|
|
- docs: update INTEL_DEBUG environment variable documentation
|
|
- intel/tracepoint: simplify tracepoint descriptions
|
|
- util/u_trace: make mako conditional code easier to read
|
|
- util/utrace: make generated code a tiny bit nicer to look at
|
|
- compiler: add VARYING bit for primitive shading rate
|
|
- genxml: gen12.5 changes for CPS
|
|
- genxml: add new 3DSTATE_PS_EXTRA bit
|
|
- intel/dev: details CPS feature support
|
|
- isl: add support for coarse pixel control surfaces
|
|
- isl: disable CPB surface compression
|
|
- nir/builder: add ishl_imm helper
|
|
- intel/compiler: add a new pass to lower shading rate into HW format
|
|
- intel/compiler: add primitive rate output support
|
|
- anv: force primitive shading rate write in last geometry stage
|
|
- anv/pass: rely on precomputed dynamic rendering pass/subpass more
|
|
- anv: Update VK_KHR_fragment_shading_rate for newer HW
|
|
- intel/fs: don't set allow_sample_mask for CS intrinsics
|
|
- intel/nir: fix shader call lowering
|
|
- anv: fix conditional render for vkCmdDrawIndirectByteCountEXT
|
|
- nir: fix lower_memcpy
|
|
- anv/genxml/intel/fs: fix binding shader record entry
|
|
- anv: fix fast clear type value with external images
|
|
- intel/fs: fix total_scratch computation
|
|
|
|
Lorenz Brun (1):
|
|
|
|
- frontends/va: Return error in vaRenderPicture if decoder is NULL
|
|
|
|
Louis-Francis Ratté-Boulianne (7):
|
|
|
|
- microsoft/compiler: add support for load_layer_id
|
|
- microsoft/compiler: Add subpass input types
|
|
- microsoft/compiler: Use SRVs for read-only images
|
|
- microsoft/spirv_to_dxil: check for variables r/w access
|
|
- microsoft/spirv_to_dxil: lower input attachments
|
|
- microsoft/spirv_to_dxil: Lower push constant loads to UBO loads
|
|
- microsoft/spirv_to_dxil: Allow passing a vulkan -> d3d12 binding mapping table
|
|
|
|
Lucas Stach (11):
|
|
|
|
- etnaviv: fix alpha blend with dither on older GPUs
|
|
- etnaviv: initialize vertex attributes on context reset
|
|
- egl/wayland: break double/tripple buffering feedback loops
|
|
- etnaviv: drm: fix size limit in etna_cmd_stream_realloc
|
|
- etnaviv: drm: properly handle reviving BOs via a lookup
|
|
- etnaviv: drm: export BO idle check function
|
|
- etnaviv: drm: rename _etna_bo_del
|
|
- etnaviv: drm: defer destruction of softpin BOs
|
|
- egl/dri2: remove superfluous flush when changing the context
|
|
- egl/dri2: short-circuit dri2_make_current when possible
|
|
- etnaviv: draw: only mark resources as read/written when the state changed
|
|
|
|
M Henning (2):
|
|
|
|
- nouveau/nir: Use natural alignment for scalars
|
|
- nouveau/nir: Lower 64-bit phis
|
|
|
|
Manas Chaudhary (2):
|
|
|
|
- panvk: Add check for null fence
|
|
- panvk: Fix pointer corruption in panvk_add_wait_event_syncobjs
|
|
|
|
Maniraj D (1):
|
|
|
|
- egl: set TSD as NULL after deinit
|
|
|
|
Manuel Stoeckl (2):
|
|
|
|
- gbm: add missing R16 case in gbm_bo_get_bpp
|
|
- gbm: add GBM_FORMAT_GR1616 and RG1616
|
|
|
|
Marcin Ślusarz (41):
|
|
|
|
- iris: fix scratch address patching for TESS_EVAL stage
|
|
- intel: fix INTEL_DEBUG environment variable on 32-bit systems
|
|
- intel/decoder: Dump Task/Mesh shaders
|
|
- spirv: handle SpvOpMemberName
|
|
- intel/compiler: extract brw_nir_load_global_const out of rt code
|
|
- intel/compiler: Get mesh_global_addr from the Inline Parameter for Task/Mesh
|
|
- intel/compiler: Load draw_id from XP0 in Task/Mesh shaders
|
|
- nir: limit lower_clip_cull_distance_arrays input to traditional stages
|
|
- nir/print: simplify printing of IO semantics
|
|
- nir/print: expand printing of io semantics.gs_streams
|
|
- nir/print: compact printing of intrinsic indices
|
|
- nir/print: move print_load_const_instr up
|
|
- nir/print: group hex and float vectors together
|
|
- nir/print: print const value near each use of const ssa variable
|
|
- intel/compiler: disable workaround not applicable to gfx >= 11
|
|
- spirv: handle ViewportMaskNV builtin/cap from SPV_NV_mesh_shader
|
|
- compiler: add new MESH_VIEW_COUNT/MESH_VIEW_INDICES system values
|
|
- spirv: add MeshViewCountNV/MeshViewIndidcesNV builtins from SPV_NV_mesh_shader
|
|
- nir: add load_mesh_view_count and load_mesh_view_indices intrinsics
|
|
- spirv: handle multiview bits of SPV_NV_mesh_shader
|
|
- nir: remove invalid assert affecting per-view variables
|
|
- spirv: mark [Clip|Cull]DistancePerViewNV variables as compact
|
|
- nir: handle per-view clip/cull distances
|
|
- freedreno/rnn: normalize line endings in rules-ng.xsd
|
|
- microsoft/compiler: normalize line endings
|
|
- ci/windows: normalize line endings
|
|
- radv/ci: add line endings exception for files generated with wine
|
|
- Add new rules to .gitattributes
|
|
- intel/compiler: handle gl_[Clip|Cull]Distance in mesh shaders
|
|
- intel/compiler: handle gl_[Clip|Cull]Distance from mesh in fragment shaders
|
|
- intel/compiler: Use Task/Mesh InlineData for the first few push constants
|
|
- anv: Enable conditional rendering in vkCmdDrawMeshTasksNV
|
|
- anv: Add support for non-zero firstTask in vkCmdDrawMeshTasksNV
|
|
- anv: Add support for UBOs, SSBOs and push constants in Mesh pipeline
|
|
- anv: Implement indirect dispatch for Mesh pipeline
|
|
- anv: tell the hardware about gl_[Clip|Cull]Distance in mesh shaders
|
|
- anv: include ClipDistance array in mesh shader per-vertex output
|
|
- anv: Put first few push constants directly into Task/Mesh InlineData
|
|
- intel/compiler: fix array & struct IO lowering in mesh shaders
|
|
- anv: don't set color state when input state was requested
|
|
- intel/compiler: ignore per-primitive attrs when calculating flat input mask
|
|
|
|
Marek Olšák (240):
|
|
|
|
- gallium/util: add some extern "C" guards
|
|
- radeonsi: si_state_shaders.c -> cpp
|
|
- radeonsi: split si_shader_key into ps and ge parts to minimize memcmp overhead
|
|
- radeonsi: don't memcmp inlined uniform values if uniform inlining is disabled
|
|
- radeonsi: don't pass NULL into si_get_nir_shader
|
|
- radeonsi: replace the GS prolog with a monolithic shader variant
|
|
- radeonsi: enable shader culling on Navi1x consumer SKUs as well
|
|
- ac,radeonsi: print a lowercase codename in the renderer string
|
|
- radeonsi: reorder and don't print patch level DRM version in the renderer string
|
|
- ac/llvm: use fmac instead of mul+sub in face culling
|
|
- ac/llvm: add helper ac_build_is_inf_or_nan
|
|
- ac/llvm: accept primitives whose face culling determinant is Inf or NaN
|
|
- gallium,vbo: add PIPE_BIND_VERTEX_STATE for display lists
|
|
- gallium/u_threaded: implement pipelined partial buffer uploads using CPU storage
|
|
- mesa: fix crashes in the no_error path of glUniform
|
|
- radeonsi: add SI_MAX_VRAM_MAP_SIZE definition
|
|
- radeonsi: add an option to use CPU storage uploads for threaded context
|
|
- radeonsi: change bind_history to track usage in each shader stage
|
|
- radeonsi: rebind a buffer only in shader stages where it's been bound
|
|
- radeonsi: don't sync PS or CS before (clear|copy)_buffer based on bind history
|
|
- radeonsi: don't update bind_history for internal buffer clears and copies
|
|
- radeonsi: don't sync before clear_buffer and copy_buffer if the buffer is idle
|
|
- radeonsi: properly destroy buffers on failure
|
|
- winsys/amdgpu: remove force_chaining parameter from cs_check_space
|
|
- winsys/amdgpu: set max_ib_size and max_check_space_size later in cs_check_space
|
|
- radeonsi: don't set inline_uniforms for viewperf because it's enabled by default
|
|
- amd/addrlib: change how the license is formatted to match internal tree
|
|
- amd/addrlib: cosmetic addrlib update
|
|
- mesa: discard draws with count=0 to decrease overhead
|
|
- st/mesa: don't crash when draw indirect buffer has no storage
|
|
- mesa: remove USAGE_ELEMENT_ARRAY_BUFFER because it's unused and adding overhead
|
|
- mesa: move setting USAGE_PIXEL_PACK_BUFFER out of BindBuffer to reduce overhead
|
|
- mesa: remove redundant flagging USAGE_ARRAY_BUFFER
|
|
- mesa: add a no_error path to _mesa_handle_bind_buffer_gen
|
|
- glthread: don't execute display lists if they have no effect
|
|
- glthread: don't sync for glIsEnabled with a few enums
|
|
- glthread: add an option to make glCheckFramebufferStatus a no-op
|
|
- glthread: add a trivial thread-safe way to skip display list execution
|
|
- radeonsi: enable shader culling for indirect draws
|
|
- radeonsi: remove unused parameters in si_emit_draw_packets
|
|
- gallium/radeon: change the BO priority definitions to bits
|
|
- gallium/radeon: remove/merge some BO priorities and remove holes
|
|
- gallium/radeon: remove unused RADEON_DEPENDENCY_START_FENCE
|
|
- gallium/radeon: merge BO read/write usage flags with priority flags
|
|
- winsys/amdgpu: simplify parameter passing and derefs in cs_add_buffer
|
|
- winsys/amdgpu: remove an amdgpu_cs dereference from amdgpu_cs_add_buffer
|
|
- winsys/amdgpu: don't clear RADEON_USAGE_SYNCHRONIZED for last_added_bo_usage
|
|
- winsys/amdgpu: increase the BO hash list size
|
|
- winsys/amdgpu: don't use ip_instance and ring fields of fence and IB structures
|
|
- winsys/amdgpu: move BO fence array updates to the CS thread
|
|
- winsys/amdgpu: optimize looping inefficiencies in add_bo_fence_dependencies
|
|
- radeonsi: don't invoke si_decompress_depth if textures are not dirty at binding
|
|
- mesa: fix locking when destroying/overwriting/adding display lists
|
|
- mesa: remove display list OPCODE_NOP
|
|
- mesa: remove PADDING_64BIT by adding the dlist header into vbo_save_vertex_list
|
|
- vbo: return a GL error earlier in vbo_save_playback_vertex_list_gallium
|
|
- vbo: use int16_t for vbo_save_vertex_list::gallium::private_refcount
|
|
- vbo: restructure vbo_save_vertex_list to get more cache hits
|
|
- mesa: use alloca in search_resource_hash
|
|
- glsl: add gl_resource_name to precompute "name" properties later
|
|
- mesa: don't compute the same strlen up to 3x in _mesa_program_resource_find_name
|
|
- mesa: precompute strlen in gl_resource_name::length and use it
|
|
- mesa: rename locals in _mesa_program_resource_find_name for clarity
|
|
- mesa: preparse [ and [0] in gl_resource_name and use it in shader_query.cpp
|
|
- mesa: handle hash collisions in program resource lookups (e.g. uniforms)
|
|
- mesa: add separate hash tables for each GLSL resource type
|
|
- mesa: skip strlen when hashing strings for ProgramResourceHash
|
|
- radeonsi: print the border color error message only once
|
|
- util: add a util_bitcount variant that selects POPCNT through C++ template arg
|
|
- st/mesa: change st_atom_array.c to cpp
|
|
- st/mesa: use POPCNT in st_update_array if the CPU supports it
|
|
- mesa: change gl_vertex_array_object::NewArrays to bool
|
|
- mesa: add NewVertexBuffers/NewVertexElements flags to indicate state changes
|
|
- cso: add missing parameters into cso_set_vertex_buffers
|
|
- st/mesa: don't update vertex elements when GL doesn't change them
|
|
- driconf: set vblank_mode=0 for viewperf2020
|
|
- gallium/util: fix util_can_blit_via_copy_region with unbound render condition
|
|
- gallium/u_blitter: disable sample shading for all blits
|
|
- gallium/u_blitter: do MSAA copies in 1 pass using sample shading
|
|
- gallium/u_blitter: work around broken sample shading in llvmpipe and zink
|
|
- radeonsi: fix 2 issues with depth_cleared_level_mask
|
|
- radeonsi: add a faster clear path for glClearTexImage
|
|
- radeonsi: rename stencil_cleared_level_mask -> stencil_cleared_level_mask_once
|
|
- radeonsi: allow and finish TC-compatible MSAA HTILE
|
|
- radeonsi: fix a typo preventing a fast depth-stencil clear
|
|
- radeonsi: increase tc_max_cpu_storage_size
|
|
- vbo: utilize structure padding to optimize indirection cold->prims[0].begin
|
|
- driconf: disallow 10-bit pbuffers for viewperf2020/maya due to X errors
|
|
- gallium: rename PIPE_CAPF_MAX_POINT_WIDTH -> MAX_POINT_SIZE
|
|
- gallium: add missing point and line CAPs
|
|
- radeonsi: set correct point and line limits
|
|
- st/mesa: use new point and line CAPs
|
|
- nir: add new SSA instruction scheduler grouping loads into indirection groups
|
|
- radeonsi: enable nir_group_loads for better performance
|
|
- radeonsi: fix shader culling with integer pixel centers
|
|
- radeonsi: fix view culling for wide lines
|
|
- radeonsi: use ac_build_load_to_sgpr in gfx10_emit_ngg_culling_epilogue
|
|
- radeonsi: make si_get_small_prim_cull_info static
|
|
- radeonsi: set PERPENDICULAR_ENDCAP_ENA for wide AA lines
|
|
- radeonsi: set EXTRA_DX_DY_PRECISION for lines where it's supported
|
|
- radeonsi: add si_state_rasterizer::ngg_cull_flags_lines and rename the others
|
|
- ac,radeonsi: cull small lines in the shader using the diamond exit rule
|
|
- radeonsi: unify GFX9_VSGS_NUM_USER_SGPR and GFX9_TESGS_NUM_USER_SGPR
|
|
- radeonsi: add dcc_msaa option to enable DCC for MSAA
|
|
- radeonsi: improve memory instruction tracking
|
|
- radeonsi: remove an incorrect comment at lds_byte0_accept_flag
|
|
- radeonsi: make si_llvm_emit_clipvertex non-static
|
|
- radeonsi: unify how ngg_cull_flags are set
|
|
- radeonsi: cull against clip planes, clipvertex, clip/cull distances in shader
|
|
- radeonsi: inline declare_vs_specific_input_sgprs
|
|
- radeonsi: don't use GS SGPR6 for the small prim cull info
|
|
- glx: add a workaround to glXDestroyWindow for Viewperf2020/Sw
|
|
- radeonsi: separate culling code from VS/TES (to be reused by GS)
|
|
- radeonsi: restructure code that declares merged VS-GS and TES-GS SGPRs
|
|
- radeonsi: add is_gs parameter into si_vs_needs_prolog
|
|
- radeonsi: simplify si_get_vs_key_outputs for GS
|
|
- radeonsi: don't use ctx.stage outside of si_llvm_translate_nir
|
|
- radeonsi: implement shader culling in GS
|
|
- radeonsi: deduplicate min_esverts code in gfx10_ngg_calculate_subgroup_info
|
|
- mesa: don't add attenuation constants if ffvp doesn't use them
|
|
- mesa: add allow_glsl_compat_shaders for shader-db
|
|
- ac/gpu_info: don't fail on amdgpu_query_video_caps_info failures
|
|
- ac/surface: allow gfx6-8 to enter the gfx9 DCC codepath for SI_FORCE_FAMILY
|
|
- mesa: add a more straightforward callback for replacing shaders
|
|
- driconf: enable glthread for all Unigine benchmarks
|
|
- driconf: enable glthread for Minecraft
|
|
- driconf: enable glthread for Basemark GPU
|
|
- radeonsi: don't print uninitialized inlined_uniform_values
|
|
- radeonsi: add wave32 flag into prolog/epilog keys
|
|
- radeonsi: add si_shader::wave_size because it will vary
|
|
- radeonsi: use si_shader::wave_size
|
|
- radeonsi: don't use si_get_wave_size in si_get_ir_cache_key
|
|
- radeonsi: clean up compute_wave_size use in si_compute_blit.c
|
|
- radeonsi: propagate si_shader::wave_size to VGT_SHADER_STAGES
|
|
- radeonsi: don't use compute_wave_size directly
|
|
- radeonsi: centralize wave size computation in si_get_shader_wave_size
|
|
- radeonsi: rename si_get_shader_wave_size and make it non-inline
|
|
- mesa: remove SourceChecksum from shader structures
|
|
- mesa: rename gl_shader::sha1 to disk_cache_sha1
|
|
- mesa: add shader source SHA1s that are propagated up to glCompileShader
|
|
- mesa: add gl_linked_shader::linked_source_sha1
|
|
- mesa: don't compute the same SHA1 twice in glShaderSource
|
|
- util: add SHA1 printing and comparison functions
|
|
- nir: add shader_info::source_sha1, its initialization and printing
|
|
- radeonsi: print source_sha1 as part of shader dumps
|
|
- radeonsi: print the shader stage for shader-db dumps
|
|
- glthread: don't sync for more glGetIntegerv enums for glretrace
|
|
- mesa: use simple_mtx_t for TexMutex (v2)
|
|
- nir: handle more intrinsics in divergence analysis
|
|
- nir: disable a NIR test due to undebuggable & locally unreproducible CI failures
|
|
- nir: serialize divergent fields
|
|
- nir: add nir_has_divergent_loop function
|
|
- glsl: fix setting compiled_source_sha1 without a shader cache
|
|
- radeonsi: add Wave32 heuristics and shader profiles
|
|
- radeonsi: print more stats for shader-db
|
|
- radeonsi: add shader profiles that disable binning
|
|
- radeonsi: unroll loops of up to 128 iterations
|
|
- mesa: include less stuff in dlist.c
|
|
- mesa: inline vbo_initialize_save_dispatch and rename the functions
|
|
- mesa: inline _mesa_install_arrayelt_vtxfmt
|
|
- mesa: inline _mesa_install_eval_vtxfmt
|
|
- mesa: inline _mesa_install_dlist_vtxfmt
|
|
- mesa: move _mesa_initialize_vbo_vtxfmt calls to a common place and inline
|
|
- mesa: inline _vbo_install_exec_vtxfmt
|
|
- mesa: move the ES2 check from vbo_init_tmp.h to install_vtxfmt
|
|
- glapi: rename exec="dynamic" to exec "vtxfmt" to make it self-explanatory
|
|
- mesa: don't set CallList* redundantly in _mesa_initialize_save_table
|
|
- mesa: rename dlist functions to match dispatch function names
|
|
- glapi: rename gl_genexec.py to api_exec_init.py, api_exec.c to api_exec_init.c
|
|
- glapi: move apiexec API condition determination to common code
|
|
- glapi: replace dispatch.h inline functions with macros for faster compilation
|
|
- mesa: add EXT suffix to VertexAttribI*EXT to match glapi name
|
|
- vbo: rename vertex functions to match GL dispatch names
|
|
- vbo: rename ES vertex functions to match GL dispatch names
|
|
- glapi: move reusable glapi printing code to apiexec.py
|
|
- glapi: autogenerate _mesa_initialize_save_table with python
|
|
- glapi: autogenerate api_save.h with save_* function declarations
|
|
- mesa: remove api_exec.h and move its contents into context.h
|
|
- glapi: autogenerate all _mesa_* forward declarations in api_exec_decl.h
|
|
- glapi: autogenerate install_vtxfmt with python
|
|
- mesa: remove GLvertexformat
|
|
- vbo: expose all exec entrypoints for glthread and match api_exec_decl.h names
|
|
- glthread: add nop dispatch
|
|
- glthread: set marshal functions in dispatch only if they exist in the API
|
|
- glthread: inline _mesa_glthread_restore_dispatch and merge disable & destroy
|
|
- glthread: fix restoring the dispatch in destroy when the context is not current
|
|
- glthread: disable glthread if the context is lost
|
|
- mesa: use ctx->GLThread.enabled now that it's correct
|
|
- mesa: rename _ae_ArrayElement -> _mesa_ArrayElement to match glapi
|
|
- mesa: remove COPY_DISPATCH code that doesn't do anything
|
|
- glapi: add missing no_error settings for implemented functions
|
|
- mesa,vbo: make ES wrapper functions static
|
|
- mesa: remove all GL func forward declarations because they are autogenerated
|
|
- mesa: use nop dispatch for ColorTable/Convolution/Histogram
|
|
- radeonsi: add a debug option that disables DCC for all exported buffers
|
|
- driconf: enable glthread for Minecraft-FTB, Stellaris, Battletech
|
|
- ac/gpu_info: set cu_mask correctly for Arcturus
|
|
- ac/gpu_info: add AMD_CU_MASK environment variable to set CU_EN
|
|
- radeonsi: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en
|
|
- radeonsi: program COMPUTE_STATIC_THREAD_MGMT_SE4..7 on Arcturus
|
|
- radeonsi: apply spi_cu_en to CU_EN
|
|
- radv: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en
|
|
- radv: apply spi_cu_en to CU_EN
|
|
- radeonsi: pack si_pm4_state
|
|
- radeonsi: replace SI_PM4_MAX_DW with a max_dw field
|
|
- radeonsi: decrease the size of si_pm4_state::pm4 except for cs_preamble_state
|
|
- amd/registers: work around an assertion in parse_kernel_headers.py
|
|
- ac/llvm: add vindex into ac_build_buffer_store_dword
|
|
- ac/llvm: remove the num_channels parameter from ac_build_buffer_store_dword
|
|
- radeonsi: modifiers can't disable DCC
|
|
- radeonsi: make get_thread_id_in_tg non-static
|
|
- radeonsi: clean up si_export_mrt_color
|
|
- radeonsi: set done=1 for PS exports at the end of si_llvm_build_ps_epilog
|
|
- radeonsi: remove unnecessary code that was used to find the last export
|
|
- radeonsi: export mrtz before color exports
|
|
- radeonsi: simplify compacted_mrt_index in si_export_mrt_color
|
|
- radeonsi: reorder slots for internal buffers, reuse a slot for GS_QUERY_BUF
|
|
- radeonsi: don't bind the ESGS ring twice, handle the difference in the shader
|
|
- radeonsi: remove unused si_shader::prolog2
|
|
- radeonsi: pass sample_coverage VGPR index to the PS prolog instead of guessing
|
|
- radeonsi: move smoothing to the main shader part to remove 1 live VGPR
|
|
- radeonsi: do opt_large_constants & lower_indirect_derefs after uniform inlining
|
|
- radeonsi: use nir->scratch_size instead of ac_count_scratch_private_memory
|
|
- radeonsi: change si_shader_output_values::vertex_stream to a bitmask
|
|
- radeonsi: move si_nir_scan_shader into si_shader_info.c
|
|
- radeonsi: add into the disk cache key whether cached shaders contain LLVM IR
|
|
- radeonsi: move the GS copy shader into shader variants
|
|
- ac: move ac_exp_param.h to ac_nir.h
|
|
- ac/nir: move ac_are_tessfactors_def_in_all_invocs into radeonsi
|
|
- radeonsi: print all streamout info
|
|
- radeonsi: print the number of param exports for shader-db
|
|
- ac/surface: allow displayable DCC with any resolution (e.g. 8K)
|
|
- radeonsi: rename uses_vmem_* flags
|
|
- radeonsi: apply fbfetch/indirect_descriptor to uses_vmem_load_other earlier
|
|
- radeonsi: determine MEM_ORDERED after generating a shader variant
|
|
- winsys/radeon: fix a hang due to introducing spi_cu_en
|
|
- amd: add a workaround for an SQ perf counter bug
|
|
- ac/surface: add more elements to meta equations because HTILE can use them
|
|
- radeonsi: fix register shadowing after the pm4 state size was decreased
|
|
- radeonsi: fix an assertion failure with register shadowing
|
|
|
|
Marius Hillenbrand (3):
|
|
|
|
- util/cpu_detect: Add flag for IBM Z (s390x)
|
|
- llvmpipe: Use lp_build_round_arch on IBM Z (s390x)
|
|
- util/cpu_detect, gallium: use cpu_family CPU_S390X instead of separate flag
|
|
|
|
Markus_included (1):
|
|
|
|
- Fixed you're to your
|
|
|
|
Martin Roukala (né Peres) (1):
|
|
|
|
- radv/ci: mark the dEQP fails related to a missing VKCTS 1.3 as expected
|
|
|
|
Matt Turner (7):
|
|
|
|
- util/format: Add PIPE_FORMAT_Y8_UNORM as an "other" layout format
|
|
- tu: Expose required VK_FORMAT_FEATURE bits for planar YUV formats
|
|
- ir3: Add support for (dis)assembling flat.b
|
|
- freedreno/ir3: Add infrastructure for flat.b
|
|
- freedreno/ir3: Use flat.b to load flat varyings on a6xx
|
|
- freedreno/ir3: Use immediate for flat.b's src1
|
|
- intel/genxml: capitalize decoder mode select properly
|
|
|
|
Matti Hamalainen (3):
|
|
|
|
- aux/trace: print enum names instead of integer values in gallium traces
|
|
- aux/trace: implement missing trace calls
|
|
- aux/trace: cosmetic cleanup
|
|
|
|
Mauro Rossi (3):
|
|
|
|
- android: define cpp_rtti=false because libLLVM is built w/o RTTI
|
|
- Revert "android: define cpp_rtti=false because libLLVM is built w/o RTTI"
|
|
- android: define cpp_rtti=false because libLLVM is built w/o RTTI (v2)
|
|
|
|
Melissa Wen (14):
|
|
|
|
- v3dv: drop unused variable on handle_set_event_cpu_job
|
|
- v3dv: wrap wait semaphores info in v3dv_submit_info_semaphores
|
|
- v3dv: store wait semaphores in event_wait_cpu_job_info
|
|
- drm-uapi/v3d: extend interface for multiple semaphores support
|
|
- v3dv: check multiple semaphores capability
|
|
- v3dv: enable multiple semaphores on cl submission
|
|
- v3dv: enable multiple semaphores for tfu job
|
|
- v3dv: enable multiple semaphores for csd job
|
|
- v3dv: enable GPU jobs to signal multiple semaphores
|
|
- v3dv: track submitted jobs by GPU queue type
|
|
- v3dv: handle wait semaphores in the first job by queue
|
|
- v3dv: process signal semaphores in the very last job
|
|
- v3dv: signal fence when all submitted jobs complete execution
|
|
- v3dv: move sems_info from event_wait job to wait_thread info
|
|
|
|
Michael Tang (2):
|
|
|
|
- microsoft/spirv_to_dxil: turn sysvals into input varyings
|
|
- microsoft/compiler: Use memcpy instead of a union to write dxil_features
|
|
|
|
Michel Dänzer (1):
|
|
|
|
- Revert "wsi/x11: Avoid a class of deadlocks in the WSI queue thread"
|
|
|
|
Michel Zou (8):
|
|
|
|
- vulkan: fix uninitialized variables
|
|
- lavapipe: fix unused variable
|
|
- llvmpipe: Fix Wpointer-to-int-cast
|
|
- meson: check -mtls if has_exe_wrapper
|
|
- meson: correctly detect linker arguments
|
|
- zink: fix -Warray-bounds warning
|
|
- zink: fix unused variable warning
|
|
- vulkan/wsi: drop unused wsi_create_win32_image
|
|
|
|
Mike Blumenkrantz (278):
|
|
|
|
- nir/print: print bindless info as applicable
|
|
- aux/pb: add a tolerance for reclaim failure
|
|
- virgl: remove unused pipebuffer include
|
|
- aux/pb: more correctly check number of reclaims
|
|
- zink: use static array for detecting VK_TIME_DOMAIN_DEVICE_EXT
|
|
- zink: add a read barrier for indirect dispatch
|
|
- zink: fully zero surface creation struct
|
|
- zink: clear descriptor refs on buffer replacement
|
|
- zink: assert compute descriptor key is valid before hashing it
|
|
- zink: don't update lazy descriptor states in hybrid mode
|
|
- zink: move push descriptor updating into lazy-only codepath
|
|
- zink: add an early return for zink_descriptors_update_lazy_masked()
|
|
- zink: move last of lazy descriptor state updating back to lazy-only code
|
|
- lavapipe: stop reading renderpass during pipeline creation
|
|
- lavapipe: remove some unused struct members
|
|
- lavapipe: use framebuffer attachment_count member instead of renderpass
|
|
- lavapipe: remove lvp_subpass_attachment::in_render_loop
|
|
- lavapipe: remove lvp_subpass_attachment::layout
|
|
- lavapipe: add attachment index to lvp_render_pass_attachment
|
|
- lavapipe: remove lvp_subpass::max_sample_count
|
|
- lavapipe: remove lvp_subpass_attachment and use lvp_render_pass_attachment refs
|
|
- lavapipe: remove last VK_ATTACHMENT_UNUSED check
|
|
- lavapipe: store subpass directly to rendering_state
|
|
- lavapipe: simplify some attachment derefs
|
|
- lavapipe: remove lvp_subpass::has_color_att
|
|
- lavapipe: pull layer count from render state during resolve
|
|
- lavapipe: clamp attachment clear rect sizes
|
|
- zink: don't ralloc zink_resource structs
|
|
- zink: align pipe_resource and sampler_view allocations to cachelines
|
|
- zink: use ctx params for program ref/destroy functions
|
|
- zink: fix descriptor interface param for program_deinit
|
|
- zink: remove descriptor pools from hash table on deletion
|
|
- zink: unref descriptor pools in hybrid mode when they explode
|
|
- zink: always invalidate descriptor sets on pool free
|
|
- zink: don't clear descriptor pool cache on context destroy
|
|
- zink: stop leaking descriptor pool references
|
|
- zink: rescue surfaces/bufferviews for cache hits during deletion
|
|
- zink: rename zink_descriptor_layout_key::num_descriptors -> num_bindings
|
|
- zink: eliminate a hole in zink_descriptor_layout_key
|
|
- zink: reduce hashed region of zink_descriptor_layout_key
|
|
- zink: split out descriptor pool sizing into separate struct
|
|
- zink: detect prim type more accurately for tess/gs lines
|
|
- zink: don't break early when applying fb clears
|
|
- zink: only reset zink_resource::so_valid on buffer rebind
|
|
- zink: don't check rebind count outside of buffer/image rebind function
|
|
- zink: add notes about binding points which aren't counted in util funcs
|
|
- zink: stop exporting PIPE_SHADER_CAP_FP16_DERIVATIVES
|
|
- zink: don't add dynamic vertex pipeline states if no attribs are used
|
|
- zink: fix gl_SampleMaskIn spirv generation
|
|
- zink: more accurately update samplemask for fs shader keys
|
|
- nir/lower_samplers_as_deref: rewrite more image intrinsics
|
|
- nir/lower_io_to_scalar: add support for bo and shared io
|
|
- zink: run optimize_nir() only once during compile
|
|
- zink: move bo load offset adjustment to compiler passes
|
|
- zink: run lower_io_to_scalar before rewriting bo access
|
|
- zink: move all 64-32bit shader load rewriting to nir pass
|
|
- zink: move ssbo store offset adjustment to compiler passes
|
|
- zink: move shared intrinsic offset adjustments to compiler passes
|
|
- zink: move all 64-32bit shader store rewriting to nir pass
|
|
- zink: add more glsl base types to get_glsl_basetype()
|
|
- zink: move all shader bo/sharedmem access to compiler passes
|
|
- zink: add better handling for CUBE_COMPATIBLE bit
|
|
- zink: cache bo SpvId array types
|
|
- zink: use align64 for allocation sizes
|
|
- zink: set aspectMask for renderpass2 VkAttachmentReference2 structs
|
|
- zink: always use explicit lod for texture() when legal in non-fragment stages
|
|
- zink: be more permissive for injecting LOD into texture() instructions
|
|
- zink: inject LOD for sampler version of OpImageQuerySize
|
|
- zink: flag renderpass change when toggling fbfetch
|
|
- zink: be more spec-compliant for unnormalizedCoordinates samplers
|
|
- zink: don't clamp cube array surfacess to cubes
|
|
- zink: don't clamp 2D_ARRAY surfaces to 2D
|
|
- zink: error when trying to allocate a bo larger than heap size
|
|
- zink: clamp max buffer sizes to smallest buffer heap size
|
|
- zink: explicitly enable VK_EXT_shader_subgroup_ballot
|
|
- zink: add more int/float types to cast switching in ntv
|
|
- zink: force float dest types on some alu results
|
|
- zink: stop double printing validation messages
|
|
- zink: add SpvCapabilityStorageImageMultisample for multisampled storage images
|
|
- zink: reject all storage multisampling if the feature is unsupported
|
|
- gallium: add PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY
|
|
- zink: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY
|
|
- zink: add queue locking
|
|
- build: add sha1_h to llvmpipe build
|
|
- mesa/st: rework psiz lowering
|
|
- mesa/st: lower psiz for shader precompile
|
|
- anv: disable debug logging spam
|
|
- zink: set fbfetch state on lazy batch data when enabling it
|
|
- zink: always use lazy (non-push) updating for fbfetch descriptors
|
|
- vk: update headers for 1.2.197
|
|
- lavapipe: fix cmd queuing for dynamic render
|
|
- lavapipe: VK_KHR_dynamic_rendering
|
|
- features: add dynamic render for lavapipe
|
|
- zink: do a better job conserving locations for packed xfb outputs
|
|
- zink: clamp PIPE_SHADER_CAP_MAX_INPUTS for xfb
|
|
- zink: remove lazy ci job
|
|
- zink: add khr46 to ci
|
|
- aux/primconvert: handle singular incomplete restarts
|
|
- zink: rework cached fbfetch descriptor fallback
|
|
- lavapipe: add some asserts for descriptor dynamic offsets
|
|
- gallivm: handle TGSI SampleId sysval
|
|
- Revert "gallium/u_blitter: work around broken sample shading in llvmpipe and zink"
|
|
- aux/trace: fix PIPE_QUERY_PIPELINE_STATISTICS_SINGLE tracing
|
|
- aux/trace: support pipe_context::get_query_result_resource
|
|
- aux/trace: trace pipe_screen::is_format_supported better
|
|
- aux/trace: fix vertex state tracing
|
|
- aux/trace: add pipe_context::render_condition_mem
|
|
- zink: set new point/line caps
|
|
- zink: update radv ci passes
|
|
- zink: flatten out draw templates a bit
|
|
- zink: declare int/float size caps inline with type usage
|
|
- zink: simplify 64bit vertex attrib lowering
|
|
- zink: add another compiler pass to convert 64bit vertex attribs
|
|
- zink: add 1DShadow sampler handling for drivers (radv) that don't support it
|
|
- zink: radv ci updates for 1dshadow stuff
|
|
- zink: implement cs uniform inlining
|
|
- zink: always inline uniforms when running on a cpu driver
|
|
- zink: ci updates
|
|
- zink: be more consistent about applying module hash for gfx pipeline
|
|
- zink: update gfx pipeline shader module pointer even if the program is unchanged
|
|
- aux/primconvert: break out primconvert internals into util function
|
|
- aux/primconvert: support pipe_context::draw_vertex_state
|
|
- zink: stop running discard_if in generated tcs
|
|
- zink: always add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for 3D images
|
|
- move util/indices to core util
|
|
- mesa: convert unsupported primtypes during display list compilation
|
|
- zink: block suballocator caching for swapchain/dmabuf images
|
|
- zink: set suballocator bo size to aligned allocation size
|
|
- zink: implement pipe_context::draw_vertex_state
|
|
- zink: stop using VK_IMAGE_LAYOUT_PREINITIALIZED for dmabuf
|
|
- zink: always set matching resource export type for dmabuf creation
|
|
- dri2: set dimensions on dmabuf import planes
|
|
- zink: implement multiplanar modifier handling
|
|
- zink: add error for bo allocation failure
|
|
- zink: always use slab allocation placement for domains
|
|
- zink: set zink_resource_object::host_visible based on actual bo placement
|
|
- zink: demote BAR allocations to device-local on oom
|
|
- zink: use IMMUTABLE for dummy xfb buffer
|
|
- zink: fix memory availability reporting
|
|
- zink: fail context creation more gracefully
|
|
- zink: clamp to 500 max batch states on nvidia
|
|
- gallium: add pipe_screen::is_compute_copy_faster hook
|
|
- gallium: rename PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER
|
|
- mesa/st: make sampler_type_for_target public
|
|
- mesa/st: make some pbo functions public
|
|
- gallium: implement compute pbo download
|
|
- aux/pb: add a new slab alloc function for reclaiming all bo objects
|
|
- zink: use pb_slab_alloc_reclaimed(reclaim_all) for BAR heap sometimes
|
|
- zink: enable PIPE_TEXTURE_TRANSFER_COMPUTE on non-cpu drivers
|
|
- zink: explicitly init glsl
|
|
- zink: add a compiler pass to scan for shader image use
|
|
- zink: set batch state queue on creation
|
|
- zink: be consistent about waiting on context queue on context destroy
|
|
- mesa/vbo: be more comprehensive for degenerate primitive conversion in dlists
|
|
- radv: fix xfb query copy param ordering
|
|
- zink: add missing assert for 8bit vertex decompose
|
|
- zink: add some wsi instance extensions
|
|
- zink: always unset vertex shader variant key data when changing last vertex stage
|
|
- zink: add extra synchronization for buffer descriptor binds
|
|
- zink: use device-local heap for sparse backing allocations
|
|
- zink: add a better threshold for clamping query pool resets on suspend
|
|
- zink: always set number of timestamp results to 1 for internal qbo
|
|
- zink: fix availability buffer sizing/copying for xfb queries
|
|
- zink: skip readback of qbos with no results
|
|
- Revert "zink: when performing an implicit reset, sync qbos"
|
|
- zink: use even more accurate stride values for query result copies
|
|
- zink: ci updates
|
|
- zink: add flake
|
|
- aux/trace: copy over stream_output_target_offset method from context
|
|
- util/vbuf: fix buffer translation sizing
|
|
- zink: remove SpvMemorySemanticsMakeVisibleMask from nir_intrinsic_memory_barrier
|
|
- zink: add some nv ci results
|
|
- aux/trace: add pipe_context::fence_server_signal tracing
|
|
- zink: update gfx_pipeline_state.vertex_strides when necessary
|
|
- zink: check EXT_image_drm_format_modifier for dmabuf support
|
|
- Revert "zink: update gfx_pipeline_state.vertex_strides when necessary"
|
|
- zink: add VK_KHR_external_semaphore_fd to device exts
|
|
- zink: add VK_KHR_external_memory_capabilities to instance exts
|
|
- zink: add driver/device uuid screen hooks
|
|
- zink: implement GL semaphores
|
|
- zink: implement external memory object resource handling
|
|
- zink: enable EXT_external_objects pipe caps
|
|
- util: add ptralloc
|
|
- lavapipe: replace hard pointer calcs in dynamic render with ptralloc
|
|
- lavapipe: replace hard pointer calcs in push descriptors with ptralloc
|
|
- zink: update nv ci baseline
|
|
- zink: add deqp ci baseline for nv
|
|
- lavapipe: fix ptralloc typo
|
|
- zink: add get_sparse_texture_virtual_page_size hook
|
|
- zink: set up image create bits for sparse textures
|
|
- zink: support sparse texture range commits
|
|
- zink: allow sparse buffers to be suballocated
|
|
- zink: stop allocating such massive staging regions for buffer maps
|
|
- zink: fake sparse R9G9B9E5 support as needed
|
|
- zink: implement sparse shader instructions in ntv
|
|
- zink: always set actual_dest_type for ntv tex instruction emission
|
|
- zink: handle residency return value from sparse texture instructions
|
|
- zink: rename zink_so_info -> zink_shader_info
|
|
- zink: always pass shader info to ntv
|
|
- zink: add nir_intrinsic_image_deref_sparse_load to image scanning in compiler
|
|
- zink: only allocate ntv residency info if it will be used
|
|
- zink: emit sparse residency cap in ntv
|
|
- zink: handle min_lod texture operands
|
|
- zink: batch sparse texture binds
|
|
- zink: handle sparse texture miptail commits
|
|
- zink: enable ARB_sparse_texture pipe caps
|
|
- zink: ARB_sparse_texture2
|
|
- docs: add features/relnotes for zink sparse texture support
|
|
- zink: fix non-modifer dmabuf usage
|
|
- zink: link with vulkan utils
|
|
- zink: hook up planar image format creation
|
|
- vulkan/wsi: add VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT for swapchain image caps
|
|
- lavapipe: remove unused struct member
|
|
- zink: update nv fails
|
|
- zink: update radv fails list
|
|
- zink: never use SpvOpImageQuerySizeLod for texel buffers
|
|
- zink: add anv (icl) fails
|
|
- zink: handle bogus xfb draws
|
|
- zink: reorder fbfetch flag-setting to avoid null deref
|
|
- zink: return 256 for PIPE_CAP_MIN_MAP_BUFFER_ALIGNMENT
|
|
- anv: silence wsi debug logging
|
|
- zink: unify some context casts in zink_create_sampler_view
|
|
- zink: emit same number of timeline signals as semaphore signals
|
|
- zink: flag has_work when a GL semaphore is signalled
|
|
- zink: allow resource creation without VK_FORMAT_FEATURE_COLOR_ATTACHMENT_BIT
|
|
- zink: use VkImageViewUsageCreateInfo to remove attachment bits
|
|
- zink: always create a separate VkBuffer for storage use
|
|
- zink: use the storage buffer for bufferview creation when format allows
|
|
- zink: flag all buffer resources with PIPE_BIND_SHADER_IMAGE
|
|
- zink: simplify buffer case for zink_resource_object_init_storage()
|
|
- zink: remove tmp buffer rebinds
|
|
- aux/tc: add tc_buffer_write to replace pipe_buffer_write usage
|
|
- zink: fix vertex buffer mask computation for null buffers
|
|
- zink: replace qbo pipe_buffer_write usage with tc_buffer_write
|
|
- zink: replace other pipe_buffer_write usage with pipe_buffer_write_nooverlap
|
|
- zink: make pipe_buffer_write usage trigger compiler errors
|
|
- zink: ci updates
|
|
- zink: clamp tbo creation to maxTexelBufferElements
|
|
- zink: add vertex shader pipeline bit for generated barrier construction
|
|
- zink: print an error when the device is lost
|
|
- zink: fix waiting on current batch id
|
|
- zink: handle swizzled offset/count values for shader bitfield ops
|
|
- zink: enable VK_EXT_shader_atomic_float
|
|
- zink: add warning printf for drivers missing VK_EXT_shader_atomic_float
|
|
- zink: cast image atomic op params/results based on image type
|
|
- zink: use SpvScopeDevice over SpvScopeWorkgroup for atomic shader ops
|
|
- aux/vbuf: use local var for modifying unaligned_vb_mask during update
|
|
- aux/vbuf: move mask-clearing for vbuf updates after buffer scanning
|
|
- aux/vbuf: add fastpath for skipping identical vbuf updates
|
|
- mesa: stop truncating MESA_GLSL=dump
|
|
- zink: ARB_sparse_texture_clamp
|
|
- docs: update features/relnotes for zink sparse texture clamp
|
|
- zink: disable PIPE_SHADER_CAP_FP16_CONST_BUFFERS
|
|
- llvmpipe: disable PIPE_SHADER_CAP_FP16_CONST_BUFFERS
|
|
- zink: add VK_BUFFER_USAGE_CONDITIONAL_RENDERING_BIT_EXT for query binds
|
|
- zink: use scanout obj when returning resource param info
|
|
- zink: fix PIPE_CAP_TGSI_BALLOT export conditional
|
|
- zink: reject invalid draws
|
|
- zink: min/max blit region in coverage functions
|
|
- aux/draw: fix llvm tcs lane vec generation
|
|
- lavapipe: use util_pack_color_union() for generating clear colors
|
|
- zink: ci updates
|
|
- lavapipe: ci updates
|
|
- zink: always set VkPipelineMultisampleStateCreateInfo::pSampleMask
|
|
- zink: always invalidate streamout counter buffer if not resuming
|
|
- zink: ci updates
|
|
- zink: fix cached descriptor set invalidation for array bindings
|
|
- zink: free push descriptor pools on deinit
|
|
- zink: don't free non-fbfetch dsl structs when switching to fbfetch
|
|
- gallivm: avoid division by zero when computing cube face
|
|
- zink: always update shader variants when rebinding a gfx program
|
|
- zink: use a fence for pipeline cache update jobs
|
|
- zink: wait on program cache fences before destroying programs
|
|
- zink: fix descriptor cache pointer array allocation
|
|
- zink: mark fbfetch push sets as non-cached
|
|
- zink: stop leaking descriptor sets
|
|
- zink: invalidate non-punted recycled descriptor sets that are not valid
|
|
- zink: set shader key size to 0 for non-generated tcs
|
|
|
|
Mike Lothian (1):
|
|
|
|
- meson: Fix dri.pc dridriverdir
|
|
|
|
Mykhailo Skorokhodov (3):
|
|
|
|
- iris: Add missed tile flush flag
|
|
- Revert "iris: add tile cache flush to iris_copy_region"
|
|
- nir: Fix read depth for predecessors
|
|
|
|
Nanley Chery (82):
|
|
|
|
- iris: Convert some mod_info checks to asserts
|
|
- iris: Disable the MC_CCS modifier with norbc
|
|
- iris: Set DISABLE_AUX_BIT for AUX_USAGE_NONE modifiers
|
|
- iris: Refactor the assignment to possible_usages
|
|
- intel/isl: Allow creating non-Y-tiled ASTC surfaces
|
|
- iris: Allow GPU-based uploads of ASTC textures
|
|
- anv: Require transfer features for transfer usages
|
|
- anv: Allow transfer-only linear ASTC images
|
|
- anv: Drop code from get_blorp_surf_for_anv_buffer
|
|
- intel/isl: Unify fmt checks in isl_surf_supports_ccs
|
|
- iris: Enable CCS_E on 32-bpc float formats on TGL+
|
|
- iris: Drop redundant iris_resource_disable_aux call
|
|
- intel/isl: Restore CCS_E support for YUYV and UYVY
|
|
- iris: Drop the YCRCB cases in finish_aux_import
|
|
- intel/isl: Drop extra devinfo checks for CCS support
|
|
- intel/isl: Require aux map for some 64K alignment
|
|
- intel/blorp: Modify the SKL+ CCS resolve rectangle
|
|
- intel/blorp: Modify get_fast_clear_rect for XeHP
|
|
- iris: Change a param of iris_resource_init_aux_buf
|
|
- iris: Use the aux BO and surf less during init
|
|
- iris: Move some BO setup to iris_resource_init_aux_buf
|
|
- iris: Simplify iris_get_aux_clear_color_state_size
|
|
- iris: Don't allocate a clear color BO for some Z/S
|
|
- iris: Drop row pitch param from iris_get_ccs_surf
|
|
- iris: Don't allocate and initialize CCS on XeHP
|
|
- iris: Don't assert a NULL aux BO during aux config
|
|
- iris: Modify the comment about zeroing CCS
|
|
- iris: Update the initial CCS state on XeHP
|
|
- iris: Free the local cache bucket in bufmgr_destroy
|
|
- iris: Replace bo->real.local with bo->real.heap
|
|
- iris: Add and use flags_to_heap
|
|
- iris: Use a num_buckets pointer in add_bucket
|
|
- iris: Replace "local" with "heap" in bufmgr fn params
|
|
- iris: Add and use BUCKET_ARRAY_SIZE
|
|
- iris: Add and use bucket_info_for_heap
|
|
- iris: Rework the DEVICE_LOCAL heap
|
|
- iris: Disable the SMEM fallback for CCS on XeHP
|
|
- iris: Use util packing fns in convert_clear_color
|
|
- blorp: Drop multisampled code in blorp_can_hiz_clear_depth
|
|
- intel/isl: Rework HiZ image align calculations
|
|
- intel/isl: Update comment for the XeHP HiZ block
|
|
- intel/isl: Use a new HiZ format on XeHP+
|
|
- intel/isl: Require Y-tiling for depth on gfx4-5
|
|
- intel/isl: Allow HiZ with Tile4/64 surfaces
|
|
- intel/isl: Return false more in isl_surf_get_hiz_surf
|
|
- anv,iris: Flush tile cache after color fast clears
|
|
- anv,iris: Depth stall around color fast clears
|
|
- intel: Rename the PSD bit in PIPE_CONTROL for XeHP
|
|
- anv,iris: PSS Stall Sync around color fast clears
|
|
- anv,iris: Flush HDC before color fast clears
|
|
- anv: Don't fill lowered_storage_image_param on SKL+
|
|
- intel/isl: Don't check pitch in isl_surf_get_mcs_surf
|
|
- intel/isl: Strengthen MCS SINT format restriction
|
|
- Revert "intel/isl: Don't reconfigure aux surfaces for MCS"
|
|
- intel/gen125.xml: Increase Auxiliary Surface Pitch
|
|
- intel/isl: Allow creating MCS in Tile4 memory
|
|
- anv: Drop assert against modifier with aux on gfx12
|
|
- anv: Disable CCS_E for some 8/16bpp copies on TGL+
|
|
- anv: Use ANV_FAST_CLEAR_DEFAULT_VALUE for CCS on TGL+
|
|
- anv: Re-enable CCS_E on TGL+
|
|
- anv: Drop redundant disabling of non-renderable CCS
|
|
- anv: Disable the SMEM fallback for local memory
|
|
- anv: Require the local heap for CCS on XeHP
|
|
- anv: Don't allocate VMA for CCS on XeHP
|
|
- isl: Enable compression with multisampled Tile64
|
|
- iris: Explicitly rely on gallium fallbacks for YUV
|
|
- intel/isl: Support YUV pipe-to-isl format mapping
|
|
- iris: Drop stale media compression import code
|
|
- iris: Use iris_format_for_usage in map_aux_addresses
|
|
- intel: Rename a RenderCompressionFormat field
|
|
- intel/isl: Support the XeHP media compression format
|
|
- iris: Support the XeHP media compression format
|
|
- iris: Refactor a ternary in iris_resource_get_param
|
|
- iris: Pick the right BO in iris_resource_get_param
|
|
- iris: Return non-zero stride for clear color plane
|
|
- intel/isl: Simplify Z-buffer tiling config during emit
|
|
- intel/isl: Fix depth buffer TiledSurface programming
|
|
- intel/isl: Add more PRM text for HiZ/STC requirement
|
|
- iris: Don't fast clear with the view format
|
|
- Revert "anv: Require the local heap for CCS on XeHP"
|
|
- anv: Refactor anv_image_init_from_create_info
|
|
- anv: Disable aux if the explicit modifier lacks it
|
|
|
|
Neha Bhende (11):
|
|
|
|
- st: Fix 64-bit vertex attrib index for TGSI path
|
|
- st: Fix comments in commit be6d584de43966e
|
|
- svga: Add GL43 commands support
|
|
- svga: Add utility to check for GL43 support
|
|
- tgsi: Add hw_atomic_declared in tgsi_info
|
|
- svga: Add support for compute shader, shader buffers and image views
|
|
- svga: shader translation for compute, image views and shader buffers
|
|
- svga: add GL43 resource validation at draw time
|
|
- svga: enable GL43 on SVGA GL43 capable device
|
|
- svga: enable PIPE_CAP_IMAGE_STORE_FORMATTED on gl43 capable device
|
|
- svga: store shared_mem_size in svga_compute_shader instead of svga_context
|
|
|
|
Neil Roberts (1):
|
|
|
|
- v3d: Update prim_counts when prims generated query in flight without TF
|
|
|
|
Nicholas Bishop (1):
|
|
|
|
- mesa/get: allow NV_pixel_buffer_object constants in GLES2
|
|
|
|
Niklas Haas (1):
|
|
|
|
- wsi/x11: support depth 30 visuals
|
|
|
|
Omar Akkila (1):
|
|
|
|
- llvmpipe: page-align memory allocations
|
|
|
|
Paulo Zanoni (12):
|
|
|
|
- iris: also dump bo's imported and exported flags
|
|
- iris: destroy our mutexes a little later
|
|
- iris: fix off-by-one error when clearing stale syncobjs
|
|
- iris: call brw_process_intel_debug_variable() earlier
|
|
- iris: extract iris_hw_context_set_unrecoverable()
|
|
- intel/fs: Assert the GPU supports 64bit ops if present at lower_scoreboard time.
|
|
- iris: improve error checking in functions that call vma_alloc()
|
|
- iris: sprinkle some assertions for bufmgr->lock
|
|
- iris: save some iris_syncobj_reference() calls at update_bo_syncobjs()
|
|
- iris: implement inter-context busy-tracking
|
|
- iris: handle IRIS_MEMZONE_BINDER with a real vma_heap like the others
|
|
- iris: fix register spilling on compute shaders on XeHP
|
|
|
|
Pavel Asyutchenko (6):
|
|
|
|
- llvmpipe: fix wrong assumption on FB fetch shader opacity
|
|
- llvmpipe: fix gl_FragColor and gl_LastFragData[0] combination
|
|
- llvmpipe: protect from doing FB fetch of missing buffers
|
|
- llvmpipe: fix FB fetch with non 32-bit render target formats
|
|
- llvmpipe: remove dead args from load_unswizzled_block
|
|
- llvmpipe: enable PIPE_CAP_FBFETCH_COHERENT
|
|
|
|
Pavel Ondračka (9):
|
|
|
|
- r300: Document the RADEON_DEBUG options
|
|
- r300: Replace RADEON_NO_TCL with RADEON_DEBUG=notcl
|
|
- r300: Remove broken optimization in rc_transform_KILL
|
|
- r300: use point sprite coordinates only when drawing points (v5)
|
|
- r300: fix translate_LRP
|
|
- r300: properly initialize new_vs in r300_draw_init_vertex_shader
|
|
- r300: Disable integers and indirect temporary addressing with swctl
|
|
- r300: Set consistent PIPE_SHADER_CAP_PREFERRED_IR
|
|
- r300: fix transformation of abs modifiers with negate
|
|
|
|
Philipp Zabel (1):
|
|
|
|
- etnaviv: fix emit_if in case the else block ends in a jump
|
|
|
|
Pierre Moreau (1):
|
|
|
|
- nv50/nir: Switch to the common NIR options
|
|
|
|
Pierre-Eric Pelloux-Prayer (57):
|
|
|
|
- radeonsi: use viewport offset in quant_mode determination
|
|
- gallium/dri: replace bool with flag parameter
|
|
- gallium/dri: add createImageFromFds2
|
|
- gallium/dri: let the driver know if the imported image is DRI_PRIME buffer
|
|
- radeon_winsys.h: add a parameter to buffer_from_handle
|
|
- winsys/amdgpu: add uncached flag to the imported DRI_PRIME buffer
|
|
- radeonsi/gfx10.3: enable SDMA for DRI_PRIME copies
|
|
- radeonsi: use PIPE_BIND_DRI_PRIME instead of is_dri_blit_image
|
|
- Revert "gallium: add a is_dri_blit_image bool to pipe_blit_info"
|
|
- glsl/drirc: add an option for gl_ClipVertex / gl_CullDistance checks
|
|
- drirc: enable do_dce_before_clip_cull_analysis for ANSA
|
|
- mesa: don't reset SamplersValidated if nothing changed
|
|
- glsl/nir: mark samplers inside a block as bindless
|
|
- radeonsi: treat nir_intrinsic_load_constant as a VMEM operation
|
|
- ac/surface: use a less strict condition in is_dcc_supported_by_L2
|
|
- radeonsi/sdma: fix bogus assert
|
|
- radeonsi: fix ps_uses_fbfetch value
|
|
- ac/surface: don't validate DCC settings if DCC isn't possible
|
|
- vbo/dlist: free copied.buffer if no vertices were copied
|
|
- glx/drirc: add a force_direct_glx_context option
|
|
- mesa: enable force_direct_glx_context for DiscoveryStudio2020
|
|
- llvmpipe: add missing NIR alu-op handling
|
|
- mesa: print a warning when an extension can't be disabled
|
|
- mesa: don't use dummy_true for some MESA extensions
|
|
- mesa/init: replace call_once with manual implementation
|
|
- drirc: add mesa_extension_override option
|
|
- drirc: add options for BETA CAE Ansa application.
|
|
- mesa: always call _mesa_update_pixel
|
|
- mesa: remove NEW_COPY_TEX_STATE
|
|
- radeonsi/sqtt: fix shader stage values
|
|
- radeonsi/sqtt: fix FINISH_DONE / BUSY usage
|
|
- radeonsi/sqtt: reserve a vmid when sqtt is enabled
|
|
- radeonsi: unreference framebuffer state after use
|
|
- radeonsi/sqtt: increase the default buffer size to 32MB
|
|
- radeonsi: fix fast clear / depth decompression corruption
|
|
- radeonsi: silence a warning
|
|
- radeonsi: use max_zplanes after the last write
|
|
- glx: fix querying GLX_FBCONFIG_ID for Window
|
|
- radeonsi: don't use perp. end caps when line smoothing is on
|
|
- glthread: only log glthread destroy reason when it's not NULL
|
|
- radeonsi/gfx8: use the proper dcc clear size
|
|
- vbo/dlist: fix loopback crash
|
|
- vbo/dlist: add vertices to incomplete primitives
|
|
- amd: add SDMA_NOP_PAD
|
|
- radv: partial sdma support
|
|
- radv: allocate the prime buffer as uncached
|
|
- vulkan/wsi: add use_prime_blit param to wsi_swapchain_init
|
|
- vulkan/wsi: add a private transfer pool to exec the DRI_PRIME blit
|
|
- radv: implement wsi's private transfer queue using SDMA
|
|
- radeonsi/gfx10: fix si_texture_get_offset for mipmapped tex
|
|
- radeonsi,radv: fix usages of surf_pitch
|
|
- radeonsi/tests: update expected results
|
|
- radeonsi/tests: add expected results for vega20
|
|
- driconf: enable vs_position_always_invariant for Dirt Rally
|
|
- mesa: use less temporaries in build_lighting
|
|
- radeonsi: limit loop unrolling for LLVM < 13
|
|
- radeonsi: change rounding mode to round to even
|
|
|
|
Qiang Yu (76):
|
|
|
|
- driconf: add executable_regexp application attribute
|
|
- drirc: add Mari application workaround
|
|
- xmlconfig_test: add unit test for executable_regexp
|
|
- radeonsi: add radeonsi_force_use_fma32 driconf option
|
|
- driconf: support META application
|
|
- glx/dri3: fix glXQueryContext does not return GLX_RENDER_TYPE value
|
|
- loader/dri3: fix piglit egl-copy-buffer test
|
|
- ci: remove egl-copy-buffers from fail list
|
|
- glx: no need to create extra pixmap for pbuffer
|
|
- glx: check drawable type before create drawble
|
|
- glx: add drawable type argument when create drawable
|
|
- loader/dri3: add drawable type set by GLX and EGL
|
|
- loader/dri3: remove unused present capability query
|
|
- loader/dri3: pack window present event setup into a function
|
|
- loader/dri3: setup present event with drawable type info
|
|
- loader/dri3: replace is_pixmap with drawable type
|
|
- loader/dri3: rename dri3_fake_front_buffer
|
|
- loader/dri3: stop doing anything in swap buffer for some drawable
|
|
- loader/dri3: support glx pbuffer swap
|
|
- glx: fix regression for drawable type detection
|
|
- glapi: should not add alias function to static_data.py
|
|
- gallium: add caps for sparse texture support
|
|
- mesa: add ARB_sparse_texture extension
|
|
- mesa: add ARB_sparse_texture constants
|
|
- gallium: add screen get_sparse_texture_virtual_page_size callback
|
|
- gallium: add get_sparse_texture_virtual_page_size for noop/rbug/trace
|
|
- mesa: add ARB_sparse_texture query in glGetInternalformativ
|
|
- mesa: add ARB_sparse_texture texture param set/get
|
|
- mesa/st: add st_GetSparseTextureVirtualPageSize interface
|
|
- mesa: glTexStorage* support sparse texture allocation
|
|
- mesa/st: add st_TexturePageCommitment interface
|
|
- mesa: implement glTexPageCommitmentARB/glTexturePageCommitmentEXT
|
|
- mesa/st: update NumSparseLevels from pipe_resource
|
|
- winsys/radeon: change surface_init flags to 64bit
|
|
- ac/surface: fix prt_first_mip_tail calculation for gfx9+
|
|
- ac/surface: add prt_tile_depth
|
|
- radeonsi: support alloc a sparse texture
|
|
- radeonsi: use staging buffer for sparse texture when transfer map
|
|
- radeonsi: implement get_sparse_texture_virtual_page_size
|
|
- radeonsi: support texture resource commit
|
|
- radeonsi: enable ARB_sparse_texture
|
|
- nir: fix nir_tex_instr hash not count is_sparse field
|
|
- gallium/dd_debug: add get_sparse_texture_virtual_page_size
|
|
- gallium: add PIPE_CAP_QUERY_SPARSE_TEXTURE_RESIDENCY
|
|
- mesa: add ARB_sparse_texture2 extension
|
|
- mesa/main: relax alignment check when ARB_sparse_texture2 available
|
|
- glsl: add ARB_sparse_texture2 extension
|
|
- glsl: ir_texture support sprase texture
|
|
- glsl: add _texture related sparse texture builtin functions
|
|
- glsl: add _textureCubeArrayShadow related sparse texture builtin func
|
|
- glsl: add _texelFetch related sparse texture builtin function
|
|
- glsl: add sparse texture image load builtin functions
|
|
- glsl: add vec5 glsl types
|
|
- glsl/nir: convert sparse ir_texture to nir
|
|
- glsl/nir: convert sparse image load to nir
|
|
- glsl/nir: adjust sparse texture nir_variable
|
|
- glsl: add sparseTexelsResidentARB builtin function
|
|
- glsl/nir: convert is_sparse_texels_resident to nir
|
|
- radeonsi: lower nir_intrinsic_is_sparse_texels_resident
|
|
- mesa/main: allow multi sample sparse texture
|
|
- mesa/main: export _is_multisample_target for external usage
|
|
- gallium: add multi_sample parameter to get_sparse_texture_virtual_page_size
|
|
- radeonsi: enable multi sample sparse texture support
|
|
- radeonsi: enable ARB_sparse_texture2
|
|
- gallium: add PIPE_CAP_CLAMP_SPARSE_TEXTURE_LOD
|
|
- mesa: add ARB_sparse_texture_clamp extension
|
|
- glsl: add ARB_sparse_texture_clamp extension
|
|
- glsl: ir_texture add clamp field
|
|
- glsl: _texture support clamp parameter
|
|
- glsl: _textureCubeArrayShadow support clamp
|
|
- glsl: add ARB_sparse_texture_clamp builtin functions
|
|
- glsl/nir: convert ir_texture->clamp to nir
|
|
- radeonsi: enable ARB_sparse_texture_clamp
|
|
- radeonsi: workaround Specviewperf13 Catia hang on GFX9
|
|
- radeonsi: fix depth stencil multi sample texture blit
|
|
- glx: fix pbuffer refcount init
|
|
|
|
Rafael Antognolli (2):
|
|
|
|
- intel/compiler: Assert that unsupported tg4 offsets were lowered for XeHP
|
|
- intel: Emit 3DSTATE_BINDING_TABLE_POOL_ALLOC for XeHP
|
|
|
|
Renato Pereyra (2):
|
|
|
|
- anv: Enable implicit CCS for external images
|
|
- anv: add helper methods related to enabling CCS for external images
|
|
|
|
Rhys Perry (83):
|
|
|
|
- radv: Use nir_var_mem_image in meta shaders
|
|
- aco: disable mul(cndmask(0, 1, b), a) optimization sometimes
|
|
- aco: simplify emit_stream_output()
|
|
- radv: don't use a separate cache entry for GS copy shaders
|
|
- aco: use std::vector and IDSet in RA validator
|
|
- docs: update radv extensions in features.txt
|
|
- nir/algebraic: optimize Cyberpunk 2077's open-coded bitfieldReverse()
|
|
- nir/lower_system_values: replace local_invocation_id components with zero
|
|
- radv: lower load_local_invocation_index with 1D workgroups
|
|
- aco: optimize load_local_invocation_index with single-wave workgroups
|
|
- aco: consider pseudo-instructions reading exec in needs_exec_mask()
|
|
- aco/spill: use spills_entry instead of spills_exit to kill linear VGPRs
|
|
- radv: make RADV_FORCE_FAMILY case-insensitive
|
|
- radv: fix max_render_backends for Sienna Cichlid null winsys
|
|
- spirv: run nir_copy_prop before nir_rematerialize_derefs_in_use_blocks_impl
|
|
- radv: stop running copy-propagation before nir_opt_deref
|
|
- nir/lower_tex: don't calculate texture_mask for texture_index>=32
|
|
- nir/lower_subgroups: fix left shift of -1
|
|
- aco: properly update use counts if a extract is still used
|
|
- nir/dce: fix DCE of loops with a halt or return instruction in the pre-header
|
|
- nir/cf: fix insertion of loops/ifs after jumps
|
|
- nir/tests: add DCE test for loops following a jump
|
|
- aco: don't create DPP instructions with SGPR operands
|
|
- nir/opt_offsets: fix try_extract_const_addition recursion
|
|
- nir/opt_offsets: remove need to loop try_extract_const_addition
|
|
- nir/algebraic: optimize a*#b & -4
|
|
- radv: add radv_meta_init_shader
|
|
- radv: include RT shaders in RADV_DEBUG=shaders,shaderstats
|
|
- radv: clone shader in radv_shader_compile_to_nir
|
|
- radv: initialize workgroup_size in radv_meta_init_shader
|
|
- aco: don't create unnecessary addition in indirect get_sampler_desc()
|
|
- aco: improve clrx disassembly
|
|
- aco: use more predictable tiebreaker when forming MADs
|
|
- aco: create v_fmamk_f32/v_fmaak_f32 from nir_op_ffma
|
|
- aco: swap operands if necessary to create v_madak/v_fmaak
|
|
- aco: swap multiplication operands if needed to create v_fmac_f32/etc
|
|
- radv,aco: implement nir_op_ffma
|
|
- radv,aco: don't lower some ffma instructions
|
|
- radv: have the null winsys set more fields
|
|
- aco: fix check_vop3_operands() for f16vec2 ffma fneg combine
|
|
- nir/opt_load_store_vectorize: improve ssbo/global alias analysis
|
|
- nir/algebraic: optimize more 64-bit imul with constant source
|
|
- nir/opt_access: infer CAN_REORDER for global access
|
|
- aco: disallow SMEM offsets that are not multiples of 4
|
|
- aco: skip &-4 before SMEM
|
|
- aco: remove SMEM constant/addition combining out of the loop
|
|
- radv: increase maxTaskOutputCount to 65535
|
|
- nir/lower_shader_calls: fix store_scratch write_mask
|
|
- ac/nir: fix store_buffer_amd write_masks
|
|
- nir/uniform_atomics: fix is_atomic_already_optimized without workgroups
|
|
- aco: remove pack_half_2x16(a, 0) optimization
|
|
- radv: add RADV_DEBUG=splitfma
|
|
- radv: set radv_split_fma=true for Proton SotTR
|
|
- nir/unsigned_upper_bound: don't follow 64-bit f2u32()
|
|
- aco: use p_extract for SGPR nir_op_unpack_half_2x16_split_y
|
|
- nir: add nir_op_fmulz and nir_op_ffmaz
|
|
- radv,aco,ac/llvm: implement fmulz and ffmaz
|
|
- aco: consider legacy multiplications in optimizer
|
|
- aco: create v_mac_legacy_f32/v_fmac_legacy_f32
|
|
- nir/algebraic: add ignore_exact() wrapper
|
|
- nir/algebraic: optimize open-coded fmulz/ffmaz
|
|
- nir: add some missing nir_alu_type_get_base_type
|
|
- nir/algebraic: optimize expressions using fmulz/ffmaz
|
|
- nir/builder: set write mask if not provided
|
|
- nir/builder: assume scalar alignment if not provided
|
|
- radv: avoid providing a write_mask to intrinsic builders
|
|
- radv: avoid providing an align_mul to intrinsic builders
|
|
- radv: avoid providing an align_offset to intrinsic builders
|
|
- ac/nir: avoid providing a write_mask to intrinsic builders
|
|
- ac/nir: avoid providing an align_mul to intrinsic builders
|
|
- ac/nir: use shorter builder names
|
|
- radv: fix raytracing with wave32
|
|
- radv: use wave32 for raytracing
|
|
- radv: use 8x4 workgroups for wave32 RT
|
|
- radv: fix optimized MSAA copies with suballocated images
|
|
- aco: fix neg(abs(mul(a, b))) if the mul is not VOP3
|
|
- aco: optimize abs(mul(a, b))
|
|
- aco: preserve pass_flags during format conversions
|
|
- aco: fix v_mac_legacy_f32
|
|
- aco: don't encode src2 for v_writelane_b32_e64
|
|
- radv: fix R_02881C_PA_CL_VS_OUT_CNTL with mixed cull/clip distances
|
|
- anv: Enable nir_opt_access
|
|
- radv: include disable_aniso_single_level and adjust_frag_coord_z in key
|
|
|
|
Rob Clark (63):
|
|
|
|
- freedreno: Fix for large epilogues
|
|
- freedreno/ir3+isa: Cleanup bindless cat5 samp/tex encoding
|
|
- freedreno/isa: Fixes for validation
|
|
- freedreno/ir3/tests: Fix indentation
|
|
- freedreno/ir3/tests: Add additional disasm test vectors
|
|
- freedreno/ir3: Don't lower s2en if samp/tex is too large
|
|
- isaspec: Fix derived field width
|
|
- isaspec: Do not emit duplicate field encodes
|
|
- isaspec: Add bitfield size assertions
|
|
- freedreno/isa: Add immed reg accessors
|
|
- freedreno/ir3/tests: Don't skip encode test if decode fails
|
|
- freedreno/ir3/tests: Add some 8/16b ldg/stg tests
|
|
- freedreno/isa: Fix ldg/stg "halfness"
|
|
- freedreno/computerator: Fix mergedregs
|
|
- freedreno/ir3: 64b intrinsic lowering
|
|
- freedreno/ir3: Lower 64b phis
|
|
- freedreno/ir3: Add load/store_global lowering
|
|
- isaspec: Fix gpu_id for default_options
|
|
- freedreno: Skip built-in shaders for clover
|
|
- freedreno/ir3: Handle MESA_SHADER_KERNEL
|
|
- freedreno: implement set_compute_state()
|
|
- freedreno/ir3: vec8+vec16 support
|
|
- freedreno/ir3: implement load_work_dim intrinsic
|
|
- freedreno/ir3: Add support for load_kernel_input
|
|
- freedreno/ir3: Move lower_idiv_options
|
|
- freedreno: Fix set_global_binding
|
|
- freedreno/ir3: Deal with zero-source instructions
|
|
- freedreno/ir3: 16b bools
|
|
- freedreno/ir3: 8bit fixes
|
|
- freedreno/ir3: Fix load/store_global_ir3 type
|
|
- freedreno/ir3: Fix reg size validation
|
|
- freedreno/ir3: Add wide load/store lowering
|
|
- freedreno/ir3: Add ihadd/uhadd
|
|
- freedreno/ir3: Get req_local_mem from pipe_compute_state
|
|
- freedreno/ir3: Fix validation of subgroup macros
|
|
- freedreno/drm: Add some asserts
|
|
- freedreno/drm: Move suballoc_bo to device
|
|
- freedreno/ir3/print: Show end's outidxs
|
|
- freedreno/ir3: xfb fix for duplicate outputs
|
|
- freedreno: Add PIPE_SHADER_IR_NIR_SERIALIZED support
|
|
- freedreno: caps for clover
|
|
- isaspec: Add prototypes for expr evaluators
|
|
- freedreno/crashdec: Split out mempool decoding
|
|
- freedreno/crashdec: HFI queue decoding
|
|
- freedreno/crashdec: Fallback to chip_id for GPU id
|
|
- freedreno/crashdec: Basing GMU log decoding
|
|
- freedreno/ir3: Handle instr->address when cloning
|
|
- freedreno/computerator: Fix @buf header
|
|
- freedreno/computerator: Mark shader bo for dumping
|
|
- freedreno/ir3: Dump const state with shader disasm
|
|
- clover: Move min image support check
|
|
- freedreno: Report system memory as video memory
|
|
- freedreno: Small dev_id_compare() cleanup
|
|
- freedreno: Rearrange dev_id_compare() logic
|
|
- freedreno: Handle wildcard fuse-id in device matching
|
|
- freedreno: Update chip-ids
|
|
- freedreno/decode: Handle chip-id
|
|
- mesa/st: Lowered ucp should still mark rast state dirty
|
|
- freedreno: Pass shader cache key instead of shader key
|
|
- freedreno: Add FD_DIRTY_RASTERIZER_CLIP_PLANE_ENABLE
|
|
- freedreno/a6xx: Fix clip_mask
|
|
- freedreno/a5xx: Fix clip_mask
|
|
- freedreno: Add missing generated header dependency
|
|
|
|
Rohan Garg (7):
|
|
|
|
- intel/fs: OpImageQueryLod does not support arrayed images as an operand
|
|
- ci: Do not remove libgbm-dev
|
|
- ci: Move common variables out into a separate file
|
|
- ci: Do not remove wget
|
|
- ci/piglit: Start vtest server if driver is set to virpipe
|
|
- anv: Enable VK_VALVE_mutable_descriptor_type
|
|
- docs: Update features and new_features for anv
|
|
|
|
Roland Scheidegger (1):
|
|
|
|
- llvmpipe: adjust rounding for viewport scissoring
|
|
|
|
Roman Gilg (1):
|
|
|
|
- vulkan/wsi/x11: document implementation
|
|
|
|
Roman Stratiienko (6):
|
|
|
|
- android.mk: Add missing variables to the make target
|
|
- v3dv: Fix dEQP-VK.info#instance_extensions test
|
|
- v3dv: Fix V3DV_HAS_SURFACE preprocessor condition
|
|
- v3dv: Hotfix: Rename remaining V3DV_HAS_SURFACE->V3DV_USE_WSI_PLATFORM
|
|
- v3d: Don't force SCANOUT for PIPE_BIND_SHARED requests
|
|
- v3dv: add Android support
|
|
|
|
Ruijing Dong (6):
|
|
|
|
- frontends/omx: preserve omx to keep current mode for avc decoding
|
|
- frontends/va: preparing to disable h264 extension flag in vaapi dec path
|
|
- radeon/vcn: enable dynamic dpb Tier2 support for h264 dec vaapi path
|
|
- radeon/vcn: enable dynamic dpb Tier2 for hevc dec vaapi path
|
|
- frontend/va: Keep surface buf addr before reallocation
|
|
- radeon/vcn: Updating render_pic_list for correction
|
|
|
|
Ryan Houdek (1):
|
|
|
|
- util/xmlconfig: Allow DT_UNKNOWN files
|
|
|
|
Ryan Neph (1):
|
|
|
|
- venus: ignore framebuffer for VkCommandBuffer executed outside of render pass
|
|
|
|
Sagar Ghuge (16):
|
|
|
|
- intel/compiler: Set correct cache policy for A64 byte scattered read
|
|
- intel/genxml: Add new Primitive ID Not Required bit field to 3DSTATE_DS
|
|
- intel/compiler: Track primitive id in domain/evaluation shader
|
|
- anv: Drop hint if primitive id is required or not
|
|
- iris: Drop hint if primitive id is required or not
|
|
- anv: Pass correct aux usage while filling out surface state
|
|
- anv: Enable CCS for storage image formats
|
|
- intel/compiler: Set correct return format for brw_SAMPLE
|
|
- intel/compiler: Don't hardcode padding source type to 32bit
|
|
- intel/compiler: Fix instruction size written calculation
|
|
- intel/compiler: Add helper to support half float payload with padding
|
|
- intel/fs: Define and set correct sampler simd mode
|
|
- intel/compiler: Prepare disasm for 16-bit sampler params
|
|
- anv, iris: Implement Wa_14014890652 for DG2
|
|
- intel/genxml: Add L1 Cache Control bit field
|
|
- intel/genxml: Add Un-Typed Data-Port Cache Flush field to pipe control
|
|
|
|
Sajeesh Sidharthan (1):
|
|
|
|
- frontends/va/av1: handle multiple slice params
|
|
|
|
Samuel Pitoiset (158):
|
|
|
|
- aco: do not return an empty string when disassembly is not supported
|
|
- radv: fix removing PSIZ when it's not emitted by the last VGT stage
|
|
- radv: fix OpImageQuerySamples with non-zero descriptor set
|
|
- radv: do not remove PSIZ for streamout shaders
|
|
- aco: fix invalid IR generated for b2f64 when the dest is a VGPR
|
|
- aco: fix emitting stream outputs when the first component isn't zero
|
|
- radv: add an assertion to prevent GPU hangs when VRS isn't supported
|
|
- ac/rgp: remove useless code related to GFX6-7
|
|
- aco: fix loading 64-bit inputs with fragment shaders
|
|
- radv,aco: decouple shader_info/options from radv_shader_args
|
|
- radv: re-emit prolog inputs when the nontrivial divisors state changed
|
|
- radv: remove old RADV_TRACE_FILE warning
|
|
- radv: fix build errors with Android
|
|
- radv: report error messages when the driver can't be initialized
|
|
- aco: only load streamout buffers if streamout is enabled
|
|
- radv: lower the viewport index to zero when the VGT stage doesn't export it
|
|
- radv: fix invalid wait_dst_stage_mask type
|
|
- ac/nir: remove bogus assertion about the position for culling
|
|
- zink: add CI lists and deqp-suite configuration for RADV
|
|
- radv: move freeing the trigger SQTT file at a better place
|
|
- radv: remove useless checks about GFX7 for SQTT
|
|
- radv: enable SQTT instruction timing by default
|
|
- radv: stop reporting SQTT/RGP support as experimental
|
|
- docs: document RADV_THREAD_TRACE_* envvars
|
|
- radv: rename radv_shader_variant to radv_shader
|
|
- radv: only enable VK_EXT_display_control for vrcompositor (SteamVR)
|
|
- radv: optimize subpass barrier flushes for imageless framebuffers
|
|
- Revert "radv: only enable VK_EXT_display_control for vrcompositor (SteamVR)"
|
|
- radv: remove unused parameter in radv_emit_subpass_barrier()
|
|
- radv/sqtt: reserve a VMID for better profiling
|
|
- radv/sqtt: stop calling radv_cs_add_buffer() for the thread trace BO
|
|
- radv: do not expose buffer features for depth/stencil formats
|
|
- ac/rgp: fix alignment of code object records to follow the RGP spec
|
|
- radv: only emit PGM_LO for the vertex prolog
|
|
- ac/rgp: add support for queue event timings
|
|
- ac/rgp: add support for clock calibration
|
|
- radv/winsys: use same IBs padding as the kernel
|
|
- zink: update the CI lists for RADV
|
|
- radv/sqtt: fix GPU hangs when capturing from the compute queue
|
|
- ac: add initial SPM support
|
|
- ac/rgp: add support for dumping SPM data
|
|
- radv: fix a sync issue on GFX9+ by clearing the upload BO fence
|
|
- ac/spm: fix determing the SPM wire
|
|
- ac/spm: fix determining the counter slot
|
|
- zink: update the CI lists for RADV
|
|
- zink: skip one GLES31 subset to avoid GPU hangs on Navi10
|
|
- nir: fix constant expression of ibitfield_extract
|
|
- radv: simplify re-using cache entries in radv_pipeline_cache_insert_shaders()
|
|
- radv: disable HTILE for D32S8 format and mipmaps on GFX10
|
|
- util/queue: fix a data race detected by TSAN when finishing the queue
|
|
- radv: allow TC-compat CMASK with storage images on GFX10+
|
|
- radv: ignore the descriptor set layout when creating descriptor template
|
|
- radv: convert remaining enums/structs to 1.2 versions
|
|
- ac: change family names to uppercase in ac_get_family_name()
|
|
- radv/winsys: report the real family name instead of OVERRIDDEN
|
|
- radv: fix emitting VBO when vertex input dynamic state is used
|
|
- radv: make radv_break_on_count() a non-static function
|
|
- radv: make radv_copy_buffer() a non-static function
|
|
- radv: implement optimized MSAA copies using FMASK
|
|
- ac/nir: fix left shift of 1 by 31 places detected by UBSAN
|
|
- radv: fix OOB access for inline push constants detected by UBSAN
|
|
- aco: fix right shift of exponent 32 detected by UBSAN
|
|
- radv: implement vkCmdWriteTimestamp2KHR()
|
|
- radv: implement vkCmd{Reset,Set}Event2KHR()
|
|
- radv: implement vkCmdWriteBufferMarker2AMD()
|
|
- radv: implement vkCmdWaitEvents2KHR()/vkCmdPipelineBarrier2KHR()
|
|
- radv: implement vkQueueSubmit2KHR()
|
|
- radv: add support for VkMemoryBarrier2KHR
|
|
- radv: add support for creating device-only events
|
|
- radv: add support for new pipeline stages and access masks
|
|
- radv: add support for VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL_KHR
|
|
- radv: switch the remaining stages/access to VK_PIPELINE_STAGE_2/VK_ACCESS_2
|
|
- radv: advertise VK_KHR_synchronization2
|
|
- radv: fix accessing NULL pointers when destroy the VRS image
|
|
- radv: add a workaround to fix a segfault with Metro Exodus (Linux native)
|
|
- radv: configure the number of SGPRs/VGPRs directly from the arguments
|
|
- radv: copy the user SGPRs locations outside of radv_declare_shader_args()
|
|
- radv: constify radv_shader_info in radv_declare_shader_args()
|
|
- radv/llvm: stop trying to eliminate VS outputs
|
|
- radv/llvm: constify radv_shader_info
|
|
- radv: fix resetting the entire vertex input dynamic state
|
|
- radv: remove never reached free() when compiling shaders
|
|
- radv: add a helper function to upload a shader binary
|
|
- radv: upload shader binaries after they are all compiled
|
|
- radv: pass a pointer to a pipeline for the create/insert cache functions
|
|
- radv: upload shader binaries of a pipeline contiguously in memory
|
|
- radv: move a comment at the right place in CmdBindVertexBuffers2EXT()
|
|
- radv: constify radv_vertex_binding in CmdSetVertexInputEXT()
|
|
- radv: constify radv_vs_input_state() in more places
|
|
- radv: fix dynamic rendering with VRS
|
|
- radv: avoid prefixing few VkXXX structures by struct
|
|
- radv/sqtt: always dump pipelines and shaders ISA
|
|
- radv: add few helpers for configuring performance counters
|
|
- radv: add initial SPM support on GFX10+
|
|
- radv: fix resume/suspend render pass with depth/stencil attachment
|
|
- radv: mark GFX10.3 (aka RDNA2) as conformant products with CTS 1.2.7.1
|
|
- Revert "radv: Add bufferDeviceAddressMultiDevice support."
|
|
- radv: do not perform depth/stencil resolves for suspended render pass
|
|
- nir/lower_tex: add lower_lod_zero_width
|
|
- radv: enable lower_lod_zero_width
|
|
- radv: fix dynamic rendering inheritance if the subpass index isn't 0
|
|
- radv: ignore dynamic inheritance if the render pass isn't NULL
|
|
- radv/winsys: remove useless has_sparse_vm_mappings checks
|
|
- radv/winsys: clear the PRT VA range when destroying a virtual BO
|
|
- radv/winsys: stop using reference counting for virtual BOs
|
|
- radv/winsys: update sparse mappings with OP_REPLACE instead of OP_MAP/OP_UNMAP
|
|
- ac/nir: add an option to disable anisotropic filtering for single level images
|
|
- radv,aco: do not disable anisotropy filtering for non-mipmap images
|
|
- radv: fix dynamic rendering global scissor
|
|
- radv: remove unnecessary NULL checks in vkMapMemory()/vkUnmapMemory()
|
|
- radv/winsys: stop zeroing few structs in buffer_from_fd()
|
|
- radv/winsys: remove radv_amdgpu_winsys_bo::is_shared
|
|
- radv: fix restoring subpass during hw/fs color resolves
|
|
- radv: re-enable fast clears for images that support comp-to-single
|
|
- radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets."
|
|
- radv: fix clears with value of "1" and different DCC signedness
|
|
- radv: stop checking buffer size in vkCreateBuffer()
|
|
- radv: print number of levels with RADV_DEBUG=img
|
|
- radv: fix copying mutable descriptors to sampler descriptors
|
|
- radv: add drirc radv_disable_htile_layers and enable it for F1 2021
|
|
- aco: fix a dynamic-stack-buffer-overflow when printing instructions
|
|
- aco: do not print prologs disassembly if no disassembler
|
|
- radv,aco: keep track of the prolog disassembly if necessary
|
|
- radv: save the vertex prolog to the trace BO for debugging
|
|
- radv: dump the VS prolog disassembly to the hang report
|
|
- radv: add UMR markers for the vertex prolog
|
|
- radv: do not use the common entrypoint for the Metro Exodus layer
|
|
- radv: remove remaining dead code related to the old sync code
|
|
- radv/winsys: remove unused syncobj functions
|
|
- radv: stop checking if dynamic states changed
|
|
- radv: reset VRS if the current subpass doesn't have a VRS attachment
|
|
- radv: fix computing the fb size in presence of dynamic VRS attachment
|
|
- radv: only clear VRS_HTILE_ENCODING on GFX10.3+
|
|
- radv: allow to disable anisotropic filtering for single level image with drirc
|
|
- radv: enable radv_disable_aniso_single_level for Battlefield 1 & V
|
|
- radv/winsys: fix zero submit if no timeline semaphore support
|
|
- vulkan/runtime: fix accessing NULL pointers detected by UBSAN
|
|
- radv: fix missing destroy for the overallocation mutex
|
|
- radv: fix computing the number of color samples if no attachments
|
|
- radv: optimize CPU overhead of si_cp_dma_prefetch() slightly
|
|
- radv: disable attachmentFragmentShadingRate for RADV_DEBUG=nohiz
|
|
- radv: fix copying VRS rates to HTILE if the depth/stencil is cleared
|
|
- radv: fix copying VRS rates if the ds attachment uses mips
|
|
- radv/winsys: set GTT_WC flag for CS IBs on GFX6
|
|
- radv: do not restore NULL compute pipelines after meta operations
|
|
- radv: stop checking if pipelines are NULL during draws/dispatches
|
|
- vulkan: Update the XML and headers to 1.3.204
|
|
- radv: implement 1.3 features/properties
|
|
- radv: report textureCompressionASTC_HDR as not supported
|
|
- radv: add a no-op version of vkGetPhysicalDeviceToolPropertiesEXT()
|
|
- radv: switch a bunch of struct/enum to 1.3 versions
|
|
- radv: bump conformance version to 1.3.0.0 for RDNA2
|
|
- radv: advertise Vulkan 1.3
|
|
- radv/ci: mark dEQP-VK.api.version_check.version as expected failure on Stoney
|
|
- radv/winsys: fix missing buffer_make_resident() for the null winsys
|
|
- Revert "radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets.""
|
|
- radv/winsys: fix initializing debug/perftest options if multiple instances
|
|
- radv: enable radv_disable_aniso_single_level for The Evil Within 1&2
|
|
|
|
Scott Anderson (1):
|
|
|
|
- egl/wayland: Remove unused wayland enum
|
|
|
|
Sergii Melikhov (1):
|
|
|
|
- vulkan: Unlock before return.
|
|
|
|
Shmerl (1):
|
|
|
|
- docs/features: Add VK_KHR_acceleration_structure, VK_KHR_pipeline_library, VK_KHR_ray_query, VK_KHR_ray_tracing_pipeline.
|
|
|
|
Silvestrs Timofejevs (2):
|
|
|
|
- egl: introduce a log level getter function
|
|
- egl: add config debug printout
|
|
|
|
Simon McVittie (1):
|
|
|
|
- meson: Try to link all-targets module if Gallium OpenCL is enabled
|
|
|
|
Simon Ser (4):
|
|
|
|
- renderonly: write down usage rules
|
|
- vulkan/wsi/wayland: use enum wl_shm_format
|
|
- vulkan/wsi/wayland: use DRM_FORMAT_INVALID
|
|
- vulkan/wsi/wayland: remove format switch from wl_shm_format_for_vk_format
|
|
|
|
Stefan Brüns (1):
|
|
|
|
- llvmpipe: Add get_{driver,device}_uuid implementations
|
|
|
|
Tapani Pälli (18):
|
|
|
|
- iris: clear bos_written when resetting a batch
|
|
- vulkan/wsi: provide api for drivers to setup syncobj fd
|
|
- radv: setup syncobj fd via wsi_device_setup_syncobj_fd
|
|
- anv: setup syncobj fd via wsi_device_setup_syncobj_fd
|
|
- vulkan/wsi: implement missing wsi_register_device_event
|
|
- anv: allow VK_IMAGE_LAYOUT_UNDEFINED as final layout
|
|
- glsl: fix invariant qualifer usage and matching rule for GLSL 4.20
|
|
- iris: unref syncobjs and free r/w dependencies array for slab entries
|
|
- mesa: free idalloc storage for display lists
|
|
- mesa: free vbo_save_vertex_list store prims
|
|
- mesa: refactor GetProgramiv to use program resource list
|
|
- mesa: move GetProgramInterfaceiv as a shader_query function
|
|
- mesa: change GetProgramiv name length queries to use program resources
|
|
- intel/genxml: add PIPE_CONTROL field for L3 read only cache invalidation
|
|
- anv: invalidate L3 read only cache when VF cache is invalidated
|
|
- iris: invalidate L3 read only cache when VF cache is invalidated
|
|
- iris: fix a leak on surface states
|
|
- mesa/st: always use DXT5 when transcoding ASTC format
|
|
|
|
Tatsuyuki Ishi (3):
|
|
|
|
- aco: support DPP8
|
|
- aco: lower masked swizzle to DPP8
|
|
- radv/sqtt: Add and enable basic EXT_debug_utils support.
|
|
|
|
Thierry Reding (2):
|
|
|
|
- tegra: Use private reference count for sampler views
|
|
- tegra: Use private reference count for resources
|
|
|
|
Thomas H.P. Andersen (49):
|
|
|
|
- nine: remove dead code
|
|
- svga: fix bitwise/logical and mixup
|
|
- meson: drop compatability with < 0.48
|
|
- meson: drop a comment relating to old meson version
|
|
- docs: update the required meson version
|
|
- meson: drop a temp formatting variable
|
|
- ci: clean up debian-clang no-error list
|
|
- gallivm: avoid a self-assign warning
|
|
- ci: debian-clang: drop -Wno-error for self-assign
|
|
- lavapipe: fix implicit-fallthrough warning
|
|
- i915g: fix implicit-fallthrough warning
|
|
- r300: remove a set but not used variable
|
|
- r600: remove a set but not used variable
|
|
- glx: remove a set but not used variable
|
|
- gallium/u_threaded: drop unused function
|
|
- gallium/tgsi_exec: drop unused function
|
|
- draw: drop unused function
|
|
- microsoft/compiler: dxil_nir_opt_alu_deref_srcs: return progress
|
|
- lavapipe: fix string-plus-int warning
|
|
- i915g: avoid left shifting a negative number
|
|
- panvk: use FALLTHROUGH to stop a warning
|
|
- panvk: cast negative value to unint8_t
|
|
- ci: debian-clang: build more drivers
|
|
- tu/clear_blit: use || when working with bools
|
|
- r600/sb: silence a sometimes-uninitialized warning
|
|
- ci: debian-clang: -Wno-error for sometimes-uninitialized
|
|
- zink: malloc/sizeof mismatch
|
|
- xa: fix compile warning for -Wabsolute-value
|
|
- broadcom/compiler: fix compile warning -Wabsolute-value
|
|
- ci: debian-clang: drop -Wno-error=absolute-value
|
|
- replace 0 with NULL for NULL pointers
|
|
- freedreno: silence sometimes-uninitialized warning
|
|
- freedreno: drop dead assignment
|
|
- meson: add check kwarg to run_command
|
|
- ci: clean up debian-android no-error list
|
|
- anv: drop unused label
|
|
- ci: debian-android: drop -Wno-error=unused-label
|
|
- vulkan/vk_extensions_gen: fix -Wextern-initializer warning
|
|
- ci: debian-android: drop -Wno-error=extern-initializer
|
|
- microsoft/compiler: fix -Wbitwise-instead-of-logical warning
|
|
- anv: avoid warning about unused function
|
|
- vc4: drop unused function
|
|
- v3d: avoid warning about unused function
|
|
- v3d: avoid warning about unused function
|
|
- broadcom: drop unused functions
|
|
- panfrost: mark two variables as unused
|
|
- anv: drop a set but unused variable
|
|
- anv: drop a set but unused variable
|
|
- svga: silence -Wsometimes-uninitialized
|
|
|
|
Thomas Wagner (1):
|
|
|
|
- util: use anonymous file for memory fd creation
|
|
|
|
Thong Thai (8):
|
|
|
|
- frontends/va: disable packed header support for h264 encoder
|
|
- frontends/va/enc: hardcode h265 encoder ref pic list size
|
|
- radeon/vcn: increase encoder dpb size
|
|
- frontends/va/enc: allow for frames to be marked as (not) referenced
|
|
- radeon: hardcode uvd/vce encoder not_referenced value to false
|
|
- radeon/vcn: implement encoder dpb management
|
|
- frontends/va/enc: default motion estimation parameters for performance
|
|
- radeonsi: add check for graphics to si_try_normal_clear
|
|
|
|
Timothy Arceri (16):
|
|
|
|
- mesa: fix buffer overrun in SavedObj texture obj array
|
|
- mesa: remove old tnl device driver header files
|
|
- mesa: remove _mesa_ir_link_shader()
|
|
- mesa: make _mesa_associate_uniform_storage() static
|
|
- mesa: remove GLSL IR to Mesa IR code
|
|
- mesa/st: move _mesa_generate_parameters_list_for_uniforms() code to st
|
|
- mesa: move _mesa_ensure_and_associate_uniform_storage() to uniform_query.cpp
|
|
- mesa: tidy up ir_to_mesa.{cpp,h} includes, comments, etc
|
|
- mesa: rename ir_to_mesa.{cpp,h} -> link_program.{cpp,h}
|
|
- mesa: update or remove out of date references to ir_to_mesa
|
|
- doc: update source tree doc to reflect recent classic/swrast deletions
|
|
- util: add dri config option force_compat_shaders
|
|
- util: add workaround for SNK HEROINES Tag Team Frenzy
|
|
- mesa: make struct in gl_program a union and remove FIXME
|
|
- glsl/glcpp: make sure to expand new token after concatenation
|
|
- glsl/st: move st_nir_opts() into gl compiler common code
|
|
|
|
Timur Kristóf (44):
|
|
|
|
- radv: Use MESA_VULKAN_SHADER_STAGES to make room for mesh/task.
|
|
- aco: Fix how p_is_helper interacts with optimizations.
|
|
- nir: Rename nir_get_io_vertex_index_src and include per-primitive I/O.
|
|
- nir: Print Mesh Shader specific info.
|
|
- nir: Fix nir_lower_io with per primitive outputs.
|
|
- nir, spirv: Don't mark NV_mesh_shader primitive indices as per-primitive.
|
|
- nir: Add new option to lower invocation ID from invocation index.
|
|
- nir: Lower cull and clip distance arrays for mesh shaders.
|
|
- nir: Don't compact per-vertex and per-primitive outputs together.
|
|
- nir: Group per-primitive outputs at the end for driver location assign.
|
|
- nir: Fix sorting per-primitive outputs.
|
|
- util: Add util_widen_mask function.
|
|
- aco: Use util_widen_mask.
|
|
- radv: Use util_widen_mask.
|
|
- nir: Print task and mesh shader I/O variable names.
|
|
- aco: Clean up and fix quad group instructions with WQM.
|
|
- aco/optimizer_postRA: Fix combining DPP into VALU.
|
|
- aco/optimizer_postRA: Fix applying VCC to branches.
|
|
- spirv: Allow VRS with mesh shaders.
|
|
- gitlab-ci: Disable radv-fossils again.
|
|
- ac/nir/ngg: Lower NV mesh shaders to NGG semantics.
|
|
- radv: Add radv_pipeline_has_mesh helper.
|
|
- radv: Add mesh shader specific info.
|
|
- radv: Cleanup PS input generation.
|
|
- radv: Cleanup VS output param assignment.
|
|
- radv: Add support for per-primitive mesh shader outputs.
|
|
- radv: Setup shader arguments for mesh shaders.
|
|
- radv: Compile mesh shaders and apply the necessary NIR lowerings.
|
|
- radv: Set output driver locations for mesh shaders.
|
|
- radv: Create mesh shading pipelines.
|
|
- radv: Add support for mesh shading pipelines in the command buffer.
|
|
- aco: Add Mesh and Task shader stages.
|
|
- aco: Update README about NGG and mesh shaders.
|
|
- aco: Add 1D workgroup_id support for mesh shaders.
|
|
- aco: Use the correct outinfo for mesh shaders.
|
|
- aco: Export per-primitive mesh shader output attributes.
|
|
- radv: Implement NV_mesh_shader draw calls.
|
|
- radv: Enable NV_mesh_shader with a perftest flag.
|
|
- radv: Note when a mesh shader writes the primitive shading rate.
|
|
- radv: Lower primitive shading rate for mesh shaders.
|
|
- aco: Emit VRS rate when it's per-primitive.
|
|
- radv: Support VRS for mesh shaders.
|
|
- radv: Disable IB2 on compute queues.
|
|
- ac/nir/ngg: Fix mixed up primitive ID after culling.
|
|
|
|
Tomeu Vizoso (13):
|
|
|
|
- ci: Add support for lazor Chromebooks
|
|
- ci: Let manual LAVA jobs have a longer timeout than others
|
|
- freedreno/ci: Test Turnip on Adreno 618
|
|
- ci: Create symlink to /install early
|
|
- ci: Don't set GALLIVM_PERF in the scripts
|
|
- virgl/ci: Set GALLIVM_PERF=nopt,no_quad_lod
|
|
- ci: Remove syslogd
|
|
- virgl/ci: Run each dEQP instance in its own VM
|
|
- ci: Uprev Crosvm
|
|
- lvp: Free the driver_data pointer for all commands
|
|
- anv/ci: Test with deqp-vk on Tiger Lake
|
|
- anv/tests: Free BO cache and device mutex
|
|
- ci: Rebalance Iris jobs
|
|
|
|
Topi Pohjolainen (6):
|
|
|
|
- intel/compiler: Handle new sampler descriptor fields for 16bit sampler
|
|
- intel/compiler/fs: Add support for 16-bit sampler msg payload
|
|
- intel/compiler: Demote sampler params to 16-bit for CMS/UMS/MCS
|
|
- intel/compiler: Prepare ld2dms_w for 4 mcs components
|
|
- intel/compiler: Add new variant for TXF_CMS_W
|
|
- intel/compiler: Deprecate ld2dms and use ld2dms_w instead
|
|
|
|
Uday Kiran Pichika (2):
|
|
|
|
- iris: enable adaptive sync for IRIS
|
|
- anv: enable adaptive sync for ANV
|
|
|
|
Vadym Shovkoplias (3):
|
|
|
|
- intel/fs: Fix a cmod prop bug when cmod is set to inst that doesn't support it
|
|
- anv: Include viewport size in scissor rectangle
|
|
- glthread: Check out of bounds for MultiDrawElementsBaseVertex cmd
|
|
|
|
Vasily Khoruzhick (14):
|
|
|
|
- gallium: add PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS
|
|
- lima: enable PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS
|
|
- lima: add native txp support
|
|
- lima: add support for 3D textures
|
|
- lima: handle 1D samplers
|
|
- lima: add more wrap modes
|
|
- lima/ppir: check if mul node is a source of add node before inserting
|
|
- lima/ppir: implement gl_FragDepth support
|
|
- lima: fix crash with sparse samplers
|
|
- lima: disasm: use last argument as a filename
|
|
- lima: disasm: call util_cpu_detect() to init CPU caps
|
|
- lima: implement dual source blend
|
|
- lima: use 1 as blend factor for dst_alpha for SRC_ALPHA_SATURATE
|
|
- lima: fix blending with min/max ops
|
|
|
|
Viktoriia Palianytsia (1):
|
|
|
|
- glsl: fix for unused variable in glsl_types.cpp
|
|
|
|
Vinson Lee (17):
|
|
|
|
- anv: Fix assertion.
|
|
- radv: Fix memory leak on error path.
|
|
- clover: Add constructor for image_rd_argument.
|
|
- zink: Remove duplicate variable unsized.
|
|
- intel/compiler: Change selected_simd return type to int.
|
|
- intel/compiler: Initialize SIMDSelectionTest member error.
|
|
- vulkan/wsi: Unlock before return on error path.
|
|
- microsoft/spirv_to_dxil: Fix non-Windows build.
|
|
- virgl: Allocate qdws after virgl_init_context to avoid leak.
|
|
- ac/rgp: Initialize clock_calibration with memset.
|
|
- ir3: Make shift operand 64-bit.
|
|
- panfrost: Avoid double unlock.
|
|
- isaspec: Sort field names to generate deterministic output.
|
|
- radv: Fix memory leak on error path.
|
|
- r600/sfn: Remove unused AluInstruction members.
|
|
- isaspec: Remove duplicate return statement.
|
|
- v3dv: Add missing unlocks on errors.
|
|
|
|
Witold Baryluk (2):
|
|
|
|
- zink: Do not access just freed zink_batch_state
|
|
- zink: Fully initialize VkBufferViewCreateInfo for hashing
|
|
|
|
Xiaohui Gu (1):
|
|
|
|
- iris: Mark a dirty update when vs_needs_sgvs_element value changed
|
|
|
|
Yiwei Zhang (27):
|
|
|
|
- dri_interface: remove obsolete interfaces
|
|
- dri_interface: remove gl header
|
|
- venus: refactor private descriptor_set helpers to be private
|
|
- venus: assign valid memoryTypeIndex of exportable ahb memory for image
|
|
- venus: release queues on device creation failure
|
|
- venus: refactor to add vn_device_init
|
|
- venus: refactor to add vn_buffer_init
|
|
- venus: refactor the ahb buffer mem_type_bits query api
|
|
- venus: add struct vn_buffer_memory_requirements
|
|
- venus: add struct vn_image_memory_requirements
|
|
- venus: add buffer cache init and usage flows
|
|
- venus: implement vn_buffer_get_max_buffer_size
|
|
- venus: implement vn_buffer_cache_get_memory_requirements
|
|
- venus: implement vn_buffer_cache_entries_create
|
|
- venus: refactor vn_device_memory_simple_alloc
|
|
- venus: simplify device memory pool alloc and refcount
|
|
- venus: defer roundtrip waiting to vkFreeMemory time
|
|
- venus: track memory type property flags in vn_device_memory
|
|
- venus: move bo allocation for mappable memory to vn_MapMemory
|
|
- venus: subtract appended header size in vn_CreatePipelineCache
|
|
- venus: VkExternalImageFormatProperties is optional
|
|
- tu: VkExternalImageFormatProperties is optional
|
|
- venus: update some obsolete assumptions described
|
|
- venus: track whether a fence is external
|
|
- Revert "venus: remove vn_ring_wait_all"
|
|
- venus: make vn_QueueSubmit async for native submissions
|
|
- venus: properly destroy deferred ahb image before real image creation
|
|
|
|
Yonggang Luo (2):
|
|
|
|
- vulkan: Open registry XML files as UTF-8
|
|
- win32: Fixes thread local on win32 with clang/mingw (!14062)
|
|
|
|
mwezdeck (1):
|
|
|
|
- virgl/drm: New optimization for uploading textures
|
|
|
|
orbea (1):
|
|
|
|
- build: add sha1_h for lp_texture.c
|
|
|
|
satmandu (1):
|
|
|
|
- Fix compilation on armv7l with gcc 11.2.0
|
|
|
|
shanshengwang (1):
|
|
|
|
- radeon/vce: Limiting max supported refernce frames to 1 for h264 encoding
|