mirror of
https://gitlab.freedesktop.org/mesa/mesa.git
synced 2025-01-07 14:25:08 +08:00
5273 lines
245 KiB
ReStructuredText
5273 lines
245 KiB
ReStructuredText
Mesa 21.2.0 Release Notes / 2021-08-04
|
||
======================================
|
||
|
||
Mesa 21.2.0 is a new development release. People who are concerned
|
||
with stability and reliability should stick with a previous release or
|
||
wait for Mesa 21.2.1.
|
||
|
||
Mesa 21.2.0 implements the OpenGL 4.6 API, but the version reported by
|
||
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
||
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
||
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
||
4.6 is **only** available if requested at context creation.
|
||
Compatibility contexts may report a lower version depending on each driver.
|
||
|
||
Mesa 21.2.0 implements the Vulkan 1.2 API, but the version reported by
|
||
the apiVersion property of the VkPhysicalDeviceProperties struct
|
||
depends on the particular driver being used.
|
||
|
||
SHA256 checksum
|
||
---------------
|
||
|
||
::
|
||
|
||
0cb3c802f4b8e7699b1602c08c29d06a4d532ab5b8f7a64676c4ca6bb8f4d426 mesa-21.2.0.tar.xz
|
||
|
||
|
||
New features
|
||
------------
|
||
|
||
- zink supports GL_ARB_texture_filter_minmax, GL_ARB_shader_clock
|
||
|
||
- VK_EXT_provoking_vertex on RADV.
|
||
|
||
- VK_EXT_extended_dynamic_state2 on RADV.
|
||
|
||
- VK_EXT_global_priority_query on RADV.
|
||
|
||
- VK_EXT_physical_device_drm on RADV.
|
||
|
||
- VK_KHR_shader_subgroup_uniform_control_flow on Intel and RADV.
|
||
|
||
- VK_EXT_color_write_enable on RADV.
|
||
|
||
- 32-bit x86 builds now default disable x87 math and use sse2.
|
||
|
||
- GL ES 3.1 on GT21x hardware.
|
||
|
||
- VK_EXT_acquire_drm_display on RADV and ANV.
|
||
|
||
- VK_EXT_vertex_input_dynamic_state on lavapipe
|
||
|
||
- wideLines on lavapipe
|
||
|
||
- VK_EXT_line_rasterization on lavapipe
|
||
|
||
- VK_EXT_multi_draw on ANV, lavapipe, and RADV
|
||
|
||
- VK_KHR_separate_depth_stencil_layouts on lavapipe
|
||
|
||
- VK_EXT_separate_stencil_usage on lavapipe
|
||
|
||
- VK_EXT_extended_dynamic_state2 on lavapipe
|
||
|
||
- NGG shader based primitive culling is now supported by RADV.
|
||
|
||
- Panfrost supports OpenGL ES 3.1
|
||
|
||
- New Asahi driver for the Apple M1
|
||
|
||
- GL_ARB_sample_locations on zink
|
||
|
||
- GL_ARB_sparse_buffer on zink
|
||
|
||
- GL_ARB_shader_group_vote on zink
|
||
|
||
- DRM format modifiers on zink
|
||
|
||
- freedreno+turnip: Initial support for a6xx gen4 (a660, a635)
|
||
|
||
- None
|
||
|
||
|
||
Bug fixes
|
||
---------
|
||
|
||
- The image is distorted while use iGPU(Intel GPU) rendering and output via dGPU (AMD GPU)
|
||
- lima: regression in plbu scissors cmd
|
||
- freedreno: regression in org.skia.skqp.SkQPRunner#gles_multipicturedraw_*_tiled
|
||
- Incorrect rendering
|
||
- intel/isl: Wrong surface format name in batch
|
||
- [RADV] FSR in Resident Evil: Village looks very pixelated on Polaris
|
||
- 21.2.0rc1 Build Failure - GCC6.3
|
||
- Crash in update_buffers after closing KDE "splash screen" downloader
|
||
- Firefox (wayland) crash in wayland_platform
|
||
- Crash in update_buffers after closing KDE "splash screen" downloader
|
||
- Firefox (wayland) crash in wayland_platform
|
||
- radeonsi: persistent, read-only buffer maps are slow to read
|
||
- substance painter flickering with jagged texture and masks shown black
|
||
- radv: FP16 mode in FidelityFX FSR doesn't look right
|
||
- Regression in Turnip with KGSL and Zink running opengl in proot
|
||
- Validation crash on wlroots after wl_shm appeared
|
||
- [RADV] Blocky corruption in Scarlet Nexus and vkd3d-proton 2.4
|
||
- Use out encoding for float immediates
|
||
- Radeon RX580 and 5700 XT: Reloading ARB assembly shaders causes very glitchy rendering
|
||
- i915g: dEQP-GLES2.functional.fragment_ops.depth_stencil..* failures
|
||
- i915g: dEQP-GLES2.functional.fragment_ops.interaction.basic_shader.* and fragment_ops.random.* failures
|
||
- KHR-GL46.shader_ballot_tests.ShaderBallot* tests fails on the main
|
||
- i965 nir compiler is lowering fdiv twice or initializing struct twice
|
||
- [radv] Textures missing in Doom (2016) w/ any temporal AA setting enabled
|
||
- Drop Android.mk
|
||
- [build error] macros.h:88:26: error: size of unnamed array is negative
|
||
- Game Issue: Nuclear Throne crashes in RadeonSI
|
||
- iris: reduce shader storage buffer object alignment
|
||
- agx_compile.h:29:10: fatal error: asahi/lib/agx_pack.h: No such file or directory
|
||
- radv: VBO range check issues with odd strides and sizes
|
||
- Crash in glLinkProgram while trying to craft the link error
|
||
- i915g: wide point failures
|
||
- Wolfenstein II: The New Colossus - Screen goes black in some cases
|
||
- [radv] [regression] Textures missing in Doom (2016)
|
||
- Shader compilation memory leaks
|
||
- radv: fd leak in Android WSI radv_AcquireImageANDROID
|
||
- SpaceEngine in Steam Proton cannot start with Mesa >=20.3
|
||
- [bisected] KDE plasma menu text renders like stretched strangely [amdgpu]
|
||
- radeonsi: glitches in Euro Truck Simulator 2
|
||
- White box for Webrender Firefox with R600_DEBUG=nir on Evergreen GPU
|
||
- radv_AllocateDescriptorSets: validation on variable description count is too strict
|
||
- Luna Sky Crashes on Launch
|
||
- Mesa crashes on undefined texture behaviour
|
||
- Mesa crashes on undefined texture behaviour
|
||
- cache_test uses uninitialized stack memory
|
||
- nir/opt_load_store_vectorize: check_for_robustness() crashes on derefs
|
||
- [anv] GravityMark (benchmark) crashes on ANV
|
||
- turnip: corrupted geometry after tesselation shaders in GTAV
|
||
- [opengl] We happy few not being rendered correctly
|
||
- anv: dynamic state prim type is hard
|
||
- [iris][bisected] piglit test ...ext_external_objects.vk-image-display-muliple-textures failing after enabled
|
||
- Factorio: GPU hang when opening machine inventory
|
||
- RuneScape on Mesa 21.1.1 (VEGA10) has bad performance and leaks memory
|
||
- LLVM12 breaks atomicCompSwap tests with radeonsi
|
||
- freedreno: tex-miplevel-selection causes a creation of too many BOs without flushes, causing a crash
|
||
- RADV: Resident Evil Village Freezes during a specific cutscene
|
||
- Supraland: flickering black bars on ground
|
||
- u_queue.c:173:7: error: implicit declaration of function 'timespec_get' is invalid in C99
|
||
- radv: front face and cull mode dynamic state can desync
|
||
- radv: GPU hang in Cyberpunk 2077 on Big Navi
|
||
- Cyberpunk 1.22 crashes with amdgpu ring gfx_0.0.0 timeout
|
||
- [amdgpu][renoir][rx5500m]: [drm:0xffffffff8198ad5e] \*ERROR* ring gfx timeout, signaled seq=10952, emitted seq=10954
|
||
- [spirv-fuzz] SPIR-V parsing FAILED: Invalid back or cross-edge in the CFG
|
||
- anv: dEQP-VK.graphicsfuzz.spv-stable-bubblesort-flag-complex-conditionals fails
|
||
- panfrost Mount and Blade: Warband (glitches)
|
||
- anv: dEQP-VK.robustness.robustness2*no_fmt_qual.null_descriptor.samples* fail
|
||
- KHR-GLES31.core.shader_image_load_store.basic-glsl-earlyFragTests may be flakey on RPI4
|
||
- SPIRV AMD Driver compiler memory leak/usage - 8Gb+ to compile single 41Kb SPIRV file, Ubuntu 21.04
|
||
- Regression of !10941: mutter Wayland on bare metal crashes when closing HW accelerated windows
|
||
- radv: ACO miscompiles a specific DIRT 5 shader
|
||
- Copy paste bug in v3dv_cmd_buffer.c
|
||
- Segfault in mtx_unlock/amdgpu_bo_slab_destroy
|
||
- [radeonsi] Incorrect rendering when using glDisable(GL_MULTISAMPLE) with multisample backbuffer
|
||
- virtio(vulkan): Error building on Android/arm32
|
||
- [i965] regression: piglit.spec.arb_depth_buffer_float.fbo-depthstencil-gl_depth32f_stencil8-drawpixels
|
||
- iris: Select memory map cacheability settings at BO allocation time
|
||
- zink: regression for primitive-restart on ANV
|
||
- zink: Unimplemented ALU {un,}pack_half_2x16 in nir_to_spirv
|
||
- venus: dEQP-VK.api.command_buffers.bad_inheritance_info_random test failure
|
||
- [radeonsi] glClearTexImage on 1D array only clears first slice
|
||
- zink: Expected Image Operand ConstOffset to be a const object
|
||
- docs: bullet-lists no longer show any bullets
|
||
- [RADV] - Path of Exile (238960) - Ground decals are missing or corrupted using the Vulkan renderer.
|
||
- [ADL-S / TGL-U / TGL-H] Pixels missing / flickering when render some app on weston
|
||
- [iris][biected] piglit.spec.glsl-1_50.execution.geometry.clip-distance-vs-gs-out
|
||
- panfrost ff9a0000.gpu: Unhandled Page fault in AS1 at VA 0x0000000009801200
|
||
- black stripes in X/Xwayland applications under panfrost/midgard
|
||
- radv: significant overhead from radv_pipeline_has_ngg()
|
||
- mesa-21.1.1/src/gallium/drivers/vc4/vc4_resource.c:790: reading from uninitialised array
|
||
- [iris,i965][hsw,ivb,snb,bdw][bisected][regression] wflinfo crashes
|
||
- [iris,i965][hsw,ivb,snb,bdw][bisected][regression] wflinfo crashes
|
||
- The image is distorted while use iGPU(Intel GPU) rendering and output via dGPU (AMD GPU)
|
||
- [radv] Revert !7207 once BG3 is fixed.
|
||
- piglit test 'spec.glsl-1_30.execution.range_analysis_fsat_of_nan' failure
|
||
- iris: Rework iris_transfer_mapto better use iris_has_color_unresolved
|
||
- ir3_cf breaks nir_op_fquantize2f16
|
||
- [i915g] implicit fallthrough
|
||
- Add asahi to CI's build
|
||
- RADV - Regression - CPU side 'hang' in metro exodus
|
||
- Assassin's Creed Odyssey crash on launch
|
||
- Metro Exodus not starting under xwayland
|
||
- [i915g] PIPE_SHADER_CAP_MAX_HW_ATOMIC_COUNTERS is Unknown cap 38
|
||
- Elite Dangerous: Odyssey alpha crashes GPU on launch
|
||
- glmark2-es2 -b terrain crashes since Bifrost FP16
|
||
- [iris][bisected][regression] dEQP-GLES31.functional.texture.multisample.* tests crash on multiple platforms
|
||
- gallium: tc regression
|
||
- panfrost: Look into invalidate_resource()
|
||
- [ivb,hsw][i965][bisected] dEQP-GLES3.functional.shaders.texture_functions.texture.sampler2darrayshadow_vertex failing
|
||
- ci: Explicitly test shader caching
|
||
- radv: RoTR fails on Raven APU
|
||
- Tungsten Graphics links in Gallium docs
|
||
- Obs Studio Broken on Latest Mesa Git(Regression)(Bisected)
|
||
- Graphics corruption and GPU hang with RADV/LLVM
|
||
- old kernels (4.19) support in radv
|
||
- Elite Dangerous: Odyssey alpha crashes GPU on launch
|
||
- CSGO: Some default variables can cause problems with trust mode
|
||
- freedreno: dEQP-GLES3.functional.fence_sync.client_wait_sync_finish flakes
|
||
- glxclient.h:56:10: fatal error: 'loader.h' file not found
|
||
- mesa git started to break wine + UnrealTournament.exe (old dx6 game)
|
||
- SuperTuxKart artifacting on RK3399
|
||
- [amdgpu]: Golf With Your Friends (431240): ERROR Waiting for fences timed out
|
||
- don't flush for each blit/grid
|
||
- No sRGB capable visuals/fbconfigs reported in glx
|
||
- Documentation request: AMD debug variables
|
||
- docs: versions is out-of-date
|
||
- Strange results when trying to read from VK_FORMAT_R64_SFLOAT in compute shader
|
||
- anv: dEQP-VK.binding_model.buffer_device_address.set3.depth3.basessbo.convertcheck* slow
|
||
- Iris doesn't support INTEL_performance_query anymore
|
||
- [iris][bisected][regression] dEQP-GLES3.functional.texture.specification.teximage2d* failures
|
||
- RADV: TRUNC_COORD breaks gather operations
|
||
- [RADV] corruption in avatar after dying in Heroes of the Storm
|
||
- drm-shim build error with glibc 2.33
|
||
- Metro Exodus crashing due to memory overflow
|
||
- Sauerbraten shader rendering broken on RV530 (r300g)
|
||
- texture glitches on CS:GO on Tiger Lake
|
||
- Incorrect texture blitting/mapping when running Unigine-Heaven 4.0 on ADL-S / TGL-H, TGL-U
|
||
- Build fail due to "parameter name omitted" on Gallium Nine
|
||
- v3dv: DiligentEngine fail to run with SRGB swapchain
|
||
- Non-DRI builds broken by recent cleanups in Mesa core
|
||
- Cinnamon core dump after installing latest oibaf mesa build (165a69d2)
|
||
- yuv sampler lowering regression
|
||
- clover llvm build failure ‘class llvm::VectorType’ has no member named ‘getNumElements’;
|
||
- Possible corruption for apps using multiple Z-buffers on TGL
|
||
|
||
|
||
Changes
|
||
-------
|
||
|
||
Aaron Liu (1):
|
||
|
||
- amd: add Yellow Carp support
|
||
|
||
Abel García Dorta (2):
|
||
|
||
- i915g: add HW atomic counters as unsupported
|
||
- i915g: fix implicit fallthrough
|
||
|
||
Adam Jackson (53):
|
||
|
||
- Revert "glx: Lift sending the MakeCurrent request to top-level code"
|
||
- gallium/xlib: Fix for recent gl_config changes
|
||
- glx/drisw: Enable GLX_ARB_create_context_no_error
|
||
- glx: Remove silly __glXGetGLVersion() indirection
|
||
- glx: Remove some truly ancient unused code
|
||
- glx: Remove major/minor version tracking from extension table
|
||
- glx: Mark GLX_{ATI_pixel_format_float,NV_float_buffer} as supported
|
||
- glx: Remove some non-functional GL extension from the table
|
||
- glx: Generalize __glXGetStringFromTable a little
|
||
- glx: Remove redundant client_support field from extension table
|
||
- glx: Enable pure-client-library extensions explicitly
|
||
- glx: Fold client_gl_only array into its one real user
|
||
- glx: Remove some ancient backwards-compatibility typedefs
|
||
- zink: Learn about VK_KHR_swapchain
|
||
- zink: Fix format query for minmax reduction support
|
||
- gallium: Fix PIPE_BIND_SAMPLER_REDUCTION_MINMAX definition to be unique
|
||
- dri: Try harder to infer the drawable fbconfig if needed
|
||
- glx: Return the right GLX opcode in synthetic MakeCurrent errors
|
||
- glx: s/dri_message/glx_message/
|
||
- glx: Add and use DebugMessageF convenience macro
|
||
- glx: Convert undocumented LIBGL_DIAGNOSTIC to LIBGL_DEBUG=verbose
|
||
- glx: Remove unused debugging printfs
|
||
- glx: Implement GLX_EXT_no_config_context
|
||
- glx: Stop pretending the GLX major number isn't 1
|
||
- glx: Stop force-enabling extensions "implied" by GLX 1.3
|
||
- glx: Simplify glXIsDirect
|
||
- glx: Remove warn-once-ery around GLX 1.3 functions
|
||
- glx: Remove unused opcode argument to __glX{Get,QueryServer}String
|
||
- glx: Stash a copy of the XExtCodes in the glx_display
|
||
- glx: Simplify some overuse of GetGLXScreenConfigs
|
||
- glx: Move server GLX vendor and version strings to glx_screen
|
||
- glx: s/Display \*/struct glx_display \*/ over internal API
|
||
- glx: Remove some dead declarations from glxclient.h
|
||
- Revert "glx: s/Display \*/struct glx_display \*/ over internal API"
|
||
- include: Remove unused i810_pci_ids.h
|
||
- i915c: Add a symlink for i830_dri.so
|
||
- mesa: Remove unused _mesa_{create,destroy}_visual
|
||
- mesa: Ignore the depth buffer when computing framebuffer floatness
|
||
- gallium: Reset attachments to ST_ATTACHMENT_INVALID when revalidating
|
||
- format/fxt1: Clean up fxt1_variance's argument list
|
||
- mesa: s/malloc/calloc/ to silence a warning
|
||
- gallium: Remove unused st_visual::render_buffer
|
||
- gallivm: Fix a signature mismatch warning
|
||
- zink/ntv: Don't call free() on ralloc'd memory
|
||
- gallium/dri: Remove unused dri_drawable::drisw_surface
|
||
- drisw: Don't bzero displaytarget pixels
|
||
- intel: properly constify isl_format_layouts
|
||
- classic/xlib: Fix the build after !9817
|
||
- swrast: Fix a warning from gcc 11
|
||
- loader/dri3: Properly initialize the XFIXES extension
|
||
- loader/dri3: Don't churn through xfixes regions in SwapBuffers
|
||
- vl/dri3: Don't leak regions on the X server
|
||
- meson: Make prefer-{crocus,iris} always take effect
|
||
|
||
Alejandro Piñeiro (30):
|
||
|
||
- v3dv/debug: print correct stage name
|
||
- v3dv/debug: use gl stage when checking debug flag
|
||
- v3dv/pipeline: track descriptor maps per stage, not per pipeline
|
||
- v3dv: remove custom icd json generation
|
||
- v3dv: move extensions table to v3dv_device
|
||
- v3dv: don't use typedef enum with broadcom stages
|
||
- v3dv: remove unused v3dv_zs_buffer_from_vk_format
|
||
- broadcom/compiler: use proper type field for atomic operations
|
||
- v3d/simulator: capture hub interrupts
|
||
- v3d/simulator: add a cache flush mode enum
|
||
- v3d/simulator: wait for cache flushes
|
||
- v3d/simulator: use the proper register when waiting on a CSD submit
|
||
- v3d/simulator: use BFC/RFC registers to wait for bin/render to complete
|
||
- broadcom/common: move v3d_tiling to common
|
||
- v3d/simulator: hw mem is now an v3d_size_t, typedef to uint32_t
|
||
- v3d/simulator: get rid of has_gca wrapper
|
||
- v3dv: rename v3dv_pack for v3dvx_pack
|
||
- v3dv/cmd_buffer: add helper job_emit_binning_prolog
|
||
- v3dv/cmd_buffer: move cl_emit calls for Draw methods to helpers
|
||
- v3dv: start to move and wrap hw-version code with v3dv_queue
|
||
- v3dv: split v3dv_pipeline hw version dependant code to a new source file
|
||
- v3dv: split v3dv_image hw version dependant code to a new source file
|
||
- v3dv: split v3dv_format hw version dependant code to a new source file
|
||
- v3dv: split v3dv_device hw version dependant code to a new source file
|
||
- v3dv: move several hw version dependant code to their own (v3dvx) source file
|
||
- v3dv: split v3dv_descriptor hw version dependant to a new source file.
|
||
- v3dv: don't use cl_packet_length for prepacked data
|
||
- v3dv: remove gen-dependant includes from v3dv_private
|
||
- v3dv/build: meson infrastructure for multi-hw-version support
|
||
- v3dv/format: expose properly that some formats are not filterable
|
||
|
||
Alexander Monakov (2):
|
||
|
||
- freedreno/drm-shim: pretend to offer DRM 1.6.0
|
||
- freedreno/drm-shim: keep GEM buffers page-aligned
|
||
|
||
Alexander Shi (1):
|
||
|
||
- mesa: texparam: Add a clamping macro to handle out-of-range floats returned as integers.
|
||
|
||
Alexey Nurmukhametov (1):
|
||
|
||
- tu/kgsl: Fix file descriptor double close
|
||
|
||
Alyssa Rosenzweig (668):
|
||
|
||
- nir: Update some comments referring to imov
|
||
- panfrost: Don't allow_forward_pixel_to_kill for Z/S blit
|
||
- panfrost: Set allow_forward_pixel_to_be_killed for blit
|
||
- panfrost: Set clean_fragment_write for blits
|
||
- panfrost: Invert blend_reads_dest logic
|
||
- panfrost: Don't allow FPK if a RT is missing
|
||
- panfrost: Allow FPK when there are no side effects
|
||
- panfrost: Keep Bifrost blendable -> pixel in table
|
||
- panfrost: Specialize blendable formats for sRGB
|
||
- panfrost: Simplify format_to_bifrost_blend prototype
|
||
- panfrost: Drop blendable format accessor
|
||
- panfrost: Always pick dithered tb formats
|
||
- panfrost: Remove padded unorm blendable formats
|
||
- docs/macos: Explain Apple GLX versus OSMesa on macOS
|
||
- nir/lower_fragcolor: Use shader_instructions_pass
|
||
- nir/lower_fragcolor: Handle fp16 outputs
|
||
- panfrost: Fix formats converting uninit from AFBC
|
||
- nir/lower_fragcolor: Fix driver_location assignment
|
||
- nir/lower_fragcolor: Take max cbufs as argument
|
||
- d3d12: Switch to nir_lower_fragcolor
|
||
- util/bitset: Add BITSET_COUNT helper
|
||
- nir: Add fsin_agx opcode
|
||
- asahi: Stub command-line compiler for AGX G13B
|
||
- agx: Add opcode descriptions as Python
|
||
- agx: Generate opcode list
|
||
- agx: Generate runtime-accessible opcode table
|
||
- agx: Generate builder routines
|
||
- agx: Stub NIR backend compiler
|
||
- agx: Remap varyings to match AGX ABI
|
||
- agx: Stub control flow walking
|
||
- agx: Stub NIR instruction iteration
|
||
- agx: Stub emit_intrinsic
|
||
- agx: Implement load_const as mov
|
||
- agx: Implement direct st_vary
|
||
- agx: Add agx_alu_src_index helper for emit_alu
|
||
- agx: Implement vec2/vec3/vec4 ops
|
||
- agx: Implement fragment_out
|
||
- agx: Add instruction printing
|
||
- agx: Add a trivial register allocator
|
||
- agx: Add instruction packing
|
||
- agx: Add packing for memory loads/stores
|
||
- agx: Add st_vary(_final) instruction packing
|
||
- agx: Terminate programs with stop and traps
|
||
- agx: Implement ld_vary
|
||
- agx: Implement simple floating point ops
|
||
- agx: Implement fsin/fcos
|
||
- agx: Add 8-bit AGX minifloat routines
|
||
- agx: Add minifloat tests
|
||
- agx: Implement native float->int conversions
|
||
- agx: Implement native int->float conversions
|
||
- agx: Add bitwise operations
|
||
- agx: Add iadd/imad integer arithmetic
|
||
- agx: Add saturated integer add/subtract support
|
||
- agx: Add 32-bit bitwise shifts
|
||
- agx: Add forward optimizing pass for fmov
|
||
- agx: Add dead code eliminator
|
||
- agx: Propagate fmov backwards as well
|
||
- agx: Propagate immediates
|
||
- agx: Implement limited case of i2i16/i2i32 as iadd
|
||
- agx: Add sysval management helper
|
||
- agx: Implement load_ubo/kernel_input
|
||
- agx: Set flag on last st_vary instruction
|
||
- agx: Lower load_attr to device memory accesses
|
||
- agx: Implement vertex_id
|
||
- agx: Add agx_tex_dim helper
|
||
- agx: Emit texture ops
|
||
- agx: Pack texture ops
|
||
- agx: Add min/max support
|
||
- agx: Support 1-bit booleans
|
||
- agx: Implement b2f
|
||
- agx: Add b2i implementation
|
||
- agx: Pack cmpsel
|
||
- agx: Support bcsel
|
||
- asahi: Add hexdump utility
|
||
- asahi: Add command buffer XML definitions
|
||
- asahi: Add allocation data structure
|
||
- asahi: Add a GenXML fork
|
||
- asahi: Add (clean room) IOKit uABI header
|
||
- asahi: Add command buffer decode helpers
|
||
- asahi: Add tiling routines
|
||
- asahi: Add device abstraction
|
||
- asahi: Add pool data structure
|
||
- asahi: Add uniform upload routines
|
||
- asahi: Add some magic IOGPU routines
|
||
- asahi: Add vertex formats table
|
||
- asahi: Add Gallium driver
|
||
- nir/opcodes: Reword confusing comment
|
||
- pan/bi: Add missing sr_count to pseudo-atomics
|
||
- pan/bi: Don't reference uninit source in ATOM_C1
|
||
- pan/bi: Add simple constant folding pass
|
||
- pan/bi: Don't reference nir_lower_mediump_outputs
|
||
- pan/bi: Simplify Python expression
|
||
- pan/bi: Union modifiers from across variants
|
||
- pan/bi: Support 16-bit load_interpolated_input
|
||
- pan/bi: Emit int CSEL instead of float by default
|
||
- pan/bi: Implement vectorized f32_to_f16
|
||
- pan/bi: Fix 16-bit fsat
|
||
- pan/bi: Improve assert for vector size errors
|
||
- pan/bi: Implement vectorized int downcasts
|
||
- pan/bi: Fix loads and stores smaller than 32 bits
|
||
- pan/bi: Lower swizzles on CLPER
|
||
- pan/bi: Add and use bi_negzero helper
|
||
- pan/bi: Don't schedule clamps to +FADD.v2f16
|
||
- pan/bi: Workaround \*V2F32_TO_V2F16 erratum
|
||
- panfrost: Don't unroll loops in GLSL
|
||
- panfrost: Remove old dEQP workaround
|
||
- pan/bi: Track dual-src blend type
|
||
- pan/bi: Handle different sizes of LD_TILE
|
||
- pan/bi: Add single-component 8-bit mkvec lowering
|
||
- pan/bi: Handle swizzles in i2i8
|
||
- pan/bi: Lower 8-bit fragment input
|
||
- panfrost: Make comment less confusing
|
||
- panfrost: Support alpha_to_one
|
||
- panfrost: Minor cleanup of blend CSO
|
||
- panfrost: Don't clobber RT0 if RTn is disabled
|
||
- pan/lower_blend: Clean up type size handling
|
||
- pan/lower_blend: Use NIR helpers
|
||
- pan/lower_blend: Rename is_bifrost->scalar
|
||
- panfrost/blend: Fix outdated comments
|
||
- panfrost/blend: Workaround a v7 implementation-detail
|
||
- panfrost/blend: Distribute to_c_factor
|
||
- panfrost/blend: Prepare for lower_fragcolor
|
||
- panfrost: Call nir_lower_fragcolor based on key
|
||
- panfrost: Assume lower_fragcolor has been called
|
||
- panfrost/lower_framebufffer: Don't use i2imp
|
||
- pan/blend: Emit explicit conversions for all types
|
||
- panfrost: Key blend shaders to the input types
|
||
- pan/mdg: Hide units behind MIDGARD_MESA_DEBUG=verbose
|
||
- pan/mdg: More concise RMU name
|
||
- pan/mdg: Don't print zero shifts
|
||
- pan/mdg: Suppress most attribute tables
|
||
- pan/mdg: Don't print explicit .rte
|
||
- pan/mdg: Don't print mem addr brackets
|
||
- pan/mdg: Reduced printed parens
|
||
- pan/mdg: Don't print zero
|
||
- pan/bi: Add imm_uintN helper
|
||
- pan/bi: Handle integer min/max ourselves
|
||
- pan/bi: Handle ineg
|
||
- pan/bi: Handle b2f ourselves
|
||
- pan/bi: Handle b2i8/16
|
||
- pan/bi: Track scalarness of 16-bit ALU
|
||
- pan/bi: Don't swizzle scalars
|
||
- pan/bi: Switch to 1-bit bools
|
||
- pan/bi: Use nir_lower_to_bit_size
|
||
- pan/mdg: Use _output_ type for outmod printing
|
||
- pan/mdg: Remove midgard_opt_copy_prop_reg
|
||
- pan/mdg: Enable nir_opt_{move, sink}
|
||
- panfrost/blend: Inline blend constants
|
||
- pan/mdg: Model blend shader interference
|
||
- panfrost: Fix typo handling blend types
|
||
- pan/bi: Change swizzled scalars to identity
|
||
- pan/bi: Adapt branching for 1-bit bools
|
||
- pan/bi: Handle make_vec with 1-bit bools
|
||
- pan/bi: Temporarily switch back to 0/~0 bools
|
||
- pan/bi: Enable NIR vectorization
|
||
- pan/bi: Fix int<-->float size converts
|
||
- pan/bi: Copyprop constants
|
||
- pan/bi: Garbage collect bifrost_nir.h
|
||
- pan/bi: Enable mediump BLEND lowering
|
||
- panfrost: Enable 16-bit support on Bifrost
|
||
- pan/lower_framebuffer: Fix bitsize mismatch
|
||
- nir: Add blend lowering pass
|
||
- panfrost: Use common blend lowering
|
||
- nir/divergence_anlysis: Add intrinsics for Bifrost
|
||
- pan/bi: Drop load_sampler_lod_parameters_pan
|
||
- pan/bi: Map load_subgroup_invocation to FAU
|
||
- pan/bi: Add "lanes per warp" accessor
|
||
- pan/bi: Add divergent intrinsic lowering pass
|
||
- asahi: Translate blend CSO to lower_blend options
|
||
- asahi: Augment Gallium key with blend state
|
||
- asahi: Call nir_lower_blend with selected key
|
||
- asahi: Garbage collect bind_state
|
||
- asahi: Implement set_blend_color
|
||
- asahi: Add blend constant system value
|
||
- asahi: Call nir_lower_fragcolor
|
||
- asahi: Fix shader key hash function
|
||
- asahi: Pass through "reads tilebuffer?" bit
|
||
- agx: Return agx_instr* from emit_intrinsic
|
||
- agx: Implement blend constant color sysvals
|
||
- agx: Rename blend -> st_tile
|
||
- agx: Add ld_tile opcode
|
||
- agx: Assume lower_fragcolor has been called
|
||
- agx: Condition writeout ops on already being emitted
|
||
- agx: Implement load_output
|
||
- agx: Set reads_tib appropriately
|
||
- panfrost: Drop panfrost_fence in favour of pipe_fence_handle
|
||
- docs: Simplify now that kmsro is autoenabled
|
||
- pan/bi: Add first_vertex to vertex ID
|
||
- panfrost: Track buffers needing resolve
|
||
- panfrost: Set discard based on the resolve set
|
||
- panfrost: Implement framebuffer invalidation
|
||
- panfrost: Hide CAP_INT16 behind is_deqp
|
||
- panfrost: Don't translate compare funcs
|
||
- panfrost: Remove spurious assignment
|
||
- panfrost: Clean up cases for emit_fbd
|
||
- panfrost: Don't upload empty push uniform table
|
||
- pan/mdg: Use smaller LD_UNIFORM instructions
|
||
- ci: Build asahi in meson-gallium job
|
||
- panfrost: Fix major flaw in BO cache
|
||
- panfrost: Drop random #define
|
||
- panfrost: Use natural shader limits
|
||
- panfrost: Make clear which limits are arbitrary
|
||
- panfrost: Garbage collect comment
|
||
- panfrost: Shorten iffy comment
|
||
- pan/mdg: Remove unused midgard_int_alu_op_prefix
|
||
- pan/mdg: Fix output types for scalar fields
|
||
- pan/mdg: Fix spills to TLS
|
||
- pan/mdg: Set lower_uniforms_to_ubo
|
||
- panfrost: Add unowned mode to pan_pool
|
||
- panfrost: Label all BOs in userspace
|
||
- panfrost: Label pools
|
||
- panfrost: Make pool slab size configurable
|
||
- panfrost: Add reference type for unowned pool
|
||
- panfrost: Pool shaders
|
||
- panfrost: Pool texture views
|
||
- panfrost: Reduce blitter pool size
|
||
- panfrost: Fix blending for unbacked MRT
|
||
- panfrost: Fix the reads_dest prototype
|
||
- panfrost: Fix is_opaque prototype
|
||
- panfrost: Fix blend constant fetch prototype
|
||
- panfrost: Fix blend fixed-function prototype
|
||
- panfrost: Fix pan_blend_to_fixed_function_equation prototype
|
||
- panfrost: Move blend properties to CSO create
|
||
- panfrost: Translate fixed-function blend at CSO create
|
||
- panfrost: Garbage collect Gallium blend includes
|
||
- panfrost: Pack blend equations at CSO create time
|
||
- panfrost: Distribute out constant colour code
|
||
- panfrost: Simplify blend_final
|
||
- panfrost: Pass batch to panfrost_get_blend
|
||
- panfrost: Streamline fixed-function get_blend path
|
||
- panfrost: Remove unused dither flag
|
||
- panfrost: Split Bifrost BLEND emit by word
|
||
- panfrost: Precompute bifrost_blend_type_from_nir
|
||
- panfrost: Add draw-time merge helper
|
||
- panfrost: Prepack partial RSD at compile time
|
||
- panfrost: Move depth/stencil/alpha to CSO create
|
||
- panfrost: Preset evaluate_per_sample
|
||
- panfrost: Correct the type of sample_mask
|
||
- panfrost: Fill out the rasterizer CSO
|
||
- panfrost: Move early-z decision earlier
|
||
- panfrost: Streamline the !fs_required case
|
||
- panfrost: Hoist allow_forward_pixel_to_be_killed
|
||
- panfrost: Partially determine FPK state
|
||
- panfrost: Distribute masks for FPK selection
|
||
- panfrost: Pull erratum workaround into own function
|
||
- panfrost: Hoist part of shader_reads_tilebuffer
|
||
- panfrost: Pack draw-time RSD all-at-once
|
||
- panfrost: Move batch_set_requirements to the CSO
|
||
- panfrost: Deduplicate some code from indirect/direct draws
|
||
- panfrost: Pass batch to panfrost_get_index_buffer_bounded
|
||
- panfrost: Remove silly assertion
|
||
- panfrost: Mark job_index > 10000 as unlikely
|
||
- panfrost: Simplify panfrost_bind_sampler_states
|
||
- panfrost: Express viewport in terms of the batch
|
||
- asahi: Set PACKED_STREAM_OUTPUT
|
||
- glsl: Fix subscripted arrays with no XFB packing
|
||
- glsl: Fix packing of matrices for XFB
|
||
- panfrost: Streamline varying linking code
|
||
- panfrost: Define dirty tracking flags
|
||
- panfrost: Add the usual clean/dirty helpers
|
||
- panfrost: Dirty all state when batch is set
|
||
- panfrost: Dirty track RSDs
|
||
- panfrost: Dirty track textures/samplers
|
||
- panfrost: Dirty track viewport descriptor
|
||
- panfrost: Dirty track fragment images
|
||
- panfrost: Add PAN_MESA_DEBUG=dirty option
|
||
- panfrost/ci: Disable GLES2 jobs when we run GLES3
|
||
- panfrost/ci: Disable G72 jobs for now
|
||
- panfrost/ci: Split rules by ISA
|
||
- ci: Condition ppc64-el on specific drivers
|
||
- ci: Condition s390x on specific drivers
|
||
- panfrost: Only link varyings once in good conditions
|
||
- panfrost: Lower max inputs again
|
||
- panfrost: Abort on faults in SYNC mode
|
||
- panfrost: Remove minimal mode
|
||
- panfrost: Increase tiler_heap max allocation to 64MB
|
||
- panfrost/ci: Disable terrain trace
|
||
- panfrost/ci: Remove reference to dated flag
|
||
- panfrost/ci: Run jobs with PAN_MESA_DEBUG=sync
|
||
- panfrost: Add Message Preload descriptor XML
|
||
- panfrost: Add message preload to pan_shader_info
|
||
- panfrost: Inline pan_prepare_shader_descriptor
|
||
- panfrost: Don't take ctx in panfrost_shader_compile
|
||
- panfrost: Expose PIPE_CAP_SHAREABLE_SHADERS
|
||
- asahi: Fix meson.build definition to depend on agx_pack.h
|
||
- agx: Drop cmdline version back to ES3.0
|
||
- agx: Pack ld_var Dx
|
||
- agx: Enable 1-bit load_const
|
||
- agx: Implement boolean mov
|
||
- agx: Track current_block
|
||
- agx: Track block offsets
|
||
- agx: Add nest field to IR
|
||
- agx: Add invert_cond (ccn) to IR
|
||
- agx: Add branch target to IR
|
||
- agx: Add inner loop nesting count field
|
||
- agx: Model control flow instructions
|
||
- agx: Model pop_exec
|
||
- agx: Add push_exec alias
|
||
- agx: Pack control flow instructions
|
||
- agx: Model jump instructions
|
||
- agx: Fix up branch offsets at pack time
|
||
- agx: Implement emit_if the simplest way
|
||
- agx: Optimize out empty else blocks
|
||
- agx: Implement loops in the simplest way
|
||
- agx: Add break/continue support
|
||
- agx: Zero r0l before first use of control flow
|
||
- asahi: Fix scissor descriptor definition
|
||
- asahi: Add "set scissor" command
|
||
- asahi: Add scissor enable bit
|
||
- asahi: Defer viewport pack
|
||
- asahi: Dirty track viewport descriptor
|
||
- asahi: Track scissor states
|
||
- asahi: Mark scissor dirty if rast->scissor changes
|
||
- asahi: Skip draws if the scissor culls everything
|
||
- agx: Add scissor upload BO
|
||
- asahi: Expose PIPE_CAP_CLIP_HALFZ
|
||
- asahi: Add unknown bits seen with the GL driver
|
||
- asahi: Enable depth culling
|
||
- asahi: Update viewport descriptor depth fields
|
||
- asahi: Implement scissors and scissor to viewport
|
||
- asahi: Fix off-by-one in viewport scissoring
|
||
- asahi: Implement wide lines
|
||
- asahi: Determine tiling vs linear for internal textures
|
||
- asahi: Use dt_stride for line_stride where needed
|
||
- asahi: Add layout enum to XML
|
||
- asahi: Translate layouts for texture and RTs
|
||
- asahi: Identify line stride in texture/RT XML
|
||
- asahi: Respect linear strides
|
||
- asahi: Handle linear display targets as well as tiled
|
||
- asahi: Note that "render target" lacks an sRGB bit
|
||
- asahi: Align strides to 16 bytes
|
||
- asahi: Print unknown enum values
|
||
- asahi: Add format enums
|
||
- asahi: Hide pixel formats behind an opaque type
|
||
- asahi: Scaffold format table
|
||
- asahi: Use pixel table in is_format_supported
|
||
- asahi: Respect render target format swizzle
|
||
- asahi: Add ETC2 formats to table
|
||
- asahi: Add "hacks for dEQP" flag
|
||
- asahi: Lift streamout scaffolding from Panfrost
|
||
- asahi: Fake CAPs for ES3 with AGX_MESA_DEBUG=deqp
|
||
- asahi: Flesh out the formats table
|
||
- asahi: Allow half-float vertex buffers
|
||
- asahi: Make data_valid a bitset to save memory
|
||
- asahi: Abort on blit()
|
||
- asahi: Add mipmapping state to the XML
|
||
- asahi: Set levels in texture descriptor
|
||
- asahi: Allocate slices for mipmapping
|
||
- panfrost: Update comment
|
||
- panfrost: Shrink pan_draw_mode return type
|
||
- panfrost: Add draw parameters dirty flags
|
||
- panfrost: Analyze sysval dirty flags
|
||
- panfrost: Dirty track constant buffers
|
||
- panfrost: Don't allocate empty varying buffer
|
||
- panfrost: Dirty track stack sizes
|
||
- panfrost: Write translate_index_size better
|
||
- panfrost: Minor changes to draw_vbo
|
||
- panfrost: Bubble up errors
|
||
- panfrost: Elucidate thread group split field
|
||
- panfrost: Eliminate reserve_* functions
|
||
- panfrost/ci: Report flakes on IRC
|
||
- vc4: Use Rn_UINT instead of In_UINT for index buffers
|
||
- v3d: Use Rn_UINT instead of In_UINT for index buffers
|
||
- etnaviv: Use Rn_UINT instead of In_UINT for index buffers
|
||
- freedreno: Use Rn_UINT instead of In_UINT for index buffers
|
||
- lima: Use Rn_UINT instead of In_UINT for index buffers
|
||
- si: Use Rn_UINT instead of In_UINT for index buffers
|
||
- docs/gallium: Document the index buffer format convention
|
||
- nir: Add nir_intrinsic_load_back_face_agx
|
||
- asahi: Mark special fragment inputs as sysvals
|
||
- agx: Model get_sr
|
||
- agx: Generate enums from Python
|
||
- agx: List sr enum in Python
|
||
- agx: Pack SR immediate
|
||
- agx: Lower front face to back face
|
||
- agx: Handle load_back_face_agx
|
||
- ci: Disable the iris APL jobs
|
||
- nir/lower_fragcolor: Avoid redundant load_output
|
||
- pan/bi: Pull out bi_count_write_registers
|
||
- pan/bi: Use TEXS_2D for rect textures
|
||
- pan/bi: Simplify TEXC codegen for sr_count=0
|
||
- pan/bi: Fix bi_rewrite_passthrough ordering
|
||
- pan/bi: Bundle after RA
|
||
- pan/bi: Add post-RA optimizer
|
||
- pan/bi: Track liveness while scheduling
|
||
- pan/bi: Allow IADD.u32 on FMA as \*IADDC
|
||
- pan/bi: Use explicit affinities in RA
|
||
- pan/bi: Inline spilling in RA
|
||
- pan/bi: Explicit zero reg_live_{in, out} when needed
|
||
- pan/bi: Model interference with preloaded regs
|
||
- pan/bi: Allow move/sink in blend shaders
|
||
- pan/bi: Don't restrict the register file in non-blend shaders
|
||
- pan/bi: Model +BLEND clobbering of r48
|
||
- pan/bi: Handle images in vertex shaders
|
||
- pan/bi: Lower loads with component > 0
|
||
- pan/bi: Lower stores with component != 0
|
||
- pan/bi: Lower 64-bit ints again
|
||
- pan/bi: Emit a dummy ATEST if needed
|
||
- pan/bi: Simplify spill code
|
||
- pan/bi: Track words instead of bytes in RA
|
||
- pan/bi: Don't allocate past the end of the reg file
|
||
- panfrost: Remove AFBC format fixups
|
||
- panfrost: Add missing 'Reverse issue order flag'
|
||
- panfrost: Disable AFBC on v7
|
||
- panfrost: Don't duplicate attribute buffers
|
||
- panfrost: Separate image attribute and buffer emit
|
||
- panfrost: Be explicit in image modifier handling
|
||
- panfrost: Use util_last_bit for images
|
||
- panfrost: Default indirect attributes to 1D type
|
||
- pan/indirect: Factor out is_power_of_two_or_zero
|
||
- pan/indirect_draw: Use unsigned comparisons
|
||
- pan/indirect_draw: Fix 1 instance, nonzero divisor
|
||
- panfrost: Correctly size varyings
|
||
- panfrost: Use varying format from frag shader
|
||
- pan/bi: Force u32 for flat varyings
|
||
- panfrost: Fix vertex image attribute overrun
|
||
- panfrost: Simplify compute_checksum_size formula
|
||
- panfrost: Fix crc_valid condition
|
||
- panfrost: Zero r_dimension for buffer textures
|
||
- panfrost: Add util_draw_indirect() debug path
|
||
- panfrost: Align NPOT divisor records
|
||
- panfrost: Fix src_offset data type
|
||
- panfrost: Make instancing code more obvious
|
||
- panfrost: Assert alignment of indirect records
|
||
- pan/mdg: Use consistent casing in midgard_print
|
||
- pan/mdg: Make -Wswitch happy
|
||
- pan/mdg: Stub memory_barrier{_image}
|
||
- panfrost: Clarify how fs_sidefx works with oq
|
||
- panfrost: Simplify Midgard blend disable
|
||
- panfrost: Don't force early-z with occlusion query
|
||
- panfrost: Respect early-Z force on Midgard
|
||
- pan/mdg: Fix units for SUBSAT
|
||
- pan/mdg: Handle {i,u}{add,sub}_sat
|
||
- pan/mdg: Update r1.w comment
|
||
- pan/mdg: Fix incorrect rewrite in Midgard scheduler
|
||
- panfrost: Mark 16/32_UNORM as non-renderable (v5)
|
||
- panfrost: Don't allocate WLS when not needed
|
||
- pan/mdg: Wire in PAN_SYSVAL_VERTEX_INSTANCE_OFFSETS
|
||
- pan/mdg: Lower away gl_VertexID offset
|
||
- pan/mdg: Use more accurate ld/st reg estimates
|
||
- pan/mdg: Don't skip unit-based checks in choose_instruction
|
||
- pan/mdg: Assert scheduled instructions are reasonable
|
||
- pan/mdg: Insert moves to load/store registers
|
||
- panfrost: Fix dirty state emission
|
||
- panfrost: Emulate indirect draws on Midgard
|
||
- panfrost: Add some missing BGRA formats
|
||
- panfrost: Remove scissor_culls_everything
|
||
- panfrost: Don't set a blend shader for no_colour
|
||
- panfrost: Allocate XFB buffers per-instance
|
||
- panfrost: Fix BUFFER image handling
|
||
- panfrost: Make image buffers robust
|
||
- panfrost: Lower max compute size
|
||
- panfrost: Set PIPE_COMPUTE_CAP_SUBGROUP_SIZE
|
||
- panfrost: Set PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK
|
||
- panfrost: Drop todo on PIPE_COMPUTE_CAP_IMAGES_SUPPORTED
|
||
- panfrost: Don't CRC mipmapped textures
|
||
- panfrost: Reduce pan_image_state indirection
|
||
- pan/indirect_dispatch: Indent NIR blocks
|
||
- pan/indirect_dispatch: Simplify empty command case
|
||
- pan/indirect_dispatch: Distinguish minus-1 defs
|
||
- pan/indirect_dispatch: Expand split expressions
|
||
- pan/indirect_dispatch: Use extracted values
|
||
- panfrost: Use direct dispatch with shared memory
|
||
- panfrost: Don't clobber indirect dispatch fields
|
||
- panfrost: Make data_valid a bitset
|
||
- panfrost: Remove pan_image_state
|
||
- panfrost: Set valid_buffer_range for GPU writes
|
||
- panfrost: Add XML for vertex/instance ID records
|
||
- panfrost: Clean up vertex/instance ID on Midgard
|
||
- panfrost: Flush everything for glMemoryBarrier
|
||
- panfrost: Flush before compute jobs
|
||
- panfrost: Set vertex job_barrier
|
||
- panfrost: Add "Cache Flush" job XML
|
||
- panfrost: Advertise GLES3.1
|
||
- pan/decode: Fix image attribute counting
|
||
- pan/decode: Handle cache flush jobs
|
||
- panfrost/ci: Blank G52 flakes file
|
||
- panfrost/ci: Don't skip SSBO tests on G52
|
||
- panfrost/ci: Do fractional dEQP-GLES31 run on Midgard
|
||
- docs/features: Mark GLES3.1 as done on Panfrost
|
||
- docs/panfrost: Update API versions
|
||
- pan/bi: Include modifier info in opcode table
|
||
- pan/bi: Move bi_word_node to common code
|
||
- pan/bi: Move typesize to common code
|
||
- pan/bi: Track instruction size in opcode table
|
||
- pan/bi: Handle fsat_signed and fclamp_pos
|
||
- pan/bi: Report tuples, not nops, in shader-db
|
||
- pan/bi: Propagate fabs/neg/sat
|
||
- pan/bi: Add back custom algebraic opts
|
||
- pan/bi: Fuse fclamp_pos and fsat_signed
|
||
- pan/bi: Schedule FCMP.v2f16 with abs modifier
|
||
- pan/bi: Fuse abs into FCMP/FMIN/FMAX.v2f16
|
||
- nir: Fix constant folding for irhadd/urhadd
|
||
- agx: Mark components as ASSERTED
|
||
- agx: Add agx_immediate_f helper
|
||
- agx: Add perspective bit to ld_var
|
||
- agx: Update ld_vary encoding mask
|
||
- agx: Add ld_vary_flat opcode
|
||
- asahi: Identify varying descriptor fields
|
||
- agx: Rename remap_varyings -> remap_varyings_vs
|
||
- agx: Implement nir_intrinsic_load_frag_coord
|
||
- agx: Implement ld_vary_flat
|
||
- agx: Rename agx_pack to agx_pack_binary
|
||
- agx: Remap fragment shader varyings explicitly
|
||
- asahi: Unify varying linking code with vertex shaders
|
||
- agx: Pull out agx_write_components
|
||
- agx: Add agx_exit_block helper
|
||
- agx: Add liveness analysis pass
|
||
- agx: Mark sources that kill
|
||
- agx: Count write registers, not components
|
||
- agx: Lift agx_block_add_successor from Panfrost
|
||
- agx: Track logical control flow graph
|
||
- asahi: Wire in tgsi_to_nir
|
||
- asahi: Fix random \*2
|
||
- asahi: Guard for overflow when packing
|
||
- asahi: Always flush when setting framebuffer state
|
||
- asahi: Handle Z16_UNORM textures
|
||
- asahi: Add zsbuf to batch
|
||
- asahi: Save zsbuf ptr
|
||
- asahi: Add internal (renderable) formats to the table
|
||
- asahi: Set fragment key for non-U8NORM render targets
|
||
- asahi: Implement colour buffer reloads
|
||
- asahi: Remove spurious assignment
|
||
- asahi: Remove spurious varying assignment
|
||
- asahi: Generalize varying linking
|
||
- asahi: Add ASAHI_MESA_DEBUG=no16 option
|
||
- agx: Fix 32-bit bitwise shifts
|
||
- agx: Fix LOD_MIN enum
|
||
- agx: Pack LOD descriptors
|
||
- agx: Fix lod_mode shift
|
||
- agx: Legalize LOD sources to be 16-bit only
|
||
- agx: Handle txl
|
||
- asahi: Fail on LOD clamps/bias
|
||
- asahi: Identify texture/sampler count fields
|
||
- asahi: Identify vertex texture/sampler counts
|
||
- asahi: Set vertex texture/sampler counts
|
||
- asahi: Track more Gallium state
|
||
- asahi: Wire in u_blitter
|
||
- asahi: Handle nonzero first_level
|
||
- asahi: Fix meson dependency on packing in compiler
|
||
- asahi: Prepack rasterizer faces
|
||
- asahi: Implement the stencil test
|
||
- asahi: Flush for accesses to Z/S buffer
|
||
- asahi: Comment on an embedded data structure
|
||
- asahi: Skip over holes in the vbufs
|
||
- asahi: Add XML for the attachment structure
|
||
- asahi: Sync attachment magic with asahi demo
|
||
- asahi: Parametrize software "command buffer" size
|
||
- asahi: Identify "command buffer" size field in map
|
||
- asahi: Move IOGPU header to XML
|
||
- asahi: Extend IOGPU header to contain encoder
|
||
- asahi: Use GenXML for main bind fragment
|
||
- asahi: Identify attachment length field
|
||
- asahi: Set data_valid for the depth buffer
|
||
- asahi: Enable primitive restart
|
||
- asahi: Use XML for interpolation packet
|
||
- panfrost: Express dependencies as resources, not BOs
|
||
- panfrost: Wrap occlusion query in pipe_resource
|
||
- panfrost: Split "flush writer" from "flush accessing"
|
||
- panfrost: Eliminate redundant flushes with AFBC
|
||
- panfrost: Add secondary shader XML fields
|
||
- pan/decode: Handle IDVS jobs on Bifrost
|
||
- agx: Don't choke on registers in the optimizer
|
||
- agx: Count read registers as well
|
||
- agx: Assign registers locally
|
||
- agx: Pipe in nir_register
|
||
- agx: Ensure we don't overallocate registers
|
||
- panfrost: Move draw_vbo to pan_cmdstream.c
|
||
- panfrost: Move most CSO creates to pan_cmdstream.c
|
||
- panfrost: Split out prepare_rsd into a vtbl
|
||
- panfrost: Move blend CSO to cmdstream/context
|
||
- panfrost: Don't ralloc panfrost_blend_state
|
||
- panfrost: Move launch_grid to pan_cmdstream
|
||
- panfrost: Move panfrost_emit_tile_map to pan_job
|
||
- panfrost: Use vtable for fragment descriptor functions
|
||
- panfrost: Clean up pan_cmdstream.h
|
||
- panfrost: Move sample accessor to pan_cmdstream
|
||
- panfrost: Remove pan_cmdstream.h
|
||
- panfrost: Remove unused midgard-pack.h includes
|
||
- docs: Update relnotes for panfrost/asahi
|
||
- pan/bi: Improve clause printing
|
||
- pan/bi: Fix skip/lod_mode aliasing with VAR_TEX
|
||
- pan/bi: Add bi_foreach_instr_global_rev_safe helper
|
||
- pan/bi: Pack staging_barrier for the -next- clause
|
||
- pan/bi: Try to hit full occupancy on v7
|
||
- pan/bi: Only spill nodes that could progress in RA
|
||
- pan/bi: Report cycle counts
|
||
- pan/bi: Track LOD mode even for TEXC
|
||
- pan/bi: Analyze helper invocations
|
||
- pan/bi: Fuse LD_VAR+TEXS_2D -> VAR_TEX
|
||
- pan/bi: Add a constant subexpression elimination pass
|
||
- pan/bi: Workaround widen restrictions on +FADD.f32
|
||
- pan/bi: Simplify cube map descriptor generation
|
||
- pan/bi: Comment the fexp2 implementation
|
||
- pan/bi: Factor out exp2/log2 code
|
||
- pan/bi: Don't lower fpow
|
||
- panfrost: Fix FPK enable condition
|
||
- panfrost: Add a performance counter dump utility
|
||
- panfrost: Don't set zs_update_operation in vertex shaders
|
||
- panfrost: Zero depth_source in vertex shaders
|
||
- panfrost: Query tiler features
|
||
- panfrost: Enable more tiler levels if we can
|
||
- panfrost: Generalize pan_blitter's reg count assert
|
||
- panfrost: Set register allocation in the v7 RSD
|
||
- asahi: Move fixed internal shaders to agx_blit.c
|
||
- asahi: Add missing copyright/guards for magic.c/h
|
||
- asahi: Remove unused bo_access property
|
||
- asahi/decode: Only dump mapped allocations
|
||
- asahi: Make track_free safer
|
||
- asahi/decode: Check fewer zeroes after a command buffer
|
||
- asahi: Reserve more space to stop a command buffer
|
||
- asahi: Identify more unknown fields in the memmap
|
||
- asahi/decode: Fix up high word
|
||
- asahi/decode: Handle CULL packets
|
||
- asahi/decode: Fix decoding of draw calls
|
||
- asahi: Allow specifying an encoder ID
|
||
- asahi: Allocate global IDs
|
||
- asahi: Consolidate some magic numbers
|
||
- asahi: Garbage collect senseless cmdbuf struct
|
||
- asahi/decode: Print clear/store pipelines
|
||
- asahi/decode: Print some IOGPU stuff
|
||
- asahi: Set bits in UNK11 needed for points
|
||
- asahi: Set point magic bit in rasterizer
|
||
- asahi: Set bit for psiz
|
||
- asahi: Lower PIPE_CAPF_MAX_POINT_WIDTH to hw limit
|
||
- asahi: Unpack varying descriptors (1x)
|
||
- asahi: Identify triangle/lines vs point varyings
|
||
- asahi: Handle point coordinates
|
||
- agx: Flip point coordinates because OpenGL
|
||
- panfrost: Inline flip_compare_func into pan_encoder.h
|
||
- panfrost: Move panfrost_vertex/instance_id to per-gen
|
||
- panfrost: Inline away pan_pool.c
|
||
- panfrost: Express pack_work_groups more concisely
|
||
- panfrost: Inline away pan_invocation.c
|
||
- panfrost: Assert that injected jobs are for blits
|
||
- panfrost: Inline panfrost_get_z_internal_format
|
||
- panfrost: Move arch-independent pan_format code
|
||
- panvk: Don't use panfrost_bifrost_swizzle
|
||
- panfrost: Remove panfrost_bifrost_swizzle
|
||
- panfrost: Add GenXML macros
|
||
- panfrost: Compile format table multiple times
|
||
- panfrost: Specialize blendable_formats for v6
|
||
- panfrost: Use smaller sizes in blend table
|
||
- panfrost: Give WLS Instances a default
|
||
- panfrost: Pin an architecture for blending
|
||
- panfrost: Use generic delete for ZSA
|
||
- panfrost: Remove reference to mali_blend_equation_packed
|
||
- panfrost: Avoid GenXML enum dependences
|
||
- panfrost: Remove pan_blitter integration
|
||
- panfrost: Init/destroy blitter from per-gen file
|
||
- panfrost: Only access blitter from per-gen
|
||
- pan/bi: Refuse to CSE non-SSA sources
|
||
- pan/bi: Make bi_foreach_instr_in_tuple safer
|
||
- pan/bi: Update ins->link after scheduling
|
||
- pan/bi: Do helper termination analysis on clauses
|
||
- pan/bi: Handle multiple destinations in scheduler
|
||
- pan/bi: Add bi_before_tuple convenience method
|
||
- pan/bi: Handle 4-src instructions in scheduler
|
||
- pan/bi: Calculate dependency graph when bundling
|
||
- pan/bi: Add a bundling heuristic
|
||
- panfrost: Fix format swizzles on G72
|
||
- targets/graw-xlib: Add missing dep_x11
|
||
- pan/mdg: Garbage collect silly quirk
|
||
- asahi: Fix sampler filtering flag
|
||
- agx: Fix mismatched units in load_ubo
|
||
- agx: Plug memory leak in register allocator
|
||
- pan/bi: Restrict swizzles on same cycle temporaries
|
||
- pan/bi: Remove incorrect errata workaround
|
||
|
||
Andres Gomez (25):
|
||
|
||
- ci: Uprev piglit to 9d87cc3d79e ("framework/replay: send backend's subprocess stderr to sys.stderr")
|
||
- ci: Add test which occasionally times out to lavapipe-vk skips
|
||
- ci: add xorg to the x86_test-vk container
|
||
- ci: allow starting xorg for piglit run
|
||
- ci: remove results directory content only with piglit runners
|
||
- ci: make sure we only read the first line from install/VERSION
|
||
- ci: update some radv trace checksums
|
||
- ci: update some radv trace checksums
|
||
- ci: update radv's trace job tag for Raven
|
||
- ci: remove radv's trace job for Polaris10
|
||
- ci: uprev apitrace to 10.0
|
||
- ci: uprev DXVK to 1.8.1
|
||
- ci: add radv's trace job for Navy Flounder
|
||
- ci: include VKD3D-Proton tests into the VK test container
|
||
- ci: add VKD3D-Proton testsuite runner
|
||
- ci: add VKD3D-Proton testsuite job for radv's Navy Flounder
|
||
- ci: disentangle tags for containers and artifacts produced by them
|
||
- ci: remove glslangValidator installation from the VK test container
|
||
- ci: replace glslangValidator with glslang-tools
|
||
- ci: fix the vkd3d-proton runner
|
||
- ci: build the hang-detection tool into x86_test-vk
|
||
- ci: update some radv trace checksums
|
||
- ci: bump x86_test-base tag
|
||
- ci: remove unzip from several containers that don't use it at all
|
||
- ci: use bash with download-git-cache.sh
|
||
|
||
Andrii Simiklit (1):
|
||
|
||
- Remove redundant assignment
|
||
|
||
Antonio Caggiano (15):
|
||
|
||
- panfrost: Fix invalid conversions
|
||
- panfrost: Meson dependency
|
||
- util: Perfetto SDK v15.0
|
||
- pps: Gfx-pps v0.3.0
|
||
- pps: Gfx-pps config tool
|
||
- pps: Documentation
|
||
- intel/perf: Extern C
|
||
- pps: Intel pps driver
|
||
- pps: Intel documentation
|
||
- ci: Add a manual job for tracking the performance of Freedreno
|
||
- panfrost: Counter definitions
|
||
- panfrost: Performance configuration
|
||
- panfrost: Fix pan_pool_ref construction
|
||
- pps: Panfrost pps driver
|
||
- pps: Panfrost documentation
|
||
|
||
Anuj Phogat (39):
|
||
|
||
- intel: Rename files with gen_debug prefix
|
||
- intel: Rename gen_debug prefix to intel_debug
|
||
- intel: Rename GEN_DEBUG prefix to INTEL_DEBUG
|
||
- intel: Rename intel_device_info.c to intel_dev_info.c
|
||
- intel: Rename gen_device prefix in filenames
|
||
- intel: Rename gen_device prefix to intel_device
|
||
- intel: Fix alignment and line wrapping due to gen_device renaming
|
||
- intel: Rename GEN_DEVICE prefix in macros to INTEL_DEVICE
|
||
- intel: Rename gen_get_device prefix to intel_get_device
|
||
- intel: Rename gen_get_aperture_size to intel_get_aperture_size
|
||
- intel: Drop gen prefix in gen_has_get_tiling()
|
||
- intel: Rename gen_context.h to intel_context.h
|
||
- intel: Rename gen_context prefix to intel_context
|
||
- intel: Rename gen_perf prefix in filenames to intel_perf
|
||
- intel: Rename gen_perf prefix to intel_perf in source files
|
||
- intel: Fix alignment and line wrapping due to gen_perf renaming
|
||
- intel: Rename GEN_PERF prefix to INTEL_PERF in build files
|
||
- intel: Rename GEN_PERF prefix to INTEL_PERF in source files
|
||
- intel: Rename gen_{pipeline, oa, counter, hw} to intel_{..}
|
||
- intel: Rename brw_gen_enum.h to brw_gfx_ver_enum.h
|
||
- intel: Rename gen enum to gfx_ver
|
||
- intel: Rename gen keyword in test_eu_validate.cpp
|
||
- intel: Rename gens keyword to gfx_vers
|
||
- intel: Rename index_gen keyword to index_ver
|
||
- intel: Rename eu compact instruction tests
|
||
- intel: Rename gen_{mapped, clflush, invalidate} prefix to intel_{..}
|
||
- intel: Remove devinfo_to_gen() helper function
|
||
- intel: Rename isl_to_gen keyword to isl_encode
|
||
- intel: Rename vk_to_gen keyword to vk_to_intel
|
||
- intel: Rename gen_10 to ver_10
|
||
- intel: Rename calculate_gen_slm_size to intel_calculate_slm_size
|
||
- intel: Rename _gen_{program, part, batch, freq} to _intel_{..}
|
||
- intel: Rename GEN_PART to INTEL_PART
|
||
- intel: Rename {i965, iris, anv, isl}_gen prefix in build files
|
||
- intel: Rename since_gen to since ver
|
||
- intel: Rename _gen keyword to _gfx_ver in few build files
|
||
- intel: Fix GEN_GEN macro checks
|
||
- intel/gfx12+: Add Wa_14013840143
|
||
- intel: Rename GFX 12.5 to XE_HP
|
||
|
||
Axel Davy (1):
|
||
|
||
- st/nine: Fix compilation error on non-x86 platforms
|
||
|
||
Bas Nieuwenhuizen (41):
|
||
|
||
- radv: Fix memory leak on descriptor pool reset with layout_size=0.
|
||
- amd/common: Use cap to test kernel modifier support.
|
||
- radv: Only require DRM 3.23.
|
||
- radeon/vcn: Use the correct pitch for chroma surface.
|
||
- nir: Add load_sbt_amd intrinsic.
|
||
- radv: Add sbt descriptors user SGPR input.
|
||
- aco: Add load_sbt_amd intrinsic implementation.
|
||
- radv: Use global BO list with raytracing.
|
||
- radv: Add support for RT bind point.
|
||
- radv: Add RT pipeline bind.
|
||
- radv: Implement vkCmdTraceRays.
|
||
- radv: Use correct border swizzle on GFX9+.
|
||
- nir: Add bvh64_intersect_ray_amd intrinsic.
|
||
- aco: Implement bvh64_intersect_ray_amd intrinsic.
|
||
- nir/lower_returns: Deal with single-arg phis after if.
|
||
- radv: Don't skip barriers that only change queues.
|
||
- radv: Actually return correct value for read-only DCC compressedness.
|
||
- radv: Allow DCC images to be compressed with foreign queues.
|
||
- gallium/vl: Use format plane count for sampler view creation.
|
||
- gallium/va: Add support for PRIME_2 import.
|
||
- radv: Use the global BO list for acceleration structures.
|
||
- radv: Add initial CPU BVH building.
|
||
- radv: Implement device-side BVH building.
|
||
- radv: Add acceleration structure descriptor set support.
|
||
- radv: Convert lower_intrinsics to a switch statement
|
||
- radv: Implement load_vulkan_descriptor for acceleration structures.
|
||
- radv: Expose formats for acceleration structure.
|
||
- radv: Add rt perftest flag.
|
||
- radv: Enable VK_KHR_acceleration_structure with RADV_PERFTEST=rt.
|
||
- radv: Add -Wpointer-arith.
|
||
- util/fossilize_db: Pull seek into lock.
|
||
- util/fossilize_db: Split out reading the index.
|
||
- util/fossilize_db: Do not lock the fossilize db permanently.
|
||
- util/fossilize_db: Only lock the db file, not the index.
|
||
- nir: Add lowered vendor independent raytracing intrinsics.
|
||
- nir: Add raytracing shader call lowering pass.
|
||
- meson: Bump libdrm for amdgpu to 2.4.107.
|
||
- radv/winsys: Return vulkan errors for buffer creation.
|
||
- radv/winsys: Add support for a fixed VA address for replay.
|
||
- radv: Support address capture and replay.
|
||
- ac/surface: Handle non-retiled displayable DCC correctly for modifiers.
|
||
|
||
Bastian Beranek (1):
|
||
|
||
- glx: Assign unique serial number to GLXBadFBConfig error
|
||
|
||
BillKristiansen (2):
|
||
|
||
- d3d12: Fixes stale context bindings after copy, resolve, and clear
|
||
- d3d12: Sets all SRV descriptors as data-static
|
||
|
||
Billy Laws (1):
|
||
|
||
- meson: Increase Android Platform SDK version limit
|
||
|
||
Boris Brezillon (60):
|
||
|
||
- panfrost: Don't advertise AFBC mods when the format is not supported
|
||
- panfrost: Reserve thread storage descriptor in panfrost_launch_grid()
|
||
- panfrost: Fix RSD emission on Bifrost v6
|
||
- panfrost: Fix indirect draws
|
||
- pan/bi: Don't set the EOS flag if there's at least one successor
|
||
- panfrost: Keep panfrost_batch_reserve_framebuffer() private
|
||
- panfrost: Fix ZS reloading on Bifrost v6
|
||
- pan/midg: Fix 2 memory leaks
|
||
- pan/bi: Expand pseudo instructions when nosched is set
|
||
- pan/midg: Fix midgard_pack_common_store_mask()
|
||
- pan/midg: Make sure the constant offset is in range in mir_match_iadd()
|
||
- panfrost: Make sure pack_work_groups_compute() is passed valid dimensions
|
||
- panfrost: Add helpers to emit indirect dispatch jobs
|
||
- panfrost: Hook-up indirect dispatch support
|
||
- panfrost: Only advertise INDIRECT_DRAW if the kernel supports HEAP BOs
|
||
- ci: Update to a kernel that has the panfrost MMU fixes
|
||
- panfrost/ci: Test GLES 3.1 on Bifrost
|
||
- panfrost/ci: Skip draw_indirect.compute_interop.large.*
|
||
- panfrost/ci: Run the full deqp-gles3 testsuite
|
||
- panfrost: Fix format definitions to match gallium expectations
|
||
- Revert "gallium/util: Fix depth/stencil blit shaders"
|
||
- panfrost: Pass an image view to panfrost_estimate_texture_payload_size()
|
||
- panfrost: Fix blit shader names
|
||
- panfrost: Pack pan_blit_surface fields
|
||
- panfrost: Get rid of the vertex_count arg in pan_preload_emit_varying()
|
||
- panfrost: Make pan_preload_emit_*_sampler() applicable to blits
|
||
- panfrost: Stop assigning ->position in pan_preload_emit_varying()
|
||
- panfrost: Make pan_preload_emit_*_textures() applicable to blits
|
||
- panfrost: Make pan_preload_emit_viewport() applicable to blits
|
||
- panfrost: Rename pan_preload_emit_varying()
|
||
- panfrost: Shrink the number of args passed to prepare_{bifrost,midgard}_rsd()
|
||
- panfrost: Don't select the blit shader fragout type twice
|
||
- panfrost: Stop assuming the viewport will always cover the framebuffer
|
||
- panfrost: Extend pan_blitter to support blit/resolve operations
|
||
- panfrost: Use pan_blit() when PAN_MESA_DEBUG=panblit
|
||
- panfrost: Split the indexed and !indexed indirect draw info structs
|
||
- pan/bi: Add support for gl_{BaseVertex,BaseInstance}
|
||
- pan/bi: Add support for gl_DrawID
|
||
- panfrost: Expose the DRAW_PARAMETERS cap on Bifrost
|
||
- panfrost: Flag indirect draw/dispatch shaders as internal
|
||
- panfrost: Relax the stride check when importing resources
|
||
- panfrost: Try to align scanout resource stride on 64 bytes
|
||
- panfrost: Don't freeze blit batches
|
||
- panfrost: Avoid duplicate entries in access->readers
|
||
- panfrost: Simplify the dependency tracking logic
|
||
- panfrost: Limit the number of active batch to 32
|
||
- ci: Update to a new kernel fixing a bug in the panfrost driver
|
||
- panfrost: Constify the constants pointers passed to pan_blend functions
|
||
- panfrost: Make panfrost_scoreboard_initialize_tiler() return the job pointer
|
||
- pan/midg: Add a flag to dump internal shaders
|
||
- panfrost: Add a Vulkan driver for Midgard/Bifrost GPUs
|
||
- panfrost: Don't add blit context BOs twice
|
||
- panfrost: Pass a memory pool to pan_blit_ctx_init()
|
||
- panfrost: Add alignment info to write-value and cache-flush jobs
|
||
- panfrost: Allocate WRITE_VALUE jobs with panfrost_pool_alloc_desc()
|
||
- panvk: Use the desc alloctor when we can
|
||
- panfrost: Start splitting the panfrost pool logic
|
||
- panvk: Support returning BOs allocated by panvk_pool to a 'free BO' pool
|
||
- panfrost: Replace the batch->bos hashmap by a sparse array
|
||
- panfrost: Do tracking of resources, not BOs
|
||
|
||
Boyuan Zhang (8):
|
||
|
||
- vl: add st_rps_bits for HEVC decode
|
||
- frontends/va: get st_rps_bits from VA pic param hevc
|
||
- frontends/vdpau: disable UseStRpsBits for vdpau hevc
|
||
- radeon/vcn: enable parsing support for st_rps_bits
|
||
- frontends/omx: use pipe buffer map instead of texture map
|
||
- radeon/vcn: move calc_dpb_size into create_decoder
|
||
- radeon/vcn: allocate non-tmz context buffer for VCN2+
|
||
- radeon/vcn: use st_rps_bits only when it's set
|
||
|
||
Caio Marcelo de Oliveira Filho (25):
|
||
|
||
- spirv: Don't replicate patch bool in vtn_variable
|
||
- nir: Remove now unnecessary conditions from emit_load/store helpers
|
||
- intel/compiler: Add common function for CS dispatch info
|
||
- iris: Use brw_cs_get_dispatch_info()
|
||
- anv: Use brw_cs_get_dispatch_info()
|
||
- i965: Use brw_cs_get_dispatch_info()
|
||
- intel/compiler: Remove unused exported functions
|
||
- nir: Move shared_memory_explicit_layout bit into common shader_info
|
||
- intel/compiler: Clarify why VUE is recomputed by FS
|
||
- nir: Rename nir_is_per_vertex_io to nir_is_arrayed_io
|
||
- compiler: Rename local_size to workgroup_size
|
||
- compiler: Rename SYSTEM_VALUE_LOCAL_GROUP_SIZE to SYSTEM_VALUE_WORKGROUP_SIZE
|
||
- nir: Rename nir_intrinsic_load_local_group_size to nir_intrinsic_load_workgroup_size
|
||
- nir: Rename WORK_GROUP (and similar) to WORKGROUP
|
||
- nir: Move zero_initialize_shared_memory into common shader_info
|
||
- nir: Move workgroup_size and workgroup_variable_size into common shader_info
|
||
- anv: Support workgroup memory in other shaders
|
||
- nir/lower_io: Rename vertex_index to array_index in helpers
|
||
- nir/gather_info: Rename per_vertex to is_arrayed
|
||
- spirv: Fix handling of OpBranchConditional with same THEN and ELSE
|
||
- nir/opt_if: Don't split ALU for single block infinite loops
|
||
- nir: Add test to check edge case in Split ALU optimization
|
||
- spirv: Update headers and metadata from latest Khronos commit
|
||
- spirv: Support SPV_KHR_subgroup_uniform_control_flow
|
||
- anv: Advertise VK_KHR_shader_subgroup_uniform_control_flow
|
||
|
||
Carsten Haitzler (Rasterman) (1):
|
||
|
||
- panfrost: Fix Bo imports to not take the process down if fd is invalid
|
||
|
||
Charlie (10):
|
||
|
||
- v3dv: enable KHR_image_format_list
|
||
- v3dv: enable KHR_sampler_mirror_clamp_to_edge
|
||
- v3dv: enable KHR_incremental_present
|
||
- v3dv: enable KHR_uniform_buffer_standard_layout
|
||
- v3dv: clamp srgb render targets
|
||
- v3dv: remove sRGB blending workaround
|
||
- v3dv: add the unswizzled RGBA4444 format
|
||
- v3dv: divide by block size in copy_image_blit
|
||
- v3dv: add ASTC formats to get_compatible_tlb_format
|
||
- v3dv: enable ASTC formats
|
||
|
||
Charlie Birks (1):
|
||
|
||
- v3dv: document two supported extensions
|
||
|
||
Charlie Turner (5):
|
||
|
||
- ci: Remove obsolete reference to DEQP_SKIPS
|
||
- radv: Merge dEQP default skips into all generation-specific skip lists
|
||
- radv: Add a STONEY baseline for dEQP.
|
||
- radv: Provide a toggle to avoid warnings about unsupported devices.
|
||
- spirv_to_nir: Add environment variable to change default log level
|
||
|
||
Charmaine Lee (1):
|
||
|
||
- svga: fix texture rectangle sampling when no sampler view declaration is defined
|
||
|
||
Chia-I Wu (93):
|
||
|
||
- venus: stop using vn_renderer_sync in vn_fence
|
||
- venus: stop using vn_renderer_sync in vn_semaphore
|
||
- venus: stop using vn_renderer_sync in vn_queue
|
||
- venus: remove vn_renderer_sync support from vn_queue_submission
|
||
- venus: remove VN_SYNC_TYPE_SYNC
|
||
- venus: remove vn_queue::sync_queue_index
|
||
- venus: remove vn_ring_wait_all
|
||
- venus: wait on vkQueuePresentKHR
|
||
- venus: remove vn_renderer_info::has_timeline_sync
|
||
- venus: add vn_image_init_memory_requirements
|
||
- venus: add vn_image_create
|
||
- venus: add vn_wsi_create_scanout_image
|
||
- venus: refactor vn_queue_submission_count_semaphores
|
||
- venus: clang-format clean
|
||
- venus: change SpaceBeforeParens style option
|
||
- venus: provide accessors for vn_instance_submit_command
|
||
- venus: update venus-protocol headers to use accessors
|
||
- venus: rename VN_CS_ENCODER_INITIALIZER
|
||
- venus: add vn_renderer_shmem
|
||
- venus: use vn_renderer_shmem
|
||
- venus: add dev->renderer pointer
|
||
- venus: pass vn_renderer in vn_renderer_bo functions
|
||
- venus: move vn_renderer_bo_ops to vn_renderer
|
||
- venus: merge bo create and init ops
|
||
- venus: move some common members to vn_renderer_bo
|
||
- venus: use sparse array to manage vn_renderer_bo
|
||
- venus: make sure gem_handle and vn_renderer_bo are 1:1
|
||
- venus: update venus-protocol for external memory
|
||
- venus: rework external memory capability queries
|
||
- venus: enable external memory support
|
||
- venus: fix render pass without attachments
|
||
- venus: fix dmabuf import mmap_size check
|
||
- venus: fix dmabuf import fail path
|
||
- venus: add VN_MAX_API_VERSION
|
||
- venus: rename vn_instance::renderer_version
|
||
- venus: clarify/fix instance renderer versions
|
||
- venus: clarify/fix device renderer version
|
||
- venus: refactor vn_physical_device_init_extensions
|
||
- venus: avoid strcmp for spec version override
|
||
- venus: refactor vn_physical_device_init_supported_extensions
|
||
- venus: init supported extensions in one place
|
||
- venus: add extension check for ANDROID_native_buffer
|
||
- venus: clean up vn_device_fix_create_info
|
||
- venus: get rid of #ifdef's in vn_CreateImage
|
||
- pps: fix a missing include in Intel pps driver
|
||
- util/u_thread: fix u_thread_setname for long names
|
||
- venus: add struct vn_command_buffer_builder
|
||
- venus: remember cmd buffer level and queue family
|
||
- venus: ignore pInheritanceInfo when we should
|
||
- docs: add basic documentation for venus
|
||
- vulkan/wsi: provide more info in wsi_image_create_info
|
||
- venus: add vn_device_memory_alloc as a helper
|
||
- venus: fix asserts on mem bo
|
||
- venus: fix opaque fd re-import
|
||
- venus: move wsi_image_create_info parsing
|
||
- venus: remember image wsi states
|
||
- venus: handle VN_COMMAND_BUFFER_STATE_INVALID
|
||
- venus: remember render pass PRESENT_SRC attachments
|
||
- venus: remember render pass PRESENT_SRC barriers
|
||
- venus: remember image view image
|
||
- venus: remember framebuffer attachments
|
||
- venus: remember cmd buffer render pass and framebuffer
|
||
- venus: remember cmd buffer fb attachments
|
||
- venus: add vn_image_memory_barrier_has_present_src
|
||
- venus: add vn_cmd_get_image_memory_barriers
|
||
- venus: clean up vn_get_intercepted_barriers
|
||
- venus: undo wsi iamge ownership transfer for Android
|
||
- venus: reland wsi image ownership transfer for Android
|
||
- venus: enable wsi image ownership transfer for common wsi
|
||
- vulkan/util: add vk_default_allocator
|
||
- radv: use vk_default_allocator
|
||
- v3dv: use vk_default_allocator
|
||
- tu: use vk_default_allocator
|
||
- anv: use vk_default_allocator
|
||
- venus: use vk_default_allocator
|
||
- venus: silence compiler warnings
|
||
- venus: query experimental features in one call
|
||
- venus: document the darkest corner of venus
|
||
- venus: move vn_renderer_sync_ops to vn_renderer
|
||
- venus: simplify vn_renderer_sync creation
|
||
- venus: update venus-protocol headers
|
||
- venus: add support for external fence on Android
|
||
- venus: add support for external semaphores on Android
|
||
- venus: clean up vn_physical_device_get_native_extensions
|
||
- venus: fix compatibility with older host drivers
|
||
- venus: be verbose about which physical devices are skipped
|
||
- vulkan/wsi: fix select_memory_type when all MTs are local
|
||
- venus: fix empty submits with BOs
|
||
- egl/surfaceless: try kms_swrast before swrast
|
||
- meson: allow egl_native_platform to be specified
|
||
- venus: clean up vn_AllocateMemory
|
||
- venus: suballocate memory in more cases
|
||
- vulkan/wsi/x11: do not inherit last_present_mode
|
||
|
||
Christian Gmeiner (3):
|
||
|
||
- ci: disable initrd support
|
||
- drm-shim: fix compile with glibc >= 2.33
|
||
- ci: bare-metal: drop webdav support
|
||
|
||
Connor Abbott (133):
|
||
|
||
- ir3: Fix list corruption in legalize_block()
|
||
- ir3: Reduce max const file indirect offset base to 9 bits
|
||
- ir3, tu: Add compiler flag for robust UBO behavior
|
||
- tu: Correctly preserve old push descriptor contents
|
||
- tu: Handle robust UBO behavior for pushed UBO ranges
|
||
- tu: Handle null descriptors
|
||
- tu: Expose VK_EXT_robustness2
|
||
- ir3/parser: Fix oob write with immediates array
|
||
- ir3: Improve cat1 modifier disassembly
|
||
- ir3: Assemble and disassemble swz/gat/sct
|
||
- ir3: Prevent oob writes to inputs/outputs array
|
||
- nir/lower_clip_disable: Fix store writemask
|
||
- ir3, tu: Cleanup indirect i/o lowering
|
||
- freedreno: Don't lower indirects in GLSL IR
|
||
- freedreno/a6xx: Better document SP_GS_PRIM_SIZE
|
||
- freedreno/a6xx: Fix SP_GS_PRIM_SIZE for large sizes
|
||
- tu: Fix SP_GS_PRIM_SIZE for large sizes
|
||
- ir3/postsched: Fix dependencies for a0.x/p0.x
|
||
- ir3/cp: Clone registers for compare-folding optimization
|
||
- ir3/sched: Use correct src index
|
||
- ir3/postsched: Use correct src index
|
||
- ir3/delay: Remove special case for array deps
|
||
- ir3/postsched: Fix ir3_postsched_node::delay calculation
|
||
- ir3/cp_postsched: Fixup SSA use pointer for direct reads
|
||
- ir3: Refactor nir->ir3 block handling
|
||
- ir3: Make predecessors an array
|
||
- ir3: Rework outputs
|
||
- ir3: Don't assume regs[1] exists in ir3_fixup_src_type()
|
||
- nir/lower_phis_to_scalar: Add "lower_all" option
|
||
- ir3/cf: Rewrite pass
|
||
- ir3: Use round-to-nearest-even for fquantize2f16
|
||
- ir3: Call nir_lower_wrmask() again after lowering scratch
|
||
- ir3: Only use per-wave pvtmem layout for compute
|
||
- ir3: Introduce phi and parallelcopy instructions
|
||
- ir3: Add ir3_start_block()
|
||
- ir3: Readd support for translating NIR phi nodes
|
||
- ir3: Prepare for instructions with multiple destinations
|
||
- ir3: Improve register printing for SSA
|
||
- ir3: Add ir3_register::array.base
|
||
- ir3/delay: Fix full->half and half->full delay
|
||
- ir3: Add reg_elems(), reg_elem_size(), and reg_size()
|
||
- ir3: Make branch conditions non-SSA
|
||
- ir3: Rewrite delay calculation
|
||
- ir3/delay: Delete pre-RA repeat handling
|
||
- ir3/postsched: Don't use SSA source information
|
||
- ir3: Remove unused check_src_cond()
|
||
- ir3: Add dominance infrastructure
|
||
- ir3: Add pass to lower arrays to SSA
|
||
- ir3: Expose occupancy calculation functions
|
||
- ir3: Rewrite register allocation
|
||
- ir3/ra: Add a validation pass
|
||
- ir3: Remove right and left copy prop restrictions
|
||
- ir3/sched: Don't schedule collect early
|
||
- ir3/sched: Make collects count against tex/sfu limits
|
||
- ir3/sched: Consider unused destinations when computing live effect
|
||
- ir3: Add simple CSE pass
|
||
- ir3: Insert output collects in the main shader
|
||
- ir3: Copy propagate immed/const to meta instructions
|
||
- ir3: Improve printing of array parallelcopies/phis
|
||
- ir3/ra: Fix array parallelcopy confusion
|
||
- ir3: Make tied sources/destinations part of the IR
|
||
- ir3: Split read-modify-write array dests in two
|
||
- ir3: Update ir3_register::instr when cloning instructions
|
||
- ir3: Validate that ir3_register::instr is correct
|
||
- ir3: Add is_reg_special()
|
||
- ir3: Make ir3_instruction::address a normal register
|
||
- ir3: Split ir3_reg_create() into ir3_{src,dst}_create()
|
||
- ir3: Add separate src/dst count in ir3_instr
|
||
- ir3/legalize: Construct branch properly
|
||
- ir3: Add srcs/dsts arrays to ir3_instruction
|
||
- freedreno/isa: Convert to srcs/dsts
|
||
- freedreno/tests: Convert to srcs/dsts
|
||
- ir3/sched: Convert to srcs/dsts arrays
|
||
- ir3/core: Switch to srcs/dsts arrays
|
||
- ir3/ra: Switch to srcs/dsts arrays
|
||
- ir3/parser: Switch to srcs/dsts arrays
|
||
- ir3/array_to_ssa: Switch to srcs/dsts arrays
|
||
- ir3/legalize: Switch to srcs/dsts arrays
|
||
- ir3/print: Switch to srcs/dsts arrays
|
||
- ir3/validate: Switch to srcs/dsts arrays
|
||
- ir3/opts: Switch to srcs/dsts arrays
|
||
- ir3/frontend: Switch to srcs/dsts arrays
|
||
- ir3: Remove regs array
|
||
- ir3: Remove IR3_REG_DEST
|
||
- ir3/ra: Fix corner case in collect handling
|
||
- freedreno/a6xx: Make SP_XS_PVT_MEM_HW_STACK_OFFSET non-inline
|
||
- freedreno, tu: Set SP_XS_PVT_MEM_HW_STACK_OFFSET
|
||
- freedreno/computerator: Fix local_size typo
|
||
- ir3/sched: Speed up live_effect
|
||
- ir3: Stop creating dummy dest registers
|
||
- ir3: Prepare dest helpers for multi-dst instructions
|
||
- ir3: Add foreach_dst/foreach_dst_n
|
||
- ir3: Support multi-mov instructions
|
||
- ir3/delay: Support multi-mov instructions
|
||
- ir3/postsched: Support multi-mov instructions
|
||
- ir3/legalize: Support multi-mov instructions
|
||
- ir3: Use correct flags for movmsk & multi-mov
|
||
- ir3/validate: Support multi-mov instructions
|
||
- ir3: Print multi-mov instructions
|
||
- ir3: Add min gen for multi-mov instructions
|
||
- ir3/lower_parallelcopy: Use SWZ
|
||
- nir/subgroups: Replace lower_vote_eq_to_ballot with lower_vote_eq
|
||
- nir/subgroups: Support > 1 ballot components
|
||
- nir: Add read_invocation_cond_ir3 intrinsic
|
||
- tu, ir3: Plumb through support for CS subgroup size/id
|
||
- ir3/nir: Call nir_lower_subgroups
|
||
- ir3: Handle shared register liveness correctly
|
||
- ir3: Handle unreachable blocks
|
||
- ir3: Prevent propagating shared regs out of loops
|
||
- ir3: Better valid flags for shared regs
|
||
- ir3: Actually allow shared reg moves to be folded
|
||
- ir3: Fix shared reg delay
|
||
- ir3: Make MOVMSK use repeat
|
||
- ir3: Fix infinite loop in scheduler when splitting
|
||
- ir3/sched: Handle branch condition in split_pred()
|
||
- ir3: Cleanup ir3_legalize jump optimization
|
||
- ir3: Support any/all/getone branches
|
||
- ir3: Add subgroup pseudoinstructions
|
||
- ir3: Handle shared registers in lower_parallelcopy
|
||
- ir3: Implement nir subgroup intrinsics
|
||
- ir3: Fix convergence behavior for loops with continues
|
||
- ir3/legalize: Fix loop convergence behavior
|
||
- tu: Update subgroup properties
|
||
- ir3/nir: Lower indirect references of compact variables
|
||
- ir3: Add missing include to ir3_parser.y
|
||
- ir3: Add ir3_collect() for fixed-size collects
|
||
- ir3/lower_parallelcopy: Don't manually set wrmask
|
||
- ir3: Update .editorconfig and .dir-locals.el
|
||
- ir3: Manually reformat some places
|
||
- freedreno: Add some options to .clang-format
|
||
- ir3: Reformat source with clang-format
|
||
- ir3/print: Manual formatting fixups
|
||
- ir3: Preserve gl_ViewportIndex in the binning shader
|
||
|
||
Corentin Noël (4):
|
||
|
||
- ci: Use the caching proxy for Mesa artifacts
|
||
- ci: Re-enable virgl tesselation shader
|
||
- ci: Bump virglrenderer
|
||
- ci: actually run piglit tests with virgl
|
||
|
||
Daniel Schürmann (33):
|
||
|
||
- aco: fix additional register requirements for spilling
|
||
- aco: relax validation rules for p_reduce dst RegType
|
||
- driconf: set vk_x11_strict_image_count for Metro: Exodus
|
||
- aco/ra: prevent underflow register for p_create_vector operands
|
||
- radv: call nir_copy_prop() after load-store vectorization
|
||
- aco/ra: also prevent overflow register for p_create_vector operands
|
||
- aco: remove condition operand from branch in invert block
|
||
- radv,aco: scalarize all phis via nir_lower_phis_to_scalar()
|
||
- aco: simplify Phi RegClass selection
|
||
- aco/ra: only create phi-affinities for killed operands
|
||
- aco/ra: refactor affinity coalescing
|
||
- aco/ra: refactor register assignment for vector operands
|
||
- amd/ci: add hawaii-specific skip and fail lists
|
||
- aco/ra: handle copies of definition registers
|
||
- aco/ra: handle copies of copies better
|
||
- aco/util: replace DIV_ROUND_UP(n+1,m) by n/m+1
|
||
- aco: reorder and cleanup #includes
|
||
- aco: add missing Licenses and remove Authors from files
|
||
- aco: add 'common/' and 'llvm/' prefix to #includes
|
||
- aco/meson: remove unnecessary dependencies
|
||
- aco: refactor SDWA opcode validation
|
||
- aco: remove (wrong) GCC array-bounds warning
|
||
- util/meson: include inc_gallium
|
||
- aco: add .clang-format file
|
||
- aco: Format.
|
||
- aco/meson: remove inc_gallium from include_directories
|
||
- aco: fix self-intersecting register swaps
|
||
- aco: fix extract_vector optimization
|
||
- aco/isel: avoid unnecessary calls to nir_unsigned_upper_bound()
|
||
- aco/insert_waitcnt: Remove many unnecessary wait_imm.combine()
|
||
- aco/live_var_analysis: change worklist to a single integer
|
||
- aco/optimizer: ensure to not erase high bits when propagating packed constants
|
||
- aco: include <cstddef> in aco_util.h
|
||
|
||
Daniel Stone (69):
|
||
|
||
- CI: Disable Panfrost and radeonsi
|
||
- CI: Disable all Panfrost/AMD/Iris automatic jobs
|
||
- CI: Disable rk3399-gru-kevin jobs for now
|
||
- doc: Gratuituous promotion of Wayland
|
||
- docs: Even more gratutious nitpicks
|
||
- Revert "CI: Disable rk3399-gru-kevin jobs for now"
|
||
- CI: Fix path confusion in OpenCL Piglit execution
|
||
- ci/zink: Skip flaky GLX test
|
||
- ci/radeonsi: Skip flaky glx-swap-copy test
|
||
- ci/windows: Artifact Meson build and test logs
|
||
- ci/windows: Re-enable Windows build
|
||
- ci: Add Piglit gl-1.0-blend-func to everyone's skips
|
||
- ci/lava: Iterate all job results, not just the first
|
||
- ci/lava: Handle proxy download failures
|
||
- ci/lava: Add validate-only mode to job submitter
|
||
- ci/lava: Add --dump-yaml option to submitter
|
||
- ci/bare-metal: Factor out environment to a separate script
|
||
- ci/bare-metal: Don't leak JWT into logs
|
||
- ci/lava: Move LAVA files to lava/
|
||
- ci/lava: Pass JWT separately from environment variables
|
||
- ci/lava: Cosmetic reordering of job init
|
||
- ci/lava: Wrap submission in a shell script
|
||
- ci/lava: Clean up variable naming, document them
|
||
- ci: Make PIPELINE_ARTIFACTS_BASE a common variable
|
||
- ci: Add JOB_ARTIFACTS_BASE variable
|
||
- ci: Use JOB_ARTIFACTS_BASE for Piglit fails
|
||
- ci/lava: Use per-job rootfs overlay for environment
|
||
- ci/panfrost: Remove useless variable
|
||
- ci/lava: Generate job name from lava-submit.sh
|
||
- ci/lava: Remove unused arguments
|
||
- ci/lava: Add explicit fatal-error handler
|
||
- ci/lava: Disable stdout/stderr buffering
|
||
- ci/lava: Dump and artifact YAML again
|
||
- ci/lava: Avoid tee as it ruins exit status
|
||
- ci/piglit: Fix path to uploaded images
|
||
- ci/lava: Always upload Piglit replay images to MinIO
|
||
- ci/lava: Set PIGLIT_NO_WINDOW
|
||
- ci/lava: Explicitly start Xorg for Iris EGL tests
|
||
- ci/bare-metal: Rename BM_KERNEL_MODULES to HWCI_KERNEL_MODULES
|
||
- ci/lava: Use HWCI_KERNEL_MODULES to load modules
|
||
- ci/lava: Rename environment variable script
|
||
- ci/bare-metal: Try harder to do NTP
|
||
- ci/bare-metal: Reorder init so network comes first
|
||
- ci: Move bare-metal init script to common directory
|
||
- ci: Be consistent about install path
|
||
- ci/bare-metal: Consistently set library paths
|
||
- ci/bare-metal: Split init script into two stages
|
||
- ci/bare-metal: Move devcoredump capture to CI common
|
||
- ci/lava: Start using devcoredump captures
|
||
- ci: Consistent pass/fail result output
|
||
- ci: Unify {BM,LAVA}_START_XORG environment
|
||
- ci: Unify {BARE_METAL,LAVA}_TEST_SCRIPT environment
|
||
- ci/bare-metal: Set CPU and GPU governors to max, disable GPU runtime PM
|
||
- ci/lava: Pass MinIO path on the command line
|
||
- ci/lava: Use common stage-2 init
|
||
- ci/lava: Drop bitrotten fastboot support
|
||
- ci/lava: Make kernel image type a normal argument
|
||
- ci/lava: Generate YAML from Python, not Jinja
|
||
- llvmpipe: Add handle export for resource_get_param
|
||
- Revert "ci: disable panfrost t760 jobs"
|
||
- CI: Disable LAVA devices for maintenance
|
||
- Revert "CI: Disable LAVA devices for maintenance"
|
||
- util/disk_cache: Don't leak when cache is empty
|
||
- panfrost/genxml: Decode Bifrost index-driven vertex jobs
|
||
- ci/panfrost: Temporarily disable sun50i/RK3288
|
||
- Revert "ci/panfrost: Temporarily disable sun50i/RK3288"
|
||
- vulkan/wsi/wayland: Initialise wl_shm pointer in VkImage
|
||
- egl/wayland: Error on invalid native window
|
||
- egl/wayland: Allow EGLSurface to outlive wl_egl_window
|
||
|
||
Danylo Piliaiev (36):
|
||
|
||
- nir: add lowering pass for helperInvocationEXT()
|
||
- turnip: implement VK_EXT_shader_demote_to_helper_invocation
|
||
- turnip: implement VK_KHR_shader_terminate_invocation
|
||
- ir3: treat 16b imul as mul.s24
|
||
- turnip: enable shaderInt16
|
||
- ir3: do not double threadsize when exceeding branchstack limit
|
||
- ir3: make possible to specify branchstack up to 64
|
||
- tu: do not corrupt unwritten render targets
|
||
- ir3: do not move varying inputs that depend on unmovable instrs
|
||
- ir3: do not fold cmps from different blocks with non-null address
|
||
- ir3: memory_barrier also controls shared memory access order
|
||
- ir3: update bar/fence bits in accordance to blob
|
||
- turnip: implement VK_KHR_vulkan_memory_model
|
||
- docs: mark off VK_KHR_vulkan_memory_model for turnip
|
||
- turnip,freedreno/a6xx: SP_BLEND_CNTL has per-mrt blend enable bit
|
||
- freedreno/a5xx: SP_BLEND_CNTL has per-mrt blend enable bit
|
||
- turnip: copy all layers specified in vkCmdCopyImage
|
||
- ci/turnip: drop fail annotation for float_control tests
|
||
- ci/turnip: drop fail annotation for image.extend_operands_spirv1p4.*
|
||
- turnip: do not ignore early_fragment_tests
|
||
- turnip: make possible to create read-only bo with tu_bo_init_new
|
||
- turnip: make cmdstream bo's read-only to GPU
|
||
- turnip: place a limit on the growth of BOs
|
||
- freedreno: reduce the upper bound of IB size by one
|
||
- turnip: reset push descriptor set on command buffer reset
|
||
- turnip: emit vb stride dynamic state when it is dirty
|
||
- turnip: fix register_index calculations of xfb outputs
|
||
- turnip: implement VK_EXT_provoking_vertex
|
||
- turnip: do not re-emit same vs params
|
||
- turnip: early exit in tu6_draw_common to save cpu cycles
|
||
- freedreno/computerator: pass iova of buffer to const register
|
||
- freedreno/isa: add uoffset type to print positive-only offsets
|
||
- ir3: add ldg.a,stg.a which allow complex in-place offset calculation
|
||
- glsl: Prohibit implicit conversion of mem parameter in atomicOP functions
|
||
- ir3: add newly found shlg.b16 instruction
|
||
- freedreno: fix wrong tile aligment for 3 CCU gpu
|
||
|
||
Dave Airlie (146):
|
||
|
||
- iris: move get_time into a static in bufmgr code.
|
||
- iris: move target to isl dim translate to inline.
|
||
- lavapipe: add support for non-dri loader on linux
|
||
- llvmpipe: split screen init up.
|
||
- llvmpipe: wrap late screen init with a mutex.
|
||
- llvmpipe: delay late screen creation until context init.
|
||
- lavapipe: fix mipmapped resolves.
|
||
- lavapipe: mark event_storage as volatile
|
||
- intel: move brw_ff_gs_prog_key/data to compiler.
|
||
- intel/compiler: add support for compiling fixed function gs
|
||
- i965: port fixed function geom shader to use compiler paths
|
||
- i965: drop old brw ff gs code.
|
||
- intel/genxml: align gen4/5 xml for store data immediate
|
||
- intel/genxml: rewrite the prefilterop xml to be more consistent.
|
||
- intel/gemxml: move blitter command to render on gen4/5
|
||
- intel/genxml: fix raster op fields on gen4/5
|
||
- intel/decoder: fixup batch decoder for binding tables on gen4/5
|
||
- intel/decoder: add gen4/5 geometry state decode
|
||
- gallivm: handle texture arrays in non-fragment shaders with lod.
|
||
- llvmpipe: fix non-multisampled rendering to multisampled framebuffer
|
||
- llvmpipe: add the interesting bit of cpu detection to the cache.
|
||
- st/nir: always revectorise if scalarising happens.
|
||
- intel/gfx6: move xfb_setup outside the gs compiler into the driver.
|
||
- intel/isl: decrease isl_format_layouts size by 36k
|
||
- intel/isl: convert null surface fill to a struct.
|
||
- intel/isl: add levels and minimum array element to null fill
|
||
- intel/isl: add blend enable flag to gen4/5
|
||
- u_blitter: fix fs used when no color emitted
|
||
- u_blitter: fix stencil blit fallback for crocus.
|
||
- iris: drop unused function declaration
|
||
- nir/edgeflags: update outputs written when lowering edge flags.
|
||
- st/mesa: also disable other int textures
|
||
- intel/decode: handle gen4/5 WM state fragment shaders
|
||
- intel: reorder base program key.
|
||
- intel/compiler: add flag to indicate edge flags vertex input is last
|
||
- crocus: initial gallium driver for Intel gfx 4-7
|
||
- ci: add crocus to the build tests
|
||
- crocus: Don't call SET_TILING for dmabuf imports
|
||
- crocus: Make iris_bo_import_dmabuf take a modifier
|
||
- crocus: introduce main resource configuration helper.
|
||
- crocus: Drop buffer support in resource_from_handle
|
||
- crocus: hook up memory object creation from handle
|
||
- crocus: hook up resource creation from memory object
|
||
- crocus: plumb device/driver UUID generators
|
||
- crocus: enable GL_EXT_memory_object feature on gen7
|
||
- crocus: fix scanout tiling so glamor/modesetting can work.
|
||
- crocus: fixed some missing WM dirtys.
|
||
- crocus: fixup render aux usage function.
|
||
- crocus: disable Z16
|
||
- crocus/gen6: fix depth blit blorp regression.
|
||
- i965: fix regression in pipe control on g45
|
||
- crocus: drop dead gen prototypes.
|
||
- crocus: fixup stray tab
|
||
- crocus: rename genX proto functions to avoid iris conflicts.
|
||
- crocus: fixup workaround_bo to match 965.
|
||
- crocus: convert a bunch of is_haswell into verx10 checks.
|
||
- crocus: refactor blend state code.
|
||
- crocus/gen8: limit some pipe controls to gen7/hsw
|
||
- crocus: limit texture gather workarounds to gen7/hsw
|
||
- crocus/stencil: limit stencil workaround to gen7
|
||
- crocus/query: add gen8 support to queries by extending hsw checks
|
||
- crocus: extend l3 config setup to gen8
|
||
- crocus/gen8: add push constant support (extend hsw)
|
||
- crocus/gen8: extend some compute + state functions to gen8
|
||
- crocus/gen8: extend image support to gen8
|
||
- crocus: extend hsw cut index to gen8
|
||
- crocus/gen8: extend predicate handling to gen8.
|
||
- crocus/gen8: add sampler / border color support for gen8
|
||
- crocus/gen8: add l3 config support
|
||
- crocus/gen8: extending gen7 binding table pointers
|
||
- crocus/gen8: limit vertex buffer workarounds to ivb
|
||
- crocus/gen8: add raw pipe control support for gen8 workarounds
|
||
- crocus/gen8: add support for vertex instancing and index buffers.
|
||
- crocus/gen8: state base address + misc setup state.
|
||
- crocus/gen8: add VF topology support
|
||
- crocus/gen8: add PMA fix from iris
|
||
- crocus/gen8: add streamout support
|
||
- crocus/gen8: add SBE swiz support
|
||
- crocus/gen8: add VF SGVS support.
|
||
- crocus/gen8: add PS blend command support.
|
||
- crocus/gen8: refactor blend state for gen8
|
||
- crocus/gen8: add rasterizer state changes.
|
||
- crocus/gen8: add viewport support
|
||
- crocus/gen8: add depth stencil state support
|
||
- crocus/gen8: port over vs/gs/ds state changes.
|
||
- crocus/gen8: port over ps/wm state changes from iris.
|
||
- crocus/gen8: port over VFE/compute state changes
|
||
- croucs/gen8: handle gfx8 surface aux addr reloc.
|
||
- crocus/gen8: handle sampler differences
|
||
- crocus/gen8: hookup gen8 state generators
|
||
- crocus/gen8: add support for cherryview (env var for bdw)
|
||
- croucs: limit stencil swizzle change to older generations
|
||
- crocus/bufmgr: fix userptr left over fail
|
||
- crocus: Explicitly cast value to uint64_t
|
||
- crocus: free context state properly.
|
||
- crocus: fix vertex buffer leak on screen end.
|
||
- crocus: fix batch state bo leak
|
||
- meson/crocus: add prefer-crocus option.
|
||
- crocus/query: poll the syncobj in the no wait situation
|
||
- intel/genxml: fix gfx6 GS SVB_INDEX encoding
|
||
- crocus/gfx6: fix sampler view first level.
|
||
- crocus: dirty blend state more often.
|
||
- crocus: Avoid replacing backing storage for buffers with no contents
|
||
- crocus/gfx6: always be dirtying gs attachments for xfb
|
||
- crocus: fix another printf specifier.
|
||
- crocus/gen8: add back z16 support for gen8
|
||
- crocus: disable Z16 unorm textures on pre-gen8 as well.
|
||
- gallium/sw: add sw_vk bit to avoid having to futz with env vars for lavapipe
|
||
- zink: drop getenv hacking now that gallium is fixed.
|
||
- iris: make iris_bind_reserve_3d and Wa_1604061319 only check for dirty render bindings
|
||
- crocus: cleanup some deadcode in the gen5 blend emit
|
||
- crocus: expose ARB_blend_func_extended on gen 45/50
|
||
- crocus/gen5: enable support for GL_EXT_gpu_shader4
|
||
- crocus: fix crash on index buffer rebinding.
|
||
- crocus: fixup index buffer dirtying.
|
||
- draw: fix tessellation output vertex size calculation
|
||
- draw/tess: write correct primitive id into vertices
|
||
- crocus: inline the d/s resource handling functions
|
||
- crocus: don't update draw parameters unless needed
|
||
- crocus: optimise bo_unref path a little.
|
||
- crocus: inline group_index<->bti
|
||
- crocus: reorder version checks on indirect xfb
|
||
- crocus: restrict prim_restart on index buffer check to pre-hsw
|
||
- crocus: support rebinding streamout target buffers
|
||
- crocus: use threaded context base classes
|
||
- crocus/tc: init/deinit threaded resource
|
||
- crocus: add unsync transfer pool
|
||
- crocus: enable threaded context support
|
||
- ac: fix win32 build
|
||
- crocus/gen8: fix wrap mode needs border color.
|
||
- crocus: add GL_CLAMP emulation in driver again.
|
||
- vulkan/wsi/wl: add wl_shm support for lavapipe.
|
||
- lavapipe: add the separate depth/stencil layout enable.
|
||
- crocus: use simple_mtx in the bufmgr
|
||
- lvp: fixup multi draw memcpys
|
||
- draw: handle resetting draw_id between instances.
|
||
- softpipe/aniso: move DDQ calculation to after scaling.
|
||
- crocus/gen4-5: fix ff gs emit on VS vue map change.
|
||
- llvmpipe: add support for time elapsed queries.
|
||
- draw/llvmpipe: multiply polygon offset units by 2
|
||
- teximage: return correct desktop GL error for compressedteximage
|
||
- crocus/gen4: restrict memcpy mapping to gen5
|
||
- intel/fs: restrict max push length on older GPUs to a smaller amount
|
||
- crocus/gen45: fix mapping compressed textures
|
||
- intel/genxml: fix raster operation field in blt genxml
|
||
- crocus: add support for set alpha to one with blt.
|
||
|
||
Dmitry Baryshkov (2):
|
||
|
||
- freedreno/regs: split DSI PHY registers to separate xml files.
|
||
- freedreno/regs: split old/not used phy registers to separate DB
|
||
|
||
Drew Davenport (1):
|
||
|
||
- radeonsi: Report multi-plane formats as unsupported
|
||
|
||
Duncan Hopkins (3):
|
||
|
||
- zink: Correct compiler issue with have_moltenvk member having been moved.
|
||
- gallium/dri: Guard DRI driver global variables on MacOS if Zink is enabled.
|
||
- zink: Fix MacOS compiling issues
|
||
|
||
Dylan Baker (27):
|
||
|
||
- meson: OpenMP is supposed to be optional
|
||
- docs: add release notes for 21.0.3
|
||
- docs: update sha256 sum for mesa 21.0.3
|
||
- docs: update calendar and link releases notes for 21.0.3
|
||
- docs: update calendar for 21.1.0-rc1
|
||
- docs: update calendar for 21.1.0-rc2
|
||
- docs: update calendar for 21.1.0-rc3
|
||
- meson/vulkan: fix linkage on windows
|
||
- docs: Add calendar entries for 21.2 release candidates.
|
||
- VERSION: bump for 21.2-rc1
|
||
- .pick_status.json: Update to f40a08d25c91256cd3dff0211b8e10d5bbb3734e
|
||
- .pick_status.json: Update to a62973580b7846f2213cbd2589e9473c26596683
|
||
- .pick_status.json: Update to 27534a49cf3872646cb8ef9371707d74a81b1986
|
||
- VERSION: bump for 21.2-rc2
|
||
- .pick_status.json: Update to b45cddda183230232937387f91d009500b2372c9
|
||
- .pick_status.json: Update to 49908c602ffd2d84063effa7ddd0ee842be41a89
|
||
- VERSION: bump for 21.2.0-rc3
|
||
- .pick_status.json: Update to dff0d9911d176802b54890c796e19f56c50f24e1
|
||
- .pick_status.json: Update to b8e29e89366a5264391dc7c10e778330b7add66a
|
||
- freedreno/ir3: Add build id to the disassembler test
|
||
- .pick_status.json: Mark 8cb795b4772f882024b20c4d4b051b2411dd1a8c as denominated
|
||
- .pick_status.json: Update to 87b0962fef4e447a2ea9c76a611aa20b109a259d
|
||
- .pick_status.json: Update to 842b8c8965327615f4692384a905dd63f1fba63d
|
||
- .pick_status.json: Update to 97be8e42e42f3b739c3de808553094f86ad8879f
|
||
- bin/gen_release_notes: Add basic tests for parsing issues
|
||
- bin/gen_release_notes: Don't consider issues for other projects
|
||
- bin/gen_release_notes: Fix commits with multiple Closes:
|
||
|
||
Eleni Maria Stea (5):
|
||
|
||
- egl: fix in expected type
|
||
- util: replaced ENODATA with ENOATTR for non-Linux systems
|
||
- util: Removed unused statement from FreeBSD build
|
||
- intel: struct bitset is renamed to brw_bitset
|
||
- intel: PAGE_SIZE used in allocators shouldn't be defined on FreeBSD
|
||
|
||
Ella-0 (1):
|
||
|
||
- anv: expose primary node to VK_EXT_physical_device_drm even when VK_KHR_display is not enabled
|
||
|
||
Emil Velikov (1):
|
||
|
||
- gbm: list to stderr all the missing extension
|
||
|
||
Emma Anholt (251):
|
||
|
||
- ci/freedreno: Merge a630 piglit to a single job.
|
||
- freedreno: Fix YUV sampler regression.
|
||
- ci/virgl: Mark a couple of new Crash tests as flakes.
|
||
- ci/freedreno: Skip some precision tests on a530.
|
||
- nir_to_tgsi: Use ARL instead of UARL in the !native_integers case.
|
||
- nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS
|
||
- ci/lavapipe: Don't include deqp's shader_cache in the artifacts.
|
||
- ci/lava: Return the run's results/ artifacts from the DUTs.
|
||
- ci/piglit: Always include the HTML summary in a run.
|
||
- ci/lava: Point the shader cache at tmpfs.
|
||
- mesa: Remove dead _mesa_unpack_rgba_block().
|
||
- util: Switch the non-block formats to unpacking rgba rows instead of rects.
|
||
- util/format: Add some NEON intrinsics-based u_format_unpack.
|
||
- panfrost: Enable packed uniforms.
|
||
- zink: Enable PIPE_CAP_PACKED_UNIFORMS.
|
||
- ci: Build deqp-egl targeting x11_egl_glx
|
||
- ci/llvmpipe: Test dEQP-EGL against Xvfb.
|
||
- ci/freedreno: Test dEQP-EGL against Xorg.
|
||
- mapi: Respect MESA_DEBUG=silent for no-op debug output.
|
||
- freedreno: Mark glsl-fs-fogscale as a Fail.
|
||
- freedreno/a6xx: Don't try to do Z-as-RGBA blits for mismatched formats.
|
||
- util: Fix big-endian handling of z/s formats.
|
||
- mesa: Deduplicate _mesa_pack_ubyte_stencil_row()
|
||
- mesa: Deduplicate _mesa_pack_float_z_row().
|
||
- mesa: Deduplicate _mesa_pack_uint_z_row().
|
||
- mesa: Remove dead _mesa_get_pack_float_z_func().
|
||
- msea: Move z24s8-to-z24s8 packing fastpath to swrast.
|
||
- mesa: Move per-pixel Z pack functions to swrast.
|
||
- mesa: Remove dead _mesa_pack_ubyte_rgba_rect().
|
||
- mesa: Replace _mesa_pack_ubyte_rgba_row() with pack_ubyte_rgba_8unorm().
|
||
- ci/radeonsi: Mark a glx_arb_sync_control/timing flake.
|
||
- turnip: Only write the tu_RegisterDeviceEXT() out fence on success.
|
||
- ci: Add missing vulkan dep for freedreno (turnip) and v3dv test jobs.
|
||
- u_format: Fix z32_s8x24 s8 unpacking on big-endian.
|
||
- u_format: Add missing BE swizzles for R8SG8SB8UX8U_NORM
|
||
- ci/freedreno: Mark dEQP-EGL flakes reported on IRC since its introduction.
|
||
- ci/freedreno: Mark new flakes from the go-fast branch.
|
||
- ci/freedreno: Mark another recent piglit flake.
|
||
- ci/freedreno: Fix the recent-a5xx-texture-flakes matches.
|
||
- ci/freedreno: Add another db820c flake that's appeared in the last few months.
|
||
- tgsi: Mark the tgsi_exec_channel and tgsi_double_channel ALIGN16.
|
||
- tunrip: Add support for VK_EXT_separate_stencil_usage.
|
||
- ci/freedreno: Mark a5xx texture gather as flaky.
|
||
- turnip: Demote API version to 1.1.
|
||
- ci/llvmpipe: Add testing of gles3/31/gl.
|
||
- ci/lavapipe: Add fractional NIR stress test job.
|
||
- freedreno/a5xx: Fix up border color pointers.
|
||
- gallium/tgsi_exec: Drop the unused dst_datatypes from dest stores.
|
||
- tgsi_exec: Drop unused destination dimension support.
|
||
- tgsi_exec: Mark the store file default case as unreachable.
|
||
- gallium/tgsi_exec: Simplify bounds checks on the const file.
|
||
- turnip: Switch to the shared vulkan ICD generator.
|
||
- turnip: Move the extension tables to tu_device.c
|
||
- ci/freedreno: Add another daily dose of a530 flakes.
|
||
- turnip: Drop wideLines properties since we don't support wide lines.
|
||
- turnip: Claim 2 discrete queue priorities.
|
||
- freedreno: Update editorconfig and emacs settings for freedreno reformat.
|
||
- ci/turnip: Clean up some stale fail annotations.
|
||
- ci/turnip: Add some links to issues and MRs for some test failures.
|
||
- turnip: Drop fail annotation for driver_properties.
|
||
- ci: Switch to apitraces for glmark2
|
||
- ci/panfrost: Add some more traces to replay.
|
||
- ci/iris: Add some more traces to replay.
|
||
- ci/freedreno: Skip refract on a306 now that it hangchecks sometimes.
|
||
- midgard: Fix type for vertex_builtin_arg() and compute_builtin_arg().
|
||
- ci/freedreno: Skip a test that's taking out the a530 boards.
|
||
- ci/freedreno: Mark two more recent intermittent a530 flakes.
|
||
- ci/deqp: Make DEQP_EXPECTED_RENDERER a required regex for VK like for GLES.
|
||
- ci/intel: Add test jobs for dEQP.
|
||
- vulkan: Avoid stomping array padding in the MemoryProperties wrapper.
|
||
- mesa/st: Only use 16-bit ints or floats in the NIR path.
|
||
- i915g: Disable 3D-pipeline clears.
|
||
- i915g: Switch batchbuffer dumping to mesa_logi().
|
||
- i915g: Fix dumping of the FS in batchbuffers.
|
||
- ci/i915g: Introduce manual testing of i915g using anholt's runner.
|
||
- i915g: Make the FS for compile failures write red instead of DIFFUSE.
|
||
- i915g: Add support for the .Absolute flag on TGSI srcs.
|
||
- i915g: Stop advertising support for indirect addressing in the FS.
|
||
- i915g: Fix writing of undefined depth value if not writing any outputs.
|
||
- i915g: Fix undefined results for TGSI_OPCODE_KILL
|
||
- ci/iris: Switch GLK back to manual testing.
|
||
- ci/freedreno: Clear compswap flake annotation.
|
||
- ci/freedreno: Clear stale validation failure flake annotation.
|
||
- ci/freedreno: Drop a630 flake annotation from the go-fast changes.
|
||
- ci/freedreno: Add a link explaining get_display_plane_capabilities
|
||
- ci/freedreno: Drop VK flake annotations not seen in the last ~year.
|
||
- ci/freedreno: Consolidate ssbo.fragment_binding_array flake annotation.
|
||
- ci/freedreno: Mark a630 glx-visuals-depth/stencil as piglit flakes.
|
||
- ci/freedreno: Also mark waitformsc as flaky.
|
||
- ci/freedreno: Add glx-copy-sub-buffer to flakes on a530 and a630.
|
||
- mesa/st: Fix iris regression with clip distances.
|
||
- ci/freedreno: Add another a630 piglit flake.
|
||
- ci/freedreno: Turn off default a530 quick_gl testing, do full quick_shader.
|
||
- turnip: Reorganize copy_format()'s switch statement.
|
||
- turnip: Make sure that SNORM blits don't clamp ambiguous -1.0 values.
|
||
- Revert "ci: Configure DUTs for max performance"
|
||
- ci: Add known-flake handling for the IRC flake reports
|
||
- ci: Move the flakes channels to OFTC
|
||
- util: Add a helper macro for defining initial-exec variables.
|
||
- android: Fix ELF TLS support.
|
||
- ci/android: Update to building for SDK 29 by default.
|
||
- u_format: Fix some pep8 in u_format_parse.py.
|
||
- u_format: Drop redundant .name init.
|
||
- u_format: Move the BE swizzle computation into Format init.
|
||
- u_format: Use the nice helper for reversing an array.
|
||
- u_format: Assert that array formats don't include BE swizzles.
|
||
- u_format: Define tests for r3g3b2 formats and fix BE swizzles for them.
|
||
- u_format: Fix the BE channel ordering for R5G5B5A1_UINT.
|
||
- u_format: Sanity check the BE channels for all bitmask formats.
|
||
- u_format: Sanity check that BE swizzles are appropriately mapped from LE.
|
||
- u_format: Use the computed BE channels/swizzles for bitmask formats.
|
||
- ci/freedreno: Add some more known flakes from recent marge runs.
|
||
- docs/freedreno: Update for the fanin/fanout -> collect/split rename.
|
||
- docs/freedreno: Rewrite the section on array access.
|
||
- tgsi_exec: Garbage-collect the FAST_MATH path.
|
||
- u_math: Reduce fast-log2 table size from 65k entries back to 256.
|
||
- llvmpipe: Don't call util_init_math().
|
||
- ra: Add a unit test.
|
||
- ra: Document that class index is allocated in order, use that in r300.
|
||
- ra: Use struct ra_class in the public API.
|
||
- ra: Add fast-path support for register classes of contiguous regs.
|
||
- vc4: Use the ra_alloc_contig_reg_class() function to speed up RA.
|
||
- v3d: Use the ra_alloc_contig_reg_class() function to speed up RA.
|
||
- intel/fs: Use ra_alloc_contig_reg_class() to speed up RA.
|
||
- intel/vec4: Use ra_alloc_contig_reg_class() to reduce RA overhead.
|
||
- lima: Use ra_alloc_contig_reg_class().
|
||
- util/ra: Use the conflicting neighbor to skip unavailable registers.
|
||
- ci/i915g: Fix incorrect expectation.
|
||
- i915g: Make sure we don't try to texture from the const file.
|
||
- ci/lava: Finish garbage-collecting the TEST_SUITE variable
|
||
- ci: Update piglit and deqp/piglit-runner.
|
||
- ci/freedreno: Enable running all of piglit_gl for a530's manual test.
|
||
- ci/piglit: Skip WGL on all the Linux runs.
|
||
- ci/fastboot: Add a serial timeout to catch fastboot prompt failure.
|
||
- ci/fastboot: Consistently restart the run on intermittent conditions.
|
||
- ci/iris: Enable piglit testing on AML-Y.
|
||
- ci: Disable Xorg's screensaver entirely.
|
||
- ci/deqp: Drop stress/perf skips lists.
|
||
- ci/deqp: Skip flush_finish on all CI jobs.
|
||
- ci/softpipe: Move the flake to the flakes list.
|
||
- ci: Add a flakes IRC channel for llvmpipe/softpipe.
|
||
- ci/deqp: Skip dEQP-VK.wsi.display.get_display_plane_capabilities
|
||
- ci/piglit: Move the WGL skip to a common skips file.
|
||
- ci/piglit: Skip glx_arb_sync_control@timing.* on all systems.
|
||
- freedreno: Drop batch-cache orphan tracking.
|
||
- freedreno: Make a bunch of the batch cache take ctx as the arg.
|
||
- freedreno: Drop a bit of indirection around the batch cache flush path.
|
||
- freedreno: Be more strict about QUERY_AVAILABLE to simplify the code.
|
||
- freedreno: Fix batch reference handling in flush_resource().
|
||
- freedreno: Move the !MAP_WRITE write batch refcounting to the branch.
|
||
- freedreno: Remove broken back_blit optimization.
|
||
- freedreno: Flush batches upon destroying the ctx.
|
||
- freedreno: Add perf_debug() for our software conditional rendering.
|
||
- freedreno: Move FD_MESA_DEBUG=msgs output to mesa_logi.
|
||
- freedreno/fdl: Give the tiling mode a nice name in debug dumps.
|
||
- freedreno: Add more detailed blit debug in FD_MESA_DEBUG=msgs.
|
||
- freedreno: Skip staging blits from uninitialized resources.
|
||
- freedreno: Add some cheza flakes from the last week.
|
||
- i915: Disable vertex texturing and delete the code.
|
||
- i915: Drop assertion failure about seeing each const decled once.
|
||
- i915g: Allow fragment coord conventions TGSI properties to be set.
|
||
- nir/lower_int_to_float: Make sure the cursor is in the right spot.
|
||
- nir: Do peephole select on other instructions if the limit is ~0.
|
||
- nir_to_tgsi: Fix internal handling of NIR uints for !CAP_INTEGERS
|
||
- nir_to_tgsi: Support integer sysvals on !CAP_INTEGERS hardware.
|
||
- i915g: Handle fragment depth being in OUT[1] not OUT[0].
|
||
- i915g: Switch to using nir-to-tgsi.
|
||
- i915g: Add triangle provoking vertex support.
|
||
- freedreno: Move some driver debug printfs to mesa_logd.
|
||
- freedreno/ir3: Move the assert output to mesa_loge().
|
||
- util/log: Add a streaming printf interface.
|
||
- freedreno/ir3: Use mesa_log_stream() for ir3 disassembly.
|
||
- freedreno/ir3: Move the native code output to mesa_log as well.
|
||
- nir: Add an interface for logging shaders with mesa_log*.
|
||
- freedreno/ir3: Move NIR printing to mesa_log.
|
||
- gallium/draw: Garbage collect draw_set_force_passthrough
|
||
- gallium/draw: Garbage collect draw_pt_fetch_emit.
|
||
- gallium/util: Introduce a helper for finding whole-resource blits.
|
||
- freedreno: Move the rsc-based batch flushing to helper functions.
|
||
- freedreno: Handle full blit discards by invalidating the resource.
|
||
- freedreno: Cooperate with tc to stop checking the BC for resource_busy().
|
||
- ci/llvmpipe: Mark two more multithread program link flakes.
|
||
- i915g: Remove nr_cbufs loop.
|
||
- i915g: Create an i915_surface for our pipe_surfaces.
|
||
- i915g: Compute 3DSTATE_BUF_INFO flags at surface create time.
|
||
- i915g: Move cbuf color swizzle lookup to CSO creation time.
|
||
- i915g: Simplify color write mask setup.
|
||
- i915g: Use the color swizzle to reshuffle the blend const color.
|
||
- i915g: Clear xfails for vertex texturing.
|
||
- freedreno: Fix leak of the screen hash table.
|
||
- i915g: Fix GL_ARB_copy_buffer assertion fails.
|
||
- i915g: Fix bad naming of depth texture formats.
|
||
- i915g: Finish out blend factor overrides for both RGBx and A8.
|
||
- ci/i915g: Skip the piglit glx tests since we're not running X.
|
||
- freedreno/ir3: Report RA failure with mesa_loge().
|
||
- turnip: Link more MRs and issues related to our xfails.
|
||
- turnip: Use vk_startup_errorf() in more startup paths.
|
||
- ci/turnip: Document create_instance_device_intentional_alloc_fail's fail.
|
||
- turnip: Disable buffer texturing on 422 formats.
|
||
- Revert "freedreno: Cooperate with tc to stop checking the BC for resource_busy()."
|
||
- nir: Add a helper for chasing movs with nir_ssa_scalar().
|
||
- turnip: Short-circuit if ladder generation for constant index SSBO/UBOs.
|
||
- i915g: Apply clang-format.
|
||
- i915g: Bake the decls and program together.
|
||
- i915g: Allow use of I915_DEBUG= options on non-DEBUG builds.
|
||
- i915g: Enable dumping of fragment shaders under I915_DEBUG=fs.
|
||
- i915g: Use the normal compile error path for empty FSes.
|
||
- i915g: Log program compile errors to mesa_loge().
|
||
- i915g: Stop translating the fragment program on the first error.
|
||
- i915g: Improve logging of unsupported opcodes.
|
||
- i915g: replace "uint" with normal uint32_t.
|
||
- i915g: Use stdbool.h instead of custom bools.
|
||
- i915g: Remove redundant p->error setting.
|
||
- i915g: Mark program errors on setting up temps, constants, and immediates.
|
||
- i915g: Fix off-by-one in constant count assertion.
|
||
- intel: Early exit from inst_is_in_block().
|
||
- i915g: Finish the uint -> uint32_t conversion.
|
||
- i915g: Add the nice cube map layout comments from i915c.
|
||
- i915g: Fix FS debug dumping for declarations.
|
||
- i915g: Delete redundant i915_hw_sampler_views atom.
|
||
- i915g: Add curly braces for normal mesa style (and helps clang-format)
|
||
- i915g: Set up the cube map texture wrap modes.
|
||
- freedreno: Update comments about PIPE_BUFFER shadowing.
|
||
- freedreno: swap ->valid when shadowing resources.
|
||
- freedreno/a5xx: Make sure to mark blit read/write access in the BC.
|
||
- freedreno: Stop manually marking blit dst buffers as valid.
|
||
- freedreno: Swap needs_ubwc_clear when shadowing.
|
||
- freedreno: Flush the shadowed resource's write batch up front.
|
||
- i915g: Add support for per-vertex point size.
|
||
- i915g: whitespace fixup from the cube map fix.
|
||
- i915g: Force 1D textures to use wrap mode for the Y coordinate.
|
||
- i915g: Make sure the 1D texture Y channel is initialized.
|
||
- anv: Fix unused var warning on release builds from an assertion.
|
||
- nir: Add a nir_instr_remove that recursively removes dead code.
|
||
- nir: Use remove_and_dce for nir_shader_lower_instructions().
|
||
- nir: Free the instructions in a DCE instr removal.
|
||
- i915g: Fix writemasking of SEQ/SNE/SSG.
|
||
- nir_to_tgsi: Run copy prop (and thus dce) after lower_bool_to_float.
|
||
- nir_to_tgsi: Declare immediates as float on non-native-ints hardware.
|
||
- turnip: Fix allocation size for vkCmdUpdateBuffer.
|
||
- i915g: Fix dumping of 3DSTATE_BACKFACE_STENCIL_OPS.
|
||
- i915g: Fix backface stencil when front_ccw is set.
|
||
- ci: Make sure that we build the piglit dmabuf tests.
|
||
- freedreno: Suballocate our long-lived ring objects.
|
||
- freedreno/a6xx: Reduce the size of the config stateobj allocation.
|
||
- freedrneo/a6xx: Reduce the size of the long-lived texture stateobj.
|
||
- freedreno/a6xx: Allocate just enough memory for SO state, only if we do SO.
|
||
- freedreno: Optimize duplicate obj-obj ring relocs.
|
||
- i915g: Fix release build compiler warnings.
|
||
- ci: Enable testing of i915g in the the debian -Werror release build.
|
||
- freedreno: Lock access to msm_pipe for RB object suballocation.
|
||
|
||
Enrico Galli (10):
|
||
|
||
- microsoft/compiler: zero out unused WebGPU system values
|
||
- microsoft/compiler: Remove de-duplication of arbitrary semantic names
|
||
- d3d12, microsoft/compiler: Switching semantic names to TEXCOORD
|
||
- d3d12, microsoft/compiler: Moving driver_location allocation to compiler
|
||
- util: Add simple test for util_qsort_r
|
||
- util: Add qsort_r/s args adapter for MSVC and BSD/macOS
|
||
- nir: Add modes filter to nir_sort_variables
|
||
- microsoft/compiler: Switch io sort to use nir_sort_variables_with_modes
|
||
- microsoft/spirv_to_dxil: Add drive_location assignment
|
||
- microsoft/compiler: Add support for get_ssbo_size to translator
|
||
|
||
Eric Engestrom (15):
|
||
|
||
- VERSION: bump to 21.2.0-devel
|
||
- docs: reset new_features.txt
|
||
- egl/x11: don't forget to exit the attrib list loop
|
||
- docs: add release notes for 21.1.0
|
||
- docs: add release notes for 21.1.1
|
||
- docs: update calendar and link releases notes for 21.1.0
|
||
- docs: update calendar and link releases notes for 21.1.1
|
||
- docs/release-calendar: add the schedule for the 21.1 branch
|
||
- docs: add release notes for 21.1.2
|
||
- docs: update calendar and link releases notes for 21.1.2
|
||
- docs: add release notes for 21.1.3
|
||
- docs: update calendar and link releases notes for 21.1.3
|
||
- docs: add release notes for 21.1.4
|
||
- docs: update calendar and link releases notes for 21.1.4
|
||
- docs/release-calendar: add a few more 21.1 releases
|
||
|
||
Erico Nunes (7):
|
||
|
||
- gallium/hud: create vs_text to match fs_text
|
||
- gallium/hud: extend check for has_srgb
|
||
- docs/lima: add an initial page for Lima
|
||
- lima: enable z16 format
|
||
- lima: add reload command to the command dump
|
||
- meson: kmsro: require dri3 for X11
|
||
- lima: avoid crash with negative viewport values
|
||
|
||
Erik Faye-Lund (193):
|
||
|
||
- zink: fix stencil-export cap emission
|
||
- lavapipe: resolve border-color when creating sampler
|
||
- lavapipe: implement VK_EXT_custom_border_color
|
||
- nir/lower_tex: do not stumble on 16-bit inputs
|
||
- zink: document requirement of VK_EXT_custom_border_color
|
||
- gallivm: handle 16-bit input in i2b32
|
||
- gallivm: run nir_opt_algebraic_late
|
||
- gallivm: add 16-bit integer support
|
||
- zink: do not require vulkan memory model for shader-images
|
||
- docs: write basic meta-documentation
|
||
- zink: do not read outside of array
|
||
- docs: remove out-of-date gles info
|
||
- docs: remove documentation of MESA_CI_VISUAL
|
||
- docs: remove documentation of MESA_PRIVATE_CMAP
|
||
- docs: remove documentation of MESA_HPCR_CLEAR
|
||
- docs: nest cherry-pick example under note
|
||
- docs: use tables instead of pre-formatted text
|
||
- docs: use math notation for example matrices
|
||
- docs: use code-block for console-content
|
||
- docs: use code-block for glsl
|
||
- docs: use code-block for c
|
||
- docs: use code-block for ini
|
||
- zink: only emit extended-formats cap if needed
|
||
- zink: remove memory-model leftovers
|
||
- docs: fixup link to extension
|
||
- docs: fix quoting around a few limits
|
||
- zink: correct image cap checks
|
||
- docs: add missing zink-requirement
|
||
- docs: someome -> someone
|
||
- zink: enable required instance ext
|
||
- zink: make zink_binding private
|
||
- zink: remove stray semicolons
|
||
- zink: fixup bad indentation
|
||
- docs: remove out-of-date versions doc
|
||
- zink: fix shader-image requirements
|
||
- zink: correct an extension-link
|
||
- docs: fixup indentation of radeonsi envvar values
|
||
- docs: document r600 envvars
|
||
- zink: use UINT32_MAX instead of UINT_MAX
|
||
- zink: respect bit-size of dref-result
|
||
- zink: run nir_opt_algebraic_late
|
||
- zink: always lower function-temp derefs
|
||
- zink: support emitting 16-bit int types
|
||
- zink: enable 16-bit int support
|
||
- zink: support emitting 16-bit float types
|
||
- zink: perform fp16 texture-lookups as fp32 and then convert
|
||
- zink: enable 16-bit float support
|
||
- zink/codegen: prefer first definition of prop/feature structs
|
||
- zink: also enable float16 from KHR extension
|
||
- lavapipe: consistently use nir macros
|
||
- docs: update gallium doxygen docs
|
||
- zink: handle matrix-types after vectors
|
||
- zink: cache SpvId for aggregate glsl_types
|
||
- zink: always enable fixed shader-caps
|
||
- zink: do not check for varying output for fragment shaders
|
||
- zink: emit cap early
|
||
- zink: remove needless shader-info from context
|
||
- zink: emit sample-shading cap early
|
||
- zink: emit cap early
|
||
- zink: only emit ImageBuffer cap if needed
|
||
- docs: do not generate redirects on error
|
||
- gallium/u_vbuf: avoid dereferencing NULL pointer
|
||
- freedreno/a5xx: Remove ppgtt hack
|
||
- docs: remove doxygen support
|
||
- zink: remove incorrect border-swizzle assumption
|
||
- lavapipe: emit correct textures_used for texture-arrays
|
||
- zink: do not ask glsl-compiler to unroll
|
||
- lavapipe: fix fsum with swizzle
|
||
- st/mesa: do not take util_logbase2 of a negative size
|
||
- zink: check for error when binding memory
|
||
- gallium: allow to report errors from p_screen::resource_bind_backing
|
||
- lavapipe: report out-of-memory when binding
|
||
- llvmpipe: allow calculating size of overly large texture
|
||
- lavapipe: report allocation-error
|
||
- lavapipe: correct reported number of UBOs
|
||
- translate: reserve more vertex-shader outputs
|
||
- translate: assert that nr_elements is in range
|
||
- ci: Uprev piglit to 3351e8952 ("max-texture-size: report merged results")
|
||
- docs/features: document GL_ARB_ES3_2_compatibility support for zink
|
||
- docs/features: mark a few more extensions as done for zink
|
||
- zink: fix provoking-vertex cap for quads
|
||
- docs: promote #dri-devel on oftc over freenode
|
||
- docs: update link to #zink
|
||
- docs: update location of #panfrost
|
||
- docs: update link to #lima
|
||
- zink: simplify emit_load_const
|
||
- v3d: use helper to simplify things
|
||
- ci: downgrade sphinx to v3.x
|
||
- docs: update another IRC reference
|
||
- docs: update another IRC reference
|
||
- docs: drop clayton from intel-ci notice
|
||
- zink: use actual const for const offset
|
||
- lavapipe: handle cube-array image-views
|
||
- lavapipe: do not interpret cube-compatible as cubemap
|
||
- zink: only mark resources as cube-compatible if supported
|
||
- zink: mark 2d-arrays as cube-compatible
|
||
- zink: implement half-float packing
|
||
- zink: untangle have_EXT_debug_utils and ZINK_DEBUG_VALIDATION
|
||
- zink: add support for string-markers
|
||
- util/prim_restart: revert part of bad fix
|
||
- docs: quote a few defines
|
||
- docs: fix header-levels in envvars.rst
|
||
- docs: use file-role for paths
|
||
- docs: use envvar role for envvars
|
||
- docs: add the doc-comment for fse-vars
|
||
- docs: do not list all gles major versions
|
||
- docs: update list of apis to match website
|
||
- docs: update llvm requirement
|
||
- docs: rename vmware-guest article
|
||
- docs: clean up list of deprecated systems
|
||
- docs: move swrast to deprecated drivers list
|
||
- docs: clean up software-drivers list
|
||
- docs: clean up openswr links
|
||
- docs: split out layered driver to its own list
|
||
- docs: clean up freedreno links
|
||
- docs: add links to documented drivers
|
||
- r600: explicitly advertise index buffer format support
|
||
- zink: limit images we mark as cube-compatible
|
||
- zink: rename spirv_15 bool to spirv_1_4_interfaces
|
||
- zink: allow to specify any spir-v version to nir_to_spirv
|
||
- zink: calculate spir-v version based on vk version
|
||
- zink: only enable vote if we can support it
|
||
- zink: use a macro for spir-v versions
|
||
- st/pbo: use correct type for images and textures
|
||
- docs: update master -> main in edit-links
|
||
- zink/ci: increase piglit and deqp-runner timeouts
|
||
- llvmpipe: fix edge-rule logic for lines
|
||
- llvmpipe: consistently deal with post-rast state
|
||
- llvmpipe: fix multisample lines again
|
||
- llvmpipe: do not always use pixel-rounded coordinates for points
|
||
- zink/ci: re-enable test
|
||
- zink: reject more illegal blits
|
||
- zink: limit non-extension version feature to spirv 1.5
|
||
- zink: use correct type for u_bit_scan
|
||
- zink: do not unmap dt-buffers twice
|
||
- zink: drop paranoid code
|
||
- zink: add missing compiler-dependency
|
||
- zink: drop some more vla usage
|
||
- zink: fix more initializer styles
|
||
- zink: introduce a define for max descriptors per type
|
||
- zink: use max-descriptor define
|
||
- zink: use alloca instead of hard-to-size vlas
|
||
- zink: correct type of flags to flush
|
||
- zink: fixup signedness of subtraction
|
||
- zink: remove unused function
|
||
- zink: drop repeated usage-bit
|
||
- zink: do not check buffer-format for usage-bits
|
||
- docs: remove outdated meson-section
|
||
- docs: remove outdated clarification
|
||
- docs: drop historic meson details
|
||
- docs: use more file-roles
|
||
- docs: use rst captions
|
||
- wgl: remove hard limit on pixelformats
|
||
- zink: drop unused macros
|
||
- zink: remove unused function-pointers
|
||
- zink: unbreak moltenvk code
|
||
- zink: remove unused moltenvk functions
|
||
- zink: do not store moltenvk functions in screen
|
||
- zink: remove some needless moltenvk details
|
||
- libgl-gdi: add missing include
|
||
- iris/ci: disable amly jobs
|
||
- aux/trace: fix bool argument
|
||
- zink: cast pointers to uintptr_t
|
||
- ci/windows: work around meson encoding issues
|
||
- ci/windows: enable msvc builds of zink
|
||
- ci/windows: fix zink msvc build-rules
|
||
- gallium/u_threaded: do not apply start twice
|
||
- ci: fix source-deps for radv on windows
|
||
- zink: hook up line-rasterization ext
|
||
- zink: use bit-allocation for boolean rasterizer-state
|
||
- zink: support line stippling
|
||
- zink: fill in the right line-mode based on state
|
||
- docs: update zink requirements
|
||
- llvmpipe: reject unsupported shader-image formats
|
||
- lavapipe: query formats for shader-image support
|
||
- llvmpipe: only report supported shader-image formats
|
||
- lavapipe: expose more storage-image features
|
||
- lavapipe: do not disable multisampling for smooth lines
|
||
- lavapipe: fix disable_multisample condition
|
||
- gallium: explicitly specify line rasterization mode
|
||
- draw: respect line_rectangular state
|
||
- llvmpipe: respect rectangular_lines
|
||
- lavapipe: re-expose line-rasterization extension
|
||
- lavapipe: expose strict-lines feature
|
||
- zink: implement support for non-planar DRM modifiers
|
||
- zink: remove duplicate format-mapping on little-endian
|
||
- vulkan: do not map zero-sized region of memory
|
||
- vulkan: allocate host-visible memory for swapchain images
|
||
- zink: check for right feature
|
||
- zink: respect line_rectangular state
|
||
- lavapipe: do not assert on more than 32 samplers
|
||
- lavapipe: do not mark unsupported tests as crashing
|
||
- d3d12: split up root parameter update and set
|
||
|
||
Erik Kurzinger (1):
|
||
|
||
- vulkan/device_select: avoid segfault on Wayland if wl_drm is unavailable
|
||
|
||
Ernst Sjöstrand (1):
|
||
|
||
- nv50: Fix use of initializers on older compilers
|
||
|
||
Ezequiel Garcia (2):
|
||
|
||
- panfrost: Add GPU IDs for G52 1-Core-2EE (RK3568/RK3566)
|
||
- panfrost: Rename G52 product ID 0x7402 as G52r1
|
||
|
||
Felix DeGrood (16):
|
||
|
||
- intel: add L3 Bypass Disable to gen xml
|
||
- iris: Cache VB/IB in L3$ for Gen12
|
||
- iris: reduce redundant tile cache flushes
|
||
- intel/blorp: remove tile flush from emit surface state
|
||
- intel/compiler: Use switch for DERIVATIVE_GROUP logic
|
||
- intel/compile: refactor DERIVATIVE_GROUP logic
|
||
- intel/compiler: tileY friendly LID order for CS
|
||
- intel/compiler: balanced tileY/linear friendly LID order for CS
|
||
- anv: Cache VB/IB in L3$ for Gfx12
|
||
- anv: Add debug messages for DEBUG_PIPE_CONTROL
|
||
- anv: Clear all pending stall after pipe flush
|
||
- anv: Remove Tile Cache flush from SBA, Pipe Select
|
||
- anv: remove unnecessary Tile Cache flushes
|
||
- anv: Only flush Tile Cache on VK_ACCESS_HOST_R/W
|
||
- anv: Add ANV_PIPE_HDC_PIPELINE_FLUSH_BIT
|
||
- anv: Replace DC Flush with HDC Pipeline Flush
|
||
|
||
Francisco Jerez (20):
|
||
|
||
- intel/fs: Implement representation of SWSB cross-pipeline synchronization annotations.
|
||
- intel/fs: Add helper functions inferring sync and exec pipeline of an instruction.
|
||
- intel/fs: Represent SWSB in-order dependency addresses as vectors.
|
||
- intel/fs: Calculate SWSB cross-pipeline synchronization information.
|
||
- intel/fs: Use CHV/BXT implementation of 64-bit MOV_INDIRECT on XeHP+.
|
||
- intel/fs: Fix repclear assembly for XeHP+ regioning restrictions.
|
||
- intel/fs: Handle regioning restrictions of split FP/DP pipelines.
|
||
- intel/eu: Teach EU validator about FP/DP pipeline regioning restrictions.
|
||
- intel/compiler: Lower integer division on XeHP.
|
||
- intel/fs: Introduce lowering pass to implement derivatives in terms of quad swizzles.
|
||
- intel/fs: Add more efficient fragment coordinate calculation.
|
||
- iris/gen12: Work around push constant corruption on context switch.
|
||
- iris/gfx12: Invalidate ISP at the end of every batch.
|
||
- intel/fs/xehp: Assert that the compiler is sending all 3 coords for cubemaps.
|
||
- intel/fs: Track single accumulator in scoreboard lowering pass.
|
||
- intel/fs: Implement Wa_22012725308 for cross-pipe accumulator data hazard.
|
||
- intel/fs: Add SWSB dependency annotations for cross-pipeline WaR data hazards on XeHP+.
|
||
- intel/fs: Teach IR about EOT instruction writing the accumulator implicitly on TGL+.
|
||
- intel/fs: Fix synchronization of accumulator-clearing W/A move on TGL+.
|
||
- intel/fs: Implement Wa_14013745556 on TGL+.
|
||
|
||
Georg Lehmann (11):
|
||
|
||
- radv: Fix compatible image handle type for dmabufs.
|
||
- v3dv: use VKAPI_ATTR and VKAPI_CALL.
|
||
- zink: Add a missing VKAPI_ATTR.
|
||
- vulkan: Update the XML and headers to 1.2.180
|
||
- radv: Implement VK_EXT_global_priority_query.
|
||
- ac: Check me_fw_feature for 32bit predication on gfx10.3
|
||
- ac: Enable 32bit predication on gfx10.
|
||
- ac: Enable 32bit predication on gfx9 with fw feature version 52.
|
||
- lavapipe: Use common default allocator.
|
||
- lavapipe: Add a missing VKAPI_ATTR.
|
||
- vulkan/wsi/wayland: Add support for more SRGB formats.
|
||
|
||
Gert Wollny (40):
|
||
|
||
- Revert "r600: don't set an index_bias for indirect draw calls"
|
||
- Revert "r600: Don't advertise support for scaled int16 vertex formats"
|
||
- r600: don't set an index_bias for indirect draw calls
|
||
- virgl: use pipe_draw_info::restart_index only when primitive_restart is enabled
|
||
- r600: update pipe_draw_info::restart_index only when primitive_restart is enabled
|
||
- nir/opt_algebraic: optimizations for add umax/umin with zero
|
||
- nir: Add filter callback for lower_to_scalar to the options
|
||
- gallium: pass lower_to_scalar_filter to lower_to_scalar pass
|
||
- r600/sfn: lower to scalar with filter applied
|
||
- mesa: add an extension MESA_bgra
|
||
- compiler/nir: check whether var is an input in lower_fragcoord_wtrans
|
||
- nir/linker: add option to ignore the IO precisions for better varying packing
|
||
- r600/sfn: Ignore precision when linking
|
||
- r600: don't put INTERP_X and INTERP_Z into one instruction group
|
||
- r600/sfn: Use valid pixel mode only in fragment shaders
|
||
- r600/sfn: Use valid pixel mode for SSBO and Image result fetches
|
||
- r600/sfn: force new CF if fetch through TC would be used in same clause
|
||
- r600/sfn: Lower FS pos input w-transform in NIR
|
||
- r600/sfn: Don't check the faction when searching for the input slot
|
||
- r600/sfn: count only distinct literals per instruction group
|
||
- r600/sfn: Fix Cayman trans ops
|
||
- r600/sfn: Use unified index register code for samplers
|
||
- r600/sfn: Use unified code path for index register load
|
||
- r600/sfn: Fix texture gather for Cayman
|
||
- r600/sfn: Fix ssbo/image atomic swap for Cayman
|
||
- r600/sfn: Fix Cayman SSBO write with more than one value
|
||
- r600/sfn: Fix Geometry shader for Cayman
|
||
- r600/sfn: read number of images from shader info
|
||
- r600/sfn: Fix cube query layer number for indirect access
|
||
- r600/sfn: Add lowering pass to legalize image access
|
||
- r600/sfn: legalize image access on Cayman
|
||
- r600: Enable NIR debug flags also for Cayman
|
||
- r600/sfn: don't designates initializers, since they are c++20
|
||
- r600/sfn: don't read back unused image atomic result values
|
||
- r600/sfn: Drop method for emit_atomic_add, it is handled in generic code
|
||
- r600/sfn: Don't read return values of atomic ops that are not used
|
||
- r600/sfn: Clean up some ALU lowering and move code
|
||
- r600/sfn: Lower offset in TXF instructions
|
||
- virgl: Enable ASTC formats also for 3D textures
|
||
- r600/sfn: initialize all texture lower options
|
||
|
||
Gustavo Padovan (10):
|
||
|
||
- traces-iris: fix expectation for Intel GLK
|
||
- gitlab-ci: enable Intel AML-Y as experimental
|
||
- gitlab-ci: rule anchor for experimental devices as manual in MRs
|
||
- gitlab-ci: enable all 3 intel devices as manual in MR pipelines
|
||
- iris/ci: disable failing gimark test for now
|
||
- iris/ci: enable intel devices automatically in MR pipelines
|
||
- gitlab-ci: add python script to submit lava jobs
|
||
- gitlab-ci: enable testing on Intel Kaby Lake as experimental
|
||
- ci/lava: propely report test failure through sys.exit()
|
||
- ci/lava: do not save lava.yaml in the artifacts
|
||
|
||
Hans-Kristian Arntzen (2):
|
||
|
||
- radv: Take image alignment into account when allocating MUTABLE pool.
|
||
- radv: Allocate buffer list for MUTABLE descriptor types as well.
|
||
|
||
Heinrich Fink (6):
|
||
|
||
- softpipe: add missing sentinel to debug option array
|
||
- llvmpipe: unmap display target of shader image/sampler
|
||
- softpipe: unmap display target of shader sampler
|
||
- llvmpipe: do not leak map of display target in fs setup
|
||
- llvmpipe: do not leak display target mapped ptr in cs setup
|
||
- gbm/dri: Fix leaking bo memory on failure path
|
||
|
||
Hoe Hao Cheng (15):
|
||
|
||
- vulkan/util: generate vk_dispatch_table that combines all dispatch tables
|
||
- nir: define NIR_ALU_MAX_INPUTS
|
||
- zink: remove variable length arrays in ntv
|
||
- zink: introduce vk_dispatch_table
|
||
- zink/codegen: split commands into three groups
|
||
- zink/codegen: add zink_verify_*_extensions()
|
||
- zink: slight refactor of load_device_extensions()
|
||
- zink: use the dispatch tables
|
||
- zink/codegen: allow conditional enabling of instance extensions
|
||
- zink/codegen: clean the constructor of Extension up
|
||
- zink: do not fail when EXT_calibrated_timestamps is unavailable
|
||
- zink: move extension function verification to when it is used
|
||
- zink: zero-init structs with ISO C
|
||
- zink: standardize zero-init code style
|
||
- zink: make codegen compatible with python 3.5
|
||
|
||
Hubert Jasudowicz (1):
|
||
|
||
- docs/egl: Add missing backticks
|
||
|
||
Hyunjun Ko (6):
|
||
|
||
- turnip: prep work for timeline semaphore support
|
||
- turnip: Implement VK_KHR_timeline_semaphore.
|
||
- turnip/kgsl: Fix to build on android.
|
||
- turnip: add missing VKAPI_ATTR/CALL
|
||
- turnip: Copy command buffers to deferred submit request
|
||
- turnip/kgsl: new flag TU_USE_KGSL
|
||
|
||
Iago Toral Quiroga (118):
|
||
|
||
- v3dv: avoid redundant BO job additions for textures and samplers
|
||
- v3dv: avoid redundant BO job additions for UBO/SSBO
|
||
- v3dv: avoid redundant BO job additions for spill / shared BOs
|
||
- v3dv: optimize a few cases of BO job additions
|
||
- v3dv: use a bitfield to implement a quick check for job BO tracking
|
||
- v3dv: fix descriptor set limits
|
||
- v3dv: fix array sizes when tracking BOs during uniform setup
|
||
- v3dv: don't use a dedicated BO for each occlusion query
|
||
- v3dv: fix sRGB blending workaround
|
||
- v3dv: improve dirty descriptor set state tracking
|
||
- v3dv: dirty viewport doesn't affect fragment shaders
|
||
- v3dv: better tracking of dirty push constant state
|
||
- vulkan/wsi: give drivers the option to decide if they need to blit
|
||
- v3dv: implement wsi hook to decide if we can present directly on device
|
||
- compiler/nir: add a divergence analysis option for non-uniform workgroup id
|
||
- v3dv: choose a larger CSD supergroup size if possible
|
||
- broadcom/compiler: track if a shader has control barriers in prog_data
|
||
- v3dv: limit supergroup size in presence of TSY barriers
|
||
- broadcom/common: move CSD supergroup sizing to a common helper
|
||
- v3d: choose a larger CSD supergroup size if possible
|
||
- broadcom/compiler: add a loop unrolling pass
|
||
- v3dv: setup loop unrolling
|
||
- v3d: move NIR compiler options to GL driver
|
||
- broadcom/compiler: add a compiler strategy to disable loop unrolling
|
||
- broadcom/compiler: refactor compile strategies
|
||
- broadcom/compiler: specify maximum thread count in compile strategies
|
||
- v3d: enable NIR loop unrolling
|
||
- v3d: re-enable GLSL loop unrolling
|
||
- broadcom/compiler: change register allocation policy for accumulators
|
||
- broadcom/compiler: move vertex shader output handling to its own function
|
||
- broadcom/compiler: implement non-uniform offset on vertex outputs
|
||
- broadcom/compiler: make vir_VPM_WRITE_indirect handle non-uniform offsets
|
||
- broacom/compiler: enable PIPE_SHADER_CAP_INDIRECT_OUTPUT_ADDR
|
||
- broadcom/compiler: don't use nir_src_is_dynamically_uniform
|
||
- v3dv: don't lower indirect derefs on output variables
|
||
- broadcom/compiler: don't unroll due to indirect indexing of outputs
|
||
- v3d: disable GLSL loop unrolling again
|
||
- broadcom/compiler: clarify PIPE_SHADER_CAP_INDIRECT_INPUT_ADDR setting
|
||
- broadcom/compiler: don't emit TLB loads for components that don't exist
|
||
- broadcom/compiler: consider RT component size when lowering logic ops in Vulkan
|
||
- broadcom/ci: update fail list for v3dv
|
||
- v3d: take TLB blit framebuffer dimensions from smallest surface dimensions
|
||
- v3dv: implement VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_POINT_CLIPPING_PROPERTIES
|
||
- v3dv: fix texture_size()
|
||
- v3dv: allow creating uncompressed views from compressed images and vice versa
|
||
- v3dv: expose VK_KHR_maintenance2
|
||
- v3dv: define V3D_MAX_BUFFER_RANGE
|
||
- v3dv: implement VK_KHR_maintenance3
|
||
- v3dv: implement VK_KHR_bind_memory2
|
||
- v3dv: implement VK_KHR_get_memory_requirements2
|
||
- v3dv: keep track of whether an image may be backed by external memory
|
||
- v3dv: implement VK_KHR_dedicated_allocation
|
||
- v3dv: trivially handle VK_STRUCTURE_TYPE_EXPORT_MEMORY_ALLOCATE_INFO_KHR
|
||
- v3dv: add v3dv_GetImageSparseMemoryRequirements back
|
||
- v3dv: implement vkCmdDispatchBase
|
||
- v3dv: create a helper for image creation
|
||
- v3dv: implement interactions of VK_KHR_device_group with VK_KHR_swapchain
|
||
- v3dv: implement VK_KHR_device_group
|
||
- v3dv: don't keep an open file descriptor for imported fences/semaphores
|
||
- v3dv: implement external semaphore/fence extensions
|
||
- v3dv: increase number of supported SSBOs
|
||
- v3dv: expose KHR_relaxed_block_layout
|
||
- v3dv: document VK_KHR_relaxed_block_layout as implemented
|
||
- v3dv: expose VK_KHR_storage_buffer_storage_class
|
||
- v3dv: refactor descriptor updates
|
||
- v3dv: implement VK_KHR_descriptor_update_template
|
||
- v3dv: fix incorrect render area setup
|
||
- v3dv: expose KHR_variable_pointers
|
||
- v3dv: don't lower vulkan resource index result to scalar
|
||
- v3dv: implement VK_KHR_get_display_properties2
|
||
- v3dv: handle Vulkan 1.1 feature and property queries
|
||
- v3dv: don't support VK_IMAGE_CREATE_BLOCK_TEXEL_VIEW_COMPATIBLE_BIT
|
||
- Revert "v3dv: allow creating uncompressed views from compressed images and vice versa"
|
||
- v3dv: expose VK_KHR_shader_non_semantic_info
|
||
- v3dv: implement VK_EXT_index_type_uint8
|
||
- v3dv: implement vkCmdBlitImage2KHR
|
||
- v3dv: implement vkCmdCopyBuffer2KHR
|
||
- v3dv: implement vkCmdCopyBufferToImage2KHR and vkCmdCopyImageToBuffer2KHR
|
||
- v3dv: implement vkCmdCopyImage2KHR
|
||
- v3dv: implement vkCmdResolveImage2KHR
|
||
- v3dv: expose VK_KHR_copy_commands2
|
||
- v3dv: remove const qualifier for resource pointer in view objects
|
||
- broadcom/compiler: implement nir_intrinsic_load_subgroup_id correctly
|
||
- broadcom/compiler: lower nir_intrinsic_load_num_subgroups
|
||
- broadcom/compiler: add FLAFIRST and FLNAFIRST opcodes
|
||
- broadcom/compiler: implement more subgroup intrinsics
|
||
- broadcom/compiler: add a ntq_emit_cond_to_bool helper
|
||
- broadcom/compiler: add a set_a_flags_for_subgroup helper
|
||
- broadcom/compiler: track if a compute shader uses subgroup functionality
|
||
- broadcom/util: don't use compute supergroup packing with subgroups
|
||
- v3dv: expose correct subgroup size
|
||
- v3dv: expose support for basic subgroup operations
|
||
- broadcom/compiler: use nir_sort_variables_with_modes
|
||
- v3dv: account for dst offset of copy query results operations
|
||
- v3dv: always free pipeline stages after compiling
|
||
- v3dv: extend broadcom stages to include geometry
|
||
- v3dv: define a generic helper to create binning pipeline stages
|
||
- v3dv: add a few more broadcom shader stage helpers
|
||
- broadcom/compiler: track if geometry shaders write gl_PointSize
|
||
- v3dv: add support for geometry shaders to pipelines
|
||
- broadcom/compiler: create a helper for computing VPM config
|
||
- v3dv: emit state packets for geometry shaders
|
||
- v3dv: handle QUNIFORM_FB_LAYERS
|
||
- v3dv: fix copy buffer to image TFU path for 3D images
|
||
- broadcom/compiler: handle compact input arrays for geometry shaders
|
||
- broadcom/compiler: don't ignore constant offset on per-vertex input loads
|
||
- v3dv: implement layered attachment clears
|
||
- v3dv: remove fallback path for vkCmdClearAttachments
|
||
- v3dv: remove deferred vkCmdClearAtachments path
|
||
- broadcom/ci: update expected fails for v3dv after enabling geometry shaders
|
||
- v3dv: expose geometry shaders
|
||
- v3dv: fix push constant range for texel buffer copy pipelines
|
||
- v3dv: implement layered texel buffer copies using a geometry shader
|
||
- v3dv: allow batching texel buffer copies for 3D images
|
||
- v3dv: use defines for push constant offsets used by texel buffer copy shaders
|
||
- v3d: better scissor tracking
|
||
- broadcom/compiler: implement gl_PrimitiveID in FS without a GS
|
||
- v3dv: remove more dead clearing code
|
||
|
||
Ian Romanick (49):
|
||
|
||
- tgsi_exec: Fix NaN behavior of saturate
|
||
- tgsi_exec: Fix NaN behavior of min and max
|
||
- ci: Uprev piglit to b3a9fa345 ("framework/replay: Quote resource names before signing")
|
||
- tgsi_exec: Use C99 functions for min and max instead of open coding
|
||
- gallivm: Fix NaN behavior of min and max
|
||
- gallivm: Use range analysis to generate better fmin and fmax code
|
||
- gallivm: Use GALLIVM_NAN_RETURN_OTHER_SECOND_NONNAN for norm clamping
|
||
- gallivm: Remove unused GALLIVM_NAN_RETURN_NAN
|
||
- nir/algebraic: Remove some optimizations of comparisons with fsat
|
||
- nir/algebraic: Tautology replacements require sources be numbers
|
||
- nir/algebraic: Invert comparisons less often
|
||
- nir/algebraic: Equality comparison inversions require sources be numbers
|
||
- nir/algebraic: Mark some more comparison reductions exact
|
||
- nir/algebraic: Mark some more logic-joined comparison reductions as exact
|
||
- nir/algebraic: Rearrange some logic-joined comparisons and reduce
|
||
- nir/algebraic: Add algebraic opt for float comparisons with identical operands.
|
||
- util/format: Delete trailing whitespace
|
||
- dri: Fix typo before __DRI_IMAGE_COMPONENTS defines
|
||
- egl/dri2: Rely on drm-uapi for DRM_FORMAT defines
|
||
- mesa/st: Always call st_nir_lower_tex_src_plane if samplers were lowered
|
||
- nir/lower_tex: Add support for lowering Y41x formats
|
||
- util/format: Add Y41x formats
|
||
- egl/dri2: Add Y41x formats
|
||
- gallium/dri: Add Y41x formats
|
||
- util/format: Add Y21x formats
|
||
- egl/dri2: Add Y21x formats
|
||
- gallium/dri: Add Y21x formats
|
||
- intel/isl: Add mappings for PIPE_FORMAT_R8G8_R8B8_UNORM and PIPE_FORMAT_G8R8_B8R8_UNORM
|
||
- mesa: Add R8G8_R8B8 and G8R8_B8R8 formats
|
||
- nir/lower_tex: Add support for lowering YUYV formats
|
||
- gallium/dri: Allow use of R8G8_R8B8 for YUYV and G8R8_B8R8 for UYVY
|
||
- iris: Return correct enum names from fmt_swizzle
|
||
- iris: Silence warnings about implicit enum type conversions
|
||
- mesa/st: Don't assert !unify_interfaces in the passthrough edge flags case
|
||
- radeonsi: Use util_cpu_caps to detect number of CPUs
|
||
- util: Zero out all of mask in util_set_thread_affinity
|
||
- util: Change order of PIPE_OS_UNIX code in util_cpu_detect_once
|
||
- util: Trivial cleanup in the BSD code of util_cpu_detect_once
|
||
- util: Fix setting nr_cpus on some BSD variants
|
||
- util: Set util_cpu_caps.num_cpu_mask_bits based on total CPUs in the system
|
||
- util: Use maximum number of CPUs for determining cache topology
|
||
- util: Consider CPU affinity when detecting number of CPUs
|
||
- v3d: ci: Add KHR-GLES31.core.shader_image_load_store.basic-glsl-earlyFragTests to flakes
|
||
- intel/compiler: Add the ability to defer IP updates in backend_instruction::remove
|
||
- intel/compiler: Add cfg_t::adjust_block_ips() method
|
||
- intel/compiler: Update block IPs once in dead_code_eliminate
|
||
- intel/compiler: Update block IPs once in register_coalesce
|
||
- intel/compiler: Update block IPs once in opt_cmod_propagation
|
||
- nir/gcm: Clear out pass_flags before starting
|
||
|
||
Icecream95 (38):
|
||
|
||
- panfrost: Assert staging resource allocation was successful
|
||
- panfrost: Unset shared/scanout binding flags for staging resources
|
||
- pan/bi: Skip nir_opt_move/sink for blend shaders
|
||
- panfrost: Fix shader texture count
|
||
- pan/decode: Allow frame shader DCDs to be in another BO than the FBD
|
||
- pan/decode: Print errors when closing dump file
|
||
- pan/mdg: Fix calculation of available work registers
|
||
- panfrost: Remove incorrect comment
|
||
- panfrost: Fix viewport scissor for preload draws
|
||
- panfrost: Split panfrost_batch_submit to prevent stack overflows
|
||
- pan/bi: Add "lane_dest" modifier
|
||
- pan/bi: Replace lane0 modifier with lane_dest for load instructions
|
||
- panfrost: Make pan_select_crc_rt a non-static function
|
||
- panfrost: Always write reloaded tiles when making CRC data valid
|
||
- pan/bi: Add two tuples to a clause when needed with NOSCHED
|
||
- panfrost: Skip blit shader labelling if the buffer has no space
|
||
- panfrost: Fix polygon list size computations
|
||
- pan/mdg: Try scheduling load/store ops in pairs
|
||
- pan/decode: Flush the dump stream after decoding
|
||
- panfrost: Call abort() when aborting on fault
|
||
- panfrost: Use first_tiler to check if tiling is needed
|
||
- pan/mdg: Add a bundle ID to instructions
|
||
- pan/mdg: Reorder some code in mir_spill_register
|
||
- pan/mdg: Fill from TLS before spilling non-SSA nodes
|
||
- pan/mdg: Fix reading a spilt register in the bundle it's written
|
||
- pan/mdg: Add 16 bytes of padding to the end of shaders
|
||
- panfrost: Don't set dirty_mask for constant buffers
|
||
- pan/bi: Create a mask of UBOs that need to be uploaded
|
||
- pan/mdg: Create a mask of UBOs that need to be uploaded
|
||
- panfrost: Only upload UBOs when needed
|
||
- panfrost: Set bound dimensions to framebuffer size
|
||
- Revert "panfrost: Fix crc_valid condition"
|
||
- panfrost: Always use a fragment shader when alpha test is enabled
|
||
- panfrost: Fix GPU ID for t76x in get_perf_config
|
||
- panfrost: Fix full_threads calculation on v6
|
||
- pan/bi: Create a nop clause when the shader starts with ATEST
|
||
- panfrost: Initialise the blend equation in create_blend_state
|
||
- pan/mdg: Analyze helper termination after scheduling
|
||
|
||
Ilia Mirkin (29):
|
||
|
||
- nv50/ir: offset accesses to shared memory
|
||
- nv50/ir: refine limitation on load/store loading offsets, include atomics
|
||
- nv50/ir: "zero" register does not work with g[] memory
|
||
- nv50/ir: mark ATOM as having 3 arguments
|
||
- nv50/ir: wipe any info about memory when seeing a locking op
|
||
- nv50/ir: optimize shift of 0 bits
|
||
- nv50: pass surface/buffer parameters to shader via aux buffer
|
||
- nv50/ir: add surface op lowering
|
||
- nv50/ir: add lowering for shared atomics
|
||
- nv50: add compute invocations counter
|
||
- nv50: add remapping of buffers/images into unified space
|
||
- nv50: add support for doing membars
|
||
- nv50: add indirect compute support
|
||
- intel: fix MI builder for pre-gen7
|
||
- nv50: fix streamout queries
|
||
- nvc0: fix 3d images
|
||
- vdpau: allow state tracker to report a lower number of macroblocks
|
||
- nouveau: improve video limit reporting
|
||
- st/mesa: avoid enabling image/buffer/compute extensions for weak hardware
|
||
- mesa: relax ES 3.1 compute shader requirements
|
||
- st/mesa: properly encode OES_geometry_shader requirement
|
||
- mesa/get: allow image/buffer/atomic variables to be fetched in es3.1
|
||
- st/mesa: allow hardware to claim ES 3.1 without hw indirect draws
|
||
- nv50: expose images/buffers/compute
|
||
- nv50: expose GL ES 3.1 for nva3+ hardware
|
||
- mesa: always expose NV_image_formats and OES_shader_image_atomic
|
||
- mesa: also flush after compute dispatch when debug flag enabled
|
||
- nv50: use the no-mipmap texture type for 2d ms views
|
||
- st/mesa: always report the max samples as supported
|
||
|
||
Ishi Tatsuyuki (1):
|
||
|
||
- radv: ignore redundant variable descriptor counts (v2)
|
||
|
||
Italo Nicola (28):
|
||
|
||
- pan/mdg: fix midgard writemask encoding for stores
|
||
- util: add util_sign_extend
|
||
- pan/mdg: clean up redundant/unused variables in disassemble.c
|
||
- pan/mdg: rename dest_override to shrink_mode
|
||
- pan/mdg: improve outmod printing
|
||
- pan/mdg: refactor mir_pack_swizzle
|
||
- pan/mdg: add proper expand_mode enum
|
||
- pan/mdg: encode/decode expand_mode properly
|
||
- pan/mdg: add midgard_src_expand_mode validation
|
||
- pan/mdg: improve input modifier printing
|
||
- pan/mdg: improve swizzle decoding
|
||
- pan/mdg: fix/change ALU opcodes descriptions and add some missing ops
|
||
- pan/mdg: stop querying datatype by reading opcode name
|
||
- pan/mdg: print input data type for ALU opcodes
|
||
- pan/mdg: stop using size disambiguation suffixes
|
||
- pan/mdg: fix midgard.h indentation
|
||
- pan/mdg: improve mask decoding
|
||
- pan/mdg: remove register prefixes
|
||
- pan/mdg: print special alu arg outmods
|
||
- pan/mdg: misc cleanups
|
||
- pan/mdg: add helpers for load/store special read regs
|
||
- pan/mdg: improve ldst opcode names and add missing ops
|
||
- pan/mdg: print names of non-work registers
|
||
- pan/mdg: properly encode/decode ldst instructions
|
||
- pan/mdg: improve tex opcode decoding and add missing ops
|
||
- panfrost/ci: Improve coverage for T860
|
||
- virgl: implement EXT_multisampled_render_to_texture
|
||
- panfrost: fix GL_EXT_multisampled_render_to_texture regression
|
||
|
||
Iván Briano (2):
|
||
|
||
- intel/nir: Fix txs for null surfaces
|
||
- anv: fix feature/property/sizes reported for fragment shading rate
|
||
|
||
James Jones (18):
|
||
|
||
- gbm: Remove stat and refcount fields from gbm_device
|
||
- gbm: Inline load_backend function content
|
||
- gbm: Create device directly in find_backend
|
||
- gbm: Consolidate env var and default backend loops
|
||
- gbm: Give getenv backend override its own function
|
||
- gbm: Give gbm_device a reference to its backend
|
||
- gbm: Add gbm_core struct to export code to backends
|
||
- gbm: Move majority of gbmint.h to gbm_backend_abi.h
|
||
- gbm: Version the GBM backend interface
|
||
- gbm: Add backend ABI-check test
|
||
- gbm: Rename backend description list to builtin_backends
|
||
- loader: Factor out driver library loading code
|
||
- meson: Add a GBM backends search path build option
|
||
- gbm: Rename the DRI backend from gbm_dri.so to dri
|
||
- gbm: Put common device creation in a helper function
|
||
- gbm: Support dynamically loading named backends
|
||
- gbm: Load backend based on DRM device driver name
|
||
- loader: Handle failure to load DRI driver library
|
||
|
||
James Park (14):
|
||
|
||
- meson: Fix winflexbison warnings
|
||
- ac/surface: Move drm_fourcc.h to common header
|
||
- radv: Use ac_drm_fourcc.h
|
||
- meson: Add wrap for libelf on Windows
|
||
- meson: Disable libdrm for RADV on Windows
|
||
- meson: Disable MSVC warning 5105
|
||
- amd: Fix warnings around variable sizes
|
||
- radv: Add _WIN32 guard in radv_check_gpu_hangs
|
||
- radv: Fix unused label warning on Windows
|
||
- radv: Add <io.h> on WIndows for missing close()
|
||
- draw/clip: Use NAN to make MSVC happy
|
||
- llvmpipe: Remove stray ## operator for MSVC
|
||
- ci: Update Windows image to build RADV
|
||
- vulkan: Support 32-bit "weak" symbols on MSVC
|
||
|
||
Jan Beich (1):
|
||
|
||
- anv: adjust headers for non-GNU after e9e1e0362b6c
|
||
|
||
Faith Ekstrand (139):
|
||
|
||
- intel/compiler: Don't insert barriers for NULL sources
|
||
- anv: Use the same re-order mode for streamout as for GS
|
||
- vulkan: Update the XML and headers to 1.2.177
|
||
- anv: Implement VK_EXT_provoking_vertex
|
||
- gallium: Add a transcode_astc driconf option
|
||
- intel/isl: There are seven aux states
|
||
- intel/isl: Fix isl_color_value_unpack to match the prototype
|
||
- intel/eu: SVB writes only happen on Gen6
|
||
- intel/fs: Stop using brw_dp_read/write_desc in Gen7+ only code
|
||
- intel/eu: Set message subtype properly for SIMD8 FB fetch
|
||
- intel/fs: Don't use pixel_z for Gen4-5 source_depth_to_render_target
|
||
- intel/nir: Set lower txs with non-zero LOD
|
||
- nir/builder: Move clamp helpers to nir_builder.h
|
||
- anv: Check offset instead of alloc_size for freeing surface states
|
||
- anv: Allow storage on all formats that support typed writes
|
||
- anv: Plumb the shader into push constant helpers
|
||
- anv: Support pushing shader constants
|
||
- anv: Push at most 32 regs for vec4 shaders
|
||
- intel/vec4: Don't spill fp64 registers more than once
|
||
- intel/vec4: Add some asserts to move_push_to_pull
|
||
- intel/vec4: Update nr_params in pack_uniform_registers
|
||
- intel/vec4: Set up push ranges before we emit any code
|
||
- intel/vec4: Add support for masking pushed data
|
||
- intel/vec4: Add support for UBO pushing
|
||
- nir: Add a nir_instr_move helper
|
||
- nir/gather_info: Expose a nir_intrinsic_writes_external_memory helper
|
||
- nir: Add a discard optimization pass
|
||
- intel/fs: Handle non-perspective-correct interpolation on gen4-5
|
||
- intel/nir,i965: Move HW generation check for UBO pushing to i965
|
||
- intel/vec4: Also use MOV_FOR_SCRATCH for swizzle resolves
|
||
- intel/isl: Fix isl_format_is_valid
|
||
- intel/fs/ra: Fix payload node setup for SIMD16 on Gen4-5
|
||
- ttn: Stop manually managing system_values_read
|
||
- anv: Require softpin on Gen8+
|
||
- anv: Make use_softpin compile-time in genX code
|
||
- anv: Handle OOM in the pinned path in anv_reloc_list_add
|
||
- anv: Add a helper to add a BO to the batch list without a reloc
|
||
- anv: Make anv_batch_emit_reloc inline and optimize SKL+
|
||
- anv: Fast-path surface relocs when we have softpin
|
||
- anv: Optimize anv_address_physical when ANV_ALWAYS_SOFTPIN
|
||
- anv/blorp: Optimize addresses/relocations when ANV_ALWAYS_SOFTPIN
|
||
- iris: Use isl_surf_get_image_surf instead of hand-rolling it
|
||
- iris: Move target_to_isl_surf_dim to iris_resource.c
|
||
- intel/isl: Add a isl_surf_get_image_offset_B_tile_el helper
|
||
- intel/blorp: Use isl_surf_get_image_offset_B_tile_el in ccs_ambiguate
|
||
- intel/isl: Make the offset helpers four dimensional
|
||
- intel/isl: Make tile logical extents four dimensional
|
||
- intel/isl: Use a 4D physical total extent for size calculations
|
||
- i965: Use nir_lower_passthrough_edgeflags
|
||
- anv: Agressively no-op Flush/InvalidateMappedMemoryRanges
|
||
- docs: Begin documenting ISL
|
||
- isl: Document more members of isl_surf
|
||
- docs/isl: Document ISL's units
|
||
- docs/isl: Add detailed documentation about isl formats
|
||
- docs/isl: Add detailed documentation about tiling on Intel GPUs
|
||
- docs/isl: Add detailed documentation about CCS compression
|
||
- util: Move the 4x4 matrix inverse function to u_math
|
||
- crocus: Drop extra_aux support
|
||
- nir,amd: Suffix nir_op_cube_face_coord/index with _amd
|
||
- nir,panfrost: Suffix fsat_signed and fclamp_pos with _mali
|
||
- nir,vc4: Suffix a bunch of unorm 4x8 opcodes _vc4
|
||
- vulkan: Update the XML and headers to 1.2.182
|
||
- nir: Require vectorized ALU ops to be all-or-nothing
|
||
- nir,docs: Add docs for NIR ALU instructions
|
||
- nir: Document all the ALU opcodes
|
||
- docs,isl: Document Sandy Bridge HiZ/stencil
|
||
- editorconfig: Use 3-space tabs for .rst
|
||
- docs/nir: Use 3-space tabs
|
||
- docs/isl: Consistently use 3-space tabs
|
||
- spirv: Create acceleration structure and shader record variables
|
||
- anv: Add minimal boilerplate for VK_KHR_acceleration_structure
|
||
- anv: Add stub support for acceleration structures
|
||
- anv: Add support for binding acceleration structures
|
||
- anv: Add minimal boilerplate for VK_KHR_ray_tracing_pipeline
|
||
- anv: Get ready for more pipeline stages
|
||
- anv: Add a ray-tracing pipeline object
|
||
- anv: Add support for binding ray-tracing pipelines
|
||
- anv,iris: Move the SHADER_RELOC enums to brw_compiler.h
|
||
- intel/compiler: Generalize shader relocations a bit
|
||
- intel/compiler: Add a U32 reloc type
|
||
- intel/fs: Add support for compiling bindless shaders with resume shaders
|
||
- intel/rt: Use reloc constants for the resume SBT
|
||
- anv: Disallow UBO pushing for bindless shaders
|
||
- nir/apply_pipeline_layout: Handle bindless shaders
|
||
- anv: Support fetching descriptor addresses from push constants
|
||
- anv: Compile ray-tracing shaders
|
||
- anv: Compile trivial return and trampoline shaders
|
||
- intel/fs: Don't pull CS push constants if uses_inline_data
|
||
- anv: Create and return ray-tracing pipeline SBT handles
|
||
- anv: Compute scratch sizes for ray-tracing pipelines and shader groups
|
||
- anv: Add support for vkCmdSetRayTracingPipelineStackSizeKHR
|
||
- anv: Allow _anv_combine_address with a NULL batch
|
||
- anv: Make anv_address::offset 64-bit
|
||
- anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect
|
||
- isl: Assert some iris invariants in isl_surf_get_ccs_surf
|
||
- isl: Take a hiz_or_mcs_surf in isl_surf_supports_ccs
|
||
- isl,iris: Move the extra_aux_surf logic into iris
|
||
- isl,docs: Add a chapter on AUX state tracking
|
||
- docs/isl: Improve the bit[6] swizzling section of the tiling chapter
|
||
- include/drm-uapi: bump headers
|
||
- anv: Claim to be a discrete GPU if has_lmem
|
||
- util: Add an implementation of qsort_r for non-GNU platforms
|
||
- nir: Add a function for sorting variables
|
||
- intel/genxml: Add SURFTYPE_SCRATCH on GFX version 12.5
|
||
- intel/isl: Add support for scratch buffers
|
||
- intel/fs: Implement spilling on XeHP
|
||
- intel/fs: Implement load/store_scratch on XeHP
|
||
- intel/genxml: Add new ScratchSpaceBuffer fields on GFX version 12.5
|
||
- iris: Add a MEMZONE_BINDLESS and uploader
|
||
- iris: Add support for scratch on XeHP
|
||
- anv: Add support for scratch on XeHP
|
||
- intel/genxml: Remove old scratch fields on GFX version 12.5
|
||
- iris/bufmgr: Stop changing mapping modes on buffers
|
||
- intel/devinfo: Add a has_lsc bit
|
||
- intel/compiler: Add LSC to messages brw_ir_performance
|
||
- intel/fs: Lower uniform pull constant load message to LSC dataport
|
||
- docs/isl/tiling: Fix swizzle pattern for X-tiling
|
||
- intel/isl: Pull the uncompressed surface view code from anv
|
||
- intel/blorp: Adjust the compressed copy rectangle before convert_to_single_slice
|
||
- intel/blorp: Use isl_surf_get_uncompressed_surf
|
||
- intel/isl: Add more cases to isl_surf_get_uncompressed_surf
|
||
- iris: Don't leak the surface if uncompressed re-interp fails
|
||
- iris: Use isl_surf_get_uncompressed_surf
|
||
- nir: Drop nir_ssa_def::name and nir_register::name
|
||
- android: Drop the Android.mk build system
|
||
- android: Restore android/Android.mk
|
||
- nir/lower_subgroups: Pad ballot values before bitcasting
|
||
- docs: Add docs for running a local Mesa build
|
||
- mailmap: Update for Emma's new e-mail address
|
||
- Convert a few files to UTF-8
|
||
- mailmap: Add two more lines for Alyssa Rosenzweig
|
||
- glsl: Delete lower_texture_projection
|
||
- anv/allocator: Use list->u64 in free_list_push
|
||
- iris: Re-emit MEDIA_VFE_STATE for variable group size shaders
|
||
- anv: Handle errors properly in anv_i915_query
|
||
- intel: Pull anv_i915_query into common code
|
||
- anv: Use intel_i915_query_alloc for memory regions
|
||
- iris: Use intel_i915_query for meminfo
|
||
- nir/lower_tex: Rework invalid implicit LOD lowering
|
||
|
||
Jeremy Huddleston (2):
|
||
|
||
- libgl-xlib: Set darwin-versions
|
||
- libgl-xlib: Add missing dep_x11 dependency
|
||
|
||
Jeremy Newton (1):
|
||
|
||
- Update libva requirement
|
||
|
||
Jesse Natalie (44):
|
||
|
||
- microsoft/spirv_to_dxil: Lower samplers from deref to index
|
||
- microsoft/spirv_to_dxil: Lower loads/stores to DXIL
|
||
- microsoft/compiler: Support raw SRVs/UAVs through dxil_module_get_res_type
|
||
- microsoft/compiler: Support arrays of UBOs
|
||
- microsoft/compiler: Emit CBVs via variables for Vulkan
|
||
- microsoft/compiler: Emit SSBO variables
|
||
- microsoft/compiler: Split Vulkan resource_index / descriptor processing
|
||
- microsoft/compiler: Better support UBO/SSBO references to descriptors
|
||
- microsoft/compiler: Store nir_shader in the ntd_context
|
||
- microsoft/compiler: Support raw SRVs in addition to typed SRVs
|
||
- microsoft/compiler: Propagate access when lowering SSBO loads
|
||
- microsoft/clc: If local size isn't specified either in the shader or at runtime, set it to (1,1,1)
|
||
- gallium: Define PIPE_ARCH_AARCH64 for MSVC arm64 builds
|
||
- nir: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- d3d12: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- microsoft/clc: Fix MSVC unreferenced variable warnings
|
||
- microsoft/clc: Fix undeclared function warning
|
||
- microsoft/compiler: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- shader_enums: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- gallium/aux: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- llvmpipe: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- xmlconfig: Fix MSVC warning C4334 (32bit shift cast to 64bit)
|
||
- CI: Windows: Bump warning level to W3 (except for zlib)
|
||
- microsoft/compiler: Remove hardcoded limits on numbers of resource arrays
|
||
- microsoft/compiler: Remove assert-only resource size or usage tracking
|
||
- microsoft/compiler: For Vulkan environment, don't create resource handles upfront
|
||
- vtn: Propagate access data that's present on all struct members to the struct itself
|
||
- vtn: Propagate access data from UBO/SSBO/push constant types to variables of that type, not just their pointers
|
||
- nir: Rename nir_lower_cl_images_to_tex, replace 'cl' with 'readonly'
|
||
- nir_lower_readonly_images_to_tex: Support non-CL semantics
|
||
- nir_lower_readonly_images_to_tex: Use nir_shader_lower_instructions
|
||
- microsoft/compiler: Treat read-only SSBOs as SRVs
|
||
- microsoft/spirv_to_dxil: Treat read-only storage images as SRVs
|
||
- d3d12, microsoft/compiler: Use SRV/sampler variable binding data
|
||
- microsoft/compiler: Rewrite sampler splitting pass to be smarter and handle derefs
|
||
- microsoft/compiler: Fix function signature for bufferStore to support overloads
|
||
- microsoft/compiler: Map descriptor set -> binding space
|
||
- microsoft/compiler: Handle unbounded arrays
|
||
- llvmpipe: Fix optimization loop to actually loop
|
||
- nir: Add relaxed 24bit opcodes
|
||
- vtn: Use relaxed 24bit opcodes for CL 24bit math
|
||
- microsoft/compiler: Change behavior for emitting inexpressible barriers
|
||
- nir_lower_readonly_images: Clear variable data when changing the type
|
||
- mesa/main: Check for fbo attachments when importing EGL images to textures
|
||
|
||
John Bates (1):
|
||
|
||
- add execmem build option
|
||
|
||
Jonathan Marek (4):
|
||
|
||
- freedreno/registers: define REG_DSI_CPHY_MODE_CTRL
|
||
- tu: remove workaround for conditional rendering + hw binning
|
||
- freedreno/a6xx: larger gmem_page_align from tile align instead of gpu id
|
||
- freedreno/common: unhardcode CCU color cache offset
|
||
|
||
Jordan Justen (20):
|
||
|
||
- Revert "intel/compiler: Silence unused parameter warning in update_inst_scoreboard"
|
||
- intel/eu: Allow 64-bit registers on XeHP.
|
||
- intel/fs: Disable 3-src immediates on XeHP.
|
||
- intel/fs: End computer shader with message gateway on XeHP.
|
||
- intel/compiler: Lower txd for 3D samplers on XeHP.
|
||
- intel/compiler: Fix INTEL_DEBUG=hex
|
||
- commit_in_branch_test.py: Rename branch master to main
|
||
- bin/pick: Rename master branch to main
|
||
- .gitlab-ci.yml: Use main branch for gitlab ci
|
||
- issue_templates/Bug Report: Rename master branch to main
|
||
- docs/releasing.rst: Rename master branch to main
|
||
- docs: Rename master branch to main
|
||
- mesa: NOTE! Default branch is now main
|
||
- intel/isl: Add Wa_22011186057 to disable CCS on ADL GT2 A0
|
||
- intel/dev: Add device info for ADL GT2
|
||
- intel: Add 2 ADL-S pci-ids
|
||
- intel/gen125.xml: Drop GPGPU_WALKER
|
||
- intel/devinfo: Add has_local_mem
|
||
- iris/bufmgr: Align vma addresses to 64K for local memory
|
||
- intel/dev: Set has_local_mem for DG1
|
||
|
||
Jose Maria Casanova Crespo (5):
|
||
|
||
- v3d: YUV formats at is_dmabuf_modifier_supported are external_only
|
||
- v3d: YUV formats at query_dmabuf_modifiers are external_only
|
||
- v3d: DRM_FORMAT_MOD_BROADCOM_SAND128 only available for NV12 format.
|
||
- ci/v3d: Update piglit expectations.
|
||
- v3d/driconf: Expose non-MSAA texture limits for mutter and gnome-shell
|
||
|
||
Joshua Ashton (5):
|
||
|
||
- radv: Handle unnormalized samplers in YCbCr lowering
|
||
- venus: Fix zero-initialized fd causing apps to hang/crash
|
||
- driconf: Add more workarounds for Teardown
|
||
- llvmpipe: Handle NULL views in llvmpipe_cleanup_stage_sampling
|
||
- lavapipe: Use common Vulkan format helpers
|
||
|
||
Joshua Watt (1):
|
||
|
||
- v3d, vc4: Fix dmabuf import for non-scanout buffers
|
||
|
||
José Fonseca (8):
|
||
|
||
- lavapipe: Fix lvp_execute_cmds' pipe_stream_output_target leak.
|
||
- lavapipe: Fix lvp_pipeline_compile's nir_xfb_info leak.
|
||
- wgl: Remove opengl32.mingw.def.
|
||
- draw: Allocate extra padding for extra shader outputs.
|
||
- draw: Plug leak when combining tessellation with primitive assembly.
|
||
- d3d10umd,d3d10sw: Initial import.
|
||
- d3d10sw: Add a sanity test.
|
||
- d3d10umd: Avoid duplication in CreateDevice.
|
||
|
||
Juan A. Suarez Romero (42):
|
||
|
||
- v3dv: avoid dereferencing null value
|
||
- ci: support KHR-GLES testing
|
||
- ci/v3d: add KHR-GLES test jobs
|
||
- ci/llvmpipe: run KHR-GLES2.* tests
|
||
- ci/softpipe: run KHR-GLESxx tests
|
||
- iris: hook up memory object creation from handle
|
||
- iris: hook up resource creation from memory object
|
||
- iris: enable GL_EXT_memory_object feature
|
||
- ci/broadcom: update expected results
|
||
- ci/vc4: add KHR-GLES2.* job test
|
||
- ci/broadcom: add EGL testing jobs
|
||
- v3dv: check returned values
|
||
- ci/v3d: execute all piglit tests
|
||
- v3dv/pipeline_cache: bail out in case of error
|
||
- ci/v3d: fix typo in job name
|
||
- ci/v3dv: update flakes
|
||
- ci/baremetal: propagate ASAN_OPTIONS to devices
|
||
- ci/broadcom: update expected results
|
||
- v3d: rename header include guards
|
||
- v3d: rename VC5 enums and definitions
|
||
- broadcom/qpu: rename from VC5 to V3D
|
||
- broadcom/simulator: change references to VC5
|
||
- v3dv: rename VC5 to V3D
|
||
- v3dv: check dest bitsize in color blit
|
||
- util/hash_table: do not leak u64 struct key
|
||
- ci/broadcom: update expected results
|
||
- v3d: fix resource leak in error path
|
||
- st/mesa: fix pipe resource leak
|
||
- broadcom/compiler: fix dynamic-stack-buffer-overflow error
|
||
- ci: Update VK-GL-CTS to 1.2.6.1
|
||
- ci/broadcom: update expected results
|
||
- vc4: initialize array
|
||
- ci/v3dv: update expected results
|
||
- ci/broadcom: unset manual jobs
|
||
- ci/v3dv: test v3dv in arm64 environment
|
||
- broadcom/ci: Report flakes on IRC
|
||
- ci/vc4: update piglit failures
|
||
- ci: update VK-GL-CTS to 1.2.6.2
|
||
- broadcom/compiler: emit TMU flush before a jump
|
||
- v3dv: assert job->cmd_buffer is valid
|
||
- broadcom: remove v3dv3 from neon library
|
||
- gallium/hud: initialize query
|
||
|
||
Kai-Heng Feng (1):
|
||
|
||
- iris: Avoid abort() if kernel can't allocate memory
|
||
|
||
Karol Herbst (18):
|
||
|
||
- clover/llvm: handle Fixed vs Scalable vectors explicitly starting with llvm-11
|
||
- util/format: fix value declarations for big endian
|
||
- nv50/query: fix stringop-overflow gcc warning
|
||
- nvc0: fix implicit-fallthrough gcc warning
|
||
- clover/memory: fix data race in buffer subclasses
|
||
- nouveau: fix race in nouveau_screen_get_name
|
||
- nouveau/mm: pass mm_bucket to mm_slab_new
|
||
- nouveau/mm: remove unused nouveau_mm_allocation.next field
|
||
- nv50/ir: when constant folding shl(mul, a) we need to copy muls type
|
||
- nv50/ir: don't optimize shl(mul_hi, a) to mul_hi
|
||
- nv50/ir/ra: fixes upcoming barrier file
|
||
- nv50/ir: add barrier and thread_state files
|
||
- gv100/ir: add support for barrier thread state files for OP_CVT
|
||
- gm107/ir: emit barrier sources for quadon/pop
|
||
- gv100/ir: fix quadop/pop lowering
|
||
- nv50/ir: fix surface lowering when values get shared accross operations
|
||
- nv50/ir/nir: fix smem size for GL
|
||
- nv30: fix emulated vertex index buffers
|
||
|
||
Keith Packard (1):
|
||
|
||
- vulkan/x11: Mark present complete using serial instead of MSC
|
||
|
||
Kenneth Graunke (29):
|
||
|
||
- iris: only flush the render cache for aux changes, not format changes
|
||
- isl: Work around NVIDIA and AMD display pitch requirements
|
||
- i965: Don't advertise Y-tiled modifiers for scanout buffers on Gfx8-
|
||
- iris: Don't advertise Y-tiled modifiers for scanout buffers on Gfx8
|
||
- iris: Replace no_gpu flag with PIPE_MAP_DIRECTLY
|
||
- iris: Promote to MAP_DIRECTLY when required before NULL return
|
||
- iris: Delete a comment suggesting we use tiled staging buffers
|
||
- iris: Make an iris_bo_is_external() helper and use it in a few places
|
||
- iris: Track imported vs. exported status separately
|
||
- iris: Use staging blits for reads from uncached buffers.
|
||
- iris: Use staging blits for transfers involving imported BOs
|
||
- iris: Assert on mapping a tiled buffer without MAP_RAW
|
||
- iris: Drop fallback GEM_MMAP_GTT if GEM_MMAP with I915_MMAP_WC fails
|
||
- iris: Delete GTT mapping support
|
||
- iris: Pick a single mmap mode (WB/WC) at BO allocation time
|
||
- iris: Use bo->mmap_mode in transfer map read check
|
||
- iris: Add a flags argument to iris_bo_alloc()
|
||
- iris: Add an alignment parameter to iris_bo_alloc()
|
||
- iris: Only use SET/GET_TILING when exporting/importing BOs
|
||
- iris: Add a BO_ALLOC_SMEM flag for allocating from system memory
|
||
- anv: Fix dynamic primitive topology for tess on Gfx7.x too
|
||
- iris: Stop calling I915_GEM_SET_CACHING on discrete GPUs
|
||
- iris: Fail BO allocation if we can't enable snooping properly.
|
||
- iris: Delete unused bo->cache_coherent flag
|
||
- vulkan/wsi: Fix prime blits to use system memory for the destination
|
||
- iris: Reduce SSBO alignment requirements from 64B to 4B
|
||
- crocus: Reduce SSBO alignment requirements from 64B to 4B.
|
||
- iris: Force device local memory for u_upload_mgr buffers
|
||
- iris: Use simple_mtx in the bufmgr.
|
||
|
||
Leo Liu (9):
|
||
|
||
- frontends/va: add VASurfaceAttribUsageHint attribute
|
||
- frontends/va: fix multi planes for external memeory type
|
||
- frontends/va: use pipe buffer map instead of texture map
|
||
- radeon/vcn/enc: use surface swizzle mode instead of linear
|
||
- radeonsi: add PIPE_FORMAT_P010 for HEVC Main10 profile to encode param
|
||
- radeonsi: separate video hw info based on HW engine individually
|
||
- frontends/va: use the correct entrypoint to get config attributes
|
||
- frontends/va: include the profile queries for encoder as well
|
||
- frontends/va: use the entrypoint from context instead of the hard-coded one
|
||
|
||
Lepton Wu (3):
|
||
|
||
- virgl: move new added field to the end.
|
||
- Revert "virgl: Cache depth and stencil buffers"
|
||
- gallium: Reset {d,r}Priv in dri_unbind_context
|
||
|
||
Lionel Landwerlin (59):
|
||
|
||
- anv: fix 3DSTATE_MULTISAMPLE emission on gen8+
|
||
- anv: disable baked in pipeline bits from dynamic emission path
|
||
- vulkan/util: cast enums to int64_t in switch
|
||
- spirv: fix uToAccelerationStructure handling
|
||
- spirv: fixup pointer_to/from_ssa with acceleration structures
|
||
- vulkan: bump headers/registry to version 1.2.175
|
||
- anv: drop extension check for dynamic state
|
||
- anv: prepare pipeline for delayed emission of color writes
|
||
- anv: implement VK_EXT_color_write_enable
|
||
- anv: reuse define for number of render target assert
|
||
- vulkan/wsi/display: don't report support if there is no drm fd
|
||
- i965/bufmgr: fix invalid assertion
|
||
- intel/dev: printout correct subslice/dualsubslice name
|
||
- intel/genxml: Add coarse pixel shading instructions
|
||
- intel/decoder: decode CPS_STATE
|
||
- intel/compiler: make sure we keep the lowest dispatch limit
|
||
- intel/compiler: rework message descriptors for render targets
|
||
- intel/compiler: use existing helpers to pull bits of descriptors
|
||
- intel/compiler: handle coarse pixel in render target writes descriptors
|
||
- intel/compiler: add support for fragment shading rate variable
|
||
- intel/compiler: add support for fragment coordinate with coarse pixels
|
||
- intel/compiler: add coarse pixel offset on Gfx12.5+
|
||
- intel/compiler: add restrictions related to coarse pixel shading
|
||
- anv: implement VK_KHR_fragment_shading_rate
|
||
- isl: document format fields
|
||
- intel/fs: use the final destination type for regioning restrictions
|
||
- intel/mi_builder: fix resolve call
|
||
- anv: fix perf query pass with command buffer batching
|
||
- anv: handle spirv parsing failure
|
||
- iris: fix assert to reflect correct limit for encoded size
|
||
- intel/perf: allow opening perf stream with no context filtering
|
||
- intel/perf: allow metric sets to be loaded with on OA reports
|
||
- anv: fixup physical device properties of fragment shading rate
|
||
- intel/fs: make sure shuffle is lowered to supported types
|
||
- intel/perf: update gen9/11 TestOa configs
|
||
- intel/perf: update Gen11 RenderBasic programming
|
||
- intel/perf: update Gen11 RenderBasic programming
|
||
- intel/perf: add EHL availability condition to HDCAndSF counters
|
||
- intel/perf: update Gen9/11 programming for AsyncCompute
|
||
- intel/perf: rename metric descriptions
|
||
- anv: implement VK_EXT_physical_device_drm
|
||
- blorp: add blorp string in shader keys
|
||
- anv: cache raytracing trampoline shader
|
||
- anv: store more RT shader data in pipeline_stage object
|
||
- anv: move trivial return shader to device
|
||
- anv: implement caching for ray tracing pipelines
|
||
- intel/rt: switch to common pass for shader calls lowering
|
||
- nir: drop the btd_resume_intel intrinsic
|
||
- nir: use a more fitting index for btd_stack_push_intel
|
||
- anv: bound checks buffer memory binding in debug builds
|
||
- anv: allocate bigger batches as we grow command buffers
|
||
- intel/perf: use the right popcount for 64bits
|
||
- intel/compiler: Track latency/perf of LSC fences
|
||
- isl: fix mapping of format->stringname
|
||
- loader/dri3: create linear buffer with scanout support
|
||
- nir/lower_shader_calls: adding missing stack offset alignment
|
||
- anv: fix submission batching with perf queries
|
||
- drm-shim: implement stat/fstat when xstat variants are not there
|
||
- intel/disasm: fix missing oword index decoding
|
||
|
||
Lucas Stach (9):
|
||
|
||
- etnaviv: fix vertex sampler setup
|
||
- dri: add loader_dri_create_image helper
|
||
- loader/dri3: convert to loader_dri_create_image
|
||
- loader/dri: hook up createImageWithModifiers2
|
||
- gallium/dri: copy image use in dup_image
|
||
- dri: don't call modifier interfaces when modifiers_count is 0
|
||
- frontend/dri: add EXPLICIT_FLUSH hint in dri2_resource_get_param
|
||
- etnaviv: remove double assigment of surface->texture
|
||
- etnaviv: flush used render buffers on context flush when neccessary
|
||
|
||
Luis Felipe Strano Moraes (2):
|
||
|
||
- meson: print information about layers being built as part of summary
|
||
- overlay_layer: add missing undef
|
||
|
||
Maksim Sisov (2):
|
||
|
||
- iris: export GEM handle with RDWR access rights
|
||
- i965: export GEM handle with RDWR access rights
|
||
|
||
Marcin Ślusarz (28):
|
||
|
||
- intel/tools: remove unused macros
|
||
- intel/batch_decoder: set foreground color of decoded instructions
|
||
- i965: fully populate perf_config before using it to initialize perf_context
|
||
- iris: fully populate perf_config before using it to initialize perf_context
|
||
- intel/perf: move calculation of period_exponent to perf ctx init
|
||
- gallium/u_threaded: implement INTEL_performance_query hooks
|
||
- gallium/u_threaded: offload begin/end_intel_perf_query
|
||
- nir: handle float atomics in nir_gather_info
|
||
- nir: handle float atomics in nir_lower_memory_model
|
||
- intel: simplify is_haswell checks, part 1
|
||
- intel: simplify is_haswell checks, part 2
|
||
- i965: simplify gfx version checks
|
||
- intel/isl: replace format_gen by verx10
|
||
- intel/disasm: decode/describe more send messages
|
||
- intel/disasm: remove useless space after "("
|
||
- iris: fix error message on I915_GEM_[GS]ET_TILING failure
|
||
- intel/decoder: add assert for register size
|
||
- anv: fix potential integer overflows
|
||
- intel/tools: fix left shift overflow on 32-bit
|
||
- intel/tools: fix int-to-pointer/pointer-to-int cast warnings on 32-bit
|
||
- intel/tools: fix invalid type in argument to printf format specifier
|
||
- intel/tools: fix potential memory leaks
|
||
- intel/blorp: initialize BLEND_STATE using braced initializer list
|
||
- intel/fs: use stack for temporary array
|
||
- anv: keep descriptor set's address directly in anv_descriptor_set
|
||
- anv: handle push descriptor sets when they are sent with push constants
|
||
- anv: drop unused argument of anv_descriptor_set_address
|
||
- intel/compiler: document register types
|
||
|
||
Marek Olšák (190):
|
||
|
||
- ci: don't build clover with LLVM 9 on radeonsi because it's unsupported
|
||
- amd: drop support for LLVM 9
|
||
- amd: drop support for LLVM 10
|
||
- amd: remove some references to older LLVM versions in comments
|
||
- amd/registers: fix the kernel header parser with latest headers
|
||
- amd/registers: clean up gfx103.json
|
||
- amd/registers: rename IMG_FORMAT to GFX10_FORMAT to disambiguate the meaning
|
||
- radeonsi: don't decompress DCC for float formats in si_compute_copy_image
|
||
- radeonsi: fix incorrect comments in culling code and NIR lowering
|
||
- radeonsi: fix automatic DCC retiling after DCC clear and DCC decompression
|
||
- radeonsi: fix automatic DCC retiling after compute image stores
|
||
- gallium/util: add easy profiling helpers using TIME_ELAPSED queries
|
||
- Revert "st/pbo: use cso_set_vertex_buffers_and_elements() for st_pbo_draw"
|
||
- Revert "ci/radeonsi: Add expected failures due to #4674 having slipped in"
|
||
- ac/surface: document more meta equation dependencies
|
||
- radeonsi: make the gfx9 DCC MSAA clear shader depend on the number of samples
|
||
- radeonsi: remove the separate DCC optimization for Stoney
|
||
- amd: addrlib update for April
|
||
- gallium: renumber PIPE_MAP_* enums to remove holes
|
||
- gallium: remove 4 bytes from pipe_transfer
|
||
- gallium+(u_threaded,r300,r600,radeonsi): move transfer offset into pipe_transfer
|
||
- util: print CPU caps in release builds too
|
||
- util: fix (re-enable) L3 cache pinning
|
||
- Revert "gallium/u_threaded: align batches and call slots to 16 bytes"
|
||
- gallium/u_threaded: move base_valid_buffer_range to transfer where it belongs
|
||
- gallium/u_threaded: handle sampler views == NULL better
|
||
- gallium/u_threaded: rewrite slot layout to reduce wasted space
|
||
- gallium/u_threaded: don't set resource pointers to NULL after driver calls
|
||
- gallium/u_threaded: fix 32-bit breakage due to incorrect pointer arithmetic
|
||
- gallium/u_threaded: pass last into and return call size from execute callbacks
|
||
- gallium/u_threaded: merge draws in tc_call_draw_single
|
||
- gallium/u_threaded: add callbacks and documentation for resource busy checking
|
||
- gallium/u_threaded: track whether TCS, TES, or GS have ever been used
|
||
- gallium/u_threaded: query shader resource limits
|
||
- gallium/u_threaded: add buffer lists - tracking of buffers referenced by tc
|
||
- gallium/u_threaded: add driver-internal flush tracking for buffer lists
|
||
- gallium/u_threaded: don't invalidate idle buffers
|
||
- gallium/u_threaded: always map idle buffers unsynchronized
|
||
- gallium/u_threaded: don't reference resource in pipe_transfer
|
||
- util: add thread-safe version of idalloc
|
||
- zink: don't set u_resource_vtbl
|
||
- gallium: remove unused u_default_transfer_unmap
|
||
- gallium: remove resource_get_handle from u_resource_vtbl
|
||
- gallium: remove transfer_flush_region from u_resource_vtbl
|
||
- radeonsi: stop using u_resource_vtbl::resource_destroy
|
||
- r300: stop using u_resource_vtbl::resource_destroy
|
||
- nouveau: stop using u_resource_vtbl::resource_destroy
|
||
- i915g: stop using u_resource_vtbl::resource_destroy
|
||
- virgl: stop using u_resource_vtbl::resource_destroy
|
||
- svga: stop using u_resource_vtbl::resource_destroy
|
||
- r600: stop using u_resource_vtbl::resource_destroy
|
||
- gallium: remove u_resource_vtbl::resource_destroy
|
||
- gallium: split transfer_(un)map into buffer_(un)map and texture_(un)map
|
||
- gallium: remove u_resource_vtbl::transfer_(un)map
|
||
- gallium: remove empty structure u_resource_vtbl
|
||
- gallium: remove structure u_resource
|
||
- radeonsi: simplify the NGG culling vertex count heuristic
|
||
- amd: add Beige Goby support
|
||
- amd/registers: don't generate 32-bit register fields
|
||
- amd/registers: regenerate json files without 32-bit register fields
|
||
- amd: fix incorrect addrlib comment for HTILE equations
|
||
- ac/gpu_info: set has_zero_index_buffer_bug for Navi12 too
|
||
- ac/llvm: set target features per function instead of per target machine
|
||
- ac/llvm: expose set_range_metadata to more users
|
||
- ac/llvm: allow ac_build_optimization_barrier with SGPRs, pointers, and metadata
|
||
- ac/llvm: set range metadata on mbcnt and deduplicate get_thread_id
|
||
- ac/llvm: don't draw the primitive for the dummy export workaround for Navi1x
|
||
- winsys/amdgpu: don't hold a mutex while accessing is_shared
|
||
- radeonsi: remove unused SI_IMAGE_ACCESS_AS_BUFFER
|
||
- radeonsi: handle PIPE_CAP_MAX_VERTEX_BUFFERS
|
||
- radeonsi: add a gfx10 bug workaround for NOT_EOP
|
||
- radeonsi: fix a coherency issue when VS memory stores are not visible in PS
|
||
- radeonsi: always use the L2 LRU cache policy for faster clears and copies
|
||
- radeonsi: don't disable L2 caching for staging textures
|
||
- radeonsi: don't use GS fast launch with small instances
|
||
- radeonsi: fix the fast launch vert/prim thread counts if they are trimmed
|
||
- radeonsi: remove a twice duplicated workaround for VERT_GRP_SIZE
|
||
- radeonsi: re-enable fast launch with indexed tri strips because it doesn't hang
|
||
- radeonsi: improve generated culling code by adding optimization barriers
|
||
- radeonsi: change si_resource::alignment to alignment_log2 for better packing
|
||
- radeonsi: remove 8 bytes from si_resource, turn other 4 bytes into padding
|
||
- radeonsi: add a gfx10 hw bug workaround with the barrier before gs_alloc_req
|
||
- radeonsi: add missing threaded_resource_deinit calls in fail paths
|
||
- radeonsi: rewrite the prefix sum computation for shader culling
|
||
- radeonsi: allow changing the NGG subgroup size to 256 but don't change it yet
|
||
- radeonsi: generate buffer_id_unique for u_threaded_context
|
||
- radeonsi: implement threaded context callbacks for resource busy checking
|
||
- radeonsi: disable DFSM on gfx9 by default because it decreases performance a lot
|
||
- radeonsi: remove DFSM after we discovered how bad it is
|
||
- gallium/u_vbuf: add a fast path to skip refcounting for uploaded user buffers
|
||
- mesa: move _mesa_copy_vertex_attrib/buffer functions to their only use
|
||
- mesa: don't call _mesa_set_draw_vao in glPushClientAttrib
|
||
- mesa: optimize glPush/PopClientAttrib for GL_CLIENT_VERTEX_ARRAY_BIT
|
||
- mesa: optimize unreferencing VBOs in glPopClientAttrib
|
||
- mesa: don't call FLUSH_VERTICES in glPopClientAttrib
|
||
- mesa: don't save/restore VAO NumUpdates and IsDynamic to fix update tracking
|
||
- st/mesa: execute glFlush asynchronously if no image has been imported/exported
|
||
- gallium/u_threaded: don't update valid_buffer_range for read-only shader buffers
|
||
- gallium/u_threaded: clear valid buffer range only if it's not bound for write
|
||
- gallium/u_threaded: use tc_drop_resource_reference in call_draw_single_drawid
|
||
- gallium/u_threaded: merge draws faster by merging indexbuf unreferencing
|
||
- radeonsi: check is_buffer once instead of 4 times in si_set_sampler_view_desc
|
||
- radeonsi: use the restrict keyword to set sampler view descriptors faster
|
||
- radeonsi: don't clear register fields in si_set_mutable_tex_desc_fields
|
||
- radeonsi: restructure si_set_sampler_views for faster unbinding trailing slots
|
||
- radeonsi: remove no-op unref in si_set_constant_buffer
|
||
- radeonsi: set desc[3] of all buffer descriptors at context creation
|
||
- radeonsi: move a few functions from si_state_draw.cpp into si_gfx_cs.c
|
||
- radeonsi: compile si_state_draw.cpp for each gfx generation separately
|
||
- radeonsi: remove the chip_class dimension from the draw_vbo array
|
||
- radeonsi: remove -Wstrict-overflow=0 since it doesn't seem to be needed
|
||
- gallium/pb: change alignment to 32 bits
|
||
- shader_enums: change VERT_BIT back to the 32-bit shift
|
||
- glthread: change when glFlush flushes asynchronously
|
||
- st/mesa: fix an incorrect comment in st_context_flush
|
||
- st/mesa: move the st_flush_bitmap_cache call into st_flush
|
||
- mesa: add gallium flush_flags param into ctx->Driver.Flush
|
||
- mesa: move _mesa_notifySwapBuffers into the x11 swrast driver
|
||
- mesa: execute glFlush asynchronously if no image has been imported/exported
|
||
- radeonsi: fix compile failures with SI_PRIM_DISCARD_DEBUG enabled
|
||
- radeonsi: use ac_build_bit_count instead of opencoding it
|
||
- radeonsi: fix incorrect counting of compute_num_verts_rejected
|
||
- radeonsi: fix multi draws for the prim discard CS
|
||
- ac/llvm: add a callback to ac_cull_triangle to generate code in inner-most block
|
||
- radeonsi: move the accepting code into the bbox cull branch in NGG cull code
|
||
- ac/surface/tests: fix RB counts
|
||
- ac/surface: don't set DCC_PIPE_ALIGN modifier bit for gfx10 with 1 RB
|
||
- radeonsi: restructure si_get_vs_vgpr_comp_cnt for readability
|
||
- radeonsi: merge 2 conditional blocks with same condition into 1 in culling code
|
||
- radeonsi: set more precise max_waves in NGG code
|
||
- radeonsi: remove incorrect comment about PA
|
||
- radeonsi: try to keep all VS input loads together for better perf
|
||
- radeonsi: don't compile TES and GS draw_vbo variants for the prim discard CS
|
||
- radeonsi: remove the Z culling option from the primitive discard CS
|
||
- radeonsi: drop gfx7 support from the prim discard CS to simplify code
|
||
- radeonsi: drop support for triangle fans from the prim discard CS
|
||
- radeonsi: skip buffer_atomic_add(ptr, n) when n=0 in the prim discard CS
|
||
- radeonsi: cleanup some primitive discard CS TODOs regarding instancing, etc.
|
||
- ac/llvm: don't set skip-uniform-regions to fix atomic.cmpswap
|
||
- mesa: unreference zombie buffers when creating buffers to lower memory usage
|
||
- radeonsi: document why VBO descriptors in user SGPRs are beneficial
|
||
- radeonsi: if shader culling culls all vertices, cull the primitive exports too
|
||
- radeonsi: remove incorrect comment about hangs in gfx10_ngg_gs_emit_epilogue
|
||
- radeonsi: don't use NGG culling on 1 RB chips
|
||
- ac/gpu_info: adjust the condition for use_late_alloc
|
||
- radeonsi: optimize set_inlinable_constants when they don't change
|
||
- st/mesa: don't track VS sampler views for st_draw_feedback in st_context
|
||
- st/mesa: don't track FS sampler views for bitmap/drawpix in st_context
|
||
- st/mesa: don't memset the sampler view array, don't init trailing slots to NULL
|
||
- st/mesa: sink _mesa_get_samplerobj into st_update_single_texture
|
||
- st/mesa: read Target only once in st_update_single_texture
|
||
- st/mesa: return sview from st_update_single_texture via return value, not param
|
||
- st/mesa: remove the const qualifier for a few st_sampler_view instances
|
||
- st/mesa: sink refcounting from st_get_sampler_views into st_sampler_view.c
|
||
- st/mesa: add a mechanism to bypass atomics when binding sampler views
|
||
- st/mesa: remove the sampler min_lod/max_lod value swap
|
||
- cso: disallow NULL sampler state templates in cso_single_sampler
|
||
- cso: update max_sampler_seen only once in cso_set_samplers
|
||
- cso: don't look up a sampler CSO if the last one is identical
|
||
- mesa: use atomics instead of mutexes for refcounting texture objects
|
||
- mesa: use atomics instead of mutexes for refcounting sampler objects
|
||
- mesa: use atomics instead of mutexes for refcounting renderbuffers
|
||
- mesa: remove mutex locking from a glBindTexture early out path
|
||
- mesa: translate into pipe_sampler_state in GL functions
|
||
- mesa: add LodBias quantization from st/mesa
|
||
- mesa: add IsBorderColorNonZero to skip border color update for st/mesa faster
|
||
- mesa: lower GL_CLAMP in texture and sampler functions instead of st/mesa
|
||
- radeonsi: remove the GDS variants of compute-based primitive discard
|
||
- radeonsi: change how the prim discard CS is enabled and splitting limits
|
||
- radeonsi: fix issues with draw-level splitting for the prim discard CS
|
||
- radeonsi: add optimal multi draws and draw-level splitting for prim discard CS
|
||
- radeonsi: move the accepting code into the bbox cull branch in prim discard CS
|
||
- radeonsi: drop smoothing quality to 4xAA for better performance
|
||
- ac/llvm: don't return a status from ac_cull_triangle because it's unused
|
||
- ac/llvm: rework how negative W affects culling to not call accept_func twice
|
||
- radeonsi: rewrite a confusing comment in si_upload_and_prefetch_VB_descriptors
|
||
- ac/surface/tests: fix the ARM build
|
||
- radeonsi,radv: fix a late alloc deadlock with <= 6 CUs per SA
|
||
- radeonsi: move an incorrectly placed comment about late alloc
|
||
- ac,radeonsi: move late alloc computation into common code and shader states
|
||
- radeonsi: enable uniform inlining by default
|
||
- util/idalloc: change num_elements to units of elements instead of bits
|
||
- util/idalloc: fold the size call into init
|
||
- util/idalloc: reserving an ID that already exists should be no-op
|
||
- util/idalloc: hide or remove unused public functions
|
||
- util/idalloc: add exists and foreach helpers
|
||
- util/idalloc: add util_idalloc_alloc_range
|
||
- radeonsi: don't expose no-attachment MSAA 16x on all 1 RB chips due to issues
|
||
- mesa: fix incorrect comment in draw_gallium_multimode
|
||
- st/mesa: always use PIPE_USAGE_STAGING for GL_MAP_READ_BIT usage
|
||
|
||
Mark Janes (11):
|
||
|
||
- iris: Increase the size of upload buffers
|
||
- iris: Upload constant resources for efficient GPU access
|
||
- iris: Use const_uploader for iris_create_stream_output_target
|
||
- iris: Use const uploader for blorp vertex data
|
||
- iris: Use const uploader for draw parameters
|
||
- iris: Use const uploader for user index data
|
||
- intel/compiler: Add getter helpers for LSC message descriptor fields
|
||
- intel/compiler: Add LSC messages to brw_schedule_instructions
|
||
- intel/fs: Lower DW untyped r/w messages to LSC when available
|
||
- intel/fs: Lower untyped atomic messages to LSC when available
|
||
- intel/fs: Lower A64 untyped r/w messages to LSC when available
|
||
|
||
Martin Krastev (1):
|
||
|
||
- compiler/glsl: Use mutex lock while freeing up mem_ctx
|
||
|
||
Martin Peres (1):
|
||
|
||
- ci: add the dEQP expectations for radv on Renoir
|
||
|
||
Matt Turner (10):
|
||
|
||
- intel/eu: Add instruction compaction support on XeHP.
|
||
- compiler/glsl: Return progress from propagate_invariance()
|
||
- compiler/glsl: Propagate invariant/precise when splitting arrays
|
||
- compiler/glsl: Always propagate_invariance() last
|
||
- freedreno/afuc: Print uintptr_t with PRIxPTR
|
||
- sparc: Avoid some redefinition warnings
|
||
- tu: Provide a toggle to avoid warnings about unsupported devices
|
||
- freedreno/ci: Use TU_IGNORE_CONFORMANCE_WARNING to reduce warnings
|
||
- ci: Unify on MESA_VK_IGNORE_CONFORMANCE_WARNING
|
||
- amd/ci: Use MESA_VK_IGNORE_CONFORMANCE_WARNING to reduce warnings
|
||
|
||
Matti Hamalainen (11):
|
||
|
||
- gallium/tools: clean up tracediff.sh a bit
|
||
- gallium/tools: improve option handling in dump_state.py
|
||
- gallium/tools: implement better suppression of variants
|
||
- gallium/tools: implement 'named' pointers option in dump.py
|
||
- gallium/tools: use left-column output mode of sdiff in tracediff.sh
|
||
- gallium/tools: improve tracediff.sh argument handling
|
||
- gallium/tools: implement "high-level" overview mode option in dump scripts
|
||
- gallium/tools: improve pointer type tracking in parse.py
|
||
- gallium/tools: add option to use Meld for diffing
|
||
- aux/trace: add missing return value to trace output
|
||
- gallium/tools: improve handling of pointer arrays
|
||
|
||
Mauro Rossi (11):
|
||
|
||
- egl/android: include "util/compiler.h" for FALLTHROUGH macro
|
||
- android: panfrost/lib: add pan_cs.c to Makefile.sources
|
||
- android: gallium/radeonsi: add nir include path
|
||
- android: amd/common: add nir include path
|
||
- android: pan/bi: add bi_opt_constant_fold.c to Makefile.sources
|
||
- android: nir: add nir_lower_fragcolor.c to Makefile.sources
|
||
- android: intel/compiler: add brw_compile_ff_gs.c to Makefile.sources
|
||
- android: i965: remove brw_ff_gs_emit.c from Makefile.sources
|
||
- android: ac: add ac_nir_lower_ngg.c to Makefile.sources
|
||
- android: ac: add include src/util path
|
||
- android: aco: add aco_optimizer_postRA.cpp to Makefile.sources
|
||
|
||
Michael Tang (1):
|
||
|
||
- microsoft/compiler: Maintain sorting of resource type in the context
|
||
|
||
Michael Walle (1):
|
||
|
||
- kmsro: Add mali-dp
|
||
|
||
Michel Dänzer (18):
|
||
|
||
- lima/ppir: Cast pointer to uintptr_t instead of uint64_t
|
||
- util: Remove unused Android options_tbl_lock
|
||
- Convert most remaining free-form fall-through comments to FALLTHROUGH
|
||
- Guard FALLTHROUGH annotations after assert()
|
||
- llvmpipe: Drop switch with only default case
|
||
- iris: Drop unneeded default switch case
|
||
- Use explicit break instead of fall-through to break-only case
|
||
- ci: Enable -Werror in clang jobs
|
||
- osmesa: Replace default case FALLTHROUGH annotation by following return
|
||
- ci: Enable -Werror for the remaining GCC build jobs
|
||
- ci: Move -Werror enabling from job definitions to meson build script
|
||
- ci: Add test which occasionally times out to lavapipe-vk skips
|
||
- Fix up leftover "state_trackers" references to "frontends"
|
||
- turnip: Mark local variable ASSERTED
|
||
- ci: Add debian/ prefix to job names for Debian based docker images
|
||
- ci: Rename Debian based build jobs from meson-* to debian-*
|
||
- ci: Add Fedora 34 based x86 build docker image
|
||
- ci: Add Fedora release build job
|
||
|
||
Michel Zou (14):
|
||
|
||
- lavapipe: fix unused variable warning
|
||
- vulkan: fix duplicate win32 def
|
||
- gallium: fix uninitialized variable warning
|
||
- meson: link vulkan_util with link_whole on mingw
|
||
- docs: list more vulkan extensions
|
||
- vulkan/wsi: avoid wsi_x11_check_for_dri3 for sw device
|
||
- zink: fix win32 build
|
||
- swr: fix uninitialized variable warnings
|
||
- llvmpipe: restrict optim bug workaround to gcc 10.x
|
||
- glapi: fix Warray-parameter
|
||
- zink: Drop useless zink_dispatch_table
|
||
- zink: Fix win32 build
|
||
- zink: Fix unused-variable warning
|
||
- meson: dont use missing dumpbin path
|
||
|
||
Miguel Gomez (1):
|
||
|
||
- i965: Prevent invalid framebuffer usage
|
||
|
||
Mike Blumenkrantz (548):
|
||
|
||
- gallium: add PIPE_BIND_SAMPLER_REDUCTION_MINMAX
|
||
- gallium: split PIPE_CAP_SAMPLER_REDUCTION_MINMAX into modes
|
||
- mesa/st: plumb GL_TEXTURE_REDUCTION_MODE_ARB through QueryInternalFormat
|
||
- zink: hook up VK_EXT_sampler_filter_minmax
|
||
- zink: support format queries for VK_EXT_sampler_filter_minmax
|
||
- zink: handle minmax sampler creation for VK_EXT_sampler_filter_minmax
|
||
- zink: export PIPE_CAP_SAMPLER_REDUCTION_MINMAX_ARB
|
||
- docs: update GL_ARB_texture_filter_minmax for zink
|
||
- zink: compare against screen batch id when determining which semaphore to use
|
||
- zink: always copy the nir shader before compiling
|
||
- zink: fix tcs slot map eval for user vars
|
||
- zink: fix tcs input reservation for user vars
|
||
- st/pbo: use cso_set_vertex_buffers_and_elements() for st_pbo_draw
|
||
- zink: merge copy-to-scanout path into non-deferred flush path
|
||
- zink: force scanout sync when mapping scanout resource
|
||
- util/format: add util_format_is_rgbx_or_bgrx
|
||
- zink: use undefined layout for first scanout obj transition
|
||
- Revert "zink: force scanout sync when mapping scanout resource"
|
||
- zink: move scanout sync to end of batch
|
||
- zink: add a flag indicating whether scanout object needs updating
|
||
- zink: move wsi flush info conditional to queue submission
|
||
- zink: directly set batch->state->flush_res from flush_resource hook
|
||
- zink: add clear-on-flush mechanic deeper into flush codepath
|
||
- gallium: when tracing is enabled for threaded drivers, trace the driver thread
|
||
- nir/lower_fragcolor: set outputs_written for fragdata members
|
||
- softpipe: fix render condition checking
|
||
- softpipe: fix streamout queries
|
||
- softpipe: ci updates
|
||
- zink: track persistent resource objects, not resources
|
||
- zink: restore previous semaphore (prev_sem) handling
|
||
- zink: use cached memory for staging resources
|
||
- zink: init timeline semaphore on screen creation, not first batch creation
|
||
- zink: only reset query on suspend if the query has previously been stopped
|
||
- zink: when performing an implicit reset, sync qbos
|
||
- lavapipe: implement VK_EXT_provoking_vertex
|
||
- zink: hook up VK_EXT_provoking_vertex
|
||
- zink: implement VK_EXT_provoking_vertex
|
||
- zink: ci updates
|
||
- zink: update docs
|
||
- nir/gl_lower_buffers: set access for ssbo load/store instrs
|
||
- zink: use non-atomic load/store ops if intrinsic is not actually coherent
|
||
- zink: remove leftover references to flatshading in shader keys
|
||
- zink: hook up VK_KHR_shader_clock
|
||
- zink: add conversion util for nir_scope -> SpvScope
|
||
- zink: add spirv builder for unops with a const operand
|
||
- zink: support nir_intrinsic_shader_clock
|
||
- zink: export PIPE_CAP_TGSI_CLOCK
|
||
- zink: generate spirv 1.5 from ntv when using vk >= 1.2
|
||
- zink: create entrypoints for descriptor variables with spirv 1.5
|
||
- zink: add fastpath for getting default shader variants
|
||
- zink: use first-created shader variant as the default
|
||
- zink: hook up VK_EXT_sample_locations
|
||
- zink: hook up VK_EXT_conservative_rasterization
|
||
- zink: hook up VK_EXT_shader_subgroup_ballot
|
||
- zink: hook up EXT_image_drm_format_modifier
|
||
- docs: mark off GL_ARB_shader_clock for zink
|
||
- gallium: rename pipe_draw_start_count -> pipe_draw_start_count_bias
|
||
- gallium: move pipe_draw_info::index_bias to pipe_draw_start_count_bias
|
||
- mesa/st: rename DrawGalliumComplex -> DrawGalliumMultiMode
|
||
- gallium: split drawid out of pipe_draw_info and as a separate draw_vbo param
|
||
- gallium: remove padding members from pipe_draw_info
|
||
- util/tc: split out drawid-using draws into a separate call
|
||
- iris: fix indirect drawid
|
||
- zink: grab GetPhysicalDeviceMemoryProperties2 from instance
|
||
- zink: hook up VK_EXT_memory_budget
|
||
- zink: support PIPE_CAP_QUERY_MEMORY_INFO
|
||
- zink: minor refactoring of buffer map for read case
|
||
- zink: add a screen util function for handling VkResults
|
||
- zink: use zink_screen_handle_vkresult() for fence and timeline waiting
|
||
- zink: add a ctx function for handling device lost resets
|
||
- zink: use new ctx device lost checker function
|
||
- zink: add a pipe_context::resource_commit hook
|
||
- zink: implement sparse buffer creation/mapping
|
||
- zink: export PIPE_CAP_SPARSE_BUFFER_PAGE_SIZE
|
||
- aux/cso_cache: add handling for save/restore of compute states
|
||
- zink: clamp zs samplers to XXXX swizzle for all non-zero/one swizzles
|
||
- gallium/inlines: remove atomic set from pipe_reference_init()
|
||
- nir: add nir_isub_imm
|
||
- lavapipe: handle buffer sizes better in CmdBindTransformFeedbackBuffersEXT
|
||
- lavapipe: do not read sampler descriptor info during update if layout has immutables
|
||
- lavapipe: set events to the unsignalled state on creation
|
||
- lavapipe: flag renderpasses as having color/zs attachments
|
||
- lavapipe: update more states on null multisample pipeline info
|
||
- lavapipe: zero out the dsa state info and flag for updating on null dsa state
|
||
- lavapipe: zero out the blend state info and flag for updating on null blend state
|
||
- lavapipe: don't unnecessarily flag dsa states for updating
|
||
- lavapipe: ignore tess pipeline info if no tess shaders in pipeline
|
||
- lavapipe: don't access pipeline viewport state when it should be ignored
|
||
- lavapipe: don't access pipeline dsa state when it should be ignored
|
||
- lavapipe: don't access pipeline blend state when it should be ignored
|
||
- zink: split off descriptor layout from descriptor pools
|
||
- zink: unify pipeline layout creation functions
|
||
- zink: abstract descriptor init
|
||
- zink: abstract descriptor usage for programs
|
||
- zink: abstract descriptor pool usage for programs
|
||
- zink: use explicit types during descriptor updates
|
||
- zink: check descriptor layout support before creating it
|
||
- zink: move more vertex state stuff into the hw state
|
||
- zink: split vertex state pipeline hashing into its own value
|
||
- zink: flag pipeline for change more often when vbos change without dynamic state
|
||
- zink: return current pipeline object if state hasn't changed
|
||
- zink: hook up dynamic dsa states
|
||
- zink: start using dynamic front face state
|
||
- util/hash_table: _mesa_hash_table_create_u32_keys()
|
||
- zink: add a pipe_context::clear_buffer hook
|
||
- zink: never use LINEAR for VK_EXT_4444_formats
|
||
- zink: make ZINK_INLINE_UNIFORMS more standardized in function
|
||
- zink: clamp 3D surface viewtype to 2D only in the create_surface hook
|
||
- zink: add a target param to create_ivci()
|
||
- zink: simplify samplerview surface creation
|
||
- zink: only set layer info for samplerviews if there are multiple layers
|
||
- zink: handle in-renderpass clears in fb_clears_apply_internal()
|
||
- zink: break zs clear loop once both bits are set when beginning renderpass
|
||
- zink: add debug assert to verify that zink_clear_framebuffer() is accurate
|
||
- zink: remove compute cruft from resource mapping
|
||
- zink: break out draw dispatch into separate functions
|
||
- zink: fix texture barriers for real this time
|
||
- zink: rework memory_barrier hook again (third time's the charm)
|
||
- ci: skip glsl-uniform-interstage-limits tests for softpipe jobs
|
||
- zink: use DONTCARE renderpass when a new scanout fb attachment is set
|
||
- iris: refcount separate screen objects for resource tracking
|
||
- zink: stop invalidating descriptor sets on pool destroy
|
||
- zink: add context-based descriptor info tracking infrastructure
|
||
- zink: unify resource rebinding
|
||
- zink: track bind counts for descriptors
|
||
- zink: update samplerview descriptor layouts when image binds are set
|
||
- zink: don't track sampler states onto buffer sampler sets
|
||
- zink: track max slot idx for descriptor types
|
||
- zink: track number of tbos in shader data
|
||
- zink: add slot params to zink_context_invalidate_descriptor_state
|
||
- zink: use better iterating for buffer rebinds
|
||
- zink: call invalidate on invalid descriptor sets during recycle
|
||
- zink: make zink_context_update_descriptor_states() static
|
||
- zink: remove screen param from zink_descriptors_update()
|
||
- zink: pop descriptor refs when invalidating sets
|
||
- zink: flush every 100k draws/computes
|
||
- zink: check for a work_count-based stall in zink_maybe_flush_or_stall()
|
||
- zink: always do maybe_flush after draw/compute
|
||
- zink: stop overwriting buffer map pointers for stream uploader
|
||
- zink: fix DrawParameters shader cap usage
|
||
- lavapipe: fix fencing when submitting multiple cmdbufs
|
||
- zink: immediately return false when getting query result if it's not gonna happen
|
||
- util/queue: don't require a fence when adding a job
|
||
- zink: split out base renderpass begin into separate function
|
||
- zink: add a flag for tracking/validating renderpass clears
|
||
- zink: add flags for determining whether to update framebuffer and renderpass
|
||
- zink: emit some barriers out of renderpass where possible
|
||
- nir/builder: add nir_pad_vector and nir_pad_vec4 util functions
|
||
- zink: don't multiply cube array image layers
|
||
- zink: populate images with u_blitter if transfer_dst isn't available
|
||
- zink: add even more validation for linear images before creation
|
||
- util/primconvert: add C++ guards to header
|
||
- aux/trace: support pipe_screen::query_memory_info
|
||
- aux/trace: pipe_screen::query_dmabuf_modifiers
|
||
- aux/trace: pipe_context::is_dmabuf_modifier_supported
|
||
- aux/trace: propagate pipe_screen::transfer_helper pointer
|
||
- aux/trace: pipe_screen::get_dmabuf_modifier_planes
|
||
- aux/trace: trace pipe_screen::resource_create_with_modifiers
|
||
- util/prim_restart: fix util_translate_prim_restart_ib
|
||
- ci: more freedreno flakes
|
||
- aux/vbuf: prevent uint underflow and assert if no vbs are dirty
|
||
- aux/trace: add pipe_context::set_debug_callback hook
|
||
- aux/trace: more effectively unwrap pipe_context params from screen functions
|
||
- aux/trace: trace transfer ops
|
||
- aux/trace: stop dumping transfer data for threaded contexts
|
||
- aux/trace: hook tc methods
|
||
- aux/trace: fix set_inlinable_constants hook
|
||
- aux/trace: fix query handling with tc
|
||
- aux/trace: add a pipe_context::clear_buffer hook
|
||
- aux/trace: dump 'wait' param for get_query_result
|
||
- radeonsi: explicitly return support for all index buffer formats
|
||
- zink: rename ptr_add_usage -> batch_ptr_add_usage
|
||
- zink: make descriptor_layout_get a public util function
|
||
- zink: make a public util function for allocating descriptor sets
|
||
- zink: unify pipeline layout creation and move to descriptor_program_init
|
||
- zink: pass descriptor type to set layout create()
|
||
- zink: replace has_descriptors program member with a util function
|
||
- zink: abstract descriptor functionality and make descriptor structs private
|
||
- zink: improve samplerview update flagging
|
||
- zink: emit descriptor barriers and references during bind
|
||
- zink: add vertex buffer barriers during bind
|
||
- zink: make timeline_wait use only a screen param
|
||
- zink: move timeline_wait() to screen function
|
||
- zink: implement tc idalloc resource id stuff
|
||
- zink: force streamout rebind when mapping a streamout buffer for writing
|
||
- zink: implement a tc is_resource_busy hook
|
||
- zink: call tc_driver_internal_flush_notify() on flush
|
||
- zink: mark some buffer barrier functions inline/static
|
||
- zink: switch to memory barriers instead of actual buffer barriers
|
||
- zink: hook up push descriptor and descriptor template extensions
|
||
- zink: disable push descriptors on amd
|
||
- nir/builder: add nir_mask
|
||
- radv: make radv_pipeline::attrib_ends 32bit
|
||
- radv: set maxVertexInputAttributeOffset to UINT32_MAX
|
||
- zink: remove weird lod hack for texturing
|
||
- zink: ci updates
|
||
- llvmpipe: remove clamping to [0,1] for tri offset
|
||
- lavapipe: moar @optimize
|
||
- llvmpipe: split out scene surface info into separate struct
|
||
- llvmpipe: split out scene surface init into separate function
|
||
- llvmpipe: only dump tgsi shaders if they're actually tgsi shaders
|
||
- llvmpipe: store a screen pointer in resource struct
|
||
- llvmpipe: stop accessing pipe_resource::screen internally
|
||
- lavapipe: skip "pipeline barriers" if they're first or last in a cmdbuf
|
||
- lavapipe: also ignore multiple pipeline barriers in succession
|
||
- gallium/aux: add helper for pre-clamping clear_buffer value to dword
|
||
- zink: clamp clear_buffer values
|
||
- radeonsi: clamp clear_buffer values using new util helper
|
||
- zink: improve unsupported feature warning message
|
||
- aux/trace: avoid deadlock in screen::flush_frontbuffer hook
|
||
- gallivm: fix oob imageLoad with formats that have <4 components
|
||
- llvmpipe: ci updates
|
||
- aux/indices: break out primitive type conversion to separate function
|
||
- aux/indices: break out index size conversion to separate function
|
||
- aux/indices: break out index count conversion into separate function
|
||
- aux/indices: employ Delete The Code methodology
|
||
- lavapipe: add more format mappings for vertex buffer formats
|
||
- zink: reapply resource/surface refs after app flushes
|
||
- zink: reapply program refs automatically
|
||
- zink: remove barriers/refs from descriptor cache
|
||
- zink: mark some draw functions inline
|
||
- zink: only rebind pipelines when necessary
|
||
- zink: handle rebinds for vertex buffers
|
||
- zink: only rebind vertex buffers when necessary
|
||
- zink: only update viewport state when necessary
|
||
- zink: update scissor only when necessary
|
||
- zink: ref vertex buffers during set_vertex_buffers
|
||
- zink: stop using util_set_vertex_buffers_mask()
|
||
- Revert "zink: call tc_driver_internal_flush_notify() on flush"
|
||
- compiler/spirv: expand_to_vec4 -> nir_pad_vec4
|
||
- anv: fix availability for copying timestamp query results
|
||
- zink: add a second descriptor manager
|
||
- zink: unify code for updating res->bind_count values
|
||
- zink: unify more resource bind count tracking code
|
||
- zink: optimize buffer rebinds
|
||
- zink: ci updates
|
||
- aux/trace: dump resource for samplerview and surface
|
||
- aux/draw: if pipe_draw_info::index_bias_varies is not set, ignore index_bias for N>1
|
||
- aux/draw: fix aalines and aapoints for shaders with explicit FragData outputs
|
||
- radv: declare index_va in a single call for indexed draw packet emit
|
||
- radv: explicitly load a desc set layout struct member during set allocate
|
||
- zink: add a util function to create a null surface
|
||
- zink: replace context-based null framebuffer surfaces with internal ones
|
||
- zink: create dummy surface/bufferview for null descriptor use
|
||
- zink: handle null bufferview/imageview descriptors when robustness2 is missing
|
||
- zink: ci updates
|
||
- zink: no-op read access buffer barriers if existing access exists for earlier stage
|
||
- zink: emit fb attachment barriers inline during renderpass start
|
||
- zink: track number of fb attachment binds on resources
|
||
- zink: use VK_IMAGE_LAYOUT_DEPTH_STENCIL_READ_ONLY_OPTIMAL when possible
|
||
- aux/tc: fix ubo unbinding
|
||
- Revert "Revert "zink: call tc_driver_internal_flush_notify() on flush""
|
||
- nouveau: explicitly advertise index buffer format support
|
||
- r300: explicitly advertise index buffer format support
|
||
- d3d12: explicitly advertise index buffer format support
|
||
- zink: explicitly advertise index buffer format support
|
||
- zink: more accurately handle shader layer/viewport caps
|
||
- util/prim_restart: assert the index size at the start of the function
|
||
- util/prim_restart: pre-trim degenerate primitives during draw rewrite
|
||
- util/prim_restart: store index bounds while rewriting draws
|
||
- util/prim_restart: store the total index count when rewriting draws
|
||
- util/prim_restart: update index bounds before draws in util_draw_vbo_without_prim_restart
|
||
- util/prim_restart: simplify util_draw_vbo_without_prim_restart a bit
|
||
- zink: populate maxSampleLocationGridSize for all available sample sizes on init
|
||
- zink: set VK_IMAGE_CREATE_SAMPLE_LOCATIONS_COMPATIBLE_DEPTH_BIT_EXT on zs rts
|
||
- zink; add a pipe_screen::get_sample_pixel_grid hook
|
||
- zink: add a pipe_context::set_sample_locations hook
|
||
- zink: also flag sample_locations_changed if framebuffer samples changes
|
||
- zink: add a util function for populating VkSampleLocationsInfoEXT
|
||
- zink: update vk sample location info during framebuffer setup
|
||
- zink: add a pipe_context::evaluate_depth_buffer hook
|
||
- zink: use dynamic state to apply sample locations during draw
|
||
- zink: export PIPE_CAP_PROGRAMMABLE_SAMPLE_LOCATIONS
|
||
- util/vbuf: fix buffer overrun in attribute conversions
|
||
- zink: fix caching of shader variants with inlined uniforms
|
||
- util/blitter: remove duplicated set_sample_mask calls
|
||
- util/disk_cache: add nocopy variant of disk cache store function
|
||
- zink: use scissor region for discarding clears during blit
|
||
- zink: clamp PIPE_CAP_MAX_VIEWPORTS to PIPE_MAX_VIEWPORTS
|
||
- aux/cso: add flag to disable vbuf
|
||
- aux/cso: split cso_destroy_context into unbind and a destroy functions
|
||
- lavapipe: use cso caching
|
||
- zink: fix typo that's definitely not at all embarrassing or anything like that
|
||
- aux/cso: store flatshade_first state from rasterizer
|
||
- util/primconvert: add function for setting flatshade_first
|
||
- util/vbuf: add flatshade_first to vbuf context and api
|
||
- aux/cso: set flatshade_first onto vbuf when binding rasterizer
|
||
- aux/tc: pass rebind count and rebind bitmask with replace_buffer_storage func
|
||
- util/prim_restart: use more direct conversion for restart index
|
||
- zink: add a function for creating descriptor layouts for push sets
|
||
- zink: split lazy sets based on descriptor type
|
||
- zink: match lazy descriptor set layout in cache mode
|
||
- zink: modernize cached push ubo descriptor updating
|
||
- zink: modernize cached ubo descriptor updating
|
||
- zink: modernize cached ssbo descriptor updating
|
||
- zink: modernize cached image descriptor updating
|
||
- zink: remove sorting for dynamic ubo offset updating
|
||
- zink: move ubo range assert to update_descriptor_state()
|
||
- zink: unify cached descriptor update code
|
||
- zink: run lazy batch descriptor functions in cache mode
|
||
- zink: add is_buffer flag to union zink_descriptor_surface
|
||
- zink: update null sampler/image descriptor surface with is_buffer during hashing
|
||
- zink: move shader image descriptor set refs to underlying type
|
||
- zink: add funcs for descriptor_surface refs
|
||
- zink: move samplerview descset refs to base objects
|
||
- zink: enable templated descriptor updates in cache mode
|
||
- zink: add oob asserts for descriptor set ref setting
|
||
- zink: skip hash updates for descriptor types which aren't used
|
||
- zink: unblock last_set cached descriptor reuse when safe to do so
|
||
- zink: add ZINK_DESCRIPTORS env var to explicitly set a mode
|
||
- zink: remove zink_batch_state::descs_used
|
||
- zink: split batch state work_count into separate vars
|
||
- zink: reorder has_barriers flag in batch state struct
|
||
- zink: optimize zink_tc_fence struct packing
|
||
- zink: move batch usage functions to static inlines
|
||
- zink: remove atomic from batch usage setting
|
||
- zink: make batch_usage_unset take a batch state param
|
||
- zink: unset program batch usage on state reset
|
||
- zink: remove unnecessary conditionals in resource batch tracking
|
||
- zink: make batch_usage_set take a batch state param
|
||
- zink: make batch_usage_matches take a batch state param
|
||
- zink: cache descriptor update templates along with layout
|
||
- zink: track active use counts for descriptor layouts
|
||
- zink: destroy lazy descriptor pools during batch reset when unused
|
||
- zink: slightly refactor program updating during draw
|
||
- zink: remove return types from program update functions during draw
|
||
- zink: simplify zink_program_has_descriptors()
|
||
- zink: mark bind_stage() as inline
|
||
- zink: unify gfx shader create callbacks
|
||
- zink: use u_live_shader_cache
|
||
- zink: remove unnecessary draw checks
|
||
- zink: move batch decl to top of draw_vbo
|
||
- zink: stop sanitizing primitive_restart flag in draw info
|
||
- zink: handle nir_op_pack_64_2x32
|
||
- zink: add update flag for rasterizer state change
|
||
- zink: add update flag for dsa state change
|
||
- zink: split stencil ref changes to separate dirty flag
|
||
- anv: fix dynamic primitive topology for tess
|
||
- zink: update pipe_screen::num_contexts
|
||
- zink: set subdata hook as PIPE_MAP_ONCE
|
||
- zink: move queue init to screen creation
|
||
- util/queue: add a global data pointer for the queue object
|
||
- zink: add a more direct check for rgbx formats in create_sampler_view hook
|
||
- zink: smash dstAlphaBlendFactor to ZERO for RGBX attachments
|
||
- zink: also nope out of any dst alpha blends for rgbx formats
|
||
- zink: support more RGBX formats
|
||
- zink: ci updates
|
||
- zink: mark some functions inline
|
||
- zink: collapse host_visible and non-coherent alignment alloc cases
|
||
- zink: change a bunch of sparse buffer resource checks to host-visible checks
|
||
- zink: avoid caching visible vram allocations
|
||
- zink: key alloc cache on heap index, not heap flags
|
||
- zink: check actual mem props to determine if resource object is coherent
|
||
- zink: use fake buffer barriers for descriptors
|
||
- zink: always defer image descriptor barriers
|
||
- zink: remove duplicated bitflag filtering for inline uniforms
|
||
- zink: remove inlinable_uniforms_dirty_mask
|
||
- radv: move pipe_misaligned and l2_coherent image checks to flags set on init
|
||
- nine: only enable tgsi disk cache if the driver supports it
|
||
- nine: add zink to the build target
|
||
- zink: handle custom border color without matching wrap mode case
|
||
- zink: add a flag for disabling conditional render during blit
|
||
- zink: add more clear hooks
|
||
- zink: clear the fb clears array instead of freeing it on reset
|
||
- zink: support multidraw
|
||
- zink: use multidraw
|
||
- vk/util: add macros for multidraw
|
||
- zink: clear textures directly when possible
|
||
- zink: only update last_finished during batch reset if the batch was used
|
||
- zink: improve tc fence disambiguation
|
||
- zink: add and use fencing functions which take batch usage structs
|
||
- zink: use batch usage api for resource helper function
|
||
- zink: remove no-longer-used resource helper functions
|
||
- zink: queue v3.0
|
||
- zink: apply zink_resource_object::offset for memory flush/invalidates
|
||
- zink: break out offset alignment calculation into helper
|
||
- zink: make init_mem_range() a public function
|
||
- zink: enforce multi-context waiting for unflushed resources on foreign batches
|
||
- zink: move queue submit thread to screen
|
||
- zink: move sparse buffer commit to screen queue
|
||
- zink: move fence reset to submit thread
|
||
- zink: flag scanout updates to batch state, not resource
|
||
- zink: move some end-of-batch stuff to submit thread
|
||
- zink: don't clear batch resources on fence finish
|
||
- ci: disable panfrost t760 jobs
|
||
- aux/draw: add a util function for reading back indirect draw params
|
||
- util/prim_restart: break out draw rewriting into separate function
|
||
- util/primconvert: handle indirect draws
|
||
- util/primconvert: map index buffer before getting index translator function
|
||
- util/primconvert: handle rewriting of prim-restart draws with unsupported primtype
|
||
- util/primconvert: handle multidraws in primconvert
|
||
- gallium: add a pipe cap to rewrite index buffers for draws using a non-fixed restart index
|
||
- gallium: handle automatic 8bit -> 16bit index buffer rewrites
|
||
- gallium: add a pipe cap for performing automatic prim type conversion
|
||
- gallium: add a pipe cap for determining driver support for prim type in restarts
|
||
- zink: export PIPE_CAP_EMULATE_NONFIXED_PRIMITIVE_RESTART
|
||
- zink: export 8bit index buffer support based on extension presence
|
||
- zink: export supported prim types
|
||
- zink: export supported primitive restart types
|
||
- zink: remove primconvert
|
||
- zink: ci updates
|
||
- zink: use depth/stencil-only layouts for depth/stencil-only formats
|
||
- lavapipe: implement multidraw ext
|
||
- zink: break out image descriptor layout into util function
|
||
- zink: split deferring of barriers to image and buffer functions
|
||
- zink: only do deferred image barriers if layout changes
|
||
- zink: use bind counts to more accurately determine image descriptor's exact layout
|
||
- zink: improve automatic layout transitions for sampler+image descriptors
|
||
- zink: only queue deferred descriptor layout change on first bind or change
|
||
- zink: flush pending clears if a resource is bound as a descriptor
|
||
- zink: repack zink_context struct a bit
|
||
- anv: unify some draw state vertex constant emission
|
||
- anv: VK_EXT_multi_draw implementation
|
||
- util/vbuf: always claim support for PATCHES in restart modes
|
||
- util/vbuf: flag fallback_always if any prim types are missing from restart modes
|
||
- zink: add direct conversion from pipe_shader_type->VkPipelineStageFlags
|
||
- zink: split dummy buffer creation and populate
|
||
- zink: try for better buffer allocation heaps
|
||
- zink: don't align device-local buffer memory
|
||
- zink: make mem cache limits dynamically scalable
|
||
- zink: uncap mem caching
|
||
- zink: cache visible vram
|
||
- zink: attempt to handle some resource unmap cases in 32bit envs
|
||
- radv: pre-calc vertex buffer descriptor size on pipeline object
|
||
- lavapipe: hook up some bits for handling dynamic line stipple state
|
||
- lavapipe: implement EXT_vertex_input_dynamic_state
|
||
- zink: avoid unnecessarily rewriting gl_DrawID
|
||
- zink: unify/consolidate some barrier queuing
|
||
- zink: break up ctx descriptor state updating to per-type functions
|
||
- zink: add a ref for flush resource
|
||
- zink: unify fb surface unbinding
|
||
- zink: move line width and depth bias updating into conditional during draw
|
||
- zink: merge some streamout state emission into the same draw conditional
|
||
- zink: rework pipeline cache implementation
|
||
- zink: make prim type a bitfield in pipeline info
|
||
- zink: rename 'template' struct member
|
||
- zink: remove unnecessary return from zink_desc_type_from_vktype()
|
||
- zink: add c++ header guards
|
||
- zink: add more explicit casts to draw code
|
||
- zink: don't add batch tracking during buffer rebinds if refs are dirty
|
||
- zink: remove stencil resource batch tracking
|
||
- zink: split out resource tracking into more incremental functions
|
||
- zink: split batch usage setting from refcounting
|
||
- zink: split samplerview/imageview usage/refcounting calls
|
||
- zink: add resource refs after last descriptor unbind
|
||
- zink: use vkresult helper for map return
|
||
- zink: only flag persistent resource maps for invalidation if they aren't coherent
|
||
- zink: don't add mem allocation offset when copying buf2image
|
||
- zink: use pipe_resource::width0 for clamping ssbo sizes
|
||
- zink: use 0 as the offset when mapping qbos
|
||
- zink: stop screwing up buffer offsets during for maps
|
||
- zink: add a screen function for waiting on a batch id
|
||
- zink: check last_finished before timeline waiting
|
||
- lavapipe: store whether the geometry shader outputs GL_LINES
|
||
- lavapipe: store the geometry shader prim type to render state
|
||
- lavapipe: implement VK_EXT_line_rasterization
|
||
- lavapipe: wideLines support
|
||
- zink: ci updates for wideline fails
|
||
- relnotes: add some line feature updates for lavapipe
|
||
- features: mark off line rasterization for lavapipe
|
||
- features: mark off some zink features
|
||
- features: fix ARB_shader_group_vote -> GL_ARB_shader_group_vote
|
||
- features: add VK_EXT_multi_draw
|
||
- features: mark off EXT_vertex_input_dynamic_state for lavapipe
|
||
- radv: use multidraw iteration for direct draws
|
||
- radv: emit NOT_EOP for multi indexed draws
|
||
- radv: emit drawid for multidraws
|
||
- radv: determine if hardware can emit NOT_EOP before emitting
|
||
- radv: split indexed draw cases based on whether drawid is used
|
||
- radv: add a gfx10 bug workaround for NOT_EOP
|
||
- radv: implement VK_EXT_multi_draw
|
||
- lavapipe: handle null vertex buffers more gracefully
|
||
- util/vbuf: check 3-component 16bit int formats for translation
|
||
- zink: make shader cache local to a single program
|
||
- zink: split up shader cache per-stage
|
||
- zink: set gfx program shaders and generate internal tcs during program creation
|
||
- zink: remove gfx program slot mapping
|
||
- zink: remove shader_id
|
||
- zinK: tweak shader module update -> pipeline combined_dirty conditional
|
||
- lavapipe: implement EXT_separate_stencil_usage
|
||
- lavapipe: implement KHR_separate_depth_stencil_layouts
|
||
- features: more lavapipe extensions
|
||
- relnotes: more lavapipe features
|
||
- zink: add util function for transferring resource refs to batch
|
||
- zink: move resource object ref to batch in invalidate hook
|
||
- zink: move resource object ref to batch in init_storage_object
|
||
- zink: remove refs from buffer rebinds
|
||
- zink: remove refs from vertex buffers
|
||
- zink: remove refs from ubos
|
||
- zink: remove refs from shader buffers
|
||
- zink: remove refs from shader images
|
||
- zink: remove resource refs from samplerviews
|
||
- zink: remove refs from desc ref updating
|
||
- zink: add surface ref during rebind if unflushed usage
|
||
- zink: set new batch usage during surface rebinds
|
||
- zink: remove imageview refs from shader images
|
||
- zink: remove samplerview refs
|
||
- zink: remove fb surface refs
|
||
- zink: remove fb surface resource refs
|
||
- zink: remove some descriptor_refs_dirty checks from resource binding
|
||
- zink: add a per-stage mask for ubo binds
|
||
- zink: add a per-stage bind mask for ssbos
|
||
- zink: make samplerview bind mask apply to buffer resources too
|
||
- zink: make image_bind_count work for buffers
|
||
- zink: remove barriers from buffer rebinds
|
||
- zink: optimize buffer rebinds
|
||
- zink: redo streamout and texture components of memory_barrier hook
|
||
- zink: remove unnecessary stall during device-local map case
|
||
- lavapipe: only apply pipeline state for depth bias if it's enabled
|
||
- lavapipe: implement EXT_extended_dynamic_state2
|
||
- features: EXT_extended_dynamic_state2 for lavapipe
|
||
- relnotes: EXT_extended_dynamic_state2 for lavapipe
|
||
- zink: store the last vertex stage to the context during bind
|
||
- zink: use last_vertex_stage pointer to optimize streamout emission during draw
|
||
- zink: update streamout buffer strides inline
|
||
- zink: move descriptor update closer to start of draw
|
||
- zink: consolidate and optimize index buffer handling during draw
|
||
- features: mark off VK_EXT_multi_draw for radv
|
||
- zink: remove zink_shader_module refcounting
|
||
- zink: flag all shaders for create during gfx program init
|
||
- zink: keep a mask of stages present in a gfx program
|
||
- zink: flag shader modules as default
|
||
- zink: store the default variant hash for a program
|
||
- nir/format_convert: nir_shift -> nir_shift_imm
|
||
- nir/format_convert: add ssa version of uint packing
|
||
- lavapipe: disable line rasterization ext
|
||
- zink: ensure sparse allocations aren't marked host-visible
|
||
- zink: fix mem info query to be more permissive
|
||
- zink: zero out sampler/image descriptor surface info for null descriptor updates
|
||
- zink: ci updates
|
||
- zink: populate modifier props onto screen object during init
|
||
- zink: start storing modifiers to the base resource struct
|
||
- zink: store modifier aspect to resource
|
||
- zink: add a pipe_screen::resource_get_param hook
|
||
- zink: use VkImageDrmFormatModifierListCreateInfoEXT for creating from modifier array
|
||
- zink: explicitly disallow using the modifier image create for non-linear images
|
||
- zink: don't pass modifier count to first image create
|
||
- zink: add fallback for linear modifier use
|
||
- zink: add a pipe_screen::resource_create_with_modifiers hook
|
||
- features: mark off line rasterization for lavapipe
|
||
- relnotes: add some missing zink/lavapipe updates
|
||
- ci: add vulkan files to lavapipe rules
|
||
- ci: only trigger gallium_core_file_list jobs from dri and glx frontend changes
|
||
- zink: simplify modifier ifdefs
|
||
- zink: improve detection for broken drawids
|
||
- lavapipe: increment drawid for multidraws
|
||
- util/foz: stop crashing on destroy if prepare hasn't been called
|
||
- zink: use array size in spirv bo length calculations
|
||
|
||
Nanley Chery (8):
|
||
|
||
- anv: Add clear_supported to anv_layout_to_aux_state
|
||
- anv: Avoid sampling some MCS surfaces with clear
|
||
- iris: Avoid sampling some MCS surfaces with clear
|
||
- isl: Add isl_aux_usage_has_compression
|
||
- iris: Prefer more GPU-based uploads for compression
|
||
- intel: Limit the D16 workarounds to Gfx12.0
|
||
- anv,iris: Port the D16 workaround stalls to BLORP
|
||
- intel/isl: Fix HiZ+CCS comment about ambiguates
|
||
|
||
Neha Bhende (4):
|
||
|
||
- svga: Add target and sampler_return_type info into shader key
|
||
- svga: Use shader_key info to declare resources if TGSI shader is missing it
|
||
- svga: Initialize pipe_shader_state for transform shaders
|
||
- aux/indices: include provoking vertex check in prim type conversion
|
||
|
||
Neil Roberts (1):
|
||
|
||
- kmsro: Fix confusing comma expression
|
||
|
||
Niklas Haas (3):
|
||
|
||
- vulkan/wsi/x11: return VK_SUBOPTIMAL_KHR on mismatched swapchain
|
||
- vulkan/wsi/x11: lower resize events to VK_SUBOPTIMAL_KHR
|
||
- vulkan/wsi/wayland: implement the full format table
|
||
|
||
Olivier Fourdan (1):
|
||
|
||
- radeonsi: Check aux_context on si_destroy_screen()
|
||
|
||
Paul Gofman (1):
|
||
|
||
- util: add force_gl_names_reuse for SWKOTOR.
|
||
|
||
Paul Kocialkowski (1):
|
||
|
||
- lima: Take offset in account when checking BO size
|
||
|
||
Paulo Zanoni (2):
|
||
|
||
- iris: finish converting from drmIoctl to intel_ioctl
|
||
- iris: don't munmap NULL pointers
|
||
|
||
Petr Vaněk (1):
|
||
|
||
- docs/install: remove one extra when
|
||
|
||
Philipp Zabel (1):
|
||
|
||
- etnaviv: fix gbm_bo_get_handle_for_plane for multiplanar images
|
||
|
||
Philippe Normand (1):
|
||
|
||
- i915: Prevent invalid framebuffer usage
|
||
|
||
Pierre Moreau (2):
|
||
|
||
- clover/spirv: Properly size 3-component vector args
|
||
- clover/nir: Set constant buffer pointer size to host
|
||
|
||
Pierre-Eric Pelloux-Prayer (57):
|
||
|
||
- driconf: add workaround for Golf With Friends
|
||
- glx: init __GLXvendorInfo to NULL
|
||
- radeonsi/nir: enable nir_opt_move_discards_to_top pass
|
||
- radeonsi: enable glsl_correct_derivatives_after_discard by default
|
||
- st/mesa: fix clearing of 1D array textures
|
||
- frontend/dri: set PIPE_BIND_PROTECTED later
|
||
- frontend/dri: fix bool/int comparison
|
||
- radeonsi: allow write-only mapping of encrypted textures
|
||
- radeonsi: fix encryption check for buffers
|
||
- radeonsi: dirty msaa_config on rs->multisample_enable change
|
||
- winsys/amdgpu: don't read bo->u.slab.entry after pb_slab_free
|
||
- amdgpu/winsys: remove amdgpu_cs_has_chaining
|
||
- winsys/amdgpu: reduce amdgpu_cs size
|
||
- winsys/amdgpu: use int16 for buffer_indices_hashlist
|
||
- radeonsi: add _once suffix to depth_cleared_level_mask
|
||
- radeonsi: add si_install_draw_wrapper
|
||
- radeonsi: use si_install_draw_wrapper for tmz handling
|
||
- radeonsi/nir: add si_nir_is_output_const_if_tex_is_const
|
||
- radeonsi: use si_nir_is_output_const_if_tex_is_const
|
||
- vbo: delay vbo_exec_vtx_map call
|
||
- radeonsi: delay sample_pos_buffer creation until first use
|
||
- util/u_queue: move function definition up
|
||
- util/u_queue: add UTIL_QUEUE_INIT_SCALE_THREADS flag
|
||
- disk_cache: use UTIL_QUEUE_INIT_SCALE_THREADS
|
||
- radeonsi: skip instance_count==0 draws on <= GFX9
|
||
- radeonsi: disable ngg culling on llvm < 12
|
||
- mesa/shaderapi: change construct_name signature
|
||
- mesa/shaderapi: add an optional shader override mechanism
|
||
- ac/llvm: call the callback in all return paths of ac_cull_triangle
|
||
- radeonsi: fix fb_too_small condition
|
||
- radeonsi/gfx7: always sync pfp/me
|
||
- ac/surface: don't print stencil info if tex has no stencil
|
||
- radeonsi/driconf: add workaround for SpaceEngine
|
||
- glthread: add a last parameter to unmarshal functions
|
||
- glthread: return consumed bytes
|
||
- glthread: use custom marshal/unmarshal for CallList
|
||
- glthread: merge sucessive glCallList
|
||
- dlist: add locked param to _mesa_lookup_list
|
||
- dlist: prelock ctx->Shared->DisplayList before execute_list
|
||
- dlist: remove OPCODE_EXT_0
|
||
- dlist: remove InstSize
|
||
- dlist: unindent code
|
||
- dlist: use an union instead of allocating a 1-sized array
|
||
- dlist: always use merged primitive for drawing
|
||
- dlist: split hot/cold data from vertex_list
|
||
- dlist: use a separate opcode for vbo replay using loopback
|
||
- dlist: use a new OPCODE to avoid loading cold data
|
||
- dlist: increment/check list nesting when handling OPCODE_CALL_LIST(S)
|
||
- dlist: store all dlist in a continuous memory block
|
||
- dlist: remove _mesa_dlist_alloc_aligned
|
||
- dlist: remove unused _mesa_dlist_alloc
|
||
- dlist: skip NOP command at the head of a list
|
||
- mesa: clear shader_info::is_lowered in prog_to_nir
|
||
- mesa: fix bindless uniform samplers update
|
||
- dlist: don't handle unmerged draws as merged
|
||
- gallium/va: don't use key=NULL in hash tables
|
||
- amd/registers: fix fields conflict detection
|
||
|
||
Qiang Yu (1):
|
||
|
||
- st/mesa: fix size miss match for some check
|
||
|
||
Rafael Antognolli (5):
|
||
|
||
- intel/fs: Lower dword integer multiplies on XeHP.
|
||
- iris/bufmgr: Query memory region info.
|
||
- iris/bufmgr: Add new set of buckets for local memory.
|
||
- iris/bufmgr: Add flag to allocate from local memory.
|
||
- iris: Map with WC on non-LLC platforms.
|
||
|
||
Rhys Perry (92):
|
||
|
||
- aco/ra: use original names when renaming loop carried phi operands
|
||
- aco/ra: remove live-in temporary from live_out_per_block when moving it
|
||
- radv: fix barrier in radv_decompress_dcc_compute shader
|
||
- radv: fix clearing DCC-compressed e5b9g9r9 images
|
||
- aco: set TRUNC_COORD=0 for nir_texop_tg4
|
||
- ac/nir: set TRUNC_COORD=0 for nir_texop_tg4
|
||
- aco: remove image parameter from get_sampler_desc()
|
||
- Revert "radeonsi: set TRUNC_COORD=0 for Total War: WARHAMMER to fix it"
|
||
- aco: don't update register demand during RA validation
|
||
- aco: allow SDWA sels smaller than the operand size
|
||
- aco: add and use Program::progress
|
||
- nir/load_store_vectorize: assume CAN_REORDER ops don't alias with stores
|
||
- nir/opt_load_store_vectorize: improve handling of swizzles
|
||
- nir/opt_load_store_vectorize: ignore load_vulkan_descriptor
|
||
- nir/opt_load_store_vectorize: loop internally
|
||
- radv: improve vectorization callback for small bit sizes
|
||
- radv: only set robust_modes if robustBufferAccess2 is enabled
|
||
- radv: disable VK_FORMAT_R64_SFLOAT
|
||
- radv: cleanup LLVM implementation of vulkan_descriptor_index
|
||
- radv: implement vulkan_resource_reindex
|
||
- nir/lower_non_uniform: allow lowering with vec2 handles
|
||
- radv,aco: use nir_address_format_vec2_index_32bit_offset
|
||
- vulkan: fix use-after-free in vk_common_DestroyDebugReportCallbackEXT
|
||
- radv: fix use-after-free upon GS copy shader cache hits
|
||
- radv: fix possible use-after-free when inserting GS copy shader from cache
|
||
- radv,ac/llvm: use a dword alignment for descriptor loads
|
||
- aco: group loads from the same vertex binding into the same clause
|
||
- radv,aco: use per-attribute vertex descriptors for robustness
|
||
- Revert "radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2"
|
||
- radv,aco: compact vertex buffer descriptors
|
||
- ci: remove expected robustness2 fails for Renoir
|
||
- aco/ra: initialize temp_in_scc earlier
|
||
- aco/ra: fix get_reg_for_operand() with no free registers
|
||
- aco/ra: fix get_reg_for_operand() when the blocking var is a vector
|
||
- aco/ra: fix get_reg_for_operand() with vector operands
|
||
- aco/ra: use flags instead of booleans for update_renames()
|
||
- aco: disallow SGPRs on DPP instructions
|
||
- radv: don't allocate DCC predicate if the image doesn't use DCC
|
||
- radv: add radv_absolute_depth_bias
|
||
- radv: workaround incorrect depthBiasConstantFactor by Path of Exile
|
||
- radv: fix formatting of radv_dri_options
|
||
- radv: make attrib_end variable in radv_flush_vertex_descriptors 32-bit
|
||
- aco: do not clause NSA instructions
|
||
- aco/tests: add tests for form_hard_clauses()
|
||
- aco/tests: improve reporting of failed code checks
|
||
- aco: don't create 4 and 5 dword NSA instructions on GFX10
|
||
- aco: don't use nir_block_is_unreachable()
|
||
- nir/unsigned_upper_bound: don't require dominance metadata
|
||
- nir/algebraic: optimize extract of extract
|
||
- nir, nir/algebraic: add byte/word insertion instructions
|
||
- aco: disallow SDWA for instructions with 64-bit definitions/operands
|
||
- aco: add p_extract/p_insert
|
||
- aco: implement nir_op_extract/nir_op_insert
|
||
- aco: use byte/word extract pseudo-instructions
|
||
- ac/llvm: implement byte/word extract/insert instructions
|
||
- radv: use byte/word extract/insert instructions
|
||
- aco: optimize 32-bit extracts and inserts using SDWA
|
||
- aco: make validate_ir() output usable in tests
|
||
- aco: disallow literals with some instruction formats
|
||
- aco/tests: add tests for p_extract/p_insert lowering
|
||
- aco/tests: add SDWA tests
|
||
- aco: use v1b/v2b for ds_read_u8/ds_read_u16
|
||
- radv: improve LDS alignment check for load/store vectorization
|
||
- aco: don't ever widen 8/16-bit sgpr load_shared
|
||
- aco: use ds_read_{u8,u16}_d16
|
||
- aco: fix emit_mbcnt() with a VGPR mask
|
||
- radv: increase maxComputeSharedMemorySize
|
||
- nir/load_store_vectorizer: fix check_for_robustness() with indirect loads
|
||
- nir/opt_load_store_vectorize: check for restrict at the variable
|
||
- nir/opt_load_store_vectorize: only require one variable to be restrict
|
||
- nir: document that ACCESS_RESTRICT is not set at intrinsics
|
||
- radv,aco: use all attributes in a binding to obtain an alignment for fetch
|
||
- aco: adjust the condition for expanding vertex fetch data format
|
||
- aco/ra: use adjust_max_used_regs() in compact_relocate_vars()
|
||
- aco: don't move descriptor loads below buffer loads
|
||
- aco: move VMEM instructions below descriptor loads
|
||
- aco/lower_phis: fix undef_operands initialization with >32 predecessors
|
||
- aco/lower_phis: don't allocate unused temporary ids
|
||
- nir: use a single set during CSE
|
||
- nir/cse: resize the instruction set
|
||
- nir/propagate_invariant: add invariant_prim option
|
||
- radv: allow VK_FORMAT_R8G8_SRGB sampling
|
||
- nir/opt_load_store_vectorize: fix check_for_robustness() with deref access
|
||
- aco/tests: fix 32-bit build
|
||
- docs/envvars: fix RADV_TEX_ANISO
|
||
- aco: remove resource flags
|
||
- aco: handle NIR loops without breaks
|
||
- radv: enable VK_KHR_shader_subgroup_uniform_control_flow
|
||
- radv: don't ever convert num_records to bytes if it's zero
|
||
- radv: adjust num_records when offset>stride
|
||
- radv: use null vertex descriptor if num_records=0
|
||
- aco: don't create v_madmk_f32/v_madak_f32 from v_fma_legacy_f16
|
||
|
||
Rob Clark (157):
|
||
|
||
- freedreno: Add .clang-format
|
||
- freedreno: Some manual reformatting
|
||
- freedreno: Re-indent
|
||
- freedreno: Manual fixups
|
||
- freedreno: Add missing foreach macros and update indentation
|
||
- freedreno/drm: Re-indent
|
||
- freedreno/afuc: Re-indent
|
||
- freedreno/common: Re-indent
|
||
- freedreno/computerator: Re-indent
|
||
- freedreno/decode: Re-indent
|
||
- freedreno/drm-shim: Re-indent
|
||
- freedreno/ir2: Re-indent
|
||
- freedreno/perfcntrs: Re-indent
|
||
- freedreno/fdl: Re-indent
|
||
- ir3: handle 16b op_i2b1
|
||
- ci: Update kernel with a few freedreno related fixes
|
||
- ci: Add timeout for traces jobs
|
||
- freedreno: Small indent fix
|
||
- freedreno: Avoid staging blits with stencil on older gens
|
||
- freedreno: Make sure we actually flush if we need a fence
|
||
- freedreno: Add a couple debug traces
|
||
- freedreno: Allow resource shadowing for TC
|
||
- freedreno/drm: Move submit->primary to base class
|
||
- freedreno/drm: Cleanup bo allocation flags
|
||
- freedreno/drm: Cleanup bo cpu_prep flags
|
||
- freedreno/drm: Add FD_BO_PREP_FLUSH
|
||
- freedreno/drm: Move the growable array helper
|
||
- freedreno/drm: Add locked version fd_{bo,pipe}_del()
|
||
- freedreno/drm: Userspace fences
|
||
- freedreno/drm: Inline the fence-table
|
||
- freedreno/batch: Don't create fences for every batch
|
||
- freedreno: last_fence optimization for TC async flushes
|
||
- freedreno: Move fence struct to header
|
||
- freedreno: Drop unused create_fence() arg
|
||
- freedreno/drm: Reference count submits
|
||
- freedreno: Re-work fd_submit fence interface
|
||
- freedreno/drm: Add pipe tracking for deferred submits
|
||
- freedreno/drm/sp: Split submit prep and finish
|
||
- freedreno/drm/sp: Implement deferred submit merging
|
||
- freedreno: Avoid flushing deferred submits for u_trace
|
||
- freedreno/drm: fd_submit should hold ref to fd_pipe
|
||
- freedreno/drm: pipe should hold reference to device
|
||
- freedreno/drm: Async submit support
|
||
- freedreno/drm: Assume explicit fences if in_fence_fd
|
||
- freedreno/ci: Disable counterstrike trace on a306 for now
|
||
- freedreno/ci: Skip texsubmimage cube_map_array
|
||
- ci: Add DEQP_CASELIST_INV_FILTER
|
||
- freedreno/ci: Isolate dEQP-EGL reset_context tests
|
||
- freedreno: Remove samples-per-tex tracking
|
||
- freedreno/drm: Allow FD_BO_PREP_FLUSH without _NOSYNC
|
||
- freedreno: Flush resources harder
|
||
- freedreno/ci: Mark client_wait_sync_finish as flake
|
||
- freedreno/ci: Update piglit skips/fails
|
||
- freedreno/drm: Initialize control->fence
|
||
- freedreno: Fix TC last_fence optimization
|
||
- freedreno: Consolidate needs_flush and clearing last_fence
|
||
- freedreno/query/acc: Set needs_flush
|
||
- freedreno/tools: Fix async flush vs fdperf/computerator
|
||
- pps: Lower min sampling interval
|
||
- util/perfetto: Add one-time init
|
||
- freedreno: Add freedreno pps driver
|
||
- gallium/aux: Add perfetto support to u_trace
|
||
- freedreno/drm: Add support to query device suspend count
|
||
- freedreno/pps: Detect GPU suspend on newer kernels
|
||
- freedreno: Moar header C++-proofing
|
||
- freedreno: Add perfetto renderpass support
|
||
- pps: Add a more interesting cfg example
|
||
- docs/perfetto: Updates for freedreno and render-stages
|
||
- gallium/u_threaded: Add to_call() helper
|
||
- gallium/u_threaded: Add call logging
|
||
- freedreno/ir3: Don't force RTNE if rounding mode is undefined
|
||
- freedreno/a6xx: Add a few registers
|
||
- freedreno: Rename internal resource_busy
|
||
- freedreno: Implement TC resource_busy
|
||
- freedreno/tu+drm: Extract out pm4 pkt header helpers
|
||
- freedreno: Move pkt parsing helpers to common
|
||
- freedreno/afuc: Split out instruction decode helper
|
||
- freedreno/afuc: Split out utils
|
||
- freedreno/afuc: Clean up special regs
|
||
- freedreno/afuc: Add pipe reg name decoding
|
||
- freedreno/afuc: Add emulator mode to afuc-disasm
|
||
- freedreno/registers: Add a few a6xx regs and notes
|
||
- freedreno/afuc: Extract full gpu-id
|
||
- freedreno/afuc: Split out helpers to parse labels and packet-table
|
||
- freedreno/afuc: Add emulator support to run bootstrap
|
||
- freedreno/ci: Add real packet-table loading for afuc test
|
||
- freedreno/afuc: Use emulator to extract jmptbl
|
||
- freedreno/headergen2: Fix compile warnings with CP_DRAW_INDIRECT_MULTI
|
||
- freedreno/a6xx: Fix mh31 intermittent faults
|
||
- freedreno: Fix typo
|
||
- freedreno: Don't return a flushed batch
|
||
- egl: zero is a valid fd
|
||
- egl+libsync: Add check for valid fence-fd
|
||
- frontend/dri: Fix fence-fd logic
|
||
- freedreno/ir3: Fix use after free
|
||
- Revert "st/mesa: execute glFlush asynchronously if no image has been imported/exported"
|
||
- freedreno: Fix batch flush race condition
|
||
- freedreno: Fix fdperf flush
|
||
- gallium/u_threaded: Missing driver-thread marking
|
||
- freedreno: Add string-marker debug trace
|
||
- freedreno: Move assert
|
||
- freedreno: Add tid to DBG() msgs
|
||
- freedreno: Remove assert
|
||
- freedreno/registers: add A5XX_RBBM_STATUS3 bit
|
||
- freedreno: Add missing valid range tracking for SSBOs/images
|
||
- docs: Update freedreno features
|
||
- freedreno/ci: Sort a630 piglit xfails
|
||
- freedreno/a6xx: Fix r16_snorm blits
|
||
- freedreno/a6xx: Handle non-UBWC surface views
|
||
- freedreno/a6xx: Improve UBWC demotion logic
|
||
- freedreno: Drop obsolete comment
|
||
- freedreno: Don't try staging blit for non-renderable formats
|
||
- freedreno: Add debugging for blitter fallback recursion
|
||
- freedreno: Avoid recursive re-entry of u_blitter
|
||
- freedreno/a6xx: Handle R8G8 sharp edges in validate_format()
|
||
- freedreno/a6xx: Also validate format in blitter path
|
||
- freedreno: Flush batches on shadow/uncompress
|
||
- freedreno: Fallback to sw for copy_image with compressed
|
||
- freedreno: Fix flushes with NULL batch
|
||
- freedreno/blitter: Flush before self-blits
|
||
- freedreno/a6xx: Use UNORM for SNORM copy blits
|
||
- freedreno/a6xx: Handle u/snorm vs u/sint validation
|
||
- freedreno: Fix for multi-draw blits
|
||
- freedreno/a6xx: Flip on copy_image
|
||
- freedreno/a6xx: Skip nv_copy_image tests
|
||
- freedreno: Defer freeing batch->key
|
||
- freedreno/ci: Start longest traces first
|
||
- freedreno/ci: Increase # of jobs for CI runners
|
||
- freedreno/ci: Garbage collect some a630 flakes
|
||
- freedreno/a6xx: Handle fb_read in sysmem path
|
||
- freedreno: Flush if at risk of overflowing bos table
|
||
- turnip: Use drmIoctl()
|
||
- turnip: Fix AcquireImageANDROID() handle type
|
||
- turnip: Add CrOS Gralloc support
|
||
- nir: Add pass to lower phi precision
|
||
- freedreno+ir3: Enable INT16
|
||
- freedreno/a6xx: Fix framebuffer_barrier crash
|
||
- turnip: avoid some UB
|
||
- turnip: Split tu6_emit_xs()
|
||
- freedreno/computerator: Add script to probe FLUT values
|
||
- freedreno/ir3: Add float immed "FLUT" support
|
||
- freedreno: Rename \*_dev_info
|
||
- freedreno: Generate device-info tables at build time
|
||
- freedreno: Convert fd_dev_info to const pointer
|
||
- turnip: Convert fd_dev_info to const pointer
|
||
- freedreno/ir3: Get tess_use_shared from fd_dev_info
|
||
- freedreno/ir3: Get reg_size_vec4 from fd_dev_info
|
||
- turnip: Drop unused vshs_workgroup param
|
||
- turnip: Get storage_16bit from fd_dev_info
|
||
- turnip: Get indirect_draw_wfm_quirk from fd_dev_info
|
||
- turnip: Get has_tex_filter_cubic from fd_dev_info
|
||
- turnip: Get has_sample_locations from fd_dev_info
|
||
- freedreno+turnip: Add has_cp_reg_write
|
||
- freedreno+turnip: Add has_8bpp_ubwc
|
||
- freedreno+turnip: Get device name from device-info table
|
||
- freedreno+turnip: Add a6xx gen4 support
|
||
- freedreno/a6xx: Add missing PC_CCU_INVALIDATE_x
|
||
|
||
Robert Foss (1):
|
||
|
||
- freedreno/regs: add 5nm DSI PHY/PLL regs
|
||
|
||
Robert Tarasov (1):
|
||
|
||
- iris: Check data alignment for copy_mem_mem
|
||
|
||
Rohan Garg (8):
|
||
|
||
- i965: plumb device/driver UUID generators
|
||
- i965: Initial implementation for EXT_memory_object_*
|
||
- i965: Implement semaphore support for EXT_external_objects
|
||
- i965: Implement BufferDataMem
|
||
- i965: fix in fences backend for ext_external_objects edge case
|
||
- i965: Enable EXT_memory_object_* for Gen 7 and above
|
||
- docs: mark external memory and semaphore extensions done for i965
|
||
- ci: Don't artifact rendered images when job succeeds
|
||
|
||
Roland Scheidegger (1):
|
||
|
||
- llvmpipe: fix nir dot products (fsum op)
|
||
|
||
Roman Stratiienko (7):
|
||
|
||
- anv_android: Add missing type
|
||
- meson: egl: Do not build platform_drm for Android
|
||
- android: Add scripts to build using meson
|
||
- nouveau: Don't require RTTI and use it only when enabled
|
||
- egl: android: prepare code for adding more buffer_info getters
|
||
- egl: android: add IMapper@4 metadata API buffer_info getter
|
||
- AOSP: Do not add '-Wl,--gc-sections' to the linker arguments
|
||
|
||
Ryan Houdek (3):
|
||
|
||
- Default enable SSE2 on mesa builds.
|
||
- Switch u_format_test to passed on i386
|
||
- Update release notes with mention that x87 is no longer used on x86
|
||
|
||
Sagar Ghuge (16):
|
||
|
||
- anv: Set correct fast clear value for depth during blorp operation
|
||
- anv: Avoid corrupting indirect depth clear values
|
||
- anv: Query memory region info
|
||
- anv: Wrapper around I915_GEM_CREATE_EXT_MEMORY_REGIONS
|
||
- anv: Allocate BO in appropriate region
|
||
- anv: Allocate scratch and workaround BO in local memory
|
||
- intel/compiler: Define new LSC data port encodings
|
||
- intel/compiler: Add support for LSC fence operations
|
||
- intel/compiler: Add helpers for LSC message descriptors
|
||
- intel/disasm: Disassmeble LSC messages
|
||
- intel/disasm: Disassemble LSC message extended descriptors
|
||
- intel/fs: Lower untyped float atomic messages to LSC when available
|
||
- intel/fs: Lower Byte scattered r/w messages to LSC when available
|
||
- intel/fs: Lower A64 byte scattered r/w messages to LSC dataport
|
||
- intel/fs: Lower A64 atomic messages to LSC dataport
|
||
- intel/fs: Lower varying pull constant load message to LSC dataport
|
||
|
||
Samuel Iglesias Gonsálvez (13):
|
||
|
||
- turnip: move pipeline gras_su and rb{stencil,depth}_cntl_mask initialization
|
||
- turnip: initialize pipeline->rb_{stencil,depth}_cntl always
|
||
- turnip: refactor how LRZ state is calculated
|
||
- turnip/lrz: add support for VK_EXT_extended_dynamic_state
|
||
- turnip: document GRAS_LRZ_CNTL's UNK5 bitfield
|
||
- turnip/lrz: added support for depth bounds test enable
|
||
- turnip: fix typo in tu_CmdBeginRenderPass2()
|
||
- turnip: implement LRZ direction
|
||
- turnip: update LRZ state based on stencil test state
|
||
- turnip: group all geometry constant draw states in one
|
||
- turnip: fix setting dynamic state mask for VK_DYNAMIC_STATE_STENCIL_OP_EXT case
|
||
- turnip: add LRZ early-z support
|
||
- anv: do not dereference VkPipelineMultisampleStateCreateInfo always
|
||
|
||
Samuel Pitoiset (130):
|
||
|
||
- amd: drop support for LLVM 8
|
||
- radv: keep DCC compressed for clears on compute with image stores
|
||
- aco: fix opquantize2f16 on GFX6-7
|
||
- radv: fix fast clearing depth-only or stencil-only aspects with HTILE
|
||
- radv: fix emitting depth bias when beginning a command buffer
|
||
- radv: remove radv_image_iview::bo
|
||
- radv: remove radv_image_iview::multiplane_planes
|
||
- radv: allow concurrent MSAA images to be FMASK compressed
|
||
- radv: fix emitting default depth bounds state on GFX6
|
||
- radv/winsys: remove set but never used use_llvm
|
||
- radv: remove old comment about LLVM <= 8
|
||
- ac: move ac_lower_indirect_derefs() outside of the LLVM dir
|
||
- radv: cleanup LLVM related includes
|
||
- radv: remove RADV_DEBUG=nothreadllvm
|
||
- radv/winsys: fix allocating the number of CS in the sysmem path
|
||
- radv/winsys: fix resetting the number of padded IB words
|
||
- radv: make sure CP DMA is idle before executing secondary command buffers
|
||
- radv: remove warnings about RADV_PERFTEST=aco,llvm
|
||
- radv/llvm: implement the image load DCC bug
|
||
- radv: enable DCC stores with the LLVM backend
|
||
- radv: re-introduce missing skip list for Polaris10
|
||
- radv: fix various CMASK regressions on GFX9
|
||
- radv: add the provoking vertex mode to the pipeline/shader keys
|
||
- radv/llvm: adjust NGG if provoking vertex mode is last
|
||
- aco: adjust NGG if provoking vertex mode is last
|
||
- radv: implement VK_EXT_provoking_vertex
|
||
- radv: enable TC-compat CMASK on GFX8-9
|
||
- radv: fix computation of the number of user SGPRS for NGG GS state
|
||
- radv: check if DCC is enabled when resolving different levels
|
||
- radv: only keep concurrent MSAA images compressed if TC-compat CMASK
|
||
- radv/winsys: add GFX6_MAX_CS_SIZE instead of using a magic value
|
||
- radv/winsys: fix executing huge secondary command buffers on GFX6
|
||
- radv: implement RADV_FORCE_VRS for the LLVM backend
|
||
- util/math: change ROUND_DOWN_TO to return a uint64_t
|
||
- radv: adjust the computation of the total usage of memory used
|
||
- radv: expose 2/3rd of total memory as VRAM and 1/3rd as GTT on APUs
|
||
- radv: fix missing ITERATE_256 for D/S MSAA images that are TC-compat HTILE
|
||
- radv: declare VK_EXT_extended_dynamic_state2 but leave it disabled
|
||
- radv: declare new dynamic states for VK_EXT_extended_dynamic_state2
|
||
- radv: implement dynamic depth bias enable
|
||
- radv: implement dynamic primitive restart enable
|
||
- radv: implement dynamic rasterizer discard enable
|
||
- radv: advertise VK_EXT_extended_dynamic_state2
|
||
- radv: fix extending the dirty bits to 64-bit
|
||
- radv: dump the trap handler shader with RADV_DEBUG=metashaders
|
||
- nir/opt_access: fix getting variables in presence of similar bindings/desc
|
||
- radv: add missing entrypoints for VK_EXT_extended_dynamic_state2
|
||
- radv: enable DCC stores on RDNA2
|
||
- aco: fix derivatives/intrinsics with SGPR sources
|
||
- Revert "radv: Do not access set layout during vkCmdBindDescriptorSets."
|
||
- radv: fix heap indices when computing the budget
|
||
- ac: ac_gpu_info::has_vgt_flush_ngg_legacy_bug
|
||
- radv: fix fast clearing DCC if one level can't be compressed on GFX10+
|
||
- radv: simplify radv_pipeline_has_gs_copy_shader()
|
||
- radv: remove small overhead of radv_pipeline_has_ngg()
|
||
- radv: ignore dynamic blend constants if blend isn't enabled
|
||
- radv: remove an useless TODO for dynamic line width
|
||
- radv: pass an image range to radv_layout_dcc_compressed()
|
||
- radv: remove redundant call to radv_dcc_enabled()
|
||
- radv: only mark DCC as compressed when drawing if layout allows it
|
||
- radv: only init DCC if compressed in the HW resolve path
|
||
- radv: do not decompress DCC for partial resolves if stores are supported
|
||
- radv: use radv_dcc_enabled() for the FB mip flush workaround
|
||
- aco: fix emitting discard when the program just ends
|
||
- radv: stop reporting ACO from the device name
|
||
- radv: remove DFSM
|
||
- util/drirc: make engine_versions an optional field
|
||
- radv: add few new drirc options
|
||
- util/drirc: use application_name_match for the SotTR RADV workaround
|
||
- radv: move all game workarounds to drirc
|
||
- radv: fix missing default state for DB_DFSM_CONTROL
|
||
- radv: fix generating hang reports if mutable descriptors are used
|
||
- radv: enable RADV_DEBUG=invariantgeom for Monster Hunter World
|
||
- ac/rgp: mark SQTT_FILE_CHUNK_TYPE_ISA_DATABASE as deprecated
|
||
- ac/rgp: bump the SQTT file minor version to 5
|
||
- radv: enable RADV_DEBUG=invariantgeom for SotTR DX11/DX12 versions
|
||
- ac: import performance counters from RadeonSI
|
||
- ac: rename ac_dump_thread_trace() to ac_dump_rgp_capture()
|
||
- ac/rgp: fix ac_fill_sqtt_asic_info() name
|
||
- ac: add ac_thread_trace::data
|
||
- radv/winsys: allow to reserve a VMID
|
||
- radv: emit PA_SC_CONSERVATIVE_RASTERIZATION_CNTL only on GFX9+
|
||
- ac/debug: fix color printing PKT3 when count in header is too low
|
||
- aco: fix range checking for SSBO loads/stores with SGPR offset on GFX6-7
|
||
- radv: dump SPIR-V instead of using spirv-dis when generating a hang report
|
||
- aco: fix emitting literal offsets with SMEM on GFX7
|
||
- ci: update list of expected failures for Pitcairn/Oland (RADV)
|
||
- radv: do not launch an IB2 for secondary cmdbuf with INDIRECT_MULTI on GFX7
|
||
- radv/winsys: add a small comment explaining the CHAIN bit
|
||
- ci: add expected list of failures for Bonaire (RADV)
|
||
- radv: fix aligning the image offset by using align64()
|
||
- radv/winsys: adjust some error messages
|
||
- radv/winsys: remove useless errno.h includes
|
||
- radv: fix dynamic rasterizer discard enable state
|
||
- radv: reject binding buffer/image when the device memory is too small
|
||
- radv: always decompress both aspects of a depth/stencil image
|
||
- radv: create only one pipeline for decompressing depth/stencil images
|
||
- radv: fix dynamic culling and depth/stencil related dynamic states
|
||
- ac/perfcounters: remove ac_pc_block_base::num_prelude
|
||
- ac/perfcounters,radeonsi: rework performance counters layout
|
||
- ac/perfcounters: rename num_multi to num_spm_counters
|
||
- ac/perfcounters: add more SPM configuration fields
|
||
- ac/perfcounters: add a GPU block ID to every block definitions
|
||
- radv: implement dynamic logic op
|
||
- radv: advertise extendedDynamicState2LogicOp
|
||
- radv: fix RADV_FORCE_VRS for 2x1 and 1x2
|
||
- radv: fix fd leak in vkAcquireImageANDROID()
|
||
- radv: disable DCC for DOOM 2016 and Wolfenstein II
|
||
- radv: implement VK_EXT_color_write_enable
|
||
- radv: advertise VK_EXT_color_write_enable
|
||
- radv: add support for more HTILE clear codes
|
||
- radv: prevent fast clearing HTILE depth for unrestricted ranges
|
||
- radv: allow more fast clears for depth surfaces without TC-compat HTILE
|
||
- ci: update list of expected failures against CTS 1.2.6.2 for RADV
|
||
- ci: remove few CTS that are now skipped with RADV
|
||
- aco: fix emitting d16 for MIMG instructions on GFX9+
|
||
- aco: fix emitting a16 for MIMG instructions on GFX10+
|
||
- aco: fix shared_atomic_comp_swap if the second source isn't a VGPR
|
||
- radv: fix applying radv_disable_dcc for DOOM and Wolfenstein II
|
||
- aco: use nir_ssa_def_is_unused() to determine if atomic dest is used
|
||
- ac,radv: implement the cs_regalloc_hang HW bug workaround
|
||
- radv: fix applying radv_disable_dcc for DOOM 2016 again
|
||
- radv: lower primitive shading rate in NIR
|
||
- radv: only init the TC-compat ZRANGE metadata for the depth aspect
|
||
- radv: fix bounds checking for zero vertex stride on GFX6-7
|
||
- radv: report APUs as discrete GPUs for Red Dead Redemption 2
|
||
- radv: fix specifying the stencil layout for separate depth/stencil layouts
|
||
- radv: allow unused VkSpecializationMapEntries
|
||
- radv: fix selecting the first active CU when profiling with SQTT
|
||
- radv: fix missing cache flushes when clearing HTILE levels on GFX10+
|
||
|
||
Sergii Melikhov (1):
|
||
|
||
- util/format: Change the pointer offset.
|
||
|
||
Simon Ser (27):
|
||
|
||
- radeon/vcn: handle tiled buffers when decoding
|
||
- util/format: document block depth field
|
||
- ac/surface: use blocksizebits instead of blocksize
|
||
- radeonsi: stop special-casing YUV formats in si_query_dmabuf_modifiers
|
||
- ac/surface: allow non-DCC modifiers for YUV on GFX9+
|
||
- frontends/va: improve surface attribs processing
|
||
- gallium, va: add support for VASurfaceAttribDRMFormatModifiers
|
||
- radeonsi: implement pipe_context.create_video_buffer_with_modifiers
|
||
- radv: stop special-casing multi-planar formats in radv_get_modifier_flags
|
||
- dri: add createImageWithModifiers2 interface
|
||
- gallium/dri: implement createImageWithModifiers2
|
||
- i965: implement createImageWithModifiers2
|
||
- vulkan/wsi/wayland: simplify wl_surface version check
|
||
- docs/envvars: document MESA_VK_WSI_PRESENT_MODE
|
||
- radv: implement VK_EXT_physical_device_drm
|
||
- amd/addrlib: remove Meson debug message()
|
||
- vulkan/wsi: unify format logic in dmabuf_handle_modifier
|
||
- vulkan/wsi: prefer the Wayland linux-dmabuf protocol
|
||
- vulkan/wsi/wayland: remove swapchain wl_drm wrapper
|
||
- vulkan/wsi/wayland: remove unnecessary wl_proxy_set_queue call
|
||
- vulkan/wsi/wayland: fix wsi_wl_image_init error code
|
||
- vulkan/wsi/wayland: handle dmabuf params allocation failure
|
||
- etnaviv: fix renderonly check in etna_resource_alloc
|
||
- etnaviv: fail in get_handle(TYPE_KMS) without a scanout resource
|
||
- freedreno: fail in get_handle(TYPE_KMS) without a scanout resource
|
||
- panfrost: fail in get_handle(TYPE_KMS) without a scanout resource
|
||
- lima: fail in get_handle(TYPE_KMS) without a scanout resource
|
||
|
||
Simon Zeni (4):
|
||
|
||
- vulkan/wsi: add drm_fd param to wsi_display_get_connector
|
||
- vulkan/wsi: Implement VK_EXT_acquire_drm_display
|
||
- radv: Implement VK_EXT_acquire_drm_display
|
||
- anv: Implement VK_EXT_acquire_drm_display
|
||
|
||
Steve Pronovost (1):
|
||
|
||
- d3d12: Add mechanism for D3D12 Adapter Selection
|
||
|
||
Stéphane Marchesin (1):
|
||
|
||
- virgl: resources without any binding can be cached
|
||
|
||
SureshGuttula (3):
|
||
|
||
- frontends/va/picture:Fix wrong reallocation even surface is protected
|
||
- frontends/va: Derive image from interlaced buffers for h26[4/5]encode
|
||
- radeon/vcn: calc_dpb_size should be based on dpb_type
|
||
|
||
Tapani Pälli (21):
|
||
|
||
- anv: do not support image export with stencil aspect set
|
||
- glx: fix compilation error when function name not found
|
||
- glsl: ignore interface precision qualifier on desktop GL
|
||
- glx: revert "Downgrade sRGB-ful fbconfigs"
|
||
- i965: support only color formats with memory objects
|
||
- nir: skip assert check with empty structs
|
||
- isl: require hiz for depth surface in isl_surf_get_ccs_surf
|
||
- anv: require rendering support for blit destination feature
|
||
- mesa: fix error set for glCompressedTexSubImage calls
|
||
- gitlab-ci: enable building of Vulkan tests in Piglit
|
||
- anv: introduce new dynamic states
|
||
- anv: support rasterizer discard dynamic state
|
||
- anv: support depth bias enable dynamic state
|
||
- anv: support primitive restart enable dynamic state
|
||
- anv: centralize vk_to_intel_logic_op array
|
||
- anv: support blending logic op dynamic state
|
||
- anv: toggle on VK_EXT_extended_dynamic_state2
|
||
- docs: add VK_EXT_extended_dynamic_state2 features.txt entry
|
||
- anv: provide dummy vkCmdSetPatchControlPointsEXT
|
||
- iris: take a reference to memobj bo in iris_resource_from_memobj
|
||
- anv: fix emitting dynamic primitive topology
|
||
|
||
Thomas H.P. Andersen (9):
|
||
|
||
- nir: return progress from nir_lower_packing
|
||
- nir/lower_packing: use shader_instructions_pass
|
||
- anv: remove dead code
|
||
- nir/ifind_msb_rev: fix input check
|
||
- zink: remove initialization override
|
||
- lavapipe: remove initialization override
|
||
- broadcom/compiler: use correct flag enum
|
||
- broadcom/compiler: fix add vs. mul
|
||
- nine: Fix assert in tx_src_param
|
||
|
||
Thong Thai (1):
|
||
|
||
- radeon/vcn/enc: Add missing line to HEVC SPS header code
|
||
|
||
Timothy Arceri (23):
|
||
|
||
- mesa: fix incomplete GL_NV_half_float implementation
|
||
- mesa: make _mesa_find_temp_intervals() a static function
|
||
- mesa: fix _mesa_add{_typed}_unnamed_constant() declarations
|
||
- mesa: fix _mesa_add_state_reference() declaration mismatch
|
||
- mesa: fix glShaderSource() error handling
|
||
- util: disable glthread in CSGO
|
||
- glsl: create validate_component_layout_for_type() helper
|
||
- glsl: add missing support for explicit components in interface blocks
|
||
- nir/lower_io_to_vector: fix per vertex io handling for arrays
|
||
- Revert "util: disable glthread in CSGO"
|
||
- util: add work around for the game We Happy Few
|
||
- util/tests: initialise key in cache_test
|
||
- mesa: don't crash on incorrect texture use
|
||
- i965: don't crash on incorrect texture use
|
||
- glsl: force_glsl_version to shaders with no defined version
|
||
- util/driconf: add new ignore_write_to_readonly_var workaround
|
||
- util: add some workarounds for the game Luna Sky
|
||
- util/disk_cache: delete more cache items in one go when full
|
||
- util/radeonsi: add radeonsi workaround for Nuclear Throne
|
||
- glsl: replace some C++ code with C
|
||
- util: add workaround for Full Bore
|
||
- glsl: relax rule on varying matching for shaders older than 4.20
|
||
- intel/compiler: make sure swizzle is applied to if condition
|
||
|
||
Timur Kristóf (74):
|
||
|
||
- aco: Mark VCC clobbered for iadd8 and iadd16 reductions on GFX6-7.
|
||
- radv: Ignore GS copy shader when determining NGG GS wave size.
|
||
- radv: Properly enable Wave32 mode for NGG GS.
|
||
- nir: Support upper bound of subgroup_id/num_subgroups for non-compute.
|
||
- nir: Support upper bound of unsigned bit size conversions.
|
||
- nir: Allow load_primitive_id in VS in nir_divergence_analysis.
|
||
- nir: Add AMD specific intrinsics for merged shaders and NGG.
|
||
- aco: Allow workgroup barrier and shared scope for NGG shaders.
|
||
- aco: Fixup the NIR metadata after sanitize_cf_list.
|
||
- aco: Split ngg_emit_sendmsg_gs_alloc_req from the wave0 check.
|
||
- radv: Fill shader info earlier.
|
||
- radv: Gather NGG info sooner.
|
||
- aco: Implement new NGG specific NIR intrinsics.
|
||
- ac: Add new NIR pass to lower NGG VS/TES.
|
||
- radv: Use new NGG NIR lowering for VS/TES when ACO is used.
|
||
- ac: Add NIR lowering for NGG GS.
|
||
- radv: Use new NIR lowering of NGG GS when ACO is used.
|
||
- aco: Determine whether a few more instructions need exec.
|
||
- aco: Use Operand instead of Temp for the exec mask stack.
|
||
- aco: Remember when exec mask is const, and restore the const then.
|
||
- aco: Don't use s_and_saveexec with branches when exec is constant.
|
||
- aco: Refactor SSA elimination phi info to use vector instead of map.
|
||
- aco: Eliminate useless exec writes in jump threading.
|
||
- aco/insert_exec_mask: Fixed unused variable warning in release build.
|
||
- aco/util: Initialize IDSet::bits_set to zero.
|
||
- gallium/tessellator: Fix uninitialized variable warnings.
|
||
- anv: Fix unused function warnings for memory range checks.
|
||
- gallivm: Fix a few uninitialized variable warnings.
|
||
- nine: Fix uninitialized warning in texture9.c
|
||
- radv/cmd_buffer: Fix warning by initializing instance count.
|
||
- aco: Don't eliminate exec write when it's used by a copy later.
|
||
- aco: Don't DCE instructions that write non-temps, eg. exec.
|
||
- aco: Add Operand(Temp, PhysReg) constructor.
|
||
- aco: New writeout overloads for the test framework.
|
||
- aco: Introduce a new, post-RA optimizer.
|
||
- aco: Use s_cbranch_vccz/nz in post-RA optimization.
|
||
- aco: Eliminate SALU comparison when SCC can be used instead.
|
||
- radv: Remove duplicate code for getting GS info.
|
||
- radv: Don't generate GS copy shader when the pipeline has NGG.
|
||
- radv: Assert that there is no GS copy shader when the pipeline has NGG.
|
||
- aco: Add note about v_alignbyte in the ISA README.
|
||
- nir: Add nir_op_sad_u8x4 which corresponds to AMD's v_sad_u8.
|
||
- aco: Implement nir_op_sad_u8x4.
|
||
- aco: Add validation for v_permlane instructions.
|
||
- nir: Add AMD-specific byte and lane permute intrinsics.
|
||
- aco: Implement byte and lane permute intrinsics.
|
||
- aco: Keep VGPR destinations for uniform shared loads when beneficial.
|
||
- ac/nir: Refactor and optimize the repacking sequence.
|
||
- amd: Add extra source to the mbcnt_amd NIR intrinsic.
|
||
- aco: Use as_vgpr for the second source of mbcnt_amd.
|
||
- ac/nir: Update TCS output barriers with nir_var_mem_shared.
|
||
- aco: Fix checking if load_shared is used by cross lane instructions.
|
||
- radv/llvm: Emit s_barrier at the beginning of NGG non-GS shaders.
|
||
- aco/gfx10: NGG zero output workaround for conservative rasterization.
|
||
- aco/gfx10: Emit barrier at the start of NGG VS and TES.
|
||
- radv: Add last_vgt_api_stage and use it to simplify some code.
|
||
- radv: Move radv_optimize_nir_algebraic to a separate function.
|
||
- radv: Allow enabling vertex grouping, fix NGG info with it disabled.
|
||
- radv: Set parameter cache oversubscription according to the PC lines.
|
||
- nir: Add AMD specific intrinsics for NGG shader based culling.
|
||
- ac/nir: Add a NIR port of ac_llvm_cull.
|
||
- ac/nir: Use a ballot that matches the wave size during NGG lowering.
|
||
- ac/nir: Implement NGG deferred attribute culling in NIR.
|
||
- radv: Expose radv_get_viewport_xform in radv_private.h
|
||
- radv: New shader args for NGG culling settings and viewport.
|
||
- aco: Implement NGG culling related intrinsics.
|
||
- radv: Support NGG culling with new perftest environment variable.
|
||
- radv: Run algebraic optimizations before NGG lowering.
|
||
- ac/nir: Reuse the repacked output positions of culling shaders.
|
||
- ac/nir: Analyze culling shaders to remember which inputs are used when.
|
||
- ac/nir: Reuse uniforms from top part of culling shaders.
|
||
- radv, aco, ac/nir: Tweak position export scheduling for NGG culling.
|
||
- radv: Don't compile NGG culling into shaders that write viewport index.
|
||
- radv: Remove num_viewports from radv_skip_ngg_culling.
|
||
|
||
Tomeu Vizoso (51):
|
||
|
||
- ci: Reenable radeonsi jobs, and extend coverage
|
||
- ci/lava: Build all piglit profiles in LAVA images
|
||
- ci/lava: Update kernel for LAVA to 5.11
|
||
- ci/lava: Start Xorg on request, for Piglit
|
||
- ci: Test RadeonSI with piglit's quick_gl
|
||
- ci: Use a single kernel+rootfs for both baremetal and LAVA jobs
|
||
- ci: Drop hack to disable all modules from defconfig
|
||
- ci/radeonsi: Add expected failures due to #4674 having slipped in
|
||
- panfrost/ci: Enable some dEQP 3.1 tests on Mali T860
|
||
- Revert "CI: Disable Panfrost and radeonsi"
|
||
- panfrost: Don't access members of NULL pointers
|
||
- pan/midgard: Don't emit zero padding
|
||
- ci: Remove the need for an empty Piglit results file
|
||
- Revert "CI: Disable all Panfrost/AMD/Iris automatic jobs"
|
||
- ci: Update kernel to v5.13-rc2
|
||
- panfrost/ci: Test Panfrost on the Mali G72 GPU
|
||
- panfrost/ci: Add one more flake test for G72
|
||
- radv/ci: Test on Stoney on CI
|
||
- ci/lava: Add caching proxies for trace downloads
|
||
- ci/piglit: Use wget instead of ci-fairy to check a file exists
|
||
- ci: Configure DUTs for max performance
|
||
- ci: Uprev piglit to eee7d89611cf "tests: Replay profile frame times"
|
||
- ci: Uprev apitrace to 170424754bb4 "retrace: Get --loop to work without rewinding"
|
||
- radeonsi/ci: Add new Piglit failures
|
||
- ci/freedreno: Add depth32f_stencil8 flakes
|
||
- ci/zink: Add nearest_linear_mirror_l8_pot flake
|
||
- ci/freedreno: Fix name of flake
|
||
- ci/freedreno: Add new flake after "ci: Configure DUTs for max performance"
|
||
- ci/freedreno: Add spec@arb_copy_buffer@dlist flake on a530
|
||
- Partial revert of "ci: Add a manual job for tracking the performance of Freedreno"
|
||
- ci/freedreno: Skip Portal 2 trace on a630, due to flakiness
|
||
- Revert "ci/freedreno: Skip Portal 2 trace on a630, due to flakiness"
|
||
- ci/lava: Disable CPU frequency scaling
|
||
- ci/lava: Switch LAVA jobs to x86 runners
|
||
- ci: Disable windows builds due to runner not being available
|
||
- ci: Build Crosvm in our container
|
||
- ci: Move Kernel build tasks into its own file
|
||
- ci: Store the credentials in /tmp
|
||
- ci: Run tests inside Crosvm
|
||
- iris/ci: Update the checksums for the pixmark-piano trace
|
||
- panfrost/ci: Add some failures that crept in
|
||
- ci/lava: Improve error reporting in lava_job_submitter.py
|
||
- ci/lava: Don't overwrite PIGLIT_REPLAY_EXTRA_ARGS
|
||
- Revert "ci: Disable the iris APL jobs"
|
||
- ci/bare-metal: Add parens around shell command
|
||
- panfrost: Fork pan_pool for Gallium and Vulkan
|
||
- panvk: Add VkCommandPool support
|
||
- panvk: Support calls to CreateDescriptorSetLayout without bindings
|
||
- panvk: Make panvk_queue_transfer_sync more generic
|
||
- panfrost: Specify alignment for the Job Header descriptor
|
||
- panvk: Add vkEvents support
|
||
|
||
Tony Wasserka (18):
|
||
|
||
- radv: Remove assert about pDepthStencilState
|
||
- aco/spill: Fix improper handling of exec phis
|
||
- aco/scheduler: Fix register demand computation for downwards moves
|
||
- aco/scheduler: Fix register demand computation for upwards moves
|
||
- aco/scheduler: Verify register demand invariants in debug mode
|
||
- util: add support for defining bitwise operators on strongly typed enums
|
||
- util: tune signatures of generated enum operators
|
||
- aco/scheduler: Clean up register demand tracking
|
||
- aco/scheduler: Move cursor handling state to dedicated interfaces
|
||
- aco/ra: Fix off-by-one-error in print_regs
|
||
- aco/ra: Clean up print_regs output and support byte-allocated variables
|
||
- aco/ra: Split print_regs by lines of 64 registers
|
||
- aco: Replace Operand literal constructors with factory member functions
|
||
- aco: Remove use of deprecated Operand constructors in test_to_hw_instr.cpp
|
||
- aco: Remove use of deprecated Operand constructors in aco_builder.h
|
||
- aco: Remove use of deprecated Operand constructors
|
||
- aco: Clean up unneeded literal casts
|
||
- aco: Remove deprecated Operand constructors
|
||
|
||
Vasily Khoruzhick (3):
|
||
|
||
- lima: switch resource to linear layout if there's to many full updates
|
||
- lima: implement alpha test
|
||
- lima: handle fp16 vertex formats
|
||
|
||
Ville Syrjälä (2):
|
||
|
||
- i915: Implement __DRI_IMAGE_ATTRIB_OFFSET query
|
||
- i915: Implement __DRI2_FLUSH version 4
|
||
|
||
Vinson Lee (17):
|
||
|
||
- clover: Add constructor for constant_argument.
|
||
- glx: Fix macOS build.
|
||
- nv50/ir: Initialize Graph::Node member tag.
|
||
- nvc0: Remove unnecessary bsp_bo NULL check.
|
||
- nv50/ir: Initialize BuildUtil member tail.
|
||
- nv50/ir: Initialize CodeEmitterNV50 member progType.
|
||
- nv50/ir: Initialize GCRA::RIG_Node members.
|
||
- nvc0/ir: Initialize CodeEmitterGK110 member progType in constructor.
|
||
- nv50/ir: Add ConstantFolding constructor.
|
||
- travis: Download XQuartz from GitHub.
|
||
- v3dv: Fix assert.
|
||
- nvc0/ir: Initialize CodeEmitterNVC0 member progType in constructor.
|
||
- intel/vec4: Add missing break statement.
|
||
- nvc0/ir: Initialize Limits members in constructor.
|
||
- asahi: Fix macOS macro.
|
||
- st/xa: Mark default xa_get_pipe_format case unreachable.
|
||
- asahi: Move assignment after null check.
|
||
|
||
Yevhenii Kolesnikov (3):
|
||
|
||
- intel: fix leaking memory on shader creation
|
||
- glsl: Add operator for .length() method on implicitly-sized arrays
|
||
- glsl: Properly handle .length() of an unsized array
|
||
|
||
Yiwei Zhang (79):
|
||
|
||
- venus: update venus-protocol headers
|
||
- venus: implement dma_buf fd import and properties query
|
||
- venus: cap api version to 1.1 for Android
|
||
- venus: fix virtgpu_bo_init_dmabuf for classic resource
|
||
- venus: close the import memory fd on success
|
||
- venus: force a roundtrip after vn_renderer_bo_create_dmabuf
|
||
- venus: set bo->size to 0 for classic resource
|
||
- venus: update venus-protocol headers
|
||
- venus: implement VK_ANDROID_native_buffer v7
|
||
- venus: use VK_EXT_image_drm_format_modifier
|
||
- venus: update venus-protocol headers
|
||
- venus: enable VK_EXT_queue_family_foreign
|
||
- venus: handle VK_IMAGE_LAYOUT_PRESENT_SRC_KHR transfer
|
||
- venus: handle wsi image queue ownership transfer for Android
|
||
- venus: query extended resource info from gralloc
|
||
- venus: populate VK_ERROR_OUT_OF_HOST_MEMORY if applied
|
||
- virgl: do not use winsys info for guest storage of classic resource
|
||
- venus: fix vkEnumeratePhysicalDeviceGroups
|
||
- venus: stop advertising KHR_driver_properties for Android
|
||
- venus: clean up vn_android api names
|
||
- venus: add AHB format and VkFormat conversion helper functions
|
||
- venus: add vn_android_get_ahb_usage helper function
|
||
- venus: add ahb image and buffer properties query support
|
||
- venus: vn_GetAndroidHardwareBufferPropertiesANDROID (part 1/2)
|
||
- venus: vn_GetAndroidHardwareBufferPropertiesANDROID (part 2/2)
|
||
- anv: fix AHB leak upon exportable allocation
|
||
- radv: fix AHB leak upon exportable allocation
|
||
- gallium/st: add a back buffer fallback for front rendering
|
||
- gallium/dri: implement EGL_KHR_mutable_render_buffer
|
||
- egl/android: check front rendering support for cros gralloc
|
||
- venus: tiny refactor of vn_android_get_gralloc_buffer_info
|
||
- venus: complete the format conversion between AHB and Vulkan
|
||
- venus: fix vn_GetAndroidHardwareBufferPropertiesANDROID
|
||
- venus: fix AHB image format properties query
|
||
- venus: prepare image creation helpers for AHB
|
||
- venus: implement image creation for ahb handle type
|
||
- venus: refactor device memory fd import
|
||
- venus: implement AHB allocation and import (part 1/2)
|
||
- venus: implement AHB allocation and import (part 2/2)
|
||
- venus: implement vn_GetMemoryAndroidHardwareBufferANDROID
|
||
- venus: support AHB external format for sampler YCbCr conversion
|
||
- venus: advertise VK_ANDROID_external_memory_android_hardware_buffer
|
||
- venus: rename dmabuf to dma_buf when it represents a type
|
||
- venus: fix misaligned bo_flags between import and query
|
||
- venus: refactor for property query of dma_buf fd
|
||
- venus: fix mismatched bo mmap_size for export and multiple imports
|
||
- venus: initial AHB support for multi-planar format
|
||
- venus: update to the latest venus protocol
|
||
- venus: support AHB prop query with host dma_buf size
|
||
- venus: refactor gralloc buffer and drm modifier properties query
|
||
- venus: unify VkNativeBufferANDROID and AHardwareBuffer image create info
|
||
- venus: forward the host renderer hardware info
|
||
- egl/android: fix cached buffer slots for EGL Android winsys
|
||
- egl/android: refactor to use the legit vndk/window.h header
|
||
- vulkan: fix back compat with Android Oreo and below
|
||
- egl/android: add aosp_nougat system/window.h back for back compat
|
||
- virgl: forward the host renderer hardware info
|
||
- anv: fix Android WSI VkFence
|
||
- venus: silence a build warning
|
||
- venus: refactor vn_AcquireImageANDROID with globalFencing
|
||
- venus: moves GPU rendering off CPU timeline for Android WSI
|
||
- venus: add debug info for experimental features during init
|
||
- radv: fix build errors after commit 8b7ff784
|
||
- anv: fix build errors after commit 8b7ff78
|
||
- venus: remove workarounds for multi-planar format interop
|
||
- anv: fix some log formats
|
||
- anv: support multi-planar format in add_all_surfaces_explicit_layout
|
||
- anv: enable multi-planar support for drm format modifier
|
||
- venus: properly support GPU_DATA_BUFFER for AHB
|
||
- venus: use the mesa "drm-uapi/drm_fourcc.h" header
|
||
- venus: remove unsupported AHB formats
|
||
- venus: resolve AHB external format with DRM format
|
||
- venus: add more logs for Android WSI debugging
|
||
- venus: prepare vn_CreateBuffer for AHB
|
||
- venus: handle ahb backed VkBuffer creation properly
|
||
- venus: fix AHB VkBuffer memory requirement
|
||
- egl/android: only apply front rendering usage in shared buffer mode
|
||
- egl/android: restore image creation fallback path used by virgl
|
||
- venus: cache ahb backed buffer memory type bits requirement
|
||
|
||
Yogesh Mohanmarimuthu (4):
|
||
|
||
- radv: set RADEON_FLAG_GTT_WC flag for prime memory
|
||
- glx: Keep display fd open for prime
|
||
- glx: create DRI screen for display GPU incase of prime
|
||
- loader: allocate VRAM in display GPU in case of prime
|
||
|
||
Yurii Kolesnykov (1):
|
||
|
||
- c_std=c11 in meson default_options
|
||
|
||
Zhaofeng Li (1):
|
||
|
||
- Add default driver selections for RISC-V
|
||
|
||
Zhu Yuliang (1):
|
||
|
||
- gallium/vl: don't leak fd in vl_dri3_screen_create
|
||
|
||
Zoltán Böszörményi (2):
|
||
|
||
- crocus: Add pipe loader driver
|
||
- crocus: Make the driver loader use PCI IDs for crocus
|
||
|
||
cheyang (1):
|
||
|
||
- virgl:Fix the leak of hw_res used as fence
|
||
|
||
luc (1):
|
||
|
||
- panfrost: Only clear existing color buffers
|