mesa/docs/relnotes/19.1.0.rst
Erik Faye-Lund 841a47fb28 docs: escape trailing underscores properly
In reStructuredText, a trailing underscore means a hyperlink reference,
but it seems pandoc doesn't get this right for symbols that have already
been escaped. So let's manually fix these up.

Reviewed-by: Eric Engestrom <eric@engestrom.ch>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4630>
2020-06-13 10:42:00 +00:00

4678 lines
204 KiB
ReStructuredText
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

Mesa 19.1.0 Release Notes / June 11, 2019
=========================================
Mesa 19.1.0 is a new development release. People who are concerned with
stability and reliability should stick with a previous release or wait
for Mesa 19.1.1.
Mesa 19.1.0 implements the OpenGL 4.5 API, but the version reported by
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being
used. Some drivers don't support all the features required in OpenGL
4.5. OpenGL 4.5 is **only** available if requested at context creation.
Compatibility contexts may report a lower version depending on each
driver.
SHA256 checksums
----------------
::
2a6c3af3a803389183168e449c536304cf03e0f82c4c9333077933543b9d02f3 mesa-19.1.0.tar.xz
New features
------------
- GL_ARB_parallel_shader_compile on all drivers.
- GL_EXT_gpu_shader4 on all GL 3.1 drivers.
- GL_EXT_shader_image_load_formatted on radeonsi.
- GL_EXT_texture_buffer_object on all GL 3.1 drivers.
- GL_EXT_texture_compression_s3tc_srgb on Gallium drivers and i965 (ES
extension).
- GL_NV_compute_shader_derivatives on iris and i965.
- GL_KHR_parallel_shader_compile on all drivers.
- VK_EXT_buffer_device_address on Intel and RADV.
- VK_EXT_depth_clip_enable on Intel and RADV.
- VK_KHR_ycbcr_image_arrays on Intel.
- VK_EXT_inline_uniform_block on Intel and RADV.
- VK_EXT_external_memory_host on Intel.
- VK_EXT_host_query_reset on Intel and RADV.
- VK_KHR_surface_protected_capabilities on Intel and RADV.
- VK_EXT_pipeline_creation_feedback on Intel and RADV.
- VK_KHR_8bit_storage on RADV.
- VK_AMD_gpu_shader_int16 on RADV.
- VK_AMD_gpu_shader_half_float on RADV.
- VK_NV_compute_shader_derivatives on Intel.
- VK_KHR_shader_float16_int8 on Intel and RADV (RADV only supports
int8).
- VK_KHR_shader_atomic_int64 on Intel.
- VK_EXT_descriptor_indexing on Intel.
- VK_KHR_shader_float16_int8 on Intel and RADV.
- GL_INTEL_conservative_rasterization on iris.
- VK_EXT_memory_budget on Intel.
Bug fixes
---------
- `Bug 81843 <https://bugs.freedesktop.org/show_bug.cgi?id=81843>`__ -
[SNB IVB HSW] ETC2 textures are not returned as compressed images
- `Bug 99781 <https://bugs.freedesktop.org/show_bug.cgi?id=99781>`__ -
Some Unity games fail assertion on startup in
glXCreateContextAttribsARB
- `Bug 100239 <https://bugs.freedesktop.org/show_bug.cgi?id=100239>`__
- Incorrect rendering in CS:GO
- `Bug 100316 <https://bugs.freedesktop.org/show_bug.cgi?id=100316>`__
- Linking GLSL 1.30 shaders with invariant and deprecated variables
triggers an 'mismatching invariant qualifiers' error
- `Bug 104272 <https://bugs.freedesktop.org/show_bug.cgi?id=104272>`__
- [OpenGL CTS] [HSW]
KHR-GL46.direct_state_access.textures_compressed_subimage assert
fails
- `Bug 104355 <https://bugs.freedesktop.org/show_bug.cgi?id=104355>`__
- Ivy Bridge ignores component mappings in texture views
- `Bug 104602 <https://bugs.freedesktop.org/show_bug.cgi?id=104602>`__
- [apitrace] Graphical artifacts in Civilization VI on RX Vega
- `Bug 107052 <https://bugs.freedesktop.org/show_bug.cgi?id=107052>`__
- [Regression][bisected]. Crookz - The Big Heist Demo can't be
launched despite the "true" flag in "drirc"
- `Bug 107505 <https://bugs.freedesktop.org/show_bug.cgi?id=107505>`__
- [lars]
dEQP-GLES31.functional.geometry_shading.layered#render_with_default_layer_3d
failure
- `Bug 107510 <https://bugs.freedesktop.org/show_bug.cgi?id=107510>`__
- [GEN8+] up to 10% perf drop on several 3D benchmarks
- `Bug 107563 <https://bugs.freedesktop.org/show_bug.cgi?id=107563>`__
- [RADV] Broken rendering in Unity demos
- `Bug 107987 <https://bugs.freedesktop.org/show_bug.cgi?id=107987>`__
- [Debug mesa only]. Crash happens when calling drawArrays
- `Bug 108250 <https://bugs.freedesktop.org/show_bug.cgi?id=108250>`__
- [GLSL] layout-location-struct.shader_test fails to link
- `Bug 108457 <https://bugs.freedesktop.org/show_bug.cgi?id=108457>`__
- [OpenGL CTS]
KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage
fails
- `Bug 108540 <https://bugs.freedesktop.org/show_bug.cgi?id=108540>`__
- vkAcquireNextImageKHR blocks when timeout=0 in Wayland
- `Bug 108766 <https://bugs.freedesktop.org/show_bug.cgi?id=108766>`__
- Mesa built with meson has RPATH entries
- `Bug 108824 <https://bugs.freedesktop.org/show_bug.cgi?id=108824>`__
- Invalid handling when GL buffer is bound on one context and
invalidated on another
- `Bug 108841 <https://bugs.freedesktop.org/show_bug.cgi?id=108841>`__
- [RADV] SPIRV's control flow attributes do not propagate to LLVM
- `Bug 108879 <https://bugs.freedesktop.org/show_bug.cgi?id=108879>`__
- [CIK] [regression] All opencl apps hangs indefinitely in
si_create_context
- `Bug 108999 <https://bugs.freedesktop.org/show_bug.cgi?id=108999>`__
- Calculating the scissors fields when the y is flipped (0 on top)
can generate negative numbers that will cause assertion failure later
on.
- `Bug 109057 <https://bugs.freedesktop.org/show_bug.cgi?id=109057>`__
- texelFetch from GL_TEXTURE_2D_MULTISAMPLE with integer format fails
- `Bug 109107 <https://bugs.freedesktop.org/show_bug.cgi?id=109107>`__
- gallium/st/va: change va max_profiles when using Radeon VCN
Hardware
- `Bug 109216 <https://bugs.freedesktop.org/show_bug.cgi?id=109216>`__
- 4-27% performance drop in Vulkan benchmarks
- `Bug 109326 <https://bugs.freedesktop.org/show_bug.cgi?id=109326>`__
- mesa: Meson configuration summary should be printed
- `Bug 109328 <https://bugs.freedesktop.org/show_bug.cgi?id=109328>`__
- [BSW BXT GLK] dEQP-VK.subgroups.arithmetic.subgroup regressions
- `Bug 109391 <https://bugs.freedesktop.org/show_bug.cgi?id=109391>`__
- LTO Build fails
- `Bug 109401 <https://bugs.freedesktop.org/show_bug.cgi?id=109401>`__
- [DXVK] Project Cars rendering problems
- `Bug 109404 <https://bugs.freedesktop.org/show_bug.cgi?id=109404>`__
- [ANV] The Witcher 3 shadows flickering
- `Bug 109443 <https://bugs.freedesktop.org/show_bug.cgi?id=109443>`__
- Build failure with MSVC when using Scons >= 3.0.2
- `Bug 109451 <https://bugs.freedesktop.org/show_bug.cgi?id=109451>`__
- [IVB,SNB] LINE_STRIPs following a TRIANGLE_FAN fail to use
primitive restart
- `Bug 109543 <https://bugs.freedesktop.org/show_bug.cgi?id=109543>`__
- After upgrade mesa to 19.0.0~rc1 all vulkan based application stop
working ["vulkan-cube" received SIGSEGV in
radv_pipeline_init_blend_state at
../src/amd/vulkan/radv_pipeline.c:699]
- `Bug 109561 <https://bugs.freedesktop.org/show_bug.cgi?id=109561>`__
- [regression, bisected] code re-factor causing games to stutter or
lock-up system
- `Bug 109573 <https://bugs.freedesktop.org/show_bug.cgi?id=109573>`__
- dEQP-VK.spirv_assembly.instruction.graphics.module.same_module
- `Bug 109575 <https://bugs.freedesktop.org/show_bug.cgi?id=109575>`__
- Mesa-19.0.0-rc1 : Computer Crashes trying to run anything Vulkan
- `Bug 109581 <https://bugs.freedesktop.org/show_bug.cgi?id=109581>`__
- [BISECTED] Nothing is Rendered on Sascha Willem's "subpasses" demo
- `Bug 109594 <https://bugs.freedesktop.org/show_bug.cgi?id=109594>`__
- totem assert failure: totem: src/intel/genxml/gen9_pack.h:72:
\__gen_uint: La declaración \`v <= max' no se cumple.
- `Bug 109597 <https://bugs.freedesktop.org/show_bug.cgi?id=109597>`__
- wreckfest issues with transparent objects & skybox
- `Bug 109601 <https://bugs.freedesktop.org/show_bug.cgi?id=109601>`__
- [Regression] RuneLite GPU rendering broken on 18.3.x
- `Bug 109603 <https://bugs.freedesktop.org/show_bug.cgi?id=109603>`__
- nir_instr_as_deref: Assertion \`parent && parent->type ==
nir_instr_type_deref' failed.
- `Bug 109645 <https://bugs.freedesktop.org/show_bug.cgi?id=109645>`__
- build error on arm64: tegra_screen.c:33:
/usr/include/xf86drm.h:41:10: fatal error: drm.h: No such file or
directory
- `Bug 109646 <https://bugs.freedesktop.org/show_bug.cgi?id=109646>`__
- New video compositor compute shader render glitches mpv
- `Bug 109647 <https://bugs.freedesktop.org/show_bug.cgi?id=109647>`__
- /usr/include/xf86drm.h:40:10: fatal error: drm.h: No such file or
directory
- `Bug 109648 <https://bugs.freedesktop.org/show_bug.cgi?id=109648>`__
- AMD Raven hang during va-api decoding
- `Bug 109659 <https://bugs.freedesktop.org/show_bug.cgi?id=109659>`__
- Missing OpenGL symbols in OSMesa Gallium when building with meson
- `Bug 109698 <https://bugs.freedesktop.org/show_bug.cgi?id=109698>`__
- dri.pc contents invalid when built with meson
- `Bug 109717 <https://bugs.freedesktop.org/show_bug.cgi?id=109717>`__
- [regression] Cull distance tests asserting
- `Bug 109735 <https://bugs.freedesktop.org/show_bug.cgi?id=109735>`__
- [Regression] broken font with mesa_vulkan_overlay
- `Bug 109738 <https://bugs.freedesktop.org/show_bug.cgi?id=109738>`__
- Child of Light shows only a black screen
- `Bug 109739 <https://bugs.freedesktop.org/show_bug.cgi?id=109739>`__
- Mesa build fails when vulkan-overlay-layer option is enabled
- `Bug 109742 <https://bugs.freedesktop.org/show_bug.cgi?id=109742>`__
- vdpau state tracker on nv92 started to hit assert after vl compute
work
- `Bug 109743 <https://bugs.freedesktop.org/show_bug.cgi?id=109743>`__
- Test fails:
piglit.spec.arb_sample_shading.arb_sample_shading-builtin-gl-sample-mask-mrt-alpha
- `Bug 109747 <https://bugs.freedesktop.org/show_bug.cgi?id=109747>`__
- Add framerate to vulkan-overlay-layer
- `Bug 109759 <https://bugs.freedesktop.org/show_bug.cgi?id=109759>`__
- [BISECTED][REGRESSION][IVB, HSW] Font rendering problem in OpenGL
- `Bug 109788 <https://bugs.freedesktop.org/show_bug.cgi?id=109788>`__
- vulkan-overlay-layer: Only installs 64bit version
- `Bug 109810 <https://bugs.freedesktop.org/show_bug.cgi?id=109810>`__
- nir_opt_copy_prop_vars.c:454: error: unknown field ssa specified
in initializer
- `Bug 109929 <https://bugs.freedesktop.org/show_bug.cgi?id=109929>`__
- tgsi_to_nir.c:2111: undefined reference to
\`gl_nir_lower_samplers_as_deref'
- `Bug 109944 <https://bugs.freedesktop.org/show_bug.cgi?id=109944>`__
- [bisected] Android build test fails with: utils.c: error: use of
undeclared identifier 'PACKAGE_VERSION'
- `Bug 109945 <https://bugs.freedesktop.org/show_bug.cgi?id=109945>`__
- pan_assemble.c:51:46: error: passing argument 2 of tgsi_to_nir
from incompatible pointer type [-Werror=incompatible-pointer-types]
- `Bug 109980 <https://bugs.freedesktop.org/show_bug.cgi?id=109980>`__
- [i915 CI][HSW]
spec@arb_fragment_shader_interlock@arb_fragment_shader_interlock-image-load-store
- fail
- `Bug 109984 <https://bugs.freedesktop.org/show_bug.cgi?id=109984>`__
- unhandled VkStructureType
VK_STRUCTURE_TYPE_RENDER_PASS_INPUT_ATTACHMENT_ASPECT_CREATE_INFO
- `Bug 110134 <https://bugs.freedesktop.org/show_bug.cgi?id=110134>`__
- SIGSEGV while playing large hevc video in mpv
- `Bug 110143 <https://bugs.freedesktop.org/show_bug.cgi?id=110143>`__
- Doom 3: BFG Edition - Steam and GOG.com - white flickering screen
- `Bug 110201 <https://bugs.freedesktop.org/show_bug.cgi?id=110201>`__
- [ivb] mesa 19.0.0 breaks rendering in kitty
- `Bug 110211 <https://bugs.freedesktop.org/show_bug.cgi?id=110211>`__
- If DESTDIR is set to an empty string, the dri drivers are not
installed
- `Bug 110216 <https://bugs.freedesktop.org/show_bug.cgi?id=110216>`__
- radv: Segfault when compiling compute shaders from Assassin's Creed
Odyssey (regression, bisected)
- `Bug 110221 <https://bugs.freedesktop.org/show_bug.cgi?id=110221>`__
- build error with meson
- `Bug 110239 <https://bugs.freedesktop.org/show_bug.cgi?id=110239>`__
- Mesa SIGABRT: src/intel/genxml/gen9_pack.h:72: \__gen_uint:
Assertion \`v <= max' failed
- `Bug 110257 <https://bugs.freedesktop.org/show_bug.cgi?id=110257>`__
- Major artifacts in mpeg2 vaapi hw decoding
- `Bug 110259 <https://bugs.freedesktop.org/show_bug.cgi?id=110259>`__
- radv: Sampling depth-stencil image in GENERAL layout returns
nothing but zero (regression, bisected)
- `Bug 110291 <https://bugs.freedesktop.org/show_bug.cgi?id=110291>`__
- Vega 64 GPU hang running Space Engineers
- `Bug 110302 <https://bugs.freedesktop.org/show_bug.cgi?id=110302>`__
- [bisected][regression] piglit egl-create-pbuffer-surface and
egl-gl-colorspace regressions
- `Bug 110305 <https://bugs.freedesktop.org/show_bug.cgi?id=110305>`__
- Iris driver fails ext_packed_depth_stencil-getteximage test
- `Bug 110311 <https://bugs.freedesktop.org/show_bug.cgi?id=110311>`__
- [IVB HSW SNB][regression][bisected] regressions on vec4
deqp/gl{es}cts tests
- `Bug 110349 <https://bugs.freedesktop.org/show_bug.cgi?id=110349>`__
- radv: Dragon Quest XI (DXVK) has a graphical glitch (regression,
bisected)
- `Bug 110353 <https://bugs.freedesktop.org/show_bug.cgi?id=110353>`__
- weird colors seen in valley
- `Bug 110355 <https://bugs.freedesktop.org/show_bug.cgi?id=110355>`__
- radeonsi: GTK elements become invisible in some applications (GIMP,
LibreOffice)
- `Bug 110356 <https://bugs.freedesktop.org/show_bug.cgi?id=110356>`__
- install_megadrivers.py creates new dangling symlink [bisected]
- `Bug 110404 <https://bugs.freedesktop.org/show_bug.cgi?id=110404>`__
- Iris fails piglit.spec.ext_transform_feedback.immediate-reuse test
- `Bug 110422 <https://bugs.freedesktop.org/show_bug.cgi?id=110422>`__
- AMD_DEBUG=forcedma will crash OpenGL aps with SIGFAULT on VegaM
8706G
- `Bug 110441 <https://bugs.freedesktop.org/show_bug.cgi?id=110441>`__
- [llvmpipe] complex-loop-analysis-bug regression
- `Bug 110443 <https://bugs.freedesktop.org/show_bug.cgi?id=110443>`__
- vaapi/vpp: wrong output for non 64-bytes align width (ex: 1200)
- `Bug 110454 <https://bugs.freedesktop.org/show_bug.cgi?id=110454>`__
- [llvmpipe] piglit arb_color_buffer_float-render GL_RGBA8_SNORM
failure with llvm-9
- `Bug 110462 <https://bugs.freedesktop.org/show_bug.cgi?id=110462>`__
- Epic Games Launcher renders nothing with "-opengl" option
- `Bug 110474 <https://bugs.freedesktop.org/show_bug.cgi?id=110474>`__
- [bisected][regression] vk cts fp16 arithmetic failures
- `Bug 110497 <https://bugs.freedesktop.org/show_bug.cgi?id=110497>`__
- [DXVK][Regression][Bisected][SKL] Project Cars 2 crashes with Bug
Splat when loading finishes
- `Bug 110526 <https://bugs.freedesktop.org/show_bug.cgi?id=110526>`__
- [CTS] dEQP-VK.ycbcr.{conversion,format}.\* fail
- `Bug 110530 <https://bugs.freedesktop.org/show_bug.cgi?id=110530>`__
- [CTS] dEQP-VK.ycbcr.format.g8_b8_r8_3plane_420\* reports VM faults
on Vega10
- `Bug 110535 <https://bugs.freedesktop.org/show_bug.cgi?id=110535>`__
- [bisected] [icl] GPU hangs on crucible
func.miptree.r8g8b8a8-unorm.aspect-color.view-2d.levels01.array01.extent-512x512.upload-copy-with-draw
tests
- `Bug 110540 <https://bugs.freedesktop.org/show_bug.cgi?id=110540>`__
- [AMD TAHITI XT] valve artifact broken
- `Bug 110573 <https://bugs.freedesktop.org/show_bug.cgi?id=110573>`__
- Mesa vulkan-radeon 19.0.3 system freeze and visual artifacts (RADV)
- `Bug 110590 <https://bugs.freedesktop.org/show_bug.cgi?id=110590>`__
- [Regression][Bisected] GTAⅣ under wine fails with GLXBadFBConfig
- `Bug 110632 <https://bugs.freedesktop.org/show_bug.cgi?id=110632>`__
- "glx: Fix synthetic error generation in \__glXSendError" broke wine
games on 32-bit
- `Bug 110648 <https://bugs.freedesktop.org/show_bug.cgi?id=110648>`__
- Dota2 will not open using vulkan since 19.0 series
- `Bug 110655 <https://bugs.freedesktop.org/show_bug.cgi?id=110655>`__
- VK_LAYER_MESA_OVERLAY_CONFIG=draw,fps renders sporadically
- `Bug 110698 <https://bugs.freedesktop.org/show_bug.cgi?id=110698>`__
- tu_device.c:900:4: error: initializer element is not constant
- `Bug 110701 <https://bugs.freedesktop.org/show_bug.cgi?id=110701>`__
- GPU faults in in Unigine Valley 1.0
- `Bug 110721 <https://bugs.freedesktop.org/show_bug.cgi?id=110721>`__
- graphics corruption on steam client with mesa 19.1.0 rc3 on polaris
- `Bug 110761 <https://bugs.freedesktop.org/show_bug.cgi?id=110761>`__
- Huge problems between Mesa and Electron engine apps
- `Bug 110784 <https://bugs.freedesktop.org/show_bug.cgi?id=110784>`__
- [regression][bisected] Reverting 'expose 0 shader binary formats
for compat profiles for Qt' causes get_program_binary failures on
Iris
Changes
-------
Adam Jackson (1):
- drisw: Try harder to probe whether MIT-SHM works
Albert Pal (1):
- Fix link release notes for 19.0.0.
Alejandro Piñeiro (12):
- blorp: introduce helper method blorp_nir_init_shader
- nir, glsl: move pixel_center_integer/origin_upper_left to
shader_info.fs
- nir/xfb: add component_offset at nir_xfb_info
- nir_types: add glsl_varying_count helper
- nir/xfb: adding varyings on nir_xfb_info and gather_info
- nir/xfb: sort varyings too
- nir_types: add glsl_type_is_struct helper
- nir/xfb: handle arrays and AoA of basic types
- nir/linker: use nir_gather_xfb_info
- nir/linker: fix ARRAY_SIZE query with xfb varyings
- nir/xfb: move varyings info out of nir_xfb_info
- docs: document MESA_GLSL=errors keyword
Alexander von Gluck IV (1):
- haiku: Fix hgl dispatch build. Tested under meson/scons.
Alexandros Frantzis (1):
- virgl: Fake MSAA when max samples is 1
Alok Hota (32):
- swr/rast: update SWR rasterizer shader stats
- gallium/swr: Param defaults for unhandled PIPE_CAPs
- gallium/aux: add PIPE_CAP_MAX_VARYINGS to u_screen
- swr/rast: Convert system memory pointers to gfxptr_t
- swr/rast: Disable use of \__forceinline by default
- swr/rast: Correctly align 64-byte spills/fills
- swr/rast: Flip BitScanReverse index calculation
- swr/rast: Move knob defaults to generated cpp file
- swr/rast: FP consistency between POSH/RENDER pipes
- swr/rast: Refactor scratch space variable names
- swr/rast: convert DWORD->uint32_t, QWORD->uint64_t
- swr/rast: simdlib cleanup, clipper stack space fixes
- swr/rast: Add translation support to streamout
- swr/rast: bypass size limit for non-sampled textures
- swr/rast: Cleanup and generalize gen_archrast
- swr/rast: Add initial SWTag proto definitions
- swr/rast: Add string handling to AR event framework
- swr/rast: Add general SWTag statistics
- swr/rast: Fix autotools and scons codegen
- swr/rast: Remove deprecated 4x2 backend code
- swr/rast: AVX512 support compiled in by default
- swr/rast: enforce use of tile offsets
- swr/rast: add more llvm intrinsics
- swr/rast: update guardband rects at draw setup
- swr/rast: add SWR_STATIC_ASSERT() macro
- swr/rast: add flat shading
- swr/rast: add guards for cpuid on Linux
- swr/rast: early exit on empty triangle mask
- swr/rast: Cleanup and generalize gen_archrast
- swr/rast: Add initial SWTag proto definitions
- swr/rast: Add string handling to AR event framework
- swr/rast: Add general SWTag statistics
Alyssa Rosenzweig (192):
- panfrost: Initial stub for Panfrost driver
- panfrost: Implement Midgard shader toolchain
- meson: Remove panfrost from default driver list
- kmsro: Move DRM entrypoints to shared block
- panfrost: Use u_pipe_screen_get_param_defaults
- panfrost: Check in sources for command stream
- panfrost: Include glue for out-of-tree legacy code
- kmsro: Silence warning if missing
- panfrost: Clean-up one-argument passing quirk
- panfrost: Don't hardcode number of nir_ssa_defs
- panfrost: Add kernel-agnostic resource management
- panfrost: Remove if 0'd dead code
- panfrost: Remove speculative if 0'd format bit code
- panfrost: Elucidate texture op scheduling comment
- panfrost: Specify supported draw modes per-context
- panfrost: Fix build; depend on libdrm
- panfrost: Backport driver to Mali T600/T700
- panfrost: Identify MALI_OCCLUSION_PRECISE bit
- panfrost: Implement PIPE_QUERY_OCCLUSION_COUNTER
- panfrost: Don't align framebuffer dims
- panfrost: Improve logging and patch memory leaks
- panfrost: Fix various leaks unmapping resources
- panfrost: Free imported BOs
- panfrost: Swap order of tiled texture (de)alloc
- panfrost: Cleanup mali_viewport (clipping) code
- panfrost: Preserve w sign in perspective division
- panfrost: Fix clipping region
- panfrost: Stub out separate stencil functions
- panfrost: Add pandecode (command stream debugger)
- panfrost: Implement pantrace (command stream dump)
- panfrost/midgard: Refactor tag lookahead code
- panfrost/midgard: Fix nested/chained if-else
- panfrost: Rectify doubleplusungood extended branch
- panfrost/midgard: Emit extended branches
- panfrost: Dynamically set discard branch targets
- panfrost: Verify and print brx condition in disasm
- panfrost: Use tiler fast path (performance boost)
- panfrost/meson: Remove subdir for nondrm
- panfrost/nondrm: Flag CPU-invisible regions
- panfrost/nondrm: Make COHERENT_LOCAL explicit
- panfrost/nondrm: Split out dump_counters
- panfrost/midgard: Add fround(_even), ftrunc, ffma
- panfrost: Decode render target swizzle/channels
- panfrost: Add RGB565, RGB5A1 texture formats
- panfrost: Identify 4-bit channel texture formats
- panfrost: Expose perf counters in environment
- panfrost/midgard: Allow flt to run on most units
- panfrost: Import job data structures from v3d
- panfrost: Decouple Gallium clear from FBD clear
- panfrost: Cleanup cruft related to clears
- panfrost/midgard: Don't force constant on VLUT
- panfrost: Flush with offscreen rendering
- panfrost/midgard: Promote smul to vmul
- panfrost/midgard: Preview for data hazards
- panfrost: List primitive restart enable bit
- panfrost/drm: Cast pointer to u64 to fix warning
- panfrost: Cleanup needless if in create_bo
- panfrost: Combine has_afbc/tiled in layout enum
- panfrost: Delay color buffer setup
- panfrost: Determine framebuffer format bits late
- panfrost: Allocate dedicated slab for linear BOs
- panfrost: Support linear depth textures
- panfrost: Document "depth-buffer writeback" bit
- panfrost: Identify fragment_extra flags
- util: Add a drm_find_modifier helper
- v3d: Use shared drm_find_modifier util
- vc4: Use shared drm_find_modifier util
- freedreno: Use shared drm_find_modifier util
- panfrost: Break out fragment to SFBD/MFBD files
- panfrost: Remove staging SFBD for pan_context
- panfrost: Remove staging MFBD
- panfrost: Minor comment cleanup (version detection)
- panfrost/mfbd: Implement linear depth buffers
- panfrost/mfbd: Respect per-job depth write flag
- panfrost: Comment spelling fix
- panfrost: Allocate extra data for depth buffer
- panfrost; Disable AFBC for depth buffers
- panfrost: Compute viewport state on the fly
- panfrost/midgard: Implement fpow
- panfrost: Workaround buffer overrun with mip level
- panfrost: Fix primconvert check
- panfrost: Disable PIPE_CAP_TGSI_TEXCOORD
- panfrost/decode: Respect primitive size pointers
- panfrost: Replay more varying buffers
- panfrost: Rewrite varying assembly
- panfrost/midgard: Fix b2f32 swizzle for vectors
- panfrost: Fix viewports
- panfrost: Implement scissor test
- panfrost/midgard: Add fcsel_i opcode
- panfrost/midgard: Schedule ball/bany to vectors
- panfrost/midgard: Add more ball/bany, iabs ops
- panfrost/midgard: Map more bany/ball opcodes
- panfrost/midgard: Lower bool_to_int32
- panfrost/midgard: Lower f2b32 to fne
- panfrost/midgard: Lower i2b32
- panfrost/midgard: Implement b2i; improve b2f/f2b
- panfrost/midgard: Lower source modifiers for ints
- panfrost/midgard: Cleanup midgard_nir_algebraic.py
- panfrost: Stub out ES3 caps/callbacks
- panfrost/midgard: Add ult/ule ops
- panfrost/midgard: Expand fge lowering to more types
- panfrost/midgard: Handle i2b constant
- panfrost/midgard: fpow is a two-part operation
- panfrost: Preliminary work for mipmaps
- panfrost: Fix vertex buffer corruption
- panfrost/midgard: Disassemble \`cube\` texture op
- panfrost/midgard: Add L/S op for writing cubemap coordinates
- panfrost: Preliminary work for cubemaps
- panfrost/decode: Decode all cubemap faces
- panfrost: Include all cubemap faces in bitmap list
- panfrost/midgard: Emit cubemap coordinates
- panfrost: Implement command stream for linear cubemaps
- panfrost: Extend tiling for cubemaps
- panfrost: Implement missing texture formats
- panfrost/decode: Print negative_start
- panfrost: Clean index state between indexed draws
- panfrost: Fix index calculation types and asserts
- panfrost: Implement FIXED formats
- panfrost: Remove support for legacy kernels
- nir: Add "viewport vector" system values
- panfrost: Implement system values
- panfrost: Cleanup some indirection in pan_resource
- panfrost: Respect box->width in tiled stores
- panfrost: Size tiled temp buffers correctly
- panfrost/decode: Add flags for tilebuffer readback
- panfrost: Add tilebuffer load? branch
- panfrost/midgard: Add umin/umax opcodes
- panfrost/midgard: Add ilzcnt op
- panfrost/midgard: Add ibitcount8 op
- panfrost/midgard: Enable lower_find_lsb
- panfrost: Remove "mali_unknown6" nonsense
- panfrost/midgard: Drop dependence on mesa/st
- panfrost: Cleanup indexed draw handling
- nir: Add nir_lower_viewport_transform
- panfrost/midgard: Use shared nir_lower_viewport_transform
- panfrost: Track BO lifetime with jobs and reference counts
- panfrost: Fixup vertex offsets to prevent shadow copy
- panfrost/mdg: Use shared fsign lowering
- panfrost/mdg/disasm: Print raw varying_parameters
- panfrost/midgard: Pipe through varying arrays
- panfrost/midgard: Implement indirect loads of varyings/UBOs
- panfrost/midgard: Respect component of bcsel condition
- panfrost/midgard: Remove useless MIR dump
- panfrost: Respect backwards branches in RA
- panfrost/midgard: Don't try to inline constants on branches
- panfrost/midgard: imul can only run on \*mul
- panfrost: Disable indirect outputs for now
- panfrost: Use actual imov instruction
- panfrost/midgard: Dead code eliminate MIR
- panfrost/midgard: Track loop depth
- panfrost/midgard: Fix off-by-one in successor analysis
- panfrost/midgard: Remove unused mir_next_block
- panfrost/midgard: Update integer op list
- panfrost/midgard: Document sign-extension/zero-extension bits
(vector)
- panfrost/midgard: Set integer mods
- panfrost/midgard: Implement copy propagation
- panfrost/midgard: Optimize MIR in progress loop
- panfrost/midgard: Refactor opcode tables
- panfrost/midgard: Add "op commutes?" property
- panfrost/midgard: Remove assembler
- panfrost/midgard: Reduce fmax(a, 0.0) to fmov.pos
- panfrost/midgard: Extend copy propagation pass
- panfrost/midgard: Optimize csel involving 0
- panfrost/midgard: Copy prop for texture registers
- panfrost/midgard: Identify inand
- panfrost/midgard: Add new bitwise ops
- Revert "panfrost/midgard: Extend copy propagation pass"
- panfrost/midgard: Only copyprop without an outmod
- panfrost/midgard: Fix regressions in -bjellyfish
- panfrost/midgard: Fix tex propogation
- panfrost/midgard: imov workaround
- panfrost: Use fp32 (not fp16) varyings
- panfrost/midgard: Safety check immediate precision degradations
- panfrost: Workaround -bshadow regression
- panfrost: Remove shader dump
- panfrost/decode: Hit MRT blend shader enable bits
- panfrost: Fix blend shader upload
- panfrost/midgard: reg_mode_full -> reg_mode_32, etc
- panfrost/midgard/disasm: Catch mask errors
- panfrost/midgard/disasm: Extend print_reg to 8-bit
- panfrost/midgard/disasm: Fill in .int mod
- panfrost/midgard: Fix crash on unknown op
- panfrost/midgard: Rename ilzcnt8 -> iclz
- panfrost/midgard/disasm: Support 8-bit destination
- panfrost/midgard/disasm: Print 8-bit sources
- panfrost/midgard/disasm: Stub out 64-bit
- panfrost/midgard/disasm: Handle dest_override generalized
- panfrost: Support RGB565 FBOs
- panfrost/midgard: Fix integer selection
- panfrost/midgard: Fix RA when temp_count = 0
- panfrost/midgard: Lower mixed csel (NIR)
- panfrost/midgard: iabs cannot run on mul
Alyssa Ross (1):
- get_reviewer.pl: improve portability
Amit Pundir (1):
- mesa: android: freedreno: build libfreedreno_{drm,ir3} static libs
Andre Heider (5):
- iris: fix build with gallium nine
- iris: improve PIPE_CAP_VIDEO_MEMORY bogus value
- iris: add support for tgsi_to_nir
- st/nine: enable csmt per default on iris
- st/nine: skip position checks in SetCursorPosition()
Andreas Baierl (2):
- nir: add rcp(w) lowering for gl_FragCoord
- lima/ppir: Add gl_FragCoord handling
Andres Gomez (12):
- mesa: INVALID_VALUE for wrong type or format in Clear*Buffer*Data
- gitlab-ci: install distro's ninja
- glsl: correctly validate component layout qualifier for dvec{3,4}
- glsl/linker: always validate explicit location among inputs
- glsl/linker: don't fail non static used inputs without matching
outputs
- glsl/linker: simplify xfb_offset vs xfb_stride overflow check
- Revert "glsl: relax input->output validation for SSO programs"
- glsl/linker: location aliasing requires types to have the same width
- docs: drop Andres Gomez from the release cycles
- glsl/linker: always validate explicit locations for first and last
interfaces
- docs/relnotes: add support for VK_KHR_shader_float16_int8
- glsl/linker: check for xfb_offset aliasing
Andrii Simiklit (5):
- i965: consider a 'base level' when calculating width0, height0,
depth0
- i965: re-emit index buffer state on a reset option change.
- util: clean the 24-bit unused field to avoid an issues
- iris: make the TFB result visible to others
- egl: return correct error code for a case req ver < 3 with
forward-compatible
Antia Puentes (1):
- nir/linker: Fix TRANSFORM_FEEDBACK_BUFFER_INDEX
Anuj Phogat (7):
- i965/icl: Add WA_2204188704 to disable pixel shader panic dispatch
- anv/icl: Add WA_2204188704 to disable pixel shader panic dispatch
- intel: Add Elkhart Lake device info
- intel: Add Elkhart Lake PCI-IDs
- iris/icl: Set Enabled Texel Offset Precision Fix bit
- iris/icl: Add WA_2204188704 to disable pixel shader panic dispatch
- intel: Add support for Comet Lake
Axel Davy (49):
- st/nine: Ignore window size if error
- st/nine: Ignore multisample quality level if no ms
- st/nine: Disable depth write when nothing gets updated
- st/nine: Do not advertise support for D15S1 and D24X4S4
- st/nine: Do not advertise CANMANAGERESOURCE
- st/nine: Change a few advertised caps
- Revert "d3dadapter9: Support software renderer on any DRI device"
- st/nine: Fix D3DWindowBuffer_release for old wine nine support
- st/nine: Use FLT_MAX/2 for RCP clamping
- st/nine: Upload managed textures only at draw using them
- st/nine: Upload managed buffers only at draw using them
- st/nine: Fix buffer/texture unbinding in nine_state_clear
- st/nine: Finish if nooverwrite after normal mapping
- st/nine: Always return OK on SetSoftwareVertexProcessing
- st/nine: Enable modifiers on ps 1.X texcoords
- st/nine: Ignore nooverwrite for systemmem
- st/nine: Fix SINCOS input
- st/nine: Optimize surface upload with conversion
- st/nine: Optimize volume upload with conversion
- st/nine: rename \*_conversion to \*_internal
- st/nine: Refactor surface GetSystemMemPointer
- st/nine: Refactor volume GetSystemMemPointer
- st/nine: Support internal compressed format for surfaces
- st/nine: Support internal compressed format for volumes
- st/nine: Add drirc option to use data_internal for dynamic textures
- drirc: Add Gallium nine workaround for Rayman Legends
- st/nine: Recompile optimized shaders based on b/i consts
- st/nine: Control shader constant inlining with drirc
- st/nine: Regroup param->rel tests
- st/nine: Refactor param->rel
- st/nine: Compact nine_ff_get_projected_key
- st/nine: Compact pixel shader key
- st/nine: use helper ureg_DECL_sampler everywhere
- st/nine: Manually upload vs and ps constants
- st/nine: Refactor shader constants ureg_src computation
- st/nine: Make swvp_on imply IS_VS
- st/nine: Refactor ct_ctor
- st/nine: Track constant slots used
- st/nine: Refactor counting of constants
- st/nine: Prepare constant compaction in nine_shader
- st/nine: Propagate const_range to context
- st/nine: Cache constant buffer size
- st/nine: Handle const_ranges in nine_state
- st/nine: Enable computing const_ranges
- st/nine: Use TGSI_SEMANTIC_GENERIC for fog
- st/nine: Optimize a bit writeonly buffers
- st/nine: Throttle rendering similarly for thread_submit
- st/nine: Check discard_delayed_release is set before allocating more
- d3dadapter9: Revert to old throttling limit value
Bart Oldeman (1):
- gallium-xlib: query MIT-SHM before using it.
Bas Nieuwenhuizen (105):
- radv: Only look at pImmutableSamples if the descriptor has a sampler.
- amd/common: Add gep helper for pointer increment.
- amd/common: Implement ptr->int casts in ac_to_integer.
- radv: Fix the shader info pass for not having the variable.
- amd/common: Use correct writemask for shared memory stores.
- amd/common: Fix stores to derefs with unknown variable.
- amd/common: Handle nir_deref_type_ptr_as_array for shared memory.
- amd/common: handle nir_deref_cast for shared memory from integers.
- amd/common: Do not use 32-bit loads for shared memory.
- amd/common: Implement global memory accesses.
- radv: Do not use the bo list for local buffers.
- radv: Implement VK_EXT_buffer_device_address.
- radv: Use correct num formats to detect whether we should be use 1.0
or 1.
- radv: Sync ETC2 whitelisted devices.
- radv: Clean up a bunch of compiler warnings.
- radv: Handle clip+cull distances more generally as compact arrays.
- radv: Implement VK_EXT_depth_clip_enable.
- radv: Disable depth clamping even without
EXT_depth_range_unrestricted.
- radv: Fix float16 interpolation set up.
- radv: Allow interpolation on non-float types.
- radv: Interpolate less aggressively.
- turnip: Add driver skeleton (v2)
- turnip: Fix up detection of device.
- turnip: Gather some device info.
- turnip: Remove abort.
- turnip: Fix newly introduced warning.
- turnip: Add buffer allocation & mapping support.
- turnip: Report a memory type and heap.
- turnip: Cargo cult the Intel heap size functionality.
- turnip: Initialize memory type in requirements.
- turnip: Disable more features.
- turnip: Add 630 to the list.
- turnip: Fix bo allocation after we stopped using libdrm_freedreno ...
- turnip: Fix memory mapping.
- turnip: Add image layout calculations.
- turnip: Stop hardcoding the msm version check.
- turnip: move tu_gem.c to tu_drm.c
- turnip: Implement pipe-less param query.
- turnip: Implement some format properties for RGBA8.
- turnip: Remove some radv leftovers.
- turnip: clean up TODO.
- turnip: Implement some UUIDs.
- turnip: Implement a slow bo list
- turnip: Add a command stream.
- turnip: Add msm queue support.
- turnip: Make bo_list functions not static
- turnip: Implement submission.
- turnip: Fill command buffer
- turnip: Shorten primary_cmd_stream name.
- turnip: Add emit functions in a header.
- turnip: Move stream functions to tu_cs.c
- turnip: Add buffer memory binding.
- turnip: Make tu6_emit_event_write shared.
- turnip: Add tu6_rb_fmt_to_ifmt.
- turnip: Implement buffer->buffer DMA copies.
- turnip: Add image->buffer DMA copies.
- turnip: Add buffer->image DMA copies.
- turnip: Add todo for copies.
- turnip: Fix GCC compiles.
- turnip: Deconflict vk_format_table regeneration
- gitlab-ci: Build turnip.
- radeonsi: Remove implicit const cast.
- radv: Allow fast clears with concurrent queue mask for some layouts.
- vulkan/util: Handle enums that are in platform-specific headers.
- vulkan: Update the XML and headers to 1.1.104
- radv: Implement VK_EXT_host_query_reset.
- radv: Use correct image view comparison for fast clears.
- radv: Implement VK_EXT_pipeline_creation_feedback.
- ac/nir: Return frag_coord as integer.
- nir: Add access qualifiers on load_ubo intrinsic.
- radv: Add non-uniform indexing lowering.
- radv: Add bolist RADV_PERFTEST flag.
- ac: Move has_local_buffers disable to radeonsi.
- radv: Use local buffers for the global bo list.
- radv: Support VK_EXT_inline_uniform_block.
- radv: Add support for driconf.
- vulkan/wsi: Add X11 adaptive sync support based on dri options.
- radv: Add adaptive_sync driconfig option and enable it by default.
- radv: Add logic for subsampled format descriptions.
- radv: Add logic for multisample format descriptions.
- radv: Add multiple planes to images.
- radv: Add single plane image views & meta operations.
- radv: Support different source & dest aspects for planar images in
blit2d.
- radv: Add ycbcr conversion structs.
- radv: Add support for image views with multiple planes.
- radv: Allow mixed src/dst aspects in copies.
- ac/nir: Add support for planes.
- radv: Add ycbcr samplers in descriptor set layouts.
- radv: Update descriptor sets for multiple planes.
- radv: Add ycbcr lowering pass.
- radv: Run the new ycbcr lowering pass.
- radv: Add hashing for the ycbcr samplers.
- radv: Add ycbcr format features.
- radv: Add ycbcr subsampled & multiplane formats to csv.
- radv: Enable YCBCR conversion feature.
- radv: Expose VK_EXT_ycbcr_image_arrays.
- radv: Expose Vulkan 1.1 for Android.
- radv: Fix hang width YCBCR array textures.
- radv: Set is_array in lowered ycbcr tex instructions.
- radv: Restrict YUVY formats to 1 layer.
- radv: Disable subsampled formats.
- radv: Implement cosited_even sampling.
- radv: Do not use extra descriptor space for the 3rd plane.
- nir: Actually propagate progress in nir_opt_move_load_ubo.
- radv: Prevent out of bound shift on 32-bit builds.
Benjamin Gordon (1):
- configure.ac/meson.build: Add options for library suffixes
Benjamin Tissoires (1):
- CI: use wayland ci-templates repo to create the base image
Boyan Ding (3):
- gk110/ir: Add rcp f64 implementation
- gk110/ir: Add rsq f64 implementation
- gk110/ir: Use the new rcp/rsq in library
Boyuan Zhang (1):
- st/va: reverse qt matrix back to its original order
Brian Paul (51):
- st/mesa: whitespace/formatting fixes in st_cb_texture.c
- svga: assorted whitespace and formatting fixes
- svga: fix dma.pending > 0 test
- mesa: fix display list corner case assertion
- st/mesa: whitespace fixes in st_sampler_view.c
- st/mesa: line wrapping, whitespace fixes in st_cb_texture.c
- st/mesa: whitespace fixes in st_texture.h
- svga: init fill variable to avoid compiler warning
- svga: silence array out of bounds warning
- st/wgl: init a variable to silence MinGW warning
- gallium/util: whitespace cleanups in u_bitmask.[ch]
- gallium/util: add some const qualifiers in u_bitmask.c
- pipebuffer: use new pb_usage_flags enum type
- pipebuffer: whitespace fixes in pb_buffer.h
- winsys/svga: use new pb_usage_flags enum type
- st/mesa: move, clean-up shader variant key decls/inits
- st/mesa: whitespace, formatting fixes in st_cb_flush.c
- svga: refactor draw_vgpu10() function
- svga: remove SVGA_RELOC_READ flag in SVGA3D_BindGBSurface()
- pipebuffer: s/PB_ALL_USAGE_FLAGS/PB_USAGE_ALL/
- st/mesa: init hash keys with memset(), not designated initializers
- intel/decoders: silence uninitialized variable warnings in
gen_print_batch()
- intel/compiler: silence unitialized variable warning in
opt_vector_float()
- st/mesa: move utility functions, macros into new st_util.h file
- st/mesa: move around some code in st_context.c
- st/mesa: add/improve sampler view comments
- st/mesa: rename st_texture_release_sampler_view()
- st/mesa: minor refactoring of texture/sampler delete code
- docs: try to improve the Meson documentation (v2)
- drisw: fix incomplete type compilation failure
- gallium/winsys/kms: fix incomplete type compilation failure
- nir: silence a couple new compiler warnings
- docs: separate information for compiler selection and compiler
options
- docs: link to the meson_options.txt file gitlab.freedesktop.org
- st/mesa: implement "zombie" sampler views (v2)
- st/mesa: implement "zombie" shaders list
- st/mesa: stop using pipe_sampler_view_release()
- svga: stop using pipe_sampler_view_release()
- llvmpipe: stop using pipe_sampler_view_release()
- swr: remove call to pipe_sampler_view_release()
- i915g: remove calls to pipe_sampler_view_release()
- gallium/util: remove pipe_sampler_view_release()
- nir: fix a few signed/unsigned comparison warnings
- st/mesa: fix texture deletion context mix-up issues (v2)
- nir: use {0} initializer instead of {} to fix MSVC build
- util: no-op \__builtin_types_compatible_p() for non-GCC compilers
- docs: s/Aptril/April/
- llvmpipe: init some vars to NULL to silence MinGW compiler warnings
- glsl: work around MinGW 7.x compiler bug
- svga: add SVGA_NO_LOGGING env var (v2)
- glsl: fix typo in #warning message
Caio Marcelo de Oliveira Filho (61):
- nir: keep the phi order when splitting blocks
- i965: skip bit6 swizzle detection in Gen8+
- anv: skip bit6 swizzle detection in Gen8+
- isl: assert that Gen8+ don't have bit6_swizzling
- intel/compiler: use 0 as sampler in emit_mcs_fetch
- nir: fix example in opt_peel_loop_initial_if description
- iris: Fix uses of gl_TessLevel\*
- iris: Add support for TCS passthrough
- iris: always include an extra constbuf0 if using UBOs
- nir/copy_prop_vars: don't get confused by array_deref of vectors
- nir/copy_prop_vars: add debug helpers
- nir/copy_prop_vars: keep track of components in copy_entry
- nir/copy_prop_vars: change test helper to get intrinsics
- nir: nir_build_deref_follower accept array derefs of vectors
- nir/copy_prop_vars: add tests for load/store elements of vectors
- nir: fix MSVC build
- st/nir: count num_uniforms for FS bultin shader
- nir/copy_prop_vars: rename/refactor store_to_entry helper
- nir/copy_prop_vars: use NIR_MAX_VEC_COMPONENTS
- nir/copy_prop_vars: handle load/store of vector elements
- nir/copy_prop_vars: add tests for indirect array deref
- nir/copy_prop_vars: prefer using entries from equal derefs
- nir/copy_prop_vars: handle indirect vector elements
- anv: Implement VK_EXT_external_memory_host
- nir: Add a pass to combine store_derefs to same vector
- intel/nir: Combine store_derefs after vectorizing IO
- intel/nir: Combine store_derefs to improve code from SPIR-V
- nir: Handle array-deref-of-vector case in loop analysis
- spirv: Add an execution environment to the options
- intel/compiler: handle GLSL_TYPE_INTERFACE as GLSL_TYPE_STRUCT
- spirv: Use interface type for block and buffer block
- iris: Clean up compiler warnings about unused
- nir: Take if_uses into account when repairing SSA
- mesa: Extension boilerplate for NV_compute_shader_derivatives
- glsl: Remove redundant conditions when asserting in_qualifier
- glsl: Enable derivative builtins for NV_compute_shader_derivatives
- glsl: Enable texture builtins for NV_compute_shader_derivatives
- glsl: Parse and propagate derivative_group to shader_info
- nir/algebraic: Lower CS derivatives to zero when no group defined
- nir: Don't set LOD=0 for compute shader that has derivative group
- intel/fs: Use TEX_LOGICAL whenever implicit lod is supported
- intel/fs: Add support for CS to group invocations in quads
- intel/fs: Don't loop when lowering CS intrinsics
- intel/fs: Use NIR_PASS_V when lowering CS intrinsics
- i965: Advertise NV_compute_shader_derivatives
- gallium: Add PIPE_CAP_COMPUTE_SHADER_DERIVATIVES
- iris: Enable NV_compute_shader_derivatives
- spirv: Add support for DerivativeGroup capabilities
- anv: Implement VK_NV_compute_shader_derivatives
- docs: Add NV_compute_shader_derivatives to 19.1.0 relnotes
- spirv: Add more to_string helpers
- spirv: Tell which opcode or value is unhandled when failing
- spirv: Rename vtn_decoration literals to operands
- spirv: Handle SpvOpDecorateId
- nir: Add option to lower tex to txl when shader don't support
implicit LOD
- intel/fs: Don't handle texop_tex for shaders without implicit LOD
- spirv: Properly handle SpvOpAtomicCompareExchangeWeak
- intel/fs: Assert when brw_fs_nir sees a nir_deref_instr
- anv: Fix limits when VK_EXT_descriptor_indexing is used
- nir: Fix nir_opt_idiv_const when negatives are involved
- nir: Fix clone of nir_variable state slots
Carlos Garnacho (1):
- wayland/egl: Ensure EGL surface is resized on DRI update_buffers()
Chad Versace (17):
- turnip: Drop Makefile.am and Android.mk
- turnip: Fix indentation in function signatures
- turnip: Fix result of vkEnumerate*LayerProperties
- turnip: Fix result of vkEnumerate*ExtensionProperties
- turnip: Use vk_outarray in all relevant public functions
- turnip: Fix a real -Wmaybe-uninitialized
- turnip: Fix indentation
- turnip: Require DRM device version >= 1.3
- turnip: Add TODO for Android logging
- turnip: Use vk_errorf() for initialization error messages
- turnip: Replace fd_bo with tu_bo
- turnip: Add TODO file
- turnip: Fix 'unused' warnings
- turnip: Don't return from tu_stub funcs
- turnip: Annotate vkGetImageSubresourceLayout with tu_stub
- turnip: Fix error behavior for
VkPhysicalDeviceExternalImageFormatInfo
- turnip: Use Vulkan 1.1 names instead of KHR
Charmaine Lee (5):
- svga: add svga shader type in the shader variant
- svga: move host logging to winsys
- st/mesa: purge framebuffers with current context after unbinding
winsys buffers
- mesa: unreference current winsys buffers when unbinding winsys
buffers
- svga: Remove unnecessary check for the pre flush bit for setting
vertex buffers
Chenglei Ren (1):
- anv/android: fix missing dependencies issue during parallel build
Chia-I Wu (78):
- egl: fix KHR_partial_update without EXT_buffer_age
- turnip: add .clang-format
- turnip: use msm_drm.h from inc_freedreno
- turnip: remove unnecessary libfreedreno_drm dep
- turnip: add wrappers around DRM_MSM_GET_PARAM
- turnip: add wrappers around DRM_MSM_SUBMITQUEUE\_\*
- turnip: constify tu_device in tu_gem\_\*
- turnip: preliminary support for tu_QueueWaitIdle
- turnip: run sed and clang-format on tu_cs
- turnip: document tu_cs
- turnip: add tu_cs_add_bo
- turnip: minor cleanup to tu_cs_end
- turnip: update cs->start in tu_cs_end
- turnip: inline tu_cs_check_space
- turnip: add more tu_cs helpers
- turnip: build drm_msm_gem_submit_bo array directly
- turnip: add tu_bo_list_merge
- turnip: add cmdbuf->bo_list to bo_list in queue submit
- turnip: preliminary support for tu_BindImageMemory2
- turnip: preliminary support for tu_image_view_init
- turnip: preliminary support for tu_CmdBeginRenderPass
- turnip: add tu_cs_reserve_space(_assert)
- turnip: emit HW init in tu_BeginCommandBuffer
- turnip: preliminary support for tu_GetRenderAreaGranularity
- turnip: add tu_tiling_config
- turnip: add internal helpers for tu_cs
- turnip: add tu_cs_{reserve,add}_entry
- turnip: specify initial size in tu_cs_init
- turnip: never fail tu_cs_begin/tu_cs_end
- turnip: add tu_cs_sanity_check
- turnip: provide both emit_ib and emit_call
- turnip: add tu_cs_mode
- turnip: add TU_CS_MODE_SUB_STREAM
- turnip: preliminary support for loadOp and storeOp
- turnip: add a more complete format table
- turnip: add functions to import/export prime fd
- turnip: advertise VK_KHR_external_memory_capabilities
- turnip: advertise VK_KHR_external_memory
- turnip: add support for VK_KHR_external_memory_{fd,dma_buf}
- turnip: fix VkClearValue packing
- turnip: preliminary support for fences
- turnip: respect color attachment formats
- turnip: mark IBs for dumping
- turnip: use 32-bit offset in tu_cs_entry
- turnip: more/better asserts for tu_cs
- turnip: add tu_cs_discard_entries
- turnip: tu_cs_emit_array
- turnip: fix tu_cs sub-streams
- turnip: simplify tu_cs sub-streams usage
- turnip: create a less dummy pipeline
- turnip: parse VkPipelineDynamicStateCreateInfo
- turnip: parse VkPipelineInputAssemblyStateCreateInfo
- turnip: parse VkPipelineViewportStateCreateInfo
- turnip: parse VkPipelineRasterizationStateCreateInfo
- turnip: parse VkPipelineDepthStencilStateCreateInfo
- turnip: parse VkPipeline{Multisample,ColorBlend}StateCreateInfo
- turnip: preliminary support for shader modules
- turnip: compile VkPipelineShaderStageCreateInfo
- turnip: parse VkPipelineShaderStageCreateInfo
- turnip: parse VkPipelineVertexInputStateCreateInfo
- turnip: add draw_cs to tu_cmd_buffer
- turnip: preliminary support for draw state binding
- turnip: preliminary support for tu_CmdDraw
- turnip: guard -Dvulkan-driver=freedreno
- turnip: preliminary support for tu_GetImageSubresourceLayout
- turnip: preliminary support for Wayland WSI
- vulkan/wsi: move modifier array into wsi_wl_swapchain
- vulkan/wsi: create wl_drm wrapper as needed
- vulkan/wsi: refactor drm_handle_format
- vulkan/wsi: add wsi_wl_display_drm
- vulkan/wsi: add wsi_wl_display_dmabuf
- vulkan/wsi: make wl_drm optional
- virgl: handle fence_server_sync in winsys
- virgl: hide fence internals from the driver
- virgl: introduce virgl_drm_fence
- virgl: fix fence fd version check
- virgl: clear vertex_array_dirty
- virgl: skip empty cmdbufs
Chris Forbes (3):
- glsl: add scaffolding for EXT_gpu_shader4
- glsl: enable noperspective|flat|centroid for EXT_gpu_shader4
- glsl: enable types for EXT_gpu_shader4
Chris Wilson (19):
- i965: Assert the execobject handles match for this device
- iris: fix import from dri2/3
- iris: IndexFormat = size/2
- iris: Set resource modifier on handle
- iris: Wrap userptr for creating bo
- iris: AMD_pinned_memory
- iris: Record reusability of bo on construction
- iris: fix memzone_for_address since multibinder changes
- iris: Tidy exporting the flink handle
- iris: Fix assigning the output handle for exporting for KMS
- iris: Merge two walks of the exec_bos list
- iris: Tag each submitted batch with a syncobj
- iris: Add fence support using drm_syncobj
- iris: Wire up EGL_IMG_context_priority
- iris: Use PIPE_BUFFER_STAGING for the query objects
- iris: Use coherent allocation for PIPE_RESOURCE_STAGING
- iris: Use streaming loads to read from tiled surfaces
- iris: Push heavy memchecker code to DEBUG
- iris: Adapt to variable ppGTT size
Christian Gmeiner (12):
- etnaviv: rs: mark used src resource as read from
- etnaviv: blt: mark used src resource as read from
- etnaviv: implement ETC2 block patching for HALTI0
- etnaviv: keep track of mapped bo address
- etnaviv: hook-up etc2 patching
- etnaviv: enable ETC2 texture compression support for HALTI0 GPUs
- etnaviv: fix resource usage tracking across different pipe_context's
- etnaviv: fix compile warnings
- st/dri: allow direct UYVY import
- etnaviv: shrink struct etna_3d_state
- nir: add lower_ftrunc
- etnaviv: use the correct uniform dirty bits
Chuck Atkins (1):
- meson: Fix missing glproto dependency for gallium-glx
Connor Abbott (6):
- nir/serialize: Prevent writing uninitialized state_slot data
- nir: Add a stripping pass for improved cacheability
- radeonsi/nir: Use nir stripping pass
- nir/search: Add automaton-based pre-searching
- nir/search: Add debugging code to dump the pattern matched
- nir/algebraic: Don't emit empty initializers for MSVC
Daniel Schürmann (2):
- nir: Define shifts according to SM5 specification.
- nir: Use SM5 properties to optimize shift(a@32, iand(31, b))
Daniel Stone (2):
- panfrost: Properly align stride
- vulkan/wsi/wayland: Respect non-blocking AcquireNextImage
Danylo Piliaiev (13):
- anv: Handle VK_ATTACHMENT_UNUSED in colorAttachment
- radv: Handle VK_ATTACHMENT_UNUSED in CmdClearAttachment
- anv: Fix VK_EXT_transform_feedback working with varyings packed in
PSIZ
- anv: Fix destroying descriptor sets when pool gets reset
- anv: Treat zero size XFB buffer as disabled
- glsl: Cross validate variable's invariance by explicit invariance
only
- i965,iris,anv: Make alpha to coverage work with sample mask
- intel/fs: Make alpha test work with MRT and sample mask
- st/mesa: Fix GL_MAP_COLOR with glDrawPixels GL_COLOR_INDEX
- iris: Fix assert when using vertex attrib without buffer binding
- intel/compiler: Do not reswizzle dst if instruction writes to flag
register
- drirc: Add workaround for Epic Games Launcher
- anv: Do not emulate texture swizzle for INPUT_ATTACHMENT,
STORAGE_IMAGE
Dave Airlie (63):
- virgl: enable elapsed time queries
- virgl: ARB_query_buffer_object support
- docs: update qbo support for virgl
- glsl: glsl to nir fix uninit class member.
- radv/llvm: initialise passes member.
- radv: remove alloc parameter from pipeline init
- iris: fix some hangs around null framebuffers
- iris: fix crash in sparse vertex array
- iris: add initial transform feedback overflow query paths (V3)
- iris: fix cube texture view
- iris: execute compute related query on compute batch.
- iris: iris add load register reg32/64
- iris: add conditional render support
- iris: fix gpu calcs for timestamp queries
- iris/WIP: add broadwell support
- iris: limit gen8 to 8 samples
- iris: setup gen8 caps
- iris: add fs invocations query workaround for broadwell
- iris: handle qbo fragment shader invocation workaround
- st/mesa: add support for lowering fp64/int64 for nir drivers
- softpipe: fix texture view crashes
- nir/spirv: don't use bare types, remove assert in split vars for
testing
- nir/deref: remove casts of casts which are likely redundant (v3)
- softpipe: fix 32-bit bitfield extract
- softpipe: handle 32-bit bitfield inserts
- softpipe: remove shadow_ref assert.
- softpipe: fix integer texture swizzling for 1 vs 1.0f
- nir/split_vars: fixup some more explicit_stride related issues.
- draw: bail instead of assert on instance count (v2)
- draw/gs: fix point size outputs from geometry shader.
- draw/vs: partly fix basevertex/vertex id
- softpipe: fix clears to only clear specified color buffers.
- softpipe/draw: fix vertex id in soft paths.
- softpipe: add indirect store buffer/image unit
- nir/deref: fix struct wrapper casts. (v3)
- nir: use proper array sizing define for vectors
- intel/compiler: use defined size for vector components
- iris: avoid use after free in shader destruction
- ddebug: add compute functions to help hang detection
- draw: add stream member to stats callback
- tgsi: add support for geometry shader streams.
- softpipe: add support for indexed queries.
- draw: add support to tgsi paths for geometry streams. (v2)
- softpipe: add support for vertex streams (v2)
- virgl: add support for missing command buffer binding.
- virgl: add support for ARB_multi_draw_indirect
- virgl: add support for ARB_indirect_parameters
- draw: fix undefined shift of (1 << 31)
- swrast: fix undefined shift of 1 << 31
- llvmpipe: fix undefined shift 1 << 31.
- virgl/drm: cleanup buffer from handle creation (v2)
- virgl/drm: handle flink name better.
- virgl/drm: insert correct handles into the table. (v3)
- intel/compiler: fix uninit non-static variable. (v2)
- nir: fix bit_size in lower indirect derefs.
- r600: reset tex array override even when no view bound
- spirv: fix SpvOpBitSize return value.
- nir: fix lower vars to ssa for larger vector sizes.
- util/tests: add basic unit tests for bitset
- util/bitset: fix bitset range mask calculations.
- kmsro: add \_dri.so to two of the kmsro drivers.
- glsl: init packed in more constructors.
- Revert "mesa: unreference current winsys buffers when unbinding
winsys buffers"
David Riley (3):
- virgl: Store mapped hw resource with transfer object.
- virgl: Allow transfer queue entries to be found and extended.
- virgl: Re-use and extend queue transfers for intersecting buffer
subdatas.
David Shao (1):
- meson: ensure that xmlpool_options.h is generated for gallium targets
that need it
Deepak Rawat (2):
- winsys/drm: Fix out of scope variable usage
- winsys/svga/drm: Fix 32-bit RPCI send message
Dominik Drees (1):
- Add no_aos_sampling GALLIVM_PERF option
Drew Davenport (1):
- util: Don't block SIGSYS for new threads
Dylan Baker (40):
- bump version for 19.0 branch
- docs: Add relnotes stub for 19.1
- gallium: wrap u_screen in extern "C" for c++
- automake: Add --enable-autotools to distcheck flags
- android,autotools,i965: Fix location of float64_glsl.h
- meson: remove build_by_default : true
- meson: fix style in intel/tools
- meson: remove -std=c++11 from intel/tools
- get-pick-list: Add --pretty=medium to the arguments for Cc patches
- meson: Add dependency on genxml to anvil
- meson/iris: Use current coding style
- docs: Add release notes for 19.0.0
- docs: Add SHA256 sums for 19.0.0
- docs: update calendar, add news item, and link release notes for
19.0.0
- bin/install_megadrivers.py: Correctly handle DESTDIR=''
- bin/install_megadrivers.py: Fix regression for set DESTDIR
- docs: Add release notes for 19.0.1
- docs: Add SHA256 sums for mesa 19.0.1
- docs: update calendar, add news item and link release notes for
19.0.1
- meson: Error if LLVM doesn't have rtti when building clover
- meson: Error if LLVM is turned off but clover it turned on
- docs: Add release notes for 19.0.2
- docs: Add sha256 sums for 19.0.2
- docs: update calendar, and news item and link release notes for
19.0.2
- Delete autotools
- docs: drop most autoconf references
- ci: Delete autotools build jobs
- docs: add relnotes for 19.0.3
- docs: Add SHA256 sums for mesa 19.0.3
- docs: update calendar, and news item and link release notes for
19.0.3
- meson: always define libglapi
- glsl: fix general_ir_test with mingw
- meson: switch gles1 and gles2 to auto options
- meson: Make shader-cache a trillean instead of boolean
- meson: make nm binary optional
- util/tests: Use define instead of VLA
- glsl/tests: define ssize_t on windows
- tests/vma: fix build with MSVC
- meson: Don't build glsl cache_test when shader cache is disabled
- meson: Force the use of config-tool for llvm
Eduardo Lima Mitev (5):
- freedreno/a6xx: Silence compiler warnings
- nir: Add ir3-specific version of most SSBO intrinsics
- ir3/nir: Add a new pass 'ir3_nir_lower_io_offsets'
- ir3/compiler: Enable lower_io_offsets pass and handle new SSBO
intrinsics
- ir3/lower_io_offsets: Try propagate SSBO's SHR into a previous shift
instruction
El Christianito (1):
- drirc: add Budgie WM to adaptive-sync blacklist
Eleni Maria Stea (6):
- i965: Faking the ETC2 compression on Gen < 8 GPUs using two miptrees.
- i965: Fixed the CopyImageSubData for ETC2 on Gen < 8
- i965: Enabled the OES_copy_image extension on Gen 7 GPUs
- i965: Removed the field etc_format from the struct intel_mipmap_tree
- i965: fixed clamping in set_scissor_bits when the y is flipped
- radv: consider MESA_VK_VERSION_OVERRIDE when setting the api version
Elie Tournier (3):
- virgl: Add a caps to advertise GLES backend
- virgl: Set PIPE_CAP_DOUBLES when running on GLES This is a lie but no
known app use fp64.
- virgl: Return an error if we use fp64 on top of GLES
Emil Velikov (30):
- vc4: Declare the last cpu pointer as being modified in NEON asm.
- docs: add release notes for 18.3.3
- docs: add sha256 checksums for 18.3.3
- docs: update calendar, add news item and link release notes for
18.3.3
- anv: wire up the state_pool_padding test
- docs: add release notes for 18.3.4
- docs: add sha256 checksums for 18.3.4
- docs: update calendar, add news item and link release notes for
18.3.4
- egl/dri: de-duplicate dri2_load_driver\*
- meson: egl: correctly manage loader/xmlconfig
- loader: use loader_open_device() to handle O_CLOEXEC
- egl/android: bump the number of drmDevices to 64
- docs: mention "Allow commits from members who can merge..."
- egl/sl: split out swrast probe into separate function
- egl/sl: use drmDevice API to enumerate available devices
- egl/sl: use kms_swrast with vgem instead of a random GPU
- docs: add release notes for 18.3.5
- docs: add sha256 checksums for 18.3.5
- docs: update calendar, add news item and link release notes for
18.3.5
- docs: add release notes for 18.3.6
- docs: add sha256 checksums for 18.3.6
- docs: update calendar, add news item and link release notes for
18.3.6
- turnip: drop dead close(master_fd)
- vulkan/wsi: check if the display_fd given is master
- vulkan/wsi: don't use DUMB_CLOSE for normal GEM handles
- llvmpipe: add lp_fence_timedwait() helper
- llvmpipe: correctly handle waiting in llvmpipe_fence_finish
- egl/dri: flesh out and use dri2_create_drawable()
- mapi: add static_date offset to MaxShaderCompilerThreadsKHR
- mapi: correctly handle the full offset table
Emmanuel Gil Peyrot (1):
- docs: make bugs.html easier to find
Eric Anholt (121):
- v3d: Always enable the NEON utile load/store code.
- v3d: Fix a release build set-but-unused compiler warning.
- mesa: Skip partial InvalidateFramebuffer of packed depth/stencil.
- v3d: Fix image_load_store clamping of signed integer stores.
- nir: Move V3D's "the shader was TGSI, ignore FS output types" flag to
NIR.
- v3d: Fix precompile of FRAG_RESULT_DATA1 and higher outputs.
- v3d: Store the actual mask of color buffers present in the key.
- v3d: Fix dumping of shaders with alpha test.
- v3d: Fix pack/unpack of VFPACK operand unpacks.
- v3d: Fix input packing of .l for rounding/fdx/fdy.
- v3d: Fix copy-propagation of input unpacks.
- v3d: Whitespace consistency fix.
- nir: Move panfrost's isign lowering to nir_opt_algebraic.
- v3d: Use the NIR lowering for isign instead of rolling our own.
- intel: Use the NIR lowering for isign.
- freedreno: Use the NIR lowering for isign.
- v3d: Clear the GMP on initialization of the simulator.
- v3d: Sync indirect draws on the last rendering.
- v3d: Use the early_fragment_tests flag for the shader's disable-EZ
field.
- v3d: Fix incorrect flagging of ldtmu as writing r4 on v3d 4.x.
- v3d: Drop a perf note about merging unpack_half_*, which has been
implemented.
- v3d: Drop our hand-lowered nir_op_ffract.
- v3d: Add a helper function for getting a nop register.
- v3d: Refactor bcsel and if condition handling.
- v3d: Do bool-to-cond for discard_if as well.
- v3d: Kill off vir_PF(), which is hard to use right.
- v3d: Fix f2b32 behavior.
- v3d: Fix the check for "is the last thrsw inside control flow"
- v3d: Add a function to describe what the c->execute.file check means.
- v3d: Stop tracking num_inputs for VPM loads.
- v3d: Delay emitting ldvpm on V3D 4.x until it's actually used.
- v3d: Emit a simpler negate for the iabs implementation.
- v3d: Move i2b and f2b support into emit_comparison.
- kmsro: Add the rest of the current set of tinydrm drivers.
- nir: Just return when asked to rewrite uses of an SSA def to itself.
- v3d: Fix vir_is_raw_mov() for input unpacks.
- v3d: Dump the VIR after register spilling if we were forced to.
- v3d: Rematerialize MOVs of uniforms instead of spilling them.
- v3d: Fix build of NEON code with Mesa's cflags not targeting NEON.
- v3d: Restrict live intervals to the blocks reachable from any def.
- v3d: Stop treating exec masking specially.
- nir: Improve printing of load_input/store_output variable names.
- v3d: Translate f2i(fround_even) as FTOIN.
- v3d: Move the stores for fixed function VS output reads into NIR.
- v3d: Fix temporary leaks of temp_registers and when spilling.
- v3d: Do uniform rematerialization spilling before dropping
threadcount
- v3d: Switch implicit uniforms over to being any qinst->uniform != ~0.
- v3d: Add support for vir-to-qpu of ldunif instructions to a temp.
- v3d: Drop the old class bits splitting up the accumulators.
- v3d: Add support for register-allocating a ldunif to a QFILE_TEMP.
- v3d: Use ldunif instructions for uniforms.
- v3d: Eliminate the TLB and TLBU files.
- v3d: Drop the V3D 3.x vpm read dead code elimination.
- v3d: Include a count of register pressure in the RA failure dumps.
- st/dri: Set the PIPE_BIND_SHARED flag on create_image_with_modifiers.
- util: Add a DAG datastructure.
- vc4: Switch over to using the DAG datastructure for QIR scheduling.
- v3d: Reuse list_for_each_entry_rev().
- vc4: Reuse list_for_each_entry_rev().
- v3d: Use the DAG datastructure for QPU instruction scheduling.
- vc4: Switch the post-RA scheduler over to the DAG datastructure.
- v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER.
- v3d: Fix leak of the mem_ctx after the DAG refactor.
- v3d: Fix leak of the renderonly struct on screen destruction.
- mesa/st: Make sure that prog_to_nir NIR gets freed.
- mesa/st: Fix leaks of TGSI tokens in VP variants.
- v3d: Always lay out shared tiled buffers with UIF_TOP set.
- v3d: Allow the UIF modifier with renderonly.
- v3d: Expose the dma-buf modifiers query.
- v3d: Rename v3d_tmu_config_data to v3d_unit_data.
- v3d: Move constant offsets to UBO addresses into the main uniform
stream.
- v3d: Upload all of UBO[0] if any indirect load occurs.
- v3d: Remove some dead members of struct v3d_compile.
- egl: Add a 565 pbuffer-only EGL config under X11.
- dri3: Return the current swap interval from glXGetSwapIntervalMESA().
- v3d: Add support for handling OOM signals from the simulator.
- v3d: Bump the maximum texture size to 4k for V3D 4.x.
- v3d: Don't try to use the TFU blit path if a scissor is enabled.
- v3d: Add some more new packets for V3D 4.x.
- st: Lower uniforms in st in the !PIPE_CAP_PACKED_UNIFORMS case as
well.
- vc4: Don't forget to set the range when scalarizing our uniforms.
- vc4: Split UBO0 and UBO1 address uniform handling.
- vc4: Upload CS/VS UBO uniforms together.
- v3d: Add an optimization pass for redundant flags updates.
- nir: Drop comments about the constant_index slots for load/stores.
- nir: Drop remaining references to const_index in favor of the call to
use.
- nir: Add a comment about how intrinsic definitions work.
- v3d: Add and use a define for the number of channels in a QPU
invocation.
- v3d: Drop a note for the future about PIPE_CAP_PACKED_UNIFORMS.
- v3d: Include the number of max temps used in the shader-db output.
- v3d: Replace the old shader-db env var output with the
ARB_debug_output.
- v3d: Add Compute Shader compilation support.
- v3d: Add missing base offset to CS shared memory accesses.
- v3d: Add missing dumping for the spill offset/size uniforms.
- v3d: Detect the correct number of QPUs and use it to fix the spill
size.
- v3d: Use the new lower_to_scratch implementation for indirects on
temps.
- v3d: Only look up the 3rd texture gather offset for non-arrays.
- v3d: Always set up the qregs for CSD payload.
- v3d: Fix an invalid reuse of flags generation from before a thrsw.
- v3d: Fix atomic cmpxchg in shaders on hardware.
- nir: Fix deref offset calculation for structs.
- nir: Use the nir_builder \_imm helpers in setting up deref offsets.
- gallium: Remove the pool pipebuffer manager.
- gallium: Remove the ondemand pipebuffer manager.
- gallium: Remove the "alt" pipebuffer manager interface.
- gallium: Remove the malloc pipebuffer manager.
- st/mesa: Don't set atomic counter size != 0 if MAX_SHADER_BUFFERS ==
0.
- v3d: Disable SSBOs and atomic counters on vertex shaders.
- v3d: Fill in the ignored segment size fields to appease new
simulator.
- v3d: Apply the GFXH-930 workaround to the case where the VS loads
attrs.
- v3d: Assert that we do request the normal texturing return data.
- v3d: Use \_mesa_hash_table_remove_key() where appropriate.
- vc4: Use \_mesa_hash_table_remove_key() where appropriate.
- v3d: Add a note about i/o indirection for future performance work.
- v3d: Don't try to update the shadow texture for separate stencil.
- Revert "v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER."
- v3d: Re-add support for memory_barrier_shared.
- v3d: Fix detection of the last ldtmu before a new TMU op.
- v3d: Fix detection of TMU write sequences in register spilling.
- kmsro: Add support for V3D.
- vc4: Fall back to renderonly if the vc4 driver doesn't have v3d.
Eric Engestrom (142):
- wsi/display: add comment
- egl: use coherent variable names
- gitlab-ci: add ubuntu container
- gitlab-ci: add a meson vulkan build
- gitlab-ci: add a make vulkan build
- gitlab-ci: add a scons no-llvm build
- gitlab-ci: add scons llvm 3.5 build
- gitlab-ci: add scons SWR build
- gitlab-ci: add meson loader/classic DRI build
- gitlab-ci: add meson gallium SWR build
- gitlab-ci: add meson gallium RadeonSI build
- gitlab-ci: add meson gallium "other drivers" build
- gitlab-ci: add meson gallium ST Clover (LLVM 5.0) build
- gitlab-ci: add meson gallium ST Clover (LLVM 6.0) build
- gitlab-ci: add meson gallium ST Clover (LLVM 7.0) build
- gitlab-ci: add meson gallium ST "Other" build
- gitlab-ci: add make loaders/classic DRI build
- gitlab-ci: add make Gallium Drivers SWR build
- gitlab-ci: add make Gallium Drivers RadeonSI build
- gitlab-ci: add make Gallium Drivers "Other" build
- gitlab-ci: add make Gallium ST Clover LLVM-3.9 build
- gitlab-ci: add make Gallium ST Clover LLVM-4.0 build
- gitlab-ci: add make Gallium ST Clover LLVM-5.0 build
- gitlab-ci: add make Gallium ST Clover LLVM-6.0 build
- gitlab-ci: add make Gallium ST Clover LLVM-7 build
- gitlab-ci: add make Gallium ST Other build
- travis: remove unused linux code path
- travis: remove unused scons code path
- gitlab-ci: add meson glvnd build
- xvmc: fix string comparison
- xvmc: fix string comparison
- meson: add script to print the options before configuring a builddir
- driconf: drop unused macro
- travis: fix osx make build
- gitlab-ci: workaround docker bug for users with uppercase characters
- wsi: query the ICD's max dimensions instead of hard-coding them
- gitlab-ci: limit ninja to 4 threads max
- drm-uapi/README: remove explicit list of driver names
- drm-uapi: use local files, not system libdrm
- gbm: drop duplicate #defines
- st/dri: drop duplicate #define
- etnaviv: drop duplicate #define
- anv/tests: compile to something sensible in release builds
- util/tests: compile to something sensible in release builds
- gitlab-ci: use ccache to speed up builds
- tegra/meson: add missing dep_libdrm
- tegra/autotools: add missing libdrm cflags
- gitlab-ci: limit the automatic CI to master and MRs
- gitlab-ci: automatically run the CI on pushes to \`ci/\*\` branches
- anv: sort extensions alphabetically
- anv: sort vendors extensions after KHR and EXT
- anv: make sure the extensions stay sorted
- anv: drop unused imports
- anv: use anv_shader_bin_write_to_blob()'s return value
- gitlab-ci: always run the containers build
- dri_interface: add missing #include
- driinfo: add DTD to allow the xml to be validated
- meson/swr: replace hard-coded path with current_build_dir()
- egl/android: replace magic 0=CbCr,1=CrCb with simple enum
- vulkan: use VkBase{In,Out}Structure instead of a custom struct
- driconf: add DTD to allow the drirc xml (00-mesa-defaults.conf) to be
validated
- gitlab-ci: install xmllint to validate 00-mesa-defaults.conf
- anv: simplify chained comparison
- anv: drop unused parameter
- anv: remove spaces around kwargs assignment
- anv: fix typo
- Revert "swr/rast: Archrast codegen updates"
- meson: avoid going back up the tree with include_directories()
- anv: use the platform defines in vk.xml instead of hard-coding them
- radv: use the platform defines in vk.xml instead of hard-coding them
- util: #define PATH_MAX when undefined (eg. Hurd)
- vulkan: import missing file from Khronos
- egl: fix libdrm-less builds
- vulkan: import vk_layer.h from Khronos
- gitlab-ci: drop job prefixes
- meson: fix with_dri2 definition for GNU Hurd
- meson: remove unused include_directories(vulkan)
- vulkan/util: use the platform defines in vk.xml instead of
hard-coding them
- vulkan/overlay: fix missing var rename in previous commit
- meson: don't build libGLES*.so with GLVND
- autotools: don't build libGLES*.so with GLVND
- travis: fix meson build by letting \`auto\` do its job
- travis: drop unused vars
- travis: clean up
- gitlab-ci: only build the default (=latest) and oldest llvm versions
- gitlab-ci: autotools needs to be told which llvm version to use
- r600: cast pointer to expected type
- build: make passing an incorrect pointer type a hard error
- gitlab-ci: fix llvm version (7 doesn't have a ".0")
- hgl/meson: drop unused include directory
- glx/meson: use full include path for dri_interface.h
- android: fix missing backspace for line continuation
- panfrost: fix tgsi_to_nir() call
- panfrost: move #include to fix compilation
- gitlab-ci: add panfrost to the gallium drivers build
- wsi: deduplicate get_current_time() functions between display and x11
- wsi/display: s/#if/#ifdef/ to fix -Wundef
- wsi/wayland: fix pointer casting warning on 32bit
- wsi/x11: use WSI_FROM_HANDLE() instead of pointer casts
- turnip: use the platform defines in vk.xml instead of hard-coding
them
- travis: fix osx meson build
- nir: const \`nir_call_instr::callee\`
- gitlab-ci: add clang build
- gitlab-ci: drop most autotools builds
- util/disk_cache: close fd in the fallback path
- egl: hide entrypoints that shouldn't be exported when using glvnd
- meson: strip rpath from megadrivers
- gallium/hud: fix memory leaks
- gallium/hud: prevent buffer overflow
- gallium/hud: fix rounding error in nic bps computation
- simplify LLVM version string printing
- util/process: document memory leak
- vk/util: remove unneeded array index
- bin: drop unused import from install_megadrivers.py
- meson: remove meson-created megadrivers symlinks
- gitlab-ci: build gallium extra hud
- gitlab-ci: add lima to the build
- delete autotools .gitignore files
- delete autotools input files
- docs: remove unsupported GL function name mangling
- docs: drop autotools python information
- docs: replace autotools intructions with meson equivalent
- docs: use past tense when talking about autotools
- docs: haiku can be built using meson
- egl: fixup autotools-specific wording
- util: add os_read_file() helper
- anv: add support for VK_EXT_memory_budget
- radv: update to use the new features struct names
- turnip: update to use the new features struct names
- gitlab-ci: build vulkan drivers in clang build
- util: move #include out of #if linux
- wsi/wayland: document lack of vkAcquireNextImageKHR timeout support
- egl: hard-code destroy function instead of passing it around as a
pointer
- gitlab-ci: add scons windows build using mingw
- gitlab-ci: merge several meson jobs
- gitlab-ci: meson-gallium-radeonsi was a subset of
meson-gallium-clover-llvm
- gitlab-ci: simplify meson job names
- gitlab-ci: merge meson-glvnd into meson-swr
- travis: fix syntax, and drop unused stuff
- util/os_file: always use the 'grow' mechanism
- meson: expose glapi through osmesa
- util/os_file: actually return the error read() gave us
Erico Nunes (5):
- lima/ppir: support ppir_op_ceil
- nir/algebraic: add lowering for fsign
- lima: enable nir fsign lowering in ppir
- lima/gpir: add limit of max 512 instructions
- lima/ppir: support nir_op_ftrunc
Erik Faye-Lund (79):
- mesa: expose NV_conditional_render on GLES
- st/mesa: remove unused header-file
- swr/codegen: fix autotools build
- virgl: remove unused variables
- virgl: remove unused variable
- virgl: remove unused variable
- virgl: remove unused variable
- virgl: do not allow compressed formats for buffers
- virgl: stricter usage of compressed 3d textures
- virgl: also destroy all read-transfers
- virgl: use debug_printf instead of fprintf
- virgl: unsigned int -> unsigned
- virgl: only warn about unchecked flags
- virgl: do not warn about display-target binding
- virgl: use debug_printf instead of fprintf
- virgl: remove pointless transfer-counter
- virgl: tmp_resource -> templ
- virgl: track full virgl_resource instead of just virgl_hw_res
- virgl: simplify virgl_texture_transfer_unmap logic
- virgl: make unmap queuing a bit more straight-forward
- virgl: check for readback on correct resource
- virgl: wait for the right resource
- virgl: return error if allocating resolve_tmp fails
- virgl: rewrite core of virgl_texture_transfer_map
- virgl: use pipe_box for blit dst-rect
- virgl: support write-back with staged transfers
- virgl: make sure bind is set for non-buffers
- gallium/util: support translating between uint and sint formats
- virgl: get readback-formats from host
- virgl: only blit if resource is read
- virgl: do color-conversion during when mapping transfer
- virgl: document potentially failing blit
- mesa/st: remove impossible error-check
- gallium/u_vbuf: support NULL-resources
- i915: support NULL-resources
- nouveau: support NULL-resources
- swr: support NULL-resources
- mesa/st: accept NULL and empty buffer objects
- mesa/st: remove always-false state
- softpipe: setup pixel_offset for all primitive types
- docs: normaize css-indent style
- docs: remove non-existent css attribute
- docs: remove long commented out css
- docs: add missing semicolon
- docs: avoid repeating the font
- docs: avoid repeating the color
- docs: remove spurious newline
- docs: use multiple background-images for header
- docs: simplify css-centering
- docs: do not hard-code header-height
- docs: properly escape '>'
- docs: properly escape ampersand
- docs: remove stray paragraph-close
- docs: use h2 instead of b-tag for headings
- docs: use dl/dd instead of blockquote for freedesktop link
- docs: open list-item before closing it
- docs: close paragraphs before lists
- docs: close lists
- docs: remove stray paragraph-close
- docs: close paragraphs before preformatted text
- docs: start paragraph before closing it
- docs: drop paragraph around preformatted text
- docs: fix incorrectly closed paragraph
- docs: don't pointlessly close and re-start definition lists
- docs: remove stray list-start
- docs: fixup bad paragraphing
- docs: add missing lists
- docs: fix closing of paragraphs
- docs: fixup list-item tags
- docs: fix closing of list-items
- docs: replace empty list with a none-paragraph
- docs: turn faq-index into an ordered list
- docs: drop centered heading for faq
- docs: reorder heading and notice
- meson: lift driver-collection out into parent build-file
- meson: give dri- and gallium-drivers separate vars
- meson: add build-summary
- docs: fixup mistake in contents
- draw: flush when setting stream-out targets
Ernestas Kulik (2):
- vc4: Fix leak in HW queries error path
- v3d: Fix leak in resource setup error path
Francisco Jerez (6):
- intel/dump_gpu: Disambiguate between BOs from different GEM handle
spaces.
- intel/fs: Exclude control sources from execution type and region
alignment calculations.
- intel/fs: Lower integer multiply correctly when destination stride
equals 4.
- intel/fs: Cap dst-aligned region stride to maximum representable
hstride value.
- intel/fs: Implement extended strides greater than 4 for IR source
regions.
- intel/fs: Rely on undocumented unrestricted regioning for 32x16-bit
integer multiply.
Fritz Koenig (4):
- freedreno: pass count to query_dmabuf_modifiers
- freedreno/a6xx: UBWC support
- freedreno: UBWC allocator
- freedreno/a6xx: Enable UBWC modifier
Gert Wollny (35):
- mesa/core: Enable EXT_texture_sRGB_R8 also for desktop GL
- radeonsi: release tokens after creating the shader program
- mesa: release references to image textures when a context is
destroyed
- virgl: Enable mixed color FBO attachemnets only when the host
supports it
- mesa/core: Enable EXT_depth_clamp for GLES >= 2.0
- nir: Add posibility to not lower to source mod 'abs' for ops with
three sources
- mesa: Expose EXT_texture_query_lod and add support for its use
shaders
- softpipe: Enable PIPE_CAP_MIXED_COLORBUFFER_FORMATS    It seems
softpipe actually supports this. This change enables the following
piglits as passing without regressions in the gpu test set:
- virgl: Add a caps feature check version
- softpipe: Implement ATOMFADD and enable cap TGSI_ATOMFADD
- virgl: define MAX_VERTEX_STREAMS based on availability of TF3
- softpipe: Use mag texture filter also for clamped lod == 0
- softpipe: Don't use mag filter for gather op
- softpipe: raise number of bits used for X coordinate texture lookup
- softpipe: Add an extra code path for the buffer texel lookup
- softpipe: Enable PIPE_CAP_TEXTURE_BUFFER_OFFSET_ALIGNMENT
- Gallium: Add new CAP that indicated whether IO array definitions can
be shriked
- virgl: Enable passing arrays as input to fragment shaders
- doc/features: Add a few extensions to the feature matrix
- softpipe: Factor gradient evaluation out of the lambda evaluation
- softpipe: Prepare handling explicit gradients
- softpipe: Pipe gather_comp through from st_tgsi_get_samples
- softpipe: Move selection of shadow values up and clean parameter list
- softpipe: tie in new code path for lod evaluation
- softpipe: keep input lod for explicite derivatives
- softpipe: evaluate cube the faces on a per sample bases
- softpipe: Factor out evaluation of the source indices
- softpipe: Add an per-input array for interpolator correctors to
machine
- softpipe: Add (fake) support for TGSI_OPCODE_INTERP_SAMPLE
- softpipe: Add support for TGSI_OPCODE_INTERP_OFFSET
- softpipe: Add support for TGSI_OPCODE_INTERP_CENTROID
- softpipe: Increase the GLSL feature level
- doc: Update feature matrix
- softpipe/buffer: load only as many components as the the buffer
resource type provides
- Revert "softpipe/buffer: load only as many components as the the
buffer resource type provides"
Greg V (3):
- util: emulate futex on FreeBSD using umtx
- gallium/hud: add CPU usage support for FreeBSD
- gallium: enable dmabuf on BSD as well
Grigori Goronzy (1):
- glx: add support for GLX_ARB_create_context_no_error (v3)
Guido Günther (4):
- docs: Fix 19.0.x version numbers
- gallium: ddebug: Add missing fence related wrappers
- gallium/u_dump: util_dump_sampler_view: Dump u.tex.first_level
- gallium: trace: Add missing fence related wrappers
Gurchetan Singh (44):
- mesa/main: Expose EXT_texture_compression_s3tc_srgb
- i965: Set flag for EXT_texture_compression_s3tc_srgb
- st/mesa: expose EXT_texture_compression_s3tc_srgb
- docs: add GL_EXT_texture_compression_s3tc_srgb to release notes
- virgl: add ability to do finer grain dirty tracking
- virgl: use virgl_resource_dirty helper
- virgl: don't mark unclean after a flush
- virgl: track level cleanliness rather than resource cleanliness
- virgl: make alignment smaller when uploading index user buffers
- virgl: unmap uploader at flush time
- virgl: when creating / freeing transfers, pass slab pool directly
- virgl: add protocol for resource transfers
- virgl: use virgl_transfer in inline write
- virgl: limit command length to 16 bits
- virgl: keep track of number of computations
- virgl: pass virgl transfer to virgl_res_needs_flush_wait
- virgl: add extra checks in virgl_res_needs_flush_wait
- virgl: make winsys modifications for encoded transfers
- virgl: add encoder functions for new protocol
- virgl: introduce transfer queue
- virgl: use transfer queue
- virgl: use virgl_transfer_inline_write even less
- virgl/vtest: deprecate protocol version 1
- egl/sl: also allow virtgpu to fallback to kms_swrast
- virgl: use uint16_t mask instead of separate booleans
- configure.ac / meson: depend on libnativewindow when appropriate
- anv: move anv_GetMemoryAndroidHardwareBufferANDROID up a bit
- anv: fix build on Nougat
- egl/android: move droid_image_loader_extension down a bit
- egl/android: move droid_open_device_drm_gralloc down a bit
- egl/android: droid_open_device_drm_gralloc --> droid_open_device
- egl/android: refactor droid_load_driver a bit
- egl/android: plumb swrast option
- egl/android: use swrast option in droid_load_driver
- egl/android: use software rendering when appropriate
- egl/android: chose node type based on swrast and preprocessor flags
- virgl: wait after a flush
- virgl/vtest: execute a transfer_get when flushing the front buffer
- virgl/vtest: add utilities for receiving fds
- virgl/vtest: plumb support for shared memory
- virgl/vtest: receive and handle shared memory fd
- virgl/vtest: modify sending and receiving data for shared memory
- virgl/vtest: wait after issuing a transfer get
- virgl/vtest: bump up protocol version + support encoded transfers
Guttula, Suresh (1):
- st/va:Add support for indirect manner by returning
VA_STATUS_ERROR_OPERATION_FAILED
Hal Gentz (1):
- glx: Fix synthetic error generation in \__glXSendError
Heinrich (1):
- gbm: Improve documentation of BO import
Iago Toral Quiroga (39):
- compiler/nir: add an is_conversion field to nir_op_info
- compiler/nir: add lowering option for 16-bit fmod
- compiler/nir: add lowering for 16-bit flrp
- compiler/nir: add lowering for 16-bit ldexp
- intel/compiler: add a NIR pass to lower conversions
- intel/compiler: split float to 64-bit opcodes from int to 64-bit
- intel/compiler: handle b2i/b2f with other integer conversion opcodes
- intel/compiler: assert restrictions on conversions to half-float
- intel/compiler: lower some 16-bit float operations to 32-bit
- intel/compiler: handle extended math restrictions for half-float
- intel/compiler: implement 16-bit fsign
- intel/compiler: drop unnecessary temporary from 32-bit fsign
implementation
- intel/compiler: add instruction setters for Src1Type and Src2Type.
- intel/compiler: add new half-float register type for 3-src
instructions
- intel/compiler: don't compact 3-src instructions with Src1Type or
Src2Type bits
- intel/compiler: allow half-float on 3-source instructions since gen8
- intel/compiler: set correct precision fields for 3-source float
instructions
- intel/compiler: fix ddx and ddy for 16-bit float
- intel/compiler: fix ddy for half-float in Broadwell
- intel/compiler: workaround for SIMD8 half-float MAD in gen8
- intel/compiler: split is_partial_write() into two variants
- intel/compiler: activate 16-bit bit-size lowerings also for 8-bit
- intel/compiler: rework conversion opcodes
- intel/compiler: ask for an integer type if requesting an 8-bit type
- intel/eu: force stride of 2 on NULL register for Byte instructions
- intel/compiler: generalize the combine constants pass
- intel/compiler: implement is_zero, is_one, is_negative_one for
8-bit/16-bit
- intel/compiler: add a brw_reg_type_is_integer helper
- intel/compiler: fix cmod propagation for non 32-bit types
- intel/compiler: remove inexact algebraic optimizations from the
backend
- intel/compiler: skip MAD algebraic optimization for half-float or
mixed mode
- intel/compiler: implement SIMD16 restrictions for mixed-float
instructions
- intel/compiler: also set F execution type for mixed float mode in BDW
- intel/compiler: validate region restrictions for half-float
conversions
- intel/compiler: validate conversions between 64-bit and 8-bit types
- intel/compiler: validate region restrictions for mixed float mode
- compiler/spirv: move the check for Int8 capability
- anv/pipeline: support Float16 and Int8 SPIR-V capabilities in gen8+
- anv/device: expose VK_KHR_shader_float16_int8 in gen8+
Ian Romanick (55):
- nir: Silence zillions of unused parameter warnings in release builds
- intel/compiler: Silence warning about value that may be used
uninitialized
- nir: Document some fields of nir_loop_terminator
- nir: Refactor code that checks phi nodes in opt_peel_loop_initial_if
- nir: Select phi nodes using prev_block instead of continue_block
- nir: Split ALU instructions in loops that read phis
- nir: Convert a bcsel with only phi node sources to a phi node
- spirv: Add missing break
- nir/algebraic: Convert some f2u to f2i
- nir/algebraic: Simplify comparison with sequential integers starting
with 0
- intel/vec4: Emit constants for some ALU sources as immediate values
- nir/algebraic: Replace i2b used by bcsel or if-statement with
comparison
- intel/fs: Relax type matching rules in cmod propagation from MOV
instructions
- intel/fs: Handle OR source modifiers in algebraic optimization
- intel/fs: Refactor ALU source and destination handling to a separate
function
- intel/fs: Emit logical-not of operands on Gen8+
- intel/fs: Use De Morgan's laws to avoid logical-not of a logic result
on Gen8+
- intel/fs: Emit better code for b2f(inot(a)) and b2i(inot(a))
- nir/algebraic: Replace a bcsel of a b2f sources with a b2f(!(a \|\|
b))
- intel/fs: Generate if instructions with inverted conditions
- nir/algebraic: Replace a-fract(a) with floor(a)
- intel/fs: Don't assert on b2f with a saturate modifier
- nir/algebraic: Optimize away an fsat of a b2f
- intel/compiler: Silence many unused parameter warnings in brw_eu.h
- intel/compiler: Silence unused parameter warning in
brw_interpolation_map.c
- intel/fs: nir_op_extract_i8 extracts a byte, not a word
- intel/fs: Fix extract_u8 of an odd byte from a 64-bit integer
- nir/algebraic: Fix up extract_[iu]8 after loop unrolling
- nir/algebraic: Remove redundant extract_[iu]8 patterns
- nir/algebraic: Add missing 64-bit extract_[iu]8 patterns
- nir/algebraic: Add missing 16-bit extract_[iu]8 patterns
- nir/algebraic: Fix up extract_[iu]8 after loop unrolling
- nir/algebraic: Remove redundant extract_[iu]8 patterns
- nir/algebraic: Add missing 64-bit extract_[iu]8 patterns
- nir/algebraic: Add missing 16-bit extract_[iu]8 patterns
- nir: Add nir_const_value_negative_equal
- nir: Add nir_alu_srcs_negative_equal
- nir: Add partial redundancy elimination for compares
- intel/compiler: Use partial redundancy elimination for compares
- intel/fs: Eliminate dead code first
- intel/fs: Refactor code generation for nir_op_fsign to its own
function
- intel/fs: Add a scale factor to emit_fsign
- intel/fs: Generate better code for fsign multiplied by a value
- nir/algebraic: Recognize open-coded copysign(1.0, a)
- nir/algebraic: Replace a pattern where iand with a Boolean is used as
a bcsel
- nir/algebraic: Fix some 1-bit Boolean weirdness
- nir/algebraic: Strength reduce some compares of x and -x
- intel/fs: Add support for float16 to the fsign optimizations
- glsl: Silence may unused parameter warnings in glsl/ir.h
- intel/compiler: Don't have sepearate, per-Gen nir_options
- intel/compiler: Lower ffma on Gen4 and Gen5
- intel/fs: Fix D to W conversion in opt_combine_constants
- mesa: Add missing display list support for GL_FOG_COORDINATE_SOURCE
- nir: Saturating integer arithmetic is not associative
- Revert "nir: add late opt to turn inot/b2f combos back to bcsel"
Icenowy Zheng (5):
- lima: add dummy set_sample_mask function
- lima: make lima_context_framebuffer subtype of pipe_framebuffer_state
- lima: implement blit with util_blitter
- lima: lower bool to float when building shaders
- lima: add Android build
Ilia Mirkin (14):
- nv50,nvc0: add explicit settings for recent caps
- nvc0: add support for handling indirect draws with attrib conversion
- nvc0/ir: always use CG mode for loads from atomic-only buffers
- nvc0/ir: fix second tex argument after levelZero optimization
- nvc0: fix 3d images on kepler
- nv50,nvc0: use condition for occlusion queries when already complete
- nvc0: stick zero values for the compute invocation counts
- nvc0: we have 16k-sized framebuffers, fix default scissors
- swr: set PIPE_CAP_MAX_VARYINGS correctly
- mesa: add explicit enable for EXT_float_blend, and error condition
- st/mesa: enable GL_EXT_float_blend when possible
- i965: always enable EXT_float_blend
- nv50: disable compute
- glsl: fix recording of variables for XFB in TCS shaders
Illia Iorin (1):
- mesa/main: Fix multisample texture initialize
James Zhu (12):
- gallium/auxiliary/vl: Move dirty define to header file
- gallium/auxiliary/vl: Split vl_compositor graphic shaders from
vl_compositor API
- gallium/auxiliary/vl: Rename csc_matrix and increase its size.
- gallium/auxiliary/vl: Add compute shader to support video compositor
render
- gallium/auxiliary/vl: Add video compositor compute shader render
- gallium/auxiliary/vl: Fix transparent issue on compute shader with
rgba
- gallium/auxiliary/vl: Increase shader_params size
- gallium/auxiliary/vl: Change grid setting
- gallium/auxiliary/vl: Change weave compute shader implementation
- gallium/auxiliary/vl: Fixed blur issue with weave compute shader
- gallium/auxiliary/vl: Fixed blank issue with compute shader
- gallium/auxiliary/vl: Add barrier/unbind after compute shader launch.
Jan Vesely (2):
- Partially revert "gallium: fix autotools build of pipe_msm.la"
- gallium/aux: Report error if loading of a pipe driver fails.
Jan Zielinski (1):
- swr/rast: fix 32-bit compilation on Linux
Jason Ekstrand (212):
- spirv: Replace vtn_constant_value with vtn_constant_uint
- spirv: Rework handling of spec constant workgroup size built-ins
- spirv: Handle constants and types before execution modes
- spirv: Handle OpExecutionModeId
- spirv: Support LocalSizeId and LocalSizeHintId execution modes
- intel/nir: Add global support to lower_mem_access_bit_sizes
- intel/fs/cse: Split create_copy_instr into three cases
- intel/fs: Properly handle 64-bit types in LOAD_PAYLOAD
- intel/fs: Do the grf127 hack on SIMD8 instructions in SIMD16 mode
- intel/fs: Implement load/store_global with A64 untyped messages
- intel/fs: Use SENDS for A64 writes on gen9+
- intel/fs: Implement nir_intrinsic_global_atomic\_\*
- anv: Implement VK_EXT_buffer_device_address
- relnotes: Add VK_EXT_buffer_device_address
- nir/deref: Drop zero ptr_as_array derefs
- README: Drop the badges from the readme
- intel/fs: Use enumerated array assignments in fb read TXF setup
- nir/deref: Rematerialize parents in
rematerialize_derefs_in_use_blocks
- nir: Silence a couple of warnings in release builds
- anv/blorp: Delete a pointless assert
- anv: Silence some compiler warnings in release builds
- intel/fs: Silence a compiler warning
- intel/fs: Bail in optimize_extract_to_float if we have modifiers
- nir/dead_cf: Inline cf_node_has_side_effects
- nir/dead_cf: Stop relying on liveness analysis
- compiler/types: Add a contains_64bit helper
- nir/xfb: Properly align 64-bit values
- nir: Rewrite lower_clip_cull_distance_arrays to do a lot less
lowering
- nir/xfb: Work in terms of components rather than slots
- nir/xfb: Handle compact arrays in gather_xfb_info
- nir: Fix a compile warning
- nir/lower_clip_cull: Fix an incorrect assert
- iris: Don't lower image formats for write-only images
- iris/compute: Don't increment the grid size offset
- iris/compute: Zero out the last grid size on indirect dispatches
- iris: Configure the L3$ on the compute context
- iris: Don't set constant read lengths at upload time
- iris: Allocate buffer resources separately
- iris: Copy anv's MI_MATH helpers for multiplication and division
- nir/split_vars: Don't compact vectors unnecessarily
- nir/builder: Don't emit no-op swizzles
- intel/eu: Add an EOT parameter to send_indirect_[split]_message
- intel/fs: Add an enum type for logical sampler inst sources
- intel/fs: Re-order logical surface arguments
- intel/fs: Drop the fs_surface_builder
- intel/vec4: Drop dead code for handling typed surface messages
- intel/fs: Get rid of the IMAGE_SIZE opcode
- intel/compiler: Drop unused surface opcodes
- intel/schedule_instructions: Move some comments
- intel/compiler: Re-prefix non-logical surface opcodes with VEC4
- anv: Count surfaces for non-YCbCr images in
GetDescriptorSetLayoutSupport
- spirv: OpImageQueryLod requires a sampler
- intel,nir: Lower TXD with min_lod when the sampler index is not < 16
- anv: Use an actual binding for gl_NumWorkgroups
- anv/pipeline: Drop anv_fill_binding_table
- anv/descriptor_set: Refactor alloc/free of descriptor sets
- anv: Rework arguments to anv_descriptor_set_write\_\*
- anv: Stop allocating buffer views for dynamic buffers
- anv: Count image param entries rather than images
- anv: Clean up descriptor set layouts
- anv: drop add_var_binding from anv_nir_apply_pipeline_layout.c
- anv: Refactor descriptor pushing a bit
- anv: Take references to push descriptor set layouts
- anv: Add a concept of a descriptor buffer
- spirv: Pull offset/stride from the pointer for OpArrayLength
- spirv: Use the generic dereference function for OpArrayLength
- spirv: Use the same types for resource indices as pointers
- anv: Implement VK_EXT_inline_uniform_block
- nir: Expose double and int64 op_to_options_mask helpers
- nir: Teach loop unrolling about 64-bit instruction lowering
- i965: Compile the fp64 program based on nir options
- intel/debug: Add a debug flag to force software fp64
- intel/nir: Drop an unneeded lower_constant_initializers call
- glsl/nir: Add a shared helper for building float64 shaders
- glsl/nir: Inline functions in float64_funcs_to_nir
- nir/inline_functions: Break inlining into a builder helper
- nir/deref: Expose nir_opt_deref_impl
- nir/lower_doubles: Inline functions directly in lower_doubles
- intel/nir: Move 64-bit lowering later
- st/nir: Move 64-bit lowering later
- nir/builder: Emit better code for iadd/imul_imm
- nir/builder: Cast array indices in build_deref_follower
- nir/builder: Add a build_deref_array_imm helper
- intel/nir: Move lower_mem_access_bit_sizes to postprocess_nir
- anv/pipeline: Move lower_explicit_io much later
- nir: Add a pass for lowering IO back to vector when possible
- intel/nir: Vectorize all IO
- anv: Ignore VkRenderPassInputAttachementAspectCreateInfo
- nir/loop_unroll: Fix out-of-bounds access handling
- glsl/list: Add a list variant of insert_after
- glsl/lower_vector_derefs: Don't use a temporary for TCS outputs
- anv: Stop using VK_TRUE/FALSE
- anv/pass: Flag the need for a RT flush for resolve attachments
- anv: Only set 3DSTATE_PS::VectorMaskEnable on gen8+
- nir/algebraic: Add a couple optimizations for iabs and ishr
- nir/validate: Only require bare types to match for copy_deref
- nir/validate: Allow 32-bit boolean load/store intrinsics
- compiler/types: Add a new is_interface C wrapper
- compiler/types: Add a C wrapper to get full struct field data
- compiler/types: Add helpers to get explicit types for standard
layouts
- nir/deref: Consider COHERENT decorated var derefs as aliasing
- nir: Rename nir_address_format_vk_index_offset to not be vk
- nir/lower_io: Add a new buffer_array_length intrinsic and lowering
- glsl: Don't lower vector derefs for SSBOs, UBOs, and shared
- glsl/nir: Set explicit types on UBO/SSBO variables
- glsl/nir: Handle unlowered SSBO atomic and array_length intrinsics
- glsl/nir: Add a pass to lower UBO and SSBO access
- i965: Stop setting LowerBuferInterfaceBlocks
- st/mesa: Let NIR lower UBO and SSBO access when we have it
- nir/builder: Add a vector extract helper
- nir: Add a new pass to lower array dereferences on vectors
- intel/nir: Lower array-deref-of-vector UBO and SSBO loads
- anv: Implement VK_EXT_host_query_reset
- anv,radv: Implement VK_KHR_surface_capability_protected
- Revert "nir: const \`nir_call_instr::callee`"
- anv: Bump maxComputeWorkgroupInvocations
- nir: Constant values are per-column not per-component
- anv,radv,turnip: Lower TG4 offsets with nir_lower_tex
- spirv: Drop inline tg4 lowering
- nir/lower_io: Add a bounds-checked 64-bit global address format
- nir: Add a lowering pass for non-uniform resource access
- nir: Add texture sources and intrinsics for bindless
- nir: Add access flags to deref and SSBO atomics
- spirv: Handle the NonUniformEXT decoration
- Revert "anv/radv: release memory allocated by glsl types during
spirv_to_nir"
- nir: Lock around validation fail shader dumping
- nir/algebraic: Drop some @bool specifiers
- nir/algebraic: Add some logical OR and AND patterns
- vc4: Prefer nir_src_comp_as_uint over nir_src_as_const_value
- nir/search: Search for all combinations of commutative ops
- nir: Get rid of nir_register::is_packed
- nir: Get rid of global registers
- intel/common: Add a MI command builder
- intel/common: Add unit tests for gen_mi_builder
- anv: Use gen_mi_builder for CmdDrawIndirectByteCount
- anv: Use gen_mi_builder for computing resolve predicates
- anv: Use gen_mi_builder for indirect draw parameters
- anv: Use gen_mi_builder for indirect dispatch
- anv: Use gen_mi_builder for conditional rendering
- anv: Use gen_mi_builder for queries
- anv: Move mi_memcpy and mi_memset to gen_mi_builder
- anv/cmd_buffer: Use gen_mi_sub instead of gen_mi_add with a negative
- intel/common: Support bigger right-shifts with mi_builder
- anv/pipeline: Fix MEDIA_VFE_STATE::PerThreadScratchSpace on gen7
- nir: Add a pass for selectively lowering variables to scratch space
- intel/nir: Take a nir_tex_instr and src index in brw_texture_offset
- nir/builder: Add a nir_imm_zero helper
- nir/print: Use nir_src_as_int for array indices
- nir/constant_folding: Get rid of a bit size switch statement
- spirv: Drop some unneeded bit size switch statements
- nir/load_const_to_scalar: Get rid of a bit size switch statement
- nir/validate: Require unused bits of nir_const_value to be zero
- vulkan: Update the XML and headers to 1.1.106
- anv: Update to use the new features struct names
- nir/algebraic: Move the template closer to the render function
- nir/algebraic: Use a cache to avoid re-emitting structs
- intel/mi_builder: Re-order an initializer
- intel/mi_builder: Disable mem_mem tests on IVB
- nir: Drop "struct" from some nir\_\* declarations
- nir: Rework nir_src_as_alu_instr to not take a pointer
- nir: Add a nir_src_as_intrinsic() helper
- anv: Re-sort the GetPhysicalDeviceFeatures2 switch statement
- anv: Drop some unneeded ANV_FROM_HANDLE for physical devices
- intel/fs: Account for live range lengths in spill costs
- anv: Make all VkDeviceMemory BOs resident permanently
- anv: Put image params in the descriptor set buffer on gen8 and
earlier
- anv: Add a #define for the max binding table size
- anv/pipeline: Sort bindings by most used first
- anv/pipeline: Add skeleton support for spilling to bindless
- nir/lower_io: Expose some explicit I/O lowering helpers
- intel/nir: Re-run int64 lowering in postprocess_nir
- anv: Add a has_a64_buffer_access to anv_physical_device
- anv: Lower some SSBO operations in apply_pipeline_layout
- anv: Implement SSBOs bindings with GPU addresses in the descriptor BO
- anv: Implement VK_KHR_shader_atomic_int64
- intel,nir: Lower TXD with a bindless sampler
- intel/fs: Add support for bindless texture ops
- anv: Count the number of planes in each descriptor binding
- anv: Use write_image_view to initialize immutable samplers
- anv: Pass the plane into lower_tex_deref
- anv: Use bindless textures and samplers
- intel/fs: Add support for bindless image load/store/atomic
- anv: Use bindless handles for images
- anv: Put binding flags in descriptor set layouts
- anv: Implement VK_EXT_descriptor_indexing
- nir: Add helpers for getting the type of an address format
- anv/nir: Add a central helper for figuring out SSBO address formats
- anv: Ignore descriptor binding flags if bindingCount == 0
- anv: Rework the descriptor set layout create loop
- anv,radv: Update release notes for newly implemented extensiosn
- nir: Use the NIR_SRC_AS\_ macro to define nir_src_as_deref
- anv/descriptor_set: Unlink sets from the pool in set_destroy
- anv/descriptor_set: Destroy sets before pool finalization
- anv/descriptor_set: Only vma_heap_finish if we have a descriptor
buffer
- anv/descriptor_set: Properly align descriptor buffer to a page
- anv: Better handle 32-byte alignment of descriptor set buffers
- anv/descriptor_set: Don't fully destroy sets in pool destroy/reset
- nir/algebraic: Optimize integer cast-of-cast
- util/bitset: Return an actual bool from test macros
- anv: Stop including POS in FS input limits
- anv,i965: Stop warning about incomplete gen11 support
- nir: Add a SSA type gathering pass
- intel/fs/ra: Only add dest interference to sources that exist
- intel/fs/ra: Stop adding RA interference to too many SENDS nodes
- anv: Emulate texture swizzle in the shader when needed
- anv: Stop forcing bindless for images
- anv: Only consider minSampleShading when sampleShadingEnable is set
- iris: Don't assume UBO indices are constant
- intel/fs,vec4: Use g0 as the header for MFENCE
- intel/fs: Do a stalling MFENCE in endInvocationInterlock()
- nir/dead_cf: Call instructions aren't dead
- nir/propagate_invariant: Don't add NULL vars to the hash table
Jian-Hong Pan (1):
- intel: Fix the description of Coffeelake pci-id 0x3E98
Jiang, Sonny (1):
- va: use a compute shader for the blit
John Stultz (3):
- mesa: android: freedreno: Fix build failure due to path change
- mesa: Makefile.sources: Add
ir3_nir_lower_load_barycentric_at_sample/offset to Makefile.sources
- mesa: Makefile.sources: Add nir_lower_fb_read.c to Makefile.sources
list
Jon Turney (1):
- meson: Force '.so' extension for DRI drivers
Jonathan Marek (22):
- nir: add missing vec opcodes in lower_bool_to_float
- freedreno: a2xx: fix fast clear
- freedreno: a2xx: don't write 4th vertex in mem2gmem
- freedreno: a2xx: add use_hw_binning function
- freedreno: a2xx: fix fast clear for some gmem configurations
- freedreno: a2xx: fix mipmapping for NPOT textures
- freedreno: use renderonly path for buffers allocated with modifiers
- freedreno: catch failing fd_blit and fallback to software blit
- mesa: add GL_AMD_compressed_ATC_texture support
- gallium: add ATC format support
- llvmpipe, softpipe: no support for ATC textures
- st/mesa: add ATC support
- freedreno: a3xx: add GL_AMD_compressed_ATC_texture support
- freedreno: a2xx: add GL_AMD_compressed_ATC_texture support
- svga: add new ATC formats to the format conversion table
- freedreno: a2xx: fix builtin blit program compilation
- freedreno: a2xx: disable PIPE_CAP_PACKED_UNIFORMS
- freedreno: a2xx: use nir_lower_io for TGSI shaders
- freedreno: a2xx: enable batch reordering
- freedreno: a2xx: same gmem2mem sequence for all tiles
- nir: improve convert_yuv_to_rgb
- freedreno/ir3: fix input ncomp for vertex shaders
Jordan Justen (22):
- iris: Set num_uniforms in bytes
- iris/compute: Set mask bits on PIPELINE_SELECT
- iris: Add IRIS_DIRTY_CONSTANTS_CS
- iris: Add iris_restore_compute_saved_bos
- iris/compute: Add MEDIA_STATE_FLUSH following WALKER
- iris/compute: Flush compute batches
- iris/compute: Get group counts from grid->grid
- iris/program: Don't try to push ubo ranges for compute
- iris/compute: Wait on compute batch when mapping
- iris/compute: Provide binding table entry for gl_NumWorkGroups
- iris/compute: Flush compute batch on memory-barriers
- iris/compute: Push subgroup-id
- iris/compute: Support indirect compute dispatch
- iris: Emit default L3 config for the render pipeline
- genxml/gen_bits_header.py: Use regex to strip no alphanum chars
- genxml: Remove extra space in gen4/45/5 field name
- iris: Add gitlab-ci build testing
- iris: Always use in-tree i915_drm.h
- nir: Add int64/doubles options into nir_shader_compiler_options
- intel/compiler: Move int64/doubles lowering options
- scons: Generate float64_glsl.h for glsl_to_nir fp64 lowering
- intel/genxml: Support base-16 in value & start fields in
gen_sort_tags.py
Jose Maria Casanova Crespo (4):
- iris: Enable ARB_shader_draw_parameters support
- glsl: fix typos in comments "transfor" -> "transform"
- glsl: TCS outputs can not be transform feedback candidates on GLES
- iris: setup EdgeFlag Vertex Element when needed.
José Fonseca (1):
- scons: Workaround failures with MSVC when using SCons 3.0.[2-4].
Juan A. Suarez Romero (22):
- anv/cmd_buffer: check for NULL framebuffer
- nir: move ALU instruction before the jump instruction
- nir: remove jump from two merging jump-ending blocks
- genxml: add missing field values for 3DSTATE_SF
- anv: advertise 8 subpixel precision bits
- nir/spirv: return after emitting a branch in block
- anv: destroy descriptor sets when pool gets reset
- nir: deref only for OpTypePointer
- anv: advertise 8 subtexel/mipmap precision bits
- nir/xfb: do not use bare interface type
- meson: Add dependency on genxml to anvil genfiles
- Revert "intel/compiler: split is_partial_write() into two variants"
- spirv: add missing SPV_EXT_descriptor_indexing capabilities
- radv: enable descriptor indexing capabilities
- anv: enable descriptor indexing capabilities
- Update version to 19.1.0-rc1
- Update version to 19.1.0-rc2
- cherry-ignore: radeonsi: update buffer descriptors in all contexts
after buffer invalidation
- Update version to 19.1.0-rc3
- Update version to 19.1.0-rc4
- Update version to 19.1.0-rc5
- Update version to 19.1.0
Julien Isorce (5):
- gallium: add resource_get_info to pipe_screen
- radeonsi: implement resource_get_info
- st/va: properly set stride and offset in vlVaDeriveImage
- r600: implement resource_get_info
- st/va: check resource_get_info nullity in vlVaDeriveImage
Józef Kucia (3):
- mesa: Fix GL_NUM_DEVICE_UUIDS_EXT
- radv: Fix driverUUID
- radv: clear vertex bindings while resetting command buffer
Karol Herbst (82):
- nvc0/ir: replace cvt instructions with add to improve shader
performance
- gk104/ir: Use the new rcp/rsq in library
- gm107/ir: add fp64 rcp
- gm107/ir: add fp64 rsq
- gallium: add PIPE_CAP_MAX_VARYINGS
- st/mesa: require RGBA2, RGB4, and RGBA4 to be renderable
- glsl_type: initialize offset and location to -1 for glsl_struct_field
- nir/opt_if: don't mark progress if nothing changes
- clover: update ICD table to support everything up to 2.2
- nir: replace magic numbers with M_PI
- nir/spirv: improve parsing of the memory model
- nir: add support for address bit sized system values
- nir/vtn: add support for SpvBuiltInGlobalLinearId
- nir/spirv: initial handling of OpenCL.std extension opcodes
- prog_to_nir: fix write from vps to FOG
- nvc0: print the shader type when dumping headers
- nv50/ir: move common converter code in base class
- nv50/ir: add lowering helper
- nouveau: add support for nir
- nouveau: fix nir and TGSI shader cache collision
- nv50/ir/nir: run some passes to make the conversion easier
- nv50/ir/nir: track defs and provide easy access functions
- nv50/ir/nir: add nir type helper functions
- nv50/ir/nir: run assignSlots
- nv50/ir/nir: add loadFrom and storeTo helpler
- nv50/ir/nir: parse NIR shader info
- nv50/ir/nir: implement nir_load_const_instr
- nv50/ir/nir: add skeleton for nir_intrinsic_instr
- nv50/ir/nir: implement nir_alu_instr handling
- nv50/ir/nir: implement nir_intrinsic_load_uniform
- nv50/ir/nir: implement nir_intrinsic_store_(per_vertex\_)output
- nv50/ir/nir: implement load_(interpolated\_)input/output
- nv50/ir/nir: implement intrinsic_discard(_if)
- nv50/ir/nir: implement loading system values
- nv50/ir/nir: implement nir_ssa_undef_instr
- nv50/ir/nir: implement nir_instr_type_tex
- nv50/ir/nir: add skeleton getOperation for intrinsics
- nv50/ir/nir: implement vote and ballot
- nv50/ir/nir: implement variable indexing
- nv50/ir/nir: implement geometry shader nir_intrinsics
- nv50/ir/nir: implement nir_intrinsic_load_ubo
- nv50/ir/nir: implement ssbo intrinsics
- nv50/ir/nir: implement images
- nv50/ir/nir: add memory barriers
- nv50/ir/nir: implement load_per_vertex_output
- nv50/ir/nir: implement intrinsic shader_clock
- nv50/ir/nir: handle user clip planes for each emitted vertex
- nv50ir/nir: move immediates before use
- glsl: add packed for struct types
- glsl: add cl_size and cl_alignment
- nir/lower_locals_to_regs: cast array index to 32 bit
- nir/spirv: handle kernel function parameters
- nir/spirv: support physical pointers
- nir: add support for gather offsets
- nv50/ir/nir: support gather offsets
- nir/lower_tex: Add support for tg4 offsets lowering
- nir/print: fix printing the image_array intrinsic index
- nir/validate: validate that tex deref sources are actually derefs
- v3d: prefer using nir_src_comp_as_int over nir_src_as_const_value
- panfrost/midgard: use nir_src_is_const and nir_src_as_uint
- glsl/standalone: add GLES3.1 and GLES3.2 compatibility
- nir: move brw_nir_rewrite_image_intrinsic into common code
- glsl_to_nir: handle bindless textures
- glsl/nir: fetch the type for images from the deref instruction
- glsl/nir: add support for lowering bindless images_derefs
- nv50/ir/nir: handle bindless texture
- nv50/ir/nir: add support for bindless images
- nvc0/nir: enable bindless texture
- lima: add bool parameter to type_size function
- amd/nir: some cleanups
- radv: use nir constant helpers
- intel/nir: use nir_src_is_const and nir_src_as_uint
- freedreno/ir3: use nir_src_as_uint in a few places
- lima: use nir_src_as_float
- nir/builder: Move nir_imm_vec2 from blorp into the builder
- nir/loop_analyze: use nir_const_value.b for boolean results, not u32
- spirv: reduce array size in vtn_handle_constant
- nir: make nir_const_value scalar
- vtn: handle bitcast with pointer src/dest
- nir: Add a nir_builder_alu variant which takes an array of components
- nir: Add nir_op_vec helper
- spirv/cl: support vload/vstore
Kasireddy, Vivek (3):
- nir/lower_tex: Add support for XYUV lowering
- dri: Add XYUV8888 format
- i965: Add support for sampling from XYUV images
Kenneth Graunke (872):
- st/mesa: Set pipe_image_view::shader_access in PBO readpixels.
- st/nir: Move varying setup code to a helper function.
- st/nir: Make new helpers for constructing built-in NIR shaders.
- st/mesa: Add a NIR version of the drawpixels/bitmap VS copy shader.
- st/mesa: Add NIR versions of the drawpixels Z/stencil fragment
shaders.
- st/mesa: Add NIR versions of the clear shaders.
- st/mesa: Add a NIR version of the OES_draw_texture built-in shaders.
- st/mesa: Add NIR versions of the PBO upload/download shaders.
- program: Use u_bit_scan64 in prog_to_nir.
- program: Extend prog_to_nir handle system values.
- nir: Record info->fs.pixel_center_integer in lower_system_values
- compiler: Mark clip/cull distance arrays as compact before lowering.
- nir: Bail on clip/cull distance lowering if GLSL IR already did it.
- nir: Avoid clip/cull distance lowering multiple times.
- nir: Avoid splitting compact arrays into per-element variables.
- st/nir: Call nir_lower_clip_cull_distance_arrays().
- gallium: Add a PIPE_CAP_NIR_COMPACT_ARRAYS capability bit.
- nouveau: Silence unhandled cap warnings
- st/mesa: Limit GL_MAX_[NATIVE\_]PROGRAM_PARAMETERS_ARB to 2048
- glsl: Allow gl_nir_lower_samplers*() without a gl_shader_program
- glsl: Don't look at sampler uniform storage for internal vars
- i965: Call nir_lower_samplers for ARB programs.
- st/nir: Pull sampler lowering into a helper function.
- st/nir: Lower sampler derefs for builtin shaders.
- st/nir: Use sampler derefs in built-in shaders.
- program: Make prog_to_nir create texture/sampler derefs.
- nir: Use sampler derefs in drawpixels and bitmap lowering.
- nir: Gather texture bitmasks in gl_nir_lower_samplers_as_deref.
- i965: Drop unnecessary 'and' with prog->SamplerUnits
- i965: Use info->textures_used instead of prog->SamplersUsed.
- mesa: Advertise EXT_float_blend in ES 3.0+ contexts.
- anv: Put MOCS in the correct location
- spirv: Eliminate dead input/output variables after translation.
- nir: Don't reassociate add/mul chains containing only constants
- compiler: Make is_64bit(GL_*) helper more broadly available
- mesa: Align doubles to a 64-bit starting boundary, even if packing.
- radeonsi: Go back to using llvm.pow intrinsic for nir_op_fpow
- st/mesa: Copy VP TGSI tokens if they exist, even for NIR shaders.
- nir: Don't forget if-uses in new nir_opt_dead_cf liveness check
- iris: Initial commit of a new 'iris' driver for Intel Gen8+ GPUs.
- iris: viewport state, sort of
- iris: port over batchbuffer updates
- iris: initial render state upload
- iris: packing with valgrind.
- iris: merge pack
- iris: initial gpu state, merges
- iris: RASTER + SF + some CLIP, fix DIRTY vs. NEW
- iris: scissors
- iris: SF_CLIP_VIEWPORT
- iris: Surfaces!
- iris: sampler views
- iris: stipples and vertex elements
- iris: framebuffers
- iris: don't segfault on !old_cso
- iris: fix SF_CL length
- iris: a bit of depth
- iris: some draw info, vbs, sample mask
- iris: fix crash - CSO binding can be NULL (when destroying context)
- iris: COLOR_CALC_STATE
- iris: sampler states
- iris: emit 3DSTATE_SAMPLER_STATE_POINTERS
- iris: basic push constant alloc
- iris: some program code
- iris: linear resources
- iris: maps
- iris: shader debug log
- iris: drop unused field
- iris: make an ice->render_batch field
- iris: disable execbuf for now
- iris: delete iris_pipe.c, shuffle code around
- iris: init the batch!
- iris: fix/rework line stipple
- iris: actually save VBs
- iris: msaa sample count packing problems
- iris: fix prim type
- iris: fix bogus index buffer reference
- iris: draw->restart_index is uninitialized if PR is not enabled
- iris: parse INTEL_DEBUG
- iris: reworks, FS compile pieces
- iris: import program cache code
- iris: do the FS...asserts because we don't lower uniforms yet
- iris: lower io
- iris: make iris_batch target a particular ring
- iris: kill iris_new_batch
- iris: move MAX defines to iris_batch.h
- iris: bit of SBA code
- iris: flag SBA updates when instruction BO changes
- iris: try and have an iris address
- iris: so, sba then.
- iris: reference VB BOs
- iris: VB addresses
- iris: DEBUG=bat
- iris: VB fixes
- iris: actually APPEND commands, not stomp over the top and never incr
- iris: actually flush the commands
- iris: actually advance forward when emitting commands
- iris: initialize dirty bits to ~0ull
- iris: hack to stop crashing on samplers for now
- iris: fix indentation
- iris: fix assert
- iris: fix VBs
- iris: vertex packet fixes
- iris: fix VF instancing length so we don't get garbage in batch
- iris: 3DPRIMITIVE fields
- iris: bind_state -> compute state
- iris: scissor slots
- iris: some shader bits
- iris: promote iris_program_cache_item to iris_compiled_shader
- iris: actually save derived state
- iris: emit shader packets
- iris: convert IRIS_DIRTY\_\* to #defines
- iris: don't forget about TE
- iris: reorganize commands to match brw
- iris: initial gpu state
- iris: WM.
- iris: index buffer BO
- iris: more comes from bits filled in
- iris: drop const from prog data parameters
- iris: softpin some things
- iris: use vtbl to avoid multiple symbols, fix state base address
- iris: fix SBA
- iris: move key pop to state module
- iris: bits of WM key
- iris: shuffle comments
- iris: no NEW_SBA
- iris: rewrite program cache to use u_upload_mgr
- iris: actually destroy the cache
- iris: actually softpin at an address
- iris: actually set KSP offsets
- iris: URB configs.
- iris: dummy constants
- iris: blend state
- iris: alpha testing in PSB
- iris: basic SBE code
- iris: warning fixes
- iris: fix silly unused batch with addr macro
- iris: render targets!
- iris: don't do samplers for disabled stages
- iris: smaller blend state
- iris: actually pin the instruction cache buffers
- iris: compctrl
- iris: more sketchy SBE
- iris: fix dmabuf retval comparisons
- iris: more SF CL VPs
- iris: catastrophic state pointer mistake
- iris: fix extents
- iris: write DISABLES are not write ENABLES...whoops
- iris: sample mask...not 0.
- iris: uniform bits...badly
- iris: warn if execbuf fails
- iris: NOOP pad batches correctly
- iris: decode batches if they fail to submit
- iris: enable a few more formats
- iris: set strides on transfers
- iris: stop adding 9 to our varyings
- iris: bufmgr updates.
- iris: some thinking about binding tables
- iris: Soft-pin the universe
- iris: fix icache memzone
- iris: dump gtt offset in dump_validation_list
- iris: Also set SUPPORTS_48B? Not sure if necessary.
- iris: more uploaders
- iris: rewrite to use memzones and not relocs
- iris: set EXEC_OBJECT_WRITE
- iris: include p_defines.h in iris_bufmgr.h
- iris: binders
- iris: hook up batch decoder
- iris: binder fixes
- iris: decoder fixes
- iris: update vb BO handling now that we have softpin
- iris: validation dumping improvements
- iris: canonicalize addresses.
- iris: delete more trash
- iris: allocate SURFACE_STATEs up front and stop streaming them
- iris: same treatment for sampler views
- iris: assemble SAMPLER_STATE table at bind time
- iris: fix a scissor bug
- iris: SBA once at context creation, not per batch
- iris: TES stash
- iris: isv freeing fixes
- iris: set sampler views
- iris: decoder fixes
- iris: better BT asserts
- iris: increase allocator alignment
- iris: fix index
- iris: port bug fix from i965
- iris: fixes from i965
- iris: fixes
- iris: crazy pipe control code
- iris: bo reuse
- iris: vma fixes - don't free binder address
- iris: vma - fix assert
- iris: better SBE
- iris: fix texturing!
- iris: Move get_command_space to iris_batch.c
- iris: Defines for base addresses rather than numbers everywhere
- iris: pull in newer comments
- iris: copy over i965's cache tracking
- iris: move bo_offset_from_sba
- iris: bits of blorp code
- iris: more blitting code to make readpixels work
- iris: drop bogus binder free
- iris: fix sampler view crashes
- iris: more blorp
- iris: fix blorp prog data crashes
- iris: add INTEL_DEBUG=reemit
- iris: drop the 48b printout, we never use anything else
- iris: hacky flushing for now
- iris: linear staging buffers - fast CPU access...
- iris: make blorp pin the binder
- iris: blorp URB
- iris: no more drawing rectangle in blorp
- iris: assert surf init
- iris: some depth stuff :(
- iris: bump GL version to 4.2
- iris: uniforms for VS
- iris: proper length for VE packet?
- iris: proper # of uniforms
- iris: properly reject formats, fixes RGB32 rendering with texture
float
- iris: blorp bug fixes
- iris: delete growing code and just die for now
- iris: just turn batch reset_and_clear_caches into reset
- iris: chaining not growing
- iris: caps
- iris: fix batch chaining...
- iris: fix decoding and undo testing code
- iris: Lower the max number of decoded VBO lines
- iris: fix whitespace
- iris: fix 3DSTATE_VERTEX_ELEMENTS length
- iris: more depth stuffs...
- iris: fix VF INSTANCING length
- iris: util_copy_framebuffer_state (ported from Rob's v3d patches)
- iris: transfers
- iris: flush always
- iris: maybe slightly less boats uniforms
- iris: fix constant packet length to match i965
- iris: better ubo handling
- iris: completely rewrite binder
- iris: have more than one const_offset
- iris: make surface states for cbufs
- iris: fill out pull constant buffers
- iris: fix pull bufs that aren't the first user upload
- iris: use u_transfer helpers for now
- iris: better VFI
- iris: fix release builds
- iris: drop assert for now
- iris: disable \__gen_validate_value in release mode
- iris: allow mapped buffers during execution (faster)
- iris: comment about reemitting and flushing
- iris: state cleaning
- iris: untested index buffer upload
- iris: delete some pointless STATIC_ASSERTS
- iris: untested SAMPLER_STATE pin BO fix
- iris: put back the always flush - fixes some things :(
- iris: save pointers to streamed state resources
- iris: fix the validation list on new batches
- iris: flag DIRTY_WM properly
- iris: bindings dirty tracking
- iris: some dirty fixes
- iris: clear dirty
- iris: plug leaks
- iris: more leak fixes
- iris: pc fixes
- iris: remove 4 bytes of padding in iris_compiled_shader
- iris: rzalloc iris_compiled_shader so memcmp works even if padding
creeps in
- iris: don't leak sampler state table resources
- iris: don't leak keyboxes when searching for an existing program
- iris: indentation
- iris: use pipe resources not direct BOs
- iris: clean up some warnings so I can see through the noise
- iris: print binder utilization in INTEL_DEBUG=submit
- iris: redo VB CSO a bit
- iris: print refcounts in INTEL_DEBUG=submit
- iris: support signed vertex buffer offsets
- iris: fix major refcounting bug with resources
- iris: fix caps so tests run again
- iris: avoid crashing on unbound constant resources
- iris: emit 3DSTATE_SBE_SWIZ
- iris: max VP index
- iris: fix viewport counts and settings
- iris: fix num viewports to be based on programs
- iris: fix VP iteration
- iris: scissor count fixes
- iris: actually init num_viewports
- iris: print second batch size separately
- iris: don't always flush
- iris: Handle batch submission failure "better"
- iris: bad inherited comments
- iris: colorize batchbuffer failures to make them stand out
- iris: iris - fix QWord aligned endings after batch chaining rework
- iris: tidy comments about mirroring modes
- iris: Disable unsupported mirror clamp modes
- iris: fix fragcoord ytransform
- iris: better boxing on maps
- iris: clears
- iris: rework DEBUG_REEMIT
- iris: shader dirty bits
- iris: clear fix
- iris: fall back to u_generate_mipmap
- iris: implement copy image
- iris: lightmodel flat
- iris: maybe-flush before blorp operations
- iris: fix provoking vertex ordering
- iris: larger polygon offset
- iris: TES uniform fixes
- iris: geometry shader support
- iris: don't emit garbage 3DSTATE_VERTEX_BUFFERS when there aren't any
- iris: fix 3DSTATE_VERTEX_ELEMENTS / VF_INSTANCING for 0 elements
- iris: fix GS dispatch mode
- iris: depth clears
- iris: null surface for unbound textures
- iris: state ref tuple
- iris: don't include binder in surface VMA range
- iris: border color memory zone :(
- iris: implement border color, fix other sampler nonsense
- iris: dead pointer
- iris: just malloc one iris_genx_state instead of a bunch of oddball
pieces
- iris: SBE change stash
- iris: fix zoffset asserts with 2DArray/Cube
- iris: rename map->stride
- iris: actually set cube bit properly
- iris: keep DISCARD_RANGE
- iris: actually handle array layers in blits
- iris: comment out l/a/i/la
- iris: fix clip flagging on fb changes
- iris: fix depth bounds clamp enables
- iris: don't crash on shader perf logs
- iris: slab allocate transfers
- iris: rearrange iris_resource.h
- iris: Implement 3DSTATE_SO_DECL_LIST
- iris: SO buffers
- iris: streamout
- iris: set even if no outputs
- iris: bother setting program_string_id...
- iris: fix SO_DECL_LIST
- iris: actually pin the buffers
- iris: fix sample mask for MSAA-off
- iris: disable 6x MSAA support
- iris: multislice transfer maps
- iris: fix CC_VIEWPORT
- iris: draw indirect support?
- iris: save query type
- iris: bits of multisample program key
- iris: s/hwcso/state/g
- iris: bind state helper function
- iris: NOS mechanics
- iris: record FS NOS
- iris: fix crash
- iris: fix sampler views of TBOs
- iris: fix texture buffer stride
- iris: TES program key inputs
- iris: compile a TCS...don't bother with passthrough yet
- iris: don't emit SO_BUFFERS and SO_DECL_LIST unless streamout is
enabled
- iris: vertex ID, instance ID
- iris: fix SGVS when there are no valid vertex elements
- iris: fill out MAX_PATCH_VERTICES
- iris: assert about passthrough shaders to make this easier to detect
- iris: fix EmitNoIndirect
- iris: fix Z24
- iris: reemit blend state for alpha test function changes
- iris: point sprite enables
- iris: hack around samples confusion
- iris: fix blorp filters
- iris: expose more things that we already support
- iris: fix msaa flipping filters
- iris: export get_shader_info
- iris: implement set_shader_buffers
- iris: emit binding table for atomic counters and SSBOs
- iris: shorten loop
- iris: unbind compiled shaders if none are present
- iris: fix TBO alignment to match 965
- iris: enable SSBOs
- iris: fix SSBO indexing
- iris: fix for disabling ssbos
- iris: update bindings when changing programs
- iris: drop unused bo parameter
- iris: implement texture/memory barriers
- iris: Don't reserve new binding table section unless things are dirty
- iris: update a todo comment
- iris: BIG OL' HACK for UBO updates
- iris: enable texture gather
- iris: Avoid croaking when trying to create FBO surfaces with bad
formats
- iris: fix GS output component limit
- iris: drop pipe_shader_state
- iris: fix sample mask
- iris: cube arrays are cubes too
- iris: we don't support textureGatherOffsets, need it lowered
- iris: add minor comments
- iris: comment everything
- iris: sync bugfixes from brw_bufmgr
- iris: remember to set bo->userptr
- iris: rename ring to engine
- iris: simplify batch len qword alignment
- iris: get angry about execbuf failures
- iris: fill out more caps
- iris: depth or stencil fixes
- iris: clear stencil
- iris: actually emit stencil packets
- iris: allow S8 as a stencil format
- iris: WTF transfers
- iris: use u_transfer_helper for depth stencil packing/unpacking
- iris: drop stencil handling now that u_transfer_helper does it
- iris: refcounting, who needs it?
- iris: actually do stencil blits
- iris: say no to more formats
- iris: deal with Marek's new MSAA caps
- iris: we can do multisample Z resolves
- iris: Convert RGBX to RGBA for rendering.
- iris: disallow RGB32 formats too
- iris: Fix tiled memcpy for cubes...and for array slices
- iris: blorp blit multiple slices
- iris: assert depth is 1 in resource_copy_region
- iris: call maybe_flush for each blorp operation
- iris: implement ARB_clear_texture
- iris: last VUE map NOS, handle > 16 FS inputs
- iris: drop dead assignments
- iris: drop pwrite
- iris: port non-bucket alignment bugfix
- iris: don't emit SBE all the time
- iris: rename pipe to base
- iris: Drop bogus sampler state saving
- iris: move iris_shader_state from ice->shaders.state to
ice->state.shaders
- iris: Move things to iris_shader_state
- iris: Move iris_sampler_view declaration to iris_resource.h
- iris: track depth/stencil writes enabled
- iris: use consistent copyright formatting
- iris: Move cache tracking to iris_resolve.c
- iris: proper cache tracking
- iris: precompute hashes for cache tracking
- iris: Reduce binder alignment from 64 to 32
- iris: reenable R32G32B32 texture buffers
- iris: z_res -> s_res
- iris: implement get_sample_position
- iris: fix line-aa-width
- iris: try to hack around binder issue
- iris: fix sampler state setting
- iris: big old hack for tex-miplevel-selection
- iris: use linear for 1D textures
- iris: handle level/layer in direct maps
- iris: fix crash when binding optional shader for the first time
- iris: Skip primitive ID overrides if the shader wrote a custom value
- iris: fix blend state memcpy
- iris: new caps
- iris: use Eric's new caps helper
- iris: Allow inlining of require/get_command_space
- iris: skip over whole function if dirty == 0
- iris: don't unconditionally emit 3DSTATE_VF / 3DSTATE_VF_TOPOLOGY
- iris: fix constant buffer 0 to be absolute
- iris: set EXEC_OBJECT_CAPTURE on all driver internal buffers
- iris: fix null FB and unbound tex surface state addresses
- iris: Support multiple binder BOs, update Surface State Base Address
- iris: fix SO offset writes for multiple streams
- iris: update comments for multibinder
- iris: move binder pinning outside the dirty == 0 check
- iris: re-pin binding table contents if we didn't re-emit them
- iris: enable ARB_enhanced_layouts
- iris: refactor LRIs in context setup
- iris: initialize "don't suck" bits, as Ben likes to call them
- iris: totally untested icelake support
- iris: refactor program CSO stuff
- iris: silence const warning
- iris: fix context restore of 3DSTATE_CONSTANT ranges
- iris: properly re-pin stencil buffers
- iris: delete bogus comment
- iris: inherit the index buffer properly
- iris: use 0 for TCS passthrough program string ID
- iris: rw_bo for pipe controls
- iris: LRM/SRM/SDI hooks
- iris: initial query code
- iris: gen10+ workarounds and break fix
- iris: results write
- iris: flush batch when asking for result via QBO
- iris: fix random failures via CS stall...but why?
- iris: gpr0 to bool
- iris: play chicken with timer queries for now
- iris: pipeline stats
- iris: primitives generated query support
- iris: drop explicit pinning
- iris: timestamps
- iris: ...and SO prims emitted queries
- iris: glGet timestamps, more correct timestamps
- iris: Need to \| 1 when asking for timestamps
- iris: 36-bit overflow fixes
- iris: early return properly
- iris: better query file comment
- iris: magic number 36 -> #define
- iris: Enable ARB_shader_vote
- iris: just mark snapshots_landed from the CPU
- iris: drop a bunch of pipe_sampler_state stuff we don't need
- iris: vma_free bo->size, not bo_size
- iris: don't mark contains_draw = false when chaining batches
- iris: fix Z32_S8 depth sampling
- iris: stencil texturing
- iris: force persample interp cap
- iris: pipe to scs -> iris_pipe.h
- iris: inline stage_from_pipe to avoid unused warnings
- iris: add gen11 to genX_call
- iris: Allow PIPE_CONTROL with Stall at Scoreboard and RT flush
- iris: rework format translation apis
- iris: Use R/RG instead of I/L/A when sampling
- iris: enable I/L formats
- iris: X32_S8X24 :/
- iris: set the binding table size
- iris: lower storage image derefs
- iris: implement set_shader_images hook
- iris: bother with BTIs
- iris: set image access correctly
- iris: actually set image access
- iris: null for non-existent cbufs
- iris: move images next to textures in binding table
- iris: advertise GL_ARB_shader_texture_image_samples
- iris: Enable fb fetch
- iris: initial compute caps
- iris: yes
- iris: drop dead format //'s
- iris: drop XXX's about swizzling
- iris: little bits of compute basics
- iris: drop XXX that Jordan handled
- iris: drop unnecessary #ifdefs
- iris: leave XXX about unnecessary binding table uploads
- iris: bail if SLM is needed
- iris: fix whitespace
- iris: XXX for compute state tracking :/
- iris: rewrite grid surface handling
- iris: better dirty checking
- iris: don't let render/compute contexts stomp each other's dirty bits
- iris: hack to avoid memorybarriers out the wazoo
- iris: do PIPELINE_SELECT for render engine, add flushes, GLK hacks
- iris: fix SBA flushing by refactoring code
- iris: try and avoid pointless compute submissions
- iris: fix UBOs with bindings that have an offset
- iris: flag CC_VIEWPORT when changing num viewports
- iris: fix SF_CLIP_VIEWPORT array indexing with multiple VPs
- iris: Fix texture buffer / image buffer sizes.
- iris: Clamp UBO and SSBO access to the actual BO size, for safety
- iris: Move snapshots_landed to the front.
- iris: Fix off by one in scissoring, empty scissors, default scissors
- iris: Fall back to 1x1x1 null surface if no framebuffer supplied
- iris: SO_DECL_LIST fix
- iris: Fix refcounting of grid surface
- iris: delete dead code
- iris: fix overhead regression from "don't stomp each other's dirty
bits"
- iris: allow binding a null vertex buffer
- iris: Flag constants dirty on program changes
- iris: Disable a PIPE_CONTROL workaround on Icelake
- iris: Enable ARB_shader_stencil_export
- iris: Enable A8/A16_UNORM in an inefficient manner
- iris: Drop B5G5R5X1 support
- iris: Use at least 1x1 size for null FB surface state.
- iris: Cross-link iris_batches so they can potentially flush each
other
- iris: cross batch flushing
- iris: Don't leak the compute batch
- iris: Actually create/destroy HW contexts
- iris: Enable msaa_map transfer helpers
- iris: tidy more warnings
- iris: implement scratch space!
- iris: Fix MSAA smooth points
- iris: Fix TextureBarrier
- iris: Fix multiple RTs with non-independent blending
- iris: partial set_query_active_state
- iris: Print the batch name when decoding
- iris: Clone the NIR
- iris: Defer cbuf0 upload to draw time
- iris: drop unnecessary param[] setup from iris_setup_uniforms
- iris: add param domain defines
- iris: fill out params array with built-ins, like clip planes
- iris: only bother with params if there are any...
- iris: lower user clip planes
- iris: hook up key stuff for clip plane lowering
- iris: fix system value remapping
- iris: dodge backend UCP lowering
- iris: bypass params and do it ourselves
- iris: actually upload clip planes.
- iris: fix num clip plane consts
- iris: fix more uniform setup
- iris: drop iris_setup_push_uniform_range
- iris: enable push constants if we have sysvals but no uniforms
- iris: regather info so we get CLIP_DIST slots, not CLIP_VERTEX
- iris: don't support pull constants.
- iris: don't trip on param asserts
- iris: drop param stuffs
- iris: don't forget to upload CS consts
- iris: fix sysval only binding tables
- iris: only clip lower if there's something to clip against
- iris: leave another TODO
- iris: Fix SourceAlphaBlendFactor
- iris: "Fix" transfer maps of buffers
- iris: Fix independent alpha blending.
- iris: more TODO
- iris: scissored and mirrored blits
- iris: more todo notes
- iris: Fix TCS/TES slot unification
- iris: properly pin stencil buffers
- iris: Fix SLM
- iris: Use iris_use_pinned_bo rather than add_exec_bo directly
- iris: Combine iris_use_pinned_bo and add_exec_bo
- iris: Avoid cross-batch synchronization on read/reads
- iris: Avoid synchronizing due to the workaround BO
- iris: replace vestiges of fence fds with newer exec_fence API
- iris: Drop vestiges of throttling code
- iris: Hang on to the last batch's sync-point, so we can wait on it
- iris: Add wait fences to properly sync between render/compute
- iris: leave a TODO
- iris: flush the compute batch too if border pool is redone
- iris: put render batch first in fence code
- iris: Put batches in an array
- iris: PIPE_CONTROL workarounds for GPGPU mode
- iris: RT flush for memorybarrier with texture bit
- iris: update comment
- iris: Enable ctx->Const.UseSTD430AsDefaultPacking
- iris: Lie about indirects
- iris: Fix buffer -> buffer copy_region
- iris: Fix VIEWPORT/LAYER in stream output info
- iris: Do the 48-bit vertex buffer address invalidation workaround
- iris: drop long dead XXX comment
- iris: Track a binding history for buffer resources
- iris: add iris_flush_and_dirty_for_history
- iris: Flush for history at various moments
- iris: Re-pin even if nothing is dirty
- iris: fix prototype warning
- iris: export iris_upload_shader
- iris: fix comment location
- iris: Use wrappers for create_xs_state rather than a switch statement
- iris: rework program cache interface
- iris: Enable precompiles
- iris: Use program's num textures not the state tracker's bound
- iris: drop pull constant binding table entry
- iris: add assertions about binding table starts
- iris: add an extra BT assert from Chris Wilson
- iris: actually flush for storage images
- iris: fix some SO overflow query bugs and tidy the code a bit
- iris: drop key_size_for_cache
- iris: for BLORP, only use the predicate enable bit when USE_BIT
- iris: check query first
- iris: fix conditional compute, don't stomp predicate for pipelined
queries
- iris: Rework tiling/modifiers handling
- iris: Fix failed to compile TCS message
- iris: Destroy transfer helper on screen teardown
- iris: Destroy the border color pool
- iris: Unref unbound_tex resource
- iris: Fix IRIS_MEMZONE_COUNT to exclude the border color pool
- iris: Destroy the bufmgr
- iris: Stop leaking iris_uncompiled_shaders like mad
- iris: move some non-buffer case code in a bit
- iris: Don't bother considering if the underlying surface is a cube
- iris: fix alpha channel for RGB BC1 formats
- iris: fix dma buf import strides
- iris: CS stall for stream out -> VB
- iris: make clipper statistics dynamic
- iris: reject all clipping when we can't use streamout render disabled
- iris: omask can kill
- iris: reemit SBE when sprite coord origin changes
- iris: re-pin inherited streamout buffers
- iris: Fix NOS mechanism
- iris: fix overhead regression from flushing for storage images
- iris: fix set_sampler_views to not unbind, be better about bounds
- iris: Fix set_sampler_views with start > 0
- iris: Replace num_textures etc with a bitmask we can scan
- iris: Drop continues in resolve
- iris: Fix clear dimensions
- iris: Clamp viewport extents to the framebuffer dimensions
- iris: Enable guardband clipping
- iris: Fix primitive generated query active flag
- iris: Always do rasterizer discard in clipper
- iris: override alpha to one src1 blend factors
- iris: handle PatchVerticesIn as a system value.
- iris: rewrite set_vertex_buffer and VB handling
- iris: Reorder LRR parameters to have dst first.
- iris: Add \_MI_ALU helpers that don't paste
- iris: Don't bother packing 3DSTATE_SO_BUFFER at create time
- iris: Move iris_stream_output_target def to iris_context.h
- iris: only get space for one offset in stream output targets
- iris: Implement DrawTransformFeedback()
- iris: drop unnecessary genx->streamout field
- iris: Fix for PIPE_CAP_SIGNED_VERTEX_BUFFER_OFFSET
- iris: Fix the prototype for iris_bo_alloc_tiled
- iris: don't print the pointer in INTEL_DEBUG=submit
- iris: Use a surface state fill helper
- iris: Make a alloc_surface_state helper
- iris: whitespace fixes
- iris: Track blend enables, save outbound for resolve code
- iris: always pin the binder...in the compute context, too.
- iris: delete finished comments
- iris: pin and re-pin the scratch BO
- iris: more dead comments
- iris: only mark depth/stencil as writable if writes are actually
enabled
- iris: better MOCS
- iris: Fix scratch space allocation on Icelake.
- iris: Only resolve inputs for actual shader stages
- iris: Add a more long term TODO about timebase scaling
- iris: Fix compute scratch pinning
- iris: Delete bogus comment about cube array counting.
- iris: Fix framebuffer layer count
- iris: Don't enable push constants just because there are system
values
- iris: Don't make duplicate system values
- iris: Fill out brw_image_params for storage images on Broadwell
- iris: Fix surface states for Gen8 lowered-to-untype images
- iris: Leave a comment about why Broadwell images are broken
- iris: Implement multi-slice copy_region
- iris: Flush the render cache in flush_and_dirty_for_history
- iris: Handle PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE somewhat
- iris: Don't check other batches for our batch BO
- iris: Drop a dead comment
- iris: Delete genx->bound_vertex_buffers
- iris: Fix Broadwell WaDividePSInvocationCountBy4
- iris: Use new PIPE_STAT_QUERY enums rather than hardcoded numbers.
- iris: Switch to the new PIPELINE_STATISTICS_QUERY_SINGLE capability
- iris: fail to create screen for older unsupported HW
- iris: Allow sample mask of 0
- iris: Don't enable smooth points when point sprites are enabled
- iris: Assert about blits with color masking
- iris: Pay attention to blit masks
- iris: CS stall on VF cache invalidate workarounds
- iris: Fix SO issue with INTEL_DEBUG=reemit, set fewer bits
- iris: Don't whack SO dirty bits when finishing a BLORP op
- iris: Fix memzone_for_address for the surface and binder zones
- iris: Do binder address allocations per-context, not globally.
- iris: Zero the compute predicate when changing the render condition
- iris: Remap stream output indexes back to VARYING_SLOT_*.
- iris: Enable PIPE_CAP_COMPACT_ARRAYS
- iris: Drop comment about ISP_DIS
- iris: Drop dead state_size hash table
- iris: Unreference some more things on state module teardown
- iris: minor tidying
- iris: Fix bug in bound vertex buffer tracking
- iris: Implement ALT mode for ARB_{vertex,fragment}_shader
- iris: Add a timeout_nsec parameter, rename check_syncpt to
wait_syncpt
- iris: Fix accidental busy-looping in query waits
- iris: Use READ_ONCE and WRITE_ONCE for snapshots_landed
- iris: Make a iris_batch_reference_signal_syncpt helper function.
- iris: Add PIPE_CAP_MAX_VARYINGS
- iris: rework num textures to util_lastbit
- iris: Stop chopping off the first nine characters of the renderer
string
- iris: Drop XXX about alpha testing
- iris: Set 3DSTATE_WM::ForceThreadDispatchEnable
- iris: Set HasWriteableRT correctly
- iris: Drop XXX about checking for swizzling
- iris: Move create and bind driver hooks to the end of iris_program.c
- iris: Make an IRIS_MAX_MIPLEVELS define
- iris: Simplify iris_get_depth_stencil_resources
- iris: Add missing depth cache flushes
- iris: Always emit at least one BLEND_STATE
- iris: Add iris_resource fields for aux surfaces
- iris: Fill out res->aux.possible_usages
- iris: Fill out SURFACE_STATE entries for each possible aux usage
- iris: create aux surface if needed
- iris: Initial import of resolve code
- iris: blorp using resolve hooks
- iris: add some draw resolve hooks
- iris: actually use the multiple surf states for aux modes
- iris: try to fix copyimage vs copybuffers
- iris: be sure to skip buffers in resolve code
- iris: resolve before transfer maps
- iris: pin the buffers
- iris: store modifier info in res
- iris: Make blit code use actual aux usages
- iris: consider framebuffer parameter for aux usages
- iris: Resolves for compute
- iris: disable aux for external things
- iris: some initial HiZ bits
- iris: don't use hiz for MSAA buffers
- iris: Set program key fields for MCS
- iris: make surface states for CCS_D too
- iris: do flush for buffers still
- iris: Allow disabling aux via INTEL_DEBUG options
- iris: Fix aux usage in render resolve code
- iris: Only resolve compute resources for compute shaders
- iris: Enable auxiliary buffer support
- iris: Enable -msse2 and -mstackrealign
- Revert "iris: Enable auxiliary buffer support"
- vulkan: Fix 32-bit build for the new overlay layer
- mesa: Fix RGBBuffers for renderbuffers with sized internal formats
- iris: Drop RGBX -> RGBA for storage image usages
- iris: Properly allow rendering to RGBX formats.
- i965: Implement threaded GL support.
- tgsi_to_nir: use sampler variables and derefs
- iris: Fix MOCS for blits and clears
- isl: Add a swizzle parameter to isl_buffer_fill_state()
- iris: Plumb through ISL_SWIZZLE_IDENTITY in buffer surface emitters
- iris: Defer uploading sampler state tables until draw time
- iris: Properly support alpha and luminance-alpha formats
- iris: Drop PIPE_CAP_BUFFER_SAMPLER_VIEW_RGBA_ONLY
- iris: Spruce up "are we using this engine?" checks for flushing
- iris: Export a copy_region helper that doesn't flush
- iris: Use copy_region and staging resources to avoid transfer stalls
- Revert MR 369 (Fix extract_i8 and extract_u8 for 64-bit integers)
- iris: Fix backface stencil write condition
- iris: Rework default tessellation level uploads
- iris: Fix TES gl_PatchVerticesIn handling.
- iris: Move depth/stencil flushes so they actually do something
- iris: Refactor depth/stencil buffer pinning into a helper.
- iris: Fix write enable in pinning of depth/stencil resources
- i965: Move some genX infrastructure to genX_boilerplate.h.
- i965: Rename ISP_DIS to INDIRECT_STATE_POINTERS_DISABLE.
- i965: Use genxml for emitting PIPE_CONTROL.
- i965: Reimplement all the PIPE_CONTROL rules.
- intel/fs: Fix opt_peephole_csel to not throw away saturates.
- iris: Don't mutate box in transfer map code
- iris: Don't flush the batch for unsynchronized mappings
- iris: Slightly better bounds on buffer sizes
- gallium: Add PIPE_BARRIER_UPDATE_BUFFER and UPDATE_TEXTURE bits.
- nvc0: Skip new update barrier bits
- nir: Record non-vector/scalar varyings as unmovable when compacting
- iris: Fix util_vma_heap_init size for IRIS_MEMZONE_SHADER
- iris: Skip input resolve handling if bindings haven't changed
- iris: Skip framebuffer resolve tracking if framebuffer isn't dirty
- iris: Skip resolves and flushes altogether if unnecessary
- iris: Fix batch chaining map_next increment.
- iris: Actually advertise some modifiers
- st/nir: Free the GLSL IR after linking.
- st/mesa: Fix blitting from GL_DEPTH_STENCIL to GL_STENCIL_INDEX
- iris: Fix blits with S8_UINT destination
- iris: Print the memzone name when allocating BOs with INTEL_DEBUG=buf
- iris: Save/restore MI_PREDICATE_RESULT, not MI_PREDICATE_DATA.
- iris: Silence unused variable warnings in release mode
- gallium/util: Add const to u_range_intersect
- iris: Actually pin the scratch BO.
- glsl: Set location on structure-split sampler uniform variables
- intel: Emit 3DSTATE_VF_STATISTICS dynamically
- iris: Actually mark blorp_copy_buffer destinations as written.
- iris: Preserve all PIPE_TRANSFER flags in xfer->usage
- iris: Fix FLUSH_EXPLICIT handling with staging buffers.
- iris: Make shader_perf_log print to stderr if INTEL_DEBUG=perf is set
- i965: Move program key debugging to the compiler.
- iris: Print the reason for shader recompiles.
- iris: Move iris_debug_recompile calls before uploading.
- iris: Change vendor and renderer strings
- iris: Add texture cache flushing hacks for blit and
resource_copy_region
- iris: Be less aggressive at postdraw work skipping
- iris: Add mechanism for iris-specific driconf options
- iris: Enable the dual_color_blend_by_location driconf option.
- iris: Track bound and writable SSBOs
- Revert "glsl: Set location on structure-split sampler uniform
variables"
- i965: Ignore uniform storage for samplers or images, use binding info
- i965: Tidy bogus indentation left by previous commit
- iris: Mark constants dirty on transfer unmap even if no flushes occur
- iris: Track bound constant buffers
- iris: Rework UBOs and SSBOs to use pipe_shader_buffer
- iris: Rework image views to store pipe_image_view.
- iris: Make a gl_shader_stage -> pipe_shader_stage helper function
- iris: Make memzone_for_address non-static
- iris: Replace buffer backing storage and rebind to update addresses.
- iris: Make a resource_is_busy() helper
- iris: Track valid data range and infer unsynchronized mappings.
- iris: Make some offset math helpers take a const isl_surf pointer
- iris: Fix DrawTransformFeedback math when there's a buffer offset
- iris: Prefer staging blits when destination supports CCS_E.
- iris: Actually put Mesa in GL_RENDERER string
- iris: Split iris_flush_and_dirty_for_history into two helpers.
- iris: Enable GL_AMD_depth_clamp_separate
- iris: Advertise EXT_texture_sRGB_R8 support
- iris: Some tidying for preemption support
- iris: Silence unused function warning
- iris: Fix zeroing of transform feedback offsets in strange cases.
- glsl/list: Add an exec_list_is_singular() helper.
- nir: Add a new nir_cf_list_is_empty_block() helper.
- intel/fs: Don't emit empty ELSE blocks.
- iris: Set XY Clipping correctly.
- iris: Only enable GL_AMD_depth_clamp_separate on Gen9+
- iris: Fix imageBuffer and PBO download.
- iris: Disable dual source blending when shader doesn't handle it
- iris: Resolve textures used by the program, not merely bound textures
- iris: Fix 4GB memory zone heap sizes.
- iris: leave the top 4Gb of the high heap VMA unused
- iris: Force VMA alignment to be a multiple of the page size.
- iris: Delete bucketing allocators
- i965: Fix BRW_MEMZONE_LOW_4G heap size.
- i965: Force VMA alignment to be a multiple of the page size.
- i965: leave the top 4Gb of the high heap VMA unused
- i965: Fix memory leaks in brw_upload_cs_work_groups_surface().
- iris: Use full ways for L3 cache setup on Icelake.
- egl/x11: calloc dri2_surf so it's properly zeroed
Kevin Strasser (1):
- egl/dri: Avoid out of bounds array access
Khaled Emara (1):
- freedreno: PIPE_CAP_SHADER_BUFFER_OFFSET_ALIGNMENT unreachable
statement
Khem Raj (1):
- winsys/svga/drm: Include sys/types.h
Kishore Kadiyala (1):
- android: static link with libexpat with Android O+
Konstantin Kharlamov (1):
- mapi: work around GCC LTO dropping assembly-defined functions
Kristian Høgsberg (49):
- st/nir: Use src/ relative include path for autotools
- freedreno/a6xx: Emit blitter dst with OUT_RELOCW
- freedreno/a6xx: Use tiling for all resources
- freedreno/a6xx: regen headers
- freedreno/a6xx: Drop render condition check in blitter
- freedreno: Log number of draw for sysmem passes
- freedreno/a6xx: Use the right resource for separate stencil stride
- freedreno/a6xx: Combine emit_blit and fd6_blit
- freedreno: Consolidate u_blitter functions in freedreno_blitter.c
- freedreno: Don't tell the blitter what it can't do
- freedreno/a6xx: Move blit check so as to restore comment
- freedreno/a6xx: Support some depth/stencil blits on blitter
- freedreno/a6xx: Support y-inverted blits
- freedreno/a6xx: Add format argument to fd6_tex_swiz()
- freedreno/a6xx: Fall back to masked RGBA blits for depth/stencil
- freedreno/a6xx: Clean up mixed use of swap and swizzle for texture
state
- freedreno/a6xx: Update headers
- freedreno/a6xx: Front facing needs UNK3 bit
- freedreno/a6xx: Fix point coord
- .mailmap: Add a few more alises for myself
- freedreno: Update headers
- freedreno/a6xx: Copy stencil as R8_UINT
- freedreno/a6xx: Support MSAA resolve blits on blitter
- freedreno/a6xx: Only output MRT control for used framebuffers
- freedreno/a6xx: Don't zero SO buffer addresses
- freedreno: Fix a couple of warnings
- turnip: Only get bo offset when we need to mmap
- freedreno: Use c_vis_args and no_override_init_args
- freedreno/a6xx: Remove extra parens
- freedreno/ir3: Track whether shader needs derivatives
- freedreno/ir3: Fix operand order for DSX/DSY
- st/glsl_to_nir: Calculate num_uniforms from NumParameterValues
- freedreno/ir3: Enable PIPE_CAP_PACKED_UNIFORMS
- freedreno/ir3: Push UBOs to constant file
- freedreno/ir3: Don't access beyond available regs
- freedreno/ir3: Add workaround for VS samgq
- freedreno/ir3: Mark ir3_context_error() as NORETURN
- freedreno/a2xx: Fix redundant if statement
- freedreno: Use enum values from matching enum
- freedreno/a6xx: Add helper for incrementing regid
- freedreno: Fix format string warning
- .gitignore: Remove autotool artifacts
- tgsi: Mark tgsi_strings_check() unused
- glsl_to_nir: Initialize debug variable
- nir_opcodes.py: Saturate to expression that doesn't overflow
- ralloc: Fully qualify non-virtual destructor call
- egl/dri2: Mark potentially unused 'display' variable with
MAYBE_UNUSED
- gallium/auxiliary/vl: Fix a couple of warnings
- freedreno/drm: Quiet pointer to u64 conversion warning
Leo Liu (6):
- st/va: fix the incorrect max profiles report
- st/va/vp9: set max reference as default of VP9 reference number
- vl/dri3: remove the wait before getting back buffer
- radeon/vcn: add H.264 constrained baseline support
- radeon/vcn/vp9: search the render target from the whole list
- winsys/amdgpu: add VCN JPEG to no user fence group
Lepton Wu (2):
- virgl: close drm fd when destroying virgl screen.
- virgl: Set bind when creating temp resource.
Lionel Landwerlin (127):
- anv: assert that color attachment are valid
- radv: assert that colorAttachment is valid for CmdClearAttachment
- i965: scale factor changes should trigger recompile
- vulkan: Update the XML and headers to 1.1.101
- anv: implement VK_EXT_depth_clip_enable
- build: move imgui out of src/intel/tools to be reused
- imgui: bump copy
- imgui: make sure our copy of imgui doesn't clash with others in the
same process
- vulkan: add an overlay layer
- intel: fix urb size for CFL GT1
- anv: add support for INTEL_DEBUG=bat
- Revert "anv: add support for INTEL_DEBUG=bat"
- intel/aub_viewer: printout 48bits addresses
- intel/aub_viewer: silence compiler warning
- intel/aub_viewer: silence more compiler warnings
- vulkan/overlay: fix missing installation of layer
- vulkan/overlay: fix includes
- imgui: update commit
- imgui: update memory editor
- vulkan/overlay: install layer binary in libdir
- intel/compiler: use correct swizzle for replacement
- vulkan/overlay: fix min/max computations
- vulkan/overlay: rework option parsing
- vulkan/overlay: add support for fps output in file
- anv: add support for INTEL_DEBUG=bat
- vulkan: update headers/registry to 1.1.102
- anv: update supported patch version
- radv: set num_components on vulkan_resource_index intrinsic
- vulkan/util: make header available from c++
- vulkan/util: generate instance/device dispatch tables
- vulkan/overlay: drop dependency on validation layer headers
- intel/decoders: add address space indicator to get BOs
- intel/decoders: handle decoding MI_BBS from ring
- intel/decoders: limit number of decoded batchbuffers
- intel/aub_read: reuse defines from gen_context
- intel/aub_write: split comment section from HW setup
- intel/aub_write: write header in init
- intel/aub_write: break execlist write in 2
- intel/aub_write: switch to use i915_drm engine classes
- intel/aub_write: log mmio writes
- intel/aub_write: store the physical page allocator in struct
- intel/aub_write: turn context images arrays into functions
- intel/aub_write: factorize context image/pphwsp/ring creation
- iris: fix decoder call
- iris: fix decode_get_bo callback
- intel/error2aub: build a list of BOs before writing them
- intel/error2aub: identify buffers by engine
- intel/error2aub: strenghten batchbuffer identifier marker
- intel/error2aub: parse other buffer types
- intel/error2aub: annotate buffer with their address space
- intel/error2aub: store engine last ring buffer head/tail pointers
- intel/error2aub: write GGTT buffers into the aub file
- intel/error2aub: add a verbose option
- intel/error2aub: deal with GuC log buffer
- intel/error2aub: support older style engine names
- vulkan: factor out wsi dependencies
- anv: implement VK_EXT_pipeline_creation_feedback
- vulkan/overlay: properly register layer object with loader
- vulkan/overlay: silence validation layer warnings
- vulkan/overlay: check return value of swapchain get images
- vulkan/overlay: improve error reporting
- i965: perf: sklgt2: update a priority for register programming
- i965: perf: sklgt2: update compute metrics config
- i965: perf: sklgt2: update memory write config
- i965: perf: add PMA stall metrics
- i965: perf: chv: fixup counters names
- i965: perf: hsw: drop register programming not needed on HSW
- i965: perf: sklgt2: drop programming of an unused NOA register
- i965: perf: add Icelake metrics
- i965: perf: enable Icelake metrics
- i965: perf: add ring busyness metric for cfl gt2
- i965: perf: update render basic configs for big core gen9/gen10
- anv: implement VK_KHR_swapchain revision 70
- intel: add dependency on genxml generated files
- genxml: add a sorting script
- genxml: sort xml files using new script
- anv: don't use default pipeline cache for hits for
VK_EXT_pipeline_creation_feedback
- anv: store heap address bounds when initializing physical device
- anv: leave the top 4Gb of the high heap VMA unused
- i965: store device revision in gen_device_info
- i965: extract performance query metrics
- i965: move mdapi data structure to intel/perf
- i965: move OA accumulation code to intel/perf
- i965: move brw_timebase_scale to device info
- i965: move mdapi result data format to intel/perf
- i965: move mdapi guid into intel/perf
- intel/perf: stub gen10/11 missing definitions
- i965: perf: add mdapi pipeline statistics queries on gen10/11
- intel/perf: drop counter size field
- intel/perf: constify accumlator parameter
- iris: implement WaEnableStateCacheRedirectToCS
- i965: implement WaEnableStateCacheRedirectToCS
- anv: implement WaEnableStateCacheRedirectToCS
- anv: fix uninitialized pthread cond clock domain
- intel/devinfo: fix missing num_thread_per_eu on ICL
- intel/devinfo: add basic sanity tests on device database
- anv: limit URB reconfigurations when using blorp
- intel: workaround VS fixed function issue on Gen9 GT1 parts
- anv: fix argument name for vkCmdEndQuery
- i965: fix icelake performance query enabling
- Revert "anv: limit URB reconfigurations when using blorp"
- vulkan/util: generate a helper function to return pNext struct sizes
- vulkan/overlay: update help printout
- vulkan/overlay: record stats in command buffers and accumulate on
exec/submit
- vulkan/overlay: add pipeline statistic & timestamps support
- vulkan/overlay: add no display option
- vulkan/overlay: add a margin to the size of the window
- vulkan/overlay: record all select metrics into output file
- vulkan/overlay: add a frame counter option
- vulkan/overlay: make overlay size configurable
- vulkan/overlay: make overriden functions static
- vulkan/overlay: add TODO list
- anv: fix crash when application does not provide push constants
- anv: rework queries writes to ensure ordering memory writes
- anv: fix use after free
- anv: Use corresponding type from the vector allocation
- vulkan/overlay: keep allocating draw data until it can be reused
- nir: fix lower_non_uniform_access pass
- vulkan/overlay-layer: fix cast errors
- vulkan/overlay: fix truncating error on 32bit platforms
- nir: lower_non_uniform_access: iterate over instructions safely
- vulkan/overlay: fix timestamp query emission with no pipeline stats
- vulkan: fix build dependency issue with generated files
- anv: fix apply_pipeline_layout pass for arrays of YCbCr descriptors
- nir/lower_non_uniform: safely iterate over blocks
- intel/perf: fix EuThreadsCount value in performance equations
- intel/perf: improve dynamic loading config detection
Lubomir Rintel (3):
- kmsro: Extend to include armada-drm
- gallivm: guess CPU features also on ARM
- gallivm: disable NEON instructions if they are not supported
Lucas Stach (3):
- etnaviv: don't flush own context when updating resource use
- etnaviv: flush all pending contexts when accessing a resource with
the CPU
- etnaviv: only try to construct scanout resource when on KMS winsys
Marek Olšák (121):
- radeonsi: enable dithered alpha-to-coverage for better quality
- radeonsi: merge & rename texture BO metadata functions
- radeonsi: unify error paths in si_texture_create_object
- winsys/amdgpu: remove amdgpu_drm.h definitions
- r600: add -Wstrict-overflow=0 to meson to silence the warning
- radeonsi: fix a comment typo in si_fine_fence_set
- gallium: allow more PIPE_RESOURCE\_ driver flags
- meson: drop the xcb-xrandr version requirement
- radeonsi: handle render_condition_enable in
si_compute_clear_render_target
- radeonsi: fix crashing performance counters (division by zero)
- radeonsi: initialize textures using DCC to black when possible
- radeonsi: clear allocator_zeroed_memory with SDMA
- radeonsi: make allocator_zeroed_memory unmappable and use bigger
buffers
- radeonsi: don't leak an index buffer if draw_vbo fails
- radeonsi: use local ws variable in si_need_dma_space
- gallium/u_threaded: fix EXPLICIT_FLUSH for flush offsets > 0
- radeonsi: fix EXPLICIT_FLUSH for flush offsets > 0
- winsys/amdgpu: don't drop manually added fence dependencies
- winsys/amdgpu: unify fence list code
- winsys/amdgpu: use a separate fence list for syncobjs
- winsys/amdgpu: remove occurence of INDIRECT_BUFFER_CONST
- winsys/amdgpu: clean up IB buffer size computation
- winsys/amdgpu: cs_check_space sets the minimum IB size for future IBs
- radeonsi: add AMD_DEBUG env var as an alternative to R600_DEBUG
- radeonsi: use MEM instead of MEM_GRBM in COPY_DATA.DST_SEL
- radeonsi: add driconf option radeonsi_enable_nir
- radeonsi: always enable NIR for Civilization 6 to fix corruption
- driconf: add Civ6Sub executable for Civilization 6
- st/mesa: always unmap the uploader in st_atom_array.c
- gallium/u_threaded: always unmap const_uploader
- gallium/u_upload_mgr: allow use of FLUSH_EXPLICIT with persistent
mappings
- radeonsi: use SDMA for uploading data through const_uploader
- tgsi: don't set tgsi_info::uses_bindless_images for constbufs and hw
atomics
- radeonsi: always use compute rings for clover on CI and newer (v2)
- gallium/u_tests: use a compute-only context to test GCN compute ring
- gallium: add pipe_grid_info::last_block
- omx: clean up enc_LoadImage_common
- omx: add a compute path in enc_LoadImage_common
- radeonsi: fix assertion failure by using the correct type
- mesa: implement ARB/KHR_parallel_shader_compile
- gallium: implement ARB/KHR_parallel_shader_compile
- util/queue: move thread creation into a separate function
- util/queue: add ability to kill a subset of threads
- util/queue: hold a lock when reading num_threads in util_queue_finish
- util/queue: add util_queue_adjust_num_threads
- radeonsi: implement ARB/KHR_parallel_shader_compile callbacks
- radeonsi: don't use PFP_SYNC_ME with compute-only contexts
- docs/relnotes: document parallel_shader_compile changes in 19.1.0,
not 19.0.0
- amd/addrlib: fix uninitialized values for
Addr2ComputeDccAddrFromCoord
- radeonsi/gfx9: add support for PIPE_ALIGNED=0
- radeonsi: add ability to bind images as image buffers
- radeonsi: add support for displayable DCC for 1 RB chips
- radeonsi: add support for displayable DCC for multi-RB chips
- radeonsi: enable displayable DCC on Ravens
- gallium: add writable_bitmask parameter into set_shader_buffers
- glsl: remember which SSBOs are not read-only and pass it to gallium
- radeonsi: set exact shader buffer read/write usage in CS
- tegra: fix the build after the set_shader_buffers change
- radeonsi: fix a crash when unbinding sampler states
- glsl: fix shader_storage_blocks_write_access for SSBO block arrays
- Revert "glsl: fix shader_storage_blocks_write_access for SSBO block
arrays"
- glsl: allow the #extension directive within code blocks for the dri
option
- mesa: don't overwrite existing shader files with
MESA_SHADER_CAPTURE_PATH
- radeonsi: set AC_FUNC_ATTR_READNONE for image opcodes where it was
missing
- ac: use the common helper ac_apply_fmask_to_sample
- ac: fix incorrect bindless atomic code in visit_image_atomic
- radeonsi: enable GL_EXT_shader_image_load_formatted
- nir: optimize gl_SampleMaskIn to gl_HelperInvocation for radeonsi
when possible
- winsys/amdgpu: don't set GTT with GDS & OA placements on APUs
- radeonsi/gfx9: use the correct condition for the DPBB + QUANT_MODE
workaround
- radeonsi: use CP DMA for the null const buffer clear on CIK
- tgsi/scan: add uses_drawid
- ac: add radeon_info::marketing_name, replacing the winsys callback
- ac: add radeon_info::is_pro_graphics
- ac: add ac_get_i1_sgpr_mask
- ac: add REWIND and GDS registers to register headers
- winsys/amdgpu: make IBs writable and expose their address
- winsys/amdgpu: reorder chunks, make BO_HANDLES first, IB and FENCE
last
- winsys/amdgpu: enable chaining for compute IBs
- winsys/amdgpu: clean up and remove nonsensical assertion
- radeonsi: add si_cp_copy_data
- radeonsi: add helper si_get_minimum_num_gfx_cs_dwords
- radeonsi: delay adding BOs at the beginning of IBs until the first
draw
- gallium: document conservative rasterization flags
- st/dri: simplify throttling code
- gallium: replace DRM_CONF_THROTTLE with PIPE_CAP_MAX_FRAMES_IN_FLIGHT
- gallium: replace DRM_CONF_SHARE_FD with PIPE_CAP_DMABUF
- gallium: replace drm_driver_descriptor::configuration with
driconf_xml
- gallium: set PIPE_CAP_MAX_FRAMES_IN_FLIGHT to 2 for all drivers
- gallium: add PIPE_CAP_PREFER_COMPUTE_BLIT_FOR_MULTIMEDIA
- util: fix a compile failure in u_compute.c on windows
- mesa: enable glGet for EXT_gpu_shader4
- glsl: add \`unsigned int\` type for EXT_GPU_shader4
- glsl: apply some 1.30 and other rules to EXT_gpu_shader4 as well
- glsl: add builtin variables for EXT_gpu_shader4
- glsl: add arithmetic builtin functions for EXT_gpu_shader4
- glsl: add texture builtin functions for EXT_gpu_shader4
- glsl: allow "varying out" for fragment shader outputs with
EXT_gpu_shader4
- mesa: expose EXT_texture_buffer_object
- mesa: only allow EXT_gpu_shader4 in the compatibility profile
- st/mesa: expose EXT_gpu_shader4 if GLSL 1.40 is supported
- glsl: handle interactions between EXT_gpu_shader4 and texture
extensions
- radeonsi: add BOs after need_cs_space
- radeonsi/gfx9: set that window_rectangles always roll the context
- radeonsi/gfx9: rework the gfx9 scissor bug workaround (v2)
- radeonsi: remove dirty slot masks from scissor and viewport states
- glsl: fix shader_storage_blocks_write_access for SSBO block arrays
(v2)
- radeonsi: don't ignore PIPE_FLUSH_ASYNC
- mesa: rework error handling in glDrawBuffers
- mesa: fix pbuffers because internally they are front buffers
- st/mesa: don't flush the front buffer if it's a pbuffer
- radeonsi: use new atomic LLVM helpers
- radeonsi: set sampler state and view functions for compute-only
contexts
- st/dri: decrease input lag by syncing sooner in SwapBuffers
- glsl: fix and clean up NV_compute_shader_derivatives support
- st/mesa: fix 2 crashes in st_tgsi_lower_yuv
- radeonsi: remove old_va parameter from si_rebind_buffer by
remembering offsets
- radeonsi: update buffer descriptors in all contexts after buffer
invalidation
- radeonsi: fix a regression in si_rebind_buffer
- u_blitter: don't fail mipmap generation for depth formats containing
stencil
- ac: fix a typo in ac_build_wg_scan_bottom
Mario Kleiner (1):
- drirc: Add sddm-greeter to adaptive_sync blacklist.
Mark Janes (5):
- mesa: properly report the length of truncated log messages
- mesa: rename logging functions to reflect that they format strings
- mesa: add logging function for formatted string
- intel/common: move gen_debug to intel/dev
- intel/tools: Remove redundant definitions of INTEL_DEBUG
Mateusz Krzak (2):
- panfrost: cast bo_handles pointer to uintptr_t first
- panfrost: use os_mmap and os_munmap
Mathias Fröhlich (22):
- st/mesa: Reduce array updates due to current changes.
- mesa: Track buffer object use also for VAO usage.
- st/mesa: Invalidate the gallium array atom only if needed.
- mesa: Implement helper functions to map and unmap a VAO.
- mesa: Factor out \_mesa_array_element.
- mesa: Use \_mesa_array_element in dlist save.
- mesa: Replace \_ae_{,un}map_vbos with \_mesa_vao_{,un}map_arrays
- mesa: Remove \_ae_{,un}map_vbos and dependencies.
- mesa: Use mapping tools in debug prints.
- vbo: Fix basevertex handling in display list compiles.
- vbo: Fix GL_PRIMITIVE_RESTART_FIXED_INDEX in display list compiles.
- mesa: Add assert to \_mesa_primitive_restart_index.
- mesa: Factor out index function that will have multiple use.
- mesa: Use glVertexAttrib*NV functions for fixed function attribs.
- mesa: Implement \_mesa_array_element by walking enabled arrays.
- mesa: Rip out now unused gl_context::aelt_context.
- mesa: Remove the now unused \_NEW_ARRAY state change flag.
- mesa: Constify static const array in api_arrayelt.c
- mesa: Remove the \_glapi_table argument from \_mesa_array_element.
- mesa: Set CurrentSavePrimitive in vbo_save_NotifyBegin.
- mesa: Correct the is_vertex_position decision for dlists.
- mesa: Leave aliasing of vertex and generic0 attribute to the dlist
code.
Matt Turner (7):
- intel/compiler/test: Set devinfo->gen = 7
- intel/compiler: Avoid propagating inequality cmods if types are
different
- intel/compiler/test: Add unit test for mismatched signedness
comparison
- intel/compiler: Add commas on final values of compaction table arrays
- intel/compiler: Use SIMD16 instructions in fs saturate prop unit test
- intel/compiler: Add unit tests for sat prop for different exec sizes
- intel/compiler: Improve fix_3src_operand()
Matthias Lorenz (1):
- vulkan/overlay: Add fps counter
Mauro Rossi (6):
- android: intel/isl: remove redundant building rules
- android: anv: fix generated files depedencies (v2)
- android: anv: fix libexpat shared dependency
- android: nouveau: add support for nir
- android: fix LLVM version string related building errors
- draw: fix building error in draw_gs_init()
Maya Rashish (1):
- configure: fix test portability
Michel Dänzer (19):
- loader/dri3: Use strlen instead of sizeof for creating VRR property
atom
- gitlab-ci: Re-use docker image from the main repo in forked repos
- gitlab-ci: List some longer-running jobs before others of the same
stage
- gitlab-ci: Use 8 CPU cores in autotools job
- gitlab-ci: Make sure clang job actually uses ccache
- gitlab-ci: Only pull/push cache contents in build+test stage jobs
- gitlab-ci: Automatically retry jobs after runner system failure
- gitlab-ci: Run CI pipeline for all branches in the main repository
- gitlab-ci: Use Debian stretch instead of Ubuntu bionic
- gitlab-ci: Use HTTPS for APT repositories
- gitlab-ci: Use Debian packages instead of pip ones for meson and
scons
- gitlab-ci: Install most packages from Debian buster
- gitlab-ci: Remove unneded (stuff from) APT command lines
- gitlab-ci: Remove unused Debian packages from Docker image
- gitlab-ci: Use clang 8 instead of 7
- gitlab-ci: Drop unused clang 5/6 packages
- gitlab-ci: Do not use subshells for compiling dependencies
- gitlab-ci: Use LLVM 3.4 from Debian jessie for scons-llvm job
- gitlab-ci: Use meson buildtype debug instead of default
debugoptimized
Mike Blumenkrantz (6):
- iris: support INTEL_NO_HW environment variable
- gallium: add pipe cap for inner_coverage conservative raster mode
- st/mesa: indicate intel extension support for inner_coverage based on
cap
- iris: add support for INTEL_conservative_rasterization
- iris: add preemption support on gen9
- iris: enable preemption support for gen10
Nanley Chery (3):
- i965: Rename intel_mipmap_tree::r8stencil\_\* -> ::shadow\_\*
- anv: Fix some depth buffer sampling cases on ICL+
- anv/cmd_buffer: Initalize the clear color struct for CNL+
Nataraj Deshpande (1):
- anv: Fix check for isl_fmt in assert
Neha Bhende (2):
- st/mesa: Fix topogun-1.06-orc-84k-resize.trace crash
- draw: fix memory leak introduced 7720ce32a
Nicolai Hähnle (9):
- amd/surface: provide firstMipIdInTail for metadata surface
calculations
- radeonsi: add si_debug_options for convenient adding/removing of
options
- util/u_log: flush auto loggers before starting a new page
- ddebug: set thread name
- ddebug: log calls to pipe->flush
- ddebug: dump driver state into a separate file
- ddebug: expose some helper functions as non-inline
- radeonsi: add radeonsi_aux_debug option for aux context debug dumps
- radeonsi: add radeonsi_sync_compile option
Oscar Blumberg (3):
- intel/fs: Fix memory corruption when compiling a CS
- radeonsi: Fix guardband computation for large render targets
- glsl: Fix function return typechecking
Patrick Lerda (1):
- lima/ppir: fix pointer referenced after a free
Patrick Rudolph (1):
- d3dadapter9: Support software renderer on any DRI device
Philipp Zabel (1):
- etnaviv: fill missing offset in etna_resource_get_handle
Pierre Moreau (12):
- include/CL: Update to the latest OpenCL 2.2 headers
- clover: Avoid warnings from new OpenCL headers
- clover: Remove the TGSI backend as unused
- clover: Add an helper for checking if an IR is supported
- clover/api: Rework the validation of devices for building
- clover/api: Fail if trying to build a non-executable binary
- clover: Disallow creating libraries from other libraries
- clover: Validate program and library linking options
- clover: Move device extensions definitions to core/device.cpp
- clover: Move platform extensions definitions to clover/platform.cpp
- clover: Only use devices supporting IR_NATIVE
- clover: Fix indentation issues
Pierre-Eric Pelloux-Prayer (1):
- radeonsi: init sctx->dma_copy before using it
Plamena Manolova (3):
- i965: Disable ARB_fragment_shader_interlock for platforms prior to
GEN9
- isl: Set ClearColorConversionEnable.
- i965: Re-enable fast color clears for GEN11.
Qiang Yu (9):
- u_math: add ushort_to_float/float_to_ushort
- u_dynarray: add util_dynarray_grow_cap
- gallium/u_vbuf: export u_vbuf_get_minmax_index
- drm-uapi: add lima_drm.h
- gallium: add lima driver
- lima/gpir: fix compile fail when two slot node
- lima/gpir: fix alu check miss last store slot
- lima: fix lima_blit with non-zero level source resource
- lima: fix render to non-zero level texture
Rafael Antognolli (45):
- iris: Store internal_format when getting resource from handle.
- iris: Skip msaa16 on gen < 9.
- iris: Flush before hiz_exec.
- iris: Pin HiZ buffers when rendering.
- iris: Avoid leaking if we fail to allocate the aux buffer.
- iris/clear: Pass on render_condition_enabled.
- iris: Skip resolve if there's no context.
- iris: Flag ALL_DIRTY_BINDINGS on aux state change.
- iris: Add resolve on iris_flush_resource.
- iris: Convert RGBX to RGBA always.
- iris: Enable auxiliary buffer support again
- iris: Enable HiZ for multisampled depth surfaces.
- iris: Make intel_hiz_exec public.
- iris: Allocate buffer space for the fast clear color.
- iris: Use the clear depth when emitting 3DSTATE_CLEAR_PARAMS.
- iris: Fast clear depth buffers.
- iris: Add helper to convert fast clear color.
- iris: Add function to update clear color in surface state.
- iris: Bring back check for srgb and fast clear color.
- intel/isl: Add isl_format_has_color_component() function.
- intel/blorp: Make swizzle_color_value public.
- iris: Implement fast clear color.
- iris: Add iris_resolve_conditional_render().
- iris: Stall on the CPU and resolve predication during fast clears.
- iris: Track fast clear color.
- iris: Let blorp update the clear color for us.
- i965/blorp: Remove unused parameter from blorp_surf_for_miptree.
- iris: Only update clear color for gens 8 and 9.
- iris/gen8: Re-emit the SURFACE_STATE if the clear color changed.
- iris: Manually apply fast clear color channel overrides.
- iris: Do not allocate clear_color_bo for gen8.
- iris: Add aux.sampler_usages.
- iris: Enable fast clears on gen8.
- intel/fs: Only propagate saturation if exec_size is the same.
- intel/fs: Move the scalar-region conversion to the generator.
- intel/fs: Add a lowering pass for linear interpolation.
- intel/fs: Remove fs_generator::generate_linterp from gen11+.
- intel/isl: Resize clear color buffer to full cacheline
- intel/genxml: Update MI_ATOMIC genxml definition.
- intel/blorp: Make blorp update the clear color in gen11.
- iris: Do not advertise multisampled image load/store.
- iris: Support sRGB fast clears even if the colorspaces differ.
- iris: Use the linear version of the surface format during fast
clears.
- iris: Update the surface state clear color address when available.
- iris: Enable fast clear colors on gen11.
Ray Zhang (1):
- glx: fix shared memory leak in X11
Rhys Kidd (1):
- iris: Fix assertion in iris_resource_from_handle() tiling usage
Rhys Perry (28):
- nvc0: add compute invocation counter
- radv: bitcast 16-bit outputs to integers
- radv: ensure export arguments are always float
- ac/nir: implement 8-bit nir_load_const_instr
- ac/nir: fix 64-bit nir_op_f2f16_rtz
- ac/nir: make ac_build_clamp work on all bit sizes
- ac/nir: make ac_build_isign work on all bit sizes
- ac/nir: make ac_build_fdiv support 16-bit floats
- ac/nir: implement half-float nir_op_frcp
- ac/nir: implement half-float nir_op_frsq
- ac/nir: implement half-float nir_op_ldexp
- ac/nir: fix 16-bit ssbo stores
- ac/nir: implement 8-bit push constant, ssbo and ubo loads
- ac/nir: implement 8-bit ssbo stores
- ac/nir: add 8-bit types to glsl_base_to_llvm_type
- ac/nir: implement 8-bit conversions
- radv: enable VK_KHR_8bit_storage
- ac/nir: implement 16-bit pack/unpack opcodes
- radv: lower 16-bit flrp
- ac: add 16-bit support to ac_build_ddxy()
- nir,ac/nir: fix cube_face_coord
- gallium: add support for formatted image loads
- mesa, glsl: add support for EXT_shader_image_load_formatted
- st/mesa: add support for EXT_shader_image_load_formatted
- vc4: fix build
- ac,ac/nir: use a better sync scope for shared atomics
- radv: fix set_output_usage_mask() with composite and 64-bit types
- ac/nir: mark some texture intrinsics as convergent
Rob Clark (135):
- freedreno: fix release tarball
- freedreno: more fixing release tarball
- freedreno/a6xx: small compiler warning fix
- freedreno/ir3: fix varying packing vs. tex sharp edge
- freedreno/a6xx: move stream-out emit to helper
- freedreno/a6xx: clean up some open-coded bits
- freedreno/ir3: split out image helpers
- freedreno/ir3: split out a4xx+ instructions
- freedreno/ir3: fix ncomp for \_store_image() src
- freedreno/ir3: add image/ssbo <-> ibo/tex mapping
- freedreno/ir3: add a6xx instruction encoding
- freedreno/ir3: add a6xx+ SSBO/image support
- freedreno/ir3: HIGH reg w/a for a6xx
- freedreno/a6xx: border-color offset helper
- freedreno/a6xx: image/ssbo state emit
- freedreno/a6xx: compute support
- freedreno/a6xx: cache flush harder
- freedreno/a6xx: fix helper_invocation (sampler mask/id)
- freedreno/ir3: handle quirky atomic dst for a6xx
- freedreno/ir3: fix legalize for vecN inputs
- freedreno/ir3: fix crash in compile fail case
- freedreno/a6xx: 3d and cube image fixes
- freedreno: fix crash w/ masked non-SSA dst
- freedreno/ir3: rename put_dst()
- freedreno/ir3/a6xx: fix load_ssbo barrier type.
- freedreno/ir3: sync instr/disasm and add ldib encoding
- freedreno/ir3/a6xx: use ldib for ssbo reads
- freedreno/a6xx: samplerBuffer fixes
- freedreno/a6xx: enable tiled images
- freedreno: fix race condition
- freedreno/ir3: don't hardcode wrmask
- freedreno/a6xx: fix border-color offset
- freedreno/a6xx: cube image fix
- freedreno/a6xx: fix hangs with large shaders
- freedreno/ir3: use nopN encoding when possible
- freedreno/a6xx: fix ssbo alignment
- freedreno/ir3/a6xx: fix non-ssa atomic dst
- freedreno/a6xx: fix DRAW_IDX_INDIRECT max_indicies
- freedreno/a6xx: vertex_id is not \_zero_based
- freedreno/ir3/a6xx: fix atomic shader outputs
- freedreno/ir3: gsampler2DMSArray fixes
- freedreno/ir3: include nopN in expanded instruction count
- freedreno/ir3: add SethiUllman numbering pass
- freedreno/ir3: track register pressure in sched
- freedreno: fix ir3_cmdline build
- freedreno/a6xx: remove astc_srgb workaround
- freedreno/a6xx: refactor fd6_tex_swiz()
- freedreno/a6xx: fix border-color swizzles
- freedreno/a6xx: perfcntrs
- freedreno/ir3: fix ir3_cmdline harder
- freedreno/ir3: turn on [iu]mul_high
- freedreno/a6xx: more bcolor fixes
- freedreno/ir3/cp: fix ldib bug
- freedreno/ir3/a6xx: fix ssbo comp_swap
- freedreno/ir3 better cat6 encoding detection
- freedreno/ir3/ra: fix half-class conflicts
- freedreno/ir3: fix sam.s2en decoding
- freedreno/ir3: fix sam.s2en encoding
- freedreno/ir3: fix regmask for merged regs
- nir: move gls_type_get_{sampler,image}_count()
- freedreno/ir3: find # of samplers from uniform vars
- freedreno/ir3: enable indirect tex/samp (sam.s2en)
- freedreno/ir3: optimize sam.s2en to sam
- freedreno/ir3: additional lowering
- freedreno/ir3: fix bit_count
- freedreno/ir3: dynamic UBO indexing vs 64b pointers
- freedreno/ir3: rename has_kill to no_earlyz
- freedreno/ir3: disable early-z for SSBO/image writes
- gallium: add PIPE_CAP_ESSL_FEATURE_LEVEL
- mesa/st: use ESSL cap top enable gpu_shader5
- freedreno: add ESSL cap
- docs: update freedreno status
- freedreno/a6xx: small cleanup
- freedreno/ir3: sched fix
- freedreno/ir3: reads/writes to unrelated arrays are not dependent
- freedreno/ir3: align const size to vec4
- nir: print var name for load_interpolated_input too
- nir: add lower_all_io_to_elements
- freedreno/ir3: re-indent comment
- freedreno/ir3: rework varying packing
- freedreno/ir3: add pass to move varying loads
- freedreno/ir3: convert to "new style" frag inputs
- gallium/docs: clarify set_sampler_views (v2)
- iris: fix set_sampler_view
- freedreno/ir3: fix const assert
- freedreno/drm: update for robustness
- freedreno: add robustness support
- compiler: rename SYSTEM_VALUE_VARYING_COORD
- freedreno/ir3: fix rgetpos decoding
- freedreno/ir3: more emit-cat5 fixes
- freedreno/ir3: cleanup instruction builder macros
- freedreno: update generated headers
- freedreno/ir3: lower load_barycentric_at_sample
- freedreno/ir3: lower load_barycentric_at_offset
- freedreno/ir3: remove bogus assert
- freedreno/ir3: rename frag_vcoord -> ij_pixel
- freedreno/a6xx: add VALIDREG/CONDREG helper macros
- freedreno/ir3: fix load_interpolated_input slot
- freedreno: wire up core sample-shading support
- freedreno/ir3: sample-shading support
- freedreno/a6xx: sample-shading support
- docs/features: update GL too
- freedreno/ir3: switch fragcoord to sysval
- freedreno/a6xx: small texture emit cleanup
- freedreno/a6xx: pre-bake UBWC flags in texture-view
- freedreno/ir3: fixes for half reg in/out
- freedreno/ir3: fix shader variants vs UBO analysis
- freedreno/ir3: fix lowered ubo region alignment
- freedreno/ir3: add IR3_SHADER_DEBUG flag to disable ubo lowering
- freedreno/ir3: add some ubo range related asserts
- nir: rework tex instruction printing
- nir: fix lower_wpos_ytransform in load_frag_coord case
- nir: add pass to lower fb reads
- freedreno/drm: expose GMEM_BASE address
- freedreno/ir3: fb read support
- freedreno/a6xx: KHR_blend_equation_advanced support
- freedreno/a6xx: smaller hammer for fb barrier
- docs: mark KHR_blend_equation_advanced done on a6xx
- nir: fix nir tex print harder
- freedreno/ir3: remove assert
- freedreno/a6xx: OUT_RELOC vs OUT_RELOCW fixes
- freedreno: update generated headers
- freedreno/a6xx: UBWC fixes
- freedreno/a6xx: UBWC support for images
- freedreno: mark imported resources as valid
- freedreno/a6xx: buffer resources cannot be compressed
- freedreno: move UBWC color offset to fd_resource_offset()
- freedreno: add ubwc_enabled helper
- freedreno/a6xx: deduplicate a few lines
- freedreno: remove unused forward struct declaration
- freedreno/ir3: fix rasterflat/glxgears
- freedreno/ir3: set more barrier bits
- freedreno/a6xx: fix GPU crash on small render targets
- freedreno/a6xx: fix issues with gallium HUD
- freedreno/a6xx: fix hangs with newer sqe fw
Rob Herring (2):
- kmsro: Add lima renderonly support
- kmsro: Add platform support for exynos and sun4i
Rodrigo Vivi (1):
- intel: Add more PCI Device IDs for Coffee Lake and Ice Lake.
Roland Scheidegger (2):
- gallivm: fix bogus assert in get_indirect_index
- gallivm: fix saturated signed add / sub with llvm 9
Romain Failliot (1):
- docs: changed "Done" to "DONE" in features.txt
Ross Burton (1):
- Revert "meson: drop GLESv1 .so version back to 1.0.0"
Ryan Houdek (1):
- panfrost: Adds Bifrost shader disassembler utility
Sagar Ghuge (10):
- iris: Don't allocate a BO per query object
- nir/glsl: Add another way of doing lower_imul64 for gen8+
- glsl: [u/i]mulExtended optimization for GLSL
- nir/algebraic: Optimize low 32 bit extraction
- spirv: Allow [i/u]mulExtended to use new nir opcode
- iris: Refactor code to share 3DSTATE_URB\_\* packet
- iris: Track last VS URB entry size
- iris: Flag fewer dirty bits in BLORP
- intel/fs: Remove unused condition from opt_algebraic case
- intel/compiler: Fix assertions in brw_alu3
Samuel Iglesias Gonsálvez (4):
- isl: remove the cache line size alignment requirement
- isl: the display engine requires 64B alignment for linear surfaces
- radv: don't overwrite results in VkGetQueryPoolResults() when queries
are not available
- radv: write availability status vkGetQueryPoolResults() when the data
is not available
Samuel Pitoiset (147):
- radv/winsys: fix hash when adding internal buffers
- radv: fix build
- radv: bail out when no image transitions will be performed
- radv: remove unused radv_render_pass_attachment::view_mask
- radv: remove useless MAYBE_UNUSED in CmdBeginRenderPass()
- radv: add radv_cmd_buffer_begin_subpass() helper
- radv: move subpass image transitions to
radv_cmd_buffer_begin_subpass()
- radv: store the list of attachments for every subpass
- radv: use the new attachments array when starting subpasses
- radv: determine the last subpass id for every attachments
- radv: handle final layouts at end of every subpass and render pass
- radv: move some render pass things to radv_render_pass_compile()
- radv: add radv_render_pass_add_subpass_dep() helper
- radv: track if subpasses have color attachments
- radv: handle subpass dependencies correctly
- radv: accumulate all ingoing external dependencies to the first
subpass
- radv: execute external subpass barriers after ending subpasses
- radv: drop useless checks when resolving subpass color attachments
- radv: do not set preserveAttachments for internal render passes
- radv: don't flush src stages when dstStageMask == BOTTOM_OF_PIPE
- radv: fix compiler issues with GCC 9
- radv: gather more info about push constants
- radv: gather if shaders load dynamic offsets separately
- radv: keep track of the number of remaining user SGPRs
- radv: add support for push constants inlining when possible
- radv: fix using LOAD_CONTEXT_REG with old GFX ME firmwares on GFX8
- radv/winsys: fix BO list creation when RADV_DEBUG=allbos is set
- radv: always export gl_SampleMask when the fragment shader uses it
- ac: make use of ac_build_expand_to_vec4() in visit_image_store()
- radv: use MAX_{VBS,VERTEX_ATTRIBS} when defining max vertex input
limits
- radv: store vertex attribute formats as pipeline keys
- radv: reduce the number of loaded channels for vertex input fetches
- radv: fix radv_fixup_vertex_input_fetches()
- radv: fix invalid element type when filling vertex input default
values
- ac: add ac_build_llvm8_tbuffer_load() helper
- ac: use new LLVM 8 intrinsic when loading 16-bit values
- radv: write the alpha channel of MRT0 when alpha coverage is enabled
- radv: remove unused variable in gather_push_constant_info()
- radv: fix writing the alpha channel of MRT0 when alpha coverage is
enabled
- radv: fix clearing attachments in secondary command buffers
- radv: fix out-of-bounds access when copying descriptors BO list
- radv: don't copy buffer descriptors list for samplers
- rav: use 32_AR instead of 32_ABGR when alpha coverage is required
- radv: allocate enough space in cmdbuf when starting a subpass
- radv: properly align the fence and EOP bug VA on GFX9
- radv: enable lower_mul_2x32_64
- Revert "radv: execute external subpass barriers after ending
subpasses"
- radv: fix pointSizeRange limits
- radv: set the maximum number of IBs per submit to 192
- ac: rework typed buffers loads for LLVM 7
- radv: store more vertex attribute infos as pipeline keys
- radv: use typed buffer loads for vertex input fetches
- ac: add ac_build_{struct,raw}_tbuffer_load() helpers
- ac: use the raw tbuffer version for 16-bit SSBO loads
- radv: always initialize HTILE when the src layout is UNDEFINED
- radv: always load 3 channels for formats that need to be shuffled
- ac: use llvm.amdgcn.fract intrinsic for nir_op_ffract
- radv: fix binding transform feedback buffers
- ac: make use of ac_get_store_intr_attribs() where possible
- ac/nir: set attrib flags for SSBO and image store operations
- ac: add ac_build_buffer_store_format() helper
- ac/nir: remove one useless check in visit_store_ssbo()
- ac/nir: use new LLVM 8 intrinsics for SSBO atomic operations
- ac/nir: use ac_build_buffer_load() for SSBO load operations
- ac/nir: use ac_build_buffer_store_dword() for SSBO store operations
- ac: use new LLVM 8 intrinsics in ac_build_buffer_load()
- ac: add ac_build_{struct,raw}_tbuffer_store() helpers
- ac: use new LLVM 8 intrinsic when storing 16-bit values
- ac: use new LLVM 8 intrinsics in ac_build_buffer_store_dword()
- ac: add various int8 definitions
- ac: add ac_build_tbuffer_load_byte() helper
- ac: add ac_build_tbuffer_store_byte() helper
- radv: add missing initializations since
VK_EXT_pipeline_creation_feedback
- ac: add f16_0 and f16_1 constants
- ac: add 16-bit support fo fsign
- ac: add 16-bit support to fract
- ac: fix 16-bit shifts
- ac: fix incorrect argument type for tbuffer.{load,store} with LLVM 7
- nir: use generic float types for frexp_exp and frexp_sig
- spirv,nir: lower frexp_exp/frexp_sig inside a new NIR pass
- nir: add nir_{load,store}_deref_with_access() helpers
- spirv: propagate the access flag for store and load derefs
- ac: use llvm.amdgcn.fmed3 intrinsic for nir_op_fmed3
- ac: add ac_build_frexp_mant() helper and 16-bit/32-bit support
- ac: add ac_build_frex_exp() helper ans 16-bit/32-bit support
- radv: do not lower frexp_exp and frexp_sig
- radv: enable VK_AMD_gpu_shader_int16
- radv: skip updating depth/color metadata for conditional rendering
- radv: do not always initialize HTILE in compressed state
- ac: fix return type for llvm.amdgcn.frexp.exp.i32.64
- ac/nir: fix nir_op_b2i16
- ac: fix ac_build_bit_count() for 16-bit integer type
- ac: fix ac_build_bitfield_reverse() for 16-bit integer type
- ac: fix ac_find_lsb() for 16-bit integer type
- ac: fix ac_build_umsb() for 16-bit integer type
- ac/nir: add support for nir_op_b2i8
- ac: add 8-bit support to ac_build_bit_count()
- ac: add 8-bit support to ac_find_lsb()
- ac: add 8-bit support to ac_build_umsb()
- ac: add 8-bit and 64-bit support to ac_build_bitfield_reverse()
- radv: partially enable VK_KHR_shader_float16_int8
- nir: do not pack varying with different types
- ac/nir: fix intrinsic names for atomic operations with LLVM 9+
- radv: fix getting the vertex strides if the bindings aren't
contiguous
- ac/nir: fix nir_op_b2f16
- radv: enable VK_AMD_gpu_shader_half_float
- wsi: allow to override the present mode with MESA_VK_WSI_PRESENT_MODE
- ac/nir: make use of ac_build_imax() where possible
- ac/nir: make use of ac_build_imin() where possible
- ac/nir: make use of ac_build_umin() where possible
- ac: add ac_build_umax() and use it where possible
- ac: add ac_build_ddxy_interp() helper
- ac: add ac_build_load_helper_invocation() helper
- ac/nir: remove useles LLVMGetUndef for nir_op_pack_64_2x32_split
- ac/nir: remove useless integer cast in
adjust_sample_index_using_fmask()
- ac/nir: remove useless integer cast in visit_image_load()
- ac/nir: remove some useless integer casts for ALU operations
- spirv: add SpvCapabilityFloat16 support
- radv: enable VK_KHR_shader_float16_int8
- radv: set ACCESS_NON_READABLE on stores for copy/fill/clear meta
shaders
- radv: enable shaderInt8 on SI and CIK
- radv: sort the shader capabilities alphabetically
- ac/nir: use new LLVM 8 intrinsics for SSBO atomics except cmpswap
- ac/nir: add 64-bit SSBO atomic operations support
- radv: add VK_KHR_shader_atomic_int64 but disable it for now
- ac: add support for more types with struct/raw LLVM intrinsics
- ac: use struct/raw load intrinsics for 8-bit/16-bit int with LLVM 9+
- ac: use struct/raw store intrinsics for 8-bit/16-bit int with LLVM 9+
- ac/nir: only use the new raw/struct image atomic intrinsics with LLVM
9+
- ac/nir: only use the new raw/struct SSBO atomic intrinsics with LLVM
9+
- ac/nir: use the new raw/struct SSBO atomic intrisics for comp_swap
- radv: add VK_NV_compute_shader_derivates support
- radv: add missing VEGA20 chip in radv_get_device_name()
- radv: do not need to force emit the TCS regs on Vega20
- radv: fix color conversions for normalized uint/sint formats
- radv: implement a workaround for VK_EXT_conditional_rendering
- ac: tidy up ac_build_llvm8_tbuffer_{load,store}
- radv: set WD_SWITCH_ON_EOP=1 when drawing primitives from a stream
output buffer
- radv: only need to force emit the TCS regs on Vega10 and Raven1
- radv: fix radv_get_aspect_format() for D+S formats
- radv: apply the indexing workaround for atomic buffer operations on
GFX9
- radv: fix setting the number of rectangles when it's dyanmic
- radv: add a workaround for Monster Hunter World and LLVM 7&8
- radv: allocate more space in the CS when emitting events
- radv: do not use gfx fast depth clears for layered depth/stencil
images
- radv: fix alpha-to-coverage when there is unused color attachments
- radv: fix setting CB_SHADER_MASK for dual source blending
Sergii Romantsov (4):
- dri: meson: do not prefix user provided dri-drivers-path
- d3d: meson: do not prefix user provided d3d-drivers-path
- i965,iris/blorp: do not blit 0-sizes
- glsl: Fix input/output structure matching across shader stages
Sonny Jiang (1):
- radeonsi: use compute for clear_render_target when possible
Tapani Pälli (42):
- nir: add option to use scaling factor when sampling planes YUV
lowering
- dri: add P010, P012, P016 for 10bit/12bit/16bit YUV420 formats
- intel/compiler: add scale_factors to sampler_prog_key_data
- i965: add P0x formats and propagate required scaling factors
- drirc/i965: add option to disable 565 configs and visuals
- mesa: return NULL if we exceed MaxColorAttachments in
get_fb_attachment
- anv: anv: refactor error handling in anv_shader_bin_write_to_blob()
- iris: add Android build
- nir: initialize value in copy_prop_vars_block
- nir: use nir_variable_create instead of open-coding the logic
- android: add liblog to libmesa_intel_common build
- android: make libbacktrace optional on USE_LIBBACKTRACE
- iris: add libmesa_iris_gen8 library to the build
- util: fix a warning when building against clang7 headers
- anv: retain the is_array state in create_plane_tex_instr_implicit
- anv: toggle on support for VK_EXT_ycbcr_image_arrays
- anv: use anv_gem_munmap in block pool cleanup
- anv: call blob_finish when done with it
- nir: free dead_ctx in case of no progress
- anv: destroy descriptor sets when pool gets destroyed
- anv: release memory allocated by bo_heap when descriptor pool is
destroyed
- anv: release memory allocated by glsl types during spirv_to_nir
- anv: revert "anv: release memory allocated by glsl types during
spirv_to_nir"
- i965: remove scaling factors from P010, P012
- isl: fix automake build when sse41 is not supported
- android: Build fixes for OMR1
- iris: initialize num_cbufs
- iris: mark switch case fallthrough
- anv/radv: release memory allocated by glsl types during spirv_to_nir
- st/mesa: fix compilation warning on storage_flags_to_buffer_flags
- st/mesa: fix warnings about implicit conversion on enumeration type
- spirv: fix a compiler warning
- st/nir: run st_nir_opts after 64bit ops lowering
- iris: move variable to the scope where it is being used
- iris: move iris_flush_resource so we can call it from get_handle
- iris: handle aux properly in iris_resource_get_handle
- egl: setup fds array correctly when exporting dmabuf
- compiler/glsl: handle case where we have multiple users for types
- android/iris: fix driinfo header filename
- nir: use braces around subobject in initializer
- glsl: use empty brace initializer
- anv: expose VK_EXT_queue_family_foreign on Android
Thomas Hellstrom (5):
- winsys/svga: Add an environment variable to force host-backed
operation
- winsys/svga: Enable the transfer_from_buffer GPU command for vgpu10
- svga: Avoid bouncing buffer data in malloced buffers
- winsys/svga: Update the drm interface file
- winsys/svga: Don't abort on EBUSY errors from execbuffer
Timo Aaltonen (1):
- util/os_misc: Add check for PIPE_OS_HURD
Timothy Arceri (72):
- st/glsl_to_nir: remove dead local variables
- ac/radv/radeonsi: add ac_get_num_physical_sgprs() helper
- radv: take LDS into account for compute shader occupancy stats
- util: move BITFIELD macros to util/macros.h
- st/glsl_to_nir: call nir_remove_dead_variables() after lowing local
indirects
- nir: add support for marking used patches when packing varyings
- nir: add glsl_type_is_32bit() helper
- nir: add is_packing_supported_for_type() helper
- nir: rewrite varying component packing
- nir: prehash instruction in nir_instr_set_add_or_rewrite()
- nir: turn ssa check into an assert
- nir: turn an ssa check in nir_search into an assert
- nir: remove simple dead if detection from nir_opt_dead_cf()
- radeonsi/nir: set input_usage_mask properly
- radeonsi/nir: set colors_read properly
- radeonsi/nir: set shader_buffers_declared properly
- st/nir: use NIR for asm programs
- nir: remove non-ssa support from nir_copy_prop()
- nir: clone instruction set rather than removing individual entries
- nir: allow nir_lower_phis_to_scalar() on more src types
- radeonsi: fix query buffer allocation
- glsl: fix shader cache for packed param list
- radeonsi/nir: move si_lower_nir() call into compiler thread
- glsl: rename is_record() -> is_struct()
- glsl: rename get_record_instance() -> get_struct_instance()
- glsl: rename record_location_offset() -> struct_location_offset()
- glsl: rename record_types -> struct_types
- nir: rename glsl_type_is_struct() -> glsl_type_is_struct_or_ifc()
- glsl/freedreno/panfrost: pass gl_context to the standalone compiler
- glsl: use NIR function inlining for drivers that use glsl_to_nir()
- i965: stop calling nir_lower_returns()
- radeonsi/nir: stop calling nir_lower_returns()
- st/glsl: start spilling out common st glsl conversion code
- anv: add support for dumping shader info via VK_EXT_debug_report
- nir: add guess trip count support to loop analysis
- nir: add new partially_unrolled bool to nir_loop
- nir: add partial loop unrolling support
- nir: calculate trip count for more loops
- nir: unroll some loops with a variable limit
- nir: simplify the loop analysis trip count code a little
- nir: add helper to return inversion op of a comparison
- nir: add get_induction_and_limit_vars() helper to loop analysis
- nir: pass nir_op to calculate_iterations()
- nir: find induction/limit vars in iand instructions
- st/glsl_to_nir: fix incorrect arrary access
- radeonsi/nir: call some more var optimisation passes
- ac/nir_to_llvm: add assert to emit_bcsel()
- nir: only override previous alu during loop analysis if supported
- nir: fix opt_if_loop_last_continue()
- nir: add support for user defined loop control
- spirv: make use of the loop control support in nir
- nir: add support for user defined select control
- spirv: make use of the select control support in nir
- Revert "ac/nir: use new LLVM 8 intrinsics for SSBO atomic operations"
- nir: propagate known constant values into the if-then branch
- Revert "nir: propagate known constant values into the if-then branch"
- nir/radv: remove restrictions on opt_if_loop_last_continue()
- nir: initialise some variables in opt_if_loop_last_continue()
- nir/i965/freedreno/vc4: add a bindless bool to type size functions
- ac/nir_to_llvm: make get_sampler_desc() more generic and pass it the
image intrinsic
- ac/nir_to_llvm: add image bindless support
- nir: fix packing components with arrays
- radeonsi/nir: fix scanning of bindless images
- st/mesa/radeonsi: fix race between destruction of types and shader
compilation
- nir: fix nir_remove_unused_varyings()
- radeonsi/nir: create si_nir_opts() helper
- radeonsi/nir: call radeonsi nir opts before the scan pass
- util/drirc: add workarounds for bugs in Doom 3: BFG
- radeonsi: add config entry for Counter-Strike Global Offensive
- Revert "glx: Fix synthetic error generation in \__glXSendError"
- Revert "st/mesa: expose 0 shader binary formats for compat profiles
for Qt"
- st/glsl: make sure to propagate initialisers to driver storage
Timur Kristóf (19):
- radeonsi/nir: Use uniform location when calculating const_file_max.
- iris: implement clearing render target and depth stencil
- nir: Add ability for shaders to use window space coordinates.
- tgsi_to_nir: Fix the TGSI ARR translation by converting the result to
int.
- tgsi_to_nir: Fix TGSI LIT translation by using flt.
- tgsi_to_nir: Make the TGSI IF translation code more readable.
- tgsi_to_nir: Split to smaller functions.
- nir: Move nir_lower_uniforms_to_ubo to compiler/nir.
- nir: Add multiplier argument to nir_lower_uniforms_to_ubo.
- freedreno: Plumb pipe_screen through to irX_tgsi_to_nir.
- tgsi_to_nir: Produce optimized NIR for a given pipe_screen.
- tgsi_to_nir: Restructure system value loads.
- tgsi_to_nir: Extract ttn_emulate_tgsi_front_face into its own
function.
- tgsi_to_nir: Support FACE and POSITION properly.
- tgsi_to_nir: Improve interpolation modes.
- tgsi_to_nir: Set correct location for uniforms.
- radeonsi/nir: Only set window_space_position for vertex shaders.
- iris: Face should be a system value.
- gallium: fix autotools build of pipe_msm.la
Tobias Klausmann (1):
- vulkan/util: meson build - add wayland client include
Tomasz Figa (1):
- llvmpipe: Always return some fence in flush (v2)
Tomeu Vizoso (19):
- panfrost: Add gem_handle to panfrost_memory and panfrost_bo
- panfrost: Add backend targeting the DRM driver
- panfrost/midgard: Add support for MIDGARD_MESA_DEBUG
- panfrost: Add support for PAN_MESA_DEBUG
- panfrost: Set bo->size[0] in the DRM backend
- panfrost: Set bo->gem_handle when creating a linear BO
- panfrost: Adapt to uapi changes
- panfrost: Fix sscanf format options
- panfrost: Set the GEM handle for AFBC buffers
- panfrost: Also tell the kernel about the checksum_slab
- panfrost: Pass the context BOs to the kernel so they aren't unmapped
while in use
- panfrost: Wait for last job to finish in force_flush_fragment
- panfrost: split asserts in pandecode
- panfrost: Guard against reading past end of buffer
- panfrost/ci: Initial commit
- panfrost/midgard: Skip register allocation if there's no work to do
- panfrost/midgard: Skip liveness analysis for instructions without
dest
- panfrost: Fix two uninitialized accesses in compiler
- panfrost: Only take the fast paths on buffers aligned to block size
Toni Lönnberg (8):
- intel/genxml: Only handle instructions meant for render engine when
generating headers
- intel/genxml: Media instructions and structures for gen6
- intel/genxml: Media instructions and structures for gen7
- intel/genxml: Media instructions and structures for gen7.5
- intel/genxml: Media instructions and structures for gen8
- intel/genxml: Media instructions and structures for gen9
- intel/genxml: Media instructions and structures for gen10
- intel/genxml: Media instructions and structures for gen11
Topi Pohjolainen (2):
- intel/compiler/icl: Use tcs barrier id bits 24:30 instead of 24:27
- intel/compiler/fs/icl: Use dummy masked urb write for tess eval
Vasily Khoruzhick (2):
- lima: use individual tile heap for each GP job.
- lima: add support for depth/stencil fbo attachments and textures
Vinson Lee (5):
- gallium/auxiliary/vl: Fix duplicate symbol build errors.
- nir: Fix anonymous union initialization with older GCC.
- swr: Fix build with llvm-9.0.
- gallium: Fix autotools build with libxatracker.la.
- freedreno: Fix GCC build error.
Vivek Kasireddy (1):
- drm-uapi: Update headers from drm-next
Xavier Bouchoux (1):
- nir/spirv: Fix assert when unsampled OpTypeImage has unknown 'Depth'
Yevhenii Kolesnikov (1):
- i965: Fix allow_higher_compat_version workaround limited by OpenGL
3.0
coypu (1):
- gbm: don't return void
davidbepo (1):
- drirc: add Waterfox to adaptive-sync blacklist
grmat (1):
- drirc: add Spectacle, Falkon to a-sync blacklist
pal1000 (1):
- scons: Compatibility with Scons development version string
suresh guttula (3):
- vl: Add cropping flags for H264
- radeon/vce:Add support for frame_cropping_flag of
VAEncSequenceParameterBufferH264
- st/va/enc: Add support for frame_cropping_flag of
VAEncSequenceParameterBufferH264