Commit Graph

46 Commits

Author SHA1 Message Date
jaxl
fe996fc7a4 2020-6-24 update
New feature and improvement
* Enable vkGetDeviceQueue2
* Make CmdCopyImageGraphics faster
* Add VK_COMPOSITE_ALPHA_PRE_MULTIPLIED_BIT_KHR in supportedCompositeAlpha

Issue fix
* Randr lease display is broken
* Driver crashes when setting basePipelineHandle to an invalid value
* Nullptr crash and some memtracker errors in the GPU profiler
* AMDVLK sets timezone to GMT+0 which affected in-game clock
2020-06-24 17:09:01 +08:00
jaxl
fc576d9b33 2020-5-28 update
New feature and improvement
* Improve memory allocation for systems that have no local invisible memory
* Turn off command buffer prefetch for local memory
* Enable TMZ  (trusted memory zone)  feature
* Use backend optimization for kills

Issue fix
* Vulkan RGP instruction tracing is not working
* LlpcOptions doesn't work when pipeline cache is enabled
* Issues in OpAtomicIDecrement and OpAtomicIIncrement for shader_image_atomic_int64 extension
* Out of bounds reads in ElfWriter
2020-05-28 23:04:46 +08:00
jaxl
c54654496a 2020-5-14 update
New feature and improvement

* Support VK_EXT_pipeline_creation_cache_control extension
* Remove optimization that sinks arithmetic out of "if" blocks
* Remove waTessIncorrectRelativeIndex workaround
* Optimizations for SubgroupShuffleXor() function
* Update api_version to 1.2.139

Issue fix

* External memory CTS tests failure on platform that doesn't have local invisible heap
* Trackmania 2 glitchy graphical issue
2020-05-14 16:48:53 +08:00
jaxl
7b51aa7034 2020-4-30 update
New feature and improvement
* Enable pipeline binary cache by default
* Support VK_MEMORY_PROPERTY_PROTECTED_BIT
* Shadow of Tomb Raider benchmark performance tuning

Issue fix
* ANGLE conformance test functional_transform_feedback crash
* [RGP] incorrect start/stop frame/tag values reported back to SQTT file
* EDID parsing bugs for HDR support
2020-04-30 12:40:45 +08:00
jaxl
c59322aa30 2020-4-16 update
New feature and improvement
*  Enable Renoir support
* Update api version to 1.2.135
* Add limitation on cache directory size
* Add panel setting to disable AMD Vendor extensions

Issue fix
*  The issue that scratch allocations, once multiplied for wave size, could be under the WaveSize granularity
2020-04-16 21:52:55 +08:00
jaxl
d5ddc3fb6f 2020-3-26 update
New feature and improvement
*  Enable Immediate and Mailbox mode for direct display

Issue fix
* SPM issues
2020-03-27 14:14:48 +08:00
jaxl
12df867cd3 2020-3-12 update
New feature and improvement
* Enable flip by default
* Support VK_EXT_post_depth_coverage extension
* Support VK_EXT_texel_buffer_alignment extension
* Support VK_KHR_non_semantic_info extension
* Performance tuning for Thrones of Britannia, Talos principle and Serious sam
* Update api version to 1.2.133

Issue fix
* Corruption in non power of two compressed textures on Gfx9 and Gfx10 when using the CopyImage2DMipLevel shader
2020-03-13 10:58:23 +08:00
jaxl
b81c3a09c9 2020-2-21 update
New feature and improvement
* Update api version to 1.2.131
* Disable VK_IMAGE_CREATE_SPARSE_BINDING_BIT for images with YUV format
* Support barrier for streamout buffer
* Add a new type FmaskOnly for Image MetadataMode which makes color msaa Image only have Cmask/Fmask metadata
* Avoid issue TCC metadata cache invalidation for image without metadata
* Allow compressed copySrc layout for color msaa image that supports MetaDataTexFetch
* Add wave32 support for indirect command generator dispatches
* Disable DCC on mipmap array resource by default and adjust the UseDCC validation logic

Issue fix
* Vk GPA extension not returning correct return code
* Clearing 3D image views (created from 3D PRT images) via clear load ops does not work as expected
* Image corruption during memory defrag copy (Corrupt non power of 2 textures)
* Random failure with XFBCaptureAndVerifyOnHost test cases when using vkDrawIndirectByteCountEXT() draw calls
* A minor logic error causing a whole bunch of extra updates to DCC state metadata inside PAL barriers
* A bug of DCC constant encoding
* CTS 1.2 api.image_clearing.*.clear_depth_stencil_* random failures on Linux for asics before gfx10
2020-02-21 21:03:44 +08:00
jaxl
813f090efb 2020-1-21 update
New feature and improvement
* Update the api version to 1.1.130
* Performance  tuning for Rise of Tomb Raider and DiRT4

Issue fix
* Corruption observed while running vkmark
* Memory leak in llpc
* Wrong result of interpolateAtSample/AtOffset
* CTS failure in dEQP-VK.draw.scissor.16*
2020-01-22 22:25:13 +08:00
jaxl
e6d1928269 2019-12-19 update
New feature and improvement

* Enable VK_KHR_shader_float_controls extension
* Enable  VK_KHR_separate_depth_stencil_layouts extension
* Enable VertexAttributeInstanceRateZeroDivisor
* Remove a workaround for the issue that Elite Dangerous game is too bright
* Update the api version to 1.1.129

Issue fix

* Performance drop issue of vkmark and vkcube when flip is enabled
* Crash triggered by Sparse bind + timeline semaphores
2019-12-23 14:34:37 +08:00
jaxl
887460b07b 2019-12-12 update
New feature and improvement
* Enable VK_EXT_pipeline_creation_feedback extension
* Enable SUBGROUP_CLUSTER_SUPPORT
* Performance optimization for Dawn of War 3

Issue fix

* Few CTS failures for Navi14
* An issue of VK_EXT_vertex_attribute_disivor (divisor == 0)
2019-12-13 11:04:18 +08:00
jaxl
b2600e0866 2019-11-25 update
New feature and improvement

* Add Navi14 support
* Support VK_EXT_pipeline_creation_feedback extension
* Support VK_EXT_shader_demote_to_helper_invocation extension
* Shader performance tuning for TOTALWAR WARHAMMER II and DiRT4
* Update Vulkan headers to 1.1.127

Issue fix

* spirv_assembly.instruction.compute.float_controls.independence_settings.independence_settings  test failures
* dEQP-VK.pipeline.depth_range_unrestricted.*d32_sfloat*  test failure
* dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp32.input_args.denorm_mmulm_var_preserve_vert test failures
* dEQP-VK.transform_feedback.fuzz.*.all_unordered_and_missing.56  test failures
* dEQP-VK.transform_feedback.fuzz* test failures on Gfx7
2019-11-27 14:21:33 +08:00
jaxl
ff4e60402d 2019-11-7 update
New feature and improvement

* Support VK_KHR_shader_subgroup_extended_types extension
* Support VK_KHR_pipeline_executable_properties extension
* Support VK_KHR_TIMELINE_SEMAPHORE extension
* Support VK_KHR_SHADER_CLOCK extension
* Support VK_KHR_SPIRV_1_4 extension
* Enable computeFullSubgroups support
* Switch vk pipeline cache to PAL pipeline binary cache
* Shader tuning for game Rise of Tomb Raider

Issue fix

* RPCS3 not working or hang
* dEQP-VK.glsl.builtin.precision_fp16_storage*.frem.compute.* test failure
* transform_feedback.simple.xfb_* - clip and cull tests failure
2019-11-08 17:23:46 +08:00
jaxl
aab8cd0393 2019-10-231 update
New feature and improvement

* Add VKI_EXT_HOST_MAPPED_FOREIGN_MEMORY support
* Re-work vkPipelineCache
*  A few cache related optimization for AcqRelBarrier
* Tune shader performance for F1 2017 and the Talos principle
* EXT_vertex_attribute_divisor: Add missing features query and support verification

Issue fix
*  Image corruption presented on screen with multiGPU
*  dEQP-VK.api.image_clearing clear_color_image.2d.linear CTS failure
*  RGP profiling fails to enable starting with GPUOPEN v41
* A case fallthrough bug with VK_AMD_memory_overallocation_behavior at device creation
* CTS transform feedback test failures
2019-10-22 13:45:33 +08:00
jaxl
71e1218989 Change manifest: llvm-project revision. 2019-10-16 17:03:04 +08:00
jaxl
9cbff8e8de Change manifest:switch llvm to llvm-project. 2019-10-16 16:54:46 +08:00
Jacob He
808b6d5603 2019-9-23 update
New feature and improvement
* Support VK_AMD_device_coherent_memory extension
* Support VK_EXT_line_rasterization extension
* Support VK_EXT_calibrated_timestamps extension
* Tune shader performance for Serious Sam Fusion 2017
* Tune shader performance for DawnOfWar3
* Add  lots of missing enabled feature verification for device create

Issue fix
* Memory leak in VK_Semaphore
* Corruption in Elite Dangerous
* Crash when calling vk_EXT_debug_utils extension when using DevDriver

Change-Id: I88453296117dca2f5180267ddc3de51cc0c0b498
2019-09-25 10:42:40 +08:00
jaxl
6dadf94b08 Update manifest: use external cwpack 2019-09-23 14:33:59 +08:00
jaxl
e79226b8a0 2019-8-26 update
New feature and improvement

* Clean up Metrohash dependency, build driver with external single instance of Metrohash
* Update Vulkan headers to 1.1.119
* Expose VK_EXT_subgroup_size_control version 2
* Talos principle performance tuning
* Improve codegen for FMin and FMax
* Remove the workaround for flickering smoke in Three Kingdoms

Issue fix

* HDR test hang
* Segment fault and memory leak in memoryCache
* Semaphore handle leak
* Divergence analysis problem with atomic optimization
* VGT_STRMOUT_BUFFER_CONFIG for multiple xfb buffers in the same stream
2019-08-29 11:01:51 +08:00
jaxl
046d3913a3 Add MetroHash amd-master branch into default.xml 2019-08-23 12:09:46 +08:00
jaxl
ddb9bc059a 2019-8-14 update
New feature and improvement

* Support extension VK_EXT_subgroup_size_control
* Update Vulkan headers to 1.1.116
* Optimize atomic max/min
* Optimize fp16 code generation
* Various atomic optimizations
* Extend atomic optimizer to gfx10

Issue fix

* ICD initialization failure for system with a mix of GFX10 and other GFX versions ASICs
* Crash with AMD GPU disabled
* CTS dEQP-VK.binding_model.descriptorset_random.* - test failure
2019-08-15 14:42:35 +08:00
Your Name
53d1a7c297 2019-7-26 update
New feature and improvement
* Support extension VK_KHR_imageless_framebuffer
* Support extension VK_AMD_shader_core_properties2
* Navi10 performance tuning for Dirt4, Totalwar:WarhammerII, F1 2017

Issue fix
* Vk.api.device_init.create_instance_device_intentional_alloc_fail test failure
* Smoke flickering in game Three Kingdoms
* Incorrect behavior with OpImageSampleExplicitLod + Lod|Offset
2019-07-29 03:24:11 -04:00
Jacob He
4f7b640d4f 2019-7-15 update
New feature and improvement
* Add Navi10 support
* Enable shader writes for alphaToCoverageEnable when attachment is set to VK_ATTACHMENT_UNUSED
* Change reported driver UUID from pal major/minor version (and timestamp on Linux) to AMD-LINUX-DRV

Issue fix
* vkGetShaderInfoAMD cannot get first shader stage disassembly code
2019-07-15 21:54:27 -04:00
Jacob He
bff1e718f0 2019-6-30 update
New feature and improvement
* Add HDR10 support for direct display mode, enable EXT_HDR_METADATA
* Expose VK_EXT_display_surface_counter by default

Issue fix
* MGPU app crashes when disable panel setting useSharedCmdAllocator
* Reading wrong slice from mip 1 when a 2d array of 9 slices, 2 mips is viewed as cubemap
* CTS test failure with dEQP-VK.wsi.*.surface.query_devgroup_present_modes, dEQP-VK.memory_model.*  and dEQP-VK.clipping.user_defined.clip_cull_distance*fragmentshader_read
* Access violation when destroying Image objects
* Crashes with two different ASICs installed
* RGP test failure
2019-07-02 00:13:21 -04:00
Jacob He
193a7883a1 2019-6-6 update
New feature and improvement
* Update SDK headers to 1.1.109
* Enable Scratch Bounds Checking for GFX9
* VK_MEMORY_OVERALLOCATION_BEHAVIOR_ALLOWED_AMD: add support for the attachment image Vulkan memory type
* Optimizations for acquire-release barrier

Issue fix
* F1 2018 hang on gfx9
* Some acquire-release barrier issues
* World War Z - black patches corruption
2019-06-10 00:27:27 -04:00
jacob he
8d085a79d4 2019-5-24 update
New feature and improvement
* Enable VK_EXT_host_query_reset extension
* Enable VK_EXT_separate_stencil_usage  extension
* Enable VK_KHR_uniform_buffer_standard_layout extension
* Reduce optimizations run by llpc
* Support merge ELF binary for per stage cache and enable per-stage-shader cache
* Remove ETC2 support from GFXIP 8.1
* Update api_version to 1.1.108

Issue fix
* DiRT Rally 2.0 corruption
* DiRT 4 flickering
* Block corruption in Witcher3
* DiRT Rally 2.0  hang with ultra settings
* dEQP-VK.binding_model.descriptorset_random.sets32.unifindexed.ubolimithigh.sbolimitlow.imglimithigh.noiub.frag.0 failure
2019-05-27 06:17:26 -04:00
Jacob He
5475d7dfda 2019-5-12 update
New feature and improvement
* LLPC compile time optimization
* Update Vulkan header to 1.1.106
* Enable shader cache to disk for Dawn of War3
* Performance tuning for game Thrones of Britannia
* Add option to use GpuHeapGartUswc for the pipeline uploader
2019-05-13 04:07:27 -04:00
Jacob He
1dd300ae8d 2019-4-21 update
New feature and improvement
* Performance optimization for Total:WarhammerII, Talos principle and Thrones of Britannia

Issue fix
* Corruption in multiple dxvk games
* CPU mapping problem for memory shared cross device
* Thrones of Britannia tearing regression
2019-04-22 03:51:16 -04:00
Jacob He
258c2ba810 2019-4-10 update
New feature and improvement
* Update Vulkan headers to 1.1.105
* Allow shared memory to be CPU visible
* Enable priority regardless of whether VK_EXT_memory_priority is enabled or not for all external queues
* Performance optimization for Total:WarhammerII

Issue fix
* Just Cause3 flickering
* Path of Exile incorrect rendering
2019-04-11 22:46:08 -04:00
shamu
3160bb61cc 2019-3-26 update
New feature and improvement
* Enable VK_EXT_memory_priority extension
* Enable VK_EXT_memory_budget extension
* Enable skip fast clear eliminate optimization by default
* Reduce unnecessary L2 cache actions on GFX9
* Update pipeline ABI metadata note ID to 32 (from 13) to match HSA code objects
* Improve CPU-bound performance in Mad Max

Issue fix
* Abnormal timestamp on transfer queue
* ShaderDbg bugs
* Minor profiling validation and memory leak fixes
* Dead loop in converting a FP16 denorm back to normalized FP32
2019-03-25 22:58:43 -04:00
Jacob He
735d204365 2019-3-14 update
New feature and improvement
* Enable below extensions:
   - VK_KHR_vulkan_memory_model
   - VK_EXT_depth_clip_enable
   - VK_KHR_depth_stencil_resolve
   - VK_KHR_shader_float16_int8
   - VK_EXT_debug_utils extension
   - VK_EXT_transform_feedback
* Fall back to the internal shader cache for the case of VkPipelineCache miss

Issue fix
* dEQP-VK.spirv_assembly.instruction.graphics.float16.derivative_*  and dEQP-VK.spirv_assembly.instruction*.float16.arithmetic_*.atan2* test failure
* A bug in LLPC ShaderCache Merge function which causes Dota2 stuttering and performance drop after recent game update
* Dynamic loop unroll crash
* Rise of the Tomb Rider game crash with gcc 7 build
* Witcher3-dxvk hang after loading screen
2019-03-15 01:02:54 -04:00
Jacob He
c59b9988e4 2019-3-1 update
New feature and improvement
* [TransformFeedback] Fix the register setting for RasterizationStreamSelect capablitity
* [VK_KHR_shader_float16_int8] handling of OpVectorInsertDynamic
* Add supports for min and max stencil resolve using compute pipeline

Issue fix
* Memory leak in CmdBufferLogger
* vkmark corruption observed on Fiji + wayland
* Corruption issue caused by  DPP operand order in atomic optimizer
2019-03-04 03:19:14 -05:00
Jacob He
27ef34e957 2019-2-25 update
New feature and improvement
* [RGP]Add functionality to enable instruction-level trace per-pipeline
* Add debug support to the CmdBufferLogger layer that allows for "single-stepping" of draws/dispatches

Issue fix
* Vulkan CTS OOM test case crashes and a wide range of many other potential PAL_NEW alloc failure crashes
* Broken path of offchip GS
* Transformfeedback multi-stream CTS failure on gfxip 8 and gfxip 9
* PRT issues with SDMA on gfxip 6-8
2019-02-26 21:14:14 +08:00
Jacob He
20a62e872f 2019-2-1 update
New feature and improvement
* Update Vulkan Headers to 1.1.97
* Enable the extensions under development (VK_EXT_DEBUG_UTILS, VK_KHR_SHADER_FLOAT16_INT8,VK_ EXT_TRANSFORM_FEEDBACK) through the environment variable AMDVLK_ENABLE_DEVELOPING_EXT
* Add proper handling of allocation failures

Issue fix
* Support for ECC GPR protection on Vega20
* A memory priority issue
* GFX7 indirect dispatch does not work correctly when submitted to a compute queue
2019-02-01 09:28:33 -05:00
Jacob He
fbfa56c614 Release notes:
2019-1-29 update
New feature and improvement
* Add Vega20 and Raven2 support
* Move VK_EXT_swapchain_colorspace to instance extension
* Report CPU clock speed to RGP

Issue fix
* Bugs in Release/acquire-based barrier
* Top pipelines identified by timeReport.py is mismatched with the dumped pipelines
2019-01-29 21:12:41 -05:00
Jacob He
96f3ad778d Release notes
2019-1-24 update
New feature and improvement
* Enable EXT_INLINE_UNIFORM_BLOCK extension
* Enable EXT_PCI_BUS_INFO extension
* Enable inclusion of llvm-ir in section of ELF output
* Add barrier filtering

Issue fix
* CTS v1.1.2.2 failure in ./deqp-vk -n dEQP-VK.binding_model.descriptorset_random.sets32.noarray.ubolimitlow
* Wrong reported LDS size per threadgroup for GFX7+
2019-01-25 18:47:17 +08:00
Jacob He
db64ac276c 2019-1-15 update
New feature and improvement
* Expose YUV planes, allow applications to implement their own color conversion accessing each YUV planes
* Do not include symbols while building release driver
* Change the default WgpMode from wgp to cu

Issue fix
* Random VM fault caused by that the image descriptor and the fmask descriptor contain the same lower virtual address (as designed) but use different offsets in the suballocation
* dxvk ELEX  corruption issue
* Performance regression introduced by changes which added support for the LOAD_INDEX path for handling pipeline binds
* A number of issues affecting many-core CPUs like Threadripper and Epyc
2019-01-15 21:46:31 -05:00
Jacob He
9b0dacc35f 2019-1-7 update
New feature and improvement
* Update Vulkan Headers to 1.1.96
* Add pipeline optimizer key to pipeline dump file
* Optimization for fully overwritten resolve
* Add atomicOp support for the variable pointer

Issue fix
* A performance regression with the Talos Principle
* A potential access violation
* A RGP regression
* Multi-process failure
2019-01-08 12:58:09 +08:00
Jacob He
63811556d5 2018-12-12 update
New feature and improvement
* Merge tbuffer.store.i32 with tbuffer.store.v2i32 and tbuffer.store.v4i32, up to 5% performance gain for tessellation
* Enable the atomic optimizer
* Update Vulkan headers to 1.1.94
* Enable primitiveUnderestimation capability of VK_EXT_conservative_rasterization

Issue fix
* GPU hang when Vulkan API accesses the stencil aspect of VK_Format_S8_Uint image explicitly
* CTS  dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.struct_mixed_types.uniform_geom failure
2018-12-12 20:25:00 +08:00
Jacob He
8477e2ece9 Release note:
2018-12-4 update
New feature and improvement
* Enable VK_EXT_scalar_block_layout extension
* Enable VK_KHR_swapchain_mutable_format extension
* Enable on-chip GSVS ring on GFX9, seeing up to 10% performance gain
* Add dccBitsPerPixelThreshold setting
* Re-enable tracing of missing registers in thread trace
* Remove some useless settings

Issue fix
* Undefined reference to vk::OverrideDispatchTable_ND(vk::DispatchTable*)
* Errors with multiple monitors and direct display
* dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail test failure
* Crash issue on Raven caused by the implementation of VK_AMD_memory_overallocation_behavior extension
* dEQP-VK.glsl.atomic_operations.*_*signed64bit_* failure on gfx7
* ELEX crash with Steam Proton
* CopyTextureRegion failure when copying a MSAA Resource in D32_FLOAT or D16_UNORM to a R32_UINT format
* GFX9: CB_DCC_CONTROL register is programmed incorrectly
* Perf counter instance incorrect calculation
2018-12-04 21:11:01 +08:00
Jacob He
959e071f20 The short commit hash doesn't work for manifest, fix it with the long
hash.
2018-11-30 12:22:53 +08:00
Jacob He
376d2b2314 Release notes:
2018-11-19 update
* Enable sparse support by default
* Enable VK_AMD_memory_overallocation_behavior extension
* Enable degenerate triangles for conservative rasterizations
* Add setting for exiting after compilation failure
* Add setting CsCuEnLimitMask for limiting CUs enabled for compute shaders
* Add the ability to dump debug stack traces
* Set default -log-file-dbgs to "" (meaning stderr): this brings amdllpc into line with other LLVM tools

* Can't clear a mutable format image
* CTS memory.pipeline_barrier.transfer_dst_storage_image tests fail
* MGPU asserts when creating graphics pipeline
* Wrong target device ID passed to PAL
* Dual source bend is being enabled when blending is disabled
* Regression dEQP-VK.wsi.wayland.swapchain.create.min_image_count test crashes
* Dependency on x11/xcb libraries even there is no x11/xcb surface created
* Assert in dEQP-VK.binding_model.descriptorset_random.*
2018-11-20 13:37:46 +08:00
Jacob He
ea01a8d323 Release v-18.3.1 2018-11-12 13:17:49 +08:00
Jacob He
96a0ad8796 Update manifests: Add spvgen 2018-10-23 12:52:05 +08:00
Jacob He
dbb9dda34e Update manifests: LLPC is an independent module now. 2018-07-02 16:49:13 +08:00
Jacob He
425d2cee10 Initial drop of AMDVLK, commit:b6c6f24 2017-12-22 11:34:01 +08:00