Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge pull request #12756 from liamwhite/applet-multiprocess-hwc | Narr the Reg | 2024-02-12 | 12 | -33/+35 |
|\ | | | | | general: applet multiprocess | ||||
| * | nvnflinger/gpu: implement blending | Liam | 2024-02-09 | 12 | -33/+35 |
| | | |||||
* | | host_shaders: add vendor workaround for adreno drivers | Liam | 2024-02-10 | 1 | -6/+4 |
|/ | |||||
* | renderer_vulkan: implement layer stack composition | Liam | 2024-01-31 | 7 | -40/+34 |
| | |||||
* | renderer_vulkan: convert FSR to graphics pipeline | Liam | 2024-01-31 | 7 | -28/+37 |
| | |||||
* | renderer_opengl: split out FXAA | Liam | 2024-01-31 | 1 | -2/+2 |
| | |||||
* | Fix typos in video_core | Viktor Szépe | 2024-01-07 | 1 | -9/+9 |
| | |||||
* | Merge pull request #11795 from Squall-Leonhart/D32FToOther | liamwhite | 2023-10-18 | 3 | -1/+17 |
|\ | | | | | [Vulkan]Implement missing copy formats for D32, ARGB8_SRGB and BGRA8_Unorm/SRGB | ||||
| * | Changes based on hardware tests | Squall-Leonhart | 2023-10-16 | 3 | -22/+3 |
| | | | | | | | | | | | | Removes unnecessary d32f to bgra shader and blit functions, update vk_texture_cache to use abgr shader for d32f to BGRA formats updates abgr to d32f shader to comply with hardware tests | ||||
| * | added missing trailing line. | Squall Leonhart | 2023-10-15 | 1 | -1/+1 |
| | | |||||
| * | use texelfetch instead of texturelod | Squall-Leonhart | 2023-10-15 | 2 | -2/+2 |
| | | |||||
| * | Another missing copy connected to Bravely Default II | Squall-Leonhart | 2023-10-15 | 2 | -0/+19 |
| | | | | | | | | adds blit_image_helper.ConvertABGR8ToD32F and fragment shader for performing ABGR and BGRA to D32F copies | ||||
| * | Implement missing formats for Bravely Default 2 | Squall-Leonhart | 2023-10-15 | 2 | -0/+16 |
| | | |||||
* | | Merge pull request #11349 from vonchenplus/buffer_cache_crash | liamwhite | 2023-10-17 | 2 | -8/+8 |
|\ \ | |/ |/| | video_core: Fix moltenvk crash on macos | ||||
| * | video_core: Fix d24r8/s8d24 convert shader build error in moltenvk | Feng Chen | 2023-09-07 | 2 | -8/+8 |
| | | |||||
* | | update shader to confirmed format copy | Squall Leonhart | 2023-10-07 | 1 | -1/+1 |
| | | |||||
* | | lets not convert depth to greyscale since this makes the exhaust and tire smoke light gray/white | Squall-Leonhart | 2023-10-04 | 1 | -1/+1 |
| | | | | | | | | tiresmoke should be a darker gray. | ||||
* | | Implements D32_Float to A8B8G8R8_UNORM format copy | Squall-Leonhart | 2023-10-04 | 2 | -0/+15 |
| | | | | | | | | Corrects some visual issues in games such as Disney SpeedStorm | ||||
* | | host_shaders: More proper handling of x2 MSAA copies | GPUCode | 2023-09-25 | 2 | -4/+10 |
| | | |||||
* | | Query Cache: Fix Prefix Sums | Fernando Sahmkow | 2023-09-23 | 2 | -82/+109 |
| | | |||||
* | | Query Cache: Simplify Prefix Sum compute shader | Fernando Sahmkow | 2023-09-23 | 3 | -65/+227 |
| | | |||||
* | | Query Cache: Implement host side sample counting. | Fernando Sahmkow | 2023-09-23 | 2 | -0/+125 |
| | | |||||
* | | Query Cachge: Fully rework Vulkan's query cache | Fernando Sahmkow | 2023-09-23 | 2 | -0/+21 |
|/ | |||||
* | Merge pull request #11149 from ameerj/astc-perf-prod | liamwhite | 2023-08-21 | 1 | -535/+453 |
|\ | | | | | host_shaders: ASTC compute shader optimizations | ||||
| * | flatten color_values | Ameer J | 2023-08-10 | 1 | -14/+9 |
| | | |||||
| * | flatten encoding_values | Ameer J | 2023-08-10 | 1 | -11/+10 |
| | | |||||
| * | flatten result vector | Ameer J | 2023-08-10 | 1 | -14/+5 |
| | | |||||
| * | GetUnquantizedWeightVector | Ameer J | 2023-08-09 | 1 | -69/+63 |
| | | |||||
| * | Compute Replicate | Ameer J | 2023-08-06 | 1 | -85/+20 |
| | | |||||
| * | minor | Ameer J | 2023-08-06 | 1 | -12/+6 |
| | | |||||
| * | undo uint | Ameer J | 2023-08-06 | 1 | -3/+3 |
| | | |||||
| * | Revert "vulkan dims specialization" | Ameer J | 2023-08-06 | 3 | -103/+16 |
| | | | | | | | | This reverts commit e6243058f2269bd79ac8479d58e55feec2611e9d. | ||||
| * | vulkan dims specialization | ameerj | 2023-08-06 | 3 | -16/+103 |
| | | |||||
| * | small_block opt | Ameer J | 2023-08-06 | 1 | -4/+3 |
| | | |||||
| * | remove TexelWeightParams | Ameer J | 2023-08-06 | 1 | -46/+31 |
| | | |||||
| * | error/void extent funcs | Ameer J | 2023-08-06 | 1 | -48/+43 |
| | | |||||
| * | more packing | Ameer J | 2023-08-06 | 1 | -109/+109 |
| | | |||||
| * | Revert "uint result index" | Ameer J | 2023-08-06 | 1 | -1/+1 |
| | | | | | | | | This reverts commit 0e978786b5a8e7382005d8b1e16cfa12f3eeb775. | ||||
| * | Revert "bfe instead of mod" | Ameer J | 2023-08-06 | 1 | -15/+13 |
| | | | | | | | | This reverts commit 86006a3b09e8a8c17d2ade61be76736a79e3f58a. | ||||
| * | Revert "global endpoints" | Ameer J | 2023-08-06 | 1 | -36/+40 |
| | | | | | | | | This reverts commit d8f5bfd1df2b7469ef6abcee182aa110602d1751. | ||||
| * | global endpoints | Ameer J | 2023-08-06 | 1 | -40/+36 |
| | | |||||
| * | bfe instead of mod | Ameer J | 2023-08-06 | 1 | -13/+15 |
| | | |||||
| * | uint result index | Ameer J | 2023-08-06 | 1 | -1/+1 |
| | | |||||
| * | amd opts | Ameer J | 2023-08-06 | 1 | -16/+13 |
| | | |||||
| * | const, pack result_vector and replicate tables, | Ameer J | 2023-08-06 | 1 | -227/+260 |
| | | | | | | | | undo amd opts | ||||
| * | minor redundancy cleanup | Ameer J | 2023-08-06 | 1 | -12/+2 |
| | | |||||
| * | extractbits robustness | Ameer J | 2023-08-06 | 1 | -5/+8 |
| | | |||||
| * | reuse vectors memory | Ameer J | 2023-08-06 | 1 | -33/+17 |
| | | |||||
| * | EncodingData pack | Ameer J | 2023-08-06 | 1 | -44/+69 |
| | | |||||
| * | flattening | Ameer J | 2023-08-06 | 1 | -43/+44 |
| | | |||||
| * | weights refactor | Ameer J | 2023-08-06 | 1 | -26/+22 |
| | | |||||
| * | params.max_weight | Ameer J | 2023-08-06 | 1 | -5/+2 |
| | | |||||
| * | skip bits | Ameer J | 2023-08-06 | 1 | -9/+14 |
| | | |||||
| * | restrict | Ameer J | 2023-08-06 | 1 | -2/+2 |
| | | |||||
* | | Masked depthstencil clears | Kelebek1 | 2023-08-19 | 2 | -0/+13 |
|/ | |||||
* | OpenGL: Add Local Memory warmup shader | ameerj | 2023-06-26 | 2 | -0/+48 |
| | |||||
* | general: fix spelling mistakes | Liam | 2023-03-12 | 2 | -2/+2 |
| | |||||
* | texture_cache: OpenGL: Implement MSAA uploads and copies | ameerj | 2023-02-11 | 3 | -0/+61 |
| | |||||
* | video_core: Implement vulkan clear specified channel | FengChen | 2023-01-28 | 3 | -0/+26 |
| | |||||
* | video_core/opengl: Add FSR upscaling filter to the OpenGL renderer | Wollnashorn | 2023-01-26 | 4 | -2/+154 |
| | |||||
* | Merge pull request #9556 from vonchenplus/draw_texture | liamwhite | 2023-01-19 | 3 | -4/+11 |
|\ | | | | | video_core: Implement maxwell3d draw texture method | ||||
| * | Address feedback | Feng Chen | 2023-01-16 | 1 | -1/+1 |
| | | |||||
| * | video_core: Implement opengl/vulkan draw_texture | Feng Chen | 2023-01-05 | 3 | -18/+12 |
| | | |||||
| * | video_core: Implement maxwell3d draw texture method | Feng Chen | 2023-01-05 | 1 | -0/+13 |
| | | |||||
* | | vulkan: implement 'turbo mode' clock booster | Liam | 2023-01-05 | 2 | -0/+30 |
|/ | |||||
* | video_core: Implement vulkan QuadStrip topology | FengChen | 2022-12-26 | 1 | -2/+4 |
| | |||||
* | video_core: Integrate SMAA | Liam | 2022-12-08 | 9 | -5/+1564 |
| | | | | | Co-authored-by: goldenx86 <goldenx86@users.noreply.github.com> Co-authored-by: BreadFish64 <breadfish64@users.noreply.github.com> | ||||
* | video_core: Modify astc texture decode error fill value | FengChen | 2022-09-15 | 1 | -1/+1 |
| | |||||
* | chore: make yuzu REUSE compliant | Andrea Pappacoda | 2022-07-27 | 5 | -0/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | [REUSE] is a specification that aims at making file copyright information consistent, so that it can be both human and machine readable. It basically requires that all files have a header containing copyright and licensing information. When this isn't possible, like when dealing with binary assets, generated files or embedded third-party dependencies, it is permitted to insert copyright information in the `.reuse/dep5` file. Oh, and it also requires that all the licenses used in the project are present in the `LICENSES` folder, that's why the diff is so huge. This can be done automatically with `reuse download --all`. The `reuse` tool also contains a handy subcommand that analyzes the project and tells whether or not the project is (still) compliant, `reuse lint`. Following REUSE has a few advantages over the current approach: - Copyright information is easy to access for users / downstream - Files like `dist/license.md` do not need to exist anymore, as `.reuse/dep5` is used instead - `reuse lint` makes it easy to ensure that copyright information of files like binary assets / images is always accurate and up to date To add copyright information of files that didn't have it I looked up who committed what and when, for each file. As yuzu contributors do not have to sign a CLA or similar I couldn't assume that copyright ownership was of the "yuzu Emulator Project", so I used the name and/or email of the commit author instead. [REUSE]: https://reuse.software Follow-up to 01cf05bc75b1e47beb08937439f3ed9339e7b254 | ||||
* | chore: add missing SPDX tags | Andrea Pappacoda | 2022-04-28 | 1 | -21/+2 |
| | | | | Follow-up to 99ceb03a1cfcf35968cab589ea188a8c406cda52 | ||||
* | general: Convert source file copyright comments over to SPDX | Morph | 2022-04-23 | 29 | -87/+58 |
| | | | | | This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later. | ||||
* | OpenGL: fix S8D24 to ABGR8 conversions | Liam | 2022-04-07 | 2 | -0/+19 |
| | |||||
* | Address review comments | Liam | 2022-03-16 | 2 | -2/+2 |
| | |||||
* | Vulkan: convert S8D24 <-> ABGR8 | Liam | 2022-03-16 | 2 | -0/+24 |
| | |||||
* | astc_decoder: Combine FastReplicate functions to work around new NV driver bug | ameerj | 2022-01-16 | 1 | -34/+46 |
| | | | | | | The new Nvidia drivers have a bug where the FastReplicateTo6 function produces a lookup into the REPLICATE_TO_8 table rather than the REPLICATE_TO_6 table. This seems to be an optimization gone wrong. Combining the logic of the FastReplicate functions seems to address the bug. | ||||
* | Texture Cache: Correct conversion shaders. | Fernando Sahmkow | 2021-11-22 | 2 | -2/+2 |
| | |||||
* | TextureCache: Simplify blitting of D24S8 formats and fix bugs. | Fernando Sahmkow | 2021-11-22 | 5 | -104/+0 |
| | |||||
* | HostShaders: Fix D24S8 convertion shaders. | Fernando Sahmkow | 2021-11-21 | 6 | -23/+47 |
| | |||||
* | TextureCache: Assure full conversions on depth/stencil write shaders. | Fernando Sahmkow | 2021-11-20 | 3 | -6/+6 |
| | |||||
* | TextureCache: Add R16G16 to D24S8 converter. | Fernando Sahmkow | 2021-11-20 | 2 | -0/+19 |
| | |||||
* | TextureCache: Add B10G11R11 to D24S8 converter. | Fernando Sahmkow | 2021-11-19 | 2 | -0/+20 |
| | |||||
* | TextureCache: Implement additional D24S8 convertions. | Fernando Sahmkow | 2021-11-19 | 3 | -0/+44 |
| | |||||
* | Vulkan: implement D24S8 <-> RGBA8 convertions. | Fernando Sahmkow | 2021-11-19 | 3 | -0/+40 |
| | |||||
* | HostShader: fix Gaussian filter. | FernandoS27 | 2021-11-16 | 1 | -2/+2 |
| | |||||
* | host_shaders: Misc copyright/style changes | ameerj | 2021-11-16 | 4 | -10/+12 |
| | |||||
* | Presentation: Only use FP16 in scaling shaders on supported devices in Vulkan | Marshall Mohror | 2021-11-16 | 9 | -95/+147 |
| | |||||
* | HostShader: Fix gaussian and add attribution. | Fernando Sahmkow | 2021-11-16 | 1 | -23/+19 |
| | |||||
* | Vulkan: Implement FXAA | FernandoS27 | 2021-11-16 | 1 | -1/+1 |
| | |||||
* | OpenGL: Implement FXAA | Marshall Mohror | 2021-11-16 | 3 | -0/+114 |
| | |||||
* | VideoCore: Add gaussian filtering. | FernandoS27 | 2021-11-16 | 2 | -0/+75 |
| | |||||
* | Update scaleforce to use FP16 | Marshall Mohror | 2021-11-16 | 1 | -88/+55 |
| | |||||
* | vulkan: Implement FidelityFX Super Resolution | Marshall Mohror | 2021-11-16 | 4 | -2/+155 |
| | |||||
* | Renderers: Unify post processing filter shaders | ameerj | 2021-11-16 | 5 | -202/+28 |
| | |||||
* | Renderer: Implement Bicubic and ScaleForce filters. | Fernando Sahmkow | 2021-11-16 | 5 | -0/+388 |
| | |||||
* | host_shaders: Remove opengl_copy_bgra.comp | ameerj | 2021-09-17 | 2 | -16/+0 |
| | |||||
* | astc_decoder: Reduce workgroup size | ameerj | 2021-08-01 | 1 | -1/+1 |
| | | | | This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32. | ||||
* | astc_decoder: Compute offset swizzles in-shader | ameerj | 2021-08-01 | 1 | -33/+13 |
| | | | | Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes. | ||||
* | astc_decoder: Make use of uvec4 for payload data | ameerj | 2021-08-01 | 1 | -79/+43 |
| | |||||
* | astc_decoder: Simplify Select2DPartition | ameerj | 2021-08-01 | 1 | -38/+19 |
| | |||||
* | astc_decoder: Optimize the use EncodingData | ameerj | 2021-08-01 | 1 | -25/+25 |
| | | | | | | | This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation. We can take advantage of its sorted property to optimize its usage in the shader. Thanks to wwylele for the optimization idea. | ||||
* | Merge pull request #6459 from lat9nq/ubuntu-fixes | Ameer J | 2021-07-01 | 1 | -1/+4 |
|\ | | | | | cmake: Improve Linux dependency checking for externals | ||||
| * | cmake: Fix find_program usage for 3.15 | lat9nq | 2021-06-13 | 1 | -1/+4 |
| | | | | | | | | | | | | | | | | | | | | yuzu requires CMake 3.15 yet find_program was using REQUIRED, which is only available on 3.18 and later. Instead, we check for "<VAR>-NOTFOUND". In addition, check for additional requirements before building libusb or FFmpeg with autotools. Otherwise, CMake configuration will pass yet compilation will fail. | ||||
* | | astc_decoder.comp: Remove unnecessary LUT SSBOs | ameerj | 2021-06-19 | 1 | -19/+16 |
| | | | | | | | | We can move them to instead be compile time constants within the shader. | ||||
* | | astc: Various robustness enhancements for the gpu decoder | ameerj | 2021-06-19 | 1 | -9/+6 |
| | | | | | | | | | | | | These changes should help in reducing crashes/drivers panics that may occur due to synchronization issues between the shader completion and later access of the decoded texture. | ||||
* | | astc_decoder: Fix LDR CEM1 endpoint calculation | ameerj | 2021-06-16 | 1 | -1/+1 |
|/ | | | | | | | | Per the spec, L1 is clamped to the value 0xff if it is greater than 0xff. An oversight caused us to take the maximum of L1 and 0xff, rather than the minimum. Huge thanks to wwylele for finding this. Co-Authored-By: Weiyi Wang <wwylele@gmail.com> | ||||
* | astc_decoder: Refactor for style and more efficient memory use | ameerj | 2021-03-25 | 1 | -262/+307 |
| | |||||
* | astc_decoder: Reimplement Layers | Rodrigo Locatti | 2021-03-13 | 1 | -18/+15 |
| | | | | Reimplements the approach to decoding layers in the compute shader. Fixes multilayer astc decoding when using Vulkan. | ||||
* | astc_decoder: Fix out of bounds memory access | ameerj | 2021-03-13 | 1 | -2/+10 |
| | | | | resolves a crash with some anamolous textures found in Astral Chain. | ||||
* | renderer_vulkan: Accelerate ASTC decoding | ameerj | 2021-03-13 | 1 | -21/+22 |
| | | | | Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc> | ||||
* | host_shaders: Modify shader cmake integration to allow for larger shaders | ameerj | 2021-03-13 | 3 | -2/+25 |
| | | | | using a raw string to encapsulate the entire shader code limits us to shaders of size less than 2KB. This change overcomes this limitation. | ||||
* | renderer_opengl: Accelerate ASTC texture decoding with a compute shader | ameerj | 2021-03-13 | 1 | -0/+1288 |
| | | | | | | ASTC texture decoding is currently handled by a CPU decoder for GPU's without native ASTC decoding support (most desktop GPUs). This is the cause for noticeable performance degradation in titles which use the format extensively. This commit adds support to accelerate ASTC decoding using a compute shader on OpenGL for GPUs without native support. | ||||
* | renderer_opengl: Swizzle BGR textures on copy | ameerj | 2021-03-04 | 2 | -0/+16 |
| | | | | | | OpenGL does not natively support BGR internal formats, which causes many BGR textures to render incorrectly, with Red and Blue channels swapped. This commit aims to address this by swizzling the blue and red channels on texture copies when a BGR format is encountered. | ||||
* | video_core: Reimplement the buffer cache | ReinUsesLisp | 2021-02-13 | 3 | -30/+8 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits. | ||||
* | video_core: host_shaders: Don't pass --quiet to glslangValidator if unavailable | lat9nq | 2021-02-02 | 1 | -1/+19 |
| | | | | | Prevents CMake from calling `glslangValidator` with `--quiet` when it is not available, i.e. on older downstream versions from Ubuntu. | ||||
* | host_shaders/cmake: Pass --quiet to glslang to keep it quiet | ReinUsesLisp | 2021-01-24 | 1 | -1/+1 |
| | | | | Silences noisy builds on toolchains. | ||||
* | host_shaders: Add Vulkan assembler compute shaders | ReinUsesLisp | 2020-12-30 | 4 | -0/+96 |
| | |||||
* | host_shaders: Add helper to blit depth stencil fragment shader | ReinUsesLisp | 2020-12-30 | 2 | -0/+17 |
| | |||||
* | host_shaders: Add texture color blit fragment shader | ReinUsesLisp | 2020-12-30 | 2 | -0/+15 |
| | |||||
* | host_shaders: Add shaders to present to the swapchain | ReinUsesLisp | 2020-12-30 | 3 | -0/+36 |
| | |||||
* | host_shaders: Add shaders to convert between depth and color images | ReinUsesLisp | 2020-12-30 | 3 | -0/+28 |
| | |||||
* | host_shaders: Add compute shader to copy BC4 as RG32UI to RGBA8 | ReinUsesLisp | 2020-12-30 | 2 | -0/+71 |
| | |||||
* | host_shaders: Add shader to render a full screen triangle | ReinUsesLisp | 2020-12-30 | 2 | -0/+30 |
| | |||||
* | host_shaders: Add pitch linear upload compute shader | ReinUsesLisp | 2020-12-30 | 2 | -0/+87 |
| | |||||
* | host_shaders: Add block linear upload compute shaders | ReinUsesLisp | 2020-12-30 | 3 | -0/+249 |
| | |||||
* | host_shaders: Add copyright headers to OpenGL present shaders | ReinUsesLisp | 2020-12-30 | 2 | -0/+8 |
| | |||||
* | video_core/host_shaders: Add support for prebuilt SPIR-V shaders | ReinUsesLisp | 2020-12-30 | 1 | -16/+37 |
| | | | | | Add support for building SPIR-V shaders from GLSL and generating headers to include the text of those same GLSL shaders to consume from OpenGL. | ||||
* | video_core: Fix instances where msbuild always regenerated host shaders | ReinUsesLisp | 2020-09-24 | 2 | -12/+7 |
| | | | | | | When HEADER_GENERATOR was included in the DEPENDS section of custom commands, msbuild assumed this was always modified. Changing this file is not common so we can remove it from there. | ||||
* | video_core/host_shaders: Add CMake integration for string shaders | ReinUsesLisp | 2020-08-24 | 5 | -0/+97 |
Add the necessary CMake code to copy the contents in a string source shader (GLSL or GLASM) to a header file then consumed by video_core files. This allows editting GLSL in its own files without having to maintain them in source files. For now, only OpenGL presentation shaders are moved, but we can add GLASM presentation shaders and static SPIR-V generation through glslangValidator in the future. |