anonymous/yuzu - yuzu is the world's most popular, open-source, Nintendo Switch emulator — started by the creators of Citra. It is written in C++ with portability in mind,

	Commit message (Collapse)	Author	Age	Files	Lines
*	general: Convert source file copyright comments over to SPDX	Morph	2022-04-23	1	-3/+2
\| \| \| \| \|	This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
*	GC: Address Feedback.	Fernando Sahmkow	2022-03-25	1	-0/+7
\|
*	glsl: Add boolean reference workaround	ameerj	2021-12-30	1	-0/+5
\|
*	glsl_context_get_set: Add alternative cbuf type for broken drivers	ameerj	2021-12-30	1	-0/+5
\| \| \| \|	some drivers have a bug bitwise converting floating point cbuf values to uint variables. This adds a workaround for these drivers to make all cbufs uint and convert to floating point as needed.
*	structured_control_flow: Conditionally invoke demote reorder pass	ameerj	2021-08-30	1	-0/+4
\| \| \| \|	This is only needed on select drivers when a fragment shader discards/demotes.
*	video_core: Enable GL SPIR-V shaders	lat9nq	2021-07-23	1	-0/+11
\|
*	glasm: Add passthrough geometry shader support	ReinUsesLisp	2021-07-23	1	-0/+5
\|
*	shader: Unify shader stage types	ReinUsesLisp	2021-07-23	1	-5/+6
\|
*	shader: Emulate 64-bit integers when not supported	ReinUsesLisp	2021-07-23	1	-0/+5
\| \| \| \|	Useful for mobile and Intel Xe devices.
*	glsl: Address rest of feedback	ameerj	2021-07-23	1	-0/+5
\|
*	glsl: Add stubs for sparse queries and variable aoffi when not supported	ameerj	2021-07-23	1	-0/+5
\|
*	glsl: Implement VOTE for subgroup size potentially larger	ameerj	2021-07-23	1	-0/+5
\|
*	glsl: Query GL Device for FP16 extension support	ameerj	2021-07-23	1	-0/+10
\|
*	glasm: Use ARB_derivative_control conditionally	ReinUsesLisp	2021-07-23	1	-0/+5
\|
*	glasm: Use storage buffers instead of global memory when possible	ReinUsesLisp	2021-07-23	1	-1/+5
\|
*	shader: Initial OpenGL implementation	ReinUsesLisp	2021-07-23	1	-16/+0
\|
*	video_core: Add GPU vendor name to window title bar	ameerj	2021-06-21	1	-0/+3
\|
*	Implement glDepthRangeIndexeddNV	Kelebek1	2021-02-24	1	-0/+5
\|
*	renderer_opengl: Remove interop	ReinUsesLisp	2021-02-13	1	-1/+1
\| \| \| \|	Remove unused interop code from the OpenGL backend.
*	video_core: Reimplement the buffer cache	ReinUsesLisp	2021-02-13	1	-5/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
*	renderer_opengl: Avoid precompiled cache and force NV GL cache directory	ReinUsesLisp	2021-01-21	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Setting __GL_SHADER_DISK_CACHE_PATH we can force the cache directory to be in yuzu's user directory to stop commonly distributed malware from deleting our driver shader cache. And by setting __GL_SHADER_DISK_CACHE_SKIP_CLEANUP we can have an unbounded shader cache size. This has only been implemented on Windows, mostly because previous tests didn't seem to work on Linux. Disable the precompiled cache on Nvidia's driver. There's no need to hide information the driver already has in its own cache.
*	gl_texture_cache: Avoid format views on Intel and AMD	ReinUsesLisp	2021-01-04	1	-0/+5
\| \| \| \| \| \| \|	Intel and AMD proprietary drivers are incapable of rendering to texture views of different formats than the original texture. Avoid creating these at a cache level. This will consume more memory, emulating them with copies.
*	video_core: Rewrite the texture cache	ReinUsesLisp	2020-12-30	1	-4/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.
*	Merge pull request #4359 from ReinUsesLisp/clamp-shared	Rodrigo Locatti	2020-07-21	1	-0/+5
\|\ \| \| \| \|	renderer_{opengl,vulkan}: Clamp shared memory to host's limit
\| *	renderer_{opengl,vulkan}: Clamp shared memory to host's limit	ReinUsesLisp	2020-07-16	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	This stops shaders from failing to build when the exceed host's shared memory size limit. An error is logged.
* \|	async shaders	David Marcec	2020-07-17	1	-0/+5
\|/
*	gl_device: Expose NV_vertex_buffer_unified_memory except on Turing	ReinUsesLisp	2020-06-24	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	Expose NV_vertex_buffer_unified_memory when the driver supports it. This commit adds a function the determine if a GL_RENDERER is a Turing GPU. This is required because on Turing GPUs Nvidia's driver crashes when the buffer is marked as resident or on DeleteBuffers. Without a synchronous debug output (single threaded driver), it's likely that the driver will crash in the first blocking call.
*	gl_device: Check for GL_EXT_texture_shadow_lod	Morph	2020-06-21	1	-0/+5
\|
*	gl_arb_decompiler: Implement an assembly shader decompiler	ReinUsesLisp	2020-06-12	1	-0/+5
\| \| \| \| \| \|	Emit code compatible with NV_gpu_program5. This should emit code compatible with Fermi, but it wasn't tested on that architecture. Pascal has some issues not present on Turing GPUs.
*	glsl: Squash constant buffers into a single SSBO when we hit the limit	ReinUsesLisp	2020-06-01	1	-1/+6
\| \| \| \| \|	Avoids compilation errors at the cost of shader build times and runtime performance when a game hits the limit of uniform buffers we can use.
*	gl_device: Enable compute shaders for Intel proprietary drivers	Morph	2020-05-31	1	-5/+0
\| \| \| \|	Previously we were disabling compute shaders on Intel's proprietary driver due to broken compute. This has been fixed in the latest Intel drivers. Re-enable compute for Intel proprietary drivers and remove the check for broken compute.
*	renderer_opengl: Add assembly program code paths	ReinUsesLisp	2020-05-19	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add code required to use OpenGL assembly programs based on NV_gpu_program5. Decompilation for ARB programs is intended to be added in a follow up commit. This does not include ARB decompilation and it's not in an usable state. The intention behind assembly programs is to reduce shader stutter significantly on drivers supporting NV_gpu_program5 (and other required extensions). Currently only Nvidia's proprietary driver supports these extensions. Add a UI option hidden for now to avoid people enabling this option accidentally. This code path has some limitations that OpenGL compatibility doesn't have: - NV_shader_storage_buffer_object is limited to 16 entries for a single OpenGL context state (I don't know if this is an intended limitation, an specification issue or I am missing something). Currently causes issues on The Legend of Zelda: Link's Awakening. - NV_parameter_buffer_object can't bind buffers using an offset different to zero. The used workaround is to copy to a temporary buffer (this doesn't happen often so it's not an issue). On the other hand, it has the following advantages: - Shaders build a lot faster. - We have control over how floating point rounding is done over individual instructions (SPIR-V on Vulkan can't do this). - Operations on shared memory can be unsigned and signed. - Transform feedbacks are dynamic state (not yet implemented). - Parameter buffers (uniform buffers) are per stage, matching NVN and hardware's behavior. - The API to bind and create assembly programs makes sense, unlike ARB_separate_shader_objects.
*	gl_device: Detect if ASTC is reported and expose it	ReinUsesLisp	2020-04-01	1	-0/+5
\|
*	renderer_opengl: Detect Nvidia Nsight as a debugging tool	ReinUsesLisp	2020-03-16	1	-5/+0
\| \| \| \|	Use getenv to detect Nsight.
*	gl_device: Add option to check GL_EXT_debug_tool.	bunnei	2020-03-14	1	-0/+5
\|
*	gl_device: Deduce indexing bug from device instead of heuristic	ReinUsesLisp	2019-11-25	1	-1/+0
\| \| \| \| \| \|	The heuristic to detect AMD's driver was not working properly since it also included Intel. Instead of using heuristics to detect it, compare the GL_VENDOR string.
*	gl_rasterizer: Disable compute shaders on Intel	ReinUsesLisp	2019-11-23	1	-0/+5
\| \| \| \| \|	Intel's proprietary driver enters in a corrupt state when compute shaders are executed. For now, disable these.
*	gl_shader_cache: Remove dynamic BaseBinding specialization	ReinUsesLisp	2019-11-23	1	-1/+20
\|
*	gl_shader_decompiler: Add safe fallbacks when ARB_shader_ballot is not available	ReinUsesLisp	2019-11-08	1	-0/+5
\|
*	gl_rasterizer: Upload constant buffers with glNamedBufferSubData	ReinUsesLisp	2019-11-02	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Nvidia's OpenGL driver maps gl(Named)BufferSubData with some requirements to a fast. This path has an extra memcpy but updates the buffer without orphaning or waiting for previous calls. It can be seen as a better model for "push constants" that can upload a whole UBO instead of 256 bytes. This path has some requirements established here: http://on-demand.gputechconf.com/gtc/2014/presentations/S4379-opengl-44-scene-rendering-techniques.pdf#page=24 Instead of using the stream buffer, this commits moves constant buffers uploads to calls of glNamedBufferSubData and from my testing it brings a performance improvement. This is disabled when the vendor is not Nvidia since it brings performance regressions.
*	shader/image: Implement SULD and remove irrelevant code	ReinUsesLisp	2019-09-21	1	-0/+5
\| \| \| \| \|	* Implement SULD as float. * Remove conditional declaration of GL_ARB_shader_viewport_layer_array.
*	gl_device: Disable precise in fragment shaders on bugged drivers	ReinUsesLisp	2019-09-04	1	-0/+6
\|
*	shader_ir: Implement VOTE	ReinUsesLisp	2019-08-21	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
*	Merge pull request #2695 from ReinUsesLisp/layer-viewport	Fernando Sahmkow	2019-07-15	1	-0/+5
\|\ \| \| \| \|	gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders
\| *	gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders	ReinUsesLisp	2019-07-08	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit implements gl_ViewportIndex and gl_Layer in vertex and geometry shaders. In the case it's used in a vertex shader, it requires ARB_shader_viewport_layer_array. This extension is available on AMD and Nvidia devices (mesa and proprietary drivers), but not available on Intel on any platform. At the moment of writing this description I don't know if this is a hardware limitation or a driver limitation. In the case that ARB_shader_viewport_layer_array is not available, writes to these registers on a vertex shader are ignored, with the appropriate logging.
* \|	gl_device: Query SSBO alignment	ReinUsesLisp	2019-07-06	1	-0/+5
\|/
*	gl_device: Add test to detect broken component indexing	ReinUsesLisp	2019-05-24	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	Component indexing on AMD's proprietary driver is broken. This commit adds a test to detect when we are on a driver that can't successfully manage component indexing. It dispatches a dummy draw with just one vertex shader that writes to an indexed SSBO from the GPU with data sent through uniforms, it then reads that data from the CPU and compares the expected output.
*	gl_shader_decompiler: Declare all possible varyings on physical attribute usage	ReinUsesLisp	2019-05-03	1	-1/+13
\|
*	gl_shader_decompiler: Use variable AOFFI on supported hardware	ReinUsesLisp	2019-04-14	1	-2/+7
\|
*	gl_device: Implement interface and add uniform offset alignment	ReinUsesLisp	2019-04-10	1	-0/+25