anonymous/yuzu - yuzu is the world's most popular, open-source, Nintendo Switch emulator — started by the creators of Citra. It is written in C++ with portability in mind,

	Commit message (Collapse)	Author	Age	Files	Lines
*	externals: update dynarmic, SDL2	Liam	2022-12-04	1	-11/+11
\|
*	Merge pull request #9353 from vonchenplus/draw_indexed	liamwhite	2022-12-03	2	-27/+22
\|\ \| \| \| \|	video_core: Fine tuning the index drawing judgment logic
\| *	video_core: Fine tuning the index drawing judgment logic	Feng Chen	2022-12-01	2	-27/+22
\| \|
* \|	maxwell_3d: Mark shifted value as unsigned	Lioncash	2022-11-29	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Otherwise this is technically creating a signed int result that gets converted. Just a consistency change. While we're in the area, we can mark Samples() as const.
* \|	engines: Remove unnecessary casts	Lioncash	2022-11-29	10	-85/+57
\|/ \| \| \|	In a few cases we have some casts that can be trivially removed.
*	Merge pull request #9288 from vonchenplus/deferred_draw	liamwhite	2022-11-26	2	-61/+63
\|\ \| \| \| \|	video_core: Fine tune maxwell drawing trigger mechanism
\| *	video_core: Optimize maxwell drawing trigger mechanism	FengChen	2022-11-22	2	-61/+63
\| \|
* \|	Merge pull request #9194 from FernandoS27/yfc-fermi2d	liamwhite	2022-11-25	9	-7/+1769
\|\ \ \| \| \| \| \| \|	YFC - Fermi2D: Rework blit engine and add a software blitter.
\| * \|	Fermi2D: Cleanup and address feedback.	Fernando Sahmkow	2022-11-24	3	-8/+150
\| \| \|
\| * \|	GPU: Implement additional render target formats.	Fernando Sahmkow	2022-11-24	1	-4/+102
\| \| \|
\| * \|	MaxwellDMA: Implement BlockLinear to BlockLinear copies.	Fernando Sahmkow	2022-11-24	2	-1/+69
\| \| \|
\| * \|	Fermi2D: Implement Bilinear software filtering and address feedback.	Fernando Sahmkow	2022-11-24	5	-114/+176
\| \| \|
\| * \|	Fermi2D: Rework blit engine and add a software blitter.	Fernando Sahmkow	2022-11-24	6	-4/+1396
\| \|/
* /	GPU: Fix buffer cache issue, engine upload not inlining memory in multiline and pessismistic invalidation.	Fernando Sahmkow	2022-11-24	3	-13/+7
\|/
*	Merge pull request #9252 from liamwhite/radv-superiority	bunnei	2022-11-19	2	-6/+6
\|\ \| \| \| \|	maxwell3d: HLE multi-layer clear macro
\| *	maxwell3d: full HLE for multi-layer clears	Liam	2022-11-17	2	-7/+6
\| \|
\| *	maxwell3d: HLE multi-layer clear macro	Liam	2022-11-17	1	-0/+1
\| \|
* \|	Merge pull request #9229 from Docteh/achy_breaky_heart	Morph	2022-11-18	2	-0/+5
\|\ \ \| \|/ \|/\|	Add break for default cases
\| *	Add break for default cases	Kyle Kienapfel	2022-11-14	2	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Visual Studio has an option to search all files in a solution, so I did a search in there for "default:" looking for any missing break statements. I've left out default statements that return something, and that throw something, even if via ThrowInvalidType. UNREACHABLE leads towards throw R_THROW macro leads towards a return
* \|	Merge pull request #9226 from Kelebek1/regs_regression	bunnei	2022-11-12	2	-4/+17
\|\ \ \| \| \| \| \| \|	[video_core] Fix a couple regs regressions
\| * \|	Fix regs regression with OpenGL two-sided stencil, and re-add data invalidation reg	Kelebek1	2022-11-11	2	-4/+17
\| \|/
* \|	Merge pull request #9204 from vonchenplus/dma_copy_1d_random_crash	liamwhite	2022-11-11	1	-17/+20
\|\ \ \| \|/ \|/\|	video_core: Fix dma copy 1D random crash
\| *	video_core: Fix dma copy 1D random crash	FengChen	2022-11-10	1	-17/+20
\| \|
* \|	ir/texture_pass: Use host_info instead of querying Settings::values (#9176)	Morph	2022-11-11	1	-2/+2
\|/
*	video_core: Fix drawing trigger mechanism regression	FengChen	2022-10-31	1	-32/+25
\|
*	video_core: Fix drawing trigger mechanism regression	FengChen	2022-10-27	2	-61/+70
\|
*	Merge pull request #9112 from vonchenplus/deferred_draw	liamwhite	2022-10-25	2	-188/+143
\|\ \| \| \| \|	video_core: Reimplementing the maxwell drawing trigger mechanism
\| *	video_core: Implement maxwell inline_index method	FengChen	2022-10-22	2	-74/+99
\| \|
\| *	video_coare: Reimplementing the maxwell drawing trigger mechanism	FengChen	2022-10-21	2	-180/+110
\| \|
* \|	Merge pull request #9095 from FernandoS27/meat-good-vegetable-bad	Fernando S	2022-10-22	2	-13/+9
\|\ \ \| \|/ \|/\|	Maxwell3D/Puller: Fix regressions and syncing issues.
\| *	Maxwell3D/Puller: Fix regressions and syncing issues.	Fernando Sahmkow	2022-10-19	2	-13/+9
\| \|
* \|	video_core: implement 1D copies based on VMM 'kind'	FengChen	2022-10-17	2	-56/+73
\| \|
* \|	renderer_(opengl/vulkan): Fix tessellation clockwise parameter	Morph	2022-10-13	1	-2/+2
\| \| \| \| \| \| \| \|	This should be assigned CW only on Triangles_CW rather than not Triangles_CCW, making CCW the default winding order rather than CW.
* \|	Fix stencil func registers, make clip control equivalent to how it was before, but surely wrong.	Kelebek1	2022-10-10	2	-14/+16
\|/
*	Update 3D regs	Kelebek1	2022-10-07	2	-1178/+2997
\|
*	maxwell_dma: remove warnings from implemented functionality	Liam	2022-10-06	1	-2/+0
\|
*	General: address feedback	Fernando Sahmkow	2022-10-06	1	-1/+1
\|
*	general: Format licenses as per SPDX guidelines	Morph	2022-10-06	2	-6/+4
\|
*	NVDRV: Further improvements.	Fernando Sahmkow	2022-10-06	3	-32/+22
\|
*	DMA & InlineToMemory Engines Rework.	bunnei	2022-10-06	7	-53/+127
\|
*	Maxwell3D: Add small_index_2	Fernando Sahmkow	2022-10-06	1	-0/+2
\|
*	VideoCore: Refactor fencing system.	Fernando Sahmkow	2022-10-06	2	-16/+47
\|
*	VideoCore: Refactor syncing.	Fernando Sahmkow	2022-10-06	2	-30/+36
\|
*	VideoCore: Extra Fixes.	Fernando Sahmkow	2022-10-06	1	-1/+1
\|
*	VideoCore: implement channels on gpu caches.	Fernando Sahmkow	2022-10-06	2	-0/+476
\|
*	common: Change semantics of UNREACHABLE to unconditionally crash	Liam	2022-06-14	2	-8/+8
\|
*	Maxwell3D: Fix 3D semaphore counter type 0 handling	Billy Laws	2022-06-02	2	-3/+3
\| \| \| \|	Counter type 0 actually releases the semaphore payload rather than a constant zero as was previously thought. This is required by Skyrim.
*	Merge pull request #8313 from liamwhite/dma-bpp	Morph	2022-05-11	1	-3/+6
\|\ \| \| \| \|	maxwell_dma: fix bytes_per_pixel
\| *	maxwell_dma: use fallback if remapping is enabled	Liam	2022-05-11	1	-3/+6
\| \|
\| *	maxwell_dma: fix bytes per pixel	Liam	2022-05-07	1	-3/+3
\| \|
* \|	video_core/macro: clear code on upload address assignment	Liam	2022-05-10	1	-0/+2
\|/
*	general: Convert source file copyright comments over to SPDX	Morph	2022-04-23	14	-42/+28
\| \| \| \| \|	This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
*	maxwell3d: add small_index_2 register	Liam	2022-04-14	2	-1/+11
\|
*	video_core: Reduce unused includes	ameerj	2022-03-19	7	-10/+0
\|
*	Merge pull request #8023 from ameerj/kirby-pop-in	Fernando S	2022-03-16	2	-70/+12
\|\ \| \| \| \|	maxwell_3d: Implement a safer CB data upload
\| *	maxwell_3d: Implement a safer CB data upload	ameerj	2022-03-15	2	-70/+12
\| \| \| \| \| \| \| \|	This makes constant buffer uploads safer and more accurate by updating the GPU memory as soon as the CB Data method is invoked. The previous implementation was deferring the updates until a different maxwell 3d method was detected, then writing all CB data at once.
* \|	Maxwell3D: Link to override constant definition in nouveau	byte[]	2022-03-14	1	-0/+2
\| \|
* \|	Maxwell3D: restore original topology when topology overrides are disabled	byte[]	2022-03-14	1	-0/+2
\| \|
* \|	Maxwell3D: Use override constants from nouveau	Liam	2022-03-14	2	-2/+37
\| \| \| \| \| \| \| \|	This fixes some incorrect rendering in Sunshine
* \|	Maxwell3D: Restrict topology override effect to after the register is set	Liam	2022-03-12	2	-1/+5
\| \|
* \|	Maxwell3D: mark index buffers as dirty after updating counts	Liam	2022-03-11	1	-0/+2
\| \|
* \|	Maxwell3D: read small-index draw and primitive topology override registers	Liam	2022-03-11	2	-2/+30
\|/ \| \| \|	This allows Galaxy and Sunshine to render for the first time.
*	MaxwellDMA: Implement semaphore operations	Lody	2022-03-07	2	-1/+21
\|
*	Rasterizer: Refactor inlineToMemory.	Fernando Sahmkow	2022-02-01	2	-2/+2
\|
*	Rasterizer: Implement Inline2Memory Acceleration.	Fernando Sahmkow	2022-01-29	7	-3/+29
\|
*	Inline2Memory: Flush before writting buffer.	Fernando Sahmkow	2022-01-29	1	-0/+1
\|
*	video_core/macro: Remove unused parameter from Execute()	Lioncash	2022-01-25	1	-1/+1
\| \| \| \|	Simplifies the function interface.
*	common/logging: Move Log::Entry declaration to a separate header	ameerj	2021-10-02	1	-0/+1
\| \| \| \|	This reduces the load of requiring to include std::chrono in all files which include log.h
*	maxwell_dma: Minor refactoring	ameerj	2021-09-20	2	-33/+33
\|
*	Fix blend equation enum error	Feng Chen	2021-09-07	1	-4/+4
\|
*	video_core/engine: Consistently initialize rasterizer pointers	Lioncash	2021-07-27	2	-2/+2
\| \| \| \| \|	Ensures all of the engines have consistent and deterministic initialization of the rasterizer pointers.
*	vk_rasterizer: Workaround bug in VK_EXT_vertex_input_dynamic_state	ReinUsesLisp	2021-07-23	1	-4/+0
\| \| \| \| \|	Workaround potential bug on Nvidia's driver where only updating high attributes leaves low attributes out dated.
*	shader: Rework varyings and implement passthrough geometry shaders	ReinUsesLisp	2021-07-23	1	-1/+6
\| \| \| \| \| \|	Put all varyings into a single std::bitset with helpers to access it. Implement passthrough geometry shaders using host's.
*	vk_graphics_pipeline: Implement conservative rendering	ReinUsesLisp	2021-07-23	1	-1/+6
\|
*	shader: Unify shader stage types	ReinUsesLisp	2021-07-23	4	-24/+0
\|
*	vulkan: Use VK_EXT_provoking_vertex when available	ReinUsesLisp	2021-07-23	1	-1/+6
\|
*	DMA: Restrict optimised path for BlockToLinear further.	FernandoS27	2021-07-23	1	-1/+2
\|
*	shader: Primitive Vulkan integration	ReinUsesLisp	2021-07-23	3	-2457/+0
\|
*	shader: Remove old shader management	ReinUsesLisp	2021-07-23	5	-222/+3
\|
*	Buffer cache: Fixes, Clang and Feedback.	Fernando Sahmkow	2021-07-15	1	-0/+5
\|
*	DMAEngine: Revert flushing from Pitch to BlpockLinear.	Fernando Sahmkow	2021-07-14	1	-2/+7
\|
*	DMAEngine: Accelerate BufferClear	Fernando Sahmkow	2021-07-13	2	-2/+6
\|
*	accelerateDMA: Accelerate Buffer Copies.	Fernando Sahmkow	2021-07-11	2	-10/+43
\|
*	Out of bound blit (#6531)	Feng Chen	2021-07-08	1	-2/+20
\| \| \| \| \| \| \| \| \|	* Fix out of bound blit error * Fix code read * Fix ci error Co-authored-by: Feng Chen <chen.feng@gloritysolutions.com>
*	maxwell3d: Add missing return in default SizeInBytes() case	Lioncash	2021-06-23	1	-0/+1
\| \| \| \| \|	We were returning '1' in ComponentCount()'s default case but were neglecting to do the same with SizeInBytes().
*	buffer_cache: Simplify uniform disabling logic	ameerj	2021-06-01	1	-2/+6
\|
*	Merge pull request #6196 from bunnei/asserts-setting	bunnei	2021-04-15	1	-1/+1
\|\ \| \| \| \|	core: settings: Add setting for debug assertions and disable by default.
\| *	common: Move settings to common from core.	bunnei	2021-04-15	1	-1/+1
\| \| \| \| \| \| \| \|	- Removes a dependency on core and input_common from common.
* \|	engine_interface: Add missing virtual destructor	Lioncash	2021-04-12	4	-4/+5
\|/ \| \| \| \| \|	Eliminates a potential bug vector related to inheritance. Plus, we should generally be specifying the destructor as virtual within purely virtual interfaces to begin with.
*	video_core: Reimplement the buffer cache	ReinUsesLisp	2021-02-13	5	-23/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
*	gpu: Report renderer errors with exceptions	ReinUsesLisp	2021-02-13	6	-9/+9
\| \| \| \| \| \|	Instead of using a two step initialization to report errors, initialize the GPU renderer and rasterizer on the constructor and report errors through std::runtime_error.
*	maxwell_3d: Silence array bounds warnings	ReinUsesLisp	2021-01-24	2	-35/+35
\|
*	common/common_funcs: Rename INSERT_UNION_PADDING_{BYTES,WORDS} to _NOINIT	ReinUsesLisp	2021-01-15	5	-119/+119
\| \| \| \|	INSERT_PADDING_BYTES_NOINIT is more descriptive of the underlying behavior.
*	video_core: Rewrite the texture cache	ReinUsesLisp	2020-12-30	7	-249/+377
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.
*	Merge pull request #5157 from lioncash/array-dirty	bunnei	2020-12-15	1	-34/+33
\|\ \| \| \| \|	maxwell_3d: Remove unused dirty_pointer array
\| *	maxwell_3d: Move member variables to end of class	Lioncash	2020-12-07	1	-31/+32
\| \| \| \| \| \| \| \|	Follows our established coding style.
\| *	maxwell_3d: Resolve -Wdocumentation warning	Lioncash	2020-12-07	1	-1/+1
\| \| \| \| \| \| \| \|	Removes a documentation comment for a non-existent member.
\| *	maxwell_3d: Remove unused dirty_pointer array	Lioncash	2020-12-07	1	-2/+0
\| \| \| \| \| \| \| \|	This is unused and removing it shrinks the structure by 3584 bytes.
* \|	video_core: Remove unnecessary enum class casting in logging messages	Lioncash	2020-12-07	3	-12/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	fmt now automatically prints the numeric value of an enum class member by default, so we don't need to use casts any more. Reduces the line noise a bit.
* \|	maxwell_dma: Rename RenderEnable::Mode::FALSE and TRUE to avoid name conflict	comex	2020-12-05	1	-5/+7
\|/ \| \| \| \| \| \|	On Apple platforms, FALSE and TRUE are defined as macros by <mach/boolean.h>, which is included by various system headers. Note that there appear to be no actual users of the names to fix up.
*	video_core: Resolve more variable shadowing scenarios	Lioncash	2020-12-04	6	-13/+15
\| \| \| \| \| \|	Resolves variable shadowing scenarios up to the end of the OpenGL code to make it nicer to review. The rest will be resolved in a following commit.
*	vk_shader_decompiler: Implement force early fragment tests	ReinUsesLisp	2020-11-26	1	-1/+6
\| \| \| \| \| \| \| \|	Force early fragment tests when the 3D method is enabled. The established pipeline cache takes care of recompiling if needed. This is implemented only on Vulkan to avoid invalidating the shader cache on OpenGL.
*	Merge pull request #4953 from lioncash/shader-shadow	bunnei	2020-11-21	1	-88/+96
\|\ \| \| \| \|	shader_bytecode: Eliminate variable shadowing
\| *	shader_bytecode: Make use of [[nodiscard]] where applicable	Lioncash	2020-11-20	1	-73/+79
\| \| \| \| \| \| \| \|	Ensures that all queried values are made use of.
\| *	shader_bytecode: Eliminate variable shadowing	Lioncash	2020-11-20	1	-15/+17
\| \|
* \|	maxwell_3d: Use insert instead of loop push_back	ReinUsesLisp	2020-11-11	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This reduces the overhead of bounds checking on each element. It won't reduce the cost of allocation because usually this vector's capacity is usually large enough to hold whatever we push to it.
* \|	maxwell_3d: Move code to separate functions	ReinUsesLisp	2020-11-11	2	-151/+124
\|/ \| \| \| \|	Deduplicate some code and put it in separate functions so it's easier to understand and profile.
*	shader/arithmetic: Implement FCMP immediate + register variant	ReinUsesLisp	2020-10-28	1	-0/+2
\| \| \| \|	Trivially add the encoding for this.
*	video_core: Enforce -Wclass-memaccess	ReinUsesLisp	2020-10-09	1	-7/+6
\|
*	video_core: Enforce -Wunused-variable and -Wunused-but-set-variable	ReinUsesLisp	2020-10-03	1	-2/+0
\|
*	General: Make use of std::nullopt where applicable	Lioncash	2020-09-22	1	-1/+1
\| \| \| \| \| \| \| \|	Allows some implementations to avoid completely zeroing out the internal buffer of the optional, and instead only set the validity byte within the structure. This also makes it consistent how we return empty optionals.
*	fermi_2d: Make use of designated initializers	Lioncash	2020-09-18	2	-8/+8
\| \| \| \| \|	Same behavior, less repetition. We can also ensure all members of Config are initialized.
*	video_core: Initialize renderer with a GPU	ReinUsesLisp	2020-08-22	6	-45/+63
\| \| \| \| \|	Add an extra step in GPU initialization to be able to initialize render backends with a valid GPU instance.
*	Merge pull request #4519 from lioncash/semi	bunnei	2020-08-16	1	-1/+1
\|\ \| \| \| \|	maxwell_3d: Resolve -Wextra-semi warning
\| *	maxwell_3d: Resolve -Wextra-semi warning	Lioncash	2020-08-14	1	-1/+1
\| \| \| \| \| \| \| \|	Semicolons after a function definition aren't necessary.
* \|	textures/decoders: Fix block linear to pitch copies	ReinUsesLisp	2020-08-11	1	-13/+8
\|/ \| \| \| \| \| \| \| \| \| \| \|	There were two issues with block linear copies. First the swizzling was wrong and this commit reimplements them. The other issue was that these copies are generally used to download render targets from the GPU and yuzu was not downloading them from host GPU memory unless the extreme GPU accuracy setting was selected. This commit enables cached memory reads for all accuracy levels. - Fixes level thumbnails in Super Mario Maker 2.
*	video_core/textures: Add and use SwizzleSliceToVoxel, and minor style changes	ReinUsesLisp	2020-07-10	1	-13/+17
\| \| \| \| \| \| \|	Change GOB sizes from free-functions to constexpr constants. Add SwizzleSliceToVoxel, a function that swizzles a 2D array of pixels into a 3D texture and use it for 3D copies.
*	maxwell_dma: Rename registers to match official docs and reorder	ReinUsesLisp	2020-07-08	2	-287/+355
\| \| \| \| \| \| \| \| \| \|	Rename registers in the MaxwellDMA class to match Nvidia's official documentation. This one can be found here: https://github.com/NVIDIA/open-gpu-doc/blob/master/classes/dma-copy/clb0b5.h While we are at it, reorganize the code in MaxwellDMA to be separated in different functions.
*	Merge pull request #4147 from ReinUsesLisp/hset2-imm	bunnei	2020-06-27	1	-0/+8
\|\ \| \| \| \|	shader/half_set: Implement HSET2_IMM
\| *	shader/half_set: Implement HSET2_IMM	ReinUsesLisp	2020-06-23	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	Add HSET2_IMM. Due to the complexity of the encoding avoid using BitField unions and read the relevant bits from the code itself. This is less error prone.
* \|	Addressed issues	David Marcec	2020-06-24	1	-0/+4
\| \|
* \|	Macro HLE support	David Marcec	2020-06-24	2	-1/+5
\|/
*	Merge pull request #4049 from ReinUsesLisp/separate-samplers	bunnei	2020-06-13	5	-2/+13
\|\ \| \| \| \|	shader/texture: Join separate image and sampler pairs offline
\| *	shader/texture: Join separate image and sampler pairs offline	ReinUsesLisp	2020-06-05	5	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch
* \|	texture_cache: Implement rendering to 3D textures	ReinUsesLisp	2020-06-08	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This allows rendering to 3D textures with more than one slice. Applications are allowed to render to more than one slice of a texture using gl_Layer from a VTG shader. This also requires reworking how 3D texture collisions are handled, for now, this commit allows rendering to slices but not to miplevels. When a render target attempts to write to a mipmap, we fallback to the previous implementation (copying or flushing as needed). - Fixes color correction 3D textures on UE4 games (rainbow effects). - Allows Xenoblade games to render to 3D textures directly.
* \|	Merge pull request #4009 from ogniK5377/macro-jit-prod	bunnei	2020-06-04	2	-25/+10
\|\ \ \| \|/ \|/\|	video_core: Implement Macro JIT
\| *	Default init labels and use initializer list for macro engine	David Marcec	2020-06-04	1	-1/+1
\| \|
\| *	Mark parameters as const	David Marcec	2020-06-03	2	-3/+2
\| \|
\| *	Pass by reference instead of copying parameters	David Marcec	2020-06-02	2	-5/+7
\| \|
\| *	Implement macro JIT	David Marcec	2020-05-30	2	-27/+11
\| \|
* \|	Merge pull request #3998 from ReinUsesLisp/init-3d	bunnei	2020-06-01	1	-0/+4
\|\ \ \| \|/ \|/\|	maxwell_3d: Initialize more registers to their expected value
\| *	maxwell_3d: Initialize line widths	ReinUsesLisp	2020-05-27	1	-0/+2
\| \| \| \| \| \| \| \|	Initialize line widths to avoid setting a line width of zero.
\| *	maxwell_3d: Initialize polygon modes	ReinUsesLisp	2020-05-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	NVN expects this to be initialized as Fill, otherwise games that never bind a rasterizer state will log an invalid polygon mode.
* \|	maxwell_3d: Reduce severity of logs that can be spammed	ReinUsesLisp	2020-05-28	1	-6/+7
\|/ \| \| \| \|	These logs were killing performance on some games when they were spammed. Reduce them to Debug severity.
*	Merge pull request #3899 from ReinUsesLisp/float-comparisons	bunnei	2020-05-13	1	-12/+16
\|\ \| \| \| \|	shader_ir: Add separate instructions for ordered and unordered comparisons and fix NE on GLSL
\| *	shader_ir: Separate float-point comparisons in ordered and unordered	ReinUsesLisp	2020-05-09	1	-12/+16
\| \| \| \| \| \| \| \| \| \|	This allows us to use native SPIR-V instructions without having to manually check for NAN.
* \|	Merge pull request #3885 from ReinUsesLisp/viewport-swizzles	bunnei	2020-05-08	2	-1/+25
\|\ \ \| \|/ \|/\|	video_core: Implement viewport swizzles with NV_viewport_swizzle
\| *	vk_graphics_pipeline: Implement viewport swizzles with NV_viewport_swizzle	ReinUsesLisp	2020-05-04	1	-0/+1
\| \|
\| *	maxwell_3d: Add viewport swizzles	ReinUsesLisp	2020-05-04	2	-1/+24
\| \|
* \|	Merge pull request #3815 from FernandoS27/command-list-2	bunnei	2020-05-05	11	-56/+122
\|\ \ \| \|/ \|/\|	GPU: More optimizations to GPU Command List Processing and DMA Copy Optimizations
\| *	Clang Format and Documentation.	Fernando Sahmkow	2020-04-28	7	-8/+14
\| \|
\| *	MaxwellDMA: Optimize micro copies.	Fernando Sahmkow	2020-04-28	1	-0/+40
\| \|
\| *	VideoCore/Engines: Refactor Engines CallMethod.	Fernando Sahmkow	2020-04-28	11	-56/+76
\| \|
* \|	Merge pull request #3808 from ReinUsesLisp/wait-for-idle	bunnei	2020-05-03	2	-1/+8
\|\ \ \| \| \| \| \| \|	{maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers
\| * \|	{maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers	ReinUsesLisp	2020-04-28	2	-1/+8
\| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Drop MemoryBarrier from the buffer cache and use Maxwell3D's register WaitForIdle. To implement this on OpenGL we just call glMemoryBarrier with the necessary bits. Vulkan lacks this synchronization primitive, so we set an event and immediately wait for it. This is not a pretty solution, but it's what Vulkan can do without submitting the current command buffer to the queue (which ends up being more expensive on the CPU).
* \|	Merge pull request #3807 from ReinUsesLisp/fix-depth-clamp	bunnei	2020-04-30	1	-0/+1
\|\ \ \| \| \| \| \| \|	maxwell_3d: Fix depth clamping register
\| * \|	maxwell_3d: Fix depth clamping register	ReinUsesLisp	2020-04-28	1	-0/+1
\| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Using deko3d as reference: https://github.com/devkitPro/deko3d/blob/4e47ba0013552e592a86ab7a2510d1e7dadf236a/source/maxwell/gpu_3d_state.cpp#L42 We were using bits 3 and 4 to determine depth clamping, but these are the same both enabled and disabled: state->depthClampEnable ? 0x101A : 0x181D The same happens on Nvidia's OpenGL driver, where they do something like this (default capabilities, GL 4.5 compatibility): (state & DEPTH_CLAMP) != 0 ? 0x201a : 0x281c There's always a difference between the first bits in this register, but bit 11 is consistently disabled on both deko3d/NVN and OpenGL. This commit changes yuzu's behaviour to use bit 11 to determine depth clamping. - Fixes depth issues on Super Mario Odyssey's intro.
* \|	Merge pull request #3799 from ReinUsesLisp/iadd-cc	bunnei	2020-04-30	1	-0/+4
\|\ \ \| \|/ \|/\|	shader: Implement P2R CC, IADD Rd.CC and IADD.X
\| *	shader/arithmetic_integer: Implement IADD.X	ReinUsesLisp	2020-04-26	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	IADD.X takes the carry flag and adds it to the result. This is generally used to emulate 64-bit operations with 32-bit registers.
* \|	Merge pull request #3742 from FernandoS27/command-list	bunnei	2020-04-27	10	-0/+117
\|\ \ \| \| \| \| \| \|	Optimize GPU Command Lists and Introduce Fast GPU Time Option
\| * \|	Clang Format.	Fernando Sahmkow	2020-04-23	3	-3/+6
\| \| \|
\| * \|	Maxwell3D: Process Macros on MultiMethod.	Fernando Sahmkow	2020-04-23	1	-25/+47
\| \| \|
\| * \|	DMAPusher: Propagate multimethod writes into the engines.	Fernando Sahmkow	2020-04-23	10	-0/+92
\| \| \|
* \| \|	Merge pull request #3753 from ReinUsesLisp/ac-vulkan	Rodrigo Locatti	2020-04-26	1	-1/+2
\|\ \ \ \| \|_\|/ \|/\| \|	{gl,vk}_rasterizer: Add lazy default buffer maker and use it for empty buffers
\| * \|	gl_rasterizer: Fix buffers without size	ReinUsesLisp	2020-04-22	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On NVN buffers can be enabled but have no size. According to deko3d and the behavior we see in Animal Crossing: New Horizons these buffers get the special address of 0x1000 and limit themselves to 0xfff. Implement buffers without a size by binding a null buffer to OpenGL without a side. https://github.com/devkitPro/deko3d/blob/1d1930beea093b5a663419e93b0649719a3ca5da/source/maxwell/gpu_3d_vbo.cpp#L62-L63
* \| \|	Merge pull request #3734 from ReinUsesLisp/half-float-mods	bunnei	2020-04-25	1	-2/+0
\|\ \ \ \| \| \| \| \| \| \| \|	decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits
\| * \| \|	decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits	ReinUsesLisp	2020-04-23	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The encoding for negation and absolute value was wrong. Extracting is now done manually. Similar instructions having different encodings is the rule, not the exception. To keep sanity and readability I preferred to extract the desired bit manually. This is implemented against nxas: https://github.com/ReinUsesLisp/nxas/blob/8dbc38995711cc12206aa370145a3a02665fd989/table.h#L68 That is itself tested against nvdisasm (Nvidia's official disassembler).
* \| \| \|	Fix -Wdeprecated-copy warning.	Markus Wick	2020-04-24	1	-0/+1
\| \|_\|/ \|/\| \|
* \| \|	Merge pull request #3697 from lioncash/declarations	bunnei	2020-04-23	1	-1/+1
\|\ \ \ \| \| \| \| \| \| \| \|	CMakeLists: Enable -Wmissing-declarations on Linux builds
\| * \| \|	General: Resolve warnings related to missing declarations	Lioncash	2020-04-17	1	-1/+1
\| \|/ /
* \| \|	MaxwellDMA: Correct copying on accuracy level.	Fernando Sahmkow	2020-04-22	1	-2/+7
\| \| \|
* \| \|	FenceManager: Manage syncpoints and rename fences to semaphores.	Fernando Sahmkow	2020-04-22	1	-2/+2
\| \| \|
* \| \|	Rasterizer: Document SignalFence & ReleaseFences and setup skeletons on Vulkan.	Fernando Sahmkow	2020-04-22	1	-1/+0
\| \| \|
* \| \|	GPU: Fix rebase errors.	Fernando Sahmkow	2020-04-22	1	-4/+3
\| \| \|
* \| \|	OpenGL: Implement Fencing backend.	Fernando Sahmkow	2020-04-22	2	-15/+5
\| \| \|
* \| \|	GPU: Delay Fences.	Fernando Sahmkow	2020-04-22	2	-1/+13
\| \| \|
* \| \|	GPU: Refactor synchronization on Async GPU	Fernando Sahmkow	2020-04-22	1	-2/+6
\| \| \|
* \| \|	UI: Replasce accurate GPU option for GPU Accuracy Level	Fernando Sahmkow	2020-04-22	1	-1/+1
\| \|/ \|/\|
* \|	Merge pull request #3718 from ReinUsesLisp/better-pipeline-state	Rodrigo Locatti	2020-04-21	1	-1/+1
\|\ \ \| \| \| \| \| \|	fixed_pipeline_state: Pack structure, use memcmp and CityHash on it
\| * \|	fixed_pipeline_state: Pack attribute state	ReinUsesLisp	2020-04-19	1	-1/+1
\| \|/ \| \| \| \| \| \|	Reduce FixedPipelineState's size from 1384 to 664 bytes
* \|	Merge pull request #3695 from ReinUsesLisp/default-attributes	bunnei	2020-04-21	1	-0/+4
\|\ \ \| \|/ \|/\|	maxwell_3d: Initialize format attributes constant as one
\| *	maxwell_3d: Initialize format attributes constant as one	ReinUsesLisp	2020-04-17	1	-0/+4
\| \| \| \| \| \| \| \|	nouveau expects this to be true but it doesn't set it.
* \|	CMakeLists: Specify -Wextra on linux builds	Lioncash	2020-04-16	1	-1/+1
\|/ \| \| \| \| \| \| \| \| \| \|	Allows reporting more cases where logic errors may exist, such as implicit fallthrough cases, etc. We currently ignore unused parameters, since we currently have many cases where this is intentional (virtual interfaces). While we're at it, we can also tidy up any existing code that causes warnings. This also uncovered a few bugs as well.
*	Merge pull request #3612 from ReinUsesLisp/red	Fernando Sahmkow	2020-04-15	1	-0/+8
\|\ \| \| \| \|	shader/memory: Implement RED.E.ADD and minor changes to ATOM
\| *	shader/memory: Implement RED.E.ADD	ReinUsesLisp	2020-04-06	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implements a reduction operation. It's an atomic operation that doesn't return a value. This commit introduces another primitive because some shading languages might have a primitive for reduction operations.
* \|	Merge pull request #3662 from ReinUsesLisp/constant-attrs	Mat M	2020-04-15	1	-0/+4
\|\ \ \| \| \| \| \| \|	gl_rasterizer: Implement constant vertex attributes
\| * \|	gl_rasterizer: Implement constant vertex attributes	ReinUsesLisp	2020-04-14	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Credits go to gdkchan from Ryujinx for finding constant attributes are used in retail games.
* \| \|	shader/arithmetic: Add FCMP_CR variant	ReinUsesLisp	2020-04-15	1	-2/+4
\|/ / \| \| \| \| \| \|	Adds another variant of FCMP.
* \|	gl_rasterizer: Implement line widths and smooth lines	ReinUsesLisp	2020-04-13	1	-2/+8
\| \| \| \| \| \| \| \| \| \|	Implements "legacy" features from OpenGL present on hardware such as smooth lines and line width.
* \|	Merge pull request #3578 from ReinUsesLisp/vmnmx	Fernando Sahmkow	2020-04-12	1	-1/+56
\|\ \ \| \| \| \| \| \|	shader/video: Partially implement VMNMX
\| * \|	shader/video: Partially implement VMNMX	ReinUsesLisp	2020-04-12	1	-0/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implements the common usages for VMNMX. Inputs with a different size than 32 bits are not supported and sign mismatches aren't supported either. VMNMX works as follows: It grabs Ra and Rb and applies a maximum/minimum on them (this is defined by .MX), having in mind the input sign. This result can then be saturated. After the intermediate result is calculated, it applies another operation on it using Rc. These operations are merges, accumulations or another min/max pass. This instruction allows to implement with a more flexible approach GCN's min3 and max3 instructions (for instance).
\| * \|	shader_bytecode: Fix I2I_IMM encoding	ReinUsesLisp	2020-03-28	1	-1/+1
\| \| \|
* \| \|	video_core: Add MSAA registers in 3D engine and TIC	ReinUsesLisp	2020-04-12	1	-6/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the registers used for multisampling. It doesn't implement anything for now.
* \| \|	Merge pull request #3601 from ReinUsesLisp/some-shader-encodings	bunnei	2020-04-09	1	-2/+6
\|\ \ \ \| \| \| \| \| \| \| \|	video_core/shader: Add some instruction and S2R encodings
\| * \| \|	shader_bytecode: Rename MOV_SYS to S2R	ReinUsesLisp	2020-04-04	1	-2/+2
\| \| \| \|
\| * \| \|	shader_bytecode: Add encoding for BAR	ReinUsesLisp	2020-04-04	1	-0/+2
\| \| \| \|
\| * \| \|	shader_bytecode: Add encoding for VOTE.VTG	ReinUsesLisp	2020-04-04	1	-0/+2
\| \| \|/ \| \|/\|
* / \|	shader_decompiler: Remove FragCoord.w hack and change IPA implementation	ReinUsesLisp	2020-04-02	1	-25/+30
\|/ / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Credits go to gdkchan and Ryujinx. The pull request used for this can be found here: https://github.com/Ryujinx/Ryujinx/pull/1082 yuzu was already using the header for interpolation, but it was missing the FragCoord.w multiplication described in the linked pull request. This commit finally removes the FragCoord.w == 1.0f hack from the shader decompiler. While we are at it, this commit renames some enumerations to match Nvidia's documentation (linked below) and fixes component declaration order in the shader program header (z and w were swapped). https://github.com/NVIDIA/open-gpu-doc/blob/master/Shader-Program-Header/Shader-Program-Header.html
* /	shader_decode: merge GlobalAtomicOp to AtomicOp	namkazy	2020-03-30	1	-13/+1
\|/
*	engines/const_buffer_engine_interface: Store image format type	ReinUsesLisp	2020-03-27	1	-4/+10
\| \| \| \| \| \| \|	This information is required to properly implement SULD.B. It might also be handy for all image operations, since it would allow us to implement them on devices that require the image format to be specified (on desktop, this would be AMD on OpenGL and Intel on OpenGL and Vulkan).
*	Merge pull request #3520 from ReinUsesLisp/legacy-varyings	bunnei	2020-03-26	1	-0/+6
\|\ \| \| \| \|	gl_shader_decompiler: Implement legacy varyings
\| *	shader/shader_ir: Track usage in input attribute and of legacy varyings	ReinUsesLisp	2020-03-16	1	-0/+6
\| \|
* \|	apply replay logic to all writes. remove replay from MacroInterpreter::Send (@fincs)	namkazy	2020-03-22	1	-6/+9
\| \|
* \|	maxwell_3d: change declaration order	namkazy	2020-03-22	1	-1/+3
\| \|
* \|	maxwell_3d: init shadow_state	namkazy	2020-03-22	1	-0/+2
\| \|
* \|	maxwell_3d: this seem more correct.	namkazy	2020-03-22	1	-2/+2
\| \|
* \|	maxwell_3d: update comments for shadow ram usage	namkazy	2020-03-22	2	-1/+5
\| \|
* \|	maxwell_3d: track shadow ram ctrl and hw reg value	Nguyen Dac Nam	2020-03-22	1	-0/+10
\| \|
* \|	maxwell_3d: implement MME shadow RAM	Nguyen Dac Nam	2020-03-22	1	-1/+14
\| \|
* \|	kepler_compute: Remove unused variables	ReinUsesLisp	2020-03-19	1	-8/+0
\| \|
* \|	Merge pull request #3502 from namkazt/patch-3	Rodrigo Locatti	2020-03-16	1	-8/+3
\|\ \ \| \|/ \|/\|	shader_decode: Reimplement BFE instructions
\| *	shader_bytecode: update BFE instructions struct.	Nguyen Dac Nam	2020-03-13	1	-8/+3
\| \|
* \|	maxwell_3d: Add padding words to XFB entries	ReinUsesLisp	2020-03-13	1	-2/+4
\| \| \| \| \| \| \| \| \| \|	Use INSERT_UNION_PADDING_WORDS instead of alignas to ensure a size requirement.
* \|	gl_rasterizer: Implement transform feedback bindings	ReinUsesLisp	2020-03-13	1	-0/+9
\| \|
* \|	Merge branch 'master' into shader-purge	Rodrigo Locatti	2020-03-13	1	-2/+20
\|\ \
\| * \|	gl_rasterizer: Implement polygon modes and fill rectangles	ReinUsesLisp	2020-03-10	1	-2/+20
\| \|/
* \|	engines/maxwell_3d: Add TFB registers and store them in shader registry	ReinUsesLisp	2020-03-09	1	-2/+32
\| \|
* \|	const_buffer_engine_interface: Store component types	ReinUsesLisp	2020-03-09	3	-45/+26
\|/ \| \| \| \|	This is required for Vulkan. Sampling integer textures with float handles is illegal.
*	state_tracker: Remove type traits with named structures	ReinUsesLisp	2020-02-28	1	-4/+8
\|
*	maxwell_3d: Use two tables instead of three for dirty flags	ReinUsesLisp	2020-02-28	1	-1/+1
\|
*	maxwell_3d: Change write dirty flags to a bitset	ReinUsesLisp	2020-02-28	1	-4/+2
\|
*	maxwell_3d: Flatten cull and front face registers	ReinUsesLisp	2020-02-28	2	-19/+17
\|
*	video_core: Reintroduce dirty flags infrastructure	ReinUsesLisp	2020-02-28	5	-1/+36
\|
*	gl_state: Remove clip distances tracking	ReinUsesLisp	2020-02-28	1	-10/+1
\|
*	gl_state: Remove viewport and depth range tracking	ReinUsesLisp	2020-02-28	1	-9/+9
\|
*	gl_rasterizer: Remove dirty flags	ReinUsesLisp	2020-02-28	5	-264/+1
\|
*	Merge pull request #3425 from ReinUsesLisp/layered-framebuffer	bunnei	2020-02-24	1	-2/+7
\|\ \| \| \| \|	texture_cache: Implement layered framebuffer attachments
\| *	texture_cache: Implement layered framebuffer attachments	ReinUsesLisp	2020-02-16	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Layered framebuffer attachments is a feature that allows applications to write attach layered textures to a single attachment. What layer the fragments are written to is decided from the shader using gl_Layer.
* \|	Merge pull request #3414 from ReinUsesLisp/maxwell-3d-draw	bunnei	2020-02-19	1	-2/+2
\|\ \ \| \| \| \| \| \|	maxwell_3d: Unify draw methods
\| * \|	maxwell_3d: Unify draw methods	ReinUsesLisp	2020-02-14	1	-2/+2
\| \|/ \| \| \| \| \| \| \| \|	Pass instanced state of a draw invocation as an argument instead of having two separate virtual methods.
* \|	Merge pull request #3409 from ReinUsesLisp/host-queries	Fernando Sahmkow	2020-02-18	2	-31/+88
\|\ \ \| \|/ \|/\|	query_cache: Implement a query cache and query 21 (samples passed)
\| *	gl_query_cache: Optimize query cache	ReinUsesLisp	2020-02-14	1	-3/+8
\| \| \| \| \| \| \| \|	Use a custom cache instead of relying on a ranged cache.
\| *	gl_query_cache: Implement host queries using a deferred cache	ReinUsesLisp	2020-02-14	2	-18/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of waiting immediately for executed commands, defer the query until the guest CPU reads it. This way we get closer to what the guest program is doing. To archive this we have to build a dependency queue, because host APIs (like OpenGL and Vulkan) use ranged queries instead of counters like NVN. Waiting for queries implicitly uses fences and this requires a command being queued, otherwise the driver will lock waiting until a timeout. To fix this when there are no commands queued, we explicitly call glFlush.
\| *	maxwell_3d: Slow implementation of passed samples (query 21)	ReinUsesLisp	2020-02-14	2	-17/+60
\| \| \| \| \| \| \| \|	Implements GL_SAMPLES_PASSED by waiting immediately for queries.
* \|	Merge pull request #3379 from ReinUsesLisp/cbuf-offset	bunnei	2020-02-14	1	-2/+2
\|\ \ \| \|/ \|/\|	shader/decode: Fix constant buffer offsets
\| *	shader/decode: Fix constant buffer offsets	ReinUsesLisp	2020-02-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Some instances were using cbuf34.offset instead of cbuf34.GetOffset(). This returned the an invalid offset. Address those instances and rename offset to "shifted_offset" to avoid future bugs.
* \|	Merge pull request #3395 from FernandoS27/queries	bunnei	2020-02-14	2	-51/+56
\|\ \ \| \| \| \| \| \|	GPU: Refactor queries implementation and correct GPU Clock.
\| * \|	GPU: Address Feedback.	Fernando Sahmkow	2020-02-13	1	-5/+2
\| \| \|
\| * \|	GPU: Implement GPU Clock correctly.	Fernando Sahmkow	2020-02-10	1	-1/+2
\| \| \|
\| * \|	Maxwell3D: Correct query reporting.	Fernando Sahmkow	2020-02-10	2	-51/+58
\| \| \|
* \| \|	Merge pull request #3376 from ReinUsesLisp/point-sprite	bunnei	2020-02-11	1	-1/+6
\|\ \ \ \| \|/ / \|/\| \|	gl_rasterizer: Implement GL_POINT_SPRITE
\| * \|	gl_rasterizer: Implement GL_POINT_SPRITE	ReinUsesLisp	2020-02-04	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	OpenGL core defaults to GL_POINT_SPRITE, meanwhile on OpenGL compatibility we have to explicitly enable it. This fixes gl_PointCoord's behaviour.
* \| \|	Merge pull request #3372 from ReinUsesLisp/fix-back-stencil	bunnei	2020-02-10	1	-3/+3
\|\ \ \ \| \| \| \| \| \| \| \|	maxwell_3d: Fix stencil back mask
\| * \| \|	maxwell_3d: Fix stencil back mask	ReinUsesLisp	2020-02-02	1	-3/+3
\| \| \| \|
* \| \| \|	Merge pull request #3369 from ReinUsesLisp/shf	bunnei	2020-02-08	1	-0/+20
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	shader/shift: Implement SHF
\| * \| \|	shader/shift: Implement SHF_LEFT_{IMM,R}	ReinUsesLisp	2020-02-02	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Shifts a pair of registers to the left and returns the high register.
* \| \| \|	Merge pull request #3357 from ReinUsesLisp/bfi-rc	bunnei	2020-02-04	1	-0/+2
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	shader/bfi: Implement register-constant buffer variant
\| * \| \|	shader/bfi: Implement register-constant buffer variant	ReinUsesLisp	2020-01-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's the same as the variant that was implemented, but it takes the operands from another source.
* \| \| \|	Merge pull request #3356 from ReinUsesLisp/fcmp	bunnei	2020-02-04	1	-0/+7
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	shader/arithmetic: Implement FCMP
\| * \| \|	shader/arithmetic: Implement FCMP	ReinUsesLisp	2020-01-27	1	-0/+7
\| \|/ / \| \| \| \| \| \| \| \| \| \| \| \|	Compares the third operand with zero, then selects between the first and second.
* \| \|	Merge pull request #3282 from FernandoS27/indexed-samplers	bunnei	2020-02-02	5	-0/+28
\|\ \ \ \| \|_\|/ \|/\| \|	Partially implement Indexed samplers in general and specific code in GLSL
\| * \|	Shader_IR: Allow constant access of guest driver.	Fernando Sahmkow	2020-01-24	5	-0/+13
\| \| \|
\| * \|	GPU: Implement guest driver profile and deduce texture handler sizes.	Fernando Sahmkow	2020-01-24	5	-0/+15
\| \|/
* /	shader/memory: Implement ATOM.ADD	ReinUsesLisp	2020-01-26	1	-0/+30
\|/ \| \| \| \| \| \| \| \| \| \| \| \|	ATOM operates atomically on global memory. For now only add ATOM.ADD since that's what was found in commercial games. This asserts for ATOM.ADD.S32 (handling the others as unimplemented), although ATOM.ADD.U32 shouldn't be any different. This change forces us to change the default type on SPIR-V storage buffers from float to uint. We could also alias the buffers, but it's simpler for now to just use uint. While we are at it, abstract the code to avoid repetition.
*	Merge pull request #3322 from ReinUsesLisp/vk-front-face	bunnei	2020-01-20	1	-0/+1
\|\ \| \| \| \|	vk_graphics_pipeline: Set front facing properly
\| *	vk_graphics_pipeline: Set front facing properly	ReinUsesLisp	2020-01-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Front face was being forced to a certain value when cull face is disabled. Set a default value on initialization and drop the forcefully set front facing value with culling disabled.
* \|	Merge pull request #3305 from ReinUsesLisp/point-size-program	bunnei	2020-01-18	1	-1/+9
\|\ \ \| \| \| \| \| \|	gl_state: Implement PROGRAM_POINT_SIZE
\| * \|	gl_state: Implement PROGRAM_POINT_SIZE	ReinUsesLisp	2020-01-15	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For gl_PointSize to have effect we have to activate GL_PROGRAM_POINT_SIZE.
* \| \|	shader/memory: Implement ATOMS.ADD.U32	ReinUsesLisp	2020-01-16	1	-3/+34
\| \|/ \|/\|
* \|	maxwell_3d: Make dirty_pointers private	Lioncash	2020-01-16	1	-2/+2
\|/ \| \| \| \|	This isn't used outside of the class itself, so we can make it private for the time being.
*	yuzu: Remove Maxwell debugger	ReinUsesLisp	2020-01-03	1	-31/+0
\| \| \| \| \|	This was carried from Citra and wasn't really used on yuzu. It also adds some runtime overhead. This commit removes it from yuzu's codebase.
*	Merge pull request #3239 from ReinUsesLisp/p2r	bunnei	2020-01-01	1	-1/+3
\|\ \| \| \| \|	shader/p2r: Implement P2R Pr
\| *	shader/r2p: Refactor P2R to support P2R	ReinUsesLisp	2019-12-20	1	-1/+3
\| \|
* \|	Merge pull request #3228 from ReinUsesLisp/ptp	bunnei	2019-12-27	1	-6/+6
\|\ \ \| \| \| \| \| \|	shader/texture: Implement AOFFI and PTP for TLD4 and TLD4S
\| * \|	shader/texture: Implement TLD4.PTP	ReinUsesLisp	2019-12-16	1	-6/+6
\| \| \|
* \| \|	Merge pull request #3244 from ReinUsesLisp/vk-fps	Fernando Sahmkow	2019-12-25	1	-6/+14
\|\ \ \ \| \| \| \| \| \| \| \|	fixed_pipeline_state: Define structure and loaders
\| * \| \|	maxwell_3d: Add depth bounds registers	ReinUsesLisp	2019-12-23	1	-6/+14
\| \| \|/ \| \|/\|
* \| \|	Merge pull request #3236 from ReinUsesLisp/rasterize-enable	bunnei	2019-12-25	2	-4/+9
\|\ \ \ \| \|/ / \|/\| \|	gl_rasterizer: Implement RASTERIZE_ENABLE
\| * \|	gl_rasterizer: Implement RASTERIZE_ENABLE	ReinUsesLisp	2019-12-18	2	-4/+9
\| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \|	RASTERIZE_ENABLE is the opposite of GL_RASTERIZER_DISCARD. Implement it naturally using this. NVN games expect rasterize to be enabled by default, reflect that in our initial GPU state.
* /	shader_bytecode: Fix TLD4S encoding	ReinUsesLisp	2019-12-18	1	-1/+1
\|/
*	Shader_Ir: Correct TLD4S encoding and implement f16 flag.	Fernando Sahmkow	2019-12-12	1	-1/+2
\|
*	Merge pull request #3210 from ReinUsesLisp/memory-barrier	bunnei	2019-12-11	1	-1/+17
\|\ \| \| \| \|	shader: Implement MEMBAR.GL
\| *	shader: Implement MEMBAR.GL	ReinUsesLisp	2019-12-10	1	-1/+17
\| \| \| \| \| \| \| \|	Implement using memoryBarrier in GLSL and OpMemoryBarrier on SPIR-V.
* \|	Maxwell3D: Implement Depth Mode.	Fernando Sahmkow	2019-12-11	1	-6/+7
\|/ \| \| \| \|	This commit finishes adding depth mode that was reverted before due to other unresolved issues.
*	shader_ir/memory: Implement patch stores	ReinUsesLisp	2019-12-10	1	-1/+2
\|
*	maxwell_3d: Add tessellation tess level registers	ReinUsesLisp	2019-12-07	1	-1/+6
\|
*	maxwell_3d: Add tessellation mode register	ReinUsesLisp	2019-12-07	1	-1/+28
\|
*	maxwell_3d: Add patch vertices register	ReinUsesLisp	2019-12-07	1	-1/+4
\|
*	shader_bytecode: Remove corrupted character	ReinUsesLisp	2019-12-07	1	-1/+1
\|
*	Merge pull request #3109 from FernandoS27/new-instr	bunnei	2019-12-07	1	-0/+44
\|\ \| \| \| \|	Implement FLO & TXD Instructions on GPU Shaders
\| *	Shader_IR: Implement TXD instruction.	Fernando Sahmkow	2019-11-14	1	-0/+20
\| \|
\| *	Shader_IR: Implement FLO instruction.	Fernando Sahmkow	2019-11-14	1	-0/+6
\| \|
\| *	Shader_Bytecode: Add encodings for FLO, SHF and TXD	Fernando Sahmkow	2019-11-14	1	-0/+18
\| \|
* \|	Merge pull request #3098 from ReinUsesLisp/shader-invalidations	bunnei	2019-11-25	6	-38/+51
\|\ \ \| \| \| \| \| \|	gl_shader_cache: Miscellaneous changes to shaders
\| * \|	gl_shader_cache: Remove dynamic BaseBinding specialization	ReinUsesLisp	2019-11-23	2	-1/+1
\| \| \|
\| * \|	video_core: Unify ProgramType and ShaderStage into ShaderType	ReinUsesLisp	2019-11-23	6	-35/+43
\| \| \|
\| * \|	gl_shader_cache: Specialize local memory size for compute shaders	ReinUsesLisp	2019-11-23	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Local memory size in compute shaders was stubbed with an arbitary size. This commit specializes local memory size from guest GPU parameters.
\| * \|	gl_shader_cache: Specialize shader workgroup	ReinUsesLisp	2019-11-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Drop the usage of ARB_compute_variable_group_size and specialize compute shaders instead. This permits compute to run on AMD and Intel proprietary drivers.
* \| \|	Merge pull request #3105 from ReinUsesLisp/fix-stencil-reg	bunnei	2019-11-24	1	-3/+3
\|\ \ \ \| \|/ / \|/\| \|	maxwell_3d: Fix stencil_back_func_mask offset
\| * \|	maxwell_3d: Fix stencil_back_func_mask offset	ReinUsesLisp	2019-11-13	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	stencil_back_func_mask and stencil_back_mask were misplaced. This commit addresses that issue.
* \| \|	texture_cache: Use a table instead of switch for texture formats	ReinUsesLisp	2019-11-15	1	-8/+0
\| \|/ \|/\| \| \| \| \| \| \| \| \|	Use a large flat array to look up texture formats. This allows us to properly implement formats with different component types. It should also be faster.
* \|	Merge pull request #3081 from ReinUsesLisp/fswzadd-shuffles	Fernando Sahmkow	2019-11-14	1	-0/+10
\|\ \ \| \|/ \|/\|	shader: Implement FSWZADD and reimplement SHFL
\| *	shader_ir/warp: Implement FSWZADD	ReinUsesLisp	2019-11-08	1	-0/+10
\| \|
* \|	video_core: Silence implicit conversion warnings	ReinUsesLisp	2019-11-08	2	-6/+9
\|/
*	Merge pull request #2914 from FernandoS27/fermi-fix	bunnei	2019-11-06	1	-3/+27
\|\ \| \| \| \|	Fermi2D: limit blit area to only available area
\| *	Fermi2D: Use a different formula for delimiting blit areas.	Fernando Sahmkow	2019-10-18	1	-14/+28
\| \|
\| *	Fermi2D: limit blit area to only available area	Fernando Sahmkow	2019-10-17	1	-4/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Normaly OpenGL does not care if the areas exceed the texture regions but other backends such as Vulkan do care about the limits of this areas. This PR crops the areas of the blit in order that they don't surpass the limits of the textures. This should help Vulkan and faulty OpenGL drivers
* \|	common_func: Use std::array for INSERT_PADDING_* macros.	bunnei	2019-11-04	6	-104/+106
\| \| \| \| \| \| \| \|	- Zero initialization here is useful for determinism.
* \|	Merge pull request #3050 from FernandoS27/fix-tld4	Rodrigo Locatti	2019-10-30	1	-1/+29
\|\ \ \| \| \| \| \| \|	shader_ir: Fix TLD4 and add bindless variant
\| * \|	Shader_IR: Fix TLD4 and add Bindless Variant.	Fernando Sahmkow	2019-10-30	1	-1/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit fixes an issue where not all 4 results of tld4 were being written, the color component was defaulted to red, among other things. It also implements the bindless variant.
* \| \|	maxwell_3d/kepler_compute: Remove unused arguments in GetTexture	ReinUsesLisp	2019-10-28	4	-31/+14
\| \| \|
* \| \|	video_core/textures: Remove unused index entry in FullTextureInfo	ReinUsesLisp	2019-10-28	1	-1/+0
\| \| \|
* \| \|	maxwell_3d: Remove unused method GetStageTextures	ReinUsesLisp	2019-10-28	2	-42/+0
\|/ /
* \|	maxwell_3d: Silence implicit conversion warnings	ReinUsesLisp	2019-10-27	2	-24/+25
\| \| \| \| \| \| \| \|	While we are at it, unify types for dirty reg pointers.
* \|	Shader_IR: Address Feedback.	Fernando Sahmkow	2019-10-26	1	-3/+6
\| \|
* \|	Shader_IR: Clang format	Fernando Sahmkow	2019-10-25	1	-2/+1
\| \|
* \|	gl_shader_disk_cache: Store and load fast BRX	ReinUsesLisp	2019-10-25	1	-19/+16
\| \|
* \|	Shader_IR: allow lookup of texture samplers within the shader_ir for instructions that don't provide it	Fernando Sahmkow	2019-10-25	5	-4/+151
\| \|
* \|	VideoCore: Unify const buffer accessing along engines and provide ConstBufferLocker class to shaders.	Fernando Sahmkow	2019-10-25	5	-6/+36
\| \|
* \|	shader_bytecode: Make Matcher constexpr capable	Lioncash	2019-10-24	1	-13/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Greatly shrinks the amount of generated code for GetDecodeTable(). Collapses an assembly output of 9000+ lines down to ~3621 with Clang, and 6513 down to ~2616 with GCC, given it's now allowed to construct all the entries as a sequence of constant data.
* \|	maxwell_3d: Reduce FlushMMEInlineDraw logging to Trace	ReinUsesLisp	2019-10-20	1	-1/+1
\| \|
* \|	maxwell_3d: Silence truncation warnings	Lioncash	2019-10-15	1	-1/+2
\|/ \| \| \| \|	A trivial warning caused by not using size_t as the argument types instead of u32.
*	maxwell_3d: Add dirty flags for depth bounds values	ReinUsesLisp	2019-10-05	2	-1/+10
\| \| \| \| \|	This is useful in Vulkan where we want to update depth bounds without caring if it's enabled or disabled through vkCmdSetDepthBounds.
*	Merge pull request #2869 from ReinUsesLisp/suld	bunnei	2019-09-24	1	-3/+5
\|\ \| \| \| \|	shader/image: Implement SULD and fix SUATOM
\| *	gl_shader_decompiler: Use uint for images and fix SUATOM	ReinUsesLisp	2019-09-21	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	In the process remove implementation of SUATOM.MIN and SUATOM.MAX as these require a distinction between U32 and S32. These have to be implemented with imageCompSwap loop.
\| *	shader/image: Implement SULD and remove irrelevant code	ReinUsesLisp	2019-09-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	* Implement SULD as float. * Remove conditional declaration of GL_ARB_shader_viewport_layer_array.
\| *	shader_bytecode: Add SULD encoding	ReinUsesLisp	2019-09-21	1	-0/+2
\| \|
* \|	Merge pull request #2870 from FernandoS27/multi-draw	David	2019-09-22	2	-2/+129
\|\ \ \| \| \| \| \| \|	Implement a MME Draw commands Inliner and correct host instance drawing
\| * \|	Maxwell3D: Corrections and refactors to MME instance refactor	Fernando Sahmkow	2019-09-22	2	-33/+43
\| \| \|
\| * \|	Rasterizer: Refactor and simplify DrawBatch Interface.	Fernando Sahmkow	2019-09-19	1	-2/+2
\| \| \|
\| * \|	VideoCore: Corrections to the MME Inliner and removal of hacky instance management.	Fernando Sahmkow	2019-09-19	2	-10/+32
\| \| \|
\| * \|	Video Core: initial Implementation of InstanceDraw Packaging	Fernando Sahmkow	2019-09-19	2	-1/+96
\| \| \|
* \| \|	Fix clang-format	FearlessTobi	2019-09-22	1	-1/+1
\| \| \|
* \| \|	fermi_2d: Lower surface copy log severity to DEBUG	FearlessTobi	2019-09-22	1	-1/+1
\| \| \|
* \| \|	Merge pull request #2878 from FernandoS27/icmp	Rodrigo Locatti	2019-09-21	1	-0/+13
\|\ \ \ \| \| \| \| \| \| \| \|	shader_ir: Implement ICMP
\| * \| \|	Shader_IR: ICMP corrections and fixes	Fernando Sahmkow	2019-09-21	1	-0/+2
\| \| \| \|
\| * \| \|	Shader_IR: Implement ICMP.	Fernando Sahmkow	2019-09-20	1	-0/+11
\| \|/ /
* \| /	Mark DrawArrays as LOG_TRACE	David Marcec	2019-09-21	1	-1/+1
\| \|/ \|/\| \| \| \| \|	There's no reason to clog logs with DrawArray.
* \|	shader_ir/warp: Implement SHFL	ReinUsesLisp	2019-09-17	1	-0/+18
\|/
*	Merge pull request #2851 from ReinUsesLisp/srgb	Fernando Sahmkow	2019-09-15	1	-0/+3
\|\ \| \| \| \|	renderer_opengl: Fix sRGB blits
\| *	renderer_opengl: Fix sRGB blits	ReinUsesLisp	2019-09-11	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Removes the sRGB hack of tracking if a frame used an sRGB rendertarget to apply at least once to blit the final texture as sRGB. Instead of doing this apply sRGB if the presented image has sRGB. Also enable sRGB by default on Maxwell3D registers as some games seem to assume this.
* \|	Merge pull request #2824 from ReinUsesLisp/mme	Fernando Sahmkow	2019-09-15	2	-1/+19
\|\ \ \| \| \| \| \| \|	Revert "Revert #2466" and stub FirmwareCall 4
\| * \|	maxwell_3d: Update firmware 4 call stub commentary	Rodrigo Locatti	2019-09-15	1	-1/+2
\| \| \|
\| * \|	Revert "Revert #2466" and stub FirmwareCall 4	ReinUsesLisp	2019-09-04	2	-1/+18
\| \| \|
* \| \|	shader/image: Implement SUATOM and fix SUST	ReinUsesLisp	2019-09-11	1	-0/+32
\| \|/ \|/\|
* \|	Merge pull request #2823 from ReinUsesLisp/shr-clamp	bunnei	2019-09-10	1	-0/+4
\|\ \ \| \| \| \| \| \|	shader/shift: Implement SHR wrapped and clamped variants
\| * \|	shader/shift: Implement SHR wrapped and clamped variants	ReinUsesLisp	2019-09-04	1	-0/+4
\| \|/ \| \| \| \| \| \| \| \| \| \|	Nvidia defaults to wrapped shifts, but this is undefined behaviour on OpenGL's spec. Explicitly mask/clamp according to what the guest shader requires.
* \|	Merge pull request #2810 from ReinUsesLisp/mme-opt	bunnei	2019-09-10	2	-4/+6
\|\ \ \| \| \| \| \| \|	maxwell_3d: Avoid moving macro_params
\| * \|	maxwell_3d: Avoid moving macro_params	ReinUsesLisp	2019-09-04	2	-4/+6
\| \|/
* \|	gl_rasterizer: Implement image bindings	ReinUsesLisp	2019-09-06	1	-0/+1
\| \|
* \|	kepler_compute: Implement texture queries	ReinUsesLisp	2019-09-06	2	-4/+72
\|/
*	Merge pull request #2812 from ReinUsesLisp/f2i-selector	bunnei	2019-09-04	1	-1/+7
\|\ \| \| \| \|	shader_ir/conversion: Implement F2I and F2F F16 selector
\| *	shader_ir/conversion: Split int and float selector and implement F2F H1	ReinUsesLisp	2019-08-28	1	-1/+8
\| \|
\| *	shader_ir/conversion: Implement F2I F16 Ra.H1	ReinUsesLisp	2019-08-28	1	-2/+1
\| \|
* \|	Merge pull request #2811 from ReinUsesLisp/fsetp-fix	bunnei	2019-09-04	1	-0/+1
\|\ \ \| \| \| \| \| \|	float_set_predicate: Add missing negation bit for the second operand
\| * \|	float_set_predicate: Add missing negation bit for the second operand	ReinUsesLisp	2019-08-28	1	-0/+1
\| \|/
* \|	Merge pull request #2826 from ReinUsesLisp/macro-binding	bunnei	2019-09-04	2	-10/+4
\|\ \ \| \| \| \| \| \|	maxwell_3d: Fix macro binding cursor
\| * \|	maxwell_3d: Fix macro binding cursor	ReinUsesLisp	2019-09-01	2	-10/+4
\| \| \|
* \| \|	Merge pull request #2765 from FernandoS27/dma-fix	bunnei	2019-09-01	1	-16/+26
\|\ \ \ \| \|/ / \|/\| \|	MaxwellDMA: Fixes, corrections and relaxations.
\| * \|	MaxwellDMA: Fixes, corrections and relaxations.	Fernando Sahmkow	2019-07-26	1	-16/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit fixes offsets on Linear -> Tiled copies, corrects z pos fortiled->linear copies, corrects bytes_per_pixel calculation in tiled -> linear copies and relaxes some limitations set by latest dma fixes refactors.
* \| \|	video_core: Silent miscellaneous warnings (#2820)	Rodrigo Locatti	2019-08-30	7	-23/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* texture_cache/surface_params: Remove unused local variable * rasterizer_interface: Add missing documentation commentary * maxwell_dma: Remove unused rasterizer reference * video_core/gpu: Sort member declaration order to silent -Wreorder warning * fermi_2d: Remove unused MemoryManager reference * video_core: Silent unused variable warnings * buffer_cache: Silent -Wreorder warnings * kepler_memory: Remove unused MemoryManager reference * gl_texture_cache: Add missing override * buffer_cache: Add missing include * shader/decode: Remove unused variables
* \| \|	shader_ir: Implement VOTE	ReinUsesLisp	2019-08-21	1	-0/+16
\| \|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
* \|	Merge pull request #2753 from FernandoS27/float-convert	bunnei	2019-08-21	1	-2/+0
\|\ \ \| \| \| \| \| \|	Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.
\| * \|	Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.	Fernando Sahmkow	2019-07-20	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done.
* \| \|	shader_ir: Implement NOP	ReinUsesLisp	2019-08-04	1	-0/+7
\| \| \|
* \| \|	Merge pull request #2592 from FernandoS27/sync1	bunnei	2019-07-26	1	-2/+3
\|\ \ \ \| \| \| \| \| \| \| \|	Implement GPU Synchronization Mechanisms & Correct NVFlinger
\| * \| \|	video_core: Implement GPU side Syncpoints	Fernando Sahmkow	2019-07-05	1	-2/+3
\| \| \| \|
* \| \| \|	Merge pull request #2743 from FernandoS27/surpress-assert	bunnei	2019-07-25	1	-1/+1
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	Downgrade and suppress a series of GPU asserts and debug messages.
\| * \| \|	MaxwellDMA/KeplerCopy: Downgrade DMA log message to Trace.	Fernando Sahmkow	2019-07-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This log was just to know which games used DMA. It's no longer important.
* \| \| \|	Merge pull request #2704 from FernandoS27/conditional	bunnei	2019-07-24	2	-1/+88
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \|	maxwell3d: Implement Conditional Rendering
\| * \| \| \|	maxwell3d: Implement Conditional Rendering	Fernando Sahmkow	2019-07-17	2	-1/+88
\| \|/ / / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conditional Rendering takes care of conditionaly clearing or drawing depending on a set of queries. This PR implements the query checks to stablish if things can be rendered or not.
* \| \| \|	Merge pull request #2734 from ReinUsesLisp/compute-shaders	bunnei	2019-07-22	1	-3/+4
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \|	gl_rasterizer: Implement compute shaders
\| * \| \| \|	gl_rasterizer: Implement compute shaders	ReinUsesLisp	2019-07-15	1	-3/+4
\| \|/ / /
* \| \| \|	Merge pull request #2735 from FernandoS27/pipeline-rework	bunnei	2019-07-21	5	-69/+287
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	Rework Dirty Flags in GPU Pipeline, Optimize CBData and Redo Clearing mechanism
\| * \| \|	Maxwell3D: Reorganize and address feedback	Fernando Sahmkow	2019-07-20	2	-2/+6
\| \| \| \|
\| * \| \|	GL_State: Feedback and fixes	Fernando Sahmkow	2019-07-17	1	-1/+7
\| \| \| \|
\| * \| \|	Maxwell3D: Address Feedback	Fernando Sahmkow	2019-07-17	2	-13/+10
\| \| \| \|
\| * \| \|	GL_Rasterizer: Corrections to Clearing.	Fernando Sahmkow	2019-07-17	1	-1/+1
\| \| \| \|
\| * \| \|	Maxwell3D: Correct marking dirtiness on CB upload	Fernando Sahmkow	2019-07-17	1	-0/+1
\| \| \| \|
\| * \| \|	GL_Rasterizer: Rework RenderTarget/DepthBuffer clearing	Fernando Sahmkow	2019-07-17	1	-1/+0
\| \| \| \|
\| * \| \|	Maxwell3D: Implement State Dirty Flags.	Fernando Sahmkow	2019-07-17	2	-6/+86
\| \| \| \|
\| * \| \|	Maxwell3D: Rework CBData Upload	Fernando Sahmkow	2019-07-17	2	-8/+45
\| \| \| \|
\| * \| \|	Maxwell3D: Rework the dirty system to be more consistant and scaleable	Fernando Sahmkow	2019-07-17	5	-61/+155
\| \|/ /
* / /	shader/half_set_predicate: Implement missing HSETP2 variants	ReinUsesLisp	2019-07-20	1	-6/+20
\|/ /
* \|	Merge pull request #2695 from ReinUsesLisp/layer-viewport	Fernando Sahmkow	2019-07-15	1	-1/+1
\|\ \ \| \| \| \| \| \|	gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders
\| * \|	gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders	ReinUsesLisp	2019-07-08	1	-1/+1
\| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit implements gl_ViewportIndex and gl_Layer in vertex and geometry shaders. In the case it's used in a vertex shader, it requires ARB_shader_viewport_layer_array. This extension is available on AMD and Nvidia devices (mesa and proprietary drivers), but not available on Intel on any platform. At the moment of writing this description I don't know if this is a hardware limitation or a driver limitation. In the case that ARB_shader_viewport_layer_array is not available, writes to these registers on a vertex shader are ignored, with the appropriate logging.
* \|	Merge pull request #2675 from ReinUsesLisp/opengl-buffer-cache	bunnei	2019-07-15	1	-0/+1
\|\ \ \| \| \| \| \| \|	buffer_cache: Implement a generic buffer cache and its OpenGL backend
\| * \|	gl_rasterizer: Minor style changes	ReinUsesLisp	2019-07-06	1	-0/+1
\| \|/
* \|	Merge pull request #2692 from ReinUsesLisp/tlds-f16	Fernando Sahmkow	2019-07-14	1	-1/+2
\|\ \ \| \| \| \| \| \|	shader/texture: Add F16 support for TLDS
\| * \|	shader/texture: Add F16 support for TLDS	ReinUsesLisp	2019-07-07	1	-1/+2
\| \|/
* /	shader_ir: Implement BRX & BRA.CC	Fernando Sahmkow	2019-07-09	1	-0/+16
\|/
*	shader_bytecode: Include missing <array>	ReinUsesLisp	2019-06-24	1	-0/+1
\|
*	surface: Correct format S8Z24	Fernando Sahmkow	2019-06-21	1	-1/+1
\|
*	decoders: correct block calculation	Fernando Sahmkow	2019-06-21	5	-10/+10
\|
*	fermi2d: Correct Origin Mode	Fernando Sahmkow	2019-06-21	1	-5/+10
\|
*	texture_cache: Fermi2D reform and implement View Mirage	Fernando Sahmkow	2019-06-21	2	-14/+40
\| \| \| \| \|	This also does some fixes on compressed textures reinterpret and on the Fermi2D engine in general.
*	shader: Decode SUST and implement backing image functionality	ReinUsesLisp	2019-06-21	1	-2/+64
\|
*	maxwell_3d: Partially implement texture buffers as 1D textures	ReinUsesLisp	2019-06-21	1	-8/+4
\|
*	shader: Implement texture buffers	ReinUsesLisp	2019-06-21	1	-0/+16
\|
*	texture_cache: loose TryReconstructSurface when accurate GPU is not on.	Fernando Sahmkow	2019-06-21	1	-1/+1
\| \| \| \|	Also corrects some asserts.
*	engine_upload: Addapt to new Texture Cache	Fernando Sahmkow	2019-06-21	2	-5/+5
\|
*	video_core: Use un-shifted block sizes to avoid integer divisions	ReinUsesLisp	2019-06-21	2	-8/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Instead of storing all block width, height and depths in their shifted form: block_width = 1U << block_shift; Store them like they are provided by the emulated hardware (their block_shift form). This way we can avoid doing the costly Common::AlignUp operation to align texture sizes and drop CPU integer divisions with bitwise logic (defined in Common::AlignBits).
*	Merge pull request #2562 from ReinUsesLisp/split-cbuf-upload	bunnei	2019-06-18	3	-10/+19
\|\ \| \| \| \|	video_core/engines: Move ConstBufferInfo out of Maxwell3D
\| *	video_core/engines: Move ConstBufferInfo out of Maxwell3D	ReinUsesLisp	2019-06-08	3	-10/+19
\| \|
* \|	kepler_compute: Use std::array for cbuf info	ReinUsesLisp	2019-06-08	1	-2/+3
\| \|
* \|	kepler_compute: Fix block_dim_x encoding	ReinUsesLisp	2019-06-08	1	-1/+1
\|/
*	shader_bytecode: Mark EXIT as flow instruction	Fernando Sahmkow	2019-06-04	1	-1/+1
\|
*	shader/memory: Implement ST (generic memory)	ReinUsesLisp	2019-05-21	1	-0/+1
\|
*	shader/memory: Implement LD (generic memory)	ReinUsesLisp	2019-05-21	1	-4/+15
\|
*	Merge pull request #2441 from ReinUsesLisp/al2p	bunnei	2019-05-19	2	-2/+21
\|\ \| \| \| \|	shader: Implement AL2P and ALD.PHYS
\| *	shader_ir/other: Implement IPA.IDX	ReinUsesLisp	2019-05-03	1	-0/+1
\| \|
\| *	shader_ir/memory: Implement physical input attributes	ReinUsesLisp	2019-05-03	1	-0/+4
\| \|
\| *	gl_shader_decompiler: Declare all possible varyings on physical attribute usage	ReinUsesLisp	2019-05-03	1	-0/+1
\| \|
\| *	shader_bytecode: Add AL2P decoding	ReinUsesLisp	2019-05-03	1	-2/+15
\| \|
* \|	Merge pull request #2472 from FernandoS27/tic	Hexagon12	2019-05-19	1	-1/+1
\|\ \ \| \| \| \| \| \|	maxwell_3d: reduce severity of different component formats assert.
\| * \|	maxwell_3d: reduce sevirity of different component formats assert.	Fernando Sahmkow	2019-05-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was reduced due to happening on most games and at such constant rate that it affected performance heavily for the end user. In general, we are well aware of the assert and an implementation is already planned.
* \| \|	Merge pull request #2469 from lioncash/copyable	Hexagon12	2019-05-19	1	-0/+2
\|\ \ \ \| \| \| \| \| \| \| \|	video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs
\| * \| \|	video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs	Lioncash	2019-05-14	1	-0/+2
\| \|/ / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	std::memset is used to clear the entire register structure, which requires that the Regs struct be trivially copyable (otherwise undefined behavior is invoked). This prevents the case where a non-trivial type is potentially added to the struct.
* \| \|	Merge pull request #2470 from lioncash/ranged-for	Sebastian Valle	2019-05-19	1	-18/+18
\|\ \ \ \| \| \| \| \| \| \| \|	video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults()
\| * \| \|	video_core/engines/maxwell3d: Get rid of three magic values in CallMethod()	Lioncash	2019-05-14	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can use the named constant instead of using 32 directly.
\| * \| \|	video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults()	Lioncash	2019-05-14	1	-15/+15
\| \|/ / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Lessens the amount of code that needs to be read, and gets rid of the need to introduce an indexing variable. Instead, we just operate on the objects directly.
* \| \|	video_core/engines/engine_upload: Amend constructor initializer list order	Lioncash	2019-05-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Silences a -Wreorder warning.
* \| \|	video_core/engines/engine_upload: Default destructor in the cpp file	Lioncash	2019-05-14	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Avoids inlining destruction logic where applicable, and also makes forward declarations not cause unexpected compilation errors depending on where the State class is used.
* \| \|	video_core/engines/engine_upload: Remove unnecessary const on parameters in function declarations	Lioncash	2019-05-14	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These only apply in the definition of the function. They can be omitted from the declaration.
* \| \|	video_core/engines/engine_upload: Remove unnecessary includes	Lioncash	2019-05-14	2	-2/+2
\|/ /
* \|	Merge pull request #2429 from FernandoS27/compute	bunnei	2019-05-09	11	-140/+479
\|\ \ \| \|/ \|/\|	Corrections and Implementation on GPU Engines
\| *	Refactors and name corrections.	Fernando Sahmkow	2019-05-01	6	-35/+35
\| \|
\| *	Fixes and Corrections to DMA Engine	Fernando Sahmkow	2019-04-23	2	-37/+57
\| \|
\| *	Add Swizzle Parameters to the DMA engine	Fernando Sahmkow	2019-04-23	2	-2/+27
\| \|
\| *	Add Documentation Headers to all the GPU Engines	Fernando Sahmkow	2019-04-23	5	-0/+29
\| \|
\| *	Corrections and styling	Fernando Sahmkow	2019-04-23	5	-6/+9
\| \|
\| *	Implement Maxwell3D Data Upload	Fernando Sahmkow	2019-04-23	2	-3/+32
\| \|
\| *	Introduce skeleton of the GPU Compute Engine.	Fernando Sahmkow	2019-04-23	2	-7/+201
\| \|
\| *	Revamp Kepler Memory to use a subegine to manage uploads	Fernando Sahmkow	2019-04-23	4	-92/+131
\| \|
* \|	Merge pull request #2322 from ReinUsesLisp/wswitch	bunnei	2019-04-29	1	-2/+3
\|\ \ \| \|/ \|/\|	video_core: Silent -Wswitch warnings
\| *	video_core: Silent -Wswitch warnings	ReinUsesLisp	2019-04-18	1	-2/+3
\| \|
* \|	Merge pull request #2411 from FernandoS27/unsafe-gpu	bunnei	2019-04-22	1	-2/+2
\|\ \ \| \| \| \| \| \|	GPU Manager: Implement ReadBlockUnsafe and WriteBlockUnsafe
\| * \|	Use ReadBlockUnsafe on TIC and TSC reading	Fernando Sahmkow	2019-04-16	1	-2/+2
\| \|/ \| \| \| \| \| \| \| \|	Use ReadBlockUnsafe on TIC and TSC reading as memory is never flushed from host GPU there.
* \|	Merge pull request #2400 from FernandoS27/corret-kepler-mem	bunnei	2019-04-22	2	-17/+54
\|\ \ \| \| \| \| \| \|	Implement Kepler Memory on both Linear and BlockLinear.
\| * \|	Use WriteBlock and ReadBlock.	Fernando Sahmkow	2019-04-16	1	-10/+6
\| \| \|
\| * \|	Implement Block Linear copies in Kepler Memory.	Fernando Sahmkow	2019-04-16	1	-5/+14
\| \| \|
\| * \|	Correct Kepler Memory on Linear Pushes.	Fernando Sahmkow	2019-04-15	2	-16/+48
\| \|/
* \|	Merge pull request #2407 from FernandoS27/f2f	bunnei	2019-04-20	1	-7/+20
\|\ \ \| \| \| \| \| \|	Do some corrections in conversion shader instructions.
\| * \|	Do some corrections in conversion shader instructions.	Fernando Sahmkow	2019-04-16	1	-7/+20
\| \|/ \| \| \| \| \| \| \| \| \| \|	Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs.
* \|	Merge pull request #2348 from FernandoS27/guest-bindless	bunnei	2019-04-18	3	-13/+68
\|\ \ \| \| \| \| \| \|	Implement Bindless Textures on Shader Decompiler and GL backend
\| * \|	Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format.	Fernando Sahmkow	2019-04-08	3	-3/+13
\| \| \|
\| * \|	Implement TXQ_B	Fernando Sahmkow	2019-04-08	1	-0/+2
\| \| \|
\| * \|	Corrections to TEX_B	Fernando Sahmkow	2019-04-08	1	-0/+32
\| \| \|
\| * \|	Implement Bindless Handling on SetupTexture	Fernando Sahmkow	2019-04-08	2	-13/+22
\| \| \|
\| * \|	Implement Bindless Samplers and TEX_B in the IR.	Fernando Sahmkow	2019-04-08	1	-0/+2
\| \| \|
* \| \|	Merge pull request #2315 from ReinUsesLisp/severity-decompiler	bunnei	2019-04-17	1	-1/+15
\|\ \ \ \| \| \| \| \| \| \| \|	shader_ir/decode: Reduce the severity of common assertions
\| * \| \|	shader_ir/memory: Reduce severity of LD_L cache management and log it	ReinUsesLisp	2019-04-03	1	-0/+7
\| \| \| \|
\| * \| \|	shader_ir/memory: Reduce severity of ST_L cache management and log it	ReinUsesLisp	2019-04-03	1	-1/+8
\| \| \| \|
* \| \| \|	shader_ir: Implement STG, keep track of global memory usage and flush	ReinUsesLisp	2019-04-14	1	-0/+6
\| \|_\|/ \|/\| \|
* \| \|	Merge pull request #2366 from FernandoS27/xmad-fix	bunnei	2019-04-10	1	-0/+3
\|\ \ \ \| \| \| \| \| \| \| \|	Correct XMAD mode, psl and high_b on different encodings.
\| * \| \|	Correct XMAD mode, psl and high_b on different encodings.	Fernando Sahmkow	2019-04-08	1	-0/+3
\| \| \|/ \| \|/\|
* / \|	Correct LOP_IMN encoding	Fernando Sahmkow	2019-04-08	1	-1/+1
\|/ /
* \|	maxwell_3d: Reduce severity of ProcessSyncPoint	ReinUsesLisp	2019-04-06	1	-2/+2
\| \|
* \|	Merge pull request #2317 from FernandoS27/sync	bunnei	2019-04-06	2	-1/+27
\|\ \ \| \| \| \| \| \|	Implement SyncPoint Register in the GPU.
\| * \|	Implement SyncPoint Register in the GPU.	Fernando Sahmkow	2019-04-06	2	-1/+27
\| \|/
* \|	video_core/engines: Make memory manager members private	Lioncash	2019-04-06	9	-13/+14
\| \| \| \| \| \| \| \| \| \|	These aren't used externally by anything, so they can be made private data members.
* \|	video_core/engines: Remove unnecessary inclusions where applicable	Lioncash	2019-04-06	9	-9/+24
\|/ \| \| \| \| \|	Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h
*	maxwell_dma: Check for valid source in destination before copy.	bunnei	2019-03-21	1	-0/+10
\| \| \| \|	- Avoid a crash in Octopath Traveler.
*	gpu: Rewrite virtual memory manager using PageTable.	bunnei	2019-03-21	2	-5/+5
\|
*	video_core: Refactor to use MemoryManager interface for all memory access.	bunnei	2019-03-16	3	-55/+29
\| \| \| \| \| \| \| \| \| \| \|	# Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp
*	gpu: Use host address for caching instead of guest address.	bunnei	2019-03-15	3	-4/+12
\|
*	Merge pull request #2147 from ReinUsesLisp/texture-clean	bunnei	2019-03-10	1	-12/+13
\|\ \| \| \| \|	shader_ir: Remove "extras" from the MetaTexture
\| *	shader/decode: Remove extras from MetaTexture	ReinUsesLisp	2019-02-26	1	-4/+4
\| \|
\| *	shader/decode: Split memory and texture instructions decoding	ReinUsesLisp	2019-02-26	1	-8/+9
\| \|
* \|	gpu: Move command processing to another thread.	bunnei	2019-03-07	2	-3/+3
\| \|
* \|	video_core/engines: Remove unnecessary includes	Lioncash	2019-03-06	8	-10/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those.
* \|	Merge pull request #2163 from ReinUsesLisp/bitset-dirty	bunnei	2019-02-28	2	-41/+40
\|\ \ \| \| \| \| \| \|	maxwell_3d: Use std::bitset to manage dirty flags
\| * \|	maxwell_3d: Use std::bitset to manage dirty flags	ReinUsesLisp	2019-02-26	2	-41/+40
\| \|/
* /	common/math_util: Move contents into the Common namespace	Lioncash	2019-02-27	2	-5/+5
\|/ \| \| \| \|	These types are within the common library, so they should be within the Common namespace.
*	Merge pull request #2118 from FernandoS27/ipa-improve	bunnei	2019-02-25	2	-6/+41
\|\ \| \| \| \|	shader_decompiler: Improve Accuracy of Attribute Interpolation.
\| *	shader_decompiler: Improve Accuracy of Attribute Interpolation.	Fernando Sahmkow	2019-02-14	2	-6/+41
\| \|
* \|	video_core: Remove usages of System::GetInstance() within the engines	Lioncash	2019-02-16	6	-16/+39
\| \| \| \| \| \| \| \| \| \|	Avoids the use of the global accessor in favor of explicitly making the system a dependency within the interface.
* \|	core_timing: Convert core timing into a class	Lioncash	2019-02-16	1	-1/+1
\|/ \| \| \| \| \| \| \| \| \| \|	Gets rid of the largest set of mutable global state within the core. This also paves a way for eliminating usages of GetInstance() on the System class as a follow-up. Note that no behavioral changes have been made, and this simply extracts the functionality into a class. This also has the benefit of making dependencies on the core timing functionality explicit within the relevant interfaces.
*	Merge pull request #2110 from lioncash/namespace	bunnei	2019-02-13	1	-1/+1
\|\ \| \| \| \|	core_timing: Rename CoreTiming namespace to Core::Timing
\| *	core_timing: Rename CoreTiming namespace to Core::Timing	Lioncash	2019-02-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Places all of the timing-related functionality under the existing Core namespace to keep things consistent, rather than having the timing utilities sitting in its own completely separate namespace.
* \|	Merge pull request #2104 from ReinUsesLisp/compute-assert	bunnei	2019-02-13	3	-43/+50
\|\ \ \| \| \| \| \| \|	kepler_compute: Fixup assert and rename the engine
\| * \|	kepler_compute: Fixup assert and rename engines	ReinUsesLisp	2019-02-10	3	-43/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When I originally added the compute assert I used the wrong documentation. This addresses that. The dispatch register was tested with homebrew against hardware and is triggered by some games (e.g. Super Mario Odyssey). What exactly is missing to get a valid program bound by this engine requires more investigation.
* \| \|	Corrected F2I None mode to RoundEven.	Fernando Sahmkow	2019-02-11	1	-1/+1
\| \|/ \|/\|
* \|	gl_rasterizer: Implement a more accurate fermi 2D copy.	bunnei	2019-02-07	2	-52/+39
\|/ \| \| \|	- This is a blit, use the blit registers.
*	Merge pull request #2042 from ReinUsesLisp/nouveau-tex	bunnei	2019-02-07	4	-67/+67
\|\ \| \| \| \|	maxwell_3d: Allow texture handles with TIC id zero
\| *	video_core: Assert on invalid GPU to CPU address queries	ReinUsesLisp	2019-02-03	4	-41/+54
\| \|
\| *	maxwell_3d: Allow sampler handles with TSC id zero	ReinUsesLisp	2019-02-03	1	-10/+6
\| \|
\| *	maxwell_3d: Allow texture handles with TIC id zero	ReinUsesLisp	2019-02-03	1	-16/+7
\| \| \| \| \| \| \| \| \| \|	Also remove "enabled" field from Tegra::Texture::FullTextureInfo because it would become unused.
* \|	Merge pull request #2081 from ReinUsesLisp/lmem-64	bunnei	2019-02-05	1	-3/+3
\|\ \ \| \| \| \| \| \|	shader_ir/memory: Add LD_L 64 bits loads
\| * \|	shader_bytecode: Rename BytesN enums to BitsN	ReinUsesLisp	2019-02-03	1	-3/+3
\| \|/
* \|	Merge pull request #2082 from FernandoS27/txq-stl	bunnei	2019-02-05	1	-0/+4
\|\ \ \| \|/ \|/\|	Fix TXQ not using the component mask.
\| *	Update src/video_core/engines/shader_bytecode.h	Mat M	2019-02-04	1	-1/+1
\| \| \| \| \| \|	Co-Authored-By: FernandoS27 <fsahmkow27@gmail.com>
\| *	Fix TXQ not using the component mask.	Fernando Sahmkow	2019-02-03	1	-0/+4
\| \|
* \|	shader_ir: Unify constant buffer offset values	ReinUsesLisp	2019-01-30	1	-0/+8
\|/ \| \| \| \| \| \|	Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries.
*	shader_decode: Implement LDG and basic cbuf tracking	ReinUsesLisp	2019-01-30	1	-0/+8
\|
*	Merge pull request #1927 from ReinUsesLisp/shader-ir	bunnei	2019-01-26	2	-3/+9
\|\ \| \| \| \|	video_core: Replace gl_shader_decompiler with an IR based decompiler
\| *	shader_decode: Implement VMAD and VSETP	ReinUsesLisp	2019-01-15	1	-2/+3
\| \|
\| *	shader_decode: Implement HFMA2	ReinUsesLisp	2019-01-15	1	-0/+1
\| \|
\| *	shader_decode: Fixup clang-format	ReinUsesLisp	2019-01-15	1	-1/+1
\| \|
\| *	shader_ir: Initial implementation	ReinUsesLisp	2019-01-15	1	-0/+4
\| \|
\| *	shader_bytecode: Fixup encoding	ReinUsesLisp	2019-01-15	1	-1/+1
\| \|
\| *	shader_header: Make local memory size getter constant	ReinUsesLisp	2019-01-15	1	-1/+1
\| \|
* \|	maxwell_3d: Set rt_separate_frag_data to 1 by default	ReinUsesLisp	2019-01-22	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commercial games assume that this value is 1 but they never set it. On the other hand nouveau manually sets this register. On ConfigureFramebuffers we were asserting for what we are actually implementing (according to envytools).
* \|	gl_rasterizer_cache: Use dirty flags for the depth buffer	ReinUsesLisp	2019-01-07	2	-0/+12
\| \|
* \|	gl_rasterizer_cache: Use dirty flags for color buffers	ReinUsesLisp	2019-01-07	2	-0/+12
\|/
*	gl_shader_cache: Use dirty flags for shaders	ReinUsesLisp	2019-01-07	2	-0/+11
\|
*	shader_bytecode: Fixup TEXS.F16 encoding	ReinUsesLisp	2018-12-26	1	-1/+1
\|
*	Fixed uninitialized memory due to missing returns in canary	David Marcec	2018-12-19	2	-0/+4
\| \| \| \|	Functions which are suppose to crash on non canary builds usually don't return anything which lead to uninitialized memory being used.
*	shader_bytecode: Fixup half float's operator B encoding	ReinUsesLisp	2018-12-18	1	-1/+1
\|
*	Implement postfactor multiplication/division for fmul instructions	heapo	2018-12-17	1	-1/+1
\|
*	gl_shader_decompiler: Implement TEXS.F16	ReinUsesLisp	2018-12-05	1	-1/+2
\|
*	gl_rasterizer: Enable clip distances when set in register and in shader	ReinUsesLisp	2018-11-29	1	-0/+1
\|
*	Merge pull request #1808 from Tinob/master	bunnei	2018-11-28	1	-1/+15
\|\ \| \| \| \|	Fix clip distance and viewport
\| *	Add support for Clip Distance enabled register	Rodolfo Bogado	2018-11-27	1	-1/+15
\| \|
* \|	Merge pull request #1786 from Tinob/DepthClamp	bunnei	2018-11-28	1	-1/+9
\|\ \ \| \| \| \| \| \|	Add Depth Clamp Support
\| * \|	Implement depth clamp	Rodolfo Bogado	2018-11-27	1	-1/+9
\| \|/
* \|	Merge pull request #1792 from bunnei/dma-pusher	bunnei	2018-11-28	10	-47/+52
\|\ \ \| \| \| \| \| \|	gpu: Rewrite GPU command list processing with DmaPusher class.
\| * \|	gpu: Rewrite GPU command list processing with DmaPusher class.	bunnei	2018-11-27	10	-47/+52
\| \|/ \| \| \| \| \| \|	- More accurate impl., fixes Undertale (among other games).
* \|	Merge pull request #1735 from FernandoS27/tex-spacing	bunnei	2018-11-28	1	-2/+2
\|\ \ \| \|/ \|/\|	Texture decoder: Implemented Tile Width Spacing
\| *	Implemented Tile Width Spacing	FernandoS27	2018-11-26	1	-2/+2
\| \|
* \|	Merge pull request #1794 from Tinob/master	bunnei	2018-11-27	1	-1/+9
\|\ \ \| \| \| \| \| \|	Add support for viewport_transfom_enable register
\| * \|	Add support for viewport_transfom_enable register	Rodolfo Bogado	2018-11-24	1	-1/+9
\| \| \|
* \| \|	Merge pull request #1723 from degasus/dirty_flags	bunnei	2018-11-27	5	-0/+34
\|\ \ \ \| \| \| \| \| \| \| \|	gl_rasterizer: Skip VB upload if the state is clean.
\| * \| \|	gl_rasterizer: Skip VB upload if the state is clean.	Markus Wick	2018-11-17	5	-0/+34
\| \| \| \|
* \| \| \|	GPU States: Implement Polygon Offset. This is used in SMO all the time. (#1784)	Marcos	2018-11-27	1	-4/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* GPU States: Implement Polygon Offset. This is used in SMO all the time. * Clang Format fixes. * Initialize polygon_offset in the constructor.
* \| \| \|	Merge pull request #1798 from ReinUsesLisp/y-direction	bunnei	2018-11-27	1	-0/+1
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	gl_shader_decompiler: Implement S2R's Y_DIRECTION
\| * \| \|	gl_shader_decompiler: Implement S2R's Y_DIRECTION	ReinUsesLisp	2018-11-25	1	-0/+1
\| \| \|/ \| \|/\|
* \| \|	Merge pull request #1763 from ReinUsesLisp/bfi	bunnei	2018-11-26	1	-0/+3
\|\ \ \ \| \| \| \| \| \| \| \|	gl_shader_decompiler: Implement BFI_IMM_R
\| * \| \|	gl_shader_decompiler: Implement BFI_IMM_R	ReinUsesLisp	2018-11-21	1	-0/+3
\| \| \| \|
* \| \| \|	Merge pull request #1760 from ReinUsesLisp/r2p	bunnei	2018-11-26	1	-0/+14
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \|	gl_shader_decompiler: Implement R2P_IMM
\| * \| \| \|	gl_shader_decompiler: Implement R2P_IMM	ReinUsesLisp	2018-11-21	1	-0/+14
\| \|/ / /
* \| \| \|	Merge pull request #1783 from ReinUsesLisp/clip-distances	bunnei	2018-11-26	2	-1/+12
\|\ \ \ \ \| \|_\|/ / \|/\| \| \|	gl_shader_decompiler: Implement clip distances
\| * \| \|	gl_shader_decompiler: Implement clip distances	ReinUsesLisp	2018-11-23	2	-1/+12
\| \| \| \|
* \| \| \|	Merge pull request #1785 from Tinob/master	bunnei	2018-11-24	1	-1/+11
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \|	Add support for clear_flags register
\| * \| \| \|	Add support for clear_flags register	Rodolfo Bogado	2018-11-24	1	-1/+11
\| \| \| \| \|
* \| \| \| \|	Merge pull request #1769 from ReinUsesLisp/cc	bunnei	2018-11-24	1	-4/+3
\|\ \ \ \ \ \| \|/ / / / \|/\| \| \| \|	gl_shader_decompiler: Rename cc to condition code and name internal flags
\| * \| \| \|	gl_shader_decompiler: Rename control codes to condition codes	ReinUsesLisp	2018-11-22	1	-4/+3
\| \| \|/ / \| \|/\| \|
* \| \| \|	Added predicate comparison LessEqualWithNan (#1736)	Hexagon12	2018-11-23	1	-0/+1
\| \|/ / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Added predicate comparison LessEqualWithNan * oops * Clang fix
* \| \|	maxwell_3d: Implement alternate blend equations.	bunnei	2018-11-22	1	-0/+7
\|/ / \| \| \| \| \| \|	- Used by Undertale.
* \|	maxwell_3d: Initialize rasterizer color mask registers as enabled.	bunnei	2018-11-21	1	-0/+9
\| \| \| \| \| \| \| \|	- Fixes rendering regression with Sonic Mania.
* \|	small fix for alphaToOne bit location	Rodolfo Bogado	2018-11-17	1	-2/+2
\| \|
* \|	fix for gcc compilation	Rodolfo Bogado	2018-11-17	1	-60/+61
\| \|
* \|	add AlphaToCoverage and AlphaToOne	Rodolfo Bogado	2018-11-17	1	-1/+7
\| \|
* \|	add support for fragment_color_clamp	Rodolfo Bogado	2018-11-17	1	-1/+4
\| \|
* \|	set default value for point size register	Rodolfo Bogado	2018-11-17	1	-0/+3
\| \|
* \|	fix viewport and scissor behavior	Rodolfo Bogado	2018-11-17	2	-12/+18
\|/
*	gl_rasterizer: Minor cleanup	Frederic L	2018-11-13	1	-4/+2
\| \| \|	Minor code cleanup from unaddressed feedback in #1654
*	Try to fix problems with stencil test in some games, relax translation to opengl enums to avoid crashing and only generate logs of the errors.	Rodolfo Bogado	2018-11-11	2	-0/+21
\|
*	Merge pull request #1654 from degasus/dirty_flags	bunnei	2018-11-11	2	-0/+14
\|\ \| \| \| \|	gl_rasterizer: Skip VAO binding if the state is clean.
\| *	gl_rasterizer: Skip VAO binding if the state is clean.	Markus Wick	2018-11-06	2	-0/+14
\| \|
* \|	Add support to color mask to avoid issues in blending caused by wrong values in the alpha channel in some render targets.	Rodolfo Bogado	2018-11-05	1	-3/+20
\| \|
* \|	Implement multi-target viewports and blending	Rodolfo Bogado	2018-11-05	2	-2/+28
\|/
*	Merge pull request #1527 from FernandoS27/assert-flow	bunnei	2018-11-01	1	-0/+1
\|\ \| \| \| \|	Assert Control Flow Instructions using Control Codes
\| *	Assert Control Flow Instructions using Control Codes	FernandoS27	2018-10-29	1	-1/+2
\| \|
* \|	maxwell_3d: Restructure macro upload to use a single macro code memory.	bunnei	2018-11-01	2	-12/+39
\| \| \| \| \| \| \| \| \| \|	- Fixes an issue where macros could be skipped. - Fixes rendering of distant objects in Super Mario Odyssey.
* \|	Merge pull request #1528 from FernandoS27/assert-control-codes	bunnei	2018-11-01	1	-1/+5
\|\ \ \| \| \| \| \| \|	Assert Control Codes Generation on Shader Instructions
\| * \|	Assert Control Codes Generation	FernandoS27	2018-10-30	1	-1/+5
\| \|/
* /	global: Use std::optional instead of boost::optional (#1578)	Frederic L	2018-10-30	2	-9/+9
\|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* get rid of boost::optional * Remove optional references * Use std::reference_wrapper for optional references * Fix clang format * Fix clang format part 2 * Adressed feedback * Fix clang format and MacOS build
*	Implement sRGB Support, including workarounds for nvidia driver issues and QT sRGB support	Rodolfo Bogado	2018-10-28	1	-1/+6
\|
*	gl_rasterizer: Implement primitive restart.	bunnei	2018-10-26	1	-1/+9
\|
*	Merge pull request #1533 from FernandoS27/lmem	bunnei	2018-10-26	2	-0/+36
\|\ \| \| \| \|	Implemented Shader Local Memory
\| *	Implemented LD_L and ST_L	FernandoS27	2018-10-24	2	-0/+36
\| \|
* \|	maxwell_3d: Add code for initializing register defaults.	bunnei	2018-10-26	2	-1/+21
\|/
*	Merge pull request #1554 from FernandoS27/pointsize	bunnei	2018-10-24	1	-0/+1
\|\ \| \| \| \|	Implement PointSize Output Attribute.
\| *	Implement PointSize	FernandoS27	2018-10-23	1	-0/+1
\| \|
* \|	maxwell_3d: Remove unused variable within ProcessQueryGet()	Lioncash	2018-10-24	1	-1/+0
\|/
*	Merge pull request #1519 from ReinUsesLisp/vsetp	bunnei	2018-10-23	1	-3/+15
\|\ \| \| \| \|	gl_shader_decompiler: Implement VSETP
\| *	gl_shader_decompiler: Implement VSETP	ReinUsesLisp	2018-10-23	1	-0/+2
\| \|
\| *	gl_shader_decompiler: Abstract VMAD into a video subset	ReinUsesLisp	2018-10-23	1	-3/+13
\| \|
* \|	Merge pull request #1539 from lioncash/dma	bunnei	2018-10-23	3	-19/+10
\|\ \ \| \| \| \| \| \|	maxwell_dma: Silence compilation warnings
\| * \|	engines/maxwell_*: Use nested namespace specifiers where applicable	Lioncash	2018-10-20	3	-12/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These three source files are the only ones within the engines directory that don't use nested namespaces. We may as well change these over to keep things consistent.
\| * \|	maxwell_dma: Make variables const where applicable within HandleCopy()	Lioncash	2018-10-20	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \|	These are never modified, so we can make that assumption explicit.
\| * \|	maxwell_dma: Make FlushAndInvalidate's size parameter a u64	Lioncash	2018-10-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This prevents truncation warnings at the lambda's usage sites.
\| * \|	maxwell_dma: Remove unused variables in HandleCopy()	Lioncash	2018-10-20	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \|	These pointer variables are never used, so we can get rid of them.
* \| \|	Merge pull request #1470 from FernandoS27/alpha_testing	bunnei	2018-10-23	1	-1/+3
\|\ \ \ \| \| \| \| \| \| \| \|	Implemented Alpha Test using Shader Emulation
\| * \| \|	Implemented Alpha Testing	FernandoS27	2018-10-22	1	-1/+3
\| \| \| \|
* \| \| \|	Merge pull request #1512 from ReinUsesLisp/brk	bunnei	2018-10-23	1	-3/+7
\|\ \ \ \ \| \|_\|_\|/ \|/\| \| \|	gl_shader_decompiler: Implement PBK and BRK
\| * \| \|	gl_shader_decompiler: Implement PBK and BRK	ReinUsesLisp	2018-10-18	1	-3/+7
\| \| \| \|
* \| \| \|	Added Saturation to FMUL32I	FernandoS27	2018-10-23	1	-0/+4
\| \|/ / \|/\| \|
* \| \|	Fixed FSETP and FSET	FernandoS27	2018-10-22	1	-2/+0
\| \|/ \|/\|
* \|	Merge pull request #1501 from ReinUsesLisp/half-float	bunnei	2018-10-20	1	-0/+145
\|\ \ \| \| \| \| \| \|	gl_shader_decompiler: Implement H* instructions
\| * \|	gl_shader_decompiler: Implement HSET2_R	ReinUsesLisp	2018-10-15	1	-0/+18
\| \| \|
\| * \|	gl_shader_decompiler: Implement HSETP2_R	ReinUsesLisp	2018-10-15	1	-0/+20
\| \| \|
\| * \|	gl_shader_decompiler: Implement HFMA2 instructions	ReinUsesLisp	2018-10-15	1	-0/+32
\| \| \|
\| * \|	gl_shader_decompiler: Implement HADD2_IMM and HMUL2_IMM	ReinUsesLisp	2018-10-15	1	-0/+30
\| \| \|
\| * \|	gl_shader_decompiler: Implement non-immediate HADD2 and HMUL2 instructions	ReinUsesLisp	2018-10-15	1	-0/+25
\| \| \|
\| * \|	gl_shader_decompiler: Setup base for half float unpacking and setting	ReinUsesLisp	2018-10-15	1	-0/+20
\| \| \|
* \| \|	GPU: Improved implementation of maxwell DMA (Subv).	bunnei	2018-10-19	2	-16/+65
\| \| \|
* \| \|	GPU: Invalidate destination address of kepler_memory writes.	bunnei	2018-10-19	2	-2/+16
\| \| \|
* \| \|	fermi_2d: Add support for more accurate surface copies.	bunnei	2018-10-19	1	-3/+6
\| \| \|
* \| \|	Implement 3D Textures	FernandoS27	2018-10-18	1	-1/+4
\| \|/ \|/\|
* \|	shader_bytecode: Add Control Code enum 0xf	ReinUsesLisp	2018-10-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Control Code 0xf means to unconditionally execute the instruction. This value is passed to most BRA, EXIT and SYNC instructions (among others) but this may not always be the case.
* \|	Propagate depth and depth_block on modules using decoders	FernandoS27	2018-10-13	3	-10/+18
\|/
*	gl_shader_decompiler: Implement VMAD	ReinUsesLisp	2018-10-11	1	-0/+36
\|
*	Merge pull request #1458 from FernandoS27/fix-render-target-block-settings	bunnei	2018-10-11	2	-4/+34
\|\ \| \| \| \|	Fixed block height settings for RenderTargets and Depth Buffers
\| *	Add memory Layout to Render Targets and Depth Buffers	FernandoS27	2018-10-10	1	-2/+14
\| \|
\| *	Fixed block height settings for RenderTargets and Depth Buffers, and added block width and block depth	FernandoS27	2018-10-10	2	-4/+22
\| \|
* \|	Merge pull request #1460 from FernandoS27/scissor_test	bunnei	2018-10-10	1	-1/+16
\|\ \ \| \| \| \| \| \|	Implemented Scissor Testing
\| * \|	Assert Scissor tests	FernandoS27	2018-10-09	1	-1/+16
\| \|/
* /	gl_shader_decompiler: Implement geometry shaders	ReinUsesLisp	2018-10-07	1	-0/+112
\|/
*	fermi_2d: Implement simple copies with AccelerateSurfaceCopy.	bunnei	2018-10-06	2	-23/+35
\|
*	gl_rasterizer: Implement quads topology	ReinUsesLisp	2018-10-04	1	-0/+6
\|
*	Merge pull request #1411 from ReinUsesLisp/point-size	bunnei	2018-09-29	1	-1/+6
\|\ \| \| \| \|	video_core: Implement point_size and add point state sync
\| *	video_core: Implement point_size and add point state sync	ReinUsesLisp	2018-09-28	1	-1/+6
\| \|
* \|	gl_state: Pack sampler bindings into a single ARB_multi_bind	ReinUsesLisp	2018-09-28	1	-0/+1
\|/
*	video_core: Add asserts for CS, TFB and alpha testing	ReinUsesLisp	2018-09-26	3	-3/+64
\| \| \| \| \| \|	Add asserts for compute shader dispatching, transform feedback being enabled and alpha testing. These have in common that they'll probably break rendering without logging.
*	shader_bytecode: Lay out the Ipa-related enums better	Lioncash	2018-09-21	1	-2/+12
\| \| \| \|	This is more consistent with the surrounding enums.
*	shader_bytecode: Make operator== and operator!= of IpaMode const qualified	Lioncash	2018-09-21	1	-6/+7
\| \| \| \| \|	These don't affect the state of the struct and can be const member functions.
*	Merge pull request #1279 from FernandoS27/csetp	bunnei	2018-09-19	1	-0/+47
\|\ \| \| \| \|	shader_decompiler: Implemented (Partialy) Control Codes and CSETP
\| *	Implemented I2I.CC on the NEU control code, used by SMO	FernandoS27	2018-09-17	1	-1/+1
\| \|
\| *	Implemented CSETP	FernandoS27	2018-09-17	1	-0/+11
\| \|
\| *	Implemented Control Codes	FernandoS27	2018-09-17	1	-0/+36
\| \|
* \|	Merge pull request #1299 from FernandoS27/texture-sanatize	bunnei	2018-09-19	1	-1/+147
\|\ \ \| \| \| \| \| \|	shader_decompiler: Asserts for Texture Instructions
\| * \|	Added texture misc modes to texture instructions	FernandoS27	2018-09-17	1	-1/+147
\| \|/
* \|	Merge pull request #1290 from FernandoS27/shader-header	bunnei	2018-09-18	1	-0/+103
\|\ \ \| \|/ \|/\|	Implemented (Partialy) Shader Header
\| *	Replace old FragmentHeader for the new Header	FernandoS27	2018-09-11	1	-9/+15
\| \|
\| *	Implemented (Partialy) Shader Header	FernandoS27	2018-09-11	1	-0/+97
\| \|
* \|	Merge pull request #1326 from FearlessTobi/port-4182	bunnei	2018-09-17	6	-32/+33
\|\ \ \| \| \| \| \| \|	Port #4182 from Citra: "Prefix all size_t with std::"
\| * \|	Port #4182 from Citra: "Prefix all size_t with std::"	fearlessTobi	2018-09-15	6	-32/+33
\| \| \|
* \| \|	Merge pull request #1273 from Subv/ld_sizes	bunnei	2018-09-15	1	-1/+9
\|\ \ \ \| \| \| \| \| \| \| \|	Shaders: Implemented multiple-word loads and stores to and from attribute memory.
\| * \| \|	Shaders: Implemented multiple-word loads and stores to and from attribute memory.	Subv	2018-09-15	1	-1/+9
\| \|/ / \| \| \| \| \| \| \| \| \|	This seems to be an optimization performed by nouveau.
* \| \|	Merge pull request #1271 from Subv/kepler_engine	bunnei	2018-09-15	2	-0/+135
\|\ \ \ \| \|/ / \|/\| \|	GPU: Basic implementation of the Kepler Inline Memory engine (p2mf).
\| * \|	GPU: Basic implementation of the Kepler Inline Memory engine (p2mf).	Subv	2018-09-12	2	-0/+135
\| \| \| \| \| \| \| \| \| \| \| \|	This engine writes data from a FIFO register into the configured address.
* \| \|	Merge pull request #1263 from FernandoS27/tex-mode	bunnei	2018-09-12	1	-0/+10
\|\ \ \ \| \|/ / \|/\| \|	shader_decompiler: Implemented (Partially) Texture Processing Modes
\| * \|	Implemented Texture Processing Modes	FernandoS27	2018-09-12	1	-0/+10
\| \|/
* /	Implemented encodings for LEA and PSET	FernandoS27	2018-09-11	1	-0/+64
\|/
*	rasterizer: Drop unused handler.	Markus Wick	2018-09-10	1	-2/+0
\| \| \| \| \| \| \| \|	This virtual function is called in a very hot spot, and it does nothing. If this kind of feature is required, please be more specific and add callbacks in the switch statement within Maxwell3D::WriteReg. There is no point in having another switch statement within the rasterizer.
*	gl_rasterizer: Implement multiple color attachments.	bunnei	2018-09-10	1	-1/+21
\|
*	Merge pull request #1268 from FernandoS27/tmml	bunnei	2018-09-10	1	-5/+19
\|\ \| \| \| \|	shader_decompiler: Implemented TMML
\| *	Implemented TMML	FernandoS27	2018-09-10	1	-5/+19
\| \|
* \|	Merge pull request #1272 from Subv/dma_2d	bunnei	2018-09-10	1	-2/+10
\|\ \ \| \|/ \|/\|	GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine.
\| *	GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine.	Subv	2018-09-08	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When not set, this tells the GPU to only use the X size when performing a DMA copy. This is only implemented for linear->linear and tiled->tiled copies. Conversion copies still retain the assert. This bit is unset by some games for various purposes, and by nouveau when copying the vertex buffers.
* \|	Implemented TXQ dimension query type, used by SMO.	FernandoS27	2018-09-09	1	-1/+16
\| \|
* \|	Change name of TEXQ to TXQ, in order to match NVIDIA's naming	FernandoS27	2018-09-09	1	-2/+2
\| \|
* \|	maxwell_3d: Remove assert that no longer applies.	bunnei	2018-09-08	1	-4/+0
\|/
*	Merge pull request #1243 from degasus/VAO_cache	bunnei	2018-09-06	1	-2/+7
\|\ \| \| \| \|	gl_rasterizer: Implement a VAO cache.
\| *	gl_rasterizer: Implement a VAO cache.	Markus Wick	2018-09-05	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \|	This patch caches VAO objects instead of re-emiting all pointers per draw call. Configuring this pointers is known as a fast task, but it yields too many GL calls. So for better performance, just bind the VAO instead of 16 pointers.
* \|	Implemented IPA Properly	FernandoS27	2018-09-06	1	-0/+12
\|/
*	Merge pull request #1213 from DarkLordZach/octopath-fs	bunnei	2018-09-02	1	-2/+3
\|\ \| \| \| \|	filesystem/maxwell_3d: Various changes to boot Project Octopath Traveller
\| *	maxwell_3d: Use CoreTiming for query timestamp	Zach Hilman	2018-09-01	1	-2/+3
\| \|
* \|	Merge pull request #1215 from ogniK5377/texs-nodep-assert	bunnei	2018-09-02	1	-0/+1
\|\ \ \| \| \| \| \| \|	Added assert for TEXS nodep
\| * \|	Added assert for TEXS nodep	David Marcec	2018-09-01	1	-0/+1
\| \|/
* \|	Merge pull request #1214 from ogniK5377/ipa-assert	bunnei	2018-09-02	1	-2/+5
\|\ \ \| \| \| \| \| \|	Added better asserts to IPA, Renamed IPA modes to match mesa
\| * \|	Added better asserts to IPA, Renamed IPA modes to match mesa	David Marcec	2018-09-01	1	-2/+5
\| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	IpaMode is changed to IpaInterpMode IpaMode is suppose to be 2 bits not 3 Added IpaSampleMode Added Saturate Renamed modes based on https://github.com/mesa3d/mesa/blob/d27c7918916cdc8092959124955f887592e37d72/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp#L2530
* \|	Merge pull request #1216 from ogniK5377/ffma-assert	bunnei	2018-09-02	1	-0/+3
\|\ \ \| \| \| \| \| \|	Added FFMA asserts and missing fields
\| * \|	Removed saturate assert	David Marcec	2018-09-01	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Saturate already implemented
\| * \|	Added FFMA asserts	David Marcec	2018-09-01	1	-0/+4
\| \|/
* \|	Removed saturate assert	David Marcec	2018-09-01	1	-1/+0
\| \| \| \| \| \| \| \|	Unneeded as we already implement it
* \|	Added FMUL asserts	David Marcec	2018-09-01	1	-0/+5
\|/
*	core/core: Replace includes with forward declarations where applicable	Lioncash	2018-08-31	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \|	The follow-up to e2457418dae19b889b2ad85255bb95d4cd0e4bff, which replaces most of the includes in the core header with forward declarations. This makes it so that if any of the headers the core header was previously including change, then no one will need to rebuild the bulk of the core, due to core.h being quite a prevalent inclusion. This should make turnaround for changes much faster for developers.
*	Added predicate comparison GreaterEqualWithNan	Hexagon12	2018-08-31	1	-0/+1
\|
*	gl_shader_decompiler: Implement POPC (#1203)	Laku	2018-08-31	1	-0/+10
\| \| \| \| \| \|	* Implement POPC * implement invert
*	Merge pull request #1200 from bunnei/improve-ipa	bunnei	2018-08-30	1	-0/+6
\|\ \| \| \| \|	gl_shader_decompiler: Improve IPA for Pass mode with Position attribute.
\| *	gl_shader_decompiler: Improve IPA for Pass mode with Position attribute.	bunnei	2018-08-29	1	-0/+6
\| \|
* \|	Shaders: Implemented IADD3	tech4me	2018-08-29	1	-1/+23
\|/
*	Merge pull request #1169 from Lakumakkara/sel	bunnei	2018-08-28	1	-1/+1
\|\ \| \| \| \|	shader_bytecode: fix SEL_IMM bitstring
\| *	fix SEL_IMM bitstring	Laku	2018-08-24	1	-1/+1
\| \|
* \|	Merge pull request #1173 from lioncash/batch	bunnei	2018-08-25	1	-4/+4
\|\ \ \| \|/ \|/\|	maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch()
\| *	maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch()	Lioncash	2018-08-25	1	-4/+4
\| \| \| \| \| \| \| \| \| \|	The start and finish events should likely not be right after one another like this, otherwise the batch will appear to complete immediately
* \|	Shaders: Added decodings for IADD3 instructions	tech4me	2018-08-23	1	-0/+6
\|/
*	maxwell_3d: Update to include additional stencil registers.	bunnei	2018-08-23	1	-20/+50
\|
*	implement lop3	Laku	2018-08-22	1	-0/+19
\|
*	Merge pull request #1124 from Subv/logic_ops	bunnei	2018-08-22	1	-1/+28
\|\ \| \| \| \|	GPU: Implemented logic ops.
\| *	GPU: Added registers for the logicop functionality.	Subv	2018-08-21	1	-1/+28
\| \|
* \|	shader_bytecode: Parenthesize conditional expression within GetTextureType()	Lioncash	2018-08-21	1	-1/+1
\| \| \| \| \| \| \| \|	Resolves a -Wlogical-op-parentheses warning.
* \|	shader_bytecode: Replace some UNIMPLEMENTED logs.	bunnei	2018-08-21	1	-2/+6
\|/
*	Merge pull request #1104 from Subv/instanced_arrays	bunnei	2018-08-20	1	-1/+14
\|\ \| \| \| \|	GLRasterizer: Implemented instanced vertex arrays.
\| *	GLRasterizer: Implemented instanced vertex arrays.	Subv	2018-08-18	1	-1/+14
\| \| \| \| \| \| \| \|	Before each draw call, for every enabled vertex array configured as instanced, we take the current instance id and divide it by its configured divisor, then we multiply that by the corresponding stride and increment the start address by the resulting amount. This way we can simulate the vertex array being incremented once per instance without actually using OpenGL's instancing functions.
* \|	Merge pull request #1112 from Subv/sampler_types	bunnei	2018-08-20	1	-4/+72
\|\ \ \| \| \| \| \| \|	Shaders: Use the correct shader type when sampling textures.
\| * \|	Shader: Added bitfields for the texture type of the various sampling instructions.	Subv	2018-08-19	1	-1/+65
\| \| \|
\| * \|	Shaders: Added decodings for TLD4 and TLD4S	Subv	2018-08-19	1	-3/+7
\| \| \|
* \| \|	Merge pull request #1089 from Subv/neg_bits	bunnei	2018-08-19	1	-0/+4
\|\ \ \ \| \| \| \| \| \| \| \|	Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions.
\| * \| \|	Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions.	Subv	2018-08-18	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We should definitely audit our shader generator for more errors like this.
* \| \| \|	Shaders/TEXS: Fixed the component mask in the TEXS instruction.	Subv	2018-08-19	1	-6/+11
\| \|/ / \|/\| \| \| \| \| \| \| \|	Previously we could end up with a TEXS that didn't write any outputs, this was wrong.
* \| \|	Merge pull request #1109 from Subv/ldg_decode	bunnei	2018-08-19	1	-0/+4
\|\ \ \ \| \| \| \| \| \| \| \|	Shaders: Added decodings for the LDG and STG instructions.
\| * \| \|	Shaders: Added decodings for the LDG and STG instructions.	Subv	2018-08-19	1	-0/+4
\| \| \|/ \| \|/\|
* \| \|	Merge pull request #1108 from Subv/front_facing	bunnei	2018-08-19	1	-0/+3
\|\ \ \ \| \| \| \| \| \| \| \|	Shaders: Implemented the gl_FrontFacing input attribute (attr 63).
\| * \| \|	Shaders: Implemented the gl_FrontFacing input attribute (attr 63).	Subv	2018-08-19	1	-0/+3
\| \|/ /
* / /	Shader: Implemented the predicate and mode arguments of LOP.	Subv	2018-08-18	1	-1/+6
\|/ / \| \| \| \| \| \| \| \| \| \|	The mode can be used to set the predicate to true depending on the result of the logic operation. In some cases, this means discarding the result (writing it to register 0xFF (Zero)). This is used by Super Mario Odyssey.
* \|	Added predcondition GreaterThanWithNan	David Marcec	2018-08-18	1	-0/+1
\| \|
* \|	Rasterizer: Implemented instanced rendering.	Subv	2018-08-15	2	-0/+15
\|/ \| \| \| \| \|	We keep track of the current instance and update an uniform in the shaders to let them know which instance they are. Instanced vertex arrays are not yet implemented.
*	gl_shader_decompiler: Implement XMAD instruction.	bunnei	2018-08-13	1	-4/+25
\|
*	Merge pull request #1024 from Subv/blend_gl	bunnei	2018-08-12	1	-0/+21
\|\ \| \| \| \|	GPU/Maxwell3D: Implemented an alternative set of blend factors.
\| *	GPU/Maxwell3D: Implemented an alternative set of blend factors.	Subv	2018-08-12	1	-0/+21
\| \| \| \| \| \| \| \|	These are used by nouveau and some games like SMO.
* \|	RasterizerGL: Ignore invalid/unset vertex attributes.	Subv	2018-08-12	1	-0/+5
\|/ \| \| \|	This should make the es2gears example not crash anymore.
*	Merge pull request #1010 from bunnei/unk-vert-attrib-shader	bunnei	2018-08-12	1	-2/+1
\|\ \| \| \| \|	gl_shader_decompiler: Improve handling of unknown input/output attributes.
\| *	gl_shader_decompiler: Improve handling of unknown input/output attributes.	bunnei	2018-08-12	1	-2/+1
\| \|
* \|	Merge pull request #1018 from Subv/ssy_sync	bunnei	2018-08-12	1	-0/+7
\|\ \ \| \|/ \|/\|	GPU/Shader: Implemented SSY and SYNC as a set_target/jump pair.
\| *	GPU/Shader: Don't predicate instructions that don't have a predicate field (SSY).	Subv	2018-08-11	1	-0/+7
\| \|
* \|	video_core: Use variable template variants of type_traits interfaces where applicable	Lioncash	2018-08-10	1	-2/+1
\|/
*	maxwell_3d: Ignore macros that have not been uploaded yet.	bunnei	2018-08-09	1	-4/+9
\| \| \| \|	- Used by Super Mario Odyssey (in game).
*	Merge pull request #982 from bunnei/stub-unk-63	bunnei	2018-08-09	1	-0/+2
\|\ \| \| \| \|	gl_shader_decompiler: Stub input attribute Unknown_63.
\| *	gl_shader_decompiler: Stub input attribute Unknown_63.	bunnei	2018-08-08	1	-0/+2
\| \|
* \|	Merge pull request #976 from bunnei/shader-imm	bunnei	2018-08-09	1	-9/+4
\|\ \ \| \| \| \| \| \|	gl_shader_decompiler: Let OpenGL interpret floats.
\| * \|	gl_shader_decompiler: Let OpenGL interpret floats.	bunnei	2018-08-08	1	-9/+4
\| \|/ \| \| \| \| \| \| \| \|	- Accuracy is lost in translation to string, e.g. with NaN. - Needed for Super Mario Odyssey.
* /	maxwell_3d: Use correct const buffer size and check bounds.	bunnei	2018-08-08	2	-1/+3
\|/ \| \| \|	- Fixes mem corruption with Super Mario Odyssey and Pokkén Tournament DX.
*	maxwell_3d: Remove outdated assert.	bunnei	2018-08-06	1	-2/+0
\|
*	video_core: Eliminate the g_renderer global variable	Lioncash	2018-08-04	2	-6/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We move the initialization of the renderer to the core class, while keeping the creation of it and any other specifics in video_core. This way we can ensure that the renderer is initialized and doesn't give unfettered access to the renderer. This also makes dependencies on types more explicit. For example, the GPU class doesn't need to depend on the existence of a renderer, it only needs to care about whether or not it has a rasterizer, but since it was accessing the global variable, it was also making the renderer a part of its dependency chain. By adjusting the interface, we can get rid of this dependency.
*	GPU: Remove the assert that required the CODE_ADDRESS to be 0.	Subv	2018-07-24	1	-8/+0
\| \| \| \|	Games usually just leave it at 0 but nouveau sets it to something else. This already works fine, the assert is useless.
*	shader_bytecode: Implement other TEXS masks.	bunnei	2018-07-22	1	-5/+9
\|
*	gl_shader_decompiler: Implement SEL instruction.	bunnei	2018-07-22	1	-0/+11
\|
*	maxwell_3d: Add depth buffer enable, width, and height registers.	bunnei	2018-07-22	1	-2/+14
\|
*	video_core: Use nested namespaces where applicable	Lioncash	2018-07-21	6	-28/+14
\| \| \| \|	Compresses a few namespace specifiers to be more compact.
*	maxwell_3d: Remove unused variable within GetStageTextures()	Lioncash	2018-07-20	1	-2/+0
\|
*	GPU: Added register definitions for the stencil parameters.	Subv	2018-07-17	1	-2/+25
\|
*	gl_rasterizer: Fix check for if a shader stage is enabled.	bunnei	2018-07-13	2	-24/+8
\|
*	Merge pull request #655 from bunnei/pred-lt-nan	bunnei	2018-07-13	1	-0/+1
\|\ \| \| \| \|	gl_shader_decompiler: Implement PredCondition::LessThanWithNan.
\| *	gl_shader_decompiler: Implement PredCondition::LessThanWithNan.	bunnei	2018-07-13	1	-0/+1
\| \|
* \|	gl_shader_decompiler: Use FlowCondition field in EXIT instruction.	bunnei	2018-07-13	1	-0/+9
\|/
*	Merge pull request #652 from Subv/fadd32i	Sebastian Valle	2018-07-13	1	-0/+9
\|\ \| \| \| \|	GPU: Implement the FADD32I shader instruction.
\| *	GPU: Implement the FADD32I shader instruction.	Subv	2018-07-12	1	-0/+9
\| \|
* \|	Merge pull request #651 from Subv/ffma_decode	bunnei	2018-07-12	1	-1/+1
\|\ \ \| \| \| \| \| \|	GPU: Corrected the decoding of FFMA for immediate operands.
\| * \|	GPU: Corrected the decoding of FFMA for immediate operands.	Subv	2018-07-12	1	-1/+1
\| \|/
* \|	Merge pull request #625 from Subv/imnmx	bunnei	2018-07-08	1	-3/+17
\|\ \ \| \|/ \|/\|	GPU: Implemented the IMNMX shader instruction.
\| *	GPU: Implemented the IMNMX shader instruction.	Subv	2018-07-04	1	-3/+17
\| \| \| \| \| \| \| \|	It's similar to the FMNMX instruction but it works on integers.
* \|	Merge pull request #629 from Subv/depth_test	bunnei	2018-07-05	1	-9/+21
\|\ \ \| \| \| \| \| \|	GPU: Allow using the old NV04 values for the depth test function.
\| * \|	GPU: Allow using the old NV04 values for the depth test function.	Subv	2018-07-05	1	-9/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These seem to be just a valid as the GL token values. Thanks @ReinUsesLisp This restores graphical output to Disgaea 5
* \| \|	Merge pull request #626 from Subv/shader_sync	bunnei	2018-07-05	1	-0/+5
\|\ \ \ \| \|/ / \|/\| \|	GPU: Stub the shader SYNC and DEPBAR instructions.
\| * \|	GPU: Stub the shader SYNC and DEPBAR instructions.	Subv	2018-07-04	1	-0/+5
\| \|/ \| \| \| \| \| \|	It is unknown at this moment if we actually need to do something with these instructions or if the GLSL compiler takes care of that for us.
* \|	Merge pull request #622 from Subv/unused_tex	bunnei	2018-07-05	1	-1/+1
\|\ \ \| \| \| \| \| \|	GPU: Ignore unused textures and corrected the TEX shader instruction decoding.
\| * \|	GPU: Corrected the decoding for the TEX shader instruction.	Subv	2018-07-04	1	-1/+1
\| \|/
* \|	Merge pull request #621 from Subv/psetp_	bunnei	2018-07-05	1	-0/+13
\|\ \ \| \| \| \| \| \|	GPU: Implemented the PSETP shader instruction.
\| * \|	GPU: Implemented the PSETP shader instruction.	Subv	2018-07-04	1	-0/+13
\| \|/ \| \| \| \| \| \|	It's similar to the isetp and fsetp instructions but it works on predicates instead.
* /	GPU: Flip the triangle front face winding if the GPU is configured to not flip the triangles.	Subv	2018-07-04	1	-3/+19
\|/ \| \| \| \| \|	OpenGL's default behavior is already correct when the GPU is configured to flip the triangles. This fixes 1-2 Switch's splash screen.
*	Merge pull request #609 from Subv/clear_buffers	bunnei	2018-07-04	2	-2/+39
\|\ \| \| \| \|	GPU: Implemented the CLEAR_BUFFERS register.
\| *	GPU: Support clears that don't clear the color buffer.	Subv	2018-07-03	1	-2/+3
\| \|
\| *	GPU: Bind and clear the render target when the CLEAR_BUFFERS register is written to.	Subv	2018-07-03	1	-0/+11
\| \|
\| *	GPU: Added registers for the CLEAR_BUFFERS and CLEAR_COLOR methods.	Subv	2018-07-03	1	-2/+27
\| \|
* \|	Merge pull request #607 from jroweboy/logging	bunnei	2018-07-03	3	-5/+5
\|\ \ \| \| \| \| \| \|	Logging - Customizable backends
\| * \|	Update clang format	James Rowe	2018-07-03	2	-3/+3
\| \| \|
\| * \|	Rename logging macro back to LOG_*	James Rowe	2018-07-03	3	-3/+3
\| \|/
* \|	Merge pull request #611 from Subv/enabled_depth_test	bunnei	2018-07-03	1	-9/+9
\|\ \ \| \| \| \| \| \|	GPU: Don't try to parse the depth test function if the depth test is disabled and use only the least significant 3 bits in the depth test func
\| * \|	GPU: Use only the least significant 3 bits when reading the depth test func.	Subv	2018-07-03	1	-9/+9
\| \|/ \| \| \| \| \| \|	Some games set the full GL define value here (including nouveau), but others just seem to set those last 3 bits.
* \|	Merge pull request #610 from Subv/mufu_8	bunnei	2018-07-03	1	-0/+1
\|\ \ \| \|/ \|/\|	GPU: Implemented MUFU suboperation 8, sqrt.
\| *	GPU: Implemented MUFU suboperation 8, sqrt.	Subv	2018-07-03	1	-0/+1
\| \|
* \|	Merge pull request #608 from Subv/depth	bunnei	2018-07-03	1	-4/+52
\|\ \ \| \| \| \| \| \|	GPU: Implemented the depth buffer and depth test + culling
\| * \|	GPU: Added registers for depth test and cull mode.	Subv	2018-07-02	1	-3/+51
\| \| \|
\| * \|	GPU: Implemented the Z24S8 depth format and load the depth framebuffer.	Subv	2018-07-02	1	-1/+1
\| \|/
* \|	Merge pull request #606 from Subv/base_vertex	Sebastian Valle	2018-07-02	1	-1/+6
\|\ \ \| \| \| \| \| \|	GPU: Fixed the index offset and implement BaseVertex when doing indexed rendering.
\| * \|	GPU: Added register definitions for the vertex buffer base element.	Subv	2018-07-02	1	-1/+6
\| \|/
* \|	Merge pull request #605 from Subv/dma_copy	Sebastian Valle	2018-07-02	1	-1/+5
\|\ \ \| \|/ \|/\|	GPU: Directly copy the pixels when performing a same-layout DMA.
\| *	GPU: Directly copy the pixels when performing a same-layout DMA.	Subv	2018-07-02	1	-1/+5
\| \|
* \|	Merge pull request #602 from Subv/mufu_subop	bunnei	2018-07-01	1	-2/+1
\|\ \ \| \| \| \| \| \|	GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation.
\| * \|	GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation.	Subv	2018-06-30	1	-2/+1
\| \|/
* /	gl_shader_decompiler: Implement predicate NotEqualWithNan.	bunnei	2018-06-30	1	-0/+1
\|/
*	maxwell_3d: Add a struct for RenderTargetConfig.	bunnei	2018-06-27	1	-17/+19
\|
*	Build: Fixed some MSVC warnings in various parts of the code.	Subv	2018-06-20	2	-4/+5
\|
*	GPU: Don't mark uniform buffers and registers as used for instructions which don't have them.	Subv	2018-06-19	1	-2/+3
\| \| \| \| \|	Like the MOV32I and FMUL32I instructions. This fixes a potential crash when using these instructions.
*	gl_shader_decompiler: Implement LOP instructions.	bunnei	2018-06-17	1	-0/+14
\|
*	gl_shader_decompiler: Refactor LOP32I instruction a bit in support of LOP.	bunnei	2018-06-17	1	-3/+2
\|
*	gl_shader_decompiler: Implement integer size conversions for I2I/I2F/F2I.	bunnei	2018-06-16	1	-1/+2
\|
*	Merge pull request #556 from Subv/dma_engine	bunnei	2018-06-12	3	-0/+225
\|\ \| \| \| \|	GPU: Partially implemented the Maxwell DMA engine.
\| *	GPU: Partially implemented the Maxwell DMA engine.	Subv	2018-06-12	3	-0/+225
\| \| \| \| \| \| \| \|	Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported.
* \|	Merge pull request #558 from Subv/iadd32i	bunnei	2018-06-12	1	-2/+10
\|\ \ \| \| \| \| \| \|	GPU: Implemented the iadd32i shader instruction.
\| * \|	GPU: Implemented the iadd32i shader instruction.	Subv	2018-06-12	1	-2/+10
\| \|/
* /	gl_shader_decompiler: Implement saturate for float instructions.	bunnei	2018-06-12	1	-2/+1
\|/
*	GPU: Implement the iset family of shader instructions.	Subv	2018-06-09	1	-0/+9
\|
*	GPU: Added decodings for the ISET family of instructions.	Subv	2018-06-09	1	-0/+7
\|
*	Merge pull request #550 from Subv/ssy	bunnei	2018-06-09	1	-0/+2
\|\ \| \| \| \|	GPU: Stub the SSY shader instruction.
\| *	GPU: Stub the SSY shader instruction.	Subv	2018-06-09	1	-0/+2
\| \| \| \| \| \| \| \|	This instruction tells the GPU where the flow reconverges in a non-uniform control flow scenario, we can ignore this when generating GLSL code.
* \|	Merge pull request #551 from bunnei/shr	bunnei	2018-06-09	1	-0/+4
\|\ \ \| \| \| \| \| \|	gl_shader_decompiler: Implement SHR instruction.
\| * \|	gl_shader_decompiler: Implement SHR instruction.	bunnei	2018-06-09	1	-0/+4
\| \|/
* \|	gl_shader_decompiler: Implement IADD instruction.	bunnei	2018-06-09	1	-5/+11
\| \|
* \|	gl_shader_decompiler: Add missing asserts for saturate_a instructions.	bunnei	2018-06-09	1	-1/+1
\|/
*	GPU: Added registers for normal and independent blending.	Subv	2018-06-09	1	-5/+26
\|
*	gl_shader_decompiler: Implement BFE_IMM instruction.	bunnei	2018-06-07	1	-3/+15
\|
*	gl_shader_decompiler: F2F: Implement rounding modes.	bunnei	2018-06-07	1	-3/+12
\|
*	shader_bytecode: Add instruction decodings for BFE, IMNMX, and XMAD.	bunnei	2018-06-07	1	-0/+20
\|
*	Merge pull request #534 from Subv/multitexturing	bunnei	2018-06-07	2	-0/+37
\|\ \| \| \| \|	GPU: Implement sampling multiple textures in the generated glsl shaders.
\| *	GPU: Implement sampling multiple textures in the generated glsl shaders.	Subv	2018-06-06	2	-0/+37
\| \| \| \| \| \| \| \| \| \| \| \|	All tested games that use a single texture show no regression. Only Texture2D textures are supported right now, each shader gets its own "tex_fs/vs/gs" sampler array to maintain independent textures between shader stages, the textures themselves are reused if possible.
* \|	gl_shader_decompiler: Implement LD_C instruction.	bunnei	2018-06-07	1	-0/+16
\| \|
* \|	gl_shader_decompiler: Refactor uniform handling to allow different decodings.	bunnei	2018-06-06	1	-6/+10
\|/
*	Merge pull request #516 from Subv/f2i_r	bunnei	2018-06-06	1	-4/+20
\|\ \| \| \| \|	GPU: Implemented the F2I_R shader instruction.
\| *	GPU: Implemented the F2I_R shader instruction.	Subv	2018-06-05	1	-4/+20
\| \|
* \|	Merge pull request #521 from Subv/bra	bunnei	2018-06-05	1	-4/+5
\|\ \ \| \| \| \| \| \|	GPU: Corrected the branch targets for the shader bra instruction.
\| * \|	GPU: Corrected the branch targets for the shader bra instruction.	Subv	2018-06-05	1	-4/+5
\| \| \|
* \| \|	gl_shader_decompiler: Implement SHL instruction.	bunnei	2018-06-05	1	-13/+17
\|/ /
* \|	GPU: Implement the ISCADD shader instructions.	Subv	2018-06-05	1	-0/+16
\| \|
* \|	GPU: Added decodings for the ISCADD instructions.	Subv	2018-06-05	1	-0/+7
\|/
*	Merge pull request #514 from Subv/lop32i	bunnei	2018-06-05	1	-1/+15
\|\ \| \| \| \|	GPU: Implemented the LOP32I instruction.
\| *	GPU: Implemented the LOP32I instruction.	Subv	2018-06-04	1	-1/+15
\| \|
* \|	Merge pull request #510 from Subv/isetp	bunnei	2018-06-05	1	-0/+10
\|\ \ \| \| \| \| \| \|	GPU: Implemented the ISETP_R and ISETP_C instructions
\| * \|	GPU: Implemented the ISETP_R and ISETP_C shader instructions.	Subv	2018-06-04	1	-0/+10
\| \|/
* \|	Merge pull request #512 from Subv/fset	bunnei	2018-06-05	1	-1/+1
\|\ \ \| \| \| \| \| \|	GPU: Corrected the FSET and I2F instructions.
\| * \|	GPU: Use the bf bit in FSET to determine whether to write 0xFFFFFFFF or 1.0f.	Subv	2018-06-04	1	-1/+1
\| \|/
* \|	Merge pull request #501 from Subv/shader_bra	bunnei	2018-06-05	1	-0/+15
\|\ \ \| \| \| \| \| \|	GPU: Partially implemented the bra shader instruction
\| * \|	GPU: Partially implemented the shader BRA instruction.	Subv	2018-06-04	1	-0/+13
\| \| \|
\| * \|	GPU: Added decoding for the BRA instruction.	Subv	2018-06-04	1	-0/+2
\| \|/
* /	GPU: Calculate the correct viewport dimensions based on the scale and translate registers.	Subv	2018-06-04	1	-12/+28
\|/ \| \| \|	This is how nouveau calculates the viewport width and height. For some reason some games set 0xFFFF in the VIEWPORT_HORIZ and VIEWPORT_VERT registers, maybe those are a misnomer and actually refer to something else?
*	Merge pull request #500 from Subv/long_queries	bunnei	2018-06-04	1	-9/+24
\|\ \| \| \| \|	GPU: Partial implementation of long GPU queries.
\| *	GPU: Partial implementation of long GPU queries.	Subv	2018-06-04	1	-9/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Long queries write a 128-bit result value to memory, which consists of a 64 bit query value and a 64 bit timestamp. In this implementation, only select=Zero of the Crop unit is implemented, this writes the query sequence as a 64 bit value, and a 0u64 value for the timestamp, since we emulate an infinitely fast GPU. This specific type was hwtested, but more rigorous tests should be performed in the future for the other types.
* \|	gl_shader_decompiler: Implement TEXS component mask.	bunnei	2018-06-03	1	-2/+16
\| \|
* \|	Merge pull request #494 from bunnei/shader-tex	bunnei	2018-06-03	1	-0/+15
\|\ \ \| \| \| \| \| \|	gl_shader_decompiler: Implement TEX, fixes for TEXS.
\| * \|	gl_shader_decompiler: Implement TEX instruction.	bunnei	2018-06-01	1	-0/+10
\| \| \|
\| * \|	gl_shader_decompiler: Support multi-destination for TEXS.	bunnei	2018-06-01	1	-0/+5
\| \|/
* /	gl_shader_decompiler: Implement RRO as a register move.	bunnei	2018-06-03	1	-3/+7
\|/
*	Merge pull request #489 from Subv/vertexid	bunnei	2018-05-30	1	-0/+4
\|\ \| \| \| \|	Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader.
\| *	Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader.	Subv	2018-05-30	1	-0/+4
\| \|
* \|	gl_shader_decompiler: Partially implement F2F_R instruction.	bunnei	2018-05-30	1	-3/+3
\|/
*	shader_bytecode: Implement other variants of FMNMX.	bunnei	2018-05-26	1	-3/+7
\|
*	Merge pull request #458 from Subv/fmnmx	bunnei	2018-05-21	1	-0/+5
\|\ \| \| \| \|	Shaders: Implemented the FMNMX shader instruction.
\| *	Shaders: Implemented the FMNMX shader instruction.	Subv	2018-05-21	1	-0/+5
\| \|
* \|	ShadersDecompiler: Added decoding for the PSETP instruction.	Subv	2018-05-19	1	-0/+3
\|/
*	maxwell_3d: Reset vertex counts after drawing.	bunnei	2018-04-29	1	-0/+10
\|
*	shader_bytecode: Add decoding for FMNMX instruction.	bunnei	2018-04-29	1	-0/+2
\|
*	Merge pull request #416 from bunnei/shader-ints-p3	bunnei	2018-04-29	1	-8/+25
\|\ \| \| \| \|	gl_shader_decompiler: Implement MOV32I, partially implement I2I, I2F
\| *	gl_shader_decompiler: Partially implement I2I_R, and I2F_R.	bunnei	2018-04-29	1	-8/+8
\| \|
\| *	shader_bytecode: Add decodings for i2i instructions.	bunnei	2018-04-29	1	-3/+20
\| \|
\| *	gl_shader_decompiler: Implement MOV32_IMM instruction.	bunnei	2018-04-29	1	-2/+2
\| \|
* \|	fermi_2d: Fix surface copy block height.	bunnei	2018-04-29	2	-2/+7
\|/
*	general: Convert assertion macros over to be fmt-compatible	Lioncash	2018-04-27	1	-2/+2
\|
*	gl_shader_decompiler: Boilerplate for handling integer instructions.	bunnei	2018-04-26	1	-1/+9
\|
*	Merge pull request #396 from Subv/shader_ops	bunnei	2018-04-26	1	-8/+35
\|\ \| \| \| \|	Shaders: Implemented the FSET instruction.
\| *	Shaders: Added bit decodings for the I2I instruction.	Subv	2018-04-25	1	-0/+6
\| \|
\| *	Shaders: Added decodings for the FSET instructions.	Subv	2018-04-25	1	-8/+29
\| \|
* \|	GPU: Partially implemented the Fermi2D surface copy operation.	Subv	2018-04-25	2	-0/+59
\| \| \| \| \| \| \| \| \| \|	The hardware allows for some rather complicated operations to be performed on the data during the copy, this is not implemented. Only same-format same-size raw copies are implemented for now.
* \|	GPU: Added surface copy registers to Fermi2D	Subv	2018-04-25	1	-1/+57
\| \|
* \|	GPU: Added boilerplate code for the Fermi2D engine	Subv	2018-04-25	2	-2/+33
\| \|
* \|	GPU: Reduce the number of registers of Maxwell3D to 0xE00.	Subv	2018-04-25	2	-5/+5
\| \| \| \| \| \| \| \|	The rest are just macro shim registers.
* \|	GPU: Move the Maxwell3D macro uploading code to the inside of the Maxwell3D processor.	Subv	2018-04-25	2	-8/+23
\| \| \| \| \| \| \| \|	It doesn't belong in the PFIFO handler.
* \|	video-core: Move logging macros over to new fmt-capable ones	Lioncash	2018-04-25	1	-2/+2
\|/
*	memory_manager: Make GpuToCpuAddress return an optional.	bunnei	2018-04-24	1	-10/+11
\|
*	memory_manager: Use GPUVAdddr, not PAddr, for GPU addresses.	bunnei	2018-04-24	1	-6/+5
\|
*	Merge pull request #386 from Subv/gpu_query	bunnei	2018-04-24	2	-2/+53
\|\ \| \| \| \|	GPU: Added asserts to our code for handling the QUERY_GET GPU command.
\| *	GPU: Added asserts to our code for handling the QUERY_GET GPU command.	Subv	2018-04-24	2	-2/+53
\| \| \| \| \| \| \| \| \| \|	This is based on research from nouveau. Many things are currently unknown and will require hwtests in the future. This commit also stubs QueryMode::Write2 to do the same as Write. Nouveau code treats them interchangeably, it is currently unknown what the difference is.
* \|	GPU: Support multiple enabled vertex arrays.	Subv	2018-04-23	1	-0/+5
\|/ \| \| \| \| \|	The vertex arrays will be copied to the stream buffer one after the other, and the attributes will be set using the ARB_vertex_attrib_binding extension. yuzu now thus requires OpenGL 4.3 or the ARB_vertex_attrib_binding extension.
*	shader_bytecode: Add several more instruction decodings.	bunnei	2018-04-21	1	-5/+52
\|
*	shader_bytecode: Decode instructions based on bit strings.	bunnei	2018-04-21	1	-185/+172
\|
*	ShaderGen: Implemented predicated instruction execution.	Subv	2018-04-21	1	-1/+5
\| \| \| \|	Each predicated instruction will be wrapped in an `if (predicate) { instruction_body; }` in the GLSL, where `predicate` is one of the predicate boolean variables previously set by fsetp.
*	ShaderGen: Implemented the fsetp instruction.	Subv	2018-04-21	1	-3/+40
\| \| \| \| \| \| \| \| \| \|	Predicate variables are now added to the generated shader code in the form of 'pX' where X is the predicate id. These predicate variables are initialized to false on shader startup and are set via the fsetp instructions. TODO: * Not all the comparison types are implemented. * Only the single-predicate version is implemented.
*	ShaderGen: Register id 255 is special and is hardcoded to return 0 (SR_ZERO).	Subv	2018-04-20	1	-0/+3
\|
*	ShaderGen: Implemented the fmul32i shader instruction.	Subv	2018-04-19	1	-3/+14
\|
*	gl_shader_gen: Support vertical/horizontal viewport flipping. (#347)	bunnei	2018-04-18	1	-1/+10
\| \| \| \| \| \|	* gl_shader_gen: Support vertical/horizontal viewport flipping. * fixup! gl_shader_gen: Support vertical/horizontal viewport flipping.
*	GPU: Pitch textures are now supported, don't assert when encountering them.	Subv	2018-04-18	1	-2/+3
\|
*	Merge pull request #346 from bunnei/misc-gpu-improvements	bunnei	2018-04-18	1	-1/+2
\|\ \| \| \| \|	Misc gpu improvements
\| *	maxwell3d: Allow Texture2DNoMipmap as Texture2D.	bunnei	2018-04-18	1	-1/+2
\| \|
* \|	Merge pull request #344 from bunnei/shader-decompiler-p2	bunnei	2018-04-18	1	-10/+33
\|\ \ \| \| \| \| \| \|	Shader decompiler changes part 2
\| * \|	shader_bytecode: Make ctor's constexpr and explicit.	bunnei	2018-04-18	1	-7/+7
\| \| \|
\| * \|	gl_shader_decompiler: Implement FMUL/FADD/FFMA immediate instructions.	bunnei	2018-04-17	1	-0/+14
\| \| \|
\| * \|	gl_shader_decompiler: Add support for TEXS instruction.	bunnei	2018-04-17	1	-5/+14
\| \|/
* /	renderer_opengl: Implement BlendEquation and BlendFunc.	bunnei	2018-04-18	2	-4/+48
\|/
*	gl_rasterizer: Implement indexed vertex mode.	bunnei	2018-04-17	2	-2/+46
\|
*	GPU: Added a function to determine whether a shader stage is enabled or not.	Subv	2018-04-15	2	-0/+24
\|
*	shaders: Add NumTextureSamplers const, remove unused #pragma.	bunnei	2018-04-15	1	-2/+0
\|
*	shaders: Address PR review feedback.	bunnei	2018-04-14	1	-1/+1
\|
*	shaders: Fix GCC and clang build issues.	bunnei	2018-04-14	1	-3/+3
\|
*	gl_shader_decompiler: Implement negate, abs, etc. and lots of cleanup.	bunnei	2018-04-14	1	-20/+39
\|
*	shader_bytecode: Add FSETP and KIL to GetInfo.	bunnei	2018-04-14	1	-0/+3
\|
*	shader_bytecode: Add SubOp decoding.	bunnei	2018-04-14	1	-0/+10
\|
*	maxwell_3d: Make memory_manager public.	bunnei	2018-04-14	1	-2/+1
\|
*	maxwell_3d: Fix shader_config decodings.	bunnei	2018-04-14	1	-6/+3
\|
*	shader_bytecode: Add initial module for shader decoding.	bunnei	2018-04-14	1	-0/+297
\|
*	GPU: Assert when finding a texture with a format type other than UNORM.	Subv	2018-04-07	1	-0/+2
\|
*	GPU: Use the MacroInterpreter class to execute the GPU macros instead of HLEing them.	Subv	2018-04-01	2	-121/+13
\|
*	GPU: Implemented a gpu macro interpreter.	Subv	2018-04-01	2	-0/+8
\| \| \| \| \| \|	The Ryujinx macro interpreter and envydis were used as reference. Macros are programs that are uploaded by the games during boot and can later be called by writing to their method id in a GPU command buffer.
*	gl_rasterizer: Add a SyncViewport method.	bunnei	2018-03-27	1	-0/+10
\|
*	gl_rasterizer: Normalize vertex array data as appropriate.	bunnei	2018-03-27	1	-0/+4
\|
*	maxwell_3d: Use names that match envytools for VertexType.	bunnei	2018-03-27	1	-8/+8
\|
*	maxwell_3d: Add VertexAttribute struct and cleanup.	bunnei	2018-03-27	1	-121/+160
\|
*	Maxwell3D: Call AccelerateDrawBatch on DrawArrays.	bunnei	2018-03-27	1	-1/+8
\|
*	gl_rasterizer: Implement AnalyzeVertexArray.	bunnei	2018-03-27	1	-0/+35
\|
*	maxwell: Add RenderTargetFormat enum.	bunnei	2018-03-27	1	-3/+4
\|
*	GPU: Load the sampler info (TSC) when retrieving active textures.	Subv	2018-03-26	2	-21/+67
\|
*	GPU: Make the debug_context variable a member of the frontend instead of a global.	Subv	2018-03-25	1	-11/+13
\|
*	GPU: Added a function to retrieve the active textures for a shader stage.	Subv	2018-03-24	2	-50/+59
\| \| \| \|	TODO: A shader may not use all of these textures at the same time, shader analysis should be performed to determine which textures are actually sampled.
*	GPU: Implement the Incoming/FinishedPrimitiveBatch debug breakpoints.	Subv	2018-03-24	1	-0/+7
\|
*	GPU: Implement the MaxwellCommandLoaded/Processed debug breakpoints.	Subv	2018-03-24	1	-0/+10
\|
*	GPU: Added a method to unswizzle a texture without decoding it.	Subv	2018-03-24	1	-1/+1
\| \| \| \|	Allow unswizzling of DXT1 textures.
*	GPU: Preliminary work for texture decoding.	Subv	2018-03-24	1	-0/+45
\|
*	GPU: Added viewport registers to Maxwell3D's reg structure.	Subv	2018-03-24	1	-1/+18
\|
*	maxwell_3d: Add some format decodings and string helper functions.	bunnei	2018-03-23	1	-3/+107
\|
*	GPU: Added vertex attribute format registers.	Subv	2018-03-21	1	-1/+14
\|
*	GPU: Added registers for the number of vertices to render.	Subv	2018-03-21	1	-2/+13
\|
*	Merge pull request #253 from Subv/rt_depth	Mat M	2018-03-20	1	-1/+48
\|\ \| \| \| \|	GPU: Added registers for color and Z buffers.
\| *	GPU: Added Z buffer registers to Maxwell3D's reg structure.	Subv	2018-03-19	1	-1/+17
\| \|
\| *	GPU: Added the render target (RT) registers to Maxwell3D's reg structure.	Subv	2018-03-19	1	-1/+32
\| \|
* \|	Clang Fixes	N00byKing	2018-03-19	1	-1/+2
\| \|
* \|	Clean Warnings (?)	N00byKing	2018-03-19	1	-1/+1
\|/
*	GPU: Added the TSC registers to the Maxwell3D register structure.	Subv	2018-03-19	1	-1/+15
\|
*	GPU: Added the TIC registers to the Maxwell3D register structure.	Subv	2018-03-19	1	-1/+16
\|
*	GPU: Implement macro 0xE1A BindTextureInfoBuffer in HLE.	Subv	2018-03-19	2	-1/+29
\| \| \| \|	This macro simply sets the current CB_ADDRESS to the texture buffer address for the input shader stage.
*	GPU: Implement the BindStorageBuffer macro method in HLE.	Subv	2018-03-18	2	-1/+36
\| \| \| \| \| \|	This macro binds the SSBO Info Buffer as the current ConstBuffer. This buffer is usually bound to c0 during shader execution. Games seem to use this macro instead of directly writing the address for some reason.
*	GPU: Handle writes to the CB_DATA method.	Subv	2018-03-18	2	-0/+39
\| \| \| \| \| \|	Writing to this method will cause the written value to be stored in the currently-set ConstBuffer plus CB_POS. This method is usually used to upload uniforms or other shader-visible data.
*	GPU: Store uploaded GPU macros and keep track of the number of method parameters.	Subv	2018-03-18	2	-11/+24
\|
*	GPU: Macros are specific to the Maxwell3D engine, so handle them internally.	Subv	2018-03-18	6	-31/+55
\|
*	GPU: Renamed ShaderType to ShaderStage as that is less confusing.	Subv	2018-03-18	2	-19/+19
\|
*	GPU: Store shader constbuffer bindings in the GPU state.	Subv	2018-03-18	2	-5/+61
\|
*	GPU: Corrected some register offsets and removed superfluous macro registers.	Subv	2018-03-18	1	-9/+3
\|
*	GPU: Make the SetShader macro call do the same as the real macro's code.	Subv	2018-03-18	2	-3/+44
\| \| \| \| \| \|	It'll now set the CB_SIZE, CB_ADDRESS and CB_BIND registers when it's called. Presumably this SetShader function is binding the constant shader uniforms to buffer 1 (c1[]).
*	GPU: Corrected the parameter documentation for the SetShader macro call.	Subv	2018-03-17	2	-11/+12
\| \| \| \| \| \|	Register 0xE24 is actually a macro that sets some shader parameters in the register structure. Macros are uploaded to the GPU at startup and have their own ISA, we'll probably write an interpreter for this in the future.
*	Merge pull request #242 from Subv/set_shader	bunnei	2018-03-17	2	-4/+38
\|\ \| \| \| \|	GPU: Handle the SetShader method call (0xE24) and store the shader config.
\| *	GPU: Handle the SetShader method call (0xE24) and store the shader config.	Subv	2018-03-17	2	-4/+38
\| \|
* \|	GPU: Added the vertex array registers.	Subv	2018-03-17	1	-2/+33
\|/
*	Merge pull request #241 from Subv/gpu_method_call	bunnei	2018-03-17	6	-1/+56
\|\ \| \| \| \|	GPU: Process command mode 5 (IncreaseOnce) differently from other commands
\| *	GPU: Process command mode 5 (IncreaseOnce) differently from other commands.	Subv	2018-03-17	6	-1/+56
\| \| \| \| \| \| \| \| \| \| \| \|	Accumulate all arguments before calling the desired method. Note: Maybe we should do the same for the NonIncreasing mode?
* \|	GPU: Assert that we get a 0 CODE_ADDRESS register in the 3D engine.	Subv	2018-03-17	1	-0/+8
\| \| \| \| \| \| \| \|	Shader address calculation depends on this value to some extent, we do not currently know what it being 0 entails.
* \|	GPU: Added Maxwell registers for Shader Program control.	Subv	2018-03-17	1	-2/+55
\|/
*	GPU: Intercept writes to the VERTEX_END_GL register.	Subv	2018-03-05	2	-1/+18
\| \| \| \| \| \|	This is the register that gets written after a game calls DrawArrays(). We should collect all GPU state and draw using our graphics API here.
*	maxwell_3d: Make constructor explicit	Lioncash	2018-02-14	1	-1/+1
\|
*	GPU: Partially implemented the QUERY_* registers in the Maxwell3D engine.	Subv	2018-02-12	2	-2/+94
\| \| \| \|	Only QueryMode::Write is supported at the moment.
*	Make a GPU class in VideoCore to contain the GPU state.	Subv	2018-02-12	6	-18/+24
\| \| \| \|	Also moved the GPU MemoryManager class to video_core since it makes more sense for it to be there.
*	GPU: Added a command processor to decode the GPU pushbuffers and forward the commands to their respective engines.	Subv	2018-02-12	6	-0/+99