Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge pull request #2441 from ReinUsesLisp/al2p | bunnei | 2019-05-19 | 2 | -2/+21 |
|\ | | | | | shader: Implement AL2P and ALD.PHYS | ||||
| * | shader_ir/other: Implement IPA.IDX | ReinUsesLisp | 2019-05-03 | 1 | -0/+1 |
| | | |||||
| * | shader_ir/memory: Implement physical input attributes | ReinUsesLisp | 2019-05-03 | 1 | -0/+4 |
| | | |||||
| * | gl_shader_decompiler: Declare all possible varyings on physical attribute usage | ReinUsesLisp | 2019-05-03 | 1 | -0/+1 |
| | | |||||
| * | shader_bytecode: Add AL2P decoding | ReinUsesLisp | 2019-05-03 | 1 | -2/+15 |
| | | |||||
* | | Merge pull request #2472 from FernandoS27/tic | Hexagon12 | 2019-05-19 | 1 | -1/+1 |
|\ \ | | | | | | | maxwell_3d: reduce severity of different component formats assert. | ||||
| * | | maxwell_3d: reduce sevirity of different component formats assert. | Fernando Sahmkow | 2019-05-14 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | This was reduced due to happening on most games and at such constant rate that it affected performance heavily for the end user. In general, we are well aware of the assert and an implementation is already planned. | ||||
* | | | Merge pull request #2469 from lioncash/copyable | Hexagon12 | 2019-05-19 | 1 | -0/+2 |
|\ \ \ | | | | | | | | | video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs | ||||
| * | | | video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs | Lioncash | 2019-05-14 | 1 | -0/+2 |
| |/ / | | | | | | | | | | | | | | | | | | | std::memset is used to clear the entire register structure, which requires that the Regs struct be trivially copyable (otherwise undefined behavior is invoked). This prevents the case where a non-trivial type is potentially added to the struct. | ||||
* | | | Merge pull request #2470 from lioncash/ranged-for | Sebastian Valle | 2019-05-19 | 1 | -18/+18 |
|\ \ \ | | | | | | | | | video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults() | ||||
| * | | | video_core/engines/maxwell3d: Get rid of three magic values in CallMethod() | Lioncash | 2019-05-14 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | We can use the named constant instead of using 32 directly. | ||||
| * | | | video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults() | Lioncash | 2019-05-14 | 1 | -15/+15 |
| |/ / | | | | | | | | | | | | | | | | Lessens the amount of code that needs to be read, and gets rid of the need to introduce an indexing variable. Instead, we just operate on the objects directly. | ||||
* | | | video_core/engines/engine_upload: Amend constructor initializer list order | Lioncash | 2019-05-14 | 1 | -1/+1 |
| | | | | | | | | | | | | Silences a -Wreorder warning. | ||||
* | | | video_core/engines/engine_upload: Default destructor in the cpp file | Lioncash | 2019-05-14 | 2 | -1/+3 |
| | | | | | | | | | | | | | | | | | | Avoids inlining destruction logic where applicable, and also makes forward declarations not cause unexpected compilation errors depending on where the State class is used. | ||||
* | | | video_core/engines/engine_upload: Remove unnecessary const on parameters in function declarations | Lioncash | 2019-05-14 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | These only apply in the definition of the function. They can be omitted from the declaration. | ||||
* | | | video_core/engines/engine_upload: Remove unnecessary includes | Lioncash | 2019-05-14 | 2 | -2/+2 |
|/ / | |||||
* | | Merge pull request #2429 from FernandoS27/compute | bunnei | 2019-05-09 | 11 | -140/+479 |
|\ \ | |/ |/| | Corrections and Implementation on GPU Engines | ||||
| * | Refactors and name corrections. | Fernando Sahmkow | 2019-05-01 | 6 | -35/+35 |
| | | |||||
| * | Fixes and Corrections to DMA Engine | Fernando Sahmkow | 2019-04-23 | 2 | -37/+57 |
| | | |||||
| * | Add Swizzle Parameters to the DMA engine | Fernando Sahmkow | 2019-04-23 | 2 | -2/+27 |
| | | |||||
| * | Add Documentation Headers to all the GPU Engines | Fernando Sahmkow | 2019-04-23 | 5 | -0/+29 |
| | | |||||
| * | Corrections and styling | Fernando Sahmkow | 2019-04-23 | 5 | -6/+9 |
| | | |||||
| * | Implement Maxwell3D Data Upload | Fernando Sahmkow | 2019-04-23 | 2 | -3/+32 |
| | | |||||
| * | Introduce skeleton of the GPU Compute Engine. | Fernando Sahmkow | 2019-04-23 | 2 | -7/+201 |
| | | |||||
| * | Revamp Kepler Memory to use a subegine to manage uploads | Fernando Sahmkow | 2019-04-23 | 4 | -92/+131 |
| | | |||||
* | | Merge pull request #2322 from ReinUsesLisp/wswitch | bunnei | 2019-04-29 | 1 | -2/+3 |
|\ \ | |/ |/| | video_core: Silent -Wswitch warnings | ||||
| * | video_core: Silent -Wswitch warnings | ReinUsesLisp | 2019-04-18 | 1 | -2/+3 |
| | | |||||
* | | Merge pull request #2411 from FernandoS27/unsafe-gpu | bunnei | 2019-04-22 | 1 | -2/+2 |
|\ \ | | | | | | | GPU Manager: Implement ReadBlockUnsafe and WriteBlockUnsafe | ||||
| * | | Use ReadBlockUnsafe on TIC and TSC reading | Fernando Sahmkow | 2019-04-16 | 1 | -2/+2 |
| |/ | | | | | | | | | Use ReadBlockUnsafe on TIC and TSC reading as memory is never flushed from host GPU there. | ||||
* | | Merge pull request #2400 from FernandoS27/corret-kepler-mem | bunnei | 2019-04-22 | 2 | -17/+54 |
|\ \ | | | | | | | Implement Kepler Memory on both Linear and BlockLinear. | ||||
| * | | Use WriteBlock and ReadBlock. | Fernando Sahmkow | 2019-04-16 | 1 | -10/+6 |
| | | | |||||
| * | | Implement Block Linear copies in Kepler Memory. | Fernando Sahmkow | 2019-04-16 | 1 | -5/+14 |
| | | | |||||
| * | | Correct Kepler Memory on Linear Pushes. | Fernando Sahmkow | 2019-04-15 | 2 | -16/+48 |
| |/ | |||||
* | | Merge pull request #2407 from FernandoS27/f2f | bunnei | 2019-04-20 | 1 | -7/+20 |
|\ \ | | | | | | | Do some corrections in conversion shader instructions. | ||||
| * | | Do some corrections in conversion shader instructions. | Fernando Sahmkow | 2019-04-16 | 1 | -7/+20 |
| |/ | | | | | | | | | | | Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs. | ||||
* | | Merge pull request #2348 from FernandoS27/guest-bindless | bunnei | 2019-04-18 | 3 | -13/+68 |
|\ \ | | | | | | | Implement Bindless Textures on Shader Decompiler and GL backend | ||||
| * | | Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format. | Fernando Sahmkow | 2019-04-08 | 3 | -3/+13 |
| | | | |||||
| * | | Implement TXQ_B | Fernando Sahmkow | 2019-04-08 | 1 | -0/+2 |
| | | | |||||
| * | | Corrections to TEX_B | Fernando Sahmkow | 2019-04-08 | 1 | -0/+32 |
| | | | |||||
| * | | Implement Bindless Handling on SetupTexture | Fernando Sahmkow | 2019-04-08 | 2 | -13/+22 |
| | | | |||||
| * | | Implement Bindless Samplers and TEX_B in the IR. | Fernando Sahmkow | 2019-04-08 | 1 | -0/+2 |
| | | | |||||
* | | | Merge pull request #2315 from ReinUsesLisp/severity-decompiler | bunnei | 2019-04-17 | 1 | -1/+15 |
|\ \ \ | | | | | | | | | shader_ir/decode: Reduce the severity of common assertions | ||||
| * | | | shader_ir/memory: Reduce severity of LD_L cache management and log it | ReinUsesLisp | 2019-04-03 | 1 | -0/+7 |
| | | | | |||||
| * | | | shader_ir/memory: Reduce severity of ST_L cache management and log it | ReinUsesLisp | 2019-04-03 | 1 | -1/+8 |
| | | | | |||||
* | | | | shader_ir: Implement STG, keep track of global memory usage and flush | ReinUsesLisp | 2019-04-14 | 1 | -0/+6 |
| |_|/ |/| | | |||||
* | | | Merge pull request #2366 from FernandoS27/xmad-fix | bunnei | 2019-04-10 | 1 | -0/+3 |
|\ \ \ | | | | | | | | | Correct XMAD mode, psl and high_b on different encodings. | ||||
| * | | | Correct XMAD mode, psl and high_b on different encodings. | Fernando Sahmkow | 2019-04-08 | 1 | -0/+3 |
| | |/ | |/| | |||||
* / | | Correct LOP_IMN encoding | Fernando Sahmkow | 2019-04-08 | 1 | -1/+1 |
|/ / | |||||
* | | maxwell_3d: Reduce severity of ProcessSyncPoint | ReinUsesLisp | 2019-04-06 | 1 | -2/+2 |
| | | |||||
* | | Merge pull request #2317 from FernandoS27/sync | bunnei | 2019-04-06 | 2 | -1/+27 |
|\ \ | | | | | | | Implement SyncPoint Register in the GPU. | ||||
| * | | Implement SyncPoint Register in the GPU. | Fernando Sahmkow | 2019-04-06 | 2 | -1/+27 |
| |/ | |||||
* | | video_core/engines: Make memory manager members private | Lioncash | 2019-04-06 | 9 | -13/+14 |
| | | | | | | | | | | These aren't used externally by anything, so they can be made private data members. | ||||
* | | video_core/engines: Remove unnecessary inclusions where applicable | Lioncash | 2019-04-06 | 9 | -9/+24 |
|/ | | | | | | Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h | ||||
* | maxwell_dma: Check for valid source in destination before copy. | bunnei | 2019-03-21 | 1 | -0/+10 |
| | | | | - Avoid a crash in Octopath Traveler. | ||||
* | gpu: Rewrite virtual memory manager using PageTable. | bunnei | 2019-03-21 | 2 | -5/+5 |
| | |||||
* | video_core: Refactor to use MemoryManager interface for all memory access. | bunnei | 2019-03-16 | 3 | -55/+29 |
| | | | | | | | | | | | # Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp | ||||
* | gpu: Use host address for caching instead of guest address. | bunnei | 2019-03-15 | 3 | -4/+12 |
| | |||||
* | Merge pull request #2147 from ReinUsesLisp/texture-clean | bunnei | 2019-03-10 | 1 | -12/+13 |
|\ | | | | | shader_ir: Remove "extras" from the MetaTexture | ||||
| * | shader/decode: Remove extras from MetaTexture | ReinUsesLisp | 2019-02-26 | 1 | -4/+4 |
| | | |||||
| * | shader/decode: Split memory and texture instructions decoding | ReinUsesLisp | 2019-02-26 | 1 | -8/+9 |
| | | |||||
* | | gpu: Move command processing to another thread. | bunnei | 2019-03-07 | 2 | -3/+3 |
| | | |||||
* | | video_core/engines: Remove unnecessary includes | Lioncash | 2019-03-06 | 8 | -10/+9 |
| | | | | | | | | | | | | | | | | | | Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those. | ||||
* | | Merge pull request #2163 from ReinUsesLisp/bitset-dirty | bunnei | 2019-02-28 | 2 | -41/+40 |
|\ \ | | | | | | | maxwell_3d: Use std::bitset to manage dirty flags | ||||
| * | | maxwell_3d: Use std::bitset to manage dirty flags | ReinUsesLisp | 2019-02-26 | 2 | -41/+40 |
| |/ | |||||
* / | common/math_util: Move contents into the Common namespace | Lioncash | 2019-02-27 | 2 | -5/+5 |
|/ | | | | | These types are within the common library, so they should be within the Common namespace. | ||||
* | Merge pull request #2118 from FernandoS27/ipa-improve | bunnei | 2019-02-25 | 2 | -6/+41 |
|\ | | | | | shader_decompiler: Improve Accuracy of Attribute Interpolation. | ||||
| * | shader_decompiler: Improve Accuracy of Attribute Interpolation. | Fernando Sahmkow | 2019-02-14 | 2 | -6/+41 |
| | | |||||
* | | video_core: Remove usages of System::GetInstance() within the engines | Lioncash | 2019-02-16 | 6 | -16/+39 |
| | | | | | | | | | | Avoids the use of the global accessor in favor of explicitly making the system a dependency within the interface. | ||||
* | | core_timing: Convert core timing into a class | Lioncash | 2019-02-16 | 1 | -1/+1 |
|/ | | | | | | | | | | | Gets rid of the largest set of mutable global state within the core. This also paves a way for eliminating usages of GetInstance() on the System class as a follow-up. Note that no behavioral changes have been made, and this simply extracts the functionality into a class. This also has the benefit of making dependencies on the core timing functionality explicit within the relevant interfaces. | ||||
* | Merge pull request #2110 from lioncash/namespace | bunnei | 2019-02-13 | 1 | -1/+1 |
|\ | | | | | core_timing: Rename CoreTiming namespace to Core::Timing | ||||
| * | core_timing: Rename CoreTiming namespace to Core::Timing | Lioncash | 2019-02-12 | 1 | -1/+1 |
| | | | | | | | | | | | | Places all of the timing-related functionality under the existing Core namespace to keep things consistent, rather than having the timing utilities sitting in its own completely separate namespace. | ||||
* | | Merge pull request #2104 from ReinUsesLisp/compute-assert | bunnei | 2019-02-13 | 3 | -43/+50 |
|\ \ | | | | | | | kepler_compute: Fixup assert and rename the engine | ||||
| * | | kepler_compute: Fixup assert and rename engines | ReinUsesLisp | 2019-02-10 | 3 | -43/+50 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When I originally added the compute assert I used the wrong documentation. This addresses that. The dispatch register was tested with homebrew against hardware and is triggered by some games (e.g. Super Mario Odyssey). What exactly is missing to get a valid program bound by this engine requires more investigation. | ||||
* | | | Corrected F2I None mode to RoundEven. | Fernando Sahmkow | 2019-02-11 | 1 | -1/+1 |
| |/ |/| | |||||
* | | gl_rasterizer: Implement a more accurate fermi 2D copy. | bunnei | 2019-02-07 | 2 | -52/+39 |
|/ | | | | - This is a blit, use the blit registers. | ||||
* | Merge pull request #2042 from ReinUsesLisp/nouveau-tex | bunnei | 2019-02-07 | 4 | -67/+67 |
|\ | | | | | maxwell_3d: Allow texture handles with TIC id zero | ||||
| * | video_core: Assert on invalid GPU to CPU address queries | ReinUsesLisp | 2019-02-03 | 4 | -41/+54 |
| | | |||||
| * | maxwell_3d: Allow sampler handles with TSC id zero | ReinUsesLisp | 2019-02-03 | 1 | -10/+6 |
| | | |||||
| * | maxwell_3d: Allow texture handles with TIC id zero | ReinUsesLisp | 2019-02-03 | 1 | -16/+7 |
| | | | | | | | | | | Also remove "enabled" field from Tegra::Texture::FullTextureInfo because it would become unused. | ||||
* | | Merge pull request #2081 from ReinUsesLisp/lmem-64 | bunnei | 2019-02-05 | 1 | -3/+3 |
|\ \ | | | | | | | shader_ir/memory: Add LD_L 64 bits loads | ||||
| * | | shader_bytecode: Rename BytesN enums to BitsN | ReinUsesLisp | 2019-02-03 | 1 | -3/+3 |
| |/ | |||||
* | | Merge pull request #2082 from FernandoS27/txq-stl | bunnei | 2019-02-05 | 1 | -0/+4 |
|\ \ | |/ |/| | Fix TXQ not using the component mask. | ||||
| * | Update src/video_core/engines/shader_bytecode.h | Mat M | 2019-02-04 | 1 | -1/+1 |
| | | | | | | Co-Authored-By: FernandoS27 <fsahmkow27@gmail.com> | ||||
| * | Fix TXQ not using the component mask. | Fernando Sahmkow | 2019-02-03 | 1 | -0/+4 |
| | | |||||
* | | shader_ir: Unify constant buffer offset values | ReinUsesLisp | 2019-01-30 | 1 | -0/+8 |
|/ | | | | | | | Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries. | ||||
* | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 2019-01-30 | 1 | -0/+8 |
| | |||||
* | Merge pull request #1927 from ReinUsesLisp/shader-ir | bunnei | 2019-01-26 | 2 | -3/+9 |
|\ | | | | | video_core: Replace gl_shader_decompiler with an IR based decompiler | ||||
| * | shader_decode: Implement VMAD and VSETP | ReinUsesLisp | 2019-01-15 | 1 | -2/+3 |
| | | |||||
| * | shader_decode: Implement HFMA2 | ReinUsesLisp | 2019-01-15 | 1 | -0/+1 |
| | | |||||
| * | shader_decode: Fixup clang-format | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | |||||
| * | shader_ir: Initial implementation | ReinUsesLisp | 2019-01-15 | 1 | -0/+4 |
| | | |||||
| * | shader_bytecode: Fixup encoding | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | |||||
| * | shader_header: Make local memory size getter constant | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | |||||
* | | maxwell_3d: Set rt_separate_frag_data to 1 by default | ReinUsesLisp | 2019-01-22 | 1 | -0/+5 |
| | | | | | | | | | | | | | | Commercial games assume that this value is 1 but they never set it. On the other hand nouveau manually sets this register. On ConfigureFramebuffers we were asserting for what we are actually implementing (according to envytools). | ||||
* | | gl_rasterizer_cache: Use dirty flags for the depth buffer | ReinUsesLisp | 2019-01-07 | 2 | -0/+12 |
| | | |||||
* | | gl_rasterizer_cache: Use dirty flags for color buffers | ReinUsesLisp | 2019-01-07 | 2 | -0/+12 |
|/ | |||||
* | gl_shader_cache: Use dirty flags for shaders | ReinUsesLisp | 2019-01-07 | 2 | -0/+11 |
| | |||||
* | shader_bytecode: Fixup TEXS.F16 encoding | ReinUsesLisp | 2018-12-26 | 1 | -1/+1 |
| | |||||
* | Fixed uninitialized memory due to missing returns in canary | David Marcec | 2018-12-19 | 2 | -0/+4 |
| | | | | Functions which are suppose to crash on non canary builds usually don't return anything which lead to uninitialized memory being used. | ||||
* | shader_bytecode: Fixup half float's operator B encoding | ReinUsesLisp | 2018-12-18 | 1 | -1/+1 |
| | |||||
* | Implement postfactor multiplication/division for fmul instructions | heapo | 2018-12-17 | 1 | -1/+1 |
| | |||||
* | gl_shader_decompiler: Implement TEXS.F16 | ReinUsesLisp | 2018-12-05 | 1 | -1/+2 |
| | |||||
* | gl_rasterizer: Enable clip distances when set in register and in shader | ReinUsesLisp | 2018-11-29 | 1 | -0/+1 |
| | |||||
* | Merge pull request #1808 from Tinob/master | bunnei | 2018-11-28 | 1 | -1/+15 |
|\ | | | | | Fix clip distance and viewport | ||||
| * | Add support for Clip Distance enabled register | Rodolfo Bogado | 2018-11-27 | 1 | -1/+15 |
| | | |||||
* | | Merge pull request #1786 from Tinob/DepthClamp | bunnei | 2018-11-28 | 1 | -1/+9 |
|\ \ | | | | | | | Add Depth Clamp Support | ||||
| * | | Implement depth clamp | Rodolfo Bogado | 2018-11-27 | 1 | -1/+9 |
| |/ | |||||
* | | Merge pull request #1792 from bunnei/dma-pusher | bunnei | 2018-11-28 | 10 | -47/+52 |
|\ \ | | | | | | | gpu: Rewrite GPU command list processing with DmaPusher class. | ||||
| * | | gpu: Rewrite GPU command list processing with DmaPusher class. | bunnei | 2018-11-27 | 10 | -47/+52 |
| |/ | | | | | | | - More accurate impl., fixes Undertale (among other games). | ||||
* | | Merge pull request #1735 from FernandoS27/tex-spacing | bunnei | 2018-11-28 | 1 | -2/+2 |
|\ \ | |/ |/| | Texture decoder: Implemented Tile Width Spacing | ||||
| * | Implemented Tile Width Spacing | FernandoS27 | 2018-11-26 | 1 | -2/+2 |
| | | |||||
* | | Merge pull request #1794 from Tinob/master | bunnei | 2018-11-27 | 1 | -1/+9 |
|\ \ | | | | | | | Add support for viewport_transfom_enable register | ||||
| * | | Add support for viewport_transfom_enable register | Rodolfo Bogado | 2018-11-24 | 1 | -1/+9 |
| | | | |||||
* | | | Merge pull request #1723 from degasus/dirty_flags | bunnei | 2018-11-27 | 5 | -0/+34 |
|\ \ \ | | | | | | | | | gl_rasterizer: Skip VB upload if the state is clean. | ||||
| * | | | gl_rasterizer: Skip VB upload if the state is clean. | Markus Wick | 2018-11-17 | 5 | -0/+34 |
| | | | | |||||
* | | | | GPU States: Implement Polygon Offset. This is used in SMO all the time. (#1784) | Marcos | 2018-11-27 | 1 | -4/+26 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * GPU States: Implement Polygon Offset. This is used in SMO all the time. * Clang Format fixes. * Initialize polygon_offset in the constructor. | ||||
* | | | | Merge pull request #1798 from ReinUsesLisp/y-direction | bunnei | 2018-11-27 | 1 | -0/+1 |
|\ \ \ \ | |_|_|/ |/| | | | gl_shader_decompiler: Implement S2R's Y_DIRECTION | ||||
| * | | | gl_shader_decompiler: Implement S2R's Y_DIRECTION | ReinUsesLisp | 2018-11-25 | 1 | -0/+1 |
| | |/ | |/| | |||||
* | | | Merge pull request #1763 from ReinUsesLisp/bfi | bunnei | 2018-11-26 | 1 | -0/+3 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Implement BFI_IMM_R | ||||
| * | | | gl_shader_decompiler: Implement BFI_IMM_R | ReinUsesLisp | 2018-11-21 | 1 | -0/+3 |
| | | | | |||||
* | | | | Merge pull request #1760 from ReinUsesLisp/r2p | bunnei | 2018-11-26 | 1 | -0/+14 |
|\ \ \ \ | | | | | | | | | | | gl_shader_decompiler: Implement R2P_IMM | ||||
| * | | | | gl_shader_decompiler: Implement R2P_IMM | ReinUsesLisp | 2018-11-21 | 1 | -0/+14 |
| |/ / / | |||||
* | | | | Merge pull request #1783 from ReinUsesLisp/clip-distances | bunnei | 2018-11-26 | 2 | -1/+12 |
|\ \ \ \ | |_|/ / |/| | | | gl_shader_decompiler: Implement clip distances | ||||
| * | | | gl_shader_decompiler: Implement clip distances | ReinUsesLisp | 2018-11-23 | 2 | -1/+12 |
| | | | | |||||
* | | | | Merge pull request #1785 from Tinob/master | bunnei | 2018-11-24 | 1 | -1/+11 |
|\ \ \ \ | | | | | | | | | | | Add support for clear_flags register | ||||
| * | | | | Add support for clear_flags register | Rodolfo Bogado | 2018-11-24 | 1 | -1/+11 |
| | | | | | |||||
* | | | | | Merge pull request #1769 from ReinUsesLisp/cc | bunnei | 2018-11-24 | 1 | -4/+3 |
|\ \ \ \ \ | |/ / / / |/| | | | | gl_shader_decompiler: Rename cc to condition code and name internal flags | ||||
| * | | | | gl_shader_decompiler: Rename control codes to condition codes | ReinUsesLisp | 2018-11-22 | 1 | -4/+3 |
| | |/ / | |/| | | |||||
* | | | | Added predicate comparison LessEqualWithNan (#1736) | Hexagon12 | 2018-11-23 | 1 | -0/+1 |
| |/ / |/| | | | | | | | | | | | | | | | | | | | | * Added predicate comparison LessEqualWithNan * oops * Clang fix | ||||
* | | | maxwell_3d: Implement alternate blend equations. | bunnei | 2018-11-22 | 1 | -0/+7 |
|/ / | | | | | | | - Used by Undertale. | ||||
* | | maxwell_3d: Initialize rasterizer color mask registers as enabled. | bunnei | 2018-11-21 | 1 | -0/+9 |
| | | | | | | | | - Fixes rendering regression with Sonic Mania. | ||||
* | | small fix for alphaToOne bit location | Rodolfo Bogado | 2018-11-17 | 1 | -2/+2 |
| | | |||||
* | | fix for gcc compilation | Rodolfo Bogado | 2018-11-17 | 1 | -60/+61 |
| | | |||||
* | | add AlphaToCoverage and AlphaToOne | Rodolfo Bogado | 2018-11-17 | 1 | -1/+7 |
| | | |||||
* | | add support for fragment_color_clamp | Rodolfo Bogado | 2018-11-17 | 1 | -1/+4 |
| | | |||||
* | | set default value for point size register | Rodolfo Bogado | 2018-11-17 | 1 | -0/+3 |
| | | |||||
* | | fix viewport and scissor behavior | Rodolfo Bogado | 2018-11-17 | 2 | -12/+18 |
|/ | |||||
* | gl_rasterizer: Minor cleanup | Frederic L | 2018-11-13 | 1 | -4/+2 |
| | | | Minor code cleanup from unaddressed feedback in #1654 | ||||
* | Try to fix problems with stencil test in some games, relax translation to opengl enums to avoid crashing and only generate logs of the errors. | Rodolfo Bogado | 2018-11-11 | 2 | -0/+21 |
| | |||||
* | Merge pull request #1654 from degasus/dirty_flags | bunnei | 2018-11-11 | 2 | -0/+14 |
|\ | | | | | gl_rasterizer: Skip VAO binding if the state is clean. | ||||
| * | gl_rasterizer: Skip VAO binding if the state is clean. | Markus Wick | 2018-11-06 | 2 | -0/+14 |
| | | |||||
* | | Add support to color mask to avoid issues in blending caused by wrong values in the alpha channel in some render targets. | Rodolfo Bogado | 2018-11-05 | 1 | -3/+20 |
| | | |||||
* | | Implement multi-target viewports and blending | Rodolfo Bogado | 2018-11-05 | 2 | -2/+28 |
|/ | |||||
* | Merge pull request #1527 from FernandoS27/assert-flow | bunnei | 2018-11-01 | 1 | -0/+1 |
|\ | | | | | Assert Control Flow Instructions using Control Codes | ||||
| * | Assert Control Flow Instructions using Control Codes | FernandoS27 | 2018-10-29 | 1 | -1/+2 |
| | | |||||
* | | maxwell_3d: Restructure macro upload to use a single macro code memory. | bunnei | 2018-11-01 | 2 | -12/+39 |
| | | | | | | | | | | - Fixes an issue where macros could be skipped. - Fixes rendering of distant objects in Super Mario Odyssey. | ||||
* | | Merge pull request #1528 from FernandoS27/assert-control-codes | bunnei | 2018-11-01 | 1 | -1/+5 |
|\ \ | | | | | | | Assert Control Codes Generation on Shader Instructions | ||||
| * | | Assert Control Codes Generation | FernandoS27 | 2018-10-30 | 1 | -1/+5 |
| |/ | |||||
* / | global: Use std::optional instead of boost::optional (#1578) | Frederic L | 2018-10-30 | 2 | -9/+9 |
|/ | | | | | | | | | | | | | | | | * get rid of boost::optional * Remove optional references * Use std::reference_wrapper for optional references * Fix clang format * Fix clang format part 2 * Adressed feedback * Fix clang format and MacOS build | ||||
* | Implement sRGB Support, including workarounds for nvidia driver issues and QT sRGB support | Rodolfo Bogado | 2018-10-28 | 1 | -1/+6 |
| | |||||
* | gl_rasterizer: Implement primitive restart. | bunnei | 2018-10-26 | 1 | -1/+9 |
| | |||||
* | Merge pull request #1533 from FernandoS27/lmem | bunnei | 2018-10-26 | 2 | -0/+36 |
|\ | | | | | Implemented Shader Local Memory | ||||
| * | Implemented LD_L and ST_L | FernandoS27 | 2018-10-24 | 2 | -0/+36 |
| | | |||||
* | | maxwell_3d: Add code for initializing register defaults. | bunnei | 2018-10-26 | 2 | -1/+21 |
|/ | |||||
* | Merge pull request #1554 from FernandoS27/pointsize | bunnei | 2018-10-24 | 1 | -0/+1 |
|\ | | | | | Implement PointSize Output Attribute. | ||||
| * | Implement PointSize | FernandoS27 | 2018-10-23 | 1 | -0/+1 |
| | | |||||
* | | maxwell_3d: Remove unused variable within ProcessQueryGet() | Lioncash | 2018-10-24 | 1 | -1/+0 |
|/ | |||||
* | Merge pull request #1519 from ReinUsesLisp/vsetp | bunnei | 2018-10-23 | 1 | -3/+15 |
|\ | | | | | gl_shader_decompiler: Implement VSETP | ||||
| * | gl_shader_decompiler: Implement VSETP | ReinUsesLisp | 2018-10-23 | 1 | -0/+2 |
| | | |||||
| * | gl_shader_decompiler: Abstract VMAD into a video subset | ReinUsesLisp | 2018-10-23 | 1 | -3/+13 |
| | | |||||
* | | Merge pull request #1539 from lioncash/dma | bunnei | 2018-10-23 | 3 | -19/+10 |
|\ \ | | | | | | | maxwell_dma: Silence compilation warnings | ||||
| * | | engines/maxwell_*: Use nested namespace specifiers where applicable | Lioncash | 2018-10-20 | 3 | -12/+6 |
| | | | | | | | | | | | | | | | | | | These three source files are the only ones within the engines directory that don't use nested namespaces. We may as well change these over to keep things consistent. | ||||
| * | | maxwell_dma: Make variables const where applicable within HandleCopy() | Lioncash | 2018-10-20 | 1 | -3/+3 |
| | | | | | | | | | | | | These are never modified, so we can make that assumption explicit. | ||||
| * | | maxwell_dma: Make FlushAndInvalidate's size parameter a u64 | Lioncash | 2018-10-20 | 1 | -1/+1 |
| | | | | | | | | | | | | This prevents truncation warnings at the lambda's usage sites. | ||||
| * | | maxwell_dma: Remove unused variables in HandleCopy() | Lioncash | 2018-10-20 | 1 | -3/+0 |
| | | | | | | | | | | | | These pointer variables are never used, so we can get rid of them. | ||||
* | | | Merge pull request #1470 from FernandoS27/alpha_testing | bunnei | 2018-10-23 | 1 | -1/+3 |
|\ \ \ | | | | | | | | | Implemented Alpha Test using Shader Emulation | ||||
| * | | | Implemented Alpha Testing | FernandoS27 | 2018-10-22 | 1 | -1/+3 |
| | | | | |||||
* | | | | Merge pull request #1512 from ReinUsesLisp/brk | bunnei | 2018-10-23 | 1 | -3/+7 |
|\ \ \ \ | |_|_|/ |/| | | | gl_shader_decompiler: Implement PBK and BRK | ||||
| * | | | gl_shader_decompiler: Implement PBK and BRK | ReinUsesLisp | 2018-10-18 | 1 | -3/+7 |
| | | | | |||||
* | | | | Added Saturation to FMUL32I | FernandoS27 | 2018-10-23 | 1 | -0/+4 |
| |/ / |/| | | |||||
* | | | Fixed FSETP and FSET | FernandoS27 | 2018-10-22 | 1 | -2/+0 |
| |/ |/| | |||||
* | | Merge pull request #1501 from ReinUsesLisp/half-float | bunnei | 2018-10-20 | 1 | -0/+145 |
|\ \ | | | | | | | gl_shader_decompiler: Implement H* instructions | ||||
| * | | gl_shader_decompiler: Implement HSET2_R | ReinUsesLisp | 2018-10-15 | 1 | -0/+18 |
| | | | |||||
| * | | gl_shader_decompiler: Implement HSETP2_R | ReinUsesLisp | 2018-10-15 | 1 | -0/+20 |
| | | | |||||
| * | | gl_shader_decompiler: Implement HFMA2 instructions | ReinUsesLisp | 2018-10-15 | 1 | -0/+32 |
| | | | |||||
| * | | gl_shader_decompiler: Implement HADD2_IMM and HMUL2_IMM | ReinUsesLisp | 2018-10-15 | 1 | -0/+30 |
| | | | |||||
| * | | gl_shader_decompiler: Implement non-immediate HADD2 and HMUL2 instructions | ReinUsesLisp | 2018-10-15 | 1 | -0/+25 |
| | | | |||||
| * | | gl_shader_decompiler: Setup base for half float unpacking and setting | ReinUsesLisp | 2018-10-15 | 1 | -0/+20 |
| | | | |||||
* | | | GPU: Improved implementation of maxwell DMA (Subv). | bunnei | 2018-10-19 | 2 | -16/+65 |
| | | | |||||
* | | | GPU: Invalidate destination address of kepler_memory writes. | bunnei | 2018-10-19 | 2 | -2/+16 |
| | | | |||||
* | | | fermi_2d: Add support for more accurate surface copies. | bunnei | 2018-10-19 | 1 | -3/+6 |
| | | | |||||
* | | | Implement 3D Textures | FernandoS27 | 2018-10-18 | 1 | -1/+4 |
| |/ |/| | |||||
* | | shader_bytecode: Add Control Code enum 0xf | ReinUsesLisp | 2018-10-15 | 1 | -1/+1 |
| | | | | | | | | | | | | Control Code 0xf means to unconditionally execute the instruction. This value is passed to most BRA, EXIT and SYNC instructions (among others) but this may not always be the case. | ||||
* | | Propagate depth and depth_block on modules using decoders | FernandoS27 | 2018-10-13 | 3 | -10/+18 |
|/ | |||||
* | gl_shader_decompiler: Implement VMAD | ReinUsesLisp | 2018-10-11 | 1 | -0/+36 |
| | |||||
* | Merge pull request #1458 from FernandoS27/fix-render-target-block-settings | bunnei | 2018-10-11 | 2 | -4/+34 |
|\ | | | | | Fixed block height settings for RenderTargets and Depth Buffers | ||||
| * | Add memory Layout to Render Targets and Depth Buffers | FernandoS27 | 2018-10-10 | 1 | -2/+14 |
| | | |||||
| * | Fixed block height settings for RenderTargets and Depth Buffers, and added block width and block depth | FernandoS27 | 2018-10-10 | 2 | -4/+22 |
| | | |||||
* | | Merge pull request #1460 from FernandoS27/scissor_test | bunnei | 2018-10-10 | 1 | -1/+16 |
|\ \ | | | | | | | Implemented Scissor Testing | ||||
| * | | Assert Scissor tests | FernandoS27 | 2018-10-09 | 1 | -1/+16 |
| |/ | |||||
* / | gl_shader_decompiler: Implement geometry shaders | ReinUsesLisp | 2018-10-07 | 1 | -0/+112 |
|/ | |||||
* | fermi_2d: Implement simple copies with AccelerateSurfaceCopy. | bunnei | 2018-10-06 | 2 | -23/+35 |
| | |||||
* | gl_rasterizer: Implement quads topology | ReinUsesLisp | 2018-10-04 | 1 | -0/+6 |
| | |||||
* | Merge pull request #1411 from ReinUsesLisp/point-size | bunnei | 2018-09-29 | 1 | -1/+6 |
|\ | | | | | video_core: Implement point_size and add point state sync | ||||
| * | video_core: Implement point_size and add point state sync | ReinUsesLisp | 2018-09-28 | 1 | -1/+6 |
| | | |||||
* | | gl_state: Pack sampler bindings into a single ARB_multi_bind | ReinUsesLisp | 2018-09-28 | 1 | -0/+1 |
|/ | |||||
* | video_core: Add asserts for CS, TFB and alpha testing | ReinUsesLisp | 2018-09-26 | 3 | -3/+64 |
| | | | | | | Add asserts for compute shader dispatching, transform feedback being enabled and alpha testing. These have in common that they'll probably break rendering without logging. | ||||
* | shader_bytecode: Lay out the Ipa-related enums better | Lioncash | 2018-09-21 | 1 | -2/+12 |
| | | | | This is more consistent with the surrounding enums. | ||||
* | shader_bytecode: Make operator== and operator!= of IpaMode const qualified | Lioncash | 2018-09-21 | 1 | -6/+7 |
| | | | | | These don't affect the state of the struct and can be const member functions. | ||||
* | Merge pull request #1279 from FernandoS27/csetp | bunnei | 2018-09-19 | 1 | -0/+47 |
|\ | | | | | shader_decompiler: Implemented (Partialy) Control Codes and CSETP | ||||
| * | Implemented I2I.CC on the NEU control code, used by SMO | FernandoS27 | 2018-09-17 | 1 | -1/+1 |
| | | |||||
| * | Implemented CSETP | FernandoS27 | 2018-09-17 | 1 | -0/+11 |
| | | |||||
| * | Implemented Control Codes | FernandoS27 | 2018-09-17 | 1 | -0/+36 |
| | | |||||
* | | Merge pull request #1299 from FernandoS27/texture-sanatize | bunnei | 2018-09-19 | 1 | -1/+147 |
|\ \ | | | | | | | shader_decompiler: Asserts for Texture Instructions | ||||
| * | | Added texture misc modes to texture instructions | FernandoS27 | 2018-09-17 | 1 | -1/+147 |
| |/ | |||||
* | | Merge pull request #1290 from FernandoS27/shader-header | bunnei | 2018-09-18 | 1 | -0/+103 |
|\ \ | |/ |/| | Implemented (Partialy) Shader Header | ||||
| * | Replace old FragmentHeader for the new Header | FernandoS27 | 2018-09-11 | 1 | -9/+15 |
| | | |||||
| * | Implemented (Partialy) Shader Header | FernandoS27 | 2018-09-11 | 1 | -0/+97 |
| | | |||||
* | | Merge pull request #1326 from FearlessTobi/port-4182 | bunnei | 2018-09-17 | 6 | -32/+33 |
|\ \ | | | | | | | Port #4182 from Citra: "Prefix all size_t with std::" | ||||
| * | | Port #4182 from Citra: "Prefix all size_t with std::" | fearlessTobi | 2018-09-15 | 6 | -32/+33 |
| | | | |||||
* | | | Merge pull request #1273 from Subv/ld_sizes | bunnei | 2018-09-15 | 1 | -1/+9 |
|\ \ \ | | | | | | | | | Shaders: Implemented multiple-word loads and stores to and from attribute memory. | ||||
| * | | | Shaders: Implemented multiple-word loads and stores to and from attribute memory. | Subv | 2018-09-15 | 1 | -1/+9 |
| |/ / | | | | | | | | | | This seems to be an optimization performed by nouveau. | ||||
* | | | Merge pull request #1271 from Subv/kepler_engine | bunnei | 2018-09-15 | 2 | -0/+135 |
|\ \ \ | |/ / |/| | | GPU: Basic implementation of the Kepler Inline Memory engine (p2mf). | ||||
| * | | GPU: Basic implementation of the Kepler Inline Memory engine (p2mf). | Subv | 2018-09-12 | 2 | -0/+135 |
| | | | | | | | | | | | | This engine writes data from a FIFO register into the configured address. | ||||
* | | | Merge pull request #1263 from FernandoS27/tex-mode | bunnei | 2018-09-12 | 1 | -0/+10 |
|\ \ \ | |/ / |/| | | shader_decompiler: Implemented (Partially) Texture Processing Modes | ||||
| * | | Implemented Texture Processing Modes | FernandoS27 | 2018-09-12 | 1 | -0/+10 |
| |/ | |||||
* / | Implemented encodings for LEA and PSET | FernandoS27 | 2018-09-11 | 1 | -0/+64 |
|/ | |||||
* | rasterizer: Drop unused handler. | Markus Wick | 2018-09-10 | 1 | -2/+0 |
| | | | | | | | | This virtual function is called in a very hot spot, and it does nothing. If this kind of feature is required, please be more specific and add callbacks in the switch statement within Maxwell3D::WriteReg. There is no point in having another switch statement within the rasterizer. | ||||
* | gl_rasterizer: Implement multiple color attachments. | bunnei | 2018-09-10 | 1 | -1/+21 |
| | |||||
* | Merge pull request #1268 from FernandoS27/tmml | bunnei | 2018-09-10 | 1 | -5/+19 |
|\ | | | | | shader_decompiler: Implemented TMML | ||||
| * | Implemented TMML | FernandoS27 | 2018-09-10 | 1 | -5/+19 |
| | | |||||
* | | Merge pull request #1272 from Subv/dma_2d | bunnei | 2018-09-10 | 1 | -2/+10 |
|\ \ | |/ |/| | GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine. | ||||
| * | GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine. | Subv | 2018-09-08 | 1 | -2/+10 |
| | | | | | | | | | | | | | | When not set, this tells the GPU to only use the X size when performing a DMA copy. This is only implemented for linear->linear and tiled->tiled copies. Conversion copies still retain the assert. This bit is unset by some games for various purposes, and by nouveau when copying the vertex buffers. | ||||
* | | Implemented TXQ dimension query type, used by SMO. | FernandoS27 | 2018-09-09 | 1 | -1/+16 |
| | | |||||
* | | Change name of TEXQ to TXQ, in order to match NVIDIA's naming | FernandoS27 | 2018-09-09 | 1 | -2/+2 |
| | | |||||
* | | maxwell_3d: Remove assert that no longer applies. | bunnei | 2018-09-08 | 1 | -4/+0 |
|/ | |||||
* | Merge pull request #1243 from degasus/VAO_cache | bunnei | 2018-09-06 | 1 | -2/+7 |
|\ | | | | | gl_rasterizer: Implement a VAO cache. | ||||
| * | gl_rasterizer: Implement a VAO cache. | Markus Wick | 2018-09-05 | 1 | -2/+7 |
| | | | | | | | | | | | | This patch caches VAO objects instead of re-emiting all pointers per draw call. Configuring this pointers is known as a fast task, but it yields too many GL calls. So for better performance, just bind the VAO instead of 16 pointers. | ||||
* | | Implemented IPA Properly | FernandoS27 | 2018-09-06 | 1 | -0/+12 |
|/ | |||||
* | Merge pull request #1213 from DarkLordZach/octopath-fs | bunnei | 2018-09-02 | 1 | -2/+3 |
|\ | | | | | filesystem/maxwell_3d: Various changes to boot Project Octopath Traveller | ||||
| * | maxwell_3d: Use CoreTiming for query timestamp | Zach Hilman | 2018-09-01 | 1 | -2/+3 |
| | | |||||
* | | Merge pull request #1215 from ogniK5377/texs-nodep-assert | bunnei | 2018-09-02 | 1 | -0/+1 |
|\ \ | | | | | | | Added assert for TEXS nodep | ||||
| * | | Added assert for TEXS nodep | David Marcec | 2018-09-01 | 1 | -0/+1 |
| |/ | |||||
* | | Merge pull request #1214 from ogniK5377/ipa-assert | bunnei | 2018-09-02 | 1 | -2/+5 |
|\ \ | | | | | | | Added better asserts to IPA, Renamed IPA modes to match mesa | ||||
| * | | Added better asserts to IPA, Renamed IPA modes to match mesa | David Marcec | 2018-09-01 | 1 | -2/+5 |
| |/ | | | | | | | | | | | | | | | | | | | IpaMode is changed to IpaInterpMode IpaMode is suppose to be 2 bits not 3 Added IpaSampleMode Added Saturate Renamed modes based on https://github.com/mesa3d/mesa/blob/d27c7918916cdc8092959124955f887592e37d72/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp#L2530 | ||||
* | | Merge pull request #1216 from ogniK5377/ffma-assert | bunnei | 2018-09-02 | 1 | -0/+3 |
|\ \ | | | | | | | Added FFMA asserts and missing fields | ||||
| * | | Removed saturate assert | David Marcec | 2018-09-01 | 1 | -1/+0 |
| | | | | | | | | | | | | Saturate already implemented | ||||
| * | | Added FFMA asserts | David Marcec | 2018-09-01 | 1 | -0/+4 |
| |/ | |||||
* | | Removed saturate assert | David Marcec | 2018-09-01 | 1 | -1/+0 |
| | | | | | | | | Unneeded as we already implement it | ||||
* | | Added FMUL asserts | David Marcec | 2018-09-01 | 1 | -0/+5 |
|/ | |||||
* | core/core: Replace includes with forward declarations where applicable | Lioncash | 2018-08-31 | 1 | -2/+1 |
| | | | | | | | | | | | The follow-up to e2457418dae19b889b2ad85255bb95d4cd0e4bff, which replaces most of the includes in the core header with forward declarations. This makes it so that if any of the headers the core header was previously including change, then no one will need to rebuild the bulk of the core, due to core.h being quite a prevalent inclusion. This should make turnaround for changes much faster for developers. | ||||
* | Added predicate comparison GreaterEqualWithNan | Hexagon12 | 2018-08-31 | 1 | -0/+1 |
| | |||||
* | gl_shader_decompiler: Implement POPC (#1203) | Laku | 2018-08-31 | 1 | -0/+10 |
| | | | | | | * Implement POPC * implement invert | ||||
* | Merge pull request #1200 from bunnei/improve-ipa | bunnei | 2018-08-30 | 1 | -0/+6 |
|\ | | | | | gl_shader_decompiler: Improve IPA for Pass mode with Position attribute. | ||||
| * | gl_shader_decompiler: Improve IPA for Pass mode with Position attribute. | bunnei | 2018-08-29 | 1 | -0/+6 |
| | | |||||
* | | Shaders: Implemented IADD3 | tech4me | 2018-08-29 | 1 | -1/+23 |
|/ | |||||
* | Merge pull request #1169 from Lakumakkara/sel | bunnei | 2018-08-28 | 1 | -1/+1 |
|\ | | | | | shader_bytecode: fix SEL_IMM bitstring | ||||
| * | fix SEL_IMM bitstring | Laku | 2018-08-24 | 1 | -1/+1 |
| | | |||||
* | | Merge pull request #1173 from lioncash/batch | bunnei | 2018-08-25 | 1 | -4/+4 |
|\ \ | |/ |/| | maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch() | ||||
| * | maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch() | Lioncash | 2018-08-25 | 1 | -4/+4 |
| | | | | | | | | | | The start and finish events should likely not be right after one another like this, otherwise the batch will appear to complete immediately | ||||
* | | Shaders: Added decodings for IADD3 instructions | tech4me | 2018-08-23 | 1 | -0/+6 |
|/ | |||||
* | maxwell_3d: Update to include additional stencil registers. | bunnei | 2018-08-23 | 1 | -20/+50 |
| | |||||
* | implement lop3 | Laku | 2018-08-22 | 1 | -0/+19 |
| | |||||
* | Merge pull request #1124 from Subv/logic_ops | bunnei | 2018-08-22 | 1 | -1/+28 |
|\ | | | | | GPU: Implemented logic ops. | ||||
| * | GPU: Added registers for the logicop functionality. | Subv | 2018-08-21 | 1 | -1/+28 |
| | | |||||
* | | shader_bytecode: Parenthesize conditional expression within GetTextureType() | Lioncash | 2018-08-21 | 1 | -1/+1 |
| | | | | | | | | Resolves a -Wlogical-op-parentheses warning. | ||||
* | | shader_bytecode: Replace some UNIMPLEMENTED logs. | bunnei | 2018-08-21 | 1 | -2/+6 |
|/ | |||||
* | Merge pull request #1104 from Subv/instanced_arrays | bunnei | 2018-08-20 | 1 | -1/+14 |
|\ | | | | | GLRasterizer: Implemented instanced vertex arrays. | ||||
| * | GLRasterizer: Implemented instanced vertex arrays. | Subv | 2018-08-18 | 1 | -1/+14 |
| | | | | | | | | Before each draw call, for every enabled vertex array configured as instanced, we take the current instance id and divide it by its configured divisor, then we multiply that by the corresponding stride and increment the start address by the resulting amount. This way we can simulate the vertex array being incremented once per instance without actually using OpenGL's instancing functions. | ||||
* | | Merge pull request #1112 from Subv/sampler_types | bunnei | 2018-08-20 | 1 | -4/+72 |
|\ \ | | | | | | | Shaders: Use the correct shader type when sampling textures. | ||||
| * | | Shader: Added bitfields for the texture type of the various sampling instructions. | Subv | 2018-08-19 | 1 | -1/+65 |
| | | | |||||
| * | | Shaders: Added decodings for TLD4 and TLD4S | Subv | 2018-08-19 | 1 | -3/+7 |
| | | | |||||
* | | | Merge pull request #1089 from Subv/neg_bits | bunnei | 2018-08-19 | 1 | -0/+4 |
|\ \ \ | | | | | | | | | Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions. | ||||
| * | | | Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions. | Subv | 2018-08-18 | 1 | -0/+4 |
| | | | | | | | | | | | | | | | | We should definitely audit our shader generator for more errors like this. | ||||
* | | | | Shaders/TEXS: Fixed the component mask in the TEXS instruction. | Subv | 2018-08-19 | 1 | -6/+11 |
| |/ / |/| | | | | | | | | Previously we could end up with a TEXS that didn't write any outputs, this was wrong. | ||||
* | | | Merge pull request #1109 from Subv/ldg_decode | bunnei | 2018-08-19 | 1 | -0/+4 |
|\ \ \ | | | | | | | | | Shaders: Added decodings for the LDG and STG instructions. | ||||
| * | | | Shaders: Added decodings for the LDG and STG instructions. | Subv | 2018-08-19 | 1 | -0/+4 |
| | |/ | |/| | |||||
* | | | Merge pull request #1108 from Subv/front_facing | bunnei | 2018-08-19 | 1 | -0/+3 |
|\ \ \ | | | | | | | | | Shaders: Implemented the gl_FrontFacing input attribute (attr 63). | ||||
| * | | | Shaders: Implemented the gl_FrontFacing input attribute (attr 63). | Subv | 2018-08-19 | 1 | -0/+3 |
| |/ / | |||||
* / / | Shader: Implemented the predicate and mode arguments of LOP. | Subv | 2018-08-18 | 1 | -1/+6 |
|/ / | | | | | | | | | | | The mode can be used to set the predicate to true depending on the result of the logic operation. In some cases, this means discarding the result (writing it to register 0xFF (Zero)). This is used by Super Mario Odyssey. | ||||
* | | Added predcondition GreaterThanWithNan | David Marcec | 2018-08-18 | 1 | -0/+1 |
| | | |||||
* | | Rasterizer: Implemented instanced rendering. | Subv | 2018-08-15 | 2 | -0/+15 |
|/ | | | | | | We keep track of the current instance and update an uniform in the shaders to let them know which instance they are. Instanced vertex arrays are not yet implemented. | ||||
* | gl_shader_decompiler: Implement XMAD instruction. | bunnei | 2018-08-13 | 1 | -4/+25 |
| | |||||
* | Merge pull request #1024 from Subv/blend_gl | bunnei | 2018-08-12 | 1 | -0/+21 |
|\ | | | | | GPU/Maxwell3D: Implemented an alternative set of blend factors. | ||||
| * | GPU/Maxwell3D: Implemented an alternative set of blend factors. | Subv | 2018-08-12 | 1 | -0/+21 |
| | | | | | | | | These are used by nouveau and some games like SMO. | ||||
* | | RasterizerGL: Ignore invalid/unset vertex attributes. | Subv | 2018-08-12 | 1 | -0/+5 |
|/ | | | | This should make the es2gears example not crash anymore. | ||||
* | Merge pull request #1010 from bunnei/unk-vert-attrib-shader | bunnei | 2018-08-12 | 1 | -2/+1 |
|\ | | | | | gl_shader_decompiler: Improve handling of unknown input/output attributes. | ||||
| * | gl_shader_decompiler: Improve handling of unknown input/output attributes. | bunnei | 2018-08-12 | 1 | -2/+1 |
| | | |||||
* | | Merge pull request #1018 from Subv/ssy_sync | bunnei | 2018-08-12 | 1 | -0/+7 |
|\ \ | |/ |/| | GPU/Shader: Implemented SSY and SYNC as a set_target/jump pair. | ||||
| * | GPU/Shader: Don't predicate instructions that don't have a predicate field (SSY). | Subv | 2018-08-11 | 1 | -0/+7 |
| | | |||||
* | | video_core: Use variable template variants of type_traits interfaces where applicable | Lioncash | 2018-08-10 | 1 | -2/+1 |
|/ | |||||
* | maxwell_3d: Ignore macros that have not been uploaded yet. | bunnei | 2018-08-09 | 1 | -4/+9 |
| | | | | - Used by Super Mario Odyssey (in game). | ||||
* | Merge pull request #982 from bunnei/stub-unk-63 | bunnei | 2018-08-09 | 1 | -0/+2 |
|\ | | | | | gl_shader_decompiler: Stub input attribute Unknown_63. | ||||
| * | gl_shader_decompiler: Stub input attribute Unknown_63. | bunnei | 2018-08-08 | 1 | -0/+2 |
| | | |||||
* | | Merge pull request #976 from bunnei/shader-imm | bunnei | 2018-08-09 | 1 | -9/+4 |
|\ \ | | | | | | | gl_shader_decompiler: Let OpenGL interpret floats. | ||||
| * | | gl_shader_decompiler: Let OpenGL interpret floats. | bunnei | 2018-08-08 | 1 | -9/+4 |
| |/ | | | | | | | | | - Accuracy is lost in translation to string, e.g. with NaN. - Needed for Super Mario Odyssey. | ||||
* / | maxwell_3d: Use correct const buffer size and check bounds. | bunnei | 2018-08-08 | 2 | -1/+3 |
|/ | | | | - Fixes mem corruption with Super Mario Odyssey and Pokkén Tournament DX. | ||||
* | maxwell_3d: Remove outdated assert. | bunnei | 2018-08-06 | 1 | -2/+0 |
| | |||||
* | video_core: Eliminate the g_renderer global variable | Lioncash | 2018-08-04 | 2 | -6/+12 |
| | | | | | | | | | | | | | | We move the initialization of the renderer to the core class, while keeping the creation of it and any other specifics in video_core. This way we can ensure that the renderer is initialized and doesn't give unfettered access to the renderer. This also makes dependencies on types more explicit. For example, the GPU class doesn't need to depend on the existence of a renderer, it only needs to care about whether or not it has a rasterizer, but since it was accessing the global variable, it was also making the renderer a part of its dependency chain. By adjusting the interface, we can get rid of this dependency. | ||||
* | GPU: Remove the assert that required the CODE_ADDRESS to be 0. | Subv | 2018-07-24 | 1 | -8/+0 |
| | | | | Games usually just leave it at 0 but nouveau sets it to something else. This already works fine, the assert is useless. | ||||
* | shader_bytecode: Implement other TEXS masks. | bunnei | 2018-07-22 | 1 | -5/+9 |
| | |||||
* | gl_shader_decompiler: Implement SEL instruction. | bunnei | 2018-07-22 | 1 | -0/+11 |
| | |||||
* | maxwell_3d: Add depth buffer enable, width, and height registers. | bunnei | 2018-07-22 | 1 | -2/+14 |
| | |||||
* | video_core: Use nested namespaces where applicable | Lioncash | 2018-07-21 | 6 | -28/+14 |
| | | | | Compresses a few namespace specifiers to be more compact. | ||||
* | maxwell_3d: Remove unused variable within GetStageTextures() | Lioncash | 2018-07-20 | 1 | -2/+0 |
| | |||||
* | GPU: Added register definitions for the stencil parameters. | Subv | 2018-07-17 | 1 | -2/+25 |
| | |||||
* | gl_rasterizer: Fix check for if a shader stage is enabled. | bunnei | 2018-07-13 | 2 | -24/+8 |
| | |||||
* | Merge pull request #655 from bunnei/pred-lt-nan | bunnei | 2018-07-13 | 1 | -0/+1 |
|\ | | | | | gl_shader_decompiler: Implement PredCondition::LessThanWithNan. | ||||
| * | gl_shader_decompiler: Implement PredCondition::LessThanWithNan. | bunnei | 2018-07-13 | 1 | -0/+1 |
| | | |||||
* | | gl_shader_decompiler: Use FlowCondition field in EXIT instruction. | bunnei | 2018-07-13 | 1 | -0/+9 |
|/ | |||||
* | Merge pull request #652 from Subv/fadd32i | Sebastian Valle | 2018-07-13 | 1 | -0/+9 |
|\ | | | | | GPU: Implement the FADD32I shader instruction. | ||||
| * | GPU: Implement the FADD32I shader instruction. | Subv | 2018-07-12 | 1 | -0/+9 |
| | | |||||
* | | Merge pull request #651 from Subv/ffma_decode | bunnei | 2018-07-12 | 1 | -1/+1 |
|\ \ | | | | | | | GPU: Corrected the decoding of FFMA for immediate operands. | ||||
| * | | GPU: Corrected the decoding of FFMA for immediate operands. | Subv | 2018-07-12 | 1 | -1/+1 |
| |/ | |||||
* | | Merge pull request #625 from Subv/imnmx | bunnei | 2018-07-08 | 1 | -3/+17 |
|\ \ | |/ |/| | GPU: Implemented the IMNMX shader instruction. | ||||
| * | GPU: Implemented the IMNMX shader instruction. | Subv | 2018-07-04 | 1 | -3/+17 |
| | | | | | | | | It's similar to the FMNMX instruction but it works on integers. | ||||
* | | Merge pull request #629 from Subv/depth_test | bunnei | 2018-07-05 | 1 | -9/+21 |
|\ \ | | | | | | | GPU: Allow using the old NV04 values for the depth test function. | ||||
| * | | GPU: Allow using the old NV04 values for the depth test function. | Subv | 2018-07-05 | 1 | -9/+21 |
| | | | | | | | | | | | | | | | | | | These seem to be just a valid as the GL token values. Thanks @ReinUsesLisp This restores graphical output to Disgaea 5 | ||||
* | | | Merge pull request #626 from Subv/shader_sync | bunnei | 2018-07-05 | 1 | -0/+5 |
|\ \ \ | |/ / |/| | | GPU: Stub the shader SYNC and DEPBAR instructions. | ||||
| * | | GPU: Stub the shader SYNC and DEPBAR instructions. | Subv | 2018-07-04 | 1 | -0/+5 |
| |/ | | | | | | | It is unknown at this moment if we actually need to do something with these instructions or if the GLSL compiler takes care of that for us. | ||||
* | | Merge pull request #622 from Subv/unused_tex | bunnei | 2018-07-05 | 1 | -1/+1 |
|\ \ | | | | | | | GPU: Ignore unused textures and corrected the TEX shader instruction decoding. | ||||
| * | | GPU: Corrected the decoding for the TEX shader instruction. | Subv | 2018-07-04 | 1 | -1/+1 |
| |/ | |||||
* | | Merge pull request #621 from Subv/psetp_ | bunnei | 2018-07-05 | 1 | -0/+13 |
|\ \ | | | | | | | GPU: Implemented the PSETP shader instruction. | ||||
| * | | GPU: Implemented the PSETP shader instruction. | Subv | 2018-07-04 | 1 | -0/+13 |
| |/ | | | | | | | It's similar to the isetp and fsetp instructions but it works on predicates instead. | ||||
* / | GPU: Flip the triangle front face winding if the GPU is configured to not flip the triangles. | Subv | 2018-07-04 | 1 | -3/+19 |
|/ | | | | | | OpenGL's default behavior is already correct when the GPU is configured to flip the triangles. This fixes 1-2 Switch's splash screen. | ||||
* | Merge pull request #609 from Subv/clear_buffers | bunnei | 2018-07-04 | 2 | -2/+39 |
|\ | | | | | GPU: Implemented the CLEAR_BUFFERS register. | ||||
| * | GPU: Support clears that don't clear the color buffer. | Subv | 2018-07-03 | 1 | -2/+3 |
| | | |||||
| * | GPU: Bind and clear the render target when the CLEAR_BUFFERS register is written to. | Subv | 2018-07-03 | 1 | -0/+11 |
| | | |||||
| * | GPU: Added registers for the CLEAR_BUFFERS and CLEAR_COLOR methods. | Subv | 2018-07-03 | 1 | -2/+27 |
| | | |||||
* | | Merge pull request #607 from jroweboy/logging | bunnei | 2018-07-03 | 3 | -5/+5 |
|\ \ | | | | | | | Logging - Customizable backends | ||||
| * | | Update clang format | James Rowe | 2018-07-03 | 2 | -3/+3 |
| | | | |||||
| * | | Rename logging macro back to LOG_* | James Rowe | 2018-07-03 | 3 | -3/+3 |
| |/ | |||||
* | | Merge pull request #611 from Subv/enabled_depth_test | bunnei | 2018-07-03 | 1 | -9/+9 |
|\ \ | | | | | | | GPU: Don't try to parse the depth test function if the depth test is disabled and use only the least significant 3 bits in the depth test func | ||||
| * | | GPU: Use only the least significant 3 bits when reading the depth test func. | Subv | 2018-07-03 | 1 | -9/+9 |
| |/ | | | | | | | Some games set the full GL define value here (including nouveau), but others just seem to set those last 3 bits. | ||||
* | | Merge pull request #610 from Subv/mufu_8 | bunnei | 2018-07-03 | 1 | -0/+1 |
|\ \ | |/ |/| | GPU: Implemented MUFU suboperation 8, sqrt. | ||||
| * | GPU: Implemented MUFU suboperation 8, sqrt. | Subv | 2018-07-03 | 1 | -0/+1 |
| | | |||||
* | | Merge pull request #608 from Subv/depth | bunnei | 2018-07-03 | 1 | -4/+52 |
|\ \ | | | | | | | GPU: Implemented the depth buffer and depth test + culling | ||||
| * | | GPU: Added registers for depth test and cull mode. | Subv | 2018-07-02 | 1 | -3/+51 |
| | | | |||||
| * | | GPU: Implemented the Z24S8 depth format and load the depth framebuffer. | Subv | 2018-07-02 | 1 | -1/+1 |
| |/ | |||||
* | | Merge pull request #606 from Subv/base_vertex | Sebastian Valle | 2018-07-02 | 1 | -1/+6 |
|\ \ | | | | | | | GPU: Fixed the index offset and implement BaseVertex when doing indexed rendering. | ||||
| * | | GPU: Added register definitions for the vertex buffer base element. | Subv | 2018-07-02 | 1 | -1/+6 |
| |/ | |||||
* | | Merge pull request #605 from Subv/dma_copy | Sebastian Valle | 2018-07-02 | 1 | -1/+5 |
|\ \ | |/ |/| | GPU: Directly copy the pixels when performing a same-layout DMA. | ||||
| * | GPU: Directly copy the pixels when performing a same-layout DMA. | Subv | 2018-07-02 | 1 | -1/+5 |
| | | |||||
* | | Merge pull request #602 from Subv/mufu_subop | bunnei | 2018-07-01 | 1 | -2/+1 |
|\ \ | | | | | | | GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation. | ||||
| * | | GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation. | Subv | 2018-06-30 | 1 | -2/+1 |
| |/ | |||||
* / | gl_shader_decompiler: Implement predicate NotEqualWithNan. | bunnei | 2018-06-30 | 1 | -0/+1 |
|/ | |||||
* | maxwell_3d: Add a struct for RenderTargetConfig. | bunnei | 2018-06-27 | 1 | -17/+19 |
| | |||||
* | Build: Fixed some MSVC warnings in various parts of the code. | Subv | 2018-06-20 | 2 | -4/+5 |
| | |||||
* | GPU: Don't mark uniform buffers and registers as used for instructions which don't have them. | Subv | 2018-06-19 | 1 | -2/+3 |
| | | | | | Like the MOV32I and FMUL32I instructions. This fixes a potential crash when using these instructions. | ||||
* | gl_shader_decompiler: Implement LOP instructions. | bunnei | 2018-06-17 | 1 | -0/+14 |
| | |||||
* | gl_shader_decompiler: Refactor LOP32I instruction a bit in support of LOP. | bunnei | 2018-06-17 | 1 | -3/+2 |
| | |||||
* | gl_shader_decompiler: Implement integer size conversions for I2I/I2F/F2I. | bunnei | 2018-06-16 | 1 | -1/+2 |
| | |||||
* | Merge pull request #556 from Subv/dma_engine | bunnei | 2018-06-12 | 3 | -0/+225 |
|\ | | | | | GPU: Partially implemented the Maxwell DMA engine. | ||||
| * | GPU: Partially implemented the Maxwell DMA engine. | Subv | 2018-06-12 | 3 | -0/+225 |
| | | | | | | | | Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported. | ||||
* | | Merge pull request #558 from Subv/iadd32i | bunnei | 2018-06-12 | 1 | -2/+10 |
|\ \ | | | | | | | GPU: Implemented the iadd32i shader instruction. | ||||
| * | | GPU: Implemented the iadd32i shader instruction. | Subv | 2018-06-12 | 1 | -2/+10 |
| |/ | |||||
* / | gl_shader_decompiler: Implement saturate for float instructions. | bunnei | 2018-06-12 | 1 | -2/+1 |
|/ | |||||
* | GPU: Implement the iset family of shader instructions. | Subv | 2018-06-09 | 1 | -0/+9 |
| | |||||
* | GPU: Added decodings for the ISET family of instructions. | Subv | 2018-06-09 | 1 | -0/+7 |
| | |||||
* | Merge pull request #550 from Subv/ssy | bunnei | 2018-06-09 | 1 | -0/+2 |
|\ | | | | | GPU: Stub the SSY shader instruction. | ||||
| * | GPU: Stub the SSY shader instruction. | Subv | 2018-06-09 | 1 | -0/+2 |
| | | | | | | | | This instruction tells the GPU where the flow reconverges in a non-uniform control flow scenario, we can ignore this when generating GLSL code. | ||||
* | | Merge pull request #551 from bunnei/shr | bunnei | 2018-06-09 | 1 | -0/+4 |
|\ \ | | | | | | | gl_shader_decompiler: Implement SHR instruction. | ||||
| * | | gl_shader_decompiler: Implement SHR instruction. | bunnei | 2018-06-09 | 1 | -0/+4 |
| |/ | |||||
* | | gl_shader_decompiler: Implement IADD instruction. | bunnei | 2018-06-09 | 1 | -5/+11 |
| | | |||||
* | | gl_shader_decompiler: Add missing asserts for saturate_a instructions. | bunnei | 2018-06-09 | 1 | -1/+1 |
|/ | |||||
* | GPU: Added registers for normal and independent blending. | Subv | 2018-06-09 | 1 | -5/+26 |
| | |||||
* | gl_shader_decompiler: Implement BFE_IMM instruction. | bunnei | 2018-06-07 | 1 | -3/+15 |
| | |||||
* | gl_shader_decompiler: F2F: Implement rounding modes. | bunnei | 2018-06-07 | 1 | -3/+12 |
| | |||||
* | shader_bytecode: Add instruction decodings for BFE, IMNMX, and XMAD. | bunnei | 2018-06-07 | 1 | -0/+20 |
| | |||||
* | Merge pull request #534 from Subv/multitexturing | bunnei | 2018-06-07 | 2 | -0/+37 |
|\ | | | | | GPU: Implement sampling multiple textures in the generated glsl shaders. | ||||
| * | GPU: Implement sampling multiple textures in the generated glsl shaders. | Subv | 2018-06-06 | 2 | -0/+37 |
| | | | | | | | | | | | | All tested games that use a single texture show no regression. Only Texture2D textures are supported right now, each shader gets its own "tex_fs/vs/gs" sampler array to maintain independent textures between shader stages, the textures themselves are reused if possible. | ||||
* | | gl_shader_decompiler: Implement LD_C instruction. | bunnei | 2018-06-07 | 1 | -0/+16 |
| | | |||||
* | | gl_shader_decompiler: Refactor uniform handling to allow different decodings. | bunnei | 2018-06-06 | 1 | -6/+10 |
|/ | |||||
* | Merge pull request #516 from Subv/f2i_r | bunnei | 2018-06-06 | 1 | -4/+20 |
|\ | | | | | GPU: Implemented the F2I_R shader instruction. | ||||
| * | GPU: Implemented the F2I_R shader instruction. | Subv | 2018-06-05 | 1 | -4/+20 |
| | | |||||
* | | Merge pull request #521 from Subv/bra | bunnei | 2018-06-05 | 1 | -4/+5 |
|\ \ | | | | | | | GPU: Corrected the branch targets for the shader bra instruction. | ||||
| * | | GPU: Corrected the branch targets for the shader bra instruction. | Subv | 2018-06-05 | 1 | -4/+5 |
| | | | |||||
* | | | gl_shader_decompiler: Implement SHL instruction. | bunnei | 2018-06-05 | 1 | -13/+17 |
|/ / | |||||
* | | GPU: Implement the ISCADD shader instructions. | Subv | 2018-06-05 | 1 | -0/+16 |
| | | |||||
* | | GPU: Added decodings for the ISCADD instructions. | Subv | 2018-06-05 | 1 | -0/+7 |
|/ | |||||
* | Merge pull request #514 from Subv/lop32i | bunnei | 2018-06-05 | 1 | -1/+15 |
|\ | | | | | GPU: Implemented the LOP32I instruction. | ||||
| * | GPU: Implemented the LOP32I instruction. | Subv | 2018-06-04 | 1 | -1/+15 |
| | | |||||
* | | Merge pull request #510 from Subv/isetp | bunnei | 2018-06-05 | 1 | -0/+10 |
|\ \ | | | | | | | GPU: Implemented the ISETP_R and ISETP_C instructions | ||||
| * | | GPU: Implemented the ISETP_R and ISETP_C shader instructions. | Subv | 2018-06-04 | 1 | -0/+10 |
| |/ | |||||
* | | Merge pull request #512 from Subv/fset | bunnei | 2018-06-05 | 1 | -1/+1 |
|\ \ | | | | | | | GPU: Corrected the FSET and I2F instructions. | ||||
| * | | GPU: Use the bf bit in FSET to determine whether to write 0xFFFFFFFF or 1.0f. | Subv | 2018-06-04 | 1 | -1/+1 |
| |/ | |||||
* | | Merge pull request #501 from Subv/shader_bra | bunnei | 2018-06-05 | 1 | -0/+15 |
|\ \ | | | | | | | GPU: Partially implemented the bra shader instruction | ||||
| * | | GPU: Partially implemented the shader BRA instruction. | Subv | 2018-06-04 | 1 | -0/+13 |
| | | | |||||
| * | | GPU: Added decoding for the BRA instruction. | Subv | 2018-06-04 | 1 | -0/+2 |
| |/ | |||||
* / | GPU: Calculate the correct viewport dimensions based on the scale and translate registers. | Subv | 2018-06-04 | 1 | -12/+28 |
|/ | | | | This is how nouveau calculates the viewport width and height. For some reason some games set 0xFFFF in the VIEWPORT_HORIZ and VIEWPORT_VERT registers, maybe those are a misnomer and actually refer to something else? | ||||
* | Merge pull request #500 from Subv/long_queries | bunnei | 2018-06-04 | 1 | -9/+24 |
|\ | | | | | GPU: Partial implementation of long GPU queries. | ||||
| * | GPU: Partial implementation of long GPU queries. | Subv | 2018-06-04 | 1 | -9/+24 |
| | | | | | | | | | | | | | | | | Long queries write a 128-bit result value to memory, which consists of a 64 bit query value and a 64 bit timestamp. In this implementation, only select=Zero of the Crop unit is implemented, this writes the query sequence as a 64 bit value, and a 0u64 value for the timestamp, since we emulate an infinitely fast GPU. This specific type was hwtested, but more rigorous tests should be performed in the future for the other types. | ||||
* | | gl_shader_decompiler: Implement TEXS component mask. | bunnei | 2018-06-03 | 1 | -2/+16 |
| | | |||||
* | | Merge pull request #494 from bunnei/shader-tex | bunnei | 2018-06-03 | 1 | -0/+15 |
|\ \ | | | | | | | gl_shader_decompiler: Implement TEX, fixes for TEXS. | ||||
| * | | gl_shader_decompiler: Implement TEX instruction. | bunnei | 2018-06-01 | 1 | -0/+10 |
| | | | |||||
| * | | gl_shader_decompiler: Support multi-destination for TEXS. | bunnei | 2018-06-01 | 1 | -0/+5 |
| |/ | |||||
* / | gl_shader_decompiler: Implement RRO as a register move. | bunnei | 2018-06-03 | 1 | -3/+7 |
|/ | |||||
* | Merge pull request #489 from Subv/vertexid | bunnei | 2018-05-30 | 1 | -0/+4 |
|\ | | | | | Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader. | ||||
| * | Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader. | Subv | 2018-05-30 | 1 | -0/+4 |
| | | |||||
* | | gl_shader_decompiler: Partially implement F2F_R instruction. | bunnei | 2018-05-30 | 1 | -3/+3 |
|/ | |||||
* | shader_bytecode: Implement other variants of FMNMX. | bunnei | 2018-05-26 | 1 | -3/+7 |
| | |||||
* | Merge pull request #458 from Subv/fmnmx | bunnei | 2018-05-21 | 1 | -0/+5 |
|\ | | | | | Shaders: Implemented the FMNMX shader instruction. | ||||
| * | Shaders: Implemented the FMNMX shader instruction. | Subv | 2018-05-21 | 1 | -0/+5 |
| | | |||||
* | | ShadersDecompiler: Added decoding for the PSETP instruction. | Subv | 2018-05-19 | 1 | -0/+3 |
|/ | |||||
* | maxwell_3d: Reset vertex counts after drawing. | bunnei | 2018-04-29 | 1 | -0/+10 |
| | |||||
* | shader_bytecode: Add decoding for FMNMX instruction. | bunnei | 2018-04-29 | 1 | -0/+2 |
| | |||||
* | Merge pull request #416 from bunnei/shader-ints-p3 | bunnei | 2018-04-29 | 1 | -8/+25 |
|\ | | | | | gl_shader_decompiler: Implement MOV32I, partially implement I2I, I2F | ||||
| * | gl_shader_decompiler: Partially implement I2I_R, and I2F_R. | bunnei | 2018-04-29 | 1 | -8/+8 |
| | | |||||
| * | shader_bytecode: Add decodings for i2i instructions. | bunnei | 2018-04-29 | 1 | -3/+20 |
| | | |||||
| * | gl_shader_decompiler: Implement MOV32_IMM instruction. | bunnei | 2018-04-29 | 1 | -2/+2 |
| | | |||||
* | | fermi_2d: Fix surface copy block height. | bunnei | 2018-04-29 | 2 | -2/+7 |
|/ | |||||
* | general: Convert assertion macros over to be fmt-compatible | Lioncash | 2018-04-27 | 1 | -2/+2 |
| | |||||
* | gl_shader_decompiler: Boilerplate for handling integer instructions. | bunnei | 2018-04-26 | 1 | -1/+9 |
| | |||||
* | Merge pull request #396 from Subv/shader_ops | bunnei | 2018-04-26 | 1 | -8/+35 |
|\ | | | | | Shaders: Implemented the FSET instruction. | ||||
| * | Shaders: Added bit decodings for the I2I instruction. | Subv | 2018-04-25 | 1 | -0/+6 |
| | | |||||
| * | Shaders: Added decodings for the FSET instructions. | Subv | 2018-04-25 | 1 | -8/+29 |
| | | |||||
* | | GPU: Partially implemented the Fermi2D surface copy operation. | Subv | 2018-04-25 | 2 | -0/+59 |
| | | | | | | | | | | The hardware allows for some rather complicated operations to be performed on the data during the copy, this is not implemented. Only same-format same-size raw copies are implemented for now. | ||||
* | | GPU: Added surface copy registers to Fermi2D | Subv | 2018-04-25 | 1 | -1/+57 |
| | | |||||
* | | GPU: Added boilerplate code for the Fermi2D engine | Subv | 2018-04-25 | 2 | -2/+33 |
| | | |||||
* | | GPU: Reduce the number of registers of Maxwell3D to 0xE00. | Subv | 2018-04-25 | 2 | -5/+5 |
| | | | | | | | | The rest are just macro shim registers. | ||||
* | | GPU: Move the Maxwell3D macro uploading code to the inside of the Maxwell3D processor. | Subv | 2018-04-25 | 2 | -8/+23 |
| | | | | | | | | It doesn't belong in the PFIFO handler. | ||||
* | | video-core: Move logging macros over to new fmt-capable ones | Lioncash | 2018-04-25 | 1 | -2/+2 |
|/ | |||||
* | memory_manager: Make GpuToCpuAddress return an optional. | bunnei | 2018-04-24 | 1 | -10/+11 |
| | |||||
* | memory_manager: Use GPUVAdddr, not PAddr, for GPU addresses. | bunnei | 2018-04-24 | 1 | -6/+5 |
| | |||||
* | Merge pull request #386 from Subv/gpu_query | bunnei | 2018-04-24 | 2 | -2/+53 |
|\ | | | | | GPU: Added asserts to our code for handling the QUERY_GET GPU command. | ||||
| * | GPU: Added asserts to our code for handling the QUERY_GET GPU command. | Subv | 2018-04-24 | 2 | -2/+53 |
| | | | | | | | | | | This is based on research from nouveau. Many things are currently unknown and will require hwtests in the future. This commit also stubs QueryMode::Write2 to do the same as Write. Nouveau code treats them interchangeably, it is currently unknown what the difference is. | ||||
* | | GPU: Support multiple enabled vertex arrays. | Subv | 2018-04-23 | 1 | -0/+5 |
|/ | | | | | | The vertex arrays will be copied to the stream buffer one after the other, and the attributes will be set using the ARB_vertex_attrib_binding extension. yuzu now thus requires OpenGL 4.3 or the ARB_vertex_attrib_binding extension. | ||||
* | shader_bytecode: Add several more instruction decodings. | bunnei | 2018-04-21 | 1 | -5/+52 |
| | |||||
* | shader_bytecode: Decode instructions based on bit strings. | bunnei | 2018-04-21 | 1 | -185/+172 |
| | |||||
* | ShaderGen: Implemented predicated instruction execution. | Subv | 2018-04-21 | 1 | -1/+5 |
| | | | | Each predicated instruction will be wrapped in an `if (predicate) { instruction_body; }` in the GLSL, where `predicate` is one of the predicate boolean variables previously set by fsetp. | ||||
* | ShaderGen: Implemented the fsetp instruction. | Subv | 2018-04-21 | 1 | -3/+40 |
| | | | | | | | | | | Predicate variables are now added to the generated shader code in the form of 'pX' where X is the predicate id. These predicate variables are initialized to false on shader startup and are set via the fsetp instructions. TODO: * Not all the comparison types are implemented. * Only the single-predicate version is implemented. | ||||
* | ShaderGen: Register id 255 is special and is hardcoded to return 0 (SR_ZERO). | Subv | 2018-04-20 | 1 | -0/+3 |
| | |||||
* | ShaderGen: Implemented the fmul32i shader instruction. | Subv | 2018-04-19 | 1 | -3/+14 |
| | |||||
* | gl_shader_gen: Support vertical/horizontal viewport flipping. (#347) | bunnei | 2018-04-18 | 1 | -1/+10 |
| | | | | | | * gl_shader_gen: Support vertical/horizontal viewport flipping. * fixup! gl_shader_gen: Support vertical/horizontal viewport flipping. | ||||
* | GPU: Pitch textures are now supported, don't assert when encountering them. | Subv | 2018-04-18 | 1 | -2/+3 |
| | |||||
* | Merge pull request #346 from bunnei/misc-gpu-improvements | bunnei | 2018-04-18 | 1 | -1/+2 |
|\ | | | | | Misc gpu improvements | ||||
| * | maxwell3d: Allow Texture2DNoMipmap as Texture2D. | bunnei | 2018-04-18 | 1 | -1/+2 |
| | | |||||
* | | Merge pull request #344 from bunnei/shader-decompiler-p2 | bunnei | 2018-04-18 | 1 | -10/+33 |
|\ \ | | | | | | | Shader decompiler changes part 2 | ||||
| * | | shader_bytecode: Make ctor's constexpr and explicit. | bunnei | 2018-04-18 | 1 | -7/+7 |
| | | | |||||
| * | | gl_shader_decompiler: Implement FMUL/FADD/FFMA immediate instructions. | bunnei | 2018-04-17 | 1 | -0/+14 |
| | | | |||||
| * | | gl_shader_decompiler: Add support for TEXS instruction. | bunnei | 2018-04-17 | 1 | -5/+14 |
| |/ | |||||
* / | renderer_opengl: Implement BlendEquation and BlendFunc. | bunnei | 2018-04-18 | 2 | -4/+48 |
|/ | |||||
* | gl_rasterizer: Implement indexed vertex mode. | bunnei | 2018-04-17 | 2 | -2/+46 |
| | |||||
* | GPU: Added a function to determine whether a shader stage is enabled or not. | Subv | 2018-04-15 | 2 | -0/+24 |
| | |||||
* | shaders: Add NumTextureSamplers const, remove unused #pragma. | bunnei | 2018-04-15 | 1 | -2/+0 |
| | |||||
* | shaders: Address PR review feedback. | bunnei | 2018-04-14 | 1 | -1/+1 |
| | |||||
* | shaders: Fix GCC and clang build issues. | bunnei | 2018-04-14 | 1 | -3/+3 |
| | |||||
* | gl_shader_decompiler: Implement negate, abs, etc. and lots of cleanup. | bunnei | 2018-04-14 | 1 | -20/+39 |
| | |||||
* | shader_bytecode: Add FSETP and KIL to GetInfo. | bunnei | 2018-04-14 | 1 | -0/+3 |
| | |||||
* | shader_bytecode: Add SubOp decoding. | bunnei | 2018-04-14 | 1 | -0/+10 |
| | |||||
* | maxwell_3d: Make memory_manager public. | bunnei | 2018-04-14 | 1 | -2/+1 |
| | |||||
* | maxwell_3d: Fix shader_config decodings. | bunnei | 2018-04-14 | 1 | -6/+3 |
| | |||||
* | shader_bytecode: Add initial module for shader decoding. | bunnei | 2018-04-14 | 1 | -0/+297 |
| | |||||
* | GPU: Assert when finding a texture with a format type other than UNORM. | Subv | 2018-04-07 | 1 | -0/+2 |
| | |||||
* | GPU: Use the MacroInterpreter class to execute the GPU macros instead of HLEing them. | Subv | 2018-04-01 | 2 | -121/+13 |
| | |||||
* | GPU: Implemented a gpu macro interpreter. | Subv | 2018-04-01 | 2 | -0/+8 |
| | | | | | | The Ryujinx macro interpreter and envydis were used as reference. Macros are programs that are uploaded by the games during boot and can later be called by writing to their method id in a GPU command buffer. | ||||
* | gl_rasterizer: Add a SyncViewport method. | bunnei | 2018-03-27 | 1 | -0/+10 |
| | |||||
* | gl_rasterizer: Normalize vertex array data as appropriate. | bunnei | 2018-03-27 | 1 | -0/+4 |
| | |||||
* | maxwell_3d: Use names that match envytools for VertexType. | bunnei | 2018-03-27 | 1 | -8/+8 |
| | |||||
* | maxwell_3d: Add VertexAttribute struct and cleanup. | bunnei | 2018-03-27 | 1 | -121/+160 |
| | |||||
* | Maxwell3D: Call AccelerateDrawBatch on DrawArrays. | bunnei | 2018-03-27 | 1 | -1/+8 |
| | |||||
* | gl_rasterizer: Implement AnalyzeVertexArray. | bunnei | 2018-03-27 | 1 | -0/+35 |
| | |||||
* | maxwell: Add RenderTargetFormat enum. | bunnei | 2018-03-27 | 1 | -3/+4 |
| | |||||
* | GPU: Load the sampler info (TSC) when retrieving active textures. | Subv | 2018-03-26 | 2 | -21/+67 |
| | |||||
* | GPU: Make the debug_context variable a member of the frontend instead of a global. | Subv | 2018-03-25 | 1 | -11/+13 |
| | |||||
* | GPU: Added a function to retrieve the active textures for a shader stage. | Subv | 2018-03-24 | 2 | -50/+59 |
| | | | | TODO: A shader may not use all of these textures at the same time, shader analysis should be performed to determine which textures are actually sampled. | ||||
* | GPU: Implement the Incoming/FinishedPrimitiveBatch debug breakpoints. | Subv | 2018-03-24 | 1 | -0/+7 |
| | |||||
* | GPU: Implement the MaxwellCommandLoaded/Processed debug breakpoints. | Subv | 2018-03-24 | 1 | -0/+10 |
| | |||||
* | GPU: Added a method to unswizzle a texture without decoding it. | Subv | 2018-03-24 | 1 | -1/+1 |
| | | | | Allow unswizzling of DXT1 textures. | ||||
* | GPU: Preliminary work for texture decoding. | Subv | 2018-03-24 | 1 | -0/+45 |
| | |||||
* | GPU: Added viewport registers to Maxwell3D's reg structure. | Subv | 2018-03-24 | 1 | -1/+18 |
| | |||||
* | maxwell_3d: Add some format decodings and string helper functions. | bunnei | 2018-03-23 | 1 | -3/+107 |
| | |||||
* | GPU: Added vertex attribute format registers. | Subv | 2018-03-21 | 1 | -1/+14 |
| | |||||
* | GPU: Added registers for the number of vertices to render. | Subv | 2018-03-21 | 1 | -2/+13 |
| | |||||
* | Merge pull request #253 from Subv/rt_depth | Mat M | 2018-03-20 | 1 | -1/+48 |
|\ | | | | | GPU: Added registers for color and Z buffers. | ||||
| * | GPU: Added Z buffer registers to Maxwell3D's reg structure. | Subv | 2018-03-19 | 1 | -1/+17 |
| | | |||||
| * | GPU: Added the render target (RT) registers to Maxwell3D's reg structure. | Subv | 2018-03-19 | 1 | -1/+32 |
| | | |||||
* | | Clang Fixes | N00byKing | 2018-03-19 | 1 | -1/+2 |
| | | |||||
* | | Clean Warnings (?) | N00byKing | 2018-03-19 | 1 | -1/+1 |
|/ | |||||
* | GPU: Added the TSC registers to the Maxwell3D register structure. | Subv | 2018-03-19 | 1 | -1/+15 |
| | |||||
* | GPU: Added the TIC registers to the Maxwell3D register structure. | Subv | 2018-03-19 | 1 | -1/+16 |
| | |||||
* | GPU: Implement macro 0xE1A BindTextureInfoBuffer in HLE. | Subv | 2018-03-19 | 2 | -1/+29 |
| | | | | This macro simply sets the current CB_ADDRESS to the texture buffer address for the input shader stage. | ||||
* | GPU: Implement the BindStorageBuffer macro method in HLE. | Subv | 2018-03-18 | 2 | -1/+36 |
| | | | | | | This macro binds the SSBO Info Buffer as the current ConstBuffer. This buffer is usually bound to c0 during shader execution. Games seem to use this macro instead of directly writing the address for some reason. | ||||
* | GPU: Handle writes to the CB_DATA method. | Subv | 2018-03-18 | 2 | -0/+39 |
| | | | | | | Writing to this method will cause the written value to be stored in the currently-set ConstBuffer plus CB_POS. This method is usually used to upload uniforms or other shader-visible data. | ||||
* | GPU: Store uploaded GPU macros and keep track of the number of method parameters. | Subv | 2018-03-18 | 2 | -11/+24 |
| | |||||
* | GPU: Macros are specific to the Maxwell3D engine, so handle them internally. | Subv | 2018-03-18 | 6 | -31/+55 |
| | |||||
* | GPU: Renamed ShaderType to ShaderStage as that is less confusing. | Subv | 2018-03-18 | 2 | -19/+19 |
| | |||||
* | GPU: Store shader constbuffer bindings in the GPU state. | Subv | 2018-03-18 | 2 | -5/+61 |
| | |||||
* | GPU: Corrected some register offsets and removed superfluous macro registers. | Subv | 2018-03-18 | 1 | -9/+3 |
| | |||||
* | GPU: Make the SetShader macro call do the same as the real macro's code. | Subv | 2018-03-18 | 2 | -3/+44 |
| | | | | | | It'll now set the CB_SIZE, CB_ADDRESS and CB_BIND registers when it's called. Presumably this SetShader function is binding the constant shader uniforms to buffer 1 (c1[]). | ||||
* | GPU: Corrected the parameter documentation for the SetShader macro call. | Subv | 2018-03-17 | 2 | -11/+12 |
| | | | | | | Register 0xE24 is actually a macro that sets some shader parameters in the register structure. Macros are uploaded to the GPU at startup and have their own ISA, we'll probably write an interpreter for this in the future. | ||||
* | Merge pull request #242 from Subv/set_shader | bunnei | 2018-03-17 | 2 | -4/+38 |
|\ | | | | | GPU: Handle the SetShader method call (0xE24) and store the shader config. | ||||
| * | GPU: Handle the SetShader method call (0xE24) and store the shader config. | Subv | 2018-03-17 | 2 | -4/+38 |
| | | |||||
* | | GPU: Added the vertex array registers. | Subv | 2018-03-17 | 1 | -2/+33 |
|/ | |||||
* | Merge pull request #241 from Subv/gpu_method_call | bunnei | 2018-03-17 | 6 | -1/+56 |
|\ | | | | | GPU: Process command mode 5 (IncreaseOnce) differently from other commands | ||||
| * | GPU: Process command mode 5 (IncreaseOnce) differently from other commands. | Subv | 2018-03-17 | 6 | -1/+56 |
| | | | | | | | | | | | | Accumulate all arguments before calling the desired method. Note: Maybe we should do the same for the NonIncreasing mode? | ||||
* | | GPU: Assert that we get a 0 CODE_ADDRESS register in the 3D engine. | Subv | 2018-03-17 | 1 | -0/+8 |
| | | | | | | | | Shader address calculation depends on this value to some extent, we do not currently know what it being 0 entails. | ||||
* | | GPU: Added Maxwell registers for Shader Program control. | Subv | 2018-03-17 | 1 | -2/+55 |
|/ | |||||
* | GPU: Intercept writes to the VERTEX_END_GL register. | Subv | 2018-03-05 | 2 | -1/+18 |
| | | | | | | This is the register that gets written after a game calls DrawArrays(). We should collect all GPU state and draw using our graphics API here. | ||||
* | maxwell_3d: Make constructor explicit | Lioncash | 2018-02-14 | 1 | -1/+1 |
| | |||||
* | GPU: Partially implemented the QUERY_* registers in the Maxwell3D engine. | Subv | 2018-02-12 | 2 | -2/+94 |
| | | | | Only QueryMode::Write is supported at the moment. | ||||
* | Make a GPU class in VideoCore to contain the GPU state. | Subv | 2018-02-12 | 6 | -18/+24 |
| | | | | Also moved the GPU MemoryManager class to video_core since it makes more sense for it to be there. | ||||
* | GPU: Added a command processor to decode the GPU pushbuffers and forward the commands to their respective engines. | Subv | 2018-02-12 | 6 | -0/+99 |