Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | SPIR-V: Merge two ifs in EmitGetAttribute | Valeri | 2021-08-19 | 1 | -6/+2 |
| | |||||
* | Merge pull request #6767 from ReinUsesLisp/fold-float-pack | Morph | 2021-07-30 | 1 | -0/+4 |
|\ | | | | | shader: Fold UnpackFloat2x16 and PackFloat2x16 | ||||
| * | shader: Fold UnpackFloat2x16 and PackFloat2x16 | ReinUsesLisp | 2021-07-30 | 1 | -0/+4 |
| | | | | | | | | | | Simplifies the code a bit when possible. These instructions should be no-ops codegen wise. | ||||
* | | Merge pull request #6722 from ReinUsesLisp/xmad-opts | bunnei | 2021-07-30 | 2 | -14/+195 |
|\ \ | |/ |/| | shader: Fold integer FMA from Nvidia's pattern | ||||
| * | shader: Fold integer FMA from Nvidia's pattern | ReinUsesLisp | 2021-07-26 | 1 | -0/+175 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fold shaders doing "a * b + c" on integers from the pattern generated by Nvidia's GL compiler. On a somewhat complex compute shader it reduces the code size by 16 instructions from 2 matches on Turing GPUs. On Intel as extracted from KHR_pipeline_executable_properties: Before the optimization: ``` Instruction Count: 2057 Basic Block Count: 45 Scratch Memory Size: 14752 Spill Count: 232 Fill Count: 261 SEND Count: 610 Cycle Count: 11325 ``` After the optimization: ``` Instruction Count: 2046 Basic Block Count: 44 Scratch Memory Size: 13728 Spill Count: 219 Fill Count: 268 SEND Count: 604 Cycle Count: 11367 ``` | ||||
| * | shader: Use TryInstRecursive on XMAD multiply folding | ReinUsesLisp | 2021-07-26 | 1 | -14/+12 |
| | | | | | | | | Simplify a bit the logic. | ||||
| * | shader: Add TryInstRecursive utility to values | ReinUsesLisp | 2021-07-26 | 1 | -0/+8 |
| | | |||||
* | | shader: Mark ConvertF16F32 and ConvertF32F16 as fp16 instructions | ReinUsesLisp | 2021-07-28 | 1 | -0/+2 |
| | | | | | | | | | | | | Fixes instances where fp16 types are not declared on SPIR-V but they are used. This shouldn't happen on master, as it's been uncovered by an additional optimization pass. | ||||
* | | exception: Make constructors explicit | Lioncash | 2021-07-27 | 1 | -4/+4 |
| | | | | | | | | Ensures that exception construction is always explicit. | ||||
* | | exception: Make what() member function nodiscard | Lioncash | 2021-07-27 | 1 | -1/+1 |
| | | |||||
* | | exception: Narrow down specific header | Lioncash | 2021-07-27 | 1 | -1/+1 |
| | | | | | | | | | | We can use the <exception> header instead of pulling in all of the exception-style classes. | ||||
* | | Merge pull request #6724 from lioncash/nodisc-shader | Rodrigo Locatti | 2021-07-26 | 2 | -4/+4 |
|\ \ | | | | | | | shader_recompiler: Remove unnecessary [[nodiscard]] instances | ||||
| * | | shader_recompiler: Remove unnecessary [[nodiscard]] instances | Lioncash | 2021-07-26 | 2 | -4/+4 |
| |/ | | | | | | | | | [[nodiscard]] doesn't do anything on functions with a void return type and causes superfluous warnings. | ||||
* | | Merge pull request #6726 from lioncash/hguard | Rodrigo Locatti | 2021-07-26 | 1 | -0/+2 |
|\ \ | | | | | | | emit_spirv_instructions: Add missing header guard | ||||
| * | | emit_spirv_instructions: Add missing header guard | Lioncash | 2021-07-26 | 1 | -0/+2 |
| |/ | |||||
* | | Merge pull request #6727 from lioncash/topology | Rodrigo Locatti | 2021-07-26 | 1 | -1/+1 |
|\ \ | | | | | | | emit_glasm: Fix LINESS_ADJACENCY typo in InputPrimitive() | ||||
| * | | emit_glasm: Fix LINESS_ADJACENCY typo in InputPrimitive() | Lioncash | 2021-07-26 | 1 | -1/+1 |
| |/ | | | | | | | This should be LINES_ADJACENCY | ||||
* | | Merge pull request #6723 from lioncash/shader | Rodrigo Locatti | 2021-07-26 | 1 | -0/+1 |
|\ \ | | | | | | | object_pool: Add missing return in Chunk move assignment operator | ||||
| * | | object_pool: Add missing return in Chunk move assignment operator | Lioncash | 2021-07-26 | 1 | -0/+1 |
| |/ | | | | | | | Prevents undefined behavior from occurring. | ||||
* / | control_flow: Fix duplicate switch case in OpcodeToken | Lioncash | 2021-07-26 | 1 | -1/+1 |
|/ | | | | This previously duplicated the case of the PBK case above it. | ||||
* | shader: Support out of bound local memory reads and immediate writes | ReinUsesLisp | 2021-07-23 | 1 | -4/+21 |
| | | | | | | | | | Support ignoring immediate out of bound writes. Writing dynamically out of bounds is not yet supported (e.g. R0+0x4). Reading out of bounds yields zero. This is supported checking for the size from the IR; if the input is immediate, the optimization passes will drop it. | ||||
* | shader: Fix disabled attribute default values | ameerj | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glsl: Simplify FCMP emission | ameerj | 2021-07-23 | 1 | -6/+4 |
| | |||||
* | glsl: Update TessellationControl gl_in | ameerj | 2021-07-23 | 1 | -0/+28 |
| | | | | Adheres to GL_ARB_separate_shader_objects requirements | ||||
* | shader: Implement ISETP.X | ameerj | 2021-07-23 | 4 | -44/+57 |
| | |||||
* | shader: Avoid usage of C++20 ranges to build in clang | ReinUsesLisp | 2021-07-23 | 11 | -39/+47 |
| | |||||
* | glsl: Clamp shared mem size to GL_MAX_COMPUTE_SHARED_MEMORY_SIZE | ameerj | 2021-07-23 | 2 | -2/+11 |
| | |||||
* | shader_recompiler, video_core: Resolve clang errors | lat9nq | 2021-07-23 | 13 | -41/+39 |
| | | | | | | | | | | Silences the following warnings-turned-errors: -Wsign-conversion -Wunused-private-field -Wbraced-scalar-init -Wunused-variable And some other errors | ||||
* | shader: Manually convert from array<u32> to bitset instead of using bit_cast | ReinUsesLisp | 2021-07-23 | 1 | -2/+3 |
| | |||||
* | glsl: Fix tracking of info.uses_shadow_lod | ameerj | 2021-07-23 | 1 | -4/+4 |
| | |||||
* | shader: Ignore global memory ops on devices lacking int64 support | ameerj | 2021-07-23 | 6 | -30/+77 |
| | |||||
* | dual_vertex_pass: Clang format | ameerj | 2021-07-23 | 1 | -14/+14 |
| | |||||
* | emit_spirv: Workaround VK_KHR_shader_float_controls on fp16 Nvidia | ReinUsesLisp | 2021-07-23 | 2 | -5/+10 |
| | | | | Fix regression on Fire Emblem: Three Houses when using native fp16. | ||||
* | shader: GCC fmt 8.0.0 fixes | lat9nq | 2021-07-23 | 7 | -16/+19 |
| | |||||
* | shader: Account for 33-bit IADD3 scenario | ameerj | 2021-07-23 | 1 | -2/+10 |
| | |||||
* | shader: Only apply shift on register mode for IADD3 | ReinUsesLisp | 2021-07-23 | 1 | -10/+14 |
| | |||||
* | shader: Fix disabled and unwritten attributes and varyings | ReinUsesLisp | 2021-07-23 | 2 | -3/+11 |
| | |||||
* | glsl: Fix shared and local memory declarations | ameerj | 2021-07-23 | 1 | -3/+3 |
| | | | | account for the fact that program.*memory_size is in units of bytes. | ||||
* | opengl: Implement LOP.CC | ameerj | 2021-07-23 | 2 | -6/+38 |
| | | | | Used by MH:Rise | ||||
* | spirv: Fix code emission when descriptor aliasing is unsupported | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | | | | Fixes OpenGL. | ||||
* | glsl: Declare local memory in main | ameerj | 2021-07-23 | 1 | -3/+3 |
| | |||||
* | glsl: Add passthrough geometry shader support | ameerj | 2021-07-23 | 3 | -7/+27 |
| | |||||
* | shader: Use std::bit_cast instead of Common::BitCast for passthrough | ReinUsesLisp | 2021-07-23 | 1 | -2/+3 |
| | |||||
* | glasm: Add passthrough geometry shader support | ReinUsesLisp | 2021-07-23 | 2 | -7/+26 |
| | |||||
* | shader: Rework varyings and implement passthrough geometry shaders | ReinUsesLisp | 2021-07-23 | 22 | -316/+302 |
| | | | | | | Put all varyings into a single std::bitset with helpers to access it. Implement passthrough geometry shaders using host's. | ||||
* | shader: Only verify shader when graphics debugging is enabled | ReinUsesLisp | 2021-07-23 | 1 | -2/+7 |
| | |||||
* | shader: Unify shader stage types | ReinUsesLisp | 2021-07-23 | 1 | -2/+9 |
| | |||||
* | lower_int64_to_int32: Add missing include | lat9nq | 2021-07-23 | 1 | -0/+1 |
| | |||||
* | shader: Emulate 64-bit integers when not supported | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | | | | Useful for mobile and Intel Xe devices. | ||||
* | shader: Add int64 to int32 lowering pass | ReinUsesLisp | 2021-07-23 | 3 | -0/+218 |
| | |||||
* | shader: Teach global memory base tracker to follow vectors | ReinUsesLisp | 2021-07-23 | 1 | -15/+14 |
| | |||||
* | shader: Add constant propagation to integer vectors | ReinUsesLisp | 2021-07-23 | 1 | -0/+9 |
| | |||||
* | glsl: Better IAdd Overflow CC fix | ameerj | 2021-07-23 | 2 | -11/+13 |
| | | | | This ensures the original operand values are not overwritten when being used in the overflow detection. | ||||
* | shader: Remove IAbs64 | ReinUsesLisp | 2021-07-23 | 9 | -26/+3 |
| | |||||
* | glsl: Fix IADD CC | ameerj | 2021-07-23 | 2 | -5/+7 |
| | |||||
* | shader_recompiler: Fix IADD3 input partitioning | ameerj | 2021-07-23 | 1 | -14/+13 |
| | |||||
* | shader: Move loop safety tests to code emission | ReinUsesLisp | 2021-07-23 | 16 | -108/+54 |
| | |||||
* | glsl: Remove frag color initialization | ameerj | 2021-07-23 | 1 | -9/+0 |
| | |||||
* | glasm: Implement SetAttribute ViewportMask | ameerj | 2021-07-23 | 2 | -1/+10 |
| | |||||
* | emit_glsl_special: Skip initialization of frag_color0 | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | | Fixes rendering in Devil May Cry without regressing Ori and the Blind Forest. | ||||
* | shader: Calibrate loop safety threshold | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glsl: Add missing ; in EmitSetSampleMask | Morph | 2021-07-23 | 1 | -1/+1 |
| | | | | Fixes shader compilation in Okami HD | ||||
* | glsl: Fix output varying initialization when transform feedback is used | ameerj | 2021-07-23 | 1 | -3/+37 |
| | |||||
* | texture_pass: Fix is_read image qualification | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | | Atomic operations are considered to have both read and write access. This was not being accounted for. | ||||
* | shader: Align constant buffer sizes to 16 bytes | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | | | | WAR for AMD reading zeroes on uniform buffers of size 2. | ||||
* | spirv: Properly handle devices without int8 and int16 | ReinUsesLisp | 2021-07-23 | 2 | -39/+67 |
| | |||||
* | spirv: Handle small storage buffer loads on devices with no support | ReinUsesLisp | 2021-07-23 | 2 | -6/+6 |
| | |||||
* | glsl: Fix cbuf component indexing bug falback | ameerj | 2021-07-23 | 1 | -7/+6 |
| | |||||
* | shader: Simplify MergeDualVertexPrograms | ReinUsesLisp | 2021-07-23 | 1 | -6/+4 |
| | |||||
* | shader: Properly manage attributes not written from previous stages | ReinUsesLisp | 2021-07-23 | 10 | -36/+40 |
| | |||||
* | glsl: Only declare fragment outputs on fragment shaders | ReinUsesLisp | 2021-07-23 | 1 | -4/+6 |
| | |||||
* | shader: Split profile and runtime info headers | ReinUsesLisp | 2021-07-23 | 11 | -76/+91 |
| | |||||
* | shader: Add support for native 16-bit floats | ReinUsesLisp | 2021-07-23 | 4 | -4/+26 |
| | |||||
* | shader: Rename maxwell/program.h to translate_program.h | ReinUsesLisp | 2021-07-23 | 3 | -9/+4 |
| | |||||
* | glsl: Obey need_declared_frag_colors to declare and initialize all frag_color | ameerj | 2021-07-23 | 2 | -1/+10 |
| | | | | Fixes Ori and the blind forest title screen | ||||
* | glsl: Address rest of feedback | ameerj | 2021-07-23 | 7 | -21/+64 |
| | |||||
* | glsl: Move gl_Position/generic attribute initialization to EmitProlgue | ameerj | 2021-07-23 | 2 | -14/+12 |
| | |||||
* | glsl: Conditionally use fine/coarse derivatives based on device support | ameerj | 2021-07-23 | 3 | -4/+28 |
| | |||||
* | glsl: Cleanup/Address feedback | ameerj | 2021-07-23 | 9 | -28/+22 |
| | |||||
* | glsl: Add Shader_GLSL logging | ameerj | 2021-07-23 | 3 | -28/+32 |
| | |||||
* | glsl: Add LoopSafety instructions | ameerj | 2021-07-23 | 2 | -0/+10 |
| | |||||
* | glsl: Conditionally add EXT_texture_shadow_lod | ameerj | 2021-07-23 | 3 | -4/+15 |
| | |||||
* | glsl: Add stubs for sparse queries and variable aoffi when not supported | ameerj | 2021-07-23 | 4 | -13/+39 |
| | |||||
* | glsl: Implement legacy varyings | ameerj | 2021-07-23 | 6 | -8/+81 |
| | |||||
* | glsl: Minor cleanup | ameerj | 2021-07-23 | 2 | -19/+15 |
| | |||||
* | glsl: Fix Cbuf getters for F32 type | ameerj | 2021-07-23 | 1 | -12/+15 |
| | |||||
* | glsl: Add immediate index oob checking for Cbuf getters | ameerj | 2021-07-23 | 1 | -0/+16 |
| | |||||
* | glsl: Refactor GetCbuf functions to reduce code duplication | ameerj | 2021-07-23 | 1 | -104/+66 |
| | |||||
* | glsl: Address more feedback. Implement indexed texture reads | ameerj | 2021-07-23 | 5 | -111/+109 |
| | |||||
* | glsl: Remove Signed Integer variables | ameerj | 2021-07-23 | 8 | -43/+13 |
| | |||||
* | glsl: Address Rodrigo's feedback | ameerj | 2021-07-23 | 13 | -75/+87 |
| | |||||
* | glsl: Reorganize backend code, remove unneeded [[maybe_unused]] | ameerj | 2021-07-23 | 12 | -315/+251 |
| | |||||
* | glsl: Implement SampleId and SetSampleMask | ameerj | 2021-07-23 | 3 | -30/+35 |
| | | | | plus some minor refactoring of implementations | ||||
* | glsl: Add gl_PerVertex in for GS | ameerj | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | glsl: Use existing tracking for enabling EXT_shader_image_load_formatted | ameerj | 2021-07-23 | 1 | -15/+1 |
| | |||||
* | glsl: Enable early fragment tests | ameerj | 2021-07-23 | 2 | -4/+7 |
| | |||||
* | glsl: Implement more attribute getters and setters | ameerj | 2021-07-23 | 2 | -12/+60 |
| | |||||
* | glsl: Implement fswzadd | ameerj | 2021-07-23 | 4 | -5/+44 |
| | | | | and wip nv thread shuffle impl | ||||
* | glsl: Implement indexed attribute loads | ameerj | 2021-07-23 | 5 | -29/+64 |
| | |||||
* | glsl: Conditionally add GL_ARB_sparse_texture2 | ameerj | 2021-07-23 | 1 | -2/+3 |
| | |||||
* | glsl: Conditionally use GL_EXT_shader_image_load_formatted | ameerj | 2021-07-23 | 1 | -2/+18 |
| | | | | Fix for SULD.D | ||||
* | glsl: Remove output generic indexing for geometry stage | ameerj | 2021-07-23 | 1 | -5/+3 |
| | |||||
* | glsl: Allow dynamic tracking of variable allocation | ameerj | 2021-07-23 | 3 | -21/+35 |
| | |||||
* | glsl: Implement barriers | ameerj | 2021-07-23 | 3 | -13/+21 |
| | |||||
* | glsl: Implement image atomics and set layer | ameerj | 2021-07-23 | 5 | -153/+202 |
| | | | | along with some more cleanup/oversight fixes | ||||
* | glsl: Fix image gather logic | ameerj | 2021-07-23 | 1 | -0/+4 |
| | |||||
* | glsl: Add cbuf access workaround for devices with component indexing bug | ameerj | 2021-07-23 | 2 | -51/+112 |
| | |||||
* | glsl: Use textureGrad fallback when EXT_texture_shadow_lod is unsupported | ameerj | 2021-07-23 | 3 | -8/+41 |
| | |||||
* | emit_glsl_image: Use immediate offsets when possible | ameerj | 2021-07-23 | 1 | -12/+33 |
| | |||||
* | glsl: Fix <32-bit SSBO writes | ameerj | 2021-07-23 | 4 | -50/+43 |
| | | | | and more cleanup | ||||
* | glsl: Cleanup and address feedback | ameerj | 2021-07-23 | 10 | -86/+69 |
| | |||||
* | glsl: Refactor Global memory functions | ameerj | 2021-07-23 | 2 | -71/+73 |
| | |||||
* | glsl: Increase NUM_VARS that can be allocated | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | | needed for HW:AoC. | ||||
* | glsl: Implement Load/WriteGlobal | ameerj | 2021-07-23 | 9 | -98/+185 |
| | | | | along with some other misc changes and fixes | ||||
* | glsl: Implement Images | ameerj | 2021-07-23 | 2 | -9/+74 |
| | |||||
* | glsl: skip gl_ViewportIndex write if device does not support it | ameerj | 2021-07-23 | 4 | -8/+17 |
| | |||||
* | glsl: Implement transform feedback | ameerj | 2021-07-23 | 3 | -13/+63 |
| | |||||
* | glsl: Yet another gl_ViewportIndex fix attempt | ameerj | 2021-07-23 | 1 | -3/+19 |
| | |||||
* | glsl: Add gl_ViewportIndex out attribute | ameerj | 2021-07-23 | 1 | -1/+3 |
| | |||||
* | emit_glsl_context_get_set: Remove unused function | lat9nq | 2021-07-23 | 1 | -4/+0 |
| | |||||
* | glsl: Fix precise variable declaration | ameerj | 2021-07-23 | 3 | -24/+25 |
| | | | | and add some more separation in the shader for better debugability when dumped | ||||
* | glsl: Implement tessellation shaders | ameerj | 2021-07-23 | 5 | -27/+146 |
| | |||||
* | glsl: Implement ImageGradient and other texture function variants | ameerj | 2021-07-23 | 2 | -32/+73 |
| | |||||
* | glsl: Fix atomic SSBO offsets | ameerj | 2021-07-23 | 4 | -67/+74 |
| | | | | and implement misc getters | ||||
* | glsl: Implement geometry shaders | ameerj | 2021-07-23 | 4 | -9/+62 |
| | |||||
* | glsl: Use NotImplemented macro with function name output | ameerj | 2021-07-23 | 10 | -104/+103 |
| | |||||
* | glsl: Implement gl_ViewportIndex | ameerj | 2021-07-23 | 3 | -5/+14 |
| | | | | SSBU now working | ||||
* | glsl: SHFL fix and prefer shift operations over divide in glsl shader | ameerj | 2021-07-23 | 5 | -63/+64 |
| | |||||
* | glsl: Implement precise fp variable allocation | ameerj | 2021-07-23 | 4 | -8/+67 |
| | |||||
* | HACK glsl: Write defaults to unused generic attributes | ameerj | 2021-07-23 | 2 | -2/+11 |
| | |||||
* | glsl: Fix ssbo indexing and name shadowing between shader stages | ameerj | 2021-07-23 | 3 | -77/+101 |
| | |||||
* | glsl: implement set clip distance | ameerj | 2021-07-23 | 2 | -0/+15 |
| | | | | and missed a diff in emit_glsl relating to var alloc ref counting | ||||
* | glsl: Rework var alloc to not assign unused results | ameerj | 2021-07-23 | 9 | -49/+91 |
| | |||||
* | glsl: Rework variable allocator to allow for variable reuse | ameerj | 2021-07-23 | 14 | -353/+482 |
| | |||||
* | glsl: Fix ATOM and implement ATOMS | ameerj | 2021-07-23 | 5 | -114/+136 |
| | |||||
* | glsl: Use gl_SubGroupInvocationARB | ameerj | 2021-07-23 | 2 | -8/+7 |
| | |||||
* | glsl: Implement VOTE for subgroup size potentially larger | ameerj | 2021-07-23 | 2 | -19/+36 |
| | |||||
* | glsl: Implement VOTE | ameerj | 2021-07-23 | 4 | -50/+64 |
| | |||||
* | glsl: Implement ST{LS} | ameerj | 2021-07-23 | 6 | -69/+106 |
| | |||||
* | glsl: Implement more instructions used by SMO | ameerj | 2021-07-23 | 1 | -3/+3 |
| | |||||
* | glsl: Implement more instructions used by SMO | ameerj | 2021-07-23 | 5 | -10/+16 |
| | |||||
* | glsl: Fix GetAttribute return values | ameerj | 2021-07-23 | 2 | -7/+9 |
| | | | | fixes font rendering issues as these were used to index into the ssbos | ||||
* | glsl: minor cleanup | ameerj | 2021-07-23 | 4 | -20/+19 |
| | |||||
* | glsl: Fix and implement rest of cbuf access | ameerj | 2021-07-23 | 1 | -7/+43 |
| | |||||
* | glsl: Implement TXQ and other misc changes | ameerj | 2021-07-23 | 5 | -6/+36 |
| | |||||
* | glsl: TLD4 implementation | ameerj | 2021-07-23 | 1 | -2/+89 |
| | |||||
* | glsl: Implement TLD instruction | ameerj | 2021-07-23 | 1 | -1/+55 |
| | |||||
* | glsl: Implement TEXS | ameerj | 2021-07-23 | 1 | -1/+29 |
| | |||||
* | glsl: Cleanup texture functions | ameerj | 2021-07-23 | 1 | -13/+11 |
| | |||||
* | shader_recompiler: GCC fixes | lat9nq | 2021-07-23 | 14 | -3/+13 |
| | |||||
* | glsl: Implement TEX depth functions | ameerj | 2021-07-23 | 2 | -4/+46 |
| | |||||
* | glsl: Implement TEX ImageSample functions | ameerj | 2021-07-23 | 3 | -11/+71 |
| | |||||
* | glsl: Rework Shuffle emit instructions to align with SPIR-V | ameerj | 2021-07-23 | 1 | -19/+40 |
| | |||||
* | glsl: Better Storage access and wip warps | ameerj | 2021-07-23 | 8 | -62/+133 |
| | |||||
* | glsl: Fix integer conversions, implement clamp CC | ameerj | 2021-07-23 | 2 | -27/+36 |
| | |||||
* | glsl: Implement IADD CC | ameerj | 2021-07-23 | 2 | -2/+17 |
| | |||||
* | glsl: SSBO access fixes and wip SampleExplicitLod implementation. | ameerj | 2021-07-23 | 2 | -4/+19 |
| | |||||
* | glsl: WIP var forward declaration | ameerj | 2021-07-23 | 6 | -49/+60 |
| | | | | to fix Loop control flow. | ||||
* | glsl: Fix bindings, add some CC ops | ameerj | 2021-07-23 | 8 | -57/+91 |
| | |||||
* | glsl: remove unused headers | ameerj | 2021-07-23 | 14 | -34/+10 |
| | |||||
* | glsl: Implement derivatives and YDirection | ameerj | 2021-07-23 | 8 | -81/+87 |
| | | | | plus some other misc additions/changed | ||||
* | glsl: Fix non-immediate buffer access | ameerj | 2021-07-23 | 12 | -72/+133 |
| | | | | and many other misc implementations | ||||
* | glsl: textures wip | ameerj | 2021-07-23 | 9 | -75/+139 |
| | |||||
* | glsl: Implement some attribute getters and setters | ameerj | 2021-07-23 | 9 | -191/+337 |
| | |||||
* | glsl: Track S32 atomics | ameerj | 2021-07-23 | 3 | -6/+16 |
| | |||||
* | glsl: Update phi node management | ameerj | 2021-07-23 | 4 | -21/+53 |
| | |||||
* | glsl: Fix floating point compare ops | ameerj | 2021-07-23 | 1 | -28/+28 |
| | | | | Logic for ordered/unordered ops was wrong. | ||||
* | glsl: Query GL Device for FP16 extension support | ameerj | 2021-07-23 | 2 | -2/+9 |
| | |||||
* | glsl: Simply FP storage atomics | ameerj | 2021-07-23 | 2 | -48/+28 |
| | |||||
* | glsl: F16x2 storage atomics | ameerj | 2021-07-23 | 7 | -58/+64 |
| | |||||
* | glsl: Revert ssbo aliasing. Storage Atomics impl | ameerj | 2021-07-23 | 5 | -75/+134 |
| | |||||
* | glsl: implement phi nodes | ameerj | 2021-07-23 | 4 | -20/+54 |
| | |||||
* | glsl: Wip storage atomic ops | ameerj | 2021-07-23 | 10 | -327/+414 |
| | |||||
* | glsl: Implement FCMP | ameerj | 2021-07-23 | 3 | -242/+185 |
| | |||||
* | glsl: Add a more robust fp formatter | ameerj | 2021-07-23 | 4 | -9/+14 |
| | |||||
* | glsl: More FP fixes | ameerj | 2021-07-23 | 2 | -9/+16 |
| | |||||
* | glsl: FP function fixes | ameerj | 2021-07-23 | 7 | -17/+25 |
| | |||||
* | glsl: More FP instructions/fixes | ameerj | 2021-07-23 | 5 | -28/+41 |
| | |||||
* | glsl: Add many FP32/64 instructions | ameerj | 2021-07-23 | 12 | -765/+1011 |
| | |||||
* | glsl: Implement more Integer ops | ameerj | 2021-07-23 | 3 | -119/+72 |
| | |||||
* | glsl: Implement BF* | ameerj | 2021-07-23 | 3 | -9/+10 |
| | |||||
* | glsl: Implement a few Integer instructions | ameerj | 2021-07-23 | 10 | -260/+398 |
| | |||||
* | glsl: Use std::string_view for Emit function args. | ameerj | 2021-07-23 | 6 | -760/+838 |
| | |||||
* | glsl: Pass IR::Inst& to Emit functions | ameerj | 2021-07-23 | 6 | -171/+169 |
| | |||||
* | glsl: INeg and IAdd negate tests | ameerj | 2021-07-23 | 3 | -94/+106 |
| | |||||
* | glsl: Reusable typed variables. IADD32 | ameerj | 2021-07-23 | 6 | -203/+311 |
| | |||||
* | glsl: Fix program linking and cbuf | ameerj | 2021-07-23 | 2 | -3/+5 |
| | |||||
* | glsl: Fix "reg" allocing | ameerj | 2021-07-23 | 10 | -898/+938 |
| | | | | based on glasm with some tweaks | ||||
* | glsl: Initial backend | ameerj | 2021-07-23 | 27 | -0/+3292 |
| | |||||
* | spirv: Reduce log severity of mismatching denorm rules | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | shader: Fix loop safety to SSA pass | ReinUsesLisp | 2021-07-23 | 2 | -2/+4 |
| | |||||
* | shader: Add logging | ReinUsesLisp | 2021-07-23 | 13 | -28/+30 |
| | |||||
* | shader: Add shader loop safety check settings | lat9nq | 2021-07-23 | 9 | -33/+130 |
| | | | | Also add a setting for enable Nsight Aftermath. | ||||
* | shader: Comment why the array component is not read in TMML | ReinUsesLisp | 2021-07-23 | 1 | -0/+2 |
| | |||||
* | tmml: Remove index component from coords vec | ameerj | 2021-07-23 | 1 | -4/+3 |
| | | | | The lod query functions exposed by the rendering API's do not make use of the texturearray layer indexing. | ||||
* | spirv/convert: Catch more signed operations oversights | ameerj | 2021-07-23 | 1 | -5/+5 |
| | | | | The sign bit on integers of size < 32 was not properly preserved in casts | ||||
* | spirv/convert: Catch more broken signed operations on Nvidia OpenGL | ReinUsesLisp | 2021-07-23 | 1 | -0/+6 |
| | | | | | BitCast U32 to S32 before converting to float on drivers with broken signed operations. | ||||
* | shader_environment: Add shader_local_memory_crs_size to local memory size | ameerj | 2021-07-23 | 1 | -2/+2 |
| | | | | Fixes DOOM 2016 missing local memory | ||||
* | shader: Fix VertexA Shaders. | FernandoS27 | 2021-07-23 | 3 | -14/+30 |
| | |||||
* | shader: Add 2D and 3D variants to SUATOM and SURED | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | | | | Used by Claybook. | ||||
* | shader: Avoid CPU side undefined behavior on I2F | ReinUsesLisp | 2021-07-23 | 1 | -0/+2 |
| | |||||
* | glasm: Use ARB_derivative_control conditionally | ReinUsesLisp | 2021-07-23 | 3 | -7/+30 |
| | |||||
* | buffer_cache: Reduce uniform buffer size from shader usage | ReinUsesLisp | 2021-07-23 | 2 | -3/+17 |
| | | | | Increases performance significantly on certain titles. | ||||
* | emit_glasm_context_get_set: Remove unused variable | lat9nq | 2021-07-23 | 1 | -1/+0 |
| | |||||
* | shader,glasm: Implement legacy texcoord loads | ReinUsesLisp | 2021-07-23 | 3 | -54/+29 |
| | |||||
* | glasm: Implement legacy varyings | ReinUsesLisp | 2021-07-23 | 1 | -17/+56 |
| | |||||
* | shader: Track legacy varyings | ReinUsesLisp | 2021-07-23 | 2 | -17/+105 |
| | |||||
* | shader: Add support for "negative" and unaligned offsets | ReinUsesLisp | 2021-07-23 | 3 | -8/+13 |
| | | | | | | | | | "Negative" offsets don't exist. They are shown as such due to a bug in nvdisasm. Unaligned offsets have been proved to read the aligned offset. For example, when reading an U32, if the offset is 6, the offset read will be 4. | ||||
* | shader: Implement ISCADD32I | ReinUsesLisp | 2021-07-23 | 1 | -17/+31 |
| | |||||
* | spirv: Fix output generics with components | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | opengl: Declare fragment outputs even if they are not used | ReinUsesLisp | 2021-07-23 | 4 | -10/+9 |
| | | | | | | Fixes Ori and the Blind Forest's menu on GLASM. For some reason (probably high level optimizations) it is not sanitized on SPIR-V for OpenGL. Vulkan is unaffected by this change. | ||||
* | shader: Always initialize up reference in structure control flow | ReinUsesLisp | 2021-07-23 | 1 | -31/+36 |
| | | | | Fixes ubsan issue. | ||||
* | shader: Fix ImageWrite indexing | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | spirv: Fix image and image buffer descriptor index usage | ReinUsesLisp | 2021-07-23 | 1 | -5/+7 |
| | |||||
* | glasm: Fix immediate texture coordinate | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | |||||
* | shader: Clang-format secondary textures | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | shader: Fix secondary textures | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | shader: Fix TMML queries | ReinUsesLisp | 2021-07-23 | 1 | -5/+9 |
| | |||||
* | shader: Fix FSwizzleAdd folding when going through phi nodes | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | shader/exception: Fix compilation errors on gcc | ReinUsesLisp | 2021-07-23 | 1 | -6/+6 |
| | |||||
* | glasm: Reduce reg allocation leaks from an exception to a log | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Handle host exceptions | ReinUsesLisp | 2021-07-23 | 4 | -13/+43 |
| | |||||
* | glasm: Use integer lod for TXQ | ReinUsesLisp | 2021-07-23 | 2 | -2/+2 |
| | |||||
* | glasm: Fix global memory fallbacks | ReinUsesLisp | 2021-07-23 | 1 | -9/+10 |
| | |||||
* | Revert "glasm: Skip phi moves on undefined instructions" | ReinUsesLisp | 2021-07-23 | 2 | -16/+1 |
| | | | | Causes regressions on Bowser's Fury. | ||||
* | glasm: Remove unintentional '\n' on Undef32 | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Use storage buffers instead of global memory when possible | ReinUsesLisp | 2021-07-23 | 6 | -370/+383 |
| | |||||
* | glasm: Implement Y direction | ReinUsesLisp | 2021-07-23 | 4 | -3/+9 |
| | |||||
* | glasm: Skip phi moves on undefined instructions | ReinUsesLisp | 2021-07-23 | 2 | -1/+16 |
| | |||||
* | glasm: Implement undef instructions | ReinUsesLisp | 2021-07-23 | 2 | -15/+15 |
| | |||||
* | glasm: Fix global memory callbacks | ReinUsesLisp | 2021-07-23 | 1 | -5/+6 |
| | |||||
* | video_core,shader: Clang-format fixes | ReinUsesLisp | 2021-07-23 | 2 | -2/+2 |
| | |||||
* | glasm: Release phi node registers after they are no longer needed | ReinUsesLisp | 2021-07-23 | 2 | -38/+54 |
| | |||||
* | glasm: Remove unintentionally committed fmt::prints | ReinUsesLisp | 2021-07-23 | 1 | -2/+0 |
| | |||||
* | glasm: Fix INeg32 on negative immediates | ReinUsesLisp | 2021-07-23 | 1 | -1/+5 |
| | |||||
* | glasm: Remove unnecessary value types | ReinUsesLisp | 2021-07-23 | 3 | -47/+6 |
| | |||||
* | glasm: Throw when there are register leaks | ReinUsesLisp | 2021-07-23 | 2 | -0/+7 |
| | |||||
* | glasm: Catch more register leaks | ReinUsesLisp | 2021-07-23 | 8 | -41/+114 |
| | | | | | | | | | | | | | Add support for null registers. These are used when an instruction has no usages. This comes handy when an instruction is only used for its CC value, with the caveat of having to invalidate all pseudo-instructions before defining the instruction itself in the register allocator. This commits changes this. Workaround a bug on Nvidia's condition codes conditional execution using branches. | ||||
* | glasm: Fix usage counting on phi nodes | ReinUsesLisp | 2021-07-23 | 3 | -8/+22 |
| | |||||
* | glasm: Implement global memory fallbacks | ReinUsesLisp | 2021-07-23 | 2 | -50/+89 |
| | |||||
* | glasm: Implement int64 add and subtract | ReinUsesLisp | 2021-07-23 | 2 | -8/+6 |
| | |||||
* | emit_glasm_context_get_set: Remove unused variable | lat9nq | 2021-07-23 | 1 | -1/+0 |
| | |||||
* | glasm: Implement indirect attribute loads | ReinUsesLisp | 2021-07-23 | 4 | -6/+65 |
| | |||||
* | glasm: Implement image atomics | ReinUsesLisp | 2021-07-23 | 3 | -166/+153 |
| | |||||
* | glasm: Reorder unreachable image atomic insts | ReinUsesLisp | 2021-07-23 | 1 | -66/+66 |
| | | | | Reorder them to the bottom of the file for readability. | ||||
* | glasm: Implement gl_Layer stores | ReinUsesLisp | 2021-07-23 | 1 | -0/+7 |
| | |||||
* | glasm: Implement SampleId | ReinUsesLisp | 2021-07-23 | 2 | -3/+3 |
| | |||||
* | glasm: Implement IsHelperInvocation | ReinUsesLisp | 2021-07-23 | 2 | -3/+3 |
| | |||||
* | glasm: Fix EmitVertex's optimization | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | gl_shader_cache,glasm: Conditionally use typeless image reads extension | ReinUsesLisp | 2021-07-23 | 1 | -2/+4 |
| | |||||
* | glasm: Implement forced early Z | ReinUsesLisp | 2021-07-23 | 1 | -2/+6 |
| | |||||
* | glasm: Simplify patch reads | ReinUsesLisp | 2021-07-23 | 1 | -5/+2 |
| | |||||
* | glasm: Fix output patch reads | ReinUsesLisp | 2021-07-23 | 2 | -13/+22 |
| | | | | With this, Luigi's Mansion's sand renders properly. | ||||
* | shader: Split profile and runtime information in separate structs | ReinUsesLisp | 2021-07-23 | 11 | -71/+88 |
| | |||||
* | emit_glasm_context_get_and_set.cpp: Add missing semicolons | ameerj | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | glasm: Fix patch attribute declarations | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Implement FSWZADD | ameerj | 2021-07-23 | 3 | -4/+28 |
| | |||||
* | glasm: Implement PrimitiveId attribute read | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | glasm: Implement clip distance stores | ReinUsesLisp | 2021-07-23 | 2 | -0/+15 |
| | |||||
* | glasm: Fix tessellation input attributes | ReinUsesLisp | 2021-07-23 | 1 | -2/+5 |
| | |||||
* | glasm: Add missing semicolon on tesscoord reading | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Fix tessellation headers | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | glasm: Add tessellation shader declarations | ReinUsesLisp | 2021-07-23 | 1 | -0/+35 |
| | |||||
* | glasm: Implement TessellationEvaluationPoint | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | |||||
* | glasm: Implement patch memory | ReinUsesLisp | 2021-07-23 | 3 | -6/+51 |
| | |||||
* | glasm: Fix InvocationId declaration | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Implement InvocationId | ReinUsesLisp | 2021-07-23 | 2 | -2/+5 |
| | |||||
* | glasm: Optimize EmitVertex into EMIT | ReinUsesLisp | 2021-07-23 | 1 | -1/+5 |
| | |||||
* | glasm: Implement geometry shader attribute reads | ReinUsesLisp | 2021-07-23 | 2 | -4/+18 |
| | |||||
* | glasm: Properly declare attributes on geometry programs | ReinUsesLisp | 2021-07-23 | 3 | -6/+14 |
| | |||||
* | glasm: Declare geometry program headers | ReinUsesLisp | 2021-07-23 | 1 | -0/+35 |
| | |||||
* | glasm: Fix potential aliasing bug on cube array samples | ReinUsesLisp | 2021-07-23 | 2 | -35/+44 |
| | |||||
* | glasm: Implement ImageWrite | ReinUsesLisp | 2021-07-23 | 1 | -4/+7 |
| | |||||
* | glasm: Implement ImageRead | ReinUsesLisp | 2021-07-23 | 4 | -4/+56 |
| | |||||
* | glasm: Implement EmitVertex and EndPrimitive | ReinUsesLisp | 2021-07-23 | 2 | -4/+8 |
| | |||||
* | glasm: Implement ImageGradient | ReinUsesLisp | 2021-07-23 | 2 | -7/+65 |
| | |||||
* | glasm: Implement 64-bit shifts | ReinUsesLisp | 2021-07-23 | 2 | -12/+14 |
| | |||||
* | glasm: Implement barriers | ReinUsesLisp | 2021-07-23 | 1 | -3/+3 |
| | |||||
* | glasm: Fix compute stage name | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Fix phi instruction types | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Implement PREC on relevant instructions | ReinUsesLisp | 2021-07-23 | 1 | -6/+12 |
| | |||||
* | glasm: Implement stores to gl_ViewportIndex | ReinUsesLisp | 2021-07-23 | 4 | -7/+29 |
| | |||||
* | glasm: Implement gl_PointSize stores | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | glasm: Implement gl_PointCoord | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | |||||
* | glasm: Implement ImageQueryLod | ReinUsesLisp | 2021-07-23 | 1 | -3/+5 |
| | |||||
* | glasm: Implement ImageFetch | ReinUsesLisp | 2021-07-23 | 4 | -13/+38 |
| | |||||
* | glasm: Implement IADD.CC | ameerj | 2021-07-23 | 1 | -1/+26 |
| | |||||
* | glasm: Implement BFE.CC | ReinUsesLisp | 2021-07-23 | 1 | -0/+8 |
| | |||||
* | glasm: Implement SelectU1 | ReinUsesLisp | 2021-07-23 | 2 | -4/+5 |
| | |||||
* | glasm: Implement gl_WorkGroupID | ReinUsesLisp | 2021-07-23 | 2 | -3/+3 |
| | |||||
* | glasm: Implement TXQ and improve texture info reads | ReinUsesLisp | 2021-07-23 | 2 | -50/+51 |
| | |||||
* | glasm: Implement gl_FrongFacing attribute | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | glasm: Support textures used in more than one stage | ReinUsesLisp | 2021-07-23 | 3 | -4/+24 |
| | |||||
* | glasm: Implement textureGather instructions | ReinUsesLisp | 2021-07-23 | 2 | -15/+97 |
| | |||||
* | glasm: Implement gl_FragDepth and gl_SampleMask stores | ReinUsesLisp | 2021-07-23 | 2 | -5/+5 |
| | |||||
* | glasm: Do not alias ConditionRef for now | ReinUsesLisp | 2021-07-23 | 2 | -3/+2 |
| | | | | | Immediate condition refs where not handled correctly. Just move the value for now. | ||||
* | shader: Read branch conditions from an instruction | ReinUsesLisp | 2021-07-23 | 12 | -16/+36 |
| | | | | Fixes the identity removal pass. | ||||
* | glasm: Implement InstanceId and VertexId | ReinUsesLisp | 2021-07-23 | 1 | -0/+6 |
| | |||||
* | glasm: Add missing return value on move assignment | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | |||||
* | glasm: Fix aliased bitcasts ref counting | ReinUsesLisp | 2021-07-23 | 3 | -13/+42 |
| | |||||
* | glasm: Remove unintentional comma on vector insert | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | glasm: Implement TEX and TEXS instructions | ReinUsesLisp | 2021-07-23 | 10 | -69/+275 |
| | | | | | Remove lod clamp from texture instructions with lod, as this is not needed (nor supported). | ||||
* | glasm: Add support for non-2D texture samples | ReinUsesLisp | 2021-07-23 | 1 | -4/+26 |
| | |||||
* | glasm: Reorder unreachable image instructions to the bottom | ReinUsesLisp | 2021-07-23 | 1 | -97/+97 |
| | |||||
* | glasm: Add support for texture offsets | ReinUsesLisp | 2021-07-23 | 1 | -11/+15 |
| | |||||
* | glasm: Improve texture sampling instructions | ReinUsesLisp | 2021-07-23 | 2 | -50/+70 |
| | |||||
* | emit_glasm: Enable ARB_draw_buffers when needed | ReinUsesLisp | 2021-07-23 | 2 | -1/+5 |
| | |||||
* | emit_glasm: Add support for reading position attributes | ReinUsesLisp | 2021-07-23 | 1 | -3/+13 |
| | |||||
* | shader_recompiler: GCC fixes | lat9nq | 2021-07-23 | 7 | -58/+55 |
| | | | | | Fixes members of unnamed union not being accessible, and one function without a declaration. | ||||
* | glasm: Implement rest of shared mem | ameerj | 2021-07-23 | 2 | -35/+29 |
| | |||||
* | shader: Use a non-trivial dummy to construct ASL node union | ReinUsesLisp | 2021-07-23 | 1 | -1/+6 |
| | |||||
* | emit_spirv: Jump to loop body with local variable | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | | Silence unused variable warning | ||||
* | glasm: Implement derivative instructions on GLASM | ReinUsesLisp | 2021-07-23 | 2 | -12/+12 |
| | |||||
* | glasm: Initial (broken) implementation of TEX on GLASM | ReinUsesLisp | 2021-07-23 | 3 | -299/+386 |
| | |||||
* | glasm: Implement some graphics instructions on GLASM | ReinUsesLisp | 2021-07-23 | 2 | -6/+5 |
| | |||||
* | glasm: Add Void type to GLASM values | ReinUsesLisp | 2021-07-23 | 3 | -0/+15 |
| | |||||
* | glasm: Add graphics specific shader declarations to GLASM | ReinUsesLisp | 2021-07-23 | 2 | -6/+63 |
| | |||||
* | glasm: Implement local memory for glasm | ameerj | 2021-07-23 | 4 | -9/+12 |
| | |||||
* | emit_spirv: Add missing block in case | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | glasm: Initial implementation of phi nodes on GLASM | ReinUsesLisp | 2021-07-23 | 12 | -25/+117 |
| | |||||
* | glasm: Write result to scalar on integer comparison instructions | ReinUsesLisp | 2021-07-23 | 1 | -10/+10 |
| | |||||
* | glasm: Declare NV_shader_thread_group when needed | ReinUsesLisp | 2021-07-23 | 1 | -3/+4 |
| | |||||
* | glasm: Rework control flow introducing a syntax list | ReinUsesLisp | 2021-07-23 | 33 | -505/+437 |
| | | | | | This commit regresses VertexA shaders, their transformation pass has to be adapted to the new control flow. | ||||
* | glasm: Implement Storage atomics | ameerj | 2021-07-23 | 5 | -109/+156 |
| | | | | | StorageAtomicExchangeU64 is failing test seemingly due to failure storing 64-bit result into the register | ||||
* | glasm: Ensure reg alloc order across compilers on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -11/+14 |
| | | | | | | | | | | | | | | | | | | Use a struct constructor to serialize register allocation arguments to ensure registers are allocated in the same order regardless of the compiler used. The A and B functions can be called in any order when passed as arguments to "foo": foo(A(), B()) But the order is guaranteed for curly-braced constructor calls in classes: Foo{A(), B()} Use this to get consistent behavior. | ||||
* | glasm: Enable unintentionally disabled register aliasing on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -16/+11 |
| | |||||
* | glasm: Review all GLASM insts to be aware of register aliasing | ReinUsesLisp | 2021-07-23 | 4 | -20/+51 |
| | |||||
* | glasm: Implement shuffle and vote instructions on GLASM | ReinUsesLisp | 2021-07-23 | 10 | -100/+166 |
| | |||||
* | glasm: Add MUFU instructions to GLASM | ReinUsesLisp | 2021-07-23 | 2 | -21/+22 |
| | |||||
* | glasm: Implement IAbs64 and INeg64 on GLASM | ReinUsesLisp | 2021-07-23 | 2 | -6/+6 |
| | |||||
* | shader: Add floating-point rounding to I2F | ReinUsesLisp | 2021-07-23 | 3 | -35/+42 |
| | |||||
* | glasm: Properly clamp Fp64 on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -6/+6 |
| | |||||
* | glasm: Fix register allocation when moving immediate on GLASM | ReinUsesLisp | 2021-07-23 | 3 | -42/+89 |
| | |||||
* | glasm: Implement SelectU64 on GLASM | ReinUsesLisp | 2021-07-23 | 2 | -4/+20 |
| | |||||
* | glasm: Fix clamps so the min value has priority on NAN on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -12/+15 |
| | |||||
* | glasm: Fix moving U64 immediates to registers in GLASM | ReinUsesLisp | 2021-07-23 | 2 | -3/+4 |
| | |||||
* | glasm: Implement storage atomic ops | ameerj | 2021-07-23 | 4 | -305/+358 |
| | |||||
* | glasm: Add conversion instructions to GLASM | ReinUsesLisp | 2021-07-23 | 9 | -282/+351 |
| | |||||
* | glasm: Add fp min/max insts and fix store for fp64 on GLASM | ReinUsesLisp | 2021-07-23 | 2 | -10/+8 |
| | |||||
* | glasm: Add logical instructions on GLASM | ReinUsesLisp | 2021-07-23 | 2 | -12/+12 |
| | |||||
* | glasm: Remove duplicated Fp64 pack instructions on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -8/+0 |
| | |||||
* | glasm: Remove unnecesary new white space on Clamp GLASM | ReinUsesLisp | 2021-07-23 | 1 | -4/+4 |
| | |||||
* | glasm: Add floating-point comparisons on GLASM | ReinUsesLisp | 2021-07-23 | 3 | -120/+116 |
| | |||||
* | emit_glasm: Implement more integer alu ops | ameerj | 2021-07-23 | 2 | -47/+41 |
| | |||||
* | glasm: Reimplement bitwise ops and BFI/BFE | ameerj | 2021-07-23 | 4 | -88/+108 |
| | |||||
* | glasm: Initial GLASM fp64 support | ReinUsesLisp | 2021-07-23 | 9 | -55/+152 |
| | |||||
* | glasm: Implement GLASM fp16 packing and move bitwise insns | ReinUsesLisp | 2021-07-23 | 4 | -66/+77 |
| | |||||
* | glasm: Remove unused functions left from rebase | ReinUsesLisp | 2021-07-23 | 1 | -12/+0 |
| | |||||
* | glasm: Specify namespace when using FormatTo | ReinUsesLisp | 2021-07-23 | 1 | -6/+6 |
| | |||||
* | glasm: Implement more GLASM composite instructions | ReinUsesLisp | 2021-07-23 | 2 | -54/+63 |
| | |||||
* | glasm: Make GLASM aware of types | ReinUsesLisp | 2021-07-23 | 12 | -1244/+1380 |
| | |||||
* | glasm: Use CMP.S for Select32 | ameerj | 2021-07-23 | 3 | -12/+8 |
| | | | | also fixes ADD and SUB to use U modifier | ||||
* | glasm: Implement more logical ops | ameerj | 2021-07-23 | 2 | -5/+5 |
| | |||||
* | glasm: Implement BFI, BFE | ameerj | 2021-07-23 | 4 | -138/+164 |
| | | | | Along with implementations of common instructions along the way | ||||
* | glasm: Use BitField instead of C bitfields | ReinUsesLisp | 2021-07-23 | 2 | -8/+12 |
| | |||||
* | glasm: Remove unused argument in identity instructions on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -7/+7 |
| | |||||
* | glasm: Implement basic GLASM instructions | ReinUsesLisp | 2021-07-23 | 10 | -840/+1173 |
| | |||||
* | glasm: Changes to GLASM register allocator and emit context | ReinUsesLisp | 2021-07-23 | 4 | -26/+64 |
| | |||||
* | glasm: Add GLASM backend infrastructure | ReinUsesLisp | 2021-07-23 | 28 | -4/+3115 |
| | |||||
* | shader: ISET.X implementation | ameerj | 2021-07-23 | 1 | -8/+58 |
| | |||||
* | shader: Fixup SPIR-V emit header namespaces | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | Move SPIR-V emission functions to their own header | ReinUsesLisp | 2021-07-23 | 24 | -572/+631 |
| | |||||
* | shader: Optimize NVN Fallthrough | FernandoS27 | 2021-07-23 | 4 | -9/+83 |
| | |||||
* | shader: Stub SR_AFFINITY | FernandoS27 | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | shader: Implement Int32 SUATOM/SURED | ameerj | 2021-07-23 | 17 | -6/+733 |
| | |||||
* | shader: Initial OpenGL implementation | ReinUsesLisp | 2021-07-23 | 3 | -0/+12 |
| | |||||
* | spirv: Be aware of NAN unaware drivers | ReinUsesLisp | 2021-07-23 | 1 | -18/+40 |
| | |||||
* | spirv: Add SSBO read fallbacks when no aliasing is available | ReinUsesLisp | 2021-07-23 | 1 | -37/+99 |
| | |||||
* | spirv: Add OpKill fallback to demote | ReinUsesLisp | 2021-07-23 | 1 | -2/+6 |
| | |||||
* | spirv: Do not enable ShaderLayer | ReinUsesLisp | 2021-07-23 | 1 | -3/+0 |
| | | | | This is enabled by an extension instead of the capability. | ||||
* | spirv: Enable DemoteToHelperInvocationEXT only when supported | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | spirv: Use OriginLowerLeft when requested | ReinUsesLisp | 2021-07-23 | 1 | -1/+5 |
| | |||||
* | spirv: Only add image operands mask when needed | ReinUsesLisp | 2021-07-23 | 1 | -5/+9 |
| | |||||
* | spirv: Workaround image unsigned offset bug | ReinUsesLisp | 2021-07-23 | 2 | -9/+26 |
| | | | | | Workaround bug on Nvidia's OpenGL SPIR-V compiler when using unsigned texture offsets. | ||||
* | spirv: Add int8 and int16 capabilities only when supported | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | spirv: Add integer clamping workarounds | ReinUsesLisp | 2021-07-23 | 1 | -4/+34 |
| | | | | Workaround more bugs on Nvidia's OpenGL SPIR-V compiler. | ||||
* | spirv: Implement int8 and int16 conversion fallbacks | ReinUsesLisp | 2021-07-23 | 1 | -19/+80 |
| | |||||
* | spirv: Support OpenGL uniform buffers and change bindings | ReinUsesLisp | 2021-07-23 | 5 | -56/+163 |
| | |||||
* | spirv: Desambiguate descriptor names | ReinUsesLisp | 2021-07-23 | 1 | -9/+37 |
| | | | | | Worksaround a bug on Nvidia's OpenGL SPIR-V compiler where names are used for name matching. | ||||
* | shader: Add OpenGL shader profile options | ReinUsesLisp | 2021-07-23 | 1 | -0/+11 |
| | |||||
* | shader: Remove shader util | ReinUsesLisp | 2021-07-23 | 4 | -176/+0 |
| | |||||
* | shader: Address feedback | FernandoS27 | 2021-07-23 | 4 | -35/+33 |
| | |||||
* | shader: Implement VertexA stage | FernandoS27 | 2021-07-23 | 11 | -0/+166 |
| | |||||
* | shader: Implement delegation of Exit to dispatcher on CFG | FernandoS27 | 2021-07-23 | 2 | -3/+47 |
| | |||||
* | shader: Fix IADD3.CC | ameerj | 2021-07-23 | 1 | -12/+5 |
| | |||||
* | shader: Fix BFE s32 undefined check | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | | Our unit tests were hitting this exception. | ||||
* | shader: Fix error checking in bitfieldExtract and implement bitfieldInsert folding | ReinUsesLisp | 2021-07-23 | 1 | -5/+14 |
| | |||||
* | shader: Fix storage type when reading patches on tess control | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | shader: Fix VMNMX selector B | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | shader: Increase the maximum number of storage buffers | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | | | Compute shaders spill uniform buffers on storage buffers, increasing the expected number. | ||||
* | shader: Remove identity removal pass for better build times | ReinUsesLisp | 2021-07-23 | 1 | -1/+0 |
| | |||||
* | shader: Add more strict validation the pass | ReinUsesLisp | 2021-07-23 | 1 | -0/+42 |
| | |||||
* | shader: Fix forward referencing identity instructions when inserting phi | ReinUsesLisp | 2021-07-23 | 1 | -11/+13 |
| | |||||
* | shader: Remove invalidated blocks in dead code elimination pass | ReinUsesLisp | 2021-07-23 | 1 | -3/+6 |
| | |||||
* | shader: Add missing UndoUse case for GetSparseFromOp | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | |||||
* | shader: Simplify code in opcodes.h to fix Intellisense | ReinUsesLisp | 2021-07-23 | 1 | -8/+6 |
| | | | | | | | | Avoid using std::array to fix Intellisense not properly compiling this code and disabling itself on all files that include it. While we are at it, change the code to use u8 instead of size_t for the number of instructions in an opcode. | ||||
* | shader: Implement indexed textures | ReinUsesLisp | 2021-07-23 | 7 | -93/+189 |
| | |||||
* | shader: Refactor atomic_operations_global_memory | ameerj | 2021-07-23 | 1 | -44/+36 |
| | |||||
* | shader: add missing include guard in half_floating_point_helper.h | ameerj | 2021-07-23 | 1 | -0/+2 |
| | |||||
* | shader: Fix gcc warnings | ReinUsesLisp | 2021-07-23 | 2 | -2/+2 |
| | |||||
* | shader: Inline common Value getters | ReinUsesLisp | 2021-07-23 | 2 | -109/+102 |
| | |||||
* | shader: Intrusively store in a block if it's sealed or not | ReinUsesLisp | 2021-07-23 | 2 | -3/+11 |
| | |||||
* | cmake: Link to common in shader_recompiler | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Improve goto removal algorithm complexity | ReinUsesLisp | 2021-07-23 | 1 | -49/+28 |
| | | | | | Find sibling node containing a nephew searching from the nephew itself instead of the uncle. | ||||
* | shader: Use memset to reset instruction arguments | ReinUsesLisp | 2021-07-23 | 2 | -4/+7 |
| | |||||
* | shader: Inline common Value functions into the header | ReinUsesLisp | 2021-07-23 | 2 | -19/+23 |
| | |||||
* | shader: Move microinstruction header to the value header | ReinUsesLisp | 2021-07-23 | 19 | -180/+161 |
| | |||||
* | shader: Move siblings check to a separate function and comment them out | ReinUsesLisp | 2021-07-23 | 1 | -16/+21 |
| | |||||
* | shader: Intrusively store register values in block for SSA pass | ReinUsesLisp | 2021-07-23 | 2 | -21/+53 |
| | |||||
* | shader: Inline common Opcode and Inst functions | ReinUsesLisp | 2021-07-23 | 4 | -112/+83 |
| | |||||
* | shader: Inline common IR::Block methods | ReinUsesLisp | 2021-07-23 | 2 | -17/+12 |
| | |||||
* | shader: Use a small_vector for phi blocks | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | shader: Calculate number of arguments in an opcode at compile time | ReinUsesLisp | 2021-07-23 | 1 | -3/+12 |
| | |||||
* | shader: Implement D3D samplers | ReinUsesLisp | 2021-07-23 | 3 | -12/+76 |
| | |||||
* | shader: Add constant propagation for arithmetic right shifts | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | shader: Simplify code for local memory | ReinUsesLisp | 2021-07-23 | 1 | -6/+11 |
| | |||||
* | shader: Add NVN storage buffer fallbacks | ReinUsesLisp | 2021-07-23 | 9 | -62/+214 |
| | | | | | | | When we can't track the SSBO origin of a global memory instruction, leave it as a global memory operation and assume these pointers are in the NVN storage buffer slots, then apply a linear search in the shader's runtime. | ||||
* | spirv: Fix ViewportMask | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | spirv: Replace Constant/ConstantComposite with Const helper | ameerj | 2021-07-23 | 12 | -112/+101 |
| | |||||
* | shader: Address feedback | FernandoS27 | 2021-07-23 | 2 | -7/+10 |
| | |||||
* | shader: Implement F2F (Imm) | FernandoS27 | 2021-07-23 | 1 | -2/+28 |
| | |||||
* | shader: Implement IADD3.CC/.X | FernandoS27 | 2021-07-23 | 1 | -7/+22 |
| | |||||
* | shader: Address feedback | FernandoS27 | 2021-07-23 | 4 | -7/+4 |
| | |||||
* | shader: Add coarse derivatives | FernandoS27 | 2021-07-23 | 7 | -8/+28 |
| | |||||
* | shader: Implement fine derivates constant propagation | FernandoS27 | 2021-07-23 | 9 | -0/+101 |
| | |||||
* | shader: Implement SR_Y_DIRECTION | FernandoS27 | 2021-07-23 | 7 | -0/+18 |
| | |||||
* | shader: Fix Phi node types | ReinUsesLisp | 2021-07-23 | 2 | -4/+4 |
| | |||||
* | shader: Fix memory barriers | ReinUsesLisp | 2021-07-23 | 8 | -62/+30 |
| | |||||
* | spirv: Fix implicit lod type | ReinUsesLisp | 2021-07-23 | 2 | -1/+5 |
| | |||||
* | spirv: Use explicit lods outside of fragment shaders | ReinUsesLisp | 2021-07-23 | 1 | -5/+16 |
| | |||||
* | spirv: Use ConstOffset instead of Offset when possible | ReinUsesLisp | 2021-07-23 | 3 | -21/+67 |
| | |||||
* | shader: Implement BFE and BFI CC | ameerj | 2021-07-23 | 3 | -14/+17 |
| | | | | Fix two bugs in BFI. | ||||
* | shader: Implement SampleMask | ReinUsesLisp | 2021-07-23 | 11 | -2/+22 |
| | |||||
* | shader: Implement PIXLD.MY_INDEX | ReinUsesLisp | 2021-07-23 | 13 | -4/+69 |
| | |||||
* | spirv: Bitcast non-F32 output attributes to their type before store | ReinUsesLisp | 2021-07-23 | 1 | -13/+28 |
| | |||||
* | spirv: Implement ViewportMask with NV_viewport_array2 | ReinUsesLisp | 2021-07-23 | 7 | -0/+20 |
| | |||||
* | spirv: Bitcast non-F32 attributes to F32 | ReinUsesLisp | 2021-07-23 | 1 | -7/+9 |
| | |||||
* | shader: Implement PrimitiveId | ReinUsesLisp | 2021-07-23 | 5 | -0/+10 |
| | |||||
* | shader: Implement tessellation shaders, polygon mode and invocation id | ReinUsesLisp | 2021-07-23 | 22 | -88/+555 |
| | |||||
* | shader: Mark atomic instructions as writes | ReinUsesLisp | 2021-07-23 | 1 | -0/+27 |
| | |||||
* | spirv: Implement image buffers | ReinUsesLisp | 2021-07-23 | 5 | -23/+86 |
| | |||||
* | spirv: Implement Layer stores | ReinUsesLisp | 2021-07-23 | 6 | -9/+30 |
| | |||||
* | spirv: Fix alpha test | FernandoS27 | 2021-07-23 | 1 | -0/+5 |
| | |||||
* | spirv: Fix non-atomic 64-bit store | ameerj | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | spirv: Implement alpha test | ameerj | 2021-07-23 | 2 | -1/+59 |
| | |||||
* | shader: Implement transform feedbacks and define file format | ReinUsesLisp | 2021-07-23 | 8 | -16/+116 |
| | |||||
* | shader: Implement early Z tests | ReinUsesLisp | 2021-07-23 | 2 | -0/+4 |
| | |||||
* | shader: Document and relax cache control on surface instructions | ReinUsesLisp | 2021-07-23 | 1 | -10/+11 |
| | |||||
* | spirv: Rework storage buffers and shader memory | ReinUsesLisp | 2021-07-23 | 8 | -499/+553 |
| | |||||
* | shader: Fix fixed pipeline point size on geometry shaders | ReinUsesLisp | 2021-07-23 | 1 | -10/+18 |
| | |||||
* | shader: Add constant propagation for *&^| binary operations | ReinUsesLisp | 2021-07-23 | 1 | -0/+12 |
| | |||||
* | shader: Implement geometry shaders | ReinUsesLisp | 2021-07-23 | 12 | -84/+221 |
| | |||||
* | shader: Implement OUT | ReinUsesLisp | 2021-07-23 | 10 | -17/+73 |
| | |||||
* | internal_stage_buffer_entry_read: Remove pragma optimize off | lat9nq | 2021-07-23 | 1 | -2/+0 |
| | |||||
* | shader: Stub SR_INVOCATION_INFO | ReinUsesLisp | 2021-07-23 | 1 | -2/+5 |
| | |||||
* | shader: Stub ISBERD | ReinUsesLisp | 2021-07-23 | 3 | -4/+56 |
| | |||||
* | shader: Fix CC in I2I | ReinUsesLisp | 2021-07-23 | 1 | -0/+2 |
| | |||||
* | spirv: Define StorageImageWriteWithoutFormat capability when used | ReinUsesLisp | 2021-07-23 | 3 | -0/+9 |
| | |||||
* | shader: Simplify FLO and throw on CC | ReinUsesLisp | 2021-07-23 | 1 | -12/+13 |
| | |||||
* | shader: Mark blocks with no end branch as unreachable | ReinUsesLisp | 2021-07-23 | 1 | -2/+7 |
| | |||||
* | shader: Implement LOP CC | ReinUsesLisp | 2021-07-23 | 3 | -12/+29 |
| | |||||
* | shader: Implement SR_THREAD_KILL | ReinUsesLisp | 2021-07-23 | 10 | -0/+22 |
| | |||||
* | shader: Apply sign bit in FCMP (imm) | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Implement ATOM/S and RED | ameerj | 2021-07-23 | 18 | -19/+1724 |
| | |||||
* | spirv: Move phi node patching to a separate function | ReinUsesLisp | 2021-07-23 | 1 | -13/+16 |
| | |||||
* | spirv: Guard against typeless image reads on unsupported devices | ReinUsesLisp | 2021-07-23 | 5 | -1/+16 |
| | |||||
* | shader: Move LaneId to the warp emission file and fix AMD | ReinUsesLisp | 2021-07-23 | 5 | -7/+11 |
| | |||||
* | spirv: Fix forward declarations on phi nodes | ReinUsesLisp | 2021-07-23 | 1 | -47/+25 |
| | |||||
* | shader: Mark ImageWrite with side effects | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | shader: Implement CC for ISET, FSET, PSET, CSET, and DSET | FernandoS27 | 2021-07-23 | 18 | -13/+136 |
| | | | | Throw when other instructions are missing CC. | ||||
* | shader: Remove outdated comment in F2I | ReinUsesLisp | 2021-07-23 | 1 | -4/+0 |
| | |||||
* | shader: Implement SULD and SUST | ReinUsesLisp | 2021-07-23 | 23 | -137/+597 |
| | |||||
* | shader: Fix Windows build issues | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Address feedback + clang format | lat9nq | 2021-07-23 | 11 | -22/+20 |
| | |||||
* | shader_recompiler,video_core: Cleanup some GCC and Clang errors | lat9nq | 2021-07-23 | 59 | -297/+289 |
| | | | | | | | | | | | | | | | | | Mostly fixing unused *, implicit conversion, braced scalar init, fpermissive, and some others. Some Clang errors likely remain in video_core, and std::ranges is still a pertinent issue in shader_recompiler shader_recompiler: cmake: Force bracket depth to 1024 on Clang Increases the maximum fold expression depth thread_worker: Include condition_variable Don't use list initializers in control flow Co-authored-by: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
* | shader: Fix FCMP immediate variant | ReinUsesLisp | 2021-07-23 | 1 | -1/+9 |
| | |||||
* | shader: Fix dangling labels | ReinUsesLisp | 2021-07-23 | 1 | -0/+5 |
| | |||||
* | shader: Interact texture buffers with buffer cache | ReinUsesLisp | 2021-07-23 | 3 | -29/+29 |
| | |||||
* | shader: Fix F2I | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Fix TextureGrad | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Implement texture buffers | ReinUsesLisp | 2021-07-23 | 6 | -23/+125 |
| | |||||
* | shader: Address feedback | FernandoS27 | 2021-07-23 | 5 | -53/+54 |
| | |||||
* | shader: Implement indexed Position and ClipDistances | FernandoS27 | 2021-07-23 | 3 | -11/+100 |
| | |||||
* | shader: Implement indexed attributes | FernandoS27 | 2021-07-23 | 12 | -35/+279 |
| | |||||
* | shader: Implement AL2P | FernandoS27 | 2021-07-23 | 3 | -4/+36 |
| | |||||
* | shader: Fix BRX tracking | FernandoS27 | 2021-07-23 | 2 | -3/+4 |
| | |||||
* | shader: Move recursive SSA rewrite to the heap | ReinUsesLisp | 2021-07-23 | 1 | -29/+89 |
| | |||||
* | shader: Fix ShadowCube declaration type, set number of pipeline threads based on hardware | FernandoS27 | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Fix splits on blocks using indirect branches | ReinUsesLisp | 2021-07-23 | 3 | -17/+38 |
| | |||||
* | shader: Eliminate orphan blocks more efficiently | ReinUsesLisp | 2021-07-23 | 1 | -7/+8 |
| | |||||
* | shader: Add subgroup masks | ReinUsesLisp | 2021-07-23 | 10 | -45/+169 |
| | |||||
* | shader: Implement BAR and fix memory barriers | ReinUsesLisp | 2021-07-23 | 7 | -5/+79 |
| | |||||
* | shader: Abstract breadth searches and use the abstraction | ReinUsesLisp | 2021-07-23 | 4 | -104/+106 |
| | |||||
* | shader: Reimplement GetCbufU64 as GetCbufU32x2 | ReinUsesLisp | 2021-07-23 | 9 | -22/+21 |
| | | | | It may generate better code on some compilers and it's easier to handle. | ||||
* | shader: Remove unused header in VOTE | ReinUsesLisp | 2021-07-23 | 1 | -2/+0 |
| | |||||
* | shader: Rework global memory tracking to use breadth-first search | ReinUsesLisp | 2021-07-23 | 1 | -69/+80 |
| | |||||
* | shader: Fix fp16 merge when using native fp16 | ReinUsesLisp | 2021-07-23 | 1 | -3/+3 |
| | |||||
* | shader: Fix FADD32I | ReinUsesLisp | 2021-07-23 | 1 | -6/+4 |
| | |||||
* | shader: Fix undetected bug from review | FernandoS27 | 2021-07-23 | 1 | -0/+3 |
| | |||||
* | shader: Address feedback | FernandoS27 | 2021-07-23 | 3 | -13/+16 |
| | |||||
* | shader: "Implement" NOP | FernandoS27 | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Address Feedback | FernandoS27 | 2021-07-23 | 16 | -211/+60 |
| | |||||
* | shader: Implement SR_LaneId | FernandoS27 | 2021-07-23 | 7 | -0/+15 |
| | |||||
* | shader: Fix shared memory on cool drivers | FernandoS27 | 2021-07-23 | 1 | -0/+1 |
| | |||||
* | shader: Implement MEMBAR | FernandoS27 | 2021-07-23 | 9 | -11/+121 |
| | |||||
* | shader: Improve VOTE.VTG stub | FernandoS27 | 2021-07-23 | 7 | -4/+147 |
| | |||||
* | shader: Mark SSBOs as written when they are | FernandoS27 | 2021-07-23 | 2 | -2/+30 |
| | |||||
* | shader: Implement ViewportIndex | FernandoS27 | 2021-07-23 | 7 | -2/+32 |
| | |||||
* | shader: Stub TLD4's PTP when it isn't constant | FernandoS27 | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | shader: Stub VOTE.VTG | FernandoS27 | 2021-07-23 | 4 | -4/+15 |
| | |||||
* | shader: Fold composite extract | FernandoS27 | 2021-07-23 | 1 | -0/+62 |
| | |||||
* | shader: Fold comparisons and Pack/Unpack16 | FernandoS27 | 2021-07-23 | 1 | -1/+41 |
| | |||||
* | shader: Fix branches to visited virtual blocks | ReinUsesLisp | 2021-07-23 | 2 | -0/+12 |
| | |||||
* | shader: Fix dependency on identity removal pass | ReinUsesLisp | 2021-07-23 | 2 | -3/+8 |
| | |||||
* | shader: Fix constant propagation to use reverse post order | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | shader: Implement LDG .U.128 as .128 | ReinUsesLisp | 2021-07-23 | 1 | -3/+2 |
| | |||||
* | shader: Unroll "using enum" for opcode declarations | ReinUsesLisp | 2021-07-23 | 1 | -1/+27 |
| | |||||
* | spirv: Remove unnecesary variable for clip distances | ReinUsesLisp | 2021-07-23 | 2 | -6/+2 |
| | |||||
* | shader: Implement ClipDistance | FernandoS27 | 2021-07-23 | 5 | -0/+36 |
| | |||||
* | shader: Fix TXD | FernandoS27 | 2021-07-23 | 2 | -2/+2 |
| | |||||
* | shader: Address feedback | FernandoS27 | 2021-07-23 | 4 | -52/+48 |
| | |||||
* | shader: Always pass a lod for TexelFetch | ReinUsesLisp | 2021-07-23 | 3 | -25/+17 |
| | |||||
* | shader: Implement TXD | FernandoS27 | 2021-07-23 | 4 | -10/+183 |
| | |||||
* | shader: Implement ImageGradient | FernandoS27 | 2021-07-23 | 8 | -2/+84 |
| | |||||
* | shader: Implement TMML partially | FernandoS27 | 2021-07-23 | 6 | -13/+137 |
| | |||||
* | shader,spirv: Implement ImageQueryLod. | FernandoS27 | 2021-07-23 | 9 | -1/+38 |
| | |||||
* | shader: Implement TLDS | FernandoS27 | 2021-07-23 | 3 | -4/+253 |
| | |||||
* | shader: Implement TLD | FernandoS27 | 2021-07-23 | 7 | -14/+173 |
| | |||||
* | spirv: Add fixed pipeline point size | ReinUsesLisp | 2021-07-23 | 3 | -1/+8 |
| | |||||
* | shader: Add PointCoord attribute | FernandoS27 | 2021-07-23 | 5 | -0/+16 |
| | |||||
* | shader: Add PointSize attribute | ameerj | 2021-07-23 | 5 | -0/+13 |
| | |||||
* | shader: Store type of phi nodes in flags | ReinUsesLisp | 2021-07-23 | 3 | -2/+11 |
| | | | | This is needed because pseudo-instructions where invalidated. | ||||
* | shader: Fix indirect branches to scheduler instructions | ReinUsesLisp | 2021-07-23 | 3 | -7/+17 |
| | |||||
* | spirv: Fix default output attribute initialization | ReinUsesLisp | 2021-07-23 | 1 | -3/+3 |
| | |||||
* | shader: Add missing new lines | ReinUsesLisp | 2021-07-23 | 1 | -0/+2 |
| | |||||
* | shader: Implement FSWZADD | ameerj | 2021-07-23 | 14 | -4/+87 |
| | |||||
* | shader: Implement BRX | FernandoS27 | 2021-07-23 | 20 | -47/+388 |
| | |||||
* | shader: Fix alignment checks on RZ | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Implement I2I CC | ameerj | 2021-07-23 | 3 | -24/+45 |
| | |||||
* | shader: Implement I2I SAT | ameerj | 2021-07-23 | 6 | -10/+52 |
| | |||||
* | shader: Fix ISCADD logic for PO/CC | ameerj | 2021-07-23 | 1 | -7/+8 |
| | |||||
* | shader: Implement LDS, STS, LDL, and STS and use SPIR-V 1.4 when available | ReinUsesLisp | 2021-07-23 | 17 | -17/+626 |
| | |||||
* | shader: Implement ISCADD CC | ameerj | 2021-07-23 | 1 | -1/+4 |
| | |||||
* | shader: Implement VMAD, VMNMX, VSETP | ameerj | 2021-07-23 | 9 | -23/+319 |
| | |||||
* | shader: Add missing I2I exception when CC is used | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | |||||
* | shader: Better interpolation and disabled attributes support | ReinUsesLisp | 2021-07-23 | 7 | -23/+96 |
| | |||||
* | spirv: Remove dependencies on Environment when generating SPIR-V | ReinUsesLisp | 2021-07-23 | 4 | -9/+12 |
| | |||||
* | shader: Implement front face | ReinUsesLisp | 2021-07-23 | 5 | -0/+12 |
| | |||||
* | shader: Fix structured control flow on KIL instructions | ReinUsesLisp | 2021-07-23 | 2 | -3/+7 |
| | | | | | This could potentially leave unvisited blocks, leading to illegal phi nodes. | ||||
* | shader: Fix TXQ | FernandoS27 | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Implement TXQ and fix FragDepth | ReinUsesLisp | 2021-07-23 | 14 | -21/+172 |
| | |||||
* | shader: Refactor PTP and other minor changes | ReinUsesLisp | 2021-07-23 | 14 | -123/+67 |
| | |||||
* | shader: Add IR opcode for ImageFetch | FernandoS27 | 2021-07-23 | 7 | -5/+55 |
| | |||||
* | shader: Implement TLD4.PTP | FernandoS27 | 2021-07-23 | 15 | -28/+111 |
| | |||||
* | shader: Fix Array Indices in TEX/TLD4 | FernandoS27 | 2021-07-23 | 2 | -6/+6 |
| | |||||
* | shader: Implement FragDepth | FernandoS27 | 2021-07-23 | 2 | -1/+7 |
| | |||||
* | shader: Implement TLD4S. | FernandoS27 | 2021-07-23 | 3 | -4/+134 |
| | |||||
* | shader: Implement TLD4 and TLD4_B | FernandoS27 | 2021-07-23 | 13 | -11/+315 |
| | |||||
* | shader: Implement SHFL | ameerj | 2021-07-23 | 16 | -69/+284 |
| | |||||
* | shader: Track first bindless argument instead of the instruction itself | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Properly insert Prologue instruction | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | |||||
* | shader: Minor style nits | ReinUsesLisp | 2021-07-23 | 1 | -2/+4 |
| | |||||
* | shader: Fix F2I | FernandoS27 | 2021-07-23 | 10 | -9/+147 |
| | |||||
* | shader: Implement NDC [-1, 1], attribute types and default varying initialization | ReinUsesLisp | 2021-07-23 | 12 | -40/+149 |
| | |||||
* | shader: Fix use-after-free bug in object_pool | ReinUsesLisp | 2021-07-23 | 1 | -3/+3 |
| | |||||
* | shader: Implement VOTE | ameerj | 2021-07-23 | 14 | -5/+167 |
| | |||||
* | shader: Fix TEX mask | ReinUsesLisp | 2021-07-23 | 1 | -1/+3 |
| | |||||
* | vk_pipeline_cache: Add pipeline cache | ReinUsesLisp | 2021-07-23 | 4 | -8/+15 |
| | |||||
* | shader: Fold interpolation multiplications | ReinUsesLisp | 2021-07-23 | 1 | -0/+34 |
| | |||||
* | shader: Better but still partial interpolation support | ReinUsesLisp | 2021-07-23 | 1 | -5/+7 |
| | |||||
* | shader: Implement DMNMX, DSET, DSETP | ameerj | 2021-07-23 | 15 | -59/+208 |
| | |||||
* | shader: Implement FADD32I | FernandoS27 | 2021-07-23 | 1 | -2/+15 |
| | |||||
* | shader: Implement F2F | FernandoS27 | 2021-07-23 | 6 | -20/+192 |
| | |||||
* | shader: Add missing fp64 usage flags | ReinUsesLisp | 2021-07-23 | 1 | -0/+34 |
| | |||||
* | shader: Implement DMUL and DFMA | ameerj | 2021-07-23 | 8 | -30/+111 |
| | | | | Also add a missing const on DADD | ||||
* | shader: Add FP64 register load/store helpers | ameerj | 2021-07-23 | 3 | -21/+24 |
| | |||||
* | shader: Add support for fp16 comparisons and misc fixes | ReinUsesLisp | 2021-07-23 | 11 | -14/+56 |
| | |||||
* | shader: Fix floating point comparison for FP16 | FernandoS27 | 2021-07-23 | 5 | -32/+56 |
| | |||||
* | shader: Implement HSETP2 | FernandoS27 | 2021-07-23 | 3 | -12/+117 |
| | |||||
* | shader: Implement HSET2 | FernandoS27 | 2021-07-23 | 5 | -14/+119 |
| | |||||
* | shader: Implement HMUL2 | FernandoS27 | 2021-07-23 | 3 | -16/+144 |
| | |||||
* | shader: Implement HFMA2 | FernandoS27 | 2021-07-23 | 5 | -20/+192 |
| | |||||
* | spirv: Implement VertexId and InstanceId, refactor code | ReinUsesLisp | 2021-07-23 | 9 | -144/+243 |
| | |||||
* | shader: Refactor half floating instructions | FernandoS27 | 2021-07-23 | 4 | -58/+84 |
| | |||||
* | shader: Implement I2F | ReinUsesLisp | 2021-07-23 | 16 | -69/+427 |
| | |||||
* | shader: Implement ISCADD (imm) | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | |||||
* | shader: Implement LOP32I | ReinUsesLisp | 2021-07-23 | 2 | -18/+45 |
| | |||||
* | shader: Add partial rasterizer integration | ReinUsesLisp | 2021-07-23 | 34 | -156/+629 |
| | |||||
* | shader: Implement DADD | ameerj | 2021-07-23 | 8 | -14/+132 |
| | |||||
* | shader: Implement CSET and CSETP | ameerj | 2021-07-23 | 6 | -15/+114 |
| | |||||
* | shader: Reorder phi nodes when redefined as undefined opcodes | ReinUsesLisp | 2021-07-23 | 1 | -1/+9 |
| | |||||
* | shader: Fix instruction transitions in and out of Phi | ReinUsesLisp | 2021-07-23 | 1 | -9/+11 |
| | |||||
* | shader: Implement FSET and FSETP | ameerj | 2021-07-23 | 9 | -94/+204 |
| | | | | Also fix oversight with adding SignedZeroInfNanPreserve execution mode. | ||||
* | shader: Implement TEXS | ReinUsesLisp | 2021-07-23 | 8 | -7/+287 |
| | |||||
* | shader: Implement CAL inlining function calls | ReinUsesLisp | 2021-07-23 | 24 | -330/+286 |
| | |||||
* | spirv: Add SignedZeroInfNanPreserve logic | ameerj | 2021-07-23 | 2 | -0/+8 |
| | |||||
* | shader: Implement FMNMX | ameerj | 2021-07-23 | 8 | -25/+101 |
| | | | | And add a const in FCMP | ||||
* | shader: Fix rebase issue | ReinUsesLisp | 2021-07-23 | 1 | -1/+0 |
| | |||||
* | shader: Implement FCMP | ameerj | 2021-07-23 | 9 | -50/+203 |
| | | | | still need to configure some settings for NV denorm flush and intel NaN | ||||
* | shader: Partial implementation of LDC | ReinUsesLisp | 2021-07-23 | 16 | -50/+405 |
| | |||||
* | shader: Initial support for textures and TEX | ReinUsesLisp | 2021-07-23 | 29 | -341/+1378 |
| | |||||
* | shader: Implement R2P | ameerj | 2021-07-23 | 8 | -15/+88 |
| | |||||
* | shader: Implement SHF | ameerj | 2021-07-23 | 8 | -31/+119 |
| | |||||
* | shader: Implement LEA | ameerj | 2021-07-23 | 9 | -29/+136 |
| | |||||
* | shader: Deduplicate HADD2 code | ReinUsesLisp | 2021-07-23 | 1 | -19/+16 |
| | |||||
* | shader: Implement I2I | ameerj | 2021-07-23 | 3 | -12/+100 |
| | |||||
* | shader: Implement HADD2 | ReinUsesLisp | 2021-07-23 | 12 | -42/+400 |
| | |||||
* | shader: Implement LOP and LOP3 | ameerj | 2021-07-23 | 8 | -31/+227 |
| | |||||
* | shader: Implement IADD3 | ameerj | 2021-07-23 | 3 | -12/+104 |
| | |||||
* | shader: Implement PSETP | ameerj | 2021-07-23 | 4 | -5/+40 |
| | |||||
* | Implement PSET, refactor common comparison funcs | ameerj | 2021-07-23 | 9 | -101/+88 |
| | |||||
* | shader: Implement FLO | ameerj | 2021-07-23 | 8 | -18/+75 |
| | |||||
* | shader: Implement ISET, add common_funcs | ameerj | 2021-07-23 | 8 | -50/+150 |
| | |||||
* | shader: Make IMNMX, SHR, SEL stylistically more consistent | ameerj | 2021-07-23 | 3 | -5/+5 |
| | |||||
* | shader: Implement ICMP | ameerj | 2021-07-23 | 3 | -16/+84 |
| | |||||
* | shader: Implement IMNMX | ameerj | 2021-07-23 | 8 | -12/+105 |
| | |||||
* | shader: Implement BFI | ameerj | 2021-07-23 | 3 | -16/+57 |
| | |||||
* | shader: Implement BFE | ameerj | 2021-07-23 | 3 | -12/+67 |
| | |||||
* | shader: Implement POPC | ameerj | 2021-07-23 | 8 | -12/+59 |
| | |||||
* | shader: Implement SHR | ameerj | 2021-07-23 | 8 | -18/+80 |
| | |||||
* | shader: Implement SEL | ameerj | 2021-07-23 | 4 | -16/+53 |
| | |||||
* | spirv: Move phi arguments emit to a separate function | ReinUsesLisp | 2021-07-23 | 1 | -27/+27 |
| | |||||
* | shader: Avoid infinite recursion when tracking global memory | ReinUsesLisp | 2021-07-23 | 1 | -5/+26 |
| | |||||
* | shader: Fix conditional execution of exit instructions | ReinUsesLisp | 2021-07-23 | 2 | -5/+6 |
| | |||||
* | spirv: Add support for self-referencing phi nodes | ReinUsesLisp | 2021-07-23 | 1 | -3/+10 |
| | |||||
* | shader: Fix control flow | ReinUsesLisp | 2021-07-23 | 8 | -20/+39 |
| | |||||
* | shader: Implement more of XMAD and FFMA32I and fix XMAD.CBCC | ReinUsesLisp | 2021-07-23 | 5 | -28/+76 |
| | |||||
* | shader: FMUL, select, RRO, and MUFU fixes | ReinUsesLisp | 2021-07-23 | 18 | -119/+507 |
| | |||||
* | shader: Fix MOV(reg), add SHL variants and emit neg and abs instructions | ReinUsesLisp | 2021-07-23 | 4 | -11/+11 |
| | |||||
* | spirv: Fixes and Intel specific workarounds | ReinUsesLisp | 2021-07-23 | 10 | -32/+43 |
| | |||||
* | shader: Rename, implement FADD.SAT and P2R (imm) | ReinUsesLisp | 2021-07-23 | 17 | -125/+211 |
| | |||||
* | shader: Add denorm flush support | ReinUsesLisp | 2021-07-23 | 15 | -60/+210 |
| | |||||
* | spirv: Add lower fp16 to fp32 pass | ReinUsesLisp | 2021-07-23 | 28 | -276/+465 |
| | |||||
* | shader: Primitive Vulkan integration | ReinUsesLisp | 2021-07-23 | 28 | -498/+573 |
| | |||||
* | shader: Add XMAD multiplication folding optimization | ReinUsesLisp | 2021-07-23 | 1 | -5/+77 |
| | |||||
* | shader: Simplify ISCADD | ReinUsesLisp | 2021-07-23 | 1 | -6/+1 |
| | |||||
* | shader: Add utility to resolve identities on a value | ReinUsesLisp | 2021-07-23 | 2 | -0/+8 |
| | |||||
* | spirv: Implement EmitIdentity | ReinUsesLisp | 2021-07-23 | 2 | -3/+3 |
| | |||||
* | spirv: Initial bindings support | ReinUsesLisp | 2021-07-23 | 22 | -292/+671 |
| | |||||
* | shader: Improve object pool | ReinUsesLisp | 2021-07-23 | 3 | -50/+66 |
| | |||||
* | shader: Fix tracking | ReinUsesLisp | 2021-07-23 | 1 | -50/+72 |
| | |||||
* | shader: Add support for forward declarations | ReinUsesLisp | 2021-07-23 | 10 | -68/+79 |
| | |||||
* | shader: Support SSA loops on IR | ReinUsesLisp | 2021-07-23 | 12 | -46/+150 |
| | |||||
* | shader: Misc fixes | ReinUsesLisp | 2021-07-23 | 10 | -89/+104 |
| | |||||
* | shader: Initial implementation of an AST | ReinUsesLisp | 2021-07-23 | 32 | -589/+1345 |
| | |||||
* | spirv: Initial SPIR-V support | ReinUsesLisp | 2021-07-23 | 18 | -34/+1400 |
| | |||||
* | shader: Better constant folding | ReinUsesLisp | 2021-07-23 | 2 | -13/+48 |
| | |||||
* | shader: Properly store phi on Inst | ReinUsesLisp | 2021-07-23 | 6 | -75/+132 |
| | |||||
* | shader: Add pools and rename files | ReinUsesLisp | 2021-07-23 | 30 | -108/+255 |
| | |||||
* | shader: Make typed IR | ReinUsesLisp | 2021-07-23 | 19 | -269/+495 |
| | |||||
* | shader: Remove illegal character in SSA pass | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | |||||
* | shader: Constant propagation and global memory to storage buffer | ReinUsesLisp | 2021-07-23 | 17 | -63/+652 |
| | |||||
* | shader: Initial instruction support | ReinUsesLisp | 2021-07-23 | 28 | -334/+1450 |
| | |||||
* | shader: SSA and dominance | ReinUsesLisp | 2021-07-23 | 24 | -77/+570 |
| | |||||
* | shader: Initial recompiler work | ReinUsesLisp | 2021-07-23 | 56 | -0/+7060 |