summaryrefslogtreecommitdiffstats
path: root/src/shader_recompiler (follow)
Commit message (Collapse)AuthorAgeFilesLines
* glsl: F16x2 storage atomicsameerj2021-07-237-58/+64
|
* glsl: Revert ssbo aliasing. Storage Atomics implameerj2021-07-235-75/+134
|
* glsl: implement phi nodesameerj2021-07-234-20/+54
|
* glsl: Wip storage atomic opsameerj2021-07-2310-327/+414
|
* glsl: Implement FCMPameerj2021-07-233-242/+185
|
* glsl: Add a more robust fp formatterameerj2021-07-234-9/+14
|
* glsl: More FP fixesameerj2021-07-232-9/+16
|
* glsl: FP function fixesameerj2021-07-237-17/+25
|
* glsl: More FP instructions/fixesameerj2021-07-235-28/+41
|
* glsl: Add many FP32/64 instructionsameerj2021-07-2312-765/+1011
|
* glsl: Implement more Integer opsameerj2021-07-233-119/+72
|
* glsl: Implement BF*ameerj2021-07-233-9/+10
|
* glsl: Implement a few Integer instructionsameerj2021-07-2310-260/+398
|
* glsl: Use std::string_view for Emit function args.ameerj2021-07-236-760/+838
|
* glsl: Pass IR::Inst& to Emit functionsameerj2021-07-236-171/+169
|
* glsl: INeg and IAdd negate testsameerj2021-07-233-94/+106
|
* glsl: Reusable typed variables. IADD32ameerj2021-07-236-203/+311
|
* glsl: Fix program linking and cbufameerj2021-07-232-3/+5
|
* glsl: Fix "reg" allocingameerj2021-07-2310-898/+938
| | | | based on glasm with some tweaks
* glsl: Initial backendameerj2021-07-2327-0/+3292
|
* spirv: Reduce log severity of mismatching denorm rulesReinUsesLisp2021-07-231-2/+2
|
* shader: Fix loop safety to SSA passReinUsesLisp2021-07-232-2/+4
|
* shader: Add loggingReinUsesLisp2021-07-2313-28/+30
|
* shader: Add shader loop safety check settingslat9nq2021-07-239-33/+130
| | | | Also add a setting for enable Nsight Aftermath.
* shader: Comment why the array component is not read in TMMLReinUsesLisp2021-07-231-0/+2
|
* tmml: Remove index component from coords vecameerj2021-07-231-4/+3
| | | | The lod query functions exposed by the rendering API's do not make use of the texturearray layer indexing.
* spirv/convert: Catch more signed operations oversightsameerj2021-07-231-5/+5
| | | | The sign bit on integers of size < 32 was not properly preserved in casts
* spirv/convert: Catch more broken signed operations on Nvidia OpenGLReinUsesLisp2021-07-231-0/+6
| | | | | BitCast U32 to S32 before converting to float on drivers with broken signed operations.
* shader_environment: Add shader_local_memory_crs_size to local memory sizeameerj2021-07-231-2/+2
| | | | Fixes DOOM 2016 missing local memory
* shader: Fix VertexA Shaders.FernandoS272021-07-233-14/+30
|
* shader: Add 2D and 3D variants to SUATOM and SUREDReinUsesLisp2021-07-231-0/+4
| | | | Used by Claybook.
* shader: Avoid CPU side undefined behavior on I2FReinUsesLisp2021-07-231-0/+2
|
* glasm: Use ARB_derivative_control conditionallyReinUsesLisp2021-07-233-7/+30
|
* buffer_cache: Reduce uniform buffer size from shader usageReinUsesLisp2021-07-232-3/+17
| | | | Increases performance significantly on certain titles.
* emit_glasm_context_get_set: Remove unused variablelat9nq2021-07-231-1/+0
|
* shader,glasm: Implement legacy texcoord loadsReinUsesLisp2021-07-233-54/+29
|
* glasm: Implement legacy varyingsReinUsesLisp2021-07-231-17/+56
|
* shader: Track legacy varyingsReinUsesLisp2021-07-232-17/+105
|
* shader: Add support for "negative" and unaligned offsetsReinUsesLisp2021-07-233-8/+13
| | | | | | | | | "Negative" offsets don't exist. They are shown as such due to a bug in nvdisasm. Unaligned offsets have been proved to read the aligned offset. For example, when reading an U32, if the offset is 6, the offset read will be 4.
* shader: Implement ISCADD32IReinUsesLisp2021-07-231-17/+31
|
* spirv: Fix output generics with componentsReinUsesLisp2021-07-231-1/+1
|
* opengl: Declare fragment outputs even if they are not usedReinUsesLisp2021-07-234-10/+9
| | | | | | Fixes Ori and the Blind Forest's menu on GLASM. For some reason (probably high level optimizations) it is not sanitized on SPIR-V for OpenGL. Vulkan is unaffected by this change.
* shader: Always initialize up reference in structure control flowReinUsesLisp2021-07-231-31/+36
| | | | Fixes ubsan issue.
* shader: Fix ImageWrite indexingReinUsesLisp2021-07-231-1/+1
|
* spirv: Fix image and image buffer descriptor index usageReinUsesLisp2021-07-231-5/+7
|
* glasm: Fix immediate texture coordinateReinUsesLisp2021-07-231-0/+1
|
* shader: Clang-format secondary texturesReinUsesLisp2021-07-231-2/+2
|
* shader: Fix secondary texturesReinUsesLisp2021-07-231-2/+2
|
* shader: Fix TMML queriesReinUsesLisp2021-07-231-5/+9
|
* shader: Fix FSwizzleAdd folding when going through phi nodesReinUsesLisp2021-07-231-2/+2
|
* shader/exception: Fix compilation errors on gccReinUsesLisp2021-07-231-6/+6
|
* glasm: Reduce reg allocation leaks from an exception to a logReinUsesLisp2021-07-231-1/+1
|
* shader: Handle host exceptionsReinUsesLisp2021-07-234-13/+43
|
* glasm: Use integer lod for TXQReinUsesLisp2021-07-232-2/+2
|
* glasm: Fix global memory fallbacksReinUsesLisp2021-07-231-9/+10
|
* Revert "glasm: Skip phi moves on undefined instructions"ReinUsesLisp2021-07-232-16/+1
| | | | Causes regressions on Bowser's Fury.
* glasm: Remove unintentional '\n' on Undef32ReinUsesLisp2021-07-231-1/+1
|
* glasm: Use storage buffers instead of global memory when possibleReinUsesLisp2021-07-236-370/+383
|
* glasm: Implement Y directionReinUsesLisp2021-07-234-3/+9
|
* glasm: Skip phi moves on undefined instructionsReinUsesLisp2021-07-232-1/+16
|
* glasm: Implement undef instructionsReinUsesLisp2021-07-232-15/+15
|
* glasm: Fix global memory callbacksReinUsesLisp2021-07-231-5/+6
|
* video_core,shader: Clang-format fixesReinUsesLisp2021-07-232-2/+2
|
* glasm: Release phi node registers after they are no longer neededReinUsesLisp2021-07-232-38/+54
|
* glasm: Remove unintentionally committed fmt::printsReinUsesLisp2021-07-231-2/+0
|
* glasm: Fix INeg32 on negative immediatesReinUsesLisp2021-07-231-1/+5
|
* glasm: Remove unnecessary value typesReinUsesLisp2021-07-233-47/+6
|
* glasm: Throw when there are register leaksReinUsesLisp2021-07-232-0/+7
|
* glasm: Catch more register leaksReinUsesLisp2021-07-238-41/+114
| | | | | | | | | | | | | Add support for null registers. These are used when an instruction has no usages. This comes handy when an instruction is only used for its CC value, with the caveat of having to invalidate all pseudo-instructions before defining the instruction itself in the register allocator. This commits changes this. Workaround a bug on Nvidia's condition codes conditional execution using branches.
* glasm: Fix usage counting on phi nodesReinUsesLisp2021-07-233-8/+22
|
* glasm: Implement global memory fallbacksReinUsesLisp2021-07-232-50/+89
|
* glasm: Implement int64 add and subtractReinUsesLisp2021-07-232-8/+6
|
* emit_glasm_context_get_set: Remove unused variablelat9nq2021-07-231-1/+0
|
* glasm: Implement indirect attribute loadsReinUsesLisp2021-07-234-6/+65
|
* glasm: Implement image atomicsReinUsesLisp2021-07-233-166/+153
|
* glasm: Reorder unreachable image atomic instsReinUsesLisp2021-07-231-66/+66
| | | | Reorder them to the bottom of the file for readability.
* glasm: Implement gl_Layer storesReinUsesLisp2021-07-231-0/+7
|
* glasm: Implement SampleIdReinUsesLisp2021-07-232-3/+3
|
* glasm: Implement IsHelperInvocationReinUsesLisp2021-07-232-3/+3
|
* glasm: Fix EmitVertex's optimizationReinUsesLisp2021-07-231-1/+1
|
* gl_shader_cache,glasm: Conditionally use typeless image reads extensionReinUsesLisp2021-07-231-2/+4
|
* glasm: Implement forced early ZReinUsesLisp2021-07-231-2/+6
|
* glasm: Simplify patch readsReinUsesLisp2021-07-231-5/+2
|
* glasm: Fix output patch readsReinUsesLisp2021-07-232-13/+22
| | | | With this, Luigi's Mansion's sand renders properly.
* shader: Split profile and runtime information in separate structsReinUsesLisp2021-07-2311-71/+88
|
* emit_glasm_context_get_and_set.cpp: Add missing semicolonsameerj2021-07-231-2/+2
|
* glasm: Fix patch attribute declarationsReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement FSWZADDameerj2021-07-233-4/+28
|
* glasm: Implement PrimitiveId attribute readReinUsesLisp2021-07-231-0/+3
|
* glasm: Implement clip distance storesReinUsesLisp2021-07-232-0/+15
|
* glasm: Fix tessellation input attributesReinUsesLisp2021-07-231-2/+5
|
* glasm: Add missing semicolon on tesscoord readingReinUsesLisp2021-07-231-1/+1
|
* glasm: Fix tessellation headersReinUsesLisp2021-07-231-2/+2
|
* glasm: Add tessellation shader declarationsReinUsesLisp2021-07-231-0/+35
|
* glasm: Implement TessellationEvaluationPointReinUsesLisp2021-07-231-0/+4
|
* glasm: Implement patch memoryReinUsesLisp2021-07-233-6/+51
|
* glasm: Fix InvocationId declarationReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement InvocationIdReinUsesLisp2021-07-232-2/+5
|
* glasm: Optimize EmitVertex into EMITReinUsesLisp2021-07-231-1/+5
|
* glasm: Implement geometry shader attribute readsReinUsesLisp2021-07-232-4/+18
|
* glasm: Properly declare attributes on geometry programsReinUsesLisp2021-07-233-6/+14
|
* glasm: Declare geometry program headersReinUsesLisp2021-07-231-0/+35
|
* glasm: Fix potential aliasing bug on cube array samplesReinUsesLisp2021-07-232-35/+44
|
* glasm: Implement ImageWriteReinUsesLisp2021-07-231-4/+7
|
* glasm: Implement ImageReadReinUsesLisp2021-07-234-4/+56
|
* glasm: Implement EmitVertex and EndPrimitiveReinUsesLisp2021-07-232-4/+8
|
* glasm: Implement ImageGradientReinUsesLisp2021-07-232-7/+65
|
* glasm: Implement 64-bit shiftsReinUsesLisp2021-07-232-12/+14
|
* glasm: Implement barriersReinUsesLisp2021-07-231-3/+3
|
* glasm: Fix compute stage nameReinUsesLisp2021-07-231-1/+1
|
* glasm: Fix phi instruction typesReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement PREC on relevant instructionsReinUsesLisp2021-07-231-6/+12
|
* glasm: Implement stores to gl_ViewportIndexReinUsesLisp2021-07-234-7/+29
|
* glasm: Implement gl_PointSize storesReinUsesLisp2021-07-231-0/+3
|
* glasm: Implement gl_PointCoordReinUsesLisp2021-07-231-0/+4
|
* glasm: Implement ImageQueryLodReinUsesLisp2021-07-231-3/+5
|
* glasm: Implement ImageFetchReinUsesLisp2021-07-234-13/+38
|
* glasm: Implement IADD.CCameerj2021-07-231-1/+26
|
* glasm: Implement BFE.CCReinUsesLisp2021-07-231-0/+8
|
* glasm: Implement SelectU1ReinUsesLisp2021-07-232-4/+5
|
* glasm: Implement gl_WorkGroupIDReinUsesLisp2021-07-232-3/+3
|
* glasm: Implement TXQ and improve texture info readsReinUsesLisp2021-07-232-50/+51
|
* glasm: Implement gl_FrongFacing attributeReinUsesLisp2021-07-231-0/+3
|
* glasm: Support textures used in more than one stageReinUsesLisp2021-07-233-4/+24
|
* glasm: Implement textureGather instructionsReinUsesLisp2021-07-232-15/+97
|
* glasm: Implement gl_FragDepth and gl_SampleMask storesReinUsesLisp2021-07-232-5/+5
|
* glasm: Do not alias ConditionRef for nowReinUsesLisp2021-07-232-3/+2
| | | | | Immediate condition refs where not handled correctly. Just move the value for now.
* shader: Read branch conditions from an instructionReinUsesLisp2021-07-2312-16/+36
| | | | Fixes the identity removal pass.
* glasm: Implement InstanceId and VertexIdReinUsesLisp2021-07-231-0/+6
|
* glasm: Add missing return value on move assignmentReinUsesLisp2021-07-231-0/+1
|
* glasm: Fix aliased bitcasts ref countingReinUsesLisp2021-07-233-13/+42
|
* glasm: Remove unintentional comma on vector insertReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement TEX and TEXS instructionsReinUsesLisp2021-07-2310-69/+275
| | | | | Remove lod clamp from texture instructions with lod, as this is not needed (nor supported).
* glasm: Add support for non-2D texture samplesReinUsesLisp2021-07-231-4/+26
|
* glasm: Reorder unreachable image instructions to the bottomReinUsesLisp2021-07-231-97/+97
|
* glasm: Add support for texture offsetsReinUsesLisp2021-07-231-11/+15
|
* glasm: Improve texture sampling instructionsReinUsesLisp2021-07-232-50/+70
|
* emit_glasm: Enable ARB_draw_buffers when neededReinUsesLisp2021-07-232-1/+5
|
* emit_glasm: Add support for reading position attributesReinUsesLisp2021-07-231-3/+13
|
* shader_recompiler: GCC fixeslat9nq2021-07-237-58/+55
| | | | | Fixes members of unnamed union not being accessible, and one function without a declaration.
* glasm: Implement rest of shared memameerj2021-07-232-35/+29
|
* shader: Use a non-trivial dummy to construct ASL node unionReinUsesLisp2021-07-231-1/+6
|
* emit_spirv: Jump to loop body with local variableReinUsesLisp2021-07-231-1/+1
| | | | Silence unused variable warning
* glasm: Implement derivative instructions on GLASMReinUsesLisp2021-07-232-12/+12
|
* glasm: Initial (broken) implementation of TEX on GLASMReinUsesLisp2021-07-233-299/+386
|
* glasm: Implement some graphics instructions on GLASMReinUsesLisp2021-07-232-6/+5
|
* glasm: Add Void type to GLASM valuesReinUsesLisp2021-07-233-0/+15
|
* glasm: Add graphics specific shader declarations to GLASMReinUsesLisp2021-07-232-6/+63
|
* glasm: Implement local memory for glasmameerj2021-07-234-9/+12
|
* emit_spirv: Add missing block in caseReinUsesLisp2021-07-231-1/+2
|
* glasm: Initial implementation of phi nodes on GLASMReinUsesLisp2021-07-2312-25/+117
|
* glasm: Write result to scalar on integer comparison instructionsReinUsesLisp2021-07-231-10/+10
|
* glasm: Declare NV_shader_thread_group when neededReinUsesLisp2021-07-231-3/+4
|
* glasm: Rework control flow introducing a syntax listReinUsesLisp2021-07-2333-505/+437
| | | | | This commit regresses VertexA shaders, their transformation pass has to be adapted to the new control flow.
* glasm: Implement Storage atomicsameerj2021-07-235-109/+156
| | | | | StorageAtomicExchangeU64 is failing test seemingly due to failure storing 64-bit result into the register
* glasm: Ensure reg alloc order across compilers on GLASMReinUsesLisp2021-07-231-11/+14
| | | | | | | | | | | | | | | | | | Use a struct constructor to serialize register allocation arguments to ensure registers are allocated in the same order regardless of the compiler used. The A and B functions can be called in any order when passed as arguments to "foo": foo(A(), B()) But the order is guaranteed for curly-braced constructor calls in classes: Foo{A(), B()} Use this to get consistent behavior.
* glasm: Enable unintentionally disabled register aliasing on GLASMReinUsesLisp2021-07-231-16/+11
|
* glasm: Review all GLASM insts to be aware of register aliasingReinUsesLisp2021-07-234-20/+51
|
* glasm: Implement shuffle and vote instructions on GLASMReinUsesLisp2021-07-2310-100/+166
|
* glasm: Add MUFU instructions to GLASMReinUsesLisp2021-07-232-21/+22
|
* glasm: Implement IAbs64 and INeg64 on GLASMReinUsesLisp2021-07-232-6/+6
|
* shader: Add floating-point rounding to I2FReinUsesLisp2021-07-233-35/+42
|
* glasm: Properly clamp Fp64 on GLASMReinUsesLisp2021-07-231-6/+6
|
* glasm: Fix register allocation when moving immediate on GLASMReinUsesLisp2021-07-233-42/+89
|
* glasm: Implement SelectU64 on GLASMReinUsesLisp2021-07-232-4/+20
|
* glasm: Fix clamps so the min value has priority on NAN on GLASMReinUsesLisp2021-07-231-12/+15
|
* glasm: Fix moving U64 immediates to registers in GLASMReinUsesLisp2021-07-232-3/+4
|
* glasm: Implement storage atomic opsameerj2021-07-234-305/+358
|
* glasm: Add conversion instructions to GLASMReinUsesLisp2021-07-239-282/+351
|
* glasm: Add fp min/max insts and fix store for fp64 on GLASMReinUsesLisp2021-07-232-10/+8
|
* glasm: Add logical instructions on GLASMReinUsesLisp2021-07-232-12/+12
|
* glasm: Remove duplicated Fp64 pack instructions on GLASMReinUsesLisp2021-07-231-8/+0
|
* glasm: Remove unnecesary new white space on Clamp GLASMReinUsesLisp2021-07-231-4/+4
|
* glasm: Add floating-point comparisons on GLASMReinUsesLisp2021-07-233-120/+116
|
* emit_glasm: Implement more integer alu opsameerj2021-07-232-47/+41
|
* glasm: Reimplement bitwise ops and BFI/BFEameerj2021-07-234-88/+108
|
* glasm: Initial GLASM fp64 supportReinUsesLisp2021-07-239-55/+152
|
* glasm: Implement GLASM fp16 packing and move bitwise insnsReinUsesLisp2021-07-234-66/+77
|
* glasm: Remove unused functions left from rebaseReinUsesLisp2021-07-231-12/+0
|
* glasm: Specify namespace when using FormatToReinUsesLisp2021-07-231-6/+6
|
* glasm: Implement more GLASM composite instructionsReinUsesLisp2021-07-232-54/+63
|
* glasm: Make GLASM aware of typesReinUsesLisp2021-07-2312-1244/+1380
|
* glasm: Use CMP.S for Select32ameerj2021-07-233-12/+8
| | | | also fixes ADD and SUB to use U modifier
* glasm: Implement more logical opsameerj2021-07-232-5/+5
|
* glasm: Implement BFI, BFEameerj2021-07-234-138/+164
| | | | Along with implementations of common instructions along the way
* glasm: Use BitField instead of C bitfieldsReinUsesLisp2021-07-232-8/+12
|
* glasm: Remove unused argument in identity instructions on GLASMReinUsesLisp2021-07-231-7/+7
|
* glasm: Implement basic GLASM instructionsReinUsesLisp2021-07-2310-840/+1173
|
* glasm: Changes to GLASM register allocator and emit contextReinUsesLisp2021-07-234-26/+64
|
* glasm: Add GLASM backend infrastructureReinUsesLisp2021-07-2328-4/+3115
|
* shader: ISET.X implementationameerj2021-07-231-8/+58
|
* shader: Fixup SPIR-V emit header namespacesReinUsesLisp2021-07-231-2/+2
|
* Move SPIR-V emission functions to their own headerReinUsesLisp2021-07-2324-572/+631
|
* shader: Optimize NVN FallthroughFernandoS272021-07-234-9/+83
|
* shader: Stub SR_AFFINITYFernandoS272021-07-231-0/+3
|
* shader: Implement Int32 SUATOM/SUREDameerj2021-07-2317-6/+733
|
* shader: Initial OpenGL implementationReinUsesLisp2021-07-233-0/+12
|
* spirv: Be aware of NAN unaware driversReinUsesLisp2021-07-231-18/+40
|
* spirv: Add SSBO read fallbacks when no aliasing is availableReinUsesLisp2021-07-231-37/+99
|
* spirv: Add OpKill fallback to demoteReinUsesLisp2021-07-231-2/+6
|
* spirv: Do not enable ShaderLayerReinUsesLisp2021-07-231-3/+0
| | | | This is enabled by an extension instead of the capability.
* spirv: Enable DemoteToHelperInvocationEXT only when supportedReinUsesLisp2021-07-231-1/+1
|
* spirv: Use OriginLowerLeft when requestedReinUsesLisp2021-07-231-1/+5
|
* spirv: Only add image operands mask when neededReinUsesLisp2021-07-231-5/+9
|
* spirv: Workaround image unsigned offset bugReinUsesLisp2021-07-232-9/+26
| | | | | Workaround bug on Nvidia's OpenGL SPIR-V compiler when using unsigned texture offsets.
* spirv: Add int8 and int16 capabilities only when supportedReinUsesLisp2021-07-231-2/+2
|
* spirv: Add integer clamping workaroundsReinUsesLisp2021-07-231-4/+34
| | | | Workaround more bugs on Nvidia's OpenGL SPIR-V compiler.
* spirv: Implement int8 and int16 conversion fallbacksReinUsesLisp2021-07-231-19/+80
|
* spirv: Support OpenGL uniform buffers and change bindingsReinUsesLisp2021-07-235-56/+163
|
* spirv: Desambiguate descriptor namesReinUsesLisp2021-07-231-9/+37
| | | | | Worksaround a bug on Nvidia's OpenGL SPIR-V compiler where names are used for name matching.
* shader: Add OpenGL shader profile optionsReinUsesLisp2021-07-231-0/+11
|
* shader: Remove shader utilReinUsesLisp2021-07-234-176/+0
|
* shader: Address feedbackFernandoS272021-07-234-35/+33
|
* shader: Implement VertexA stageFernandoS272021-07-2311-0/+166
|
* shader: Implement delegation of Exit to dispatcher on CFGFernandoS272021-07-232-3/+47
|
* shader: Fix IADD3.CCameerj2021-07-231-12/+5
|
* shader: Fix BFE s32 undefined checkameerj2021-07-231-1/+1
| | | | Our unit tests were hitting this exception.
* shader: Fix error checking in bitfieldExtract and implement bitfieldInsert foldingReinUsesLisp2021-07-231-5/+14
|
* shader: Fix storage type when reading patches on tess controlReinUsesLisp2021-07-231-1/+2
|
* shader: Fix VMNMX selector BReinUsesLisp2021-07-231-1/+2
|
* shader: Increase the maximum number of storage buffersReinUsesLisp2021-07-231-1/+1
| | | | | Compute shaders spill uniform buffers on storage buffers, increasing the expected number.
* shader: Remove identity removal pass for better build timesReinUsesLisp2021-07-231-1/+0
|
* shader: Add more strict validation the passReinUsesLisp2021-07-231-0/+42
|
* shader: Fix forward referencing identity instructions when inserting phiReinUsesLisp2021-07-231-11/+13
|
* shader: Remove invalidated blocks in dead code elimination passReinUsesLisp2021-07-231-3/+6
|
* shader: Add missing UndoUse case for GetSparseFromOpReinUsesLisp2021-07-231-0/+4
|
* shader: Simplify code in opcodes.h to fix IntellisenseReinUsesLisp2021-07-231-8/+6
| | | | | | | | Avoid using std::array to fix Intellisense not properly compiling this code and disabling itself on all files that include it. While we are at it, change the code to use u8 instead of size_t for the number of instructions in an opcode.
* shader: Implement indexed texturesReinUsesLisp2021-07-237-93/+189
|
* shader: Refactor atomic_operations_global_memoryameerj2021-07-231-44/+36
|
* shader: add missing include guard in half_floating_point_helper.hameerj2021-07-231-0/+2
|
* shader: Fix gcc warningsReinUsesLisp2021-07-232-2/+2
|
* shader: Inline common Value gettersReinUsesLisp2021-07-232-109/+102
|
* shader: Intrusively store in a block if it's sealed or notReinUsesLisp2021-07-232-3/+11
|
* cmake: Link to common in shader_recompilerReinUsesLisp2021-07-231-1/+1
|
* shader: Improve goto removal algorithm complexityReinUsesLisp2021-07-231-49/+28
| | | | | Find sibling node containing a nephew searching from the nephew itself instead of the uncle.
* shader: Use memset to reset instruction argumentsReinUsesLisp2021-07-232-4/+7
|
* shader: Inline common Value functions into the headerReinUsesLisp2021-07-232-19/+23
|
* shader: Move microinstruction header to the value headerReinUsesLisp2021-07-2319-180/+161
|
* shader: Move siblings check to a separate function and comment them outReinUsesLisp2021-07-231-16/+21
|
* shader: Intrusively store register values in block for SSA passReinUsesLisp2021-07-232-21/+53
|
* shader: Inline common Opcode and Inst functionsReinUsesLisp2021-07-234-112/+83
|
* shader: Inline common IR::Block methodsReinUsesLisp2021-07-232-17/+12
|
* shader: Use a small_vector for phi blocksReinUsesLisp2021-07-231-1/+2
|
* shader: Calculate number of arguments in an opcode at compile timeReinUsesLisp2021-07-231-3/+12
|
* shader: Implement D3D samplersReinUsesLisp2021-07-233-12/+76
|
* shader: Add constant propagation for arithmetic right shiftsReinUsesLisp2021-07-231-0/+3
|
* shader: Simplify code for local memoryReinUsesLisp2021-07-231-6/+11
|
* shader: Add NVN storage buffer fallbacksReinUsesLisp2021-07-239-62/+214
| | | | | | | When we can't track the SSBO origin of a global memory instruction, leave it as a global memory operation and assume these pointers are in the NVN storage buffer slots, then apply a linear search in the shader's runtime.
* spirv: Fix ViewportMaskReinUsesLisp2021-07-231-1/+2
|
* spirv: Replace Constant/ConstantComposite with Const helperameerj2021-07-2312-112/+101
|
* shader: Address feedbackFernandoS272021-07-232-7/+10
|
* shader: Implement F2F (Imm)FernandoS272021-07-231-2/+28
|
* shader: Implement IADD3.CC/.XFernandoS272021-07-231-7/+22
|
* shader: Address feedbackFernandoS272021-07-234-7/+4
|
* shader: Add coarse derivativesFernandoS272021-07-237-8/+28
|
* shader: Implement fine derivates constant propagationFernandoS272021-07-239-0/+101
|
* shader: Implement SR_Y_DIRECTIONFernandoS272021-07-237-0/+18
|
* shader: Fix Phi node typesReinUsesLisp2021-07-232-4/+4
|
* shader: Fix memory barriersReinUsesLisp2021-07-238-62/+30
|
* spirv: Fix implicit lod typeReinUsesLisp2021-07-232-1/+5
|
* spirv: Use explicit lods outside of fragment shadersReinUsesLisp2021-07-231-5/+16
|
* spirv: Use ConstOffset instead of Offset when possibleReinUsesLisp2021-07-233-21/+67
|
* shader: Implement BFE and BFI CCameerj2021-07-233-14/+17
| | | | Fix two bugs in BFI.
* shader: Implement SampleMaskReinUsesLisp2021-07-2311-2/+22
|
* shader: Implement PIXLD.MY_INDEXReinUsesLisp2021-07-2313-4/+69
|
* spirv: Bitcast non-F32 output attributes to their type before storeReinUsesLisp2021-07-231-13/+28
|
* spirv: Implement ViewportMask with NV_viewport_array2ReinUsesLisp2021-07-237-0/+20
|
* spirv: Bitcast non-F32 attributes to F32ReinUsesLisp2021-07-231-7/+9
|
* shader: Implement PrimitiveIdReinUsesLisp2021-07-235-0/+10
|
* shader: Implement tessellation shaders, polygon mode and invocation idReinUsesLisp2021-07-2322-88/+555
|
* shader: Mark atomic instructions as writesReinUsesLisp2021-07-231-0/+27
|
* spirv: Implement image buffersReinUsesLisp2021-07-235-23/+86
|
* spirv: Implement Layer storesReinUsesLisp2021-07-236-9/+30
|
* spirv: Fix alpha testFernandoS272021-07-231-0/+5
|
* spirv: Fix non-atomic 64-bit storeameerj2021-07-231-1/+1
|
* spirv: Implement alpha testameerj2021-07-232-1/+59
|
* shader: Implement transform feedbacks and define file formatReinUsesLisp2021-07-238-16/+116
|
* shader: Implement early Z testsReinUsesLisp2021-07-232-0/+4
|
* shader: Document and relax cache control on surface instructionsReinUsesLisp2021-07-231-10/+11
|
* spirv: Rework storage buffers and shader memoryReinUsesLisp2021-07-238-499/+553
|
* shader: Fix fixed pipeline point size on geometry shadersReinUsesLisp2021-07-231-10/+18
|
* shader: Add constant propagation for *&^| binary operationsReinUsesLisp2021-07-231-0/+12
|
* shader: Implement geometry shadersReinUsesLisp2021-07-2312-84/+221
|
* shader: Implement OUTReinUsesLisp2021-07-2310-17/+73
|
* internal_stage_buffer_entry_read: Remove pragma optimize offlat9nq2021-07-231-2/+0
|
* shader: Stub SR_INVOCATION_INFOReinUsesLisp2021-07-231-2/+5
|
* shader: Stub ISBERDReinUsesLisp2021-07-233-4/+56
|
* shader: Fix CC in I2IReinUsesLisp2021-07-231-0/+2
|
* spirv: Define StorageImageWriteWithoutFormat capability when usedReinUsesLisp2021-07-233-0/+9
|
* shader: Simplify FLO and throw on CCReinUsesLisp2021-07-231-12/+13
|
* shader: Mark blocks with no end branch as unreachableReinUsesLisp2021-07-231-2/+7
|
* shader: Implement LOP CCReinUsesLisp2021-07-233-12/+29
|
* shader: Implement SR_THREAD_KILLReinUsesLisp2021-07-2310-0/+22
|
* shader: Apply sign bit in FCMP (imm)ReinUsesLisp2021-07-231-1/+1
|
* shader: Implement ATOM/S and REDameerj2021-07-2318-19/+1724
|
* spirv: Move phi node patching to a separate functionReinUsesLisp2021-07-231-13/+16
|
* spirv: Guard against typeless image reads on unsupported devicesReinUsesLisp2021-07-235-1/+16
|
* shader: Move LaneId to the warp emission file and fix AMDReinUsesLisp2021-07-235-7/+11
|
* spirv: Fix forward declarations on phi nodesReinUsesLisp2021-07-231-47/+25
|
* shader: Mark ImageWrite with side effectsReinUsesLisp2021-07-231-0/+3
|
* shader: Implement CC for ISET, FSET, PSET, CSET, and DSETFernandoS272021-07-2318-13/+136
| | | | Throw when other instructions are missing CC.
* shader: Remove outdated comment in F2IReinUsesLisp2021-07-231-4/+0
|
* shader: Implement SULD and SUSTReinUsesLisp2021-07-2323-137/+597
|
* shader: Fix Windows build issuesReinUsesLisp2021-07-231-1/+1
|
* shader: Address feedback + clang formatlat9nq2021-07-2311-22/+20
|
* shader_recompiler,video_core: Cleanup some GCC and Clang errorslat9nq2021-07-2359-297/+289
| | | | | | | | | | | | | | | | | Mostly fixing unused *, implicit conversion, braced scalar init, fpermissive, and some others. Some Clang errors likely remain in video_core, and std::ranges is still a pertinent issue in shader_recompiler shader_recompiler: cmake: Force bracket depth to 1024 on Clang Increases the maximum fold expression depth thread_worker: Include condition_variable Don't use list initializers in control flow Co-authored-by: ReinUsesLisp <reinuseslisp@airmail.cc>
* shader: Fix FCMP immediate variantReinUsesLisp2021-07-231-1/+9
|
* shader: Fix dangling labelsReinUsesLisp2021-07-231-0/+5
|
* shader: Interact texture buffers with buffer cacheReinUsesLisp2021-07-233-29/+29
|
* shader: Fix F2IReinUsesLisp2021-07-231-1/+1
|
* shader: Fix TextureGradReinUsesLisp2021-07-231-1/+1
|
* shader: Implement texture buffersReinUsesLisp2021-07-236-23/+125
|
* shader: Address feedbackFernandoS272021-07-235-53/+54
|
* shader: Implement indexed Position and ClipDistancesFernandoS272021-07-233-11/+100
|
* shader: Implement indexed attributesFernandoS272021-07-2312-35/+279
|
* shader: Implement AL2PFernandoS272021-07-233-4/+36
|
* shader: Fix BRX trackingFernandoS272021-07-232-3/+4
|
* shader: Move recursive SSA rewrite to the heapReinUsesLisp2021-07-231-29/+89
|
* shader: Fix ShadowCube declaration type, set number of pipeline threads based on hardwareFernandoS272021-07-231-1/+1
|
* shader: Fix splits on blocks using indirect branchesReinUsesLisp2021-07-233-17/+38
|
* shader: Eliminate orphan blocks more efficientlyReinUsesLisp2021-07-231-7/+8
|
* shader: Add subgroup masksReinUsesLisp2021-07-2310-45/+169
|
* shader: Implement BAR and fix memory barriersReinUsesLisp2021-07-237-5/+79
|
* shader: Abstract breadth searches and use the abstractionReinUsesLisp2021-07-234-104/+106
|
* shader: Reimplement GetCbufU64 as GetCbufU32x2ReinUsesLisp2021-07-239-22/+21
| | | | It may generate better code on some compilers and it's easier to handle.
* shader: Remove unused header in VOTEReinUsesLisp2021-07-231-2/+0
|
* shader: Rework global memory tracking to use breadth-first searchReinUsesLisp2021-07-231-69/+80
|
* shader: Fix fp16 merge when using native fp16ReinUsesLisp2021-07-231-3/+3
|
* shader: Fix FADD32IReinUsesLisp2021-07-231-6/+4
|
* shader: Fix undetected bug from reviewFernandoS272021-07-231-0/+3
|
* shader: Address feedbackFernandoS272021-07-233-13/+16
|
* shader: "Implement" NOPFernandoS272021-07-231-1/+1
|
* shader: Address FeedbackFernandoS272021-07-2316-211/+60
|
* shader: Implement SR_LaneIdFernandoS272021-07-237-0/+15
|
* shader: Fix shared memory on cool driversFernandoS272021-07-231-0/+1
|
* shader: Implement MEMBARFernandoS272021-07-239-11/+121
|
* shader: Improve VOTE.VTG stubFernandoS272021-07-237-4/+147
|
* shader: Mark SSBOs as written when they areFernandoS272021-07-232-2/+30
|
* shader: Implement ViewportIndexFernandoS272021-07-237-2/+32
|
* shader: Stub TLD4's PTP when it isn't constantFernandoS272021-07-231-1/+2
|
* shader: Stub VOTE.VTGFernandoS272021-07-234-4/+15
|
* shader: Fold composite extractFernandoS272021-07-231-0/+62
|
* shader: Fold comparisons and Pack/Unpack16FernandoS272021-07-231-1/+41
|
* shader: Fix branches to visited virtual blocksReinUsesLisp2021-07-232-0/+12
|
* shader: Fix dependency on identity removal passReinUsesLisp2021-07-232-3/+8
|
* shader: Fix constant propagation to use reverse post orderReinUsesLisp2021-07-231-1/+2
|
* shader: Implement LDG .U.128 as .128ReinUsesLisp2021-07-231-3/+2
|
* shader: Unroll "using enum" for opcode declarationsReinUsesLisp2021-07-231-1/+27
|
* spirv: Remove unnecesary variable for clip distancesReinUsesLisp2021-07-232-6/+2
|
* shader: Implement ClipDistanceFernandoS272021-07-235-0/+36
|
* shader: Fix TXDFernandoS272021-07-232-2/+2
|
* shader: Address feedbackFernandoS272021-07-234-52/+48
|
* shader: Always pass a lod for TexelFetchReinUsesLisp2021-07-233-25/+17
|
* shader: Implement TXDFernandoS272021-07-234-10/+183
|
* shader: Implement ImageGradientFernandoS272021-07-238-2/+84
|
* shader: Implement TMML partiallyFernandoS272021-07-236-13/+137
|
* shader,spirv: Implement ImageQueryLod.FernandoS272021-07-239-1/+38
|
* shader: Implement TLDSFernandoS272021-07-233-4/+253
|
* shader: Implement TLDFernandoS272021-07-237-14/+173
|
* spirv: Add fixed pipeline point sizeReinUsesLisp2021-07-233-1/+8
|
* shader: Add PointCoord attributeFernandoS272021-07-235-0/+16
|
* shader: Add PointSize attributeameerj2021-07-235-0/+13
|
* shader: Store type of phi nodes in flagsReinUsesLisp2021-07-233-2/+11
| | | | This is needed because pseudo-instructions where invalidated.
* shader: Fix indirect branches to scheduler instructionsReinUsesLisp2021-07-233-7/+17
|
* spirv: Fix default output attribute initializationReinUsesLisp2021-07-231-3/+3
|
* shader: Add missing new linesReinUsesLisp2021-07-231-0/+2
|
* shader: Implement FSWZADDameerj2021-07-2314-4/+87
|
* shader: Implement BRXFernandoS272021-07-2320-47/+388
|
* shader: Fix alignment checks on RZReinUsesLisp2021-07-231-1/+1
|
* shader: Implement I2I CCameerj2021-07-233-24/+45
|
* shader: Implement I2I SATameerj2021-07-236-10/+52
|
* shader: Fix ISCADD logic for PO/CCameerj2021-07-231-7/+8
|
* shader: Implement LDS, STS, LDL, and STS and use SPIR-V 1.4 when availableReinUsesLisp2021-07-2317-17/+626
|
* shader: Implement ISCADD CCameerj2021-07-231-1/+4
|
* shader: Implement VMAD, VMNMX, VSETPameerj2021-07-239-23/+319
|
* shader: Add missing I2I exception when CC is usedReinUsesLisp2021-07-231-0/+4
|
* shader: Better interpolation and disabled attributes supportReinUsesLisp2021-07-237-23/+96
|
* spirv: Remove dependencies on Environment when generating SPIR-VReinUsesLisp2021-07-234-9/+12
|
* shader: Implement front faceReinUsesLisp2021-07-235-0/+12
|
* shader: Fix structured control flow on KIL instructionsReinUsesLisp2021-07-232-3/+7
| | | | | This could potentially leave unvisited blocks, leading to illegal phi nodes.
* shader: Fix TXQFernandoS272021-07-231-1/+1
|
* shader: Implement TXQ and fix FragDepthReinUsesLisp2021-07-2314-21/+172
|
* shader: Refactor PTP and other minor changesReinUsesLisp2021-07-2314-123/+67
|
* shader: Add IR opcode for ImageFetchFernandoS272021-07-237-5/+55
|
* shader: Implement TLD4.PTPFernandoS272021-07-2315-28/+111
|
* shader: Fix Array Indices in TEX/TLD4FernandoS272021-07-232-6/+6
|
* shader: Implement FragDepthFernandoS272021-07-232-1/+7
|
* shader: Implement TLD4S.FernandoS272021-07-233-4/+134
|
* shader: Implement TLD4 and TLD4_BFernandoS272021-07-2313-11/+315
|
* shader: Implement SHFLameerj2021-07-2316-69/+284
|
* shader: Track first bindless argument instead of the instruction itselfReinUsesLisp2021-07-231-1/+1
|
* shader: Properly insert Prologue instructionReinUsesLisp2021-07-231-1/+2
|
* shader: Minor style nitsReinUsesLisp2021-07-231-2/+4
|
* shader: Fix F2IFernandoS272021-07-2310-9/+147
|
* shader: Implement NDC [-1, 1], attribute types and default varying initializationReinUsesLisp2021-07-2312-40/+149
|
* shader: Fix use-after-free bug in object_poolReinUsesLisp2021-07-231-3/+3
|
* shader: Implement VOTEameerj2021-07-2314-5/+167
|
* shader: Fix TEX maskReinUsesLisp2021-07-231-1/+3
|
* vk_pipeline_cache: Add pipeline cacheReinUsesLisp2021-07-234-8/+15
|
* shader: Fold interpolation multiplicationsReinUsesLisp2021-07-231-0/+34
|
* shader: Better but still partial interpolation supportReinUsesLisp2021-07-231-5/+7
|
* shader: Implement DMNMX, DSET, DSETPameerj2021-07-2315-59/+208
|
* shader: Implement FADD32IFernandoS272021-07-231-2/+15
|
* shader: Implement F2FFernandoS272021-07-236-20/+192
|
* shader: Add missing fp64 usage flagsReinUsesLisp2021-07-231-0/+34
|
* shader: Implement DMUL and DFMAameerj2021-07-238-30/+111
| | | | Also add a missing const on DADD
* shader: Add FP64 register load/store helpersameerj2021-07-233-21/+24
|
* shader: Add support for fp16 comparisons and misc fixesReinUsesLisp2021-07-2311-14/+56
|
* shader: Fix floating point comparison for FP16FernandoS272021-07-235-32/+56
|
* shader: Implement HSETP2FernandoS272021-07-233-12/+117
|
* shader: Implement HSET2FernandoS272021-07-235-14/+119
|
* shader: Implement HMUL2FernandoS272021-07-233-16/+144
|
* shader: Implement HFMA2FernandoS272021-07-235-20/+192
|
* spirv: Implement VertexId and InstanceId, refactor codeReinUsesLisp2021-07-239-144/+243
|
* shader: Refactor half floating instructionsFernandoS272021-07-234-58/+84
|
* shader: Implement I2FReinUsesLisp2021-07-2316-69/+427
|
* shader: Implement ISCADD (imm)ReinUsesLisp2021-07-231-2/+2
|
* shader: Implement LOP32IReinUsesLisp2021-07-232-18/+45
|
* shader: Add partial rasterizer integrationReinUsesLisp2021-07-2334-156/+629
|
* shader: Implement DADDameerj2021-07-238-14/+132
|
* shader: Implement CSET and CSETPameerj2021-07-236-15/+114
|
* shader: Reorder phi nodes when redefined as undefined opcodesReinUsesLisp2021-07-231-1/+9
|
* shader: Fix instruction transitions in and out of PhiReinUsesLisp2021-07-231-9/+11
|
* shader: Implement FSET and FSETPameerj2021-07-239-94/+204
| | | | Also fix oversight with adding SignedZeroInfNanPreserve execution mode.
* shader: Implement TEXSReinUsesLisp2021-07-238-7/+287
|
* shader: Implement CAL inlining function callsReinUsesLisp2021-07-2324-330/+286
|
* spirv: Add SignedZeroInfNanPreserve logicameerj2021-07-232-0/+8
|
* shader: Implement FMNMXameerj2021-07-238-25/+101
| | | | And add a const in FCMP
* shader: Fix rebase issueReinUsesLisp2021-07-231-1/+0
|
* shader: Implement FCMPameerj2021-07-239-50/+203
| | | | still need to configure some settings for NV denorm flush and intel NaN
* shader: Partial implementation of LDCReinUsesLisp2021-07-2316-50/+405
|
* shader: Initial support for textures and TEXReinUsesLisp2021-07-2329-341/+1378
|
* shader: Implement R2Pameerj2021-07-238-15/+88
|
* shader: Implement SHFameerj2021-07-238-31/+119
|
* shader: Implement LEAameerj2021-07-239-29/+136
|
* shader: Deduplicate HADD2 codeReinUsesLisp2021-07-231-19/+16
|
* shader: Implement I2Iameerj2021-07-233-12/+100
|
* shader: Implement HADD2ReinUsesLisp2021-07-2312-42/+400
|
* shader: Implement LOP and LOP3ameerj2021-07-238-31/+227
|
* shader: Implement IADD3ameerj2021-07-233-12/+104
|
* shader: Implement PSETPameerj2021-07-234-5/+40
|
* Implement PSET, refactor common comparison funcsameerj2021-07-239-101/+88
|
* shader: Implement FLOameerj2021-07-238-18/+75
|
* shader: Implement ISET, add common_funcsameerj2021-07-238-50/+150
|
* shader: Make IMNMX, SHR, SEL stylistically more consistentameerj2021-07-233-5/+5
|
* shader: Implement ICMPameerj2021-07-233-16/+84
|
* shader: Implement IMNMXameerj2021-07-238-12/+105
|
* shader: Implement BFIameerj2021-07-233-16/+57
|
* shader: Implement BFEameerj2021-07-233-12/+67
|
* shader: Implement POPCameerj2021-07-238-12/+59
|
* shader: Implement SHRameerj2021-07-238-18/+80
|
* shader: Implement SELameerj2021-07-234-16/+53
|
* spirv: Move phi arguments emit to a separate functionReinUsesLisp2021-07-231-27/+27
|
* shader: Avoid infinite recursion when tracking global memoryReinUsesLisp2021-07-231-5/+26
|
* shader: Fix conditional execution of exit instructionsReinUsesLisp2021-07-232-5/+6
|
* spirv: Add support for self-referencing phi nodesReinUsesLisp2021-07-231-3/+10
|
* shader: Fix control flowReinUsesLisp2021-07-238-20/+39
|
* shader: Implement more of XMAD and FFMA32I and fix XMAD.CBCCReinUsesLisp2021-07-235-28/+76
|
* shader: FMUL, select, RRO, and MUFU fixesReinUsesLisp2021-07-2318-119/+507
|
* shader: Fix MOV(reg), add SHL variants and emit neg and abs instructionsReinUsesLisp2021-07-234-11/+11
|
* spirv: Fixes and Intel specific workaroundsReinUsesLisp2021-07-2310-32/+43
|
* shader: Rename, implement FADD.SAT and P2R (imm)ReinUsesLisp2021-07-2317-125/+211
|
* shader: Add denorm flush supportReinUsesLisp2021-07-2315-60/+210
|
* spirv: Add lower fp16 to fp32 passReinUsesLisp2021-07-2328-276/+465
|
* shader: Primitive Vulkan integrationReinUsesLisp2021-07-2328-498/+573
|
* shader: Add XMAD multiplication folding optimizationReinUsesLisp2021-07-231-5/+77
|
* shader: Simplify ISCADDReinUsesLisp2021-07-231-6/+1
|
* shader: Add utility to resolve identities on a valueReinUsesLisp2021-07-232-0/+8
|
* spirv: Implement EmitIdentityReinUsesLisp2021-07-232-3/+3
|
* spirv: Initial bindings supportReinUsesLisp2021-07-2322-292/+671
|
* shader: Improve object poolReinUsesLisp2021-07-233-50/+66
|
* shader: Fix trackingReinUsesLisp2021-07-231-50/+72
|
* shader: Add support for forward declarationsReinUsesLisp2021-07-2310-68/+79
|
* shader: Support SSA loops on IRReinUsesLisp2021-07-2312-46/+150
|
* shader: Misc fixesReinUsesLisp2021-07-2310-89/+104
|
* shader: Initial implementation of an ASTReinUsesLisp2021-07-2332-589/+1345
|
* spirv: Initial SPIR-V supportReinUsesLisp2021-07-2318-34/+1400
|
* shader: Better constant foldingReinUsesLisp2021-07-232-13/+48
|
* shader: Properly store phi on InstReinUsesLisp2021-07-236-75/+132
|
* shader: Add pools and rename filesReinUsesLisp2021-07-2330-108/+255
|
* shader: Make typed IRReinUsesLisp2021-07-2319-269/+495
|
* shader: Remove illegal character in SSA passReinUsesLisp2021-07-231-1/+1
|
* shader: Constant propagation and global memory to storage bufferReinUsesLisp2021-07-2317-63/+652
|
* shader: Initial instruction supportReinUsesLisp2021-07-2328-334/+1450
|
* shader: SSA and dominanceReinUsesLisp2021-07-2324-77/+570
|
* shader: Initial recompiler workReinUsesLisp2021-07-2356-0/+7060