Commit message (Collapse) | Author | Files | Lines | ||
---|---|---|---|---|---|
2019-08-21 | shader_ir: Implement VOTE | ReinUsesLisp | 1 | -0/+5 | |
Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers. | |||||
2019-07-20 | Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. | Fernando Sahmkow | 1 | -11/+14 | |
This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done. | |||||
2019-07-20 | shader/half_set_predicate: Fix HSETP2 implementation | ReinUsesLisp | 1 | -2/+1 | |
2019-07-09 | shader_ir: Unify blocks in decompiled shaders. | Fernando Sahmkow | 1 | -6/+6 | |
2019-07-09 | shader_ir: Implement BRX & BRA.CC | Fernando Sahmkow | 1 | -0/+1 | |
2019-06-21 | texture_cache: Style and Corrections | Fernando Sahmkow | 1 | -1/+2 | |
2019-06-21 | shader: Implement bindless images | ReinUsesLisp | 1 | -0/+9 | |
2019-06-21 | shader: Decode SUST and implement backing image functionality | ReinUsesLisp | 1 | -1/+41 | |
2019-06-07 | shader: Split SSY and PBK stack | ReinUsesLisp | 1 | -1/+6 | |
Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT; | |||||
2019-06-07 | shader/node: Minor changes | ReinUsesLisp | 1 | -50/+54 | |
Reflect std::shared_ptr nature of Node on initializers and remove constant members in nodes. Add some commentaries. | |||||
2019-06-07 | shader: Move Node declarations out of the shader IR header | ReinUsesLisp | 1 | -322/+4 | |
Analysis passes do not have a good reason to depend on shader_ir.h to work on top of nodes. This splits node-related declarations to their own file and leaves the IR in shader_ir.h | |||||
2019-06-06 | shader: Use shared_ptr to store nodes and move initialization to file | ReinUsesLisp | 1 | -83/+25 | |
Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class. | |||||
2019-05-23 | shader/shader_ir: Make Comment() take a std::string by value | Lioncash | 1 | -1/+1 | |
This allows for forming comment nodes without making unnecessary copies of the std::string instance. e.g. previously: Comment(fmt::format("Base address is c[0x{:x}][0x{:x}]", cbuf->GetIndex(), cbuf_offset)); Would result in a copy of the string being created, as CommentNode() takes a std::string by value (a const ref passed to a value parameter results in a copy). Now, only one instance of the string is ever moved around. (fmt::format returns a std::string, and since it's returned from a function by value, this is a prvalue (which can be treated like an rvalue), so it's moved into Comment's string parameter), we then move it into the CommentNode constructor, which then moves the string into its member variable). | |||||
2019-05-21 | shader/memory: Implement LD (generic memory) | ReinUsesLisp | 1 | -4/+2 | |
2019-05-20 | shader: Implement S2R Tid{XYZ} and CtaId{XYZ} | ReinUsesLisp | 1 | -1/+7 | |
2019-05-19 | shader/shader_ir: Remove unnecessary inline specifiers | Lioncash | 1 | -2/+2 | |
constexpr internally links by default, so the inline specifier is unnecessary. | |||||
2019-05-19 | shader/shader_ir: Simplify constructors for OperationNode | Lioncash | 1 | -15/+6 | |
Many of these constructors don't even need to be templated. The only ones that need to be templated are the ones that actually make use of the parameter pack. Even then, since std::vector accepts an initializer list, we can supply the parameter pack directly to it instead of creating our own copy of the list, then copying it again into the std::vector. | |||||
2019-05-19 | shader/shader_ir: Remove unnecessary template parameter packs from Operation() overloads where applicable | Lioncash | 1 | -2/+0 | |
These overloads don't actually make use of the parameter pack, so they can be turned into regular non-template function overloads. | |||||
2019-05-19 | shader/shader_ir: Mark tracking functions as const member functions | Lioncash | 1 | -3/+4 | |
These don't actually modify instance state, so they can be marked as const member functions | |||||
2019-05-19 | shader/shader_ir: Place implementations of constructor and destructor in cpp file | Lioncash | 1 | -5/+2 | |
Given the class contains quite a lot of non-trivial types, place the constructor and destructor within the cpp file to avoid inlining construction and destruction code everywhere the class is used. | |||||
2019-05-03 | shader: Add physical attributes commentaries | ReinUsesLisp | 1 | -2/+4 | |
2019-05-03 | gl_shader_decompiler: Implement GLSL physical attributes | ReinUsesLisp | 1 | -1/+1 | |
2019-05-03 | shader_ir/memory: Implement physical input attributes | ReinUsesLisp | 1 | -3/+17 | |
2019-05-03 | shader: Remove unused AbufNode Ipa mode | ReinUsesLisp | 1 | -16/+4 | |
2019-05-03 | shader_ir/memory: Emit AL2P IR | ReinUsesLisp | 1 | -0/+5 | |
2019-04-26 | shader_ir: Move Sampler index entry in operand< to sort declarations | ReinUsesLisp | 1 | -2/+2 | |
2019-04-26 | shader_ir: Add missing entry to Sampler operand< comparison | ReinUsesLisp | 1 | -2/+3 | |
2019-04-16 | shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic | ReinUsesLisp | 1 | -16/+10 | |
Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall. | |||||
2019-04-16 | shader_ir/decode: Implement half float saturation | ReinUsesLisp | 1 | -0/+3 | |
2019-04-16 | renderer_opengl: Implement half float NaN comparisons | ReinUsesLisp | 1 | -6/+12 | |
2019-04-14 | shader_ir: Implement STG, keep track of global memory usage and flush | ReinUsesLisp | 1 | -3/+13 | |
2019-04-08 | Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format. | Fernando Sahmkow | 1 | -2/+2 | |
2019-04-08 | Refactor GetTextureCode and GetTexCode to use an optional instead of optional parameters | Fernando Sahmkow | 1 | -11/+9 | |
2019-04-08 | Implement Bindless Handling on SetupTexture | Fernando Sahmkow | 1 | -4/+3 | |
2019-04-08 | Unify both sampler types. | Fernando Sahmkow | 1 | -8/+28 | |
2019-04-08 | Implement Bindless Samplers and TEX_B in the IR. | Fernando Sahmkow | 1 | -9/+22 | |
2019-03-30 | shader_ir/decode: Implement AOFFI for TEX and TLD4 | ReinUsesLisp | 1 | -3/+6 | |
2019-03-30 | shader_ir: Implement immediate register tracking | ReinUsesLisp | 1 | -0/+3 | |
2019-02-26 | shader/decode: Remove extras from MetaTexture | ReinUsesLisp | 1 | -1/+3 | |
2019-02-26 | shader/decode: Split memory and texture instructions decoding | ReinUsesLisp | 1 | -0/+1 | |
2019-02-07 | shader_ir: Remove F4 prefix to texture operations | ReinUsesLisp | 1 | -6/+6 | |
This was originally included because texture operations returned a vec4. These operations now return a single float and the F4 prefix doesn't mean anything. | |||||
2019-02-07 | shader_ir: Clean texture management code | ReinUsesLisp | 1 | -5/+5 | |
Previous code relied on GLSL parameter order (something that's always ill-formed on an IR design). This approach passes spatial coordiantes through operation nodes and array and depth compare values in the the texture metadata. It still contains an "extra" vector containing generic nodes for bias and component index (for example) which is still a bit ill-formed but it should be better than the previous approach. | |||||
2019-02-07 | gl_shader_disk_cache: Save GLSL and entries into the precompiled file | ReinUsesLisp | 1 | -0/+9 | |
2019-02-03 | shader_ir: Rename BasicBlock to NodeBlock | ReinUsesLisp | 1 | -46/+46 | |
It's not always used as a basic block. Rename it for consistency. | |||||
2019-02-03 | shader_ir: Pass decoded nodes as a whole instead of per basic blocks | ReinUsesLisp | 1 | -25/+26 | |
Some games call LDG at the top of a basic block, making the tracking heuristic to fail. This commit lets the heuristic the decoded nodes as a whole instead of per basic blocks. This may lead to some false positives but allows it the heuristic to track cases it previously couldn't. | |||||
2019-01-30 | shader_ir: Unify constant buffer offset values | ReinUsesLisp | 1 | -1/+1 | |
Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries. | |||||
2019-01-30 | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 1 | -4/+34 | |
2019-01-28 | shader/shader_ir: Amend three comment typos | Lioncash | 1 | -3/+3 | |
Given we're in the area, these are three trivial typos that can be corrected. | |||||
2019-01-28 | shader/shader_ir: Amend constructor initializer ordering for AbufNode | Lioncash | 1 | -2/+2 | |
Orders the class members in the same order that they would actually be initialized in. Gets rid of two compiler warnings. | |||||
2019-01-15 | shader_ir: Pass to decoder functions basic block's code | ReinUsesLisp | 1 | -25/+25 | |
2019-01-15 | shader_decode: Improve zero flag implementation | ReinUsesLisp | 1 | -2/+7 | |
2019-01-15 | shader_ir: Remove composite primitives and use temporals instead | ReinUsesLisp | 1 | -30/+30 | |
2019-01-15 | shader_decode: Use proper primitive names | ReinUsesLisp | 1 | -7/+5 | |
2019-01-15 | shader_decode: Use BitfieldExtract instead of shift + and | ReinUsesLisp | 1 | -2/+7 | |
2019-01-15 | shader_ir: Remove Ipa primitive | ReinUsesLisp | 1 | -2/+0 | |
2019-01-15 | shader_ir: Remove RZ and use Register::ZeroIndex instead | ReinUsesLisp | 1 | -2/+0 | |
2019-01-15 | shader_decode: Implement TEXS.F16 | ReinUsesLisp | 1 | -2/+6 | |
2019-01-15 | video_core: Implement IR based geometry shaders | ReinUsesLisp | 1 | -0/+3 | |
2019-01-15 | shader_decode: Implement VMAD and VSETP | ReinUsesLisp | 1 | -0/+4 | |
2019-01-15 | shader_decode: Implement HSET2 | ReinUsesLisp | 1 | -0/+1 | |
2019-01-15 | shader_decode: Rework HSETP2 | ReinUsesLisp | 1 | -7/+8 | |
2019-01-15 | shader_decode: Implement HFMA2 | ReinUsesLisp | 1 | -0/+1 | |
2019-01-15 | shader_decode: Implement POPC | ReinUsesLisp | 1 | -1/+3 | |
2019-01-15 | shader_decode: Implement TLDS (untested) | ReinUsesLisp | 1 | -0/+4 | |
2019-01-15 | shader_decode: Update TLD4 reflecting #1862 changes | ReinUsesLisp | 1 | -0/+3 | |
2019-01-15 | shader_ir: Fixup TEX and TEXS and partially fix TLD4 decompiling | ReinUsesLisp | 1 | -1/+3 | |
2019-01-15 | video_core: Address feedback | ReinUsesLisp | 1 | -11/+10 | |
2019-01-15 | shader_ir: Fixup file inclusions and clang-format | ReinUsesLisp | 1 | -1/+0 | |
2019-01-15 | shader_ir: Move comment node string | Mat M | 1 | -2/+2 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-01-15 | shader_ir: Address feedback to avoid UB in bit casting | ReinUsesLisp | 1 | -2/+4 | |
2019-01-15 | shader_decode: Implement LOP3 | ReinUsesLisp | 1 | -0/+2 | |
2019-01-15 | shader_decode: Implement LOP32I | ReinUsesLisp | 1 | -0/+5 | |
2019-01-15 | shader_decode: Implement TEX and TXQ | ReinUsesLisp | 1 | -0/+4 | |
2019-01-15 | shader_decode: Implement TEXS (F32) | ReinUsesLisp | 1 | -0/+18 | |
2019-01-15 | shader_ir: Add condition code helper | ReinUsesLisp | 1 | -0/+3 | |
2019-01-15 | shader_ir: Add predicate combiner helper | ReinUsesLisp | 1 | -0/+3 | |
2019-01-15 | shader_ir: Add comparison helpers | ReinUsesLisp | 1 | -0/+9 | |
2019-01-15 | shader_ir: Add half float helpers | ReinUsesLisp | 1 | -0/+7 | |
2019-01-15 | shader_ir: Add integer helpers | ReinUsesLisp | 1 | -0/+5 | |
2019-01-15 | shader_ir: Add float helpers | ReinUsesLisp | 1 | -0/+5 | |
2019-01-15 | shader_ir: Add setters | ReinUsesLisp | 1 | -0/+8 | |
2019-01-15 | shader_ir: Add local memory getters | ReinUsesLisp | 1 | -0/+3 | |
2019-01-15 | shader_ir: Add internal flag getters | ReinUsesLisp | 1 | -0/+2 | |
2019-01-15 | shader_ir: Add attribute getters | ReinUsesLisp | 1 | -0/+5 | |
2019-01-15 | shader_ir: Add constant buffer getters | ReinUsesLisp | 1 | -0/+4 | |
2019-01-15 | shader_ir: Add register getter | ReinUsesLisp | 1 | -0/+2 | |
2019-01-15 | shader_ir: Add immediate node constructors | ReinUsesLisp | 1 | -1/+18 | |
2019-01-15 | shader_ir: Initial implementation | ReinUsesLisp | 1 | -0/+662 | |