summaryrefslogtreecommitdiffstats
path: root/src/video_core/shader/shader_jit_x64.cpp (follow)
Commit message (Collapse)AuthorAgeFilesLines
* VideoCore/Shader: Move entry_point to SetupBatchYuri Kunde Schlesner2017-01-261-4/+6
|
* VideoCore/Shader: Move per-batch ShaderEngine state into ShaderSetupYuri Kunde Schlesner2017-01-261-15/+10
|
* shader_jit_x64: Don't read program from global stateYuri Kunde Schlesner2017-01-261-1/+1
|
* VideoCore/Shader: Move ProduceDebugInfo to InterpreterEngineYuri Kunde Schlesner2017-01-261-5/+0
|
* VideoCore/Shader: Split interpreter and JIT into separate ShaderEnginesYuri Kunde Schlesner2017-01-261-0/+56
|
* VideoCore/Shader: Rename shader_jit_x64{ => _compiler}.{cpp,h}Yuri Kunde Schlesner2017-01-261-888/+0
|
* VideoCore/Shader: Extract DebugData out from UnitStateYuri Kunde Schlesner2016-12-161-2/+2
|
* Remove unnecessary castYuri Kunde Schlesner2016-12-161-3/+1
|
* VideoCore/Shader: Remove dynamic control flow in (Get)UniformOffsetYuri Kunde Schlesner2016-12-161-5/+3
|
* shader_jit_x64: Use LOOPCOUNT_REG as a 64-bit reg when indexingYuri Kunde Schlesner2016-12-151-1/+1
|
* shader_jit_x64: Use Reg32 for LOOP* registers, eliminating castsYuri Kunde Schlesner2016-12-151-16/+16
|
* VideoCore: Convert x64 shader JIT to use Xbyak for assemblyYuri Kunde Schlesner2016-12-151-214/+218
|
* shader_jit: Fix non-SSE4.1 path where FLR would not truncateJannik Vogel2016-12-041-1/+1
|
* shader_jit: Load LOOPCOUNT_REG and LOOPINC 4 bit left-shiftedJannik Vogel2016-12-021-6/+9
|
* Remove special rules for Windows.h and library includesYuri Kunde Schlesner2016-09-211-1/+1
|
* Use negative priorities to avoid special-casing the self-includeYuri Kunde Schlesner2016-09-211-1/+1
|
* Remove empty newlines in #include blocks.Emmanuel Gil Peyrot2016-09-211-5/+1
| | | | | | | This makes clang-format useful on those. Also add a bunch of forgotten transitive includes, which otherwise prevented compilation.
* Manually tweak source formatting and then re-run clang-formatYuri Kunde Schlesner2016-09-191-2/+1
|
* Sources: Run clang-format on everything.Emmanuel Gil Peyrot2016-09-181-99/+113
|
* Refactor access to state in shader-jitJannik Vogel2016-05-131-16/+16
|
* VideoCore: Run include-what-you-use and fix most includes.Emmanuel Gil Peyrot2016-04-301-1/+9
|
* shader: Shader size is long uint, not uint.Sam Spilsbury2016-04-241-1/+1
|
* shader: Handle non-CALL opcodes with a breakSam Spilsbury2016-04-241-0/+2
|
* shader: Format string must be provided inline and not as a variableSam Spilsbury2016-04-241-1/+1
|
* shader_jit_x64: Rename RuntimeAssert to Compile_Assert.bunnei2016-04-141-4/+4
|
* shader_jit_x64.cpp: Rename JitCompiler to JitShader.bunnei2016-04-141-87/+87
|
* shader_jit_x64: Free memory that's no longer needed after compilation.bunnei2016-04-141-0/+6
|
* shader_jit_x64: Use a sorted vector instead of a set for keeping track of return addresses.bunnei2016-04-141-3/+6
|
* shader_jit_x64: Use CALL/RET instead of JMP for subroutines.bunnei2016-04-141-17/+7
|
* shader_jit_x64: Separate initialization and code generation for readability.bunnei2016-04-141-9/+8
|
* shader_jit_x64: Get rid of unnecessary last_program_counter variable.bunnei2016-04-141-5/+2
|
* shader_jit_x64: Execute certain asserts at runtime.bunnei2016-04-141-5/+13
| | | | - This is because we compile the full shader code space, and therefore its common to compile malformed instructions.
* shader_jit_x64: Specify shader main offset at runtime.bunnei2016-04-141-3/+1
|
* shader_jit_x64: Allocate each program independently and persist for emu session.bunnei2016-04-141-9/+8
|
* shader_jit_x64: Rewrite flow control to support arbitrary CALL and JMP instructions.bunnei2016-04-141-30/+92
|
* shader_jit_x64: Fix strict memory aliasing issues.bunnei2016-04-141-1/+3
|
* Merge pull request #1503 from bunnei/clear-jit-cachebunnei2016-03-161-5/+8
|\ | | | | Clear JIT cache
| * shader_jit_x64: Clear cache after code space fills up.bunnei2016-03-121-1/+1
| |
| * shader_jit_x64: Make assert outputs more useful & cleanup formatting.bunnei2016-03-121-4/+7
| |
* | PICA: Fix MAD/MADI encodingJannik Vogel2016-03-151-27/+28
|/
* Add immediate mode vertex submissionDwayne Slater2016-03-031-0/+2
|
* Merge pull request #1367 from yuriks/jit-jmpbunnei2016-01-271-5/+5
|\ | | | | Shader JIT: Fix off-by-one error when compiling JMPs
| * Shader JIT: Fix off-by-one error when compiling JMPsYuri Kunde Schlesner2016-01-241-5/+5
| | | | | | | | | | | | | | There was a mistake in the JMP code which meant that one instruction at the destination would be skipped when the jump was taken. This commit also changes the meaning of the culprit parameter to make it less confusing and avoid similar mistakes in the future.
* | Shader: Implement "invert condition" feature of IFU instructionYuri Kunde Schlesner2016-01-251-1/+3
|/ | | | | | If the bit 0 of the JMPU instruction is set, then the jump condition will be inverted. That is, a jump will happen when the boolean is false instead of when it is true.
* video_core: Remove unused variablesLioncash2015-09-101-1/+0
|
* Shader JIT: Use SCALE constant from emitteraroulin2015-09-071-4/+4
|
* Shader: Fix size_t to int casts of register offsetsaroulin2015-09-071-9/+15
|
* Merge pull request #1088 from aroulin/x64-emitter-abi-callbunnei2015-09-021-26/+17
|\ | | | | x64: Proper stack alignment in shader JIT function calls
| * x64: Proper stack alignment in shader JIT function callsaroulin2015-09-011-26/+17
| | | | | | | | | | Import Dolphin stack handling and register saving routines Also removes the x86 parts from abi files
* | video_core: Fix format specifiers warningsaroulin2015-09-021-1/+2
|/
* Shader JIT: Fix SGE/SGEI NaN behavioraroulin2015-08-311-3/+3
| | | | | SGE was incorrectly emulated w.r.t. NaN behavior as the CMPSS SSE instruction was used with NLT
* Merge pull request #1065 from yuriks/shader-fpYuri Kunde Schlesner2015-08-281-54/+73
|\ | | | | Shader FP compliance fixes
| * Shader JIT: Tiny micro-optimization in DPHYuri Kunde Schlesner2015-08-241-4/+4
| |
| * Shaders: Fix multiplications between 0.0 and infYuri Kunde Schlesner2015-08-241-39/+39
| | | | | | | | | | | | | | | | The PICA200 semantics for multiplication are so that when multiplying inf by exactly 0.0, the result is 0.0, instead of NaN, as defined by IEEE. This is relied upon by games. Fixes #1024 (missing OoT interface items)
| * Shaders: Explicitly conform to PICA semantics in MAX/MINYuri Kunde Schlesner2015-08-241-0/+2
| |
| * Shader JIT: Add name to second scratch register (XMM4)Yuri Kunde Schlesner2015-08-241-3/+5
| |
| * Shader JIT: Fix CMP NaN behavior to match hardwareYuri Kunde Schlesner2015-08-241-8/+23
| |
* | Shader JIT: Fix float to integer rounding in MOVAaroulin2015-08-271-2/+2
| | | | | | | | MOVA converts new address register values from floats to integers using truncation
* | shader_jit: Replace two MDisp usages with MatRLioncash2015-08-241-2/+2
|/
* Merge pull request #1062 from aroulin/shader-rcp-rsqbunnei2015-08-231-4/+6
|\ | | | | Shader: RCP and RSQ computes only the 1st component
| * Shader: RCP and RSQ computes only the 1st componentaroulin2015-08-231-4/+6
| |
* | Shader: implement DPH/DPHI in JITaroulin2015-08-221-2/+35
|/
* Shader: implement SGE, SGEI and SLT in JITaroulin2015-08-191-14/+34
|
* Shader: Save caller-saved registers in JIT before a CALLaroulin2015-08-191-0/+30
|
* Shader: implement EX2 and LG2 in JITaroulin2015-08-171-2/+20
|
* Introduce a shader tracer to allow inspection of input/output values for each processed instruction.Tony Wasserka2015-08-161-4/+4
|
* Shader: Use a POD struct for registers.bunnei2015-08-161-6/+6
|
* Common: Cleanup CPU capability detection code.bunnei2015-08-161-5/+5
|
* Common: Move cpu_detect to x64 directory.bunnei2015-08-161-2/+1
|
* x64: Refactor to remove fake interfaces and general cleanups.bunnei2015-08-161-6/+14
|
* JIT: Support negative address offsets.bunnei2015-08-161-26/+25
|
* Shader: Initial implementation of x86_x64 JIT compiler for Pica vertex shaders.bunnei2015-08-161-0/+669
- Config: Add an option for selecting to use shader JIT or interpreter. - Qt: Add a menu option for enabling/disabling the shader JIT.