Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | shader_jit_x64: Use LOOPCOUNT_REG as a 64-bit reg when indexing | Yuri Kunde Schlesner | 2016-12-15 | 1 | -1/+1 |
| | |||||
* | VideoCore: Eliminate an unnecessary copy in the drawcall loop | Yuri Kunde Schlesner | 2016-12-15 | 2 | -2/+2 |
| | |||||
* | shader_jit_x64: Use Reg32 for LOOP* registers, eliminating casts | Yuri Kunde Schlesner | 2016-12-15 | 1 | -16/+16 |
| | |||||
* | VideoCore: Convert x64 shader JIT to use Xbyak for assembly | Yuri Kunde Schlesner | 2016-12-15 | 2 | -223/+225 |
| | |||||
* | shader_jit: Fix non-SSE4.1 path where FLR would not truncate | Jannik Vogel | 2016-12-04 | 1 | -1/+1 |
| | |||||
* | shader_jit: Load LOOPCOUNT_REG and LOOPINC 4 bit left-shifted | Jannik Vogel | 2016-12-02 | 1 | -6/+9 |
| | |||||
* | VideoCore: Shader interpreter cleanups | Yuri Kunde Schlesner | 2016-09-30 | 1 | -32/+42 |
| | |||||
* | VideoCore: Fix out-of-bounds read in ShaderSetup::ProduceDebugInfo | Yuri Kunde Schlesner | 2016-09-30 | 1 | -3/+1 |
| | | | | | | As far as I can tell, memset was replaced by a fill without correcting the parameter type, causing an out-of-bounds array read in the Vec4 constructor. | ||||
* | Remove special rules for Windows.h and library includes | Yuri Kunde Schlesner | 2016-09-21 | 1 | -1/+1 |
| | |||||
* | Use negative priorities to avoid special-casing the self-include | Yuri Kunde Schlesner | 2016-09-21 | 3 | -3/+3 |
| | |||||
* | Remove empty newlines in #include blocks. | Emmanuel Gil Peyrot | 2016-09-21 | 5 | -22/+3 |
| | | | | | | | This makes clang-format useful on those. Also add a bunch of forgotten transitive includes, which otherwise prevented compilation. | ||||
* | Manually tweak source formatting and then re-run clang-format | Yuri Kunde Schlesner | 2016-09-19 | 4 | -9/+6 |
| | |||||
* | Sources: Run clang-format on everything. | Emmanuel Gil Peyrot | 2016-09-18 | 6 | -311/+335 |
| | |||||
* | VideoCore: Fix dangling lambda context in shader interpreter | Yuri Kunde Schlesner | 2016-09-16 | 1 | -1/+1 |
| | | | | | | The static meant that after the first execution, these lambda context would be pointing to a random location on the stack. Fixes a random crash when using the interpreter. | ||||
* | Retrieve shader result from new OutputRegisters-type | Jannik Vogel | 2016-05-16 | 3 | -56/+68 |
| | |||||
* | Use new shader-jit signature for interpreter | Jannik Vogel | 2016-05-13 | 3 | -8/+8 |
| | |||||
* | Refactor access to state in shader-jit | Jannik Vogel | 2016-05-13 | 4 | -24/+42 |
| | |||||
* | Move program_counter and call_stack from UnitState to interpreter | Jannik Vogel | 2016-05-12 | 3 | -45/+42 |
| | |||||
* | Move default_attributes into Pica state | Jannik Vogel | 2016-05-12 | 1 | -2/+0 |
| | |||||
* | Merge pull request #1690 from JayFoxRox/tex-type-3 | bunnei | 2016-05-12 | 1 | -1/+2 |
|\ | | | | | Pica: Implement texture type 3 (Projection2D) | ||||
| * | Pica: Add tc0.w to OutputVertex | Jannik Vogel | 2016-05-11 | 1 | -1/+2 |
| | | |||||
* | | Turn ShaderSetup into struct | Jannik Vogel | 2016-05-11 | 2 | -52/+53 |
|/ | |||||
* | Pica: Replace logic in shader.cpp with loop | Jannik Vogel | 2016-05-03 | 1 | -34/+4 |
| | |||||
* | VideoCore: Run include-what-you-use and fix most includes. | Emmanuel Gil Peyrot | 2016-04-30 | 6 | -14/+43 |
| | |||||
* | Merge pull request #1730 from hrydgard/vertex-loader | bunnei | 2016-04-29 | 1 | -1/+1 |
|\ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Remove late accesses to attribute_config * Refactor: Extract VertexLoader from command_processor.cpp. Preparation for a similar concept to Dolphin or PPSSPP. These can be JIT-ed and cached. * Move "&" to their proper place, add missing includes and make some properly relative. * Don't keep base_address in the loader, it doesn't belong there (with it, the loader can't be cached). * Optimize the vertex loader, nearly doubling its speed. * Debugger fix * Move and rename the MemoryAccesses class to MemoryAccessTracker. | ||||
| * | Refactor: Extract VertexLoader from command_processor.cpp. | Henrik Rydgard | 2016-04-28 | 1 | -1/+1 |
| | | | | | | | | Preparation for a similar concept to Dolphin or PPSSPP. These can be JIT-ed and cached. | ||||
* | | Common: Remove section measurement from profiler (#1731) | Yuri Kunde Schlesner | 2016-04-29 | 1 | -3/+0 |
| | | | | | | | | This has been entirely superseded by MicroProfile. The rest of the code can go when a simpler frametime/FPS meter is added to the GUI. | ||||
* | | shader: Shader size is long uint, not uint. | Sam Spilsbury | 2016-04-24 | 1 | -1/+1 |
| | | |||||
* | | shader: Handle non-CALL opcodes with a break | Sam Spilsbury | 2016-04-24 | 1 | -0/+2 |
| | | |||||
* | | shader: Format string must be provided inline and not as a variable | Sam Spilsbury | 2016-04-24 | 1 | -1/+1 |
|/ | |||||
* | shader_jit_x64: Rename RuntimeAssert to Compile_Assert. | bunnei | 2016-04-14 | 2 | -5/+5 |
| | |||||
* | shader_jit_x64.cpp: Rename JitCompiler to JitShader. | bunnei | 2016-04-14 | 3 | -92/+92 |
| | |||||
* | shader_jit_x64: Free memory that's no longer needed after compilation. | bunnei | 2016-04-14 | 1 | -0/+6 |
| | |||||
* | shader_jit_x64: Use a sorted vector instead of a set for keeping track of return addresses. | bunnei | 2016-04-14 | 2 | -5/+8 |
| | |||||
* | shader_jit_x64: Use CALL/RET instead of JMP for subroutines. | bunnei | 2016-04-14 | 1 | -17/+7 |
| | |||||
* | shader_jit_x64: Separate initialization and code generation for readability. | bunnei | 2016-04-14 | 1 | -9/+8 |
| | |||||
* | shader_jit_x64: Get rid of unnecessary last_program_counter variable. | bunnei | 2016-04-14 | 2 | -6/+2 |
| | |||||
* | shader_jit_x64: Execute certain asserts at runtime. | bunnei | 2016-04-14 | 2 | -5/+19 |
| | | | | - This is because we compile the full shader code space, and therefore its common to compile malformed instructions. | ||||
* | shader: Remove unused 'state' argument from 'Setup' function. | bunnei | 2016-04-14 | 2 | -3/+2 |
| | |||||
* | shader_jit_x64: Specify shader main offset at runtime. | bunnei | 2016-04-14 | 3 | -10/+6 |
| | |||||
* | shader_jit_x64: Allocate each program independently and persist for emu session. | bunnei | 2016-04-14 | 3 | -38/+28 |
| | |||||
* | shader_jit_x64: Rewrite flow control to support arbitrary CALL and JMP instructions. | bunnei | 2016-04-14 | 2 | -35/+119 |
| | |||||
* | shader_jit_x64: Fix strict memory aliasing issues. | bunnei | 2016-04-14 | 1 | -1/+3 |
| | |||||
* | Merge pull request #1643 from MerryMage/make_unique | Mathew Maidment | 2016-04-06 | 1 | -1/+0 |
|\ | | | | | Common: Remove Common::make_unique, use std::make_unique | ||||
| * | Common: Remove Common::make_unique, use std::make_unique | MerryMage | 2016-04-05 | 1 | -1/+0 |
| | | |||||
* | | Merge pull request #1508 from JayFoxRox/vs-output-map | bunnei | 2016-03-22 | 1 | -4/+14 |
|\ \ | |/ |/| | Respect vs output map | ||||
| * | Respect vs output map | Jannik Vogel | 2016-03-14 | 1 | -4/+14 |
| | | |||||
* | | Merge pull request #1538 from lioncash/dot | bunnei | 2016-03-20 | 1 | -5/+3 |
|\ \ | | | | | | | shader_interpreter: use std::inner_product for the dot product | ||||
| * | | shader_interpreter: use std::inner_product for the dot product | Lioncash | 2016-03-17 | 1 | -5/+3 |
| | | | | | | | | | | | | Same thing, less code. | ||||
* | | | video_core: Don't cast away const | Lioncash | 2016-03-17 | 1 | -1/+1 |
|/ / | |||||
* | | Merge pull request #1503 from bunnei/clear-jit-cache | bunnei | 2016-03-16 | 3 | -7/+27 |
|\ \ | | | | | | | Clear JIT cache | ||||
| * | | shader_jit_x64: Clear cache after code space fills up. | bunnei | 2016-03-12 | 3 | -2/+19 |
| | | | |||||
| * | | shader_jit_x64: Make assert outputs more useful & cleanup formatting. | bunnei | 2016-03-12 | 1 | -4/+7 |
| | | | |||||
| * | | shader: Update log message to use proper log class. | bunnei | 2016-03-12 | 1 | -1/+1 |
| |/ | |||||
* / | PICA: Fix MAD/MADI encoding | Jannik Vogel | 2016-03-15 | 2 | -29/+33 |
|/ | |||||
* | Common: Get rid of alignment macros | Lioncash | 2016-03-09 | 1 | -4/+4 |
| | | | | | The gl rasterizer already uses alignas, so we may as well move everything over. | ||||
* | Add immediate mode vertex submission | Dwayne Slater | 2016-03-03 | 4 | -2/+22 |
| | |||||
* | pica: Implement decoding of basic fragment lighting components. | bunnei | 2016-02-05 | 2 | -5/+9 |
| | | | | | | | - Diffuse - Distance attenuation - float16/float20 types - Vertex Shader 'view' output | ||||
* | Merge pull request #1367 from yuriks/jit-jmp | bunnei | 2016-01-27 | 2 | -6/+6 |
|\ | | | | | Shader JIT: Fix off-by-one error when compiling JMPs | ||||
| * | Shader JIT: Fix off-by-one error when compiling JMPs | Yuri Kunde Schlesner | 2016-01-24 | 2 | -6/+6 |
| | | | | | | | | | | | | | | There was a mistake in the JMP code which meant that one instruction at the destination would be skipped when the jump was taken. This commit also changes the meaning of the culprit parameter to make it less confusing and avoid similar mistakes in the future. | ||||
* | | Shader: Implement "invert condition" feature of IFU instruction | Yuri Kunde Schlesner | 2016-01-25 | 2 | -2/+5 |
|/ | | | | | | If the bit 0 of the JMPU instruction is set, then the jump condition will be inverted. That is, a jump will happen when the boolean is false instead of when it is true. | ||||
* | video_core: Reorganize headers | Lioncash | 2015-09-11 | 3 | -6/+4 |
| | |||||
* | video_core: Remove unnecessary includes from headers | Lioncash | 2015-09-11 | 1 | -2/+0 |
| | |||||
* | video_core: Remove unused variables | Lioncash | 2015-09-10 | 2 | -2/+0 |
| | |||||
* | Shader JIT: Use SCALE constant from emitter | aroulin | 2015-09-07 | 1 | -4/+4 |
| | |||||
* | Shader: Fix size_t to int casts of register offsets | aroulin | 2015-09-07 | 2 | -15/+21 |
| | |||||
* | Merge pull request #1088 from aroulin/x64-emitter-abi-call | bunnei | 2015-09-02 | 2 | -28/+18 |
|\ | | | | | x64: Proper stack alignment in shader JIT function calls | ||||
| * | x64: Proper stack alignment in shader JIT function calls | aroulin | 2015-09-01 | 2 | -28/+18 |
| | | | | | | | | | | Import Dolphin stack handling and register saving routines Also removes the x86 parts from abi files | ||||
* | | video_core: Fix format specifiers warnings | aroulin | 2015-09-02 | 1 | -1/+2 |
|/ | |||||
* | Shader JIT: Fix SGE/SGEI NaN behavior | aroulin | 2015-08-31 | 1 | -3/+3 |
| | | | | | SGE was incorrectly emulated w.r.t. NaN behavior as the CMPSS SSE instruction was used with NLT | ||||
* | Merge pull request #1065 from yuriks/shader-fp | Yuri Kunde Schlesner | 2015-08-28 | 3 | -56/+87 |
|\ | | | | | Shader FP compliance fixes | ||||
| * | Shader JIT: Tiny micro-optimization in DPH | Yuri Kunde Schlesner | 2015-08-24 | 1 | -4/+4 |
| | | |||||
| * | Shaders: Fix multiplications between 0.0 and inf | Yuri Kunde Schlesner | 2015-08-24 | 2 | -39/+45 |
| | | | | | | | | | | | | | | | | The PICA200 semantics for multiplication are so that when multiplying inf by exactly 0.0, the result is 0.0, instead of NaN, as defined by IEEE. This is relied upon by games. Fixes #1024 (missing OoT interface items) | ||||
| * | Shaders: Explicitly conform to PICA semantics in MAX/MIN | Yuri Kunde Schlesner | 2015-08-24 | 2 | -2/+10 |
| | | |||||
| * | Shader JIT: Add name to second scratch register (XMM4) | Yuri Kunde Schlesner | 2015-08-24 | 1 | -3/+5 |
| | | |||||
| * | Shader JIT: Fix CMP NaN behavior to match hardware | Yuri Kunde Schlesner | 2015-08-24 | 1 | -8/+23 |
| | | |||||
* | | Shader JIT: Fix float to integer rounding in MOVA | aroulin | 2015-08-27 | 1 | -2/+2 |
| | | | | | | | | MOVA converts new address register values from floats to integers using truncation | ||||
* | | Shader JIT: ifdef out reference to ifdef'd out shader_map | archshift | 2015-08-27 | 1 | -0/+2 |
| | | | | | | | | | | shader_map was only defined on x86 architectures, but was cleared on shutdown with no ifdef protection. Ifdef this out so non-x86 architectures can be built. | ||||
* | | Integrate the MicroProfile profiling library | Yuri Kunde Schlesner | 2015-08-25 | 1 | -0/+3 |
| | | | | | | | | | | This brings goodies such as a configurable user interface and multi-threaded timeline view. | ||||
* | | shader_jit: Replace two MDisp usages with MatR | Lioncash | 2015-08-24 | 1 | -2/+2 |
|/ | |||||
* | Merge pull request #1062 from aroulin/shader-rcp-rsq | bunnei | 2015-08-23 | 2 | -10/+10 |
|\ | | | | | Shader: RCP and RSQ computes only the 1st component | ||||
| * | Shader: Use std::sqrt for float instead of sqrt | aroulin | 2015-08-23 | 1 | -1/+1 |
| | | |||||
| * | Shader: RCP and RSQ computes only the 1st component | aroulin | 2015-08-23 | 2 | -10/+10 |
| | | |||||
* | | Shader: implement DPH/DPHI in JIT | aroulin | 2015-08-22 | 2 | -2/+36 |
| | | |||||
* | | Shader: implement DPH/DPHI in interpreter | aroulin | 2015-08-22 | 1 | -1/+8 |
|/ | | | | | Tests revealed that the component with w=1 is SRC1 and not SRC2, it is now fixed on 3dbrew. | ||||
* | Shader: implement SGE, SGEI and SLT in JIT | aroulin | 2015-08-19 | 2 | -15/+36 |
| | |||||
* | Shader: implement SGE, SGEI in interpreter | aroulin | 2015-08-19 | 1 | -0/+14 |
| | |||||
* | Shader: Save caller-saved registers in JIT before a CALL | aroulin | 2015-08-19 | 2 | -0/+33 |
| | |||||
* | Shader: implement EX2 and LG2 in JIT | aroulin | 2015-08-17 | 2 | -2/+22 |
| | |||||
* | Shader: implement EX2 and LG2 in interpreter | aroulin | 2015-08-16 | 1 | -0/+36 |
| | |||||
* | Build fix for Debug configurations. | Tony Wasserka | 2015-08-16 | 1 | -1/+1 |
| | |||||
* | Introduce a shader tracer to allow inspection of input/output values for each processed instruction. | Tony Wasserka | 2015-08-16 | 5 | -37/+322 |
| | |||||
* | citra-qt: Improve shader debugger. | Tony Wasserka | 2015-08-16 | 1 | -6/+0 |
| | | | | Now supports dumping the current shader and recognizes a larger number of output semantics. | ||||
* | Shader: Use a POD struct for registers. | bunnei | 2015-08-16 | 5 | -40/+43 |
| | |||||
* | Rename ARCHITECTURE_X64 definition to ARCHITECTURE_x86_64. | bunnei | 2015-08-16 | 1 | -6/+5 |
| | |||||
* | Common: Cleanup CPU capability detection code. | bunnei | 2015-08-16 | 1 | -5/+5 |
| | |||||
* | Common: Move cpu_detect to x64 directory. | bunnei | 2015-08-16 | 1 | -2/+1 |
| | |||||
* | x64: Refactor to remove fake interfaces and general cleanups. | bunnei | 2015-08-16 | 5 | -144/+22 |
| | |||||
* | JIT: Support negative address offsets. | bunnei | 2015-08-16 | 1 | -26/+25 |
| | |||||
* | Shader: Initial implementation of x86_x64 JIT compiler for Pica vertex shaders. | bunnei | 2015-08-16 | 6 | -2/+924 |
| | | | | | - Config: Add an option for selecting to use shader JIT or interpreter. - Qt: Add a menu option for enabling/disabling the shader JIT. | ||||
* | Common: Added MurmurHash3 hash function for general-purpose use. | bunnei | 2015-08-15 | 1 | -1/+1 |
| | |||||
* | Shader: Define a common interface for running vertex shader programs. | bunnei | 2015-08-15 | 4 | -184/+278 |
| | |||||
* | Shader: Move shader code to its own subdirectory, "shader". | bunnei | 2015-08-15 | 2 | -0/+701 |