llvm-6502/lib
Jakob Stoklund Olesen 352aa503fa Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings.
On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register
in a different domain than where it was defined. Some instructions have
equvivalents for different domains, like por/orps/orpd.

The SSEDomainFix pass tries to minimize the number of domain crossings by
changing between equvivalent opcodes where possible.

This is a work in progress, in particular the pass doesn't do anything yet. SSE
instructions are tagged with their execution domain in TableGen using the last
two bits of TSFlags. Note that not all instructions are tagged correctly. Life
just isn't that simple.

The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline
issue handled by NEONMoveFixPass. This pass may become target independent to
handle both.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@99524 91177308-0d34-0410-b5e6-96231b3b80d8
2010-03-25 17:25:00 +00:00
..
Analysis Reapply r99451 with a fix to move the NoInline check to the cost functions 2010-03-25 04:49:10 +00:00
Archive From PR6228: 2010-02-04 06:19:43 +00:00
AsmParser Introduce isOpaqueTy and use it rather than isa<OpaqueType>. Also, move some 2010-02-16 14:50:09 +00:00
Bitcode Finally land the InvokeInst operand reordering. 2010-03-24 13:21:49 +00:00
CodeGen Add comment. 2010-03-25 15:09:44 +00:00
CompilerDriver Use FindExecutable as a fall-back search method. 2010-03-05 04:46:28 +00:00
ExecutionEngine Avoid a memory leak in JITDebugRegisterer. 2010-03-16 05:54:54 +00:00
Linker Introduce isOpaqueTy and use it rather than isa<OpaqueType>. Also, move some 2010-02-16 14:50:09 +00:00
MC MC/Mach-O: Switch to MCSectionData::getOrdinal. 2010-03-25 08:08:54 +00:00
Support Fix minor style issues. 2010-03-24 19:38:02 +00:00
System Fix a false-positive memory leak in code using RemoveFileOnSignal(). Because 2010-03-17 07:08:12 +00:00
Target Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. 2010-03-25 17:25:00 +00:00
Transforms fix PR6642, GVN forwarding from memset to load of the base of the memset. 2010-03-25 05:58:19 +00:00
VMCore Include isFunctionLocal while calculating folding node set profile for a MDNode. 2010-03-25 06:04:47 +00:00
Makefile