llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-12-15 04:30:12 +00:00

History

Adam Nemet d761cc1dfa [LoopStrengthReduce] Don't trim formula that uses a subset of required registers Consider this use from the new testcase: LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i32 reg({1000,+,-1}<nw><%for.body>) -3003 + reg({3,+,3}<nw><%for.body>) -1001 + reg({1,+,1}<nuw><nsw><%for.body>) -1000 + reg({0,+,1}<nw><%for.body>) -3000 + reg({0,+,3}<nuw><%for.body>) reg({-1000,+,1}<nw><%for.body>) reg({-3000,+,3}<nsw><%for.body>) This is the last use we consider for a solution in SolveRecurse, so CurRegs is a large set. (CurRegs is the set of registers that are needed by the previously visited uses in the in-progress solution.) ReqRegs is { {3,+,3}<nw><%for.body>, {1,+,1}<nuw><nsw><%for.body> } This is the intersection of the regs used by any of the formulas for the current use and CurRegs. Now, the code requires a formula to contain all these regs (the comment is simply wrong), otherwise the formula is immediately disqualified. Obviously, no formula for this use contains two regs so they will all get disqualified. The fix modifies the check to allow the formula in this case. The idea is that neither of these formulae is introducing any new registers which is the point of this early pruning as far as I understand. In terms of set arithmetic, we now allow formulas whose used regs are a subset of the required regs not just the other way around. There are few more loops in the test-suite that are now successfully LSRed. I have benchmarked those and found very minimal change. Fixes <rdar://problem/13965777> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@207271 91177308-0d34-0410-b5e6-96231b3b80d8		2014-04-25 21:02:21 +00:00
..
ADCE
AddDiscriminators	Fix bug 19437 - Only add discriminators for DWARF 4 and above.	2014-04-17 22:33:50 +00:00
ArgumentPromotion	Update optimization passes to handle inalloca arguments	2014-01-28 02:38:36 +00:00
AtomicExpandLoadLinked/ARM	Atomics: promote ARM's IR-based atomics pass to CodeGen.	2014-04-17 18:22:47 +00:00
BBVectorize	Allow vectorization of bit intrinsics in BB Vectorizer.	2014-04-25 03:33:48 +00:00
BranchFolding	Re-commit: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call	2014-02-13 14:44:26 +00:00
CodeExtractor
CodeGenPrepare	CodeGenPrep: sink extends of illegal types into use block.	2014-03-13 13:36:25 +00:00
ConstantHoisting	[Constant Hoisting] Materialize the constant before the cloned cast instruction.	2014-04-22 18:06:58 +00:00
ConstantMerge	Remove the linker_private and linker_private_weak linkages.	2014-03-13 23:18:37 +00:00
ConstProp
CorrelatedValuePropagation
DeadArgElim	IR: Change inalloca's grammar a bit	2014-03-09 06:41:58 +00:00
DeadStoreElimination	Update optimization passes to handle inalloca arguments	2014-01-28 02:38:36 +00:00
DebugIR
EarlyCSE
FunctionAttrs	Update optimization passes to handle inalloca arguments	2014-01-28 02:38:36 +00:00
GCOVProfiling	Debug Info: update testing cases to specify the debug info version number.	2013-11-22 21:49:45 +00:00
GlobalDCE
GlobalMerge	ARM64: initial backend import	2014-03-29 10:18:08 +00:00
GlobalOpt	Prevent alias from pointing to weak aliases.	2014-03-27 15:26:56 +00:00
GVN	Revert "GVN: merge overflow intrinsics with non-overflow instructions."	2014-03-28 14:42:34 +00:00
IndVarSimplify	[LPM] Fix PR18642, a pretty nasty bug in IndVars that "never mattered"	2014-01-29 04:40:19 +00:00
Inline	[inline cold threshold] Command line argument for inline threshold will	2014-04-25 17:34:55 +00:00
InstCombine	[InstCombine][x86] Constant fold psll intrinsics.	2014-04-24 00:58:18 +00:00
InstSimplify	InstSimplify: Make shift, select and GEP simplifications vector-aware.	2014-01-24 17:09:53 +00:00
Internalize	Correct word hyphenations	2013-12-05 05:44:44 +00:00
IPConstantProp
JumpThreading
LCSSA
LICM	[LPM] Switch LICM to actively use LCSSA in addition to preserving it.	2014-02-11 12:52:27 +00:00
LoopDeletion
LoopIdiom	Debug Info: update testing cases to specify the debug info version number.	2013-11-23 01:16:29 +00:00
LoopReroll	Fix loop rerolling pass failure with non-consant loop lower bound	2014-01-03 17:20:01 +00:00
LoopRotate	[LPM] Fix PR18643, another scary place where loop transforms failed to	2014-01-29 13:16:53 +00:00
LoopSimplify	[LPM] Switch LICM to actively use LCSSA in addition to preserving it.	2014-02-11 12:52:27 +00:00
LoopStrengthReduce	[LoopStrengthReduce] Don't trim formula that uses a subset of required registers	2014-04-25 21:02:21 +00:00
LoopUnroll	Implement X86TTI::getUnrollingPreferences	2014-04-01 18:50:34 +00:00
LoopUnswitch
LoopVectorize	[CLNUP] Test commit. Remove newline.	2014-04-24 08:42:58 +00:00
LowerAtomic	IR: add a second ordering operand to cmpxhg for failure	2014-03-11 10:48:52 +00:00
LowerExpectIntrinsic	Lower llvm.expect intrinsic correctly for i1	2014-02-02 22:43:55 +00:00
LowerInvoke	Remove LowerInvoke's obsolete "-enable-correct-eh-support" option	2014-03-20 19:54:47 +00:00
LowerSwitch
Mem2Reg	Debug Info: update testing cases to specify the debug info version number.	2013-11-22 21:49:45 +00:00
MemCpyOpt	Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch by Björn Steinbrink!	2014-03-26 23:45:15 +00:00
MergeFunc	PR17925 bugfix.	2013-11-26 16:11:03 +00:00
MetaRenamer	Reject alias to undefined symbols in the verifier.	2014-03-12 20:15:49 +00:00
ObjCARC	Fix use_iterator crash in ObjCArc from r203364	2014-03-18 22:32:43 +00:00
PhaseOrdering
PruneEH
Reassociate
Reg2Mem
SampleProfile	Tolerate unmangled names in sample profiles.	2014-03-18 12:03:12 +00:00
Scalarizer	Fix Scalarizer insertion point when replacing PHIs with insertelements	2013-12-23 14:51:56 +00:00
ScalarRepl	Fix PR18800. llvm intrinsic memcpy takes 5 arguments void @llvm.memcpy.p0i8.p0i8.i32(i8* <dest>, i8* <src>, i32 <len>, i32 <align>, i1 <isvolatile>).The test case incorrectly uses the old format resulting in isVolatile function in MemIntrinsic to crash during SROA transformation.Modified the test case to use correct signature of memcpy and memset.	2014-03-13 04:50:29 +00:00
SCCP
SimplifyCFG	Allow switch-to-lookup table for tables with holes by adding bitmask check	2014-03-12 18:35:40 +00:00
Sink	Sink: Don't sink static allocas from the entry block	2014-03-21 15:51:51 +00:00
SLPVectorizer	Reapply "SLPVectorizer: Ignore users that are insertelements we can reschedule them"	2014-04-10 13:41:35 +00:00
SROA	[SROA] Use the correct index integer size in GEPs through non-default	2014-02-26 10:08:16 +00:00
StripSymbols	Add a debug info code generation level to the compile unit metadata	2014-02-27 01:24:56 +00:00
StructurizeCFG	StructurizeCFG: Fix verification failure with some loops.	2013-11-22 19:24:39 +00:00
TailCallElim	Fix PR7272 in -tailcallelim instead of the inliner	2014-04-21 20:48:47 +00:00
TailDup