llvm-6502/Transforms at d761cc1dfa7bb55c0c995dd409025147a51c1258 - llvm-6502 - Applefritter: Git

6502/llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-22 13:29:44 +00:00

History

Adam Nemet d761cc1dfa [LoopStrengthReduce] Don't trim formula that uses a subset of required registers

Consider this use from the new testcase:

  LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i32
    reg({1000,+,-1}<nw><%for.body>)
    -3003 + reg({3,+,3}<nw><%for.body>)
    -1001 + reg({1,+,1}<nuw><nsw><%for.body>)
    -1000 + reg({0,+,1}<nw><%for.body>)
    -3000 + reg({0,+,3}<nuw><%for.body>)
    reg({-1000,+,1}<nw><%for.body>)
    reg({-3000,+,3}<nsw><%for.body>)

This is the last use we consider for a solution in SolveRecurse, so CurRegs is
a large set.  (CurRegs is the set of registers that are needed by the
previously visited uses in the in-progress solution.)

ReqRegs is {
  {3,+,3}<nw><%for.body>,
  {1,+,1}<nuw><nsw><%for.body>
}

This is the intersection of the regs used by any of the formulas for the
current use and CurRegs.

Now, the code requires a formula to contain *all* these regs (the comment is
simply wrong), otherwise the formula is immediately disqualified.  Obviously,
no formula for this use contains two regs so they will all get disqualified.

The fix modifies the check to allow the formula in this case.  The idea is
that neither of these formulae is introducing any new registers which is the
point of this early pruning as far as I understand.

In terms of set arithmetic, we now allow formulas whose used regs are a subset
of the required regs not just the other way around.

There are few more loops in the test-suite that are now successfully LSRed.  I
have benchmarked those and found very minimal change.

Fixes <rdar://problem/13965777>

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@207271 91177308-0d34-0410-b5e6-96231b3b80d8

2014-04-25 21:02:21 +00:00

..

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

AddDiscriminators

Fix bug 19437 - Only add discriminators for DWARF 4 and above.

2014-04-17 22:33:50 +00:00

ArgumentPromotion

Update optimization passes to handle inalloca arguments

2014-01-28 02:38:36 +00:00

AtomicExpandLoadLinked/ARM

Atomics: promote ARM's IR-based atomics pass to CodeGen.

2014-04-17 18:22:47 +00:00

Allow vectorization of bit intrinsics in BB Vectorizer.

2014-04-25 03:33:48 +00:00

Re-commit: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call

2014-02-13 14:44:26 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

CodeGenPrep: sink extends of illegal types into use block.

2014-03-13 13:36:25 +00:00

ConstantHoisting

[Constant Hoisting] Materialize the constant before the cloned cast instruction.

2014-04-22 18:06:58 +00:00

Remove the linker_private and linker_private_weak linkages.

2014-03-13 23:18:37 +00:00

Teach ConstantFolding about pointer address spaces

2013-08-20 21:20:04 +00:00

CorrelatedValuePropagation

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

IR: Change inalloca's grammar a bit

2014-03-09 06:41:58 +00:00

DeadStoreElimination

Update optimization passes to handle inalloca arguments

2014-01-28 02:38:36 +00:00

Use right pointer type in DebugIR

2013-09-27 22:26:25 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

Update optimization passes to handle inalloca arguments

2014-01-28 02:38:36 +00:00

Debug Info: update testing cases to specify the debug info version number.

2013-11-22 21:49:45 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

ARM64: initial backend import

2014-03-29 10:18:08 +00:00

Prevent alias from pointing to weak aliases.

2014-03-27 15:26:56 +00:00

Revert "GVN: merge overflow intrinsics with non-overflow instructions."

2014-03-28 14:42:34 +00:00

[LPM] Fix PR18642, a pretty nasty bug in IndVars that "never mattered"

2014-01-29 04:40:19 +00:00

[inline cold threshold] Command line argument for inline threshold will

2014-04-25 17:34:55 +00:00

[InstCombine][x86] Constant fold psll intrinsics.

2014-04-24 00:58:18 +00:00

InstSimplify: Make shift, select and GEP simplifications vector-aware.

2014-01-24 17:09:53 +00:00

Correct word hyphenations

2013-12-05 05:44:44 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

Don't eliminate a partially redundant load if it's in a landing pad.

2013-10-21 04:09:17 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

[LPM] Switch LICM to actively use LCSSA in addition to preserving it.

2014-02-11 12:52:27 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

Debug Info: update testing cases to specify the debug info version number.

2013-11-23 01:16:29 +00:00

Fix loop rerolling pass failure with non-consant loop lower bound

2014-01-03 17:20:01 +00:00

[LPM] Fix PR18643, another scary place where loop transforms failed to

2014-01-29 13:16:53 +00:00

[LPM] Switch LICM to actively use LCSSA in addition to preserving it.

2014-02-11 12:52:27 +00:00

LoopStrengthReduce

[LoopStrengthReduce] Don't trim formula that uses a subset of required registers

2014-04-25 21:02:21 +00:00

Implement X86TTI::getUnrollingPreferences

2014-04-01 18:50:34 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

[CLNUP] Test commit. Remove newline.

2014-04-24 08:42:58 +00:00

IR: add a second ordering operand to cmpxhg for failure

2014-03-11 10:48:52 +00:00

LowerExpectIntrinsic

Lower llvm.expect intrinsic correctly for i1

2014-02-02 22:43:55 +00:00

Remove LowerInvoke's obsolete "-enable-correct-eh-support" option

2014-03-20 19:54:47 +00:00

Revert patches to add case-range support for PR1255.

2013-09-09 19:14:35 +00:00

Debug Info: update testing cases to specify the debug info version number.

2013-11-22 21:49:45 +00:00

Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch by Björn Steinbrink!

2014-03-26 23:45:15 +00:00

PR17925 bugfix.

2013-11-26 16:11:03 +00:00

Reject alias to undefined symbols in the verifier.

2014-03-12 20:15:49 +00:00

Fix use_iterator crash in ObjCArc from r203364

2014-03-18 22:32:43 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

Tolerate unmangled names in sample profiles.

2014-03-18 12:03:12 +00:00

Fix Scalarizer insertion point when replacing PHIs with insertelements

2013-12-23 14:51:56 +00:00

Fix PR18800. llvm intrinsic memcpy takes 5 arguments void @llvm.memcpy.p0i8.p0i8.i32(i8* <dest>, i8* <src>, i32 <len>, i32 <align>, i1 <isvolatile>).The test case incorrectly uses the old format resulting in isVolatile function in MemIntrinsic to crash during SROA transformation.Modified the test case to use correct signature of memcpy and memset.

2014-03-13 04:50:29 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00

Allow switch-to-lookup table for tables with holes by adding bitmask check

2014-03-12 18:35:40 +00:00

Sink: Don't sink static allocas from the entry block

2014-03-21 15:51:51 +00:00

Reapply "SLPVectorizer: Ignore users that are insertelements we can reschedule them"

2014-04-10 13:41:35 +00:00

[SROA] Use the correct index integer size in GEPs through non-default

2014-02-26 10:08:16 +00:00

Add a debug info code generation level to the compile unit metadata

2014-02-27 01:24:56 +00:00

StructurizeCFG: Fix verification failure with some loops.

2013-11-22 19:24:39 +00:00

Fix PR7272 in -tailcallelim instead of the inliner

2014-04-21 20:48:47 +00:00

[tests] Cleanup initialization of test suffixes.

2013-08-16 00:37:11 +00:00