llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-01-10 02:36:06 +00:00

History

Elena Demikhovsky b5c82c079a Fold fcmp in cases where value is provably non-negative. By Arch Robison.

This patch folds fcmp in some cases of interest in Julia. The patch adds a function CannotBeOrderedLessThanZero that returns true if a value is provably not less than zero. I.e. the function returns true if the value is provably -0, +0, positive, or a NaN. The patch extends InstructionSimplify.cpp to fold instances of fcmp where:
 - the predicate is olt or uge
 - the first operand is provably not less than zero
 - the second operand is zero
The motivation for handling these cases optimizing away domain checks for sqrt in Julia for common idioms such as sqrt(x*x+y*y)..

http://reviews.llvm.org/D6972



git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@227298 91177308-0d34-0410-b5e6-96231b3b80d8

2015-01-28 08:03:58 +00:00

IPA

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

AliasAnalysis.cpp

[PM] Separate the TargetLibraryInfo object from the immutable pass.

2015-01-15 10:41:28 +00:00

AliasAnalysisCounter.cpp

…

AliasAnalysisEvaluator.cpp

…

AliasDebugger.cpp

…

AliasSetTracker.cpp

…

Analysis.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

AssumptionCache.cpp

[PM] Actually add the new pass manager support for the assumption cache.

2015-01-22 21:53:09 +00:00

BasicAliasAnalysis.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

BlockFrequencyInfo.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

BlockFrequencyInfoImpl.cpp

…

BranchProbabilityInfo.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

CaptureTracking.cpp

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

CFG.cpp

Standardize {pred,succ,use,user}_empty()

2015-01-13 03:46:47 +00:00

CFGPrinter.cpp

…

CFLAliasAnalysis.cpp

Fix incorrect partial aliasing

2015-01-26 17:31:17 +00:00

CGSCCPassManager.cpp

[PM] Remove the defunt CGSCC-specific debug flag.

2015-01-13 22:45:13 +00:00

CMakeLists.txt

[PM] Move TargetLibraryInfo into the Analysis library.

2015-01-15 02:16:27 +00:00

CodeMetrics.cpp

…

ConstantFolding.cpp

[PM] Move TargetLibraryInfo into the Analysis library.

2015-01-15 02:16:27 +00:00

CostModel.cpp

…

Delinearization.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

DependenceAnalysis.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

DominanceFrontier.cpp

…

DomPrinter.cpp

…

FunctionTargetTransformInfo.cpp

…

InstCount.cpp

…

InstructionSimplify.cpp

Fold fcmp in cases where value is provably non-negative. By Arch Robison.

2015-01-28 08:03:58 +00:00

Interval.cpp

…

IntervalPartition.cpp

…

IVUsers.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

JumpInstrTableInfo.cpp

…

LazyCallGraph.cpp

Revert r225854: [PM] Move the LazyCallGraph printing functionality to

2015-01-14 00:27:45 +00:00

LazyValueInfo.cpp

[PM] Separate the TargetLibraryInfo object from the immutable pass.

2015-01-15 10:41:28 +00:00

LibCallAliasAnalysis.cpp

…

LibCallSemantics.cpp

Move EH personality type classification to Analysis/LibCallSemantics.h

2015-01-28 01:17:38 +00:00

Lint.cpp

[PM] Separate the TargetLibraryInfo object from the immutable pass.

2015-01-15 10:41:28 +00:00

LLVMBuild.txt

Update libdeps since TLI was moved from Target to Analysis in r226078.

2015-01-15 05:21:00 +00:00

Loads.cpp

…

LoopInfo.cpp

[PM] Port LoopInfo to the new pass manager, adding both a LoopAnalysis

2015-01-20 10:58:50 +00:00

LoopPass.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

Makefile

…

MemDepPrinter.cpp

[REFACTOR] Push logic from MemDepPrinter into getNonLocalPointerDependency

2015-01-09 00:26:45 +00:00

MemoryBuiltins.cpp

[PM] Move TargetLibraryInfo into the Analysis library.

2015-01-15 02:16:27 +00:00

MemoryDependenceAnalysis.cpp

Refine memory dependence's notion of volatile semantics

2015-01-26 18:54:27 +00:00

ModuleDebugInfoPrinter.cpp

…

NoAliasAnalysis.cpp

…

PHITransAddr.cpp

…

PostDominators.cpp

…

PtrUseVisitor.cpp

…

README.txt

…

RegionInfo.cpp

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

RegionPass.cpp

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

RegionPrinter.cpp

…

ScalarEvolution.cpp

Make ScalarEvolution less aggressive with respect to no-wrap flags.

2015-01-22 00:48:47 +00:00

ScalarEvolutionAliasAnalysis.cpp

…

ScalarEvolutionExpander.cpp

…

ScalarEvolutionNormalization.cpp

…

ScopedNoAliasAA.cpp

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

SparsePropagation.cpp

…

StratifiedSets.h

…

TargetLibraryInfo.cpp

[PM] Rework how the TargetLibraryInfo pass integrates with the new pass

2015-01-24 02:06:09 +00:00

TargetTransformInfo.cpp

Commoning of target specific load/store intrinsics in Early CSE.

2015-01-26 22:51:15 +00:00

Trace.cpp

…

TypeBasedAliasAnalysis.cpp

…

ValueTracking.cpp

Fold fcmp in cases where value is provably non-negative. By Arch Robison.

2015-01-28 08:03:58 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//