llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-11-02 22:04:55 +00:00

History

Philip Reames cce3c83917 Refine memory dependence's notion of volatile semantics According to my reading of the LangRef, volatiles are only ordered with respect to other volatiles. It is entirely legal and profitable to forward unrelated loads over the volatile load. This patch implements this for GVN by refining the transition rules MemoryDependenceAnalysis uses when encountering a volatile. The added test cases show where the extra flexibility is profitable for local dependence optimizations. I have a related change (227110) which will extend this to non-local dependence (i.e. PRE), but that's essentially orthogonal to the semantic change in this patch. I have tested the two together and can confirm that PRE works over a volatile load with both changes. I will be submitting a PRE w/volatiles test case seperately in the near future. Differential Revision: http://reviews.llvm.org/D6901 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@227112 91177308-0d34-0410-b5e6-96231b3b80d8		2015-01-26 18:54:27 +00:00
..
IPA	[cleanup] Re-sort all the #include lines in LLVM using	2015-01-14 11:23:27 +00:00
AliasAnalysis.cpp	[PM] Separate the TargetLibraryInfo object from the immutable pass.	2015-01-15 10:41:28 +00:00
AliasAnalysisCounter.cpp
AliasAnalysisEvaluator.cpp
AliasDebugger.cpp
AliasSetTracker.cpp
Analysis.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
AssumptionCache.cpp	[PM] Actually add the new pass manager support for the assumption cache.	2015-01-22 21:53:09 +00:00
BasicAliasAnalysis.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
BlockFrequencyInfo.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
BlockFrequencyInfoImpl.cpp
BranchProbabilityInfo.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
CaptureTracking.cpp	[cleanup] Re-sort all the #include lines in LLVM using	2015-01-14 11:23:27 +00:00
CFG.cpp	Standardize {pred,succ,use,user}_empty()	2015-01-13 03:46:47 +00:00
CFGPrinter.cpp
CFLAliasAnalysis.cpp	Fix incorrect partial aliasing	2015-01-26 17:31:17 +00:00
CGSCCPassManager.cpp	[PM] Remove the defunt CGSCC-specific debug flag.	2015-01-13 22:45:13 +00:00
CMakeLists.txt	[PM] Move TargetLibraryInfo into the Analysis library.	2015-01-15 02:16:27 +00:00
CodeMetrics.cpp	[PM] Split the AssumptionTracker immutable pass into two separate APIs:	2015-01-04 12:03:27 +00:00
ConstantFolding.cpp	[PM] Move TargetLibraryInfo into the Analysis library.	2015-01-15 02:16:27 +00:00
CostModel.cpp
Delinearization.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
DependenceAnalysis.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
DominanceFrontier.cpp
DomPrinter.cpp
FunctionTargetTransformInfo.cpp
InstCount.cpp
InstructionSimplify.cpp	[PM] Split the AssumptionTracker immutable pass into two separate APIs:	2015-01-04 12:03:27 +00:00
Interval.cpp
IntervalPartition.cpp
IVUsers.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
JumpInstrTableInfo.cpp
LazyCallGraph.cpp	Revert r225854: [PM] Move the LazyCallGraph printing functionality to	2015-01-14 00:27:45 +00:00
LazyValueInfo.cpp	[PM] Separate the TargetLibraryInfo object from the immutable pass.	2015-01-15 10:41:28 +00:00
LibCallAliasAnalysis.cpp
LibCallSemantics.cpp
Lint.cpp	[PM] Separate the TargetLibraryInfo object from the immutable pass.	2015-01-15 10:41:28 +00:00
LLVMBuild.txt	Update libdeps since TLI was moved from Target to Analysis in r226078.	2015-01-15 05:21:00 +00:00
Loads.cpp
LoopInfo.cpp	[PM] Port LoopInfo to the new pass manager, adding both a LoopAnalysis	2015-01-20 10:58:50 +00:00
LoopPass.cpp	[PM] Split the LoopInfo object apart from the legacy pass, creating	2015-01-17 14:16:18 +00:00
Makefile
MemDepPrinter.cpp	[REFACTOR] Push logic from MemDepPrinter into getNonLocalPointerDependency	2015-01-09 00:26:45 +00:00
MemoryBuiltins.cpp	[PM] Move TargetLibraryInfo into the Analysis library.	2015-01-15 02:16:27 +00:00
MemoryDependenceAnalysis.cpp	Refine memory dependence's notion of volatile semantics	2015-01-26 18:54:27 +00:00
ModuleDebugInfoPrinter.cpp
NoAliasAnalysis.cpp
PHITransAddr.cpp	[PM] Split the AssumptionTracker immutable pass into two separate APIs:	2015-01-04 12:03:27 +00:00
PostDominators.cpp
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp	[cleanup] Re-sort all the #include lines in LLVM using	2015-01-14 11:23:27 +00:00
RegionPass.cpp	[cleanup] Re-sort all the #include lines in LLVM using	2015-01-14 11:23:27 +00:00
RegionPrinter.cpp
ScalarEvolution.cpp	Make ScalarEvolution less aggressive with respect to no-wrap flags.	2015-01-22 00:48:47 +00:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp	[PM] Split the AssumptionTracker immutable pass into two separate APIs:	2015-01-04 12:03:27 +00:00
ScalarEvolutionNormalization.cpp
ScopedNoAliasAA.cpp	[cleanup] Re-sort all the #include lines in LLVM using	2015-01-14 11:23:27 +00:00
SparsePropagation.cpp
StratifiedSets.h
TargetLibraryInfo.cpp	[PM] Rework how the TargetLibraryInfo pass integrates with the new pass	2015-01-24 02:06:09 +00:00
TargetTransformInfo.cpp	Implemented cost model for masked load/store operations.	2015-01-25 08:44:46 +00:00
Trace.cpp
TypeBasedAliasAnalysis.cpp
ValueTracking.cpp	[cleanup] Re-sort all the #include lines in LLVM using	2015-01-14 11:23:27 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//