llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-26 10:29:36 +00:00

History

Jingyue Wu 5733100450 Divergence analysis for GPU programs

Summary:
Some optimizations such as jump threading and loop unswitching can negatively
affect performance when applied to divergent branches. The divergence analysis
added in this patch conservatively estimates which branches in a GPU program
can diverge. This information can then help LLVM to run certain optimizations
selectively.

Test Plan: test/Analysis/DivergenceAnalysis/NVPTX/diverge.ll

Reviewers: resistor, hfinkel, eliben, meheff, jholewinski

Subscribers: broune, bjarke.roune, madhur13490, tstellarAMD, dberlin, echristo, jholewinski, llvm-commits

Differential Revision: http://reviews.llvm.org/D8576

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@234567 91177308-0d34-0410-b5e6-96231b3b80d8

2015-04-10 05:03:50 +00:00

IPA

Correctly estimate SROA savings for store operands in inline cost analysis.

2015-03-20 18:33:12 +00:00

AliasAnalysis.cpp

DataLayout is mandatory, update the API to reflect it with references.

2015-03-10 02:37:25 +00:00

AliasAnalysisCounter.cpp

Make DataLayout Non-Optional in the Module

2015-03-04 18:43:29 +00:00

AliasAnalysisEvaluator.cpp

…

AliasDebugger.cpp

Make DataLayout Non-Optional in the Module

2015-03-04 18:43:29 +00:00

AliasSetTracker.cpp

AliasSet: Simplify mergeSetIn

2014-11-19 19:36:18 +00:00

Analysis.cpp

Divergence analysis for GPU programs

2015-04-10 05:03:50 +00:00

AssumptionCache.cpp

[PM] Actually add the new pass manager support for the assumption cache.

2015-01-22 21:53:09 +00:00

BasicAliasAnalysis.cpp

[opaque pointer type] Change GetElementPtrInst::getIndexedType to take the pointee type

2015-03-30 21:41:43 +00:00

BlockFrequencyInfo.cpp

Remove superfluous .str() and replace std::string concatenation with Twine.

2015-03-27 17:51:30 +00:00

BlockFrequencyInfoImpl.cpp

Remove 4,096 loop scale limitation.

2015-04-01 17:42:27 +00:00

BranchProbabilityInfo.cpp

Purge unused includes throughout libSupport.

2015-03-23 18:07:13 +00:00

CaptureTracking.cpp

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

CFG.cpp

Standardize {pred,succ,use,user}_empty()

2015-01-13 03:46:47 +00:00

CFGPrinter.cpp

Remove superfluous .str() and replace std::string concatenation with Twine.

2015-03-27 17:51:30 +00:00

CFLAliasAnalysis.cpp

Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used.

2015-03-23 19:32:43 +00:00

CGSCCPassManager.cpp

[PM] Remove the defunt CGSCC-specific debug flag.

2015-01-13 22:45:13 +00:00

CMakeLists.txt

Divergence analysis for GPU programs

2015-04-10 05:03:50 +00:00

CodeMetrics.cpp

Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used.

2015-03-23 19:32:43 +00:00

ConstantFolding.cpp

[opaque pointer type] API migration for GEP constant factories

2015-04-02 18:55:32 +00:00

CostModel.cpp

[multiversion] Thread a function argument through all the callers of the

2015-02-01 12:01:35 +00:00

Delinearization.cpp

[PM] Split the LoopInfo object apart from the legacy pass, creating

2015-01-17 14:16:18 +00:00

DependenceAnalysis.cpp

Fix a memory corruption in Dependency Analysis.

2015-03-10 14:32:02 +00:00

DivergenceAnalysis.cpp

Divergence analysis for GPU programs

2015-04-10 05:03:50 +00:00

DominanceFrontier.cpp

…

DomPrinter.cpp

…

InstCount.cpp

…

InstructionSimplify.cpp

[opaque pointer type] API migration for GEP constant factories

2015-04-02 18:55:32 +00:00

Interval.cpp

…

IntervalPartition.cpp

…

IVUsers.cpp

DataLayout is mandatory, update the API to reflect it with references.

2015-03-10 02:37:25 +00:00

LazyCallGraph.cpp

Revert r225854: [PM] Move the LazyCallGraph printing functionality to

2015-01-14 00:27:45 +00:00

LazyValueInfo.cpp

[ConstantRange] Split makeICmpRegion in two.

2015-03-18 00:41:24 +00:00

LibCallAliasAnalysis.cpp

Make DataLayout Non-Optional in the Module

2015-03-04 18:43:29 +00:00

LibCallSemantics.cpp

Stop calling DwarfEHPrepare from WinEHPrepare

2015-03-12 00:36:20 +00:00

Lint.cpp

Fix doxygen comments from r232268

2015-03-16 17:49:03 +00:00

LLVMBuild.txt

Update libdeps since TLI was moved from Target to Analysis in r226078.

2015-01-15 05:21:00 +00:00

Loads.cpp

DataLayout is mandatory, update the API to reflect it with references.

2015-03-10 02:37:25 +00:00

LoopAccessAnalysis.cpp

[LoopAccesses] Allow analysis to complete in the presence of uniform stores

2015-04-08 17:48:40 +00:00

LoopInfo.cpp

Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used.

2015-03-23 19:32:43 +00:00

LoopPass.cpp

Purge unused includes throughout libSupport.

2015-03-23 18:07:13 +00:00

Makefile

…

MemDepPrinter.cpp

MemDepPrinter: Fix some nits introduced in r228596

2015-02-25 23:55:00 +00:00

MemDerefPrinter.cpp

DataLayout is mandatory, update the API to reflect it with references.

2015-03-10 02:37:25 +00:00

MemoryBuiltins.cpp

DataLayout is mandatory, update the API to reflect it with references.

2015-03-10 02:37:25 +00:00

MemoryDependenceAnalysis.cpp

!invariant.load semantics with potentially clobbering calls

2015-03-24 23:54:54 +00:00

ModuleDebugInfoPrinter.cpp

Transforms: Stop using DIDescriptor::is*() and auto-casting

2015-04-06 23:27:00 +00:00

NoAliasAnalysis.cpp

Make DataLayout Non-Optional in the Module

2015-03-04 18:43:29 +00:00

PHITransAddr.cpp

[opaque pointer type] more gep API migration

2015-03-14 19:53:33 +00:00

PostDominators.cpp

…

PtrUseVisitor.cpp

Update SetVector to rely on the underlying set's insert to return a pair<iterator, bool>

2014-11-19 07:49:26 +00:00

README.txt

…

RegionInfo.cpp

[cleanup] Re-sort all the #include lines in LLVM using

2015-01-14 11:23:27 +00:00

RegionPass.cpp

Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used.

2015-03-23 19:32:43 +00:00

RegionPrinter.cpp

…

ScalarEvolution.cpp

[opaque pointer type] API migration for GEP constant factories

2015-04-02 18:55:32 +00:00

ScalarEvolutionAliasAnalysis.cpp

Make DataLayout Non-Optional in the Module

2015-03-04 18:43:29 +00:00

ScalarEvolutionExpander.cpp

[opaque pointer type] More GEP API migrations in IRBuilder uses

2015-04-03 19:41:44 +00:00

ScalarEvolutionNormalization.cpp

…

ScopedNoAliasAA.cpp

Make DataLayout Non-Optional in the Module

2015-03-04 18:43:29 +00:00

SparsePropagation.cpp

…

StratifiedSets.h

Update SetVector to rely on the underlying set's insert to return a pair<iterator, bool>

2014-11-19 07:49:26 +00:00

TargetLibraryInfo.cpp

[WinEH] Run cleanup handlers when an exception is thrown

2015-03-30 22:58:10 +00:00

TargetTransformInfo.cpp

Divergence analysis for GPU programs

2015-04-10 05:03:50 +00:00

Trace.cpp

…

TypeBasedAliasAnalysis.cpp

Teach TBAA analysis to report errors on cyclic TBAA metadata rather than hanging.

2015-03-13 07:09:33 +00:00

ValueTracking.cpp

[ValueTracking] Fix PR23011.

2015-03-25 22:33:53 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//