llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-01-18 13:34:04 +00:00

History

Wan Xiaofei 3cda2d3885 Change data structure to memorize computed result in ScalarEvolution

Replace std::map with SmallVector to memorize the cached result since SCEV usually belongs to little Loop/BB
Linear scan on SmallVector is faster than std::map.

Code reviewer : Andrew Trick.
Test result   : Pass Unit Test & LLVM Test Suite

401.bzip2	0.425721	0.419981	101.37%
403.gcc		24.53855	24.2667		101.12%
429.mcf		0.060847	0.059944	101.51%
433.milc	0.646009	0.636119	101.55%
444.namd	1.383928	1.370614	100.97%
445.gobmk	5.836575	5.800225	100.63%
450.soplex	1.911257	1.895963	100.81%
456.hmmer	1.039565	1.032534	100.68%
458.sjeng	0.897401	0.885567	101.34%
464.h264ref	3.645908	3.577991	101.90%
470.lbm		0.049456	0.048398	102.19%
471.omnetpp	5.638575	5.60435		100.61%
bitmnp01	0.045738	0.045291	100.99%
cjpegv2data	0.304359	0.302833	100.50%
idctrn01	0.046433	0.045763	101.46%
quake2		4.534416	4.4952		100.87%
quake		2.688566	2.659208	101.10%
xcsoar		12.42545	12.30385	100.99%
linpack		0.038739	0.03803		101.86%
matrix01	0.053564	0.0528		101.45%
nbench		0.402867	0.395803	101.78%
tblook01	0.021265	0.021015	101.19%
ttsprk01	0.066384	0.065566	101.25%

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@194459 91177308-0d34-0410-b5e6-96231b3b80d8

2013-11-12 09:40:41 +00:00

IPA

Move the old pass manager infrastructure into a legacy namespace and

2013-11-09 12:26:54 +00:00

AliasAnalysis.cpp

Reimplement isPotentiallyReachable to make nocapture deduction much stronger.

2013-07-27 01:24:00 +00:00

AliasAnalysisCounter.cpp

…

AliasAnalysisEvaluator.cpp

Support in AAEvaluator to print alias queries of loads/stores with TBAA tags.

2013-03-22 22:34:41 +00:00

AliasDebugger.cpp

…

AliasSetTracker.cpp

In AliasSetTracker, do not change the alias set to "mod/ref" when adding

2013-09-12 20:15:50 +00:00

Analysis.cpp

Remove the very substantial, largely unmaintained legacy PGO

2013-10-02 15:42:23 +00:00

BasicAliasAnalysis.cpp

Revert r193251 : Use address-taken to disambiguate global variable and indirect memops.

2013-10-27 03:08:44 +00:00

BlockFrequencyInfo.cpp

BlockFrequency: Bump up the entry frequency a bit.

2013-06-25 13:34:40 +00:00

BranchProbabilityInfo.cpp

Consider (x == -1) unlikely in BranchProbabilityInfo

2013-11-01 10:58:22 +00:00

CaptureTracking.cpp

CaptureTracking: Plug a loophole in the "too many uses" heuristic.

2013-10-03 13:24:02 +00:00

CFG.cpp

Add some constantness.

2013-08-20 23:04:15 +00:00

CFGPrinter.cpp

…

CMakeLists.txt

Remove the very substantial, largely unmaintained legacy PGO

2013-10-02 15:42:23 +00:00

CodeMetrics.cpp

Begin fleshing out an interface in TTI for modelling the costs of

2013-01-22 11:26:02 +00:00

ConstantFolding.cpp

Fix another constant folding address space place I missed.

2013-11-04 20:46:52 +00:00

CostModel.cpp

Move variable into assert to avoid unused variable warning.

2013-09-17 21:13:57 +00:00

DependenceAnalysis.cpp

Remove extraneous semicolon.

2013-08-06 16:40:40 +00:00

DominanceFrontier.cpp

…

DomPrinter.cpp

…

InstCount.cpp

…

InstructionSimplify.cpp

Teach MemoryBuiltins and InstructionSimplify that operator new never returns NULL.

2013-09-24 16:37:51 +00:00

Interval.cpp

…

IntervalPartition.cpp

…

IVUsers.cpp

…

LazyValueInfo.cpp

Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size.

2013-07-04 01:31:24 +00:00

LibCallAliasAnalysis.cpp

…

LibCallSemantics.cpp

…

Lint.cpp

Use size function instead of manually calculating it.

2013-11-10 03:18:50 +00:00

LLVMBuild.txt

…

Loads.cpp

Change GetPointerBaseWithConstantOffset's DataLayout argument from a

2013-01-31 02:00:45 +00:00

LoopInfo.cpp

Quick look-up for block in loop.

2013-10-26 03:08:02 +00:00

LoopPass.cpp

Comment: try to clarify loop iteration order.

2013-07-20 23:10:31 +00:00

Makefile

…

MemDepPrinter.cpp

…

MemoryBuiltins.cpp

fix PR17635: false positive with packed structures

2013-10-24 09:17:24 +00:00

MemoryDependenceAnalysis.cpp

Fix xemacs mode line, don't put them in .cpp files (just header files). No

2013-06-10 23:10:59 +00:00

ModuleDebugInfoPrinter.cpp

…

NoAliasAnalysis.cpp

…

PHITransAddr.cpp

…

PostDominators.cpp

…

PtrUseVisitor.cpp

…

README.txt

…

RegionInfo.cpp

Reorder headers according to lint.

2013-08-21 21:14:19 +00:00

RegionPass.cpp

…

RegionPrinter.cpp

…

ScalarEvolution.cpp

Change data structure to memorize computed result in ScalarEvolution

2013-11-12 09:40:41 +00:00

ScalarEvolutionAliasAnalysis.cpp

…

ScalarEvolutionExpander.cpp

Fix SCEVExpander: don't try to expand quadratic recurrences outside a loop.

2013-10-25 21:35:56 +00:00

ScalarEvolutionNormalization.cpp

Fix LSR: don't normalize quadratic recurrences.

2013-10-25 21:35:52 +00:00

SparsePropagation.cpp

…

TargetTransformInfo.cpp

Costmodel: Add support for horizontal vector reductions

2013-09-17 18:06:50 +00:00

Trace.cpp

…

TypeBasedAliasAnalysis.cpp

TBAA: fix PR17620.

2013-10-22 01:40:25 +00:00

ValueTracking.cpp

Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from x86_sse42_crc32_32_8 and was not mapped to a clang builtin. I'm not even sure why this form of the instruction is even called out explicitly in the docs. Also add AutoUpgrade support to convert it into the other intrinsic with appropriate trunc and zext.

2013-10-15 05:20:47 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//