llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-11-02 22:04:55 +00:00

History

Diego Novillo aa46024ea3 Fix information loss in branch probability computation. Summary: This addresses PR 22718. When branch weights are too large, they were being clamped to the range [1, MaxWeightForBB]. But this clamping is only applied to edges that go outside the range, so it distorts the relative branch probabilities. This patch changes the weight calculation to scale every branch so the relative probabilities are preserved. The scaling is done differently now. First, all the branch weights are added up, and if the sum exceeds 32 bits, it computes an integer scale to bring all the weights within the range. The patch fixes an existing test that had slightly wrong branch probabilities due to the previous clamping. It now gets branch weights scaled accordingly. Reviewers: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9442 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@236750 91177308-0d34-0410-b5e6-96231b3b80d8		2015-05-07 17:22:06 +00:00
..
IPA	[Inliner] Don't inline functions with frameescape calls	2015-04-14 20:38:14 +00:00
AliasAnalysis.cpp	Make getModRefInfo(Instruction *) not crash on certain types of instructions	2015-04-28 19:19:14 +00:00
AliasAnalysisCounter.cpp	Use 'override/final' instead of 'virtual' for overridden methods	2015-04-11 02:11:45 +00:00
AliasAnalysisEvaluator.cpp	[CallSite] Make construction from Value* (or Instruction*) explicit.	2015-04-10 14:50:08 +00:00
AliasDebugger.cpp
AliasSetTracker.cpp	[CallSite] Make construction from Value* (or Instruction*) explicit.	2015-04-10 14:50:08 +00:00
Analysis.cpp	Divergence analysis for GPU programs	2015-04-10 05:03:50 +00:00
AssumptionCache.cpp
BasicAliasAnalysis.cpp	Update BasicAliasAnalysis to understand that nothing aliases with undef values.	2015-05-05 18:10:49 +00:00
BlockFrequencyInfo.cpp	Remove superfluous .str() and replace std::string concatenation with Twine.	2015-03-27 17:51:30 +00:00
BlockFrequencyInfoImpl.cpp	Remove 4,096 loop scale limitation.	2015-04-01 17:42:27 +00:00
BranchProbabilityInfo.cpp	Fix information loss in branch probability computation.	2015-05-07 17:22:06 +00:00
CaptureTracking.cpp
CFG.cpp
CFGPrinter.cpp	Remove superfluous .str() and replace std::string concatenation with Twine.	2015-03-27 17:51:30 +00:00
CFLAliasAnalysis.cpp	Use 'override/final' instead of 'virtual' for overridden methods	2015-04-11 02:11:45 +00:00
CGSCCPassManager.cpp
CMakeLists.txt	Move IDF Calculation to a separate file, expose an interface to it.	2015-04-21 19:13:02 +00:00
CodeMetrics.cpp	Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used.	2015-03-23 19:32:43 +00:00
ConstantFolding.cpp	Added support for building against Android API-9 SDK	2015-05-07 00:05:26 +00:00
CostModel.cpp
Delinearization.cpp
DependenceAnalysis.cpp
DivergenceAnalysis.cpp	Divergence analysis for GPU programs	2015-04-10 05:03:50 +00:00
DominanceFrontier.cpp
DomPrinter.cpp
InstCount.cpp
InstructionSimplify.cpp	[opaque pointer type] API migration for GEP constant factories	2015-04-02 18:55:32 +00:00
Interval.cpp
IntervalPartition.cpp
IteratedDominanceFrontier.cpp	Move IDF Calculation to a separate file, expose an interface to it.	2015-04-21 19:13:02 +00:00
IVUsers.cpp
LazyCallGraph.cpp
LazyValueInfo.cpp	[ConstantRange] Split makeICmpRegion in two.	2015-03-18 00:41:24 +00:00
LibCallAliasAnalysis.cpp
LibCallSemantics.cpp	[WinEH] Start EH preparation for 32-bit x86, it uses no arguments	2015-04-29 22:49:54 +00:00
Lint.cpp	Fix doxygen comments from r232268	2015-03-16 17:49:03 +00:00
LLVMBuild.txt
Loads.cpp
LoopAccessAnalysis.cpp	[getUnderlyingOjbects] Analyze loop PHIs further to remove false positives	2015-04-23 20:09:20 +00:00
LoopInfo.cpp	Fix -Wpessimizing-move warnings by removing std::move calls.	2015-04-30 23:07:00 +00:00
LoopPass.cpp	Purge unused includes throughout libSupport.	2015-03-23 18:07:13 +00:00
Makefile
MemDepPrinter.cpp	[CallSite] Make construction from Value* (or Instruction*) explicit.	2015-04-10 14:50:08 +00:00
MemDerefPrinter.cpp	Move Value.isDereferenceablePointer to ValueTracking [NFC]	2015-04-23 17:36:48 +00:00
MemoryBuiltins.cpp
MemoryDependenceAnalysis.cpp	Revamp PredIteratorCache interface to be cleaner.	2015-04-21 21:11:50 +00:00
ModuleDebugInfoPrinter.cpp	IR: Give 'DI' prefix to debug info metadata	2015-04-29 16:38:44 +00:00
NoAliasAnalysis.cpp
PHITransAddr.cpp	[opaque pointer type] more gep API migration	2015-03-14 19:53:33 +00:00
PostDominators.cpp
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp
RegionPass.cpp	Change range-based for-loops to be -Wrange-loop-analysis clean.	2015-04-15 01:21:15 +00:00
RegionPrinter.cpp	One more -Wrange-loop-analysis cleanup.	2015-04-15 21:40:50 +00:00
ScalarEvolution.cpp	Fix a type mismatch assert in SCEV division	2015-04-22 15:06:40 +00:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp	[SCEV] Strengthen SCEVExpander::isHighCostExpansion.	2015-04-14 03:20:32 +00:00
ScalarEvolutionNormalization.cpp
ScopedNoAliasAA.cpp
SparsePropagation.cpp
StratifiedSets.h
TargetLibraryInfo.cpp	Populate list of vectorizable functions for Accelerate library.	2015-05-07 17:11:51 +00:00
TargetTransformInfo.cpp	[X86] Disable loop unrolling in loop vectorization pass when VF is 1.	2015-05-06 17:12:25 +00:00
Trace.cpp
TypeBasedAliasAnalysis.cpp	Teach TBAA analysis to report errors on cyclic TBAA metadata rather than hanging.	2015-03-13 07:09:33 +00:00
ValueTracking.cpp	[Statepoint] Clean up Statepoint.h: accessor names.	2015-05-06 02:36:26 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//