llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-11-06 05:06:45 +00:00

History

Chandler Carruth ca323cf916 Teach the constant folder to look through bitcast constant expressions much more effectively when trying to constant fold a load of a constant. Previously, we only handled bitcasts by trying to find a totally generic byte representation of the constant and use that. Now, we look through the bitcast to see what constant we might fold the load into, and then try to form a constant expression cast of the found value that would be equivalent to loading the value. You might wonder why on earth this actually matters. Well, turns out that the Itanium ABI causes us to create a single array for a vtable where the first elements are virtual base offsets, followed by the virtual function pointers. Because the array is homogenous the element type is consistently i8* and we inttoptr the virtual base offsets into the initial elements. Then constructors bitcast these pointers to i64 pointers prior to loading them. Boom, no more constant folding of virtual base offsets. This is the first fix to LLVM to address the insane performance Eric Niebler discovered with Clang on his range comprehensions[1]. There is more to come though, this doesn't really fix the problem fully. [1]: http://ericniebler.com/2014/04/27/range-comprehensions/ git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@208856 91177308-0d34-0410-b5e6-96231b3b80d8		2014-05-15 09:56:28 +00:00
..
IPA	Use a range loop.	2014-05-08 17:57:50 +00:00
AliasAnalysis.cpp
AliasAnalysisCounter.cpp
AliasAnalysisEvaluator.cpp
AliasDebugger.cpp
AliasSetTracker.cpp
Analysis.cpp
BasicAliasAnalysis.cpp
BlockFrequencyInfo.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
BlockFrequencyInfoImpl.cpp	blockfreq: Move include to .cpp	2014-05-06 01:57:42 +00:00
BranchProbabilityInfo.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
CaptureTracking.cpp
CFG.cpp
CFGPrinter.cpp	raw_ostream: Forward declare OpenFlags and include FileSystem.h only where necessary.	2014-04-29 23:26:49 +00:00
CGSCCPassManager.cpp
CMakeLists.txt
CodeMetrics.cpp
ConstantFolding.cpp	Teach the constant folder to look through bitcast constant expressions	2014-05-15 09:56:28 +00:00
CostModel.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
Delinearization.cpp	split delinearization pass in 3 steps	2014-05-07 18:01:20 +00:00
DependenceAnalysis.cpp	move findArrayDimensions to ScalarEvolution	2014-05-09 22:45:07 +00:00
DominanceFrontier.cpp
DomPrinter.cpp
InstCount.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
InstructionSimplify.cpp	InstSimplify: Optimize signed icmp of -(zext V)	2014-05-14 20:16:28 +00:00
Interval.cpp
IntervalPartition.cpp
IVUsers.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
LazyCallGraph.cpp	Fix typos	2014-05-15 01:52:21 +00:00
LazyValueInfo.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
LibCallAliasAnalysis.cpp
LibCallSemantics.cpp
Lint.cpp	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been	2014-05-14 21:14:37 +00:00
LLVMBuild.txt
Loads.cpp
LoopInfo.cpp
LoopPass.cpp
Makefile
MemDepPrinter.cpp
MemoryBuiltins.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
MemoryDependenceAnalysis.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
ModuleDebugInfoPrinter.cpp
NoAliasAnalysis.cpp
PHITransAddr.cpp
PostDominators.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
RegionPass.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
RegionPrinter.cpp
ScalarEvolution.cpp	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been	2014-05-14 21:14:37 +00:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp
ScalarEvolutionNormalization.cpp
SparsePropagation.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
TargetTransformInfo.cpp	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	2014-04-22 02:48:03 +00:00
Trace.cpp
TypeBasedAliasAnalysis.cpp	[TBAA] Fix handling of mixed TBAA (path-aware and non-path-aware TBAA).	2014-05-03 22:32:52 +00:00
ValueTracking.cpp	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been	2014-05-14 21:14:37 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//