llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-11-03 14:08:57 +00:00

History

Benjamin Kramer b6fdd022b7 PR13095: Give an inline cost bonus to functions using byval arguments. We give a bonus for every argument because the argument setup is not needed anymore when the function is inlined. With this patch we interpret byval arguments as a compact representation of many arguments. The byval argument setup is implemented in the backend as an inline memcpy, so to model the cost as accurately as possible we take the number of pointer-sized elements in the byval argument and give a bonus of 2 instructions for every one of those. The bonus is capped at 8 elements, which is the number of stores at which the x86 backend switches from an expanded inline memcpy to a real memcpy. It would be better to use the real memcpy threshold from the backend, but it's not available via TargetData. This change brings the performance of c-ray in line with gcc 4.7. The included test case tries to reproduce the c-ray problem to catch regressions for this benchmark early, its performance is dominated by the inline decision of a specific call. This only has a small impact on most code, more on x86 and arm than on x86_64 due to the way the ABI works. When building LLVM for x86 it gives a small inline cost boost to virtually any function using StringRef or STL allocators, but only a 0.01% increase in overall binary size. The size of gcc compiled by clang actually shrunk by a couple bytes with this patch applied, but not significantly. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@161413 91177308-0d34-0410-b5e6-96231b3b80d8		2012-08-07 11:13:19 +00:00
..
IPA	RefreshCallGraph: ignore 'invoke intrinsic'. IntrinsicInst doesnt not recognize invoke, and shouldnt at this point, since the rest of LLVM codebase doesnt expect invoke of intrinsics	2012-06-29 17:49:32 +00:00
AliasAnalysis.cpp
AliasAnalysisCounter.cpp
AliasAnalysisEvaluator.cpp
AliasDebugger.cpp
AliasSetTracker.cpp	Reduce use list thrashing by using DenseMap's find_as for maps with ValueHandle keys.	2012-06-30 22:37:15 +00:00
Analysis.cpp
BasicAliasAnalysis.cpp	refactor the MemoryBuiltin analysis:	2012-06-21 15:45:28 +00:00
BlockFrequencyInfo.cpp
BranchProbabilityInfo.cpp
CaptureTracking.cpp
CFGPrinter.cpp
CMakeLists.txt	Update the CMake files.	2012-06-29 09:01:47 +00:00
CodeMetrics.cpp
ConstantFolding.cpp	When constant folding GEP expressions, keep the address space information of pointers.	2012-07-30 07:25:20 +00:00
DbgInfoPrinter.cpp	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and	2012-06-28 00:05:13 +00:00
DominanceFrontier.cpp
DomPrinter.cpp
InlineCost.cpp	PR13095: Give an inline cost bonus to functions using byval arguments.	2012-08-07 11:13:19 +00:00
InstCount.cpp
InstructionSimplify.cpp	Fix PR13412, a nasty miscompile due to the interleaved	2012-08-07 10:59:59 +00:00
Interval.cpp
IntervalPartition.cpp
IVUsers.cpp	IVUsers should only generate SCEV's for values that are safe to speculate.	2012-07-13 23:33:05 +00:00
LazyValueInfo.cpp	make LazyValueInfo analyze the default case of switch statements (we know that in the default branch the value cannot be any of the switch cases)	2012-06-28 16:13:37 +00:00
LibCallAliasAnalysis.cpp
LibCallSemantics.cpp
Lint.cpp
LLVMBuild.txt
Loads.cpp
LoopDependenceAnalysis.cpp
LoopInfo.cpp	Enable the new LoopInfo algorithm by default.	2012-06-26 04:11:38 +00:00
LoopPass.cpp	Enable the new LoopInfo algorithm by default.	2012-06-26 04:11:38 +00:00
Makefile
MemDepPrinter.cpp
MemoryBuiltins.cpp	fix PR13390: do not loop forever with self-referencing self instructions	2012-07-27 18:21:15 +00:00
MemoryDependenceAnalysis.cpp	refactor the MemoryBuiltin analysis:	2012-06-21 15:45:28 +00:00
ModuleDebugInfoPrinter.cpp	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and	2012-06-28 00:05:13 +00:00
NoAliasAnalysis.cpp
PathNumbering.cpp	Move llvm/Support/TypeBuilder.h -> llvm/TypeBuilder.h. This completes	2012-07-15 23:45:24 +00:00
PathProfileInfo.cpp
PathProfileVerifier.cpp
PHITransAddr.cpp
PostDominators.cpp
ProfileEstimatorPass.cpp
ProfileInfo.cpp
ProfileInfoLoader.cpp	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field.	2012-07-20 22:05:57 +00:00
ProfileInfoLoaderPass.cpp
ProfileVerifierPass.cpp
README.txt
RegionInfo.cpp	Implement the block_iterator of Region based on df_iterator.	2012-08-02 14:20:02 +00:00
RegionPass.cpp
RegionPrinter.cpp
ScalarEvolution.cpp	Stay rational; don't assert trying to take the square root of a negative value.	2012-08-01 09:14:36 +00:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp	Fix a typo (the the => the)	2012-07-23 08:51:15 +00:00
ScalarEvolutionNormalization.cpp
SparsePropagation.cpp
Trace.cpp
TypeBasedAliasAnalysis.cpp
ValueTracking.cpp	PHINode::hasConstantValue(): return undef if the PHI is fully recursive.	2012-07-03 21:15:40 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//