llvm-6502/include/llvm/Analysis/CodeMetrics.h

//===- CodeMetrics.h - Code cost measurements -------------------*- C++ -*-===//
//
//                     The LLVM Compiler Infrastructure
//
// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.
//
//===----------------------------------------------------------------------===//
//
// This file implements various weight measurements for code, helping
// the Inliner and other passes decide whether to duplicate its contents.
//
//===----------------------------------------------------------------------===//

#ifndef LLVM_ANALYSIS_CODEMETRICS_H
#define LLVM_ANALYSIS_CODEMETRICS_H

#include "llvm/ADT/DenseMap.h"
#include "llvm/IR/CallSite.h"

namespace llvm {
class BasicBlock;
class Function;
class Instruction;
class DataLayout;
class TargetTransformInfo;
class Value;

/// \brief Check whether a call will lower to something small.
///
/// This tests checks whether this callsite will lower to something
/// significantly cheaper than a traditional call, often a single
/// instruction. Note that if isInstructionFree(CS.getInstruction()) would
/// return true, so will this function.
bool callIsSmall(ImmutableCallSite CS);

/// \brief Utility to calculate the size and a few similar metrics for a set
/// of basic blocks.
struct CodeMetrics {
  /// \brief True if this function contains a call to setjmp or other functions
  /// with attribute "returns twice" without having the attribute itself.
  bool exposesReturnsTwice;

  /// \brief True if this function calls itself.
  bool isRecursive;

  /// \brief True if this function cannot be duplicated.
  ///
  /// True if this function contains one or more indirect branches, or it contains
  /// one or more 'noduplicate' instructions.
  bool notDuplicatable;

  /// \brief True if this function calls alloca (in the C sense).
  bool usesDynamicAlloca;

  /// \brief Number of instructions in the analyzed blocks.
  unsigned NumInsts;

  /// \brief Number of analyzed blocks.
  unsigned NumBlocks;

  /// \brief Keeps track of basic block code size estimates.
  DenseMap<const BasicBlock *, unsigned> NumBBInsts;

  /// \brief Keep track of the number of calls to 'big' functions.
  unsigned NumCalls;

  /// \brief The number of calls to internal functions with a single caller.
  ///
  /// These are likely targets for future inlining, likely exposed by
  /// interleaved devirtualization.
  unsigned NumInlineCandidates;

  /// \brief How many instructions produce vector values.
  ///
  /// The inliner is more aggressive with inlining vector kernels.
  unsigned NumVectorInsts;

  /// \brief How many 'ret' instructions the blocks contain.
  unsigned NumRets;

  CodeMetrics()
      : exposesReturnsTwice(false), isRecursive(false), notDuplicatable(false),
        usesDynamicAlloca(false), NumInsts(0), NumBlocks(0), NumCalls(0),
        NumInlineCandidates(0), NumVectorInsts(0), NumRets(0) {}

  /// \brief Add information about a block to the current state.
  void analyzeBasicBlock(const BasicBlock *BB, const TargetTransformInfo &TTI);
};

}

#endif
Pull the implementation of the code metrics out of the inline cost analysis implementation. The header was already separated. Also cleanup all the comments in the header to follow a nice modern doxygen form. There is still plenty of cruft here, but some of that will fall out in subsequent refactorings and this was an easy step in the right direction. No functionality changed here. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@152898 91177308-0d34-0410-b5e6-96231b3b80d8 2012-03-16 05:51:52 +00:00			`//===- CodeMetrics.h - Code cost measurements -------------------- C++ --===//`
Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@105725 91177308-0d34-0410-b5e6-96231b3b80d8 2010-06-09 15:11:37 +00:00			`//`
			`// The LLVM Compiler Infrastructure`
			`//`
			`// This file is distributed under the University of Illinois Open Source`
			`// License. See LICENSE.TXT for details.`
			`//`
			`//===----------------------------------------------------------------------===//`
			`//`
remove the partial specialization pass. It is unmaintained and has bugs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@123554 91177308-0d34-0410-b5e6-96231b3b80d8 2011-01-16 00:27:10 +00:00			`// This file implements various weight measurements for code, helping`
			`// the Inliner and other passes decide whether to duplicate its contents.`
Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@105725 91177308-0d34-0410-b5e6-96231b3b80d8 2010-06-09 15:11:37 +00:00			`//`
			`//===----------------------------------------------------------------------===//`

			`#ifndef LLVM_ANALYSIS_CODEMETRICS_H`
			`#define LLVM_ANALYSIS_CODEMETRICS_H`

make this file properly self contained. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@123059 91177308-0d34-0410-b5e6-96231b3b80d8 2011-01-08 08:19:49 +00:00			`#include "llvm/ADT/DenseMap.h"`
[Modules] Move CallSite into the IR library where it belogs. It is abstracting between a CallInst and an InvokeInst, both of which are IR concepts. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@202816 91177308-0d34-0410-b5e6-96231b3b80d8 2014-03-04 11:01:28 +00:00			`#include "llvm/IR/CallSite.h"`
make this file properly self contained. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@123059 91177308-0d34-0410-b5e6-96231b3b80d8 2011-01-08 08:19:49 +00:00
Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@105725 91177308-0d34-0410-b5e6-96231b3b80d8 2010-06-09 15:11:37 +00:00			`namespace llvm {`
Fix indentation and formatting. This change brought to by clang-format. =] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173034 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 12:14:42 +00:00			`class BasicBlock;`
			`class Function;`
			`class Instruction;`
			`class DataLayout;`
Switch CodeMetrics itself over to use TTI to determine if an instruction is free. The whole CodeMetrics API should probably be reworked more, but this is enough to allow deleting the duplicate code there for computing whether an instruction is free. All of the passes using this have been updated to pull in TTI and hand it to the CodeMetrics stuff. Further, a dead CodeMetrics API (analyzeFunction) is nuked for lack of users. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173036 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 13:04:33 +00:00			`class TargetTransformInfo;`
Fix indentation and formatting. This change brought to by clang-format. =] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173034 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 12:14:42 +00:00			`class Value;`

			`/// \brief Check whether a call will lower to something small.`
			`///`
			`/// This tests checks whether this callsite will lower to something`
			`/// significantly cheaper than a traditional call, often a single`
			`/// instruction. Note that if isInstructionFree(CS.getInstruction()) would`
			`/// return true, so will this function.`
			`bool callIsSmall(ImmutableCallSite CS);`

			`/// \brief Utility to calculate the size and a few similar metrics for a set`
			`/// of basic blocks.`
			`struct CodeMetrics {`
			`/// \brief True if this function contains a call to setjmp or other functions`
			`/// with attribute "returns twice" without having the attribute itself.`
			`bool exposesReturnsTwice;`

			`/// \brief True if this function calls itself.`
			`bool isRecursive;`

			`/// \brief True if this function cannot be duplicated.`
			`///`
			`/// True if this function contains one or more indirect branches, or it contains`
			`/// one or more 'noduplicate' instructions.`
			`bool notDuplicatable;`

			`/// \brief True if this function calls alloca (in the C sense).`
			`bool usesDynamicAlloca;`

			`/// \brief Number of instructions in the analyzed blocks.`
			`unsigned NumInsts;`

			`/// \brief Number of analyzed blocks.`
			`unsigned NumBlocks;`

			`/// \brief Keeps track of basic block code size estimates.`
			`DenseMap<const BasicBlock *, unsigned> NumBBInsts;`

			`/// \brief Keep track of the number of calls to 'big' functions.`
			`unsigned NumCalls;`
Inlining and unrolling heuristics should be aware of free truncs. We want heuristics to be based on accurate data, but more importantly we don't want llvm to behave randomly. A benign trunc inserted by an upstream pass should not cause a wild swings in optimization level. See PR11034. It's a general problem with threshold-based heuristics, but we can make it less bad. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140919 91177308-0d34-0410-b5e6-96231b3b80d8 2011-10-01 01:39:05 +00:00
Fix indentation and formatting. This change brought to by clang-format. =] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173034 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 12:14:42 +00:00			`/// \brief The number of calls to internal functions with a single caller.`
			`///`
			`/// These are likely targets for future inlining, likely exposed by`
			`/// interleaved devirtualization.`
			`unsigned NumInlineCandidates;`
Initial commit for the rewrite of the inline cost analysis to operate on a per-callsite walk of the called function's instructions, in breadth-first order over the potentially reachable set of basic blocks. This is a major shift in how inline cost analysis works to improve the accuracy and rationality of inlining decisions. A brief outline of the algorithm this moves to: - Build a simplification mapping based on the callsite arguments to the function arguments. - Push the entry block onto a worklist of potentially-live basic blocks. - Pop the first block off of the front of the worklist (for breadth-first ordering) and walk its instructions using a custom InstVisitor. - For each instruction's operands, re-map them based on the simplification mappings available for the given callsite. - Compute any simplification possible of the instruction after re-mapping, and store that back int othe simplification mapping. - Compute any bonuses, costs, or other impacts of the instruction on the cost metric. - When the terminator is reached, replace any conditional value in the terminator with any simplifications from the mapping we have, and add any successors which are not proven to be dead from these simplifications to the worklist. - Pop the next block off of the front of the worklist, and repeat. - As soon as the cost of inlining exceeds the threshold for the callsite, stop analyzing the function in order to bound cost. The primary goal of this algorithm is to perfectly handle dead code paths. We do not want any code in trivially dead code paths to impact inlining decisions. The previous metric was extremely flawed here, and would always subtract the average cost of two successors of a conditional branch when it was proven to become an unconditional branch at the callsite. There was no handling of wildly different costs between the two successors, which would cause inlining when the path actually taken was too large, and no inlining when the path actually taken was trivially simple. There was also no handling of the code path, only the immediate successors. These problems vanish completely now. See the added regression tests for the shiny new features -- we skip recursive function calls, SROA-killing instructions, and high cost complex CFG structures when dead at the callsite being analyzed. Switching to this algorithm required refactoring the inline cost interface to accept the actual threshold rather than simply returning a single cost. The resulting interface is pretty bad, and I'm planning to do lots of interface cleanup after this patch. Several other refactorings fell out of this, but I've tried to minimize them for this patch. =/ There is still more cleanup that can be done here. Please point out anything that you see in review. I've worked really hard to try to mirror at least the spirit of all of the previous heuristics in the new model. It's not clear that they are all correct any more, but I wanted to minimize the change in this single patch, it's already a bit ridiculous. One heuristic that is not yet mirrored is to allow inlining of functions with a dynamic alloca if the caller has a dynamic alloca. I will add this back, but I think the most reasonable way requires changes to the inliner itself rather than just the cost metric, and so I've deferred this for a subsequent patch. The test case is XFAIL-ed until then. As mentioned in the review mail, this seems to make Clang run about 1% to 2% faster in -O0, but makes its binary size grow by just under 4%. I've looked into the 4% growth, and it can be fixed, but requires changes to other parts of the inliner. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@153812 91177308-0d34-0410-b5e6-96231b3b80d8 2012-03-31 12:42:41 +00:00
Fix indentation and formatting. This change brought to by clang-format. =] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173034 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 12:14:42 +00:00			`/// \brief How many instructions produce vector values.`
Pull the implementation of the code metrics out of the inline cost analysis implementation. The header was already separated. Also cleanup all the comments in the header to follow a nice modern doxygen form. There is still plenty of cruft here, but some of that will fall out in subsequent refactorings and this was an easy step in the right direction. No functionality changed here. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@152898 91177308-0d34-0410-b5e6-96231b3b80d8 2012-03-16 05:51:52 +00:00			`///`
Fix indentation and formatting. This change brought to by clang-format. =] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173034 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 12:14:42 +00:00			`/// The inliner is more aggressive with inlining vector kernels.`
			`unsigned NumVectorInsts;`

			`/// \brief How many 'ret' instructions the blocks contain.`
			`unsigned NumRets;`

			`CodeMetrics()`
			`: exposesReturnsTwice(false), isRecursive(false), notDuplicatable(false),`
			`usesDynamicAlloca(false), NumInsts(0), NumBlocks(0), NumCalls(0),`
			`NumInlineCandidates(0), NumVectorInsts(0), NumRets(0) {}`

			`/// \brief Add information about a block to the current state.`
Switch CodeMetrics itself over to use TTI to determine if an instruction is free. The whole CodeMetrics API should probably be reworked more, but this is enough to allow deleting the duplicate code there for computing whether an instruction is free. All of the passes using this have been updated to pull in TTI and hand it to the CodeMetrics stuff. Further, a dead CodeMetrics API (analyzeFunction) is nuked for lack of users. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173036 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 13:04:33 +00:00			`void analyzeBasicBlock(const BasicBlock *BB, const TargetTransformInfo &TTI);`
Fix indentation and formatting. This change brought to by clang-format. =] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@173034 91177308-0d34-0410-b5e6-96231b3b80d8 2013-01-21 12:14:42 +00:00			`};`

Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@105725 91177308-0d34-0410-b5e6-96231b3b80d8 2010-06-09 15:11:37 +00:00			`}`

			`#endif`