mirror of
https://github.com/c64scene-ar/llvm-6502.git
synced 2024-12-13 04:30:23 +00:00
7d4d116067
Summary: Make Scalar Evolution able to propagate NSW and NUW flags from instructions to SCEVs in some cases. This is based on reasoning about when poison from instructions with these flags would trigger undefined behavior. This gives a 13% speed-up on some Eigen3-based Google-internal microbenchmarks for NVPTX. There does not seem to be clear agreement about when poison should be considered to propagate through instructions. In this analysis, poison propagates only in cases where that should be uncontroversial. This change makes LSR able to create induction variables for expressions like &ptr[i + offset] for loops like this: for (int i = 0; i < limit; ++i) { sum += ptr[i + offset]; } Here ptr is a 64 bit pointer and offset is a 32 bit integer. For NVPTX, LSR currently creates an induction variable for i + offset instead, which is not as fast. Improving this situation is what brings the 13% speed-up on some Eigen3-based Google-internal microbenchmarks for NVPTX. There are more details in this discussion on llvmdev. June: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-June/thread.html#87234 July: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-July/thread.html#87392 Patch by Bjarke Roune Reviewers: eliben, atrick, sanjoy Subscribers: majnemer, hfinkel, jingyue, meheff, llvm-commits Differential Revision: http://reviews.llvm.org/D11212 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@243460 91177308-0d34-0410-b5e6-96231b3b80d8 |
||
---|---|---|
.. | ||
a.ll | ||
divide_by_one.ll | ||
gcd_multiply_expr.ll | ||
himeno_1.ll | ||
himeno_2.ll | ||
iv_times_constant_in_subscript.ll | ||
lit.local.cfg | ||
multidim_ivs_and_integer_offsets_3d.ll | ||
multidim_ivs_and_integer_offsets_nts_3d.ll | ||
multidim_ivs_and_parameteric_offsets_3d.ll | ||
multidim_only_ivs_2d_nested.ll | ||
multidim_only_ivs_2d.ll | ||
multidim_only_ivs_3d_cast.ll | ||
multidim_only_ivs_3d.ll | ||
multidim_two_accesses_different_delinearization.ll | ||
type_mismatch.ll | ||
undef.ll |