llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-14 02:33:53 +00:00

History

Justin Holewinski a1535e3b9b [NVPTX] Honor alignment on vector loads/stores

We were not considering the stated alignment on vector loads/stores,
leading us to generate vector instructions even when we do not have
sufficient alignment.

Now, for IR like:

  %1 = load <4 x float>, <4 x float>* %ptr, align 4

we will generate correct, conservative PTX like:

  ld.f32 ... [%ptr]
  ld.f32 ... [%ptr+4]
  ld.f32 ... [%ptr+8]
  ld.f32 ... [%ptr+12]

Or if we have an alignment of 8 (for example), we can
generate code like:

  ld.v2.f32 ... [%ptr]
  ld.v2.f32 ... [%ptr+8]

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@213186 91177308-0d34-0410-b5e6-96231b3b80d8

2014-07-16 19:45:35 +00:00

access-non-generic.ll

…

add-128bit.ll

…

addrspacecast-gvar.ll

…

addrspacecast.ll

…

aggr-param.ll

…

annotations.ll

…

arg-lowering.ll

…

arithmetic-fp-sm20.ll

…

arithmetic-int.ll

…

atomics.ll

…

bfe.ll

…

bug17709.ll

…

call-with-alloca-buffer.ll

…

callchain.ll

…

calling-conv.ll

…

compare-int.ll

…

constant-vectors.ll

…

convert-fp.ll

…

convert-int-sm20.ll

…

ctlz.ll

…

ctpop.ll

…

cttz.ll

…

div-ri.ll

…

envreg.ll

…

fast-math.ll

…

fma-disable.ll

…

fma.ll

…

fp-literals.ll

…

generic-to-nvvm.ll

…

global-ordering.ll

…

gvar-init.ll

…

i1-global.ll

…

i1-int-to-fp.ll

…

i1-param.ll

…

i8-param.ll

…

imad.ll

…

implicit-def.ll

…

inline-asm.ll

…

intrin-nocapture.ll

…

intrinsic-old.ll

…

intrinsics.ll

…

isspacep.ll

…

ld-addrspace.ll

…

ld-generic.ll

…

ldparam-v4.ll

…

ldu-i8.ll

…

ldu-ldg.ll

…

ldu-reg-plus-offset.ll

…

lit.local.cfg

…

load-sext-i1.ll

…

local-stack-frame.ll

…

managed.ll

…

misaligned-vector-ldst.ll

…

module-inline-asm.ll

…

mulwide.ll

…

noduplicate-syncthreads.ll

…

nvvm-reflect.ll

…

param-align.ll

…

pr13291-i1-store.ll

…

pr16278.ll

…

pr17529.ll

…

ptx-version-30.ll

…

ptx-version-31.ll

…

refl1.ll

…

rotate.ll

…

rsqrt.ll

…

sched1.ll

…

sched2.ll

…

sext-in-reg.ll

…

sext-params.ll

…

shift-parts.ll

…

simple-call.ll

…

sm-version-20.ll

…

sm-version-21.ll

…

sm-version-30.ll

…

sm-version-35.ll

…

st-addrspace.ll

…

st-generic.ll

…

surf-read.ll

…

surf-write.ll

…

symbol-naming.ll

…

tex-read.ll

…

tuple-literal.ll

…

vec8.ll

…

vec-param-load.ll

…

vector-args.ll

…

vector-compare.ll

…

vector-loads.ll

…

vector-select.ll

…

vector-stores.ll

…

weak-global.ll

…

weak-linkage.ll

…