llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-22 13:29:44 +00:00

History

Chris Lattner ce7cafa960 shld is a very high latency operation. Instead of emitting it for shifts of

two or three, open code the equivalent operation which is faster on athlon
and P4 (by a substantial margin).

For example, instead of compiling this:

long long X2(long long Y) { return Y << 2; }

to:

X3_2:
        movl 4(%esp), %eax
        movl 8(%esp), %edx
        shldl $2, %eax, %edx
        shll $2, %eax
        ret

Compile it to:

X2:
        movl 4(%esp), %eax
        movl 8(%esp), %ecx
        movl %eax, %edx
        shrl $30, %edx
        leal (%edx,%ecx,4), %edx
        shll $2, %eax
        ret

Likewise, for << 3, compile to:

X3:
        movl 4(%esp), %eax
        movl 8(%esp), %ecx
        movl %eax, %edx
        shrl $29, %edx
        leal (%edx,%ecx,8), %edx
        shll $3, %eax
        ret

This matches icc, except that icc open codes the shifts as adds on the P4.


git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@17707 91177308-0d34-0410-b5e6-96231b3b80d8

2004-11-13 20:48:57 +00:00

.cvsignore

Tell CVS to ignore all .inc files

2003-08-03 15:50:17 +00:00

Makefile

Change Library Names Not To Conflict With Others When Installed

2004-10-27 23:18:45 +00:00

X86.h

Add -sse[,2,3] arguments to LLC

2004-08-24 08:18:44 +00:00

X86.td

Add support for the -x86-asm-syntax flag, which can be used to choose between