llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-11-15 20:06:46 +00:00

Author	SHA1	Message	Date
Evan Cheng	df2a1908b2	Fixed an encoding bug: movd from XMM to R32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27807 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:19:00 +00:00
Chris Lattner	f70f8d91a7	pretty print node name git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27806 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:05:58 +00:00
Chris Lattner	90564f26d1	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27804 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 17:59:36 +00:00
Chris Lattner	99590420de	new testcase git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27803 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 17:56:30 +00:00
Chris Lattner	3be29059ab	move some stuff around, clean things up git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27802 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 17:52:36 +00:00
Chris Lattner	993c897390	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27801 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 16:44:51 +00:00
Nate Begeman	6baa2b7220	Fix a copy & paste error from long ago. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27800 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 16:03:18 +00:00
Chris Lattner	738bd3031c	Add some more notes, many still missing git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27799 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 06:32:08 +00:00
Reid Spencer	30ea4787ea	Have the AutoRegen.sh script prompt the user for the LLVM src and obj directories if it can't find them. Then, replace those values into the configure.ac script and pass them to the LLVM_CONFIG_PROJECT so that the values become the default for llvm_src and llvm_obj variables. In this way the user is required to input this exactly once, and the scripts take it from there. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27798 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 06:27:47 +00:00
Reid Spencer	746047a979	Make it possible to default the llvm_src and llvm_obj variables based on the arguments to the macro. This better supports the AutoRegen.sh script in projects/sample/autoconf. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27797 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 06:25:37 +00:00
Chris Lattner	44c933ec9a	add a bunch of stuff, pieces still missing git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27796 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 06:18:36 +00:00
Chris Lattner	8b57ff75be	Add a warning. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27795 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 05:31:20 +00:00
Chris Lattner	6a58e0e144	Add a warning git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27794 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 05:26:10 +00:00
Chris Lattner	cea2aa77eb	Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing even/odd halves. Thanks to Nate telling me what's what. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27793 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 04:28:57 +00:00
Chris Lattner	19a815238e	Implement v16i8 multiply with this code: vmuloub v5, v3, v2 vmuleub v2, v3, v2 vperm v2, v2, v5, v4 This implements CodeGen/PowerPC/vec_mul.ll. With this, v16i8 multiplies are 6.79x faster than before. Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with GCC. Remove the 'integer multiplies' todo from the README file. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27792 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:57:35 +00:00
Chris Lattner	a637e58918	Add tests for v8i16 and v16i8 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27791 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:54:50 +00:00
Evan Cheng	4980467476	Correct comments git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27790 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:45:01 +00:00
Chris Lattner	72dd9bdcc5	Lower v8i16 multiply into this code: li r5, lo16(LCPI1_0) lis r6, ha16(LCPI1_0) lvx v4, r6, r5 vmulouh v5, v3, v2 vmuleuh v2, v3, v2 vperm v2, v2, v5, v4 where v4 is: LCPI1_0: ; <16 x ubyte> .byte 2 .byte 3 .byte 18 .byte 19 .byte 6 .byte 7 .byte 22 .byte 23 .byte 10 .byte 11 .byte 26 .byte 27 .byte 14 .byte 15 .byte 30 .byte 31 This is 5.07x faster on the G5 (measured) than lowering to scalar code + loads/stores. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27789 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:43:48 +00:00
Chris Lattner	e7c768ea24	Custom lower v4i32 multiplies into a cute sequence, instead of having legalize scalarize the sequence into 4 mullw's and a bunch of load/store traffic. This speeds up v4i32 multiplies 4.1x (measured) on a G5. This implements PowerPC/vec_mul.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27788 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:24:30 +00:00
Chris Lattner	843ecd647c	new testcase git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27787 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:22:16 +00:00
Evan Cheng	74e955d931	Another entry git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27786 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 01:22:57 +00:00
Chris Lattner	309db81c10	Fix a build failure on Vladimir's tester. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27785 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 00:21:25 +00:00
Evan Cheng	7fa094a261	Another entry. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27784 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 00:21:01 +00:00
Evan Cheng	cdfc3c82a7	Use movss to insert_vector_elt(v, s, 0). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27782 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:45:49 +00:00
Chris Lattner	fd6bdf0b0f	Turn x86 unaligned load/store intrinsics into aligned load/store instructions if the pointer is known aligned. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27781 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:26:56 +00:00
Chris Lattner	80edfb3af5	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27780 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:10:08 +00:00
Evan Cheng	5edb8d270c	Use two pinsrw to insert an element into v4i32 / v4f32 vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27779 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:04:06 +00:00
Chris Lattner	22fcbb1320	remove done item git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27778 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:52:03 +00:00
Chris Lattner	f9568d8700	Don't diddle VRSAVE if no registers need to be added/removed from it. This allows us to codegen functions as: _test_rol: vspltisw v2, -12 vrlw v2, v2, v2 blr instead of: _test_rol: mfvrsave r2, 256 mr r3, r2 mtvrsave r3 vspltisw v2, -12 vrlw v2, v2, v2 mtvrsave r2 blr Testcase here: CodeGen/PowerPC/vec_vrsave.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27777 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:48:13 +00:00
Chris Lattner	d5c654bb0a	New testcase, shouldn't touch vrsave git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27776 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:48:03 +00:00
Chris Lattner	48d7c069c7	Add a MachineInstr::eraseFromParent convenience method. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27775 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:35:41 +00:00
Chris Lattner	6b56091842	Add some convenience methods. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27774 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:35:08 +00:00
Evan Cheng	23b72005fa	Encoding bug git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27773 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:33:57 +00:00
Chris Lattner	402504b1ba	Vectors that are known live-in and live-out are clearly already marked in the vrsave register for the caller. This allows us to codegen a function as: _test_rol: mfspr r2, 256 mr r3, r2 mtspr 256, r3 vspltisw v2, -12 vrlw v2, v2, v2 mtspr 256, r2 blr instead of: _test_rol: mfspr r2, 256 oris r3, r2, 40960 mtspr 256, r3 vspltisw v0, -12 vrlw v2, v0, v0 mtspr 256, r2 blr git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27772 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:22:06 +00:00
Chris Lattner	939274fcfd	Prefer to allocate V2-V5 before V0,V1. This lets us generate code like this: vspltisw v2, -12 vrlw v2, v2, v2 instead of: vspltisw v0, -12 vrlw v2, v0, v0 when a function is returning a value. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27771 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:19:12 +00:00
Chris Lattner	369503f841	Move some knowledge about registers out of the code emitter into the register info. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27770 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:07:20 +00:00
Chris Lattner	f7d2372b74	Use a small table instead of macros to do this conversion. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27769 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 20:59:25 +00:00
Evan Cheng	c575ca22ea	Implement v8i16, v16i8 splat using unpckl + pshufd. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27768 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 20:43:08 +00:00
Chris Lattner	b2be4032c5	implement returns of a vector, testcase here: CodeGen/X86/vec_return.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27767 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 20:32:50 +00:00
Chris Lattner	bddea0561b	New testcase git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27766 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 20:32:27 +00:00
Chris Lattner	8d5a894501	Codegen insertelement with constant insertion points as scalar_to_vector and a shuffle. For this: void %test2(<4 x float>* %F, float %f) { %tmp = load <4 x float>* %F ; <<4 x float>> [#uses=2] %tmp3 = add <4 x float> %tmp, %tmp ; <<4 x float>> [#uses=1] %tmp2 = insertelement <4 x float> %tmp3, float %f, uint 2 ; <<4 x float>> [#uses=2] %tmp6 = add <4 x float> %tmp2, %tmp2 ; <<4 x float>> [#uses=1] store <4 x float> %tmp6, <4 x float>* %F ret void } we now get this on X86 (which will get better): _test2: movl 4(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, %xmm1 shufps $3, %xmm1, %xmm1 movaps %xmm0, %xmm2 shufps $1, %xmm2, %xmm2 unpcklps %xmm1, %xmm2 movss 8(%esp), %xmm1 unpcklps %xmm1, %xmm0 unpcklps %xmm2, %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) ret instead of: _test2: subl $28, %esp movl 32(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%esp) movss 36(%esp), %xmm0 movss %xmm0, 8(%esp) movaps (%esp), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) addl $28, %esp ret git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27765 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 19:21:01 +00:00
Chris Lattner	dbce85dedf	Make sure to check splats of every constant we can, handle splat(31) by being a bit more clever, add support for odd splats from -31 to -17. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27764 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 18:09:22 +00:00
Evan Cheng	51c9c43656	Incorrect foldMemoryOperand entries git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27763 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 18:06:12 +00:00
Evan Cheng	083248e143	Errors in patterns preventing load folding git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27762 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 18:05:01 +00:00
Jeff Cohen	3c280bf4d1	Add checks for __OpenBSD__. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27761 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 17:55:41 +00:00
Chris Lattner	bdd558cd94	Teach the ppc backend to use rol and vsldoi to generate splatted constants. This implements vec_constants.ll:test_vsldoi and test_rol git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27760 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 17:55:10 +00:00
Chris Lattner	2f70c8cfa2	Some more cases that can be generated with two instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27759 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 17:54:18 +00:00
Chris Lattner	966083fd1a	add a note git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27758 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 17:29:41 +00:00
Evan Cheng	5001ea1078	FP SETOLT, SETOLT, SETUGE, SETUGT conditions were implemented incorrectly git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27755 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 07:24:10 +00:00
Chris Lattner	6876e66e5d	Make some code more general, adding support for constant formation of several new patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27754 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 06:58:41 +00:00

... 20 21 22 23 24 ...

25288 Commits