llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-01-22 10:33:23 +00:00

Author	SHA1	Message	Date
Chris Lattner	3e663a6c16	add some low-prio notes git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27934 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-21 21:03:21 +00:00
Evan Cheng	017dcc6e55	Now generating perfect (I think) code for "vector set" with a single non-zero scalar value. e.g. _mm_set_epi32(0, a, 0, 0); ==> movd 4(%esp), %xmm0 pshufd $69, %xmm0, %xmm0 _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0); ==> movzbw 4(%esp), %ax movzwl %ax, %eax pxor %xmm0, %xmm0 pinsrw $5, %eax, %xmm0 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27923 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-21 01:05:10 +00:00
Chris Lattner	ba2194ae84	Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27908 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 19:01:30 +00:00
Chris Lattner	879acefed5	add a note git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27907 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 18:49:28 +00:00
Chris Lattner	a29526275b	remove some v9 specific code git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27900 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 18:33:11 +00:00
Chris Lattner	060054b9d9	Remove this obsolete file git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27895 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 18:16:45 +00:00
Chris Lattner	2706983c48	This target is no longer built. The ,v files now live in the reoptimizer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27885 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 17:15:44 +00:00
Evan Cheng	39623daef6	- Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> to a vector shuffle. - VECTOR_SHUFFLE lowering change in preparation for more efficient codegen of vector shuffle with zero (or any splat) vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27875 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 08:58:49 +00:00
Chris Lattner	0231007269	Make sure that the new instructions selected have the right type. This fixes CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27868 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 05:58:10 +00:00
Evan Cheng	72cd9a9439	Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant) and then cast it back. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27849 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-20 00:11:39 +00:00
Evan Cheng	94fe5eb14a	isSplatMask() bug: first element can be an undef. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27847 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 23:28:59 +00:00
Evan Cheng	80d428c370	- Added support to do aribitrary 4 wide shuffle with no more than three instructions. - Fixed a commute vector_shuff bug. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27845 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 22:48:17 +00:00
Evan Cheng	fd111b5be5	Prefer {p}unpack* and movdup over {p}shuf as well. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27844 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 21:15:24 +00:00
Evan Cheng	f5e1dc278b	Renamed AddedCost to AddedComplexity. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27843 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 20:38:28 +00:00
Evan Cheng	2dadaea5d2	- Renamed AddedCost to AddedComplexity. - Added more movhlps and movlhps patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27842 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 20:37:34 +00:00
Evan Cheng	533a0aa9ba	Commute vector_shuffle to match more movlhps, movlp{s\|d} cases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27840 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 20:35:22 +00:00
Evan Cheng	f66a094cac	More mov{h\|l}p{d\|s} patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27836 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 18:20:17 +00:00
Evan Cheng	cc0e98c8ed	- More mov{h\|l}ps patterns. - Increase cost (complexity) of patterns which match mov{h\|l}ps ops. These are preferred over shufps in most cases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27835 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 18:11:52 +00:00
Evan Cheng	5941320c0d	Allow "let AddedCost = n in" to increase pattern complexity. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27834 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 18:07:24 +00:00
Chris Lattner	8cc5ffb27f	add a note git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27832 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 16:22:38 +00:00
Chris Lattner	bf9b3716ed	add a note git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27828 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 05:55:06 +00:00
Chris Lattner	ff5cdc2ccd	Add a note. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27827 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 05:53:27 +00:00
Evan Cheng	f0d4e3d7c0	- PEXTRW cannot take a memory location as its first source operand. - PINSRWrmi encoding bug. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27818 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:59:43 +00:00
Evan Cheng	f463f51161	SHUFP{S\|D}, PSHUF* encoding bugs. Left out the mask immediate operand. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27817 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:56:36 +00:00
Evan Cheng	b7a5c527ae	Name change for clarity sake git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27816 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:55:35 +00:00
Evan Cheng	a52b214f56	Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27815 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:31:08 +00:00
Evan Cheng	7b7bd57abd	Name change for clarity sake git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27814 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:29:50 +00:00
Evan Cheng	fb2a3b2964	Left a pattern out git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27813 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:29:08 +00:00
Chris Lattner	80f362a48f	These are correctly encoded by the JIT. I checked :) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27810 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 19:03:38 +00:00
Chris Lattner	87140126e8	add a note git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27809 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:30:19 +00:00
Chris Lattner	0090120c2b	Fix a crash on: void foo2(vector float A, vector float B) { vector float C = (vector float)vec_cmpeq(A, B); if (!vec_any_eq(A, B)) B = (vector float){0,0,0,0}; A = C; } git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27808 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:28:22 +00:00
Evan Cheng	df2a1908b2	Fixed an encoding bug: movd from XMM to R32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27807 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:19:00 +00:00
Chris Lattner	f70f8d91a7	pretty print node name git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27806 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:05:58 +00:00
Chris Lattner	90564f26d1	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27804 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 17:59:36 +00:00
Chris Lattner	3be29059ab	move some stuff around, clean things up git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27802 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 17:52:36 +00:00
Chris Lattner	993c897390	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27801 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 16:44:51 +00:00
Chris Lattner	cea2aa77eb	Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing even/odd halves. Thanks to Nate telling me what's what. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27793 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 04:28:57 +00:00
Chris Lattner	19a815238e	Implement v16i8 multiply with this code: vmuloub v5, v3, v2 vmuleub v2, v3, v2 vperm v2, v2, v5, v4 This implements CodeGen/PowerPC/vec_mul.ll. With this, v16i8 multiplies are 6.79x faster than before. Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with GCC. Remove the 'integer multiplies' todo from the README file. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27792 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:57:35 +00:00
Evan Cheng	4980467476	Correct comments git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27790 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:45:01 +00:00
Chris Lattner	72dd9bdcc5	Lower v8i16 multiply into this code: li r5, lo16(LCPI1_0) lis r6, ha16(LCPI1_0) lvx v4, r6, r5 vmulouh v5, v3, v2 vmuleuh v2, v3, v2 vperm v2, v2, v5, v4 where v4 is: LCPI1_0: ; <16 x ubyte> .byte 2 .byte 3 .byte 18 .byte 19 .byte 6 .byte 7 .byte 22 .byte 23 .byte 10 .byte 11 .byte 26 .byte 27 .byte 14 .byte 15 .byte 30 .byte 31 This is 5.07x faster on the G5 (measured) than lowering to scalar code + loads/stores. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27789 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:43:48 +00:00
Chris Lattner	e7c768ea24	Custom lower v4i32 multiplies into a cute sequence, instead of having legalize scalarize the sequence into 4 mullw's and a bunch of load/store traffic. This speeds up v4i32 multiplies 4.1x (measured) on a G5. This implements PowerPC/vec_mul.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27788 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 03:24:30 +00:00
Evan Cheng	74e955d931	Another entry git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27786 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 01:22:57 +00:00
Evan Cheng	7fa094a261	Another entry. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27784 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 00:21:01 +00:00
Evan Cheng	cdfc3c82a7	Use movss to insert_vector_elt(v, s, 0). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27782 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:45:49 +00:00
Evan Cheng	5edb8d270c	Use two pinsrw to insert an element into v4i32 / v4f32 vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27779 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:04:06 +00:00
Chris Lattner	22fcbb1320	remove done item git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27778 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:52:03 +00:00
Chris Lattner	f9568d8700	Don't diddle VRSAVE if no registers need to be added/removed from it. This allows us to codegen functions as: _test_rol: vspltisw v2, -12 vrlw v2, v2, v2 blr instead of: _test_rol: mfvrsave r2, 256 mr r3, r2 mtvrsave r3 vspltisw v2, -12 vrlw v2, v2, v2 mtvrsave r2 blr Testcase here: CodeGen/PowerPC/vec_vrsave.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27777 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:48:13 +00:00
Evan Cheng	23b72005fa	Encoding bug git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27773 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:33:57 +00:00
Chris Lattner	402504b1ba	Vectors that are known live-in and live-out are clearly already marked in the vrsave register for the caller. This allows us to codegen a function as: _test_rol: mfspr r2, 256 mr r3, r2 mtspr 256, r3 vspltisw v2, -12 vrlw v2, v2, v2 mtspr 256, r2 blr instead of: _test_rol: mfspr r2, 256 oris r3, r2, 40960 mtspr 256, r3 vspltisw v0, -12 vrlw v2, v0, v0 mtspr 256, r2 blr git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27772 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:22:06 +00:00
Chris Lattner	939274fcfd	Prefer to allocate V2-V5 before V0,V1. This lets us generate code like this: vspltisw v2, -12 vrlw v2, v2, v2 instead of: vspltisw v0, -12 vrlw v2, v0, v0 when a function is returning a value. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27771 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:19:12 +00:00

... 2 3 4 5 6 ...

5994 Commits