llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-08-13 07:29:38 +00:00

Author	SHA1	Message	Date
Dan Gohman	f5aeb1a8e4	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@56159 91177308-0d34-0410-b5e6-96231b3b80d8	2008-09-12 16:56:44 +00:00
Eli Friedman	d0c0fae63b	Fix for PR2687: Add patterns to match sint_to_fp and fp_to_sint for <2 x i32>. This is a little messy, but it works. We should really get rid of the intrinsics, though, since they map perfectly well to standard LLVM instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@55864 91177308-0d34-0410-b5e6-96231b3b80d8	2008-09-05 23:07:03 +00:00
Evan Cheng	66e13153bd	FsFLD0S{S\|D} and V_SETALLONES are as cheap as moves. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@55466 91177308-0d34-0410-b5e6-96231b3b80d8	2008-08-28 07:52:25 +00:00
Dan Gohman	67ca6be16a	Tablegen generated code already tests the opcode value, so it's not necessary to use dyn_cast in these predicates. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@55055 91177308-0d34-0410-b5e6-96231b3b80d8	2008-08-20 15:24:22 +00:00
Dan Gohman	d9ced09299	Add an EXTRACTPSmr pattern to match the pattern that X86ISelLowering creates. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@54544 91177308-0d34-0410-b5e6-96231b3b80d8	2008-08-08 18:30:21 +00:00
Evan Cheng	e9d5035838	Fix PR2620: Fix X86cmppd selection code so it expects operands to be v2f64. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@54376 91177308-0d34-0410-b5e6-96231b3b80d8	2008-08-05 22:19:15 +00:00
Nate Begeman	e99b255b5c	Fix a typo in last commit git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@53720 91177308-0d34-0410-b5e6-96231b3b80d8	2008-07-17 17:04:58 +00:00
Nate Begeman	30a0de94e7	SSE codegen for vsetcc nodes git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@53719 91177308-0d34-0410-b5e6-96231b3b80d8	2008-07-17 16:51:19 +00:00
Evan Cheng	331e2bd942	Fix for PR2472. Use movss to set lower 32-bits of a zero XMM vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@53386 91177308-0d34-0410-b5e6-96231b3b80d8	2008-07-10 01:08:23 +00:00
Evan Cheng	4e444436f2	Horizontal-add instructions are not commutative. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@52363 91177308-0d34-0410-b5e6-96231b3b80d8	2008-06-16 21:16:24 +00:00
Evan Cheng	35b9a7790e	mpsadbw is commutable. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@52352 91177308-0d34-0410-b5e6-96231b3b80d8	2008-06-16 20:25:59 +00:00
Duncan Sands	d4b9c17fb7	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@52254 91177308-0d34-0410-b5e6-96231b3b80d8	2008-06-13 19:07:40 +00:00
Evan Cheng	f26ffe987c	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51667 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-29 08:22:04 +00:00
Dan Gohman	c2ecdc5a26	Fix the encoding for two more "rm" instructions that were using MRMSrcReg. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51630 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-28 01:50:19 +00:00
Mon P Wang	bfbbd4d221	Fixed X86 encoding error CVTPS2PD and CVTPD2PS when the source operand is a memory location git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51626 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-28 00:42:27 +00:00
Evan Cheng	a31593901d	Eliminate x86.sse2.punpckh.qdq and x86.sse2.punpckl.qdq. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51533 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-24 02:56:30 +00:00
Evan Cheng	e716bb1c59	Eliminate x86.sse2.movs.d, x86.sse2.shuf.pd, x86.sse2.unpckh.pd, and x86.sse2.unpckl.pd intrinsics. These will be lowered into shuffles. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51531 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-24 02:14:05 +00:00
Evan Cheng	999dbe6bbc	Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51522 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-24 00:07:29 +00:00
Evan Cheng	cd0baf21a1	Use movlps / movhps to modify low / high half of 16-byet memory location. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51501 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-23 21:23:16 +00:00
Evan Cheng	50f778deed	Fix a duplicated pattern. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51490 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-23 18:00:18 +00:00
Dan Gohman	0b924dcef8	Use PMULDQ for v2i64 multiplies when SSE4.1 is available. And add load-folding table entries for PMULDQ and PMULLD. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51489 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-23 17:49:40 +00:00
Evan Cheng	b1938263c7	Bug: rcpps can only folds a load if the address is 16-byte aligned. Fixed many 'ps' load folding patterns in X86InstrSSE.td which are missing the proper alignment checks. Also fixed some 80 col. violations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51462 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-23 00:37:07 +00:00
Evan Cheng	c36c0ab44b	Add missing patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51435 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-22 18:56:56 +00:00
Evan Cheng	8e8de684c7	movsd and movq do not require 16-byte alignment. This fixes vec_set-5.ll on Linux. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51327 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-20 18:24:47 +00:00
Nate Begeman	32097bdbf6	Fix one more encoding bug. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51057 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-13 17:52:09 +00:00
Nate Begeman	c9bdb00683	Fix and encoding error in the psrad xmm, imm8 instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51020 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-13 01:47:52 +00:00
Nate Begeman	0d1704b955	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51009 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-12 23:09:43 +00:00
Nate Begeman	c2616e43fd	Initial X86 codegen support for VSETCC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@51000 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-12 20:34:32 +00:00
Evan Cheng	b70ea0bd03	Some clean up. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50929 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-10 00:59:18 +00:00
Evan Cheng	23573e5be6	Add a pattern to do move the low element of a v4f32 and zero extend the rest. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50922 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-09 23:37:55 +00:00
Evan Cheng	d880b97257	Handle a few more cases of folding load i64 into xmm and zero top bits. Note, some of the code will be moved into target independent part of DAG combiner in a subsequent patch. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50918 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-09 21:53:03 +00:00
Evan Cheng	fd17f42bab	Use movq to move low half of XMM register and zero-extend the rest. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50874 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-08 22:35:02 +00:00
Evan Cheng	7e2ff77ef0	Handle vector move / load which zero the destination register top bits (i.e. movd, movq, movss (addr), movsd (addr)) with X86 specific dag combine. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50838 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-08 00:57:18 +00:00
Evan Cheng	22b942aa4d	Add separate intrinsics for MMX / SSE shifts with i32 integer operands. This allow us to simplify the horribly complicated matching code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50601 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-03 00:52:09 +00:00
Evan Cheng	b609339a5c	80 column violation. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@50575 91177308-0d34-0410-b5e6-96231b3b80d8	2008-05-02 07:53:32 +00:00
Chris Lattner	bd381a777b	A better fix for my previous patch, MOVZQI2PQIrr just requires SSE2. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@49986 91177308-0d34-0410-b5e6-96231b3b80d8	2008-04-20 05:52:46 +00:00
Dan Gohman	171c11ec93	Add support for the form of the SSE41 extractps instruction that puts its result in a 32-bit GPR. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@49762 91177308-0d34-0410-b5e6-96231b3b80d8	2008-04-16 02:32:24 +00:00
Chris Lattner	db66750753	Fix the x86-64 side of PR2108 by adding a v2f64 version of MOVZQI2PQIrr. This would be better handled as a dag combine (with the goal of eliminating the bitconvert) but I don't know how to do that safely. Thoughts welcome. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@49463 91177308-0d34-0410-b5e6-96231b3b80d8	2008-04-10 05:13:43 +00:00
Evan Cheng	0c0f83ff5d	Favors pshufd over shufps when shuffling elements from one vector. pshufd is faster than shufps. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@49244 91177308-0d34-0410-b5e6-96231b3b80d8	2008-04-05 00:30:36 +00:00
Evan Cheng	7aae876db1	Fix some SSE4.1 instruction encoding bugs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48815 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-26 08:11:49 +00:00
Evan Cheng	62a3f1538c	- SSE4.1 extractfps extracts a f32 into a gr32 register. Very useful! Not. Fix the instruction specification and teaches lowering code to use it only when the only use is a store instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48746 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-24 21:52:23 +00:00
Nate Begeman	bc4efb8ac7	Add a couple missing SSE4 instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48430 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-16 21:14:46 +00:00
Evan Cheng	da47e6e0d0	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48380 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-15 00:03:38 +00:00
Evan Cheng	029d9dafa0	Fix some 80 col violations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48361 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-14 07:46:48 +00:00
Evan Cheng	172b794cd5	Fix a number of encoding bugs. SSE 4.1 instructions MPSADBWrri, PINSRDrr, etc. have 8-bits immediate field (ImmT == Imm8). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48360 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-14 07:39:27 +00:00
Evan Cheng	c8e3b147ee	Clean up my own mess. X86 lowering normalize vector 0 to v4i32. However DAGCombine can fold (sub x, x) -> 0 after legalization. It can create a zero vector of a type that's not expected (e.g. v8i16). We don't want to disable the optimization since leaving a (sub x, x) is really bad. Add isel patterns for other types of vector 0 to ensure correctness. It's highly unlikely to happen other than in bugpoint reduced test cases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48279 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-12 07:02:50 +00:00
Evan Cheng	27b7db549e	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@48042 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-08 00:58:38 +00:00
Evan Cheng	e9083d669a	isTwoAddress = 1 -> Constraints. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@47941 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-05 08:19:16 +00:00
Evan Cheng	e7b8a8b713	PSLLWri etc. are two-address instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@47940 91177308-0d34-0410-b5e6-96231b3b80d8	2008-03-05 08:11:27 +00:00
Evan Cheng	efec751a1b	- When DAG combiner is folding a bit convert into a BUILD_VECTOR, it should check if it's essentially a SCALAR_TO_VECTOR. Avoid turning (v8i16) <10, u, u, u> to <10, 0, u, u, u, u, u, u>. Instead, simply convert it to a SCALAR_TO_VECTOR of the proper type. - X86 now normalize SCALAR_TO_VECTOR to (BIT_CONVERT (v4i32 SCALAR_TO_VECTOR)). Get rid of X86ISD::S2VEC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@47290 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-18 23:04:32 +00:00
Andrew Lenharth	22c5c1b2df	llvm.memory.barrier, and impl for x86 and alpha git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@47204 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-16 01:24:58 +00:00
Nate Begeman	cdd1eeca2c	SSE4.1 64b integer insert/extract pattern support Move formats into the formats file git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@47035 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-12 22:51:28 +00:00
Nate Begeman	14d12caf1d	Enable SSE4 codegen and pattern matching. Add some notes to the README. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46949 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-11 04:19:36 +00:00
Nate Begeman	ab5d56c6b9	xmm0 variable blends git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46931 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-10 18:47:57 +00:00
Nate Begeman	fea2be50b9	memopv16i8 had wrong alignment requirement, would have broken pabsb pabs{b,w,d} are not two address fix extract-to-mem sse4 ops add sse4 vector sign extend nodes git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46915 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-09 23:46:37 +00:00
Nate Begeman	1426d52cab	Skeleton of insert and extract matching, more to come git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46902 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-09 01:38:08 +00:00
Nate Begeman	204e84e138	The rest of the SSE4.1 intrinsic patterns that are obvious to me. Getting Evan's help with the rest. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46697 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-04 06:00:24 +00:00
Nate Begeman	2f6f1c02ca	Some more SSE 4.1 intrinsic patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46696 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-04 05:34:34 +00:00
Nate Begeman	63ec90a6a8	SSE 4.1 Intrinsics and detection git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46681 91177308-0d34-0410-b5e6-96231b3b80d8	2008-02-03 07:18:54 +00:00
Chris Lattner	d43d00cf3a	Significantly simplify and improve handling of FP function results on x86-32. This case returns the value in ST(0) and then has to convert it to an SSE register. This causes significant codegen ugliness in some cases. For example in the trivial fp-stack-direct-ret.ll testcase we used to generate: _bar: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret because we move the result of foo() into an XMM register, then have to move it back for the return of bar. Instead of hacking ever-more special cases into the call result lowering code we take a much simpler approach: on x86-32, fp return is modeled as always returning into an f80 register which is then truncated to f32 or f64 as needed. Similarly for a result, we model it as an extension to f80 + return. This exposes the truncate and extensions to the dag combiner, allowing target independent code to hack on them, eliminating them in this case. This gives us this code for the example above: _bar: subl $12, %esp call L_foo$stub addl $12, %esp ret The nasty aspect of this is that these conversions are not legal, but we want the second pass of dag combiner (post-legalize) to be able to hack on them. To handle this, we lie to legalize and say they are legal, then custom expand them on entry to the isel pass (PreprocessForFPConvert). This is gross, but less gross than the code it is replacing :) This also allows us to generate better code in several other cases. For example on fp-stack-ret-conv.ll, we now generate: _test: subl $12, %esp call L_foo$stub fstps 8(%esp) movl 16(%esp), %eax cvtss2sd 8(%esp), %xmm0 movsd %xmm0, (%eax) addl $12, %esp ret where before we produced (incidentally, the old bad code is identical to what gcc produces): _test: subl $12, %esp call L_foo$stub fstpl (%esp) cvtsd2ss (%esp), %xmm0 cvtss2sd %xmm0, %xmm0 movl 16(%esp), %eax movsd %xmm0, (%eax) addl $12, %esp ret Note that we generate slightly worse code on pr1505b.ll due to a scheduling deficiency that is unrelated to this patch. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46307 91177308-0d34-0410-b5e6-96231b3b80d8	2008-01-24 08:07:48 +00:00
Chris Lattner	f77e037309	add some missing flags. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45859 91177308-0d34-0410-b5e6-96231b3b80d8	2008-01-11 06:59:07 +00:00
Chris Lattner	ba7e756c22	Start inferring side effect information more aggressively, and fix many bugs in the x86 backend where instructions were not marked maystore/mayload, and perf issues where instructions were not marked neverHasSideEffects. It would be really nice if we could write patterns for copy instructions. I have audited all the x86 instructions down to MOVDQAmr. The flags on others and on other targets are probably not right in all cases, but no clients currently use this info that are enabled by default. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45829 91177308-0d34-0410-b5e6-96231b3b80d8	2008-01-10 07:59:24 +00:00
Chris Lattner	dd41527a7d	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45824 91177308-0d34-0410-b5e6-96231b3b80d8	2008-01-10 05:45:39 +00:00
Chris Lattner	834f1ce031	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45667 91177308-0d34-0410-b5e6-96231b3b80d8	2008-01-06 23:38:27 +00:00
Chris Lattner	4ee451de36	Remove attribution from file headers, per discussion on llvmdev. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45418 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-29 20:36:04 +00:00
Evan Cheng	700a0fba97	Fix JIT encoding for CMPSD as well. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45268 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-20 19:57:09 +00:00
Bill Wendling	627c00b663	Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I based what flag to set on whether it was already marked as "isRematerializable". If there was a further check to determine if it's "really" rematerializable, then I marked it as "mayHaveSideEffects" and created a check in the X86 back-end similar to the remat one. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45132 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-17 23:07:56 +00:00
Chris Lattner	d7610e16fc	Fix the JIT encoding of cmpss, which aborts with this assertion currently: X86CodeEmitter.cpp:378: failed assertion `0 && "Immediate size not set!"' I think* this is right, but Evan, please verify. It also looks like CMPSDrr and maybe others are missing this info. Evan, plz investigate. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45074 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-16 20:12:41 +00:00
Evan Cheng	7a831ce85f	Make better use of instructions that clear high bits; fix various 2-wide shuffle bugs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@45058 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-15 03:00:47 +00:00
Evan Cheng	6e141fd048	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@44960 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-12 23:12:09 +00:00
Evan Cheng	1076210ee3	Remove a bogus optimization. It's not possible to do a move to low element to a <8 x i16> or <16 x i8> vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@44669 91177308-0d34-0410-b5e6-96231b3b80d8	2007-12-06 22:14:22 +00:00
Chris Lattner	8a594489bf	Fix a long standing deficiency in the X86 backend: we would sometimes emit "zero" and "all one" vectors multiple times, for example: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 pcmpeqd %mm0, %mm0 movq %mm0, _M2 ret instead of: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 movq %mm0, _M2 ret This patch fixes this by always arranging for zero/one vectors to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be any random type. This ensures they get trivially CSE'd on the dag. This fix is also important for LegalizeDAGTypes, as it gets unhappy when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when 'i64' isn't legal. This patch makes the following changes: 1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into their canonical types. 2) The now-dead patterns are removed from the SSE/MMX .td files. 3) All the patterns in the .td file that referred to immAllOnesV or immAllZerosV in the wrong form now use *_bc to match them with a bitcast wrapped around them. 4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle bitcast'd zero vectors, which simplifies the code actually. 5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that is legal, instead of generating one that is illegal and expecting a later legalize pass to clean it up. 6) isZeroShuffle is generalized to handle bitcast of zeros. 7) several other minor tweaks. This patch is definite goodness, but has the potential to cause random code quality regressions. Please be on the lookout for these and let me know if they happen. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@44310 91177308-0d34-0410-b5e6-96231b3b80d8	2007-11-25 00:24:49 +00:00
Nate Begeman	b348d18caf	Add support for vectors to int <-> float casts. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@44204 91177308-0d34-0410-b5e6-96231b3b80d8	2007-11-17 03:58:34 +00:00
Dale Johannesen	c784208a73	Add missing SSE builtins: CVTPD2PI, CVTPS2PI, CVTTPD2PI, CVTTPS2PI, CVTPI2PD, CVTPI2PS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@43523 91177308-0d34-0410-b5e6-96231b3b80d8	2007-10-30 22:15:38 +00:00
Arnold Schwaighofer	48abc5cf6b	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42934 91177308-0d34-0410-b5e6-96231b3b80d8	2007-10-12 21:30:57 +00:00
Dale Johannesen	83e105c600	Add missing argument to PALIGNR git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42874 91177308-0d34-0410-b5e6-96231b3b80d8	2007-10-11 20:58:37 +00:00
Evan Cheng	c231e8c8a5	Added DAG xforms. e.g. (vextract (v4f32 s2v (f32 load $addr)), 0) -> (f32 load $addr) (vextract (v4i32 bc (v4f32 s2v (f32 load $addr))), 0) -> (i32 load $addr) Remove x86 specific patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42677 91177308-0d34-0410-b5e6-96231b3b80d8	2007-10-06 02:46:29 +00:00
Evan Cheng	fef922a4d5	Typo. X86comi doesn't read / write chain's. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42492 91177308-0d34-0410-b5e6-96231b3b80d8	2007-10-01 18:12:48 +00:00
Evan Cheng	e5f6204cd5	Enabling new condition code modeling scheme. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42459 91177308-0d34-0410-b5e6-96231b3b80d8	2007-09-29 00:00:36 +00:00
Evan Cheng	0488db9b99	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42285 91177308-0d34-0410-b5e6-96231b3b80d8	2007-09-25 01:57:46 +00:00
Dale Johannesen	f1fc3a8fa6	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@42246 91177308-0d34-0410-b5e6-96231b3b80d8	2007-09-23 14:52:20 +00:00
Evan Cheng	24f2ea3971	Add implicit def of EFLAGS on those instructions that may modify flags. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@41962 91177308-0d34-0410-b5e6-96231b3b80d8	2007-09-14 21:48:26 +00:00
Evan Cheng	071a279e94	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@41863 91177308-0d34-0410-b5e6-96231b3b80d8	2007-09-11 19:55:27 +00:00
Dan Gohman	1ab79897e2	Avoid storing and reloading zeros and other constants from stack slots by flagging the associated instructions as being trivially rematerializable. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@41775 91177308-0d34-0410-b5e6-96231b3b80d8	2007-09-07 21:32:51 +00:00
Evan Cheng	2f39426ec9	Mark load instructions with isLoad = 1. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@41595 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-30 05:49:43 +00:00
Bill Wendling	01284b4d55	64-bit SSSE3 ops that use MMX registers don't require 16-byte alignment. Make a 'memop' pattern just for them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@41017 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-11 09:52:53 +00:00
Bill Wendling	ae9671b838	For kicks, I though it would be fun to use the correct opcode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40985 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-10 09:00:17 +00:00
Bill Wendling	76d708b76f	Adding SSSE3 intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40982 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-10 06:22:27 +00:00
Dan Gohman	7f55fcbc6b	Fix the alignment requirements of several unpck and shuf instructions. Generalize isPSHUFDMask and add a unary SHUFPD pattern so that SHUFPD's memory operand alignment can be tested as well, with a fix to avoid breaking MMX's use of isPSHUFDMask. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40756 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-02 21:17:01 +00:00
Dan Gohman	f3372d1d64	Fix pastos in vector arithmetic intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40754 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-02 21:06:40 +00:00
Dan Gohman	73a902b228	Mark the SSE and MMX load instructions that X86InstrInfo::isReallyTriviallyReMaterializable knows how to handle with the isReMaterializable flag so that it is given a chance to handle them. Without hoisting constant-pool loads from loops this isn't very visible, though it does keep CodeGen/X86/constant-pool-remat-0.ll from making a copy of the constant pool on the stack. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40736 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-02 14:27:55 +00:00
Evan Cheng	c5dd54154a	Missing Requires. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40691 91177308-0d34-0410-b5e6-96231b3b80d8	2007-08-01 21:42:24 +00:00
Dan Gohman	b1576f56c8	Change the x86 assembly output to use tab characters to separate the mnemonics from their operands instead of single spaces. This makes the assembly output a little more consistent with various other compilers (f.e. GCC), and slightly easier to read. Also, update the regression tests accordingly. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40648 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-31 20:11:57 +00:00
Evan Cheng	c64a1a921c	Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc (v4f32 s2v (f32 load ))), 0) -> (i32 load ) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40628 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-31 08:04:03 +00:00
Dan Gohman	d300622eba	Re-apply 40504, but with a fix for the segfault it caused in oggenc: Make the alignedload and alignedstore patterns always require 16-byte alignment. This way when they are used in the "Fs" instructions, in which a vector instruction is used for a scalar purpose, they can still require the full vector alignment. And add a regression test for this. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40555 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-27 17:16:43 +00:00
Evan Cheng	3e22947d9a	Reverting 40504 for now. It's breaking oggenc. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40547 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-27 01:37:47 +00:00
Dan Gohman	1704c2f9b9	Fix a whitespace difference between CMPSSrr and CMPSDrr. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40528 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-26 15:11:50 +00:00
Dan Gohman	d3283832aa	Remove X86ISD::LOAD_PACK and X86ISD::LOAD_UA and associated code from the x86 target, replacing them with the new alignment attributes on memory references. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40504 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-26 00:31:09 +00:00
Evan Cheng	b4162fd393	Because we promote SSE logical ops and loads to v2i64, we often end up generate code that cross integer / floating point domains (e.g. generate pxor / pand for logical ops on floating point value, movdqa to load / store floating point SSE values). Given that, it's better to use movaps instead of movdqa and movups instead of movdqu. They have the same latency but the "aps" variants are one byte shorter. If the domain crossing problem is a real performance issue, then we will have to fix it with dynamic programming based isel. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40076 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-20 00:27:43 +00:00
Evan Cheng	31d3a65052	Fix patterns so we isel the xorps, etc. for floating pt logical SSE ops. DAG combiner may fold away the (bit_convert (load)). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40070 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-19 23:34:10 +00:00
Evan Cheng	64d80e3387	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40033 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-19 01:14:50 +00:00
Dan Gohman	4106f3714e	Implement initial memory alignment awareness for SSE instructions. Vector loads and stores that have a specified alignment of less than 16 bytes now use instructions that support misaligned memory references. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@40015 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-18 20:23:34 +00:00
Dan Gohman	2038252c6a	Define non-intrinsic instructions for vector min, max, sqrt, rsqrt, and rcp, in addition to the intrinsic forms. Add spill-folding entries for these new instructions, and for the scalar min and max instrinsic instructions which were missing. And add some preliminary ISelLowering code for using the new non-intrinsic vector sqrt instruction, and fneg and fabs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@38478 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-10 00:05:58 +00:00
Dale Johannesen	849f214a4e	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@37847 91177308-0d34-0410-b5e6-96231b3b80d8	2007-07-03 00:53:03 +00:00
Dan Gohman	d45eddd214	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@37728 91177308-0d34-0410-b5e6-96231b3b80d8	2007-06-26 00:48:07 +00:00
Dan Gohman	32791e06d8	Make minor adjustments to whitespace and comments to reduce differences between SSE1 instructions and their respective SSE2 analogues. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@37718 91177308-0d34-0410-b5e6-96231b3b80d8	2007-06-25 15:44:19 +00:00
Dan Gohman	01976307d2	Fix loadv2i32 to be loadv4i32, though it isn't actually used anywhere yet. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@37717 91177308-0d34-0410-b5e6-96231b3b80d8	2007-06-25 15:19:03 +00:00
Dan Gohman	82a87a0172	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@37644 91177308-0d34-0410-b5e6-96231b3b80d8	2007-06-19 01:48:05 +00:00
Evan Cheng	174f803395	Added missing patterns for UNPCKH* and PUNPCKH*. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@37172 91177308-0d34-0410-b5e6-96231b3b80d8	2007-05-17 18:44:37 +00:00
Bill Wendling	ddd35321fb	Non-algorithmic change. Moved definitions around into separate sections for SSE1, SSE2, SSE3, and SSSE3. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@36656 91177308-0d34-0410-b5e6-96231b3b80d8	2007-05-02 23:11:52 +00:00
Dan Gohman	23420c09c0	Fix the spelling of the prefetchnta instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@36256 91177308-0d34-0410-b5e6-96231b3b80d8	2007-04-18 14:09:14 +00:00
Bill Wendling	bb1ee05253	Add support for our first SSSE3 instruction "pmulhrsw". git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@35869 91177308-0d34-0410-b5e6-96231b3b80d8	2007-04-10 22:10:25 +00:00
Evan Cheng	768143547b	Mark re-materializable instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@35230 91177308-0d34-0410-b5e6-96231b3b80d8	2007-03-21 00:16:56 +00:00
Chris Lattner	569bdc7bb7	add missing braces git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@34905 91177308-0d34-0410-b5e6-96231b3b80d8	2007-03-04 06:13:52 +00:00
Evan Cheng	58866f43fa	How the heck did I forget patterns for llvm.x86.sse2.cmp.sd? git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@34434 91177308-0d34-0410-b5e6-96231b3b80d8	2007-02-20 00:39:09 +00:00
Evan Cheng	73d6cf12ad	- FCOPYSIGN custom lowering bug. Clear the sign bit of operand 0 first before or'ing in the sign bit of operand 1. - Tweaking: rather than left shift the sign bit, fp_extend operand 1 first before taking its sign bit if its type is smaller than that of operand 0. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@32932 91177308-0d34-0410-b5e6-96231b3b80d8	2007-01-05 21:37:56 +00:00
Evan Cheng	68c47cba35	With SSE2, expand FCOPYSIGN to a series of SSE bitwise operations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@32900 91177308-0d34-0410-b5e6-96231b3b80d8	2007-01-05 07:55:56 +00:00
Evan Cheng	c9f0923f17	- Rename MOVDSS2DIrr to MOVSS2DIrr for consistency sake. - Add MOVDI2SSrm and MOVSS2DImr to fold load / store for i32 <-> f32 bit_convert patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@32582 91177308-0d34-0410-b5e6-96231b3b80d8	2006-12-14 19:43:11 +00:00
Chris Lattner	f3597a13ae	If we have ScalarSSE, we can select bitconvert into single instructions. This compiles bitcast.ll:test3/test4 into: _test3: movd %xmm0, %eax ret _test4: movd %edi, %xmm0 ret git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@32230 91177308-0d34-0410-b5e6-96231b3b80d8	2006-12-05 18:45:06 +00:00
Evan Cheng	ebf01d63b0	Correct instructions for moving data between GR64 and SSE registers; also correct load i64 / store i64 from v2i64. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31795 91177308-0d34-0410-b5e6-96231b3b80d8	2006-11-16 23:33:25 +00:00
Evan Cheng	8ca29326e1	Don't dag combine floating point select to max and min intrinsics. Those take v4f32 / v2f64 operands and may end up causing larger spills / restores. Added X86 specific nodes X86ISD::FMAX, X86ISD::FMIN instead. This fixes PR996. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31645 91177308-0d34-0410-b5e6-96231b3b80d8	2006-11-10 21:43:37 +00:00
Evan Cheng	6e56e2c602	Fixed a bug which causes x86 be to incorrectly match shuffle v, undef, <2, ?, 3, ?> to movhlps It should match to unpckhps instead. Added proper matching code for shuffle v, undef, <2, 3, 2, 3> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31519 91177308-0d34-0410-b5e6-96231b3b80d8	2006-11-07 22:14:24 +00:00
Chris Lattner	3751844b39	remove dead/redundant vars git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31435 91177308-0d34-0410-b5e6-96231b3b80d8	2006-11-03 23:48:56 +00:00
Evan Cheng	acf7f2e3a9	Fix ldmxcsr JIT encoding. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31343 91177308-0d34-0410-b5e6-96231b3b80d8	2006-11-01 06:53:52 +00:00
Evan Cheng	f686d9b71f	Fixed a significant bug where unpcklpd is incorrectly used to extract element 1 from a v2f64 value. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31228 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-27 21:08:32 +00:00
Evan Cheng	009073d839	X86ISD::PEXTRW 3rd operand type is always target pointer type. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@31185 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-25 21:35:05 +00:00
Evan Cheng	82a9164fb4	ComplexPatterns sse_load_f32 and sse_load_f64 returns in / out chain operands. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30892 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-11 21:06:01 +00:00
Evan Cheng	f2ea84aadc	Don't go too crazy with these AddComplexity. Try matching shufps with load folding first. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30848 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-09 21:42:15 +00:00
Evan Cheng	466685d41a	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30844 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-09 20:57:25 +00:00
Chris Lattner	3a7cd951c1	completely disable folding of loads into scalar sse instructions and provide a framework for doing it right. This fixes CodeGen/X86/2006-10-07-ScalarSSEMiscompile.ll. Once X86DAGToDAGISel::SelectScalarSSELoad is implemented right, this task will be done. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30817 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 21:55:32 +00:00
Chris Lattner	6f98773203	convert packed FP add/sub/mul/div to use a multiclass. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30815 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 21:17:13 +00:00
Chris Lattner	941cc4561c	one multiclass now defines all 8 variants of binary-scalar-sse-fp operations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30814 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 20:55:57 +00:00
Chris Lattner	d2c99d5f7a	Switch ADD/MUL/DIV/SUB scalarsse fp ops to a multiclass git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30813 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 20:35:44 +00:00
Chris Lattner	6970eda7ca	Random acts of shrinkage git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30812 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 19:49:05 +00:00
Chris Lattner	a7ebe556c8	Convert pand/por/pxor to use multiclass git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30811 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 19:37:30 +00:00
Chris Lattner	70f4f2e513	Convert some more instructions over to use a new multiclass. Fix a bug where the asmstring for PSUBQrm was wrong. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30810 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 19:34:33 +00:00
Chris Lattner	5650eeb38e	Fix a bug where PADDQrm printed paddd instead of paddq. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30809 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 19:15:46 +00:00
Chris Lattner	7c47f9a7ea	Add multiclass for SSE2 instructions that correspond to simple binops. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30808 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 19:14:49 +00:00
Chris Lattner	45e123c62a	rename: PDI_binop_rm -> PDI_binop_rm_int PDI_binop_rmi -> PDI_binop_rmi_int to make it clear that these are for use with intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30807 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 19:02:31 +00:00
Chris Lattner	d4060cc989	Convert saturating PADD/PSUB's to use a multiclass git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30806 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 18:48:46 +00:00
Chris Lattner	8139e28a79	Convert PAVG, PMADDWD, and PMUL to use multiclasses. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30805 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 18:39:00 +00:00
Chris Lattner	3dca490ff0	Fix typo in packsswb instr definition, where the load had the wrong type. This allows us to use the multiclass for other packs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30804 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 18:23:58 +00:00
Chris Lattner	783d45ef14	handle pmin/pmax with multiclasses git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30800 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 07:49:33 +00:00
Chris Lattner	7733799464	simplify pack and shift intrinsics with multiclasses git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30797 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 07:06:17 +00:00
Chris Lattner	01998742c3	Use a multiclass to simplify 'SSE2 Integer comparison' git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30796 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 06:47:08 +00:00
Chris Lattner	736c020fc8	move class defns close to uses to make it easier to read git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30795 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 06:33:36 +00:00
Chris Lattner	fb996ee727	simplify horizontal op definitions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30794 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 06:31:41 +00:00
Chris Lattner	3b57a833a5	remove more unneeded type info git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30793 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 06:27:03 +00:00
Chris Lattner	aab370db24	remove unneeded definitions and type info git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30792 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 06:19:41 +00:00
Chris Lattner	15258d5f9d	remove some unneeded type info git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30791 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 06:17:43 +00:00
Chris Lattner	845fb75536	simplify patterns by merging in operand info git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30790 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:50:25 +00:00
Chris Lattner	d1b651d5de	Factor operands into packed unary classes git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30789 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:47:20 +00:00
Chris Lattner	dc5aa21b10	remove dead/duplicate instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30788 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:41:52 +00:00
Chris Lattner	a0ea63db75	Pull operand info up into parent class for scalar sse intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30787 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:26:13 +00:00
Chris Lattner	86c1b3a2fd	convert the sole sd unary intrinsic to a multiclass for consistency git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30786 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:19:31 +00:00
Chris Lattner	3b8378552e	pull operand string into the multiclass git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30785 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:13:26 +00:00
Chris Lattner	9498ed8ac9	Remove RSQRTSS[rm] RCPSS[rm], which are dead. Introduce SS_IntUnary, a multiclass to replace SS_Int[rm]. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30784 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 05:09:48 +00:00
Chris Lattner	4cc84edd3b	eliminate redundancy git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30783 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-07 04:52:09 +00:00
Evan Cheng	485130fbf4	These don't have immediate operands. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30694 91177308-0d34-0410-b5e6-96231b3b80d8	2006-10-03 06:55:11 +00:00
Evan Cheng	734503be59	X86ISD::CMP now produces a chain as well as a flag. Make that the chain operand of a conditional branch to allow load folding into CMP / TEST instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30241 91177308-0d34-0410-b5e6-96231b3b80d8	2006-09-11 02:19:56 +00:00
Evan Cheng	23b3122c44	JIT encoding bug. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@30112 91177308-0d34-0410-b5e6-96231b3b80d8	2006-09-05 05:59:25 +00:00
Evan Cheng	55371739de	Can't commute shufps. The high / low parts elements come from different vectors. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@29275 91177308-0d34-0410-b5e6-96231b3b80d8	2006-07-25 20:25:40 +00:00
Evan Cheng	206ee9d86c	X86 target specific DAG combine: turn build_vector (load x), (load x+4), (load x+8), (load x+12), <0, 1, 2, 3> to a single 128-bit load (aligned and unaligned). e.g. __m128 test(float a, float b, float c, float d) { return _mm_set_ps(d, c, b, a); } _test: movups 4(%esp), %xmm0 ret git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@29042 91177308-0d34-0410-b5e6-96231b3b80d8	2006-07-07 08:33:52 +00:00
Evan Cheng	775ff18257	Should just use xorps to clear XMM registers for all data types. pxor is also one byte longer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28984 91177308-0d34-0410-b5e6-96231b3b80d8	2006-06-29 18:04:54 +00:00
Evan Cheng	a8e83ec8c3	Always use xorps to clear XMM registers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28979 91177308-0d34-0410-b5e6-96231b3b80d8	2006-06-29 00:34:23 +00:00
Chris Lattner	30da68acce	Remove some ugly now-redundant casts. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28864 91177308-0d34-0410-b5e6-96231b3b80d8	2006-06-20 00:25:29 +00:00
Chris Lattner	a973993c0c	Fix some mismatched type constraints git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28862 91177308-0d34-0410-b5e6-96231b3b80d8	2006-06-20 00:12:37 +00:00
Evan Cheng	b21495043e	Minor clean up. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28860 91177308-0d34-0410-b5e6-96231b3b80d8	2006-06-19 19:25:30 +00:00
Evan Cheng	015188ffbc	Type of vector extract / insert index operand should be iPTR. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28796 91177308-0d34-0410-b5e6-96231b3b80d8	2006-06-15 08:14:54 +00:00
Evan Cheng	190717d3cf	Rename instructions for consistency sake. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28594 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-31 19:00:07 +00:00
Evan Cheng	9d09b89f39	Select vector_shuffle v1, undef <2, 3, ?, ?> to MOVHLPS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28582 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-31 00:51:37 +00:00
Evan Cheng	b5e406afc3	MAXP{D\|S} and MINP{D\|S} are commutable. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28578 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-30 23:47:30 +00:00
Evan Cheng	efeaed8fb4	Commute shufps / shufpd. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28577 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-30 23:34:30 +00:00
Evan Cheng	ccba76bb25	Allow shufps x, x, mask to be converted to pshufd x, mask to save a move. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28565 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-30 20:26:50 +00:00
Evan Cheng	069287d460	X86 integer register classes naming changes. Make them consistent with FP, vector classes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28324 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-16 07:21:53 +00:00
Chris Lattner	bd04aa5796	Teach the code generator to use cvtss2sd as extload f32 -> f64 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28131 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-05 21:35:18 +00:00
Evan Cheng	9e062ed516	Use movsd to shuffle in the lowest two elements of a v4f32 / v4i32 vector when movlps cannot be used (e.g. when load from m64 has multiple uses). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@28089 91177308-0d34-0410-b5e6-96231b3b80d8	2006-05-03 20:32:03 +00:00
Evan Cheng	3d1be07141	Fix a typo. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27968 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-25 17:48:41 +00:00
Evan Cheng	a2137b592e	Explicitly specify result type for def : Pat<> patterns (if it produces a vector result). Otherwise tblgen will pick the default (v16i8 for 128-bit vector). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27965 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-25 00:50:01 +00:00
Evan Cheng	a7fc64222a	Added X86 SSE2 intrinsics which can be represented as vector_shuffles. This is a temporary workaround for the 2-wide vector_shuffle problem (i.e. its mask would have type v2i32 which is not legal). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27964 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-24 23:34:56 +00:00
Evan Cheng	64e9769339	Some missing movlps, movhps, movlpd, and movhpd patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27960 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-24 21:58:20 +00:00
Evan Cheng	017dcc6e55	Now generating perfect (I think) code for "vector set" with a single non-zero scalar value. e.g. _mm_set_epi32(0, a, 0, 0); ==> movd 4(%esp), %xmm0 pshufd $69, %xmm0, %xmm0 _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0); ==> movzbw 4(%esp), %ax movzwl %ax, %eax pxor %xmm0, %xmm0 pinsrw $5, %eax, %xmm0 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27923 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-21 01:05:10 +00:00
Evan Cheng	fd111b5be5	Prefer {p}unpack* and movdup over {p}shuf as well. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27844 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 21:15:24 +00:00
Evan Cheng	2dadaea5d2	- Renamed AddedCost to AddedComplexity. - Added more movhlps and movlhps patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27842 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 20:37:34 +00:00
Evan Cheng	f66a094cac	More mov{h\|l}p{d\|s} patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27836 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 18:20:17 +00:00
Evan Cheng	cc0e98c8ed	- More mov{h\|l}ps patterns. - Increase cost (complexity) of patterns which match mov{h\|l}ps ops. These are preferred over shufps in most cases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27835 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-19 18:11:52 +00:00
Evan Cheng	f0d4e3d7c0	- PEXTRW cannot take a memory location as its first source operand. - PINSRWrmi encoding bug. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27818 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:59:43 +00:00
Evan Cheng	b7a5c527ae	Name change for clarity sake git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27816 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:55:35 +00:00
Evan Cheng	7b7bd57abd	Name change for clarity sake git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27814 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:29:50 +00:00
Evan Cheng	fb2a3b2964	Left a pattern out git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27813 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 21:29:08 +00:00
Evan Cheng	df2a1908b2	Fixed an encoding bug: movd from XMM to R32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27807 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-18 18:19:00 +00:00
Evan Cheng	cdfc3c82a7	Use movss to insert_vector_elt(v, s, 0). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27782 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 22:45:49 +00:00
Evan Cheng	23b72005fa	Encoding bug git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27773 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 21:33:57 +00:00
Evan Cheng	083248e143	Errors in patterns preventing load folding git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27762 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-17 18:05:01 +00:00
Evan Cheng	06aef15843	movduprm, movshduprm bugs git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27734 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-16 18:11:28 +00:00
Evan Cheng	d8e8223ea1	Encoding bugs git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27733 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-16 07:02:22 +00:00
Evan Cheng	60d3fa24ba	More encoding bugs git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27722 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-15 06:10:09 +00:00
Evan Cheng	1af18985b8	pslldrm, psrawrm, etc. encoding bug git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27721 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-15 05:59:08 +00:00
Evan Cheng	7076e2daee	hsubp{s\|d} encoding bug git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27720 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-15 05:52:42 +00:00
Evan Cheng	57ebe9fbf0	Silly bug git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27719 91177308-0d34-0410-b5e6-96231b3b80d8	2006-04-15 05:37:34 +00:00

... 2 3 4 5 6 ...

441 Commits