llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-07-30 03:29:23 +00:00

Author	SHA1	Message	Date
Craig Topper	5a313bb7e8	Remove GCC builtins for vpermilp* intrinsics as clang no longer needs them. Custom lower the intrinsics to the vpermilp target specific node and remove intrinsic patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@150060 91177308-0d34-0410-b5e6-96231b3b80d8	2012-02-08 06:36:57 +00:00
Craig Topper	dbd98a4b1b	Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149968 91177308-0d34-0410-b5e6-96231b3b80d8	2012-02-07 06:28:42 +00:00
Craig Topper	5b209e84f4	Add target specific node for PMULUDQ. Change patterns to use it and custom lower intrinsics to it. Use it instead of intrinsic to handle 64-bit vector multiplies. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149807 91177308-0d34-0410-b5e6-96231b3b80d8	2012-02-05 03:14:49 +00:00
Elena Demikhovsky	dcabc7bca9	Optimization for SIGN_EXTEND operation on AVX. Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32 extensions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149600 91177308-0d34-0410-b5e6-96231b3b80d8	2012-02-02 09:10:43 +00:00
Andrew Trick	922d314e8f	Instruction scheduling itinerary for Intel Atom. Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT. Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches. Adds a test to verify that the scheduler is working. Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP. Patch by Preston Gurd! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149558 91177308-0d34-0410-b5e6-96231b3b80d8	2012-02-01 23:20:51 +00:00
Craig Topper	cc30006391	Fix pattern for memory form of PSHUFD for use with FP vectors to remove bitcast to an integer vector that normal code wouldn't have. Also remove bitcasts from code that turns splat vector loads into a shuffle as it was making the broken pattern necessary. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149232 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-30 07:50:31 +00:00
Craig Topper	3982b3cc7b	Move some patterns back near their instructions and use AddedComplexity to fix priority. Merge some patterns into their instruction definition. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149122 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-27 07:09:40 +00:00
Victor Umansky	668f7ac9e4	Fix for the following bug in AVX codegen for double-to-int conversions: . "fptosi" and "fptoui" IR instructions are defined with round-to-zero rounding mode. . Currently for AVX mode for <4xdouble> and <8xdouble> the "VCVTPD2DQ.128" and "VCVTPD2DQ.256" instructions are selected (for .fp_to_sint. DAG node operation ) by AVX codegen. However they use round-to-nearest-even rounding mode. . Consequently, the conversion produces incorrect numbers. The fix is to replace selection of VCVTPD2DQ instructions with VCVTTPD2DQ instructions. The latter use truncate (i.e. round-to-zero) rounding mode. As .fp_to_sint. DAG node operation is used only for lowering of "fptosi" and "fptoui" IR instructions, the fix in X86InstrSSE.td definition file doesn.t have an impact on other LLVM flows. The patch includes changes in the .td file, LIT test for the changes and a fix in a legacy LIT test (which produced asm code conflicting with LLVN IR spec). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149056 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-26 08:51:39 +00:00
Craig Topper	15388c4666	Fix AVX vs SSE patterns ordering issue for VPCMPESTRM and VPCMPISTRM. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149053 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-26 07:31:30 +00:00
Craig Topper	e566cd0f4d	Remove some more patterns by custom lowering intrinsics to target specific nodes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@149052 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-26 07:18:03 +00:00
Craig Topper	969ba287cd	Custom lower PSIGN and PSHUFB intrinsics to their corresponding target specific nodes so we can remove the isel patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148933 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-25 06:43:11 +00:00
Craig Topper	4bb3f34b22	Custom lower phadd and phsub intrinsics to target specific nodes. Remove the patterns that are no longer necessary. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148927 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-25 05:37:32 +00:00
Craig Topper	bce73e0a8c	Remove AVX 256-bit unaligned load intrinsics. 128-bit versions had been removed a while ago. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148922 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-25 04:42:03 +00:00
Craig Topper	042883f5da	Merge intrinsic pattern and no pattern versions of VCVTSD2SI intruction definitions. Matches non-AVX version of same instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148914 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-25 03:52:09 +00:00
Craig Topper	7925e2555d	Custom lower PCMPEQ/PCMPGT intrinsics to target specific nodes and remove the intrinsic patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148687 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-23 08:18:28 +00:00
Craig Topper	80e46360e9	Custom lower vector shift intrinsics to target specific nodes and remove the patterns that are no longer needed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148684 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-23 06:16:53 +00:00
Craig Topper	2b21fbaf11	Remove pattern fragments for v32i8, v16i16, v8i32, v16i8, v8i16, and v4i32 loads. All integer vector loads are promoted to v2i64 or v4i64 so these pattern fragments can never match. Fix or remove patterns that used these fragments. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148672 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-23 00:06:44 +00:00
Craig Topper	1906d32e55	Combine X86 CMPPD and CMPPS node types. Simplifies selection code and pattern matching. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148670 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-22 23:36:02 +00:00
Craig Topper	67609fd0eb	Merge PCMPEQB/PCMPEQW/PCMPEQD/PCMPEQQ and PCMPGTB/PCMPGTW/PCMPGTD/PCMPGTQ X86 ISD node types into only two node types. Simplifying opcode selection and pattern matching. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148667 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-22 22:42:16 +00:00
Craig Topper	ed2e13d667	Add target specific ISD node types for SSE/AVX vector shuffle instructions and change all the code that used to create intrinsic nodes to create the new nodes instead. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148664 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-22 19:15:14 +00:00
Craig Topper	6fdf3d54d2	Move some vector shift patterns into their instruction definitions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148643 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-22 00:41:20 +00:00
Craig Topper	babb1459f3	Add memory patterns for some of the fp<->integer conversion instructions. Fold some patterns into instruction definitions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148641 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-21 18:37:15 +00:00
Craig Topper	0e2037ba2b	Add support for selecting 256-bit PALIGNR. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148532 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-20 05:53:00 +00:00
Craig Topper	b7ab7fe053	Give priority to AVX over SSE for 128-bit floating point unpck instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148233 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-16 09:56:42 +00:00
Craig Topper	d07ef50ca1	Fix the memop type on a couple 256-bit AVX instructions that were using f128mem instead of f256mem. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148196 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-14 18:29:57 +00:00
Chad Rosier	d32d3b758f	Fix pasto from r146196. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148167 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-14 01:50:21 +00:00
Craig Topper	0518970dc8	Convert SHUFPD with the same register for both sources to PSHUFD if it would prevent a register copy. Similar to SHUFPS, but requires the mask to be converted. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148112 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-13 09:21:41 +00:00
Craig Topper	12216172c0	Make X86 instruction selection use 256-bit VPXOR for build_vector of all ones if AVX2 is enabled. This gives the ExeDepsFix pass a chance to choose FP vs int as appropriate. Also use v8i32 as the type for getZeroVector if AVX2 is enabled. This is consistent with SSE2 using prefering v4i32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148108 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-13 08:12:35 +00:00
Craig Topper	c30432ab57	Add patterns for v16i16 and v32i8 immAllZerosV to select VPXOR to match v4i64 and v8i32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@148106 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-13 06:59:47 +00:00
Chad Rosier	1b2983bb23	Add missing VEX predicates to VMOVSDto64rr/VMOVSDto64mr. This fixes a few failing test cases on our internal AVX nightly tester. rdar://10663637 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147881 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-10 22:14:06 +00:00
Craig Topper	c6d59954d8	Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147841 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-10 06:30:56 +00:00
Craig Topper	8ffc964582	Add HasAVX predicate to some of the AVX patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147769 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-09 08:34:00 +00:00
Craig Topper	47cf1003fa	Reorder a bunch of patterns to put the AVX version first thus giving it priority over the SSE version. Another step towards trying to remove the AVX hack that disables SSE from X86Subtarget. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147768 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-09 08:10:38 +00:00
Craig Topper	5feb5dae93	Clean up patterns for MOVNT*. Not sure why there were floating point types on MOVNTPS and MOVNTDQ. And v4i64 was completely missing. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147767 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-09 06:52:46 +00:00
Craig Topper	8974cd85cc	Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no AVX equivalent so we should use the SSE version. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147766 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-09 06:38:55 +00:00
Craig Topper	dfa5f573e7	Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations ANDPS/ORPS/XORPS/ANDNPS. This fixes a pattern ordering issue that meant that the SSE2 instructions could never be directly selected since the SSE1 patterns would always match first. This is largely moot with the ExeDepsFix pass, but I'm trying to audit for all such ordering issues. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147765 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-09 05:07:01 +00:00
Chad Rosier	3d1161e9ae	Enhance DAGCombine for transforming 128->256 casts into a vmovaps, rather then a vxorps + vinsertf128 pair if the original vector came from a load. rdar://10594409 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147481 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-03 21:05:52 +00:00
Craig Topper	a51bb3aa75	Make CanXFormVExtractWithShuffleIntoLoad reject loads with multiple uses. Also make it return false if there's not even a load at all. This makes the code better match the code in DAGCombiner that it tries to match. These two changes prevent some cases where vector_shuffles were making it to instruction selection and causing the older shuffle selection code to be triggered. Also needed to fix a bad pattern that this change exposed. This is the first step towards getting rid of the old shuffle selection support. No test cases yet because there's no way to tell whether a shuffle was handled in the legalize stage or at instruction selection. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147428 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-02 08:46:48 +00:00
Craig Topper	de9e4c728e	Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147409 91177308-0d34-0410-b5e6-96231b3b80d8	2012-01-01 19:40:22 +00:00
Craig Topper	b3982da7d2	Merge X86 SHUFPS and SHUFPD node types. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147394 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-31 23:50:21 +00:00
Craig Topper	3ee6d22c78	Add patterns for integer forms of SHUFPD/VSHUFPD with a memory load. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147393 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-31 23:24:49 +00:00
Craig Topper	e00805d52f	Fix typo in a SHUFPD and VSHUFPD pattern that prevented SHUFPD/VSHUFPD with a load from being selected. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147392 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-31 23:15:11 +00:00
Craig Topper	d65c7da5b0	Remove the separate explicit AES instruction patterns. They are equivalent to the patterns specified by the instructions. Also remove unnecessary bitconverts from the AES patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147342 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-29 17:41:56 +00:00
Chad Rosier	5c0d761d63	Fix 80-column violations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147095 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-21 20:59:09 +00:00
Elena Demikhovsky	ba4f83b4e9	This is the second fix related to VZEXT_MOVL node. The failure that I see in the current version is: LLVM ERROR: Cannot select: 0x18b8f70: v4i64 = X86ISD::VZEXT_MOVL 0x18beee0 [ID=14] 0x18beee0: v4i64 = insert_subvector 0x18b8c70, 0x18b9170, 0x18b9570 [ID=13] 0x18b8c70: v4i64 = insert_subvector 0x18b9870, 0x18bf4e0, 0x18b9970 [ID=12] 0x18b9870: v4i64 = undef [ID=4] 0x18bf4e0: v2i64 = bitcast 0x18bf3e0 [ID=10] 0x18bf3e0: v4i32 = BUILD_VECTOR 0x18b9770, 0x18b9770, 0x18b9770, 0x18b9770 [ID=8] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9770: i32 = TargetConstant<0> [ID=6] 0x18b9970: i32 = Constant<0> [ID=3] 0x18b9170: v2i64 = undef [ORD=1] [ID=1] 0x18b9570: i32 = Constant<2> [ID=5] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146975 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-20 13:34:28 +00:00
Eli Friedman	7e840efc23	Make sure we correctly note the existence of an i8 immediate for vblendvps and friends, so we compute fixups correctly. PR11586. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146709 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-15 23:46:18 +00:00
Chad Rosier	c8dd20170e	Add missing zmovl AVX patterns which were causing crashes. Patch by Elena Demikhovsky <elena.demikhovsky@intel.com>! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146689 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-15 22:11:31 +00:00
Benjamin Kramer	b653397dcd	X86: Add patterns for the various rounding ops for SSE4.1 and AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146257 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-09 15:44:03 +00:00
Benjamin Kramer	a73fb9adbb	X86: Split (v)rounds[sd] into a normal and an intrinsic version. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146256 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-09 15:43:55 +00:00
Evan Cheng	e955726a0e	Add 256-bit variant vmovss and vmovsd patterns. rdar://10538417 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146196 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-08 22:30:45 +00:00
Evan Cheng	13d2ba34f2	Add various missing AVX patterns which was causing crashes. Sadly, the generated code looks pretty bad compared to SSE. rdar://10538793 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146191 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-08 22:05:28 +00:00
Evan Cheng	2f435511e9	Many of the SSE patterns should not be selected when AVX is available. This led to the following code in X86Subtarget.cpp if (HasAVX) X86SSELevel = NoMMXSSE; This is so patterns that are predicated on hasSSE3, etc. would not be selected when avx is available. Instead, the AVX variant is selected. However, this breaks instructions which do not have AVX variants. The right way to fix this is for the SSE but not-AVX patterns to predicate on something like hasSSE3() && !hasAVX(). Then we can take out the hack in X86Subtarget.cpp. Patterns which do not have AVX variants do not need to change. However, we need to audit all the patterns before we make the change. This patch is workaround that fixes one specific case, the prefetch instructions. rdar://10538297 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146163 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-08 19:00:42 +00:00
Craig Topper	d802326335	Fix a bunch of SSE/AVX patterns to use proper memop types. In particular, not using integer loads other than v2i64/v4i64 since the others are all promoted. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@146031 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-07 08:30:53 +00:00
Craig Topper	cb6bd11bd6	Fix a bunch of SSE/AVX patterns to use v2i64/v4i64 loads since all other integer vector loads are promoted to those. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145927 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-06 09:04:59 +00:00
Craig Topper	34671b812a	Merge floating point and integer UNPCK X86ISD node types. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145926 91177308-0d34-0410-b5e6-96231b3b80d8	2011-12-06 08:21:25 +00:00
Craig Topper	ec24e61ab0	Merge VPERM2F128/VPERM2I128 ISD node types. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145485 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-30 07:47:51 +00:00
Craig Topper	316cd2a2c5	Merge decoding of VPERMILPD and VPERMILPS shuffle masks. Merge X86ISD node type for VPERMILPD/PS. Add instruction selection support for VINSERTI128/VEXTRACTI128. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145483 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-30 06:25:25 +00:00
Evan Cheng	a3438cf48b	Add another missing pattern. llvm-gcc likes f64 but clang likes i64 so it was generating poor code for some SSE builtins. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145448 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-29 22:48:34 +00:00
Jakob Stoklund Olesen	0edd83bfff	Make X86::FsFLD0SS / FsFLD0SD real pseudo-instructions. Like V_SET0, these instructions are expanded by ExpandPostRA to xorps / vxorps so they can participate in execution domain swizzling. This also makes the AVX variants redundant. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145440 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-29 22:27:25 +00:00
Elena Demikhovsky	f68b214e2d	Fixed vsqrt.ss intrinsic usage - order of input operands was wrong. Added a test. Thanks Bruno for reviewing the patch. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145403 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-29 15:00:45 +00:00
Craig Topper	36e36ace77	Fix issues in shuffle decoding around VPERM* instructions. Fix shuffle decoding for VSHUFPS/D for 256-bit types. Add pattern matching for memory forms of VPERMILPS/VPERMILPD. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145390 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-29 07:49:05 +00:00
Craig Topper	fe2a6c584a	Fix VINSERTF128/VEXTRACTF128 to be marked as FP instructions. Allow execution dependency fix pass to convert them to their integer equivalents when AVX2 is enabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145376 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-29 05:37:58 +00:00
Craig Topper	108126cfc6	Correctly mark VPERM2F128 as being an FP instruction and add execution domain fixing support to convert it to VPERM2I128 for AVX2. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145370 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-29 03:57:34 +00:00
Evan Cheng	678cda052c	Add missing avx pattern. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145272 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-28 20:27:23 +00:00
Craig Topper	70b883b3a7	Add X86 instruction selection for VPERM2I128 when AVX2 is enabled. Merge VPERMILPS/VPERMILPD detection since they are pretty similar. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145238 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-28 10:14:51 +00:00
Craig Topper	38034c568c	Merge 128-bit and 256-bit X86ISD node types for VPERMILPS and VPERMILPD. Simplify some shuffle lowering code since V1 can never be UNDEF due to canonalizing that occurs when shuffle nodes are created. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145153 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-26 22:55:48 +00:00
Craig Topper	06cb680779	Collapse X86ISD node types for PUNPCKH, PUNPCKL, UNPCKLP, and UNPCKHP to not be type specific. Now we just have integer high and low and floating point high and low. Pattern matching will choose the correct instruction based on the vector type. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145148 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-26 20:47:44 +00:00
Craig Topper	705f2431a0	Remove 256-bit specific node types for UNPCKHPS/D and instead use the 128-bit versions and let the operand type disinquish. Also fix the load form of the v8i32 patterns for these to realize that the load would be promoted to v4i64. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145126 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-24 22:57:10 +00:00
Craig Topper	f475a55bd4	Remove AVX2 specific X86ISD node types for PUNPCKH/L and instead just reuse the 128-bit versions and let the vector type distinguish. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145125 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-24 22:20:08 +00:00
Craig Topper	6fa583d787	Lowering for v32i8 to VPUNPCKLBW/VPUNPCKHBW when AVX2 is enabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145028 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-21 08:26:50 +00:00
Craig Topper	6347e8662c	Add support for lowering 256-bit shuffles to VPUNPCKL/H for i16, i32, i64 if AVX2 is enabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145026 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-21 06:57:39 +00:00
Craig Topper	0d86d462f8	Add code for lowering v32i8 shifts by a splat to AVX2 immediate shift instructions. Remove 256-bit splat handling from LowerShift as it was already handled by PerformShiftCombine. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145005 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-20 00:12:05 +00:00
Craig Topper	745a86bac9	Use 256-bit vcmpeqd for creating an all ones vector when AVX2 is enabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145004 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 22:34:59 +00:00
Craig Topper	ba798c5e51	Remove some of the special classes that worked around an old tablegen limitation of not being able to remove redundant bitconverts from patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@145003 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 21:01:54 +00:00
Craig Topper	98fc72940b	Custom lower AVX2 variable shift intrinsics to shl/srl/sra nodes and remove the intrinsic patterns. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144999 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 17:46:46 +00:00
Craig Topper	54f952afac	Synthesize SSSE3/AVX 128-bit horizontal integer add/sub instructions from add/sub of appropriate shuffle vectors. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144989 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 09:02:40 +00:00
Craig Topper	3113384a34	Collapse X86 PSIGNB/PSIGNW/PSIGND node types. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144988 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 07:33:10 +00:00
Craig Topper	1666cb6d63	Extend VPBLENDVB and VPSIGN lowering to work for AVX2. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144987 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 07:07:26 +00:00
Craig Topper	60d9a9206e	Remove unused parameters from the AVX maskmov classes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144985 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-19 04:49:22 +00:00
Nadav Rotem	cbbe33fde4	Add AVX2 vpbroadcast support git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144967 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-18 02:49:55 +00:00
Craig Topper	d90a191685	Fix SSE/AVX integer comparison patterns to understand that all integer vector loads are promoted to i64 vector loads so patterns need a bitconvert. Also slightly simplify the AVX2 variable shift patterns by using the predefined bitconvert pattern fragments. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144896 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-17 07:49:38 +00:00
Craig Topper	ec43d1f553	Remove seemingly unnecessary duplicate VROUND definitions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144885 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-17 07:04:00 +00:00
Evan Cheng	2b89498979	Another missing X86ISD::MOVLPD pattern. rdar://10450317 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144839 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-16 22:24:44 +00:00
Craig Topper	12755b07ab	Fix the execution domain on a bunch of SSE/AVX instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144784 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-16 07:30:46 +00:00
Evan Cheng	76c8f08567	Add a missing pattern for X86ISD::MOVLPD. rdar://10436044 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144566 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-14 20:35:52 +00:00
Craig Topper	3426a3efef	Add neverHasSideEffects, mayLoad, and mayStore to many patternless SSE/AVX instructions. Remove MMX check from LowerVECTOR_SHUFFLE since MMX vector types won't go through it anyway. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144522 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-14 06:46:21 +00:00
Craig Topper	7be5dfd1a1	Add more AVX2 shift lowering support. Move AVX2 variable shift to use patterns instead of custom lowering code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144457 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-12 09:58:49 +00:00
Craig Topper	46154eb6fd	Add lowering for AVX2 shift instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144380 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-11 07:39:23 +00:00
Nadav Rotem	4dbe96e22f	AVX2: Add variable shift from memory. Note: These patterns only works in some cases because many times the load sd node is bitcasted from a load node of a different type. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144266 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-10 06:54:20 +00:00
Nadav Rotem	c6c7e85a71	AVX2: Add patterns for variable shift operations git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144212 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-09 21:22:13 +00:00
Nadav Rotem	bb539bf973	Add AVX2 support for vselect of v32i8 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144187 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-09 13:21:28 +00:00
Craig Topper	0a15035f52	Add instruction selection for AVX2 integer comparisons. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144176 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-09 08:06:13 +00:00
Evan Cheng	7bc389b6b0	Add x86 isel logic and patterns to match movlps from clang generated IR for _mm_loadl_pi(). rdar://10134392, rdar://10050222 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@144052 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-08 00:31:58 +00:00
Craig Topper	4c763ee613	Add AVX2 variable shift instructions and intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143915 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-07 08:26:24 +00:00
Craig Topper	28692044db	Add AVX2 VPMOVMASK instructions and intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143904 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-07 03:20:35 +00:00
Craig Topper	69f5df7778	Add AVX2 VEXTRACTI128 and VINSERTI128 instructions. Fix VPERM2I128 to be qualified with HasAVX2 instead of HasAVX. Mark VINSERTF128 and VEXTRACTF128 as never having side effects. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143902 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-07 02:00:04 +00:00
Craig Topper	c8eb880a7f	More AVX2 instructions and their intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143895 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-06 23:04:08 +00:00
Craig Topper	27e5d0c72a	Add more AVX2 instructions and intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143861 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-06 06:12:20 +00:00
Craig Topper	018262768f	Add intrinsics for X86 vcvtps2ph and vcvtph2ps instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143683 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-04 06:59:49 +00:00
Craig Topper	98e0b9c86d	Add new X86 AVX2 VBROADCAST instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143612 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-03 07:35:53 +00:00
Craig Topper	205e3378fd	More AVX2 instructions and intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143536 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-02 06:54:17 +00:00
Craig Topper	3f2b2c218f	Add a bunch more X86 AVX2 instructions and their corresponding intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143529 91177308-0d34-0410-b5e6-96231b3b80d8	2011-11-02 04:42:13 +00:00
Craig Topper	6b1c5fc02a	Begin adding AVX2 instructions. No selection support yet other than intrinsics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@143331 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-31 02:15:10 +00:00
Jakob Stoklund Olesen	0a951fba75	V_SET0 has no side effects. TableGen will mark any pattern-less instruction as having unmodeled side effects. This is extra bad for V_SET0 which gets rematerialized a lot. This was part of the cause for PR11125, but the real bug was fixed in r141923. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141924 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-14 00:39:50 +00:00
Craig Topper	d501c714cd	Add 'implicit EFLAGS' to patterns for popcnt and lzcnt git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141853 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 06:18:52 +00:00
Craig Topper	c48b301fb0	Add HasPOPCNT predicate to the POPCNT instructions. Also mark POPCNT as modifying EFLAGS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141656 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 07:13:09 +00:00
Craig Topper	227358e93c	Make Ivy Bridge 16-bit floating point conversion instructions require AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141654 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 07:01:37 +00:00
Craig Topper	da394041c4	Add Ivy Bridge 16-bit floating point conversion instructions for the X86 disassembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141505 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-09 07:31:39 +00:00
Craig Topper	6744a17dcf	Add support in the disassembler for ignoring the L-bit on certain VEX instructions. Mark instructions that have this behavior. Fixes PR10676. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141065 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-04 06:30:42 +00:00
Craig Topper	581fe82c84	Add support for MOVBE and RDRAND instructions for the assembler and disassembler. Includes feature flag checking, but no instrinsic support. Fixes PR10832, PR11026 and PR11027. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141007 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-03 17:28:23 +00:00
Jakob Stoklund Olesen	92fb79b7a6	Expand the x86 V_SET0* pseudos right after register allocation. This also makes it possible to reduce the number of pseudo instructions and get rid of the encoding information. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140776 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-29 05:10:54 +00:00
Duncan Sands	04aa4aee89	Implement Chris's suggestion of legalizing the various SSE and AVX hadd/hsub intrinsics into the new fhadd/fhsub X86 node. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140383 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-23 16:10:22 +00:00
Duncan Sands	17470bee5f	Synthesize SSE3/AVX 128 bit horizontal add/sub instructions from floating point add/sub of appropriate shuffle vectors. Does not synthesize the 256 bit AVX versions because they work differently. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140332 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 20:15:48 +00:00
Bruno Cardoso Lopes	f4b841d4e2	Revert r140097, working on a better approach git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140203 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 23:19:29 +00:00
Bruno Cardoso Lopes	448d986858	The wrong relocation was being emitted for several SSSE3 instructions. This fixes PR10963. Thanks to Benjamin for finding the wrong tablegen declaration. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140184 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 21:39:21 +00:00
Bruno Cardoso Lopes	d91c6e058b	Fix PR10949. Fix the encoding of VMOVPQIto64rr. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140098 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-19 23:36:59 +00:00
Bruno Cardoso Lopes	97136c922e	Based on the small opt Zvi's patch was trying to achieve, eliminate 128-bit undef subvector insertion into a 256-bit vector git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140097 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-19 23:36:50 +00:00
Bruno Cardoso Lopes	97dc60b759	Match X86ISD::FSETCCsd and X86ISD::FSETCCss while in AVX mode. This fix PR10955 and PR10948. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140069 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-19 21:29:24 +00:00
Bruno Cardoso Lopes	2c693dc126	Describe more AVX 128-bit convert instructions without patterns to have mayLoad = 1 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139973 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-16 23:41:29 +00:00
Bruno Cardoso Lopes	7291272ab2	Add mayLoad attribute to AVX convert instructions, since non of them are declared with load patterns. This fix the crash in PR10941. No testcases, since a fold is triggered and then converted back to the register form afterwards. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139953 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-16 22:02:14 +00:00
Craig Topper	a08e255e1e	Fix mem type for VEX.128 form of VROUNDP*. Remove filter preventing VROUND from being recognized by disassembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139691 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 06:41:26 +00:00
Bruno Cardoso Lopes	484ddf54c9	Teach the foldable tables about 128-bit AVX instructions and make the alignment check for 256-bit classes more strict. There're no testcases but we catch more folding cases for AVX while running single and multi sources in the llvm testsuite. Since some 128-bit AVX instructions have different number of operands than their SSE counterparts, they are placed in different tables. 256-bit AVX instructions should also be added in the table soon. And there a few more 128-bit versions to handled, which should come in the following commits. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139687 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 02:36:58 +00:00
Nadav Rotem	dfb5935c76	swap vselect operand order - pr10907 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139630 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 19:56:38 +00:00
Bruno Cardoso Lopes	df24e1fb08	Add versions 256-bit versions of alignedstore and alignedload, to be more strict about the alignment checking. This was found by inspection and I don't have any testcases so far, although the llvm testsuite runs without any problem. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139625 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 19:33:03 +00:00
Craig Topper	58bbb81764	Remove filter that was preventing MOVDQU/MOVDQA and their VEX forms from being disassembled. Also added encodings for the other register/register form of these instructions. Fixes PR10848. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139588 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 06:54:58 +00:00
Craig Topper	6b0b2d6c41	Fix encoding of VMOVDQU to not simultaneously be 'TB OpSize' and 'XS'. 'XS' is correct and seems to have been taking priority. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139587 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 06:39:34 +00:00
Bruno Cardoso Lopes	5fc48100ee	Fix PR10845. SUBREG_TO_REG shouldn't be used when the input and destination types are equal! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139553 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 22:59:23 +00:00
Bruno Cardoso Lopes	93474f5f7f	Organize a bit the operand names for CMPPS and CMPPD git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139527 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:36 +00:00
Bruno Cardoso Lopes	cf355422d6	Realign BLEND patterns to match the general style for patterns in .td file. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139526 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:33 +00:00
Bruno Cardoso Lopes	3445df77d4	Fix 80-columns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139525 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:29 +00:00
Nadav Rotem	5ed0983200	Format patterns, remove unused X86blend patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139491 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 08:41:50 +00:00
Craig Topper	136046c9a2	Fix disassembling of one of the register/register forms of MOVUPS/MOVUPD/MOVAPS/MOVAPD/MOVSS/MOVSD and their VEX equivalents. Fixes PR10877. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139486 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 23:19:54 +00:00
Nadav Rotem	fbad25e120	CR fixes per Bruno's request. Undo the changes from r139285 which added custom lowering to vselect. Add tablegen lowering for vselect. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139479 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 15:02:23 +00:00
Nadav Rotem	8ffad56f8e	Implement vector-select support for avx256. Refactor the vblend implementation to have tablegen match the instruction by the node type git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139400 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-09 20:29:17 +00:00
Bruno Cardoso Lopes	7ec8fb8830	Add a AVX version of a simple i64 -> f64 bitcast. This could be triggered using llc with -O0, which wouldn't let it be folded and expose the lack of this pattern. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139320 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 21:52:33 +00:00
Bruno Cardoso Lopes	814c6ced85	Add AVX versions of blend vector operations and fix some issues noticed in Nadav's r139285 and r139287 commits. 1) Rename vsel.ll to a more descriptive name 2) Change the order of BLEND operands to "Op1, Op2, Cond", this is necessary because PBLENDVB is already used in different places with this order, and it was being emitted in the wrong way for vselect 3) Add AVX patterns and tests for the same SSE41 instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139305 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:05:08 +00:00
Bruno Cardoso Lopes	7db2d3a504	Fix PR10844: Add patterns to cover non foldable versions of X86vzmovl. Triggered using llc -O0. Also fix some SET0PS patterns to their AVX forms and test it on the testcase. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139304 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:05:02 +00:00
Nadav Rotem	ffe3e7da84	Add X86-SSE4 codegen support for vector-select. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139285 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 08:11:19 +00:00
Bruno Cardoso Lopes	2c84e96d3e	Add AVX versions to match AESENC/AESDEC intrinsics. This hopefully ends the cycle of missing AVX counterparts of already present SSE* patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139073 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:08 +00:00
Bruno Cardoso Lopes	9f63615b17	Add AVX version of a SSE4.1 VPBLENDVB pattern git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139072 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:05 +00:00
Bruno Cardoso Lopes	d01ef7d978	Add AVX versions of SSE4.1 EXTRACTPS patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139071 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:03 +00:00
Bruno Cardoso Lopes	2b0e0a42d1	Add AVX versions for SSE4.1 MOVZX* patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139070 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:01 +00:00
Bruno Cardoso Lopes	a67806530c	Add one more AVX pattern for MOVZPQILo2PQI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139069 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:58 +00:00
Bruno Cardoso Lopes	d29dd5ec9f	Move PUNPCKLQDQ splat pattern close to the instruction definition and duplicate it for AVX mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139068 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:56 +00:00
Bruno Cardoso Lopes	914a2a319c	Add AVX pattern versions for PSHUFB,PSIGN{B,W,D} git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139067 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:54 +00:00
Bruno Cardoso Lopes	a4ac989a1c	Add AVX versions of MOVZDI2PDI patterns. Use SUBREG_TO_REG to indicate that the AVX versions (even the 128-bit ones) all clear the upper part of the destination register. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139066 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:51 +00:00
Bruno Cardoso Lopes	152a287374	Enforce subtarget checks in a few places to be explicit when the pattern should be matched git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139065 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:49 +00:00
Bruno Cardoso Lopes	5ab6dcc4bb	Tidy up code moving patterns to their appropriate place! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139064 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:47 +00:00
Bruno Cardoso Lopes	0e59a04849	Add AVX versions of FsMOVAPS and FsMOVAPS. Teach X86InstrInfo how to use it! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139063 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:45 +00:00
Bruno Cardoso Lopes	1aab5515f6	Fix 80-column and style git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139061 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:40 +00:00

1 2 3 4 5 ...

939 Commits