llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-10-07 20:57:12 +00:00

Author	SHA1	Message	Date
Craig Topper	dc479c4a89	Add X86 INVPCID instruction. Add 32/64-bit predicates to INVEPT, INVVPID, VMREAD, and VMWRITE to remove hack from X86RecognizableInstr. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@142117 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-16 07:05:40 +00:00
Chris Lattner	d8b7aa2613	Enhance llvm::SourceMgr to support diagnostic ranges, the same way clang does. Enhance the X86 asmparser to produce ranges in the one case that was annoying me, for example: test.s:10:15: error: invalid operand for instruction movl 0(%rax), 0(%edx) ^~~~~~~ It should be straight-forward to enhance filecheck, tblgen, and/or the .ll parser to use ranges where appropriate if someone is interested. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@142106 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-16 04:47:35 +00:00
Craig Topper	17730847d5	Add X86 BEXTR instruction. This instruction uses VEX.vvvv to encode Operand 3 instead of Operand 2 so needs special casing in the disassembler and code emitter. Ultimately, should pass this information from tablegen git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@142105 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-16 03:51:13 +00:00
Craig Topper	4145c49aa0	Add X86 feature detection support for BMI instructions. Added new cpuid function for accessing leafs with sub leafs specified in ECX. Also added code to keep track of the max cpuid level supported in both basic and extended leaves and qualified the existing cpuid calls and the new call to leaf 7. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@142089 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-16 00:21:51 +00:00
Craig Topper	566f233ba6	Add support for X86 blsr, blsmsk, and blsi instructions. Required extra work because these are the first VEX encoded instructions to use the reg field as an opcode extension. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@142082 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-15 20:46:47 +00:00
Benjamin Kramer	003fad98cc	SmallVector -> array git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@142073 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-15 13:28:31 +00:00
Evan Cheng	b10946a5a9	A few 80-col violations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141988 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-14 20:36:23 +00:00
Craig Topper	54a11176f6	Add X86 ANDN instruction. Including instruction selection. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141947 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-14 07:06:56 +00:00
Craig Topper	909652f687	Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141939 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-14 03:21:46 +00:00
Jakob Stoklund Olesen	ccbe603869	Ban rematerializable instructions with side effects. TableGen infers unmodeled side effects on instructions without a pattern. Fix some instruction definitions where that was overlooked. Also raise an error if a rematerializable instruction has unmodeled side effects. That doen't make any sense. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141929 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-14 01:00:49 +00:00
Jakob Stoklund Olesen	0a951fba75	V_SET0 has no side effects. TableGen will mark any pattern-less instruction as having unmodeled side effects. This is extra bad for V_SET0 which gets rematerialized a lot. This was part of the cause for PR11125, but the real bug was fixed in r141923. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141924 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-14 00:39:50 +00:00
Eli Friedman	d83a54fd35	Simplify assertion, and avoid undefined shift. Based on patch by Ahmed Charles. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141912 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 23:27:48 +00:00
Bill Wendling	4e68054b20	More closely follow libgcc, which has code after the `ret' instruction to release the stack segment and reset the stack pointer. Place the code in its own MBB to make the verifier happy. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141859 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 08:24:19 +00:00
Bill Wendling	1203fe7fc8	Revert r141854 because it was causing failures: http://lab.llvm.org:8011/builders/llvm-x86_64-linux/builds/101 --- Reverse-merging r141854 into '.': U test/MC/Disassembler/X86/x86-32.txt U test/MC/Disassembler/X86/simple-tests.txt D test/CodeGen/X86/bmi.ll U lib/Target/X86/X86InstrInfo.td U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86.td U lib/Target/X86/X86Subtarget.h git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141857 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 07:48:07 +00:00
Bill Wendling	82222c20be	Should not add instructions to a BB after a return instruction. The machine instruction verifier doesn't like this, nor do I. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141856 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 07:42:32 +00:00
Craig Topper	8ab1d1e900	Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141854 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 07:09:14 +00:00
Craig Topper	d501c714cd	Add 'implicit EFLAGS' to patterns for popcnt and lzcnt git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141853 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-13 06:18:52 +00:00
Nick Lewycky	1f9c6860bf	Fix indent in comment. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141749 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-12 00:14:12 +00:00
Craig Topper	c48b301fb0	Add HasPOPCNT predicate to the POPCNT instructions. Also mark POPCNT as modifying EFLAGS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141656 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 07:13:09 +00:00
Craig Topper	227358e93c	Make Ivy Bridge 16-bit floating point conversion instructions require AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141654 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 07:01:37 +00:00
Craig Topper	37f2167f15	Add X86 LZCNT instruction. Including instruction selection support. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141651 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 06:44:02 +00:00
Craig Topper	29480fd798	Fix disassembling of popcntw. Also remove some code that says it accounts for 64BIT_REXW_XD not existing, but it does exist. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141642 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 04:34:23 +00:00
Lang Hames	4ad06e61c0	Fixed natural stack alignment for Linux x86-32. Thanks Eli. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141616 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-11 00:51:36 +00:00
Lang Hames	bb5b3f3359	Add a natural stack alignment field to TargetData, and prevent InstCombine from promoting allocas to preferred alignments that exceed the natural alignment. This avoids some potentially expensive dynamic stack realignments. The natural stack alignment is set in target data strings via the "S<size>" option. Size is in bits and must be a multiple of 8. The natural stack alignment defaults to "unspecified" (represented by a zero value), and the "unspecified" value does not prevent any alignment promotions. Target maintainers that care about avoiding promotions should explicitly add the "S<size>" option to their target data strings. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141599 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-10 23:42:08 +00:00
Eli Friedman	dca62d53b7	Make sure the X86 backend doesn't explode on 128-bit shuffles in AVX mode. Fixes PR11102. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141585 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-10 22:28:47 +00:00
Benjamin Kramer	717073c237	X86: Add a subtarget definition for core-avx-i, which is GCC's name for ivy bridge. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141571 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-10 19:35:07 +00:00
Nadav Rotem	a7934dd8e4	Fix 10892 - When lowering SIGN_EXTEND_INREG do not lower v2i64 because the instruction set has no 64-bit SRA support. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141570 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-10 19:31:45 +00:00
Benjamin Kramer	a86a58695d	X86: Add patterns for the movbe instruction (mov + bswap, only available on atom) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141563 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-10 18:34:56 +00:00
Craig Topper	1f104804bf	Put a bunch of calls to ToggleFeature behind proper if statements. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141527 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-10 05:34:02 +00:00
Craig Topper	da394041c4	Add Ivy Bridge 16-bit floating point conversion instructions for the X86 disassembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141505 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-09 07:31:39 +00:00
Jakob Stoklund Olesen	b7994fedcb	Prevent potential NOREX bug. A GR8_NOREX virtual register is created when extrating a sub_8bit_hi sub-register: %vreg2<def> = COPY %vreg1:sub_8bit_hi; GR8_NOREX:%vreg2 %GR64_ABCD:%vreg1 TEST8ri_NOREX %vreg2, 1, %EFLAGS<imp-def>; GR8_NOREX:%vreg2 If such a live range is ever split, its register class must not be inflated to GR8. The sub-register copy can only target GR8_NOREX. I dont have a test case for this theoretical bug. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141500 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-08 20:20:03 +00:00
Jakob Stoklund Olesen	ed74482704	Add TEST8ri_NOREX pseudo to constrain sub_8bit_hi copies. In 64-bit mode, sub_8bit_hi sub-registers can only be used by NOREX instructions. The COPY created from the EXTRACT_SUBREG DAG node cannot target all GR8 registers, only those in GR8_NOREX. TO enforce this, we ensure that all instructions using the EXTRACT_SUBREG are GR8_NOREX constrained. This fixes PR11088. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141499 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-08 18:28:28 +00:00
Jakob Stoklund Olesen	b66f18486a	Constrain both operands on MOVZX32_NOREXrr8. This instruction is explicitly encoded without an REX prefix, so both operands but be *_NOREX. Also add an assertion to copyPhysReg() that fires when the MOV8rr_NOREX constraints are not satisfied. This fixes a miscompilation in 20040709-2 in the gcc test suite. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141410 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-07 20:15:54 +00:00
Evan Cheng	7c1780c5fe	High bits of movmskp{s\|d} and pmovmskb are known zero. rdar://10247336 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141371 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-07 17:21:44 +00:00
Craig Topper	75fe5f3bab	Add X86 disassembler support for RDFSBASE, RDGSBASE, WRFSBASE, and WRGSBASE. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141358 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-07 07:02:24 +00:00
Craig Topper	1b526a98e3	Add X86 disassembler support for XSAVE, XRSTOR, and XSAVEOPT. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141354 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-07 05:53:50 +00:00
Craig Topper	25f6dfd108	Revert part of r141274. Only need to change encoding for xchg %eax, %eax in 64-bit mode. This is because in 64-bit mode xchg %eax, %eax implies zeroing the upper 32-bits of RAX which makes it not a NOP. In 32-bit mode using NOP encoding is fine. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141353 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-07 05:35:38 +00:00
Craig Topper	7ea16b01fa	Fix assembling of xchg %eax, %eax to not use the NOP encoding of 0x90. This was done by creating a new register group that excludes AX registers. Fixes PR10345. Also added aliases for flipping the order of the operands of xchg <reg>, %eax. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141274 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-06 06:44:41 +00:00
Peter Collingbourne	de8f33c199	Build system infrastructure for multiple tblgens. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141266 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-06 01:51:51 +00:00
Jakob Stoklund Olesen	9bb272c900	Override TRI::getSubClassWithSubReg for X86. There are fewer registers with sub_8bit sub-registers in 32-bit mode than in 64-bit mode. In 32-bit mode, sub_8bit behaves the same as sub_8bit_hi. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141206 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-05 20:26:33 +00:00
Craig Topper	41e59c7c34	Change C++ style comments to C style comments in X86 disassembler. Patch from Joe Abbey. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141162 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-05 03:29:32 +00:00
Owen Anderson	2fec6c5ff1	Teach the MC to output code/data region marker labels in MachO and ELF modes. These are used by disassemblers to provide better disassembly, particularly on targets like ARM Thumb that like to intermingle data in the TEXT segment. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141135 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-04 23:26:17 +00:00
Craig Topper	6744a17dcf	Add support in the disassembler for ignoring the L-bit on certain VEX instructions. Mark instructions that have this behavior. Fixes PR10676. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141065 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-04 06:30:42 +00:00
Craig Topper	581fe82c84	Add support for MOVBE and RDRAND instructions for the assembler and disassembler. Includes feature flag checking, but no instrinsic support. Fixes PR10832, PR11026 and PR11027. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@141007 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-03 17:28:23 +00:00
Craig Topper	04c5be9f12	Treat VEX.vvvv as a 3-bit field outside of 64-bit mode. Prevents access to registers xmm8-xmm15 outside 64-bit mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140997 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-03 08:14:29 +00:00
Craig Topper	7b22976de3	Fix VEX disassembling to ignore REX.RXBW bits in 32-bit mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140993 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-03 07:51:09 +00:00
Craig Topper	82f131a017	Fix some Intel syntax disassembly issues with instructions that implicitly use AL/AX/EAX/RAX such as ADD/SUB/ADC/SUBB/XOR/OR/AND/CMP/MOV/TEST. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140974 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-02 21:08:12 +00:00
Craig Topper	146c6d77f0	Special case disassembler handling of REX.B prefix on NOP instruction to decode as XCHG R8D, EAX instead. Fixes PR10344. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140971 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-02 16:56:09 +00:00
Craig Topper	846a2dcada	Fix disassembling of INVEPT and INVVPID to take operands git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140955 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-01 21:20:14 +00:00
Craig Topper	e1b4a1a07e	Fix disassembler handling of CRC32 which is an odd instruction that uses 0xf2 as an opcode extension and allows the opsize prefix. This necessitated adding IC_XD_OPSIZE and IC_64BIT_XD_OPSIZE contexts. Unfortunately, this increases the size of the disassembler tables. Fixes PR10702. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140954 91177308-0d34-0410-b5e6-96231b3b80d8	2011-10-01 19:54:56 +00:00
Jakob Stoklund Olesen	c8e2bb68bb	Store sub-class lists as a bit vector. This uses less memory and it reduces the complexity of sub-class operations: - hasSubClassEq() and friends become O(1) instead of O(N). - getCommonSubClass() becomes O(N) instead of O(N^2). In the future, TableGen will infer register classes. This makes it cheap to add them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140898 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-30 22:19:07 +00:00
Jakob Stoklund Olesen	92fb79b7a6	Expand the x86 V_SET0* pseudos right after register allocation. This also makes it possible to reduce the number of pseudo instructions and get rid of the encoding information. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140776 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-29 05:10:54 +00:00
Eli Friedman	7d3e2b78c7	PR11033: Make sure we don't generate PCMPGTQ and PCMPEQQ if the target CPU does not support them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140723 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-28 21:00:25 +00:00
Jakob Stoklund Olesen	d4d4fca9c3	Rename SSEDomainFix -> lib/CodeGen/ExecutionDepsFix. I'll clean up the source in the next commit. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140663 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-28 00:01:54 +00:00
Jakob Stoklund Olesen	df4b35e3dd	Remove X86-dependent stuff from SSEDomainFix. This also enables domain swizzling for AVX code which required a few trivial test changes. The pass will be moved to lib/CodeGen shortly. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140659 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-27 23:50:46 +00:00
Jakob Stoklund Olesen	98e933f9ad	Promote the X86 Get/SetSSEDomain functions to TargetInstrInfo. I am going to unify the SSEDomainFix and NEONMoveFix passes into a single target independent pass. They are essentially doing the same thing. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140652 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-27 22:57:18 +00:00
Craig Topper	100d86ada5	Fix VEX decoding in i386 mode. Fixes PR11008. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140515 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-26 05:12:43 +00:00
Jakob Stoklund Olesen	51f0c76419	Only run MF.verify() with EXPENSIVE_CHECKS=1. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140441 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-24 01:11:19 +00:00
Duncan Sands	04aa4aee89	Implement Chris's suggestion of legalizing the various SSE and AVX hadd/hsub intrinsics into the new fhadd/fhsub X86 node. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140383 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-23 16:10:22 +00:00
Eli Friedman	a6176adc8a	PR10991: make fast-isel correctly check whether accessing a global through an alias involves thread-local storage. (I'm not entirely sure how this is supposed to work, but this patch makes fast-isel consistent with the normal isel path.) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140355 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 23:41:28 +00:00
Jakob Stoklund Olesen	4bd89873be	Add support for GR32 <-> FR32 cross class copies. We already support GR64 <-> VR128 copies. All of these copies break partial register dependencies by zeroing the high part of the target register. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140348 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 22:45:24 +00:00
Duncan Sands	17470bee5f	Synthesize SSE3/AVX 128 bit horizontal add/sub instructions from floating point add/sub of appropriate shuffle vectors. Does not synthesize the 256 bit AVX versions because they work differently. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140332 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 20:15:48 +00:00
Craig Topper	adf01b3f18	Fix register printing in disassembling of push/pop of segment registers and in/out in Intel syntax mode. Fixes PR10960 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140299 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 07:01:50 +00:00
Benjamin Kramer	2c2ccbf108	The SSE version differences for fmin/fmax are more involved than I thought. - x87: no min or max. - SSE1: min/max for single precision scalars and vectors. - SSE2: min/max for single and double precision scalars and vectors. - AVX: as SSE2, but also supports the wider ymm vectors. (this is covered by the isTypeLegal check) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140296 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 03:27:22 +00:00
Benjamin Kramer	74f3501d15	X86: Don't form min/max nodes if the target is missing SSE. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140294 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-22 03:01:42 +00:00
Benjamin Kramer	15c9a1f60c	X86Disassembler: if verbose logging is going to nulls(), disable logging completely. Otherwise we'll spend a ridiculous amount of time pretty printing debug output and then discarding it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140276 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-21 21:47:35 +00:00
Nadav Rotem	64ac73bb15	fix comment git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140258 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-21 17:14:40 +00:00
Nadav Rotem	9c6cdf4c1c	Insert a sanity check on the combining of x86 truncing-store nodes. This comes to replace the problematic check that was removed in r139995. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140246 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-21 08:45:10 +00:00
Richard Trieu	23946fcaae	Change: assert(!"error message"); To: assert(0 && "error message"); which is more consistant across the code base. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140234 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-21 03:09:09 +00:00
Owen Anderson	317eaf1993	In the disassembler C API, be careful not to confuse the comment streamer that the disassembler outputs annotations on with the streamer that the InstPrinter will print them on. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140217 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-21 00:25:23 +00:00
Bruno Cardoso Lopes	f4b841d4e2	Revert r140097, working on a better approach git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140203 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 23:19:29 +00:00
Bruno Cardoso Lopes	149f29f1fd	Simplify max/minp[s\|d] dagcombine matching git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140199 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 22:34:45 +00:00
Bruno Cardoso Lopes	4e42335972	Tidy up a bit more, fix tab and remove trailing whitespaces git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140186 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 21:45:26 +00:00
Bruno Cardoso Lopes	448d986858	The wrong relocation was being emitted for several SSSE3 instructions. This fixes PR10963. Thanks to Benjamin for finding the wrong tablegen declaration. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140184 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 21:39:21 +00:00
Bruno Cardoso Lopes	77169a9197	Tidy up code! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140183 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 21:39:06 +00:00
Craig Topper	3699261d3f	Extend changes from r139986 to produce 256-bit AVX minps/minpd/maxps/maxpd. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140140 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-20 07:38:59 +00:00
Bruno Cardoso Lopes	d91c6e058b	Fix PR10949. Fix the encoding of VMOVPQIto64rr. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140098 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-19 23:36:59 +00:00
Bruno Cardoso Lopes	97136c922e	Based on the small opt Zvi's patch was trying to achieve, eliminate 128-bit undef subvector insertion into a 256-bit vector git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140097 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-19 23:36:50 +00:00
Bruno Cardoso Lopes	97dc60b759	Match X86ISD::FSETCCsd and X86ISD::FSETCCss while in AVX mode. This fix PR10955 and PR10948. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140069 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-19 21:29:24 +00:00
Nadav Rotem	ca6f296b48	Fix typos in my prev commit, found by Tobi. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140003 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-18 19:00:23 +00:00
Nadav Rotem	354efd88db	setOperationAction should be done on the return value of the type, not the operands. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@140001 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-18 14:57:03 +00:00
Nadav Rotem	91e43fd17a	When promoting integer vectors we often create ext-loads. This patch adds a dag-combine optimization to implement the ext-load efficiently (using shuffles). For example the type <4 x i8> is stored in memory as i32, but it needs to find its way into a <4 x i32> register. Previously we scalarized the memory access, now we use shuffles. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139995 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-18 10:39:32 +00:00
Craig Topper	89af15ee11	Fix typo by changing Lower256IntVETCC to Lower256IntVSETCC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139993 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-18 08:03:58 +00:00
Duncan Sands	6bcd2196e5	Synthesize x86 max/min instructions also for vectors (i.e. produce maxps and maxpd). This broke the sse41-blend.ll testcase by causing maxpd to be produced rather than a cmp+blend pair, which is the reason I tweaked it. Gives a small speedup on doduc with dragonegg when the GCC vectorizer is used. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139986 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-17 16:49:39 +00:00
Bruno Cardoso Lopes	2c693dc126	Describe more AVX 128-bit convert instructions without patterns to have mayLoad = 1 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139973 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-16 23:41:29 +00:00
Bruno Cardoso Lopes	7291272ab2	Add mayLoad attribute to AVX convert instructions, since non of them are declared with load patterns. This fix the crash in PR10941. No testcases, since a fold is triggered and then converted back to the register form afterwards. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139953 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-16 22:02:14 +00:00
Bruno Cardoso Lopes	08ecb711ac	Fix PR10884. This PR basically reports a problem where a crash in generated code happened due to %rbp being clobbered: pushq %rbp movq %rsp, %rbp .... vmovmskps %ymm12, %ebp .... movq %rbp, %rsp popq %rbp ret Since Eric's r123367 commit, the default stack alignment for x86 32-bit has changed to be 16-bytes. Since then, the MaxStackAlignmentHeuristicPass hasn't been really used, but with AVX it becomes useful again, since per ABI compliance we don't always align the stack to 256-bit, but only when there are 256-bit incoming arguments. ReserveFP was only used by this pass, but there's no RA target hook that uses getReserveFP() to check for the presence of FP (since nothing was triggering the pass to run, the uses of getReserveFP() were removed through time without being noticed). Change this pass to use setForceFramePointer, which is properly called by MachineFunction hasFP method. The testcase is very big and dependent on RA, not sure if it's worth adding to test/CodeGen/X86. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139939 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-16 20:58:28 +00:00
Owen Anderson	98c5ddabca	Don't attach annotations to MCInst's. Instead, have the disassembler return, and the printer accept, an annotation string which can be passed through if the client cares about annotations. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139876 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 23:38:46 +00:00
Bruno Cardoso Lopes	6b5b79c7e8	Add a fixme note! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139872 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 23:04:24 +00:00
Bruno Cardoso Lopes	b4e905d027	Add the remaining AVX versions of instructions to X86InstrInfo, this time for describing high latency ones and for recognizting loads from the same base pointer git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139864 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 22:15:52 +00:00
Bruno Cardoso Lopes	cd2857ee67	Factor out partial register update checks for some SSE instructions. Also add the AVX versions and add comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139854 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 21:42:23 +00:00
Owen Anderson	ede042dc8d	Add support for stored annotations to MCInst, and provide facilities for MC-based InstPrinters to print them out. Enhance the ARM and X86 InstPrinter's to do so in verbose mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139820 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 18:36:29 +00:00
Bruno Cardoso Lopes	0c4b9ff077	Change all checks regarding the presence of any SSE level to always take into consideration the presence of AVX. This change, together with the SSEDomainFix enabled for AVX, makes AVX codegen to always (hopefully) emit the same code as SSE for 128-bit vector ops. I don't have a testcase for this, but AVX now beats SSE in performance for 128-bit ops in the majority of programas in the llvm testsuite git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139817 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 18:27:36 +00:00
Bruno Cardoso Lopes	41a9635292	Enable SSEDomainFix pass for AVX mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139816 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-15 18:27:32 +00:00
Eli Friedman	322ea080ad	Fix the code creating VZEXT_LOAD so that it creates the right memoperand. Issue spotted in -debug output. I can't think of any practical effects at the moment, but it might matter if we start doing more aggressive alias analysis in CodeGen. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139758 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 23:42:45 +00:00
Craig Topper	a08e255e1e	Fix mem type for VEX.128 form of VROUNDP*. Remove filter preventing VROUND from being recognized by disassembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139691 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 06:41:26 +00:00
Craig Topper	3bb43a829e	Make disassembling of VBLEND* print immediate as a XMM/YMM register name. Fixes PR10917. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139690 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 05:55:28 +00:00
Bruno Cardoso Lopes	484ddf54c9	Teach the foldable tables about 128-bit AVX instructions and make the alignment check for 256-bit classes more strict. There're no testcases but we catch more folding cases for AVX while running single and multi sources in the llvm testsuite. Since some 128-bit AVX instructions have different number of operands than their SSE counterparts, they are placed in different tables. 256-bit AVX instructions should also be added in the table soon. And there a few more 128-bit versions to handled, which should come in the following commits. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139687 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 02:36:58 +00:00
Bruno Cardoso Lopes	5ca0d14915	Vector shuffle mask <i32 4, i32 5, i32 2, i32 3> should yield "movsd", not "movss". git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139686 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-14 02:36:14 +00:00
Nadav Rotem	dfb5935c76	swap vselect operand order - pr10907 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139630 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 19:56:38 +00:00
Bruno Cardoso Lopes	df24e1fb08	Add versions 256-bit versions of alignedstore and alignedload, to be more strict about the alignment checking. This was found by inspection and I don't have any testcases so far, although the llvm testsuite runs without any problem. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139625 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 19:33:03 +00:00
Bruno Cardoso Lopes	809f17fbb1	Revert the remaining part of r139528. According to PR10907 the bug seems to be in the VSELECT operands order, so I'll leave the fix for Nadav. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139624 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 19:33:00 +00:00
Nadav Rotem	aec5861bb6	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139623 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 19:17:42 +00:00
Craig Topper	4bbeb18f76	Only disassembler instructions with vvvv != 1111 if the instruction actually uses the vvvv field to encode an operand. Fixes PR10851. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139591 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 07:37:44 +00:00
Craig Topper	58bbb81764	Remove filter that was preventing MOVDQU/MOVDQA and their VEX forms from being disassembled. Also added encodings for the other register/register form of these instructions. Fixes PR10848. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139588 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 06:54:58 +00:00
Craig Topper	6b0b2d6c41	Fix encoding of VMOVDQU to not simultaneously be 'TB OpSize' and 'XS'. 'XS' is correct and seems to have been taking priority. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139587 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 06:39:34 +00:00
Eli Friedman	f73c881f4a	Fix the assembler strings for a couple of atomic instructions. Doesn't really matter much in practice, but it's a bit cleaner. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139563 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 00:27:04 +00:00
Bruno Cardoso Lopes	5fc48100ee	Fix PR10845. SUBREG_TO_REG shouldn't be used when the input and destination types are equal! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139553 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 22:59:23 +00:00
Bruno Cardoso Lopes	457d53d9ce	Revert the wrong part of r139528, and fix testcases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139541 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 21:24:07 +00:00
Bruno Cardoso Lopes	8e03a821f9	Not sure how CMPPS and CMPPD had already ever worked, I guess it didn't. However with this fix it does now. Basically the operand order for the x86 target specific node is not the same as the instruction, but since the intrinsic need that specific order at the instruction definition, just change the order during legalization. Also, there were some wrong invertions of condition codes, such as GE => LE, GT => LT, fix that too. Fix PR10907. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139528 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:40 +00:00
Bruno Cardoso Lopes	93474f5f7f	Organize a bit the operand names for CMPPS and CMPPD git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139527 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:36 +00:00
Bruno Cardoso Lopes	cf355422d6	Realign BLEND patterns to match the general style for patterns in .td file. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139526 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:33 +00:00
Bruno Cardoso Lopes	3445df77d4	Fix 80-columns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139525 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:29 +00:00
Nadav Rotem	5ed0983200	Format patterns, remove unused X86blend patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139491 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 08:41:50 +00:00
Craig Topper	136046c9a2	Fix disassembling of one of the register/register forms of MOVUPS/MOVUPD/MOVAPS/MOVAPD/MOVSS/MOVSD and their VEX equivalents. Fixes PR10877. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139486 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 23:19:54 +00:00
Craig Topper	038197988b	Fix disassembling of reverse register/register forms of ADD/SUB/XOR/OR/AND/SBB/ADC/CMP/MOV. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139485 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 21:41:45 +00:00
Nadav Rotem	fbad25e120	CR fixes per Bruno's request. Undo the changes from r139285 which added custom lowering to vselect. Add tablegen lowering for vselect. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139479 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 15:02:23 +00:00
Eli Friedman	106f6e7a27	r139454 activates an assert in a case where we were doing the right thing anyway. Make that explicit, and un-XFAIL the testcase. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139458 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-10 02:01:42 +00:00
Richard Trieu	81cbb0ad60	Fix the asserts in lib/Target/X86/X86ELFWriterInfo.cpp and lib/ExecutionEngine/MCJIT/MCJIT.cpp from: assert("error"); to: assert(0 && "error"); git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139456 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-10 01:42:07 +00:00
Richard Trieu	2db8628085	Fixed an assert from: assert("not implemented for target shuffle node"); to: assert(0 && "not implemented for target shuffle node"); This causes a test failure in CodeGen/X86/palignr.ll which has been marked as XFAIL for the time being. Test failure filed at PR10901. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139454 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-10 01:26:21 +00:00
Nadav Rotem	8ffad56f8e	Implement vector-select support for avx256. Refactor the vblend implementation to have tablegen match the instruction by the node type git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139400 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-09 20:29:17 +00:00
Craig Topper	ccfa4ed4e0	Fix handling of Intel syntax disassembling of movs and stos to stop being blank. Also fixed scas, and cmps to always print size suffix in Intel syntax since its abiguous without arguments. Fixes PR10875. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139353 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-09 05:40:53 +00:00
Nadav Rotem	ee64be9c17	Dix the 80-columns and remove unsupported v8i16 type from the list of legal vselect types. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139324 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 22:17:35 +00:00
Bruno Cardoso Lopes	7ec8fb8830	Add a AVX version of a simple i64 -> f64 bitcast. This could be triggered using llc with -O0, which wouldn't let it be folded and expose the lack of this pattern. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139320 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 21:52:33 +00:00
Bruno Cardoso Lopes	cbf479df8a	* Combines Alignment, AuxInfo, and TB_NOT_REVERSABLE flag into a single field (Flags), which is a bitwise OR of items from the TB_* enum. This makes it easier to add new information in the future. * Gives every static array an equivalent layout: { RegOp, MemOp, Flags } * Adds a helper function, AddTableEntry, to avoid duplication of the insertion code. * Renames TB_NOT_REVERSABLE to TB_NO_REVERSE. * Adds TB_NO_FORWARD, which is analogous to TB_NO_REVERSE, except that it prevents addition of the Reg->Mem entry. (This is going to be used by Native Client, in the next CL). Patch by David Meyer git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139311 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:35:57 +00:00
Bruno Cardoso Lopes	814c6ced85	Add AVX versions of blend vector operations and fix some issues noticed in Nadav's r139285 and r139287 commits. 1) Rename vsel.ll to a more descriptive name 2) Change the order of BLEND operands to "Op1, Op2, Cond", this is necessary because PBLENDVB is already used in different places with this order, and it was being emitted in the wrong way for vselect 3) Add AVX patterns and tests for the same SSE41 instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139305 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:05:08 +00:00
Bruno Cardoso Lopes	7db2d3a504	Fix PR10844: Add patterns to cover non foldable versions of X86vzmovl. Triggered using llc -O0. Also fix some SET0PS patterns to their AVX forms and test it on the testcase. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139304 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:05:02 +00:00
Nadav Rotem	ffe3e7da84	Add X86-SSE4 codegen support for vector-select. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139285 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 08:11:19 +00:00
Eli Friedman	d5ccb0558f	Fix atomic load and store on x86 to pass -verify-machineinstrs (and possibly fix some subtle bugs involving passes which check mayStore()). This isn't exactly ideal, but it is good enough for the moment. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139245 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-07 18:48:32 +00:00
James Molloy	b950585cc5	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139237 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-07 17:24:38 +00:00
Rafael Espindola	ca59221fdc	Detect attempt to use segmented stacks on non ELF systems and error (not assert) early. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139233 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-07 16:10:57 +00:00
Bill Wendling	c8725d11f8	Reenable compact unwind by default. However, also emit the old version of unwind information for older linkers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139206 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 23:47:14 +00:00
Rafael Espindola	5c984df26b	Fix comment. Noticed by Duncan. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139161 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 19:29:31 +00:00
Duncan Sands	28b77e968d	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139159 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 19:07:46 +00:00
Rafael Espindola	96428cea3d	Fix style issues and typos found by Duncan. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139154 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 18:43:08 +00:00
Duncan Sands	4a544a79bd	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139140 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 13:37:06 +00:00
Nick Lewycky	1fac6b50ea	Add a new MC bit for NaCl (Native Client) mode. NaCl requires that certain instructions are more aligned than the CPU requires, and adds some additional directives, to follow in future patches. Patch by David Meyer! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139125 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-05 21:51:43 +00:00
Benjamin Kramer	c53479d9c2	Use internal storage for command line option. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139079 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 03:45:06 +00:00
Bruno Cardoso Lopes	2c84e96d3e	Add AVX versions to match AESENC/AESDEC intrinsics. This hopefully ends the cycle of missing AVX counterparts of already present SSE* patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139073 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:08 +00:00
Bruno Cardoso Lopes	9f63615b17	Add AVX version of a SSE4.1 VPBLENDVB pattern git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139072 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:05 +00:00
Bruno Cardoso Lopes	d01ef7d978	Add AVX versions of SSE4.1 EXTRACTPS patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139071 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:03 +00:00
Bruno Cardoso Lopes	2b0e0a42d1	Add AVX versions for SSE4.1 MOVZX* patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139070 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:01 +00:00
Bruno Cardoso Lopes	a67806530c	Add one more AVX pattern for MOVZPQILo2PQI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139069 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:58 +00:00
Bruno Cardoso Lopes	d29dd5ec9f	Move PUNPCKLQDQ splat pattern close to the instruction definition and duplicate it for AVX mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139068 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:56 +00:00
Bruno Cardoso Lopes	914a2a319c	Add AVX pattern versions for PSHUFB,PSIGN{B,W,D} git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139067 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:54 +00:00
Bruno Cardoso Lopes	a4ac989a1c	Add AVX versions of MOVZDI2PDI patterns. Use SUBREG_TO_REG to indicate that the AVX versions (even the 128-bit ones) all clear the upper part of the destination register. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139066 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:51 +00:00
Bruno Cardoso Lopes	152a287374	Enforce subtarget checks in a few places to be explicit when the pattern should be matched git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139065 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:49 +00:00
Bruno Cardoso Lopes	5ab6dcc4bb	Tidy up code moving patterns to their appropriate place! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139064 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:47 +00:00
Bruno Cardoso Lopes	0e59a04849	Add AVX versions of FsMOVAPS and FsMOVAPS. Teach X86InstrInfo how to use it! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139063 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:45 +00:00
Bruno Cardoso Lopes	645b8be38a	Teach X86FastISel to use AVX versions of instructions when possible git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139062 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:42 +00:00
Bruno Cardoso Lopes	1aab5515f6	Fix 80-column and style git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139061 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:40 +00:00
Bruno Cardoso Lopes	e4ccf8a86c	Tidy up some SSE/AVX convert intrinsics. Also add an AVX version of OptForSize pattern git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139060 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:38 +00:00
Jakob Stoklund Olesen	5047d76575	Pseudo CMOV instructions don't clobber EFLAGS. The explanation about a 0 argument being materialized as xor is no longer valid. Rematerialization will check if EFLAGS is live before clobbering it. The code produced by X86TargetLowering::EmitLoweredSelect does not clobber EFLAGS. This causes one less testb instruction to be generated in the cmov.ll test case. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139057 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 23:52:55 +00:00
Jakob Stoklund Olesen	b8e052e123	Check for EFLAGS live-out before clobbering it. It is only allowed to clobber EFLAGS at the end of a block if it isn't live-in to any successor. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139056 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 23:52:52 +00:00
Jakob Stoklund Olesen	4a1b9d82a4	Use existing function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139055 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 23:52:49 +00:00
Jakob Stoklund Olesen	439f71eb30	Remove unused variables. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139047 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 22:41:25 +00:00
Eli Friedman	4136d23c48	Don't fast-isel for atomic load/store; some cases require extra handling missing from fast-isel. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139044 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 22:33:24 +00:00
Kevin Enderby	d5705fe50d	Change X86 disassembly to print immediates values as signed by default. Special case those instructions that the immediate is not sign-extend. radr://8795217 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139028 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 20:01:23 +00:00
Bill Wendling	d199aa012b	Revert r138826 until PR10834 can be fixed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139018 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 18:15:04 +00:00
Bruno Cardoso Lopes	a39ccdb9d4	Fix vbroadcast matching logic to early unmatch if the node doesn't have only one use. Fix PR10825. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138951 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-01 18:15:06 +00:00
Bruno Cardoso Lopes	fc7bc5889b	Move more code around and duplicate AVX patterns: MOVHPS and MOVLPS git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138897 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 21:15:32 +00:00
Bruno Cardoso Lopes	06c982d0e0	Move MOVAPS,MOVUPS patterns close to the instructions definition git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138896 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 21:15:29 +00:00
Bruno Cardoso Lopes	453f4954f2	Remove "_Int" forms of MOVUPSmr and MOVAPSmr git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138895 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 21:15:22 +00:00
Rafael Espindola	e81abfd30b	Spelling and grammar fixes to problems found by Duncan. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138858 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 16:43:33 +00:00
Eli Friedman	ac86d43eae	Make sure we don't crash when -miphoneos-version-min is specified on x86. Hopefully this will fix gcc testsuite failures. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138856 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 16:19:51 +00:00
Eric Christopher	c967ad8c88	Rework this conditional a bit. Patch by Sanjoy Das git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138853 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 04:17:21 +00:00
Bruno Cardoso Lopes	57d6a5e491	- Move all MOVSS and MOVSD patterns close to their definitions - Duplicate some store patterns to their AVX forms! - Catched a bug while restricting the patterns subtarget, fix it and update a testcase to check it properly git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138851 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 03:04:20 +00:00
Bruno Cardoso Lopes	fc646a6b06	Remove unnecessary AVX checks git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138850 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 03:04:14 +00:00
Bruno Cardoso Lopes	5affa5196f	Teach more places to use VMOVAPS,VMOVUPS instead of MOVAPS,MOVUPS, whenever AVX is enabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138849 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 03:04:09 +00:00
Evan Cheng	0899f5c62d	Fix (movhps load) lowering / pattern to match more cases. rdar://10050549 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138848 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 02:05:24 +00:00
Bill Wendling	e716124feb	Fix off-by-one error Benjamin noticed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138832 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 21:23:24 +00:00
Bill Wendling	011a8e1684	Enable compact unwind info by default. This only applies to Darwin when CFI is disabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138826 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 20:54:11 +00:00
Jeffrey Yasskin	cda2a146d1	Fix C++0x narrowing errors when char is unsigned. In the case of EDInstInfo, this would actually cause a bug when -1 became 255 and was then compared >=0 in llvm-mc/Disassembler.cpp. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138825 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 20:53:29 +00:00
Rafael Espindola	151ab3e2f7	Adds support for variable sized allocas. For a variable sized alloca, code is inserted to first check if the current stacklet has enough space. If so, space is allocated by simply decrementing the stack pointer. Otherwise a runtime routine (__morestack_allocate_stack_space in libgcc) is called which allocates the required memory from the heap. Patch by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138818 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 19:47:04 +00:00
Rafael Espindola	d07b7ec772	Adds a SelectionDAG node X86SegAlloca which will be custom lowered from DYNAMIC_STACKALLOC. Two new pseudo instructions (SEG_ALLOCA_32 and SEG_ALLOCA_64) which will match X86SegAlloca (based on word size) are also added. They will be custom emitted to inject the actual stack handling code. Patch by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138814 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 19:43:21 +00:00
Rafael Espindola	76927d7586	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138812 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 19:39:58 +00:00
Eli Friedman	f3704769bb	Explicitly zero out parts of a vector which are required to be zero by the algorithm in LowerUINT_TO_FP_i32. This only has a substantial effect on the generated code when the input is extracted from a vector register; other ways of loading an i32 do the appropriate zeroing implicitly. Fixes PR10802. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138768 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-29 21:15:46 +00:00
Bruno Cardoso Lopes	41dfabb0e3	Move non-intruction patterns to a more appropriate place! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138744 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-29 17:51:24 +00:00
Nicolas Geoffray	1c36ba50ac	Remove premature previous commit. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138725 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-28 14:52:51 +00:00
Nicolas Geoffray	c98da24bed	Encoding of instructions referencing segments has changed. Do what X86MCCodeEmitter does. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138723 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-28 13:07:57 +00:00
Benjamin Kramer	2753ae314f	Silence GCC warnings and make an array const. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138706 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-27 17:36:14 +00:00
Eli Friedman	43f51aeca8	Add support for generating CMPXCHG16B on x86-64 for the cmpxchg IR instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138660 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-26 21:21:21 +00:00
Craig Topper	8fd13b6de5	Fix disassembling of VCVTSD2SI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138623 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-26 04:49:29 +00:00
Bruno Cardoso Lopes	f1a264232c	Do the same as r138461. Mark VZEROALL as clobbering all YMM registers git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138592 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 22:23:58 +00:00
Bruno Cardoso Lopes	6292eceea0	Add support for AVX 256-bit version of MOVDDUP! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138588 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 21:40:37 +00:00
Bruno Cardoso Lopes	06ef923d14	Make isMOVDDUP mask check more strict and update comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138587 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 21:40:34 +00:00
Craig Topper	ebc1db0fac	Add more missing TB encodings to VEX instructions to allow them to be disassembled. Fixes remainder of PR10678. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138553 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 08:11:01 +00:00
Craig Topper	ea03659d23	Add TB encoding to VEROALL, VZEROUPPER, and VCVTPS2PD to allow them to be disassembled. Fixes PR10723. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138551 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 06:57:46 +00:00
Bruno Cardoso Lopes	07b7f672a0	Add support for 256-bit versions of VSHUFPD and VSHUFPS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138546 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 02:58:26 +00:00
Bruno Cardoso Lopes	e7461c0353	Add memory version of SHUFPD to mask decoding! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138545 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 02:58:21 +00:00
Bruno Cardoso Lopes	27831e5e6f	Create a section for non-instructions patterns in the beginning of the file, and move more code around! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138521 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:11 +00:00
Bruno Cardoso Lopes	9993499057	Move code around! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138520 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:09 +00:00
Bruno Cardoso Lopes	de79231468	Organize UNPCK* patterns, also add remaining for AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138519 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:06 +00:00
Bruno Cardoso Lopes	4cf4778ac4	Move remaining MOVDDUP patterns close to MOVDDUP defintion and duplicate the missing ones for AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138518 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:04 +00:00
Bruno Cardoso Lopes	4724f25ed6	Organize and tidy up MOVDDUP section. Also update comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138517 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:02 +00:00
Bruno Cardoso Lopes	6140294363	Move MOVHLPS patterns close to MOVHLPS definition, and duplicate the pattern for 128-bit AVX mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138516 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:17:59 +00:00
Bruno Cardoso Lopes	954d5eabb7	Move all PSHUF* patterns close to the PSHUF* definitions. Also be explicit about which subtarget they refer to, and add AVX versions of the ones we currently don't. Remove old and now wrong comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138515 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:17:57 +00:00
Bruno Cardoso Lopes	af002d8405	Move all SHUFP* patterns close to the SHUFP* definitions. Also be explicit about which subtarget they refer to, and add AVX versions of the ones we currently don't. Make the mask check more strict, to be clear it won't be used to match to 256-bit versions! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138514 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:17:55 +00:00
Eli Friedman	f8f90f0174	Hook up 64-bit atomic load/store on x86-32. I plan to write more efficient implementations eventually. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138505 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 22:33:28 +00:00
Eli Friedman	4317fe1fc6	Fix whitespace. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138487 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 21:17:30 +00:00
Eli Friedman	327236cd6c	Basic x86 code generation for atomic load and store instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138478 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 20:50:09 +00:00
Bruno Cardoso Lopes	356e988110	Mark VZEROALL as clobbering all YMM registers git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138461 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 18:48:33 +00:00
Evan Cheng	3e74d6fdd2	Move TargetRegistry and TargetSelect from Target to Support where they belong. These are strictly utilities for registering targets and components. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138450 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 18:08:43 +00:00
Craig Topper	13894fa135	Break 256-bit vector int add/sub/mul into two 128-bit operations to avoid costly scalarization. Fixes PR10711. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138427 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 06:14:18 +00:00
Bruno Cardoso Lopes	d8b7dd5252	Fix a nasty bug where a v4i64 was being wrong emitted with 32-bit permutations. Also tidy up some patterns and make them close to their instruction definition! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138392 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 22:06:37 +00:00
Evan Cheng	7801136b95	Some refactoring so TargetRegistry.h no longer has to include any files from MC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138367 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 20:15:21 +00:00
Nick Lewycky	726ebd6ff3	PerformSubCombine to work on integers larger than i128. Fixes a crasher. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138354 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 19:01:24 +00:00
Craig Topper	a534780da0	Add support for breaking 256-bit v16i16 and v32i8 VSETCC into two 128-bit ones, avoiding sclarization. Add vex form of pcmpeqq and pcmpgtq. Fixes more cases for PR10712. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138321 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 04:36:33 +00:00
Bruno Cardoso Lopes	3bde6fe0df	Introduce a pass to insert vzeroupper instructions to avoid AVX to SSE transition penalty. The pass is enabled through the "x86-use-vzeroupper" llc command line option. This is only the first step (very naive and conservative one) to sketch out the idea, but proper DFA is coming next to allow smarter decisions. Comments and ideas now and in further commits will be very appreciated. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138317 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 01:14:17 +00:00
Benjamin Kramer	3c1fece071	X86: Add some operand types required to identify calls. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138285 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-22 22:55:32 +00:00
Bruno Cardoso Lopes	2ac8111159	Add support for breaking 256-bit int VETCC into two 128-bit ones, avoding scalarization of the compare. Reduces code from 59 to 6 instructions. Fix PR10712. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138271 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-22 20:31:04 +00:00
Bruno Cardoso Lopes	bde9f1b302	Add 128-bit AVX codegen for PCMP* family of integer instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138270 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-22 20:31:00 +00:00
Bruno Cardoso Lopes	0c9acfcb50	Re-write part of VEX encoding logic, to be more easy to read! Also fix a bug and add a testcase! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138123 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-19 22:27:29 +00:00
Craig Topper	e004d941ec	Add TB encoding to VEX versions of SSE fp logical operations to fix disassembler git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138034 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-19 05:28:50 +00:00
Bruno Cardoso Lopes	863e0f25b7	Fix PR10677. Initial patch and idea by Peter Cooper but I've changed the implementation! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138029 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-19 02:23:56 +00:00
Bruno Cardoso Lopes	df01610d6f	Re-encoded 128-bit AVX versions of SQRT, RSQRT, RCP have 3 operands instead of 2. They were already defined this way in their regular version, but not for the intrinsics versions (_Int), and that would work for assembly emission but not for object code, since a MachineOperand would be missing. This commit fix PR10697. Also removed the {VSQRT,VRSQRT,VRCP}r_Int forms and match the intrinsic via INSERT_SUBREG+EXTRACT_SUBREG patterns. The same couldn't be done for memory versions because sse_load_f32/sse_load_f64 operand need special handling and don't work like regular "addr" operands. There are right now 114 "_Int" and 98 "Int_*" forms! I'm slowly removing them as I step through, but hope we can get rid of these someday, they are really annoying :) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138012 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-18 23:59:21 +00:00
Bruno Cardoso Lopes	24b90e2287	Cleanup vector logical ops in AVX and add use int versions for simple v2i64 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137919 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-18 02:11:34 +00:00
Bruno Cardoso Lopes	0dd80b0d69	Fix PR10688. Add support for spliting 256-bit vector shifts when the shift amount is variable git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137885 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 22:12:20 +00:00
Owen Anderson	83e3f67fb6	Allow the MCDisassembler to return a "soft fail" status code, indicating an instruction that is disassemblable, but invalid. Only used for ARM UNPREDICTABLE instructions at the moment. Patch by James Molloy. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137830 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 17:44:15 +00:00
Bruno Cardoso Lopes	0e6d230abd	Introduce matching patterns for vbroadcast AVX instruction. The idea is to match splats in the form (splat (scalar_to_vector (load ...))) whenever the load can be folded. All the logic and instruction emission is working but because of PR8156, there are no ways to match loads, cause they can never be folded for splats. Thus, the tests are XFAILed, but I've tested and exercised all the logic using a relaxed version for checking the foldable loads, as if the bug was already fixed. This should work out of the box once PR8156 gets fixed since MayFoldLoad will work as expected. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137810 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 02:29:19 +00:00
Bruno Cardoso Lopes	8a5b262e80	Update comments about vector splat handling in x86 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137808 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 02:29:13 +00:00
Bruno Cardoso Lopes	fc0a702128	Now that we have a canonical way to handle 256-bit splats: vinsertf128 $1 + vpermilps $0, remove the old code that used to first do the splat in a 128-bit vector and then insert it into a larger one. This is better because the handling code gets simpler and also makes a better room for the upcoming vbroadcast! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137807 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 02:29:10 +00:00
Bruno Cardoso Lopes	3b86598cfa	Instead of always leaving the work to the generic legalizer when there is no support for native 256-bit shuffles, be more smart in some cases, for example, when you can extract specific 128-bit parts and use regular 128-bit shuffles for them. Example: For this shuffle: shufflevector <4 x i64> %a, <4 x i64> %b, <4 x i32> <i32 1, i32 0, i32 7, i32 6> This was expanded to: vextractf128 $1, %ymm1, %xmm2 vpextrq $0, %xmm2, %rax vmovd %rax, %xmm1 vpextrq $1, %xmm2, %rax vmovd %rax, %xmm2 vpunpcklqdq %xmm1, %xmm2, %xmm1 vpextrq $0, %xmm0, %rax vmovd %rax, %xmm2 vpextrq $1, %xmm0, %rax vmovd %rax, %xmm0 vpunpcklqdq %xmm2, %xmm0, %xmm0 vinsertf128 $1, %xmm1, %ymm0, %ymm0 ret Now we get: vshufpd $1, %xmm0, %xmm0, %xmm0 vextractf128 $1, %ymm1, %xmm1 vshufpd $1, %xmm1, %xmm1, %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137733 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-16 18:21:54 +00:00
Bruno Cardoso Lopes	8400bfe9fa	While I'm here, remove the "_alt" hacks to a series of INSERT_SUBREG and also add the AVX versions of the 128-bit patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137685 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 23:36:51 +00:00
Bruno Cardoso Lopes	1deddbbd56	Reorder declarations of vmovmskp* and also put the necessary AVX predicate and TB encoding fields. This fix the encoding for the attached testcase. This fixes PR10625. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137684 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 23:36:45 +00:00
Jim Grosbach	19cb7f491f	MCTargetAsmParser target match predicate support. Allow a target assembly parser to do context sensitive constraint checking on a potential instruction match. This will be used, for example, to handle Thumb2 IT block parsing. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137675 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 23:03:29 +00:00
Bruno Cardoso Lopes	50b37c7920	Fix PR10656. It's only profitable to use 128-bit inserts and extracts when AVX mode is one. Otherwise is just more work for the type legalizer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137661 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 21:45:54 +00:00
Bruno Cardoso Lopes	4002d7e1e6	Fix comment! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137521 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 21:54:42 +00:00
Bruno Cardoso Lopes	53cae1362d	The VPERM2F128 is a AVX instruction which permutes between two 256-bit vectors. It operates on 128-bit elements instead of regular scalar types. Recognize shuffles that are suitable for VPERM2F128 and teach the x86 legalizer how to handle them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137519 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 21:48:26 +00:00
Bruno Cardoso Lopes	fa2f4fd9a2	Move code around and add comments git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137518 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 21:48:22 +00:00
Duncan Sands	1f6a329f79	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137460 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 14:54:45 +00:00
Andrew Trick	32a183c84a	findDeadCallerSavedReg fix: Missing NULL terminator in register arrays. Fix by Ivan Baev. Sorry I don't have a unit test, but the fix is obvious so I don't want to delay it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137404 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 00:49:19 +00:00
Bruno Cardoso Lopes	ef8d6999f3	Add a dag combine to xform 256-bit shuffles into simple vector inserts and extracts. This simple combine makes us generate only 1 instruction instead of 11 in the v8 case. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137362 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 21:50:44 +00:00
Bruno Cardoso Lopes	59353b436a	Fix PR10492 by teaching MOVHLPS and MOVLPS mask matching to be more strict. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137324 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 18:59:13 +00:00
Nadav Rotem	6236f7f2b6	Add a comment, per Bruno's CR. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137313 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 17:05:47 +00:00
Nadav Rotem	5e742a3e1b	[AVX] If the data which is going to be saved is already in two XMM registers (for example, after integer operation), do not pack the registers into a YMM before saving. Its better to save as two XMM registers. Before: vinsertf128 $1, %xmm3, %ymm0, %ymm3 vinsertf128 $0, %xmm1, %ymm3, %ymm1 vmovaps %ymm1, 416(%rsp) After: vmovaps %xmm3, 416+16(%rsp) vmovaps %xmm1, 416(%rsp) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137308 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 16:41:21 +00:00
Bruno Cardoso Lopes	b02c0ace20	Cleanup: Remove Int_ CVTSS2SI* forms git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137297 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 02:52:36 +00:00
Bruno Cardoso Lopes	5f1d8abf75	Splats for v8i32/v8f32 can be handled by VPERMILPSY. This was causing infinite recursive calls in legalize. Fix PR10562 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137296 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 02:49:44 +00:00
Bruno Cardoso Lopes	a5134a0ea3	Use the splat index to generate the desired shuffle. Otherwise we could only get undefs and the vector shuffle becomes an undef, generating wrong code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137295 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 02:49:41 +00:00
Eli Friedman	586272d67c	Fix X86TargetLowering::LowerExternalSymbol so that it actually works in non-trivial cases. This hasn't been an issue before because the function isn't normally called (but apparently is used to generate a tail-call to sin() on ELF x86-32 with PIC and SSE2). Fixes PR9693. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137292 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 01:48:05 +00:00
Nadav Rotem	614061bfb4	When performing a truncating store, it is sometimes possible to rearrange the data in-register prior to saving to memory. When we reorder the data in memory we prevent the need to save multiple scalars to memory, making a single regular store. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137238 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-10 19:30:14 +00:00
Bruno Cardoso Lopes	6ad251358e	The following X86 pattern is incorrect: def : Pat<(X86Movss VR128:$src1, (bc_v4i32 (v2i64 (load addr:$src2)))), (MOVLPSrm VR128:$src1, addr:$src2)>; This matches a MOVSS dag with a MOVLPS instruction. However, MOVSS will replace only the low 32 bits of the register, while the MOVLPS instruction will replace the low 64 bits. A testcase is added and illustrates the bug and also modified the one that was already present. Patch by Tanya Lattner. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137227 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-10 17:45:17 +00:00
Bruno Cardoso Lopes	155a92a491	Fix a bug in vpermilps mask checking. Fix PR10560 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137194 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-10 01:54:17 +00:00
Bruno Cardoso Lopes	d40aa24ebf	Add 256-bit support for v8i32, v4i64 and v4f64 ISD::SELECT. Fix PR10556 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137179 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 23:27:13 +00:00
Bruno Cardoso Lopes	18deb04e9c	Add v16i16 and v32i8 store patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137166 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 22:39:53 +00:00
Bruno Cardoso Lopes	cde4a1abd5	Use fp unpack instructions to unpack int types. Until we have AVX2, this is the best we can do for these patterns. This fix PR10554. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137161 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 22:18:37 +00:00
Eli Friedman	fc430a662f	Fix a couple ridiculous copy-paste errors. rdar://9914773 . git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137160 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 22:17:39 +00:00
Bruno Cardoso Lopes	e2406dfd89	Reapply a more appropriate solution than in r137114. AVX supports v4f64 = sitofp v4i32. This fix PR10559. Also add support for v4i32 = fptosi v4f64. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137128 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 17:39:13 +00:00
Bruno Cardoso Lopes	a511b8e519	Revert r137114 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137127 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 17:39:01 +00:00
Bruno Cardoso Lopes	e321d7ffc5	Handle sitofp between v4f64 <- v4i32. Fix PR10559 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137114 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 05:48:01 +00:00

... 3 4 5 6 7 ...

7809 Commits