llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-07-29 11:29:39 +00:00

Author	SHA1	Message	Date
Craig Topper	58bbb81764	Remove filter that was preventing MOVDQU/MOVDQA and their VEX forms from being disassembled. Also added encodings for the other register/register form of these instructions. Fixes PR10848. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139588 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 06:54:58 +00:00
Craig Topper	6b0b2d6c41	Fix encoding of VMOVDQU to not simultaneously be 'TB OpSize' and 'XS'. 'XS' is correct and seems to have been taking priority. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139587 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 06:39:34 +00:00
Eli Friedman	f73c881f4a	Fix the assembler strings for a couple of atomic instructions. Doesn't really matter much in practice, but it's a bit cleaner. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139563 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-13 00:27:04 +00:00
Bruno Cardoso Lopes	5fc48100ee	Fix PR10845. SUBREG_TO_REG shouldn't be used when the input and destination types are equal! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139553 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 22:59:23 +00:00
Bruno Cardoso Lopes	457d53d9ce	Revert the wrong part of r139528, and fix testcases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139541 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 21:24:07 +00:00
Bruno Cardoso Lopes	8e03a821f9	Not sure how CMPPS and CMPPD had already ever worked, I guess it didn't. However with this fix it does now. Basically the operand order for the x86 target specific node is not the same as the instruction, but since the intrinsic need that specific order at the instruction definition, just change the order during legalization. Also, there were some wrong invertions of condition codes, such as GE => LE, GT => LT, fix that too. Fix PR10907. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139528 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:40 +00:00
Bruno Cardoso Lopes	93474f5f7f	Organize a bit the operand names for CMPPS and CMPPD git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139527 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:36 +00:00
Bruno Cardoso Lopes	cf355422d6	Realign BLEND patterns to match the general style for patterns in .td file. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139526 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:33 +00:00
Bruno Cardoso Lopes	3445df77d4	Fix 80-columns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139525 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 19:30:29 +00:00
Nadav Rotem	5ed0983200	Format patterns, remove unused X86blend patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139491 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-12 08:41:50 +00:00
Craig Topper	136046c9a2	Fix disassembling of one of the register/register forms of MOVUPS/MOVUPD/MOVAPS/MOVAPD/MOVSS/MOVSD and their VEX equivalents. Fixes PR10877. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139486 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 23:19:54 +00:00
Craig Topper	038197988b	Fix disassembling of reverse register/register forms of ADD/SUB/XOR/OR/AND/SBB/ADC/CMP/MOV. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139485 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 21:41:45 +00:00
Nadav Rotem	fbad25e120	CR fixes per Bruno's request. Undo the changes from r139285 which added custom lowering to vselect. Add tablegen lowering for vselect. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139479 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-11 15:02:23 +00:00
Eli Friedman	106f6e7a27	r139454 activates an assert in a case where we were doing the right thing anyway. Make that explicit, and un-XFAIL the testcase. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139458 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-10 02:01:42 +00:00
Richard Trieu	81cbb0ad60	Fix the asserts in lib/Target/X86/X86ELFWriterInfo.cpp and lib/ExecutionEngine/MCJIT/MCJIT.cpp from: assert("error"); to: assert(0 && "error"); git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139456 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-10 01:42:07 +00:00
Richard Trieu	2db8628085	Fixed an assert from: assert("not implemented for target shuffle node"); to: assert(0 && "not implemented for target shuffle node"); This causes a test failure in CodeGen/X86/palignr.ll which has been marked as XFAIL for the time being. Test failure filed at PR10901. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139454 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-10 01:26:21 +00:00
Nadav Rotem	8ffad56f8e	Implement vector-select support for avx256. Refactor the vblend implementation to have tablegen match the instruction by the node type git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139400 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-09 20:29:17 +00:00
Craig Topper	ccfa4ed4e0	Fix handling of Intel syntax disassembling of movs and stos to stop being blank. Also fixed scas, and cmps to always print size suffix in Intel syntax since its abiguous without arguments. Fixes PR10875. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139353 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-09 05:40:53 +00:00
Nadav Rotem	ee64be9c17	Dix the 80-columns and remove unsupported v8i16 type from the list of legal vselect types. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139324 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 22:17:35 +00:00
Bruno Cardoso Lopes	7ec8fb8830	Add a AVX version of a simple i64 -> f64 bitcast. This could be triggered using llc with -O0, which wouldn't let it be folded and expose the lack of this pattern. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139320 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 21:52:33 +00:00
Bruno Cardoso Lopes	cbf479df8a	* Combines Alignment, AuxInfo, and TB_NOT_REVERSABLE flag into a single field (Flags), which is a bitwise OR of items from the TB_* enum. This makes it easier to add new information in the future. * Gives every static array an equivalent layout: { RegOp, MemOp, Flags } * Adds a helper function, AddTableEntry, to avoid duplication of the insertion code. * Renames TB_NOT_REVERSABLE to TB_NO_REVERSE. * Adds TB_NO_FORWARD, which is analogous to TB_NO_REVERSE, except that it prevents addition of the Reg->Mem entry. (This is going to be used by Native Client, in the next CL). Patch by David Meyer git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139311 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:35:57 +00:00
Bruno Cardoso Lopes	814c6ced85	Add AVX versions of blend vector operations and fix some issues noticed in Nadav's r139285 and r139287 commits. 1) Rename vsel.ll to a more descriptive name 2) Change the order of BLEND operands to "Op1, Op2, Cond", this is necessary because PBLENDVB is already used in different places with this order, and it was being emitted in the wrong way for vselect 3) Add AVX patterns and tests for the same SSE41 instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139305 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:05:08 +00:00
Bruno Cardoso Lopes	7db2d3a504	Fix PR10844: Add patterns to cover non foldable versions of X86vzmovl. Triggered using llc -O0. Also fix some SET0PS patterns to their AVX forms and test it on the testcase. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139304 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 18:05:02 +00:00
Nadav Rotem	ffe3e7da84	Add X86-SSE4 codegen support for vector-select. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139285 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-08 08:11:19 +00:00
Eli Friedman	d5ccb0558f	Fix atomic load and store on x86 to pass -verify-machineinstrs (and possibly fix some subtle bugs involving passes which check mayStore()). This isn't exactly ideal, but it is good enough for the moment. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139245 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-07 18:48:32 +00:00
James Molloy	b950585cc5	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139237 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-07 17:24:38 +00:00
Rafael Espindola	ca59221fdc	Detect attempt to use segmented stacks on non ELF systems and error (not assert) early. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139233 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-07 16:10:57 +00:00
Bill Wendling	c8725d11f8	Reenable compact unwind by default. However, also emit the old version of unwind information for older linkers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139206 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 23:47:14 +00:00
Rafael Espindola	5c984df26b	Fix comment. Noticed by Duncan. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139161 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 19:29:31 +00:00
Duncan Sands	28b77e968d	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139159 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 19:07:46 +00:00
Rafael Espindola	96428cea3d	Fix style issues and typos found by Duncan. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139154 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 18:43:08 +00:00
Duncan Sands	4a544a79bd	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139140 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-06 13:37:06 +00:00
Nick Lewycky	1fac6b50ea	Add a new MC bit for NaCl (Native Client) mode. NaCl requires that certain instructions are more aligned than the CPU requires, and adds some additional directives, to follow in future patches. Patch by David Meyer! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139125 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-05 21:51:43 +00:00
Benjamin Kramer	c53479d9c2	Use internal storage for command line option. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139079 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 03:45:06 +00:00
Bruno Cardoso Lopes	2c84e96d3e	Add AVX versions to match AESENC/AESDEC intrinsics. This hopefully ends the cycle of missing AVX counterparts of already present SSE* patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139073 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:08 +00:00
Bruno Cardoso Lopes	9f63615b17	Add AVX version of a SSE4.1 VPBLENDVB pattern git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139072 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:05 +00:00
Bruno Cardoso Lopes	d01ef7d978	Add AVX versions of SSE4.1 EXTRACTPS patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139071 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:03 +00:00
Bruno Cardoso Lopes	2b0e0a42d1	Add AVX versions for SSE4.1 MOVZX* patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139070 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:47:01 +00:00
Bruno Cardoso Lopes	a67806530c	Add one more AVX pattern for MOVZPQILo2PQI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139069 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:58 +00:00
Bruno Cardoso Lopes	d29dd5ec9f	Move PUNPCKLQDQ splat pattern close to the instruction definition and duplicate it for AVX mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139068 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:56 +00:00
Bruno Cardoso Lopes	914a2a319c	Add AVX pattern versions for PSHUFB,PSIGN{B,W,D} git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139067 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:54 +00:00
Bruno Cardoso Lopes	a4ac989a1c	Add AVX versions of MOVZDI2PDI patterns. Use SUBREG_TO_REG to indicate that the AVX versions (even the 128-bit ones) all clear the upper part of the destination register. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139066 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:51 +00:00
Bruno Cardoso Lopes	152a287374	Enforce subtarget checks in a few places to be explicit when the pattern should be matched git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139065 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:49 +00:00
Bruno Cardoso Lopes	5ab6dcc4bb	Tidy up code moving patterns to their appropriate place! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139064 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:47 +00:00
Bruno Cardoso Lopes	0e59a04849	Add AVX versions of FsMOVAPS and FsMOVAPS. Teach X86InstrInfo how to use it! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139063 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:45 +00:00
Bruno Cardoso Lopes	645b8be38a	Teach X86FastISel to use AVX versions of instructions when possible git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139062 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:42 +00:00
Bruno Cardoso Lopes	1aab5515f6	Fix 80-column and style git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139061 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:40 +00:00
Bruno Cardoso Lopes	e4ccf8a86c	Tidy up some SSE/AVX convert intrinsics. Also add an AVX version of OptForSize pattern git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139060 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-03 00:46:38 +00:00
Jakob Stoklund Olesen	5047d76575	Pseudo CMOV instructions don't clobber EFLAGS. The explanation about a 0 argument being materialized as xor is no longer valid. Rematerialization will check if EFLAGS is live before clobbering it. The code produced by X86TargetLowering::EmitLoweredSelect does not clobber EFLAGS. This causes one less testb instruction to be generated in the cmov.ll test case. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139057 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 23:52:55 +00:00
Jakob Stoklund Olesen	b8e052e123	Check for EFLAGS live-out before clobbering it. It is only allowed to clobber EFLAGS at the end of a block if it isn't live-in to any successor. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139056 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 23:52:52 +00:00
Jakob Stoklund Olesen	4a1b9d82a4	Use existing function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139055 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 23:52:49 +00:00
Jakob Stoklund Olesen	439f71eb30	Remove unused variables. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139047 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 22:41:25 +00:00
Eli Friedman	4136d23c48	Don't fast-isel for atomic load/store; some cases require extra handling missing from fast-isel. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139044 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 22:33:24 +00:00
Kevin Enderby	d5705fe50d	Change X86 disassembly to print immediates values as signed by default. Special case those instructions that the immediate is not sign-extend. radr://8795217 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139028 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 20:01:23 +00:00
Bill Wendling	d199aa012b	Revert r138826 until PR10834 can be fixed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@139018 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-02 18:15:04 +00:00
Bruno Cardoso Lopes	a39ccdb9d4	Fix vbroadcast matching logic to early unmatch if the node doesn't have only one use. Fix PR10825. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138951 91177308-0d34-0410-b5e6-96231b3b80d8	2011-09-01 18:15:06 +00:00
Bruno Cardoso Lopes	fc7bc5889b	Move more code around and duplicate AVX patterns: MOVHPS and MOVLPS git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138897 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 21:15:32 +00:00
Bruno Cardoso Lopes	06c982d0e0	Move MOVAPS,MOVUPS patterns close to the instructions definition git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138896 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 21:15:29 +00:00
Bruno Cardoso Lopes	453f4954f2	Remove "_Int" forms of MOVUPSmr and MOVAPSmr git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138895 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 21:15:22 +00:00
Rafael Espindola	e81abfd30b	Spelling and grammar fixes to problems found by Duncan. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138858 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 16:43:33 +00:00
Eli Friedman	ac86d43eae	Make sure we don't crash when -miphoneos-version-min is specified on x86. Hopefully this will fix gcc testsuite failures. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138856 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 16:19:51 +00:00
Eric Christopher	c967ad8c88	Rework this conditional a bit. Patch by Sanjoy Das git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138853 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 04:17:21 +00:00
Bruno Cardoso Lopes	57d6a5e491	- Move all MOVSS and MOVSD patterns close to their definitions - Duplicate some store patterns to their AVX forms! - Catched a bug while restricting the patterns subtarget, fix it and update a testcase to check it properly git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138851 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 03:04:20 +00:00
Bruno Cardoso Lopes	fc646a6b06	Remove unnecessary AVX checks git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138850 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 03:04:14 +00:00
Bruno Cardoso Lopes	5affa5196f	Teach more places to use VMOVAPS,VMOVUPS instead of MOVAPS,MOVUPS, whenever AVX is enabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138849 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 03:04:09 +00:00
Evan Cheng	0899f5c62d	Fix (movhps load) lowering / pattern to match more cases. rdar://10050549 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138848 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-31 02:05:24 +00:00
Bill Wendling	e716124feb	Fix off-by-one error Benjamin noticed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138832 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 21:23:24 +00:00
Bill Wendling	011a8e1684	Enable compact unwind info by default. This only applies to Darwin when CFI is disabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138826 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 20:54:11 +00:00
Jeffrey Yasskin	cda2a146d1	Fix C++0x narrowing errors when char is unsigned. In the case of EDInstInfo, this would actually cause a bug when -1 became 255 and was then compared >=0 in llvm-mc/Disassembler.cpp. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138825 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 20:53:29 +00:00
Rafael Espindola	151ab3e2f7	Adds support for variable sized allocas. For a variable sized alloca, code is inserted to first check if the current stacklet has enough space. If so, space is allocated by simply decrementing the stack pointer. Otherwise a runtime routine (__morestack_allocate_stack_space in libgcc) is called which allocates the required memory from the heap. Patch by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138818 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 19:47:04 +00:00
Rafael Espindola	d07b7ec772	Adds a SelectionDAG node X86SegAlloca which will be custom lowered from DYNAMIC_STACKALLOC. Two new pseudo instructions (SEG_ALLOCA_32 and SEG_ALLOCA_64) which will match X86SegAlloca (based on word size) are also added. They will be custom emitted to inject the actual stack handling code. Patch by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138814 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 19:43:21 +00:00
Rafael Espindola	76927d7586	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138812 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-30 19:39:58 +00:00
Eli Friedman	f3704769bb	Explicitly zero out parts of a vector which are required to be zero by the algorithm in LowerUINT_TO_FP_i32. This only has a substantial effect on the generated code when the input is extracted from a vector register; other ways of loading an i32 do the appropriate zeroing implicitly. Fixes PR10802. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138768 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-29 21:15:46 +00:00
Bruno Cardoso Lopes	41dfabb0e3	Move non-intruction patterns to a more appropriate place! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138744 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-29 17:51:24 +00:00
Nicolas Geoffray	1c36ba50ac	Remove premature previous commit. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138725 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-28 14:52:51 +00:00
Nicolas Geoffray	c98da24bed	Encoding of instructions referencing segments has changed. Do what X86MCCodeEmitter does. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138723 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-28 13:07:57 +00:00
Benjamin Kramer	2753ae314f	Silence GCC warnings and make an array const. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138706 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-27 17:36:14 +00:00
Eli Friedman	43f51aeca8	Add support for generating CMPXCHG16B on x86-64 for the cmpxchg IR instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138660 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-26 21:21:21 +00:00
Craig Topper	8fd13b6de5	Fix disassembling of VCVTSD2SI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138623 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-26 04:49:29 +00:00
Bruno Cardoso Lopes	f1a264232c	Do the same as r138461. Mark VZEROALL as clobbering all YMM registers git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138592 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 22:23:58 +00:00
Bruno Cardoso Lopes	6292eceea0	Add support for AVX 256-bit version of MOVDDUP! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138588 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 21:40:37 +00:00
Bruno Cardoso Lopes	06ef923d14	Make isMOVDDUP mask check more strict and update comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138587 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 21:40:34 +00:00
Craig Topper	ebc1db0fac	Add more missing TB encodings to VEX instructions to allow them to be disassembled. Fixes remainder of PR10678. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138553 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 08:11:01 +00:00
Craig Topper	ea03659d23	Add TB encoding to VEROALL, VZEROUPPER, and VCVTPS2PD to allow them to be disassembled. Fixes PR10723. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138551 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 06:57:46 +00:00
Bruno Cardoso Lopes	07b7f672a0	Add support for 256-bit versions of VSHUFPD and VSHUFPS. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138546 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 02:58:26 +00:00
Bruno Cardoso Lopes	e7461c0353	Add memory version of SHUFPD to mask decoding! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138545 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-25 02:58:21 +00:00
Bruno Cardoso Lopes	27831e5e6f	Create a section for non-instructions patterns in the beginning of the file, and move more code around! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138521 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:11 +00:00
Bruno Cardoso Lopes	9993499057	Move code around! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138520 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:09 +00:00
Bruno Cardoso Lopes	de79231468	Organize UNPCK* patterns, also add remaining for AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138519 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:06 +00:00
Bruno Cardoso Lopes	4cf4778ac4	Move remaining MOVDDUP patterns close to MOVDDUP defintion and duplicate the missing ones for AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138518 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:04 +00:00
Bruno Cardoso Lopes	4724f25ed6	Organize and tidy up MOVDDUP section. Also update comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138517 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:18:02 +00:00
Bruno Cardoso Lopes	6140294363	Move MOVHLPS patterns close to MOVHLPS definition, and duplicate the pattern for 128-bit AVX mode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138516 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:17:59 +00:00
Bruno Cardoso Lopes	954d5eabb7	Move all PSHUF* patterns close to the PSHUF* definitions. Also be explicit about which subtarget they refer to, and add AVX versions of the ones we currently don't. Remove old and now wrong comments! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138515 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:17:57 +00:00
Bruno Cardoso Lopes	af002d8405	Move all SHUFP* patterns close to the SHUFP* definitions. Also be explicit about which subtarget they refer to, and add AVX versions of the ones we currently don't. Make the mask check more strict, to be clear it won't be used to match to 256-bit versions! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138514 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 23:17:55 +00:00
Eli Friedman	f8f90f0174	Hook up 64-bit atomic load/store on x86-32. I plan to write more efficient implementations eventually. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138505 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 22:33:28 +00:00
Eli Friedman	4317fe1fc6	Fix whitespace. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138487 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 21:17:30 +00:00
Eli Friedman	327236cd6c	Basic x86 code generation for atomic load and store instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138478 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 20:50:09 +00:00
Bruno Cardoso Lopes	356e988110	Mark VZEROALL as clobbering all YMM registers git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138461 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 18:48:33 +00:00
Evan Cheng	3e74d6fdd2	Move TargetRegistry and TargetSelect from Target to Support where they belong. These are strictly utilities for registering targets and components. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138450 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 18:08:43 +00:00
Craig Topper	13894fa135	Break 256-bit vector int add/sub/mul into two 128-bit operations to avoid costly scalarization. Fixes PR10711. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138427 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-24 06:14:18 +00:00
Bruno Cardoso Lopes	d8b7dd5252	Fix a nasty bug where a v4i64 was being wrong emitted with 32-bit permutations. Also tidy up some patterns and make them close to their instruction definition! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138392 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 22:06:37 +00:00
Evan Cheng	7801136b95	Some refactoring so TargetRegistry.h no longer has to include any files from MC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138367 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 20:15:21 +00:00
Nick Lewycky	726ebd6ff3	PerformSubCombine to work on integers larger than i128. Fixes a crasher. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138354 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 19:01:24 +00:00
Craig Topper	a534780da0	Add support for breaking 256-bit v16i16 and v32i8 VSETCC into two 128-bit ones, avoiding sclarization. Add vex form of pcmpeqq and pcmpgtq. Fixes more cases for PR10712. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138321 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 04:36:33 +00:00
Bruno Cardoso Lopes	3bde6fe0df	Introduce a pass to insert vzeroupper instructions to avoid AVX to SSE transition penalty. The pass is enabled through the "x86-use-vzeroupper" llc command line option. This is only the first step (very naive and conservative one) to sketch out the idea, but proper DFA is coming next to allow smarter decisions. Comments and ideas now and in further commits will be very appreciated. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138317 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-23 01:14:17 +00:00
Benjamin Kramer	3c1fece071	X86: Add some operand types required to identify calls. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138285 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-22 22:55:32 +00:00
Bruno Cardoso Lopes	2ac8111159	Add support for breaking 256-bit int VETCC into two 128-bit ones, avoding scalarization of the compare. Reduces code from 59 to 6 instructions. Fix PR10712. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138271 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-22 20:31:04 +00:00
Bruno Cardoso Lopes	bde9f1b302	Add 128-bit AVX codegen for PCMP* family of integer instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138270 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-22 20:31:00 +00:00
Bruno Cardoso Lopes	0c9acfcb50	Re-write part of VEX encoding logic, to be more easy to read! Also fix a bug and add a testcase! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138123 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-19 22:27:29 +00:00
Craig Topper	e004d941ec	Add TB encoding to VEX versions of SSE fp logical operations to fix disassembler git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138034 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-19 05:28:50 +00:00
Bruno Cardoso Lopes	863e0f25b7	Fix PR10677. Initial patch and idea by Peter Cooper but I've changed the implementation! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138029 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-19 02:23:56 +00:00
Bruno Cardoso Lopes	df01610d6f	Re-encoded 128-bit AVX versions of SQRT, RSQRT, RCP have 3 operands instead of 2. They were already defined this way in their regular version, but not for the intrinsics versions (_Int), and that would work for assembly emission but not for object code, since a MachineOperand would be missing. This commit fix PR10697. Also removed the {VSQRT,VRSQRT,VRCP}r_Int forms and match the intrinsic via INSERT_SUBREG+EXTRACT_SUBREG patterns. The same couldn't be done for memory versions because sse_load_f32/sse_load_f64 operand need special handling and don't work like regular "addr" operands. There are right now 114 "_Int" and 98 "Int_*" forms! I'm slowly removing them as I step through, but hope we can get rid of these someday, they are really annoying :) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@138012 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-18 23:59:21 +00:00
Bruno Cardoso Lopes	24b90e2287	Cleanup vector logical ops in AVX and add use int versions for simple v2i64 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137919 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-18 02:11:34 +00:00
Bruno Cardoso Lopes	0dd80b0d69	Fix PR10688. Add support for spliting 256-bit vector shifts when the shift amount is variable git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137885 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 22:12:20 +00:00
Owen Anderson	83e3f67fb6	Allow the MCDisassembler to return a "soft fail" status code, indicating an instruction that is disassemblable, but invalid. Only used for ARM UNPREDICTABLE instructions at the moment. Patch by James Molloy. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137830 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 17:44:15 +00:00
Bruno Cardoso Lopes	0e6d230abd	Introduce matching patterns for vbroadcast AVX instruction. The idea is to match splats in the form (splat (scalar_to_vector (load ...))) whenever the load can be folded. All the logic and instruction emission is working but because of PR8156, there are no ways to match loads, cause they can never be folded for splats. Thus, the tests are XFAILed, but I've tested and exercised all the logic using a relaxed version for checking the foldable loads, as if the bug was already fixed. This should work out of the box once PR8156 gets fixed since MayFoldLoad will work as expected. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137810 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 02:29:19 +00:00
Bruno Cardoso Lopes	8a5b262e80	Update comments about vector splat handling in x86 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137808 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 02:29:13 +00:00
Bruno Cardoso Lopes	fc0a702128	Now that we have a canonical way to handle 256-bit splats: vinsertf128 $1 + vpermilps $0, remove the old code that used to first do the splat in a 128-bit vector and then insert it into a larger one. This is better because the handling code gets simpler and also makes a better room for the upcoming vbroadcast! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137807 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-17 02:29:10 +00:00
Bruno Cardoso Lopes	3b86598cfa	Instead of always leaving the work to the generic legalizer when there is no support for native 256-bit shuffles, be more smart in some cases, for example, when you can extract specific 128-bit parts and use regular 128-bit shuffles for them. Example: For this shuffle: shufflevector <4 x i64> %a, <4 x i64> %b, <4 x i32> <i32 1, i32 0, i32 7, i32 6> This was expanded to: vextractf128 $1, %ymm1, %xmm2 vpextrq $0, %xmm2, %rax vmovd %rax, %xmm1 vpextrq $1, %xmm2, %rax vmovd %rax, %xmm2 vpunpcklqdq %xmm1, %xmm2, %xmm1 vpextrq $0, %xmm0, %rax vmovd %rax, %xmm2 vpextrq $1, %xmm0, %rax vmovd %rax, %xmm0 vpunpcklqdq %xmm2, %xmm0, %xmm0 vinsertf128 $1, %xmm1, %ymm0, %ymm0 ret Now we get: vshufpd $1, %xmm0, %xmm0, %xmm0 vextractf128 $1, %ymm1, %xmm1 vshufpd $1, %xmm1, %xmm1, %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137733 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-16 18:21:54 +00:00
Bruno Cardoso Lopes	8400bfe9fa	While I'm here, remove the "_alt" hacks to a series of INSERT_SUBREG and also add the AVX versions of the 128-bit patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137685 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 23:36:51 +00:00
Bruno Cardoso Lopes	1deddbbd56	Reorder declarations of vmovmskp* and also put the necessary AVX predicate and TB encoding fields. This fix the encoding for the attached testcase. This fixes PR10625. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137684 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 23:36:45 +00:00
Jim Grosbach	19cb7f491f	MCTargetAsmParser target match predicate support. Allow a target assembly parser to do context sensitive constraint checking on a potential instruction match. This will be used, for example, to handle Thumb2 IT block parsing. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137675 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 23:03:29 +00:00
Bruno Cardoso Lopes	50b37c7920	Fix PR10656. It's only profitable to use 128-bit inserts and extracts when AVX mode is one. Otherwise is just more work for the type legalizer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137661 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-15 21:45:54 +00:00
Bruno Cardoso Lopes	4002d7e1e6	Fix comment! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137521 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 21:54:42 +00:00
Bruno Cardoso Lopes	53cae1362d	The VPERM2F128 is a AVX instruction which permutes between two 256-bit vectors. It operates on 128-bit elements instead of regular scalar types. Recognize shuffles that are suitable for VPERM2F128 and teach the x86 legalizer how to handle them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137519 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 21:48:26 +00:00
Bruno Cardoso Lopes	fa2f4fd9a2	Move code around and add comments git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137518 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 21:48:22 +00:00
Duncan Sands	1f6a329f79	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137460 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 14:54:45 +00:00
Andrew Trick	32a183c84a	findDeadCallerSavedReg fix: Missing NULL terminator in register arrays. Fix by Ivan Baev. Sorry I don't have a unit test, but the fix is obvious so I don't want to delay it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137404 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-12 00:49:19 +00:00
Bruno Cardoso Lopes	ef8d6999f3	Add a dag combine to xform 256-bit shuffles into simple vector inserts and extracts. This simple combine makes us generate only 1 instruction instead of 11 in the v8 case. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137362 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 21:50:44 +00:00
Bruno Cardoso Lopes	59353b436a	Fix PR10492 by teaching MOVHLPS and MOVLPS mask matching to be more strict. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137324 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 18:59:13 +00:00
Nadav Rotem	6236f7f2b6	Add a comment, per Bruno's CR. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137313 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 17:05:47 +00:00
Nadav Rotem	5e742a3e1b	[AVX] If the data which is going to be saved is already in two XMM registers (for example, after integer operation), do not pack the registers into a YMM before saving. Its better to save as two XMM registers. Before: vinsertf128 $1, %xmm3, %ymm0, %ymm3 vinsertf128 $0, %xmm1, %ymm3, %ymm1 vmovaps %ymm1, 416(%rsp) After: vmovaps %xmm3, 416+16(%rsp) vmovaps %xmm1, 416(%rsp) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137308 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 16:41:21 +00:00
Bruno Cardoso Lopes	b02c0ace20	Cleanup: Remove Int_ CVTSS2SI* forms git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137297 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 02:52:36 +00:00
Bruno Cardoso Lopes	5f1d8abf75	Splats for v8i32/v8f32 can be handled by VPERMILPSY. This was causing infinite recursive calls in legalize. Fix PR10562 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137296 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 02:49:44 +00:00
Bruno Cardoso Lopes	a5134a0ea3	Use the splat index to generate the desired shuffle. Otherwise we could only get undefs and the vector shuffle becomes an undef, generating wrong code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137295 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 02:49:41 +00:00
Eli Friedman	586272d67c	Fix X86TargetLowering::LowerExternalSymbol so that it actually works in non-trivial cases. This hasn't been an issue before because the function isn't normally called (but apparently is used to generate a tail-call to sin() on ELF x86-32 with PIC and SSE2). Fixes PR9693. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137292 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-11 01:48:05 +00:00
Nadav Rotem	614061bfb4	When performing a truncating store, it is sometimes possible to rearrange the data in-register prior to saving to memory. When we reorder the data in memory we prevent the need to save multiple scalars to memory, making a single regular store. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137238 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-10 19:30:14 +00:00
Bruno Cardoso Lopes	6ad251358e	The following X86 pattern is incorrect: def : Pat<(X86Movss VR128:$src1, (bc_v4i32 (v2i64 (load addr:$src2)))), (MOVLPSrm VR128:$src1, addr:$src2)>; This matches a MOVSS dag with a MOVLPS instruction. However, MOVSS will replace only the low 32 bits of the register, while the MOVLPS instruction will replace the low 64 bits. A testcase is added and illustrates the bug and also modified the one that was already present. Patch by Tanya Lattner. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137227 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-10 17:45:17 +00:00
Bruno Cardoso Lopes	155a92a491	Fix a bug in vpermilps mask checking. Fix PR10560 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137194 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-10 01:54:17 +00:00
Bruno Cardoso Lopes	d40aa24ebf	Add 256-bit support for v8i32, v4i64 and v4f64 ISD::SELECT. Fix PR10556 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137179 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 23:27:13 +00:00
Bruno Cardoso Lopes	18deb04e9c	Add v16i16 and v32i8 store patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137166 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 22:39:53 +00:00
Bruno Cardoso Lopes	cde4a1abd5	Use fp unpack instructions to unpack int types. Until we have AVX2, this is the best we can do for these patterns. This fix PR10554. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137161 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 22:18:37 +00:00
Eli Friedman	fc430a662f	Fix a couple ridiculous copy-paste errors. rdar://9914773 . git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137160 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 22:17:39 +00:00
Bruno Cardoso Lopes	e2406dfd89	Reapply a more appropriate solution than in r137114. AVX supports v4f64 = sitofp v4i32. This fix PR10559. Also add support for v4i32 = fptosi v4f64. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137128 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 17:39:13 +00:00
Bruno Cardoso Lopes	a511b8e519	Revert r137114 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137127 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 17:39:01 +00:00
Bruno Cardoso Lopes	e321d7ffc5	Handle sitofp between v4f64 <- v4i32. Fix PR10559 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137114 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 05:48:01 +00:00
Bruno Cardoso Lopes	2f613c5fff	Add support for avx vector fextend git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137105 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 03:04:29 +00:00
Bruno Cardoso Lopes	a1dfb63b78	Add AVX versions of 128-bit sitofp and fptosi git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137104 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 03:04:25 +00:00
Bruno Cardoso Lopes	e5118ab7bb	Add two patterns to match special vmovss and vmovsd cases. Also fix the patterns already there to be more strict regarding the predicate. This fixes PR10558 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137100 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 01:43:09 +00:00
Bruno Cardoso Lopes	0f0e0a0e58	Make LowerVSETCC aware of AVX types and add patterns to match them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137090 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-09 00:46:57 +00:00
Bruno Cardoso Lopes	328a9d4a0f	Add support for several vector shifts operations while in AVX mode. Fix PR10581 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137067 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-08 21:31:08 +00:00
Jakob Stoklund Olesen	2df3f58a0b	Hoist hasLoadFromStackSlot and hasStoreToStackSlot. These the methods are target-independent since they simply scan the memory operands. They can live in TargetInstrInfoImpl. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137063 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-08 20:53:24 +00:00
Jakob Stoklund Olesen	66b0f515d5	Don't clobber pending ST regs when FP regs are killed. X86FloatingPoint keeps track of pending ST registers for an upcoming inline asm instruction with fixed stack register constraints. It does this by remembering which FP register holds the value that should appear at a fixed stack position for the inline asm. When that FP register is killed before the inline asm, make sure to duplicate it to a scratch register, so the ST register still has a live FP reference. This could happen when the same FP register was copied to two ST registers, or when a spill instruction is inserted between the ST copy and the inline asm. This fixes PR10602. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@137050 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-08 17:15:43 +00:00
Chandler Carruth	8d8fa2506d	Silence unused variable warnings in release builds. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136956 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-05 01:08:21 +00:00
Jason W Kim	4dd963b196	Fix http://llvm.org/bugs/show_bug.cgi?id=10583\n - test for 1 and 2 byte fixups to be added git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136954 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-05 00:53:03 +00:00
Evan Cheng	dd5663c8e4	Fix an obvious type. Patch by Ivan Krasin. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136899 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-04 18:38:15 +00:00
Duncan Sands	8036586229	Add obviously missing "break". Noticed by Andrey Karpov with the PVS-studio tool. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136878 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-04 15:45:59 +00:00
Jason W Kim	e651983e71	Fix http://llvm.org/bugs/show_bug.cgi?id=10568 Move the reloc size assert into AsmBackend - where it is more apropos. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136855 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-04 00:38:45 +00:00
Bill Wendling	456a925c61	Only access both operands of an INSERT_SUBVECTOR if it is an INSERT_SUBVECTOR. Fixes PR10527. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136853 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-04 00:32:58 +00:00
Benjamin Kramer	1488f76ed9	Remove unused variables. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136803 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-03 19:53:48 +00:00
Jakob Stoklund Olesen	56e3232d5a	Handle IMPLICIT_DEF instructions in X86FloatingPoint. This fixes PR10575. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136787 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-03 16:33:19 +00:00
Eli Friedman	6cdc1f43e6	Don't create a ridiculous EXTRACT_ELEMENT. PR10563. The testcase looks extremely fragile, so I'm adding an assertion which should catch any cases like this. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136711 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-02 18:38:35 +00:00
Bruno Cardoso Lopes	ac5f13fe3f	Make this kind of lowering to be supported by 256-bit instructions: shuffle (scalar_to_vector (load (ptr + 4))), undef, <0, 0, 0, 0> To: shuffle (vload ptr)), undef, <1, 1, 1, 1> Fix PR10494 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136691 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-02 16:06:18 +00:00
Nick Lewycky	3207c9a440	Bail from FastISel when we encounter a volatile memset intrinsic. Patch by Ivan Krasin! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136663 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-02 00:40:16 +00:00
Bruno Cardoso Lopes	55244ceac4	Add v4f64 -> v2f32 fp_round support. Also add a testcase to exercise the legalizer. This commit together with the two previous ones fixes PR10495. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136654 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-01 21:54:09 +00:00
Bruno Cardoso Lopes	aed890bee0	Teach PreprocessISelDAG to be aware of vector types and to not process them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136653 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-01 21:54:05 +00:00
Bruno Cardoso Lopes	8af2451679	Lower CONCAT_VECTORS to use two VINSERTF128 instructions instead of using a stack store. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136652 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-01 21:54:02 +00:00
Bruno Cardoso Lopes	531f19f767	Since vectors with all ones can't be created with a 256-bit instruction, avoid returning early for v8i32 types, which would only be valid for vector with all zeros. Also split the handling of zeros and ones into separate checking logic since they are handled differently. This fixes PR10547 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136642 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-01 19:51:53 +00:00
Douglas Gregor	32ab312e3f	Update CMake target names for tablegen-generated data in the X86 and ARM targets. This should fix the CMake build with MSVC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136621 91177308-0d34-0410-b5e6-96231b3b80d8	2011-08-01 16:29:27 +00:00
Eli Friedman	55ba816883	Misc optimizer+codegen work for 'cmpxchg' and 'atomicrmw'. They appear to be working on x86 (at least for trivial testcases); other architectures will need more work so that they actually emit the appropriate instructions for orderings stricter than 'monotonic'. (As far as I can tell, the ARM, PPC, Mips, and Alpha backends need such changes.) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136457 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 03:05:32 +00:00
Bruno Cardoso Lopes	6126005259	Fix two tests that I crashed in the previous commits. The mask elts on the second half must be reindexed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136454 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 02:05:28 +00:00
Bruno Cardoso Lopes	dd6353073f	Match VPERMIL masks more strictly and update the target specific mask generation to always catch the weird cases. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136453 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 01:31:15 +00:00
Bruno Cardoso Lopes	2eb4c2bcad	Add DecodeShuffle shuffle support for VPERMIPD variantes git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136452 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 01:31:11 +00:00
Bruno Cardoso Lopes	e89c7d4ce3	Add v8i32 and v4i64 vpermil patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136451 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 01:31:07 +00:00
Bruno Cardoso Lopes	377baa5bc9	Fix a bug while generating target specific VPERMIL masks: skip undef mask elements. This fixes PR10529. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136450 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 01:31:04 +00:00
Bruno Cardoso Lopes	0b0a09f4bf	Enable usage of SSE4 extracts and inserts in their 128-bit AVX forms. Also tidy up code a bit. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136449 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 01:31:02 +00:00
Bruno Cardoso Lopes	9065d4b65f	Cleanup PALIGNR handling and remove the old palign pattern fragment. Also make PALIGNR masks to don't match 256-bits, which isn't supported It's also a step to solve PR10489 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136448 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 01:30:59 +00:00
Chandler Carruth	ac03e736c7	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136433 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-29 00:14:25 +00:00
Oscar Fuentes	4ebc5916e9	Explicitly declare a library dependency of LLVMDesc to LLVMAsmPrinter. GenLibDeps.pl fails to detect vtable references. As this is the only referenced symbol from LLVMDesc to LLVMAsmPrinter on optimized builds, the algorithm that creates the list of libraries to be linked into tools doesn't know about the dependency and sometimes places the libraries on the wrong order, yielding error messages like this: ../../lib/libLLVMARMDesc.a(ARMMCTargetDesc.cpp.o): In function `llvm::ARMInstPrinter::ARMInstPrinter(llvm::MCAsmInfo const&)': ARMMCTargetDesc.cpp:(.text._ZN4llvm14ARMInstPrinterC1ERKNS_9MCAsmInfoE [llvm::ARMInstPrinter::ARMInstPrinter(llvm::MCAsmInfo const&)]+0x2a): undefined reference to `vtable for llvm::ARMInstPrinter' git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136328 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-28 02:33:52 +00:00
Bruno Cardoso Lopes	15d03fb7f4	Invert the subvector insertion to be more likely to be taken as a COPY git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136324 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-28 01:26:53 +00:00
Bruno Cardoso Lopes	93fa4766c2	Add patterns to generate copies for extract_subvector instead of using vextractf128. This will reduce the number of issued instruction for several avx codes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136323 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-28 01:26:50 +00:00
Bruno Cardoso Lopes	735bccda65	movd/movq write zeros in the high 128-bit part of the vector. Use them to match 256-bit scalar_to_vector+zext. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136322 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-28 01:26:46 +00:00
Bruno Cardoso Lopes	a23236c360	Add a few patterns to match allzeros without having to use the fp unit. Take advantage that the 128-bit vpxor zeros the higher part and use it. This also fixes PR10491 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136321 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-28 01:26:43 +00:00
Bruno Cardoso Lopes	2e64ae4101	Add SINT_TO_FP and FP_TO_SINT support for v8i32 types. Also move a convert pattern close to the instruction definition. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136320 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-28 01:26:39 +00:00
Evan Cheng	5de728cfe1	Emit an error is asm parser parsed X86_64 only registers, e.g. %rax, %sil. This can happen in cases where TableGen generated asm matcher cannot check whether a register operand is in the right register class. e.g. mem operands. rdar://8204588 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136292 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 23:22:03 +00:00
Kevin Enderby	c37d4bbf1f	Fix llvm-mc handing of x86 instructions that take 8-bit unsigned immediates. llvm-mc gives an "invalid operand" error for instructions that take an unsigned immediate which have the high bit set such as: pblendw $0xc5, %xmm2, %xmm1 llvm-mc treats all x86 immediates as signed values and range checks them. A small number of x86 instructions use the imm8 field as a set of bits. This change only changes those instructions and where the high bit is not ignored. The others remain unchanged. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136287 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 23:01:50 +00:00
Eli Friedman	1464846801	Code generation for 'fence' instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136283 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 22:21:52 +00:00
Eli Friedman	84e7f7e267	X86ISD::MEMBARRIER does not require SSE2; it doesn't actually generate any code, and all x86 processors will honor the required semantics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136249 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 19:43:50 +00:00
Jeffrey Yasskin	a44defeb22	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136211 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 06:22:51 +00:00
Bruno Cardoso Lopes	9b4ad12b1e	Move some code around to open opportunity for more shuffle matching git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136201 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 00:56:37 +00:00
Bruno Cardoso Lopes	cea34e41fa	The vpermilps and vpermilpd have different behaviour regarding the usage of the shuffle bitmask. Both work in 128-bit lanes without crossing, but in the former the mask of the high part is the same used by the low part while in the later both lanes have independent masks. Handle this properly and and add support for vpermilpd. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136200 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 00:56:34 +00:00
Bruno Cardoso Lopes	cd9e5aed53	Remove more dead code! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136199 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 00:56:27 +00:00
Evan Cheng	bd27f5adbd	Support .code32 and .code64 in X86 assembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136197 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-27 00:38:12 +00:00
Benjamin Kramer	162ee5c725	Add a neat little two's complement hack for x86. On x86 we can't encode an immediate LHS of a sub directly. If the RHS comes from a XOR with a constant we can fold the negation into the xor and add one to the immediate of the sub. Then we can turn the sub into an add, which can be commuted and encoded efficiently. This code is generated for __builtin_clz and friends. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136167 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 22:42:13 +00:00
Bruno Cardoso Lopes	4ea496846a	Recognize unpckh* masks and match 256-bit versions. The new versions are different from the previous 128-bit because they work in lanes. Update a few comments and add testcases git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136157 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 22:03:40 +00:00
Eli Friedman	61cc47e15d	Prevent x86-specific DAGCombine from creating nodes with illegal type (which could not be selected). Fixes a minor isel issue that was breaking the testcase from r136130. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136148 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 21:02:58 +00:00
Bruno Cardoso Lopes	cf128eab90	Remove now unused patterns. 0 insertions(+), 98 deletions(-) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136109 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 18:22:39 +00:00
Bruno Cardoso Lopes	5e3267dac9	Cleanup old matching for PUNPCK* variants git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136108 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 18:22:27 +00:00
Bill Wendling	de77055a68	The compact unwinding offsets are divided by 8 on 64-bit machines. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136065 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 08:03:49 +00:00
Bruno Cardoso Lopes	5d348b4dc4	Add 256-bit isel for movsldup/movshdup git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136051 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 02:39:32 +00:00
Bruno Cardoso Lopes	9123c6fea0	More movsldup/movshdup cleanup. Rewrite the mask matching function and add support for 256-bit versions (but no instruction selection yet, coming next). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136050 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 02:39:28 +00:00
Bruno Cardoso Lopes	5f6c440e53	More cleanup, subtarget info isn't used here. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136049 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 02:39:25 +00:00
Bruno Cardoso Lopes	cc1c3526a7	Add 128-bit AVX versions of movshdup/mosldup git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136048 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 02:39:23 +00:00
Bruno Cardoso Lopes	3e9235c720	Cleanup movsldup/movshdup matching. 27 insertions(+), 62 deletions(-) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136047 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 02:39:13 +00:00
Evan Cheng	28c85a81a1	Rename createCodeEmitter to createMCCodeEmitter; createObjectStreamer to createMCObjectStreamer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136031 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 00:42:34 +00:00
Evan Cheng	94b9550a32	Rename TargetAsmParser to MCTargetAsmParser and TargetAsmLexer to MCTargetAsmLexer; rename createAsmLexer to createMCAsmLexer and createAsmParser to createMCAsmParser. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136027 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 00:24:13 +00:00
Chandler Carruth	b35552d440	Clean up a pile of hacks in our CMake build relating to TableGen. The first problem to fix is to stop creating synthetic Table_gen targets next to all of the LLVM libraries. These had no real effect as CMake specifies that add_custom_command(OUTPUT ...) directives (what the 'tablegen(...)' stuff expands to) are implicitly added as dependencies to all the rules in that CMakeLists.txt. These synthetic rules started to cause problems as we started more and more heavily using tablegen files from subdirectories* of the one where they were generated. Within those directories, the set of tablegen outputs was still available and so these synthetic rules added them as dependencies of those subdirectories. However, they were no longer properly associated with the custom command to generate them. Most of the time this "just worked" because something would get to the parent directory first, and run tablegen there. Once run, the files existed and the build proceeded happily. However, as more and more subdirectories have started using this, the probability of this failing to happen has increased. Recently with the MC refactorings, it became quite common for me when touching a large enough number of targets. To add insult to injury, several of the backends tried to fix this by adding explicit dependencies back to the parent directory's tablegen rules, but those dependencies didn't work as expected -- they weren't forming a linear chain, they were adding another thread in the race. This patch removes these synthetic rules completely, and adds a much simpler function to declare explicitly that a collection of tablegen'ed files are referenced by other libraries. From that, we can add explicit dependencies from the smaller libraries (such as every architectures Desc library) on this and correctly form a linear sequence. All of the backends are updated to use it, sometimes replacing the existing attempt at adding a dependency, sometimes adding a previously missing dependency edge. Please let me know if this causes any problems, but it fixes a rather persistent and problematic source of build flakiness on our end. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136023 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-26 00:09:08 +00:00
Evan Cheng	78c10eeaa5	Rename TargetAsmBackend to MCAsmBackend; rename createAsmBackend to createMCAsmBackend. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136010 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 23:24:55 +00:00
Bruno Cardoso Lopes	863bd9d5cf	Codegen allonesvector better while using AVX: vpcmpeqd + vinsertf128 This also fixes PR10452 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136004 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 23:05:32 +00:00
Bruno Cardoso Lopes	51e92e8e41	Add remaining 256-bit vector bitcasts. This also fixes PR10451 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136003 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 23:05:28 +00:00
Bruno Cardoso Lopes	6a32adc4e5	- Handle special scalar_to_vector case: splats. Using a native 128-bit shuffle before inserting on a 256-bit vector. - Add AVX versions of movd/movq instructions - Introduce a few COPY patterns to match insert_subvector instructions. This turns a trivial insert_subvector instruction into a register copy, coalescing the xmm into a ymm and avoid emiting on more instruction. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136002 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 23:05:25 +00:00
Bruno Cardoso Lopes	233fa39245	Reintroduce r135730, this is indeed the right approach, there is no native 256-bit vector instruction to do scalar_to_vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@136001 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 23:05:16 +00:00
Benjamin Kramer	a3d782f40c	Add a note about efficient codegen for binary log. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135996 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 22:30:00 +00:00
Eli Friedman	63f8dde482	Get rid of an incorrect optimization for shuffles with PALIGNR and simplify isPALIGNRMask. Addresses PR10466, although the crash from that PR only triggers in cases where DAGCombine misses optimizing a shuffle. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135980 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 21:36:45 +00:00
Evan Cheng	4b64e8a9e1	Separate MCInstPrinter registration from AsmPrinter registration. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135974 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 21:20:24 +00:00
Evan Cheng	1b0fc9b418	Fix last bits of MC layer issues. llvm-mc doesn't need to initialize TargetMachine's anymore. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135963 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 20:53:02 +00:00
Evan Cheng	54134708f5	Code clean up. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135954 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 20:18:48 +00:00
Bill Wendling	39055b3a5c	Update the comment. This feature is available only on Darwin at the moment. Though it's not Darwin-specific. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135951 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 20:15:15 +00:00
Oscar Fuentes	7331ac47b9	Unbreak the build. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135949 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 20:13:36 +00:00
Evan Cheng	a87e40f16f	More refactoring. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135939 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 19:33:48 +00:00
Evan Cheng	8c3fee5903	Refactor X86 target to separate MC code from Target code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135930 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 18:43:53 +00:00
Bill Wendling	c57e7dbfaf	Changed disabled code into a flag. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135924 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 18:04:49 +00:00
Bill Wendling	091a864f06	Remove dead variable. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135923 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 18:01:27 +00:00
Bill Wendling	09b02c8ddd	After we've modified the prolog to save volatile registers, generate the compact unwind encoding for that function. This simply crawls through the prolog looking for machine instrs marked as "frame setup". It can calculate from these what the compact unwind should look like. This is currently disabled because of needed linker support. But initial tests look good. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135922 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-25 18:00:28 +00:00
Evan Cheng	a7cfc08ebe	Move TargetAsmParser.h TargetAsmBackend.h and TargetAsmLexer.h to MC where they belong. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135833 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-23 00:45:41 +00:00
Evan Cheng	7f8dff6571	createXXXMCCodeGenInfo should be static. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135826 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-23 00:01:04 +00:00
Evan Cheng	e78085a3c0	Combine all MC initialization routines into one. e.g. InitializeX86MCAsmInfo, InitializeX86MCInstrInfo, etc. are combined into InitializeX86TargetMC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135812 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 21:58:54 +00:00
Bruno Cardoso Lopes	dad38638e1	Fix PR10422 by adding the necessary AVX UCOMISD memory versions to load folding logic git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135801 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 20:53:20 +00:00
Bruno Cardoso Lopes	8360b5fa81	Add v8f32->v8i32 bitcast. Fixes PR10440 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135794 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 19:51:02 +00:00
Rafael Espindola	23e31011fb	Turn shuffles into unpacks for VT == MVT::v2i64 and MVT::v2f64 too. Patch by Jeff Muizelaar. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135789 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 18:56:05 +00:00
Dan Gohman	6e5fda213f	Fix x86's XALUO lowering to return its replacement values instead of doing the RAUW calls for the overflow value itself. This makes it more consistent with how the rest of LegalizeDAG works. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135788 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 18:45:15 +00:00
Benjamin Kramer	558cc5a914	GCC complains about the angle of this line. Remove the escaped newline. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135739 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 01:02:57 +00:00
Bruno Cardoso Lopes	c41e593509	Remove the 128-bit special handling from SCALAR_TO_VECTOR. This isn't the way to go. Doing this here will prevent several node matches later, and would have to force looking all the way through several VINSERTF128/VEXTRACTF128 chains to optimize simple things. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135730 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 00:15:10 +00:00
Bruno Cardoso Lopes	6683efb4cd	-Inspected a AVX code block added by someone in early Feb. This was never used and was actually very wrong, fix it and make it simpler. Also remove the ConcatVectors function, which is unused now. - Fix a introduction of useless nodes in r126664 and r126264. The VUNPCKL* should never be introduced cause we don't want duplicate nodes for 128 AVX and non-AVX modes, the actual instruction difference only exists during isel, but not for target specific DAG nodes. We only introduce V* target nodes when there is no 128-bit version already there. - Fix a fragile test and make it more useful. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135729 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 00:15:07 +00:00
Bruno Cardoso Lopes	74dad551d8	Add a DAGCombine for transforming 128->256 casts into a simple vxorps + vinsertf128 pair of instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135727 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 00:15:00 +00:00
Bruno Cardoso Lopes	d088834fb9	Introduce a new function to lower 256-bit vectors which are not direclty supported and should be promoted and handled by smaller shuffles git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135726 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 00:14:56 +00:00
Bruno Cardoso Lopes	589b897a31	Rename function to be more specific and be more strict about its usage git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135725 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-22 00:14:53 +00:00
Bruno Cardoso Lopes	dbd4fe2b0a	- Register v16i16 as valid VR256 register class - Add more bitcasts for v16i16 - Since 135661 and 135662 already added the splat logic, just add one more splat test for v16i16 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135663 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 02:24:08 +00:00
Bruno Cardoso Lopes	65b74e1d00	Add support for 256-bit versions of VPERMIL instruction. This is a new instruction introduced in AVX, which can operate on 128 and 256-bit vectors. It considers a 256-bit vector as two independent 128-bit lanes. It can permute any 32 or 64 elements inside a lane, and restricts the second lane to have the same permutation of the first one. With the improved splat support introduced early today, adding codegen for this instruction enable more efficient 256-bit code: Instead of: vextractf128 $0, %ymm0, %xmm0 punpcklbw %xmm0, %xmm0 punpckhbw %xmm0, %xmm0 vinsertf128 $0, %xmm0, %ymm0, %ymm1 vinsertf128 $1, %xmm0, %ymm1, %ymm0 vextractf128 $1, %ymm0, %xmm1 shufps $1, %xmm1, %xmm1 movss %xmm1, 28(%rsp) movss %xmm1, 24(%rsp) movss %xmm1, 20(%rsp) movss %xmm1, 16(%rsp) vextractf128 $0, %ymm0, %xmm0 shufps $1, %xmm0, %xmm0 movss %xmm0, 12(%rsp) movss %xmm0, 8(%rsp) movss %xmm0, 4(%rsp) movss %xmm0, (%rsp) vmovaps (%rsp), %ymm0 We get: vextractf128 $0, %ymm0, %xmm0 punpcklbw %xmm0, %xmm0 punpckhbw %xmm0, %xmm0 vinsertf128 $0, %xmm0, %ymm0, %ymm1 vinsertf128 $1, %xmm0, %ymm1, %ymm0 vpermilps $85, %ymm0, %ymm0 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135662 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:47 +00:00
Bruno Cardoso Lopes	9283b668a1	Improve splat promotion to handle AVX types: v32i8 and v16i16. Also refactor the code and add a bunch of comments. The final shuffle emitted by handling 256-bit types is suitable for the VPERM shuffle instruction which is going to be introduced in a next commit (with a testcase which cover this commit) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135661 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:42 +00:00
Bruno Cardoso Lopes	0e87805074	Add aditional patterns for vextractf128 instruction git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135660 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:39 +00:00
Bruno Cardoso Lopes	df0e03ceb8	Add aditional patterns for vinsertf128 instruction git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135659 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:36 +00:00
Bruno Cardoso Lopes	11bbb2003a	Add v16i16 type to VR256 class git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135658 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:33 +00:00
Bruno Cardoso Lopes	bca4781b61	Move code around. No functionality changes git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135657 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:30 +00:00
Bruno Cardoso Lopes	67727cac2f	Tidy up code git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135656 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 01:55:27 +00:00
Bill Wendling	fb4eb165d6	Mark instructions which are part of the frame setup with the MachineInstr::FrameSetup flag. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135645 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-21 00:44:56 +00:00
Bill Wendling	7bc3178182	Remove unused function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135635 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-20 23:07:42 +00:00
Bill Wendling	16da7366d5	Remove the now defunct getCompactUnwindEncoding method from the frame lowering code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135634 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-20 23:04:09 +00:00
Evan Cheng	203576aa0c	Goodbye TargetAsmInfo. This eliminate last bit of CodeGen and Target in llvm-mc. There is still a bit more refactoring left to do in Targets. But we are now very close to fixing all the layering issues in MC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135611 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-20 19:50:42 +00:00
Eli Friedman	a4d0bd84f7	Extend the hack for _GLOBAL_OFFSET_TABLE_ slightly; PR10389. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@135607 91177308-0d34-0410-b5e6-96231b3b80d8	2011-07-20 19:36:11 +00:00

... 3 4 5 6 7 ...

7705 Commits