llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-25 03:30:37 +00:00

Author	SHA1	Message	Date
Chandler Carruth	2d58cc5f1b	[x86] Teach a bunch of the x86-specific shuffle combining to work with 256-bit vectors as well as 128-bit vectors. Fixes some of the redundant shuffles for v16i16. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230752 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-27 11:45:13 +00:00
Chandler Carruth	8c71e440a2	[x86] Make the v8i16 clever single-input shuffle lowering usable for repeated 128-bit lane shuffles of wider vector types and use it to lower 256-bit v16i16 vector shuffles where applicable. This should let us perfectly lowering the pattern of pshuflw and pshufhw even for AVX2 256-bit patterns. I've not added AVX-512 support, but it should be trivial for someone working on that to wire up. Note that currently this generates bad, long shuffle chains because we don't combine 256-bit target shuffles. The subsequent patches will fix that. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230751 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-27 11:33:46 +00:00
Chandler Carruth	b1961a3896	[x86] Make the single-input v8i16 lowering directly recurse rather than going back through the entire vector shuffle lowering. This is an important step to being able to re-use this logic. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230743 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-27 09:11:38 +00:00
Charles Davis	d51be017f0	Target/X86: Save Win64 non-volatile registers in a Win64 ABI function. Summary: This change causes us to actually save non-volatile registers in a Win64 ABI function that calls a System V ABI function, and vice-versa. Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7919 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230714 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-27 00:57:01 +00:00
Eric Christopher	9656d2d2bc	Rewrite MachineOperand::print and MachineInstr::print to avoid uses of TM->getSubtargetImpl and propagate to all calls. This could be a debugging regression in places where we had a TargetMachine and/or MachineFunction but don't have it as part of the MachineInstr. Fixing this would require passing a MachineFunction/Function down through the print operator, but none of the existing uses in tree seem to do this. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230710 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-27 00:11:34 +00:00
Eric Christopher	acdd4442cb	getRegForInlineAsmConstraint wants to use TargetRegisterInfo for a lookup, pass that in rather than use a naked call to getSubtargetImpl. This involved passing down and around either a TargetMachine or TargetRegisterInfo. Update all callers/definitions around the targets and SelectionDAG. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230699 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 22:38:43 +00:00
Chandler Carruth	b54c36fb4d	[x86] Fix PR22706 where we would incorrectly try lower a v32i8 dynamic blend as legal. We made the same mistake in two different places. Whenever we are custom lowering a v32i8 blend we need to check whether we are custom lowering it only for constant conditions that can be shuffled, or whether we actually have AVX2 and full dynamic blending support on bytes. Both are fixed, with comments added to make it clear what is going on and a new test case. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230695 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 22:15:34 +00:00
Chandler Carruth	205a9a3aec	[x86] Restructure the comments and the conditions for handling dynamic blends. This makes it much more clear what is going on. The case we're handling is that of dynamic conditions, and we're bailing when the nature of the vector types and subtarget preclude lowering the dynamic condition vselect as an actual blend. No functionality changed here, but this will make a subsequent bug-fix to this code much more clear. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230690 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 21:29:06 +00:00
Chandler Carruth	9b9d0fcfe9	[x86] Re-order the combines of select in the X86 backend. This doesn't change functionality, but makes it more clear that the dynamic case and the shuffle case don't overlap in any interesting way. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230689 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 21:21:36 +00:00
Chandler Carruth	34f88924e1	[x86] Add an assert to catch if we ever try to blend a v32i8 without AVX2. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230688 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 21:18:20 +00:00
Reid Kleckner	783f7f989e	Don't sibcall between SysV and Win64 convention functions The shadow stack space expectations won't match. Fixes PR22709. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230667 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 19:43:20 +00:00
Michael Kuperstein	b2b5ffd452	[X86][Haswell][SchedModel] Fix WriteMULm latency. The latency for the WriteMULm class was set to 4, which is actually lower than the latency for WriteMULr (5). A better estimate would be 4 added to WriteMULr, that is, 9. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230634 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 14:30:09 +00:00
Chandler Carruth	eabb1227f6	[x86] Sink the single-input v8i16 lowering code that is actually formulaic into the top v8i16 lowering routine. This makes the generalized lowering a completely general and single path lowering which will allow generalizing it in turn for multiple 128-bit lanes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230623 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 11:00:40 +00:00
Chandler Carruth	177498a4e0	[x86] Remove a SimpleTy usage. No need for it here, we already have the MVT. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230622 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 10:37:01 +00:00
Chandler Carruth	19c267aed1	[x86] Make the vector shuffle helpers order the SDLoc and MVT arguments. This ordering matches that of DAG.getNode. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230617 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 08:19:24 +00:00
Reid Kleckner	3142798ac8	Pass /nologo to ml64 for quieter builds It still prints "Assembling path/to/X86CompilationCallback_Win64.asm", but linking does the same thing. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230596 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 00:51:33 +00:00
Eric Christopher	341f17d0f0	Remove a FIXME. Explanation: This function is in TargetLowering because it uses RegClassForVT which would need to be moved to TargetRegisterInfo and would necessitate moving isTypeLegal over as well - a massive change that would just require TargetLowering having a TargetRegisterInfo class member that it would use. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230585 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 00:00:35 +00:00
Eric Christopher	a01bc6a59f	Remove an argument-less call to getSubtargetImpl from TargetLoweringBase. This required plumbing a TargetRegisterInfo through computeRegisterProperties and into findRepresentativeClass which uses it for register class iteration. This required passing a subtarget into a few target specific initializations of TargetLowering. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230583 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-26 00:00:24 +00:00
David Majnemer	92d1637e2f	X86, Win64: Allow 'mov' to restore the stack pointer if we have a FP The Win64 epilogue structure is very restrictive, it permits a very small number of opcodes and none of them are 'mov'. This means that given: mov %rbp, %rsp pop %rbp The mov isn't the epilogue, only the pop is. This is problematic unless a frame pointer is present in which case we are free to do whatever we'd like in the "body" of the function. If a frame pointer is present, unwinding will undo the prologue operations in reverse order regardless of the fact that we are at an instruction which is reseting the stack pointer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230543 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-25 21:13:37 +00:00
Bruno Cardoso Lopes	51fc7f5afa	[X86][MMX] Reapply: Add MMX instructions to foldable tables Reapply r230248. Teach the peephole optimizer to work with MMX instructions by adding entries into the foldable tables. This covers folding opportunities not handled during isel. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230499 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-25 15:14:02 +00:00
Bruno Cardoso Lopes	8ad268fd61	[X86][MMX] Prevent MMX_MOVD64rm folding MMX_MOVD64rm zero-extends i32 load results into i64 registers. The peephole optimizer will try to fold it in other MMX foldable instructions, the wrong thing to do, since there's no MMX memory instruction that loads from i32 and does implict zero extension. Remove 'canFoldAsLoad' from MOVD64rm in order to prevent such folding. The current MMX tests already test this, but since there are no MMX instructions in the foldable tables yet, this did not trigger. This commit prepares the addition of those instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230498 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-25 15:13:52 +00:00
Elena Demikhovsky	4105fd49d4	AVX-512: Gather and Scatter patterns Gather and scatter instructions additionally write to one of the source operands - mask register. In this case Gather has 2 destination values - the loaded value and the mask. Till now we did not support code gen pattern for gather - the instruction was generated from intrinsic only and machine node was hardcoded. When we introduce the masked_gather node, we need to select instruction automatically, in the standard way. I added a flag "hasTwoExplicitDefs" that allows to handle 2 destination operands. (Some code in the X86InstrFragmentsSIMD.td is commented out, just to split one big patch in many small patches) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230471 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-25 09:46:31 +00:00
Sanjay Patel	a90eb87f7e	simplify control flow; NFC git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230342 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-24 16:26:02 +00:00
Michael Kuperstein	09d756a7e0	[x32] Mark RBX as reserved when EBX is the base pointer. This should have gone into r230334. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230339 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-24 16:13:16 +00:00
Sanjay Patel	269510242b	fix typo in comment; NFC git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230338 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-24 16:11:05 +00:00
Michael Kuperstein	2379e8a2ee	[x32] x32 should use ebx as the base pointer. This fixes the original issue in PR22655, but not the secondary one. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230334 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-24 15:27:13 +00:00
Craig Topper	c3b9d471f6	[X86] Remove the AbsMem32 type from the assembly parser. Only really need the 16-bit version which will automatically get prioritized over AbsMem. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230313 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-24 08:02:13 +00:00
David Majnemer	fbdee9f0c0	X86: Only use 'lea' in Win64 epilogues if a frame pointer exists We can only use 'add' in epilogues, 'lea' is not permitted unless we've established a frame pointer in the prologue. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230286 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-24 00:11:32 +00:00
David Majnemer	ad6622575c	X86: Use a smaller 'mov' instruction for stack probe calls Prologue emission, in some cases, requires calls to a stack probe helper function. The amount of stack to probe is passed as a register argument in the Win64 ABI but the instruction sequence used is pessimistic: it assumes that the number of bytes to probe is greater than 4 GB. Instead, select a more appropriate opcode depending on the number of bytes we are going to probe. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230270 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 21:50:30 +00:00
David Majnemer	d71e4c6218	X86: Use 'mov' instead of 'lea' in Win64 SEH prologues when possible 'mov' and 'lea' are equivalent when the displacement applied with 'lea' is zero. However, 'mov' should encode smaller. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230269 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 21:50:27 +00:00
David Majnemer	16ae406776	X86: Explain why we cannot use a 'mov' in a Win64 epilogue git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230268 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 21:50:25 +00:00
David Majnemer	10c4458d7d	X86: Consistently use 'epilogue' instead of 'epilog' git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230267 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 21:50:18 +00:00
Bruno Cardoso Lopes	6bf5b2b094	[AsmPrinter] Access pointers to globals via pcrel GOT entries Front-ends could use global unnamed_addr to hold pointers to other symbols, like @gotequivalent below: @foo = global i32 42 @gotequivalent = private unnamed_addr constant i32* @foo @delta = global i32 trunc (i64 sub (i64 ptrtoint (i32** @gotequivalent to i64), i64 ptrtoint (i32* @delta to i64)) to i32) The global @delta holds a data "PC"-relative offset to @gotequivalent, an unnamed pointer to @foo. The darwin/x86-64 assembly output for this follows: .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta Since unnamed_addr indicates that the address is not significant, only the content, we can optimize the case above by replacing pc-relative accesses to "GOT equivalent" globals, by a PC relative access to the GOT entry of the final symbol instead. Therefore, "delta" can contain a pc relative relocation to foo's GOT entry and we avoid the emission of "gotequivalent", yielding the assembly code below: .globl _foo _foo: .long 42 .globl _delta _delta: .long _foo@GOTPCREL+4 There are a couple of advantages of doing this: (1) Front-ends that need to emit a great deal of data to store pointers to external symbols could save space by not emitting such "got equivalent" globals and (2) IR constructs combined with this opt opens a way to represent GOT pcrel relocations by using the LLVM IR, which is something we previously had no way to express. Differential Revision: http://reviews.llvm.org/D6922 rdar://problem/18534217 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230264 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 21:26:18 +00:00
Bruno Cardoso Lopes	ee7b509aa3	Revert "[X86][MMX] Add MMX instructions to foldable tables" This reverts commit r230226 since it breaks win buildbots. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230248 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 19:53:37 +00:00
Bruno Cardoso Lopes	77d2363908	[X86][MMX] Add MMX instructions to foldable tables Teach the peephole optimizer to work with MMX instructions by adding entries into the foldable tables. This covers folding opportunities not handled during isel. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230226 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 15:23:22 +00:00
Bruno Cardoso Lopes	c606f3a3cb	[X86][MMX] Support folding loads in psll, psrl and psra intrinsics git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230225 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 15:23:14 +00:00
Elena Demikhovsky	fdafc8fd5e	AVX-512: recommitted 229837 + bugfix + test git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230223 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 15:12:31 +00:00
Elena Demikhovsky	d8e5adcd92	restructured X86 scalar unary operation templates I made the templates general, no need to define pattern separately for each instruction/intrinsic. Now only need to add r_Int pattern for AVX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230221 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-23 14:14:02 +00:00
Craig Topper	f9c1605d56	[X86] Add some missing redundant MMX and SSE encodings for disassembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230165 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-22 07:50:41 +00:00
Benjamin Kramer	2b17108064	Remove dead prototype. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230137 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-21 14:35:00 +00:00
Benjamin Kramer	edf99a5e3f	X86: Remove custom lowering of SIGN_EXTEND_INREG This was just replicating logic from the legalizer. Covered by existing tests. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230136 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-21 14:31:29 +00:00
David Majnemer	164db1c6b9	X86: Call __main using the SelectionDAG Synthesizing a call directly using the MI layer would confuse the frame lowering code. This is problematic as frame lowering is highly sensitive the particularities of calls, etc. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230129 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-21 05:49:45 +00:00
Tim Northover	ca7e0787f0	CodeGen: convert CCState interface to using ArrayRefs Everyone except R600 was manually passing the length of a static array at each callsite, calculated in a variety of interesting ways. Far easier to let ArrayRef handle that. There should be no functional change, but out of tree targets may have to tweak their calls as with these examples. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230118 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-21 02:11:17 +00:00
David Majnemer	e95985d3a0	Win64: Stack alignment constraints aren't applied during SET_FPREG Stack realignment occurs after the prolog, not during, for Win64. Because of this, don't factor in the maximum stack alignment when establishing a frame pointer. This fixes PR22572. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230113 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-21 01:04:47 +00:00
Reid Kleckner	4b91be0289	X86: Remove pre-2010 dead code in mergeSPUpdatesDown git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230075 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-20 22:13:25 +00:00
Simon Pilgrim	0de2c870d8	LowerScalarImmediateShift - Merged v16i8 and v32i8 shift lowering. NFC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230074 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-20 22:13:03 +00:00
Andrea Di Biagio	3583d23018	[X86][FastIsel] Teach how to select float-half conversion intrinsics. This patch teaches X86FastISel how to select intrinsic 'convert_from_fp16' and intrinsic 'convert_to_fp16'. If the target has F16C, we can select VCVTPS2PHrr for a float-half conversion, and VCVTPH2PSrr for a half-float conversion. Differential Revision: http://reviews.llvm.org/D7673 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230043 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-20 19:37:14 +00:00
Sanjay Patel	74e8bf678a	canonicalize a v2f64 blendi of 2 registers This canonicalization step saves us 3 pattern matching possibilities * 4 math ops for scalar FP math that uses xmm regs. The backend can re-commute the operands post-instruction-selection if that makes register allocation better. The tests in llvm/test/CodeGen/X86/sse-scalar-fp-arith.ll cover this scenario already, so there are no new tests with this patch. Differential Revision: http://reviews.llvm.org/D7777 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230024 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-20 16:55:27 +00:00
Chandler Carruth	634fc5f26b	[x86] Switching the shuffle equivalence test to a variadic template was the wrong answer. We also got initializer lists which are way cleaner for this kind of thing. Let's use those and make this a normal, boring functionn accepting ArrayRef. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@230004 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-20 10:47:28 +00:00
Eric Christopher	b661ab1cbd	Save the MachineFunction in startFunction so that we can use it for lookups of the subtarget later. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@229996 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-20 08:01:55 +00:00

1 2 3 4 5 ...

11388 Commits