llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-03-25 00:35:30 +00:00

Author	SHA1	Message	Date
Nadav Rotem	6890be345e	Fix an overly aggressive assertion in getCopyFromPartsVector. The assertion in getCopyFromPartsVector assumed that the vector 'part' must match the type of argument (arguments are potentially split into multiple parts). However, in some cases the targets return a 'part' of the right size but with a different type. We already handle this case correctly later on and generate a bitcast. This commit just makes sure that we are actually checking the property that we care about. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241312 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 23:23:52 +00:00
Adrian Prantl	c3d9cd3f73	DIBuilder: Now that DICompileUnit is distinct, stop using temporary nodes for the arrays. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241308 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 22:32:52 +00:00
Akira Hatanaka	516286ff69	Use function attribute "trap-func-name" and remove TargetOptions::TrapFuncName. This commit changes normal isel and fast isel to read the user-defined trap function name from function attribute "trap-func-name" attached to llvm.trap or llvm.debugtrap instead of from TargetOptions::TrapFuncName. This is needed to use clang's command line option "-ftrap-function" for LTO and enable changing the trap function name on a per-call-site basis. Out-of-tree projects currently using TargetOptions::TrapFuncName to specify the trap function name should attach attribute "trap-func-name" to the call sites of llvm.trap and llvm.debugtrap instead. rdar://problem/21225723 Differential Revision: http://reviews.llvm.org/D10832 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241305 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 22:13:27 +00:00
Akira Hatanaka	7ef8957773	Add functions for adding and testing string attributes to CallInst. NFC. This change is needed later when I make changes to attach string function attributes to llvm.trap and llvm.debugtrap. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241304 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 22:08:48 +00:00
Dan Gohman	44bc052f4d	[WebAssembly] Set the HasFloatingPointExceptions flag for WebAssembly. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241302 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 21:36:25 +00:00
Rafael Espindola	c223f39fa7	Try to fix the build of IntelJITEventListener. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241301 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 21:24:12 +00:00
Rafael Espindola	8a80641a85	Return ErrorOr from SymbolRef::getName. This function can really fail since the string table offset can be out of bounds. Using ErrorOr makes sure the error is checked. Hopefully a lot of the boilerplate code in tools/* can go away once we have a diagnostic manager in Object. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241297 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 20:55:21 +00:00
Bill Schmidt	397fac95d5	[PPC64LE] Remove implicit-subreg restriction from VSX swap removal In r241285, I removed the SUBREG_TO_REG restriction from VSX swap removal, determining that this was overly conservative. We have another form of the same restriction in that we check for the presence of implicit subregs in vector operations. As with SUBREG_TO_REG for partial register conversions, an implicit subreg is safe in and of itself, provided no other operation makes a lane-sensitive assumption about the result. This patch removes that restriction, by removing the HasImplicitSubreg flag and all code that relies on it. I've added a test case that fails to optimize before this patch is applied, and optimizes properly with the patch. Test based on a report from Anton Blanchard. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241290 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 19:01:22 +00:00
Sanjoy Das	1e09eba39d	[Statepoints] Make operator bool() explicit. (Addressing post-commit review.) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241288 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 18:15:18 +00:00
Rafael Espindola	98857b60ed	Remove a report_fatal_error that should be unreachable. If we created a relocation iterator, we have a valid relocation section. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241286 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 17:16:07 +00:00
Bill Schmidt	a5a5a62fff	[PPC64LE] Teach swap optimization about the doubleword splat idiom With a previous patch, the VSX swap optimization is able to recognize the doubleword load-splat idiom that can be implemented using lxvdsx. However, that does not cover a doubleword splat where the source is a register. We can implement this using xxspltd (a special form of xxpermdi). This patch teaches the swap optimization pass about this idiom. As a prerequisite, it also permits swap optimization to succeed for all forms of SUBREG_TO_REG. Previously we were conservative and only allowed SUBREG_TO_REG when it copied a full register. However, on reflection any form of SUBREG_TO_REG is safe in and of itself, so long as an unsafe operation is not performed on its result. In particular, a widening SUBREG_TO_REG often occurs as an input to a doubleword splat idiom, particularly in auto-vectorized code. The doubleword splat idiom is an XXPERMDI operation where both source registers are identical, and the selection mask is either 0 (splat the first element) or 3 (splat the second element). To determine whether the registers are identical, we use the existing mechanism for looking through "copy-like" operations. That mechanism has a side effect of marking the XXPERMDI operation as using a physical register, which would invalidate its presence in a swap-optimized region. This is correct for the form of XXPERMDI that performs a swap and hence would be removed, but is not what we want for a doubleword-splat variety of XXPERMDI. Therefore we reset the physical-register flag on the XXPERMDI when it represents a splat. A simple test case is added to verify that we generate the splat and that we also remove the xxswapd instructions that would otherwise be associated with the load and store of another operand. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241285 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 17:03:06 +00:00
Rafael Espindola	9561506e34	Convert a member variable to a local one. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241284 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 16:59:57 +00:00
Gabor Ballabas	4e0f6008b9	Reworking the test part of r241149 The test part of r241149 has been reverted in r241451, due to misplaced test cases. This patch splits those test cases among the appropriate targets. Differential Revision: http://reviews.llvm.org/D10897 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241283 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 16:53:23 +00:00
Rafael Espindola	9afb3a2589	Fix for PR23310: llvm-dis crashes when trying to upgrade an intrinsic. When trying to upgrade @llvm.x86.sse2.psrl.dq while parsing a module, BitcodeReader adds the function to its worklist twice, resulting in a crash when accessing it the second time. This patch replaces the worklist vector by a map. Patch by Philip Pfaffe. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241281 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 16:22:40 +00:00
Rafael Espindola	3a27979633	Handle .dynsym a bit more like we handle .symtab. They have the same format and we find them in the same way, no reason to handle them differently. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241280 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 16:03:38 +00:00
Rafael Espindola	002ff89e22	Rangify some loops. Patch by Philip Pfaffe! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241279 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 15:55:09 +00:00
Rafael Espindola	adc9db9825	Use default member initializes. NFC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241277 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 15:48:05 +00:00
Leny Kholodov	479dc2c0fe	[Support] Lazy load of dbghlp.dll on Windows This patch changes linkage with dbghlp.dll for clang from static (at load time) to on demand (at the first use of required functions). Clang uses dbghlp.dll only in minor use-cases. First of all in case of crash and in case of plugin load. The dbghlp.dll library can be absent on system. In this case clang will fail to load. With lazy load of dbghlp.dll clang can work even if dbghlp.dll is not available. Differential Revision: http://reviews.llvm.org/D10737 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241271 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 14:34:57 +00:00
Michael Kuperstein	7edba9eec3	[X86] Convert an instruction relaxation test to use objdump instead of readobj Patch by: david.l.kreitzer@intel.com git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241270 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 14:27:35 +00:00
Rafael Espindola	7540435bef	Move error checking from once per relocation to once per relocation section. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241269 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 14:27:07 +00:00
Yaron Keren	8bbc5bbd34	Remove whitespace from start of line, NFC. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241268 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 14:25:09 +00:00
Rafael Espindola	ab6b4cfe2a	Expose getRel and getRela to reduce code duplication. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241266 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 14:21:38 +00:00
Yaron Keren	016970f468	Delete whitespace at start of line. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241265 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 14:17:12 +00:00
Rafael Espindola	e4e6f29c93	Improve error message. Thanks to Sean Silva for the suggestion. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241255 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 11:48:48 +00:00
Pawel Bylica	074d71dea6	Reapply r240291: Fix shl folding in DAG combiner. The code responsible for shl folding in the DAGCombiner was assuming incorrectly that all constants are less than 64 bits. This patch simply changes the way values are compared. It has been reverted previously because of some problems with comparing APInt with raw uint64_t. That has been fixed/changed with r241204. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241254 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 11:44:54 +00:00
Charlie Turner	8992bae990	[GraphWriter] Don't wait on xdg-open when not on Apple. By default, the GraphWriter code assumes that the generic file open program (`open` on Apple, `xdg-open` on other systems) can wait on the forked proces to complete. When the fork ends, the code would delete the temporary dot files created, and return. On GNU/Linux, the xdg-open program does not have a "wait for your fork to complete before dying" option. So the behaviour was that xdg-open would launch a process, quickly die itself, and then the GraphWriter code would think its OK to quickly delete all the temporary files. Once the temporary files were deleted, the dot viewers would get very upset, and often give you weird errors. This change only waits on the generic open program on Apple platforms. Elsewhere, we don't wait on the process, and hence we don't try and clean up the temporary files. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241250 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 09:32:07 +00:00
Charlie Turner	f1b26fc0d0	[NFC] Minor editorial fixes to the CodeGen docs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241249 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 09:32:01 +00:00
Sanjoy Das	d0872b393c	[NFC] Make the Statepoint class more like CallSite Summary: Rename some methods to make Statepoint look more like CallSite. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10756 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241235 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 02:53:45 +00:00
Sanjoy Das	70be6950a5	[Statepoints][NFC] Constify accessors on Statepoint. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10755 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241234 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 02:53:41 +00:00
Sanjoy Das	6302378736	[Statepoints][NFC] Rename variables to llvm style. Summary: (I don't think this change needs review, this was uploaded to phabricator to provide context for later dependent changes.) Differential Revision: http://reviews.llvm.org/D10630 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241233 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 02:53:39 +00:00
Sanjoy Das	05a5f1d637	[Statepoints][NFC] Add Statepoint::operator bool() Summary: This allows the "if (Statepoint SP = Statepoint(I))" idiom. (I don't think this change needs review, this was uploaded to phabricator to provide context for later dependent changes.) Differential Revision: http://reviews.llvm.org/D10629 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241232 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 02:53:36 +00:00
Sanjoy Das	c720b8690d	[Statepoints][NFC] Add Statepoint::getGCResult. Summary: Introduce a simple accessor to get the gc_result hanging off of a statepoint. (I don't think this change needs review, this was uploaded to phabricator to provide context for later dependent changes.) Differential Revision: http://reviews.llvm.org/D10627 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241231 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 02:53:34 +00:00
Sanjoy Das	c6f1b8a4ba	[LazyCallGraph] Port test case from r240039 to LCG. Summary: r240039 adds a test case to check that CallGraph does the right thing with respect to non-leaf intrinsics like statepoint and patchpoint. This ports the same test case to LazyCallGraph. LazyCallGraph already does the right thing with respect to escaping function pointers so there is no need to change any code. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10582 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241226 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 02:03:58 +00:00
Eric Christopher	bbc90cea14	Make an X86 specific directory and put the recent X86 tti specific inlining test into it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241223 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 01:36:31 +00:00
Eric Christopher	320404f134	Implement TargetTransformInfo::hasCompatibleFunctionAttributes for X86. This checks subtarget feature compatibility for inlining by verifying that the callee is a strict subset of the caller's features. This includes the cpu as part of the subtarget we can get via the incoming functions as the backend takes CPUs as feature sets. This allows us to inline things like: int foo() { return baz(); } int __attribute__((target("sse4.2"))) bar() { return foo(); } so that generic code can be inlined into specialized functions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241221 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 01:11:50 +00:00
Eric Christopher	5d5b914c59	Add a routine to TargetTransformInfo that will allow targets to look at the attributes on a function to determine whether or not to allow inlining. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241220 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-02 01:11:47 +00:00
JF Bastien	1ff585db47	WebAssembly: start instructions Summary: * Add 64-bit address space feature. * Rename SIMD feature to SIMD128. * Handle single-thread model with an IR pass (same way ARM does). * Rename generic processor to MVP, to follow design's lead. * Add bleeding-edge processors, with all features included. * Fix a few DEBUG_TYPE to match other backends. Test Plan: ninja check Reviewers: sunfish Subscribers: jfb, llvm-commits Differential Revision: http://reviews.llvm.org/D10880 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241211 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 23:41:25 +00:00
Quentin Colombet	a1a323c637	[TwoAddressInstructionPass] Try 3 Addr Conversion After Commuting. TwoAddressInstructionPass stops after a successful commuting but 3 Addr conversion might be good for some cases. Consider: int foo(int a, int b) { return a + b; } Before this commit, we emit: addl %esi, %edi movl %edi, %eax ret After this commit, we try 3 Addr conversion: leal (%rsi,%rdi), %eax ret Patch by Volkan Keles <vkeles@apple.com>! Differential Revision: http://reviews.llvm.org/D10851 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241206 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 23:12:13 +00:00
Pawel Bylica	c098b6e6e2	Change APInt comparison with uint64_t. Summary: This patch changes the way APInt is compared with a value of type uint64_t. Before the uint64_t value was truncated to the size of APInt before comparison. Now the comparison takes into account full 64-bit precision. Test Plan: Unit tests added. No regressions. Self-hosted check-all done as well. Reviewers: chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10655 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241204 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 22:56:43 +00:00
Matthias Braun	3c76e5f588	Test for specific output in lit test git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241200 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 22:34:59 +00:00
Alexey Samsonov	739ded16a5	[LoopVectorize] Use ReplaceInstWithInst() helper where appropriate. This is mostly an NFC, which increases code readability (instead of saving old terminator, generating new one in front of old, and deleting old, we just call a function). However, it would additionaly copy the debug location from old instruction to replacement, which would help PR23837. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241197 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 22:18:30 +00:00
Pete Cooper	f89e1bdc1a	Pack MCSymbol::Flags in to the bitfield with other members. NFC. All file formats only needed 16-bits right now which is enough to fit in to the padding with other fields. This reduces the size of MCSymbol to 24-bytes on a 64-bit system. The layout is now 0 \| class llvm::MCSymbol 0 \| class llvm::PointerIntPair SectionOrFragmentAndHasName 0 \| intptr_t Value \| [sizeof=8, dsize=8, align=8 \| nvsize=8, nvalign=8] 8 \| unsigned int IsTemporary 8 \| unsigned int IsRedefinable 8 \| unsigned int IsUsed 8 \| _Bool IsRegistered 8 \| unsigned int IsExternal 8 \| unsigned int IsPrivateExtern 8 \| unsigned int Kind 9 \| unsigned int IsUsedInReloc 9 \| unsigned int SymbolContents 9 \| unsigned int CommonAlignLog2 10 \| uint32_t Flags 12 \| uint32_t Index 16 \| union 16 \| uint64_t Offset 16 \| uint64_t CommonSize 16 \| const class llvm::MCExpr * Value \| [sizeof=8, dsize=8, align=8 \| nvsize=8, nvalign=8] \| [sizeof=24, dsize=24, align=8 \| nvsize=24, nvalign=8] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241196 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 21:57:51 +00:00
Dan Gohman	1efcdba0e2	[WebAssembly] Define separate Target instances for 32-bit and 64-bit. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241193 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 21:42:34 +00:00
Jingyue Wu	e08f05f3a5	[NVPTX] expand extload/truncstore for vectors of floats Summary: According to PTX ISA: For convenience, ld, st, and cvt instructions permit source and destination data operands to be wider than the instruction-type size, so that narrow values may be loaded, stored, and converted using regular-width registers. For example, 8-bit or 16-bit values may be held directly in 32-bit or 64-bit registers when being loaded, stored, or converted to other types and sizes. The operand type checking rules are relaxed for bit-size and integer (signed and unsigned) instruction types; floating-point instruction types still require that the operand type-size matches exactly, unless the operand is of bit-size type. So, the ISA does not support load with extending/store with truncatation for floating numbers. This is reflected in setting the loadext/truncstore actions to expand in the code for floating numbers, but vectors of floating numbers are not taken care of. As a result, loading a vector of floats followed by a fp_extend may be combined by DAGCombiner to a extload, and the extload may be lowered to NVPTXISD::LoadV2 with extending information. However, NVPTXISD::LoadV2 does not perform extending, and no extending instructions are inserted. Finally, PTX instructions with mismatched types are generated, like ld.v2.f32 {%fd3, %fd4}, [%rd2] This patch adds the correct actions for vectors of floats, so DAGCombiner would not create loads with extending, and correct code is generated. Patched by Gang Hu. Test Plan: Test case attached. Reviewers: jingyue Reviewed By: jingyue Subscribers: llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D10876 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241191 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 21:32:42 +00:00
Pete Cooper	f77ca9f85f	Encode MCSymbol alignment as log2(align). Given that alignments are always powers of 2, just encode it this way. This matches how we encode alignment on IR GlobalValue's for example. This compresses the CommonAlign member down to 5 bits which allows it to pack better with the surrounding fields. Reviewed by Duncan Exon Smith. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241189 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 21:07:03 +00:00
Reid Kleckner	8a1df9b25f	[WinEH] Use llvm.x86.seh.recoverfp in WinEHPrepare Don't pattern match for frontend outlined finally calls on non-x64 platforms. The 32-bit runtime uses a different funclet prototype. Now, the frontend is pre-outlining the finally bodies so that it ends up doing most of the heavy lifting for variable capturing. We're just outlining the callsite, and adapting the frameaddress(0) call to line up the frame pointer recovery. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241186 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 20:59:25 +00:00
Jingyue Wu	8f2981cb40	[NVPTX] Move NVPTXPeephole after NVPTXPrologEpilogPass Summary: Offset of frame index is calculated by NVPTXPrologEpilogPass. Before that the correct offset of stack objects cannot be obtained, which leads to wrong offset if there are more than 2 frame objects. This patch move NVPTXPeephole after NVPTXPrologEpilogPass. Because the frame index is already replaced by %VRFrame in NVPTXPrologEpilogPass, we check VRFrame register instead, and try to remove the VRFrame if there is no usage after NVPTXPeephole pass. Patched by Xuetian Weng. Test Plan: Strengthened test/CodeGen/NVPTX/local-stack-frame.ll to check the offset calculation based on SP and SPL. Reviewers: jholewinski, jingyue Reviewed By: jingyue Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10853 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241185 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 20:08:06 +00:00
Bill Schmidt	9215b9ab25	[PPC64LE] Enable missing lxvdsx optimization, and related swap optimization When adding little-endian vector support for PowerPC last year, I inadvertently disabled an optimization that recognizes a load-splat idiom and generates the lxvdsx instruction. This patch moves the offending logic so lxvdsx is once again generated. This pattern is frequently generated by the vectorizer for scalar loads of an effective constant. Previously the lxvdsx instruction was wrongly listed as lane-sensitive for the VSX swap optimization (since both doublewords are identical, swaps are safe). This patch fixes this as well, so that vectorized code using lxvdsx can now have swaps removed from the computation. There is an existing test (@test50) in test/CodeGen/PowerPC/vsx.ll that checks for the missing optimization. However, vsx.ll was only being tested for POWER7 with big-endian code generation. I've added a little-endian RUN statement and expected LE code generation for all the tests in vsx.ll to give us a bit better VSX coverage, including what's needed for this patch. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241183 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 19:40:07 +00:00
Sanjay Patel	7472456fe9	add a cl::opt override for TargetLoweringBase's JumpIsExpensive This patch is not intended to change existing codegen behavior for any target. It just exposes the JumpIsExpensive setting on the command-line to allow for easier testing and emergency overrides. Also, change the existing regression test to use FileCheck, explicitly specify the jump-is-expensive option, and use more precise checks. Differential Revision: http://reviews.llvm.org/D10846 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241179 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 18:10:20 +00:00
Jonathan Roelofs	4b64e72fdd	Disallow in-source builds (as we already do for the cmake build). http://reviews.llvm.org/D10614 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@241178 91177308-0d34-0410-b5e6-96231b3b80d8	2015-07-01 18:09:21 +00:00

... 4 5 6 7 8 ...

119341 Commits