llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-09-29 13:54:57 +00:00

Author	SHA1	Message	Date
Bill Wendling	2ca55e9ced	Merging r197492: ------------------------------------------------------------------------ r197492 \| dyatkovskiy \| 2013-12-17 04:07:33 -0800 (Tue, 17 Dec 2013) \| 26 lines Fix for PR18045: http://llvm.org/bugs/show_bug.cgi?id=18045 Short issue description: For X86 machines with sse < sse4.1 we got failures for some particular load/store vector sequences: $ clang-trunk -m32 -O2 test-case.c fatal error: error in backend: Cannot select: 0x4200920: v4i32,ch = load 0x41d6ab0, 0x4205850, 0x41dcb10<LD16[getelementptr inbounds ([4 x i32]* @e, i32 0, i32 0)](align=4)> [ORD=82] [ID=58] 0x4205850: i32 = X86ISD::Wrapper 0x41d5490 [ORD=26] [ID=43] 0x41d5490: i32 = TargetGlobalAddress<[4 x i32]* @e> 0 [ORD=26] [ID=23] 0x41dcb10: i32 = undef [ID=2] The reason is that EltsFromConsecutiveLoads could emit such load instruction both before and after legalize stage. Though this instruction is not legal for machines with SSSE3 and lower. The fix: In EltsFromConsecutiveLoads, if we have passed legalize stage, we check whether nodes it emits are legal. P.S.: If you get failure in time from 12:00 and till 22:00 (UTC-8), perhaps I'll slow with response, so you better reject this commit. Thanks! ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197779 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-20 04:29:56 +00:00
Bill Wendling	e39b15195a	Merging r197449: ------------------------------------------------------------------------ r197449 \| arnolds \| 2013-12-16 17:11:01 -0800 (Mon, 16 Dec 2013) \| 7 lines LoopVectorizer: Don't if-convert constant expressions that can trap A phi node operand or an instruction operand could be a constant expression that can trap (division). Check that we don't vectorize such cases. PR16729 radar://15653590 ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197453 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-17 01:28:35 +00:00
Bill Wendling	b525888b1f	Merging r197216: ------------------------------------------------------------------------ r197216 \| chandlerc \| 2013-12-13 00:00:01 -0800 (Fri, 13 Dec 2013) \| 9 lines [inliner] Fix PR18206 by preventing inlining functions that call setjmp through an invoke instruction. The original patch for this was written by Mark Seaborn, but I've reworked his test case into the existing returns_twice test case and implemented the fix by the prior refactoring to actually run the cost analysis over invoke instructions, and then here fixing our detection of the returns_twice attribute to work for both calls and invokes. We never noticed because we never saw an invoke. =[ ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197352 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-15 20:55:09 +00:00
Bill Wendling	e6725194d1	Merging r197215: ------------------------------------------------------------------------ r197215 \| chandlerc \| 2013-12-12 23:59:56 -0800 (Thu, 12 Dec 2013) \| 24 lines [inliner] Completely change (and fix) how the inline cost analysis handles terminator instructions. The inline cost analysis inheritted some pretty rough handling of terminator insts from the original cost analysis, and then made it much, much worse by factoring all of the important analyses into a separate instruction visitor. That instruction visitor never visited the terminator. This works fine for things like conditional branches, but for many other things we simply computed The Wrong Value. First example are unconditional branches, which should be free but were counted as full cost. This is most significant for conditional branches where the condition simplifies and folds during inlining. We paid a 1 instruction tax on every branch in a straight line specialized path. =[ Oh, we also claimed that the unreachable instruction had cost. But it gets worse. Let's consider invoke. We never applied the call penalty. We never accounted for the cost of the arguments. Nope. Worse still, we didn't handle the correctness constraints of not inlining recursive invokes, or exception throwing returns_twice functions. Oops. See PR18206. Sadly, PR18206 requires yet another fix, but this refactoring is at least a huge step in that direction. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197351 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-15 20:54:53 +00:00
Bill Wendling	dd36ddfaec	Merging r197178: ------------------------------------------------------------------------ r197178 \| hfinkel \| 2013-12-12 12:45:24 -0800 (Thu, 12 Dec 2013) \| 9 lines Fix a use-after-free error in GlobalOpt CleanupConstantGlobalUsers GlobalOpt's CleanupConstantGlobalUsers function uses a worklist array to manage constant users to be visited. The pointers in this array need to be weak handles because when we delete a constant array, we may also be holding a pointer to one of its elements (or an element of one of its elements if we're dealing with an array of arrays) in the worklist. Fixes PR17347. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197322 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-14 08:04:09 +00:00
Bill Wendling	e09cd8d42b	Merging r197228: ------------------------------------------------------------------------ r197228 \| d0k \| 2013-12-13 05:40:24 -0800 (Fri, 13 Dec 2013) \| 8 lines X86: When lowering shl_parts, don't emit shift amounts larger than the bit width. While it's safe for the X86-specific shift nodes, dag combining will kill generic nodes. Insert an AND to make it safe, isel will nuke it as x86's shift instructions have an implicit AND. Fixes PR16108, which contains a contraption to hit this case in between constant folders. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197321 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-14 08:01:30 +00:00
Bill Wendling	b29de8ba00	Merging r197089: ------------------------------------------------------------------------ r197089 \| hfinkel \| 2013-12-11 15:12:25 -0800 (Wed, 11 Dec 2013) \| 6 lines Fix the PPC subsumes-predicate check For one predicate to subsume another, they must both check the same condition register. Failure to check this prerequisite was causing miscompiles. Fixes PR18003. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197126 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-12 04:28:52 +00:00
Bill Wendling	b1eb9dd018	Merging r196858: ------------------------------------------------------------------------ r196858 \| nadav \| 2013-12-09 17:13:59 -0800 (Mon, 09 Dec 2013) \| 1 line Fix PR18162 - Incorrect assertion assumed that the SDValue resno is zero. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196886 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-10 06:42:24 +00:00
Bill Wendling	31985c7d2a	Merging r196806: ------------------------------------------------------------------------ r196806 \| apazos \| 2013-12-09 11:29:14 -0800 (Mon, 09 Dec 2013) \| 11 lines Fix pattern match for movi with 0D result Patch by Jiangning Liu. With some test case changes: - intrinsic test added to the existing /test/CodeGen/AArch64/neon-aba-abd.ll. - New test cases to cover movi 1D scenario without using the intrinsic in test/CodeGen/AArch64/neon-mov.ll. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196872 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-10 04:31:42 +00:00
Manman Ren	27457ac42f	Merging r196158: ------------------------------------------------------------------------ r196158 \| mren \| 2013-12-02 13:29:56 -0800 (Mon, 02 Dec 2013) \| 12 lines Debug Info: drop debug info via upgrading path if version number does not match. Add a helper function getDebugInfoVersionFromModule to return the debug info version number for a module. "Verifier/module-flags-1.ll" checks for verification errors. It will seg fault when calling getDebugInfoVersionFromModule because of the incorrect format for module flags in the testing case. We make getModuleFlagsMetadata more robust by checking for error conditions. PR17982 ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196822 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 21:06:30 +00:00
Manman Ren	782ff3b700	Merging r196156: ------------------------------------------------------------------------ r196156 \| mren \| 2013-12-02 13:25:56 -0800 (Mon, 02 Dec 2013) \| 2 lines Update Ocaml/vmcore.ml to emit a "Debug Info Version" module flag. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196821 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 21:05:36 +00:00
Manman Ren	3533340399	Merging r195535: ------------------------------------------------------------------------ r195535 \| mren \| 2013-11-22 17:16:29 -0800 (Fri, 22 Nov 2013) \| 8 lines Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. Make tests more robust by removing hard-coded metadata numbers in CHECK lines. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196817 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 21:01:06 +00:00
Manman Ren	41245b4e2a	Merging r195504: ------------------------------------------------------------------------ r195504 \| mren \| 2013-11-22 13:49:45 -0800 (Fri, 22 Nov 2013) \| 6 lines Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196815 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 20:58:24 +00:00
Tim Northover	863c7b48a6	Merge rest of r196210. Some bits strayed into r196701, turning 3.4 red. This should fix the issue. ------------------------------------------------------------------------ r196210 \| haoliu \| 2013-12-03 06:06:55 +0000 (Tue, 03 Dec 2013) \| 3 lines [AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196772 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 10:48:32 +00:00
Tim Northover	54ed08e250	Merge r196725 (conflicts on same API as before): ------------------------------------------------------------------------ r196725 \| tnorthover \| 2013-12-08 15:56:50 +0000 (Sun, 08 Dec 2013) \| 19 lines ARM: fix folding of stack-adjustment (yet again). When trying to eliminate an "sub sp, sp, #N" instruction by folding it into an existing push/pop using dummy registers, we need to account for the fact that this might affect precisely how "fp" gets set in the prologue. We were attempting this, but assuming that whenever we performed a fold it would make a difference. This is false, for example, in: push {r4, r7, lr} add fp, sp, #4 vpush {d8} sub sp, sp, #8 we can fold the "sub" into the "vpush", forming "vpush {d7, d8}". However, in that case the "add fp" instruction mustn't change, which we were getting wrong before. Should fix PR18160. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196769 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 09:05:30 +00:00
Bill Wendling	7d9c02dc62	Merging r196751: ------------------------------------------------------------------------ r196751 \| venkatra \| 2013-12-08 20:02:15 -0800 (Sun, 08 Dec 2013) \| 3 lines [Sparc]: Implement getSetCCResultType() in SparcTargetLowering so that umulo/smulo can be lowered on sparcv9 without an assertion error. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196766 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 08:56:18 +00:00
Bill Wendling	571a02f291	Merging r196755: ------------------------------------------------------------------------ r196755 \| venkatra \| 2013-12-08 21:13:25 -0800 (Sun, 08 Dec 2013) \| 2 lines [SPARCV9]: Adjust the resultant pointer of DYNAMIC_STACKALLOC with the stack BIAS on sparcV9. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196764 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 08:55:55 +00:00
Bill Wendling	f9a98aeb5b	Merging r196735: ------------------------------------------------------------------------ r196735 \| venkatra \| 2013-12-08 14:06:07 -0800 (Sun, 08 Dec 2013) \| 3 lines [SparcV9]: Expand MULHU/MULHS:i64 and UMUL_LOHI/SMUL_LOHI:i64 on sparcv9. This fixes PR18150. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196744 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 01:54:36 +00:00
Tim Northover	e8098892f5	Merging r196493. Simple conflict due to change API of updated function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196717 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 08:12:20 +00:00
Bill Wendling	209178daca	Merging r196638: ------------------------------------------------------------------------ r196638 \| arsenm \| 2013-12-06 18:58:45 -0800 (Fri, 06 Dec 2013) \| 1 line Fix assert with copy from global through addrspacecast ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196709 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:25:40 +00:00
Bill Wendling	b7e206eab9	--- Reverse-merging r196668 into '.': U lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp U test/Transforms/InstCombine/addrspacecast.ll git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196705 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:19:49 +00:00
Bill Wendling	2bdc0dd2db	Merging r196588: ------------------------------------------------------------------------ r196588 \| weimingz \| 2013-12-06 09:56:48 -0800 (Fri, 06 Dec 2013) \| 7 lines Bug 18149: [AArch32] VSel instructions has no ARMCC field The current peephole optimizing for compare inst assumes an instr that uses CPSR has an MO for ARM Cond code.However, for VSEL instructions (vseqeq, vselgt, vselgt, vselvs), there is no such operand nor do they support the modification of Cond Code. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196704 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:17:29 +00:00
Bill Wendling	f04a4d74b8	Merging r196456: ------------------------------------------------------------------------ r196456 \| jiangning \| 2013-12-04 18:12:01 -0800 (Wed, 04 Dec 2013) \| 2 lines For AArch64, add missing register cost calculation for big value types like v4i64 and v8i64. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196700 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:07:48 +00:00
Bill Wendling	488aab6df3	Merging r196362: ------------------------------------------------------------------------ r196362 \| kevinqin \| 2013-12-04 00:02:34 -0800 (Wed, 04 Dec 2013) \| 1 line [AArch64 Neon] Add ACLE intrinsic vceqz_f64. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196699 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:07:30 +00:00
Bill Wendling	4d919e4ec4	Merging r196360: ------------------------------------------------------------------------ r196360 \| kevinqin \| 2013-12-03 23:53:28 -0800 (Tue, 03 Dec 2013) \| 1 line [AArch64 NEON] Add missing compare intrinsics. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196697 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:07:01 +00:00
Bill Wendling	3e87fe7690	Merging r196208: ------------------------------------------------------------------------ r196208 \| haoliu \| 2013-12-02 21:58:30 -0800 (Mon, 02 Dec 2013) \| 3 lines AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196693 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:06:05 +00:00
Bill Wendling	180eb04182	Merging r196198: ------------------------------------------------------------------------ r196198 \| haoliu \| 2013-12-02 19:39:47 -0800 (Mon, 02 Dec 2013) \| 3 lines AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196691 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:05:35 +00:00
Bill Wendling	a72b30d8e8	Merging r196192: ------------------------------------------------------------------------ r196192 \| jiangning \| 2013-12-02 17:33:52 -0800 (Mon, 02 Dec 2013) \| 2 lines Add some missing pattern matches for AArch64 Neon intrinsics like vuqadd_s64 and friends. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196690 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:05:18 +00:00
Bill Wendling	9584d3222f	Merging r196190: ------------------------------------------------------------------------ r196190 \| jiangning \| 2013-12-02 17:29:32 -0800 (Mon, 02 Dec 2013) \| 2 lines Add some missing pattern matches for AArch64 Neon intrinsics like vmull_high_n_s16 and friends. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196688 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:04:47 +00:00
Bill Wendling	fccbdd27bc	Merging r196638: ------------------------------------------------------------------------ r196638 \| arsenm \| 2013-12-06 18:58:45 -0800 (Fri, 06 Dec 2013) \| 1 line Fix assert with copy from global through addrspacecast ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196668 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-07 21:24:29 +00:00
Bill Wendling	2990853ea8	Merging r196261: ------------------------------------------------------------------------ r196261 \| hliao \| 2013-12-03 01:17:32 -0800 (Tue, 03 Dec 2013) \| 13 lines Enhance the fix of PR17631 - The fix to PR17631 fixes part of the cases where 'vzeroupper' should not be issued before 'call' insn. There're other cases where helper calls will be inserted not limited to epilog. These helper calls do not follow the standard calling convention and won't clobber any YMM registers. (So far, all call conventions will clobber any or part of YMM registers.) This patch enhances the previous fix to cover more cases 'vzerosupper' should not be inserted by checking if that function call won't clobber any YMM registers and skipping it if so. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196652 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-07 09:39:35 +00:00
Bill Wendling	31928dfc03	Merging r196269: ------------------------------------------------------------------------ r196269 \| jamesm \| 2013-12-03 03:23:11 -0800 (Tue, 03 Dec 2013) \| 5 lines Addrspacecasts are no-ops on ARM. Testcase added. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196651 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-07 09:36:35 +00:00
Bill Wendling	b84d18f576	Merging r196294: ------------------------------------------------------------------------ r196294 \| arnolds \| 2013-12-03 08:33:06 -0800 (Tue, 03 Dec 2013) \| 7 lines opt: Mirror vectorization presets of clang clang enables vectorization at optimization levels > 1 and size level < 2. opt should behave similarily. Loop vectorization and SLP vectorization can be disabled with the flags -disable-(loop/slp)-vectorization. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196649 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-07 09:31:26 +00:00
Bill Wendling	7b7037563b	Merging r196611: ------------------------------------------------------------------------ r196611 \| dexonsmith \| 2013-12-06 13:48:36 -0800 (Fri, 06 Dec 2013) \| 5 lines Don't use isNullValue to evaluate ConstantExpr ConstantExpr can evaluate to false even when isNullValue gives false. Fixes PR18143. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196614 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-06 22:12:13 +00:00
Bill Wendling	7f6926930f	Merging r196508: ------------------------------------------------------------------------ r196508 \| arnolds \| 2013-12-05 07:14:40 -0800 (Thu, 05 Dec 2013) \| 12 lines SLPVectorizer: An in-tree vectorized entry cannot also be a scalar external use We were creating external uses for scalar values in MustGather entries that also had a ScalarToTreeEntry (they also are present in a vectorized tuple). This meant we would keep a value 'alive' as a scalar and vectorized causing havoc. This is not necessary because when we create a MustGather vector we explicitly create external uses entries for the insertelement instructions of the MustGather vector elements. Fixes PR18129. radar://15582184 ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196571 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-06 09:10:19 +00:00
Bill Wendling	aee5c3e105	Revert r191049 and r191059. They were causing failures. See PR17975. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196521 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-05 18:29:11 +00:00
Richard Sandiford	2a2a323488	Merging r196267: ------------------------------------------------------------------------ r196267 \| rsandifo \| 2013-12-03 11:01:54 +0000 (Tue, 03 Dec 2013) \| 12 lines [SystemZ] Fix choice of known-zero mask in insertion optimization The backend converts 64-bit ORs into subreg moves if the upper 32 bits of one operand and the low 32 bits of the other are known to be zero. It then tries to peel away redundant ANDs from the upper 32 bits. Since AND masks are canonicalized to exclude known-zero bits, the test ORs the mask and the known-zero bits together before checking for redundancy. The problem was that it was using the wrong node when checking for known-zero bits, so could drop ANDs that were still needed. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196268 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-03 11:05:09 +00:00
Bill Wendling	38348240d1	Merging r196151: ------------------------------------------------------------------------ r196151 \| mcrosier \| 2013-12-02 13:05:16 -0800 (Mon, 02 Dec 2013) \| 2 lines [AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196230 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-03 07:38:30 +00:00
Bill Wendling	cdf67d5791	Merging r196104: ------------------------------------------------------------------------ r196104 \| rafael \| 2013-12-02 06:59:34 -0800 (Mon, 02 Dec 2013) \| 1 line Output .eh_frames on COFF too now that the integrated as is used on mingw. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196137 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-02 19:24:08 +00:00
Bill Wendling	21f315bc88	Merging r196129: ------------------------------------------------------------------------ r196129 \| kkhoo \| 2013-12-02 10:43:59 -0800 (Mon, 02 Dec 2013) \| 1 line Conservative fix for PR17827 - don't optimize a shift + and + compare sequence where the shift is logical unless the comparison is unsigned ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196132 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-02 19:14:12 +00:00
Bill Wendling	1b26fdbf1f	Merging r196046: ------------------------------------------------------------------------ r196046 \| tnorthover \| 2013-12-01 06:16:24 -0800 (Sun, 01 Dec 2013) \| 8 lines ARM: fix bug in -Oz stack adjustment folding Previously, we clobbered callee-saved registers when folding an "add sp, #N" into a "pop {rD, ...}" instruction. This change checks whether a register we're going to add to the "pop" could actually be live outside the function before doing so and should fix the issue. This should fix PR18081. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196074 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-02 07:38:06 +00:00
Daniel Sanders	102f231863	Merged r195973: ------------------------------------------------------------------------ r195973 \| dsanders \| 2013-11-30 13:47:57 +0000 (Sat, 30 Nov 2013) \| 5 lines [mips][msa] MSA loads and stores have a 10-bit offset. Account for this when lowering FrameIndex. This prevents the compiler from emitting invalid ld.[bhwd]'s and st.[bhwd]'s when the stack frame is between 512 and 32,768 bytes in size. ------------------------------------------------------------------------ Review of this commit by Matheus Almeida revealed that it is still possible to emit invalid code (when the offset is not a multiple of the element size). However, we agreed that this commit still represents an improvement since it fixes many cases that previously emitted invalid code, and does not cause any cases that previously emitted valid code to emit invalid code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196049 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 15:54:07 +00:00
Daniel Sanders	88fc0183be	Merged from r195975 and r195976. ------------------------------------------------------------------------ r195975 \| zjovanovic \| 2013-11-30 19:12:28 +0000 (Sat, 30 Nov 2013) \| 1 line Fixed issue with microMIPS long branch. ------------------------------------------------------------------------ r195976 \| zjovanovic \| 2013-11-30 19:13:15 +0000 (Sat, 30 Nov 2013) \| 1 line Test case for issue with microMIPS long branch. ------------------------------------------------------------------------ To expand on those commit messages: The immediate in a MIPS branch is multiplied by the instruction size before use as an offset. For many MIPS ISA's this is 4 bytes, but for microMIPS it is 2 bytes. This commit corrects the scale factor used for microMIPS so that attempts to use large offsets result in a valid sequence of instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196043 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 10:45:26 +00:00
Bill Wendling	d0cf77ad59	--- Reverse-merging r195823 into '.': U lib/MC/MCSectionCOFF.cpp U lib/CodeGen/TargetLoweringObjectFileImpl.cpp U test/MC/COFF/weak-symbol.ll U test/MC/COFF/tricky-names.ll G . --- Recording mergeinfo for reverse merge of r195823 into '.': G . git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196036 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:40:32 +00:00
Bill Wendling	243896adcf	Merging r195941: ------------------------------------------------------------------------ r195941 \| haoliu \| 2013-11-28 18:11:22 -0800 (Thu, 28 Nov 2013) \| 4 lines AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196033 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:38:36 +00:00
Bill Wendling	bc976c8d0f	Merging r195939: ------------------------------------------------------------------------ r195939 \| jiangning \| 2013-11-28 17:38:08 -0800 (Thu, 28 Nov 2013) \| 2 lines Add missing test case for bsl_f64 support of AArch64 NEON. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196031 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:38:07 +00:00
Bill Wendling	ffafab0196	Merging r195936: ------------------------------------------------------------------------ r195936 \| kevinqin \| 2013-11-28 17:29:16 -0800 (Thu, 28 Nov 2013) \| 1 line [AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196028 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:37:25 +00:00
Bill Wendling	ae38e1a9b4	Merging r195903: ------------------------------------------------------------------------ r195903 \| haoliu \| 2013-11-27 17:07:45 -0800 (Wed, 27 Nov 2013) \| 2 lines AArch64: Fix a bug about disassembling post-index load single element to 4 vectors ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196025 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:36:39 +00:00
Bill Wendling	06866a72b0	Merging r195677: ------------------------------------------------------------------------ r195677 \| dpeixott \| 2013-11-25 11:11:13 -0800 (Mon, 25 Nov 2013) \| 41 lines ARM integrated assembler generates incorrect nop opcode This patch fixes a bug in the assembler that was causing bad code to be emitted. When switching modes in an assembly file (e.g. arm to thumb mode) we would always emit the opcode from the original mode. Consider this small example: $ cat align.s .code 16 foo: add r0, r0 .align 3 add r0, r0 $ llvm-mc -triple armv7-none-linux align.s -filetype=obj -o t.o $ llvm-objdump -triple thumbv7 -d t.o Disassembly of section .text: foo: 0: 00 44 add r0, r0 2: 00 f0 20 e3 blx #4195904 6: 00 00 movs r0, r0 8: 00 44 add r0, r0 This shows that we have actually emitted an arm nop (e320f000) instead of a thumb nop. Unfortunately, this encodes to a thumb branch which causes bad things to happen when compiling assembly code with align directives. The fix is to notify the ARMAsmBackend when we switch mode. The MCMachOStreamer was already doing this correctly. This patch makes the same change for the MCElfStreamer. There is still a bug in the way nops are emitted for alignment because the MCAlignment fragment does not store the correct mode. The ARMAsmBackend will emit nops for the last mode it knew about. In the example above, we still generate an arm nop if we add a `.code 32` to the end of the file. PR18019 ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196001 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 03:19:10 +00:00
Bill Wendling	ef39d3e9d0	Merging r195881: ------------------------------------------------------------------------ r195881 \| tstellar \| 2013-11-27 13:23:39 -0800 (Wed, 27 Nov 2013) \| 3 lines R600: Expand vector FABS NOTE: This is a candidate for the 3.4 branch. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196000 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 03:15:22 +00:00

1 2 3 4 5 ...

21787 Commits