llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2026-04-19 08:35:45 +00:00

Author	SHA1	Message	Date
Tom Stellard	c69aaa1a2c	Merging r202192: ------------------------------------------------------------------------ r202192 \| hfinkel \| 2014-02-25 15:51:50 -0500 (Tue, 25 Feb 2014) \| 5 lines Account for 128-bit integer operations in PPCCTRLoops We need to abort the formation of counter-register-based loops where there are 128-bit integer operations that might become function calls. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205822 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-09 00:20:52 +00:00
Tom Stellard	4b87fb1a4b	Merging r200288: ------------------------------------------------------------------------ r200288 \| hfinkel \| 2014-01-28 00:32:58 -0500 (Tue, 28 Jan 2014) \| 5 lines Handle spilling the PPC GPRC_NOR0 register class GPRC_NOR0 is not a subclass of GPRC (because it also contains the ZERO pseudo register). As a result, we also need to check for it in the spilling code. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205821 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-09 00:20:48 +00:00
Tom Stellard	da282fcf6a	Merging r199763: ------------------------------------------------------------------------ r199763 \| hfinkel \| 2014-01-21 15:15:58 -0500 (Tue, 21 Jan 2014) \| 9 lines Fix pointer info on PPC byval stores For PPC64 SVR (and Darwin), the stores that take byval aggregate parameters from registers into the stack frame had MachinePointerInfo objects with incorrect offsets. These offsets are relative to the object itself, not to the stack frame base. This fixes self hosting on PPC64 when compiling with -enable-aa-sched-mi. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205819 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-09 00:20:42 +00:00
Tom Stellard	dcf215a733	Merging r200202: ------------------------------------------------------------------------ r200202 \| stpworld \| 2014-01-27 04:43:10 -0500 (Mon, 27 Jan 2014) \| 2 lines Additional fix for 200201: due to dependence on bitwidth test was moved to X86 directory. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205807 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-08 21:47:15 +00:00
Tom Stellard	1fc026af4b	Merging r200201: ------------------------------------------------------------------------ r200201 \| stpworld \| 2014-01-27 04:18:31 -0500 (Mon, 27 Jan 2014) \| 31 lines Fix for PR18102. Issue outcomes from DAGCombiner::MergeConsequtiveStores, more precisely from mem-ops sequence sorting. Consider, how MergeConsequtiveStores works for next example: store i8 1, a[0] store i8 2, a[1] store i8 3, a[1] ; a[1] again. return ; DAG starts here 1. Method will collect all the 3 stores. 2. It sorts them by distance from the base pointer (farthest with highest index). 3. It takes first consecutive non-overlapping stores and (if possible) replaces them with a single store instruction. The point is, we can't determine here which 'store' instruction would be the second after sorting ('store 2' or 'store 3'). It happens that 'store 3' would be the second, and 'store 2' would be the third. So after merging we have the next result: store i16 (1 \| 3 << 8), base ; is a[0] but bit-casted to i16 store i8 2, a[1] So actually we swapped 'store 3' and 'store 2' and got wrong contents in a[1]. Fix: In sort routine just also take into account mem-op sequence number. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205806 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-08 21:47:12 +00:00
Tom Stellard	89bdc4dd55	Merging r198744: ------------------------------------------------------------------------ r198744 \| iain \| 2014-01-08 05:22:54 -0500 (Wed, 08 Jan 2014) \| 8 lines [patch] Adjust behavior of FDE cross-section relocs for targets that don't support abs-differences. Modern versions of OSX/Darwin's ld (ld64 > 97.17) have an optimisation present that allows the back end to omit relocations (and replace them with an absolute difference) for FDE some text section refs. This patch allows a backend to opt-in to this behaviour by setting "DwarfFDESymbolsUseAbsDiff". At present, this is only enabled for modern x86 OSX ports. test changes by David Fang. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205768 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-08 14:28:03 +00:00
Tom Stellard	aa3637aafb	Merging r197572: ------------------------------------------------------------------------ r197572 \| rafael.espindola \| 2013-12-18 09:35:37 -0500 (Wed, 18 Dec 2013) \| 6 lines One ppc32-darwin, a i64 inside a structure can have 32 bit alignment. Thanks for Iain Sandoe for testing this with the original gcc. Clang was already getting this right. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@205766 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-08 14:27:55 +00:00
Tom Stellard	9157d273c4	Merging r203818: ------------------------------------------------------------------------ r203818 \| thomas.stellard \| 2014-03-13 10:13:04 -0700 (Thu, 13 Mar 2014) \| 7 lines R600: LDS instructions shouldn't implicitly define OQAP LDS instructions are pseudo instructions which model the OQAP defs and uses within a single instruction. This fixes a hang in the opencv MedianFilter tests. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204650 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:44 +00:00
Tom Stellard	9d1cd8e868	Merging r201097: ------------------------------------------------------------------------ r201097 \| thomas.stellard \| 2014-02-10 08:58:30 -0800 (Mon, 10 Feb 2014) \| 9 lines R600/SI: Initialize M0 and emit S_WQM_B64 whenever DS instructions are used DS instructions that access local memory can only uses addresses that are less than or equal to the value of M0. When M0 is uninitialized, then we experience undefined behavior. This patch also changes the behavior to emit S_WQM_B64 on pixel shaders no matter what kind of DS instruction is used. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204648 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:41 +00:00
Tom Stellard	5f71aeed1a	Merging r200830: ------------------------------------------------------------------------ r200830 \| michel.daenzer \| 2014-02-05 01:48:05 -0800 (Wed, 05 Feb 2014) \| 8 lines R600/SI: Add pattern for zero-extending i1 to i32 Fixes opencl-example if_* tests with radeonsi. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74469 Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204646 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:38 +00:00
Tom Stellard	4ac0136bbe	Merging r200743: ------------------------------------------------------------------------ r200743 \| michel.daenzer \| 2014-02-03 23:12:38 -0800 (Mon, 03 Feb 2014) \| 11 lines R600/SI: Fix fneg for 0.0 V_ADD_F32 with source modifier does not produce -0.0 for this. Just manipulate the sign bit directly instead. Also add a pattern for (fneg (fabs ...)). Fixes a bunch of bit encoding piglit tests with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204643 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:34 +00:00
Tom Stellard	3e1bcea5d0	Merging r200283: ------------------------------------------------------------------------ r200283 \| michel.daenzer \| 2014-01-27 19:01:16 -0800 (Mon, 27 Jan 2014) \| 6 lines R600/SI: Add pattern for truncating i32 to i1 Fixes half a dozen piglit tests with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204642 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:32 +00:00
Tom Stellard	bcc1d80afb	Merging r199918: ------------------------------------------------------------------------ r199918 \| thomas.stellard \| 2014-01-23 10:49:33 -0800 (Thu, 23 Jan 2014) \| 8 lines R600: Disable the BFE pattern This pattern uses an SDNodeXForm, which isn't being emitted for some reason. I can get it to work by attaching the PatLeaf that has the XForm to the argument in the output pattern, but this results in an immediate being used in a register operand, which the backend can't handle yet. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204640 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:29 +00:00
Tom Stellard	6392836b79	Merging r199917: ------------------------------------------------------------------------ r199917 \| thomas.stellard \| 2014-01-23 10:49:31 -0800 (Thu, 23 Jan 2014) \| 6 lines R600: Correctly handle vertex fetch clauses the precede ENDIFs The control flow finalizer would sometimes use an ALU_POP_AFTER instruction before the vetex fetch clause instead of using a POP instruction after it. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204639 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:27 +00:00
Tom Stellard	cba1dff815	Merging r202336: ------------------------------------------------------------------------ r202336 \| michel.daenzer \| 2014-02-26 17:47:02 -0800 (Wed, 26 Feb 2014) \| 4 lines R600/SI: Allow SI_KILL for geometry shaders Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204638 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:25 +00:00
Tom Stellard	2087afe423	Merging r200196: ------------------------------------------------------------------------ r200196 \| michel.daenzer \| 2014-01-26 23:20:51 -0800 (Sun, 26 Jan 2014) \| 4 lines R600/SI: Add intrinsic for BUFFER_LOAD_DWORD* instructions Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204637 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:23 +00:00
Tom Stellard	4e50186588	Merging r200195: ------------------------------------------------------------------------ r200195 \| michel.daenzer \| 2014-01-26 23:20:44 -0800 (Sun, 26 Jan 2014) \| 4 lines R600/SI: Add intrinsic for S_SENDMSG instruction Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@204636 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 18:21:21 +00:00
Tom Stellard	0a98bd5bb8	Merging r197503, r197505, r197520: ------------------------------------------------------------------------ r197520 \| dexonsmith \| 2013-12-17 12:28:21 -0800 (Tue, 17 Dec 2013) \| 7 lines Assert that the last operand is actually EFLAGS This is another follow-up to r197503, after a post-commit review by Andy. <rdar://problem/15627766> ------------------------------------------------------------------------ r197505 \| dexonsmith \| 2013-12-17 08:20:37 -0800 (Tue, 17 Dec 2013) \| 6 lines Setting the CPU in the new vaargs test Trying to fix buildbots after r197503 (test passes locally). <rdar://problem/15627766> ------------------------------------------------------------------------ r197503 \| dexonsmith \| 2013-12-17 07:54:45 -0800 (Tue, 17 Dec 2013) \| 17 lines Revert "Revert "Mark vastart_save_xmm_regs as changing EFLAGS"" This reverts commit r197481, recommiting r197469 with an extra fix. The vastart_save_xmm_regs pseudo-instruction expands to a test and a branch, so it modifies EFLAGS. Mark it so, or else the scheduler might place it in the middle of another test+branch. This fixes a bug exposed by r192750, which changed the initial scheduler to source-order as part of enabling the MI Scheduler for X86. This re-commit changes the VASTART_SAVE_XMM_REGS custom inserter not to try to save %flags, and adds a test that catches the bad behavior of r197469. <rdar://problem/15627766> ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@202060 91177308-0d34-0410-b5e6-96231b3b80d8	2014-02-24 18:47:58 +00:00
Bill Wendling	2ca55e9ced	Merging r197492: ------------------------------------------------------------------------ r197492 \| dyatkovskiy \| 2013-12-17 04:07:33 -0800 (Tue, 17 Dec 2013) \| 26 lines Fix for PR18045: http://llvm.org/bugs/show_bug.cgi?id=18045 Short issue description: For X86 machines with sse < sse4.1 we got failures for some particular load/store vector sequences: $ clang-trunk -m32 -O2 test-case.c fatal error: error in backend: Cannot select: 0x4200920: v4i32,ch = load 0x41d6ab0, 0x4205850, 0x41dcb10<LD16[getelementptr inbounds ([4 x i32]* @e, i32 0, i32 0)](align=4)> [ORD=82] [ID=58] 0x4205850: i32 = X86ISD::Wrapper 0x41d5490 [ORD=26] [ID=43] 0x41d5490: i32 = TargetGlobalAddress<[4 x i32]* @e> 0 [ORD=26] [ID=23] 0x41dcb10: i32 = undef [ID=2] The reason is that EltsFromConsecutiveLoads could emit such load instruction both before and after legalize stage. Though this instruction is not legal for machines with SSSE3 and lower. The fix: In EltsFromConsecutiveLoads, if we have passed legalize stage, we check whether nodes it emits are legal. P.S.: If you get failure in time from 12:00 and till 22:00 (UTC-8), perhaps I'll slow with response, so you better reject this commit. Thanks! ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197779 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-20 04:29:56 +00:00
Bill Wendling	e09cd8d42b	Merging r197228: ------------------------------------------------------------------------ r197228 \| d0k \| 2013-12-13 05:40:24 -0800 (Fri, 13 Dec 2013) \| 8 lines X86: When lowering shl_parts, don't emit shift amounts larger than the bit width. While it's safe for the X86-specific shift nodes, dag combining will kill generic nodes. Insert an AND to make it safe, isel will nuke it as x86's shift instructions have an implicit AND. Fixes PR16108, which contains a contraption to hit this case in between constant folders. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197321 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-14 08:01:30 +00:00
Bill Wendling	b29de8ba00	Merging r197089: ------------------------------------------------------------------------ r197089 \| hfinkel \| 2013-12-11 15:12:25 -0800 (Wed, 11 Dec 2013) \| 6 lines Fix the PPC subsumes-predicate check For one predicate to subsume another, they must both check the same condition register. Failure to check this prerequisite was causing miscompiles. Fixes PR18003. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@197126 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-12 04:28:52 +00:00
Bill Wendling	b1eb9dd018	Merging r196858: ------------------------------------------------------------------------ r196858 \| nadav \| 2013-12-09 17:13:59 -0800 (Mon, 09 Dec 2013) \| 1 line Fix PR18162 - Incorrect assertion assumed that the SDValue resno is zero. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196886 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-10 06:42:24 +00:00
Bill Wendling	31985c7d2a	Merging r196806: ------------------------------------------------------------------------ r196806 \| apazos \| 2013-12-09 11:29:14 -0800 (Mon, 09 Dec 2013) \| 11 lines Fix pattern match for movi with 0D result Patch by Jiangning Liu. With some test case changes: - intrinsic test added to the existing /test/CodeGen/AArch64/neon-aba-abd.ll. - New test cases to cover movi 1D scenario without using the intrinsic in test/CodeGen/AArch64/neon-mov.ll. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196872 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-10 04:31:42 +00:00
Manman Ren	3533340399	Merging r195535: ------------------------------------------------------------------------ r195535 \| mren \| 2013-11-22 17:16:29 -0800 (Fri, 22 Nov 2013) \| 8 lines Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. Make tests more robust by removing hard-coded metadata numbers in CHECK lines. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196817 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 21:01:06 +00:00
Manman Ren	41245b4e2a	Merging r195504: ------------------------------------------------------------------------ r195504 \| mren \| 2013-11-22 13:49:45 -0800 (Fri, 22 Nov 2013) \| 6 lines Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196815 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 20:58:24 +00:00
Tim Northover	863c7b48a6	Merge rest of r196210. Some bits strayed into r196701, turning 3.4 red. This should fix the issue. ------------------------------------------------------------------------ r196210 \| haoliu \| 2013-12-03 06:06:55 +0000 (Tue, 03 Dec 2013) \| 3 lines [AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196772 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 10:48:32 +00:00
Tim Northover	54ed08e250	Merge r196725 (conflicts on same API as before): ------------------------------------------------------------------------ r196725 \| tnorthover \| 2013-12-08 15:56:50 +0000 (Sun, 08 Dec 2013) \| 19 lines ARM: fix folding of stack-adjustment (yet again). When trying to eliminate an "sub sp, sp, #N" instruction by folding it into an existing push/pop using dummy registers, we need to account for the fact that this might affect precisely how "fp" gets set in the prologue. We were attempting this, but assuming that whenever we performed a fold it would make a difference. This is false, for example, in: push {r4, r7, lr} add fp, sp, #4 vpush {d8} sub sp, sp, #8 we can fold the "sub" into the "vpush", forming "vpush {d7, d8}". However, in that case the "add fp" instruction mustn't change, which we were getting wrong before. Should fix PR18160. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196769 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 09:05:30 +00:00
Bill Wendling	7d9c02dc62	Merging r196751: ------------------------------------------------------------------------ r196751 \| venkatra \| 2013-12-08 20:02:15 -0800 (Sun, 08 Dec 2013) \| 3 lines [Sparc]: Implement getSetCCResultType() in SparcTargetLowering so that umulo/smulo can be lowered on sparcv9 without an assertion error. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196766 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 08:56:18 +00:00
Bill Wendling	571a02f291	Merging r196755: ------------------------------------------------------------------------ r196755 \| venkatra \| 2013-12-08 21:13:25 -0800 (Sun, 08 Dec 2013) \| 2 lines [SPARCV9]: Adjust the resultant pointer of DYNAMIC_STACKALLOC with the stack BIAS on sparcV9. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196764 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 08:55:55 +00:00
Bill Wendling	f9a98aeb5b	Merging r196735: ------------------------------------------------------------------------ r196735 \| venkatra \| 2013-12-08 14:06:07 -0800 (Sun, 08 Dec 2013) \| 3 lines [SparcV9]: Expand MULHU/MULHS:i64 and UMUL_LOHI/SMUL_LOHI:i64 on sparcv9. This fixes PR18150. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196744 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-09 01:54:36 +00:00
Tim Northover	e8098892f5	Merging r196493. Simple conflict due to change API of updated function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196717 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 08:12:20 +00:00
Bill Wendling	2bdc0dd2db	Merging r196588: ------------------------------------------------------------------------ r196588 \| weimingz \| 2013-12-06 09:56:48 -0800 (Fri, 06 Dec 2013) \| 7 lines Bug 18149: [AArch32] VSel instructions has no ARMCC field The current peephole optimizing for compare inst assumes an instr that uses CPSR has an MO for ARM Cond code.However, for VSEL instructions (vseqeq, vselgt, vselgt, vselvs), there is no such operand nor do they support the modification of Cond Code. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196704 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:17:29 +00:00
Bill Wendling	f04a4d74b8	Merging r196456: ------------------------------------------------------------------------ r196456 \| jiangning \| 2013-12-04 18:12:01 -0800 (Wed, 04 Dec 2013) \| 2 lines For AArch64, add missing register cost calculation for big value types like v4i64 and v8i64. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196700 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:07:48 +00:00
Bill Wendling	488aab6df3	Merging r196362: ------------------------------------------------------------------------ r196362 \| kevinqin \| 2013-12-04 00:02:34 -0800 (Wed, 04 Dec 2013) \| 1 line [AArch64 Neon] Add ACLE intrinsic vceqz_f64. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196699 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:07:30 +00:00
Bill Wendling	4d919e4ec4	Merging r196360: ------------------------------------------------------------------------ r196360 \| kevinqin \| 2013-12-03 23:53:28 -0800 (Tue, 03 Dec 2013) \| 1 line [AArch64 NEON] Add missing compare intrinsics. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196697 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:07:01 +00:00
Bill Wendling	3e87fe7690	Merging r196208: ------------------------------------------------------------------------ r196208 \| haoliu \| 2013-12-02 21:58:30 -0800 (Mon, 02 Dec 2013) \| 3 lines AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196693 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:06:05 +00:00
Bill Wendling	180eb04182	Merging r196198: ------------------------------------------------------------------------ r196198 \| haoliu \| 2013-12-02 19:39:47 -0800 (Mon, 02 Dec 2013) \| 3 lines AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196691 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:05:35 +00:00
Bill Wendling	a72b30d8e8	Merging r196192: ------------------------------------------------------------------------ r196192 \| jiangning \| 2013-12-02 17:33:52 -0800 (Mon, 02 Dec 2013) \| 2 lines Add some missing pattern matches for AArch64 Neon intrinsics like vuqadd_s64 and friends. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196690 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:05:18 +00:00
Bill Wendling	9584d3222f	Merging r196190: ------------------------------------------------------------------------ r196190 \| jiangning \| 2013-12-02 17:29:32 -0800 (Mon, 02 Dec 2013) \| 2 lines Add some missing pattern matches for AArch64 Neon intrinsics like vmull_high_n_s16 and friends. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196688 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-08 00:04:47 +00:00
Bill Wendling	2990853ea8	Merging r196261: ------------------------------------------------------------------------ r196261 \| hliao \| 2013-12-03 01:17:32 -0800 (Tue, 03 Dec 2013) \| 13 lines Enhance the fix of PR17631 - The fix to PR17631 fixes part of the cases where 'vzeroupper' should not be issued before 'call' insn. There're other cases where helper calls will be inserted not limited to epilog. These helper calls do not follow the standard calling convention and won't clobber any YMM registers. (So far, all call conventions will clobber any or part of YMM registers.) This patch enhances the previous fix to cover more cases 'vzerosupper' should not be inserted by checking if that function call won't clobber any YMM registers and skipping it if so. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196652 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-07 09:39:35 +00:00
Bill Wendling	31928dfc03	Merging r196269: ------------------------------------------------------------------------ r196269 \| jamesm \| 2013-12-03 03:23:11 -0800 (Tue, 03 Dec 2013) \| 5 lines Addrspacecasts are no-ops on ARM. Testcase added. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196651 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-07 09:36:35 +00:00
Bill Wendling	aee5c3e105	Revert r191049 and r191059. They were causing failures. See PR17975. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196521 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-05 18:29:11 +00:00
Richard Sandiford	2a2a323488	Merging r196267: ------------------------------------------------------------------------ r196267 \| rsandifo \| 2013-12-03 11:01:54 +0000 (Tue, 03 Dec 2013) \| 12 lines [SystemZ] Fix choice of known-zero mask in insertion optimization The backend converts 64-bit ORs into subreg moves if the upper 32 bits of one operand and the low 32 bits of the other are known to be zero. It then tries to peel away redundant ANDs from the upper 32 bits. Since AND masks are canonicalized to exclude known-zero bits, the test ORs the mask and the known-zero bits together before checking for redundancy. The problem was that it was using the wrong node when checking for known-zero bits, so could drop ANDs that were still needed. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196268 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-03 11:05:09 +00:00
Bill Wendling	38348240d1	Merging r196151: ------------------------------------------------------------------------ r196151 \| mcrosier \| 2013-12-02 13:05:16 -0800 (Mon, 02 Dec 2013) \| 2 lines [AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196230 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-03 07:38:30 +00:00
Bill Wendling	1b26fdbf1f	Merging r196046: ------------------------------------------------------------------------ r196046 \| tnorthover \| 2013-12-01 06:16:24 -0800 (Sun, 01 Dec 2013) \| 8 lines ARM: fix bug in -Oz stack adjustment folding Previously, we clobbered callee-saved registers when folding an "add sp, #N" into a "pop {rD, ...}" instruction. This change checks whether a register we're going to add to the "pop" could actually be live outside the function before doing so and should fix the issue. This should fix PR18081. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196074 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-02 07:38:06 +00:00
Daniel Sanders	102f231863	Merged r195973: ------------------------------------------------------------------------ r195973 \| dsanders \| 2013-11-30 13:47:57 +0000 (Sat, 30 Nov 2013) \| 5 lines [mips][msa] MSA loads and stores have a 10-bit offset. Account for this when lowering FrameIndex. This prevents the compiler from emitting invalid ld.[bhwd]'s and st.[bhwd]'s when the stack frame is between 512 and 32,768 bytes in size. ------------------------------------------------------------------------ Review of this commit by Matheus Almeida revealed that it is still possible to emit invalid code (when the offset is not a multiple of the element size). However, we agreed that this commit still represents an improvement since it fixes many cases that previously emitted invalid code, and does not cause any cases that previously emitted valid code to emit invalid code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196049 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 15:54:07 +00:00
Bill Wendling	243896adcf	Merging r195941: ------------------------------------------------------------------------ r195941 \| haoliu \| 2013-11-28 18:11:22 -0800 (Thu, 28 Nov 2013) \| 4 lines AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196033 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:38:36 +00:00
Bill Wendling	bc976c8d0f	Merging r195939: ------------------------------------------------------------------------ r195939 \| jiangning \| 2013-11-28 17:38:08 -0800 (Thu, 28 Nov 2013) \| 2 lines Add missing test case for bsl_f64 support of AArch64 NEON. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196031 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 04:38:07 +00:00
Bill Wendling	ef39d3e9d0	Merging r195881: ------------------------------------------------------------------------ r195881 \| tstellar \| 2013-11-27 13:23:39 -0800 (Wed, 27 Nov 2013) \| 3 lines R600: Expand vector FABS NOTE: This is a candidate for the 3.4 branch. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@196000 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 03:15:22 +00:00
Bill Wendling	08885c6758	Merging r195879: ------------------------------------------------------------------------ r195879 \| tstellar \| 2013-11-27 13:23:29 -0800 (Wed, 27 Nov 2013) \| 6 lines R600/SI: Use SGPR_32 register class for 32-bit SMRD outputs Writing to the M0 register from an SMRD instruction hangs the GPU, so we need to use the SGPR_32 register class, which does not include M0. NOTE: This is a candidate for the 3.4 branch. ------------------------------------------------------------------------ git-svn-id: https://llvm.org/svn/llvm-project/llvm/branches/release_34@195999 91177308-0d34-0410-b5e6-96231b3b80d8	2013-12-01 03:14:50 +00:00

1 2 3 4 5 ...

9495 Commits