llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-08-13 23:26:25 +00:00

Author	SHA1	Message	Date
Daniel Sanders	8ef79fa708	[mips] Add initial (experimental) MIPS-IV support. Summary: Adds the 'mips4' processor and a simple test of the ELF e_flags. Patch by David Chisnall His work was sponsored by: DARPA, AFRL I made one small change to the testcase so that it uses mips64-unknown-linux instead of mips4-unknown-linux. This patch indirectly adds FeatureCondMov to FeatureMips64. This is ok because it's supposed to be there anyway and it turns out that FeatureCondMov is not a predicate of any instructions at the moment (this is a bug that hasn't been noticed because there are no targets without the conditional move instructions yet). CC: theraven Differential Revision: http://llvm-reviews.chandlerc.com/D3244 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205530 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 12:13:36 +00:00
Zoran Jovanovic	60f5dfea66	MicroMIPS specific little endian fixup data byte ordering. Differential Revision: http://llvm-reviews.chandlerc.com/D3245 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205528 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 12:01:01 +00:00
Tim Northover	badb137729	ARM: expand atomic ldrex/strex loops in IR The previous situation where ATOMIC_LOAD_WHATEVER nodes were expanded at MachineInstr emission time had grown to be extremely large and involved, to account for the subtly different code needed for the various flavours (8/16/32/64 bit, cmpxchg/add/minmax). Moving this transformation into the IR clears up the code substantially, and makes future optimisations much easier: 1. an atomicrmw followed by using the new value can be more efficient. As an IR pass, simple CSE could handle this efficiently. 2. Making use of cmpxchg success/failure orderings only has to be done in one (simpler) place. 3. The common "cmpxchg; did we store?" idiom can be exposed to optimisation. I intend to gradually improve this situation within the ARM backend and make sure there are no hidden issues before moving the code out into CodeGen to be shared with (at least ARM64/AArch64, though I think PPC & Mips could benefit too). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205525 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 11:44:58 +00:00
Stepan Dyatkovskiy	37e5cfa4aa	PR19320: The trouble as in ARMAsmParser, in ParseInstruction method. It assumes that ARM::R12 + 1 == ARM::SP. It is wrong, since ARM::<Register> codes are generated by tablegen and actually could be any random numbers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205524 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 11:29:15 +00:00
Silviu Baranga	3f11cd0d25	[ARM] When generating a vpaddl node the input lane type is not always the type of the add operation since extract_vector_elt can perform an extend operation. Get the input lane type from the vector on which we're performing the vpaddl operation on and extend or truncate it to the output type of the original add node. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205523 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 10:44:27 +00:00
Sasa Stankovic	10914379f6	[mips] Extend MipsMCExpr class to handle %higher(sym1 - sym2 + const) and %highest(sym1 - sym2 + const) relocations. Remove "ABS_" from VK_Mips_HI and VK_Mips_LO enums in MipsMCExpr, to be consistent with VK_Mips_HIGHER and VK_Mips_HIGHEST. This change also deletes test file test/MC/Mips/higher_highest.ll and moves its CHECK's to the new test file test/MC/Mips/higher-highest-addressing.s. The deleted file tests that R_MIPS_HIGHER and R_MIPS_HIGHEST relocations are emitted in the .o file. Since it uses -force-mips-long-branch option, it was created when MipsLongBranch's implementation was emitting R_MIPS_HIGHER and R_MIPS_HIGHEST relocations in the .o file. It was disabled when MipsLongBranch started to directly calculate offsets. Differential Revision: http://llvm-reviews.chandlerc.com/D3230 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205522 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 10:37:45 +00:00
Tim Northover	27d489f3b2	ARM64: always use i64 for the RHS of shift operations Switching between i32 and i64 based on the LHS type is a good idea in theory, but pre-legalisation uses i64 regardless of our choice, leading to potential ISel errors. Should fix PR19294. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205519 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 09:26:16 +00:00
Oliver Stannard	b8c20fdb2b	ARM: Use __STACK_LIMIT symbol for segmented stacks We cannot use STACK_LIMIT, as it is not reserved for the compiler by the C spec. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205516 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 08:45:16 +00:00
Tim Northover	b642eb5dbc	ARM64: don't generate __sincos_stret calls unless on MachO This should fix PR19314. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205514 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-03 07:06:13 +00:00
Lang Hames	d285beabff	[X86] As per suggestion from Craig Topper and Hal Finkel, override TargetInstrInfo::findCommutedOpIndices to enable VFMA*231 commutation, rather than abusing commuteInstruction. Thanks very much for the suggestion guys! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205489 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 23:57:49 +00:00
Hal Finkel	1fb3df7a2e	[PowerPC] Make PPCTTI::getMemoryOpCost call BasicTTI::getMemoryOpCost PPCTTI::getMemoryOpCost will now make use of BasicTTI::getMemoryOpCost to calculate the base cost of the memory access, and then adjust on top of that. There is no functionality change from this modification, but it will become important so that PPCTTI can take advantage of scalarization information for which BasicTTI::getMemoryOpCost will account in the near future. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205476 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 22:43:49 +00:00
Lang Hames	b1b4d08195	[X86] Make the VFMA*231 variants commutable and relax the alignment restrictions on FMA3 memory operands. FMA3 instructions are VEX encoded, so they can load from unaligned memory. Testcase to follow, along with related patch. <rdar://problem/16478629> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205472 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 22:06:16 +00:00
Juergen Ributzka	172e0ca8c5	Add comments and test case for [X86TTI] Make constant base pointers for GetElementPtr opaque (r204739). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205468 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 21:45:36 +00:00
Saleem Abdulrasool	396e5e328c	ARM: update subtarget information for Windows on ARM Update the subtarget information for Windows on ARM. This enables using the MC layer to target Windows on ARM. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205459 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 20:32:05 +00:00
Jim Grosbach	bc413d65a2	Make a few more range-based loops use explicit types. No functional change. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205458 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 20:21:22 +00:00
Tom Stellard	adb852ddf3	TargetLibraryInfo: Disable memcpy and memset on R600 There are no implementations of these for R600. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205455 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 19:53:29 +00:00
Jim Grosbach	bc07242d9b	Simplify resolveFrameIndex() signature. Just pass a MachineInstr reference rather than an MBB iterator. Creating a MachineInstr& is the first thing every implementation did anyway. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205453 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 19:28:18 +00:00
Jim Grosbach	acb6d9834a	ARM: cortex-m0 doesn't support unaligned memory access. Unlike other v6+ processors, cortex-m0 never supports unaligned accesses. From the v6m ARM ARM: "A3.2 Alignment support: ARMv6-M always generates a fault when an unaligned access occurs." rdar://16491560 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205452 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 19:28:13 +00:00
Jim Grosbach	b4e30b31e9	Make some range based loop types more explicit. No functional change, but more readable code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205451 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 19:28:08 +00:00
Kai Nacke	b96fc4a5ea	[mips] Add more Octeon cnMips instructions Adds the instructions ext/ext32/cins/cins32. It also changes pop/dpop to accept the two operand version and adds a simple pattern to generate baddu. Tests for the two operand versions (including baddu/dmul/dpop/pop) and the code generation pattern for baddu are included. Reviewed by: Daniel.Sanders@imgtec.com git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205449 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:40:43 +00:00
Jim Grosbach	6408bdcacd	[C++11,ARM64] Range based for and explicit 'override' in STP cleanup. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205446 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:59 +00:00
Jim Grosbach	252303f4ad	[C++11,ARM64] Range based for loops in constant promotion. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205445 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:56 +00:00
Jim Grosbach	72ca0bfa7f	[C++11,ARM64] Range based for loops in load/store pair optimizer. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205444 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:53 +00:00
Jim Grosbach	8c60cf143e	[C++11,ARM64] Range based for loops in target lowering. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205443 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:51 +00:00
Jim Grosbach	cbc64ac10e	[C++11,ARM64] Range based for loops in frame lowering. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205442 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:49 +00:00
Jim Grosbach	7e2d11d345	[C++11,ARM64] Range based for loops in pseudo expansion. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205441 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:46 +00:00
Jim Grosbach	4d59fd9cf0	[C++11,ARM64] Range based for loops for LOH No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205440 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:44 +00:00
Jim Grosbach	86ae767ea4	[C++11,ARM64] Range based for loops TLS cleanup. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205439 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:41 +00:00
Jim Grosbach	5e2ab67a19	[C++11,ARM64] Range based for loops in branch relaxation. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205438 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:39 +00:00
Jim Grosbach	68c150834b	[C++11,ARM64] Range based for loops in address type promotion. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205437 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 18:00:36 +00:00
Quentin Colombet	a5084593ba	[ARM64][CollectLOH] Remove the link to the radar from the comments. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205435 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 16:40:49 +00:00
Oliver Stannard	af48fc4136	ARM: Add support for segmented stacks Patch by Alex Crichton, ILyoan, Luqman Aden and Svetoslav. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205430 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 16:10:33 +00:00
Tim Northover	6584d94610	ARM64: use GOT for weak symbols & PIC. Weak symbols cannot use the small code model's usual ADRP sequences since the instruction simply may not be able to encode a value of 0. This redirects them to use the GOT, which hopefully linkers are able to cope with even in the static relocation model. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205426 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 14:39:11 +00:00
Tim Northover	671c92d886	ARM64: fix lowering of fp128 fptosi/fptoui We were creating libcall nodes that returned an MVT::f128, when these particular operations actually return an int of some stripe. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205425 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 14:39:07 +00:00
Tim Northover	3844cadc9a	ARM64: make sure first argument to INSERT_SUBVECTOR has right type. Again, coalescing and other optimisations swiftly made the MachineInstrs consistent again, but when compiled at -O0 a bad INSERT_SUBREGISTER was produced. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205423 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 14:38:58 +00:00
Tim Northover	87e824120d	ARM64: convert fp16 narrowing ISel to pseudo-instruction The previous attempt was fine with optimisations, but was actually rather cavalier with its types. When compiled at -O0, it produced invalid COPY MachineInstrs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205422 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 14:38:54 +00:00
Job Noorman	4e7ec2b053	Mark FPB as a reserved register when needed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205421 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 13:13:56 +00:00
Renato Golin	421397ac00	Remove duplicated DMB instructions ARM specific optimiztion, finding places in ARM machine code where 2 dmbs follow one another, and eliminating one of them. Patch by Reinoud Elhorst. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205409 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 09:03:43 +00:00
Yaron Keren	9ee14e3522	Added isTargetWindowsMSVC(), renamed isTargetMingw() to isTargetWindowsGNU() and isTargetCygwin() to isTargetWindowsCygwin() to be consistent with the four Windows environments in Triple.h. Suggestion by Saleem Abdulrasool! git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205393 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 04:27:51 +00:00
Quentin Colombet	91b97ccd87	[ARM64][CollectLOH] Add some comments to explain how the LOHs framework works (for the compiler part), since the design document is not available. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205379 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-02 01:02:28 +00:00
Hal Finkel	4a6c0afc52	[PowerPC] Add some missing VSX bitcast patterns git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205352 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 19:24:27 +00:00
Yaron Keren	f2dc47ce99	If isKnownWindowsMSVCEnvironment then getOS == Triple::Win32 and Environment == Triple::MSVC so it will never be MinGW or Cygwin. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205349 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 18:52:55 +00:00
Hal Finkel	e30aa957e3	Implement X86TTI::getUnrollingPreferences This provides an initial implementation of getUnrollingPreferences for x86. getUnrollingPreferences is used by the generic (concatenation) unroller, which is distinct from the unrolling done by the loop vectorizer. Many modern x86 cores have some kind of uop cache and loop-stream detector (LSD) used to efficiently dispatch small loops, and taking full advantage of this requires unrolling small loops (small here means 10s of uops). These caches also have limits on the number of taken branches in the loop, and so we also cap the loop unrolling factor based on the maximum "depth" of the loop. This is currently calculated with a partial DFS traversal (partial because it will stop early if the path length grows too much). This is still an approximation, and one that is both conservative (because it does not account for branches eliminated via block placement) and optimistic (because it is only recording the maximum depth over minimum paths). Nevertheless, because the loops that fit in these uop caches are so small, it is not clear how much the details matter. The original set of patches posted for review produced the following test-suite performance results (from the TSVC benchmark) at that time: ControlLoops-dbl - 13% speedup ControlLoops-flt - 15% speedup Reductions-dbl - 7.5% speedup git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205348 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 18:50:34 +00:00
Kai Nacke	54c55edb0a	[mips] Add Octeon cnMips instructions mtmX and mtpX Adds the Octeon cnMips instructions "load multiplier register MPLx" and "load product register Px". Includes tests. Reviews by: Daniel.Sanders@imgtec.com git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205343 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 18:35:26 +00:00
Reid Kleckner	f319a2c3b3	Support segmented stacks on Win64 Identical to Win32 method except the GS segment register is used for TLS instead of FS and pvArbitrary is at TEB offset 0x28 instead of 0x14. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205342 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 18:34:21 +00:00
Yaron Keren	bec4ff5781	isTargetWindows() renamed to isTargetKnownWindowsMSVC() to reflect its current functionality. Based on Takumi NAKAMURA suggestion. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205338 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 18:15:34 +00:00
Christian Pirker	3a64202502	ARM: rename ARMle/ARMbe with ARMLE/ARMBE, and Thumble/Thumbbe with ThumbLE/ThumbBE git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205317 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 15:19:30 +00:00
Tim Northover	cb68a2e3ab	ARM: teach LLVM that Cortex-A7 is very similar to A8. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205314 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 14:10:07 +00:00
Aaron Ballman	77d76519dc	Attempting to fix r205124, which had failed asserts when built with MSVC. Suggestion from Yaron Keren. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205313 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 13:56:35 +00:00
Tim Northover	c077472250	ARM: add cyclone CPU with ZeroCycleZeroing feature. The Cyclone CPU is similar to swift for most LLVM purposes, but does have two preferred instructions for zeroing a VFP register. This teaches LLVM about them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205309 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-01 13:22:02 +00:00

1 2 3 4 5 ...

28331 Commits