llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-25 03:30:37 +00:00

Author	SHA1	Message	Date
Benjamin Kramer	1a50a12b43	Prefer SmallVector::append/insert over push_back loops. Same functionality, but hoists the vector growth out of the loop. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@229500 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-17 15:29:18 +00:00
Andrew Trick	4f7d60c1ea	AArch64: Safely handle the incoming sret call argument. This adds a safe interface to the machine independent InputArg struct for accessing the index of the original (IR-level) argument. When a non-native return type is lowered, we generate the hidden machine-level sret argument on-the-fly. Before this fix, we were representing this argument as OrigArgIndex == 0, which is an outright lie. In particular this crashed in the AArch64 backend where we actually try to access the type of the original argument. Now we use a sentinel value for machine arguments that have no original argument index. AArch64, ARM, Mips, and PPC now check for this case before accessing the original argument. Fixes <rdar://19792160> Null pointer assertion in AArch64TargetLowering git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@229413 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-16 18:10:47 +00:00
Matt Arsenault	8a44761afe	R600/SI: Implement correct f64 fdiv This version passes the OpenCL conformance test. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@229239 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-14 04:30:08 +00:00
Matt Arsenault	1751616522	R600/SI: Allow f64 inline immediates in i64 operands This requires considering the size of the operand when checking immediate legality. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@229135 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-13 19:05:03 +00:00
Tom Stellard	6378a7cb0b	R600/SI: Add soffset operand to mubuf addr64 instruction We were previously hard-coding soffset to 0. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@228775 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-11 00:34:32 +00:00
Tom Stellard	89c96b1cd0	R600/SI: Expand misaligned 16-bit memory accesses git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@228190 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-04 20:49:52 +00:00
Tom Stellard	fd4c349de2	R600/SI: Make more store operations legal v2i32, i32, trunc i32 to i16, and truc i32 to i8 stores are legal for all address spaces. We had marked them as custom in order to lower them for the private address space, but this is no longer necessary. This enables lowering of misaligned stores of these types in the DAGLegalizer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@228189 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-04 20:49:51 +00:00
Tom Stellard	d73d1062fe	R600/SI: 64-bit and larger memory access must be at least 4-byte aligned This is true for SI only. CI+ supports unaligned memory accesses, but this requires driver support, so for now we disallow unaligned accesses for all GCN targets. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@227822 91177308-0d34-0410-b5e6-96231b3b80d8	2015-02-02 18:02:28 +00:00
Eric Christopher	0def30471a	Reuse a bunch of cached subtargets and remove getSubtarget calls without a Function argument. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@227638 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-30 23:24:40 +00:00
Matt Arsenault	fa711758df	R600/SI: Implement enableAggressiveFMAFusion Add tests for the various combines. This should always be at least cycle neutral on all subtargets for f64, and faster on some. For f32 we should prefer selecting v_mad_f32 over v_fma_f32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@227484 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-29 19:34:32 +00:00
Matt Arsenault	c416d94735	R600/SI: Add subtarget feature for if f32 fma is fast git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@227483 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-29 19:34:25 +00:00
Tom Stellard	46846844ee	R600/SI: Add subtarget feature to enable VGPR spilling for all shader types This is disabled by default, but can be enabled with the subtarget feature: 'vgpr-spilling' git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@226597 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-20 19:33:04 +00:00
Matt Arsenault	781f7ee502	R600/SI: Fix bad code with unaligned byte vector loads Don't do the v4i8 -> v4f32 combine if the load will need to be expanded due to alignment. This stops adding instructions to repack into a single register that the v_cvt_ubyteN_f32 instructions read. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225926 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-14 01:35:22 +00:00
Matt Arsenault	8b6a26ca85	Implement new way of expanding extloads. Now that the source and destination types can be specified, allow doing an expansion that doesn't use an EXTLOAD of the result type. Try to do a legal extload to an intermediate type and extend that if possible. This generalizes the special case custom lowering of extloads R600 has been using to work around this problem. This also happens to fix a bug that would incorrectly use more aligned loads than should be used. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225925 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-14 01:35:17 +00:00
Tom Stellard	d90e5063ca	R600/SI: Add pattern for bitcasting fp immediates to integers The backend now assumes that all immediates are integers. This allows us to simplify immediate handling code, becasue we no longer need to handle fp and integer immediates differently. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225844 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-13 22:59:41 +00:00
Matt Arsenault	549b6dbbb7	R600/SI: Remove redundant setting expand on f64 vectors None of these are legal types already, so they default to Expand. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225728 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-12 23:13:00 +00:00
Tom Stellard	d275e025d2	R600/SI: Use RegisterOperands to specify which operands can accept immediates There are some operands which can take either immediates or registers and we were previously using different register class to distinguish between operands that could take immediates and those that could not. This patch switches to using RegisterOperands which should simplify the backend by reducing the number of register classes and also make it easier to implement the assembler. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225662 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-12 19:33:18 +00:00
Tom Stellard	9a6e4f08fe	R600/SI: Remove SIISelLowering::legalizeOperands() Its functionality has been replaced by calling SIInstrInfo::legalizeOperands() from SIISelLowering::AdjstInstrPostInstrSelection() and running the SIFoldOperands and SIShrinkInstructions passes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225445 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-08 15:08:17 +00:00
Ahmed Bougacha	7fac1d945f	[SelectionDAG] Allow targets to specify legality of extloads' result type (in addition to the memory type). The LoadExt legalization handling used to only have one type, the memory type. This forced users to assume that as long as the extload for the memory type was declared legal, and the result type was legal, the whole extload was legal. However, this isn't always the case. For instance, on X86, with AVX, this is legal: v4i32 load, zext from v4i8 but this isn't: v4i64 load, zext from v4i8 Whereas v4i64 is (arguably) legal, even without AVX2. Note that the same thing was done a while ago for truncstores (r46140), but I assume no one needed it yet for extloads, so here we go. Calls to getLoadExtAction were changed to add the value type, found manually in the surrounding code. Calls to setLoadExtAction were mechanically changed, by wrapping the call in a loop, to match previous behavior. The loop iterates over the MVT subrange corresponding to the memory type (FP vectors, etc...). I also pulled neighboring setTruncStoreActions into some of the loops; those shouldn't make a difference, as the additional types are illegal. (e.g., i128->i1 truncstores on PPC.) No functional change intended. Differential Revision: http://reviews.llvm.org/D6532 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225421 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-08 00:51:32 +00:00
Tom Stellard	e729f0f935	R600/SI: Remove VReg_32 register class Use VGPR_32 register class instead. These two register classes were identical and having separate classes was causing SIInstrInfo::isLegalOperands() to be overly conservative in some cases. This change is necessary to prevent future paches from missing a folding opportunity in fneg-fabs.ll. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225382 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-07 20:59:25 +00:00
Matt Arsenault	6a72b20325	R600/SI: Add combine for isinfinite pattern git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225310 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-06 23:00:46 +00:00
Matt Arsenault	42d9f7cf0a	R600/SI: Pattern match isinf to v_cmp_class instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225307 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-06 23:00:41 +00:00
Matt Arsenault	a5b2b64292	R600/SI: Add basic DAG combines for fp_class git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@225306 91177308-0d34-0410-b5e6-96231b3b80d8	2015-01-06 23:00:39 +00:00
Matt Arsenault	d796cf2e01	Enable (sext x) == C --> x == (trunc C) combine Extend the existing code which handles this for zext. This makes this more useful for targets with ZeroOrNegativeOne BooleanContent and obsoletes a custom combine SI uses for i1 setcc (sext(i1), 0, setne) since the constant will now be shrunk to i1. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@224691 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-21 16:48:42 +00:00
Matt Arsenault	aa14ffddcf	R600/SI: Fix f64 inline immediates git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@224458 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-17 21:04:08 +00:00
Matt Arsenault	3d1ca355c4	R600/SI: Don't promote f32 select to i32 This is nice for the instruction patterns, but it complicates min / max matching. The select doesn't have the correct type and would require looking through the bitcasts for the real float operands. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@224092 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-12 02:30:29 +00:00
Matt Arsenault	29ae5b8a8c	R600/SI: Use unordered equal instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@224067 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-11 22:15:43 +00:00
Matt Arsenault	e5bd584683	R600/SI: Make more unordered comparisons legal This saves a second compare and an and / or by using the unordered comparison instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@224066 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-11 22:15:39 +00:00
Matt Arsenault	8651adfe4f	R600/SI: Use unordered not equal instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@224065 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-11 22:15:35 +00:00
Marek Olsak	a98fd8b640	R600/SI: Use getTargetConstant in AdjustRegClass git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@223940 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-10 19:25:31 +00:00
Marek Olsak	eca8933d58	R600/SI: Set 20-bit immediate byte offset for SMRD on VI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@223614 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-07 17:17:38 +00:00
Tom Stellard	46c07c3dd8	R600/SI: Set correct number of user sgprs for HSA runtime We don't support scratch buffers yet with HSA. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@223130 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-02 17:41:43 +00:00
Tom Stellard	15e1919a76	R600/SI: Set the ATC bit on all resource descriptors for the HSA runtime git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@223125 91177308-0d34-0410-b5e6-96231b3b80d8	2014-12-02 17:05:41 +00:00
Matt Arsenault	6e5862cb8c	R600/SI: Fix assertion on sign extend of 3 vectors This was trying to create an MVT with 3x vectors which created an invalid EVT git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@222942 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-28 22:51:38 +00:00
Tom Stellard	573630a020	R600/SI: Emit s_mov_b32 m0, -1 before every DS instruction This s_mov_b32 will write to a virtual register from the M0Reg class and all the ds instructions now take an extra M0Reg explicit argument. This change is necessary to prevent issues with the scheduler mixing together instructions that expect different values in the m0 registers. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@222583 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-21 22:31:44 +00:00
Tom Stellard	891e9e7869	R600/SI: Make sure resource descriptors are always stored in SGPRs git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@222253 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-18 20:39:39 +00:00
Craig Topper	56391ddf5d	Convert some EVTs to MVTs where only a SimpleValueType is needed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@222109 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-16 21:17:18 +00:00
Matt Arsenault	24e874a1dd	R600/SI: Combine min3/max3 instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@222032 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-14 20:08:52 +00:00
Matt Arsenault	8fd3b90c3f	R600/SI: Use S_BFE_I64 for 64-bit sext_inreg git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@222012 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-14 18:18:16 +00:00
Matt Arsenault	b44e43623d	R600/SI: Get rid of FCLAMP_SI pseudo It's not necessary. Also use complex patterns to allow src modifier usage. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@221916 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-13 19:49:04 +00:00
Matt Arsenault	d8586375b8	R600/SI: Move all rsrc building functions to SIISelLowering git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@221383 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-05 19:01:19 +00:00
Matt Arsenault	12bd9f11c0	R600/SI: Remove SI_ADDR64_RSRC git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@221382 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-05 19:01:17 +00:00
Matt Arsenault	37b154c175	R600/SI: Use REG_SEQUENCE instead of INSERT_SUBREGs git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@221118 91177308-0d34-0410-b5e6-96231b3b80d8	2014-11-02 23:46:54 +00:00
Matt Arsenault	015776f38c	Add minnum / maxnum codegen git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@220342 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-21 23:01:01 +00:00
Matt Arsenault	8287a4b564	R600/SI: Add pattern for bswap git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@220304 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-21 16:25:08 +00:00
Matt Arsenault	46b53c9e4b	R600/SI: Remove SI_BUFFER_RSRC pseudo Just use REG_SEQUENCE directly, so there are fewer instructions to need to deal with later. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@220056 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-17 17:42:56 +00:00
Jan Vesely	d6315ea5a5	Reapply "R600: Add new intrinsic to read work dimensions" This effectively reverts revert 219707. After fixing the test to work with new function name format and renamed intrinsic. Reviewed-by: Tom Stellard <tom@stellard.net> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@219710 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-14 20:05:26 +00:00
Rafael Espindola	e8e8db7ff6	Revert "R600: Add new intrinsic to read work dimensions" This reverts commit r219705. CodeGen/R600/work-item-intrinsics.ll was failing on linux. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@219707 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-14 18:58:04 +00:00
Jan Vesely	6a529850f3	R600: Add new intrinsic to read work dimensions v2: Add SI lowering Add test v3: Place work dimensions after the kernel arguments. v4: Calculate offset while lowering arguments v5: rebase v6: change prefix to AMDGPU Reviewed-by: Tom Stellard <tom@stellard.net> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@219705 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-14 18:52:07 +00:00
Tom Stellard	a8b2e6f4af	R600/SI: Legalize CopyToReg during instruction selection The instruction emitter will crash if it encounters a CopyToReg node with a non-register operand like FrameIndex. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@219428 91177308-0d34-0410-b5e6-96231b3b80d8	2014-10-09 19:06:00 +00:00

1 2 3 4 5 ...

260 Commits