llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2025-02-24 12:29:33 +00:00

Author	SHA1	Message	Date
Matt Arsenault	cb1ac70623	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@213877 91177308-0d34-0410-b5e6-96231b3b80d8	2014-07-24 17:10:35 +00:00
Tom Stellard	3280804237	R600/SI: Use scratch memory for large private arrays git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@213551 91177308-0d34-0410-b5e6-96231b3b80d8	2014-07-21 15:45:01 +00:00
Tom Stellard	b664d47cb0	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@213530 91177308-0d34-0410-b5e6-96231b3b80d8	2014-07-21 14:01:14 +00:00
Matt Arsenault	5fbf09a69f	R600: Add dag combine for copy of an illegal type. This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@213031 91177308-0d34-0410-b5e6-96231b3b80d8	2014-07-15 02:06:31 +00:00
Matt Arsenault	97fb702886	R600: Move mul combine to separate function git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@212052 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-30 17:55:48 +00:00
Aaron Ballman	2711c0a68b	Silencing a warning about isZExtFree hiding an inherited virtual function. No functional change intended. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211783 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-26 13:45:47 +00:00
Matt Arsenault	95eb45c5d9	R600: Fix inconsistency in rsq instructions. R600 was using a clamped version of rsq, but SI was not. Add a new rsq_clamped intrinsic and use them consistently. It's unclear to me from the documentation what behavior the R600 instructions have, so I assume they have the legacy behavior described by the SI documents. For R600, use RECIPSQRT_IEEE for both llvm.AMDGPU.rsq.legacy and llvm.AMDGPU.rsq. R600 also has RECIPSQRT_FF, which I'm not sure how it fits in here. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211637 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-24 22:13:39 +00:00
Matt Arsenault	a91ff54e43	R600: Remove DIV_INF This corresponded to an amdil instruction which there is a 2 instruction equivalent for. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211616 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-24 17:42:16 +00:00
Matt Arsenault	791c054391	R600: Remove AMDILISelLowering git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211519 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-23 18:00:55 +00:00
Jan Vesely	ddf2a7902a	R600: Use LowerSDIVREM for i64 node replace v2: move div/rem node replacement to R600ISelLowering make lowerSDIVREM protected Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211478 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-22 21:43:01 +00:00
Jan Vesely	cd88535ab9	R600: Implement custom SDIVREM. Instead of separate SDIV/SREM. SDIV used UDIV which in turn used UDIVREM anyway. SREM used SDIV(UDIV->UDIVREM)+MUL+SUB, using UDIVREM directly is more efficient. v2: Don't use all caps names Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211477 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-22 21:43:00 +00:00
Matt Arsenault	d9b35435b8	R600/SI: Add intrinsics for various math instructions. These will be used for custom lowering and for library implementations of various math functions, so it's useful to expose these as builtins. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211247 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-19 01:19:19 +00:00
Matt Arsenault	ce09bda96e	R600: Handle fnearbyint The difference from rint isn't really relevant here, so treat them as equivalent. OpenCL doesn't have nearbyint, so this is sort of pointless other than for completeness. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211229 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-18 22:03:45 +00:00
Matt Arsenault	2b6e6fc1a8	R600/SI: Add intrinsics for brev instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211187 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-18 17:13:57 +00:00
Matt Arsenault	debd831223	R600: Implement f64 ftrunc, ffloor and fceil. CI has instructions for these, so this fixes them for older hardware. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211183 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-18 17:05:30 +00:00
Matt Arsenault	a5395c03f0	R600: Custom lower f64 frint for pre-CI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211182 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-18 17:05:26 +00:00
Tom Stellard	f56e7678d1	R600: Use LDS and vectors for private memory git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211110 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-17 16:53:14 +00:00
Matt Arsenault	62f6ab7a6d	R600: Move / cleanup more leftover AMDIL stuff. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@210998 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-15 20:23:38 +00:00
Matt Arsenault	57177e3361	R600: Move division custom lowering out of AMDILISelLowering git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@210997 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-15 20:08:02 +00:00
Matt Arsenault	36b9c7c872	R600: Remove dead code git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@210994 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-15 19:48:13 +00:00
Matt Arsenault	00c3986254	R600: Mostly remove remaining AMDIL intrinsics. Delete all unused ones, and add new AMDGPU named intrinsics for the ones that are. Handle the old AMDIL names for comptability (although remove their GCCBuiltin names) and add tests since there weren't any for these before. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@210827 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-12 21:15:44 +00:00
Matt Arsenault	8a9df8f92c	R600/SI: Use v_cvt_f32_ubyte* instructions This eliminates extra extract instructions when loading an i8 vector to a float vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@210666 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-11 17:50:44 +00:00
Matt Arsenault	e0162b9648	R600: Add helper functions. Extract these from some of my other patches, since this is the only thing really making them dependent on each other. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@210627 91177308-0d34-0410-b5e6-96231b3b80d8	2014-06-11 03:29:54 +00:00
Matt Arsenault	7e12b82625	R600: Implement ComputeNumSignBitsForTargetNode for BFE git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@209460 91177308-0d34-0410-b5e6-96231b3b80d8	2014-05-22 18:09:03 +00:00
Matt Arsenault	f49da4338a	R600: Add intrinsics for mad24 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@209456 91177308-0d34-0410-b5e6-96231b3b80d8	2014-05-22 18:00:15 +00:00
Matt Arsenault	f5d9170e67	Remove unused method declaration git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@209174 91177308-0d34-0410-b5e6-96231b3b80d8	2014-05-19 22:55:35 +00:00
Jay Foad	6b543713a2	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been inappropriate since it lost its Mask parameter in r154011. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@208811 91177308-0d34-0410-b5e6-96231b3b80d8	2014-05-14 21:14:37 +00:00
Tom Stellard	87b983680c	R600: Move MIN/MAX matching from LowerOperation() to PerformDAGCombine() git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@208429 91177308-0d34-0410-b5e6-96231b3b80d8	2014-05-09 16:42:16 +00:00
Craig Topper	c279ae979e	[C++11] Add 'override' keywords and remove 'virtual'. Additionally add 'final' and leave 'virtual' on some methods that are marked virtual without overriding anything and have no obvious overrides themselves. R600 edition git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@207503 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-29 07:57:24 +00:00
Matt Arsenault	3682fdabef	R600: Emit error instead of unreachable on function call git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@206904 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-22 16:42:00 +00:00
Matt Arsenault	1b16515971	R600: Minor cleanups. Fix indentation, better line wrapping, unused includes. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@206562 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-18 07:40:20 +00:00
Matt Arsenault	d879166376	Move ExtractVectorElements to SelectionDAG. This seems generally useful, and makes sense to go along with SplitVector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@206041 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-11 17:47:30 +00:00
Tom Stellard	5c9bb7119a	R600: Match 24-bit arithmetic patterns in a Target DAGCombine Moving these patterns from TableGen files to PerformDAGCombine() should allow us to generate better code by eliminating unnecessary shifts and extensions earlier. This also fixes a bug where the MAD pattern was calling SimplifyDemandedBits with a 24-bit mask on the first operand even when the full pattern wasn't being matched. This occasionally resulted in some instructions being incorrectly deleted from the program. v2: - Fix bug with 64-bit mul git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205731 91177308-0d34-0410-b5e6-96231b3b80d8	2014-04-07 19:45:41 +00:00
Matt Arsenault	894fa802f5	R600: Add target nodes for BFM and BFI git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@205235 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-31 18:21:13 +00:00
Matt Arsenault	0c6d96cf16	R600: Implement isZExtFree. This allows 64-bit operations that are truncated to be reduced to 32-bit ones. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@204946 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-27 17:23:31 +00:00
Matt Arsenault	94687c0f43	R600/SI: Fix unreachable with a sext_in_reg to an illegal type. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@204945 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-27 17:23:24 +00:00
Matt Arsenault	ab5382f5eb	R600: Move computeMaskedBitsForTargetNode out of AMDILISelLowering.cpp Remove handling of select_cc, since it makes no sense to be there. This now does nothing, but I'll be adding some handling of other target nodes soon. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@204743 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-25 18:18:27 +00:00
Matt Arsenault	6c199d8212	R600: Implement isNarrowingProfitable. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@204658 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-24 19:43:31 +00:00
Matt Arsenault	2683baa8ac	R600: Match sign_extend_inreg to BFE instructions git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@204072 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-17 18:58:11 +00:00
Craig Topper	629b96cb4f	Switch all uses of LLVM_OVERRIDE to just use 'override' directly. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@202621 91177308-0d34-0410-b5e6-96231b3b80d8	2014-03-02 09:09:27 +00:00
Matt Arsenault	bc247e4afd	R600/SI - Add new CI arithmetic instructions. Does not yet include larger part required to match v_mad_i64_i32 / v_mad_u64_u32. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@202077 91177308-0d34-0410-b5e6-96231b3b80d8	2014-02-24 21:01:28 +00:00
Benjamin Kramer	eee40f92a9	R600: Always implement both versions of isTruncateFree and add a sanity check. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@201222 91177308-0d34-0410-b5e6-96231b3b80d8	2014-02-12 10:17:54 +00:00
Matt Arsenault	700bba297b	R600: Implement isTruncateFree Truncation is just accessing a subregister for any multiple of the register size, so it's free. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@201107 91177308-0d34-0410-b5e6-96231b3b80d8	2014-02-10 19:57:42 +00:00
Tom Stellard	9c3e0ede1d	R600: Add support for global addresses with constant initializers git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@199825 91177308-0d34-0410-b5e6-96231b3b80d8	2014-01-22 19:24:21 +00:00
Tom Stellard	7dd37ae57a	R600/SI: Add support for i8 and i16 private loads/stores git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@199823 91177308-0d34-0410-b5e6-96231b3b80d8	2014-01-22 19:24:14 +00:00
Matt Arsenault	509a492442	Add target hook to prevent folding some bitcasted loads. This is to avoid this transformation in some cases: fold (conv (load x)) -> (load (conv*)x) On architectures that don't natively support some vector loads efficiently casting the load to a smaller vector of larger types and loading is more efficient. Patch by Micah Villmow. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@194783 91177308-0d34-0410-b5e6-96231b3b80d8	2013-11-15 04:42:23 +00:00
Tom Stellard	a2b4eb6d15	R600/SI: Add support for private address space load/store Private address space is emulated using the register file with MOVRELS and MOVRELD instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@194626 91177308-0d34-0410-b5e6-96231b3b80d8	2013-11-13 23:36:50 +00:00
Tom Stellard	aa1d078e7f	R600: Custom lower f32 = uint_to_fp i64 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@193701 91177308-0d34-0410-b5e6-96231b3b80d8	2013-10-30 17:22:05 +00:00
Tom Stellard	f95b162188	R600: Fix handling of vector kernel arguments The SelectionDAGBuilder was promoting vector kernel arguments to legal types, but this won't work for R600 and SI since kernel arguments are stored in memory and can't be promoted. In order to handle vector arguments correctly we need to look at the original types from the LLVM IR function. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@193215 91177308-0d34-0410-b5e6-96231b3b80d8	2013-10-23 00:44:32 +00:00
Tom Stellard	a3c2bcf0ee	R600/SI: expose TBUFFER_STORE_FORMAT_* for OpenGL transform feedback For _XYZ, the type of VDATA is v4i32, because v3i32 doesn't exist. The ADDR64 bit is not exposed. A simpler intrinsic that doesn't take a resource descriptor might be nicer. The maximum number of input SGPRs is bumped to 17. Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@190575 91177308-0d34-0410-b5e6-96231b3b80d8	2013-09-12 02:55:14 +00:00

1 2

76 Commits