ORCA-C

Commit Graph

Author	SHA1	Message	Date
Stephen Heumann	b43036409e	Add a new optimize flag for FP math optimizations that break IEEE rules. There were several existing optimizations that could change behavior in ways that violated the IEEE standard with regard to infinities, NaNs, or signed zeros. They are now gated behind a new #pragma optimize flag. This change allows intermediate code peephole optimization and common subexpression elimination to be used while maintaining IEEE conformance, but also keeps the rule-breaking optimizations available if desired. See section F.9.2 of recent C standards for a discussion of how these optimizations violate IEEE rules.	2021-11-29 20:31:15 -06:00
Stephen Heumann	fc515108f4	Make floating-point casts reduce the range and precision of numbers. The C standards generally allow floating-point operations to be done with extra range and precision, but they require that explicit casts convert to the actual type specified. ORCA/C was not previously doing that. This patch relies on some new library routines (currently in ORCALib) to do this precision reduction. This fixes #64.	2021-03-06 22:28:39 -06:00
Stephen Heumann	c0727315e0	Recognize byte swapping and generate an xba instruction for it. Specifically, this recognizes the pattern "(exp << 8) \| (exp >> 8)", where exp has an unsigned 16-bit type and does not have side effects.	2021-03-05 22:00:13 -06:00
Stephen Heumann	4ad7a65de6	Process floating-point values within the compiler using the extended type. This means that floating-point constants can now have the range and precision of the extended type (aka long double), and floating-point constant expressions evaluated within the compiler also have that same range and precision (matching expressions evaluated at run time). This new behavior is intended to match the behavior specified in the C99 and later standards for FLT_EVAL_METHOD 2. This fixes the previous problem where long double constants and constant expressions of type long double were not represented and evaluated with the full range and precision that they should be. It also gives extra range and precision to constants and constant expressions of type double or float. This may have pluses and minuses, but at any rate it is consistent with the existing behavior for expressions evaluated at run time, and with one of the possible models of floating point evaluation specified in the C standards.	2021-03-04 23:58:08 -06:00
Stephen Heumann	f19d21365a	Recognize more indirect long instructions in the native code optimizer. These instructions can be generated for indirect accesses to quad values, and the optimization can sometimes make those code sequences more efficient (e.g. avoiding unnecessary reloads of Y).	2021-03-02 19:19:00 -06:00
Stephen Heumann	043124db93	Implement support for doing quad ops without loading operands on stack. This works when both operands are simple loads, such that they can be broken up into operations on their subwords in a standard format. Currently, this is implemented for bitwise binary ops, but it can also be expanded to arithmetic, etc.	2021-02-24 19:44:46 -06:00
Stephen Heumann	8faafcc7c8	Implement 64-bit shifts.	2021-02-12 15:06:15 -06:00
Stephen Heumann	cb97623878	Do not copy CGI.Comments into CGI.pas. This has no functional effect, since it is all comments. It does mean that printed listings of CGI.pas would not contain those comments, but it is easy enough to restore if someone wants such listings. This change should make compilation slightly faster, and it also avoids issues with filetypes when using certain tools (since they cannot infer the filetype of CGI.Comments from its extension).	2021-02-11 18:53:25 -06:00
Stephen Heumann	05868667b2	Implement 64-bit division and remainder, signed and unsigned. These operations rely on new library routines in ORCALib (~CDIV8 and ~UDIV8).	2021-02-05 12:42:48 -06:00
Stephen Heumann	08cf7a0181	Implement 64-bit multiplication support. Signed multiplication uses the existing ~MUL8 routine in SysLib. Unsigned multiplication will use a new ~UMUL8 library routine.	2021-02-04 22:23:59 -06:00
Stephen Heumann	168a06b7bf	Add support for emitting 64-bit constants in statically-initialized data.	2021-02-04 02:17:10 -06:00
Stephen Heumann	793f0a57cc	Initial support for constants with long long types. Currently, the actual values they can have are still constrained to the 32-bit range. Also, there are some bits of functionality (e.g. for initializers) that are not implemented yet.	2021-02-03 23:11:23 -06:00
Stephen Heumann	807a143e51	Implement 64-bit addition and subtraction.	2021-01-30 23:31:18 -06:00
Stephen Heumann	2e44c36c59	Implement unary negation and bitwise complement for 64-bit types.	2021-01-30 13:49:06 -06:00
Stephen Heumann	abb0fa0fc1	Implement bitwise and/or/xor for 64-bit types. This introduces three new intermediate codes for these operations.	2021-01-30 00:25:15 -06:00
Stephen Heumann	085cd7eb1b	Initial code to recognize 'long long' as a type.	2021-01-29 22:27:11 -06:00
Stephen Heumann	ffe6c4e924	Spellcheck comments throughout the code. There are no non-comment changes.	2020-01-29 17:09:52 -06:00
Stephen Heumann	4a7644e0b5	Don't allocate stack space for varargs stack repair unless it's needed. If there are no varargs calls (and nothing else that saves stack positions), then space doesn't need to be allocated for the saved stack position. This can also lead to more efficient prolog/epilog code for small functions.	2018-01-13 20:02:43 -06:00
Stephen Heumann	e7cc513ad4	Add support for inline procedure names as documented in IIgs tech note #103 . These are enabled when bit 15 is set in the #pragma debug directive. Support is still needed to ensure these work properly with pre-compiled headers. This patch is from Kelvin Sherlock.	2017-10-21 20:36:21 -05:00
Stephen Heumann	c46cf79c79	Increase the maximum allowed number of local variables from 200 to 220.	2017-10-21 20:36:21 -05:00
Stephen Heumann	ccd653ddb9	Move some more code out of the blank segment to make space for static data.	2017-10-21 20:36:21 -05:00
Stephen Heumann	02de5f4137	Increase the total size of string constants permitted in each function. The size limit is increased from 8000 bytes to 12500 bytes. This was needed to compile some functions with many string constants.	2017-10-21 20:36:21 -05:00
Stephen Heumann	a4bffe65e5	Increase the limit on the number of intermediate code labels in a function from 2400 to 3200. This is necessary to compile some very large functions, such as the main interpreter loop in Git. This consumes about 8K of extra memory for the additional label records.	2017-10-21 20:36:21 -05:00
Stephen Heumann	709f9b3f25	Fix bug where comparing 32-bit values in static arrays or structs against 0 may give wrong results with large memory model. The issue was that 16-bit absolute addressing (in the data bank) was being used to access the data to compare, but with the large memory model the static arrays or structs are not necessarily in the same bank, so absolute long addressing should be used. This was sometimes causing failures in the C4.6.4.1.CC and C4.6.6.1.CC conformance tests in the ORCA/C test suite. The following program often demonstrates the problem (depending on memory layout and contents): #pragma memorymodel 1 #pragma optimize 1 #include <stdio.h> int i; char ch1[32000]; long L1[1]; int main (void) { if (L1 [0] != 0) printf("%li\n", L1[0]); /* shouldn't print / / buggy behavior can happen if the bank bytes of these pointers differ */ printf("%p %p\n", &L1[0], &i); }	2017-10-21 20:36:21 -05:00
Stephen Heumann	0df71da4f1	Change Byte -> UByte conversion to use a "Word -> UByte" conversion, rather than introducing a new "Byte -> UByte" conversion. The latter would require more changes to the code generator to understand it, whereas this approach doesn't require any changes. This is arguably less clean, but it matches other places where a byte value is subsequently operated on as a word without an explicit conversion, and the assembly instruction generated is the same.	2017-10-21 20:36:20 -05:00
Stephen Heumann	c28e48a54f	Do an explicit conversion when converting from signed to unsigned byte values. This is needed because the value is held in a 16-bit register, sign-extended. The high 8 bits need to be cleared to convert to an unsigned byte. This fixes the compca06.c test case. Note that this generates inefficient code in the case of loading a signed byte value and then immediately casting it to unsigned (it first sign-extends the value, then masks off the high bits). This should be optimized, but at least the generated code is correct now.	2017-10-21 20:36:20 -05:00
Stephen Heumann	46b6aa389f	Change all text/source files to LF line endings.	2017-10-21 18:40:19 -05:00
mikew50	e72177985e	ORCA/C 2.1.0 source from the Opus ][ CD	2017-10-01 17:47:47 -06:00

28 Commits