ORCA-C

mirror of https://github.com/byteworksinc/ORCA-C.git synced 2024-11-17 05:06:10 +00:00

Author	SHA1	Message	Date
Stephen Heumann	3893db1346	Make sure #pragma expand is properly applied in all cases. There were various places where the flag for macro expansions was saved, set to false, and then later restored. If #pragma expand was used within those areas, it would not be properly applied. Here is an example showing that problem: void f(void #pragma expand 1 ) {} This could also affect some uses of #pragma expand within precompiled headers, e.g.: #pragma expand 1 #include "a.h" #undef foobar #include "b.h" ... Also, add a note saying that code in precompiled headers will not be expanded. (This has always been the case, but was not clearly documented.)	2022-02-15 20:50:02 -06:00
Stephen Heumann	785a6997de	Record source file changes within a function as part of debug info. This affects functions whose body spans multiple files due to includes, or is treated as doing so due to #line directives. ORCA/C will now generate a COP 6 instruction to record each source file change, allowing debuggers to properly track the flow of execution across files.	2022-02-05 18:32:11 -06:00
Stephen Heumann	e36503508a	Allow more forms of address expressions in static initializers. There were several forms that should be permitted but were not, such as &"str"[1], &"str", &a (where a is an array), and &f (where f is a function). This fixes #15 and also certain other cases illustrated in the following example: char a[10]; int main(void); static char s1 = &"string"[1]; static char s2 = &"string"; static char s3 = &a; static int (f2)(void)=&main;	2022-01-29 21:59:25 -06:00
Stephen Heumann	8eda03436a	Preserve qualifiers when changing float/double/comp parameters to extended. Changing the type is still non-standard, but at least this allows us to detect and report write-to-const errors.	2022-01-17 18:26:28 -06:00
Stephen Heumann	6f0b94bb7c	Allow the pascal qualifier to appear anywhere types are used. This is necessary to allow declarations of pascal-qualified function pointers as members of a structure, among other things. Note that the behavior for "pascal" now differs from that for the standard function specifiers, which have more restrictive rules for where they can be used. This is justified by the fact that the "pascal" qualifier is allowed and meaningful for function pointer types, so it should be able to appear anywhere they can. This fixes #28.	2022-01-13 20:11:43 -06:00
Stephen Heumann	b1bc840ec8	Reverse order of parameters for pascal function pointer types. The parameters of the underlying function type were not being reversed when applying the "pascal" qualifier to a function pointer type. This resulted in the parameters not being in the expected order when a call was made using such a function pointer. This could result in spurious errors in some cases or inappropriate parameter conversions in others. This fixes #75.	2022-01-13 19:38:22 -06:00
Stephen Heumann	3acf5844c2	Save and restore type spec when evaluating expressions in a type name. Failing to do this could allow the type spec to be overwritten if the expression contained another type name within it (e.g. a cast). This could cause the wrong type to be computed, which could lead to incorrect behavior for constructs that use type names, e.g. sizeof. Here is an example program that demonstrated the problem: int main(void) { return sizeof(short[(long)50]); }	2022-01-12 21:53:23 -06:00
Stephen Heumann	3b35a65b1d	Give an error if a function pointer is redefined as a function. This gives an error for code like the following, which was previously allowed: void (*p)(int); void p(int i) {} Note that the opposite order still does not give a compiler error, but does give linker errors. Making sure we give a compiler error for all similar cases would require larger changes, but this patch at least catches some erroneous cases that were previously being allowed.	2022-01-12 18:31:32 -06:00
Stephen Heumann	61a382de0b	Report parameter type errors the same way with and without strict type checks. They were being reported as "type conflict" by default, but as "duplicate symbol" if strict checks were used.	2022-01-12 18:31:04 -06:00
Stephen Heumann	b5b276d0f4	Do not give a spurious error for redeclarations of a pascal function. This could occur with strict type checking on, because the parameter types were compared at a point where they had been reversed for the original declaration but not for the subsequent one. Here is an example that would give an error: #pragma ignore 24 extern pascal void func(int, long); extern pascal void func(int, long);	2022-01-12 18:30:28 -06:00
Stephen Heumann	1fb1762458	Fix bug in initialization of auto arrays of strings. When initializing (e.g.) an array of arrays of char, a string literal would be taken as an initializer for the outer array rather than for an inner array, so not all elements would be initialized properly. This was a bug introduced in commit `222c34a385`. This bug affected the C4.6.4.2.CC test case, and the following reduced version: #include <stdio.h> #include <string.h> int main (void) { char ch2[][20] = {"for all good people", "to come to the aid "}; if (strcmp(ch2[1], "to come to the aid ")) puts("Failed"); }	2021-12-27 08:22:19 -06:00
Stephen Heumann	7ac3fe6424	Allow string constants in initializers for arrays of char. For example, declarations like the following should be accepted: char p[] = {"abc", "def"}; This previously worked, but it was broken by commit `5871820e0c`.	2021-11-12 21:17:37 -06:00
Stephen Heumann	a6359f67e0	Adjust parameters with typedef'd array types to have pointer types. Parameters declared directly with array types were already adjusted to pointer types in commit `5b953e2db0`, but this code is needed for the remaining case where a typedef'd array type is used. With these changes, 'array' parameters are treated for all purposes as really having pointer types, which is what the standards call for. This affects at least their size as reported by sizeof and the debugging information generated for them.	2021-11-07 18:54:27 -06:00
Stephen Heumann	906f9f6312	Get rid of a variable that was not really used for anything.	2021-11-05 22:40:27 -05:00
Stephen Heumann	5b953e2db0	Allow 'static' and type qualifiers in parameter array declarators (C99). This has the side effect of treating most parameters declared as arrays as actually having pointer types. This affects the value returned by sizeof, among other things. The new behavior is correct under the C standards; however, it does not yet apply when using a typedef'd array type.	2021-11-02 22:17:55 -05:00
Stephen Heumann	a20d69a211	Revise variable argument handling to better comply with standards. In the new implementation, variable arguments are not removed until the end of the function. This allows variable argument processing to be restarted, and it prevents the addresses of local variables from changing in the middle of the function. The requirement to turn off stack repair code around varargs functions is also removed. This fixes #58.	2021-10-23 22:36:34 -05:00
Stephen Heumann	772043241c	Force stack checking and repair off for internal calls to ~ZERO. This can make unoptimized initialization code a bit smaller and faster.	2021-10-19 22:17:09 -05:00
Stephen Heumann	daede21819	Fix bug with assembly-language functions that return structs/unions.	2021-10-19 18:12:46 -05:00
Stephen Heumann	f567d60429	Allow bit-fields in unions. All versions of standard C allow this, but ORCA/C previously did not.	2021-10-18 21:48:18 -05:00
Stephen Heumann	692ebaba85	Structs or arrays may not contain structs with a flexible array member. We previously ignored this, but it is a constraint violation under the C standards, so it should be reported as an error. GCC and Clang allow this as an extension, as we were effectively doing previously. We will follow the standards for now, but if there was demand for such an extension in ORCA/C, it could be re-introduced subject to a #pragma ignore flag.	2021-10-17 22:22:42 -05:00
Stephen Heumann	a888206111	Simplify address calculation for auto initializers. This simplifies the code in the compiler, and also generates better code when not using optimization.	2021-10-11 22:10:38 -05:00
Stephen Heumann	5871820e0c	Support UTF-8/16/32 string literals and character constants (C11). These have u8, u, or U prefixes, respectively. The types char16_t and char32_t (defined in <uchar.h>) are used for UTF-16 and UTF-32 code points.	2021-10-11 20:54:37 -05:00
Stephen Heumann	222c34a385	Fix bug in initialization using string literals with embedded nulls. When using such a string literal to initialize an array with automatic storage duration, the bytes after the first null would be set to 0, rather than the values from the string literal. Here is an example program showing the problem: #include <stdio.h> int main(void) { char s[] = "a\0b"; puts(s+2); }	2021-10-11 19:55:09 -05:00
Stephen Heumann	7ae830ae7e	Initial support for compound literals. Compound literals outside of functions should work at this point. Compound literals inside of functions are not fully implemented, so they are disabled for now. (There is some code to support them, but the code to actually initialize them at the appropriate time is not written yet.)	2021-09-16 18:34:55 -05:00
Stephen Heumann	894baac94f	Give an error if assigning to a whole struct or union that has a const member. Such structs or unions are not modifiable lvalues, so they cannot be assigned to as a whole. Any non-const fields can be assigned to individually.	2021-09-11 18:12:58 -05:00
Stephen Heumann	2614f10ced	Make the actual return type of a function be the unqualified version of the type specified. This is a change that was introduced in C17. However, it actually keeps things closer to ORCA/C's historical behavior, which generally ignored qualifiers in return types.	2021-09-10 18:09:50 -05:00
Stephen Heumann	2f7e71cd24	Treat the fields of const structs as const-qualified. This causes an error to be produced when trying to assign to these fields, which was being allowed before. It is also necessary for correct behavior of _Generic in some cases.	2021-09-09 18:39:19 -05:00
Stephen Heumann	00cc05a6a1	Move type qualifiers from array types to their element types. This behavior is specified by the C standards. It can come up when declaring an array using a typedef'd array type and a qualifier. This is necessary for correct behavior of _Generic, as well as to give an error if code tries to write to const arrays declared in this way. Here is an example showing these issues: #define f(e) _Generic((e), int : 1, const int :2, default: 0) int main(void) { typedef int A[2][3]; const A a = {{4, 5, 6}, {7, 8, 9}}; _Static_assert(f(&a[0][0]) == 2, "qualifier error"); // OK a[1][1] = 42; // error }	2021-08-30 18:30:05 -05:00
Stephen Heumann	b16210a50b	Record volatile and restrict qualifiers in types. These are needed to correctly distinguish pointer types in _Generic. They should also be used for type compatibility checks in other contexts, but currently are not. This also fixes a couple small problems related to type qualifiers: restrict was not allowed to appear after in type-names volatile status was not properly recorded in sym files Here is an example of using _Generic to distinguish pointer types based on the qualifiers of the pointed-to type: #include <stdio.h> #define f(e) _Generic((e),\ int restrict : 1,\ int volatile const : 2,\ int : 3,\ default: 0) #define g(e) _Generic((e),\ int : 1,\ const int : 2,\ volatile int : 3,\ default: 0) int main(void) { int * restrict * p1; int * volatile const * p2; int * const * p3; // should print "1 2 0 1" printf("%i %i %i %i\n", f(p1), f(p2), f(p3), f((int * restrict )0)); int q1; const int q2; volatile int q3; const volatile int q4; // should print "1 2 3 0" printf("%i %i %i %i\n", g(q1), g(q2), g(q3), g(q4)); } Here is an example of a problem resulting from volatile not being recorded in sym files (if a sym file was present, the read of x was lifted out of the loop): #pragma optimize -1 static volatile int x; #include <stdio.h> int main(void) { int y; for (unsigned i = 0; i < 100; i++) { y = x2 + 7; } }	2021-08-30 18:19:58 -05:00
Stephen Heumann	fbdbad1f45	Report an error for certain large unsigned enumeration constants. Enumeration constants must have values representable as an int (i.e. 16-bit signed values, in ORCA/C), but errors were not being reported if code tried to use the values 0xFFFF8000 to 0xFFFFFFFF. This problem could also affect certain larger values of type unsigned long long. The issue stemmed from not properly accounting for whether the constant expression had a signed or unsigned type. This sample code demonstrated the problem: enum E { a = 0xFFFFFFFF, b = 0xFFFF8000, y = 0x7FFFFFFFFFFFFFFFull, z = 0x8000000000000000 };	2021-07-07 20:06:05 -05:00
Stephen Heumann	979852be3c	Use the right types for constants cast to character types. These were previously treated as having type int. This resulted in incorrect results from sizeof, and would also be a problem for _Generic if it was implemented. Note that this creates a token kind of "charconst", but this is not the kind for character constants in the source code. Those have type int, so their kind is intconst. The new kinds of "tokens" are created only through casts of constant expressions.	2021-03-07 13:38:21 -06:00
Stephen Heumann	f9f79983f8	Implement the standard pragmas, in particular FENV_ACCESS. The FENV_ACCESS pragma is now implemented. It causes floating-point operations to be evaluated at run time to the maximum extent possible, so that they can affect and be affected by the floating-point environment. It also disables optimizations that might evaluate floating-point operations at compile time or move them around calls to the <fenv.h> functions. The FP_CONTRACT and CX_LIMITED_RANGE pragmas are also recognized, but they have no effect. (FP_CONTRACT relates to "contracting" floating-point expressions in a way that ORCA/C does not do, and CX_LIMITED_RANGE relates to complex arithmetic, which ORCA/C does not support.)	2021-03-06 00:57:13 -06:00
Stephen Heumann	77d66ab699	Support the predefined identifier __func__ (from C99). This gives the name of the current function, as if the following definition appeared at the beginning of the function body: static const char __func__[] = "function-name";	2021-03-02 22:28:28 -06:00
Stephen Heumann	cf463ff155	Support switch statements using long long expressions.	2021-02-17 19:41:46 -06:00
Stephen Heumann	32ae4c2e17	Allow unsigned constants in "address+constant" constant expressions. This affected initializers like the following: static int a[50]; static int *ip = &a[0] + 2U; Also, introduce some basic range checks for calculations that are obviously outside the 65816's address space.	2021-02-13 15:36:54 -06:00
Stephen Heumann	47fdd9e370	Implement support for functions returning (unsigned) long long. These use a new calling convention specific to functions returning these types. When such functions are called, the caller must set the X register to the address within bank 0 that the return value is to be saved to. The function is then responsible for saving it there before returning to the caller. Currently, the calling code always makes space for the return value on the stack and sets X to point to that. (As an optimization, it would be possible to have the return value written directly to a local variable on the direct page, with no change needed to the function being called, but that has not yet been implemented.)	2021-02-05 23:25:46 -06:00
Stephen Heumann	2408c9602c	Make expressionValue a saturating approximation of the true value for long long expressions. This gives sensible behavior for several things in the parser, e.g. where all negative values or all very large values should be disallowed.	2021-02-04 12:44:44 -06:00
Stephen Heumann	168a06b7bf	Add support for emitting 64-bit constants in statically-initialized data.	2021-02-04 02:17:10 -06:00
Stephen Heumann	793f0a57cc	Initial support for constants with long long types. Currently, the actual values they can have are still constrained to the 32-bit range. Also, there are some bits of functionality (e.g. for initializers) that are not implemented yet.	2021-02-03 23:11:23 -06:00
Stephen Heumann	085cd7eb1b	Initial code to recognize 'long long' as a type.	2021-01-29 22:27:11 -06:00
Stephen Heumann	52132db18a	Implement the _Bool type from C99.	2021-01-25 21:22:58 -06:00
Stephen Heumann	5014fb97f9	Make 32-bit int (with #pragma unix 1) a distinct type from long.	2021-01-24 13:31:12 -06:00
Stephen Heumann	c0b2b44cad	Add a new representation of C basic types and use it for type checking. This allows us to distinguish int from short, etc.	2020-03-01 15:00:02 -06:00
Stephen Heumann	c84c4d9c5c	Check for non-void functions that execute to the end without returning a value. This generalizes the heuristic approach for checking whether _Noreturn functions could execute to the end of the function, extending it to apply to any function with a non-void return type. These checks use the same #pragma lint bit but give different messages depending on the situation.	2020-02-02 13:50:15 -06:00
Stephen Heumann	bc951b6735	Make lint report some more cases where noreturn functions may return. This uses a heuristic that may produce both false positives and false negatives, but any false positives should reflect extraneous code at the end of the function that is not actually reachable.	2020-01-30 17:35:15 -06:00
Stephen Heumann	a4abc5e421	Recognize enum type specifiers as such. They were erroneously triggering the "type specifier missing" lint warning.	2020-01-29 21:15:33 -06:00
Stephen Heumann	80c513bbf2	Add a lint flag for checking if _Noreturn functions may return. Currently, this only flags return statements, not cases where they may execute to the end of the function. (Whether the function will actually return is not decidable in general, although it may be in special cases).	2020-01-29 19:26:45 -06:00
Stephen Heumann	4fd642abb4	Add lint check for return with no value in a non-void function. This is disallowed in C99 and later.	2020-01-29 18:50:45 -06:00
Stephen Heumann	a9f5fb13d8	Introduce a new #pragma lint bit for syntax that C99 disallows. This currently checks for: Calls to undefined functions (same as bit 0) Parameters not declared in K&R-style function definitions *Declarations or type names with no type specifiers (includes but is broader than the condition checked by bit 1)	2020-01-29 18:33:19 -06:00
Stephen Heumann	ffe6c4e924	Spellcheck comments throughout the code. There are no non-comment changes.	2020-01-29 17:09:52 -06:00

1 2 3

115 Commits