ORCA-C

Commit Graph

Author	SHA1	Message	Date
Stephen Heumann	26e1bfc253	Allow generation of digraphs via ## token merging.	2022-02-20 18:57:03 -06:00
Stephen Heumann	2b062a8392	Make ## token merging on character constants give an error. This ultimately should be supported, but that will be more work. For now, we just set the string representation to '?', which will usually give an error when merged. (Previously, whatever was at memory location 0 would be treated as the string representation of the token. Frequently this would just be an empty string, leading to no error but incorrect results.)	2022-02-20 16:19:00 -06:00
Stephen Heumann	da978932bf	Save string representation of macros defined on command line. This is necessary for correct operation of the # and ## preprocessor operators on the tokens from such macros. Integers with a sign character still have the non-standard property of being treated as a single token, so they cannot be used with ##, but in most cases such uses will now give an error.	2022-02-20 15:35:49 -06:00
Stephen Heumann	aabbadb34b	Terminate header generation if #warning is encountered. This is necessary to ensure that the warning message is printed on subsequent compiles.	2022-02-19 14:06:15 -06:00
Stephen Heumann	a73dce103b	Terminate PCH generation if an #append is encountered. If the appended file was another C file and that file contained an #include, this would create an invalid record in the sym file. It would record memory from the buffer holding the original file to the buffer holding the appended file. In general, these are not contiguous, so superfluous data from other parts of memory would be included in the sym file. This record would normally just be treated as invalid on subsequent compiles, but it could theoretically be very large (depending on the memory layout) and might contain sensitive data from other parts of memory.	2022-02-19 14:05:07 -06:00
Stephen Heumann	f2d6625300	Save #pragma path directives in sym files. They were not being saved, which would result in ORCA/C not searching the proper paths when looking for an include file after the sym file had ended. Here is an example showing the problem: #pragma path "include" #include <stdio.h> int k = 50; #include "n.h" /* will not find include:n.h */	2022-02-15 21:27:35 -06:00
Stephen Heumann	3893db1346	Make sure #pragma expand is properly applied in all cases. There were various places where the flag for macro expansions was saved, set to false, and then later restored. If #pragma expand was used within those areas, it would not be properly applied. Here is an example showing that problem: void f(void #pragma expand 1 ) {} This could also affect some uses of #pragma expand within precompiled headers, e.g.: #pragma expand 1 #include "a.h" #undef foobar #include "b.h" ... Also, add a note saying that code in precompiled headers will not be expanded. (This has always been the case, but was not clearly documented.)	2022-02-15 20:50:02 -06:00
Stephen Heumann	c96cf4f1dd	Do not save predefined and command-line macros in the sym file. Previously, these might or might not be saved (based on the contents of uninitialized memory), but in many cases they were. This was unnecessary, since these macros are automatically defined when the scanner is initialized. Reading them from the sym file could result in duplicate copies of them in the macro list. This is usually harmless, but might result in #undefs of macros from the command line not working properly.	2022-02-13 20:17:33 -06:00
Stephen Heumann	b493dcb1da	Add lint check to require whitespace after names of object-like macros. This is a requirement added in C99, so it is added as part of the C99 syntax checks. This affects definitions like: #define foo;	2022-02-13 19:44:56 -06:00
Stephen Heumann	c169c2bf92	Fully prohibit redefinition of predefined macros. Code like the following was previously being allowed: #define __STDC__ /* no tokens */	2022-02-13 18:10:45 -06:00
Stephen Heumann	5d7c002819	Fix bug causing some #undefs to be ignored when using a sym file. This would occur if the macro had already been saved in the sym file and the #undef occurred before a subsequent #include that was also recorded in the sym file. The solution is simply to terminate sym file generation if an #undef of an already-saved macro is encountered. Here is an example showing the problem: test.c: #include "test1.h" #undef x #include "test2.h" int main(void) { #ifdef x return x; #else return y; #endif } test1.h: #define x 27 test2.h: #define y 6	2022-02-13 16:33:43 -06:00
Stephen Heumann	b231782442	Add option to use a custom pre-include file. This is a file that will be included before the source file is processed. If specified, it is used instead of the default .h file.	2022-02-12 21:36:39 -06:00
Stephen Heumann	bd811559d6	Fix issues with keep names in sym files. There were a couple issues that could occur with #pragma keep and sym files: If a source file used #pragma keep but it was overridden by KEEP= on the command line or {KeepName} in the shell, then the overriding keep name would be saved to the sym file. It would therefore be applied to subsequent compilations even if it was no longer specified in the command line or shell variable. If a source file used #pragma keep, that keep name would be recorded in the sym file. On subsequent compilations, it would always be used, overriding any keep name specified by the command line or shell, contrary to the usual rule that the name on the command line takes priority. With this patch, the keep name recorded in the sym file (if any) should always be the one specified by #pragma keep, but it can be overridden as usual.	2022-02-06 21:49:08 -06:00
Stephen Heumann	5f03dee66a	Allow negated long long constants in cc= defines. These are still treated as one token, like other negated numbers specified in cc=(-d...).	2022-02-06 15:33:42 -06:00
Stephen Heumann	efb363a04d	Update a comment.	2022-02-06 15:08:04 -06:00
Stephen Heumann	7d4f923470	Improve error handling for cc= options on command line.	2022-02-06 14:24:22 -06:00
Stephen Heumann	785a6997de	Record source file changes within a function as part of debug info. This affects functions whose body spans multiple files due to includes, or is treated as doing so due to #line directives. ORCA/C will now generate a COP 6 instruction to record each source file change, allowing debuggers to properly track the flow of execution across files.	2022-02-05 18:32:11 -06:00
Stephen Heumann	7322428e1d	Add an option to print file names in error messages. This can help identify if an error is in the main source file or an include file.	2022-02-04 22:10:50 -06:00
Stephen Heumann	4cb2106ee4	Change the name of the current source file on an #include or #append. This causes __FILE__ to give the name of an include file if used within it, which seems to be what the standards intend (and what other compilers do). It also affects the file name recorded in debugging information for functions declared in an include file. (Note that occ will generate a #line directive before an #append, essentially to work around the problem this patch fixes. After the patch, such a #line directive is effectively ignored. This should be OK, although it may result in a difference in whether a full or partial pathname is used for __FILE__ and in debug info.)	2022-02-03 22:22:33 -06:00
Stephen Heumann	dce9d36edd	Comment out unused error messages and update docs about errors.	2022-02-01 22:16:57 -06:00
Stephen Heumann	b43036409e	Add a new optimize flag for FP math optimizations that break IEEE rules. There were several existing optimizations that could change behavior in ways that violated the IEEE standard with regard to infinities, NaNs, or signed zeros. They are now gated behind a new #pragma optimize flag. This change allows intermediate code peephole optimization and common subexpression elimination to be used while maintaining IEEE conformance, but also keeps the rule-breaking optimizations available if desired. See section F.9.2 of recent C standards for a discussion of how these optimizations violate IEEE rules.	2021-11-29 20:31:15 -06:00
Stephen Heumann	73d194c12f	Allow string constants with up to 32760 bytes. This allows the length of the string plus a few extra bytes used internally to be represented by a 16-bit integer. Since the size limit for memory allocations has been raised, there is no good reason to impose a shorter limit on strings. Note that C99 and later specify a minimum translation limit for string constants of at least 4095 characters.	2021-10-24 21:43:43 -05:00
Stephen Heumann	f567d60429	Allow bit-fields in unions. All versions of standard C allow this, but ORCA/C previously did not.	2021-10-18 21:48:18 -05:00
Stephen Heumann	692ebaba85	Structs or arrays may not contain structs with a flexible array member. We previously ignored this, but it is a constraint violation under the C standards, so it should be reported as an error. GCC and Clang allow this as an extension, as we were effectively doing previously. We will follow the standards for now, but if there was demand for such an extension in ORCA/C, it could be re-introduced subject to a #pragma ignore flag.	2021-10-17 22:22:42 -05:00
Stephen Heumann	ad5063a9a3	Support hexadecimal floating-point constants.	2021-10-17 18:19:29 -05:00
Stephen Heumann	5871820e0c	Support UTF-8/16/32 string literals and character constants (C11). These have u8, u, or U prefixes, respectively. The types char16_t and char32_t (defined in <uchar.h>) are used for UTF-16 and UTF-32 code points.	2021-10-11 20:54:37 -05:00
Stephen Heumann	b076f85149	Avoid possible stack overflow when merging adjacent string literals. The code for this was recursive and could overflow if there were several dozen consecutive string literals. It has been changed to only use one level of recursion, avoiding the problem.	2021-10-11 18:55:10 -05:00
Stephen Heumann	7ae830ae7e	Initial support for compound literals. Compound literals outside of functions should work at this point. Compound literals inside of functions are not fully implemented, so they are disabled for now. (There is some code to support them, but the code to actually initialize them at the appropriate time is not written yet.)	2021-09-16 18:34:55 -05:00
Stephen Heumann	a8682e28d3	Give an error for pointer assignments that discard qualifiers. This is controlled by #pragma ignore bit 5, which is now a more general "loose type checks" bit.	2021-09-10 17:58:20 -05:00
Stephen Heumann	9c04b94093	Allow invalid escape sequences and UCN-like sequences in skipped code. The standard wording is not always clear on these cases, but I think at least some of them should be allowed and others may be undefined behavior (which we can choose to allow). At any rate, this allows non-standard escape sequences targeted at other compilers to appear in skipped-over code. There probably ought to be similar handling for #defines that are never expanded, but that would require more code changes.	2021-09-06 20:37:17 -05:00
Stephen Heumann	ea461dba7b	Give clearer error messages for errors in the command line.	2021-08-31 19:23:10 -05:00
Stephen Heumann	b8c332deeb	Treat invalid escape sequences as errors. This applies to octal and hexadecimal sequences with out-of-range values, and also to unrecognized escape characters. The C standards say both of these cases are syntax/constraint violations requiring a diagnostic.	2021-08-31 18:36:06 -05:00
Stephen Heumann	2b9d332580	Give an appropriate error for an illegal operator in a constant expression. This was being reported as an "illegal type cast".	2021-08-22 20:33:34 -05:00
Stephen Heumann	5faf219eff	Update comments about pragma flags.	2021-08-22 17:35:16 -05:00
Stephen Heumann	03f267ac02	Write out long long constants when using #pragma expand.	2021-03-11 23:20:14 -06:00
Stephen Heumann	9cd2807bc8	Do not leave behind detritus from the spinner when using #pragma expand. This could happen with the following example (under ORCA/Shell with output to the screen only): #include <stdio.h> #pragma expand 1 int main(void) { }	2021-03-11 19:01:38 -06:00
Stephen Heumann	2de8ac993e	Fix to make _Generic handle struct types properly. Also, use an existing error message instead of creating a new equivalent one.	2021-03-07 23:35:12 -06:00
Stephen Heumann	bccd86a627	Implement _Generic expressions (from C11). Note that this code relies on CompTypes for type compatibility testing, and it has slightly non-standard behavior in some cases.	2021-03-07 21:59:37 -06:00
Stephen Heumann	979852be3c	Use the right types for constants cast to character types. These were previously treated as having type int. This resulted in incorrect results from sizeof, and would also be a problem for _Generic if it was implemented. Note that this creates a token kind of "charconst", but this is not the kind for character constants in the source code. Those have type int, so their kind is intconst. The new kinds of "tokens" are created only through casts of constant expressions.	2021-03-07 13:38:21 -06:00
Stephen Heumann	8f8e7f12e2	Distinguish the different types of floating-point constants. As with expressions, the type does not actually limit the precision and range of values represented.	2021-03-07 00:48:51 -06:00
Stephen Heumann	f9f79983f8	Implement the standard pragmas, in particular FENV_ACCESS. The FENV_ACCESS pragma is now implemented. It causes floating-point operations to be evaluated at run time to the maximum extent possible, so that they can affect and be affected by the floating-point environment. It also disables optimizations that might evaluate floating-point operations at compile time or move them around calls to the <fenv.h> functions. The FP_CONTRACT and CX_LIMITED_RANGE pragmas are also recognized, but they have no effect. (FP_CONTRACT relates to "contracting" floating-point expressions in a way that ORCA/C does not do, and CX_LIMITED_RANGE relates to complex arithmetic, which ORCA/C does not support.)	2021-03-06 00:57:13 -06:00
Stephen Heumann	4ad7a65de6	Process floating-point values within the compiler using the extended type. This means that floating-point constants can now have the range and precision of the extended type (aka long double), and floating-point constant expressions evaluated within the compiler also have that same range and precision (matching expressions evaluated at run time). This new behavior is intended to match the behavior specified in the C99 and later standards for FLT_EVAL_METHOD 2. This fixes the previous problem where long double constants and constant expressions of type long double were not represented and evaluated with the full range and precision that they should be. It also gives extra range and precision to constants and constant expressions of type double or float. This may have pluses and minuses, but at any rate it is consistent with the existing behavior for expressions evaluated at run time, and with one of the possible models of floating point evaluation specified in the C standards.	2021-03-04 23:58:08 -06:00
Stephen Heumann	4020098dd6	Evaluate constant expressions with long long and floating operands. Note that we currently defer evaluation of such expressions to run time if the long long value cannot be represented exactly in a double, because statically-evaluated floating point expressions use the double format rather than the extended (long double) format used at run time.	2021-02-21 18:43:53 -06:00
Stephen Heumann	6bb91d20e5	Add the predefined macro __ORCAC_HAS_LONG_LONG__. This allows headers or other code to test for the presence of this feature.	2021-02-17 14:41:09 -06:00
Stephen Heumann	b4604e079e	Do preprocessor arithmetic in intmax_t/uintmax_t (aka long long types). This is what C99 and later require.	2021-02-17 00:04:20 -06:00
Stephen Heumann	2e29390e8e	Support 64-bit decimal constants in code.	2021-02-15 12:28:30 -06:00
Stephen Heumann	5e5434987b	Give an error when trying to evaluate constant expressions with long long operands.	2021-02-04 14:56:15 -06:00
Stephen Heumann	c37fae0f3b	Add most of the infrastructure to support 64-bit decimal constants. Right now, decimal constants can have long long types based on their suffix, but they are still limited to a maximum value of 2^32-1. This also implements the C99 change where decimal constants without a u suffix always have signed types. Thus, decimal constants of 2^31 and up now have type long long, even if their values could be represented in the type unsigned long.	2021-02-04 00:22:56 -06:00
Stephen Heumann	058c0565c6	Support 64-bit integer constants in hex/octal/binary formats. 64-bit decimal constants are not supported yet.	2021-02-04 00:02:44 -06:00
Stephen Heumann	793f0a57cc	Initial support for constants with long long types. Currently, the actual values they can have are still constrained to the 32-bit range. Also, there are some bits of functionality (e.g. for initializers) that are not implemented yet.	2021-02-03 23:11:23 -06:00
Stephen Heumann	714b417261	Merge branch 'master' into longlong	2021-02-03 21:20:37 -06:00
Stephen Heumann	4a95dbc597	Give an error if you try to define a macro to + or - on the command line. This affects command lines like: cmpl myprog.c cc=(-da=+) ... Previously, this would be accepted, but a was actually defined to 0 rather than +. Now, this gives an error, consistent with other tokens that are not supported in such definitions on the command line. (Perhaps we should support definitions using any tokens, but that would require bigger code changes.) This also cleans up some related code to avoid possible null-pointer dereferences.	2021-02-03 21:06:58 -06:00
Stephen Heumann	1b9ee39de7	Disallow duplicate suffixes on numeric constants (e.g. "123ulu").	2021-02-02 18:28:49 -06:00
Stephen Heumann	8ac887f4dc	Hexadecimal/octal constants 0x80000000+ should have type unsigned long. They previously had type signed long (with negative values).	2021-02-02 18:26:31 -06:00
Stephen Heumann	085cd7eb1b	Initial code to recognize 'long long' as a type.	2021-01-29 22:27:11 -06:00
Stephen Heumann	f0a3808c18	Add a new #pragma ignore option to treat char and unsigned char as compatible. This is contrary to the C standards, but ORCA/C historically permitted it (as do some other compilers), and I think there is a fair amount of existing code that relies on it.	2020-05-22 17:11:13 -05:00
Stephen Heumann	5d64436e6e	Implement __STDC_HOSTED__ macro (from C99). This is normally 1 (indicating a hosted implementation, where the full standard library is available and the program starts by executing main()), but it is 0 if one of the pragmas for special types of programs with different entry points has been used.	2020-03-07 15:51:29 -06:00
Stephen Heumann	a62cbe531a	Implement __STDC_NO_...__ macros as specified by C11. These indicate that various optional features of the C standard are not supported.	2020-03-06 23:29:54 -06:00
Stephen Heumann	32614abfca	Allow '/*' or '//' in character constants. These should not start a comment.	2020-02-04 18:42:55 -06:00
Stephen Heumann	c84c4d9c5c	Check for non-void functions that execute to the end without returning a value. This generalizes the heuristic approach for checking whether _Noreturn functions could execute to the end of the function, extending it to apply to any function with a non-void return type. These checks use the same #pragma lint bit but give different messages depending on the situation.	2020-02-02 13:50:15 -06:00
Stephen Heumann	77dcfdf3ee	Implement support for macros with variable arguments (C99).	2020-01-31 20:07:10 -06:00
Stephen Heumann	bc951b6735	Make lint report some more cases where noreturn functions may return. This uses a heuristic that may produce both false positives and false negatives, but any false positives should reflect extraneous code at the end of the function that is not actually reachable.	2020-01-30 17:35:15 -06:00
Stephen Heumann	76eb476809	Address some issues with stringization of macro arguments. We now insert spaces corresponding to whitespace between tokens, and string tokens are enclosed in quotes. There are still issues with (at least) escape sequences in strings and comments between tokens.	2020-01-30 12:48:16 -06:00
Stephen Heumann	80c513bbf2	Add a lint flag for checking if _Noreturn functions may return. Currently, this only flags return statements, not cases where they may execute to the end of the function. (Whether the function will actually return is not decidable in general, although it may be in special cases).	2020-01-29 19:26:45 -06:00
Stephen Heumann	4fd642abb4	Add lint check for return with no value in a non-void function. This is disallowed in C99 and later.	2020-01-29 18:50:45 -06:00
Stephen Heumann	a9f5fb13d8	Introduce a new #pragma lint bit for syntax that C99 disallows. This currently checks for: Calls to undefined functions (same as bit 0) Parameters not declared in K&R-style function definitions *Declarations or type names with no type specifiers (includes but is broader than the condition checked by bit 1)	2020-01-29 18:33:19 -06:00
Stephen Heumann	ffe6c4e924	Spellcheck comments throughout the code. There are no non-comment changes.	2020-01-29 17:09:52 -06:00
Stephen Heumann	d60104cc47	Tweak handling of lint warnings. If there were a warning and an error on the same line, and errors were treated as terminal, the warning could sometimes be reported as an error.	2020-01-29 12:16:17 -06:00
Stephen Heumann	f5cd1e3e3a	Recognize designated initializers enough to give an error and skip them. Previously, the designated initializer syntax could confuse the parser enough to cause null pointer dereferences. This avoids that, and also gives a more meaningful error message to the user.	2020-01-28 12:48:09 -06:00
Stephen Heumann	c514c109ab	Allow for function-like macros taking no parameters. This was broken by commit `06a3719304`.	2020-01-25 19:44:29 -06:00
Stephen Heumann	fe6c410271	Allow #pragma lint messages to optionally be treated as warnings. In the #pragma lint line, the integer indicating the checks to perform can now optionally be followed by a semicolon and another integer. If these are present and the second integer is 0, then the lint checks will be performed, but will be treated as warnings rather than errors, so that they do not cause compilation to fail.	2020-01-25 11:29:12 -06:00
Stephen Heumann	d8097e6b31	Do not accept %:%: digraph in places where ## would not be accepted. This could happen in obscure cases like the following (outside a macro): for(int b;;-%:%:- b) ;	2020-01-21 07:21:58 -06:00
Stephen Heumann	06a3719304	Allow for empty macro arguments, as specified by C99 and later. These were previously allowed in some cases, but not as the last argument to a macro. Also, stringization and concatenation of them did not behave according to the standards.	2020-01-20 19:49:22 -06:00
Stephen Heumann	656868a095	Implement support for universal character names in identifiers.	2020-01-20 17:22:06 -06:00
Stephen Heumann	9862500dee	Give an error if a parameter in a function definition has an incomplete type. In combination with earlier patches, this fixes #53. Also, if the lint flag requiring explicit function types is set, then also require that K&R-style parameters be explicitly declared with types, rather than not being declared and defaulting to int. (This is a requirement in C99 and later.)	2020-01-20 12:43:01 -06:00
Stephen Heumann	d24dacf01a	Add initial support for universal character names. This currently only works in character constants or strings, not identifiers.	2020-01-19 23:59:54 -06:00
Stephen Heumann	6e89dc5883	Give a basic error message for use of _Generic.	2020-01-19 18:03:21 -06:00
Stephen Heumann	dd92585116	Give errors for most illegal uses of "restrict".	2020-01-19 17:31:20 -06:00
Stephen Heumann	49dea49cb8	Detect and give errors for various illegal uses of _Alignas.	2020-01-19 17:06:01 -06:00
Stephen Heumann	a130e79929	Prohibit _Noreturn specifier on non-functions.	2020-01-19 14:57:28 -06:00
Stephen Heumann	b4232fd4ea	Flag more appropriate errors about unexpected tokens in type names. Previously, these would report "identifier expected"; now they correctly say "')' expected". This introduces a new UnexpectedTokenError procedure that can be used more generally for cases where the expected token may differ based on context.	2020-01-18 16:43:25 -06:00
Stephen Heumann	df029ce06f	Handle storage class specifiers in DeclarationSpecifiers. _Thread_local is recognized but gives a "not supported" error. It could arguably be 'supported' trivially by saying the execution of an ORCA/C program is just one thread and so no special handling is needed, but that likely isn't what someone using it would expect. There would be a possible issue if a "static" or "typedef" storage class specifier occurred after a type specifier that required memory to be allocated for it, because that memory conceptually might be in the local pool, but static objects are processed at the end of the translation unit, so their types need to stick around. In practice, this should not occur, because the local pool isn't currently used for much (in particular, not for statements or declarations in the body of a function). We give an error in case this somehow might occur. In combination with preceding commits, this fixes #14. Declaration specifiers can now appear in any order, as required by the C standards.	2020-01-18 14:52:27 -06:00
Stephen Heumann	8341f71ffc	Initial phase of support for new C99/C11 type syntax. _Bool, _Complex, _Imaginary, _Atomic, restrict, and _Alignas are now recognized in types, but all except restrict and _Alignas will give an error saying they are not supported. This also introduces uniform definitions of the syntactic classes of tokens that can be used in declaration specifiers and related constructs (currently used in some places but not yet in others).	2020-01-12 15:43:30 -06:00
Stephen Heumann	428c991895	Rewrite type specifier parsing. Type specifiers and type qualifiers can now appear in any order, as specified by the C standards. However, storage class specifiers and function specifiers still cannot be freely mixed with them.	2020-01-07 20:26:56 -06:00
Stephen Heumann	3121a465f1	Implement the _Alignof operator (from C11). In ORCA/C, the alignment of all object types is 1.	2020-01-06 20:17:29 -06:00
Stephen Heumann	9036a98e1c	Implement support for digraphs. Specifically, the following six punctuator tokens are now supported: <: :> <% %> %: %:%: These behave the same as the existing tokens [, ], {, }, #, and ## (respectively), apart from their spelling. This can be useful when the full ASCII character set cannot easily be displayed or input (e.g. on the IIgs text screen with certain language settings).	2020-01-04 21:49:50 -06:00
Stephen Heumann	6f2eb301e5	Implement C11 _Static_assert mechanism. This allows code to contain static assertions (checked at compile time).	2020-01-04 18:16:29 -06:00
Stephen Heumann	0184e3db7b	Recognize the new keywords from C99 and C11 as such. Specifically, the following will now be tokenized as keywords: _Alignas _Alignof _Atomic _Bool _Complex _Generic _Imaginary _Noreturn _Static_assert _Thread_local restrict ('inline' was also added as a standard keyword in C99, but ORCA/C already treated it as such.) The parser currently has no support for any of these keywords, so for now errors will still be generated if they are used, but this is a first step toward adding support for them.	2020-01-03 22:48:53 -06:00
Stephen Heumann	ae6de310c7	Avoid storing stale values of __DATE__ or __TIME__ in sym files. This could happen in some very obscure cases like using these macros for the names of segments or include files. The fix is to just terminate precompiled header generation if they are encountered.	2019-12-24 15:58:12 -06:00
Stephen Heumann	095807517b	Fix bug leading to spurious errors in some cases when a sym file is present. The issue was that invalid sym files could be generated if an #include is encountered within an #if or #ifdef block in the main source file. The fix (for now) is to simply terminate precompiled header generation if such an #include is encountered. Fixes #2.	2019-12-24 15:45:32 -06:00
Stephen Heumann	60484d6f69	Fix for including system headers via macros. This makes something like the following work: #define STDIO_H <stdio.h> #include STDIO_H It didn't previously, because workString would be overwritten by NextToken. The effect in this case was that it would erroneously try to include the header <hh>, rather than <stdio.h>. Detected based on a couple programs from FizzBuzz-C.	2018-09-13 21:59:46 -05:00
Stephen Heumann	95f5ec9c13	Don't print a whole bunch of spaces for an error message if the column number is 0. This could happen, e.g., for a "'}' expected" error at end-of-file. It occurred because the 0..maxint type being used caused the Pascal compiler to use unsigned comparisons, which were inappropriate here.	2018-09-10 21:55:02 -05:00
Stephen Heumann	15b1c88d44	Give accurate error message if a numeric constant is too long. Previously, "integer overflow" was reported in this case, even for floating constants.	2018-09-08 14:40:06 -05:00
Stephen Heumann	f6381b7523	Indicate errors at correct positions when the source line contains tabs. Previously, the error markers would generally be misaligned in this case, because a tab would expand to no spaces (in ORCA/Shell) or multiple spaces (in most other environments), but the error-printing code would use a single space to try to line up with it. The solution adopted is just to print tabs in the error lines at the positions where they occur in the source lines. The actual amount of space displayed will depend on the console being used, but in any case it should line up correctly with the source line.	2018-09-07 17:48:19 -05:00
Stephen Heumann	d33ac61af3	Fix bug where exit-to-editor may use wrong position in certain cases. This could happen in certain situations where an error is detected at the end of a line (for example with "cannot redefine a macro" errors). Fixes #40.	2018-09-06 23:55:25 -05:00
Stephen Heumann	dc1b0aa29f	Add lint flag to check for several forms of undefined behavior in computations. This adds lint bit 5 (a value of 32), which currently enables checking for the following conditions: Integer overflow from arithmetic in constant expressions (currently only of type int). Invalid constant shift counts (negative, or >= the width of the type) *Division by (constant) zero. These (mainly the first two) can be indicative of code that was designed for larger type sizes and needs changes to support 16-bit int.	2018-09-05 23:48:35 -05:00
Stephen Heumann	55dbc718c1	Small format checker adjustments. Format checking for "%p" is improved: in the case of scanf, the corresponding argument must be a pointer to a pointer.	2018-09-02 15:12:52 -05:00
Stephen Heumann	69f086367c	Adjust how messages from the printf/scanf format checker are displayed. Mainly, this causes the messages from the format checker to be displayed after the relevant line is printed, along with any other error messages. The wording and formatting of some of the messages is also slightly adjusted, but there should be no substantive change in what is warned about.	2018-09-01 19:59:52 -05:00
Stephen Heumann	9ff3407c60	Avoid producing invalid string literals in #pragma expand output. Previously, the characters ", /, and ? within string literals were not escaped in #pragma expand output, which could result in them being erroneously interpreted as ending the string literal, starting an escape sequence, or being part of a trigraph (respectively). Also, escape sequences were output in hexadecimal format. Since there is no length limit on hexadecimal escape sequences, this could result in subsequent characters in the string being interpreted as part of the escape sequence. This fixes the issues by escaping the characters ", /, and ?, and by using three-digit octal escape sequences rather than hexadecimal ones.	2018-09-01 16:11:18 -05:00
Stephen Heumann	a6f1211ee6	Properly treat #line directive as giving the next line number, not the current one.	2018-08-31 21:46:10 -05:00

1 2 3 4

182 Commits