This document contains the release notes for the LLVM compiler
infrastructure, release 1.2. Here we describe the status of LLVM, including any
known problems and bug fixes from the previous release. The most up-to-date
version of this document can be found on the LLVM 1.2 web site. If you are
not reading this on the LLVM web pages, you should probably go there because
this document may be updated after the release.
For more information about LLVM, including information about potentially more
current releases, please check out the main
web site. If you have questions or comments, the LLVM developer's mailing
list is a good place to send them.
Note that if you are reading this file from CVS, this document applies
to the next release, not the current one. To see the release notes for
the current or previous releases, see the releases page.
This is the third public release of the LLVM compiler infrastructure.
At this time, LLVM is known to correctly compile and run all non-unwinding C
& C++ SPEC CPU2000 benchmarks, the Olden benchmarks, and the Ptrdist
benchmarks. It has also been used to compile many other programs. LLVM
now also works with a broad variety of C++ programs, though it has still
received much less testing than the C front-end.
The LLVM native code generators are very stable but do not currently support
unwinding (exception throwing or longjmping), which prevent them from
working with programs like the 253.perlbmk in SPEC CPU2000. The C
backend and the rest of LLVM supports these programs, so you can
still use LLVM with them. Support for unwinding will be added in a future
release.
This release implements the following new features:
In this release, the following missing features were implemented:
In this release, the following Quality of Implementation issues were fixed:
- JIT should lazily initialize global variables
- [X86] X86 Backend never releases memory for machine code structures
- [vmcore] OpaqueType objects memory leak
- [llvmgcc] C front-end does not compile "extern inline" into linkonce
- Bytecode format inconsistent
In this release, the following build problems were fixed:
- [build] Makefiles break if C frontend target string has unexpected value
- [build] hard-wired assumption that shared-library extension is ".so"
In this release, the following Code Quality issues were fixed:
- [loopsimplify] Many pointless phi nodes are created
- The X86 backend didn't generate fchs to negate floating point numbers
In this release, the following bugs in the previous release were fixed:
Bugs in the LLVM Core:
- [licm] LICM promotes volatile memory
locations to registers
- [licm] Memory read after free causes
infrequent crash
- [indvars] Induction variable
canonicalization always makes 32-bit indvars
- [constantmerge] Merging globals can
cause use of invalid pointers!
- [bcreader] Bytecode reader misreads 'long -9223372036854775808'!
- VMCore mishandles double -0.0
- [X86] X86 backend code generates -0.0 as +0.0
Bugs in the C/C++ front-end:
- Need weak linkage on memory
management functions in libc runtime to allow them to be overriden
- [llvm-gcc] asserts when an extern inline function is redefined
- [llvmg++] Dynamically initialized constants cannot be marked 'constant'
- [llvmgcc] floating-point unary minus is incorrect for +0.0
LLVM has been extensively tested on Intel and AMD machines running Red
Hat Linux and FreeBSD. It has also been tested on Sun UltraSPARC workstations running Solaris 8.
Additionally,
LLVM works on Mac OS X 10.3 and above, but only with the C backend or
interpreter (no native backend for the PowerPC is available yet).
The core LLVM infrastructure uses "autoconf" for portability, so hopefully we
work on more platforms than that. However, it is likely that we
missed something and that minor porting is required to get LLVM to work on
new platforms. We welcome portability patches and error messages.
This section contains all known problems with the LLVM system, listed by
component. As new problems are discovered, they will be added to these
sections. If you run into a problem, please check the LLVM bug database and submit a bug if
there isn't already one.
- Inline assembly is not yet supported.
- "long double" is transformed by the front-end into "double". There is no
support for floating point data types of any size other than 32 and 64
bits.
- The following Unix system functionality has not been tested and may not
work:
- sigsetjmp, siglongjmp - These are not turned into the
appropriate invoke/unwind instructions. Note that
setjmp and longjmp are compiled correctly.
- getcontext, setcontext, makecontext
- These functions have not been tested.
- Although many GCC extensions are supported, some are not. In particular,
the following extensions are known to not be supported:
- Local Labels: Labels local to a block.
- Labels as Values: Getting pointers to labels and computed gotos.
- Nested Functions: As in Algol and Pascal, lexical scoping of functions.
- Constructing Calls: Dispatching a call to another function.
- Extended Asm: Assembler instructions with C expressions as operands.
- Constraints: Constraints for asm operands.
- Asm Labels: Specifying the assembler name to use for a C symbol.
- Explicit Reg Vars: Defining variables residing in specified registers.
- Return Address: Getting the return or frame address of a function.
- Vector Extensions: Using vector instructions through built-in functions.
- Target Builtins: Built-in functions specific to particular targets.
- Thread-Local: Per-thread variables.
- Pragmas: Pragmas accepted by GCC.
The following GCC extensions are partially supported. An ignored
attribute means that the LLVM compiler ignores the presence of the attribute,
but the code should still work. An unsupported attribute is one which is
ignored by the LLVM compiler and will cause a different interpretation of
the program.
- Variable Length:
Arrays whose length is computed at run time.
Supported, but allocated stack space is not freed until the function returns (noted above).
- Function Attributes:
Declaring that functions have no side effects or that they can never
return.
Supported: format, format_arg, non_null,
constructor, destructor, unused,
deprecated, warn_unused_result, weak
Ignored: noreturn, noinline,
always_inline, pure, const, nothrow,
malloc, no_instrument_function, cdecl
Unsupported: used, section, alias,
visibility, regparm, stdcall,
fastcall, all other target specific attributes
- Variable Attributes:
Specifying attributes of variables.
Supported: cleanup, common, nocommon,
deprecated, transparent_union,
unused, weak
Unsupported: aligned, mode, packed,
section, shared, tls_model,
vector_size, dllimport,
dllexport, all target specific attributes.
- Type Attributes: Specifying attributes of types.
Supported: transparent_union, unused,
deprecated, may_alias
Unsupported: aligned, packed,
all target specific attributes.
- Other Builtins:
Other built-in functions.
We support all builtins which have a C language equivalent (e.g.,
__builtin_cos), __builtin_alloca,
__builtin_types_compatible_p, __builtin_choose_expr,
__builtin_constant_p, and __builtin_expect (ignored).
The following extensions are known to be supported:
- Statement Exprs: Putting statements and declarations inside expressions.
- Typeof:
typeof
: referring to the type of an expression.
- Lvalues: Using
?:
, ",
" and casts in lvalues.
- Conditionals: Omitting the middle operand of a
?:
expression.
- Long Long: Double-word integers.
- Complex: Data types for complex numbers.
- Hex Floats:Hexadecimal floating-point constants.
- Zero Length: Zero-length arrays.
- Empty Structures: Structures with no members.
- Variadic Macros: Macros with a variable number of arguments.
- Escaped Newlines: Slightly looser rules for escaped newlines.
- Subscripting: Any array can be subscripted, even if not an lvalue.
- Pointer Arith: Arithmetic on
void
-pointers and function pointers.
- Initializers: Non-constant initializers.
- Compound Literals: Compound literals give structures, unions,
or arrays as values.
- Designated Inits: Labeling elements of initializers.
- Cast to Union: Casting to union type from any member of the union.
- Case Ranges: `case 1 ... 9' and such.
- Mixed Declarations: Mixing declarations and code.
- Function Prototypes: Prototype declarations and old-style definitions.
- C++ Comments: C++ comments are recognized.
- Dollar Signs: Dollar sign is allowed in identifiers.
- Character Escapes:
\e
stands for the character <ESC>.
- Alignment: Inquiring about the alignment of a type or variable.
- Inline: Defining inline functions (as fast as macros).
- Alternate Keywords:
__const__
, __asm__
, etc., for header files.
- Incomplete Enums:
enum foo;
, with details to follow.
- Function Names: Printable strings which are the name of the current function.
- Unnamed Fields: Unnamed struct/union fields within structs/unions.
- Attribute Syntax: Formal syntax for attributes.
If you run into GCC extensions which have not been included in any of these
lists, please let us know (also including whether or not they work).
For this release, the C++ front-end is considered to be fully functional but
has not been tested as thoroughly as the C front-end. It has been tested and
works for a number of non-trivial programs, but there may be lurking bugs.
Please report any bugs or problems.
A wide variety of additional information is available on the LLVM web page,
including mailing lists and publications describing algorithms and components
implemented in LLVM. The web page also contains versions of the API
documentation which is up-to-date with the CVS version of the source code. You
can access versions of these documents specific to this release by going into
the "llvm/doc/" directory in the LLVM tree.
If you have any questions or comments about LLVM, please feel free to contact
us via the mailing
lists.