mirror of
https://github.com/c64scene-ar/llvm-6502.git
synced 2025-06-17 04:24:00 +00:00
Various updates suggested by Misha and Chris. Moved material that is
aimed more at the comiler writer than the llvmc user later in the document. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@15561 91177308-0d34-0410-b5e6-96231b3b80d8
This commit is contained in:
@ -33,6 +33,13 @@ following main goals or purposes:
|
|||||||
The tool itself does nothing with a user's program. It merely invokes other
|
The tool itself does nothing with a user's program. It merely invokes other
|
||||||
tools to get the compilation tasks done.
|
tools to get the compilation tasks done.
|
||||||
|
|
||||||
|
The options supported by B<llvmc> generalize the compilation process and
|
||||||
|
provide a consistent and simple interface for multiple programming languages.
|
||||||
|
This makes it easier for developers to get their software compiled with LLVM.
|
||||||
|
Without B<llvmc>, developers would need to understand how to invoke the
|
||||||
|
front-end compiler, optimizer, assembler, and linker in order to compile their
|
||||||
|
programs. B<llvmc>'s sole mission is to trivialize that process.
|
||||||
|
|
||||||
=head2 Basic Operation
|
=head2 Basic Operation
|
||||||
|
|
||||||
B<llvmc> always takes the following basic actions:
|
B<llvmc> always takes the following basic actions:
|
||||||
@ -41,23 +48,35 @@ B<llvmc> always takes the following basic actions:
|
|||||||
|
|
||||||
=item * Command line options and filenames are collected.
|
=item * Command line options and filenames are collected.
|
||||||
|
|
||||||
This provides the basic instruction to B<llvmc> on what actions it should
|
The command line options provide the marching orders to B<llvmc> on what actions
|
||||||
take. This is the I<request> the user is making of B<llvmc>.
|
it should perform. This is the I<request> the user is making of B<llvmc> and it
|
||||||
|
is interpreted first.
|
||||||
|
|
||||||
=item * Configuration files are read.
|
=item * Configuration files are read.
|
||||||
|
|
||||||
Based on the options and the suffixes of the filenames presented, a set
|
Based on the options and the suffixes of the filenames presented, a set of
|
||||||
of configuration files are read to configure the actions B<llvmc>
|
configuration files are read to configure the actions B<llvmc> will take
|
||||||
will take (more on this later).
|
(more on this later).
|
||||||
|
|
||||||
=item * Determine actions to take.
|
=item * Determine actions to take.
|
||||||
|
|
||||||
The tool chain needed to complete the task is determined. This is the
|
The tool chain needed to complete the task is determined. This is the primary
|
||||||
primary work of B<llvmc>. It breaks the request specified by the
|
work of B<llvmc>. It breaks the request specified by the command line options
|
||||||
command line options into a set of basic actions to be done:
|
into a set of basic actions to be done:
|
||||||
pre-processing, compilation, assembly, optimization, and linking.
|
|
||||||
For each applicable action, it selects the command to be run from
|
=over
|
||||||
the specifications in the configuration files.
|
|
||||||
|
=item * Pre-processing: gathering/filtering compiler input
|
||||||
|
|
||||||
|
=item * Compilation: source language to bytecode conversion
|
||||||
|
|
||||||
|
=item * Assembly: bytecode to native code conversion
|
||||||
|
|
||||||
|
=item * Optimization: conversion of bytecode to something that runs faster
|
||||||
|
|
||||||
|
=item * Linking: combining multiple bytecodes to produce executable program
|
||||||
|
|
||||||
|
=back
|
||||||
|
|
||||||
=item * Execute actions.
|
=item * Execute actions.
|
||||||
|
|
||||||
@ -66,17 +85,6 @@ B<llvmc> terminates.
|
|||||||
|
|
||||||
=back
|
=back
|
||||||
|
|
||||||
=head2 Configuration Files
|
|
||||||
|
|
||||||
B<llvmc> is highly configurable both on the command line and in configuration
|
|
||||||
files. Configuration files specify the details of what commands to run
|
|
||||||
for a given action. Each front end compiler must provide its own
|
|
||||||
configuration file to tell B<llvmc> how to invoke that compiler. The LLVM
|
|
||||||
toolset does not need to be configured as B<llvmc> just "knows" how to
|
|
||||||
invoke those tools.
|
|
||||||
|
|
||||||
Rest TBD.
|
|
||||||
|
|
||||||
=head1 OPTIONS
|
=head1 OPTIONS
|
||||||
|
|
||||||
=head2 Control Options
|
=head2 Control Options
|
||||||
@ -160,7 +168,7 @@ considering the program as a whole.
|
|||||||
This is the same as B<-O4> except it employs aggressive analyses and
|
This is the same as B<-O4> except it employs aggressive analyses and
|
||||||
aggressive inter-procedural optimization.
|
aggressive inter-procedural optimization.
|
||||||
|
|
||||||
=item B<-O6> (profile guided optimization - not implemented)
|
=item B<-O6> (profile guided optimization: not implemented)
|
||||||
|
|
||||||
This is the same as B<-O5> except that it employes profile-guided
|
This is the same as B<-O5> except that it employes profile-guided
|
||||||
reoptimization of the program after it has executed. Note that this implies
|
reoptimization of the program after it has executed. Note that this implies
|
||||||
@ -168,7 +176,7 @@ a single level of reoptimization based on runtime profile analysis. Once
|
|||||||
the re-optimization has completed, the profiling instrumentation is
|
the re-optimization has completed, the profiling instrumentation is
|
||||||
removed and final optimizations are employed.
|
removed and final optimizations are employed.
|
||||||
|
|
||||||
=item B<-O7> (lifelong optimization - not implemented)
|
=item B<-O7> (lifelong optimization: not implemented)
|
||||||
|
|
||||||
This is the same as B<-O5> and similar to B<-O6> except that reoptimization
|
This is the same as B<-O5> and similar to B<-O6> except that reoptimization
|
||||||
is performed through the life of the program. That is, each run will update
|
is performed through the life of the program. That is, each run will update
|
||||||
@ -225,48 +233,6 @@ options.
|
|||||||
|
|
||||||
=back
|
=back
|
||||||
|
|
||||||
=head2 Configuration Options
|
|
||||||
|
|
||||||
=over
|
|
||||||
|
|
||||||
=item B<--show-config> I<[suffixes...]>
|
|
||||||
|
|
||||||
When this option is given, the only action taken by B<llvmc> is to show its
|
|
||||||
final configuration state in the form of a configuration file. No compilation
|
|
||||||
tasks will be conducted when this option is given; processing will stop once
|
|
||||||
the configuration has been printed. The optional (comma separated) list of
|
|
||||||
suffixes controls what is printed. Without any suffixes, the configuration
|
|
||||||
for all languages is printed. With suffixes, only the languages pertaining
|
|
||||||
to those file suffixes will be printed. The configuration information is
|
|
||||||
printed after all command line options and configuration files have been
|
|
||||||
read and processed. This allows the user to verify that the correct
|
|
||||||
configuration data has been read by B<llvmc>.
|
|
||||||
|
|
||||||
=item B<--config> :I<section>:I<name>=I<value>
|
|
||||||
|
|
||||||
This option instructs B<llvmc> to accept I<value> as the value for configuration
|
|
||||||
item I<name> in the section named I<section>. This is a quick way to override
|
|
||||||
a configuration item on the command line without resorting to changing the
|
|
||||||
configuration files.
|
|
||||||
|
|
||||||
=item B<--config-file> F<dirname>
|
|
||||||
|
|
||||||
This option tells B<llvmc> to read configuration data from the I<directory>
|
|
||||||
named F<dirname>. Data from such directories will be read in the order
|
|
||||||
specified on the command line after all other standard config files have
|
|
||||||
been read. This allows users or groups of users to conveniently create
|
|
||||||
their own configuration directories in addition to the standard ones to which
|
|
||||||
they may not have write access.
|
|
||||||
|
|
||||||
=item B<--config-only-from> F<dirname>
|
|
||||||
|
|
||||||
This option tells B<llvmc> to skip the normal processing of configuration
|
|
||||||
files and only configure from the contents of the F<dirname> directory. Multiple
|
|
||||||
B<--config-only-from> options may be given in which case the directories are
|
|
||||||
read in the order given on the command line.
|
|
||||||
|
|
||||||
=back
|
|
||||||
|
|
||||||
=head2 Information Options
|
=head2 Information Options
|
||||||
|
|
||||||
=over
|
=over
|
||||||
@ -325,29 +291,13 @@ Pass an arbitrary optionsto the code generator.
|
|||||||
|
|
||||||
=over
|
=over
|
||||||
|
|
||||||
=item B<-I>F<path> (C/C++ Only)
|
=item B<-I>F<path>
|
||||||
|
|
||||||
This option is just passed through to a C or C++ front end compiler to tell it
|
This option is just passed through to a C or C++ front end compiler to tell it
|
||||||
where include files can be found.
|
where include files can be found.
|
||||||
|
|
||||||
=back
|
=back
|
||||||
|
|
||||||
=head2 Advanced Options
|
|
||||||
|
|
||||||
=over
|
|
||||||
|
|
||||||
=item B<--emit-raw-code>
|
|
||||||
|
|
||||||
No optimization is done whatsoever. The compilers invoked by B<llvmc> with
|
|
||||||
this option given will be instructed to produce raw, unoptimized code. This
|
|
||||||
option is useful only to front end language developers and therefore does not
|
|
||||||
participate in the list of B<-O> options. This is distinctly different from
|
|
||||||
the B<-O0> option (a synonym for B<-O1>) because those optimizations will
|
|
||||||
reduce code size to make compilation faster. With B<--emit-raw-code>, only
|
|
||||||
the full raw code produced by the compiler will be generated.
|
|
||||||
|
|
||||||
=back
|
|
||||||
|
|
||||||
=head2 Miscellaneous Options
|
=head2 Miscellaneous Options
|
||||||
|
|
||||||
=over
|
=over
|
||||||
@ -363,6 +313,205 @@ and terminate.
|
|||||||
|
|
||||||
=back
|
=back
|
||||||
|
|
||||||
|
=head2 Advanced Options
|
||||||
|
|
||||||
|
You better know what you're doing if you use these options. Improper use
|
||||||
|
of these options can produce drastically wrong results.
|
||||||
|
|
||||||
|
=over
|
||||||
|
|
||||||
|
=item B<--show-config> I<[suffixes...]>
|
||||||
|
|
||||||
|
When this option is given, the only action taken by B<llvmc> is to show its
|
||||||
|
final configuration state in the form of a configuration file. No compilation
|
||||||
|
tasks will be conducted when this option is given; processing will stop once
|
||||||
|
the configuration has been printed. The optional (comma separated) list of
|
||||||
|
suffixes controls what is printed. Without any suffixes, the configuration
|
||||||
|
for all languages is printed. With suffixes, only the languages pertaining
|
||||||
|
to those file suffixes will be printed. The configuration information is
|
||||||
|
printed after all command line options and configuration files have been
|
||||||
|
read and processed. This allows the user to verify that the correct
|
||||||
|
configuration data has been read by B<llvmc>.
|
||||||
|
|
||||||
|
=item B<--config> :I<section>:I<name>=I<value>
|
||||||
|
|
||||||
|
This option instructs B<llvmc> to accept I<value> as the value for configuration
|
||||||
|
item I<name> in the section named I<section>. This is a quick way to override
|
||||||
|
a configuration item on the command line without resorting to changing the
|
||||||
|
configuration files.
|
||||||
|
|
||||||
|
=item B<--config-file> F<dirname>
|
||||||
|
|
||||||
|
This option tells B<llvmc> to read configuration data from the I<directory>
|
||||||
|
named F<dirname>. Data from such directories will be read in the order
|
||||||
|
specified on the command line after all other standard config files have
|
||||||
|
been read. This allows users or groups of users to conveniently create
|
||||||
|
their own configuration directories in addition to the standard ones to which
|
||||||
|
they may not have write access.
|
||||||
|
|
||||||
|
=item B<--config-only-from> F<dirname>
|
||||||
|
|
||||||
|
This option tells B<llvmc> to skip the normal processing of configuration
|
||||||
|
files and only configure from the contents of the F<dirname> directory. Multiple
|
||||||
|
B<--config-only-from> options may be given in which case the directories are
|
||||||
|
read in the order given on the command line.
|
||||||
|
|
||||||
|
|
||||||
|
=item B<--emit-raw-code>
|
||||||
|
|
||||||
|
No optimization is done whatsoever. The compilers invoked by B<llvmc> with
|
||||||
|
this option given will be instructed to produce raw, unoptimized code. This
|
||||||
|
option is useful only to front end language developers and therefore does not
|
||||||
|
participate in the list of B<-O> options. This is distinctly different from
|
||||||
|
the B<-O0> option (a synonym for B<-O1>) because those optimizations will
|
||||||
|
reduce code size to make compilation faster. With B<--emit-raw-code>, only
|
||||||
|
the full raw code produced by the compiler will be generated.
|
||||||
|
|
||||||
|
=back
|
||||||
|
|
||||||
|
=head1 CONFIGURATION
|
||||||
|
|
||||||
|
=head2 Warning
|
||||||
|
|
||||||
|
Configuration information is relatively static for a given release of LLVM and
|
||||||
|
a front end compiler. However, the details may change from release to release.
|
||||||
|
Users are encouraged to simply use the various options of the B<llvmc> command
|
||||||
|
and ignore the configuration of the tool. These configuration files are for
|
||||||
|
compiler writers and LLVM developers. Those wishing to simply use B<llvmc>
|
||||||
|
don't need to understand this section but it may be instructive on what the tool
|
||||||
|
does.
|
||||||
|
|
||||||
|
=head2 Introduction
|
||||||
|
|
||||||
|
B<llvmc> is highly configurable both on the command line and in configuration
|
||||||
|
files. The options it understands are generic, consistent and simple by design.
|
||||||
|
Furthermore, the B<llvmc> options apply to the compilation of any LLVM enabled
|
||||||
|
programming language. To be enabled as a supported source language compiler, a
|
||||||
|
compiler writer must provide a configuration file that tells B<llvmc> how to
|
||||||
|
invoke the compiler and what its capabilities are. The purpose of the
|
||||||
|
configuration files then is to allow compiler writers to specify to B<llvmc> how
|
||||||
|
the compiler should be invoked. Users may but are not advised to alter the
|
||||||
|
compiler's B<llvmc> configuration.
|
||||||
|
|
||||||
|
Because B<llvmc> just invokes other programs, it must deal with the
|
||||||
|
available command line options for those programs regardless of whether they
|
||||||
|
were written for LLVM or not. Furthermore, not all compilation front ends will
|
||||||
|
have the same capabilities. Some front ends will simply generate LLVM assembly
|
||||||
|
code, others will be able to generate fully optimized byte code. In general,
|
||||||
|
B<llvmc> doesn't make any assumptions about the capabilities or command line
|
||||||
|
options of a sub-tool. It simply uses the details found in the configuration
|
||||||
|
files and leaves it to the compiler writer to specify the configuration
|
||||||
|
correctly.
|
||||||
|
|
||||||
|
Ths approach means that new compiler front ends can be up and working very
|
||||||
|
quickly. As a first cut, a front end can simply compile its source to raw
|
||||||
|
(unoptimized) bytecode or LLVM assembly and B<llvmc> can be configured to pick
|
||||||
|
up the slack (translate LLVm assembly to bytecode, optimize the bytecode,
|
||||||
|
generate native assembly, link, etc.). In fact, the front end need not use
|
||||||
|
any LLVM libraries, and it could be written in any language (instead of C++).
|
||||||
|
The configuration data will allow the full range of optimization, assembly,
|
||||||
|
and linking capabilities that LLVM provides to be added to these kinds of tools.
|
||||||
|
Enabling the rapid development of front-ends is one of the primary goals of
|
||||||
|
B<llvmc>.
|
||||||
|
|
||||||
|
As a compiler front end matures, it may utilize the LLVM libraries and tools to
|
||||||
|
more efficiently produce optimized bytecode directly in a single compilation and
|
||||||
|
optimization program. In these cases, multiple tools would not be needed and
|
||||||
|
the configuration data for the compiler would change.
|
||||||
|
|
||||||
|
Configuring B<llvmc> to the needs and capabilities of a source language compiler
|
||||||
|
is relatively straight forward. The compilation process is broken down into five
|
||||||
|
phases:
|
||||||
|
|
||||||
|
=over
|
||||||
|
|
||||||
|
=item * Pre-processing (filter and combine source files)
|
||||||
|
|
||||||
|
=item * Translation (translate source language to LLVM assembly or bytecode)
|
||||||
|
|
||||||
|
=item * Optimization (make bytecode execute quickly)
|
||||||
|
|
||||||
|
=item * Assembly (converting bytecode to object code)
|
||||||
|
|
||||||
|
=item * Linking (converting translated code to an executable)
|
||||||
|
|
||||||
|
=back
|
||||||
|
|
||||||
|
A compiler writer must provide a definition of what to do for each of these five
|
||||||
|
phases for each of the optimization levels. The specification consists simply of
|
||||||
|
prototypical command lines into which B<llvmc> can substitute command line
|
||||||
|
arguments and file names. Note that any given phase can be completely blank if
|
||||||
|
the source language's compiler combines multiple phases into a single program.
|
||||||
|
For example, quite often pre-processng, translation, and optimization are
|
||||||
|
combined into a single program. The specification for such a compiler would have
|
||||||
|
blank entries for pre-processing and translation but a full command line for
|
||||||
|
optimization.
|
||||||
|
|
||||||
|
=head2 Configuration File Types
|
||||||
|
|
||||||
|
There are two types of configuration files: the master configuration file
|
||||||
|
and the language specific configuration file.
|
||||||
|
|
||||||
|
The master configuration file contains the general configuration of B<llvmc>
|
||||||
|
itself. This includes things like the mapping between file extensions and
|
||||||
|
source languages. This mapping is needed in order to quickly read only the
|
||||||
|
applicable language-specific configuration files (avoiding reading every config
|
||||||
|
file for every compilation task).
|
||||||
|
|
||||||
|
Language specific configuration files tell B<llvmc> how to invoke the language's
|
||||||
|
compiler for a variety of different tasks and what other tools are needed to
|
||||||
|
I<backfill> the compiler's missing features (e.g. optimization).
|
||||||
|
|
||||||
|
Language specific configuration files are placed in directories and given
|
||||||
|
specific names to foster faster lookup. The name of a given configuration file
|
||||||
|
is the name of the source language.
|
||||||
|
|
||||||
|
=head2 Default Directory Locations
|
||||||
|
|
||||||
|
B<llvmc> will look for configuration files in two standard locations: the
|
||||||
|
LLVM installation directory (typically C</usr/local/llvm/etc>) and the user's
|
||||||
|
home directory (typically C</home/user/.llvm>). In these directories a file named
|
||||||
|
C<master> provides the master configuration for B<llvmc>. Language specific
|
||||||
|
files will have a language specific name (e.g. C++, Stacker, Scheme, FORTRAN).
|
||||||
|
When reading the configuration files, the master files are always read first in
|
||||||
|
the following order:
|
||||||
|
|
||||||
|
=over
|
||||||
|
|
||||||
|
=item 1 C<master> in LLVM installation directory
|
||||||
|
|
||||||
|
=item 2 C<master> in the user's home directory.
|
||||||
|
|
||||||
|
=back
|
||||||
|
|
||||||
|
Then, based on the command line options and the suffixes of the file names
|
||||||
|
provided on B<llvmc>'s command line, one or more language specific configuration
|
||||||
|
files are read. Only the language specific configuration files actually needed
|
||||||
|
to complete B<llvmc>'s task are read. Other language specific files will be
|
||||||
|
ignored.
|
||||||
|
|
||||||
|
Note that the user can affect this process in several ways using the various
|
||||||
|
B<--config-*> options and with the B<--x LANGUAGE> option.
|
||||||
|
|
||||||
|
Although a user I<can> override the master configuration file, this is not
|
||||||
|
advised. The capability is retained so that compiler writers can affect the
|
||||||
|
master configuration (such as adding new file suffixes) while developing a new
|
||||||
|
compiler front end since they might not have write access to the installed
|
||||||
|
master configuration.
|
||||||
|
|
||||||
|
=head2 Syntax
|
||||||
|
|
||||||
|
The syntax of the configuration files is yet to be determined. There are three
|
||||||
|
viable options:
|
||||||
|
|
||||||
|
=over
|
||||||
|
|
||||||
|
=item XML
|
||||||
|
=item Windows .ini
|
||||||
|
=item specific to B<llvmc>
|
||||||
|
|
||||||
|
=back
|
||||||
|
|
||||||
=head1 EXIT STATUS
|
=head1 EXIT STATUS
|
||||||
|
|
||||||
If B<llvmc> succeeds, it will exit with 0. Otherwise, if an error
|
If B<llvmc> succeeds, it will exit with 0. Otherwise, if an error
|
||||||
|
Reference in New Issue
Block a user