mirror of
https://github.com/autc04/Retro68.git
synced 2025-01-24 14:32:25 +00:00
352 lines
10 KiB
XML
352 lines
10 KiB
XML
<chapter xmlns="http://docbook.org/ns/docbook" version="5.0"
|
|
xml:id="manual.ext.concurrency" xreflabel="Concurrency Extensions">
|
|
<?dbhtml filename="ext_concurrency.html"?>
|
|
|
|
<info><title>Concurrency</title>
|
|
<keywordset>
|
|
<keyword>ISO C++</keyword>
|
|
<keyword>library</keyword>
|
|
</keywordset>
|
|
</info>
|
|
|
|
|
|
|
|
<section xml:id="manual.ext.concurrency.design" xreflabel="Design"><info><title>Design</title></info>
|
|
|
|
|
|
<section xml:id="manual.ext.concurrency.design.threads" xreflabel="Threads API"><info><title>Interface to Locks and Mutexes</title></info>
|
|
|
|
|
|
<para>The file <filename class="headerfile"><ext/concurrence.h></filename>
|
|
contains all the higher-level
|
|
constructs for playing with threads. In contrast to the atomics layer,
|
|
the concurrence layer consists largely of types. All types are defined within <code>namespace __gnu_cxx</code>.
|
|
</para>
|
|
|
|
<para>
|
|
These types can be used in a portable manner, regardless of the
|
|
specific environment. They are carefully designed to provide optimum
|
|
efficiency and speed, abstracting out underlying thread calls and
|
|
accesses when compiling for single-threaded situations (even on hosts
|
|
that support multiple threads.)
|
|
</para>
|
|
|
|
<para>The enumerated type <code>_Lock_policy</code> details the set of
|
|
available locking
|
|
policies: <code>_S_single</code>, <code>_S_mutex</code>,
|
|
and <code>_S_atomic</code>.
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem><para><code>_S_single</code></para>
|
|
<para>Indicates single-threaded code that does not need locking.
|
|
</para>
|
|
|
|
</listitem>
|
|
<listitem><para><code>_S_mutex</code></para>
|
|
<para>Indicates multi-threaded code using thread-layer abstractions.
|
|
</para>
|
|
</listitem>
|
|
<listitem><para><code>_S_atomic</code></para>
|
|
<para>Indicates multi-threaded code using atomic operations.
|
|
</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
|
|
<para>The compile-time constant <code>__default_lock_policy</code> is set
|
|
to one of the three values above, depending on characteristics of the
|
|
host environment and the current compilation flags.
|
|
</para>
|
|
|
|
<para>Two more datatypes make up the rest of the
|
|
interface: <code>__mutex</code>, and <code>__scoped_lock</code>.
|
|
</para>
|
|
|
|
<para>The scoped lock idiom is well-discussed within the C++
|
|
community. This version takes a <code>__mutex</code> reference, and
|
|
locks it during construction of <code>__scoped_lock</code> and
|
|
unlocks it during destruction. This is an efficient way of locking
|
|
critical sections, while retaining exception-safety.
|
|
These types have been superseded in the ISO C++ 2011 standard by the
|
|
mutex and lock types defined in the header
|
|
<filename class="headerfile"><mutex></filename>.
|
|
</para>
|
|
</section>
|
|
|
|
<section xml:id="manual.ext.concurrency.design.atomics" xreflabel="Atomic API"><info><title>Interface to Atomic Functions</title></info>
|
|
|
|
|
|
|
|
<para>
|
|
Two functions and one type form the base of atomic support.
|
|
</para>
|
|
|
|
|
|
<para>The type <code>_Atomic_word</code> is a signed integral type
|
|
supporting atomic operations.
|
|
</para>
|
|
|
|
<para>
|
|
The two functions functions are:
|
|
</para>
|
|
|
|
<programlisting>
|
|
_Atomic_word
|
|
__exchange_and_add_dispatch(volatile _Atomic_word*, int);
|
|
|
|
void
|
|
__atomic_add_dispatch(volatile _Atomic_word*, int);
|
|
</programlisting>
|
|
|
|
<para>Both of these functions are declared in the header file
|
|
<ext/atomicity.h>, and are in <code>namespace __gnu_cxx</code>.
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem><para>
|
|
<code>
|
|
__exchange_and_add_dispatch
|
|
</code>
|
|
</para>
|
|
<para>Adds the second argument's value to the first argument. Returns the old value.
|
|
</para>
|
|
</listitem>
|
|
<listitem><para>
|
|
<code>
|
|
__atomic_add_dispatch
|
|
</code>
|
|
</para>
|
|
<para>Adds the second argument's value to the first argument. Has no return value.
|
|
</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
|
|
<para>
|
|
These functions forward to one of several specialized helper
|
|
functions, depending on the circumstances. For instance,
|
|
</para>
|
|
|
|
<para>
|
|
<code>
|
|
__exchange_and_add_dispatch
|
|
</code>
|
|
</para>
|
|
|
|
<para>
|
|
Calls through to either of:
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem><para><code>__exchange_and_add</code>
|
|
</para>
|
|
<para>Multi-thread version. Inlined if compiler-generated builtin atomics
|
|
can be used, otherwise resolved at link time to a non-builtin code
|
|
sequence.
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem><para><code>__exchange_and_add_single</code>
|
|
</para>
|
|
<para>Single threaded version. Inlined.</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
|
|
<para>However, only <code>__exchange_and_add_dispatch</code>
|
|
and <code>__atomic_add_dispatch</code> should be used. These functions
|
|
can be used in a portable manner, regardless of the specific
|
|
environment. They are carefully designed to provide optimum efficiency
|
|
and speed, abstracting out atomic accesses when they are not required
|
|
(even on hosts that support compiler intrinsics for atomic
|
|
operations.)
|
|
</para>
|
|
|
|
<para>
|
|
In addition, there are two macros
|
|
</para>
|
|
|
|
<para>
|
|
<code>
|
|
_GLIBCXX_READ_MEM_BARRIER
|
|
</code>
|
|
</para>
|
|
<para>
|
|
<code>
|
|
_GLIBCXX_WRITE_MEM_BARRIER
|
|
</code>
|
|
</para>
|
|
|
|
<para>
|
|
Which expand to the appropriate write and read barrier required by the
|
|
host hardware and operating system.
|
|
</para>
|
|
</section>
|
|
|
|
</section>
|
|
|
|
|
|
<section xml:id="manual.ext.concurrency.impl" xreflabel="Implementation"><info><title>Implementation</title></info>
|
|
<?dbhtml filename="ext_concurrency_impl.html"?>
|
|
|
|
<section xml:id="manual.ext.concurrency.impl.atomic_fallbacks" xreflabel="Atomic F"><info><title>Using Built-in Atomic Functions</title></info>
|
|
|
|
|
|
<para>The functions for atomic operations described above are either
|
|
implemented via compiler intrinsics (if the underlying host is
|
|
capable) or by library fallbacks.</para>
|
|
|
|
<para>Compiler intrinsics (builtins) are always preferred. However, as
|
|
the compiler builtins for atomics are not universally implemented,
|
|
using them directly is problematic, and can result in undefined
|
|
function calls.
|
|
</para>
|
|
|
|
<para>Prior to GCC 4.7 the older <code>__sync</code> intrinsics were used.
|
|
An example of an undefined symbol from the use
|
|
of <code>__sync_fetch_and_add</code> on an unsupported host is a
|
|
missing reference to <code>__sync_fetch_and_add_4</code>.
|
|
</para>
|
|
|
|
<para>Current releases use the newer <code>__atomic</code> intrinsics,
|
|
which are implemented by library calls if the hardware doesn't support them.
|
|
Undefined references to functions like
|
|
<code>__atomic_is_lock_free</code> should be resolved by linking to
|
|
<filename class="libraryfile">libatomic</filename>, which is usually
|
|
installed alongside libstdc++.
|
|
</para>
|
|
|
|
<para>In addition, on some hosts the compiler intrinsics are enabled
|
|
conditionally, via the <code>-march</code> command line flag. This makes
|
|
usage vary depending on the target hardware and the flags used during
|
|
compile.
|
|
</para>
|
|
|
|
|
|
|
|
<para>
|
|
<remark>
|
|
Incomplete/inconsistent. This is only C++11.
|
|
</remark>
|
|
</para>
|
|
|
|
<para>
|
|
If builtins are possible for bool-sized integral types,
|
|
<code>ATOMIC_BOOL_LOCK_FREE</code> will be defined.
|
|
If builtins are possible for int-sized integral types,
|
|
<code>ATOMIC_INT_LOCK_FREE</code> will be defined.
|
|
</para>
|
|
|
|
|
|
<para>For the following hosts, intrinsics are enabled by default.
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem><para>alpha</para></listitem>
|
|
<listitem><para>ia64</para></listitem>
|
|
<listitem><para>powerpc</para></listitem>
|
|
<listitem><para>s390</para></listitem>
|
|
</itemizedlist>
|
|
|
|
<para>For others, some form of <code>-march</code> may work. On
|
|
non-ancient x86 hardware, <code>-march=native</code> usually does the
|
|
trick.</para>
|
|
|
|
<para> For hosts without compiler intrinsics, but with capable
|
|
hardware, hand-crafted assembly is selected. This is the case for the following hosts:
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem><para>cris</para></listitem>
|
|
<listitem><para>hppa</para></listitem>
|
|
<listitem><para>i386</para></listitem>
|
|
<listitem><para>i486</para></listitem>
|
|
<listitem><para>m48k</para></listitem>
|
|
<listitem><para>mips</para></listitem>
|
|
<listitem><para>sparc</para></listitem>
|
|
</itemizedlist>
|
|
|
|
<para>And for the rest, a simulated atomic lock via pthreads.
|
|
</para>
|
|
|
|
<para> Detailed information about compiler intrinsics for atomic operations can be found in the GCC <link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://gcc.gnu.org/onlinedocs/gcc/_005f_005fatomic-Builtins.html"> documentation</link>.
|
|
</para>
|
|
|
|
<para> More details on the library fallbacks from the porting <link linkend="internals.thread_safety">section</link>.
|
|
</para>
|
|
|
|
|
|
</section>
|
|
<section xml:id="manual.ext.concurrency.impl.thread" xreflabel="Pthread"><info><title>Thread Abstraction</title></info>
|
|
|
|
|
|
<para>A thin layer above IEEE 1003.1 (i.e. pthreads) is used to abstract
|
|
the thread interface for GCC. This layer is called "gthread," and is
|
|
comprised of one header file that wraps the host's default thread layer with
|
|
a POSIX-like interface.
|
|
</para>
|
|
|
|
<para> The file <gthr-default.h> points to the deduced wrapper for
|
|
the current host. In libstdc++ implementation files,
|
|
<bits/gthr.h> is used to select the proper gthreads file.
|
|
</para>
|
|
|
|
<para>Within libstdc++ sources, all calls to underlying thread functionality
|
|
use this layer. More detail as to the specific interface can be found in the source <link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://gcc.gnu.org/onlinedocs/libstdc++/latest-doxygen/index.html">documentation</link>.
|
|
</para>
|
|
|
|
<para>By design, the gthread layer is interoperable with the types,
|
|
functions, and usage found in the usual <pthread.h> file,
|
|
including <code>pthread_t</code>, <code>pthread_once_t</code>, <code>pthread_create</code>,
|
|
etc.
|
|
</para>
|
|
|
|
</section>
|
|
</section>
|
|
|
|
<section xml:id="manual.ext.concurrency.use" xreflabel="Use"><info><title>Use</title></info>
|
|
<?dbhtml filename="ext_concurrency_use.html"?>
|
|
|
|
|
|
<para>Typical usage of the last two constructs is demonstrated as follows:
|
|
</para>
|
|
|
|
<programlisting>
|
|
#include <ext/concurrence.h>
|
|
|
|
namespace
|
|
{
|
|
__gnu_cxx::__mutex safe_base_mutex;
|
|
} // anonymous namespace
|
|
|
|
namespace other
|
|
{
|
|
void
|
|
foo()
|
|
{
|
|
__gnu_cxx::__scoped_lock sentry(safe_base_mutex);
|
|
for (int i = 0; i < max; ++i)
|
|
{
|
|
_Safe_iterator_base* __old = __iter;
|
|
__iter = __iter-<_M_next;
|
|
__old-<_M_detach_single();
|
|
}
|
|
}
|
|
</programlisting>
|
|
|
|
<para>In this sample code, an anonymous namespace is used to keep
|
|
the <code>__mutex</code> private to the compilation unit,
|
|
and <code>__scoped_lock</code> is used to guard access to the critical
|
|
section within the for loop, locking the mutex on creation and freeing
|
|
the mutex as control moves out of this block.
|
|
</para>
|
|
|
|
<para>Several exception classes are used to keep track of
|
|
concurrence-related errors. These classes
|
|
are: <code>__concurrence_lock_error</code>, <code>__concurrence_unlock_error</code>, <code>__concurrence_wait_error</code>,
|
|
and <code>__concurrence_broadcast_error</code>.
|
|
</para>
|
|
|
|
|
|
</section>
|
|
|
|
</chapter>
|