Commit Graph

1632 Commits

Author SHA1 Message Date
Rich Felker
708c91f4e9 remove useless failure-check from freopen (can't happen) 2012-10-24 20:03:43 -04:00
Rich Felker
838951c97e simplify logic in stpcpy; avoid copying first aligned byte twice
gcc seems to be generating identical or near-identical code for both
versions, but the newer code is more expressive of what it's doing.
2012-10-22 15:17:09 -04:00
Rich Felker
4eb4844b31 fix issues with wait constants in stdlib.h
the W* namespace is not reserved, so the nonstandard ones must be
moved under extension features. also WNOHANG and WUNTRACED were
missing.
2012-10-21 19:15:11 -04:00
Rich Felker
31a55f233b as an extension, have putenv("VAR") behave as unsetenv("VAR")
the behavior of putenv is left undefined if the argument does not
contain an equal sign, but traditional implementations behave this way
and gnulib replaces putenv if it doesn't do this.
2012-10-21 18:37:15 -04:00
Rich Felker
8489897e01 accept "nan(n-char-sequence)" in strtod/scanf functions
this will prevent gnulib from wrapping our strtod to handle this
useless feature.
2012-10-21 18:28:20 -04:00
Rich Felker
f1b23d41cf fix copy/paste error in popen changes that broke signals
signal mask was not being restored after fork, but instead blocked again.
2012-10-21 13:23:03 -04:00
nsz
2d321fa062 complex: make _Complex_I work with gcc -std=c99 -pedantic-errors 2012-10-21 02:31:04 +02:00
Rich Felker
0a1c2c1c17 support looking up thread-local objects with dlsym 2012-10-19 21:57:56 -04:00
Rich Felker
deb15b3cf2 fix breakage in dlsym for looking up RTLD_DEFAULT, etc.
this was broken during the early dynamic-linked TLS commits, which
rearranged some of the code for handling new relocation types.
2012-10-19 21:41:30 -04:00
Rich Felker
dde325d7b3 fix struct stat size/padding on microblaze 2012-10-19 18:52:14 -04:00
Rich Felker
599f973603 fix usage of locks with vfork
__release_ptc() is only valid in the parent; if it's performed in the
child, the lock will be unlocked early then double-unlocked later,
corrupting the lock state.
2012-10-19 15:02:37 -04:00
Rich Felker
ebee8c2b47 fix crashes in static-linked multithreaded programs without TLS 2012-10-19 01:33:52 -04:00
Rich Felker
6a04d1b8c6 fix order of syscall args for microblaze clone syscall
with this commit, based on testing with patches to qemu which are not
yet upstream,
2012-10-19 00:27:03 -04:00
Rich Felker
01e5a1b6cd use $CC rather than "$CC" in configure script
this is necessary to allow $CC with arguments in it
2012-10-18 23:02:53 -04:00
Rich Felker
25011215be inline syscalls for microblaze 2012-10-18 22:13:36 -04:00
Rich Felker
1f485230df ensure microblaze __set_thread_area returns success
since it did not set the return-value register, the caller could
wrongly interpret this as failure.
2012-10-18 22:01:24 -04:00
Rich Felker
64251d8bbd better support for reverse-endian variants of arm/mips/microblaze
these macros are supported by more compilers
2012-10-18 21:50:55 -04:00
Rich Felker
776251867d avoid raising spurious division-by-zero exception in printf 2012-10-18 20:26:41 -04:00
Rich Felker
f321de9e0a floating point environment/exceptions support for mips 2012-10-18 20:19:53 -04:00
Rich Felker
97c8bdd88a fix parent-memory-clobber in posix_spawn (environ) 2012-10-18 16:41:27 -04:00
Rich Felker
44eb4d8b9b overhaul system() and popen() to use vfork; fix various related bugs
since we target systems without overcommit, special care should be
taken that system() and popen(), like posix_spawn(), do not fail in
processes whose commit charges are too high to allow ordinary forking.

this in turn requires special precautions to ensure that the parent
process's signal handlers do not end up running in the shared-memory
child, where they could corrupt the state of the parent process.

popen has also been updated to use pipe2, so it does not have a
fd-leak race in multi-threaded programs. since pipe2 is missing on
older kernels, (non-atomic) emulation has been added.

some silly bugs in the old code should be gone too.
2012-10-18 15:58:23 -04:00
Rich Felker
f1e7a5e5f6 fix (hopefully; untested) completely broken/incomplete microblaze sigsetjmp 2012-10-18 00:09:36 -04:00
Rich Felker
ef2f595f74 fix microblaze asm relocations for shared libc
only @PLT relocations are considered functions for purposes of
-Bsymbolic-functions, so always use @PLT. it should not hurt in the
static-linked case.
2012-10-17 23:45:21 -04:00
Rich Felker
858df113bf assert() is supposed to have type void 2012-10-17 02:47:11 -04:00
Rich Felker
c86f2974e2 add memmem function (gnu extension)
based on strstr. passes gnulib tests and a few quick checks of my own.
2012-10-15 23:02:57 -04:00
Rich Felker
21284ec75d microblaze TLS relocation support, completely untested 2012-10-15 21:01:48 -04:00
Rich Felker
9ec4283b28 add support for TLS variant I, presently needed for arm and mips
despite documentation that makes it sound a lot different, the only
ABI-constraint difference between TLS variants II and I seems to be
that variant II stores the initial TLS segment immediately below the
thread pointer (i.e. the thread pointer points to the end of it) and
variant I stores the initial TLS segment above the thread pointer,
requiring the thread descriptor to be stored below. the actual value
stored in the thread pointer register also tends to have per-arch
random offsets applied to it for silly micro-optimization purposes.

with these changes applied, TLS should be basically working on all
supported archs except microblaze. I'm still working on getting the
necessary information and a working toolchain that can build TLS
binaries for microblaze, but in theory, static-linked programs with
TLS and dynamic-linked programs where only the main executable uses
TLS should already work on microblaze.

alignment constraints have not yet been heavily tested, so it's
possible that this code does not always align TLS segments correctly
on archs that need TLS variant I.
2012-10-15 18:51:53 -04:00
Rich Felker
d5304147b9 block uid/gid changes during posix_spawn
usage of vfork creates a situation where a process of lower privilege
may momentarily have write access to the memory of a process of higher
privilege.

consider the case of a multi-threaded suid program which is calling
posix_spawn in one thread while another thread drops the elevated
privileges then runs untrusted (relative to the elevated privilege)
code as the original invoking user. this untrusted code can then
potentially modify the data the child process will use before calling
exec, for example changing the pathname or arguments that will be
passed to exec.

note that if vfork is implemented as fork, the lock will not be held
until the child execs, but since memory is not shared it does not
matter.
2012-10-15 11:42:46 -04:00
Rich Felker
42c36f957d fix overlap of thread stacks with thread tls segments 2012-10-14 21:10:44 -04:00
Rich Felker
c62b9f3980 fix main program TLS alignment for dynamic-linked programs
this change brings the behavior in line with the static-linked code,
which seems to be correct.
2012-10-14 19:56:50 -04:00
Rich Felker
36be5284c2 workaround broken hidden-visibility handling in pcc
with this change, pcc-built musl libc.so seems to work correctly. the
problem is that pcc generates GOT lookups for external-linkage symbols
even if they are hidden, rather than using GOT-relative addressing.
the entire reason we're using hidden visibility on the __libc object
is to make it accessible prior to relocations -- not to mention
inexpensive to access. unfortunately, the workaround makes it even
more expensive on pcc.

when the pcc issue is fixed, an appropriate version test should be
added so new pcc can use the much more efficient variant.
2012-10-13 23:53:18 -04:00
Rich Felker
185a977074 ensure pointer decay in inline-asm arg for i386 syscall6
this is actually a rather subtle issue: do arrays decay to pointers
when used as inline asm args? gcc says yes, but currently pcc says no.
hopefully this discrepency in pcc will be fixed, but since the
behavior is not clearly defined anywhere I can find, I'm using an
explicit operation to cause the decay to occur.
2012-10-13 23:46:51 -04:00
Rich Felker
e23d358fd6 fix namespace clash (libc) in dynlink.c
this makes it so the #undef libc and __libc name are no longer needed,
which were problematic because the "accessor function" mode for
accessing the libc struct could not be used, breaking build on any
compiler without (working) visibility.
2012-10-13 23:25:20 -04:00
Rich Felker
31f340a17a remove dead code from dynamic linker 2012-10-13 23:23:29 -04:00
Rich Felker
216dca82f6 comment possibly-confusing i386 vsyscall asm 2012-10-11 23:09:17 -04:00
Rich Felker
964e9f3c4c avoid the thread-ptr-init behavior of sigaction when not installing handler
this is necessary because posix_spawn calls sigaction after vfork, and
if the thread pointer is not already initialized, initializing it in
the child corrupts the parent process's state.
2012-10-11 22:51:05 -04:00
Rich Felker
12e9b4faf6 i386 vsyscall support (vdso-provided sysenter/syscall instruction based)
this doubles the performance of the fastest syscalls on the atom I
tested it on; improvement is reportedly much more dramatic on
worst-case cpus. cannot be used for cancellable syscalls.
2012-10-11 22:47:07 -04:00
Rich Felker
f2b1f1af83 ensure that buffer for decoding auxv at startup is initially zero 2012-10-08 11:20:31 -04:00
Rich Felker
0a96a37f06 clean up and refactor program initialization
the code in __libc_start_main is now responsible for parsing auxv,
rather than duplicating the parsing all over the place. this should
shave off a few cycles and some code size. __init_libc is left as an
external-linkage function despite the fact that it could be static, to
prevent it from being inlined and permanently wasting stack space when
main is called.

a few other minor changes are included, like eliminating per-thread
ssp canaries (they were likely broken when combined with certain
dlopen usages, and completely unnecessary) and some other unnecessary
checks. since this code gets linked into every program, it should be
as small and simple as possible.
2012-10-07 21:43:46 -04:00
Rich Felker
017bf140ff fix breakage due to initializing thread pointer when loading libs
at initial program load, all libraries must be loaded before the
thread pointer can be setup, since the TP-relative addresses of all
initial TLS objects must be constant.
2012-10-07 20:19:11 -04:00
Rich Felker
00902c7384 make new TLS setup block even implementation-internals signals
this is needed to ensure async-cancel-safety, i.e. to make it safe to
access TLS objects when async cancellation is enabled. otherwise, if
cancellation were acter upon after the atomic fetch/add but before the
thread saved the obtained memory, another access to the same TLS in
the cancellation handler could end up performing the atomic fetch/add
again, consuming more memory than is actually available and
overflowing into other objects on the heap.
2012-10-06 23:57:51 -04:00
Rich Felker
92e1cd9b0b don't crash if TLS library is loaded into process with no thread pointer 2012-10-06 16:56:35 -04:00
Rich Felker
6a2eaa3c5b fix buggy TLS size/alignment computations in static-linked TLS 2012-10-06 16:51:03 -04:00
Rich Felker
bd17431a2c fix symbol acceptance/rejection rules for TLS
symbol value of 0 is not "undefined" for TLS; it's the address of the
first symbol in the TLS segment. however, non-definition TLS
references also have values of 0, so check the section.

hopefully the new logic is more clear, too.
2012-10-06 01:36:11 -04:00
Rich Felker
cf3fd3d002 TLS fixes, mainly alignment handling
compute offsets from the thread pointer statically when loading the
library, rather than repeating the logic on each thread creation. not
only is the latter less efficient at runtime; it also fails to provide
solid guarantees that the offsets will remain the same when the
initial alignment of memory is different. the new alignment handling
is both more rigorous and simpler.

the old code was also clobbering TLS bss with random image data in
some cases due to using tls_size (size of TLS segment) instead of
tls_len (length of the TLS data image).
2012-10-06 01:22:51 -04:00
Rich Felker
f4f77c068f fix/improve shared library ctor/dtor handling, allow recursive dlopen
some libraries call dlopen from their constructors, resulting in
recursive calls to dlopen. previously, this resulted in deadlock. I'm
now unlocking the dlopen lock before running constructors (this is
especially important since the lock also blocked pthread_create and
was being held while application code runs!) and using a separate
recursive mutex protecting the ctor/dtor state instead.

in order to prevent the same ctor from being called more than once, a
module is considered "constructed" just before the ctor runs.

also, switch from using atexit to register each dtor to using a single
atexit call to register the dynamic linker's dtor processing as just
one handler. this is necessary because atexit performs allocation and
may fail, but the library has already been loaded and cannot be
backed-out at the time dtor registration is performed. this change
also ensures that all dtors run after all atexit functions, rather
than in mixed order.
2012-10-05 13:09:09 -04:00
Rich Felker
5f88c0edd5 small dynamic linker module search fix
libraries loaded more than once by pathname should not get shortnames
that would cause them to later be used to satisfy non-pathname load
requests.
2012-10-05 12:09:54 -04:00
Rich Felker
dcd6037150 support for TLS in dynamic-loaded (dlopen) modules
unlike other implementations, this one reserves memory for new TLS in
all pre-existing threads at dlopen-time, and dlopen will fail with no
resources consumed and no new libraries loaded if memory is not
available. memory is not immediately distributed to running threads;
that would be too complex and too costly. instead, assurances are made
that threads needing the new TLS can obtain it in an async-signal-safe
way from a buffer belonging to the dynamic linker/new module (via
atomic fetch-and-add based allocator).

I've re-appropriated the lock that was previously used for __synccall
(synchronizing set*id() syscalls between threads) as a general
pthread_create lock. it's a "backwards" rwlock where the "read"
operation is safe atomic modification of the live thread count, which
multiple threads can perform at the same time, and the "write"
operation is making sure the count does not increase during an
operation that depends on it remaining bounded (__synccall or dlopen).
in static-linked programs that don't use __synccall, this lock is a
no-op and has no cost.
2012-10-05 11:51:50 -04:00
Rich Felker
642b7593c3 fix race condition in dlopen
orig_tail was being saved before the lock was obtained, allowing
dlopen failure to roll-back other dlopens that had succeeded.
2012-10-05 01:15:25 -04:00
Rich Felker
99a2af6f45 fix incorrect TLS reloc macro names in x86_64 reloc.h 2012-10-05 01:00:40 -04:00