NetBSD

Commit Graph

Author	SHA1	Message	Date
ad	9202b10ca9	Cosmetic change.	2007-11-13 01:21:32 +00:00
ad	f63239c2a0	Use _lwp_setname() to pass thread names to the kernel.	2007-11-07 00:55:22 +00:00
ad	84a6749ef2	Note that libpthread_dbg needs to be checked after making changes to libpthread.	2007-10-16 15:21:54 +00:00
ad	f1b2c1c4c9	... but preserve the linked list, for the debugger only.	2007-10-16 15:07:02 +00:00
ad	9583eeb248	Replace the global thread list with a red-black tree. From joerg@.	2007-10-16 13:41:18 +00:00
jnemeth	66687b0cb5	SSP doesn't like alloca...	2007-10-13 20:36:43 +00:00
rmind	25e540085b	Add cancellation stubs in libpthread for POSIX messages queues and asynchronous I/O. OK by <ad>.	2007-10-09 18:18:33 +00:00
skrll	c6deb42c81	Provide PTHREAD__ASM_RASOPS for alpha. The gcc generated lock try RAS is broken as the store needs to be the last instruction.	2007-10-08 16:04:43 +00:00
ad	507f1ca139	Compile pthread_getspecific / pthread_setspecific with -fomit-frame-pointer -falign-functions=32, since these two really get hammered on. To make them faster needs a threadreg or TLS, unless there is a way to tell gcc that a library-local (pthread__threadmask) variable does not need to be PIC.	2007-10-04 21:08:35 +00:00
ad	5059087834	Drop PTHREAD__NSPINS back from 1000 to 64. Setting the waiters bits and preparing to sleep on a mutex are slow in relative terms, so this allows us to recover from short lock holds without blocking, while not wasting too much time on longer holds.	2007-10-04 21:04:32 +00:00
ad	a4b475cd22	Fix a thinko.	2007-10-04 01:46:49 +00:00
ad	af8ed8ad89	Kill PTHREAD_SPIN_DEBUG - it's not must use with 1:1.	2007-09-24 13:56:42 +00:00
skrll	d32ed98975	Resurrect the function pointers for lock operations and allow each architecture to provide asm versions of the RAS operations. We do this because relying on the compiler to get the RAS right is not sensible. (It gets alpha wrong and hppa is suboptimal) Provide asm RAS ops for hppa. (A slightly different version) reviewed by Andrew Doran.	2007-09-24 12:19:39 +00:00
ad	598943b712	Adjust previous for clarity.	2007-09-21 21:28:11 +00:00
ad	92f04f1f80	pthread__mutex_unlock_slow: always catch up with deferred wakeups, because pthread_mutex_unlock clears the per-mutex indicator.	2007-09-21 21:09:25 +00:00
ad	cb10afa05c	pthread_rwlock_unlock: return EPERM if the caller tries to release a rwlock that is write held, but not by the caller.	2007-09-21 16:24:45 +00:00
ad	9212747aea	pthread_rwlock_unlock - Allow callers to try and release an unheld rwlock. Just return EPERM as mandated by IEEE Std 1003.1. - Use pthread__atomic_swap_ptr() to set in the new lock value. At this point the lock word can't have changed. pthread__rwlock_wrlock, pthread__rwlock_rdlock: - Mask out the waiter bits in the lock word before checking to see if the current thread is about to lock against itself.	2007-09-21 16:21:54 +00:00
skrll	f9577d3ada	Mostly fix the restartable atomic sequences by reversing the sense of the lock check and test for return value. At least alpha looks broken me and some are sub-optimal. "Looks good to me" from ad.	2007-09-17 13:25:59 +00:00
tnn	28d03ee594	sem_post(): pthread__self() is no longer used here.	2007-09-14 09:15:41 +00:00
ad	20e3392edc	Add a per-mutex deferred wakeup flag so that threads doing something like the following do not wake other threads early: pthread_mutex_lock(&mutex); pthread_cond_broadcast(&cond); foo = malloc(100); /* takes libc mutexes */ pthread_mutex_unlock(&mutex);	2007-09-13 23:51:47 +00:00
ad	b0efccf4cd	Make the new mutexes faster: - Eliminate mutexattr_private and just set a bit in ptm_owner if the mutex is recursive. This forces the slow path to be taken for recursive mutexes. Overload an unused field in pthread_mutex_t to record whether or not it's an errorcheck mutex. - Streamline pthread_mutex_lock / pthread_mutex_unlock a bit more. As a side effect makes it possible to have assembly stubs for them.	2007-09-11 18:11:29 +00:00
ad	4042b7d22a	Put new threads on the tail of pthread__allqueue, for the debugger.	2007-09-11 18:08:10 +00:00
ad	7db40473e2	Fix a dodgy bit of assembly.	2007-09-11 16:07:15 +00:00
ad	ee3459ca7b	Fix broken pthread_mutex_trylock().	2007-09-11 11:30:15 +00:00
ad	baebee83d5	Fix inverted test after merge of nick-csl-alignment.	2007-09-11 10:27:44 +00:00
skrll	9fdaf800d9	Merge nick-csl-alignment.	2007-09-10 11:34:05 +00:00
ad	e2c2fa5d09	Fix recursive mutexes to work again in non-threaded applications.	2007-09-09 20:13:23 +00:00
ad	debc2870e6	Update.	2007-09-08 23:00:31 +00:00
ad	f4fd6b797e	- Get rid of self->pt_mutexhint and use pthread__mutex_owned() instead. - Update some comments and fix minor bugs. Minor cosmetic changes. - Replace some spinlocks with mutexes and rwlocks. - Change the process private semaphores to use mutexes and condition variables instead of doing the synchronization directly. Spinlocks are no longer used by the semaphore code.	2007-09-08 22:49:50 +00:00
ad	8ccc6e060d	- Don't take the mutex's spinlock (ptr_interlock) in pthread_cond_wait(). Instead, make the deferred wakeup list a per-thread array and pass down the lwpid_t's that way. - In pthread_cond_wait(), take the mutex before dealing with early wakeup. In this way there should never be contention on the CV's spinlock if the app follows POSIX rules (there should only be contention on the user-provided mutex). - Add a port of the kernel's rwlocks. The rwlock's spinlock is only taken if there is contention. This is enabled where atomic ops are available. Right now that is only i386 and amd64 because I don't have other hardware to test with. It's trivial to add stubs for other architectures as long as they have compare-and-swap. When we have proper atomic ops the old rwlock code can be removed. - Add a new mutex implementation that's similar to the kernel's mutexes, but uses compare-and-swap to maintain the waiters list, so no spinlocks are involved. Same caveats apply as for the rwlocks.	2007-09-07 14:09:27 +00:00
ad	a6ed47a549	Add: pthread__atomic_cas_ptr, pthread__atomic_swap_ptr, pthread__membar_full This is a stopgap until the thorpej-atomic branch is complete.	2007-09-07 00:24:56 +00:00
ad	c67dc640cd	If a libpthread internal spinlock is held, nanosleep() for a tick instead of yielding. This is a nasty band-aid but with many threads, looping over sched_yield() wastes a huge amount of CPU time. It would be nice to have a way to temporarily disable preemption, but it turns out that's yet another no-brain concept that has been patented and the patent holder seems to be suing people lately. Another alternative is probably to have kernel-assisted spinlocks.	2007-09-07 00:07:54 +00:00
ad	342557fb0f	Update.	2007-08-23 19:21:40 +00:00
ad	0225b043d2	Acquire the correct lock in pthread_detach(). Spotted by Jan Kryl.	2007-08-23 19:13:23 +00:00
ad	40724da2ba	pthread_suspend_np, pthread_resume_np, pthread_detach: return correct code on error.	2007-08-17 14:28:31 +00:00
ad	cc383c5771	Minor correction for debug code and a cosmetic change.	2007-08-16 23:37:08 +00:00
ad	2639b90e3b	One more micro-optimization..	2007-08-16 14:53:45 +00:00
ad	d9adedd764	Trim fat off libpthread internal spinlock operations. Makes a mesurable improvement across the board.	2007-08-16 13:54:16 +00:00
ad	b8833ff53f	- Reinitialize the absolute minimum when recycling user thread state. Chops another ~10% off create/join in a loop on i386. - Disable low level debugging as this is stable. Improves benchmarks across the board by a small percentage. Uncontested mutex acquire and release in a loop becomes about 8% quicker. - Minor cleanup.	2007-08-16 12:01:49 +00:00
ad	9e28719960	Remove PT_FIXEDSTACKSIZE_LG.	2007-08-16 01:09:34 +00:00
ad	ed964af19e	Cache thread context for creation instead of setting it up every time. Speeds create/join loop by about 10-15% on i386.	2007-08-16 00:41:23 +00:00
ad	c94f5a91bc	Simplify pthread_join(). Knocks about 5% off pthread_create/join in a loop.	2007-08-15 22:48:52 +00:00
ad	1d42dbd29a	Missed variable rename..	2007-08-07 19:58:30 +00:00
ad	c3f8e2ee55	Change the signature of _lwp_park() to accept an lwpid_t and second hint pointer, but do so in a way that remains compatible with older pthread libraries. This can be used to wake another thread before the calling thread goes asleep, saving at least one syscall + involuntary context switch. This turns out to be a fairly large win on the condvar benchmarks that I have tried.	2007-08-07 19:04:21 +00:00
ad	7bf06aa722	Make libpthread_dbg build again.	2007-08-04 18:54:12 +00:00
ad	15c28fd2d8	Update.	2007-08-04 13:46:04 +00:00
ad	a014cf23cb	Fix error in previous.	2007-08-04 13:43:46 +00:00
ad	50fa8db4e4	Some significant performance improvements, and a fix for a race with pthread detach/join. - Make mutex acquire spin for a short time, as done with spinlocks. - Make the number of spins controllable with the env var PTHREAD_NSPINS. - Reduce the amount of time that libpthread internal spinlocks are held. - Rely more on the barrier effects of park/unpark to avoid taking spinlocks. - Simplify the locking around pthreads and the global queues. - Align per-thread sync data on a 128 byte boundary. - Offset thread stacks by a small amount to try and reduce cache thrash.	2007-08-04 13:37:48 +00:00
ad	2813873c47	Remove unused counters.	2007-08-01 21:48:19 +00:00
wiz	c54913c02b	Remove scheduler activations references. Bump date.	2007-05-19 14:24:42 +00:00

1 2 3 4 5 ...

512 Commits