NetBSD

Commit Graph

Author	SHA1	Message	Date
jdolecek	a1090edbd2	fix assertion - non-alive processes are in SZOMB state now fixes PR kern/24033 by Martin Husemann	2004-01-11 18:51:15 +00:00
hannken	ed68c4e34c	Allow vfs_write_suspend() to wait if the file system is already suspending. Move vfs_write_suspend() and vfs_write_resume() from kern/vfs_vnops.c to kern/vfs_subr.c. Change vnode write gating in ufs/ffs/ffs_softdep.c (from FreeBSD). When vnodes are throttled in softdep_trackbufs() check for file system suspension every 10 msecs to avoid a deadlock.	2004-01-10 17:16:38 +00:00
yamt	a3b2d1879c	add a new bufq strategy, BUFQ_PRIOCSCAN (per-priority CSCAN). discussed on tech-kern@	2004-01-10 14:49:44 +00:00
yamt	8c55727694	reset i/o priority in geteblk() as well.	2004-01-10 14:43:05 +00:00
yamt	7266a95907	store a i/o priority hint in struct buf for buffer queue discipline.	2004-01-10 14:39:50 +00:00
thorpej	4aeba6790d	Initialize buffer pools with PR_IMMEDRELEASE. Don't use pool_reclaim() on those pools; it is no longer necessary.	2004-01-09 19:01:01 +00:00
thorpej	7f125220f4	Add a new pool initialization flag, PR_IMMEDRELEASE. This flag causes idle pool pages to be returned to the system immediately upon becoming de-fragmented. Also, in pool_do_put(), don't free back an idle page unless we are over our minimum page claim.	2004-01-09 19:00:16 +00:00
tls	e4758a97ae	Change BUFCACHE (default hard limit on physmem consumption by metadata cache) from 30% to 20%. This seems to significantly smooth the oscillation between "almost no memory available" and "UVM free target available" caused by the current sudden, heavy backpressure on the metadata cache. We should revisit this again once the backpressure mechanism is better tuned; ideally, the hard limit should almost never come into play, because the metadata cache should gradually give back pages as buffers hit the AGE list and as the page cache demands them, rather than giving back a big slug of pages all at once when UVM decides it's in a hurry and fires off the page daemon. Just how well this adjustment works is likely to vary significantly from machine to machine depending on I/O mix, filesystem frag size, and total memory. However, 20% seems to be quite a bit better than 30% on several systems I've tested and is, coincidentally, more than enough to cache the entire metadata working set of the AnonCVS server with 100 clients, which is a useful worst-case stake in the ground...	2004-01-09 06:26:15 +00:00
tls	0d6723b09f	Bump default size of vnode cache to 1% of physical memory, instead of 0.5%, based on some quick measurements on a number of workstations and small fileservers (including my home fileserver running simultaneous builds of the NetBSD source tree and several NetBSD kernels). This brings the hit rate on my machines from below 70% to above 90%. We should be able to tune this as we run, by tracking the hit rate and increasing the size of the cache if memory permits. Some systems will still require significantly larger cache sizes. Some ports -- notably the 64-bit ones -- probably should use more than 1% of physmem as the default due to the larger size of struct vnode.	2004-01-09 00:04:53 +00:00
tls	28364b01be	Add pool_reclaim() on pool to which we just pool_put() a buffer in buf_mrelease(). Without this, though the pages are returned to the relevant pool, they are never available for any other use in the system. Now the backpressure on the physical size of the buffer cache through the buf_drain() call in the pagedaemon works correctly. If anything, it may be a bit more aggressive than intended. On my 256MB system, with vm.bufcache set to the default 30% of physmem, a kernel with this fix can do 5 simultaneous config/makedep/builds of different NetBSD kernels in 1313 seconds; with the "traditional" buffer cache code it requires 1320 seconds. Running "find / -type d -exec ls -l {}" while the build is going demonstrates that the backpressure is working correctly: free memory oscillates slowly between close to none and the UVM target free, and vmstat -m shows a large number of releases for the buffer pools. For future work: how is "bufpl" memory returned to the system? This is not obvious to me (I must be looking in the wrong place). Also, buf_mrelease() is also called from brelse() in some cases. Would it be better to add a pool flag causing automatic release of full pages as they become available (not fragmented)? Jason Thorpe proposed this and it seems more elegant than cleaning the _entire_ pool only upon memory pressure. Greg Oster did a lot of the work of figuring this out. Jason proposed the use of pool_reclaim as a way to fix it.	2004-01-08 23:41:14 +00:00
cube	3bf5e4c13b	If ksyms have not been initialized, return ENXIO in ksymsopen instead of ksymsread, because ksyms client test availability with open() and not read().	2004-01-08 22:48:26 +00:00
thorpej	d76fa360ef	Back out >2 PT_LOAD changes from rev 1.96. They cause older GCC3-compiled PowerPC binaries to fail. The compiler has since been fixed, but compatibility with older binaries needs to be maintained. PR kern/23758.	2004-01-07 16:42:53 +00:00
jdolecek	26767eb2ae	fix F_MAXFD fcntl - it returned the value as errno instead of return value from the syscall from mouss <usebsd at free dot fr>	2004-01-07 09:26:29 +00:00
atatat	5efc584023	Expose the buf_map symbol so that pmap(1) can find it. Split the sysctl setup routine into two routines, one for each "subtree". Perhaps it's a little pedantic, but it's cleaner. Also, assert that the "kern" and "vm" nodes exist.	2004-01-06 13:51:09 +00:00
lukem	7bb9d6c875	Store the copyright text in conf/copyright, and use conf/newvers.sh to generate the appropriate const char copyright[] = "..."; statement instead of hard coding it into kern/init_main.c. Idea from Simon Burge.	2004-01-05 03:33:06 +00:00
christos	b76a454b90	Ad F_CLOSEM, F_MAXFD from Matt Thomas.	2004-01-05 00:36:49 +00:00
pk	90cc172b86	bufpool_page_free: pass `buf_map' to uvm_km_free().	2004-01-04 16:17:13 +00:00
kleink	1b16e3f0a3	; may be a comment character in assembly, use \n as a separator instead.	2004-01-04 13:27:53 +00:00
jdolecek	089abdad44	Rearrange process exit path to avoid need to free resources from different process context ('reaper'). From within the exiting process context: * deactivate pmap and free vmspace while we can still block * introduce MD cpu_lwp_free() - this cleans all MD-specific context (such as FPU state), and is the last potentially blocking operation; all of cpu_wait(), and most of cpu_exit(), is now folded into cpu_lwp_free() * process is now immediatelly marked as zombie and made available for pickup by parent; the remaining last lwp continues the exit as fully detached * MI (rather than MD) code bumps uvmexp.swtch, cpu_exit() is now same for both 'process' and 'lwp' exit uvm_lwp_exit() is modified to never block; the u-area memory is now always just linked to the list of available u-areas. Introduce (blocking) uvm_uarea_drain(), which is called to release the excessive u-area memory; this is called by parent within wait4(), or by pagedaemon on memory shortage. uvm_uarea_free() is now private function within uvm_glue.c. MD process/lwp exit code now always calls lwp_exit2() immediatelly after switching away from the exiting lwp. g/c now unneeded routines and variables, including the reaper kernel thread	2004-01-04 11:33:29 +00:00
jdolecek	6ea538748e	constify a bit	2004-01-03 20:10:01 +00:00
jdolecek	1d86b1f39f	fix some comments, use NULL instead of 0 for pointer comparison	2004-01-03 19:43:55 +00:00
cl	d5645dec8e	regen	2004-01-02 18:53:45 +00:00
cl	e7045955c7	kernel part of no-syscall upcall stack return: libpthread registers an offset between ss_sp and struct sa_stackinfo_t (located in struct __pthread_st) when calling sa_register. The kernel increments the sast_gen counter in struct sastack when an upcall stack is used. libpthread increments the sasi_stackgen counter in struct sa_stackinfo_t when an upcall stack is freed. The kernel compares the two counters to decide if a stack is free or in use. - add struct sa_stackinfo_t with sasi_stackgen to count stack use in userland - add sast_gen to struct sastack to count stack use in kernel - add SA_FLAG_STACKINFO to enable the stackinfo_offset argument in the sa_register syscall - add sa_stackinfo_offset to struct sadata for offset between ss_sp and struct sa_stackinfo_t - add ssize_t stackinfo_offset argument to sa_register, initialize struct sadata's sa_stackinfo_offset from it if SA_FLAG_STACKINFO is set - add sa_getstack, sa_getstack0, sa_stackused and sa_setstackfree functions to find/use/free upcall stacks and use these where appropriate - don't record stack for upcall in sa_upcall0 - pass sau to sa_switchcall instead of l2 (l2 = curlwp in sa_switchcall) - add sa_vp_blocker to struct sadata to pass recently blocked lwp to sa_switchcall - delay finding a stack for blocked upcalls to sa_switchcall - add sa_stacknext to struct sadata pointing to next most likely free upcall stack; also g/c sa_stackslist in struct sadata and sast_list in struct sastack - add L_SA_WOKEN flag: LWP is on sa_woken queue - add L_SA_RECYCLE flag: LWP should be recycled in sa_setwoken - replace l_upcallstack with L_SA_WOKEN/L_SA_RECYCLE/L_SA_BLOCKING flags - g/c now unused sast_blocker in struct sastack - make sa_switchcall, sa_upcall0 and sa_upcall_getstate static in kern_sa.c - call sa_upcall_userret only once in userret - split sa_makeupcalls out of sa_upcall_userret and use to process the sa_upcalls queue - on process exit: mark LWPs sleeping in saunblock interruptible; also there are no LWPs sleeping on l->l_upcallstack anymore; also clear sa_wokenq_head to prevent unblocked upcalls additional changes: - cleanup timerupcall sa_vp == curlwp check - add check in sa_yield if we didn't block on our way here and we wouldn't any longer be the LWP on the VP - invalidate sa_vp_ofaultaddr after resolving pagefault	2004-01-02 18:52:17 +00:00
mycroft	a9866938b5	Welcome to 2004!	2004-01-01 00:00:05 +00:00
pk	dc6d5d0dd1	getnewbuf: return buffer locked.	2003-12-31 14:37:17 +00:00
thorpej	7e958083b1	Consistently use ANSI-style function decls.	2003-12-30 20:40:39 +00:00
thorpej	6a833751e0	Remove allocsys(); nothing uses it anymore.	2003-12-30 18:29:43 +00:00
pk	70f20a1217	Replace the traditional buffer memory management -- based on fixed per buffer virtual memory reservation and a private pool of memory pages -- by a scheme based on memory pools. This allows better utilization of memory because buffers can now be allocated with a granularity finer than the system's native page size (useful for filesystems with e.g. 1k or 2k fragment sizes). It also avoids fragmentation of virtual to physical memory mappings (due to the former fixed virtual address reservation) resulting in better utilization of MMU resources on some platforms. Finally, the scheme is more flexible by allowing run-time decisions on the amount of memory to be used for buffers. On the other hand, the effectiveness of the LRU queue for buffer recycling may be somewhat reduced compared to the traditional method since, due to the nature of the pool based memory allocation, the actual least recently used buffer may release its memory to a pool different from the one needed by a newly allocated buffer. However, this effect will kick in only if the system is under memory pressure.	2003-12-30 12:33:13 +00:00
martin	44b17951f2	Avoid using m_clget() on a mbuf already in use, especially when we need the data in the mbuf later and m_clget() changes some fields overlaid to regular mbuf data. Instead, rearange code a bit, create data into a new allocated buffer and and use MEXTADD to attach it to the mbuf, if the mbuf internal space is not sufficient. This fixes a crash on sparc64 (and probably all other archs where sizeof(int) != sizeof(struct file *)) when running regress/sys/kern/unfdpass. Idea for solution from Matt Thomas, with additional input from YAMAMOTO Takashi.	2003-12-29 22:08:02 +00:00
yamt	192843ffc2	pool_prime_page: initialize ph_time to mono_time instead of zero as it's a mono_time relative value.	2003-12-29 16:04:58 +00:00
atatat	1cab3635c2	Avoid dereferencing l in sysctl_lookup(), because it can be NULL. Note one point where a possibility of a fault exists.	2003-12-29 04:19:28 +00:00
atatat	74dad84b6e	Remove two uses of uvm_kernacc(), which wasn't quite getting the job done anyway. On a related change, use kcopy() instead of memcpy() for kernel-to-kernel copying so that the same service warranty can be given.	2003-12-29 04:16:25 +00:00
atatat	b1c111a62a	Sysctl functions called for "generic" nodes should forward "query" requests (where possible), rather than returning errors.	2003-12-28 22:36:37 +00:00
atatat	0f7550bbf8	Adjust error returns in kern.cp_time when a specific processor is being requested so that (1) the uniprocessor case and the multiprocessor case are more similar and (2) so that we return ENOENT when a non-existent processor is requested (which is both more sensible and follows the general order of things anyway).	2003-12-28 22:24:12 +00:00
atatat	c703d9821f	Rename sysctl_kern_hostname() to sysctl_setlen() and use it also for domainname. Note that there's no need to copy rnode since we're not changing any of it, nor protecting anything from change. Thanks to martin for initial work.	2003-12-28 22:19:59 +00:00
atatat	8e0c1f1594	RCSid police	2003-12-28 22:12:00 +00:00
martin	c22fd25c47	After changing hostname, adjust hostnamelen. This closes PR kern/23907.	2003-12-28 14:39:36 +00:00
martin	be59b63fe2	Make kern.rtc_offset writable at securelevel <= 0. This allows boot-time adjustment when a machine runs other OSes with RTC == localtime.	2003-12-26 23:49:39 +00:00
manu	ffb3de5522	Move the sigfilter hook to a more adequate location, and rename it to better fit what it does. The softsignal feature is used in Darwin to trace processes. When the traced process gets a signal, this raises an exception. The debugger will receive the exception message, use ptrace with PT_THUPDATE to pass the signal to the child or discard it, and then it will send a reply to the exception message, to resume the child. With the hook at the beginnng of kpsignal2, we are in the context of the signal sender, which can be the kill(1) command, for instance. We cannot afford to sleep until the debugger tells us if the signal should be delivered or not. Therefore, the hook to generate the Mach exception must be in the traced process context. That was we can sleep awaiting for the debugger opinion about the signal, this is not a problem. The hook is hence located into issignal, at the place where normally SIGCHILD is sent to the debugger, whereas the traced process is stopped. If the hook returns 0, we bypass thoses operations, the Mach exception mecanism will take care of notifying the debugger (through a Mach exception), and stop the faulting thread.	2003-12-24 22:53:59 +00:00
manu	54db0e51ad	Split sys_lwp_suspend, just like sys_lwp_unsuspend is split. We get sys_lwp_suspend, with the sanity checks, and lwp_suspend, with the actual implementation.	2003-12-24 22:42:11 +00:00
simonb	16846040d7	Remove trailing blank line.	2003-12-21 11:54:16 +00:00
fvdl	d99705e941	Put back Emmanuel's sigfilter hooks, as decided by Core.	2003-12-20 19:01:29 +00:00
manu	b23b73b953	Introduce lwp_emuldata and the associated hooks. No hook is provided for the exec case, as the emulation already has the ability to intercept that with the e_proc_exec hook. It is the responsability of the emulation to take appropriaye action about lwp_emuldata in e_proc_exec. Patch reviewed by Christos.	2003-12-20 18:22:16 +00:00
yamt	8b9614a490	update a comment to match with the previous change (rev.1.12).	2003-12-20 07:33:03 +00:00
yamt	4dd4230680	restore functionality to decrease kern.maxvnodes which has been backed out during sysctl rework.	2003-12-20 07:26:27 +00:00
dsl	97356a5fa8	Defer writing of KTR_EMUL entry until first trace done by target process. Stops ktrops sleeping with the pid table locked.	2003-12-14 22:56:45 +00:00
simonb	701a167dd3	In sysctl_kern_lwp adjust offsets into the mib entries so that they are now correct. Fixes problems with "ps -s" not working. Also use KERN_LWPSLOP instead of KERN_PROCSLOP. Both changes from Andrew Brown.	2003-12-12 23:21:44 +00:00
atatat	e3796202c5	Make kern.dump_on_panic writeable again, too	2003-12-10 14:16:12 +00:00
agc	7db1d33cba	Modify the licences of code written by Theo De Raadt from a 4-clause to a 2-clause licence (retaining UCB clauses (1) and (2)), per PR 22409 from Joel Baker, approved by Theo de Raadt, and ratified by myself - the only discrepancy being the handling of the original clause 3 in src/usr.sbin/yppoll/yppoll.c.	2003-12-10 12:06:25 +00:00
hannken	fbae381aaa	The file system snapshot pseudo driver. Uses a hook in spec_strategy() to save data written from a mounted file system to its block device and a hook in dounmount(). Not enabled by default in any kernel config. Approved by: Frank van der Linden <fvdl@netbsd.org>	2003-12-10 11:40:11 +00:00

1 2 3 4 5 ...

3709 Commits