NetBSD

Commit Graph

Author	SHA1	Message	Date
chs	33c1fd1917	add support for O_DIRECT (I/O directly to application memory, bypassing any kernel caching for file data).	2006-10-05 14:48:32 +00:00
matt	9e0ec4816e	Make PTRACE and COREDUMP optional. Make the default (status quo) by putting them in conf/std.	2006-08-29 23:34:48 +00:00
yamt	9606b0accf	uvm_swapin: process -> lwp in a comment.	2006-06-13 13:22:06 +00:00
yamt	1075c99d89	introduce macros, UAREA_TO_USER and USER_TO_UAREA, to convert uarea VA into a pointer to struct user and vice versa, so that MD code can change the layout in uarea.	2006-05-22 13:43:54 +00:00
drochner	e10923fd37	-clean up the interface to uvm_fault: the "fault type" didn't serve any purpose (done by a macro, so we don't save any cycles for now) -kill vm_fault_t; it is not needed for real faults, and for simulated faults (wiring) it can be replaced by UVM internal flags -remove <uvm/uvm_fault.h> from uvm_extern.h again	2006-03-15 18:09:25 +00:00
perry	3d4ed1fbc7	__inline__ -> inline	2005-12-24 23:41:33 +00:00
christos	95e1ffb156	merge ktrace-lwp.	2005-12-11 12:16:03 +00:00
chs	da3f825c26	remove the assertion in uvm_swapout_threads() about LSONPROC lwps not running on the same CPU as the swapper. l_stat is protected by sched_lock, which isn't held here, so we can race with that lwp starting to run and see its l_cpu not updated yet, as in PR 31870. we check l_stat again in uvm_swapout() while holding sched_lock, so the race itself is harmless.	2005-10-24 00:26:37 +00:00
thorpej	e569facced	Use ANSI function decls.	2005-06-27 02:19:48 +00:00
matt	e1245a3c46	Rework the coredump code to have no explicit knownledge of how coredump i/o is done. Instead, pass an opaque cookie which is then passed to a new routine, coredump_write, which does the actual i/o. This allows the method of doing i/o to change without affecting any future MD code. Also, make netbsd32_core.c [re]use core_netbsd.c (in a similar manner that core_elf64.c uses core_elf32.c) and eliminate that code duplication. cpu_coredump{,32} is now called twice, first with a NULL iocookie to fill the core structure and a second to actually write md parts of the coredump. All i/o is nolonger random access and is suitable for shipping over a stream.	2005-06-10 05:10:12 +00:00
matt	bb583a82ce	Make sure state.end has a valid initial value.	2005-06-07 22:02:48 +00:00
matt	25a0e29a75	When writing coredumps, don't write zero uninstantiated demand-zero pages. Also, with ELF core dumps, trim trailing zeroes from sections. These two changes can shrink coredumps by over 50% in size.	2005-06-02 17:01:43 +00:00
nathanw	92279c692f	uvm_coredump_walkmap(): Set UVM_COREDUMP_NODUMP on regions whose protection does not include VM_PROT_READ, so that the core dumping doesn't error out with EFAULT when trying to write that region. Addresses PR kern/30143; approach suggested by chs@.	2005-05-06 19:34:47 +00:00
yamt	6b2d8b66a4	merge yamt-km branch. - don't use managed mappings/backing objects for wired memory allocations. save some resources like pv_entry. also fix (most of) PR/27030. - simplify kernel memory management API. - simplify pmap bootstrap of some ports. - some related cleanups.	2005-04-01 11:59:21 +00:00
yamt	a8acd82f94	update a comment; malloc doesn't use uvm_kernacc anymore.	2005-02-08 08:22:37 +00:00
chs	7c203c91d4	reduce the size of user coredump files by not dumping regions of the address space that have never been touched (such as much of the virtual space allocated for pthread stacks).	2005-01-21 03:24:40 +00:00
yamt	5469c2b7c1	add assertions.	2004-05-12 20:09:50 +00:00
pk	9b96c17df2	Make uvm_uarea_free an inline function.	2004-05-02 13:04:57 +00:00
pk	daff668b49	Use maxdmap and maxsmap instead of MAXDSIZ and MAXSSIZ.	2004-04-04 18:21:48 +00:00
junyoung	1e2b269ded	- Nuke __P(). - Drop trailing spaces.	2004-03-24 07:50:48 +00:00
yamt	1e18e59746	- borrow vmspace0 in uvm_proc_exit instead of uvmspace_free. the latter is not a appropriate place to do so and it broke vfork. - deactivate pmap before calling cpu_exit() to keep a balance of pmap_activate/deactivate.	2004-02-09 13:11:21 +00:00
yamt	b81e2fa5c5	uvm_coredump_walkmap: use UVM_OBJ_IS_DEVICE macro.	2004-01-16 12:43:16 +00:00
jdolecek	089abdad44	Rearrange process exit path to avoid need to free resources from different process context ('reaper'). From within the exiting process context: * deactivate pmap and free vmspace while we can still block * introduce MD cpu_lwp_free() - this cleans all MD-specific context (such as FPU state), and is the last potentially blocking operation; all of cpu_wait(), and most of cpu_exit(), is now folded into cpu_lwp_free() * process is now immediatelly marked as zombie and made available for pickup by parent; the remaining last lwp continues the exit as fully detached * MI (rather than MD) code bumps uvmexp.swtch, cpu_exit() is now same for both 'process' and 'lwp' exit uvm_lwp_exit() is modified to never block; the u-area memory is now always just linked to the list of available u-areas. Introduce (blocking) uvm_uarea_drain(), which is called to release the excessive u-area memory; this is called by parent within wait4(), or by pagedaemon on memory shortage. uvm_uarea_free() is now private function within uvm_glue.c. MD process/lwp exit code now always calls lwp_exit2() immediatelly after switching away from the exiting lwp. g/c now unneeded routines and variables, including the reaper kernel thread	2004-01-04 11:33:29 +00:00
pk	70f20a1217	Replace the traditional buffer memory management -- based on fixed per buffer virtual memory reservation and a private pool of memory pages -- by a scheme based on memory pools. This allows better utilization of memory because buffers can now be allocated with a granularity finer than the system's native page size (useful for filesystems with e.g. 1k or 2k fragment sizes). It also avoids fragmentation of virtual to physical memory mappings (due to the former fixed virtual address reservation) resulting in better utilization of MMU resources on some platforms. Finally, the scheme is more flexible by allowing run-time decisions on the amount of memory to be used for buffers. On the other hand, the effectiveness of the LRU queue for buffer recycling may be somewhat reduced compared to the traditional method since, due to the nature of the pool based memory allocation, the actual least recently used buffer may release its memory to a pool different from the one needed by a newly allocated buffer. However, this effect will kick in only if the system is under memory pressure.	2003-12-30 12:33:13 +00:00
chs	e07f0b9362	eliminate uvm_useracc() in favor of checking the return value of copyin() or copyout(). uvm_useracc() tells us whether the mapping permissions allow access to the desired part of an address space, and many callers assume that this is the same as knowing whether an attempt to access that part of the address space will succeed. however, access to user space can fail for reasons other than insufficient permission, most notably that paging in any non-resident data can fail due to i/o errors. most of the callers of uvm_useracc() make the above incorrect assumption. the rest are all misguided optimizations, which optimize for the case where an operation will fail. we'd rather optimize for operations succeeding, in which case we should just attempt the access and handle failures due to insufficient permissions the same way we handle i/o errors. since there appear to be no good uses of uvm_useracc(), we'll just remove it.	2003-11-13 03:09:28 +00:00
yamt	933834a7ae	revert rev.1.70 as it was not needed. uvm_map_lookup_entry() should handle addresses out of the map.	2003-11-03 04:39:11 +00:00
jdolecek	5e94c73334	kill unneded SYSVSHM includes use ANSI C function definition for uvm_lwp_exit()	2003-11-02 16:53:43 +00:00
yamt	922ad03e28	don't try to lookup addresses out of the map in uvm_coredump_walkmap().	2003-11-01 10:43:27 +00:00
cl	e30be76fce	simplify tests: The case where l_stat == LSONPROC and l_cpu == curcpu cannot happen because the pagedaemon is the LWP on curcpu and the pagedaemon is a kernel thread and the code is only used by the pagedaemon. See also updated patch in PR kern/23095, which I ment to checkin originally.	2003-10-24 13:07:33 +00:00
cl	ed9c2d7075	don't uvm_swapout LWPs which are LSONPROC on another cpu. uvm_swapout_threads will swapout LWPs which are running on another CPU: - uvm_swapout_threads considers LWPs running on another CPU for swapout if their l_swtime is high - uvm_swapout_threads considers LWPs on the runqueue for swapout if their l_swtime is high but these LWPs might be running by the time uvm_swapout is called symptoms of failure: panic in setrunqueue fixes PR kern/23095	2003-10-19 17:45:35 +00:00
scw	4355b16f71	In uvm_lwp_fork(), check if PMAP_UAREA() is defined and if so, invoke it with the KVA of the newly-wired uarea. This is useful on some architectures (e.g. xscale) where the uarea mapping can be tweaked to use the mini-data cache instead of the main cache.	2003-10-13 20:43:03 +00:00
fvdl	d5aece61d6	Back out the lwp/ktrace changes. They contained a lot of colateral damage, and need to be examined and discussed more.	2003-06-29 22:28:00 +00:00
darrenr	960df3c8d1	Pass lwp pointers throughtout the kernel, as required, so that the lwpid can be inserted into ktrace records. The general change has been to replace "struct proc " with "struct lwp " in various function prototypes, pass the lwp through and use l_proc to get the process pointer when needed. Bump the kernel rev up to 1.6V	2003-06-28 14:20:43 +00:00
atatat	a57bcda26a	Rework the way in which the map is traversed when dumping core. Now we read-lock the map and call uvm_map_lookup_entry() instead of simply walking from the header to the next and to the next, etc. Dumping from sparsely populated amaps could cause faults that would result in amaps being split, which (in turn) resulted in the core dumping routines dumping some regions of memory twice. This makes the core file too large, the headers not match, gdb not work properly, and so on. Addresses PR 19260.	2003-02-14 16:25:12 +00:00
yamt	41ad61ee76	make KSTACK_CHECK_* compile after sa merge.	2003-01-22 12:52:14 +00:00
thorpej	b78f59b443	Merge the nathanw_sa branch.	2003-01-18 08:51:40 +00:00
chs	4b2625143d	change uvm_uarea_alloc() to indicate whether the returned uarea is already backed by physical pages (ie. because it reused a previously-freed one), so that we can skip a bunch of useless work in that case. this fixes the underlying problem behind PR 18543, and also speeds up fork() quite a bit (eg. 7% on my pc, 1% on my ultra2) when we get a cache hit.	2002-11-17 08:32:43 +00:00
chs	2b73cf7ece	encapsulate knowledge of uarea allocation in some new functions.	2002-09-22 07:20:29 +00:00
yamt	d96bff0e27	add KSTACK_CHECK_MAGIC. discussed on tech-kern.	2002-07-02 20:27:44 +00:00
matt	357945ce6f	When core dumping a process, don't dump maps backed up by the device pager. (move the pagerops externs to uvm_object.h and out the C files).	2002-05-15 06:57:49 +00:00
chs	43973be0c5	introduce a new UVM fault type, VM_FAULT_WIREMAX. this is different from VM_FAULT_WIRE in that when the pages being wired are faulted in, the simulated fault is at the maximum protection allowed for the mapping instead of the current protection. use this in uvm_map_pageable{,_all}() to fix the problem where writing via ptrace() to shared libraries that are also mapped with wired mappings in another process causes a diagnostic panic when the wired mapping is removed. this is a really obscure problem so it deserves some more explanation. ptrace() writing to another process ends up down in uvm_map_extract(), which for MAP_PRIVATE mappings (such as shared libraries) will cause the amap to be copied or created. then the amap is made shared (ie. the AMAP_SHARED flag is set) between the kernel and the ptrace()d process so that the kernel can modify pages in the amap and have the ptrace()d process see the changes. then when the page being modified is actually faulted on, the object pages (from the shared library vnode) is copied to a new anon page and inserted into the shared amap. to make all the processes sharing the amap actually see the new anon page instead of the vnode page that was there before, we need to invalidate all the pmap-level mappings of the vnode page in the pmaps of the processes sharing the amap, but we don't have a good way of doing this. the amap doesn't keep track of the vm_maps which map it. so all we can do at this point is to remove all the mappings of the page with pmap_page_protect(), but this has the unfortunate side-effect of removing wired mappings as well. removing wired mappings with pmap_page_protect() is a legitimate operation, it can happen when a file with a wired mapping is truncated. so the pmap has no way of knowing whether a request to remove a wired mapping is normal or when it's due to this weird situation. so the pmap has to remove the weird mapping. the process being ptrace()d goes away and life continues. then, much later when we go to unwire or remove the wired vm_map mapping, we discover that the pmap mapping has been removed when it should still be there, and we panic. so where did we go wrong? the problem is that we don't have any way to update just the pmap mappings that need to be updated in this scenario. we could invent a mechanism to do this, but that is much more complicated than this change and it doesn't seem like the right way to go in the long run either. the real underlying problem here is that wired pmap mappings just aren't a good concept. one of the original properties of the pmap design was supposed to be that all the information in the pmap could be thrown away at any time and the VM system could regenerate it all through fault processing, but wired pmap mappings don't allow that. a better design for UVM would not require wired pmap mappings, and Chuck C. and I are talking about this, but it won't be done anytime soon, so this change will do for now. this change has the effect of causing MAP_PRIVATE mappings to be copied to anonymous memory when they are mlock()d, so that uvm_fault() doesn't need to copy these pages later when called from ptrace(), thus avoiding the call to pmap_page_protect() and the panic that results from this when the mlock()d region is unlocked or freed. note that this change doesn't help the case where the wired mapping is MAP_SHARED. discussed at great length with Chuck Cranor. fixes PRs 10363, 12554, 12604, 13041, 13487, 14580 and 14853.	2001-12-31 22:34:39 +00:00
thorpej	06920aef28	Move the code that walks the process's VM map during a coredump into uvm_coredump_walkmap(), and use callbacks into the coredump routine to do something with each section.	2001-12-10 01:52:26 +00:00
lukem	b616d1ca1d	add RCSIDs, and in some cases, slightly cleanup #include order	2001-11-10 07:36:59 +00:00
chs	d8cbdbb0da	in uvm_exit(), don't bother to unwire the uarea before we free it, the pages will be freed anyway.	2001-11-06 05:34:42 +00:00
chs	a467bddfdc	bump the rusage counter for "swaps" when we swap out a process. addresses PR 6170.	2001-09-23 07:10:08 +00:00
chs	64c6d1d2dc	a whole bunch of changes to improve performance and robustness under load: - remove special treatment of pager_map mappings in pmaps. this is required now, since I've removed the globals that expose the address range. pager_map now uses pmap_kenter_pa() instead of pmap_enter(), so there's no longer any need to special-case it. - eliminate struct uvm_vnode by moving its fields into struct vnode. - rewrite the pageout path. the pager is now responsible for handling the high-level requests instead of only getting control after a bunch of work has already been done on its behalf. this will allow us to UBCify LFS, which needs tighter control over its pages than other filesystems do. writing a page to disk no longer requires making it read-only, which allows us to write wired pages without causing all kinds of havoc. - use a new PG_PAGEOUT flag to indicate that a page should be freed on behalf of the pagedaemon when it's unlocked. this flag is very similar to PG_RELEASED, but unlike PG_RELEASED, PG_PAGEOUT can be cleared if the pageout fails due to eg. an indirect-block buffer being locked. this allows us to remove the "version" field from struct vm_page, and together with shrinking "loan_count" from 32 bits to 16, struct vm_page is now 4 bytes smaller. - no longer use PG_RELEASED for swap-backed pages. if the page is busy because it's being paged out, we can't release the swap slot to be reallocated until that write is complete, but unlike with vnodes we don't keep a count of in-progress writes so there's no good way to know when the write is done. instead, when we need to free a busy swap-backed page, just sleep until we can get it busy ourselves. - implement a fast-path for extending writes which allows us to avoid zeroing new pages. this substantially reduces cpu usage. - encapsulate the data used by the genfs code in a struct genfs_node, which must be the first element of the filesystem-specific vnode data for filesystems which use genfs_{get,put}pages(). - eliminate many of the UVM pagerops, since they aren't needed anymore now that the pager "put" operation is a higher-level operation. - enhance the genfs code to allow NFS to use the genfs_{get,put}pages instead of a modified copy. - clean up struct vnode by removing all the fields that used to be used by the vfs_cluster.c code (which we don't use anymore with UBC). - remove kmem_object and mb_object since they were useless. instead of allocating pages to these objects, we now just allocate pages with no object. such pages are mapped in the kernel until they are freed, so we can use the mapping to find the page to free it. this allows us to remove splvm() protection in several places. The sum of all these changes improves write throughput on my decstation 5000/200 to within 1% of the rate of NetBSD 1.5 and reduces the elapsed time for "make release" of a NetBSD 1.5 source tree on my 128MB pc to 10% less than a 1.5 kernel took.	2001-09-15 20:36:31 +00:00
chris	0e7661f023	Update pmap_update to now take the updated pmap as an argument. This will allow improvements to the pmaps so that they can more easily defer expensive operations, eg tlb/cache flush, til the last possible moment. Currently this is a no-op on most platforms, so they should see no difference. Reviewed by Jason.	2001-09-10 21:19:08 +00:00
chs	821ec03ed9	replace vm_map{,_entry}_t with struct vm_map{,_entry} *.	2001-06-02 18:09:08 +00:00
lukem	d84d2c6c85	add missing #include "opt_kgdb.h"	2001-05-30 15:24:23 +00:00
chs	3845302904	remove trailing whitespace.	2001-05-25 04:06:11 +00:00

1 2

97 Commits