NetBSD

Commit Graph

Author	SHA1	Message	Date
ad	4a780c9ae2	Merge vmlocking2 to head.	2008-01-02 11:48:20 +00:00
yamt	e8abff70f2	constify pagerops.	2007-12-01 10:40:27 +00:00
yamt	062f8e82a2	use designated initiaizers for uvm_pagerops.	2007-12-01 10:18:21 +00:00
ad	4c92a21547	Remove LOCK_ASSERT(!simple_lock_held(&foo));	2007-10-11 19:53:37 +00:00
ad	7dad9f7391	Merge from vmlocking: - Split vnode::v_flag into three fields, depending on field locking. - simple_lock -> kmutex in a few places. - Fix some simple locking problems.	2007-10-10 20:42:20 +00:00
pooka	38ed0b2878	Use VSIZENOTSET only in KASSERTs	2007-08-04 09:42:58 +00:00
pooka	05ce20f4a0	Retire uvn_attach() - it abuses VXLOCK and its functionality, setting vnode sizes, is handled elsewhere: file system vnode creation or spec_open() for regular files or block special files, respectively. Add a call to VOP_MMAP() to the pagedvn exec path, since the vnode is being memory mapped. reviewed by tech-kern & wrstuden	2007-07-22 19:16:04 +00:00
ad	88ab7da936	Merge some of the less invasive changes from the vmlocking branch: - kthread, callout, devsw API changes - select()/poll() improvements - miscellaneous MT safety improvements	2007-07-09 20:51:58 +00:00
yamt	da51d139a4	improve post-ubc file overwrite performance in common cases. ie. when it's safe, actually overwrite blocks rather than doing read-modify-write. also fixes PR/33152 and PR/36303.	2007-06-05 12:31:30 +00:00
christos	53524e44ef	Kill caddr_t; there will be some MI fallout, but it will be fixed shortly.	2007-03-04 05:59:00 +00:00
thorpej	b3667ada6d	TRUE -> true, FALSE -> false	2007-02-22 06:05:00 +00:00
thorpej	712239e366	Replace the Mach-derived boolean_t type with the C99 bool type. A future commit will replace use of TRUE and FALSE with true and false.	2007-02-21 22:59:35 +00:00
chs	c398ae9734	a smorgasbord of improvements to vnode locking and path lookup: - LOCKPARENT is no longer relevant for lookup(), relookup() or VOP_LOOKUP(). these now always return the parent vnode locked. namei() works as before. lookup() and various other paths no longer acquire vnode locks in the wrong order via vrele(). fixes PR 32535. as a nice side effect, path lookup is also up to 25% faster. - the above allows us to get rid of PDIRUNLOCK. - also get rid of WANTPARENT (just use LOCKPARENT and unlock it). - remove an assumption in layer_node_find() that all file systems implement a recursive VOP_LOCK() (unionfs doesn't). - require that all file systems supply vfs_vptofh and vfs_fhtovp routines. fill in eopnotsupp() for file systems that don't support being exported and remove the checks for NULL. (layerfs calls these without checking.) - in union_lookup1(), don't change refcounts in the ISDOTDOT case, just adjust which vnode is locked. fixes PR 33374. - apply fixes for ufs_rename() from ufs_vnops.c rev. 1.61 to ext2fs_rename().	2006-12-09 16:11:50 +00:00
yamt	1a7bc55dcc	remove some __unused from function parameters.	2006-11-01 10:17:58 +00:00
yamt	61934cf2b4	uvm_vnp_setsize: put back v_size assignment after uvn_put. PR/34147 from Juergen Hannken-Illjes.	2006-10-14 09:20:35 +00:00
yamt	9de1cdc8fe	move some knowledge about vnode into uvm_vnode.c.	2006-10-12 10:14:20 +00:00
christos	4d595fd7b1	- sprinkle __unused on function decls. - fix a couple of unused bugs - no more -Wno-unused for i386	2006-10-12 01:30:41 +00:00
yamt	9d3e3eab23	merge yamt-pdpolicy branch. - separate page replacement policy from the rest of kernel - implement an alternative replacement policy	2006-09-15 15:51:12 +00:00
yamt	f9458a6ba1	- in genfs_getpages, take g_glock earlier so that it can't be intervened by truncation. it also fixes a deadlock. (g_glock vs pages locking order) - uvm_vnp_setsize: modify v_size while holding v_interlock. reviewed by Chuck Silvers.	2006-07-22 08:47:56 +00:00
ad	3029ac48c7	- Use the LWP cached credentials where sane. - Minor cosmetic changes.	2006-07-21 16:48:45 +00:00
elad	fc9422c9d9	integrate kauth.	2006-05-14 21:31:52 +00:00
christos	95e1ffb156	merge ktrace-lwp.	2005-12-11 12:16:03 +00:00
yamt	221616873d	merge yamt-readahead branch.	2005-11-29 22:52:02 +00:00
yamt	52f0a62851	read-ahead statistics.	2005-11-29 15:45:28 +00:00
thorpej	b651fb886d	Sprinkle some static.	2005-06-27 02:29:32 +00:00
thorpej	e569facced	Use ANSI function decls.	2005-06-27 02:19:48 +00:00
chs	8975a0856f	adjust the UBC mapping code to support non-vnode uvm_objects. this means we can no longer look at the vnode size to determine how many pages to request in a fault, which is good since for NFS the size can change out from under us on the server anyway. there's also a new flag UBC_UNMAP for ubc_release(), so that the file system code can make the decision about whether to cache mappings for files being used as executables.	2005-01-09 16:42:43 +00:00
junyoung	325f5482a8	Nuke __P().	2004-03-24 07:55:01 +00:00
fvdl	d5aece61d6	Back out the lwp/ktrace changes. They contained a lot of colateral damage, and need to be examined and discussed more.	2003-06-29 22:28:00 +00:00
darrenr	960df3c8d1	Pass lwp pointers throughtout the kernel, as required, so that the lwpid can be inserted into ktrace records. The general change has been to replace "struct proc " with "struct lwp " in various function prototypes, pass the lwp through and use l_proc to get the process pointer when needed. Bump the kernel rev up to 1.6V	2003-06-28 14:20:43 +00:00
yamt	d99d457173	correct accounting of {exec,file}pages. they are not updated correctly when breaking loan.	2003-04-22 14:28:15 +00:00
gehenna	77a6b82b27	Merge the gehenna-devsw branch into the trunk. This merge changes the device switch tables from static array to dynamically generated by config(8). - All device switches is defined as a constant structure in device drivers. - The new grammer ``device-major'' is introduced to ``files''. device-major <prefix> char <num> [block <num>] [<rules>] - All device major numbers must be listed up in port dependent majors.<arch> by using this grammer. - Added the new naming convention. The name of the device switch must be <prefix>_[bc]devsw for auto-generation of device switch tables. - The backward compatibility of loading block/character device switch by LKM framework is broken. This is necessary to convert from block/character device major to device name in runtime and vice versa. - The restriction to assign device major by LKM is completely removed. We don't need to reserve LKM entries for dynamic loading of device switch. - In compile time, device major numbers list is packed into the kernel and the LKM framework will refer it to assign device major number dynamically.	2002-09-06 13:18:43 +00:00
enami	2afb4efc4c	Make uvn_findpages to return number of pages found so that caller can easily check if all requested pages are found or not.	2002-05-17 22:00:50 +00:00
chs	4d069e8517	in uvm_vnp_setsize(), wait for any i/o in progress on pages that we free.	2001-12-31 07:00:15 +00:00
chs	8e9cdbbd63	replace "vnode" and "vtext" with "file" and "exec" in uvmexp field names.	2001-12-09 03:07:43 +00:00
lukem	b616d1ca1d	add RCSIDs, and in some cases, slightly cleanup #include order	2001-11-10 07:36:59 +00:00
chs	365f4c4313	change the names of the arguments to uvn_put() to match their usage.	2001-09-26 07:23:51 +00:00
sommerfeld	cc8633edd3	VOP_PUTPAGES must release the uobj's lock for us, so ensure it's locked beforehand and unlocked afterwards using LOCK_ASSERT().	2001-09-22 22:33:16 +00:00
chs	64c6d1d2dc	a whole bunch of changes to improve performance and robustness under load: - remove special treatment of pager_map mappings in pmaps. this is required now, since I've removed the globals that expose the address range. pager_map now uses pmap_kenter_pa() instead of pmap_enter(), so there's no longer any need to special-case it. - eliminate struct uvm_vnode by moving its fields into struct vnode. - rewrite the pageout path. the pager is now responsible for handling the high-level requests instead of only getting control after a bunch of work has already been done on its behalf. this will allow us to UBCify LFS, which needs tighter control over its pages than other filesystems do. writing a page to disk no longer requires making it read-only, which allows us to write wired pages without causing all kinds of havoc. - use a new PG_PAGEOUT flag to indicate that a page should be freed on behalf of the pagedaemon when it's unlocked. this flag is very similar to PG_RELEASED, but unlike PG_RELEASED, PG_PAGEOUT can be cleared if the pageout fails due to eg. an indirect-block buffer being locked. this allows us to remove the "version" field from struct vm_page, and together with shrinking "loan_count" from 32 bits to 16, struct vm_page is now 4 bytes smaller. - no longer use PG_RELEASED for swap-backed pages. if the page is busy because it's being paged out, we can't release the swap slot to be reallocated until that write is complete, but unlike with vnodes we don't keep a count of in-progress writes so there's no good way to know when the write is done. instead, when we need to free a busy swap-backed page, just sleep until we can get it busy ourselves. - implement a fast-path for extending writes which allows us to avoid zeroing new pages. this substantially reduces cpu usage. - encapsulate the data used by the genfs code in a struct genfs_node, which must be the first element of the filesystem-specific vnode data for filesystems which use genfs_{get,put}pages(). - eliminate many of the UVM pagerops, since they aren't needed anymore now that the pager "put" operation is a higher-level operation. - enhance the genfs code to allow NFS to use the genfs_{get,put}pages instead of a modified copy. - clean up struct vnode by removing all the fields that used to be used by the vfs_cluster.c code (which we don't use anymore with UBC). - remove kmem_object and mb_object since they were useless. instead of allocating pages to these objects, we now just allocate pages with no object. such pages are mapped in the kernel until they are freed, so we can use the mapping to find the page to free it. this allows us to remove splvm() protection in several places. The sum of all these changes improves write throughput on my decstation 5000/200 to within 1% of the rate of NetBSD 1.5 and reduces the elapsed time for "make release" of a NetBSD 1.5 source tree on my 128MB pc to 10% less than a 1.5 kernel took.	2001-09-15 20:36:31 +00:00
chs	2c441082d4	allow mappings of VBLK vnodes.	2001-08-17 05:53:02 +00:00
chs	11a9651c8f	replace vm_page_t with struct vm_page *.	2001-05-26 21:27:10 +00:00
chs	3845302904	remove trailing whitespace.	2001-05-25 04:06:11 +00:00
chs	dd82ad8e2c	eliminate the VM_PAGER_* error codes in favor of the traditional E* codes. the mapping is: VM_PAGER_OK 0 VM_PAGER_BAD <unused> VM_PAGER_FAIL <unused> VM_PAGER_PEND 0 (see below) VM_PAGER_ERROR EIO VM_PAGER_AGAIN EAGAIN VM_PAGER_UNLOCK EBUSY VM_PAGER_REFAULT ERESTART for async i/o requests, it used to be possible for the request to be convert to sync, and the pager would return VM_PAGER_OK or VM_PAGER_PEND to indicate whether the caller should perform post-i/o cleanup. this is no longer allowed; pagers must now return 0 to indicate that the async i/o was successfully started, and the caller never needs to worry about doing the post-i/o cleanup.	2001-03-10 22:46:45 +00:00
chs	83d071a318	add UBC memory-usage balancing. we track the number of pages in use for each of the basic types (anonymous data, executable image, cached files) and prevent the pagedaemon from reusing a given page if that would reduce the count of that type of page below a sysctl-setable minimum threshold. the thresholds are controlled via three new sysctl tunables: vm.anonmin, vm.vnodemin, and vm.vtextmin. these tunables are the percentages of pageable memory reserved for each usage, and we do not allow the sum of the minimums to be more than 95% so that there's always some memory that can be reused.	2001-03-09 01:02:10 +00:00
enami	79dbb12278	When shrinking file size, don't dispose of a page still in use.	2001-02-22 01:02:09 +00:00
chs	7b76ca8254	in uvn_flush(), add a fast path for the case where the vnode has no pages. update the comment above this function while I'm here.	2001-02-18 19:40:25 +00:00
chs	4be5f47040	remove a debug printf() that has outlived its usefulness.	2001-02-08 06:43:05 +00:00
chs	43eb344e3f	in uvn_flush(), interpret a "stop" value of 0 as meaning all pages at offsets equal to or higher than "start". use this in uvm_vnp_setsize() instead of the vnode's size since there can be pages past EOF.	2001-02-06 10:53:23 +00:00
thorpej	1779f8f71b	Page scanner improvements, behavior is actually a bit more like Mach VM's now. Specific changes: - Pages now need not have all of their mappings removed before being put on the inactive list. They only need to have the "referenced" attribute cleared. This makes putting pages onto the inactive list much more efficient. In order to eliminate redundant clearings of "refrenced", callers of uvm_pagedeactivate() must now do this themselves. - When checking the "modified" attribute for a page (for clearing PG_CLEAN), make sure to only do it if PG_CLEAN is currently set on the page (saves a potentially expensive pmap operation). - When scanning the inactive list, if a page is referenced, reactivate it (this part was actually added in uvm_pdaemon.c,v 1.27). This now works properly now that pages on the inactive list are allowed to have mappings. - When scanning the inactive list and considering a page for freeing, remove all mappings, and then check the "modified" attribute if the page is marked PG_CLEAN. - When scanning the active list, if the page was referenced since its last sweep by the scanner, don't deactivate it. (This part was actually added in uvm_pdaemon.c,v 1.28.) These changes greatly improve interactive performance during moderate to high memory and I/O load.	2001-01-28 23:30:42 +00:00
chs	f0ff6fc897	in uvn_flush(), when PGO_SYNCIO is specified then we should wait for pending i/os to complete before returning even if PGO_CLEANIT is not specified. this fixes two races: (1) NFS write rpcs vs. setattr operations which truncate the file. if the truncate doesn't wait for pending writes to complete then a later write rpc completion can undo the effect of the truncate. this problem has been reported by several people. (2) write i/os in disk-based filesystem vs. the disk block being freed by a truncation, allocated to a new file, and written again with different data. if the disk driver reorders the requests and does the second i/o first, the old data will clobber the new, corrupting the new file. I haven't heard of anyone experiencing this problem yet, but it's fixed now anyway.	2001-01-08 06:21:13 +00:00

1 2

90 Commits