NetBSD

Author	SHA1	Message	Date
atatat	19af35fd0d	Tango on sysctl_createv() and flags. The flags have all been renamed, and sysctl_createv() now uses more arguments.	2004-03-24 15:34:46 +00:00
junyoung	fdc32973e7	- Nuke __P(). - Drop trailing spaces.	2004-03-23 13:22:32 +00:00
hannken	142e9d5deb	Add a generic copy-on-write hook to add/remove functions that will be called with every buffer written through spec_strategy(). Used by fss(4). Future file-system-internal snapshots will need them too. Welcome to 1.6ZK Approved by: Jason R. Thorpe <thorpej@netbsd.org>	2004-02-14 00:00:56 +00:00
yamt	7f20b0c529	bump vnode hold count for page cache as well to resolve unfairness between page cache and traditional buffer cache. pointed by enami tsugutomo on current-users@.	2004-01-14 11:28:04 +00:00
hannken	ed68c4e34c	Allow vfs_write_suspend() to wait if the file system is already suspending. Move vfs_write_suspend() and vfs_write_resume() from kern/vfs_vnops.c to kern/vfs_subr.c. Change vnode write gating in ufs/ffs/ffs_softdep.c (from FreeBSD). When vnodes are throttled in softdep_trackbufs() check for file system suspension every 10 msecs to avoid a deadlock.	2004-01-10 17:16:38 +00:00
pk	70f20a1217	Replace the traditional buffer memory management -- based on fixed per buffer virtual memory reservation and a private pool of memory pages -- by a scheme based on memory pools. This allows better utilization of memory because buffers can now be allocated with a granularity finer than the system's native page size (useful for filesystems with e.g. 1k or 2k fragment sizes). It also avoids fragmentation of virtual to physical memory mappings (due to the former fixed virtual address reservation) resulting in better utilization of MMU resources on some platforms. Finally, the scheme is more flexible by allowing run-time decisions on the amount of memory to be used for buffers. On the other hand, the effectiveness of the LRU queue for buffer recycling may be somewhat reduced compared to the traditional method since, due to the nature of the pool based memory allocation, the actual least recently used buffer may release its memory to a pool different from the one needed by a newly allocated buffer. However, this effect will kick in only if the system is under memory pressure.	2003-12-30 12:33:13 +00:00
atatat	13f8d2ce5f	Dynamic sysctl. Gone are the old kern_sysctl(), cpu_sysctl(), hw_sysctl(), vfs_sysctl(), etc, routines, along with sysctl_int() et al. Now all nodes are registered with the tree, and nodes can be added (or removed) easily, and I/O to and from the tree is handled generically. Since the nodes are registered with the tree, the mapping from name to number (and back again) can now be discovered, instead of having to be hard coded. Adding new nodes to the tree is likewise much simpler -- the new infrastructure handles almost all the work for simple types, and just about anything else can be done with a small helper function. All existing nodes are where they were before (numerically speaking), so all existing consumers of sysctl information should notice no difference. PS - I'm sorry, but there's a distinct lack of documentation at the moment. I'm working on sysctl(3/8/9) right now, and I promise to watch out for buses.	2003-12-04 19:38:21 +00:00
dbj	37a927564e	In vclean(DOCLOSE), if vinvalbuf fails because of a write error, then redo the vinvalbuf without the V_SAVE which will force unflushed buffers to be discarded.	2003-12-01 18:53:10 +00:00
dbj	6a88e9174b	add "show mount" ddb command	2003-11-18 18:26:18 +00:00
dbj	d3bad238a2	XXX an impossible malloc failure check in set_statfs_info	2003-11-12 20:38:24 +00:00
hannken	a3a898ff0f	Add the gating of system calls that cause modifications to the underlying file system. The function vfs_write_suspend stops all new write operations to a file system, allows any file system modifying system calls already in progress to complete, then sync's the file system to disk and returns. The function vfs_write_resume allows the suspended write operations to complete. From FreeBSD with slight modifications. Approved by: Frank van der Linden <fvdl@netbsd.org>	2003-10-15 11:28:59 +00:00
dbj	fe7c786886	add mnt_iflag field to struct mount for internal flags mv MNT_GONE, MNT_UNMOUNT and MNT_WANTRDWR to this field additonally add mnt_writeopcountupper and mnt_writeopcountlower fields in preparation for pending write suspension support work bump kernel version to 1.6ZD	2003-10-14 14:02:56 +00:00
yamt	0a4f15d329	when allocating a new vnode, increment numvnodes before releasing vnode_free_list_slock.	2003-09-14 11:09:48 +00:00
yamt	ec0f99a001	acquire bqueue_slock around bremfree().	2003-09-11 15:34:26 +00:00
agc	aad01611e7	Move UCB-licensed code from 4-clause to 3-clause licence. Patches provided by Joel Baker in PR 22364, verified by myself.	2003-08-07 16:26:28 +00:00
yamt	cc104d0635	eliminate v_id.	2003-07-30 12:10:57 +00:00
yamt	b0cdf0a26d	maintain the list of namecaches attached to the vnode. it makes vnodes freeable.	2003-07-30 12:09:46 +00:00
fvdl	d5aece61d6	Back out the lwp/ktrace changes. They contained a lot of colateral damage, and need to be examined and discussed more.	2003-06-29 22:28:00 +00:00
thorpej	a06b275edc	Undo part of the ktrace/lwp changes. In particular: * Remove the "lwp " argument that was added to vget(). Turns out that nothing actually used it! Remove the "lwp " arguments that were added to VFS_ROOT(), VFS_VGET(), and VFS_FHTOVP(); all they did was pass it to vget() (which, as noted above, didn't use it). Remove all of the "lwp *" arguments to internal functions that were added just to appease the above.	2003-06-29 18:43:21 +00:00
darrenr	960df3c8d1	Pass lwp pointers throughtout the kernel, as required, so that the lwpid can be inserted into ktrace records. The general change has been to replace "struct proc " with "struct lwp " in various function prototypes, pass the lwp through and use l_proc to get the process pointer when needed. Bump the kernel rev up to 1.6V	2003-06-28 14:20:43 +00:00
dbj	d225e8e1a7	use %PRIx64 instead of %x to print bp->b_lblkno and bp->b_blkno in vfs_buf_print	2003-05-23 01:45:07 +00:00
thorpej	fcecc153fe	Use aprint_normal() for the non-error (and thus non-interative) case of mounting the root file system.	2003-05-17 22:22:41 +00:00
itojun	35e21131ac	use strlcat	2003-05-16 14:40:41 +00:00
christos	15fab6425c	Fix a variety of kernel panics related to unchecked export data: 1. sa_len was not properly checked. 2. sa_family was not properly checked [even used as an array index!] 3. we only know about inet4 and inet6, so make sure that the corresponding data is valid before using it. 4. keep reference counts of addresses used (is that necessary?)	2003-05-16 14:01:56 +00:00
christos	8906603597	fix wrong calculation. pointed by enami.	2003-04-22 13:11:23 +00:00
christos	cd174e0605	make the copy_statfs_args() function copy all the fields that are not set by the filesystems. Before my changes, the statfs code depended on calling it with mp->mnt_stat, and did not explicitly initialize anything!	2003-04-18 22:44:45 +00:00
christos	80ecd573c0	PR/1796: John Kohl: statfs misbehaves under chrooted environments. - Under chroot it displays only the visible filesystems with appropriate paths. - The statfs f_mntonname gets adjusted to contain the real path from root. - While was there, fixed a bug in ext2fs, locking problems with vfs_getfsstat(), and factored out some of the vfsop statfs() code to copy_statfs_info(). This fixes the problem where some filesystems forgot to set fsid. - Made coda look more like a normal fs.	2003-04-16 21:44:18 +00:00
enami	7d8cb58793	Set va_birthtime field in vattr_null().	2003-04-03 09:13:10 +00:00
jdolecek	3f362c0871	it appears one list of vnode type names should be enough	2003-02-25 23:35:03 +00:00
jdolecek	b06fa82f3e	make iftovt_tab[] const	2003-02-25 23:01:39 +00:00
jdolecek	2e574d6a8b	add "VT_SMBFS" to vnode_tags[] constify vnode_tags[] and vnode_types[]	2003-02-18 20:37:38 +00:00
pk	338f31f581	Make the buffer cache code MP-safe.	2003-02-05 21:38:38 +00:00
thorpej	b193480908	Add extensible malloc types, adapted from FreeBSD. This turns malloc types into a structure, a pointer to which is passed around, instead of an int constant. Allow the limit to be adjusted when the malloc type is defined, or with a function call, as suggested by Jonathan Stone.	2003-02-01 06:23:35 +00:00
christos	416978b15b	The hunt to make sync work from ddb again begins here. Protect against lwp null dereference. set proc p properly.	2003-01-20 23:59:14 +00:00
thorpej	e0d8d366df	Merge the nathanw_sa branch.	2003-01-18 10:06:22 +00:00
yamt	78d9abec0c	sync comment for vflush with reality. from FreeBSD.	2002-12-29 06:47:57 +00:00
blymn	29b7b4241f	Added support for fingerprinted executables aka verified exec	2002-10-29 12:31:20 +00:00
jdolecek	e0cc03a09b	merge kqueue branch into -current kqueue provides a stateful and efficient event notification framework currently supported events include socket, file, directory, fifo, pipe, tty and device changes, and monitoring of processes and signals kqueue is supported by all writable filesystems in NetBSD tree (with exception of Coda) and all device drivers supporting poll(2) based on work done by Jonathan Lemon for FreeBSD initial NetBSD port done by Luke Mewburn and Jason Thorpe	2002-10-23 09:10:23 +00:00
gmcgarry	3dae1c4857	vclean() isn't part of the interface so make it local. Sort prototypes by the interface they belong to.	2002-10-23 06:45:49 +00:00
simonb	63533e7e48	"tmp" in vfs_vnode_print() is set but not used; remove it.	2002-10-22 03:38:21 +00:00
gehenna	77a6b82b27	Merge the gehenna-devsw branch into the trunk. This merge changes the device switch tables from static array to dynamically generated by config(8). - All device switches is defined as a constant structure in device drivers. - The new grammer ``device-major'' is introduced to ``files''. device-major <prefix> char <num> [block <num>] [<rules>] - All device major numbers must be listed up in port dependent majors.<arch> by using this grammer. - Added the new naming convention. The name of the device switch must be <prefix>_[bc]devsw for auto-generation of device switch tables. - The backward compatibility of loading block/character device switch by LKM framework is broken. This is necessary to convert from block/character device major to device name in runtime and vice versa. - The restriction to assign device major by LKM is completely removed. We don't need to reserve LKM entries for dynamic loading of device switch. - In compile time, device major numbers list is packed into the kernel and the LKM framework will refer it to assign device major number dynamically.	2002-09-06 13:18:43 +00:00
matt	48bbf5f234	Use the queue macros from <sys/queue.h> instead of referring to the queue members directly. Use *_FOREACH whenever possible.	2002-09-04 01:32:31 +00:00
thorpej	3767580d1a	Fix a signed/unsigned comparison warning from GCC 3.3.	2002-08-26 01:26:29 +00:00
perseant	43ca783b4a	Back out rev 1.174 of vfs_subr.c, because the splbio() wasn't protecting enough to be useful, and broadening it so that it did would have meant that operations possibly requiring synchronous disk activity would have to be done in splbio(). This clearly was not going to work. Worked around this in the LFS case by having lfs_cluster_callback put an extra hold on the vnode before calling biodone(), and taking the hold off without HOLDRELE's problematic list swapping. lfs_vunref() will take care of that---in thread context---on the next write if need be. Also, ensure that the list walking in lfs_{writevnodes,segunlock,gather} takes into account the possibility that the list may change underneath it (possibly because it itself deleted an element). Tested on i386, test-compiled on alpha.	2002-05-23 23:05:25 +00:00
perseant	ec0ca919be	Protect v_freelist with splbio(), since HOLDRELE can be called in interrupt context (through brelvp). (LFS may be the only subsystem affected by this problem.) Tested on i386.	2002-05-20 22:50:57 +00:00
thorpej	605e664094	vfs_mountroot(): provide more info when we panic.	2002-04-04 01:44:30 +00:00
bjh21	dca4ae94d6	When checking that a potentially-unsigned enum is >= 0, assign it to an int first. This is necessary to avoid warnings with -fshort-enums. Casting to an int really should be enough, but turns out not to be. This change will be documented in doc/HACKS.	2002-03-09 13:22:52 +00:00
thorpej	a180cee23b	Pool deals fairly well with physical memory shortage, but it doesn't deal with shortages of the VM maps where the backing pages are mapped (usually kmem_map). Try to deal with this: * Group all information about the backend allocator for a pool in a separate structure. The pool references this structure, rather than the individual fields. * Change the pool_init() API accordingly, and adjust all callers. * Link all pools using the same backend allocator on a list. * The backend allocator is responsible for waiting for physical memory to become available, but will still fail if it cannot callocate KVA space for the pages. If this happens, carefully drain all pools using the same backend allocator, so that some KVA space can be freed. * Change pool_reclaim() to indicate if it actually succeeded in freeing some pages, and use that information to make draining easier and more efficient. * Get rid of PR_URGENT. There was only one use of it, and it could be dealt with by the caller. From art@openbsd.org.	2002-03-08 20:48:27 +00:00
simonb	9a942a34e0	Don't use local extern declarations for the mountroot variable or declare local prototypes for nfs_mountroot() or md_root_setconf().	2002-03-04 02:25:21 +00:00
chs	90503a3cda	add an assert (hopefully to find where we recycle vnodes without freeing all the pages, like I've seen recently).	2002-02-05 07:50:58 +00:00
chs	62c2e756ed	update vnode flags in ddb vnode-printing function.	2001-12-10 01:38:48 +00:00
chs	8e9cdbbd63	replace "vnode" and "vtext" with "file" and "exec" in uvmexp field names.	2001-12-09 03:07:43 +00:00
chs	163b4fbc50	in vinvalbuf(), vtruncbuf() and vflushbuf(), don't skip calling VOP_PUTPAGES() just because the vnode has no pages. layered filesystems will want to pass these calls on through to the underlying filesystem, and non-layered filesystems may need to remove the vnode from the syncer queues. fix up MP locking and add some locking assertions. fixes PRs 12284 and 14640.	2001-12-06 04:34:33 +00:00
msaitoh	1c87566f38	fix previous commit	2001-11-30 10:31:32 +00:00
msaitoh	797d100f77	fix printf format	2001-11-30 10:06:46 +00:00
christos	0fdce3fa8a	sprinkle crcvt()	2001-11-29 21:21:29 +00:00
lukem	adc783d537	add RCSIDs	2001-11-12 15:25:01 +00:00
thorpej	e8ee04475d	- Add a new vnode flag VEXECMAP, which indicates that a vnode has executable mappings. Stop overloading VTEXT for this purpose (VTEXT also has another meaning). - Rename vn_marktext() to vn_markexec(), and use it when executable mappings of a vnode are established. - In places where we want to set VTEXT, set it in v_flag directly, rather than making a function call to do this (it no longer makes sense to use a function call, since we no longer overload VTEXT with VEXECMAP's meaning). VEXECMAP suggested by Chuq Silvers.	2001-10-30 15:32:01 +00:00
chs	90a3a778a7	when attempting to reclaim a vnode, tell the lockmgr() that it's ok to just fail if we already hold the lock. we'll skip that vnode and try another. fixes PR 14090.	2001-10-04 05:46:45 +00:00
enami	b0df86c9e6	In the function getnewvnode: - Mark file system busy again on retry. - Don't use the variable `listhd' uninitialized.	2001-09-26 00:59:57 +00:00
chs	64c6d1d2dc	a whole bunch of changes to improve performance and robustness under load: - remove special treatment of pager_map mappings in pmaps. this is required now, since I've removed the globals that expose the address range. pager_map now uses pmap_kenter_pa() instead of pmap_enter(), so there's no longer any need to special-case it. - eliminate struct uvm_vnode by moving its fields into struct vnode. - rewrite the pageout path. the pager is now responsible for handling the high-level requests instead of only getting control after a bunch of work has already been done on its behalf. this will allow us to UBCify LFS, which needs tighter control over its pages than other filesystems do. writing a page to disk no longer requires making it read-only, which allows us to write wired pages without causing all kinds of havoc. - use a new PG_PAGEOUT flag to indicate that a page should be freed on behalf of the pagedaemon when it's unlocked. this flag is very similar to PG_RELEASED, but unlike PG_RELEASED, PG_PAGEOUT can be cleared if the pageout fails due to eg. an indirect-block buffer being locked. this allows us to remove the "version" field from struct vm_page, and together with shrinking "loan_count" from 32 bits to 16, struct vm_page is now 4 bytes smaller. - no longer use PG_RELEASED for swap-backed pages. if the page is busy because it's being paged out, we can't release the swap slot to be reallocated until that write is complete, but unlike with vnodes we don't keep a count of in-progress writes so there's no good way to know when the write is done. instead, when we need to free a busy swap-backed page, just sleep until we can get it busy ourselves. - implement a fast-path for extending writes which allows us to avoid zeroing new pages. this substantially reduces cpu usage. - encapsulate the data used by the genfs code in a struct genfs_node, which must be the first element of the filesystem-specific vnode data for filesystems which use genfs_{get,put}pages(). - eliminate many of the UVM pagerops, since they aren't needed anymore now that the pager "put" operation is a higher-level operation. - enhance the genfs code to allow NFS to use the genfs_{get,put}pages instead of a modified copy. - clean up struct vnode by removing all the fields that used to be used by the vfs_cluster.c code (which we don't use anymore with UBC). - remove kmem_object and mb_object since they were useless. instead of allocating pages to these objects, we now just allocate pages with no object. such pages are mapped in the kernel until they are freed, so we can use the mapping to find the page to free it. this allows us to remove splvm() protection in several places. The sum of all these changes improves write throughput on my decstation 5000/200 to within 1% of the rate of NetBSD 1.5 and reduces the elapsed time for "make release" of a NetBSD 1.5 source tree on my 128MB pc to 10% less than a 1.5 kernel took.	2001-09-15 20:36:31 +00:00
chs	adf5d360a7	add a new VFS op, vfs_reinit, which is called when desiredvnodes is adjusted via sysctl. file systems that have hash tables which are sized based on the value of this variable now resize those hash tables using the new value. the max number of FFS softdeps is also recalculated. convert various file systems to use the <sys/queue.h> macros for their hash tables.	2001-09-15 16:12:54 +00:00
jdolecek	332bb4894a	bound check mount args more thoroughly	2001-08-03 06:00:13 +00:00
jdolecek	9bbd53c2ba	vfs_sysctl(): cosmetic: provide explicit size for vfsnames[], to catch mistakes VFS_MAXID/CTL_VFS_NAMES are updated	2001-07-08 10:32:38 +00:00
jdolecek	77f0267d21	Use array based upon CTL_VFS_NAMES to get filesystem name for non-VFS_GENERIC syscall, instead of mountcompatnames[]. Move the extern mountcompatnames[], nmountcompatnames definition to COMPAT_09 \|\| COMPAT_43 section.	2001-06-28 08:12:08 +00:00
thorpej	25f00d4c18	In getnewvnode(), allocate a vnode from the pool with NOWAIT. If that fails, just try to recycle a vnode. If we can't allocate or recycle, issue a warning, sleep a bit, and try the whole thing again. This prevents us from blocking forever if we want to use a very large number of vnodes, but don't have {memory,kva} resources from which to allocate them.	2001-06-26 22:52:03 +00:00
jdolecek	e65c47a67f	vfs_rootmountalloc: take advantage of LIST_FOREACH()	2001-06-26 19:14:25 +00:00
wrstuden	716d3ae08f	In vcount(), when getting rid of unused aliases, don't vgone one which has VXLOCK set - it's already being vgoned, most likely by one of our callers. If we call vgone, we can end up sleeping against ourself with VXLOCK set - we'll start the race for root. Pointed out by Love <lha@stacken.kth.se> on tech-kern. Analysis from Artur Grabowski <art@openbsd.org> via Love. Should resolve PR kern/13077	2001-06-26 15:51:06 +00:00
thorpej	e93d1531c2	Avoid a sleeping malloc call while holding the spechash_slock. XXX This is kinda gross, but prevents complete lossage on an XXX MP system. From Bill Sommerfeld.	2001-06-05 04:42:05 +00:00
thorpej	5b35dc8136	When unmounting a file system, acquire the syncer_lock before vfs_busy'ing just before the dounmount() call. This is to avoid sleeping with the mountlist_slock held -- but we must acquire syncer_lock before vfs_busy because the syncer itself uses syncer_lock -> vfs_busy locking order.	2001-04-16 22:41:09 +00:00
enami	c75004b245	Fix the name of some bits in struct vnode.v_flag.	2001-04-09 14:14:10 +00:00
chs	83d071a318	add UBC memory-usage balancing. we track the number of pages in use for each of the basic types (anonymous data, executable image, cached files) and prevent the pagedaemon from reusing a given page if that would reduce the count of that type of page below a sysctl-setable minimum threshold. the thresholds are controlled via three new sysctl tunables: vm.anonmin, vm.vnodemin, and vm.vtextmin. these tunables are the percentages of pageable memory reserved for each usage, and we do not allow the sum of the minimums to be more than 95% so that there's always some memory that can be reused.	2001-03-09 01:02:10 +00:00
jdolecek	522f569810	make some more constant arrays 'const'	2001-02-21 21:39:52 +00:00
chs	eef7499a6c	in vtruncbuf(), pass 0 (meaning everything at or past the start of the range) instead of the vnode's size to pgo_flush() since there can be pages past EOF. in the same call, cast "lbn" to voff_t to avoid overflow.	2001-02-06 10:58:55 +00:00
chs	6717a2ac1b	in vtruncbuf(), use a "synchronous freeing" flush to prevent a race between write i/os in a disk-based filesystem vs. the disk block being freed by a truncation, allocated to a new file, and written again with different data. if the disk driver reorders the requests and does the second i/o first, the old data will clobber the new, corrupting the new file.	2001-01-08 07:05:47 +00:00
sommerfeld	f2bdd546dd	Add a missing simple_unlock() to the LK_NOWAIT/VXLOCK error case in vget().	2000-12-31 03:13:51 +00:00
chs	aeda8d3b77	Initial integration of the Unified Buffer Cache project.	2000-11-27 08:39:39 +00:00
chs	fa19fe52db	adjust the spinlock macros in the non-MULTIPROCESSOR, non-LOCKDEBUG case so that gcc will think that static spinlock are used. this allows us to remove the ugly conditionalization of static spinlock declarations.	2000-11-24 03:59:07 +00:00
fvdl	8c28d7e864	Adapt for VOP_FSYNC parameter change.	2000-09-19 22:00:01 +00:00
enami	445cbcb8c1	Accquire vnode interlock while playing with flags to see if there is someone waiting this vnode.	2000-09-05 05:13:43 +00:00
bouyer	efc4435cb3	in vfs_shutdown(), use sched_suspend() to suspend scheduling, and use tsleep() instead of DELAY. Also, keep trying flushing buffers when the number of dirty buffers decreases (20 rounds may not be enouth for a very large buffer cache). Using tsleep instead of delay gives a chance to others kernel threads to run, which is needed for raidframe. With this change I've not been able to reproduce the 'dirty buffer not flushed' problem with raidframe.	2000-08-31 14:41:35 +00:00
enami	d707b78562	Declare this static simplelock data only when MULTIPROCESSOR or LOCKDEBUG is defined to prevent compiler warning.	2000-08-21 06:42:57 +00:00
sommerfeld	78e4a089b8	Don't bother reinitializing statically-inited locks	2000-08-21 02:16:30 +00:00
sommerfeld	8875442492	Statically initialize statically-allocated locks	2000-08-19 17:25:33 +00:00
sommerfeld	861fcc44b7	Use ltsleep(...,PNORELOCK..) instead of simple_unlock()/tsleep()	2000-08-12 16:43:00 +00:00
fvdl	57e3691758	Don't wait for B_READ buffers to finish in vfs_shutdown, it makes no sense to do so. From Ethan Solomita.	2000-07-16 21:07:24 +00:00
jdolecek	1ec07d7439	change tablefull() to accept one more parameter - optional hint use that to inform about way to raise current limit when we reach maximum number of processes, descriptors or vnodes XXX hopefully I catched all users of tablefull()	2000-07-04 15:33:28 +00:00
fvdl	975751cda2	vinsheadfree -> ungetnewvnode	2000-06-27 23:51:51 +00:00
fvdl	c39797c045	Add vinsheadfree, a small function to push vnodes that have just been allocated by getnewvnode, back onto the head of the free list. Needed in some VFS_VGET functions to deal with races.	2000-06-27 23:34:45 +00:00
mrg	32aa199ccf	remove include of <vm/vm.h>	2000-06-27 17:41:07 +00:00
sommerfeld	e964d558a7	Fix assorted bugs around shutdown/reboot/panic time. - add a new global variable, doing_shutdown, which is nonzero if vfs_shutdown() or panic() have been called. - in panic, set RB_NOSYNC if doing_shutdown is already set on entry so we don't reenter vfs_shutdown if we panic'ed there. - in vfs_shutdown, don't use proc0's process for sys_sync unless curproc is NULL. - in lockmgr, attribute successful locks to proc0 if doing_shutdown && curproc==NULL, and panic if we can't get the lock right away; avoids the spurious lockmgr DIAGNOSTIC panic from the ddb reboot command. - in subr_pool, deal with curproc==NULL in the doing_shutdown case. - in mfs_strategy, bitbucket writes if doing_shutdown, so we don't wedge waiting for the mfs process. - in ltsleep, treat ((curproc == NULL) && doing_shutdown) like the panicstr case. Appears to fix: kern/9239, kern/10187, kern/9367. May also fix kern/10122.	2000-06-10 18:44:43 +00:00
assar	6c734cd283	make vfs_getnewfsid only take one argument and fetch the name of the filesystem from the supplied mount argument. also make makefstype take a const parameter. update all the callers.	2000-06-10 18:27:01 +00:00
mycroft	4656dfd24f	Add a new function to remove extra buffers when truncating a file. This is more generic than the vinvalbuf(V_SAVEMETA) case, avoiding synchronous operations when truncating to a non-zero length.	2000-05-28 04:13:56 +00:00
chs	1c084aee4f	add ddb commands for printing vnodes and bufs.	2000-04-10 02:22:13 +00:00
augustss	c87c1861bb	Add a special option, DEBUG_HALT_BUSY, that allows you to debug when the system doesn't want to halt cleanly. The code was there before, but only with the DEBUG option.	2000-03-30 09:32:25 +00:00
augustss	264f1d27c6	Get rid of register declarations.	2000-03-30 09:27:11 +00:00
fvdl	c3167b9545	Do previous better. Use FSYNC_RECLAIM as it was before.	2000-03-17 01:25:06 +00:00
jdolecek	89015c4648	Add new VFS op routine - vfs_done and call it on filesystem detach in vfs_detach(). vfs_done may free global filesystem's resources, typically those allocated in respective filesystem's init function. Needed so those filesystems which went in via LKM have a chance to clean after themselves before unloading. This fixes random panics when LKM for filesystem using pools was loaded and unloaded several times. For each leaf filesystem, add appropriate vfs_done routine.	2000-03-16 18:08:17 +00:00
fvdl	01db605567	Do the previous slightly different: any files on MNT_SOFTDEP filesystems do not want all their metadata dependencies flushed from vinvalbuf() if there are no dirty blocks.	2000-03-15 16:28:45 +00:00
perseant	61fa9e1409	Move vinvalbuf's check for dirty blocks into ffs_fsync, to ensure that mode and ownership bits are flushed to disk before the vnode is reclaimed. The check, introduced in the softdep merge, assumes that if no blocks are dirty, no file data or metadata needs to be flushed to disk. This is true of ffs, but is not true of lfs, and may not be true of other filesystems. Tested by myself and Bill Squier <groo@cs.stevens-tech.edu>.	2000-03-11 05:00:18 +00:00
mycroft	1d915f4130	Allow my disk to actually spin down using `-o async' again. Note: This uses the same questionable logic as vfs_bio.c to check MNT_ASYNC. Something needs to be done about this.	2000-03-03 05:21:03 +00:00
fvdl	c13f6dd258	Introduce a sysctl to enable/disable if non-root users can mount filesystems. Default: off.	2000-02-16 11:57:45 +00:00
perseant	fa6a733240	In lfs_bwrite, don't mark buffers dirty if lfs is mounted read-only. (Previously buffers could be marked dirty by the cleaner, and possibly by other means.) Also check for softdep mount in vfs_shutdown before trying to bawrite buffers, since other filesystems don't need it and lfs doesn't bawrite. (This fragment reviewed by fvdl.) Partially addresses PR#8964.	1999-12-15 07:10:32 +00:00
fvdl	d901f6eae0	Be more careful to block bio interrupts for some data structures. There were at least a few missed cases where vp->v_{clean,dirty}blkhd were unprotected since the softdep/trickle sync merge.	1999-11-23 23:52:40 +00:00
enami	f6b8114fc7	Initialize the vnode_hold_list correctly.	1999-11-18 05:50:25 +00:00
fvdl	0b1963121a	Add Kirk McKusick's soft updates code to the trunk. Not enabled by default, as the copyright on the main file (ffs_softdep.c) is such that is has been put into gnusrc. options SOFTDEP will pull this in. This code also contains the trickle syncer. Bump version number to 1.4O	1999-11-15 18:49:07 +00:00
mycroft	fde519b5e2	Widen usecount and writecount to prevent overflow.	1999-10-01 22:03:17 +00:00
mycroft	5e7ae44739	Correct spelling in an #ifdef.	1999-10-01 21:57:42 +00:00
wrstuden	ba891a728d	Deal with device vnodes which aren't on the spechash tables, rather than panicing. So now we make sure vp->v_hashchain != NULL before removing the node from the chain.	1999-08-20 22:21:25 +00:00
thorpej	f2c2e160b1	Fix "print vnodes for dirty buffers" change: use vprint(); VOP_PRINT() is only meant to be used by vprint(), and vprint() provides more information about the vnode.	1999-08-19 18:09:44 +00:00
simonb	c620766979	In vfs_shutdown() print any vnodes for busy buffers if DEBUG is defined. Patch from Bill Studenmund.	1999-08-19 13:54:06 +00:00
ross	2f76dd5371	In getnewvnode(), initialize v_interlock when the vnode comes from the pool allocator.	1999-08-14 06:23:59 +00:00
sommerfeld	3267e5e87d	Probable fix for PR7943: lookups fail spuriously over NFS. The problem was due to an interaction between the doomed unmounts done by amd and getnewvnode. I convinced myself that it's ok for getnewvnode() to do a sleeping vfs_busy(). Tested with multiple builds running while another process attempted to unmount /usr once a second.	1999-07-29 13:31:45 +00:00
wrstuden	a0f2937049	Define VLAYER and make layered fs's set this flag when creating their vnodes. getnewvnode now checks this bit, and it if's set makes sure a vnode's not locked before removing it from the free list. Closes PR 7954 by Alan Barrett <apb@iafrica.com>.	1999-07-15 21:30:31 +00:00
wrstuden	379a26972f	Modify file systems to deal with struct lock in struct vnode. All leaf fs's other than nfs use genfs_lock() for locking. Modify lookup routines to set PDIRUNLOCK when they unlock the parrent.	1999-07-08 01:05:58 +00:00
sommerfeld	e303e2ee8b	Fix kern/7906: race between unmount and getnewvnode() mp->mnt_flags & MNT_MWAIT is replaced by mp->mnt_wcnt, and a new mount flag MNT_GONE is created (reusing the same bit). In insmntque(), add DIAGNOSTIC check to fail if the filesystem vnode is being moved to is in the process of being unmounted. getnewvnode() now protects the list of vnodes active on mp with vfs_busy()/vfs_unbusy(). To avoid generating spurious errors during a doomed unmount, change the "wait for unmount to finish" protocol between dounmount() and vfs_busy(). In vfs_busy(), instead of only sleeping once, sleep until either MNT_UNMOUNT is clear or MNT_GONE is set; also, maintain a count of waiters in mp->mnt_wcnt so that dounmount() knows when it's safe to free mp. tested by running a "while :; do mount /d1; umount -f /d1; done" loop against multiple find(1) processes.	1999-07-04 16:20:12 +00:00
mrg	48c12bfeed	revert previous. oops.	1999-04-21 02:37:07 +00:00
mrg	58540a2274	properly test the msgsz as "msgsz - len". from PR#7386	1999-04-21 02:31:49 +00:00
mrg	d2397ac5f7	completely remove Mach VM support. all that is left is the all the header files as UVM still uses (most of) these.	1999-03-24 05:50:49 +00:00
sommerfe	36dc99adac	vinvalbuf, called from vclean, could cause a locking-against-self deadlock in VOP_FSYNC() if the unreferenced vnode picked for reclamation happened to be stacked on top of a vnode the process already had locked. This could happen if the same filesystem was accessed both through a union mount and directly; it seemed to happen most frequently when the direct access was through NFS. Avoid this deadlock by changing vinvalbuf to pass a new FSYNC_RECLAIM flag bit to VOP_FSYNC() to indicate that a reclaim is in progress and only a `shallow' fsync is necessary. Do nothing in _fsync() in umapfs, nullfs, and unionfs when FSYNC_RECLAIM is set; the underlying vnodes will shortly be released in _reclaim and may be reclaimed (and fsync'ed) later.	1999-03-22 17:24:19 +00:00
wrstuden	d0ab43c887	Oops. Need to have VOP_LOCK before calling uvm_vnp_terminate, not after (comments and code inspection in uvm_vnp_terminate agree on this).	1999-02-09 01:57:05 +00:00
christos	bee9dafdf5	defopt COMPAT_43	1998-12-10 15:07:01 +00:00
thorpej	eb8f1afb3e	Implement vdevgone(), to revoke all vnodes corresponding to the specified type, major, (low minor...high minor).	1998-11-18 20:24:59 +00:00
thorpej	88bc4b9f8d	Conditionally include the 4.4BSD-Lite2 compat vfs sysctl code.	1998-11-15 18:38:11 +00:00
thorpej	d23593a784	Make vfs_sysctl() work.	1998-11-13 20:15:32 +00:00
thorpej	830ea34819	Use the pool allocator and "nointr" pool page allocator for vnodes. The only benefit this provides is that we don't use kmem_map to map the memory used for vnodes (though, this is a 30 virtual page savings on my PPro) since vnodes are never freed (they have their own freelist).	1998-09-01 03:09:14 +00:00
thorpej	e00e495827	Add missing simple_unlock(), from Stefan Grefen, PR #5981 .	1998-08-17 17:29:20 +00:00
perry	275d1554aa	Abolition of bcopy, ovbcopy, bcmp, and bzero, phase one. bcopy(x, y, z) -> memcpy(y, x, z) ovbcopy(x, y, z) -> memmove(y, x, z) bcmp(x, y, z) -> memcmp(x, y, z) bzero(x, y) -> memset(x, 0, y)	1998-08-04 04:03:10 +00:00
perry	730baa7431	fix sizeofs so they comply with the KNF style guide. yes, it is pedantic.	1998-07-31 22:50:48 +00:00
kleink	636443752e	Nuke a couple of non-local prototypes which are already declared in either <sys/buf.h>, <sys/mount.h> or <sys/vnode.h>.	1998-06-08 15:52:07 +00:00
kleink	382743ada3	Convert fsync vnode operator implementations and usage from the old `waitfor' argument and MNT_WAIT/MNT_NOWAIT to `flags' and FSYNC_WAIT.	1998-06-05 19:53:00 +00:00
pk	d0e85dde99	Inline vref/vrele in vclean() because: * we already have the vnode interlock, so vref() should not ask for it again. * we call VOP_RECLAIM/VOP_INACTIVE(), which shouldn't be duplicated in vrele().	1998-05-18 14:59:49 +00:00
pk	addb5d9572	VOP_CLOSE() takes F* flags, not IO_* flags.	1998-05-08 21:02:35 +00:00
thorpej	7a239c12c6	In vfs_unmountall(), if curproc is NULL, abort, because unmounting puts the requesting process to sleep until the file systems buffers have flushed, and sleeping with a NULL curproc will cause a fault.	1998-04-26 19:10:33 +00:00
thorpej	cbc64bb02d	Make vfs_shutdown() look a little nicer.	1998-04-26 18:58:54 +00:00
fvdl	1d02bb10d8	Clarify vget() comment a bit.	1998-03-04 09:13:48 +00:00
thorpej	9ebbb62608	Export spechash_slock; it's used outside of vfs_subr.c	1998-03-03 02:22:00 +00:00
ross	94ae870894	Compile post-lite2 with #ifndef DIAGNOSTIC	1998-03-01 09:51:29 +00:00
fvdl	e5bc90f40c	Merge with Lite2 + local changes	1998-03-01 02:20:01 +00:00
thorpej	d1f0dbf1b1	Don't use vfssw[], it's gone; use vfs_list instead. Implement vfs_attach() and vfs_detach(), which add and remove file systems from the kernel.	1998-02-18 07:16:41 +00:00
mrg	d90485202c	- add defopt's for UVM, UVMHIST and PMAP_NEW. - remove unnecessary UVMHIST_DECL's.	1998-02-10 14:08:44 +00:00
mrg	1a8c7604f4	initial import of the new virtual memory system, UVM, into -current. UVM was written by chuck cranor <chuck@maria.wustl.edu>, with some minor portions derived from the old Mach code. i provided some help getting swap and paging working, and other bug fixes/ideas. chuck silvers <chuq@chuq.com> also provided some other fixes. this is the rest of the MI portion changes. this will be KNF'd shortly. :-)	1998-02-05 07:59:28 +00:00
christos	3c07a14a75	Separate assigments of tv_sec and tv_nsec since tv_sec is a time_t (int on the alpha) and tv_nsec is a long.	1997-10-18 16:34:17 +00:00
enami	acd4cefee0	In the function vattr_null(), assign each member individually to prevent unintended conversion due to different sign and size.	1997-10-18 11:51:32 +00:00
thorpej	176a81b2c5	Copyright assigned to The NetBSD Foundation.	1997-10-05 18:37:01 +00:00
thorpej	a2721a0f1b	In vfs_shutdown(), do the "sync and wait for it to finish" _before_ unmounting all of the file systems. If we encounter a condition where all of the dirty buffers could not flush, then don't unmount file systems, since it might be likely to wedge.	1997-09-24 21:40:55 +00:00
fvdl	4ad51c2811	Allow multiple export requests for a filesystem, host pair if the flags and anon cred are the same. Should probably be handled better in the mountd, but this will do for now. Fixes PR 469, submitted Sept 1994 by a certain "Jason R. Thorpe".. ;-)	1997-07-20 23:31:32 +00:00
fvdl	4702f17abc	Add functions to set/reset the info about the publicly exported (WebNFS) filesystem. At first glance this should go into the NFS code, but all the other export code is here as well.	1997-06-24 23:43:33 +00:00
cgd	21bbd49bf2	in vfs_shutdown(), print "syncing disks... " before dropping to spl0(), so that if the drop to spl0() causes another panic (e.g. because there's still some fatal hardware interrupt that's pending) we'll know that we dropped IPL to sync the disks.	1997-06-07 17:27:57 +00:00
mycroft	06fb68217b	Oops; fix reversed test for VDIR.	1997-05-08 16:34:54 +00:00
mycroft	e3f99a9397	Pass the vnode type to vaccess(), and use it when checking VEXEC. Make sure that the mode bits passed to vaccess() and returned by foo_getattr() contain only permission bits.	1997-05-08 16:19:43 +00:00
mycroft	8f5978a181	GC VS[UG]ID and VSVTX, and add a new VLOOKUP, since the semantics are now different from VEXEC.	1997-05-08 10:21:35 +00:00
mycroft	c32418bf82	Fix error in vfs_hang_addrlist() that caused file systems to be exported to more subnets than expected when using netmasks. From Mike Hibler.	1997-04-25 02:43:10 +00:00
mycroft	5e62a0725b	Change previous test slightly.	1997-04-23 20:19:45 +00:00
mycroft	b34794e10f	Do not return success when checking for execute permission by super-user and no execute bits are set. Also, this test is no longer needed in execve(2).	1997-04-23 20:18:16 +00:00
mycroft	6911ff7d13	Fix two performance issues: * When a delayed write buffer falls off the LRU queue, arrange for it to go on the AGE queue after being flushed out to disk. * When a delayed write buffer is synced, leave it in its relative position in the LRU queue.	1997-04-09 21:12:10 +00:00
kleink	5ec0772a62	In checkalias(), initialize the speclockf structure member invented with the specfs advisory locking support; this could cause a panic.	1997-04-03 23:15:52 +00:00
fvdl	501f1a3eb9	Do previous change properly (pasto; should have been inside the loop).	1997-02-23 00:07:18 +00:00
fvdl	0538233e2c	Implement changes to make fix for NQNFS and MFS unmounting (race conditions) work. Not quite as good as with the Lite2 merges, but it'll do until then. * dounmount() expects to be called with the mountpoint marked busy * all callers of dounmount() thus make the call themselves * if a filesystem was being unmounted, and we're woken up in vfs_busy(), don't reference the mountpoint struct pointer, as it has very probably been freed.	1997-02-22 03:22:32 +00:00
thorpej	2ca27c5550	Garbage-collect "argdev".	1997-01-31 19:10:27 +00:00
thorpej	68de7ca719	- Implement vfs_mountroot(). This function is called my main() to mount the root file system. If the operator specified the root file system type in the kernel configuration file, attempt to mount that file system type on the root device. If the root file system type was wildcarded (or unspecified), try all of the file systems statically built into the kernel until one succeeds. If no file systems succeed, return an error. The system will recover from this condition. - Implement vfs_getopsbyname(). This function returns the file system ops vector given a file system name.	1997-01-31 02:50:36 +00:00
christos	f443b89c92	backout previous kprintf change	1996-10-13 02:32:29 +00:00
christos	60d201973e	printf -> kprintf, sprintf -> ksprintf	1996-10-10 22:46:11 +00:00
cgd	60b9a20b61	initialize vnode_free_list and mountlist at compile time with the new queue.h list/queue head initializer macros. mountlist was converted so that panics (or other reboots) early on in kernel startup don't cause sys_sync() to croak. vnode_free_list was converted because it was nearby.	1996-10-01 22:49:11 +00:00
jtk	ef561b71a7	print out file systems being unmounted, #ifdef DEBUG. pr#1492	1996-06-01 20:24:05 +00:00
christos	4ef330b934	remove include of <sys/cpu.h>	1996-04-22 01:38:12 +00:00
christos	c9e746a335	Fix printf() formats.	1996-03-16 23:17:04 +00:00
christos	09afd77655	More proto fixes	1996-02-09 18:59:18 +00:00
christos	e630447d8c	First pass at prototyping	1996-02-04 02:17:43 +00:00
jtc	e19bfae4f9	Rename struct timespec fields to conform to POSIX.1b	1996-02-01 00:18:04 +00:00
mycroft	436d930db5	Use insmntque() rather than manually frobbing the mount list.	1996-01-30 18:21:08 +00:00
mycroft	245f292fed	Prefix names of system call implementation functions with `sys_'.	1995-10-07 06:25:19 +00:00
mycroft	083ba962e2	Oops; need fcntl.h.	1995-07-03 16:58:38 +00:00
mycroft	9a4505cb89	Close routines take file flags, not I/O flags. Fix two incorrect usages.	1995-07-02 18:13:02 +00:00
jtc	95ded74f58	Moved egid credential from cr_groups[0] to new field cr_gid. POSIX.1 requires that sgid executables and the setuid() syscall not change the supplemental group list.	1995-06-01 22:43:30 +00:00
mycroft	78356f06b3	Add two vprint()s, to give more informative panic messages.	1995-05-04 03:11:06 +00:00
mycroft	954487037b	Rearrange vfs_shutdown() slightly.	1995-04-21 22:09:53 +00:00
mycroft	84f803aef6	Add a return type for vaccess().	1995-04-21 22:03:24 +00:00
mycroft	f51cb8c974	Print a message for each file system that does not unmount cleanly. Add a vfs_shutdown() routine that does the unmount and sync.	1995-04-21 21:55:11 +00:00
mycroft	6cabaea642	Define vfs_unmountall(), to unmount file systems at shutdown time.	1995-04-10 19:46:56 +00:00
mycroft	9843f45605	Turn mountlist into a CIRCLEQ, and handle setting and checking of MNT_ROOTFS differently.	1995-01-18 06:19:49 +00:00
cgd	7fb59862ff	undo charles's accidental changes.	1995-01-15 09:23:05 +00:00
mycroft	d903b2aa28	Remove unused extern.	1995-01-09 19:54:28 +00:00
ws	2f0fb8ee09	Implement and use a common access checking routine	1994-12-24 16:44:12 +00:00
cgd	f3dc337d8a	fix done in rev. 1.23 over again. it was clobbered, and problem masked	1994-07-10 05:53:25 +00:00
deraadt	318b9c6b63	limit st_dev to 15 bits set for nfs filesystems	1994-07-02 04:51:18 +00:00
cgd	cf92afd66e	New RCS ID's, take two. they're more aesthecially pleasant, and use 'NetBSD'	1994-06-29 06:29:24 +00:00
mycroft	33d82e8a8b	Move definition of prtactive.	1994-06-13 15:37:55 +00:00
mycroft	699bbb84b6	Update to 4.4-Lite fs code.	1994-06-08 11:28:29 +00:00
cgd	91cf0fbaf3	copyright foo	1994-05-17 04:21:49 +00:00
cgd	e15a2ee17e	sysctl update	1994-05-07 00:53:37 +00:00
cgd	d071d1cf05	some prototype cleanup, eliminate/replace bogus types (e.g. quad and u_quad) -> use better types (e.g. quad_t & u_quad_t in inodes), some cleanup.	1994-04-25 03:49:27 +00:00
cgd	b1f4730729	some more queue code (that's #ifdef DEBUG)	1994-04-23 08:41:05 +00:00
cgd	4917d8beec	make fs types consistent over new kernels. also, some proto foo.	1994-04-23 07:54:38 +00:00
cgd	3dda0064a5	Convert mount, vnode, and buf structs to use <sys/queue.h>. Also, some knf and structure frobbing to do along with it.	1994-04-21 07:47:31 +00:00
cgd	3fe93ccc24	don't let cons dev vnode get subsumed by a 'real' vnode. the current scheme of vnode aliasing just has to go.	1994-04-18 21:03:14 +00:00
cgd	4be7b669e2	fs types are names now; accompanying changes.	1994-04-14 04:05:28 +00:00
mycroft	6076d8a10d	Fix typo.	1994-04-12 02:23:14 +00:00
mycroft	0600b23926	Remove a bogus optimization I did.	1994-04-11 23:43:04 +00:00
cgd	913fdbc06d	slight optimization, kill unnecessary label.	1994-04-11 22:03:17 +00:00

... 2 3 4 5 6 ...

368 Commits