NetBSD

Commit Graph

Author	SHA1	Message	Date
perseant	418bf18f53	Handle the "filesystem is clean" flag correctly when upgrading from read-only to read-write mount. This makes "root on lfs" work for me, although it looks like a different traceback from PR#32667.	2006-03-31 02:31:37 +00:00
yamt	c5fcdd1719	some cleanups after the introduction of GOP_SIZE_MEM flag. - remove GOP_SIZE_READ/GOP_SIZE_WRITE flags. they have not been used since the change. - ufs_balloc_range: remove code which has been no-op since the change. thanks Konrad Schroder for explaining the original intention of the code. - ffs_gop_size: don't extend past eof, in the case of GOP_SIZE_MEM. otherwise genfs_getpages end up to allocate pages past eof unnecessarily.	2006-03-30 12:40:06 +00:00
perseant	0a4e8d80c1	Double-checkpoint on unmount. This ensures that vnodes belonging to removed files are really freed, preventing occasional spurious EBUSY returns from vflush().	2006-03-28 23:57:41 +00:00
perseant	dddf5c5171	Improvements to LFS's paging mechanism, to wit: * Acknowledge that sometimes there are more dirty pages to be written to disk than clean segments. When we reach the danger line, lfs_gop_write() now returns EAGAIN. The caller of VOP_PUTPAGES(), if it holds the segment lock, drops it and waits for the cleaner to make room before continuing. * Note and avoid a three-way deadlock in lfs_putpages (a writer holding a page busy blocks on the cleaner while the cleaner blocks on the segment lock while lfs_putpages blocks on the page).	2006-03-24 20:05:32 +00:00
tls	a67eab5ee4	From Konrad Schroeder, in response to strange df output on anoncvs.netbsd.org: We were returning the wrong value for free space. Now we're not.	2006-03-17 23:21:01 +00:00
thorpej	58853410ae	Use device_class() instead of accessing dv_class directly.	2006-02-21 04:32:38 +00:00
yamt	03f80508d6	- unify ffs_blkatoff and lfs_blkatoff. - remove ufs_ops::uo_blkatoff. - add directory read-ahead code. (disabled for now.)	2006-01-14 17:41:16 +00:00
yamt	690d424f28	- add simple functions to allocate/free a buffer for i/o. - make bufpool static.	2006-01-04 10:13:05 +00:00
christos	95e1ffb156	merge ktrace-lwp.	2005-12-11 12:16:03 +00:00
yamt	a748ea88dd	merge yamt-vop branch. remove following VOPs. VOP_BLKATOFF VOP_VALLOC VOP_BALLOC VOP_REALLOCBLKS VOP_VFREE VOP_TRUNCATE VOP_UPDATE	2005-11-02 12:38:58 +00:00
yamt	baee927713	introduce "ufs_ops" and use it for ITIMES.	2005-09-27 06:48:55 +00:00
jmmv	2a3e5eeb7c	Apply the NFS exports list rototill patch: - Remove all NFS related stuff from file system specific code. - Drop the vfs_checkexp hook and generalize it in the new nfs_check_export function, thus removing redundancy from all file systems. - Move all NFS export-related stuff from kern/vfs_subr.c to the new file sys/nfs/nfs_export.c. The former was becoming large and its code is always compiled, regardless of the build options. Using the latter, the code is only compiled in when NFSSERVER is enabled. While doing this, also make some functions in nfs_subs.c conditional to NFSSERVER. - Add a new command in nfssvc(2), called NFSSVC_SETEXPORTSLIST, that takes a path and a set of export entries. At the moment it can only clear the exports list or append entries, one by one, but it is done in a way that allows setting the whole set of entries atomically in the future (see the comment in mountd_set_exports_list or in doc/TODO). - Change mountd(8) to use the nfssvc(2) system call instead of mount(2) so that it becomes file system agnostic. In fact, all this whole thing was done to remove a 'XXX' block from this utility! - Change the mount, newfs and fsck userland utilities to not deal with NFS exports initialization; done internally by the kernel when initializing the NFS support for each file system. - Implement an interface for VFS (called VFS hooks) so that several kernel subsystems can run arbitrary code upon receipt of specific VFS events. At the moment, this only provides support for unmount and is used to destroy NFS exports lists from the file systems being unmounted, though it has room for extension. Thanks go to yamt@, chs@, thorpej@, wrstuden@ and others for their comments and advice in the development of this patch.	2005-09-23 12:10:31 +00:00
christos	0b0eb1328b	Don't overload MAXNAMLEN, use a separate constant for each filesystem type.	2005-08-23 08:05:13 +00:00
christos	50f8955b6e	64 bit inode changes.	2005-08-19 02:04:03 +00:00
yamt	b7bfe82866	update file timestamps for nfsd loaned-read and mmap. PR/25279. discussed on tech-kern@.	2005-07-23 12:18:41 +00:00
yamt	44d128fa8e	- constify genfs_ops. - use member designators.	2005-06-28 09:30:37 +00:00
atatat	420d91208b	Properly fix the constipated lossage wrt -Wcast-qual and the sysctl code. I know it's not the prettiest code, but it seems to work rather well in spite of itself.	2005-06-09 02:19:59 +00:00
christos	273df63602	- sprinkle const - avoid shadow variables.	2005-05-29 21:25:24 +00:00
perseant	2ecd1730c0	Keep track of the number of segments reclaimed, since the cleaner doesn't do this anymore (it hasn't for quite some time). Add a couple of conditional debugging messages to indicate why segments are not cleaned, in the event that lfs_segclean is used. Make the LFCNSEGWAITALL fcntl work again.	2005-05-20 19:48:25 +00:00
perseant	5760c21b8b	Fill in the lfs_fsmnt field in the superblock when we mount the filesystem, so fsck(8) can tell where it was last mounted.	2005-05-20 19:03:11 +00:00
perseant	0d41dd0d46	Don't let the pager_map deadlock avoidance code in lfs_putpages() write segments containing zero-block FINFO records. These records cause segments to become uncleanable, which would eventually result in a "no clean segments" panic.	2005-05-04 04:58:22 +00:00
perseant	2f695b5476	Provide a resize_lfs(8), including kernel and cleaner support. The current implementation requires the fs to be mounted while resizing. Tested in both directions, and everything appears to work happily, but ymmv.	2005-04-23 19:47:51 +00:00
perseant	f4a7694fc9	Keep per-inode, per-fs, and subsystem-wide counts of blocks allocated through lfs_balloc(), and use that to estimate the number of dirty pages belonging to LFS (subsystem or filesystem). This is almost certainly wrong for the case of a large mmap()ed region, but the accounting is tighter than what we had before, and performs much better in the typical case of pages dirtied through write().	2005-04-19 20:59:05 +00:00
perseant	ad0169af41	Remove left-over reference to "lfs_blist", for _LKM case.	2005-04-16 18:10:12 +00:00
perseant	5ed792ecb0	Use splay trees, rather than a hash table, to manage the accounting of blocks allocated through VOP_BALLOC() for pages to be written to disk. This accounting no longer takes a noticeable fraction of the system CPU.	2005-04-16 17:35:58 +00:00
perseant	f08a1ca4fa	Consolidate the hash table we use to maintain the integrity of lfs_avail into a single, system-wide table, rather than having a separate hash table per inode. Significantly reduces the "system" cpu usage of your average file write.	2005-04-14 00:44:16 +00:00
perseant	2ee78c4fa9	Keep track of the highest block held by an LFS inode, so that we can be assured that the last byte of a file is always allocated. Previously a file extension could cause the filesystem to be flushed, writing an inconsistent inode to disk. Although this condition would be corrected the next time blocks were written to disk, an intervening crash would leave the filesystem in an inconsistent state, leaving fsck_lfs to complain of an inode "partially truncated".	2005-04-14 00:02:46 +00:00
perseant	af48a6d91c	Clean up the handling of the pager_map deadlock in lfs_putpages, after realizing that it is safe to sleep the second time through the loop.	2005-04-08 00:08:42 +00:00
perseant	c9d4fa4c0d	Fix some locking issues that appeared with the simple_lock work. Address a "pager_map" deadlock in lfs_putpages().	2005-04-06 04:30:46 +00:00
perseant	1ebfc508b6	Protect various per-fs structures with fs->lfs_interlock simple_lock, to improve behavior in the multiprocessor case. Add debugging segment-lock assertion statements.	2005-04-01 21:59:46 +00:00
thorpej	e633e8b61b	- Define a VFS_ATTACH() macro that places a reference to a vfsops structure into the "vfsops" link set. - Use VFS_ATTACH() where vfsops are declared for individual file systems. - In vfsinit(), traverse the "vfsops" link set, rather than vfs_list_initial[].	2005-03-29 02:41:05 +00:00
simonb	52c470b886	Tab Police.	2005-03-08 04:49:35 +00:00
perseant	eefd94b8e2	Straighten out the maze of ifdefs. Instead, consolidate all the debugging stuff under '#ifdef DEBUG', and use sysctl knobs to turn on/off particular parts of the debugging reporting (if DEBUG is enabled). Re-enable the LFS statistics in sysctl, while I'm there. A bit of a rototill.	2005-03-08 00:18:19 +00:00
perseant	8de99480fa	Move "ifile is too large for your NBUFS/BUFPAGES" messages into a function. Use log(9) to warn the user instead of printf(9). Since the theory is that the Ifile is "always in cache", but the greater performance risk is when the inode entries can't be held in cache, note these two cases separately, at different log levels (notice and warning, respectively).	2005-03-04 22:19:05 +00:00
perry	bcfcddbac1	nuke trailing whitespace	2005-02-26 22:31:44 +00:00
perseant	25f49c3c91	Various minor LFS improvements: * Note when lfs_putpages(9) thinks it is not going to be writing any pages before calling genfs_putpages(9). This prevents a situation in which blocks can be queued for writing without a segment header. * Correct computation of NRESERVE(), though it is still a gross overestimate in most cases. Note that if NRESERVE() is too high, it may be impossible to create files on the filesystem. We catch this case on filesystem mount and refuse to mount r/w. * Allow filesystems to be mounted whose block size is == MAXBSIZE. * Somewhere along the line, ufs_bmaparray(9) started mangling UNWRITTEN entries in indirect blocks again, triggering a failed assertion "daddr <= LFS_MAX_DADDR". Explicitly convert to and from int32_t to correct this. * Add a high-water mark for the number of dirty pages any given LFS can hold before triggering a flush. This is settable by sysctl, but off (zero) by default. * Be more careful about the MAX_BYTES and MAX_BUFS computations so we shouldn't see "please increase to at least zero" messages. * Note that VBLK and VCHR vnodes can have nonzero values in di_db[0] even though their v_size == 0. Don't panic when we see this. * Change lfs_bfree to a signed quantity. The manner in which it is processed before being passed to the cleaner means that sometimes it may drop below zero, and the cleaner must be aware of this. * Never report bfree < 0 (or higher than lfs_dsize) through lfs_statvfs(9). This prevents df(1) from ever telling us that our full filesystems have 16TB free. * Account space allocated through lfs_balloc(9) that does not have associated buffer headers, so that the pagedaemon doesn't run us out of segments. * Return ENOSPC from lfs_balloc(9) when bfree drops to zero. * Address a deadlock in lfs_bmapv/lfs_markv when the filesystem is being unmounted. Because vfs_busy() is a shared lock, and lfs_bmapv/lfs_markv mark the filesystem vfs_busy(), the cleaner can be holding the lock that umount() is blocking on, then try to vfs_busy() again in getnewvnode().	2005-02-26 05:40:42 +00:00
mycroft	7f1fe4e81f	Rearrange some code slightly to avoid uninitialized variable warnings.	2005-01-11 00:19:36 +00:00
mycroft	e72fc6717e	Whoops -- move the location of the VOP_OPEN()/VOP_CLOSE(), et al, from foo_mountfs() to foo_mount(), to match the new mountroot API. Also, for ext2fs and lfs, copy some restructuring from ffs to allow changing file system parameters without specifying the device name. (ntfs could use some more work.)	2005-01-09 09:27:17 +00:00
mycroft	0461b30ac3	Rework the mountroot interface so that vfs_mountroot() opens the root device and just passes it on to the file system functions. This avoids opening and closing the device several times. Mentioned on tech-kern some time ago, IIRC. I've been running this for a long time.	2005-01-09 03:11:48 +00:00
thorpej	1c95472d01	Add the system call and VFS infrastructure for file system extended attributes. From FreeBSD.	2005-01-02 16:08:28 +00:00
mycroft	2070a0c580	Make sure to set IMNT_DTYPE here...	2004-08-16 12:49:55 +00:00
mycroft	14f6fc2dfb	Need to set um_dirblksiz here...	2004-08-15 16:07:08 +00:00
mycroft	45a21b76f0	Fixing age old cruft: * Rather than using mnt_maxsymlinklen to indicate that a file systems returns d_type fields(!), add a new internal flag, IMNT_DTYPE. Add 3 new elements to ufsmount: * um_maxsymlinklen, replaces mnt_maxsymlinklen (which never should have existed in the first place). * um_dirblksiz, which tracks the current directory block size, eliminating the FS-specific checks littered throughout the code. This may be used later to make the block size variable. * um_maxfilesize, which is the maximum file size, possibly adjusted lower due to implementation issues. Sync some bug fixes from FFS into ext2fs, particularly: * ffs_lookup.c 1.21, 1.28, 1.33, 1.48 * ffs_inode.c 1.43, 1.44, 1.45, 1.66, 1.67 * ffs_vnops.c 1.84, 1.85, 1.86 Clean up some crappy pointer frobnication.	2004-08-15 07:19:54 +00:00
mycroft	bc25b30608	Add a new flag, IN_MODIFY. This is like IN_UPDATE\|IN_CHANGE, but unlike setting those flags, it does not cause the inode to be written in the periodic sync. This is used for writes to special files (devices and named pipes) and FIFOs. Do not preemptively sync updates to access times and modification times. They are now updated in the inode only opportunistically, or when the file or device is closed. (Really, it should be delayed beyond close, but this is enough to help substantially with device nodes.) And the most amusing part: Trickle sync was broken on both FFS and ext2fs, in different ways. In FFS, the periodic call to VFS_SYNC(MNT_LAZY) was still causing all file data to be synced. In ext2fs, it was causing the metadata to not be synced. We now only call VOP_UPDATE() on the node if we're doing MNT_LAZY. I've confirmed that we do in fact trickle correctly now.	2004-08-14 01:08:02 +00:00
pk	a7c40722d8	Call inittodr() from main(). Let file system code set the recorded `last update' time (if any) through the new function setrootfstime().	2004-07-05 07:28:45 +00:00
yamt	2209153ea4	lfs_gop_write: assert that ifile never come here.	2004-05-30 20:45:44 +00:00
hannken	8c21bc6224	Add ffs internal snapshots. Written by Marshall Kirk McKusick for FreeBSD. - Not enabled by default. Needs kernel option FFS_SNAPSHOT. - Change parameters of ffs_blkfree. - Let the copy-on-write functions return an error so spec_strategy may fail if the copy-on-write fails. - Change genfs_lock() to use vp->v_vnlock instead of &vp->v_lock. - Add flag B_METAONLY to VOP_BALLOC to return indirect block buffer. - Add a function ffs_checkfreefile needed for snapshot creation. - Add special handling of snapshot files: Snapshots may not be opened for writing and the attributes are read-only. Use the mtime as the time this snapshot was taken. Deny mtime updates for snapshot files. - Add function transferlockers to transfer any waiting processes from one lock to another. - Add vfsop VFS_SNAPSHOT to take a snapshot and make it accessible through a vnode. - Add snapshot support to ls, fsck_ffs and dump. Welcome to 2.0F. Approved by: Jason R. Thorpe <thorpej@netbsd.org>	2004-05-25 14:54:55 +00:00
atatat	53c625655c	Sysctl descriptions under vfs subtree	2004-05-25 04:44:43 +00:00
atatat	1d3a6a329e	Explicitly call pool_init() (and pool_destroy()) when being built as an _LKM. This adds pools to the list of things that lkms must do manually because they're set up with link sets. Not that there's anything wrong with link sets, but that we need to try harder to remember that lkms are second class citizens. Of a sort.	2004-05-20 05:39:34 +00:00
simonb	b5d0e6bf06	Initialise (most) pools from a link set instead of explicit calls to pool_init. Untouched pools are ones that either in arch-specific code, or aren't initialiased during initial system startup. Convert struct session, ucred and lockf to pools.	2004-04-25 16:42:40 +00:00

1 2 3 4

198 Commits