NetBSD

Commit Graph

Author	SHA1	Message	Date
yamt	e8d83a0b17	make this compilable with DIAGNOSTIC and without DEBUG. fix PR 20827 from FUKAUMI Naoki.	2003-03-21 06:09:08 +00:00
yamt	a8e8f3ea02	lfs_writevnodes: in the case of "starting over", kick lfs_writeseg in order to avoid deadlock in check_dirty.	2003-03-20 14:17:21 +00:00
yamt	91ce94db76	fix "more than one fragment" panics; direct and indirect block pointers are not valid in the case of shortlinks. while i'm here, move duplicated code in lfs_vget/fastvget into a new function, lfs_vinit.	2003-03-20 14:11:46 +00:00
perseant	12a78a5a7e	Don't break out of Ifile-writing loop in lfs_segwrite until nothing is left. Note however that blocks can be added to the Ifile even when the segment block is held because of inodes' atime. Do not panic with "dirty blocks" if these blocks are present.	2003-03-20 06:51:17 +00:00
perseant	c364d884f6	Hold the segment lock during truncation to prevent indirect blocks from being written by lfs_updatemeta while lfs_truncate is also writing them, a bug pointed out by YAMAMOTO Takashi <yamt@netbsd.org>.	2003-03-20 06:47:38 +00:00
perseant	20f569dad0	Remember to destroy lfs_inoext_pool when closing up the LFS subsystem.	2003-03-18 07:53:56 +00:00
perseant	1d23f2f4ea	Add write-behind to lfs_write().	2003-03-15 07:24:37 +00:00
perseant	b105ddb1d6	Make LFS LKM versions of ufs_makeinode and ufs_mkdir fail correctly. Note dependency of lfs_vnops.o on ufs_readwrite.c.	2003-03-15 07:20:22 +00:00
perseant	ea03a1ac09	Add simple_lock protection for lfs_seglock and lfs_subsys_pages; these will be expanded to cover other per-fs and subsystem-wide data as well. Fix a case of IN_MODIFIED being set without updating lfs_uinodes, resulting in a "lfs_uinodes < 0" panic. Fix a deadlock in lfs_putpages arising from the need to busy all pages in a block; unbusy any that had already been busied before starting over.	2003-03-15 06:58:49 +00:00
kristerw	ea98786439	SO C requires a statement after a label.	2003-03-15 02:27:18 +00:00
kristerw	f73d0d2d8c	ffs_gop_alloc() is not used any more. Remove it. OK:ed by Konrad Schroder.	2003-03-15 01:10:18 +00:00
perseant	ec13062af8	- Get rid of unused #ifdefs LFS_NO_PAGEMOVE and LFS_MALLOC_SUMMARY (both always true) and accompanying dead code. - When constructing write clusters in lfs_writeseg, if the block we are about to add is itself a cluster from GOP_WRITE, don't put a cluster in a cluster, just write the GOP_WRITE cluster on its own. This seems to represent a slight performance gain on my test machine. - Charge someone's rusage for writes on LFSes. It's difficult to tell who the "right" process to charge is; just charge whoever triggered the write.	2003-03-11 02:47:39 +00:00
perseant	a46b9ccf95	Only #define LFS if not already defined.	2003-03-08 23:18:54 +00:00
perseant	0493be6ba8	Take away "#ifdef LFS_UBC".	2003-03-08 22:14:31 +00:00
perseant	8feb2c22f5	Take away "#ifdef LFS_UBC".	2003-03-08 21:46:04 +00:00
perseant	4b4f884b89	Add an lfs_strategy() that checks to make sure we're not trying to read where the cleaner is trying to write, instead of tying up the "live" buffers (or pages). Fix a bug in the LFS_UBC case where oversized buffers would not be checksummed correctly, causing uncleanable segments. Make sure that wakeup(fs->lfs_iocount) is done if fs->lfs_iocount is 1 as well as 0, since we wait in some places for it to drop to 1. Activate all pages that make it into lfs_gop_write without the segment lock held, since they must have been dirtied very recently, even if PG_DELWRI is not set.	2003-03-08 02:55:47 +00:00
perseant	d51fdbef63	Make sure we hold the uobjlock when checking for dirty pages, in lfs_vflush. Note that pages can become dirty without our knowing it, anyway; don't panic if that happens.	2003-03-04 19:19:43 +00:00
perseant	003cfbd545	Don't add dirty blocks to the ifile in lfs_segunlock, if we're trying to unmount the filesystem. This avoids a "dirty blocks" panic.	2003-03-04 19:15:26 +00:00
perseant	958a4c008c	Don't force all truncations to be synchronous	2003-03-04 19:10:35 +00:00
perseant	9192f047ac	Account SEGUSE_ACTIVE correctly so that the automatic segment cleaning actually happens. Add a new fcntl call that will write the minimum necessary to checkpoint (i.e., for on-disk directory structure to be consistent, not including updates to file data) so that the cleaner can clean segments more quickly without sacrificing three-way commit for cleaning.	2003-03-02 04:34:30 +00:00
yamt	32a79f1dd0	use pid_t for pid.	2003-03-01 11:20:21 +00:00
perseant	cfc73a5fa9	Be careful to always zero pages on truncation/fragment extension, in the case where the filesystem block size is larger than PAGE_SIZE.	2003-03-01 05:07:51 +00:00
perseant	daeb6c37d1	Make lfs_truncate handle file extension correctly, in the LFS_UBC case.	2003-02-28 07:37:56 +00:00
perseant	c418e0c4d6	Fix a clrbuf() on an uninitialized pointer.	2003-02-28 07:36:32 +00:00
perseant	a94f9407dc	Quell a hasty panic in lfs_truncate: on-inode disk addresses can be different between the beginning and end of the call.	2003-02-28 04:37:07 +00:00
perseant	0b114d4e21	Do roundup and offset arithmetic in 64 bits, to allow >=2G files.	2003-02-27 07:10:27 +00:00
perseant	6f5626d112	Make fs-specific fcntl macros take three arguments (approved wrstuden). Let LFS use fcntl for cleaner functions.	2003-02-25 23:12:06 +00:00
thorpej	eb14e86676	Add a new BUF_INIT() macro which initializes b_dep and b_interlock, and use it. This fixes a few places where either b_dep or b_interlock were not properly initialized.	2003-02-25 20:35:31 +00:00
yamt	2bad134129	fix simplelocks	2003-02-25 13:47:44 +00:00
perseant	95137c8477	Add lfs_ioctl vnode op, with ioctls to take over cleaner system call functionality (not including segment clean, since that is now done automatically as checkpoints happen).	2003-02-24 08:42:49 +00:00
simonb	2a4457bd46	Remove assigned-to but not used variable.	2003-02-23 03:32:55 +00:00
perseant	3ab94fed93	Fix a buffer overflow bug in the LFS_UBC case that manifested itself either as a mysterious UVM error or as "panic: dirty bufs". Verify maximum size in lfs_malloc. Teach lfs_updatemeta and lfs_shellsort about oversized cluster blocks from lfs_gop_write. When unwiring pages in lfs_gop_write, deactivate them, under the theory that the pagedaemon wanted to free them last we knew.	2003-02-23 00:22:33 +00:00
yamt	1dd4645db4	fix simple_lock/unlock mismatches.	2003-02-22 01:52:25 +00:00
perseant	fdf4bfe002	Tabify, and fix some comment alignment problems.	2003-02-20 04:27:23 +00:00
yamt	5f444770aa	add debug code to lfs_free.	2003-02-19 12:58:53 +00:00
yamt	65fda8e404	workaround for "another flush is..." infinity loop in writerd. if we're writerd, sleep in lfs_flush until another writer goes away instead of busy loop in writed.	2003-02-19 12:49:10 +00:00
yamt	d9a4f81d1c	wire the pages instead of just dequeue'ing them. advised by Chuck Silvers.	2003-02-19 12:22:51 +00:00
yamt	18e00c1196	init b_interlock.	2003-02-19 12:18:59 +00:00
yamt	2be86f2ff8	acquire v_interlock before calling VOP_PUTPAGES.	2003-02-19 12:02:38 +00:00
yamt	0ad89cf93e	init b_interlock.	2003-02-19 12:01:17 +00:00
soren	3291a4522e	Make libsa compile again.	2003-02-18 14:58:31 +00:00
perseant	e61877243d	Make it compile again, grr....	2003-02-18 02:00:08 +00:00
perseant	b397c875ae	Add code to UBCify LFS. This is still behind "#ifdef LFS_UBC" for now (there are still some details to work out) but expect that to go away soon. To support these basic changes (creation of lfs_putpages, lfs_gop_write, mods to lfs_balloc) several other changes were made, to wit: * Create a writer daemon kernel thread whose purpose is to handle page writes for the pagedaemon, but which also takes over some of the functions of lfs_check(). This thread is started the first time an LFS is mounted. * Add a "flags" parameter to GOP_SIZE. Current values are GOP_SIZE_READ, meaning that the call should return the size of the in-core version of the file, and GOP_SIZE_WRITE, meaning that it should return the on-disk size. One of GOP_SIZE_READ or GOP_SIZE_WRITE must be specified. * Instead of using malloc(...M_WAITOK) for everything, reserve enough resources to get by and use malloc(...M_NOWAIT), using the reserves if necessary. Use the pool subsystem for structures small enough that this is feasible. This also obsoletes LFS_THROTTLE. And a few that are not strictly necessary: * Moves the LFS inode extensions off onto a separately allocated structure; getting closer to LFS as an LKM. "Welcome to 1.6O." * Unified GOP_ALLOC between FFS and LFS. * Update LFS copyright headers to correct values. * Actually cast to unsigned in lfs_shellsort, like the comment says. * Keep track of which segments were empty before the previous checkpoint; any segments that pass two checkpoints both dirty and empty can be summarily cleaned. Do this. Right now lfs_segclean still works, but this should be turned into an effectless compatibility syscall.	2003-02-17 23:48:08 +00:00
perseant	83adb24145	Allow negative values other than UNASSIGNED to be returned from ufs_bmap; fixes a bug introduced in the 64-bit daddr_t conversion, that manifests itself in LFS with kernels compiled with the FFS_EI option.	2003-02-09 03:26:59 +00:00
pk	338f31f581	Make the buffer cache code MP-safe.	2003-02-05 21:38:38 +00:00
perseant	14c17e57b4	Don't call a dirop within a dirop: if lfs_rename is actually deleting a link, call lfs_remove directly before starting dirop rather than having ufs_rename do it.	2003-02-03 00:32:35 +00:00
tron	f1eeaa9020	Only use MALLOC_DECLARE() in kernel namespace.	2003-02-01 18:34:14 +00:00
thorpej	b193480908	Add extensible malloc types, adapted from FreeBSD. This turns malloc types into a structure, a pointer to which is passed around, instead of an int constant. Allow the limit to be adjusted when the malloc type is defined, or with a function call, as suggested by Jonathan Stone.	2003-02-01 06:23:35 +00:00
yamt	84d61a1dc4	there's no need to treat VOP_WHITEOUT as dirop because it modifies only one inode.	2003-01-30 14:18:32 +00:00
yamt	53d6eb47ee	don't use daddr_t for segment summary since it's an on-disk structure.	2003-01-29 13:14:33 +00:00
simonb	0adecbd12b	Remove variable that is only assigned to but not referenced.	2003-01-29 03:06:40 +00:00
yamt	e41d3a6f1c	make these compilable with lfs debug options. (follow daddr_t change) XXX maybe segment number should be 64bit.	2003-01-27 23:17:56 +00:00
kleink	865868a8b1	Further printf format fixes in the wake of daddr_t. Note that PRI?64 and long long int arguments aren't made for each other, nor are %lld and int64_t arguments.	2003-01-27 21:45:52 +00:00
tsutsui	daf84696c6	More printf format cleanup to reduce casts.	2003-01-26 06:42:31 +00:00
kleink	4e0e5333ae	Fix further printf format warnings for DEBUG, in the wake of daddr_t having changed.	2003-01-25 23:00:09 +00:00
tron	5067836b9e	Use PRId64 instead of hard coding "%lld" to fix build problems under LP64 ports.	2003-01-25 18:12:31 +00:00
fvdl	a138610cac	The oldblks and newblks arrays are used to store direct copies of on-disk block pointers, so they should be int32_t. Error found by Izumi Tsutsui.	2003-01-25 16:40:28 +00:00
tron	63dda858c6	Fix printf() format strings problems caused by "daddr_t" change.	2003-01-25 12:50:38 +00:00
fvdl	a3ff3a3038	Bump daddr_t to 64 bits. Replace it with int32_t in all places where it was used on-disk, so that on-disk formats remain the same. Remove ufs_daddr_t and ufs_lbn_t for the time being.	2003-01-24 21:55:02 +00:00
thorpej	b78f59b443	Merge the nathanw_sa branch.	2003-01-18 08:51:40 +00:00
yamt	03e1a46833	- zerofill struct lfs when allocating it. - use M_ZERO instead of memset after malloc.	2003-01-12 13:04:52 +00:00
yamt	5f254d46cc	backout wrong assertions that i added.	2003-01-08 17:16:52 +00:00
yamt	99d625b53c	for lfs_remove/lfs_rmdir, keep removed vnodes marked VDIROP. (backout parts of rev.1.40) otherwise, directory structures can be corrupted because checkpoints can occur via eg. lfs_vflush before parent directory is written.	2003-01-08 17:14:58 +00:00
yamt	69f1c0cb29	in set_dirop/endop, use normal vref/vrele instead of lfs versions so that we don't miss lfs_inactivate.	2003-01-08 15:43:29 +00:00
yamt	ee36fccabb	add assertions.	2003-01-08 15:40:54 +00:00
yamt	49d2b56b43	use lfs_unmark_vnode instead of duplicated code fragments.	2003-01-08 15:40:04 +00:00
wiz	1035faff1d	writable, not writeable.	2003-01-06 20:30:28 +00:00
chs	822a8f2c0f	several bugs: - move calls to softdep_setup_pagecache() (which can sleep to allocate memory) outside the softdep lock. - replace the softdep_flush_indir() hack (which tries to find another vnode to fsync when we are holding lots of buffer-cache buffers locked for long periods of time) with softdep_trackbufs() (which just kicks the syncer and sleeps under the same circumstances). the former method had a lock-ordering problem which would occasionally deadlock. - relax the assertion in softdep_sync_metadata() which says that we should never see D_ALLOCDIRECT deps for VREG vnodes. it's ok to see those attached to indirect blocks. also, there's no need to splbio() while allocating the buffer headers to which pagecache dependencies are attached, so remove that. fixes all the problems in PR 19288.	2003-01-01 23:08:56 +00:00
yamt	a5bf83bbfc	don't set vnode type to VNON in error case of ufs_makeinode. (backout rev.1.74) it seems that there's no need to do it (anymore?) and LFS has trouble with it. (VNON vnodes marked VDIROP will never reclaimed) ok'ed by Frank van der Linden.	2002-12-31 15:00:18 +00:00
yamt	140a8e56ca	write ifile only when it has dirty buffers.	2002-12-31 14:54:32 +00:00
yamt	cb9613feef	comment and assertions	2002-12-30 05:34:17 +00:00
yamt	6fc496c67a	move check of lfs_unlockvp from lfs_reserveavail to lfs_reserve because lfs_reservebuf needs same check as well.	2002-12-30 05:31:53 +00:00
yamt	a05fbf74c0	fix vref/vunref mismatch.	2002-12-29 14:08:12 +00:00
yamt	88ae33f9e0	backout assertions in lfs_inactive. they can be false when unmounting forcibly.	2002-12-29 07:05:55 +00:00
christos	ae2bf40b7e	fix compile problem.	2002-12-28 20:08:36 +00:00
yamt	d840722863	avoid warnings without DIAGNOSTIC. pointed by Andreas Wrede.	2002-12-28 17:22:47 +00:00
yamt	a428d8a5af	dirop inode can't be passed to lfs_inactivate.	2002-12-28 15:12:26 +00:00
yamt	59be5399b7	- in lfs_reserve, vref vnodes that we're locking so that cleaner doesn't try to reclaim them. (workaround for deadlock noted in the comment in lfs_reserveavail) - in lfs_rename, mark vnodes which are being moved as well as directry vnodes.	2002-12-28 14:39:08 +00:00
hannken	c122326822	Clear IN_SPACECOUNTED on (re-)used inodes. This cures the "unmount pending error:" on softdep umounts. Approved by: Frank van der Linden <fvdl@netbsd.org>	2002-12-27 16:07:13 +00:00
yamt	4b9c604ba7	- in lfs_reserve, reserve locked buffer count as well. - don't wait for locking buf in lfs_bwrite_ext to avoid deadlocks. - skip lfs_reserve when we're doing dirop. reserve more (for lfs_truncate) in set_dirop instead. this mostly solves PR 18972. (and hopefully PR 19196)	2002-12-26 13:37:18 +00:00
yamt	e9bd1836a5	don't try to write all blocks passed to lfs_markv at once since it likely causes buf starvation.	2002-12-26 13:04:39 +00:00
yamt	362c57a2d2	add a XXX comment. (description of possible deadlock)	2002-12-22 17:31:52 +00:00
yamt	4370be165c	add a XXX comment	2002-12-21 05:35:54 +00:00
yamt	0d95cc5d66	correct/add assertion.	2002-12-18 14:05:50 +00:00
yamt	beff0dd387	#if 0 out vnode unlock/lock in lfs_reserve for now and add a comment about it. deadlock is better than corruption (or panic), IMO.	2002-12-17 15:23:37 +00:00
yamt	a999523301	no need for cleaner to hold vnode locks. cleaner and normal vnode operations are synchronized enough by seglock/fraglock and buf's B_BUSY-ness.	2002-12-17 14:37:49 +00:00
yamt	b2d5b49e2b	use ufs_daddr_t instead of int where appropriate.	2002-12-17 14:28:54 +00:00
yamt	a79cb6db43	- in lfs_bwrite_ext, if we're cleaner, mark inode IN_CLEANING rather then IN_MODIFIED. otherwise cleaned (indirect) blocks belongs to the inode isn't written until next sync. - add assertions.	2002-12-14 13:41:25 +00:00
yamt	e5ea55e4ea	in lfs_writefile, check v_type==VNON earlier. to avoid null dereference with DEBUG_LFS_VERBOSE.	2002-12-14 11:54:47 +00:00
yamt	8fe8a4ced8	save a segment write when doing checkpoint.	2002-12-13 14:40:02 +00:00
yamt	275b3a47a2	correct DIAGNOSTIC code for duplicated inodes in a segment and su_nbytes.	2002-12-12 12:28:13 +00:00
yamt	9097ffce96	take care of B_CLRBUF in lfs_balloc. otherwise you'll see uninitialized blocks.	2002-12-11 13:34:14 +00:00
matt	60db16d1ff	Add multiple inclusion protection for headers. Fix mismatched variable declarations (missing const's) as needed.	2002-12-01 00:12:06 +00:00
kristerw	fa033b67e7	Softdep is mature enough that it shouldn't define DEBUG and DIAGNOSTIC unconditionally.	2002-11-30 20:27:50 +00:00
yamt	2331faab98	more XXX comment.	2002-11-27 11:36:40 +00:00
lukem	0635de35a3	Remove KDIR=, since SYS_INCLUDE=symlinks and KDIR are not supported any more.	2002-11-26 23:30:07 +00:00
yamt	42a8b926f9	eliminate i_ino from in-core inode and use local variable instead. ok'ed by Frank van der Linden.	2002-11-26 01:23:30 +00:00
thorpej	d6f8cc841d	Avoid strict-alias warnings.	2002-11-25 01:55:21 +00:00
thorpej	b409a344b5	Avoid strict-alias warnings.	2002-11-25 01:44:21 +00:00
yamt	3af39ea015	in lfs_fakebuf, make corresponding buffer busy to avoid reading blocks that isn't written yet. it's needed because we'll update metadatas in lfs_updatemeta before data pointed by them is actually written to disk. XXX should be solved with fake inode/indirect blocks instead?	2002-11-24 16:39:13 +00:00
yamt	290fa35864	add a XXX comment to lfs_reserve. * it isn't safe to unlock vp here * because we're passing data using inode from namei. * (eg. i_offset)	2002-11-24 16:09:50 +00:00
scw	7009056578	Quell an uninitialised variable warning.	2002-11-24 11:09:13 +00:00
yamt	eca07565c3	make sure i_lfs_fragsize is initialized. fix panic "lfs_writefile: more than one fragment!" PR 18974.	2002-11-24 08:43:26 +00:00
yamt	16a26d41e8	lfs_sync should wait at lfs_writer, not lfs_dirops. PR 18973.	2002-11-24 08:37:43 +00:00
yamt	feacf34c09	lfs_reserve shouldn't block for lfs_unlockvp. otherwise cleaner deadlocks. PR 19134.	2002-11-24 08:32:22 +00:00
yamt	37b4f42285	blksize() macro shouldn't used for indirect blocks. this fixes "getblk: block size invariant failed" panic. PR 18977.	2002-11-24 08:27:00 +00:00
yamt	7d0ba73802	correct locking for lfs_rmdir. PR 18976.	2002-11-24 08:23:41 +00:00
wiz	29d58d0333	s/sqiud/squid/ in comment, reported by skrueger at europe com.	2002-11-04 16:59:37 +00:00
dbj	1d1cd19e5f	use be32toh instead of ntohl, etc.	2002-11-02 19:31:09 +00:00
kristerw	58efa0630e	Removed unused variables doclusterread and doclusterwrite.	2002-11-01 21:11:43 +00:00
chs	ea6ddab6a8	the work-around in rev. 1.37 (turn off async) wasn't enough to prevent hangs under heavy load. so we now apply the more extreme version: make MFS mounts "sync". fixes PRs 17128 and 17321.	2002-10-24 16:41:00 +00:00
jdolecek	f040fc2e24	ext2fs_remove(): use 'else' to eliminate need for goto (and improve readibility, even)	2002-10-23 19:52:16 +00:00
jdolecek	e0cc03a09b	merge kqueue branch into -current kqueue provides a stateful and efficient event notification framework currently supported events include socket, file, directory, fifo, pipe, tty and device changes, and monitoring of processes and signals kqueue is supported by all writable filesystems in NetBSD tree (with exception of Coda) and all device drivers supporting poll(2) based on work done by Jonathan Lemon for FreeBSD initial NetBSD port done by Luke Mewburn and Jason Thorpe	2002-10-23 09:10:23 +00:00
yamt	8872a5d6af	make sure to update the vnode's size even if uiomove failed. otherwise, softdep states can't be flushed later. ok'ed by Chuck Silvers. fix PR/16670.	2002-10-18 01:05:52 +00:00
dbj	43395bd5a8	Add support for the Apple UFS variation on ffs This is the bulk of PR #17345 The general approach is to use a run time deteriminable value for DIRBLKSIZ. Additional allowances are included for using MAXSYMLINKLEN with FS_42INODEFMT and a shift in the cylinder group cluster summary count array. Support is added for managing the Apple UFS volume label.	2002-09-28 20:11:05 +00:00
provos	0f09ed48a5	remove trailing \n in panic(). approved perry.	2002-09-27 15:35:29 +00:00
simonb	ad2a80f193	Move a brace that is in the wrong position when changes from FreeBSD were added in rev 1.51. This may fix the "N lost blocks" problem some people have noticed. Reviewed by fvdl.	2002-09-26 21:35:27 +00:00
jdolecek	a120eaa3ea	use ufs_balloc_range() rather than local (mostly identical, but with some bugs) ext2fs variant	2002-09-26 11:06:36 +00:00
thorpej	71404bb533	Don't include <sys/map.h>.	2002-09-25 22:21:01 +00:00
jdolecek	e305eb63e8	don't need <sys/conf.h> here	2002-09-22 19:32:54 +00:00
christos	6f3945a88d	MNT_GETARGS support	2002-09-21 18:10:34 +00:00
gehenna	77a6b82b27	Merge the gehenna-devsw branch into the trunk. This merge changes the device switch tables from static array to dynamically generated by config(8). - All device switches is defined as a constant structure in device drivers. - The new grammer ``device-major'' is introduced to ``files''. device-major <prefix> char <num> [block <num>] [<rules>] - All device major numbers must be listed up in port dependent majors.<arch> by using this grammer. - Added the new naming convention. The name of the device switch must be <prefix>_[bc]devsw for auto-generation of device switch tables. - The backward compatibility of loading block/character device switch by LKM framework is broken. This is necessary to convert from block/character device major to device name in runtime and vice versa. - The restriction to assign device major by LKM is completely removed. We don't need to reserve LKM entries for dynamic loading of device switch. - In compile time, device major numbers list is packed into the kernel and the LKM framework will refer it to assign device major number dynamically.	2002-09-06 13:18:43 +00:00
thorpej	139cdc3125	Make nbuf, nswbuf, and bufpages unsigned. Make all operations on these variables unsigned, and update places where their values are printed.	2002-08-25 20:21:33 +00:00
itojun	8dd04cdcd7	correct range check, have overflow check, fix type mismatches, for cmap args and some other calls. from openbsd	2002-08-03 00:12:48 +00:00
soren	178d83d503	Die, qaddr_t, die! - mnt_data in struct mount is already effectively a void *, so stop pretending otherwise.	2002-07-30 07:40:07 +00:00
wiz	645df36eff	Spell '[Rr]ight' correctly. From Jim Bernard.	2002-07-26 14:11:34 +00:00
hannken	7de36862a8	Rename bufq_init() to bufq_alloc(). Add bufq_free() to remove a buffer queue. Avoid MALLOC while holding a spinlock. From Chuck Silvers.	2002-07-21 15:32:17 +00:00
hannken	d4c062b4cc	Convert to new device buffer queue interface.	2002-07-19 16:26:01 +00:00
perseant	8f30dc2c9b	Remove lying comment on SEGM_PROT seglock.	2002-07-11 21:09:00 +00:00
briggs	77f5558791	Fix a printf format warning.	2002-07-07 14:29:06 +00:00
fredette	10d4232908	Fixed a printf argument type.	2002-07-06 15:39:07 +00:00
perseant	32ae84b188	Deal with fragment size changes better. For each fragment that can exist on an on-disk inode, we keep a record of its size in struct inode, which is updated when we write the block to disk. The cleaner routines thus have ready access to what size is the correct size for this block, on disk. Fixed a related bug: if a file with fragments is being cleaned (fragments being cleaned) at the same time it is being extended beyond NDADDR blocks, we could write a bogus FINFO record that has a frag in the middle; when it was cleaned this would give back bogus file data. Don't write the indirect blocks in this case, since there is no need. lfs_fragextend and lfs_truncate no longer require the seglock, but instead take a shared lock, which the seglock locks exclusively.	2002-07-06 01:30:11 +00:00
scw	881a4dcac0	Cast pointers first to uintptr_t before casting to register_t. On SH-5, sizeof(register_t) is always 8, even if sizeof(void *) is 4 as is the case when compiling for ILP32.	2002-07-05 13:49:26 +00:00
yamt	d566a58b5e	fix printf format for DEBUG_LFS.	2002-07-02 19:07:03 +00:00
perseant	0418a2c352	Fix miscalculation in lfs_fits found by Trevin Beattie <trevin@xmission.com>. Change some of the variable names from "nb", "db" to "fsb" to reflect their calling conventions.	2002-06-20 22:10:24 +00:00
perseant	ae37d9d186	Don't bomb out of lfs_bmapv if the caller is requesting blocks that live in the current segment. There's nothing wrong with this, and it is necessary for the correct operation of the coaleascer.	2002-06-20 20:43:17 +00:00
jdolecek	20644ff75f	clear_inodedeps(): use CIRCLEQ_FOREACH() appropriately	2002-06-18 20:24:31 +00:00
perseant	ddfb1dbb92	For synchronous writes, keep separate i/o counters for each write, so processes don't have to wait for one another to finish (e.g., nfsd seems to be a little happier now, though I haven't measured the difference). Synchronous checkpoints, however, must always wait for all i/o to finish. Take the contents of the callback functions and have them run in thread context instead (aiodoned thread). lfs_iocount no longer has to be protected in splbio(), and quite a bit less of the segment construction loop needs to be in splbio() as well. If lfs_markv is handed a block that is not the correct size according to the inode, refuse to process it. (Formerly it was extended to the "correct" size.) This is possibly more prone to deadlock, but less prone to corruption. lfs_segclean now outright refuses to clean segments that appear to have live bytes in them. Again this may be more prone to deadlock but avoids corruption. Replace ufsspec_close and ufsfifo_close with LFS equivalents; this means that no UFS functions need to know about LFS_ITIMES any more. Remove the reference from ufs/inode.h. Tested on i386, test-compiled on alpha.	2002-06-16 00:13:15 +00:00
chs	ea4c4a989f	allow read-only mounts even if we can't read the last fragment of the fs. this enables one to recover data from a failing disk (where the read failure is a hardware problem) while avoiding corrupting the fs further (in the case where the read failure is due to a misconfiguration).	2002-06-09 16:46:49 +00:00
perseant	c13ae45a2a	Let lfs_bmapv fill in the bi_size member of the BLOCK_INFO structure, as well as bi_daddr. This lets the cleaner have an idea of what the size of this block was at the time it was written without having to refer to a segment header (e.g., in the file coalescing case). Tested on i386.	2002-06-06 00:46:24 +00:00
chs	fffb1de109	get the units right when computing a blkno in the ENOSPC path for allocations involving indirect blocks. spotted by Trevin Beattie <trevin@xmission.com>.	2002-06-05 05:23:51 +00:00
thorpej	7903aba812	#if 0 a test that is always false (and the XXX comment above it indicates so).	2002-05-30 18:54:55 +00:00
perseant	d67a5bbb21	Fix a couple of instances where reassignbuf() was not done at splbio. Tested on i386.	2002-05-24 22:13:57 +00:00
perseant	43ca783b4a	Back out rev 1.174 of vfs_subr.c, because the splbio() wasn't protecting enough to be useful, and broadening it so that it did would have meant that operations possibly requiring synchronous disk activity would have to be done in splbio(). This clearly was not going to work. Worked around this in the LFS case by having lfs_cluster_callback put an extra hold on the vnode before calling biodone(), and taking the hold off without HOLDRELE's problematic list swapping. lfs_vunref() will take care of that---in thread context---on the next write if need be. Also, ensure that the list walking in lfs_{writevnodes,segunlock,gather} takes into account the possibility that the list may change underneath it (possibly because it itself deleted an element). Tested on i386, test-compiled on alpha.	2002-05-23 23:05:25 +00:00
perseant	ec0ca919be	Protect v_freelist with splbio(), since HOLDRELE can be called in interrupt context (through brelvp). (LFS may be the only subsystem affected by this problem.) Tested on i386.	2002-05-20 22:50:57 +00:00
perseant	36efaa3565	use macros from <sys/queue.h>	2002-05-17 21:42:38 +00:00
thorpej	6c1654256e	Fix LP64 printf format warning.	2002-05-16 02:23:55 +00:00
perseant	8886b0f4b2	Phase one of my three-phase plan to make LFS play nice with UBC, and bug-fixes I found while making sure there weren't any new ones. * Make the write clusters keep track of the buffers whose blocks they contain. This should make it possible to (1) write clusters using a page mapping instead of malloc, if desired, and (2) schedule blocks for rewriting (somewhere else) if a write error occurs. Code is present to use pagemove() to construct the clusters but that is untested and will go away anyway in favor of page mapping. * DEBUG now keeps a log of Ifile writes, so that any lingering instances of the "dirty bufs" problem can be properly debugged. * Keep track of whether the Ifile has been dirtied by various routines that can be called by lfs_segwrite, and loop on that until it is clean, for a checkpoint. Checkpoints need to be squeaky clean. * Warn the user (once) if the Ifile grows larger than is reasonable for their buffer cache. Both lfs_mountfs and lfs_unmount check since the Ifile can grow. * If an inode is not found in a disk block, try rereading the block, under the assumption that the block was copied to a cluster and then freed. * Protect WRITEINPROG() with splbio() to fix a hang in lfs_update.	2002-05-14 20:03:53 +00:00
mycroft	1523c4c12f	In ufs_mkdir(), write the data block before updating the inode with the block pointer, to prevent "DIRECTORY CORRUPTED" errors from fsck(8). Note: The behavior in the softdep case is unchanged, but needs to be fixed.	2002-05-14 17:37:52 +00:00
matt	fed7110558	Commit out code that's no longer used.	2002-05-14 02:46:22 +00:00
matt	0cb85bc7b9	Eliminate commons.	2002-05-12 23:06:27 +00:00
enami	b86c56a0b6	Add comment that getblk() in ufs_bmaparray() returns an error only if we are pagedaemon.	2002-05-11 12:23:53 +00:00
chs	e926e6ec99	use the correct size when zeroing an array.	2002-05-05 17:01:41 +00:00
chs	dcc6963777	for softdep vnodes, always write together the pages for any block that might have a dependency , since the accounting doesn't work otherwise. fixes PRs 15364 16336 16448.	2002-05-05 17:00:06 +00:00
perseant	76d2795556	Make exported LFSes not panic on the first file create.	2002-04-27 01:00:46 +00:00
thorpej	37dc008ca3	Cleanup how file system configuration information is declared, grouping related information together, with the file system code itself. This is just low-hanging fruit -- more to come.	2002-04-16 23:14:05 +00:00
mycroft	fd303c4dc5	Add a special case for nrpos=1 to cbtorpos(). This massively reduces CPU usage by newfs(8) -- and fsck_ffs(8) on a relatively empty file system. There is still one divide left in the inner loops, to calculate cylno values.	2002-04-10 14:31:07 +00:00
mycroft	afc5d40400	Use blkstofrags() and fragstoblks(). Use &(NBBY-1) rather than %NBBY. Switch off of fs_fragshift rather than fs_frag (generates better jump tables).	2002-04-10 08:05:11 +00:00
mycroft	0a9b835878	Use fsbtodb() rather than multiplying by NSPF().	2002-04-10 07:46:10 +00:00
enami	89cf6e2727	Hold an extra reference if updating and args.fspec == NULL.	2002-04-01 07:51:58 +00:00
christos	e356d686bb	Fixes from enami: - If VOP_ACCESS fails when updating mount, we will vrele() twice. - The check for update-only flags in mp->mnt_flag when not updating case is bogus. If we really want to check, we need to see flags in ufs_args, but I'm not sure if it is really necessary. - The credential passed to ffs_reload was credential of when looking up mount point, but now it is credential of when looking up device node. Anyway, it may be current process's credential.	2002-04-01 01:52:44 +00:00
christos	919d9f5617	PR/16136: Chris Jepeway: Bogus entry in /etc/fstab can panic kernel.	2002-03-31 20:53:25 +00:00
chs	fe10bac175	if the size argument to write(2) is 0, do not modify the file in any way, including updating timestamps. required for standards conformance.	2002-03-25 02:23:55 +00:00
chs	0f2018fc31	in lfs_write(), flush and invalidate any page cache pages in the range that we're about to modify. this weak attempt at coherency is enough to make some applications (eg. "tail -f") happy, so it's worth having.	2002-03-22 03:57:35 +00:00
wiz	358ed3f6d4	Fix a typo, a KNF-nit, and simplify a printf format string.	2002-03-18 13:38:52 +00:00
chs	79c365e60e	don't do any flush-behind for async mounts. this matches the traditional behaviour.	2002-03-17 23:58:09 +00:00
chs	c1d184702f	when mounting a filesystem, read the last block in the filesystem to verify that the device is at least as big as the superblock claims the filesystem is supposed to be, and if it's not then fail the mount. this should help reduce the type of confusion reported in PR 13228.	2002-03-17 00:02:34 +00:00
thorpej	a180cee23b	Pool deals fairly well with physical memory shortage, but it doesn't deal with shortages of the VM maps where the backing pages are mapped (usually kmem_map). Try to deal with this: * Group all information about the backend allocator for a pool in a separate structure. The pool references this structure, rather than the individual fields. * Change the pool_init() API accordingly, and adjust all callers. * Link all pools using the same backend allocator on a list. * The backend allocator is responsible for waiting for physical memory to become available, but will still fail if it cannot callocate KVA space for the pages. If this happens, carefully drain all pools using the same backend allocator, so that some KVA space can be freed. * Change pool_reclaim() to indicate if it actually succeeded in freeing some pages, and use that information to make draining easier and more efficient. * Get rid of PR_URGENT. There was only one use of it, and it could be dealt with by the caller. From art@openbsd.org.	2002-03-08 20:48:27 +00:00
simonb	9a942a34e0	Don't use local extern declarations for the mountroot variable or declare local prototypes for nfs_mountroot() or md_root_setconf().	2002-03-04 02:25:21 +00:00
pooka	360cafaddb	Don't add fs->fs_pendingblocks to f_bavail twice. It's already included in f_bfree, which is added to f_bavail. Fixes problem with statfs reporting too much free space for filesystems which have files pending to be freed by softdeps.	2002-02-28 21:59:23 +00:00
enami	70ca5d5195	Record some page cache related information into ubchist.	2002-02-22 08:23:16 +00:00
wiz	c809c3243b	Fix two problems with softdep_typenames (missing entry, wrong boundary check). Okayed by fvdl.	2002-02-14 00:49:56 +00:00
perseant	f41358613c	Include the space taken by inodes in the count made by lfs_check(); make VOP_SETATTR call lfs_check. This prevents large numbers of inode changes (say, at the end of tar(1)) from filling the buffer cache.	2002-02-11 02:47:29 +00:00
chs	94cfc87907	bring in the change from FreeBSD's rev. 1.107 of this file: date: 2002/02/07 00:54:32; author: mckusick; state: Exp; lines: +10 -7 Occationally deleted files would hang around for hours or days without being reclaimed. This bug was introduced in revision 1.95 dealing with filenames placed in newly allocated directory blocks, thus is not present in 4.X systems. The bug is triggered when a new entry is made in a directory after the data block containing the original new entry has been written, but before the inode that references the data block has been written. Submitted by: Bill Fenner <fenner@research.att.com> This should fix NetBSD PR 15531.	2002-02-10 18:06:03 +00:00
lukem	caa29fae38	#undef DIRBLKSIZ before #define-ing it	2002-02-06 15:44:49 +00:00
perseant	8ded9a2c7d	Correct free list tail pointer, when adding blocks of new inodes to v2 filesystems. Should fix PR #14408.	2002-02-04 03:32:16 +00:00
chs	eecf9e208a	fix PR 15299 by making MFS filesystems not be "async". in the longer term, MFS needs to be made a lot more VM-friendly.	2002-02-03 03:51:57 +00:00
tv	880a2cf970	These sources are pulled into makefs(8), so we need config.h and protection for __KERNEL_RCSID().	2002-01-31 19:19:22 +00:00
tv	5d28098c5b	Revert previous. This is actually being done a better way.	2002-01-31 19:18:18 +00:00
tv	8ec192426e	For makefs, only include <machine/bswap.h> if it exists.	2002-01-31 19:17:02 +00:00
tv	af3dca1ea8	#undef MAXNAMLEN before defining it; this lets ufs/ufs/dir.h be used properly on non-NetBSD hosts with makefs(8).	2002-01-31 19:16:34 +00:00
chs	1b454dbb4f	fix an error case.	2002-01-26 08:32:05 +00:00
enami	ac35ac58f5	- For CIRCLEQ, comparing the loop variable against NULL doesn't make sense. - Minor KNF while I'm here. # This doesn't fix real problems though.	2002-01-18 00:30:03 +00:00
enami	9ad4436bc2	Fix typo which prevents diagnostic test from working.	2002-01-16 08:33:12 +00:00
lukem	25ca00a979	Only pull in <sys/systm.h> #ifdef _KERNEL, since it's a kernel only header. In the ! _KERNEL case, provide own prototype for panic() instead.	2002-01-09 23:51:00 +00:00
lukem	202e920175	revert part of rev 1.14 - #include <ufs/ufs/dinode.h> - because that makes it MUCH more difficult to reference this file stand-alone.	2002-01-07 15:25:22 +00:00
thorpej	fdb5b56e5f	Do not compare an integer to NULL.	2001-12-31 21:37:22 +00:00
fvdl	a833eaf1fe	XXXX temporary measure: in the case of a softdep 'unmount pending error', do not mark the filesystem clean, as this will mean that one or more files were likely not completely removed (will show up as unconnected in fsck). Prevents filesystems from being marked clean while they're not until this problem has been figured out.	2001-12-30 15:46:53 +00:00
fvdl	d0f7c6fb96	Use softdep_change_linkcnt to note that the inode mode was set to 0. From FreeBSD.	2001-12-27 01:48:38 +00:00
fvdl	c9218f8686	The softdep code sometimes use vfs_vget .. vput. For removals, these would result in a vop_inactive call for the vnode each time, resulting in vinvalbuf->fsync. The original softdep code avoided the fsync in vinvalbuf by not calling it if there were no dirty blocks. This was changed in NetBSD. Also, flush_inodedeps was changed to mark the inode as modified so that it would do an inode update and flush the last one. This combination basically caused a sync write for each removed file in an rm -rf (showing up delayed from the syncer a lot of the time). If called from vinvalbuf (FSYNC_RECLAIM), and there were no dirty blocks or pages to begin with, still do everything as normal, so that possible dirty blocks in transit to disk are properly waited for, etc, but don't pass UPDATE_WAIT to VOP_UPDATE, since there is no need for it in that case.	2001-12-27 01:44:59 +00:00
fvdl	2b5fe12a98	Pull over one missed fix from FreeBSD wrt. running out of quota. Also reshuffle some code a bit to make it look more similar (no functional change).	2001-12-27 01:29:05 +00:00
fvdl	08f29df58c	As pointed out by mycroft and reflected in the comment, update the directory inode before creating the new entry (not the freshly alloced directory which isn't linked anywhere yet).	2001-12-23 16:16:59 +00:00
fvdl	983d322c7c	Fix botch in my original softdep code merge: remove redundant (and synchronous to boot) VOP_UPDATE call.	2001-12-23 14:00:21 +00:00
fvdl	f1db177e10	Fix from FreeBSD that I missed: speed up handling of short-lived files a bit.	2001-12-23 11:54:46 +00:00
chs	2ddcad30f6	process the delayed-free queue more often.	2001-12-23 08:53:46 +00:00
fvdl	68728c0901	ffs_reload may be called after an old fsck has run, and the pending* fields may not be zero. Just reset them silently, it's not an error.	2001-12-19 15:20:19 +00:00
fvdl	3d8b2ffe36	Bring over fixes from FreeBSD that weren't incorporated yet, mainly from Kirk McKusick. They implement taking pending block/inode frees into account for the sake of correct statfs() numbers, and adding a new softdep type (newdirblk) to correctly handle newly allocated directory blocks. Minor additional changes: 1) swap the newly introduced fs_pendinginodes and fs_pendingblock fields in ffs_sb_swap, and 2) declare lkt_held in the debug version of the softdep lock structure volatile, as it can be modified from interrupt context #ifdef DEBUG.	2001-12-18 10:57:21 +00:00
chs	0d70d731c2	use the new compatibility routines to allow mmap() to work (in the same non-coherent fashion that it worked pre-UBC) until someone has time to do it the right way.	2001-12-18 07:51:16 +00:00
chs	03dd7ce1e8	when truncating a file, make sure the last block of the file is actually allocated, since other parts of the code assume this.	2001-12-18 06:50:28 +00:00
chs	5a690c92a1	add a VOP_PUTPAGES method for all the filesystems that don't have pages, just unlock the interlock.	2001-12-06 04:27:40 +00:00

... 2 3 4 5 6 ...

962 Commits