NetBSD

Author	SHA1	Message	Date
jdolecek	a0deb3e9c4	move union filesystem code from sys/miscfs/union to sys/fs/union	2003-03-16 08:26:46 +00:00
enami	464f962310	Release the hash lock on failure.	2003-03-15 00:22:47 +00:00
tron	98e6fa0c08	Teach procfs_allocvp() about Puptime to avoid panics if "/proc/uptime" is opened.	2003-03-04 18:55:02 +00:00
jdolecek	b49d58ec90	use different wmesg for the reader and the writer	2003-03-02 18:54:50 +00:00
hannken	11d5f11e82	Change "%llu" to "PRIu64" to make it compile on sparc64.	2003-02-27 12:20:28 +00:00
jrf	a2d850baf8	This addresses PR kerm/19989. Thanks to hamajima@nagoya.ydc.co.jp for submitting this patch which enables /proc/uptime for linux emul. Patch reviewed by atatat@netbsd.org and tron@netbsd.org, approved by tron@netbsd.org.	2003-02-25 21:00:31 +00:00
thorpej	eb14e86676	Add a new BUF_INIT() macro which initializes b_dep and b_interlock, and use it. This fixes a few places where either b_dep or b_interlock were not properly initialized.	2003-02-25 20:35:31 +00:00
pk	2931081a79	Make updating a file's reference and use count MP-safe.	2003-02-23 14:37:32 +00:00
simonb	ac161ae918	Remove assigned-to but not used variable.	2003-02-23 04:20:06 +00:00
perseant	b397c875ae	Add code to UBCify LFS. This is still behind "#ifdef LFS_UBC" for now (there are still some details to work out) but expect that to go away soon. To support these basic changes (creation of lfs_putpages, lfs_gop_write, mods to lfs_balloc) several other changes were made, to wit: * Create a writer daemon kernel thread whose purpose is to handle page writes for the pagedaemon, but which also takes over some of the functions of lfs_check(). This thread is started the first time an LFS is mounted. * Add a "flags" parameter to GOP_SIZE. Current values are GOP_SIZE_READ, meaning that the call should return the size of the in-core version of the file, and GOP_SIZE_WRITE, meaning that it should return the on-disk size. One of GOP_SIZE_READ or GOP_SIZE_WRITE must be specified. * Instead of using malloc(...M_WAITOK) for everything, reserve enough resources to get by and use malloc(...M_NOWAIT), using the reserves if necessary. Use the pool subsystem for structures small enough that this is feasible. This also obsoletes LFS_THROTTLE. And a few that are not strictly necessary: * Moves the LFS inode extensions off onto a separately allocated structure; getting closer to LFS as an LKM. "Welcome to 1.6O." * Unified GOP_ALLOC between FFS and LFS. * Update LFS copyright headers to correct values. * Actually cast to unsigned in lfs_shellsort, like the comment says. * Keep track of which segments were empty before the previous checkpoint; any segments that pass two checkpoints both dirty and empty can be summarily cleaned. Do this. Right now lfs_segclean still works, but this should be turned into an effectless compatibility syscall.	2003-02-17 23:48:08 +00:00
pk	338f31f581	Make the buffer cache code MP-safe.	2003-02-05 21:38:38 +00:00
jdolecek	b327a5601a	don't bother special-casing DTYPE_KQUEUE/DTYPE_MISC nor panic for unknown descriptors; just return with EOPNOTSUPP for any unsupported descriptor type	2003-02-03 22:29:07 +00:00
jdolecek	b1fc5ffac3	procfs_allocvp(): * do not set vpp unless successful, otherwise we'd trigger DIAGNOSTIC panic in lookup(9) on error return on error, make sure to free malloc'ed memory and ungetnewvnode() the previously acquired vnode this fixes panic on 'tail -f <file> &; ls -l /proc/$!/fd' reported by Andrew Brown fix reviewed by Christos Zoulas	2003-02-03 22:27:42 +00:00
thorpej	b193480908	Add extensible malloc types, adapted from FreeBSD. This turns malloc types into a structure, a pointer to which is passed around, instead of an int constant. Allow the limit to be adjusted when the malloc type is defined, or with a function call, as suggested by Jonathan Stone.	2003-02-01 06:23:35 +00:00
christos	3908d39e06	step 3. Assign lwp properly if null, so that we can PHOLD without segfaulting.	2003-01-21 00:01:14 +00:00
thorpej	b78f59b443	Merge the nathanw_sa branch.	2003-01-18 08:51:40 +00:00
matt	5d09a84558	Add multiple inclusion protection.	2003-01-06 21:02:18 +00:00
wiz	1035faff1d	writable, not writeable.	2003-01-06 20:30:28 +00:00
martin	d30bd703a5	Cast off_t expression to long long to match format even on 64 bit plattforms. Shouldn't we introduce a PRIoff_t macro to create such format strings?	2003-01-04 15:42:35 +00:00
christos	b5e0c9eb5b	add LK_CANRECURSE in the locking of /dev/<pid>/fd/<n> and remove the curproc kludge. Thanks to fvdl.	2003-01-03 13:54:22 +00:00
christos	dbe6c38bc2	Implement /proc/<pid>/fd/<n>. This is work in progress. Questionable things: - Is it ok to convert DTYPE_PIPE to VFIFO and DTYPE_SOCKET to VSOCK? - XXX: Avoid locking issue in ls -Rl /proc by avoiding curproc - Does I/O to pipes work? - XXX: Are there security implications?	2003-01-03 13:21:17 +00:00
lukem	0635de35a3	Remove KDIR=, since SYS_INCLUDE=symlinks and KDIR are not supported any more.	2002-11-26 23:30:07 +00:00
christos	f5ff34bbc6	si_ -> sel_	2002-11-26 18:51:18 +00:00
yamt	bbbe3e07d7	genfs_compat_gop_write: set uio_iovcnt correctly.	2002-11-15 14:01:57 +00:00
thorpej	ec2b09dc84	Fix a signed/unsigned comparison warning.	2002-11-07 08:21:36 +00:00
thorpej	b7d2ca250a	Fix signed/unsigned comparison warnings.	2002-11-07 08:13:10 +00:00
jdolecek	c82ab2eb79	now that mem_no is emitted by config(8), there is no reason to keep copy of more or less identical iskmemdev() for every arch; move the function to spec_vnop.c, and g/c machine-dependant copies	2002-10-26 13:50:17 +00:00
yamt	ac3a01e67e	use B_ASYNC for children of nested buffers in genfs_getpages. ok'ed by Chuck Silvers.	2002-10-25 05:44:41 +00:00
jdolecek	e0cc03a09b	merge kqueue branch into -current kqueue provides a stateful and efficient event notification framework currently supported events include socket, file, directory, fifo, pipe, tty and device changes, and monitoring of processes and signals kqueue is supported by all writable filesystems in NetBSD tree (with exception of Coda) and all device drivers supporting poll(2) based on work done by Jonathan Lemon for FreeBSD initial NetBSD port done by Luke Mewburn and Jason Thorpe	2002-10-23 09:10:23 +00:00
fvdl	eb485a7e27	Use B_ASYNC in the !PGO_SYNCIO case. Gets back most, if not all, NFS read throughput performance lost since the introduction of UBC. Spotted by YAMAMOTO Takashi, many thanks to him.	2002-10-21 15:21:35 +00:00
jdolecek	affeea595d	put back the while loop in kernfs_getattr() removed in rev 1.82; it's necessary to get the whole file length e.g. for msgbuf this fixes the '/kern/msgbuf & less' problem reported on port-i386 by Dave Tyson	2002-10-12 14:04:45 +00:00
provos	0f09ed48a5	remove trailing \n in panic(). approved perry.	2002-09-27 15:35:29 +00:00
christos	6868d0a7d6	MNT_GETARGS support	2002-09-21 18:08:27 +00:00
jtk	2bbcff81d5	restore ancestral RCS ID from 4.4BSD-Lite2	2002-09-10 02:52:40 +00:00
gehenna	77a6b82b27	Merge the gehenna-devsw branch into the trunk. This merge changes the device switch tables from static array to dynamically generated by config(8). - All device switches is defined as a constant structure in device drivers. - The new grammer ``device-major'' is introduced to ``files''. device-major <prefix> char <num> [block <num>] [<rules>] - All device major numbers must be listed up in port dependent majors.<arch> by using this grammer. - Added the new naming convention. The name of the device switch must be <prefix>_[bc]devsw for auto-generation of device switch tables. - The backward compatibility of loading block/character device switch by LKM framework is broken. This is necessary to convert from block/character device major to device name in runtime and vice versa. - The restriction to assign device major by LKM is completely removed. We don't need to reserve LKM entries for dynamic loading of device switch. - In compile time, device major numbers list is packed into the kernel and the LKM framework will refer it to assign device major number dynamically.	2002-09-06 13:18:43 +00:00
thorpej	3767580d1a	Fix a signed/unsigned comparison warning from GCC 3.3.	2002-08-26 01:26:29 +00:00
simonb	c475865f39	Just use the "time" variable in the *_getattr functions instead of a call to (the potentially expensive) microtime().	2002-08-03 04:52:44 +00:00
soren	178d83d503	Die, qaddr_t, die! - mnt_data in struct mount is already effectively a void *, so stop pretending otherwise.	2002-07-30 07:40:07 +00:00
chs	e697956ce6	we can't use the vnode's v_usecount to track how many times the vnode has been VOP_OPEN()'d. if the fifo is being accessed via a layered fs, v_usecount is always one (representing the hold by the layered vnode) regardless of how many times the vnode has been opened. instead, keep a separate counter for opens. fixes PR 17195 and probably 17724.	2002-07-27 16:43:36 +00:00
jdolecek	02eb342b57	Make sure that the pointer to old parent process for ptraced children gets reset properly when the old parent exits before the child. A flag is set in old parent process when the child is reparented in ptrace(2). If it's set when process is exiting, all running processes have their 'old parent process' pointer checked and reset if appropriate. Also change to use 'struct proc *' pointer directly, rather than pid_t. This fixes security/14444 by David Sainty. Reviewed by Christos Zoulas.	2002-07-25 20:04:02 +00:00
jdolecek	9c55530336	Properly detect error in kernfs_xread(). Fixes kern/10278 by IWAMOTO Toshihiro, though implemented different way. While here, clean up some int vs. size_t confusion, make kernfs_x{read\|write}() static and g/c some #if 0 stuff.	2002-07-19 18:35:44 +00:00
wiz	4b20971f01	Spell acquire with a 'c'.	2002-07-10 23:16:32 +00:00
lukem	b8e7104984	be consistent about how va_[acm]time is set to the current time (inspired by how procfs does it)	2002-07-05 03:24:06 +00:00
lukem	20657af5c3	set vap->va_ctime to vap->va_atime (the current time), rather than vap->va_ctime (which is a no-op).	2002-07-05 02:02:00 +00:00
enami	9e1deeab34	Add missing pageq lock while uvm_pagefree() is called (either directly or indirectly). Reviewed by chuq.	2002-05-29 11:04:39 +00:00
enami	1578726840	Just give up to do readahead rather than waiting busy pages. While I'm here, added few patchable variable so that one can easily measure readahead behaviour.	2002-05-18 02:54:57 +00:00
perseant	3fa1c8abe9	Protect v_synclist with splbio(); note that LIST_REMOVE is not an idempotent operation if more than one LIST_REMOVE happens on interrupt, so both the test for VONWORKLIST and the LIST_REMOVE(vp, v_synclist) need to be in splbio().	2002-05-14 19:37:18 +00:00
matt	ea5cd1699b	Extern speclisth	2002-05-12 20:42:03 +00:00
enami	293906a53a	Redo rev. 1.57 a bit different way; don't use `tpg' since it may be freed. Subtract the number of pages behind us when calculating new offset instead.	2002-05-10 07:51:37 +00:00
enami	911c9febb2	Don't modify the local variable `n' in genfs_putpages(). It should contain the number of elements in the page array at the beginning of every iteration.	2002-05-10 02:51:44 +00:00
thorpej	f1f51aa2b8	Move code shared by procfs and the kernel proper out of procfs and into the kernel proper (renaming functions from procfs_* to process_*).	2002-05-09 15:44:44 +00:00
enami	e3cc9c0682	When traversing by list, if the page next to us is a page in the cluster, advance the pointer.	2002-05-09 07:22:09 +00:00
enami	fabaf9a730	- In genfs_putpages(), no need to restrict the cluster within the given region. - In uvm_aio_aiodone(), remove assertions no longer true.	2002-05-09 07:14:37 +00:00
enami	8876669f4c	Since npages may includes number of pages behind us, we can't use it to update current offset. Instead, use the last page in the run of pages to calculate new offset.	2002-05-06 00:42:22 +00:00
enami	e6513c283e	Stylistic change; introduce new local variable and use it instead of sprinkling different expression to test if we're pagedaemon.	2002-05-06 00:18:15 +00:00
enami	6335b88f05	We don't need to re-activate page in genfs_putpages() when GOP_WRITE returns ENOMEM (temporary memory shortage) since it is already handled in uvm_aio_aiodone() for both async/sync case. Discussed with chuq.	2002-04-26 03:57:31 +00:00
thorpej	37dc008ca3	Cleanup how file system configuration information is declared, grouping related information together, with the file system code itself. This is just low-hanging fruit -- more to come.	2002-04-16 23:14:05 +00:00
enami	6cfcfb947c	genfs_{compat_}getpages(): For PGO_LOCKED request, it is safe to return read only page if it was due to read fault. This avoid many unnecessary read fault introduced by recent nfs_bio.c change. Reviewed by chuq.	2002-04-16 06:05:05 +00:00
enami	08625200a0	KNF and other misc. cosmetic changes.	2002-04-16 06:00:46 +00:00
jdolecek	83bb6f6dd7	Changes to make it less likely to need to be revisited later again: * fdesc_attr(): don't panic for 'unknown' descriptor types, rather use (fp->f_ops->fo_stat)() hook, as for DTYPE_SOCKET and DTYPE_PIPE XXX perhaps use different vnode type than VBAD for these? fdesc_setattr(): just return 0 regardless of type, rather than paniccing for 'unknown' descriptor types	2002-04-02 17:46:06 +00:00
jmc	70ba374a66	Treat pipes like sockets and don't do setattr on them	2002-04-02 10:34:54 +00:00
chs	72c455ce83	in genfs_compat_getpages(), clear any part of a page that VOP_READ() doesn't fill in (eg. because it's past EOF).	2002-03-22 03:51:51 +00:00
atatat	31144d9976	Convert ioctl code to use EPASSTHROUGH instead of -1 or ENOTTY for indicating an unhandled "command". ERESTART is -1, which can lead to confusion. ERESTART has been moved to -3 and EPASSTHROUGH has been placed at -4. No ioctl code should now return -1 anywhere. The ioctl() system call is now properly restartable.	2002-03-17 19:40:26 +00:00
chs	a51be40dcb	don't yield the cpu in genfs_putpages() if we're the pagedaemon. pointed out by enami. fixes PR 15784.	2002-03-02 06:58:01 +00:00
enami	9a623b9870	Don't use MALLOC for variable sized allocation.	2002-02-20 06:16:22 +00:00
chs	96f907f394	fix two problems: - when yielding the cpu while using the vnode's page list, use a marker page to keep our place in the list (like the other cases where we drop the lock). - wait until no one else has the page busy before deciding if the page needs to be cleaned. a page will be dirty while it's being initialized but will be marked clean before PG_BUSY is cleared. both found by enami.	2002-02-19 15:49:39 +00:00
enami	fe24174a3b	Don't bother to subtract 0.	2002-02-13 05:20:41 +00:00
enami	52a2a21502	Don't leave junk in pgs[] array since it will be passed to uvn_findpages() again.	2002-02-12 01:08:12 +00:00
chs	0365a63944	in genfs_putpages(): - yield the cpu if we've taken too long. - when traversing by offset, skip over any pages that we clustered.	2002-01-26 02:44:27 +00:00
christos	5ff60a19d9	Don't hide the real return code with EPERM.	2002-01-12 18:51:56 +00:00
christos	5266303896	When checking for permissions, include the P_INEXEC test and return EAGAIN if the process is exec'ing.	2002-01-12 18:51:31 +00:00
christos	67c310f08c	Apply the same P_INEXEC test to avoid the execve/trace problem using the procfs ptrace calls.	2002-01-11 22:02:56 +00:00
chs	2776bd066f	add the entry for layer_getpages() to the VOP tables of the layered file systems that need it.	2002-01-04 07:19:33 +00:00
chs	03ea276e84	in genfs_gop_write(), actually set the B_ASYNC flag on buffers that we're not going to wait for. this doesn't matter for real devices since we call VOP_STRATEGY() directly, but NFS uses this flag to decide whether or not to hand the buffer off to an nfsiod thread.	2001-12-31 06:44:58 +00:00
chs	64b0c2adbb	in genfs_putpages(), we must wait for any pending write i/os to complete if the putpages request is synchronous.	2001-12-31 06:40:08 +00:00
chs	40bf5f0e12	add some compatibility routines to allow mmap() to work non-UBCified filesystems (in the same non-coherent fashion that they worked before).	2001-12-18 07:49:36 +00:00
chs	8e9cdbbd63	replace "vnode" and "vtext" with "file" and "exec" in uvmexp field names.	2001-12-09 03:07:43 +00:00
chs	0931f31deb	add a VOP_PUTPAGES method for all the filesystems that don't have pages. do all the work for destroying syncfs vnodes in vfs_deallocate_syncvnode() to make some new assertions work out.	2001-12-06 04:30:49 +00:00
chs	1361ca58eb	be consistent, clear VONWORKLST when we take a syncer vnode off the syncer list.	2001-12-06 04:29:55 +00:00
chs	4d14671458	add VOP_GETPAGES and VOP_PUTPAGES methods for layered filesystems. drop the interlock on the upper layer, acquire the interlock on the lower layer.	2001-12-06 04:29:23 +00:00
chs	5a690c92a1	add a VOP_PUTPAGES method for all the filesystems that don't have pages, just unlock the interlock.	2001-12-06 04:27:40 +00:00
thorpej	03efee5585	* Allow machine-dependent code to specify hooks for ptrace(2) (__HAVE_PTRACE_MACHDEP) and procfs (__HAVE_PROCFS_MACHDEP). These changes will allow platforms like x86 (XMM) and PowerPC (AltiVec) to export extended register sets in a sane manner. * Use __HAVE_PTRACE_MACHDEP to export x86 XMM registers (standard FP + SSE/SSE2) using PT_{GET,SET}XMMREGS (in the machdep ptrace request space). * Use __HAVE_PROCFS_MACHDEP to export x86 XMM registers via /proc/N/xmmregs in procfs.	2001-12-05 00:58:05 +00:00
christos	420771d7cc	PR/14781: Matthew Fredette: Clamp the number of read-ahead pages to 16 because other code has this limit. Also while I am here, convert the magic 16 into a #define constant and use it in the appropriate places. This is a temporary fix, since all this read-ahead business is XXXUBC anyway.	2001-11-30 15:18:39 +00:00
lukem	2565646230	don't need <sys/types.h> when including <sys/param.h>	2001-11-15 09:47:59 +00:00
lukem	e4b00f433c	add RCSIDs	2001-11-10 13:33:40 +00:00
enami	7b7e3f6b7e	Wrap long line and remove name of argument from function prototype.	2001-11-07 04:56:09 +00:00
enami	8e5afc2cbe	Make the size of null node hash table to desiredvnodes instead of 16.	2001-11-07 04:55:26 +00:00
enami	888e968e4d	Call hashdone where appropriate.	2001-11-07 04:21:26 +00:00
enami	579ece8405	Cosmetic changes.	2001-11-07 04:15:41 +00:00
enami	9af6d646c2	Fix typo in comment.	2001-11-07 04:04:10 +00:00
simonb	cf63a6b052	Remove some variables that are set but never used.	2001-11-06 07:20:37 +00:00
simonb	86c9e7e6d2	In procfs_domem() the addr variable is only needed if PMAP_NEED_PROCWR is defined.	2001-11-06 07:20:08 +00:00
enami	6e46b6ec2c	s/genfs_do_putpages/genfs_gop_write/ in uvmhist.	2001-10-03 14:13:08 +00:00
chs	f0b6149f57	change spec_{read,write}() to specify the device blkno in units of DEV_BSIZE rather than the device's sector size. this allows /dev/rcd0a and /dev/cd0a to return the same data. fixes PRs 3261 and 14026.	2001-09-23 04:39:25 +00:00
sommerfeld	181c4513dc	Add fifo_putpages() placebo so that the vnode's uobj is unlocked.	2001-09-22 22:35:18 +00:00
chs	d0b76cab5e	use shared locks instead of exclusive for VOP_READ() and VOP_READDIR().	2001-09-21 08:02:55 +00:00
chs	4111c37251	when zeroing pages past EOF, don't zero the page containing EOF if it already contains valid data. should fix PRs 13361 and 13436.	2001-09-21 07:52:25 +00:00
chs	5f5ac77eff	add a forward decl for struct vm_page.	2001-09-15 22:38:40 +00:00
chs	099a6b5258	interfaces and structures used by new genfs_{get,put}pages().	2001-09-15 21:33:05 +00:00
chs	64c6d1d2dc	a whole bunch of changes to improve performance and robustness under load: - remove special treatment of pager_map mappings in pmaps. this is required now, since I've removed the globals that expose the address range. pager_map now uses pmap_kenter_pa() instead of pmap_enter(), so there's no longer any need to special-case it. - eliminate struct uvm_vnode by moving its fields into struct vnode. - rewrite the pageout path. the pager is now responsible for handling the high-level requests instead of only getting control after a bunch of work has already been done on its behalf. this will allow us to UBCify LFS, which needs tighter control over its pages than other filesystems do. writing a page to disk no longer requires making it read-only, which allows us to write wired pages without causing all kinds of havoc. - use a new PG_PAGEOUT flag to indicate that a page should be freed on behalf of the pagedaemon when it's unlocked. this flag is very similar to PG_RELEASED, but unlike PG_RELEASED, PG_PAGEOUT can be cleared if the pageout fails due to eg. an indirect-block buffer being locked. this allows us to remove the "version" field from struct vm_page, and together with shrinking "loan_count" from 32 bits to 16, struct vm_page is now 4 bytes smaller. - no longer use PG_RELEASED for swap-backed pages. if the page is busy because it's being paged out, we can't release the swap slot to be reallocated until that write is complete, but unlike with vnodes we don't keep a count of in-progress writes so there's no good way to know when the write is done. instead, when we need to free a busy swap-backed page, just sleep until we can get it busy ourselves. - implement a fast-path for extending writes which allows us to avoid zeroing new pages. this substantially reduces cpu usage. - encapsulate the data used by the genfs code in a struct genfs_node, which must be the first element of the filesystem-specific vnode data for filesystems which use genfs_{get,put}pages(). - eliminate many of the UVM pagerops, since they aren't needed anymore now that the pager "put" operation is a higher-level operation. - enhance the genfs code to allow NFS to use the genfs_{get,put}pages instead of a modified copy. - clean up struct vnode by removing all the fields that used to be used by the vfs_cluster.c code (which we don't use anymore with UBC). - remove kmem_object and mb_object since they were useless. instead of allocating pages to these objects, we now just allocate pages with no object. such pages are mapped in the kernel until they are freed, so we can use the mapping to find the page to free it. this allows us to remove splvm() protection in several places. The sum of all these changes improves write throughput on my decstation 5000/200 to within 1% of the rate of NetBSD 1.5 and reduces the elapsed time for "make release" of a NetBSD 1.5 source tree on my 128MB pc to 10% less than a 1.5 kernel took.	2001-09-15 20:36:31 +00:00

1 2 3 4 5 ...

600 Commits