NetBSD

Commit Graph

Author	SHA1	Message	Date
andvar	6f8dc1509f	fix various typos, mainly in comments, but also in man pages and log messages.	2021-10-21 13:21:53 +00:00
thorpej	982ae832c3	Overhaul of the EVFILT_VNODE kevent(2) filter: - Centralize vnode kevent handling in the VOP_() wrappers, rather than forcing each individual file system to deal with it (except VOP_RENAME(), because VOP_RENAME() is a mess and we currently have 2 different ways of handling it; at least it's reasonably well-centralized in the "new" way). - Add support for NOTE_OPEN, NOTE_CLOSE, NOTE_CLOSE_WRITE, and NOTE_READ, compatible with the same events in FreeBSD. - Track which kevent notifications clients are interested in receiving to avoid doing work for events no one cares about (avoiding, e.g. taking locks and traversing the klist to send a NOTE_WRITE when someone is merely watching for a file to be deleted, for example). In support of the above: - Add support in vnode_if.sh for specifying PRE- and POST-op handlers, to be invoked before and after vop_pre() and vop_post(), respectively. Basic idea from FreeBSD, but implemented differently. - Add support in vnode_if.sh for specifying CONTEXT fields in the vop__args structures. These context fields are used to convey information between the file system VOP function and the VOP wrapper, but do not occupy an argument slot in the VOP_*() call itself. These context fields are initialized and subsequently interpreted by PRE- and POST-op handlers. - Version VOP_REMOVE(), uses the a context field for the file system to report back the resulting link count of the target vnode. Return this in tmpfs, udf, nfs, chfs, ext2fs, lfs, and ufs. NetBSD 9.99.92.	2021-10-20 03:08:16 +00:00
thorpej	bc827931b3	Mark the EVFILT_VNODE filters MP-safe.	2021-10-11 01:49:08 +00:00
thorpej	e305e37ace	Setting EV_EOF requires modifying kn->kn_flags. However, that relies on holding the kq_lock of that note's kq. Rather than exposing this directly, add new knote_set_eof() and knote_clear_eof() functions that handle the necessary locking and don't leak as many implementation details to modules. NetBSD 9.99.91	2021-10-11 01:07:36 +00:00
thorpej	7825206807	Must hold kn->kn_kq->kq_lock to modify kn->kn_flags.	2021-10-10 23:46:22 +00:00
thorpej	12ae65d98c	Change the kqueue filterops::f_isfd field to filterops::f_flags, and define a flag FILTEROP_ISFD that has the meaning of the prior f_isfd. Field and flag name aligned with OpenBSD. This does not constitute a functional or ABI change, as the field location and size, and the value placed in that field, are the same as the previous code, but we're bumping __NetBSD_Version__ so 3rd-party module source code can adapt, as needed. NetBSD 9.99.89	2021-09-26 01:16:07 +00:00
andvar	b780d9b67b	fix various typos, mainly in comments.	2021-09-16 20:17:46 +00:00
andvar	d4eac28cae	s/directry/directory/	2021-08-12 20:25:26 +00:00
dholland	4171507047	Abolish all the silly indirection macros for initializing vnode ops tables. These are things of the form #define foofs_op genfs_op, or #define foofs_op genfs_eopnotsupp, or similar. They serve no purpose besides obfuscation, and have gotten cutpasted all over everywhere.	2021-07-18 23:57:13 +00:00
dholland	d819c3614f	Use macros for the canned parts of device and fifo vnode op tables. Add GENFS_SPECOP_ENTRIES and GENFS_FIFOOP_ENTRIES macros that contain the portion of the vnode ops table declaration that is (conservatively) the same in every fs. Use these in every fs that supports devices and/or fifos with separate ops tables. Note that ptyfs works differently (it has one type of vnode with open-coded dispatch to the specfs code, which I haven't changed in this commit) and rump/librump/rumpvfs/rumpfs.c has an indirect dynamic dispatch that already does more or less the same thing, which I also haven't changed. Also note that this anticipates a few bits in the next changeset here and there, and adds missing but unreachable calls in some cases (e.g. most fses weren't defining whiteout on devices and fifos, but it isn't reachable there), and it changes parsepath on devices and fifos to genfs_badop from genfs_parsepath (but it's not reachable there either). It appears that devices in kernfs were missing kqfilter, so it's possible that if you try to use kqueue on /kern/rootdev that it'll explode. And finally note that the ops declaration tables aren't order-dependent. (Other than vop_default_desc has to come first.) Otherwise this wouldn't work.	2021-07-18 23:56:12 +00:00
dholland	c6c16cd073	- Add a new vnode op: VOP_PARSEPATH. - Move namei_getcomponent to genfs_vnops.c and call it genfs_parsepath. - Add a parsepath entry to every vnode ops table. VOP_PARSEPATH takes a directory vnode to be searched and a complete following path and chooses how much of that path to consume. To begin with, all parsepath calls are genfs_parsepath, which locates the first '/' as always. Note that the call doesn't take the whole struct componentname, only the string. The other bits of struct componentname should not be needed and there's no reason to cause potential complications by exposing them.	2021-06-29 22:34:05 +00:00
mlelstv	387503e686	Don't pretend that files are limited to 1TB on NFSv3.	2021-06-13 10:25:11 +00:00
hannken	5b8c1df03b	Add flag/command NFSSVC_REPLACEEXPORTSLIST to nfssvc(2) system call. Works like NFSSVC_SETEXPORTSLIST but supports "mel_nexports > 1" and will atomically update the complete exports list for a file system.	2021-06-04 10:44:58 +00:00
simonb	70da67d08d	Remove nfs_putpages() prototype; it's not defined anywhere.	2021-05-27 08:58:29 +00:00
christos	05909ab8d5	Set f_namemax during mount time like all the other filesystems so that it does gets the right data in copy_statvfs_info(). Otherwise f_namemax can end up being 0. To reproduce: unmount the remote filesystem, remount it, and kill -HUP mountd to refresh exports.	2021-04-02 03:07:54 +00:00
riastradh	9fc453562f	Round of uvm.h cleanup. The poorly named uvm.h is generally supposed to be for uvm-internal users only. - Narrow it to files that actually need it -- mostly files that need to query whether curlwp is the pagedaemon, which should maybe be exposed by an external header. - Use uvm_extern.h where feasible and uvm_*.h for things not exposed by it. We should split up uvm_extern.h but this will serve for now to reduce the uvm.h dependencies. - Use uvm_stat.h and #ifdef UVMHIST uvm.h for files that use UVMHIST(ubchist), since ubchist is declared in uvm.h but the reference evaporates if UVMHIST is not defined, so we reduce header file dependencies. - Make uvm_device.h and uvm_swap.h independently includable while here. ok chs@	2020-09-05 16:30:10 +00:00
christos	79e3c74f8e	Introduce genfs_pathconf() and use it for the default case in all filesystems.	2020-06-27 17:29:17 +00:00
ad	4bfe043955	- Alter the convention for uvm_page_array slightly, so the basic search parameters can't change part way through a search: move the "uobj" and "flags" arguments over to uvm_page_array_init() and store those with the array. - With that, detect when it's not possible to find any more pages in the tree with the given search parameters, and avoid repeated tree lookups if the caller loops over uvm_page_array_fill_and_peek().	2020-05-25 21:15:10 +00:00
ad	0eaaa024ea	Move proc_lock into the data segment. It was dynamically allocated because at the time we had mutex_obj_alloc() but not __cacheline_aligned.	2020-05-23 23:42:41 +00:00
ad	ff872804dc	Start trying to reduce cache misses on vm_page during fault processing. - Make PGO_LOCKED getpages imply PGO_NOBUSY and remove the latter. Mark pages busy only when there's actually I/O to do. - When doing COW on a uvm_object, don't mess with neighbouring pages. In all likelyhood they're already entered. - Don't mess with neighbouring VAs that have existing mappings as replacing those mappings with same can be quite costly. - Don't enqueue pages for neighbour faults unless not enqueued already, and don't activate centre pages unless uvmpdpol says its useful. Also: - Make PGO_LOCKED getpages on UAOs work more like vnodes: do gang lookup in the radix tree, and don't allocate new pages. - Fix many assertion failures around faults/loans with tmpfs.	2020-05-17 19:38:16 +00:00
christos	9aa2a9c323	Add ACL support for FFS. From FreeBSD.	2020-05-16 18:31:45 +00:00
hannken	d5cb0dea34	Resolve delayed truncation from nfs_inactive() too. Should prevent "locking against self" from nfs_unlock().	2020-05-01 08:43:00 +00:00
ad	f5ad84fdb3	PR kern/54759 (vm.ubc_direct deadlock when read()/write() into mapping of itself) - Add new flag UBC_ISMAPPED which tells ubc_uiomove() the object is mmap()ed somewhere. Use it to decide whether to do direct-mapped copy, rather than poking around directly in the vnode in ubc_uiomove(), which is ugly and doesn't work for tmpfs. It would be nicer to contain all this in UVM but the filesystem provides the needed locking here (VV_MAPPED) and to reinvent that would suck more. - Rename UBC_UNMAP_FLAG() to UBC_VNODE_FLAGS(). Pass in UBC_ISMAPPED where appropriate.	2020-04-23 21:47:07 +00:00
ad	23bf88000c	Replace most uses of vp->v_usecount with a call to vrefcnt(vp), a function that hides the details and does atomic_load_relaxed(). Signature matches FreeBSD.	2020-04-13 19:23:17 +00:00
mlelstv	3679de0323	NFSv2 is limited to use only 32bit in metadata. Prevent that larger metadata values are simply truncated. -> clamp filesystem block counts to signed 32bit. -> clamp file sizes to signed 32bit () Some NFSv2 clients also have problems to handle buffer sizes larger than (signed) 16bit. -> clamp buffer sizes to signed 16bit for better compatibility. () This can lead to erroneous behaviour for files larger than 2GB that NFSv2 cannot handle but it is still better than before. An alternative would be to (partially) reject operations on files larger than 2GB, but which causes other problems.	2020-04-04 07:07:20 +00:00
ad	1d7848ad43	Process concurrent page faults on individual uvm_objects / vm_amaps in parallel, where the relevant pages are already in-core. Proposed on tech-kern. Temporarily disabled on MP architectures with __HAVE_UNLOCKED_PMAP until adjustments are made to their pmaps.	2020-03-22 18:32:41 +00:00
pgoyette	9120d4511b	Use the module subsystem's ability to process SYSCTL_SETUP() entries to automate installation of sysctl nodes. Note that there are still a number of device and pseudo-device modules that create entries tied to individual device units, rather than to the module itself. These are not changed.	2020-03-16 21:20:09 +00:00
ad	16d4fad635	- Hide the details of SPCF_SHOULDYIELD and related behind a couple of small functions: preempt_point() and preempt_needed(). - preempt(): if the LWP has exceeded its timeslice in kernel, strip it of any priority boost gained earlier from blocking.	2020-03-14 18:08:38 +00:00
mgorny	35f46e0f22	Update NFS errno mapping and add assert for correctness Add the mapping for errno values missing in nfsrv_v2errmap[]. While at it, add a compile-time assert to make sure that the array does not become out-of-date again.	2020-03-08 22:12:42 +00:00
ad	bf79731039	Tighten up the locking around vp->v_iflag a little more after the recent split of vmobjlock & v_interlock.	2020-02-27 22:12:53 +00:00
ad	bfc37e9217	v_interlock -> vmobjlock	2020-02-24 20:11:45 +00:00
ad	d2a0ebb67a	UVM locking changes, proposed on tech-kern: - Change the lock on uvm_object, vm_amap and vm_anon to be a RW lock. - Break v_interlock and vmobjlock apart. v_interlock remains a mutex. - Do partial PV list locking in the x86 pmap. Others to follow later.	2020-02-23 15:46:38 +00:00
ad	c2e9cb9413	VFS_VGET(), VFS_ROOT(), VFS_FHTOVP(): give them a "int lktype" argument, to allow us to get shared locks (or no lock) on the returned vnode. Matches FreeBSD.	2020-01-17 20:08:06 +00:00
ad	05a3457e85	Merge from yamt-pagecache (after much testing): - Reduce unnecessary page scan in putpages esp. when an object has a ton of pages cached but only a few of them are dirty. - Reduce the number of pmap operations by tracking page dirtiness more precisely in uvm layer.	2020-01-15 17:55:43 +00:00
thorpej	d6c967bb85	- Eliminate the global "boottime" variable, which was being accessed without any synchronization against changes by e.g. clock_settime(). - Replace with new getbinboottime() / getnanoboottime() / getmicroboottime() functions (naming mirrors that of other time access functions in kern_tc.c). It returns the (maybe-converted) value of timebasebin, which also tracks our estimate of when the system was booted (i.e. the legacy "boottime" was redundant). XXX There needs to be a lockless synchronization mechanism for reading timebasebin, but this is a problem in kern_tc.c that pre-existed these "boottime" changes. At least now the problem is centralized in one location.	2020-01-02 15:42:26 +00:00
ad	7d06f3305f	Make mntvnode_lock per-mount, and address false sharing of struct mount.	2019-12-22 19:47:34 +00:00
ad	881d12e6f2	Merge from yamt-pagecache: - do gang lookup of pages using radixtree. - remove now unused uvm_object::uo_memq and vm_page::listq.queue.	2019-12-15 21:11:34 +00:00
ad	5978ddc663	Break the global uvm_pageqlock into a per-page identity lock and a private lock for use of the pagedaemon policy code. Discussed on tech-kern. PR kern/54209: NetBSD 8 large memory performance extremely low PR kern/54210: NetBSD-8 processes presumably not exiting PR kern/54727: writing a large file causes unreasonable system behaviour	2019-12-13 20:10:21 +00:00
msaitoh	c56890eeef	s/initalize/initialize/ in comment or printf message.	2019-10-18 04:09:01 +00:00
christos	9a1f52751e	remove NCHNAMLEN optimization	2019-09-10 23:19:34 +00:00
kamil	4067fe4673	Appease GCC and initialize arps_ip Fixes build as GCC errors with maybe-uninitialized that is a false positive.	2019-06-29 17:42:36 +00:00
hannken	3c4b857dd5	Bracket do_sys_renameat() and nfsrv_rename() with fstrans. The v_mount field for vnodes on the same file system as "from" is now stable for referenced vnodes. VFS_RENAMELOCK no longer may use lock from an unreferenced and freed "struct mount".	2019-02-20 10:05:20 +00:00
mrg	fbffadb9f8	- add or adjust /* FALLTHROUGH */ where appropriate - add __unreachable() after functions that can return but won't in this case, and thus can't be marked __dead easily	2019-02-03 03:19:25 +00:00
maxv	5b040abec8	Replace M_ALIGN and MH_ALIGN by m_align.	2018-12-22 14:28:56 +00:00
maxv	b1305a6d63	Replace: M_MOVE_PKTHDR -> m_move_pkthdr. No functional change, since the former is a macro to the latter.	2018-12-22 13:11:37 +00:00
riastradh	d1579b2d70	Rename min/max -> uimin/uimax for better honesty. These functions are defined on unsigned int. The generic name min/max should not silently truncate to 32 bits on 64-bit systems. This is purely a name change -- no functional change intended. HOWEVER! Some subsystems have #define min(a, b) ((a) < (b) ? (a) : (b)) #define max(a, b) ((a) > (b) ? (a) : (b)) even though our standard name for that is MIN/MAX. Although these may invite multiple evaluation bugs, these do _not_ cause integer truncation. To avoid `fixing' these cases, I first changed the name in libkern, and then compile-tested every file where min/max occurred in order to confirm that it failed -- and thus confirm that nothing shadowed min/max -- before changing it. I have left a handful of bootloaders that are too annoying to compile-test, and some dead code: cobalt ews4800mips hp300 hppa ia64 luna68k vax acorn32/if_ie.c (not included in any kernels) macppc/if_gm.c (superseded by gem(4)) It should be easy to fix the fallout once identified -- this way of doing things fails safe, and the goal here, after all, is to _avoid_ silent integer truncations, not introduce them. Maybe one day we can reintroduce min/max as type-generic things that never silently truncate. But we should avoid doing that for a while, so that existing code has a chance to be detected by the compiler for conversion to uimin/uimax without changing the semantics until we can properly audit it all. (Who knows, maybe in some cases integer truncation is actually intended!)	2018-09-03 16:29:22 +00:00
msaitoh	61e1eb0d0b	- Cleanup for dynamic sysctl: - Remove unused _NAMES macros for sysctl. - Remove unused _MAXID for sysctls. - Move CTL_MACHDEP sysctl definitions for m68k into m68k/include/cpu.h and use them on all m68k machines.	2018-08-22 01:05:21 +00:00
chs	e406c140eb	add a genfs method to allow a file system to limit the range of pages that are given to a single GOP_WRITE() call. needed by ZFS.	2018-05-28 21:04:37 +00:00
thorpej	e832c294bb	Default NFS mounts to using TCP transport instead of UDP. PR kern/53166	2018-05-17 02:34:31 +00:00
maxv	0039128179	Use M_MOVE_PKTHDR.	2018-05-08 16:47:58 +00:00

1 2 3 4 5 ...

1362 Commits