NetBSD

Commit Graph

Author	SHA1	Message	Date
gwr	9b055e8c43	minor nits -- replace vsprintf with vsnprinf	1999-04-12 00:24:17 +00:00
gwr	8946cf4be4	minor nits -- strncpy into p->p_comm	1999-04-12 00:22:08 +00:00
kleink	bff1bef006	* Add a comment to ktrgetheader() mentioning that ktr_len and ktr_buf are being left uninitialized intentionally; addresses PR kern/6987. * In ktrsysret(), initialize the unused ktr_eosys to avoid writing random junk.	1999-04-11 18:44:00 +00:00
wrstuden	2e1fa90ecb	Fix obscure bug in namei(), which was the cause of PR 7306. The problem is that if "sl" is a symbolic link, a lookup on "sl/" will be flagged as the last component. Thus VOP_LOOKUP will lock the parent directory if LOCKPARENT is set. In order for the symbolic link to be resolved, this lock needs to be released. namei() would test for this by checking if ni_pathlen == 1, which it wouldn't as "/" is left in the name, and namei() would not unlock the parent. The next call to lookup() to resolve the symbolic link would fail as the parent was still locked.	1999-04-07 05:47:37 +00:00
thorpej	b2741be06e	More locking protocol fixes. Protect pool_head with a spin lock (statically initialized). This lock also protects the "next drain candidate" pointer. XXX There is still one locking protocol problem, which should not be a problem in practice, but is still marked as an issue in the code anyhow.	1999-04-06 23:32:44 +00:00
sommerfe	c97bb0650d	Fix compile warning (not seen with egcs, but...)	1999-04-05 03:33:31 +00:00
chs	c109816333	Undo the part of the last revision about pr_rmpage() referencing a data structure after it was freed. This wasn't actually a problem, and the change caused the wrong pool_item_header to be freed in the non-PR_PHINPAGE case.	1999-04-04 17:17:31 +00:00
thorpej	4fd2edfbe8	mbinit() can now allocate memory. Update a comment accordingly.	1999-04-01 00:23:25 +00:00
thorpej	c431ebc42f	Call cpu_startup() immediately after uvm_init(), but before mbinit(). Call configure() directly immediately after config_init(). This causes autoconfiguration to happen at the same time as before, but creates some kernel submaps earlier, so that e.g. mbinit() can now allocate memory.	1999-04-01 00:22:45 +00:00
thorpej	278e7ae222	Yet more fixes to the pool allocator: - Protect userspace from unnecessary header inclusions (as noted on current-users). - Some const poisioning. - GREATLY simplify the locking protocol, and fix potential deadlock scenarios. In particular, assume that the back-end page allocator provides its own locking mechanism (this is currently true for all such allocators in the NetBSD kernel). Doing so allows us to simply use one spin lock for serialized access to all r/w members of the pool descriptor. The spin lock is released before calling the back-end allocator, and re-acquired upon return from it. - Fix a problem in pr_rmpage() where a data structure was referenced after it was freed. - Minor tweak to page manaement. Migrate both idle and empty pages to the end of the page list. As soon as a page becomes un-empty (by a pool_put()), place it at the head of the page list, and set curpage to point to it. This reduces fragmentation as well as the time required to find a non-empty page as soon as curpage becomes empty again. - Use mono_time throughout, and protect access to it w/ splclock(). - In pool_reclaim(), if freeing an idle page would reduce the number of allocatable items to below the low water mark, don't.	1999-03-31 23:23:47 +00:00
mycroft	f145c291bd	If copyout() fails, make sure to unbusy the mount point before returning.	1999-03-31 19:18:45 +00:00
mycroft	0622545249	Previous change to vn_lock() was bogus. If we got EDEADLK, it was from lockmgr(), and it already unlocked v_interlock. So, just return in this case.	1999-03-31 18:30:13 +00:00
thorpej	98d006d2c6	Set a hard limit (rather than an advisory high water mark for pages) of NMBCLUSTERS for the mbuf cluster pool. On platforms which use direct-mapped segments for pool pages (MIPS and Alpha), this makes NMBCLUSTERS actually meaningful (such ports don't even allocate mb_map, as it is not used to map mbuf cluster pages). Improve the message logged at a maximum rate of once per second. The new message: "WARNING: mclpool limit reached; increase NMBCLUSTERS". In the back-end pool page allocator, remove the message about mb_map being full. The message was not necessarily correct as the allocator may have been starved for pages, rather than for space in the map. Also, the hard limit on the mbuf cluster pool will be reached before the map fills (the last cluster will always fit into the map), so the message is redundant. Add a comment in mbinit() about considering setting low water marks on the mbuf and mbuf cluster pools.	1999-03-31 01:26:40 +00:00
thorpej	d4d4e314e9	Fix several bugs/deficiencies in the pool allocator: - Add support for hard limits, with optional rate-limited logging of a warning message when the pool limit is reached. (This will be used to fix a bug in mbuf cluster allocation on the MIPS and Alpha ports.) - Fix some locking protocol errors. This required splitting pr_flags into pr_flags (which is protected by the spin lock) and pr_roflags (which are `read only' flags, set when the pool is initialized, and never changed again; these do not need to be protected by a mutex). - Make the low water support actually mean something. When a low water mark is set, add free items to the pool until the low water mark is reached. When an item allocation causes the number of free items to drop below the low water mark, make the pool catch up to it. This can make the pool allocator more useful for several applications (e.g. pmap `pv entry' management) and more robust for others (for e.g. mbuf and mbuf cluster allocation, so that the pagedaemon can use NFS to clean pages on diskless systems without completely running dry on buffers to receive packets in during extreme memory shoratages). - Add a comment where we sleep waiting for more pages for the back-end page allocator. Specifically, instead of sleeping potentially forever, perhaps we should just wake up once a second to try allocating a page again. XXX Revisit this soon.	1999-03-31 01:14:06 +00:00
wrstuden	bd2d8363bc	The mode for a node is a mode_t in both struct stat and struct vattr - don't use a u_short for intermediate storage in vn_stat.	1999-03-30 00:16:44 +00:00
kleink	2304549e9f	Sync __getlogin()'s namelen argument with the libc-internal declaration (u_int vs. size_t).	1999-03-28 17:34:33 +00:00
kleink	1c35ac8f6b	Regen.	1999-03-28 17:32:51 +00:00
kleink	38c7f65f0c	Sync __getlogin()'s namelen argument with the libc-internal declaration (u_int vs. size_t).	1999-03-28 17:29:52 +00:00
eeh	69d1f9d0d3	COMPAT_SPARC32 -> COMPAT_NETBSD32	1999-03-26 04:29:20 +00:00
thorpej	c647f127c1	Assign initproc in main(), not start_init(). It's conventient to do so.	1999-03-26 01:10:50 +00:00
sommerfe	ef07055d58	Disallow tracing of processes unless tracer's root directory is at or above tracee's root directory.	1999-03-25 04:45:56 +00:00
nathanw	e8b935d003	Garbage collect a non-useful variable in getcwd_scandir().	1999-03-25 02:32:18 +00:00
sommerfe	f1a508e354	Prevent deadlock cited in PR4629 from crashing the system. (copyout and system call now just return EFAULT). A complete fix will presumably have to wait for UBC and/or for vnode locking protocols to be revamped to allow use of shared locks.	1999-03-25 00:20:35 +00:00
mrg	d2397ac5f7	completely remove Mach VM support. all that is left is the all the header files as UVM still uses (most of) these.	1999-03-24 05:50:49 +00:00
lukem	8a931fcdd8	Ensure that you can only bind a more specific address when it is done by the same uid or by root. This code is from FreeBSD. (Whilst it was originally obtained from OpenBSD, FreeBSD fixed it to work with multicast. To quote the commit message: - Don't bother checking for conflicting sockets if we're binding to a multicast address. - Don't return an error if we're binding to INADDR_ANY, the conflicting socket is bound to INADDR_ANY, and the conflicting socket has SO_REUSEPORT set. )	1999-03-23 10:45:37 +00:00
thorpej	845b609f97	Set the high water mark on the mbuf cluster pool to NMBCLUSTERS.	1999-03-23 02:51:27 +00:00
thorpej	9614a68c70	Fix the order of arguments to roundup().	1999-03-23 02:49:03 +00:00
thorpej	f2a91c9b91	Put back the code to log `mb_map full' that was lost when mbuf clusters were converted to use the pool allocator.	1999-03-22 22:06:58 +00:00
sommerfe	098b6f8e8a	Disallow descriptor-passing of descriptors which are open on directories which aren't under the recipient's root. Clean up of many error conditions involving descriptor passing, to eliminate infinite loops, panics, premature garbage collection of sockets, and descriptor leaks: - Avoid letting unp_gc() see descriptors with a refcount of zero by removing them from the socket's queue before releasing them. - Avoid socket leak in PRU_ABORT (this will also gc descriptors queued on a not-yet accepted socket when the accepting socket goes away). - Put in block comment explaining how unp_gc() should work. - Correctly manage unp_defer count so we don't get stuck in an infinite loop with nothing to do. - Don't tie MARK and DEFER bits so closely together. - Mark descriptors queued on not-yet-accepted sockets as well. - Don't call sorflush on non-socket, it doesn't work very well. - Deal with discard of NULL file pointer. - Hopefully cause GC to converge faster by only deferring sockets in unp_mark().	1999-03-22 17:54:38 +00:00
sommerfe	2f0f84b8e8	bug fix to fdavail: be consistent about taking per-process descriptor limit into account when checking against the limit; fdp->fd_nfiles may be greater than the current descriptor limit, and there may be space in fdp->fd_ofiles beyond the limit. If we say it's available, unp_externalize will get confused and panic when fdalloc fails.	1999-03-22 17:39:44 +00:00
sommerfe	36dc99adac	vinvalbuf, called from vclean, could cause a locking-against-self deadlock in VOP_FSYNC() if the unreferenced vnode picked for reclamation happened to be stacked on top of a vnode the process already had locked. This could happen if the same filesystem was accessed both through a union mount and directly; it seemed to happen most frequently when the direct access was through NFS. Avoid this deadlock by changing vinvalbuf to pass a new FSYNC_RECLAIM flag bit to VOP_FSYNC() to indicate that a reclaim is in progress and only a `shallow' fsync is necessary. Do nothing in _fsync() in umapfs, nullfs, and unionfs when FSYNC_RECLAIM is set; the underlying vnodes will shortly be released in _reclaim and may be reclaimed (and fsync'ed) later.	1999-03-22 17:24:19 +00:00
sommerfe	6d0f67d6aa	Regen files based on changes to syscalls.master, vnode_if.src (latter was changes to comments only, but..) Build vfs_getcwd.c as standard part of kernel. Add implementation of fchroot(), since two emulations already had it. Call vn_isunder() in fchdir(), chroot(), and fchroot() to make it harder to escape chroot().	1999-03-22 17:13:34 +00:00
sommerfe	095cd96cd5	Change namei cache to record vnode->(parent,name) entries (for directories). Add kernel implementation of getcwd() which uses this cache, falling back to reading the filesystem on a cache miss. Along for the ride: add new VOP_FSYNC flag FSYNC_RECLAIM indicating that a reclaim is being done, so only a "shallow" fsync is needed.	1999-03-22 17:01:55 +00:00
sommerfe	75fb6d661f	Include some comments with more details on the VOP_LOOKUP locking protocol	1999-03-22 16:57:37 +00:00
bouyer	87662a89a2	Hinherit MNT_NOEXEC from the mount point. Without this a user can exec arbitrary binaries by doing a user mount, even if the admin has carefully setup his system to avoid arbitrary binaries execution.	1999-03-17 15:35:03 +00:00
kleink	51be278873	Defopt COMPAT_M68K4K.	1999-03-09 21:02:38 +00:00
mycroft	f3df26f6f3	This is sort of gratuitous, but... Strip the leading path off of init's argv[0].	1999-03-05 07:26:21 +00:00
fvdl	31289a4af8	Fill in vnodecovered in the mount structure before calling VFS_MOUNT anyway, some things (e.g. unionfs) may depend on it. It's currently ok for vnodecovered to be set already; it's not for v_mountedhere in the vnode, though. From John Darrow. XXX should probably just extend VFS_MOUNT to take the vnode pointer as an argument.	1999-03-02 07:47:49 +00:00
ross	e47e3c9f45	schedclk() -> schedclock(), for consistency with hardclock(), statclock(), ... update comments for recent scheduler mods	1999-02-28 18:14:57 +00:00
fvdl	c0fb2ab4f2	Use a SETRECURSE lock before calling VFS_MOUNT in the mount() system call, since the lock may be taken again. This was the intention of the CANRECURSE lock already there, but didn't work. Only fill in the vnode<->mountpoint links (mountedhere and vnodecovered) after VFS_MOUNT returned succesfully. It might happen that something called from VFS_MOUNT mistook the vnode for an already successfully mounted on one because of this.	1999-02-28 14:12:54 +00:00
fvdl	080ad305ff	Recursive locks were previously only available with LK_CANRECURSE. This could be done in one of 2 ways: * call lk_init with LK_CANRECURSE, resulting in a lock that always can be used recursively. * call lockmgr with LK_CANRECURSE, meaning that it's ok if this lock is already held by us. Sometimes we need a locking type that says: take this lock now, exclusively, but while I am holding it, I may go through a code path which could attempt to get the lock again, and which is unaware that the lock might already be taken. Implement LK_SETRECURSE for this purpose. Assume that locks and unlocks come in matching pairs (they should), and check for this 'level' using SETRECURSE locks.	1999-02-28 14:09:15 +00:00
wrstuden	0078fc50dc	Modify VOP_CLOSE vnode op to always take a locked vnode. Change vn_close to pass down a locked node. Modify union_copyup() to call VOP_CLOSE locked nodes. Also fix a bug in union_copyup() where a lock on the lower vnode would only be released if VOP_OPEN didn't fail.	1999-02-26 23:38:55 +00:00
christos	88162a0fe9	Make this compile without EXEC_AOUT and with COMPAT_FREEBSD	1999-02-23 18:19:29 +00:00
mycroft	d4955ba8a9	While I'm on a fixed point kick, improve the NTP clock factor correction to give <.1% error in all (supported) cases. It doesn't cost us much.	1999-02-23 17:41:48 +00:00
ross	6803dbddbb	Replace the recent scheduler mods with calls to scheduler_{fork,wait}_hook(), (inlined) so scheduler functionality can be kept in a single .h/.c set. Also, the wait hook has changed the way it clips the scheduler history.	1999-02-23 02:57:18 +00:00
ross	b4a33c4e60	Scheduler bug fixes and reorganization * fix the ancient nice(1) bug, where nice +20 processes incorrectly steal 10 - 20% of the CPU, (or even more depending on load average) * provide a new schedclk() mechanism at a new clock at schedhz, so high platform hz values don't cause nice +0 processes to look like they are niced * change the algorithm slightly, and reorganize the code a lot * fix percent-CPU calculation bugs, and eliminate some no-op code === nice bug === Correctly divide the scheduler queues between niced and compute-bound processes. The current nice weight of two (sort of, see `algorithm change' below) neatly divides the USRPRI queues in half; this should have been used to clip p_estcpu, instead of UCHAR_MAX. Besides being the wrong amount, clipping an unsigned char to UCHAR_MAX is a no-op, and it was done after decay_cpu() which can only _reduce_ the value. It has to be kept <= NICE_WEIGHT * PRIO_MAX - PPQ or processes can scheduler-penalize themselves onto the same queue as nice +20 processes. (Or even a higher one.) === New schedclk() mechansism === Some platforms should be cutting down stathz before hitting the scheduler, since the scheduler algorithm only works right in the vicinity of 64 Hz. Rather than prescale hz, then scale back and forth by 4 every time p_estcpu is touched (each occurance an abstraction violation), use p_estcpu without scaling and require schedhz to be generated directly at the right frequency. Use a default stathz (well, actually, profhz) / 4, so nothing changes unless a platform defines schedhz and a new clock. Define these for alpha, where hz==1024, and nice was totally broke. === Algorithm change === The nice value used to be added to the exponentially-decayed scheduler history value p_estcpu, in _addition_ to be incorporated directly (with greater wieght) into the priority calculation. At first glance, it appears to be a pointless increase of 1/8 the nice effect (pri = p_estcpu/4 + nice*2), but it's actually at least 3x that because it will ramp up linearly but be decayed only exponentially, thus converging to an additional .75 nice for a loadaverage of one. I killed this, it makes the behavior hard to control, almost impossible to analyze, and the effect (~~nothing at for the first second, then somewhat increased niceness after three seconds or more, depending on load average) pointless. === Other bugs === hz -> profhz in the p_pctcpu = f(p_cpticks) calcuation. Collect scheduler functionality. Try to put each abstraction in just one place.	1999-02-23 02:56:03 +00:00
drochner	220b8d9add	PR kern/7033 (Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>): use device minor to unit/partition macros from sys/disklabel.h	1999-02-22 16:00:01 +00:00
cjs	d390501801	Safer use of printf.	1999-02-22 00:12:36 +00:00
mycroft	bb815bda9e	Fix a thinko in the previous. Also: * Do the boundary check when creating a new region as well. * If we crossed the boundary, don't just throw away the region; lop off the beginning and see if we still fit. SB16 is now fully functional on the Alpha.	1999-02-18 18:52:29 +00:00
mycroft	ca00239fe9	Speed up the calculation of dontcross; use EXTENT_ALIGN to calculate it directly rather than using a loop. Also adjust so that we can allocate at the end of the integer range.	1999-02-18 18:13:50 +00:00

1 2 3 4 5 ...

1798 Commits