NetBSD

Commit Graph

Author	SHA1	Message	Date
itojun	0a9d2ad2d7	provide KERN_URND, which extracts sizeof(int) random number from the kernel (similar to /dev/urandom, for use within chroot jail).	2002-07-02 17:06:17 +00:00
thorpej	11bb3d047b	Avoid NULL pointer dereference. kern/17438, xs@kittenz.org.	2002-07-02 16:22:28 +00:00
thorpej	f055878614	Fix using poll(2) on systrace descriptors.	2002-07-02 16:16:33 +00:00
thorpej	10c252ba47	Changes to allow the IPv4 and IPv6 layers to align headers themseves, as necessary: * Implement a new mbuf utility routine, m_copyup(), is is like m_pullup(), except that it always prepends and copies, rather than only doing so if the desired length is larger than m->m_len. m_copyup() also allows an offset into the destination mbuf, which allows space for packet headers, in the forwarding case. * Add _HDR_ALIGNED_P() macros for IP, IPv6, ICMP, and IGMP. These macros expand to 1 if __NO_STRICT_ALIGNMENT is defined, so that architectures which do not have strict alignment constraints don't pay for the test or visit the new align-if-needed path. Use the new macros to check if a header needs to be aligned, or to assert that it already is, as appropriate. Note: This code is still somewhat experimental. However, the new code path won't be visited if individual device drivers continue to guarantee that packets are delivered to layer 3 already properly aligned (which are rules that are already in use).	2002-06-30 22:40:32 +00:00
yamt	5154ea5e5b	constify diskerr().	2002-06-28 16:37:20 +00:00
itojun	67c6a98161	disallow ktrace on P_SUGID. from openbsd	2002-06-28 01:59:36 +00:00
christos	c96f693db0	PR/17407: Jim Wise: Systrace does not work without procfs. FIX: use process_domem as thorpej suggested.	2002-06-27 13:22:54 +00:00
wrstuden	6b88bf15ae	If we're in a chroot, and we are looking up '..', make sure we are still in the chroot. If not, teleport the lookup to the chroot and log. Closes an assisted-jail escape method pointed out by xs@kittenz.org. Patch from xs@kittenz.org and myself	2002-06-21 02:19:12 +00:00
eeh	4283dfcbf3	Use the number of args in the sysent rather than calculating it from the sysent's argsize. It should be faster and fix 32-bit compat emulations.	2002-06-19 23:35:35 +00:00
fvdl	87b285ba0d	Repair damage done in systrace commit. Since argsize is nog longer passed, assume it's sy_narg * sizeof (register_t). The code made this assumption implicitly anyway. Fixes compat_*32 tracing.	2002-06-18 08:01:30 +00:00
thorpej	2d2b1d78ae	sys_fpathconf: Don't panic in the default case; just return EOPNOTSUPP.	2002-06-18 02:04:08 +00:00
thorpej	5cbb1e82f1	* Don't bother protecting the body of the source file with #ifdef SYSTRACE. It's not even built if the option isn't present. * Use cdev_decl() to generate prototypes for the devsw functions. * Minor whitespace cleanup. * Nuke the SYSTR_CLONE ioctl from orbit; instead, just clone it in systraceopen(), like we do with svr4_net.	2002-06-18 01:24:15 +00:00
christos	acf2d4083d	Niels Provos systrace work, ported to NetBSD by kittenz and reworked...	2002-06-17 16:22:50 +00:00
matt	91650be524	Fix 2 bugs with MSG_WAITALL. The first is to not block forever if one is trying to MSG_PEEK for more than the socket can hold. The second is that before sleeping waiting for more data, upcall the protocol telling it you have just received data so it can kick itself to re-fill the just drained socket buffer.	2002-06-11 00:21:33 +00:00
he	a8c83879a2	In soreceive(), if any part of a received record has been freed, and an error occurs, make sure the socket doesn't retain a partial copy by dropping the rest of the record. This would otherwise trigger a panic("receive 1a") under DIAGNOSTIC. Fixes PR#16990, suggested fix adapted. Reviewed by Matt Thomas.	2002-06-10 20:43:16 +00:00
drochner	49c4cfb2c5	provide a db_vprintf() which is exactly what the name suggests	2002-06-05 17:53:52 +00:00
simonb	4c67158999	Use "#ifdef __mips__" instead of "#ifdef mips"; shared libraries work for kernels compiled with gcc 3.x.	2002-06-03 13:07:00 +00:00
lukem	06de426449	SIMPLEQ rototill: - implement SIMPLEQ_REMOVE(head, elm, type, field). whilst it's O(n), this mirrors the functionality of SLIST_REMOVE() (the other singly-linked list type) and FreeBSD's STAILQ_REMOVE() - remove the unnecessary elm arg from SIMPLEQ_REMOVE_HEAD(). this mirrors the functionality of SLIST_REMOVE_HEAD() (the other singly-linked list type) and FreeBSD's STAILQ_REMOVE_HEAD() - remove notes about SIMPLEQ not supporting arbitrary element removal - use SIMPLEQ_FOREACH() instead of home-grown for loops - use SIMPLEQ_EMPTY() appropriately - use SIMPLEQ_*() instead of accessing sqh_first,sqh_last,sqe_next directly - reorder manual page; be consistent about how the types are listed - other minor cleanups	2002-06-01 23:50:52 +00:00
itojun	1638da93e1	support setsockopt() with larger data (up to MCLBYTES). From: Hitoshi Asaeda <Hitoshi.Asaeda@sophia.inria.fr>	2002-05-31 20:33:11 +00:00
atatat	c52a7f3287	"a.out" -> "elf" in a comment	2002-05-31 16:49:12 +00:00
eeh	cfee9f2e02	Since a register_t is probably not an int, don't try to print it with %d.	2002-05-31 01:10:35 +00:00
eeh	45a9507815	Native 32-bit and 32-bit emulation are mutually exclusive. We should not even try to execute 32-bit binaries as native on a 64-bit kernel.	2002-05-31 00:52:20 +00:00
itojun	ae5a9e211d	re-scan all ifnet after domaininit() for if_afdata initialization.	2002-05-27 13:46:45 +00:00
perseant	43ca783b4a	Back out rev 1.174 of vfs_subr.c, because the splbio() wasn't protecting enough to be useful, and broadening it so that it did would have meant that operations possibly requiring synchronous disk activity would have to be done in splbio(). This clearly was not going to work. Worked around this in the LFS case by having lfs_cluster_callback put an extra hold on the vnode before calling biodone(), and taking the hold off without HOLDRELE's problematic list swapping. lfs_vunref() will take care of that---in thread context---on the next write if need be. Also, ensure that the list walking in lfs_{writevnodes,segunlock,gather} takes into account the possibility that the list may change underneath it (possibly because it itself deleted an element). Tested on i386, test-compiled on alpha.	2002-05-23 23:05:25 +00:00
itojun	e54108f021	in m_aux_delete, no need to chase beyond victim. from Archie Cobbs, sync w/kame	2002-05-23 05:45:34 +00:00
thorpej	e839580821	Move kernel_lock manipulation info functions so that they will show up in a profile.	2002-05-21 01:38:26 +00:00
perseant	ec0ca919be	Protect v_freelist with splbio(), since HOLDRELE can be called in interrupt context (through brelvp). (LFS may be the only subsystem affected by this problem.) Tested on i386.	2002-05-20 22:50:57 +00:00
matt	0dc8ee943d	Eliminate more commons or redundant declarations.	2002-05-14 02:58:32 +00:00
matt	44b3464a48	debug0, debug1, debug15 are defined elsewhere. XXX there needs to a better was of doing this.	2002-05-13 07:13:58 +00:00
matt	0cb85bc7b9	Eliminate commons.	2002-05-12 23:06:27 +00:00
matt	e89ff519c5	Make domains extern.	2002-05-12 20:36:58 +00:00
enami	af81fabf7c	Remove #ifdef DIAGNOSTIC around panic(). It is better than NULL pointer dereference.	2002-05-11 11:56:57 +00:00
enami	3700fdfba4	Don't release the lock on mount point vnode so early when doing update mount. Otherwise, race condition occurs (e.g., between mountd(8) and next mount(8) when multiple update mount command is invoked from shell script).	2002-05-11 00:45:06 +00:00
atatat	de7144e830	Maintain a short list of the actual descriptors that were closed and log that intead of being ambiguous about which of 0, 1, and/or 2 it was that was closed.	2002-05-09 17:57:07 +00:00
thorpej	f1f51aa2b8	Move code shared by procfs and the kernel proper out of procfs and into the kernel proper (renaming functions from procfs_* to process_*).	2002-05-09 15:44:44 +00:00
enami	b42b2c8323	In soreceive(), don't call sopendfree() if MSG_DONTWAIT is set since it may sleep. nfsrv_rcv() tries to do its jobs in softintr handler as far as possible.	2002-05-07 08:06:35 +00:00
eeh	80ef33c4c1	Back out previous.	2002-05-03 01:22:30 +00:00
thorpej	654768f185	Let the sosend_loan() path be selected at run-time; patch the variable use_sosend_loan to enable/disable it. The SOSEND_LOAN kernel option now causes it to default to 1.	2002-05-03 00:35:14 +00:00
eeh	89e107d917	Regen.	2002-05-03 00:22:16 +00:00
eeh	edc73e0d53	Move the copyout out of ntp_adjtime1 and into the syscall wrapper.	2002-05-03 00:22:15 +00:00
eeh	679cac37b9	Regen.	2002-05-03 00:22:15 +00:00
eeh	9bd29db83b	Use #if defined(_LP64) to determine if we're generating 64-bit binaries, not #if defined(alpha).	2002-05-03 00:20:56 +00:00
thorpej	7a49fee765	Add some experimental page-loaning for writes on sockets. It is disabled by default, and can be enabled by adding the SOSEND_LOAN option to your kernel config. The SOSEND_COUNTERS option can be used to provide some instrumentation. Use of this option, combined with an application that does large enough writes, gets us zero-copy on the TCP and UDP transmit path.	2002-05-02 17:55:48 +00:00
enami	e54e7305a2	- Fix more and more white space nits. - ANSIfy the last K&R function definition in this file.	2002-05-02 13:38:57 +00:00
enami	cb3d9867b6	Log who invoked the s[ug]id program. Tested by mozilla.	2002-04-28 22:35:19 +00:00
enami	88ee31f5b9	Fix the return value of logioctl() for unhandled ioctl; return EPASSTHROUGH instead of -1. Fix suggested by Andrew Brown.	2002-04-28 04:13:51 +00:00
thorpej	143be0a01b	Use M_READONLY() rathern than a home-grown M_SHAREDCLUSTER().	2002-04-28 00:53:34 +00:00
enami	9a9bf29bcf	A loop to expand file descriptor table and retry is move from fdalloc() to caller. So, no longer need to loop in fdalloc().	2002-04-27 21:36:50 +00:00
enami	a58680a4e2	KNF.	2002-04-27 21:31:41 +00:00
christos	ce8e03cd9f	Avoid file use underflow; thanks to YAMAMOTO Takashi for noticing.	2002-04-24 16:09:24 +00:00
christos	7ad1c0f407	Don't forget to set mature and unuse the file.	2002-04-23 17:20:58 +00:00
christos	1261c89715	From OpenBSD, via FreeBSD: If a set{u,g}id binary is invoked with fd < 3 closed, open those fds to /dev/null. XXX: This needs to be fixed in a better way. The kernel should not need to know about /dev/null or special case 0, 1, 2.	2002-04-23 15:11:25 +00:00
simonb	b0b78e1e37	Fix a white-space nit.	2002-04-20 06:04:35 +00:00
gmcgarry	acb682fad2	When searching for devices also skip those deleted by userconf. Problem reported by uwe.	2002-04-15 05:30:12 +00:00
christos	7d930e8572	Use SESSHOLD and SESSRELE consistently. Add SESSHOLD and SESSRELE to the t_session, so that we don't have dangling references [inspired by OpenBSD].	2002-04-12 17:02:33 +00:00
matt	2bf9358fc0	Don't use the tqh_ field names, instead use the correspond TAILQ_* macro.	2002-04-06 08:04:17 +00:00
thorpej	605e664094	vfs_mountroot(): provide more info when we panic.	2002-04-04 01:44:30 +00:00
fvdl	c2bf6c9615	Split off the backend of the shmat system call, and find removed segments if requested. This enables the linux compat code to be bug compatible (under Linux, shmat() to a removed segment works).	2002-04-03 11:53:01 +00:00
fvdl	8cc2d64d7a	Include opt_malloc_debug.h so that debug_malloc will actually be used when enabled, thankyouverymuch.	2002-04-03 09:45:22 +00:00
simonb	8bcefa5e82	Fix a typo that stopped this compiling in the !LKM case. Remove the buf2 variable from sysctl_sysvipc(), which was only ever assigned to but not used.	2002-04-03 08:06:17 +00:00
rafal	8fcad97302	Fix up linux a.out compat entry after move of emul-related sysctl handler to struct emul.	2002-04-02 23:56:17 +00:00
jdolecek	3ffbb4470d	make emul_sysctl() work with emulations loaded via LKM, properly bound check name[0]	2002-04-02 20:21:51 +00:00
jdolecek	ebd670f0bc	move emulation-specific sysctl hook from struct execsw to struct emul, where it belongs	2002-04-02 20:19:59 +00:00
jdolecek	2e327b16be	expose emul_search()	2002-04-02 20:18:06 +00:00
itohy	ab5fe01c20	Print ttyinfo before (not after) sending SIGINFO to processes. This generates more useful information of a process who catches SIGINFO, rather than always printing "runnable" (the process is marked runnable because of the signal). Inspired by the behavior of BSD/OS.	2002-03-25 04:26:43 +00:00
jdolecek	943f2b4f8a	make COMPAT_11 or COMPAT_12 imply EXEC_ELF_NOTELESSS use #if for the execsw entries, to allow 'options EXEC_ELF_NOTELESS=0' to override this This addresses kern/13520 and kern/14409 by Matthew Green.	2002-03-23 09:50:54 +00:00
jdolecek	083f26646d	rename EXEC_ELF_CATCHALL to EXEC_ELF_NOTELESS	2002-03-23 09:40:55 +00:00
jdolecek	225f07a8d4	Make sure file descriptor is properly FILE_UNUSE()d in error case of writev(2). Problem found by Alistair G. Crooks.	2002-03-22 18:58:59 +00:00
christos	195539260e	emulation specific sysctls.	2002-03-20 00:27:25 +00:00
oki	075769476a	Use /emul/pecoff as prefix to run PE/COFF binaries.	2002-03-18 07:11:06 +00:00
christos	18c7399ada	oops, forgot to commit the caller of exithook...	2002-03-17 23:41:30 +00:00
christos	22839acdb9	use the exithook mechanism to clean up semaphores.	2002-03-17 22:21:58 +00:00
christos	f7d98e2451	- merge all the hook insertion and deletion code so that we don't create a fourth copy. - add exithook to be executed when a process exits.	2002-03-17 22:19:20 +00:00
atatat	31144d9976	Convert ioctl code to use EPASSTHROUGH instead of -1 or ENOTTY for indicating an unhandled "command". ERESTART is -1, which can lead to confusion. ERESTART has been moved to -3 and EPASSTHROUGH has been placed at -4. No ioctl code should now return -1 anywhere. The ioctl() system call is now properly restartable.	2002-03-17 19:40:26 +00:00
thorpej	53df727d27	Add a new ptrace request, PT_IO, as proposed by Artur Grabowski on the <bsd-api-discuss@wasabisystems.com> mailing list. PT_IO is a more general inferior I/D space I/O mechanism. FreeBSD and OpenBSD have also added PT_IO. From lha@stacken.kth.se, kern/15945.	2002-03-17 17:02:45 +00:00
simonb	1f467013b2	White space nits.	2002-03-17 11:15:49 +00:00
simonb	21d2b8b53d	We don't need to include <uvm/uvm_extern.h> before <sys/sysctl.h> anymore.	2002-03-17 11:10:43 +00:00
chs	4d4825010d	fix bread() to return errors from reading past the end of the device. back in rev. 1.51, bread() and breadn() were changed to assume that if B_DONE is set on a buffer returned by bio_doread(), that the buffer must have already been in the cache, and thus the overall bread() should return success. but if the requested buffer is not in the cache and is past the end of the device, bounds_check_with_label() will set B_ERROR on the buffer and the caller will call biodone(), which will cause bread() to think the buffer was already in the cache and thus return success. to fix this, undo rev. 1.51 and instead have biowait() treat both B_DONE and B_DELWRI as indicators that it doesn't need to sleep waiting for an i/o to complete.	2002-03-16 23:49:59 +00:00
mjacob	ad97873103	size_t is a long on alpha. I forget whether there's actually a 'best' kernel printf format arg to print size_t in NetBSD, but coercing the arg to long probably works well enough.	2002-03-16 22:44:41 +00:00
eeh	8bf657ddf1	Don't forget to allocate space for the property name's trailing NUL.	2002-03-15 20:33:50 +00:00
jdolecek	fcc4c4d402	Merge the update to FreeBSD rev 1.95. Changes: * MP locking changes (mostly FreeBSD specific) XXXSMP the MP locking macros are noops on NetBSD for now * kevent fix (FreeBSD rev. 1.87): when the last reader/writer disconnects, ensure that anybody who is waiting for the kevent on the other end of the pipe gets EV_EOF * kill __P	2002-03-13 21:50:24 +00:00
simonb	f81b8604f8	Move 'struct pool_cache_group' definition into <sys/pool.h>	2002-03-13 10:57:18 +00:00
simonb	4067afd322	Remove two instances of an "error" variable that is only ever assigned to but not used.	2002-03-13 08:12:58 +00:00
thorpej	d100f2ae43	Put back pool_prime(); the i386 mp pmap uses it.	2002-03-09 18:06:54 +00:00
bjh21	dca4ae94d6	When checking that a potentially-unsigned enum is >= 0, assign it to an int first. This is necessary to avoid warnings with -fshort-enums. Casting to an int really should be enough, but turns out not to be. This change will be documented in doc/HACKS.	2002-03-09 13:22:52 +00:00
thorpej	59a96297ef	Fix a couple of typos in simple_{,un}lock()'s.	2002-03-09 05:14:33 +00:00
thorpej	c4a2ca6a3e	Remove pool_prime(). Nothing uses it, and how it should be used it not really well-defined in the absense of PR_STATIC.	2002-03-09 01:56:27 +00:00
thorpej	e21319b482	Make mbpool and mclpool use the new drain hook facaility. Adjust m_reclaim() to match the drain hook signature. This allows us to delete m_retry() and m_retryhdr(), as the pool allocator will now perform the reclaimation step for us. From art@openbsd.org.	2002-03-09 01:46:32 +00:00
thorpej	4be86fcc13	If, when a page becomes idle, the backend allocator is waiting for resources, release the page immediately, rather than letting it sit around cached. From art@openbsd.org.	2002-03-09 01:37:19 +00:00
thorpej	d54150e78a	Remove PR_MALLOCOK and PR_STATIC. The former wasn't actually used, and the latter, while there was some code tested the bit, was woefully incomplete and also unused by anything. Besides, PR_STATIC functionality could be better handled by backend allocators anyhow. From art@openbsd.org	2002-03-09 01:33:33 +00:00
thorpej	ce635a702a	Add a missing simple_unlock.	2002-03-08 21:43:54 +00:00
thorpej	8cdc6121b7	Add an optional "drain" client callback, which can be set by the new pool_set_drain_hook(). This hook is called in three cases: * When a pool has hit the hard limit, just before either erroring out or sleeping. * When a backend allocator fails to allocate memory. * Just before trying to reclaim pages in pool_reclaim(). This hook requests the client to try and free some items back to the pool. From art@openbsd.org.	2002-03-08 21:41:59 +00:00
thorpej	7c94c68f10	Remove PR_FREEHEADER; nothing uses it anymore. From art@openbsd.org.	2002-03-08 20:51:25 +00:00
thorpej	a180cee23b	Pool deals fairly well with physical memory shortage, but it doesn't deal with shortages of the VM maps where the backing pages are mapped (usually kmem_map). Try to deal with this: * Group all information about the backend allocator for a pool in a separate structure. The pool references this structure, rather than the individual fields. * Change the pool_init() API accordingly, and adjust all callers. * Link all pools using the same backend allocator on a list. * The backend allocator is responsible for waiting for physical memory to become available, but will still fail if it cannot callocate KVA space for the pages. If this happens, carefully drain all pools using the same backend allocator, so that some KVA space can be freed. * Change pool_reclaim() to indicate if it actually succeeded in freeing some pages, and use that information to make draining easier and more efficient. * Get rid of PR_URGENT. There was only one use of it, and it could be dealt with by the caller. From art@openbsd.org.	2002-03-08 20:48:27 +00:00
nathanw	3f0d660168	Make the debugging printfs compile: - printf format checking doesn't like %x for pointers; use %p like we should. - LP64 fixes.	2002-03-05 23:28:58 +00:00
simonb	bc7301d415	nlinesw is already declared in <sys/conf.h>.	2002-03-04 02:32:58 +00:00
simonb	8ecad242b9	Use <sys/disk.h> for the prototype of disk_init() rather than declaring our own locally.	2002-03-04 02:30:27 +00:00
simonb	9a942a34e0	Don't use local extern declarations for the mountroot variable or declare local prototypes for nfs_mountroot() or md_root_setconf().	2002-03-04 02:25:21 +00:00
mrg	d3b3f3e560	move the COMPAT_SUNOS TIOCGPGRP handling in the compat sunos code proper. this is the final fix needed for it to run properly as an LKM. no more COMPAT_SUNOS hacks around the tree!	2002-03-02 12:30:43 +00:00
thorpej	92eb54d5a5	Don't assign NULL to non-pointer variables.	2002-02-28 04:43:16 +00:00
manu	89e46d8a99	Reworked the o32/n32 matching scheme. IRIX uses some e_flags in the ELF header to distinguish between o32, n32 and n64 ABIs. We now use this. This suppress the need of the mips_option test, which had some fake positive. This also removes the mandatory ordering of n32 vs o32 in the exec switch (exec_conf.c)	2002-02-21 21:53:00 +00:00
christos	8ec87caa7f	ICANON is in lflags, by Neelkanth Natu	2002-02-17 19:34:42 +00:00
enami	3528e53ca0	Use sizeof correctly. Fixes PR#15613.	2002-02-16 02:11:43 +00:00
enami	233c254079	- Wrap long line. - Remove unnecessary semi-colon.	2002-02-16 02:07:56 +00:00
simonb	14ac06f0bd	Add a "show event" ddb command to show the event counters.	2002-02-15 11:18:26 +00:00
chs	b744097a5f	allow writing to write-only mappings. fixes PR 3493.	2002-02-14 07:08:02 +00:00
thorpej	daaeb3910f	const char *mclpool_warnmsg -> const char mclpool_warnmsg[] Noted by Matt Thomas.	2002-02-12 00:52:33 +00:00
jdolecek	5798a9a710	Switch default for pipes to the faster John S. Dyson's implementation. Old, socketpair-based ones are available with option PIPE_SOCKETPAIR.	2002-02-11 18:11:41 +00:00
chs	9784635eb6	getblk()'s "block size invariant" isn't valid for VBLK vnodes since bounds_check_with_label() will truncate a buffer that crosses the end of the partition. adjust the assertion to account for this. fixes PRs 7938, 12156, 12698, 13076, 13210 and 13288.	2002-02-10 23:14:18 +00:00
bouyer	149c671b66	extent_alloc_subregion1(): when a candidate region doesn't fit after being rounded up to respect boundary limits, adjust newstart and last before skiping to the next region. Otherwise we may check the same candidate region against the start of the next region, no the one immediatly following the hole, leading to corrupted map. This fixes the panic seen on sparc64 with scsi drivers, and probably fixes PR 15489.	2002-02-09 01:00:09 +00:00
christos	980af1007c	Andrew Brown found that we overflowed all cases in the TTLINEDNAMELEN. ouch.	2002-02-08 18:36:53 +00:00
thorpej	7380aa1467	Add a "ddb" command (if DDB is configured into the kernel) for easy, convenient dropping into DDB at the "root device: " prompt. Useful if your console can't do it w/o actually taking an interrupt and you want to, say, look at the boot messages.	2002-02-08 00:58:58 +00:00
chs	90503a3cda	add an assert (hopefully to find where we recycle vnodes without freeing all the pages, like I've seen recently).	2002-02-05 07:50:58 +00:00
tls	eeda20f137	Kick DEFAULT_MAXPTYS from 256 to 992	2002-02-02 07:18:55 +00:00
kleink	12135360cf	fcntl(..., F_GETOWN, ...): fix LP64-BE bug; raised by der Mouse on tech-kern.	2002-01-31 22:17:33 +00:00
kleink	935d56d508	Add {POSIX_MONOTONIC_CLOCK} variables.	2002-01-31 00:32:47 +00:00
simonb	346fc0519d	Implement the CLOCK_MONOTONIC clock for the posix/opengroup realtime clock_() functions. This simply returns the kernel mono_time variable. As discussed on tech-kern.	2002-01-31 00:13:07 +00:00
thorpej	07d023a3aa	Define ELF32_EHDR_FLAGS_OK()/ELF64_EHDR_FLAGS_OK() and use it as an added measure to make sure that we can execute a binary. These default to (1) if elf_machdep.h does not override them. On Sun2, ELF32_EHDR_FLAGS_OK() checks for the presense of EF_M68000, since the 68010 cannot run binaries for the 68020-and-up.	2002-01-28 22:15:54 +00:00
simonb	4aceab2a3d	Remember to update the "size copied" counter in sysctl_diskstats().	2002-01-28 03:33:55 +00:00
simonb	9e42930807	Use TAILQ_FOREACH().	2002-01-28 03:12:13 +00:00
simonb	bfc71f3c10	Add sysctls to read tk_{nin,nout,cancc,rawcc} (under a kern.tkstat node). "extern" those variables in <sys/dkstat.h>, and add declarations for them in sys/tty.c	2002-01-28 02:06:02 +00:00
lukem	fea75ff830	- kern.maxvnodes: only call vfs_reinit() and nchreinit() if the value was successfully changed. previously, successfully viewing the current value would flush the cache :-/ - similarly, don't change hostid and sb_max unless the value was successfully changed	2002-01-27 13:33:36 +00:00
simonb	ed6b438e40	Implement the hw.disknames and hw.diskstats sysctl's that have been listed in <sys/sysctl.h> since day one but never implemented.	2002-01-27 12:41:07 +00:00
christos	83990b1879	Clear the P_INEXEC flag in other labels. Pointed out by he@netbsd.org	2002-01-12 14:20:30 +00:00
christos	d4e7e4c9c4	Fix a ptrace/execve race that could be used to modify the child process's image during execve. This is a security issue because one can do that to setuid programs... From FreeBSD.	2002-01-11 21:16:27 +00:00
manu	4398be0cfc	Added code to correctly match and launch o32 and n32 IRIX binaires (note that there is n support yet in the kernel for running n32 binaries)	2002-01-07 22:07:37 +00:00
dbj	d5738f5104	add support for the `D' kgdb detach command add comment about minimum requirement for KGDB_BUFLEN	2002-01-05 22:57:38 +00:00
thorpej	21df9c4a6e	Switch ELF emulations to ELF core files (except for netbsd32 and svr4-32).	2002-01-04 06:47:46 +00:00
mrg	d6328a8778	fix previous: actually remove the COMPAT_SUNOS code, not just #if 0 it.	2002-01-03 01:16:02 +00:00
mrg	af640de164	move the COMPAT_SUNOS SO_BROADCAST hack out of uipc_socket.c into the compat/sunos code. besides being cleaner this allows the sunos LKM to properly work without any special kernel hacks.	2002-01-03 00:59:00 +00:00
perry	dc0e791802	Happy New Year!	2002-01-01 09:17:28 +00:00
ad	281fb6d209	In clalloc(), clear the quoted char bitmap (part of the ring buffer was being cleared twice).	2001-12-27 02:27:50 +00:00
christos	eebee4d7f8	PR/14795: Christos Zoulas: Fix locking problem on MP systems where ltsleep() is calling CURSIG() which can call issignal() and issignal() could not deal with being called from a locked context. This happens when a process receives SIGTTIN, and issignal() calls psignal() to post SIGCHLD to the parent. XXX: It is really messy to have issignal() handle the job control functionality and the whole signal interlocking protocol needs to be re-designed. For now this fix (provided by enami) does the trick. I've been running with this fix for weeks, and atatat has stress-tested the kernel running ~30 make kernels...	2001-12-18 15:51:52 +00:00
chs	a8b519c880	unmap loaned pages before dropping the loan. some pmaps aren't expecting pmap_kenter_pa() to be used to replace an existing mapping, plus it just seems like a bad idea to keep around mappings of pages that may be freed and reused.	2001-12-18 08:49:40 +00:00
jdolecek	d7129f9255	fix typo in #ifdef __FreeBSD__ Pointed out by Chris Jepeway in private e-mail, thanks!	2001-12-11 18:15:09 +00:00
thorpej	0b63bf16b8	Cast off_t to (long long) for printing.	2001-12-10 02:07:37 +00:00
thorpej	06920aef28	Move the code that walks the process's VM map during a coredump into uvm_coredump_walkmap(), and use callbacks into the coredump routine to do something with each section.	2001-12-10 01:52:26 +00:00
enami	8d6b971560	KNF.	2001-12-10 01:49:26 +00:00
chs	62c2e756ed	update vnode flags in ddb vnode-printing function.	2001-12-10 01:38:48 +00:00
thorpej	f884f172e3	Don't compare an int against NULL.	2001-12-10 00:33:04 +00:00
thorpej	51535d4bf5	Add support for dumping ELF-cormat core files.	2001-12-09 23:05:56 +00:00
manu	3cdc6f6197	Changed clocktl interface to use syscallargs structures	2001-12-09 16:10:43 +00:00
chs	8e9cdbbd63	replace "vnode" and "vtext" with "file" and "exec" in uvmexp field names.	2001-12-09 03:07:43 +00:00
lukem	290cc55a28	- Implement uint32_t namei_hash(const char p, const char ep) which determines the equivalent MI hash32_str() hash for p. If ep != NULL, calculate the hash to the character before ep. If ep == NULL, calculate the has to the first / or NUL found, and point ep to that location. - Use namei_hash() to calculate cn_hash in lookup() and relookup(). Hash distribution goes from 35-40% to 55-70%, with similar profiled time spent in cache_lookup() and cache_enter() on my P3-600. - Use namei_hash() to calculate cn_hash in nfs_readdirplusrpc(), insetad of homegrown code (that differed from that in lookup() !) namei_hash() has better spread and is faster than previous code (which used a non-constant multiplication).	2001-12-08 04:09:59 +00:00
enami	df8cfd38a2	Test ".." correctly when creating reverse cache entry.	2001-12-08 04:09:56 +00:00
thorpej	205c159f0e	Make the coredump routine exec-format/emulation specific. Split out traditional NetBSD coredump routines into core_netbsd.c and netbsd32_core.c (for COMPAT_NETBSD32).	2001-12-08 00:35:25 +00:00
jdolecek	3d4c59a8ee	Back off previous for now, Jason thinks it's not right. Will discuss on tech-kern@	2001-12-07 07:09:29 +00:00
christos	d343ff07bf	remove diagnostic test checking for cr_ref = 0. It will never be, because free() will stick 0xdeadbeef into it.	2001-12-06 23:11:59 +00:00
jdolecek	255ce12e50	replace FIF_WANTCLOSE/FIF_LARVAL with FWANTCLOSE/FLARVAL, which are set in f_flag of struct file for now, keep former f_iflags of struct file as _f_spare0, it will be g/c'ed when struct file will be changed (this will happen soon)	2001-12-06 22:34:24 +00:00
chs	163b4fbc50	in vinvalbuf(), vtruncbuf() and vflushbuf(), don't skip calling VOP_PUTPAGES() just because the vnode has no pages. layered filesystems will want to pass these calls on through to the underlying filesystem, and non-layered filesystems may need to remove the vnode from the syncer queues. fix up MP locking and add some locking assertions. fixes PRs 12284 and 14640.	2001-12-06 04:34:33 +00:00
augustss	88b5ed0731	Use NULL instead of 0 in a few places.	2001-12-05 16:12:42 +00:00
enami	4654f5c754	Implement sigismasked() correctly. KNF while I'm here.	2001-12-05 07:32:24 +00:00
enami	996ee10484	Introduce new function malloc_roundup(), suggested by Bill Sommerfeld on tech-kern.	2001-12-05 01:29:04 +00:00
thorpej	03efee5585	* Allow machine-dependent code to specify hooks for ptrace(2) (__HAVE_PTRACE_MACHDEP) and procfs (__HAVE_PROCFS_MACHDEP). These changes will allow platforms like x86 (XMM) and PowerPC (AltiVec) to export extended register sets in a sane manner. * Use __HAVE_PTRACE_MACHDEP to export x86 XMM registers (standard FP + SSE/SSE2) using PT_{GET,SET}XMMREGS (in the machdep ptrace request space). * Use __HAVE_PROCFS_MACHDEP to export x86 XMM registers via /proc/N/xmmregs in procfs.	2001-12-05 00:58:05 +00:00
enami	769949847d	KNF and other cosmetic changes.	2001-12-04 23:56:36 +00:00
jdolecek	c152d680d8	Add M_CANFAIL malloc(9) flag. This flag changes behaviour in M_WAITOK case when the requested memory size can't ever be granted - instead of panic, malloc(9) would return failure (NULL). Note kernel code should do proper bound checking, rather than depend on M_CANFAIL. This flag is only supposed to be used in very special cases, where common bound checking is not appropriate. Discussed on tech-kern@, name ``M_CANFAIL'' suggested by Chuck Cranor.	2001-12-04 20:13:19 +00:00
augustss	20e5de88af	Add IRFRAMEDISC line discipline. Also, use symbolic names instead of numbers in a few places.	2001-12-03 23:30:37 +00:00
augustss	d845d14a6c	Move the code that expands the cd_devs array into a subroutine.	2001-12-02 02:40:57 +00:00
kleink	e9d7166203	asm -> __asm.	2001-11-30 16:21:16 +00:00
msaitoh	1c87566f38	fix previous commit	2001-11-30 10:31:32 +00:00
msaitoh	797d100f77	fix printf format	2001-11-30 10:06:46 +00:00
enami	b3d722b5b5	Since we know that `va' is non-NULL here, no need to test if it is NULL.	2001-11-30 01:54:21 +00:00
enami	ba7af71cd5	Use cached pointer to next buf instead of re-fetching it. GCC actually generates different code.	2001-11-30 01:31:30 +00:00
christos	0fdce3fa8a	sprinkle crcvt()	2001-11-29 21:21:29 +00:00
christos	2a73d40eab	introduce crcvt() that converts from struct uucred to struct ucred.	2001-11-29 21:21:13 +00:00
manu	9d2b93114a	Updated comment about COMPAT_IRIX: we start working on o32 first...	2001-11-28 11:45:36 +00:00
bouyer	1cd410986d	In cn_get_magic() don't increment i in the for(;;) loop, as it's already set to point to the next value with CNS_MAGIC_NEXT(). Now it's obvious that the magic string on my U5 is +++++ and not +++ :)	2001-11-27 20:00:36 +00:00
jdolecek	577b809930	crfree(): add DIAGNOSTIC sanity check of cr_ref; unfortunately, if the memory is freed prematurely the check won't be triggered immediatelly, probably since the memory is likely to be reused fast; but it _would_ be triggered eventually	2001-11-27 07:30:03 +00:00
manu	12c949a188	Added COMPAT_IRIX (being developped, not functionnal at that time)	2001-11-26 21:38:41 +00:00
jdolecek	bf7f67be85	if the LKM support is not compiled in, don't bother using exec_lock at all, it's only needed in LKM case use #if defined(LKM) \|\| defined(_LKM) condition for netbsd32_execve.c, to DTRT when either compiled statically into kernel with LKM support, or compiled as a LKM	2001-11-23 22:02:39 +00:00
jdolecek	59f63c238e	Two changes to setrlimit(2): * return EINVAL if specified current limit exceeds specified hard limit. This behaviour is required by SUSv2 (noted by Giles Lean on tech-kern) * return EINVAL if an attempt is made to lower stack size limit below current usage; this addresses bin/3045 by Jason Thorpe, and conforms to SUSv2	2001-11-23 18:56:33 +00:00
enami	f7fa67fcfc	Use round_page() instead of roundup(, PAGE_SIZE). Suggested by chs@netbsd.org.	2001-11-21 01:30:04 +00:00
enami	6a043f1db9	Protect multi statement macro with do {} while (0). This is not cosmetic.	2001-11-21 00:55:39 +00:00
lukem	99bccc3cab	- pull in opt_kgdb.h where necessary - replace opt_kgdb_machdep.h with opt_kgdb.h - defparam opt_kgdb.h: KGDB_DEV KGDB_DEVNAME KGDB_DEVADDR KGDB_DEVRATE KGDB_DEVMODE - move from opt_ddbparam.h to opt_ddb.h: DDB_FROMCONSOLE DDB_ONPANIC DDB_HISTORY_SIZE DDB_BREAK_CHAR SYMTAB_SPACE - replace KGDBDEV with KGDB_DEV - replace KGDBADDR with KGDB_DEVADDR - replace KGDBMODE with KGDB_DEVMODE - replace KGDBRATE with KGDB_DEVRATE - use `9600' instead of `0x2580' for 9600 baud rate - use correct quotes for options KGDB_DEVNAME="\"com\"" - use correct quotes for options KGDB_DEV="17*256+0" - remove unnecessary dependancy on Makefile for kgdb_stub.o - minor whitespace cleanup	2001-11-20 08:43:19 +00:00
enami	39ab348507	Call pr_log(PRLOG_GET) when POOL_DIAGNOSTIC is defined instead of DIAGNOSTIC for consistency.	2001-11-20 06:57:04 +00:00
lukem	576eed5512	Add new malloc(9) flag M_ZERO - zeros memory before returning. From Poul-Henning Kamp's equivalent enhancement in FreeBSD.	2001-11-17 03:50:27 +00:00
lukem	2565646230	don't need <sys/types.h> when including <sys/param.h>	2001-11-15 09:47:59 +00:00
christos	88db539589	Make poll(2) ignore negative file descriptors (Bang Jun-Young).	2001-11-14 18:43:58 +00:00
lukem	43369f69d3	add RCSID	2001-11-13 13:33:44 +00:00
christos	d1dcd2f401	PR/8657: z@rentaboat.se: alarm takes more seconds that it can handle. This is a followup to PR/14558. - itimerfix(9) limited the number of seconds to 100M, before I changed it to 1000M for PR/14558. - nanosleep(2) documents a limit of 1000M seconds. - setitimer(2), select(2), and other library functions that indirectly use setitimer(2) for example alarm(3) don't specify a limit. So it only seems appropriate that any positive number of seconds in struct timeval should be accepted by any code that uses itimerfix(9) directly, except nanosleep(2) which should check for 1000M seconds manually. This changes makes the manual pages of select(2), nanosleep(2), setitimer(2), and alarm(3) consistent with the code.	2001-11-13 00:34:21 +00:00
lukem	adc783d537	add RCSIDs	2001-11-12 15:25:01 +00:00
lukem	7bbd2f341b	clean up whitespace	2001-11-12 15:21:46 +00:00
lukem	6e65bad538	regen (for RCSID)	2001-11-12 15:03:35 +00:00
lukem	bbfda04203	add RCSID to generated .c files	2001-11-12 14:57:02 +00:00
christos	6ff985ffa8	PR/14558: Tero Kivinen: There is no point in limiting the number of seconds to 100 million. Use 1000 million like the man page for nanosleep suggests. This is much closer to MAXINT, and it conforms to POSIX.	2001-11-12 14:52:33 +00:00
lukem	6e69bda797	regen (for RCSID)	2001-11-12 14:50:28 +00:00
lukem	d8dd7943fe	add RCSID to generated .c file	2001-11-12 14:34:24 +00:00
christos	5c7505a94f	fix compile typo.	2001-11-10 20:49:52 +00:00
bouyer	f7bc541779	Fix a memory leak when we change pl_corename several times.	2001-11-08 21:24:47 +00:00
chs	6fbca7d0fc	use pmap_kenter_pa() instead of pmap_enter(), this is required for pages loaned to the kernel. this implies that we also need to call pmap_kremove() before uvm_km_free(). other general cleanup: remove argument names from prototypes, rename some variables, etc.	2001-11-06 07:30:14 +00:00
fvdl	f8ff451a0a	Fix reversed wait/nowait logic.	2001-10-31 22:23:18 +00:00
manu	40c8c13901	Made syscallargs.h lint-friendly, so that it can be included in libc sources more easily.	2001-10-30 21:52:26 +00:00
thorpej	e8ee04475d	- Add a new vnode flag VEXECMAP, which indicates that a vnode has executable mappings. Stop overloading VTEXT for this purpose (VTEXT also has another meaning). - Rename vn_marktext() to vn_markexec(), and use it when executable mappings of a vnode are established. - In places where we want to set VTEXT, set it in v_flag directly, rather than making a function call to do this (it no longer makes sense to use a function call, since we no longer overload VTEXT with VEXECMAP's meaning). VEXECMAP suggested by Chuq Silvers.	2001-10-30 15:32:01 +00:00
itohy	36a6ce75fe	Fix memory leak if - ktrace enabled for I/O operaion, and - readv() causes an error.	2001-10-30 13:37:21 +00:00
simonb	5f717f7c33	Don't need to include <uvm/uvm_extern.h> just to include <sys/sysctl.h> anymore.	2001-10-29 07:02:30 +00:00
jdolecek	24ba90929c	Avoid using microtime(9) for atime/mtime, we don't need to have it THAT accurate and microtime(9) is painlessly slow on i386 currently. This speeds up small transfers much. The gain for large transfers is less significant, but notable too. Bottleneck was found by Andreas Persson (Re: kern/14246). Performance improvement with PIII on 661 Mhz according to hbench (with PIPE_MINDIRECT=8192): buffersize before after 512 17 49 1024 33 110 2048 52 143 4096 77 163 8192 142 190 64K 577 662 128K 372 392	2001-10-28 20:47:15 +00:00
christos	f7cb99f9c6	make a copy of the epp->ep_path, because it is too late to use it when copyargs is called since we've swapped vmspaces.	2001-10-28 04:46:42 +00:00
chs	a54f8441f8	in cache_lookup(), if we get a cache hit but then fail to vget() the found vnode, we should not attempt to remove the namecache entry. this is because vget() can sleep (eg. if VXLOCK is set because the vnode is being reclaimed), and so multiple threads can end up in this context at the same time. if this happens, each thread ends up removing the cache entry, but the code to remove the entry assumes that the entry is still valid. so we should just leave the (now stale) entry in the cache. if another thread finds the entry again before it is reused, that thread will notice that the entry is stale and remove it safely. fixes PR 14042.	2001-10-27 04:53:38 +00:00
chs	6bc926aa9e	in pool_drain(), call pool_reclaim() while we still have interrupts blocked since the pool in question might be one used in interrupt context.	2001-10-21 00:06:05 +00:00
thorpej	e45efd967f	Deprecate the "m_act" alias of "m_nextpkt" (m_act is a historical name), and just use m_nextpkt everywhere.	2001-10-18 20:17:24 +00:00
thorpej	9b2f51421c	Use a pool cache for namei buffers -- it's faster to allocate from a pool cache than a pool.	2001-10-17 23:33:29 +00:00
christos	bfe76ac87a	Allow userland to pass MNT_IGNORE (from enami)	2001-10-11 16:27:24 +00:00
mycroft	cbd7c4d140	When a pipe was grown to BIG_PIPE_SIZE, we could get in a select()/write() loop because pipe_poll() and pipe_write() did not agree on when it was okay to write more data. Fix pipe_write(), since it seems to be the broken one.	2001-10-08 07:50:17 +00:00
bjh21	d0844f93bb	Add support for allocating pool memory in units smaller than a whole page. This is activated by defining POOL_SUBPAGE to the size of the new allocation unit, and makes pools much more efficient on machines with obscenely large pages. It might even make four-megabyte arm26 systems usable.	2001-10-07 12:44:06 +00:00
eeh	c824ea9b57	Remove the property sysctl interface for now.	2001-10-05 19:05:05 +00:00
pooka	11d8d4dae9	add casts to debug printfs to make this compile on archs where size_t is of type int	2001-10-05 12:55:24 +00:00
eeh	e1bd5ac110	Add generic properties.	2001-10-04 18:56:06 +00:00
chs	90a3a778a7	when attempting to reclaim a vnode, tell the lockmgr() that it's ok to just fail if we already hold the lock. we'll skip that vnode and try another. fixes PR 14090.	2001-10-04 05:46:45 +00:00
chs	adb1a233b7	replace wakeup_one() with wakeup(). wakeup_one() can only be used if the woken-up thread is guaranteed to pass the buck to the next guy before going back to sleep, and the rest of the lockmgr() code doesn't do that. from Bill Sommerfeld. fixes PR 14097.	2001-09-29 21:27:49 +00:00
jdolecek	560e3c342e	Use lmin() instead of min(), and long for mlen & clen, to avoid integer overflow on LP64 architectures. This fixes kern/10070 by Juergen Weiss. Fix tested on NetBSD/alpha by Bernd Ernesti, on NetBSD/sparc64 by David Brownlee and Eduardo Horvath.	2001-09-29 14:16:19 +00:00
jdolecek	18c0643bfb	Update the uio resid counts appropriately when any error occurs (not just EPIPE), so that the higher-level code would note partial write has happened and DTRT if the write was interrupted due to e.g. delivery of signal. This fixes kern/14087 by Frank van der Linden. Much thanks to Frank for extensive help with debugging this, and review of the fix. Note: EPIPE/SIGPIPE delivery behaviour was retained - they're delivered even if the write was partially successful.	2001-09-29 13:48:11 +00:00
chs	a76b67cf09	jump through hoops to avoid calling uvm_km_free_poolpage() while holding spinlocks, since that function can sleep. (note that there's still one instance remaining to be fixed.) use TAILQ_FOREACH where appropriate.	2001-09-26 07:14:56 +00:00
enami	b0df86c9e6	In the function getnewvnode: - Mark file system busy again on retry. - Don't use the variable `listhd' uninitialized.	2001-09-26 00:59:57 +00:00
jdolecek	25bef3c837	Take care to transfer whole buffer passed via write(2); write(2) should not do short writes unless when using non-blocking I/O. This fixes kern/13744 by Geoff C. Wing. Note this partially undoes rev. 1.5 change. Upon closer examination, it's been apparent that hbench-OS expectations were not actually justified.	2001-09-25 19:01:21 +00:00
chs	039c1fd312	print a stack trace in more LOCKDEBUG cases. add a blank line between complaints. use TAILQ_FOREACH where appropriate.	2001-09-25 06:13:29 +00:00
chs	e7d9abce3e	in ltsleep(), assert that the interlock is held (if one is given).	2001-09-25 01:38:38 +00:00
chs	7bb91a0959	resize the namecache hash table also when desiredvnodes changes.	2001-09-24 06:01:13 +00:00
sommerfeld	b27c418d40	Deal with invokers who pass us environment variable settings as part of the command they want us to run. Fixes part 3A of PR#14046	2001-09-24 00:20:11 +00:00
sommerfeld	acf40b361c	Correct comment to match code	2001-09-22 22:36:30 +00:00
jdolecek	8573719e3d	add new UVM_LOAN_WIRED flag - the memory pages loaned in TOPAGE case are only wired if this flag is present (i.e. they are not wired by default now) loaned pages are unloaned via new uvm_unloan(), uvm_unloananon() and uvm_unloanpage() are no longer exported adjust uvm_unloanpage() to unwire the pages if UVM_LOAN_WIRED is specified mark uvm_loanuobj() and uvm_loanzero() static also in function implementation kern/sys_pipe.c: uvm_unloanpage() --> uvm_unloan()	2001-09-22 05:58:04 +00:00
chs	d0b76cab5e	use shared locks instead of exclusive for VOP_READ() and VOP_READDIR().	2001-09-21 08:02:55 +00:00
jdolecek	1d161cb2d4	call pmap_update() after pmap_enter()s ALWAYS call uvm_unloanpage() in cleanup - it's necessary even in pipe_loan_free() case, since uvm_km_free() doesn't seem to implicitly unloan the loaned pages	2001-09-20 19:09:13 +00:00
jdolecek	ef8abe0767	Make the setregs hook emulation-specific, rather than executable format specific. Struct emul has a e_setregs hook back, which points to emulation-specific setregs function. es_setregs of struct execsw now only points to optional executable-specific setup function (this is only used for ECOFF).	2001-09-18 19:36:32 +00:00
jdolecek	a7357fecf4	soreceive(): do not ignore uiomove() error Problem reported and fix provided by Aaro Koskinen in kern/11692.	2001-09-17 18:59:29 +00:00
wiz	456dff6cb8	Spell 'occurred' with two 'r's.	2001-09-16 16:34:23 +00:00
jmc	87a006a7de	Fix issues areound compilation errors (ntv is a pointer in adjtime1 and needs -> rather than . access and adjtime1 needs the syscall args passed in)	2001-09-16 09:55:26 +00:00
manu	adb22920bc	Split root-only time-related system calls so that we have an upper part, that checks root privs, and a lower part that does the actual job. The lower part will be called by the upcoming clockctl driver. Approved by Christos Also fixed a few cosmetic things	2001-09-16 06:50:06 +00:00
chs	d9faf4d4e7	regen	2001-09-15 20:37:36 +00:00
chs	64c6d1d2dc	a whole bunch of changes to improve performance and robustness under load: - remove special treatment of pager_map mappings in pmaps. this is required now, since I've removed the globals that expose the address range. pager_map now uses pmap_kenter_pa() instead of pmap_enter(), so there's no longer any need to special-case it. - eliminate struct uvm_vnode by moving its fields into struct vnode. - rewrite the pageout path. the pager is now responsible for handling the high-level requests instead of only getting control after a bunch of work has already been done on its behalf. this will allow us to UBCify LFS, which needs tighter control over its pages than other filesystems do. writing a page to disk no longer requires making it read-only, which allows us to write wired pages without causing all kinds of havoc. - use a new PG_PAGEOUT flag to indicate that a page should be freed on behalf of the pagedaemon when it's unlocked. this flag is very similar to PG_RELEASED, but unlike PG_RELEASED, PG_PAGEOUT can be cleared if the pageout fails due to eg. an indirect-block buffer being locked. this allows us to remove the "version" field from struct vm_page, and together with shrinking "loan_count" from 32 bits to 16, struct vm_page is now 4 bytes smaller. - no longer use PG_RELEASED for swap-backed pages. if the page is busy because it's being paged out, we can't release the swap slot to be reallocated until that write is complete, but unlike with vnodes we don't keep a count of in-progress writes so there's no good way to know when the write is done. instead, when we need to free a busy swap-backed page, just sleep until we can get it busy ourselves. - implement a fast-path for extending writes which allows us to avoid zeroing new pages. this substantially reduces cpu usage. - encapsulate the data used by the genfs code in a struct genfs_node, which must be the first element of the filesystem-specific vnode data for filesystems which use genfs_{get,put}pages(). - eliminate many of the UVM pagerops, since they aren't needed anymore now that the pager "put" operation is a higher-level operation. - enhance the genfs code to allow NFS to use the genfs_{get,put}pages instead of a modified copy. - clean up struct vnode by removing all the fields that used to be used by the vfs_cluster.c code (which we don't use anymore with UBC). - remove kmem_object and mb_object since they were useless. instead of allocating pages to these objects, we now just allocate pages with no object. such pages are mapped in the kernel until they are freed, so we can use the mapping to find the page to free it. this allows us to remove splvm() protection in several places. The sum of all these changes improves write throughput on my decstation 5000/200 to within 1% of the rate of NetBSD 1.5 and reduces the elapsed time for "make release" of a NetBSD 1.5 source tree on my 128MB pc to 10% less than a 1.5 kernel took.	2001-09-15 20:36:31 +00:00
chs	adf5d360a7	add a new VFS op, vfs_reinit, which is called when desiredvnodes is adjusted via sysctl. file systems that have hash tables which are sized based on the value of this variable now resize those hash tables using the new value. the max number of FFS softdeps is also recalculated. convert various file systems to use the <sys/queue.h> macros for their hash tables.	2001-09-15 16:12:54 +00:00
enami	163c9dd7c1	Defopt CALLWHEEL_STATS.	2001-09-13 05:22:16 +00:00
thorpej	16c229ea7c	Optimization suggested by Bill Sommerfeld: Keep a hint as to the "earliest" firing callout in a bucket. This allows us to skip the scan up the bucket if no callouts are due in the bucket. A cheap O(1) hint update is done at callout insertion (if new callout is earlier than hint) and removal (is bucket empty). A thorough refresh of the hint is done when the bucket is traversed. This doesn't matter much on machines with small values of hz (e.g. i386), but on systems with large values of hz (e.g. Alpha), it has a definite positive effect. Also, keep the callwheel stats in evcnts, so that you can view them with "vmstat -e".	2001-09-11 04:32:19 +00:00
christos	fdad5eb59c	Don't trash the ref count of cred. It causes a memory leak.	2001-09-08 15:34:06 +00:00
christos	f556e46298	Hijack the credentials used to evaluate access, to avoid a potential lwp race by modifying the proc's credentials temporarily. From Bill Sommerfeld. Thanks for forcing me to do this right :-)	2001-09-08 02:05:39 +00:00
christos	51d673156c	Set the credentials to be used in the NDINIT macro so that syscalls can hijack them.	2001-09-08 02:02:04 +00:00
enami	405f874060	extent_alloc_subregion shouldn't allocate a region pasts the end of subregion.	2001-08-27 13:35:44 +00:00
thorpej	bbd3d01db5	Artur Grabowski's simple brute-force malloc debugger, which uses guard pages. Can only debug one malloc type at a time, and nothing larger than 1 page. But can be useful for debugging certain types of "data modified on freelist" type problems. Modified from code in OpenBSD.	2001-08-17 00:48:29 +00:00
chs	e9fbc91f95	user maps are always pageable.	2001-08-16 01:37:50 +00:00
enami	256a367268	Give different names for different wait channels. # and minor knf fix while I'm here.	2001-08-05 08:25:39 +00:00
jdolecek	332bb4894a	bound check mount args more thoroughly	2001-08-03 06:00:13 +00:00
jdolecek	c650c8d91c	make local 'nsops' same unsigned type as the value we store there in sys_semop()	2001-08-03 05:58:18 +00:00
christos	f2a8a907b4	simplify an expression.	2001-07-29 21:24:23 +00:00
christos	3e1e7a68e5	adjust to the new copyargs footprint	2001-07-29 21:22:58 +00:00
christos	e2b51c04af	- make copyargs function return the reason it failed and pass a pointer to the stack, so that it can be modified. - pass the error code in the exit code in addition to aborting. - kill the second exit1() call; it does not make any sense.	2001-07-29 21:22:42 +00:00
thorpej	9e74a14e1f	Add a sysctl to frob sb_max.	2001-07-27 21:19:09 +00:00
thorpej	59612894e1	Now that M_TRAILINGSPACE() checks buffer writeability properly, we can greatly simplify sbcompress(). Slightly modified from a similar change in FreeBSD.	2001-07-27 19:27:49 +00:00
thorpej	fcc2e4e5f6	Use pool_cache_*() for mbufs and clusters. While we don't use the ctor/dtor feature, it's still faster to allocate from the cache groups than it is from the pool (cache groups are analogous to "magazines" in the Solaris SLAB allocator).	2001-07-26 19:05:04 +00:00
jdolecek	875b784599	pipe_create(): explicitly zero whole memory returned from pool_get(), instead of some selective pieces. This fixes problem with NEW_PIPE in kernels with DEBUG option, reported via e-mail by Chuck Silvers. sys_pipe(): g/c fdp, provide it at the chunk of FreeBSD code where it's used	2001-07-26 14:14:28 +00:00
wiz	f1bde32520	s/necesary/necessary/	2001-07-24 16:43:02 +00:00
assar	bec71dc090	change vop_symlink and vop_mknod to return vpp (the created node) refed, so that the caller can actually use it. update callers and file systems that implement these vnode operations	2001-07-24 15:39:30 +00:00

... 3 4 5 6 7 ...

3081 Commits