NetBSD

Commit Graph

Author	SHA1	Message	Date
itojun	04ac848d6f	introduce m->m_pkthdr.aux to hold random data which needs to be passed between protocol handlers. ipsec socket pointers, ipsec decryption/auth information, tunnel decapsulation information are in my mind - there can be several other usage. at this moment, we use this for ipsec socket pointer passing. this will avoid reuse of m->m_pkthdr.rcvif in ipsec code. due to the change, MHLEN will be decreased by sizeof(void *) - for example, for i386, MHLEN was 100 bytes, but is now 96 bytes. we may want to increase MSIZE from 128 to 256 for some of our architectures. take caution if you use it for keeping some data item for long period of time - use extra caution on M_PREPEND() or m_adj(), as they may result in loss of m->m_pkthdr.aux pointer (and mbuf leak). this will bump kernel version. (as discussed in tech-net, tested in kame tree)	2000-03-01 12:49:27 +00:00
thorpej	b178e1f58c	Add support for rate-limiting RSTs sent in response to no socket for an incoming packet. Default minimum interval is 10ms. The interval is changeable via the "net.inet.tcp.rstratelimit" sysctl variable.	2000-02-15 19:54:11 +00:00
thorpej	312cb38ccb	In the tcp_input() path: - Filter out multicast destinations explicitly for every incoming packet, not just SYNs. Previously, non-SYN multicast destination would be filtered out as a side effect of PCB lookup. Remove now redundant similar checks in the dropwithreset case and in syn_cache_add(). - Defer the TCP checksum until we know that we want to process the packet (i.e. have a non-CLOSED connection or a listen socket).	2000-02-12 17:19:34 +00:00
itojun	1a2a1e2b1f	bring in latest KAME ipsec tree. - interop issues in ipcomp is fixed - padding type (after ESP) is configurable - key database memory management (need more fixes) - policy specification is revisited XXX m->m_pkthdr.rcvif is still overloaded - hope to fix it soon	2000-01-31 14:18:52 +00:00
itojun	dc0f1c0435	drop IPv6 packets with v4 mapped address on src/dst. they are illegal and may be used to fool IPv6 implementations (by using ::ffff:127.0.0.1 as source you may be able to pretend the packet is from local node)	1999-12-22 04:03:01 +00:00
itojun	abddb5f851	do not overwrite traffic class field when we write IPv6 version field.	1999-12-15 06:28:43 +00:00
itojun	ea861f0183	sync IPv6 part with latest KAME tree. IPsec part is left unmodified due to massive changes in KAME side. - IPv6 output goes through nd6_output - faith can capture IPv4 packets as well - you can run IPv4-to-IPv6 translator using heavily modified DNS servers - per-interface statistics (required for IPv6 MIB) - interface autoconfig is revisited - udp input handling has a big change for mapped address support. - introduce in4_cksum() for non-overwriting checksumming - introduce m_pulldown() - neighbor discovery cleanups/improvements - netinet/in.h strictly conforms to RFC2553 (no extra defs visible to userland) - IFA_STATS is fixed a bit (not tested) - and more more more. TODO: - cleanup os-independency #ifdef - avoid rcvif dual use (for IPsec) to help ifdetach (sorry for jumbo commit, I can't separate this any more...)	1999-12-13 15:17:17 +00:00
itojun	4d757da195	implement upper-layer reachability confirmation for IPv6 ND (RFC2461 7.3.1). fix code to reject "tcp to IPv6 anycast". sync with recent KAME.	1999-12-11 09:55:14 +00:00
itojun	313f5eb9cd	do not drop from IP header to tcp option until sbappend(), to reduce requirement to mbuf chain. part of KAME sync, committed separately for its (possible) impact.	1999-12-08 16:22:20 +00:00
itojun	9474edfcd8	cleanup and correct TCP MSS consideration with IPsec headers. MSS advertisement must always be: max(if mtu) - ip hdr siz - tcp hdr siz We violated this in the previous code so it was fixed. tcp_mss_to_advertise() now takes af (af on wire) as its argument, to compute right ip hdr siz. tcp_segsize() will take care of IPsec header size. One thing I'm not really sure is how to handle IPsec header size in rxsegsizep (inbound segment size estimation). The current code subtracts possible outbound* IPsec size from *rxsegsizep, hoping that the peer is using the same IPsec policy as me. It may not be applicable, could TCP gulu please comment...	1999-09-23 02:21:30 +00:00
simonb	fd8040a031	s/acknowledgment/acknowledgement/	1999-09-10 03:24:14 +00:00
thorpej	1e921673e3	Fix a problem discovered by the snd_recover update fix. A bit of the New Reno fast recovery code was being executed even when New Reno was disabled, resulting in an unfortunate interaction with the traditional fast recovery code, the end resulting being that the very condition that would trigger the traditional fast recovery mechanism caused fast recovery to be disabled! Problem reported by Ted Lemon, and some analytical help from Charles Hannum.	1999-08-26 00:04:30 +00:00
itojun	809ab7f1ff	When listening socket goes away, remove assockated syn cache entires. Stale syn cache entries are useless because none of them will be used if there is no listening socket, as tcp_input looks up listening socket by in_pcblookup*() before looking into syn cache. This fixes race condition due to dangling socket pointer from syn cache entries to listening socket (this was introduced when ipsec is merged in). This should preserve currently implemented behavior (but not 4.4BSD behavior prior to syn cache). Tested in KAME repository before commit, but we'd better run some regression tests.	1999-08-25 15:23:12 +00:00
christos	d6f8878423	PR/8254: Wolfgang Rupprecht: Incorrect logging of tcp connections; Fix src/dst confusion.	1999-08-23 14:14:30 +00:00
thorpej	af1e02ad91	Fix a few bugs in the TCP New Reno code: - Make sure that snd_recover is always at least snd_una. If we don't do this, there can be confusion when sequence numbers wrap around on a large loss-free data transfer. - When doing a New Reno retransmit, snd_una hasn't been updated yet, and the socket's send buffer has not yet dropped off ACK'd data, so don't muddle with snd_una, so that tcp_output() gets the correct data offset. - When doing a New Reno retransmit, make sure the congestion window is open one segment beyond the ACK'd data, so that we can actually perform the retransmit. Partially derived from, although more complete than, similar changes in OpenBSD, which in turn originated from Tom Henderson <tomh@cs.berkeley.edu>.	1999-08-11 17:37:59 +00:00
thorpej	e48f29e82b	Make sure the echoed RFC 1323 timestamp is valid before using it to compute the round trip time. From Mark Allman <mallman@lerc.nasa.gov>.	1999-08-11 03:02:18 +00:00
itojun	7fee35f579	- implement IPv6 pmtud, which is necessary for TCP6. - fix memory leak on SO_DEBUG over TCP.	1999-07-22 12:56:56 +00:00
itojun	b479094c45	no need to include faith.h on non-IPv6 build, so wrap by #ifdef. (dunno if it's better to always include it or not)	1999-07-17 12:53:05 +00:00
itojun	c74f79d16f	fix faith interface support. need testing. (i understand this is a dirty hack, of course)	1999-07-17 07:07:08 +00:00
itojun	685747d56c	Use proper ip protocol # field and tcp hdr on sending RST against SYN, when ip header and tcp header are not adjacent to each other (i.e. when ip6 options are attached). To test this, try telnet @::1@::1 port toward a port without responding server. Prior to the fix, the kernel will generate broken RST packet.	1999-07-14 22:37:13 +00:00
thorpej	f9a7668b3f	defopt IPSEC and IPSEC_ESP (both into opt_ipsec.h).	1999-07-09 22:57:15 +00:00
itojun	4b961b81e3	avoid "variable not initialized" warnings on some of the platforms.	1999-07-02 12:45:32 +00:00
itojun	118d2b1d4f	IPv6 kernel code, based on KAME/NetBSD 1.4, SNAP kit 19990628. (Sorry for a big commit, I can't separate this into several pieces...) Pls check sys/netinet6/TODO and sys/netinet6/IMPLEMENTATION for details. - sys/kern: do not assume single mbuf, accept chained mbuf on passing data from userland to kernel (or other way round). - "midway" ATM card: ATM PVC pseudo device support, like those done in ALTQ package (ftp://ftp.csl.sony.co.jp/pub/kjc/). - sys/netinet/tcp: IPv4/v6 dual stack tcp support. - sys/netinet/{ip6,icmp6}.h, sys/net/pfkeyv2.h: IETF document assumes those file to be there so we patch it up. - sys/netinet: IPsec additions are here and there. - sys/netinet6/: most of IPv6 code sits here. - sys/netkey: IPsec key management code - dev/pci/pcidevs: regen In my understanding no code here is subject to export control so it should be safe.	1999-07-01 08:12:45 +00:00
ad	ccc7e59e1f	Add new sysctl (net.inet.tcp.log_refused) that when set, causes refused TCP connections to be logged.	1999-05-23 20:33:50 +00:00
thorpej	3faa72bd56	Fix an ininitialized variable that the MIPS compiler caught (but the SPARC, Alpha, Arm, and i386 compilers missed).	1999-05-03 23:30:27 +00:00
thorpej	2cd33a0ce1	Implement retransmit logic for the SYN cache engine. Fixes a rare condition where one side can think a connection exists, where the other side thinks the connection was never established. The original problem was first reported by Ty Sarna in PR #5909. The original fix I made to the code didn't cover all cases. The problem this fix addresses was reported by Christoph Badura via private e-mail. Many thanks to Bill Sommerfeld for helping me to test this code, and for finding a subtle bug.	1999-04-29 03:54:22 +00:00
simonb	be3adbebcc	Don't extern sb_max, <sys/socketvar.h> provides a definition.	1999-04-22 01:32:30 +00:00
kml	a7f8ef5e9b	Ensure that out of window SYNs receive an ACK in responce, rather than being dropped. This fixes a bug reported by Jason Thorpe.	1999-04-09 22:01:07 +00:00
matt	7ebd19d744	According to Dave Borman, the iss should be using snd_nxt and not rcv_nxt (from tcp_impl mailing-list).	1999-02-05 22:37:24 +00:00
explorer	25d32ef34d	REALLY only update the window when we get an ACK. (the old code seemed broken)	1999-02-04 22:58:37 +00:00
thorpej	86e2c3fbc6	* Completely rewrite syn_cache_respond(). - Don't use tcp_respond(), instead create the tcp/ip header from scratch, and send it ourself. - Reuse the mbuf that carried the SYN, or allocate one if that is not available. - Cache the route we look up to do the Path MTU Discovery check, and transfer the reference to that route to the inpcb when the connection completes. * Macro'ize a small, but often repeated code fragment.	1999-01-24 01:19:28 +00:00
mycroft	7eeb5a04da	Don't screw with ip_len; just subtract from it where we actually use the value.	1999-01-19 23:03:20 +00:00
mycroft	fc1211a6ab	Don't overwrite the checksum fields when checking them. There's no reason to do this, and it screws up ICMP replies. XXX The returned IP checksum and length are still wrong.	1999-01-19 21:58:40 +00:00
thorpej	4f177aec90	Add a lock around the TCPCB's sequence queue, to prevent tcp_drain() from corrupting the queue if called from a device's interrupt context. Similar in nature to the problem reported in PR #5684.	1998-12-18 21:38:02 +00:00
thorpej	974aa74abd	Use the pool allocator for ipqent structures.	1998-10-08 01:19:25 +00:00
matt	bf4e491879	Fix boolean dyslexic test. Duh!	1998-10-06 00:41:13 +00:00
matt	8e8f38e0f2	Add a sysctl for newreno (default to off).	1998-10-06 00:20:44 +00:00
matt	25054b5cf7	Adapt the NEWRENO changes from the UCSB diffs of BSDI 3.0's TCP to NetBSD. Ignore the SACK & FACK stuff for now.	1998-10-04 21:33:52 +00:00
mycroft	4a000a54e6	Fix a typo (not mine) in a comment.	1998-09-19 04:34:34 +00:00
mycroft	04ef3bf88d	If we're in LISTEN state and all of RST, SYN and ACK are clear, send a RST.	1998-09-19 04:32:51 +00:00
mouse	b95116821c	Create tcp.keepidle, tcp.keepintvl, tcp.keepcnt, tcp.slowhz sysctls.	1998-09-10 10:46:03 +00:00
thorpej	4dbfe05f1f	Use an algorithm similar to that in tcp_notify() to determine if syn_cache_unreach() should remove the entry, or just continue on. Algorithm is to only remove the entry if we've had more than one unreach error and have retransmitted 3 or more times. This prevents the following scenario, as noted in PR #5909 (PR from Ty Sarna, scenario from Charles Hannum): * Host A sends a SYN. * Host A retransmits the SYN. * Host B gets the first SYN and sends a SYN-ACK. * Host B gets the second SYN and sends a SYN-ACK. * One of the SYN-ACK bounces with an ICMP unreachable, causing the `SYN cache' entry to be removed with no notification. * Host A receives the other SYN-ACK, sends an ACK, and goes to ESTABLISHED state. Should fix PR #5909.	1998-09-09 01:32:27 +00:00
thorpej	d319e4b419	Use the pool allocator for syn_cache entries.	1998-08-02 00:35:51 +00:00
thorpej	a3f4316cba	Clarify that we are using the Loss Window if a retransmission occurred during the three-way handshake.	1998-07-17 22:58:56 +00:00
thorpej	b22946827d	Add a comment explaining why we do _not_ ACK data that might accompany a SYN (avoidance of a DoS attack).	1998-06-02 18:33:02 +00:00
thorpej	5596fe2614	Nuke TUBA per my note to tech-net; there's no reason to keep it around.	1998-05-11 19:57:23 +00:00
thorpej	ce3d776874	Rework the syn cache code somewhat: - Don't use home-grown queue manipulation. Use <sys/queue.h> instead. The data structures are a little larger, but we are otherwise wasting the memory chunk anyway (we're already a 64-byte malloc bucket). - Fix a bug in the cache-is-full case: if the oldest element removed from the first non-empty bucket was the only element in the bucket, the bucket wouldn't be removed from the bucket cache, causing queue corruption later. - Optimize the syn cache timers by using PRT timers rather than home-grown decrement-and-propagate timers. This code is now a fair bit smaller, and significantly easier to read and understand.	1998-05-07 01:37:27 +00:00
thorpej	1ffa60ac01	Use macros from tcp_timer.h to manipulate TCP timers, so that their implementation can be changed easily.	1998-05-06 01:21:20 +00:00
thorpej	e44c4fb7d3	Once again, move a declaration for the benefit of TUBA (grumble).	1998-05-03 19:54:56 +00:00
thorpej	b9fc258065	Oops, move a variable declaration so TUBA won't lose.	1998-05-02 04:23:05 +00:00

1 2 3

105 Commits