NetBSD

Author	SHA1	Message	Date
ozaki-r	67412bb47f	Replace malloc for DAD with kmem and move them out of the lock for DAD	2017-02-21 03:58:23 +00:00
ozaki-r	c5696d3c25	Rename if_acquire_NOMPSAFE to if_acquire It can be used in MP-safe ways. So let's remove the confusing postfix. If it's used in a unsafe way, warn NOMPSAFE in a comment.	2017-02-17 03:57:17 +00:00
knakahara	706b73f634	add missing files.	2017-02-16 08:23:35 +00:00
knakahara	939a415a7d	add l2tp(4) L2TPv3 interface. originally implemented by IIJ SEIL team.	2017-02-16 08:12:43 +00:00
ozaki-r	3f909d1769	Do ND in L2_output in the same manner as arpresolve The benefits of this change are: - The flow is consistent with IPv4 (and FreeBSD and OpenBSD) - old: ip6_output => nd6_output (do ND if needed) => L2_output (lookup a stored cache) - new: ip6_output => L2_output (lookup a cache. Do ND if cache not found) - We can remove some workarounds in nd6_output - We can move L2 specific operations to their own place - The performance slightly improves because one cache lookup is reduced	2017-02-14 03:05:06 +00:00
ozaki-r	19c4d830db	Protect mtudisc and redirect stuffs of icmp/icmp6 with mutex We have to run pr_init of icmp and icmp6 prior to tcp and tcp6 ones for mutex initialization.	2017-02-13 07:18:20 +00:00
ozaki-r	b070ee09f7	Replace splnet with splsoftnet	2017-02-13 04:05:58 +00:00
ozaki-r	57c38b2894	Add missing NULL checks for m_get_rcvif	2017-02-07 02:38:08 +00:00
ozaki-r	589739056f	Defer some pr_input to workqueue pr_input is currently called in softint. Some pr_input such as ICMP, ICMPv6 and CARP can add/delete/update IP addresses and routing table entries. For example, icmp6_redirect_input updates an a routing table entry and nd6_ra_input may delete an IP address. Basically such operations shouldn't be done in softint. That aside, we have a reason to avoid the situation; psz/psref waits cannot be used in softint, however they are required to work in such pr_input in the MP-safe world. The change implements the workqueue pr_input framework called wqinput which provides a means to defer pr_input of a protocol to workqueue easily. Currently icmp_input, icmp6_input, carp_proto_input and carp6_proto_input are deferred to workqueue by the framework. Proposed and discussed on tech-kern and tech-net	2017-02-02 02:52:10 +00:00
ozaki-r	9e8d969cf0	Tweak softnet_lock and NET_MPSAFE - Don't hold softnet_lock in some functions if NET_MPSAFE - Add softnet_lock to sysctl_net_inet_icmp_redirtimeout - Add softnet_lock to expire_upcalls of ip_mroute.c - Restore softnet_lock for in{,6}_pcbpurgeif{,0} if NET_MPSAFE - Mark some softnet_lock for future work	2017-01-24 07:09:24 +00:00
ozaki-r	c26964ba3f	Replace some splnet with splsoftnet	2017-01-23 10:19:03 +00:00
ozaki-r	14cc93cb28	Get rid of splnet for pool(9) We don't need it anymore.	2017-01-23 09:14:24 +00:00
christos	35561f6b22	ip6_sprintf -> IN6_PRINT so that we pass the size.	2017-01-16 15:44:46 +00:00
ozaki-r	e4b1e1923a	Remove KASSERT (revert in6.c,v 1.232) We don't need it (it's harmless though).	2017-01-16 08:26:30 +00:00
ryo	28f4c24cc2	Make ip6_sprintf(), in_fmtaddr(), lla_snprintf() and icmp6_redirect_diag() mpsafe. Reviewed by ozaki-r@	2017-01-16 07:33:36 +00:00
ozaki-r	e4f13796f3	Tweak icmp6_input; always use off, not *offp	2017-01-13 10:38:37 +00:00
ozaki-r	046e2eafb0	Prevent in6_ifaddr from being freed with holding its psref This is a possible fix for PR kern/51828.	2017-01-12 04:43:59 +00:00
christos	88f657f139	Add KASSERT.	2017-01-11 18:25:46 +00:00
ozaki-r	2b82ef9b8f	Get rid of unnecessary header inclusions	2017-01-11 13:08:29 +00:00
ozaki-r	a94a205118	Enable some sysctl knobs on rump kernels for ifmcstat	2017-01-10 05:42:34 +00:00
knakahara	cc189cdb90	remove unnecessary conversion. gif_softc->gif_pdst is already valid sockaddr.	2017-01-06 03:25:13 +00:00
christos	92bd3d8559	- kill NULL argument from in6_update_ifa - amend in6_update_ifa1 to return the ia, so that we can use it in pfil hooks to avoid NULL pointer crash.	2017-01-04 19:37:14 +00:00
christos	042834e792	simplify, and call the hooks after the address has been deleted like we did for the ipv4 case.	2017-01-03 15:14:31 +00:00
ryo	30456e82a3	In the case of SIOCDIFADDR, call pfil_run_addrhooks before release ia.	2016-12-31 09:41:05 +00:00
ozaki-r	12da772ecc	Fix panic in pfil_run_hooks on bootup XXX a kernel with pf still fails to boot up. Please someone fix it.	2016-12-27 10:53:11 +00:00
ozaki-r	ec260ed075	Remove assertion that the lock isn't held It's useless in this case, because without it we can know that the lock is held or not on a next lock acquisition and even more if LOCKDEBUG is enabled a failure on the acquisition will provide useful information for debugging while an assertion failure will provide just the fact that the assertion failed.	2016-12-22 03:46:51 +00:00
ozaki-r	6261537b3d	Fix deadlock between llentry timers and destruction of llentry llentry timer (of nd6) holds both llentry's lock and softnet_lock. A caller also holds them and calls callout_halt to wait for the timer to quit. However we can pass only one lock to callout_halt, so passing either of them can cause a deadlock. Fix it by avoid calling callout_halt without holding llentry's lock. BTW in the first place we cannot pass llentry's lock to callout_halt because it's a rwlock...	2016-12-21 08:47:02 +00:00
ozaki-r	3f765c212e	Hold the big locks only where they are needed	2016-12-21 04:08:47 +00:00
ozaki-r	3adf4b3b3e	Protect IPv6 default router and prefix lists with coarse-grained rwlock in6_purgeaddr (in6_unlink_ifa) itself unrefernces a prefix entry and calls nd6_prelist_remove if the counter becomes 0, so callers doesn't need to handle the reference counting. Performance-sensitive paths (sending/forwarding packets) call just one reader lock. This is a trade-off between performance impact vs. the amount of efforts; if we want to remove the reader lock, we need huge amount of works including destroying objects with psz/psref in softint, for example.	2016-12-19 07:51:34 +00:00
ozaki-r	834995ee17	Kill pr->ndpr_refcnt = 0 The reference counter represents the numuber of references from IPv6 addresses to a prefix entry. If all IPv6 addresses assigned to an interface are purged, all references to a prefix for the interface are also released. For now nd6_purge is always called after purging all IPv6 addresses, so we can get rid of clearing pr->ndpr_refcnt from nd6_purge and instead we can assert it's 0 there. Note that nd6_ifdetach is only called via dom_ifdetach when processing if_detach where dom_ifdetach is called after pr_purgeif that eventually calls in6_ifdetach. So in the call path nd6_purge in nd6_ifdetach does nothing. That said, we should explicitly make it sure to purge all IPv6 addresses before nd6_purge for future changes (or the case I missed something). So if_purgeaddrs is added to nd6_ifdetach.	2016-12-19 04:52:17 +00:00
ozaki-r	1ce4a727f8	Get rid of extra nd6_purge from in6_ifdetach There were two nd6_purge in in6_ifdetach for some reason, but at least now We don't need extra nd6_purge. Remove it and instead add assertions that check if surely purged.	2016-12-19 03:32:54 +00:00
ozaki-r	dd8638eea5	Move bpf_mtap and if_ipackets++ on Rx of each driver to percpuq if_input The benefits of the change are: - We can reduce codes - We can provide the same behavior between drivers - Where/When if_ipackets is counted up - Note that some drivers still update packet statistics in their own way (periodical update) - Moved bpf_mtap run in softint - This makes it easy to MP-ify bpf Proposed on tech-kern and tech-net	2016-12-15 09:28:02 +00:00
knakahara	237f476937	fix race of gif_softc->gif_ro when we send multiple flows over gif on NET_MPSAFE enabled kernel. make gif_softc->gif_ro percpu as well as ipforward_rt to resolve this race. and add future TODO comment for etherip(4).	2016-12-14 11:19:15 +00:00
ozaki-r	8e7c1da780	Reduce return points No functional change intended.	2016-12-14 06:33:01 +00:00
ozaki-r	0195cee9ea	Use macro to iterate on the nd_prefix list	2016-12-14 04:13:50 +00:00
ozaki-r	9d79cf8c86	Make functions static	2016-12-14 04:05:11 +00:00
ozaki-r	44375ea93d	Remove unnecessary inclusions of nd6.h	2016-12-13 08:29:03 +00:00
ozaki-r	6fb8880601	Make the routing table and rtcaches MP-safe See the following descriptions for details. Proposed on tech-kern and tech-net Overview -------- We protect the routing table with a rwock and protect rtcaches with another rwlock. Each rtentry is protected from being freed or updated via reference counting and psref. Global rwlocks -------------- There are two rwlocks; one for the routing table (rt_lock) and the other for rtcaches (rtcache_lock). rtcache_lock covers all existing rtcaches; there may have room for optimizations (future work). The locking order is rtcache_lock first and rt_lock is next. rtentry references ------------------ References to an rtentry is managed with reference counting and psref. Either of the two mechanisms is used depending on where a rtentry is obtained. Reference counting is used when we obtain a rtentry from the routing table directly via rtalloc1 and rtrequest{,1} while psref is used when we obtain a rtentry from a rtcache via rtcache_* APIs. In both cases, a caller can sleep/block with holding an obtained rtentry. The reasons why we use two different mechanisms are (i) only using reference counting hurts the performance due to atomic instructions (rtcache case) (ii) ease of implementation; applying psref to APIs such rtaloc1 and rtrequest{,1} requires additional works (adding a local variable and an argument). We will finally migrate to use only psref but we can do it when we have a lockless routing table alternative. Reference counting for rtentry ------------------------------ rt_refcnt now doesn't count permanent references such as for rt_timers and rtcaches, instead it is used only for temporal references when obtaining a rtentry via rtalloc1 and rtrequest{,1}. We can do so because destroying a rtentry always involves removing references of rt_timers and rtcaches to the rtentry and we don't need to track such references. This also makes it easy to wait for readers to release references on deleting or updating a rtentry, i.e., we can simply wait until the reference counter is 0 or 1. (If there are permanent references the counter can be arbitrary.) rt_ref increments a reference counter of a rtentry and rt_unref decrements it. rt_ref is called inside APIs (rtalloc1 and rtrequest{,1} so users don't need to care about it while users must call rt_unref to an obtained rtentry after using it. rtfree is removed and we use rt_unref and rt_free instead. rt_unref now just decrements the counter of a given rtentry and rt_free just tries to destroy a given rtentry. See the next section for destructions of rtentries by rt_free. Destructions of rtentries ------------------------- We destroy a rtentry only when we call rtrequst{,1}(RTM_DELETE); the original implementation can destroy in any rtfree where it's the last reference. If we use reference counting or psref, it's easy to understand if the place that a rtentry is destroyed is fixed. rt_free waits for references to a given rtentry to be released before actually destroying the rtentry. rt_free uses a condition variable (cv_wait) (and psref_target_destroy for psref) to wait. Unfortunately rtrequst{,1}(RTM_DELETE) can be called in softint that we cannot use cv_wait. In that case, we have to defer the destruction to a workqueue. rtentry#rt_cv, rtentry#rt_psref and global variables (see rt_free_global) are added to conduct the procedure. Updates of rtentries -------------------- One difficulty to use refcnt/psref instead of rwlock for rtentry is updates of rtentries. We need an additional mechanism to prevent readers from seeing inconsistency of a rtentry being updated. We introduce RTF_UPDATING flag to rtentries that are updating. While the flag is set to a rtentry, users cannot acquire the rtentry. By doing so, we avoid users to see inconsistent rtentries. There are two options when a user tries to acquire a rtentry with the RTF_UPDATING flag; if a user runs in softint context the user fails to acquire a rtentry (NULL is returned). Otherwise a user waits until the update completes by waiting on cv. The procedure of a updater is simpler to destruction of a rtentry. Wait on cv (and psref) and after all readers left, proceed with the update. Global variables (see rt_update_global) are added to conduct the procedure. Currently we apply the mechanism to only RTM_CHANGE in rtsock.c. We would have to apply other codes. See "Known issues" section. psref for rtentry ----------------- When we obtain a rtentry from a rtcache via rtcache_* APIs, psref is used to reference to the rtentry. rtcache_ref acquires a reference to a rtentry with psref and rtcache_unref releases the reference after using it. rtcache_ref is called inside rtcache_* APIs and users don't need to take care of it while users must call rtcache_unref to release the reference. struct psref and int bound that is needed for psref is embedded into struct route. By doing so we don't need to add local variables and additional argument to APIs. However this adds another constraint to psref other than reference counting one's; holding a reference of an rtentry via a rtcache is allowed by just one caller at the same time. So we must not acquire a rtentry via a rtcache twice and avoid a recursive use of a rtcache. And also a rtcache must be arranged to be used by a LWP/softint at the same time somehow. For IP forwarding case, we have per-CPU rtcaches used in softint so the constraint is guaranteed. For a h rtcache of a PCB case, the constraint is guaranteed by the solock of each PCB. Any other cases (pf, ipf, stf and ipsec) are currently guaranteed by only the existence of the global locks (softnet_lock and/or KERNEL_LOCK). If we've found the cases that we cannot guarantee the constraint, we would need to introduce other rtcache APIs that use simple reference counting. psref of rtcache is created with IPL_SOFTNET and so rtcache shouldn't used at an IPL higher than IPL_SOFTNET. Note that rtcache_free is used to invalidate a given rtcache. We don't need another care by my change; just keep them as they are. Performance impact ------------------ When NET_MPSAFE is disabled the performance drop is 3% while when it's enabled the drop is increased to 11%. The difference comes from that currently we don't take any global locks and don't use psref if NET_MPSAFE is disabled. We can optimize the performance of the case of NET_MPSAFE on by reducing lookups of rtcache that uses psref; currently we do two lookups but we should be able to trim one of two. This is a future work. Known issues ------------ There are two known issues to be solved; one is that a caller of rtrequest(RTM_ADD) may change rtentry (see rtinit). We need to prevent new references during the update. Or we may be able to remove the code (perhaps, need more investigations). The other is rtredirect that updates a rtentry. We need to apply our update mechanism, however it's not easy because rtredirect is called in softint and we cannot apply our mechanism simply. One solution is to defer rtredirect to a workqueue but it requires some code restructuring.	2016-12-12 03:55:57 +00:00
ozaki-r	f2613b7f35	Introduce macros for the prefix list No functional change.	2016-12-12 03:14:01 +00:00
ozaki-r	f5a82c952d	Introduce macros for the default router list No functional change.	2016-12-12 03:13:14 +00:00
ozaki-r	95eeb954a9	Add nd6_ prefix to exported functions	2016-12-11 07:38:50 +00:00
ozaki-r	091c448c26	Move default interface things from nd6_rtr.c to nd6.c	2016-12-11 07:37:53 +00:00
ozaki-r	100c447a96	Make some functions static	2016-12-11 07:36:55 +00:00
ozaki-r	c43c0bf31c	Remove function declarations that have no actual definition	2016-12-11 07:36:20 +00:00
ozaki-r	eecbd48e68	Correct sanity checks of icmp6_redirect_output - rt->rt_ifp is always non-NULL - Checking RTF_UP here is just racy and meaningless - The arguments should be non-NULL (at least for now)	2016-12-11 07:35:42 +00:00
ozaki-r	4c25fb2f83	Add rtcache_unref to release points of rtentry stemming from rtcache In the MP-safe world, a rtentry stemming from a rtcache can be freed at any points. So we need to protect rtentries somehow say by reference couting or passive references. Regardless of the method, we need to call some release function of a rtentry after using it. The change adds a new function rtcache_unref to release a rtentry. At this point, this function does nothing because for now we don't add a reference to a rtentry when we get one from a rtcache. We will add something useful in a further commit. This change is a part of changes for MP-safe routing table. It is separated to avoid one big change that makes difficult to debug by bisecting.	2016-12-08 05:16:33 +00:00
knakahara	2126b5df9a	remove unnecessary extern declaration. inetsw has been declared since r1.1, however sctp6_usrreq.c can be built without the declaration. It must be removed.	2016-12-06 08:58:16 +00:00
ozaki-r	3de81a8881	CID 1396598, CID 1396634: Fix null pointer dereferences	2016-12-02 00:19:54 +00:00
ozaki-r	0645ba174f	Fix panic on destroying an interface with IPv6 addresses obtained with RA nd6_purge depends on that IPv6 addresses are purged. If addresses remain, pfxlist_onlink_check called from nd6_purge dereferences a dangling pointer (ia->ia6_ndpr) that is freed before calling pfxlist_onlink_check. Fix it by removing addresses before calling nd6_purge, which is the original behavior that was changed by in6.c,v 1.203 and in6_ifattach.c,v 1.99. Note that it seems the issue occurs because of a hack that forcibly destroys prefix list entries of a given interface in nd6_purge. We should tackle the hack in the future. Fix PR kern/51467	2016-11-30 02:08:57 +00:00
knakahara	2526d8f639	fix: "ifconfig destory" can stalls when "ifconfig" is done parallel. This problem occurs only if NET_MPSAFE on. ifconfig destroy side: kernel entry point is ifioctl => if_clone_destroy. pr_purgeif() acquires softnet_lock, and then ifa_remove() calls pserialize_perform() holding softnet_lock. ifconfig side: kernel entry point is socreate. pr_attach()(udp_attach_wrapper()) calls sosetlock(). In this call path, sosetlock() try to acquire softnet_lock. These can cause dead lock.	2016-11-18 06:50:04 +00:00
mlelstv	700032562a	nd6_dad_duplicated takes the lock itself. Move it out of the critical section.	2016-11-15 21:17:07 +00:00
mlelstv	845a599209	Enforce alignment requirements that are violated in some cases. For machines that don't need strict alignment (i386,amd64,vax,m68k) this is a no-op. Fixes PR kern/50766 but should be improved.	2016-11-15 20:50:28 +00:00
ozaki-r	5879478f65	Don't use rt_walktree to delete routes Some functions use rt_walktree to scan the routing table and delete matched routes. However, we shouldn't use rt_walktree to delete routes because rt_walktree is recursive to the routing table (radix tree) and isn't friendly to MP-ification. rt_walktree allows a caller to pass a callback function to delete an matched entry. The callback function is called from an API of the radix tree (rn_walktree) but also calls an API of the radix tree to delete an entry. This change adds a new API of the radix tree, rn_search_matched, which returns a matched entry that is selected by a callback function passed by a caller and the caller itself deletes the entry. By using the API, we can avoid the recursive form.	2016-11-15 01:50:06 +00:00
ozaki-r	c2f4bfe351	Add missing rtfree	2016-11-14 02:34:19 +00:00
ozaki-r	d0432711b6	Tidy up in6_select* This change tidies up in6_select* functions, especially selectroute. selectroute is annoying because: - It returns both/either of a rtentry and/or an ifp - Yes, it may return only an ifp! - It is valid but selectroute shouldn't handle the case - Such conditional behavior makes it difficult to apply locking/psref thingy - It may return a rtentry even if error - It may use opt->ip6po_nextroute rtcache implicitly - The caller can know if it is used by rtcache_validate(&opt->ip6po_nextroute) but it's racy in MP-safe world - Even if it uses opt->ip6po_nextroute, it may return a rtentry that isn't derived from the rtcache The change includes: - Rename selectroute to in6_selectroute - Let a remaining caller of selectroute, in6_selectif, use in6_selectroute instead - Let in6_selectroute return only an rtentry - If error, it doesn't return an rtentry - A caller gets an ifp from a returned rtentry - Allow in6_selectroute to modify a passed rtcache and a caller can know if opt->ip6po_nextroute is used via the rtcache - Let callers (ip6_output and in6_selectif) handle the case that only an ifp is required Inspired by OpenBSD Proposed on tech-kern and tech-net LGTM by roy@	2016-11-10 04:13:53 +00:00
ozaki-r	fbb7e30d1e	Reduce the number of return points of frag6_input No functional change.	2016-11-09 03:49:38 +00:00
ozaki-r	29a46075c0	Pull routing header handling out of ip6_output No functional change.	2016-11-07 01:55:17 +00:00
ozaki-r	2ad0a3feca	Tidy up ip6_getpmtu Pull rtcache thing out of ip6_getpmtu; that isn't an essential of the function. Add comments inspired by FreeBSD. No functional change.	2016-11-07 01:05:39 +00:00
ozaki-r	ede76fae4b	Add missing pserialize_read_exit	2016-11-02 03:43:27 +00:00
ozaki-r	d5dc0960bb	Reduce the number of return points No functional change.	2016-11-01 10:32:57 +00:00
christos	e3992d8536	restore previous logic.	2016-10-31 14:34:32 +00:00
ozaki-r	c5224ffd07	Pull best address selection code out of in6_selectsrc No functional change.	2016-10-31 04:57:10 +00:00
ozaki-r	0f3a44863e	Fix race condition of in6_selectsrc in6_selectsrc returned a pointer to in6_addr that wan't guaranteed to be safe by pserialize (or psref), which was racy. Let callers pass a pointer to in6_addr and in6_selectsrc copy a result to it inside pserialize critical sections.	2016-10-31 04:16:25 +00:00
ozaki-r	6e6136eaff	Remove unnecessary NULL checks	2016-10-31 02:50:31 +00:00
ozaki-r	cf96c34d79	Remove unnecessary argument No functional change.	2016-10-25 02:45:09 +00:00
ozaki-r	3be3142886	Don't hold global locks if NET_MPSAFE is enabled If NET_MPSAFE is enabled, don't hold KERNEL_LOCK and softnet_lock in part of the network stack such as IP forwarding paths. The aim of the change is to make it easy to test the network stack without the locks and reduce our local diffs. By default (i.e., if NET_MPSAFE isn't enabled), the locks are held as they used to be. Reviewed by knakahara@	2016-10-18 07:30:30 +00:00
ozaki-r	6cabf64625	Fix indentation	2016-10-18 02:46:50 +00:00
ozaki-r	5faeb64f4b	Remove unnecessary pserialize_read_enter	2016-10-18 02:46:21 +00:00
ozaki-r	48ec99bd49	Add missing pserialize_read_exit	2016-10-18 02:45:41 +00:00
roy	0dbee937df	Now that we disallow sending or receiving from invalid addresses, allow binding to tentative addresses.	2016-09-29 12:19:47 +00:00
roy	8066689d53	Drop UDP packets as well as TCP without error when sending from detached or tentative addresses.	2016-09-20 14:30:13 +00:00
roy	8c6871896f	Ensure that packets are sent from a valid address. If the packet is TCP and the address is detached or tentative then it's just dropped, otherwise an error is returned. This is needed because you can bind to a valid address and it can then become invalid. This satisfies RFC 4862 section 5.5.4.	2016-09-15 18:25:45 +00:00
christos	0d94f00ba4	fix typo	2016-09-14 16:17:17 +00:00
christos	959c247a60	revert previous, roy says it breaks DaD.	2016-09-13 15:57:50 +00:00
christos	acab31252a	When initializing addresses, reset the interface flags to 0. This fixes an issue where point to point addresses that started down, and then came up, were left with stale flags on one side of the point to point link.	2016-09-13 15:41:33 +00:00
christos	647765d084	remove trailing spaces. userland does not catch this?	2016-09-13 00:45:15 +00:00
christos	47afd135ed	add bits for address flags	2016-09-13 00:19:28 +00:00
roy	3e6930820d	Disallow input to detached addresses because they are not yet valid.	2016-09-07 15:41:44 +00:00
roy	c85195ff64	This comment no longer applies.	2016-09-02 15:57:54 +00:00
ozaki-r	ab06ed1240	Don't GC an NDP cache that is added just before GC This fixes unstable test results of ndp_neighborgcthresh.	2016-09-02 07:15:14 +00:00
ozaki-r	543e39c0d3	Make ipforward_rt and ip6_forward_rt percpu Sharing one rtcache between CPUs is just a bad idea. Reviewed by knakahara@	2016-08-31 09:14:47 +00:00
dholland	2df5f31439	PR 51434 David Binderman: remove redundant test.	2016-08-26 21:48:31 +00:00
roy	c63a839724	Simplify.	2016-08-26 20:29:31 +00:00
roy	333b0c4c48	Allow explicit binding to detached addresss. Fixes PR kern/51435.	2016-08-26 19:45:55 +00:00
roy	1893d82b49	White space police.	2016-08-23 19:39:57 +00:00
roy	da7a376e71	Sync denied flags.	2016-08-23 19:39:04 +00:00
knakahara	74c24413b3	improve fast-forward performance when the number of flows exceeds ip6_maxflows. This is porting of ip_flow.c:r1.76 In ip6flow case, the before degradation is about 45%, the after degradation is bout 55%.	2016-08-23 09:59:20 +00:00
roy	dfadc24d64	Revert r1.148 IP6_EXTHDR_GET ensures that a icmp6 header can be fetched from the mbuf so m_pullup does not need to be called. While here, we can safely increament interface error stats even with an invalidated mbuf because we have a saved reference to the interface.	2016-08-19 12:26:01 +00:00
roy	e52094cac4	Revert part of the prior patch so loopback lladdr gets a working prefix route.	2016-08-18 09:34:43 +00:00
roy	fe4671807c	Separate ioctl address prefix management from RA prefix management as we have no API for controlling the latter. This fixes a long standing problem where addresses added with non /128 prefixes and non infinte address lifetimes would register a prefix route which would expire. Subsequent calls set new lifetimes for the same address would not affect the prefix route management, so once expired, the prefix route would be impossible to add back as the kernel would remove it.	2016-08-16 10:31:57 +00:00
christos	fa02ef2c34	In rump (ifp)->if_afdata[AF_INET6] == NULL if we did not register netinet6 yet. Treat this like we don't have a scope, and make the sid tests consistent.	2016-08-12 11:44:24 +00:00
roy	e9c7e74884	Set RTF_CONNECTED instead of setting only RTF_CONNECTED.	2016-08-06 20:00:14 +00:00
ozaki-r	e8f81e31c2	CID 1364757: remove unnecessary branching	2016-08-05 00:51:14 +00:00
knakahara	48235e8230	ip6flow refactor like ipflow. - move ip6flow sysctls into ip6_flow.c like ip_flow.c:r1.64 - build ip6_flow.c only if GATEWAY kernel option is enabled	2016-08-02 04:50:16 +00:00
ozaki-r	466f21f0b9	Fix kernel builds (gcc 4.8)	2016-08-01 04:37:53 +00:00
ozaki-r	a403cbd4f5	Apply pserialize and psref to struct ifaddr and its variants This change makes struct ifaddr and its variants (in_ifaddr and in6_ifaddr) MP-safe by using pserialize and psref. At this moment, pserialize_perform and psref_target_destroy are disabled because (1) we don't need them because of softnet_lock (2) they cause a deadlock because of softnet_lock. So we'll enable them when we remove softnet_lock in the future.	2016-08-01 03:15:30 +00:00
ozaki-r	efee6976a2	Avoid memset and rtcache_free if unnecessary It's the same as ip_output.	2016-07-29 06:02:03 +00:00
ozaki-r	c68a77bc1d	Fix panic on adding/deleting IP addresses under network load Adding and deleting IP addresses aren't serialized with other network opeartions, e.g., forwarding packets. So if we add or delete an IP address under network load, a kernel panic may happen on manipulating network-related shared objects such as rtentry and rtcache. To avoid such panicks, we still need to hold softnet_lock in in_control and in6_control that are called via ioctl and do network-related operations including IP address additions/deletions. Fix PR kern/51356	2016-07-28 09:03:50 +00:00
ozaki-r	e449cc85bc	Simplify by using atomic_swap instead of mutex Suggested by kefren@	2016-07-26 05:53:30 +00:00
ozaki-r	a3625f4d7b	Make DAD of ARP/NDP MP-safe with coarse-grained locks The change also prevents arp_dad_timer/nd6_dad_timer from running if arp_dad_stop/nd6_dad_stop is called, which makes sure that callout_reset won't be called during callout_halt.	2016-07-25 04:21:19 +00:00
ozaki-r	6b3e3b4814	Use KASSERT for checking non-NULL of ifa->ifa_ifp ifa->ifa_ifp should be always non-NULL, so doing the check only if DIAGNOSTIC is ok.	2016-07-25 01:52:21 +00:00
ozaki-r	1f39eeaeb9	Get rid of extra ifafree It was wrongly imported from FreeBSD.	2016-07-20 07:56:10 +00:00
ozaki-r	4f21a42704	Apply pserialize to some iterations of IP address lists	2016-07-20 07:37:51 +00:00
ozaki-r	8759207c83	Use sin6tosa and sin6tocsa macros No functional change.	2016-07-15 07:40:09 +00:00
ozaki-r	328b3c6b85	Use ifatoia6 macro No functional change.	2016-07-15 07:33:41 +00:00
ozaki-r	dca032f9f4	Run timers in workqueue Timers (such as nd6_timer) typically free/destroy some data in callout (softint). If we apply psz/psref for such data, we cannot do free/destroy process in there because synchronization of psz/psref cannot be used in softint. So run timer callbacks in workqueue works (normal LWP context). Doing workqueue_enqueue a work twice (i.e., call workqueue_enqueue before a previous task is scheduled) isn't allowed. For nd6_timer and rt_timer_timer, this doesn't happen because callout_reset is called only from workqueue's work. OTOH, ip{,6}flow_slowtimo's callout can be called before its work starts and completes because the callout is periodically called regardless of completion of the work. To avoid such a situation, add a flag for each protocol; the flag is set true when a work is enqueued and set false after the work finished. workqueue_enqueue is called only if the flag is false. Proposed on tech-net and tech-kern.	2016-07-11 07:37:00 +00:00
ozaki-r	94dba1b837	CID 1363345: remove unreachable code and cleanup returns	2016-07-08 06:18:29 +00:00
ozaki-r	4133a8eca8	Replace macros to get an IP address with proper inline functions The inline functions are more friendly for applying psz/psref; they consist of only simple interations.	2016-07-08 04:33:30 +00:00
ozaki-r	75a23513d7	Kill remaining use of the old lists of IP addresses	2016-07-08 03:40:34 +00:00
ozaki-r	9e4c2bda8a	Switch the address list of intefaces to pslist(9) As usual, we leave the old list to avoid breaking kvm(3) users.	2016-07-07 09:32:01 +00:00
ozaki-r	6106c473fc	Move in6_ifaddr_list to a more proper place (from ip6_input.c to in6.c) It's a similar place as the IPv4 address list, i.e., in.c. More varibles will join together.	2016-07-06 10:49:49 +00:00
ozaki-r	806a31cb3c	Add missing IN6_ADDRLIST_ENTRY_DESTROY	2016-07-06 07:52:53 +00:00
ozaki-r	d04ff44ad6	Apply m_get_rcvif_psref (kill m_get_rcvif_NOMPSAFE)	2016-07-06 00:30:55 +00:00
ozaki-r	ff67da833b	Constify an argument of regen_tmpaddr	2016-07-05 06:32:18 +00:00
ozaki-r	2a3c249748	KNF	2016-07-05 04:25:23 +00:00
ozaki-r	c30ba26977	Use ia6 or ia instead of ifa as a variable name of struct in6_ifaddr We conventionally use ifa for struct ifaddr and use ia6 or ia for struct in6_ifaddr. No functional change.	2016-07-05 03:40:52 +00:00
ozaki-r	d961591ee9	Fix userland compilations of those including in6_var.h	2016-07-04 07:32:18 +00:00
ozaki-r	6cf9fce745	Use pslist(9) for the global in6_ifaddr list psz and psref will be applied in another commit. No functional change intended.	2016-07-04 06:48:14 +00:00
knakahara	a6d7586724	fix: gif(4) receive side race A panic cause in rn_match() called by encap[46]_lookup(). The reason is that gif(4) does not suspend receive packet processing in spite of suspending transmit packet processing while anyone is doing gif(4) ioctl.	2016-07-04 04:22:47 +00:00
knakahara	d81cd78ed7	let gif(4) promise softint(9) contract (1/2) : gif(4) side To prevent calling softint_schedule() after called softint_disestablish(), the following modifications are added + ioctl (writing configuration) side - off IFF_RUNNING flag before changing configuration - wait softint handler completion before changing configuration + packet processing (reading configuraiotn) side - if IFF_RUNNING flag is on, do nothing + in whole - add gif_list_lock_{enter,exit} to prevent the same configuration is set to other gif(4) interfaces	2016-07-04 04:14:47 +00:00
ozaki-r	feeae45125	Remove redundant codes purging IPv6 addresses Proposed on tech-net and tech-kern.	2016-07-04 02:41:18 +00:00
ozaki-r	17b4eb5edd	Make sure to free all interface addresses in if_detach Addresses of an interface (struct ifaddr) have a (reverse) pointer of an interface object (ifa->ifa_ifp). If the addresses are surely freed when their interface is destroyed, the pointer is always valid and we don't need a tweak of replacing the pointer to if_index like mbuf. In order to make sure the assumption, the following changes are required: - Deactivate the interface at the firstish of if_detach. This prevents in6_unlink_ifa from saving multicast addresses (wrongly) - Invalidate rtcache(s) and clear a rtentry referencing an address on RTM_DELETE. rtcache(s) may delay freeing an address - Replace callout_stop with callout_halt of DAD timers to ensure stopping such timers in if_detach	2016-07-01 05:22:33 +00:00
ozaki-r	d4c71b34a8	Make sure that ifaddr is published after its initialization finished Basically we should insert an item to a collection (say a list) after item's initialization has been completed to avoid accessing an item that is initialized halfway. ifaddr (in{,6}_ifaddr) isn't processed like so and needs to be fixed. In order to do so, we need to tweak {arp,nd6}_rtrequest that depend on that an ifaddr is inserted during its initialization; they explore interface's address list to determine that rt_getkey(rt) of a given rtentry is in the list to know whether the route's interface should be a loopback, which doesn't work after the change. To make it work, first check RTF_LOCAL flag that is set in rt_ifa_addlocal that calls {arp,nd6}_rtrequest eventually. Note that we still need the original code for the case to remove and re-add a local interface route.	2016-06-30 01:34:53 +00:00
ozaki-r	a577cf2aa0	Introduce if_is_deactivated Checking ifp->if_output == if_nulloutput is too implicit. No functional change.	2016-06-28 02:36:54 +00:00
ozaki-r	ca4ea29d93	Add missing NULL checks for m_get_rcvif_psref	2016-06-28 02:02:56 +00:00
christos	9471dccf97	CID 1362905: Initialize ifp early, so that we don't if_put garbage in the IPSEC case.	2016-06-27 18:35:54 +00:00
ozaki-r	4b54d200aa	Remove unnecessary NULL checks of ifa->ifa_addr If it's NULL, it should be a bug. There many IFADDR_FOREACH that don't do NULL check. If it can be NULL, they should fire already.	2016-06-22 07:48:17 +00:00
ozaki-r	4badfc204a	Make sure returning ifp from in6_select* functions psref-ed To this end, callers need to pass struct psref to the functions and the fuctions acquire a reference of ifp with it. In some cases, we can simply use if_get_byindex, however, in other cases (say rt->rt_ifp and ia->ifa_ifp), we have no MP-safe way for now. In order to take a reference anyway we use non MP-safe function if_acquire_NOMPSAFE for the latter cases. They should be fixed in the future somehow.	2016-06-21 10:25:27 +00:00
ozaki-r	f7107c248e	Protect if_byindex with pserialize	2016-06-21 10:21:04 +00:00
ozaki-r	43c5ab376f	Replace ifp of ip_moptions and ip6_moptions with if_index The motivation is the same as the mbuf's rcvif case; avoid having a pointer of an ifnet object in ip_moptions and ip6_moptions, which is not MP-safe. ip_moptions and ip6_moptions can be stored in a PCB for inet or inet6 that's life time is different from ifnet one and so an ifnet object can be disappeared anytime we get it via them. Thus we need to look up an ifnet object by if_index every time for safe.	2016-06-21 03:28:27 +00:00
ozaki-r	51db7a24e2	Fix nd6_output (if_output_lock conversion mistake)	2016-06-21 02:14:11 +00:00
knakahara	95fc145695	apply if_output_lock() to L3 callers which call ifp->if_output() of L2(or L3 tunneling).	2016-06-20 06:46:37 +00:00
ozaki-r	f0423d34e6	Use if_get_byindex instead of if_byindex for MP-safe	2016-06-16 03:03:33 +00:00
ozaki-r	e1135cd9b9	Use curlwp_bind and curlwp_bindx instead of open-coding LP_BOUND	2016-06-16 02:38:40 +00:00
ozaki-r	c7e18ccbde	Protect if_byindex by pserialize	2016-06-15 06:01:21 +00:00
knakahara	a6f4292e65	eliminate unnecessary splnet	2016-06-13 08:37:15 +00:00
knakahara	e4ff09f05d	MP-ify fastforward to support GATEWAY kernel option. I add "ipflow_lock" mutex in ip_flow.c and "ip6flow_lock" mutex in ip6_flow.c to protect all data in each file. Of course, this is not MP-scalable. However, it is sufficient as tentative workaround. We should make it scalable somehow in the future. ok by ozaki-r@n.o.	2016-06-13 08:34:23 +00:00
ozaki-r	fe6d427551	Avoid storing a pointer of an interface in a mbuf Having a pointer of an interface in a mbuf isn't safe if we remove big kernel locks; an interface object (ifnet) can be destroyed anytime in any packet processing and accessing such object via a pointer is racy. Instead we have to get an object from the interface collection (ifindex2ifnet) via an interface index (if_index) that is stored to a mbuf instead of an pointer. The change provides two APIs: m_{get,put}_rcvif_psref that use psref(9) for sleep-able critical sections and m_{get,put}_rcvif that use pserialize(9) for other critical sections. The change also adds another API called m_get_rcvif_NOMPSAFE, that is NOT MP-safe and for transition moratorium, i.e., it is intended to be used for places where are not planned to be MP-ified soon. The change adds some overhead due to psref to performance sensitive paths, however the overhead is not serious, 2% down at worst. Proposed on tech-kern and tech-net.	2016-06-10 13:31:43 +00:00
ozaki-r	d938d837b3	Introduce m_set_rcvif and m_reset_rcvif The API is used to set (or reset) a received interface of a mbuf. They are counterpart of m_get_rcvif, which will come in another commit, hide internal of rcvif operation, and reduce the diff of the upcoming change. No functional change.	2016-06-10 13:27:10 +00:00
ozaki-r	9f10dc7910	Get rcvif once and reuse it No functional change.	2016-05-19 08:53:25 +00:00
ozaki-r	348f728f8e	Replace DIAGNOSTIC & panic with KASSERT	2016-05-19 03:11:42 +00:00
ozaki-r	894d037bc1	Get rid of unnecessary assignment	2016-05-18 11:28:44 +00:00
ozaki-r	9f595a90fa	Get rid of unnecessary NULL check It's already checked just some lines above.	2016-05-18 09:32:05 +00:00
ozaki-r	27df9b11fc	Don't try to get outif unnecessarily from in6_selectsrc The got outif is unused.	2016-05-18 08:40:51 +00:00
ozaki-r	842c4ed6c1	Get rcvif once and reuse it No functional change.	2016-05-17 03:27:02 +00:00
ozaki-r	31da384114	Make sure icmp6_redirect_input frees mbuf before return	2016-05-17 03:24:46 +00:00
ozaki-r	040205ae93	Protect ifnet list with psz and psref The change ensures that ifnet objects in the ifnet list aren't freed during list iterations by using pserialize(9) and psref(9). Note that the change adds a pslist(9) for ifnet but doesn't remove the original ifnet list (ifnet_list) to avoid breaking kvm(3) users. We shouldn't use the original list in the kernel anymore.	2016-05-12 02:24:16 +00:00
is	142ff9d692	Let non-neighbor NS/NA debug error message include useful information.	2016-04-29 11:46:17 +00:00
ozaki-r	ad0fbab4d2	Get rid of unused argument from get_rand_ifid	2016-04-27 07:51:14 +00:00
ozaki-r	9e0f6c5e36	Stop using rt_gwroute on packet sending paths rt_gwroute of rtentry is a reference to a rtentry of the gateway for a rtentry with RTF_GATEWAY. That was used by L2 (arp and ndp) to look up L2 addresses. By separating L2 nexthop caches, we don't need a route for the purpose and we can stop using rt_gwroute. By doing so, we can reduce referencing and modifying rtentries, which makes it easy to apply a lock (and/or psref) to the routing table and rtentries. One issue to do this is to keep RTF_REJECT behavior. It seems it was broken when we moved rtalloc1 things from L2 output routines (e.g., ether_output) to ip_hresolv_output, but (fortunately?) it works unexpectedly. What we mistook are: - RTF_REJECT was checked for any routes in L2 output routines, but in ip_hresolv_output it is checked only when the route is RTF_GATEWAY - The RTF_REJECT check wasn't copied to IPv6 (nd6_output) It seems that rt_gwroute checks hid the mistakes and it looked work (unexpectedly) and removing rt_gwroute checks unveil the issue. So we need to fix RTF_REJECT checks in ip_hresolv_output and also add them to nd6_output. One more point we have to care is returning an errno; we need to mimic looutput behavior. Originally RTF_REJECT check was done either in L2 output routines or in looutput. The latter is applied when a reject route directs to a loopback interface. However, now RTF_REJECT check is done before looutput so to keep the original behavior we need to return an errno which looutput chooses. Added rt_check_reject_route does such tweaks.	2016-04-26 09:30:01 +00:00
ozaki-r	a79dfa5db0	Sweep unnecessary route.h inclusions	2016-04-26 08:44:44 +00:00
rjs	505ea9765f	Fix build when IPSEC enabled.	2016-04-25 21:21:02 +00:00
ozaki-r	0c74cec625	Check error of rt_setgate and rt_settag	2016-04-25 14:38:08 +00:00
ozaki-r	c325d0ca4f	Fix RTF_{REJECT,BLACKHOLE} behavior for IPv6 routes We still need a nexthop route to reflect RTF_{REJECT,BLACKHOLE}. In the future, we would do it w/o looking up a route.	2016-04-21 05:07:50 +00:00
ozaki-r	322b6a238d	Sweep unncessary radix.h inclusions	2016-04-11 08:56:16 +00:00
ozaki-r	dd3c4fc3e5	Don't call pfxlist_onlink_check with holding llentry lock From FreeBSD (as of 2016-04-11). Should fix PR kern/51060.	2016-04-11 01:16:20 +00:00
ozaki-r	f0071d85a1	Don't call pfxlist_onlink_check with holding llentry lock Sync nd6_free with FreeBSD (as of 2016-04-10). Should fix PR kern/51056.	2016-04-10 08:15:52 +00:00
roy	60a5a4a8a7	all1_sa is no longer used.	2016-04-04 12:05:40 +00:00
ozaki-r	09973b35ac	Separate nexthop caches from the routing table By this change, nexthop caches (IP-MAC address pair) are not stored in the routing table anymore. Instead nexthop caches are stored in each network interface; we already have lltable/llentry data structure for this purpose. This change also obsoletes the concept of cloning/cloned routes. Cloned routes no longer exist while cloning routes still exist with renamed to connected routes. Noticeable changes are: - Nexthop caches aren't listed in route show/netstat -r - sysctl(NET_RT_DUMP) doesn't return them - If RTF_LLDATA is specified, it returns nexthop caches - Several definitions of routing flags and messages are removed - RTF_CLONING, RTF_XRESOLVE, RTF_LLINFO, RTF_CLONED and RTM_RESOLVE - RTF_CONNECTED is added - It has the same value of RTF_CLONING for backward compatibility - route's -xresolve, -[no]cloned and -llinfo options are removed - -[no]cloning remains because it seems there are users - -[no]connected is introduced and recommended to be used instead of -[no]cloning - route show/netstat -r drops some flags - 'L' and 'c' are not seen anymore - 'C' now indicates a connected route - Gateway value of a route of an interface address is now not a L2 address but "link#N" like a connected (cloning) route - Proxy ARP: "arp -s ... pub" doesn't create a route You can know details of behavior changes by seeing diffs under tests/. Proposed on tech-net and tech-kern: http://mail-index.netbsd.org/tech-net/2016/03/11/msg005701.html	2016-04-04 07:37:07 +00:00
ozaki-r	35b18fbb1d	Remove unnecessary casts and do s/0/NULL/ for rtrequest	2016-04-01 09:16:02 +00:00
ozaki-r	103bd8df24	Refine nd6log Add __func__ to nd6log itself instead of adding it to callers.	2016-04-01 08:12:00 +00:00
ozaki-r	19fb0179dc	Use __func__ in log messages	2016-04-01 06:25:51 +00:00
ozaki-r	acdecad069	Tidy up nd6_timer initialization	2016-04-01 05:11:38 +00:00
knakahara	9b7918b3ee	remove unnecessary declarations and fix KNF Thanks to riastradh@	2016-02-29 01:29:15 +00:00
knakahara	e80f101289	To eliminate gif_softc_list linear search, add extra argument to encapsw.pr_ctlinput().	2016-02-26 07:35:17 +00:00
rtr	e2a3307b85	Reduce code duplication. Split creation of IPv4-Mapped IPv6 addresses into its own function and use it. No functional change intended. As posted to tech-net@	2016-02-15 14:59:03 +00:00
ozaki-r	9c4cd06355	Introduce softint-based if_input This change intends to run the whole network stack in softint context (or normal LWP), not hardware interrupt context. Note that the work is still incomplete by this change; to that end, we also have to softint-ify if_link_state_change (and bpf) which can still run in hardware interrupt. This change softint-ifies at ifp->if_input that is called from each device driver (and ieee80211_input) to ensure Layer 2 runs in softint (e.g., ether_input and bridge_input). To this end, we provide a framework (called percpuq) that utlizes softint(9) and percpu ifqueues. With this patch, rxintr of most drivers just queues received packets and schedules a softint, and the softint dequeues packets and does rest packet processing. To minimize changes to each driver, percpuq is allocated in struct ifnet for now and that is initialized by default (in if_attach). We probably have to move percpuq to softc of each driver, but it's future work. At this point, only wm(4) has percpuq in its softc as a reference implementation. Additional information including performance numbers can be found in the thread at tech-kern@ and tech-net@: http://mail-index.netbsd.org/tech-kern/2016/01/14/msg019997.html Acknowledgment: riastradh@ greatly helped this work. Thank you very much!	2016-02-09 08:32:07 +00:00
riastradh	3bc04b00b8	Declare in6_tmpaddrtimer_ch in in6_var.h. Do not declare extern variables in .c files!	2016-02-04 02:48:37 +00:00
knakahara	b546d5277b	implement encapsw instead of protosw and uniform prototype. suggested and advised by riastradh@n.o, thanks. BTW, It seems in_stf_input() had bugs...	2016-01-26 05:58:05 +00:00
riastradh	fa50b451d4	Those were local changes not meant to be part of the revert. SORRY!	2016-01-23 14:48:55 +00:00
christos	e28df70b14	make this compile again	2016-01-23 14:03:04 +00:00
riastradh	e588d95c25	Back out previous change to introduce struct encapsw. This change was intended, but Nakahara-san had already made a better one locally! So I'll let him commit that one, and I'll try not to step on anyone's toes again.	2016-01-22 23:27:12 +00:00
riastradh	87bc652e3d	Don't abuse struct protosw for ip_encap -- introduce struct encapsw. Mostly mechanical change to replace it, culling some now-needless boilerplate around all the users. This does not substantively change the ip_encap API or eliminate abuse of sketchy pointer casts -- that will come later, and will be easier now that it is not tangled up with struct protosw.	2016-01-22 05:15:10 +00:00
riastradh	7c7b1739c8	Revert previous: ran cvs commit when I meant cvs diff. Sorry! Hit up-arrow one too few times.	2016-01-21 15:41:29 +00:00
riastradh	b41d562bd0	Give proper prototype to ip_output.	2016-01-21 15:27:48 +00:00
riastradh	65a8f527af	Eliminate struct protosw::pr_output. You can't use this unless you know what it is a priori: the formal prototype is variadic, and the different instances (e.g., ip_output, route_output) have different real prototypes. Convert the only user of it, raw_send in net/raw_cb.c, to take an explicit callback argument. Convert the only instances of it, route_output and key_output, to such explicit callbacks for raw_send. Use assertions to make sure the conversion to explicit callbacks is warranted. Discussed on tech-net with no objections: https://mail-index.netbsd.org/tech-net/2016/01/16/msg005484.html	2016-01-20 21:43:59 +00:00
knakahara	d7b9bb29c0	Refactor protosw codes in gif(4). No functional change. - remove unnecessary include - reduce scopes	2016-01-18 06:08:26 +00:00
ozaki-r	5c49460e3c	Add missing RTF_LOCAL; sync with arp_setgate	2016-01-08 08:50:07 +00:00
knakahara	1c5d304e9c	eliminate ip_input.c and ip6_input.c dependency on gif(4)	2016-01-08 03:55:39 +00:00
knakahara	6d50f36d54	use satosin{,6} macros instead of casts.	2015-12-25 06:47:56 +00:00
ozaki-r	9c1d124220	Add missing LLE_WUNLOCK to nd6_free	2015-12-18 09:04:33 +00:00
christos	5b5956f338	Hook up the addrctl stuff that's already there.	2015-12-12 23:34:25 +00:00
knakahara	a00e94f4ff	PR kern/50522: gif(4) ioctl causes panic while someone is using the gif(4) interface. It is required to wait other CPU's softint completion before disestablishing the softint handler.	2015-12-11 07:59:14 +00:00
ozaki-r	c6e461ee0d	CID 1341546: Fix integer handling issue (CONSTANT_EXPRESSION_RESULT) n > INT_MAX where n is a long integer variable never be true on 32bit architectures. Use time_t(int64_t) instead of long for the variable.	2015-12-07 06:19:13 +00:00
ozaki-r	2c1e216cf8	Replace __debugused with __diagused Declaring __debugused was just a mistake. This fixes builds of kernels with DEBUG but without DIAGNOSTIC.	2015-11-27 02:54:22 +00:00
ozaki-r	ff97010dea	Declare __debugused for no DIAGNOSTIC kernels This unbreaks hpcsh GENERIC kernel build.	2015-11-25 07:06:19 +00:00
ozaki-r	ecd5b23eef	Use lltable/llentry for NDP lltable and llentry were introduced to replace ARP cache data structure for further restructuring of the routing table: L2 nexthop cache separation. This change replaces the NDP cache data structure (llinfo_nd6) with them as well as ARP. One noticeable change is for neighbor cache GC mechanism that was introduced to prevent IPv6 DoS attacks. net.inet6.ip6.neighborgcthresh was the max number of caches that we store in the system. After introducing lltable/llentry, the value is changed to be per-interface basis because lltable/llentry stores neighbor caches in each interface separately. And the change brings one degradation; the old GC mechanism dropped exceeded packets based on LRU while the new implementation drops packets in order from the beginning of lltable (a hash table + linked lists). It would be improved in the future. Added functions in in6.c come from FreeBSD (as of r286629) and are tweaked for NetBSD. Proposed on tech-kern and tech-net.	2015-11-25 06:21:26 +00:00
ozaki-r	0edb16352e	Call icmp6_error2 after releasing ln This is a restructuring for coming changes. From FreeBSD	2015-11-19 03:02:10 +00:00
ozaki-r	5d81659a46	Stop passing llinfo_nd6 to nd6_ns_output This is a restructuring for coming changes to nd6 (replacing llinfo_nd6 with llentry). Once we have a lock of llinfo_nd6, we need to pass it to nd6_ns_output with holding the lock. However, in a function subsequent to nd6_ns_output, the llinfo_nd6 may be looked up, i.e., its lock would be acquired again. To avoid such a situation, pass only required data (in6_addr) to nd6_ns_output instead of passing whole llinfo_nd6. Inspired by FreeBSD	2015-11-18 05:16:22 +00:00
ozaki-r	7cdf5bbe65	Unify nd6_ns_output calls in nd6_llinfo_timer Inspired by FreeBSD	2015-11-18 02:51:11 +00:00
joerg	a3e166507d	Ensure that the callout of the multicast address is valid before hooking it up.	2015-11-12 15:01:06 +00:00
rjs	8c2654abca	Add core networking support for SCTP.	2015-10-13 21:28:34 +00:00
ozaki-r	91afbd53fe	Use satosin6 instead of its own macro	2015-10-05 04:15:42 +00:00
ozaki-r	4f92eb6d47	Update icmp6_redirect_timeout_q when changing net.inet6.icmp6.redirtimeout We have to update icmp6_redirect_timeout_q as well as icmp6_redirtimeout when changing net.inet6.icmp6.redirtimeout via sysctl. The updating logic is copied from sysctl_net_inet_icmp_redirtimeout. This change is from s-yamaguchi@IIJ (with KNF by ozaki-r) and fixes PR kern/50240.	2015-09-14 05:34:28 +00:00
roy	f3b0c038a1	If, for whatever reason, a local interface route is removed and then re-added, mark it as a local route. While here, if changing the route to go via the loopback interface remove any inherited MTU value.	2015-09-11 10:33:32 +00:00
dholland	1fbab01a93	More on PR 41200: headers that declare ioctls should include sys/ioccom.h. This covers (I think) all the MI headers outside of external/ (and dist/).	2015-09-06 06:00:59 +00:00
ozaki-r	30a9349144	Pull nexthop determination routine from nd6_output It simplifies nd6_output and the nexthop determination routine slightly.	2015-09-04 05:33:23 +00:00
ozaki-r	6af5fcf207	Fix rtfree in nd6_output We have to check and avoid to rtfree the original rtentry passed to nd6_output even when manipulating gateway routes. This fixes panic on assertion "ro->_ro_rt ==NULL \|\| ro->_ro_rt->rt_refcnt > 0" failure and probably PR kern/50161.	2015-09-03 00:54:39 +00:00
ozaki-r	54c4f3b688	Do rt_refcnt++ when set a rtentry to another rtentry's rt_gwroute And also do rtfree when deref a rtentry from rt_gwroute.	2015-09-02 11:35:11 +00:00
ozaki-r	1231d10774	Use KASSERT to check programming errors	2015-09-02 08:03:10 +00:00
ozaki-r	04bf400967	Move a rtentry definition to reduce its scope No functional change.	2015-09-01 08:52:02 +00:00
ozaki-r	31cbc4a715	Cleanup nd6_nud_hint The deleted rtfree was never called.	2015-09-01 08:46:27 +00:00
ozaki-r	3aedc74443	Make rt_refcnt take into account rt_timer	2015-08-31 06:25:15 +00:00
ozaki-r	31874cd257	Remove leading whitespaces	2015-08-31 03:26:53 +00:00
pooka	1c4a50f192	sprinkle _KERNEL_OPT	2015-08-24 22:21:26 +00:00
ozaki-r	b635aa0309	Change 0 to NULL for rtrequest's last argument (struct rtentry **ret_nrt)	2015-08-24 09:45:29 +00:00
ozaki-r	aade6ffbb3	Fix double rtfree	2015-08-11 09:30:32 +00:00
ozaki-r	aa2414a0f0	Free rtentry when we successfully obtain it but return NULL	2015-08-11 08:27:08 +00:00
ozaki-r	55140c1926	Use time_uptime instead of time_second to avoid time leaps Some codes in sys/net* use time_second to manage time periods such as cache expirations. However, time_second doesn't increase monotonically and can leap by say settimeofday(2) according to time_second(9). We should use time_uptime instead of it to avoid such time leaps. This change replaces time_second with time_uptime. Additionally it converts a time based on time_uptime to a time based on time_second when the kernel passes the time to userland programs that expect the latter, and vice versa. Note that we shouldn't leak time_uptime to other hosts over the netowrk. My investigation shows there is no such leak: http://mail-index.netbsd.org/tech-net/2015/08/06/msg005332.html Discussed on tech-kern and tech-net.	2015-08-07 08:11:33 +00:00
ozaki-r	0e93629237	Fix rtfree-ing wrong rtentry	2015-07-24 07:36:29 +00:00
ozaki-r	9eae87d0c8	Reform use of rt_refcnt rt_refcnt of rtentry was used in bad manners, for example, direct rt_refcnt++ and rt_refcnt-- outside route.c, "rt->rt_refcnt++; rtfree(rt);" idiom, and touching rt after rt->rt_refcnt--. These abuses seem to be needed because rt_refcnt manages only references between rtentry and doesn't take care of references during packet processing (IOW references from local variables). In order to reduce the above abuses, the latter cases should be counted by rt_refcnt as well as the former cases. This change improves consistency of use of rt_refcnt: - rtentry is always accessed with rt_refcnt incremented - rtentry's rt_refcnt is decremented after use (rtfree is always used instead of rt_refcnt--) - functions returning rtentry increment its rt_refcnt (and caller rtfree it) Note that rt_refcnt prevents rtentry from being freed but doesn't prevent rtentry from being updated. Toward MP-safe, we need to provide another protection for rtentry, e.g., locks. (Or introduce a better data structure allowing concurrent readers during updates.)	2015-07-17 02:21:08 +00:00
ozaki-r	fcda92b6be	Remove unused arguments and the associated code from nd6_nud_hint() from OpenBSD	2015-07-15 09:20:18 +00:00
ozaki-r	452d01ddfd	Use KASSERT for argument NULL checks	2015-06-30 08:31:42 +00:00
ozaki-r	eeab7eecc6	Fix nd6_numroutes counting nd6_numroutes is intended to be incremented when a route is added via RA and decremented when a RA route is deleted. However, a decrement of a RA route was skipped when there remained references to the RA route.	2015-06-30 06:42:06 +00:00
rtr	e7083d7a4b	remove transitional functions in{,6}_pcbconnect_m() that were used in converting protocol user requests to accept sockaddr instead of mbufs. remove tcp_input copy in to mbuf from sockaddr and just copy to sockaddr to make it possible for the transitional functions to go away. no version bump since these functions only existed for a short time and were commented as adapters (they appeared in 7.99.15).	2015-05-24 15:43:45 +00:00
ozaki-r	b3ff39b0ab	Use NULL instead of 0 for pointers	2015-05-19 01:14:40 +00:00
rtr	fd12cf39ee	make connect syscall use sockaddr_big and modify pr_{send,connect} nam parameter type from buf * to sockaddr . final commit for parameter type changes to protocol user requests bump kernel version to 7.99.15 for parameter type changes to pr_{send,connect}	2015-05-02 17:18:03 +00:00
roy	9cdef53c9c	Mitigate Local Denial of Service with IPv6 Router Advertisements and log attack attempts. Fixes CVE-2015-2923, taken from FreeBSD.	2015-05-02 14:28:30 +00:00
ozaki-r	36d424c9ec	Don't take KERNEL_LOCK for if_output when NET_MPSAFE	2015-04-30 10:00:04 +00:00
ozaki-r	5f21075b8f	Add missing error checks on rtcache_setdst It can fail with ENOMEM.	2015-04-27 10:14:44 +00:00
ozaki-r	2373b55abc	Introduce in6_selecthlim_rt to consolidate an idiom for rt->rt_ifp It consolidates a scattered routine: (rt = rtcache_validate(&in6p->in6p_route)) != NULL ? rt->rt_ifp : NULL	2015-04-27 02:59:44 +00:00
rtr	d2aa9dd71f	remove pr_generic from struct pr_usrreqs and all implementations of pr_generic in protocols. bump to 7.99.13 approved by rmind@	2015-04-26 21:40:48 +00:00
rtr	89539c0d5f	return EINVAL if sin{,6}_len != sizeof(sockaddr_in{,6}) respectively in in{,6}_pcbconnect(). checking just m->m_len isn't enough because there are various places that assume sa_len has been properly populated.	2015-04-26 16:45:50 +00:00
rtr	403dacccdb	fix missed parameter type change in dccp6_accept() to sockaddr * from mbuf *	2015-04-25 14:56:05 +00:00
rtr	eddf3af3c6	make accept, getsockname and getpeername syscalls use sockaddr_big and modify pr_{accept,sockname,peername} nam parameter type from mbuf * to sockaddr . retained use of mbuftypes[MT_SONAME] for now. * bump to netbsd version 7.99.12 for parameter type change. patch posted to tech-net@ 2015/04/19	2015-04-24 22:32:37 +00:00
ozaki-r	2c22236376	Avoid NULL checks for a variable that is definitely NULL	2015-04-24 08:53:06 +00:00
ozaki-r	7600c4ec42	Add missing rtcache_free It's the same as other similar code paths in in_gif and ip6_etherip.	2015-04-24 07:51:43 +00:00
roy	b1f5fd8a7f	Move INET6 specific in6_if_{up,down}() and in6_if_link_{up,down}() into agnostic domain functions.	2015-04-22 19:46:08 +00:00
roy	33e035dca4	Introduce p2p_rtrequest() so that IFF_POINTOPOINT interfaces can work with RTF_LOCAL. Fixes PR kern/49829.	2015-04-20 10:19:54 +00:00
roy	2aa9f440e3	Move in6if_do_dad() to if_do_dad() as the routine is not INET6 specific and could equally be used by INET.	2015-04-07 23:30:36 +00:00
rtr	80ea8ccc7c	* update dccp_bind for struct mbuf * to struct sockaddr * parameter change * pass NULL instead of casting 0 to a pointer when calling in_pcbbind()	2015-04-04 04:33:38 +00:00
rtr	a2ba5e69ab	* change pr_bind to accept struct sockaddr * instead of struct mbuf * * update protocol bind implementations to use/expect sockaddr * instead of mbuf * * introduce sockaddr_big struct for storage of addr data passed via sys_bind; sockaddr_big is of sufficient size and alignment to accommodate all addr data sizes received. * modify sys_bind to allocate sockaddr_big instead of using an mbuf. * bump kernel version to 7.99.9 for change to pr_bind() parameter type. Patch posted to tech-net@ http://mail-index.netbsd.org/tech-net/2015/03/15/msg005004.html The choice to use a new structure sockaddr_big has been retained since changing sockaddr_storage size would lead to unnecessary ABI change. The use of the new structure does not preclude future work that increases the size of sockaddr_storage and at that time sockaddr_big may be trivially replaced. Tested by mrg@ and myself, discussed with rmind@, posted to tech-net@	2015-04-03 20:01:07 +00:00
ozaki-r	eefc30d59b	Pull out ipsec routines from ip6_input This change reduces symbol references from netinet6 to netipsec and improves modularity of netipsec. No functional change is intended.	2015-04-01 02:49:44 +00:00
ozaki-r	f35c2148c2	Tidy up opt_ipsec.h inclusions	2015-03-30 04:25:26 +00:00
ozaki-r	32be705817	Include ip6.h for ip6_hdr	2015-03-30 02:23:21 +00:00
roy	a37502b2b6	Add RTF_BROADCAST to mark routes used for the broadcast address when they are created on the fly. This makes it clear what the route is for and allows an optimisation in ip_output() by avoiding a call to in_broadcast() because most of the time we do talk to a host. It also avoids a needless allocation for the storage of llinfo_arp and thus vanishes from arp(8) - it showed as incomplete anyway so this is a nice side effect. Guard against this and routes marked with RTF_BLACKHOLE in ip_fastforward(). While here, guard against routes marked with RTF_BLACKHOLE in ip6_fastforward(). RTF_BROADCAST is IPv4 only, so don't bother checking that here.	2015-03-23 18:33:17 +00:00
roy	5170946304	Don't add local routes for the any address or p2p addresses where the address matches the destination.	2015-02-26 12:58:36 +00:00
roy	42900924fd	Introduce the routing flag RTF_LOCAL to track local address routes. Add functions rt_ifa_addlocal() and rt_ifa_remlocal() to add and remove local routes for the address and announce the new address and route to the routing socket. Add in_ifaddlocal() and in_ifremlocal() to use these functions. Rename in6_if{add,rem}loop() to in6_if{add,rem}local() and use these functions. rtinit() no longer announces the address, just the network route for the address. As such, calls to rt_newaddrmsg() have been removed from in_addprefix() and in_scrubprefix(). This solves the problem of potentially more than one announcement, or no announcement at all for the address in certain situations.	2015-02-26 09:54:46 +00:00
roy	1d0df6e404	Rename nd6_rtmsg() to rt_newmsg() and move into the generic routing code as it's not IPv6 specific and will be used elsewhere.	2015-02-25 12:45:34 +00:00
roy	1777c2ee4b	Retire nd6_newaddrmsg and use rt_newaddrmsg directly instead so that we don't spam route changes when the route hasn't changed.	2015-02-25 00:26:58 +00:00
martin	94a27aa4e3	Rearange interface detachement slightly: before we free the INET6 specific per-interface data, make sure to call nd6_purge() with it to remove routing entries pointing to the going interface. When we should happen to call this function again later, with the data already gone, just return. Fixes PR kern/49682, ok: christos.	2015-02-23 19:15:59 +00:00
rjs	3e6de5e8d2	Declare input argument to in6_sin_2_v4mapsin6 to be const, allows an address from the route cache to be used as the input. ok christos@.	2015-02-20 22:13:48 +00:00
christos	c4bbd62988	"something odd happens" is not a useful error message.	2015-02-17 15:14:28 +00:00
rjs	652788239c	Add DCCP protocol support from KAME.	2015-02-10 19:11:52 +00:00
christos	0090b13dae	CID/1267860: Missing break in switch	2015-02-02 03:14:02 +00:00
roy	a3c36dcba4	Fix IPV6_USE_MIN_MTU set by setsockopt(2) being ignored when IPV6_PKTINFO is set as a control with sendmsg(2).	2015-01-20 21:42:36 +00:00
roy	9daa8a6db0	Add net.inet6.ip6.prefer_tempaddr sysctl knob so that we can prefer IPv6 temporary addresses as the source address. Fixes PR kern/47100 based on a patch by Dieter Roelants.	2015-01-20 21:27:36 +00:00
roy	24c1397228	Report route additions/changes/deletions for cached neighbours to userland.	2014-12-16 11:42:27 +00:00
christos	d1456ccc1f	printable version of the scope. remove stray breaks.	2014-12-10 01:10:37 +00:00
christos	e0b4678125	call vsnprintf instead of snprintf; provide more detail	2014-12-10 01:10:14 +00:00
christos	cb7e0235f1	Merge some common code in the failed forwarding case, while providing better diagnostics, and fixing leaks.	2014-12-08 00:19:37 +00:00
seanb	1f56ae1036	- Fix comment which was no longer accurate after previous change to move from in_pcbconnect -> in6_pcbsetport.	2014-12-05 18:45:37 +00:00
christos	99c363a8a2	more debugging info...	2014-12-03 01:32:11 +00:00
christos	f89df58b37	use the new printing code.	2014-12-02 20:25:47 +00:00
christos	a5009781c6	add routines to print in6_addr and sockaddr_in6 (in6_print, sin6_print)	2014-12-02 19:36:58 +00:00
christos	52b8bb1b69	CID 977389: Out of bounds access.	2014-11-25 19:51:17 +00:00
seanb	ae36e3e5b1	Really make SO_REUSEPORT and SO_REUSEADDR equivalent for multicast sockets. From FreeBSD.	2014-11-25 19:09:13 +00:00
seanb	56c6664a5c	Clean up any dangling ifp references in (struct in6pcb *)->in6p_v4moptions (v4 multicast options off v4 mapped v6 socket) on interface destruction. The code to clean this up in a true v4 socket was moved to its own function which is now also called in the corresponding place for v6 sockets on interface destruction.	2014-11-25 15:04:37 +00:00
joerg	1a64665727	Drop impossible check.	2014-11-16 00:04:06 +00:00
maxv	833172a8e0	Do not uselessly include <sys/malloc.h>.	2014-11-14 17:34:23 +00:00
ozaki-r	d5cdd84d0a	Ensure callout isn't running and pending before callout_destroy Call callout_halt before callout_destroy. And also let callout (mld_timeo) not call callout_schedule when we already called callout_halt. This fixes PR 47881.	2014-11-12 03:24:25 +00:00
roy	23e96eacf2	Clear IN6_IFF_DUPLICATED when link goes down or up.	2014-11-03 13:04:12 +00:00
christos	192050492a	print mapped addresses better	2014-10-27 14:10:12 +00:00
roy	38d2e3f021	Remove the ability for userland to toggle IN6_IFF_TENTATIVE. Preserve IN6_IFF_TENTATIVE when updating address flags.	2014-10-20 14:50:09 +00:00
snj	f0a7346d21	src is too big these days to tolerate superfluous apostrophes. It's "its", people!	2014-10-18 08:33:23 +00:00
roy	15d73271e1	Tests for neighbour now work correctly on bridge(4) and carp(4) interfaces.	2014-10-14 15:29:43 +00:00
roy	1b519b6d17	Remove redundant logging.	2014-10-12 20:05:50 +00:00
christos	01fcb35dc2	document that we depend on the option numbers matching.	2014-10-12 19:02:18 +00:00
christos	4f85a755f8	Refactor the multicast membership code so that we can handle v4 mapped addresses using the v6 membership ioctls.	2014-10-12 19:00:21 +00:00
christos	48e7af1441	Succeed binding to multicast address for now: Open questions: Open questions: http://mail-index.netbsd.org/tech-net/2014/07/23/msg004714.html	2014-10-11 23:07:39 +00:00
christos	f26c7dc958	Make IPV4 mapped addresses able to do IPV4 multicast. Fixes needed: - allow binding to mapped v4 multicast addresses - define v4moptions, allow setting it via ioctl, pass it to ip_output, free it when killing the pcb. Ideally we would allow the IPV6 multicast setsockopts work on mapped addresses too, but this is a lot more work and linux does not do it either.	2014-10-11 20:53:16 +00:00
rmind	436f757159	Eliminate IFAREF() and IFAFREE() macros in favour of functions.	2014-09-09 20:16:12 +00:00
rmind	2082db2d3c	in_pcbdetach: move ip_freemoptions() under softnet_lock for now (this will be changed back once other IP paths become MP-safe). Same for IPv6 routine. This partially reverts 1.150 of in_pcb.c and 1.127 of in6_pcb.c changes.	2014-09-07 00:50:56 +00:00
matt	6f1589d59d	Don't use C++ keyword as variable. Use different prefix for nd6_prefixctl members than for nd6_prefix members.	2014-09-05 06:08:15 +00:00
matt	a9081927c7	Don't nest structure definitions.	2014-09-05 06:06:31 +00:00
matt	62dd88055e	Don't use new as a variable name.	2014-09-05 05:33:06 +00:00
maxv	b6cc446ce5	http://m00nbsd.net/ae123a9bae03f7dde5c6d654412daf5a.html#Report-2 #03-0x02: Memory leak ok ozaki-r@	2014-08-16 17:27:09 +00:00
rtr	8cf67cc6d5	split PRU_CONNECT2 & PRU_PURGEIF function out of pr_generic() usrreq switches and put into separate functions - always KASSERT(solocked(so)) even if not implemented (for PRU_CONNECT2 only) - replace calls to pr_generic() with req = PRU_CONNECT2 with calls to pr_connect2() - replace calls to pr_generic() with req = PRU_PURGEIF with calls to pr_purgeif() put common code from unp_connect2() (used by unp_connect() into unp_connect1() and call out to it when needed patch only briefly reviewed by rmind@	2014-08-09 05:33:00 +00:00
rtr	822872eada	split PRU_RCVD function out of pr_generic() usrreq switches and put into separate functions - always KASSERT(solocked(so)) even if not implemented - replace calls to pr_generic() with req = PRU_RCVD with calls to pr_rcvd()	2014-08-08 03:05:44 +00:00
rtr	651e5bd3f8	split PRU_SEND function out of pr_generic() usrreq switches and put into separate functions xxx_send(struct socket , struct mbuf , struct mbuf , struct mbuf , struct lwp *) - always KASSERT(solocked(so)) even if not implemented - replace calls to pr_generic() with req = PRU_SEND with calls to pr_send() rename existing functions that operate on PCB for consistency (and to free up their names for xxx_send() PRUs - l2cap_send() -> l2cap_send_pcb() - sco_send() -> sco_send_pcb() - rfcomm_send() -> rfcomm_send_pcb() patch reviewed by rmind	2014-08-05 07:55:31 +00:00
rtr	ce6a5ff64f	revert the removal of struct lwp * parameter from bind, listen and connect user requests. this should resolve the issue relating to nfs client hangs presented recently by wiz on current-users@	2014-08-05 05:24:26 +00:00
rmind	73e4b5c06b	in6_pcbdetach: now that IGMP and multicast groups are MP-safe, we can move the ip6_freemoptions() call outside the softnet_lock. Should fix PR/49065.	2014-08-03 22:55:24 +00:00
ozaki-r	a2084f3bc9	Define IFADDR_FOREACH_SAFE for on-the-fly element removal in a loop We have to use it when we purge an address element in an ifaddr loop. This change restores the original behavior that was accidentally degraded.	2014-07-31 06:35:47 +00:00
rtr	892163b8e9	split PRU_DISCONNECT, PRU_SHUTDOWN and PRU_ABORT function out of pr_generic() usrreq switches and put into separate functions xxx_disconnect(struct socket ) xxx_shutdown(struct socket ) xxx_abort(struct socket *) - always KASSERT(solocked(so)) even if not implemented - replace calls to pr_generic() with req = PRU_{DISCONNECT,SHUTDOWN,ABORT} with calls to pr_{disconnect,shutdown,abort}() respectively rename existing internal functions used to implement above functionality to permit use of the names for xxx_{disconnect,shutdown,abort}(). - {l2cap,sco,rfcomm}_disconnect() -> {l2cap,sco,rfcomm}_disconnect_pcb() - {unp,rip,tcp}_disconnect() -> {unp,rip,tcp}_disconnect1() - unp_shutdown() -> unp_shutdown1() patch reviewed by rmind	2014-07-31 03:39:35 +00:00
ozaki-r	f46609a687	Define IFNET_EMPTY() and replace !IFNET_FIRST() with it No functional change.	2014-07-31 02:21:51 +00:00
rtr	ad6ae402db	split PRU_CONNECT function out of pr_generic() usrreq switches and put into seaparate functions xxx_listen(struct socket , struct mbuf ) - always KASSERT(solocked(so)) and KASSERT(nam != NULL) - replace calls to pr_generic() with req = PRU_CONNECT with pr_connect() - rename existin {l2cap,sco,rfcomm}_connect() to {l2cap,sco,rfcomm}_connect_pcb() respectively to permit naming consistency with other protocols functions. - drop struct lwp * parameter from unp_connect() and at_pcbconnect() and use curlwp instead where appropriate. patch reviewed by rmind	2014-07-30 10:04:25 +00:00
joerg	0bb0ad593a	PR 49036: net.inet6 has not been created when the sysctl constructor for net.inet6.multicast is run.	2014-07-26 22:21:16 +00:00
ozaki-r	011b056d61	Use IFADDR_FOREACH for iterating if_addrlist of ifnet	2014-07-25 07:12:55 +00:00
rtr	6dd8eef044	split PRU_BIND and PRU_LISTEN function out of pr_generic() usrreq switches and put into separate functions xxx_bind(struct socket , struct mbuf ) xxx_listen(struct socket ) - always KASSERT(solocked(so)) even if not implemented - replace calls to pr_generic() with req = PRU_BIND with call to pr_bind() - replace calls to pr_generic() with req = PRU_LISTEN with call to pr_listen() - drop struct lwp parameter from at_pcbsetaddr(), in_pcbbind() and unp_bind() and always use curlwp. rename existing functions that operate on PCB for consistency (and to free up their names for xxx_{bind,listen}() PRUs - l2cap_{bind,listen}() -> l2cap_{bind,listen}_pcb() - sco_{bind,listen}() -> sco_{bind,listen}_pcb() - rfcomm_{bind,listen}() -> rfcomm_{bind,listen}_pcb() patch reviewed by rmind welcome to netbsd 6.99.48	2014-07-24 15:12:03 +00:00
rtr	35b22fa96a	split PRU_SENDOOB and PRU_RCVOOB function out of pr_generic() usrreq switches and put into separate functions xxx_sendoob(struct socket , struct mbuf , struct mbuf ) xxx_recvoob(struct socket , struct mbuf *, int) - always KASSERT(solocked(so)) even if request is not implemented - replace calls to pr_generic() with req = PRU_{SEND,RCV}OOB with calls to pr_{send,recv}oob() respectively. there is still some tweaking of m_freem(m) and m_freem(control) to come for consistency. not performed with this commit for clarity. reviewed by rmind	2014-07-23 13:17:18 +00:00
rtr	d27b133d27	* split PRU_ACCEPT function out of pr_generic() usrreq switches and put into a separate function xxx_accept(struct socket , struct mbuf ) note: future cleanup will take place to remove struct mbuf parameter type and replace it with a more appropriate type. patch reviewed by rmind	2014-07-09 14:41:42 +00:00
rtr	d575eb5454	* split PRU_PEERADDR and PRU_SOCKADDR function out of pr_generic() usrreq switches and put into separate functions xxx_{peer,sock}addr(struct socket , struct mbuf ). - KASSERT(solocked(so)) always in new functions even if request is not implemented - KASSERT(pcb != NULL) and KASSERT(nam) if the request is implemented and not for tcp. * for tcp roll #ifdef KPROF and #ifdef DEBUG code from tcp_usrreq() into easier to cut & paste functions tcp_debug_capture() and tcp_debug_trace() - functions provided by rmind - remaining use of PRU_{PEER,SOCK}ADDR #define to be removed in a future commit. * rename netbt functions to permit consistency of pru function names (as has been done with other requests already split out). - l2cap_{peer,sock}addr() -> l2cap_{peer,sock}_addr_pcb() - rfcomm_{peer,sock}addr() -> rfcomm_{peer,sock}_addr_pcb() - sco_{peer,sock}addr() -> sco_{peer,sock}_addr_pcb() * split/refactor do_sys_getsockname(lwp, fd, which, nam) into two functions do_sys_get{peer,sock}name(fd, nam). - move PRU_PEERADDR handling into do_sys_getpeername() from do_sys_getsockname() - have svr4_stream directly call do_sys_get{sock,peer}name() respectively instead of providing `which' & fix a DPRINTF string that incorrectly wrote "getpeername" when it meant "getsockname" - fix sys_getpeername() and sys_getsockname() to call do_sys_get{sock,peer}name() without `which' and `lwp' & adjust comments - bump kernel version for removal of lwp & which parameters from do_sys_getsockname() note: future cleanup to remove struct mbuf * abuse in xxx_{peer,sock}name() still to come, not done in this commit since it is easier to do post split. patch reviewed by rmind welcome to 6.99.47	2014-07-09 04:54:03 +00:00
rtr	ff90c29d04	* sprinkle KASSERT(solocked(so)); in all pr_stat() functions. * fix remaining inconsistent struct socket parameter names.	2014-07-07 17:13:56 +00:00
rtr	909a1fc699	backout change that made pr_stat return EOPNOTSUPP for protocols that were not filling in struct stat. decision made after further discussion with rmind and investigation of how other operating systems behave. soo_stat() is doing just enough to be able to call what gets returned valid and thus justifys a return of success. additional review will be done to determine of the pr_stat functions that were already returning EOPNOTSUPP can be considered successful with what soo_stat() is doing.	2014-07-07 15:13:21 +00:00
rtr	183fc9ab77	* have pr_stat return EOPNOTSUPP consistently for all protocols that do not fill in struct stat instead of returning success. * in pr_stat remove all checks for non-NULL so->so_pcb except where the pcb is actually used (i.e. cases where we don't return EOPNOTSUPP). proposed on tech-net@	2014-07-07 07:09:58 +00:00
rtr	a60320ca07	* split PRU_SENSE functionality out of xxx_usrreq() switches and place into separate xxx_stat(struct socket , struct stat ) functions. * replace calls using pr_generic with req == PRU_SENSE with pr_stat(). further change will follow that cleans up the pattern used to extract the pcb and test for its presence. reviewed by rmind	2014-07-06 03:33:33 +00:00
justin	de6ed608dd	On ARM the variable name 'delay' shadows a function here, rename to avoid -Wshadow objecting.	2014-07-01 23:01:54 +00:00
ozaki-r	f3479d05c4	Stop using callout randomly nd6_dad_start uses callout when xtick > 0 while doesn't when xtick == 0. So if we pass a random value ranging from 0 to N, nd6_dad_start uses callout randomly. This behavior makes debugging difficult. Discussed in http://mail-index.netbsd.org/tech-kern/2014/06/25/msg017278.html	2014-07-01 07:51:29 +00:00
rtr	0dedd9772f	fix parameter types in pr_ioctl, called xx_control() functions and remove abuse of pointer to struct mbuf type. param2 changed to u_long type and uses parameter name 'cmd' (ioctl command) param3 changed to void * type and uses parameter name 'data' param4 changed to struct ifnet * and uses parameter name 'ifp' param5 has been removed (formerly struct lwp ) and uses of 'l' have been replaced with curlwp from curproc(9). callers have had (now unnecessary) casts to struct mbuf removed, called code has had (now unnecessary) casts to u_long, void * and struct ifnet * respectively removed. reviewed by rmind@	2014-07-01 05:49:18 +00:00
rtr	c5cb349386	where appropriate rename xxx_ioctl() struct mbuf * parameters from `control' to `ifp' after split from xxx_usrreq(). sys_socket.c fix wrapping of arguments to be consistent with other function calls in the file after replacing pr_usrreq() call with pr_ioctl() which required one less argument. link_proto.c fix indentation of parameters in link_ioctl() prototype to be consistent with the rest of the file. discussed with rmind@	2014-06-23 17:18:45 +00:00
rtr	d54d7ab24a	* split PRU_CONTROL functionality out of xxx_userreq() switches and place into separate xxx_ioctl() functions. * place KASSERT(req != PRU_CONTROL) inside xxx_userreq() as it is now inappropriate for req = PRU_CONTROL in xxx_userreq(). * replace calls to pr_generic() with req = PRU_CONTROL with pr_ioctl(). * remove & fixup references to PRU_CONTROL xxx_userreq() function comments. * fix various comments references for xxx_userreq() that mentioned PRU_CONTROL as xxx_userreq() no longer handles the request. a further change will follow to fix parameter and naming inconsistencies retained from original code. Reviewed by rmind@	2014-06-22 08:10:18 +00:00
ozaki-r	e05f40117a	Add 3rd argument to pktq_create to pass sc It will be used to pass bridge sc for bridge_forward softint. ok rmind@	2014-06-16 00:33:39 +00:00
joerg	539332ecd5	Introduce new sysctls for obtaining interface-specific addresses: - net.sdl for the active link-layer adddress (the MAC) - net.ether.multicast for the Ethernet multicast addresses - net.inet6.multicast for the IPv6 multicast groups - net.inet6.multicast_kludge for temporarily removed multicast groups Use this sysctls for replacing the kmem grovelling in ifmcstat(8).	2014-06-10 09:38:30 +00:00
rmind	32293d340f	- Eliminate RTFREE() macro in favour of rtfree() function. - Make rtcache() function static.	2014-06-06 01:02:47 +00:00
rmind	60d350cf6d	- Implement pktqueue interface for lockless IP input queue. - Replace ipintrq and ip6intrq with the pktqueue mechanism. - Eliminate kernel-lock from ipintr() and ip6intr(). - Some preparation work to push softnet_lock out of ipintr(). Discussed on tech-net.	2014-06-05 23:48:16 +00:00
roy	0398025216	Add IPV6CTL_AUTO_LINKLOCAL and ND6_IFF_AUTO_LINKLOCAL toggles which control the automatic creation of IPv6 link-local addresses when an interface is brought up. Taken from FreeBSD.	2014-06-05 16:06:49 +00:00
joerg	fca46cebb6	Use explicit initializer.	2014-06-02 11:02:20 +00:00
christos	5d61e6c015	Introduce 2 new variables: ipsec_enabled and ipsec_used. Ipsec enabled is controlled by sysctl and determines if is allowed. ipsec_used is set automatically based on ipsec being enabled, and rules existing.	2014-05-30 01:39:03 +00:00
rmind	9a6de984e5	Move udp6_input(), udp6_sendup(), udp6_realinput() and udp6_input_checksum() from udp_usrreq.c to udp6_usrreq.c where they belong. No functional change.	2014-05-22 22:56:53 +00:00
bouyer	8ec9289dda	Sync with the ipv4 code and call ifp->if_output() with KERNEL_LOCK held. Problem reported and fix tested by njoly@ on current-users@	2014-05-20 20:23:56 +00:00
rmind	e401453f3f	Adjust PR_WRAP_USRREQS() to include the attach/detach functions. We still need the kernel-lock for some corner cases.	2014-05-20 19:04:00 +00:00
rmind	4ae03c1815	- Split off PRU_ATTACH and PRU_DETACH logic into separate functions. - Replace malloc with kmem and eliminate M_PCB while here. - Sprinkle more asserts.	2014-05-19 02:51:24 +00:00
rmind	39bd8dee77	Add struct pr_usrreqs with a pr_generic function and prepare for the dismantling of pr_usrreq in the protocols; no functional change intended. PRU_ATTACH/PRU_DETACH changes will follow soon. Bump for struct protosw. Welcome to 6.99.62!	2014-05-18 14:46:15 +00:00
rmind	94eca5faba	Use IFNET_FIRST() rather than open coding ifnet access.	2014-05-18 00:10:11 +00:00
rmind	bc9504c95e	Replace open-coded access (and boundary checking) of ifindex2ifnet with if_byindex() function.	2014-05-17 21:26:20 +00:00
rmind	f7741dab17	- Move IFNET_*() macros under #ifdef _KERNEL. - Replace TAILQ_FOREACH on ifnet with IFNET_FOREACH().	2014-05-17 20:44:24 +00:00
pooka	d610791504	Wrap ipflow_create() & ip6flow_create() in kernel lock. Prevents the interrupt side on another core from seeing the situation while the ipflow is being modified.	2014-04-01 13:11:44 +00:00
roy	263486c97b	If IPv6 is disabled for an interface, mark all addresses as tentative. If enabled, check for a duplicated link-local address and abort enabling as per RFC 4862, section 5.4.5. If allowed to enable, perform DAD on the tentative addresses. Taken from FreeBSD.	2014-03-20 13:34:35 +00:00
pooka	4f6fb3bf35	Ensure that the top level sysctl nodes (kern, vfs, net, ...) exist before the sysctl link sets are processed, and remove redundancy. Shaves >13kB off of an amd64 GENERIC, not to mention >1k duplicate lines of code.	2014-02-25 18:30:08 +00:00
joerg	2ab760d98c	Bail out in case m_pulldown failed.	2014-02-20 13:36:06 +00:00
roy	a277db7c28	Remove dead code.	2014-01-15 10:52:11 +00:00
roy	b122449be2	If the address matches a cloning route, it is also a neighbor. This allows us to use prefixes which userland may have added.	2014-01-15 10:25:04 +00:00
roy	e0146f9ccc	Remove the now un-used function in6ifa_ifplocaladdr.	2014-01-13 18:57:48 +00:00
roy	f29241a88d	When handling NS/NA we need to check our prefix list instead of our address list to work out if it came from a valid neighbor.	2014-01-13 18:23:36 +00:00
pooka	acb676442c	Allow kernels compiled with INET+INET6 to be booted as IPv4-only or IPv6-only.	2014-01-02 18:29:01 +00:00
martin	89c87ea341	Instead of voodo casts use simple byte pointer arithmetic and memcpy to create the "packed" binary format we pass out to userland when querying the router/prefix list.	2013-12-17 20:25:00 +00:00
christos	007db6ee9d	convert from CIRCLEQ to TAILQ.	2013-11-23 14:20:21 +00:00
riz	5236eb3350	Revert previous and solve in a different way, using __unused. Fixes building with MRT6DEBUG. ok martin.	2013-11-21 21:55:13 +00:00
martin	d393a539e6	Mark a variable as used only in diagnostic kernels	2013-10-25 15:44:39 +00:00
christos	c7c7d1300b	define constants for scopeid function flags.	2013-10-19 15:44:29 +00:00
christos	14a31944e9	add scopeid functions	2013-10-19 00:09:03 +00:00
mrg	16b81f3bcd	convert a DIAGNOSTIC / panic into a KASSERTMSG().	2013-10-18 02:20:15 +00:00
christos	191f4d1d8e	check result of setscope, from logan.	2013-10-04 14:23:14 +00:00
christos	ff9d8f8219	check sockopt_get() error, from logan.	2013-10-03 20:27:55 +00:00
martin	107b587925	Remove unused variable	2013-09-14 21:08:35 +00:00
martin	3d10084754	Remove unused variable and ifdef some others like their use	2013-09-14 11:33:59 +00:00
christos	952f93f19e	Include BRDADDR and NETMASK to the v4 ioctls we ban for v6; from FreeBSD. Remove X25 stuff which has been GC'ed. XXX: pullup-5,6	2013-09-11 23:15:47 +00:00
christos	d407b3e25b	draft-gont-6man-ipv6-atomic-fragment-00 is now RFC 6949 (Loganaden Velvindron logan at elandsys dot com)	2013-08-30 07:42:08 +00:00
rmind	f04a92b1d6	- Rewrite parts of pfil(9): use array to store hooks and thus be more cache friendly (there are only few hooks in the system). Make the structures opaque and the interface more strict. - Remove PFIL_HOOKS option by making pfil(9) mandatory.	2013-06-29 21:06:57 +00:00
roy	3643d6b4fe	Move the detaching and making tentative addresses out if in6_if_up and into in6_if_link_up. This fixes a possible panic where link is up but not the interface. Note that a better solution would be to listen to the routing socket in the kernel, but I don't know how to do that. Reachable Router tests for IFF_UP as well.	2013-06-20 13:56:29 +00:00
roy	49e60b0459	When an interface link state changes to down, mark all attached IPv6 addresses as detached. Likewise, when the link state changes to up, mark all detached IPv6 as tentative and start DAD on them. Advertised router reachability now checks that link state is not down. This means that when an interface link state changes, the default IPv6 router may change as well.	2013-06-11 12:08:29 +00:00
christos	27fe772ddc	IPSEC has not come in two speeds for a long time now (IPSEC == kame, FAST_IPSEC). Make everything refer to IPSEC to avoid confusion.	2013-06-05 19:01:26 +00:00
roy	cf9f00bd51	Generate RTM_NEWADDR when adding a pre-existing IPv6 address.	2013-05-29 12:07:58 +00:00
msaitoh	c259649f35	Clear mbuf's csum_flags in ip6flow_fastforward(). Fixes PR#47849.	2013-05-23 16:49:46 +00:00
roy	ad83294f6e	Disable nd6_newaddrmsg debug	2013-05-21 09:54:12 +00:00
roy	a34d72845c	For IPv6, emit RTM_NEWADDR once DAD completes and also when address flag changes. Tentative addresses are not emitted. Version bumped so userland can detect this behaviour change.	2013-05-21 08:37:27 +00:00
joerg	89a508fbb5	Systematically include sys/featuretest.h when _NETBSD_SOURCE is used. Some are redundant, but make verification with grep much easier.	2013-04-27 21:35:24 +00:00
christos	f46fe92653	PR/47738: connect(2) to 239.x.y.z should return error but does not.	2013-04-12 21:30:40 +00:00
gdt	2431ad86cc	Initialize variable used as (conditional) result parameter. ip6_insertfraghdr either sets a result parameter or returns an error. While the caller only uses the result parameter in the non-error case, knowing that requires cross-module static analysis, and that's not robust against distant code changes. Therfore, set ip6f to NULL before the function call that maybe sets it, avoiding a spuruious warning and changing the future possible bug from an unitialized dereference to a NULL deferrence.	2013-03-18 19:31:39 +00:00
joerg	e240adbd0b	Retire OSI network stack. OK core@	2013-03-01 18:25:13 +00:00

... 5 6 7 8 9 ...

1913 Commits