Mirror/frr - Forgejo: Beyond coding. We Forge.

Mirror/frr

mirror of https://github.com/FRRouting/frr.git synced 2025-04-30 13:37:17 +02:00

Author	SHA1	Message	Date
Mark Stapp	2c5c679e93	zebra: use new per-NS interface iteration Replace use of the old if_table with the new per-NS ifp iteration apis. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-10-28 14:54:06 -04:00
Mark Stapp	5af72e95b0	zebra: add new per-NS tree of interfaces Add new per-NS interface typerb tree, using external linkage, to replace the use of the if_table table. Add apis to iterate the per-NS collection - which is not public. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-10-28 14:54:06 -04:00
Donald Sharp	811168ecc3	zebra: Add safi to some debugs Trying to figure out what safi we are talking about is fun when it is not put into the debugs. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-28 14:29:31 -04:00
Philippe Guibert	a7fec9c387	zebra: add 'debug zebra srv6' command Add a specific debug command to handle srv6 troubleshooting. Move the srv6 traces that initially were under 'debug zebra packet' debug. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-10-28 16:40:33 +01:00
Mark Stapp	1f8d81335d	zebra: make a zif MTYPE internal/static Make an MTYPE used in zifs internal/static Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-10-28 11:09:52 -04:00
Jafar Al-Gharaibeh	f11421d4ec	Merge pull request #17160 from opensourcerouting/fix/keep_zebra_on-rib-process_in_frr.conf lib, zebra: Keep `zebra on-rib-process script` in frr.conf	2024-10-27 18:23:36 -05:00
Donald Sharp	138935a5fd	bgpd: Fix wrong pthread event cancelling 0 __pthread_kill_implementation (no_tid=0, signo=6, threadid=130719886083648) at ./nptl/pthread_kill.c:44 1 __pthread_kill_internal (signo=6, threadid=130719886083648) at ./nptl/pthread_kill.c:78 2 __GI___pthread_kill (threadid=130719886083648, signo=signo@entry=6) at ./nptl/pthread_kill.c:89 3 0x000076e399e42476 in __GI_raise (sig=6) at ../sysdeps/posix/raise.c:26 4 0x000076e39a34f950 in core_handler (signo=6, siginfo=0x76e3985fca30, context=0x76e3985fc900) at lib/sigevent.c:258 5 <signal handler called> 6 __pthread_kill_implementation (no_tid=0, signo=6, threadid=130719886083648) at ./nptl/pthread_kill.c:44 7 __pthread_kill_internal (signo=6, threadid=130719886083648) at ./nptl/pthread_kill.c:78 8 __GI___pthread_kill (threadid=130719886083648, signo=signo@entry=6) at ./nptl/pthread_kill.c:89 9 0x000076e399e42476 in __GI_raise (sig=sig@entry=6) at ../sysdeps/posix/raise.c:26 10 0x000076e399e287f3 in __GI_abort () at ./stdlib/abort.c:79 11 0x000076e39a39874b in _zlog_assert_failed (xref=0x76e39a46cca0 <_xref.27>, extra=0x0) at lib/zlog.c:789 12 0x000076e39a369dde in cancel_event_helper (m=0x5eda32df5e40, arg=0x5eda33afeed0, flags=1) at lib/event.c:1428 13 0x000076e39a369ef6 in event_cancel_event_ready (m=0x5eda32df5e40, arg=0x5eda33afeed0) at lib/event.c:1470 14 0x00005eda0a94a5b3 in bgp_stop (connection=0x5eda33afeed0) at bgpd/bgp_fsm.c:1355 15 0x00005eda0a94b4ae in bgp_stop_with_notify (connection=0x5eda33afeed0, code=8 '\b', sub_code=0 '\000') at bgpd/bgp_fsm.c:1610 16 0x00005eda0a979498 in bgp_packet_add (connection=0x5eda33afeed0, peer=0x5eda33b11800, s=0x76e3880daf90) at bgpd/bgp_packet.c:152 17 0x00005eda0a97a80f in bgp_keepalive_send (peer=0x5eda33b11800) at bgpd/bgp_packet.c:639 18 0x00005eda0a9511fd in peer_process (hb=0x5eda33c9ab80, arg=0x76e3985ffaf0) at bgpd/bgp_keepalives.c:111 19 0x000076e39a2cd8e6 in hash_iterate (hash=0x76e388000be0, func=0x5eda0a95105e <peer_process>, arg=0x76e3985ffaf0) at lib/hash.c:252 20 0x00005eda0a951679 in bgp_keepalives_start (arg=0x5eda3306af80) at bgpd/bgp_keepalives.c:214 21 0x000076e39a2c9932 in frr_pthread_inner (arg=0x5eda3306af80) at lib/frr_pthread.c:180 22 0x000076e399e94ac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442 23 0x000076e399f26850 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 (gdb) f 12 12 0x000076e39a369dde in cancel_event_helper (m=0x5eda32df5e40, arg=0x5eda33afeed0, flags=1) at lib/event.c:1428 1428 assert(m->owner == pthread_self()); In this decode the attempt to cancel the connection's events from the wrong thread is causing the crash. Modify the code to create an event on the bm->master to cancel the events for the connection. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-24 21:01:26 -04:00
Donatas Abraitis	91e157f3ae	Merge pull request #17162 from louis-6wind/fix-bh-nh-vrf zebra: fix showing nexthop vrf for ipv6 blackhole	2024-10-23 17:34:44 +03:00
Jafar Al-Gharaibeh	0078472e19	Merge pull request #17180 from anlancs/zebra/review-move-dplane zebra: drop NEWLINK event handling in the main thread	2024-10-22 10:29:49 -05:00
anlan_cs	96192f6aee	zebra: drop NEWLINK event handling in the main thread NEWLINK is only registered by the dplane thread, the main thread doesn't care about it. So remove the real process of `netlink_link_change()` for NEWLINK event in main thread. And move NEWLINK/DELLINK event to the block where the dplane messages are kept together. Signed-off-by: anlan_cs <anlan_cs@126.com>	2024-10-22 09:05:00 +08:00
anlan_cs	5829fea1b5	zebra: remove useless code Signed-off-by: anlan_cs <anlan_cs@126.com>	2024-10-19 13:32:53 +08:00
Louis Scalbert	6cdc82b21b	zebra: fix showing nexthop vrf for ipv6 blackhole For some reasons the Linux kernel associates the ipv6 blackhole of non default table the lo interface. > root@r1# ip -6 route show table 100 > root@r1# ip -6 route add unreachable default metric 4278198272 table 100 > root@r1# ip -6 route show table 100 > unreachable default dev lo metric 4278198272 pref medium As a consequence, the VRF default that owns the lo interface is shown as the nexthop VRF: > r1# show ipv6 route table 20 > Table 20: > K>* ::/0 [255/8192] unreachable (ICMP unreachable) (vrf default), 00:18:12 Do not display the nexthop VRF of a blackhole. It does not make sense for a blackhole and it was not displayed in the past. Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>	2024-10-18 14:45:50 +02:00
Donatas Abraitis	1fe1f8d87c	lib, zebra: Keep `zebra on-rib-process script` in frr.conf After the change: ``` $ grep on-rib-process /etc/frr/frr.conf zebra on-rib-process script script4 $ systemctl restart frr $ vtysh -c 'show run' \| grep on-rib-process zebra on-rib-process script script4 ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-10-18 15:36:52 +03:00
Donald Sharp	5a2a9e3b89	zebra: Fix possible null deref discovered by coverity Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-17 07:42:47 -04:00
Donald Sharp	466efab870	Merge pull request #17136 from opensourcerouting/clang-sa-19 *: fix clang-19 SA	2024-10-17 07:38:28 -04:00
Donatas Abraitis	1ce225d7e4	Merge pull request #17076 from donaldsharp/rnh_and_redistribution_nexthop_num_fix *: Fix up improper handling of nexthops for nexthop tracking	2024-10-16 16:34:08 +03:00
Donald Sharp	cc63dbb68f	Merge pull request #17020 from pguibert6WIND/asan_shutdown zebra: fix heap-use-after free on ns shutdown	2024-10-16 09:15:06 -04:00
David Lamparter	e6cb1a90f2	zebra: check `dirfd()` result `dirfd()` can theoretically return an error. Call it once and check the result. clang-SA: technically correct™. Ain't that the best kind of correct? Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2024-10-16 13:30:25 +02:00
David Lamparter	67b0a457ed	zebra: don't misappropriate `errno` `errno` has its own semantics. Sometimes it is correct to write to it. This is not one of those cases - just use a separate `nl_errno`. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2024-10-16 13:30:25 +02:00
David Lamparter	1350f8d1c1	zebra: don't try to read past EOF `FILE *` objects are theoretically in an invalid state if you try to use them past their reporting EOF. Adjust the code to make it correct. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2024-10-16 13:30:25 +02:00
David Lamparter	c071b4370d	*: clang-SA switch-enum initializer workarounds In these cases the value assigned by the switch block is used directly rather than returned. Mark the initial/default value as used so clang-SA doesn't complain about it. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2024-10-16 13:30:25 +02:00
David Lamparter	49cf311d46	*: clang-SA friendly switch-enum-return-string clang-19's SA complains about unused initializers for this kind of "switch (enum) { return string }" kind of code. Use direct string return values to avoid the issue. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2024-10-16 13:00:11 +02:00
Donatas Abraitis	c32bdc2469	Merge pull request #17116 from enkechen-panw/zfix-2 zebra: unlock node only after operation in zebra_free_rnh()	2024-10-16 08:12:28 +03:00
Jafar Al-Gharaibeh	b2eaf86fb5	Merge pull request #15586 from donaldsharp/nht_explain_doc zebra: Attempt to explain the rnh tracking code better	2024-10-15 14:25:35 -05:00
Jafar Al-Gharaibeh	b23bbb885a	Merge pull request #17088 from donaldsharp/connected_kernel_fun zebra: Prevent a kernel route from being there when a connected should	2024-10-15 14:04:51 -05:00
Enke Chen	5b6ff51b8a	zebra: unlock node only after operation in zebra_free_rnh() Move route_unlock_node() after rnh_list_del(). Signed-off-by: Enke Chen <enchen@paloaltonetworks.com>	2024-10-15 10:25:46 -07:00
Donald Sharp	28237d73ad	zebra: Attempt to explain the rnh tracking code better I got asked today what was going on in the rnh code. I had to take time off of what I was doing and rewrap my head around this code, since it's been a long time. As that this question may come up again in the future I am trying to document this better so that someone coming behind us will be able to read this and get a better idea of what the algorithm is attempting to do. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-15 12:42:17 -04:00
Donald Sharp	645a9e4f83	*: Fix up improper handling of nexthops for nexthop tracking Currently FRR needs to send a uint16_t value for the number of nexthops as well it needs the ability to properly decode all of this. Find and handle all the places that this happens. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-15 11:57:23 -04:00
Mark Stapp	6c1bc51bbb	Merge pull request #16737 from raja-rajasekar/rajasekarr/vlan_to_dplane zebra: vlan to dplane	2024-10-15 08:06:34 -04:00
Donald Sharp	74e25198e7	zebra: Prevent a kernel route from being there when a connected should There exists a series of events where a kernel route is learned first( that happens to be exactly what a connected route should be ) and FRR ends up with both a kernel route and a connected route, leaving us in a very strange spot. This code change just mirrors the existing code of if there is a connected route drop the kernel route. Here we just do the reverse, if we have a kernel route already and a connected should be created, remove the kernel and keep the connected. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-14 11:27:53 -04:00
Donatas Abraitis	d1433ee9a8	Merge pull request #17062 from donaldsharp/dplane_fpm_nl_problems zebra: Only notify dplane work pthread when needed	2024-10-14 08:14:34 +03:00
anlan_cs	05e2472de7	zebra: add back one field for debug The `flags` field is removed recently, so add back it for debug. Signed-off-by: anlan_cs <anlan_cs@126.com>	2024-10-13 21:30:46 +08:00
Donald Sharp	8aa97a439f	zebra: Slow down fpm_process_queue When the fpm_process_queue has run out of space but has written to the fpm output buffer, schedule it to wake up immediately, as that the write will go out pretty much immediately, since it was scheduled first. If the fpm_process_queue has not written to the output buffer then delay the processing by 10 milliseconds to allow a possibly backed up write processing to have a chance to complete it's work. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-11 09:37:37 -04:00
Donald Sharp	963792e8c5	zebra: Only notify dplane work pthread when needed The fpm_nl_process function was getting the count of the total number of ctx's processed. This leads to after having processed 1 context to always signal the dataplane that there is work to do. Change the code to only notify the dplane worker when a context was actually added to the outgoing context queue. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-11 09:37:37 -04:00
Donald Sharp	154a89bc31	zebra: Fix crash in pw code Recent PR #17009 introduced a crash in pw handing for deletion. Let's fix that problem. Fixes: #17041 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-09 07:17:29 -04:00
Philippe Guibert	7ae70eb5ef	zebra: fix heap-use-after free on ns shutdown The following ASAN issue has been observed: > ERROR: AddressSanitizer: heap-use-after-free on address 0x6160000acba4 at pc 0x55910c5694d0 bp 0x7ffe3a8ac850 sp 0x7ffe3a8ac840 > READ of size 4 at 0x6160000acba4 thread T0 > #0 0x55910c5694cf in ctx_info_from_zns zebra/zebra_dplane.c:3315 > #1 0x55910c569696 in dplane_ctx_ns_init zebra/zebra_dplane.c:3331 > #2 0x55910c56bf61 in dplane_ctx_nexthop_init zebra/zebra_dplane.c:3680 > #3 0x55910c5711ca in dplane_nexthop_update_internal zebra/zebra_dplane.c:4490 > #4 0x55910c571c5c in dplane_nexthop_delete zebra/zebra_dplane.c:4717 > #5 0x55910c61e90e in zebra_nhg_uninstall_kernel zebra/zebra_nhg.c:3413 > #6 0x55910c615d8a in zebra_nhg_decrement_ref zebra/zebra_nhg.c:1919 > #7 0x55910c6404db in route_entry_update_nhe zebra/zebra_rib.c:454 > #8 0x55910c64c904 in rib_re_nhg_free zebra/zebra_rib.c:2822 > #9 0x55910c655be2 in rib_unlink zebra/zebra_rib.c:4212 > #10 0x55910c6430f9 in zebra_rtable_node_cleanup zebra/zebra_rib.c:968 > #11 0x7f26f275b8a9 in route_node_free lib/table.c:75 > #12 0x7f26f275bae4 in route_table_free lib/table.c:111 > #13 0x7f26f275b749 in route_table_finish lib/table.c:46 > #14 0x55910c65db17 in zebra_router_free_table zebra/zebra_router.c:191 > #15 0x55910c65dfb5 in zebra_router_terminate zebra/zebra_router.c:244 > #16 0x55910c4f40db in zebra_finalize zebra/main.c:249 > #17 0x7f26f2777108 in event_call lib/event.c:2011 > #18 0x7f26f264180e in frr_run lib/libfrr.c:1212 > #19 0x55910c4f49cb in main zebra/main.c:531 > #20 0x7f26f2029d8f in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58 > #21 0x7f26f2029e3f in __libc_start_main_impl ../csu/libc-start.c:392 > #22 0x55910c4b0114 in _start (/usr/lib/frr/zebra+0x1ae114) It happens with FRR using the kernel. During shutdown, the namespace identifier is attempted to be obtained by zebra, in an attempt to prepare zebra dataplane nexthop messages. Fix this by accessing the ns structure. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-10-08 22:25:55 +02:00
Russ White	b8c458622d	Merge pull request #17023 from donaldsharp/dplane_problems zebra: Allow dplane to pass larger number of nexthops down to dataplane	2024-10-08 11:45:27 -04:00
Donald Sharp	9f8968fc5a	*: Allow 16 bit size for nexthops Currently FRR is limiting the nexthop count to a uint8_t not a uint16_t. This leads to issues when the nexthop count is 256 which results in the count to overflow to 0 causing problems in the code. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-08 09:26:57 -04:00
Donald Sharp	a8af2b2a9d	zebra: Do not retry in 30 seconds on pw reachability failure Currently the zebra pw code has setup a retry to install the pw after 30 seconds when it is decided that reachability to the pw is gone. This causes a failure mode where the pw code just goes and re-installs the pw after 30 seconds in the non-reachability case. Instead it should just be reinstalling after reachability is restored. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-07 20:36:45 -04:00
Donald Sharp	f50b1f7c22	zebra: Move pw status settting until after we get results Currently the pw code sets the status of the pw for install and uninstall immediately when notifying the dplane. This is incorrect in that we do not actually know the status at this point in time. When we get the result is when to set the status. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-10-07 20:36:45 -04:00
Donatas Abraitis	ded59bcc72	Merge pull request #17013 from dksharp5/removal_functions Removal functions	2024-10-07 11:47:01 +03:00
Donna Sharp	f62dfc5d53	lib,zebra: remove unused ZEBRA_VRF_UNREGISTER Signed-off-by: Donna Sharp <dksharp5@gmail.com>	2024-10-06 19:40:49 -04:00
Donna Sharp	103f24485c	zebra: remove unsued function from tc_netlink.c Signed-off-by: Donna Sharp <dksharp5@gmail.com>	2024-10-06 19:30:56 -04:00
Donna Sharp	7a63799a84	zebra: remove unused function from if_netlink.c Signed-off-by: Donna Sharp <dksharp5@gmail.com>	2024-10-06 19:25:44 -04:00
Donna Sharp	b6dd4ff8bc	zebra: remove unused function from tc_netlink.c Signed-off-by: Donna Sharp <dksharp5@gmail.com>	2024-10-06 19:08:44 -04:00
Donna Sharp	8eb5f4f506	zebra: remove unused function rib_lookup_ipv4 Signed-off-by: Donna Sharp <dksharp5@gmail.com>	2024-10-06 18:53:11 -04:00
Russ White	15991e1a08	Merge pull request #16800 from donaldsharp/nhg_reuse_intf_down_up Nhg reuse intf down up	2024-10-04 10:28:58 -04:00
Igor Zhukov	a3877e4444	zebra: Fix crash during reconnect fpm_enqueue_rmac_table expects an fpm_rmac_arg* as its argument. The issue can be reproduced by dropping the TCP session using: ss -K dst 127.0.0.1 dport = 2620 I used Fedora 40 and frr 9.1.2 and I got the gdb backtrace: (gdb) bt 0 0x00007fdd7d6997ea in fpm_enqueue_rmac_table (bucket=0x2134dd0, arg=0x2132b60) at zebra/dplane_fpm_nl.c:1217 1 0x00007fdd7dd1560d in hash_iterate (hash=0x21335f0, func=0x7fdd7d6997a0 <fpm_enqueue_rmac_table>, arg=0x2132b60) at lib/hash.c:252 2 0x00007fdd7dd1560d in hash_iterate (hash=0x1e5bf10, func=func@entry=0x7fdd7d698900 <fpm_enqueue_l3vni_table>, arg=arg@entry=0x7ffed983bef0) at lib/hash.c:252 3 0x00007fdd7d698b5c in fpm_rmac_send (t=<optimized out>) at zebra/dplane_fpm_nl.c:1262 4 0x00007fdd7dd6ce22 in event_call (thread=thread@entry=0x7ffed983c010) at lib/event.c:1970 5 0x00007fdd7dd20758 in frr_run (master=0x1d27f10) at lib/libfrr.c:1213 6 0x0000000000425588 in main (argc=10, argv=0x7ffed983c2e8) at zebra/main.c:492 Signed-off-by: Igor Zhukov <fsb4000@yandex.ru>	2024-10-04 14:59:14 +07:00
Rajasekar Raja	aa4786642c	zebra: vlan to dplane Offload from main Trigger: Zebra core seen when we convert l2vni to l3vni and back BackTrace: /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(_zlog_assert_failed+0xe9) [0x7f4af96989d9] /usr/lib/frr/zebra(zebra_vxlan_if_vni_up+0x250) [0x5561022ae030] /usr/lib/frr/zebra(netlink_vlan_change+0x2f4) [0x5561021fd354] /usr/lib/frr/zebra(netlink_parse_info+0xff) [0x55610220d37f] /usr/lib/frr/zebra(+0xc264a) [0x55610220d64a] /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(thread_call+0x7d) [0x7f4af967e96d] /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(frr_run+0xe8) [0x7f4af9637588] /usr/lib/frr/zebra(main+0x402) [0x5561021f4d32] /lib/x86_64-linux-gnu/libc.so.6(+0x2724a) [0x7f4af932624a] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0x85) [0x7f4af9326305] /usr/lib/frr/zebra(_start+0x21) [0x5561021f72f1] Root Cause: In working case, - We get a RTM_NEWLINK whose ctx is enqueued by zebra dplane and dequeued by zebra main and processed i.e. (102000 is deleted from vxlan99) before we handle RTM_NEWVLAN. - So in handling of NEWVLAN (vxlan99) we bail out since find with vlan id 703 does not exist. root@leaf2:mgmt:/var/log/frr# cat ~/raja_logs/working/nocras.log \| grep "RTM_NEWLINK\\|QUEUED\\|vxlan99\\|in thread" 2024/07/18 23:09:33.741105 ZEBRA: [KMXEB-K771Y] netlink_parse_info: netlink-dp-in (NS 0) type RTM_NEWLINK(16), len=616, seq=0, pid=0 2024/07/18 23:09:33.744061 ZEBRA: [K8FXY-V65ZJ] Intf dplane ctx 0x7f2244000cf0, op INTF_INSTALL, ifindex (65), result QUEUED 2024/07/18 23:09:33.767240 ZEBRA: [KMXEB-K771Y] netlink_parse_info: netlink-dp-in (NS 0) type RTM_NEWLINK(16), len=508, seq=0, pid=0 2024/07/18 23:09:33.767380 ZEBRA: [K8FXY-V65ZJ] Intf dplane ctx 0x7f2244000cf0, op INTF_INSTALL, ifindex (73), result QUEUED 2024/07/18 23:09:33.767389 ZEBRA: [NVFT0-HS1EX] INTF_INSTALL for vxlan99(73) 2024/07/18 23:09:33.767404 ZEBRA: [TQR2A-H2RFY] Vlan-Vni(1186:1186-6000002:6000002) update for VxLAN IF vxlan99(73) 2024/07/18 23:09:33.767422 ZEBRA: [TP4VP-XZ627] Del L2-VNI 102000 intf vxlan99(73) 2024/07/18 23:09:33.767858 ZEBRA: [QYXB9-6RNNK] RTM_NEWVLAN bridge IF vxlan99 NS 0 2024/07/18 23:09:33.767866 ZEBRA: [KKZGZ-8PCDW] Cannot find VNI for VID (703) IF vxlan99 for vlan state update >>>>BAIL OUT In failure case, - The NEWVLAN is received first even before processing RTM_NEWLINK. - Since the vxlan id 102000 is not removed from the vxlan99, the find with vlan id 703 returns the 102000 one and we invoke zebra_vxlan_if_vni_up where the interfaces don't match and assert. root@leaf2:mgmt:/var/log/frr# cat ~/raja_logs/noworking/crash.log \| grep "RTM_NEWLINK\\|QUEUED\\|vxlan99\\|in thread" 2024/07/18 22:26:43.829370 ZEBRA: [KMXEB-K771Y] netlink_parse_info: netlink-dp-in (NS 0) type RTM_NEWLINK(16), len=616, seq=0, pid=0 2024/07/18 22:26:43.829646 ZEBRA: [K8FXY-V65ZJ] Intf dplane ctx 0x7fe07c026d30, op INTF_INSTALL, ifindex (65), result QUEUED 2024/07/18 22:26:43.853930 ZEBRA: [QYXB9-6RNNK] RTM_NEWVLAN bridge IF vxlan99 NS 0 2024/07/18 22:26:43.853949 ZEBRA: [K61WJ-XQQ3X] Intf vxlan99(73) L2-VNI 102000 is UP >>> VLAN PROCESSED BEFORE INTF EVENT 2024/07/18 22:26:43.853951 ZEBRA: [SPV50-BX2RP] RAJA zevpn_vxlanif vxlan48 and ifp vxlan99 2024/07/18 22:26:43.854005 ZEBRA: [KMXEB-K771Y] netlink_parse_info: netlink-dp-in (NS 0) type RTM_NEWLINK(16), len=508, seq=0, pid=0 2024/07/18 22:26:43.854241 ZEBRA: [KMXEB-K771Y] netlink_parse_info: netlink-dp-in (NS 0) type RTM_NEWLINK(16), len=516, seq=0, pid=0 2024/07/18 22:26:43.854251 ZEBRA: [KMXEB-K771Y] netlink_parse_info: netlink-dp-in (NS 0) type RTM_NEWLINK(16), len=544, seq=0, pid=0 ZEBRA: in thread kernel_read scheduled from zebra/kernel_netlink.c:505 kernel_read() Fix: Similar to #13396, where link change handling was offloaded to dplane, same is being done for vlan events. Note: Prior to this change, zebra main thread was interested in the RTNLGRP_BRVLAN. So all the kernel events pertaining to vlan was handled by zebra main. With this change change as well the handling of vlan events is still with Zebra main. However we offload it via Dplane thread. Ticket :#3878175 Signed-off-by: Rajasekar Raja <rajasekarr@nvidia.com>	2024-09-26 20:17:35 -07:00
Rajasekar Raja	1632988acf	zebra: vlan to dplane, Relocating some functions Relocating functions used by vlan in if_netlink into zebra vxlan Note: Static variable to the functions will be added back in the next commit. Ticket :#3878175 Signed-off-by: Rajasekar Raja <rajasekarr@nvidia.com>	2024-09-25 11:56:06 -07:00
Donald Sharp	f53dde0e59	zebra: Add missing proto translations Add missing isis and eigrp proto translations. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-25 12:14:50 -04:00
Donald Sharp	e41ae0acc1	zebra: Correctly report metrics Report the routes metric in IPFORWARDMETRIC1 and return -1 for the other metrics as required by the IP-FORWARD-MIB. inetCidrRouteMetric2 OBJECT-TYPE SYNTAX Integer32 MAX-ACCESS read-create STATUS current DESCRIPTION "An alternate routing metric for this route. The semantics of this metric are determined by the routing- protocol specified in the route's inetCidrRouteProto value. If this metric is not used, its value should be set to -1." DEFVAL { -1 } ::= { inetCidrRouteEntry 13 } I've included metric2 but it's the same for all of them. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-25 12:09:40 -04:00
Donald Sharp	659cd66427	zebra: Let's use memset instead of walking bytes and setting to 0 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-25 12:08:03 -04:00
Donald Sharp	ecd9d441b0	zebra: Fix snmp walk of zebra rib The snmp walk of the zebra rib was skipping entries because in_addr_cmp was replaced with a prefix_cmp which worked slightly differently causing parts of the zebra rib tree to be skipped. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-25 12:06:29 -04:00
Donald Sharp	e54261e20d	lib, zebra: TABLE_NODE is not used No-one is using this, remove Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-24 16:05:54 -04:00
Donatas Abraitis	9616304e47	Merge pull request #16882 from mjstapp/fix_if_table_unlock zebra: unlock if_table route_nodes	2024-09-23 10:24:52 +02:00
Mark Stapp	c40635c5c2	zebra: unlock if_table route_nodes Must unlock if we break during iteration over any lib/table tree. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-09-20 12:24:01 -04:00
Donald Sharp	58722b9448	zebra: Pass in ZEBRA_ROUTE_MAX instead of true zebra_nhg_install_kernel takes a route type. We don't know it at that particular spot but we should not be passing in `true`. Let's use ZEBRA_ROUTE_MAX to indicate we do not know, so that the correct thing is done. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-20 11:00:11 -04:00
Donatas Abraitis	73d01a8e40	zebra: Send a correct size of ctx->nh6 for SRv6 SEG6_LOCAL_ACTION_END_DX6 Fixes: `f6e58d26f6` ("zebra, sharpd: add srv6 End.DX6 support") Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-09-19 23:54:43 +03:00
Donald Sharp	ccbfb46d28	zebra: Remove nl_addraw_l This function is never used. So let's remove it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-19 08:28:48 -04:00
Donald Sharp	1af0a67401	zebra: In zebra_evpn_mac.c remove bad comments Adding comments that tell what a variable is doing in the middle of a function call makes it extremely hard to read the formatting. Remove. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-18 07:35:54 -04:00
Donald Sharp	03a7ab10fe	zebra: Reindent some badly formatted functions in zebra_evpn_mac.c Fix some badly formatted code to fit better on the screen. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-18 07:33:17 -04:00
Donald Sharp	390406973c	zebra: Reframe zebra_evpn_mac.c to be properly formatted Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-18 07:33:17 -04:00
Donald Sharp	f02d76f0fd	zebra: Attempt to reuse NHG after interface up and route reinstall The previous commit modified zebra to reinstall the singleton nexthops for a nexthop group when a interface event comes up. Now let's modify zebra to attempt to reuse the nexthop group when this happens and the upper level protocol resends the route down with that. Only match if the protocol is the same as well as the instance and the nexthop groups would match. Here is the new behavior: eva(config)# do show ip route 9.9.9.9/32 Routing entry for 9.9.9.9/32 Known via "static", distance 1, metric 0, best Last update 00:00:08 ago * 192.168.99.33, via dummy1, weight 1 * 192.168.100.33, via dummy2, weight 1 * 192.168.101.33, via dummy3, weight 1 * 192.168.102.33, via dummy4, weight 1 eva(config)# do show ip route nexthop-group 9.9.9.9/32 % Unknown command: do show ip route nexthop-group 9.9.9.9/32 eva(config)# do show ip route 9.9.9.9/32 nexthop-group Routing entry for 9.9.9.9/32 Known via "static", distance 1, metric 0, best Last update 00:00:54 ago Nexthop Group ID: 57 * 192.168.99.33, via dummy1, weight 1 * 192.168.100.33, via dummy2, weight 1 * 192.168.101.33, via dummy3, weight 1 * 192.168.102.33, via dummy4, weight 1 eva(config)# exit eva# conf eva(config)# int dummy3 eva(config-if)# shut eva(config-if)# no shut eva(config-if)# do show ip route 9.9.9.9/32 nexthop-group Routing entry for 9.9.9.9/32 Known via "static", distance 1, metric 0, best Last update 00:00:08 ago Nexthop Group ID: 57 * 192.168.99.33, via dummy1, weight 1 * 192.168.100.33, via dummy2, weight 1 * 192.168.101.33, via dummy3, weight 1 * 192.168.102.33, via dummy4, weight 1 eva(config-if)# exit eva(config)# exit eva# exit sharpd@eva ~/frr1 (master) [255]> ip nexthop show id 57 id 57 group 37/43/50/58 proto zebra sharpd@eva ~/frr1 (master)> ip route show 9.9.9.9/32 9.9.9.9 nhid 57 proto 196 metric 20 nexthop via 192.168.99.33 dev dummy1 weight 1 nexthop via 192.168.100.33 dev dummy2 weight 1 nexthop via 192.168.101.33 dev dummy3 weight 1 nexthop via 192.168.102.33 dev dummy4 weight 1 sharpd@eva ~/frr1 (master)> Notice that we now no longer are creating a bunch of new nexthop groups. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-16 09:34:05 -04:00
Donald Sharp	3be8b48e6b	zebra: Reinstall nexthop when interface comes back up If a interface down event caused a nexthop group to remove one of the entries in the kernel, have it be reinstalled when the interface comes back up. Mark the nexthop as usable. new behavior: eva# show nexthop-group rib 181818168 ID: 181818168 (sharp) RefCnt: 1 Uptime: 00:00:23 VRF: default(bad-value) Valid, Installed Depends: (35) (38) (44) (51) via 192.168.99.33, dummy1 (vrf default), weight 1 via 192.168.100.33, dummy2 (vrf default), weight 1 via 192.168.101.33, dummy3 (vrf default), weight 1 via 192.168.102.33, dummy4 (vrf default), weight 1 eva# conf eva(config)# int dummy3 eva(config-if)# shut eva(config-if)# do show nexthop-group rib 181818168 ID: 181818168 (sharp) RefCnt: 1 Uptime: 00:00:44 VRF: default(bad-value) Depends: (35) (38) (44) (51) via 192.168.99.33, dummy1 (vrf default), weight 1 via 192.168.100.33, dummy2 (vrf default), weight 1 via 192.168.101.33, dummy3 (vrf default) inactive, weight 1 via 192.168.102.33, dummy4 (vrf default), weight 1 eva(config-if)# no shut eva(config-if)# do show nexthop-group rib 181818168 ID: 181818168 (sharp) RefCnt: 1 Uptime: 00:00:53 VRF: default(bad-value) Valid, Installed Depends: (35) (38) (44) (51) via 192.168.99.33, dummy1 (vrf default), weight 1 via 192.168.100.33, dummy2 (vrf default), weight 1 via 192.168.101.33, dummy3 (vrf default), weight 1 via 192.168.102.33, dummy4 (vrf default), weight 1 eva(config-if)# exit eva(config)# exit eva# exit sharpd@eva ~/frr1 (master) [255]> ip nexthop show id 181818168 id 181818168 group 35/38/44/51 proto 194 sharpd@eva ~/frr1 (master)> Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-16 09:34:05 -04:00
Donald Sharp	ce166ca789	zebra: Expose _route_entry_dump_nh so it can be used. Expose this helper function so it can be used in zebra_nhg.c Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-16 09:34:05 -04:00
Donald Sharp	1bbbcf043b	zebra: Properly note that a nhg's nexthop has gone down Current code when a link is set down is to just mark the nexthop group as not properly setup. Leaving situations where when an interface goes down and show output is entered we see incorrect state. This is true for anything that would be checking those flags at that point in time. Modify the interface down nexthop group code to notice the nexthops appropriately ( and I mean set the appropriate flags ) and to allow a `show ip route` command to actually display what is going on with the nexthops. eva# show ip route 1.0.0.0 Routing entry for 1.0.0.0/32 Known via "sharp", distance 150, metric 0, best Last update 00:00:06 ago * 192.168.44.33, via dummy1, weight 1 * 192.168.45.33, via dummy2, weight 1 sharpd@eva:~/frr1$ sudo ip link set dummy2 down eva# show ip route 1.0.0.0 Routing entry for 1.0.0.0/32 Known via "sharp", distance 150, metric 0, best Last update 00:00:12 ago * 192.168.44.33, via dummy1, weight 1 192.168.45.33, via dummy2 inactive, weight 1 Notice now that the 1.0.0.0/32 route now correctly displays the route for the nexthop group entry. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-16 09:34:05 -04:00
Enke Chen	f6e28717ec	zebra: include the prefix in nht show command Include the prefix in "show ip nht" and "show ipv6 nht". Signed-off-by: Enke Chen <enchen@paloaltonetworks.com>	2024-09-14 23:47:00 -07:00
Donald Sharp	e0437aba6d	zebra: Add more vrf name to debugs Trying to debug some cross vrf stuff in zebra and frankly it's hard to grep the file for the routes you are interested in. Let's clean this up some and get a bit better information for us developers Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-11 15:30:43 -04:00
Russ White	add56c61dd	Merge pull request #15259 from dmytroshytyi-6WIND/nexthop_resolution zebra: add LSP entry to nexthop via recursive (part 2)	2024-09-10 10:04:08 -04:00
Donald Sharp	98b11de9f6	zebra: Modify show `zebra dplane providers` to give more data The show zebra dplane provider command was ommitting the input and output queues to the dplane itself. It would be nice to have this insight as well. New output: r1# show zebra dplane providers dataplane Incoming Queue from Zebra: 100 Zebra dataplane providers: Kernel (1): in: 6, q: 0, q_max: 3, out: 6, q: 14, q_max: 3 dplane_fpm_nl (2): in: 6, q: 10, q_max: 3, out: 6, q: 0, q_max: 3 dataplane Outgoing Queue to Zebra: 43 r1# Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-05 15:52:05 -04:00
Donald Sharp	8926ac1984	zebra: Limit queue depth in dplane_fpm_nl The dplane providers have a concept of input queues and output queues. These queues are chained together during normal operation. The code in zebra also has a feedback mechanism where the MetaQ will not run when the first input queue is backed up. Having the dplane_fpm_nl code grab all contexts when it is backed up prevents this system from behaving appropriately. Modify the code to not add to the dplane_fpm_nl's internal queue when it is already full. This will allow the backpressure to work appropriately in zebra proper. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-05 15:52:05 -04:00
Donald Sharp	3af381b502	zebra: Modify dplane loop to allow backpressure to filter up Currently when the dplane_thread_loop is run, it moves contexts from the dg_update_list and puts the contexts on the input queue of the first provider. This provider is given a chance to run and then the items on the output queue are pulled off and placed on the input queue of the next provider. Rinse/Repeat down through the entire list of providers. Now imagine that we have a list of multiple providers and the last provider is getting backed up. Contexts will end up sticking in the input Queue of the `slow` provider. This can grow without bounds. This is a real problem when you have a situation where an interface is flapping and an upper level protocol is sending a continous stream of route updates to reflect the change in ecmp. You can end up with a very very large backlog of contexts. This is bad because zebra can easily grow to a very very large memory size and on restricted systems you can run out of memory. Fortunately for us, the MetaQ already participates with this process by not doing more route processing until the dg_update_list goes below the working limit of dg_updates_per_cycle. Thus if FRR modifies the behavior of this loop to not move more contexts onto the input queue if either the input queue or output queue of the next provider has reached this limit. FRR will naturaly start auto handling backpressure for the dplane context system and memory will not go out of control. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-05 15:44:34 -04:00
Donald Sharp	34670c476a	zebra: Use the ctx queue counters The ctx queue data structures already have a counter associated with them. Let's just use them instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-05 15:44:34 -04:00
Donald Sharp	d97c535c1e	*: Create termtable specific temp memory When trying to track down a MTYPE_TMP memory leak it's harder to search for it when you happen to have some usage of ttable_dump. Let's just give it it's own memory type so that we can avoid confusion in the future. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-09-01 13:07:46 -04:00
Donald Sharp	0c72a78930	zebra: Allow for initial deny of installation of nhe's Currently the FRR code will receive both kernel and connected routes that do not actually have an underlying nexthop group at all. Zebra turns around and creates a `matching` nexthop hash entry and installs it. For connected routes, this will create 2 singleton nexthops in the dplane per interface (v4 and v6). For kernel routes it would just create 1 singleton nexthop that might be used or not. This is bad because the dplane has a limited amount of space available for nexthop entries and if you happen to have a large number of interfaces then all of a sudden you have 2x(# of interfaces) singleton nexthops. Let's modify the code to delay creation of these singleton nexthops until they have been used by something else in the system. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-30 08:23:48 -04:00
Jafar Al-Gharaibeh	90787a57fd	Merge pull request #16689 from donaldsharp/blackhole_and_afi Blackhole and afi	2024-08-29 22:13:03 -04:00
Donald Sharp	8ad5643abe	zebra: Convince SA that the ng will always be valid There is a code path that could theoretically get you to a point where the ng->nexthop is a NULL value. Let's just make sure the SA system believes that cannot happen anymore. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-29 18:10:30 -04:00
Donald Sharp	f90989d52a	zebra: Allow blackhole singleton nexthops to be v6 A blackhole nexthop, according to the linux kernel, can be v4 or v6. A v4 blackhole nexthop cannot be used on a v6 route, but a v6 blackhole nexthop can be used with a v4 route. Convert all blackhole singleton nexthops to v6 and just use that. Possibly reducing the number of active nexthops by 1. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-29 15:06:31 -04:00
Donald Sharp	c10cdcd79a	zebra: Display afi of the nexthop hash entry Let's display the afi of the nexthop hash entry. Right now it is impossible to tell the difference between v4 or v6 nexthops, especially since it is important for the kernel. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-29 14:49:36 -04:00
Jafar Al-Gharaibeh	ffaa365cc4	Merge pull request #16681 from donaldsharp/zebra_re_after_rn zebra: Move prefix lookup to outside re loop	2024-08-28 23:43:40 -04:00
Mark Stapp	8b23abf36e	Merge pull request #16300 from donaldsharp/local_connected Local connected	2024-08-28 14:10:14 -04:00
Donald Sharp	184dccca60	zebra: Move prefix lookup to outside re loop Move the prefix lookup/comparison to outside the re loop and into the rn loop, since that is where the code should actually be. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-28 13:18:00 -04:00
Russ White	5a6cb0bf75	Merge pull request #16103 from mjstapp/fix_5549_nhg_type zebra: be consistent about v6 nexthops for v4 routes	2024-08-27 09:46:53 -04:00
Donald Sharp	9bc0cd8241	zebra: Prevent accidental re memory leak in odd case There exists a path in rib_add_multipath where if a decision is made to not use the passed in re, we just drop the memory instead of freeing it. Let's free it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-27 06:25:34 -04:00
Donald Sharp	d528c02a20	zebra: Handle kernel routes appropriately Current code intentionally ignores kernel routes. Modify zebra to allow these routes to be read in on linux. Also modify zebra to look to see if a route should be treated as a connected and mark it as such. Additionally this should properly handle some of the issues being seen with NOPREFIXROUTE. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-27 06:25:34 -04:00
Donald Sharp	bdfccf69fa	zebra: Expose rib_update_handle_vrf_all This function will be used on interface down events to allow for kernel routes to be cleaned up. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-27 06:25:34 -04:00
Donald Sharp	f450e1cda4	zebra: Make p and src_p const for rib_delete The prefix'es p and src_p are not const. Let's make them so. Useful to signal that we will not change this data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-27 06:25:34 -04:00
Mark Stapp	b4dae97381	Merge pull request #16609 from donaldsharp/singleton_no_weight Reduce the number of Singleton objects when using weight for NHG's	2024-08-23 16:19:29 -04:00
Khem Raj	0ef71391f0	zebra: Mimic GNU basename() API for non-glibc library e.g. musl musl only provides POSIX version of basename and it has also removed providing it via string.h header [1] which now results in compile errors with newer compilers e.g. clang-18 [1] https://git.musl-libc.org/cgit/musl/commit/?id=725e17ed6dff4d0cd22487bb64470881e86a92e7 Signed-off-by: Khem Raj <raj.khem@gmail.com>	2024-08-22 14:50:41 -07:00
Donald Sharp	c20fa972d6	zebra: Create Singleton nhg's without weights Currently FRR when it has two nexthop groups: A nexthop 1 weight 5 nexthop 2 weight 6 nexthop 3 weight 7 B nexthop 1 weight 3 nexthop 2 weight 4 nexthop 3 weight 5 We end up with 5 singleton nexthops and two groups: ID: 181818168 (sharp) RefCnt: 1 Uptime: 00:04:52 VRF: default Valid, Installed Depends: (69) (70) (71) via 192.168.119.1, enp13s0 (vrf default), weight 182 via 192.168.119.2, enp13s0 (vrf default), weight 218 via 192.168.119.3, enp13s0 (vrf default), weight 255 ID: 181818169 (sharp) RefCnt: 1 Uptime: 00:02:08 VRF: default Valid, Installed Depends: (71) (127) (128) via 192.168.119.1, enp13s0 (vrf default), weight 127 via 192.168.119.2, enp13s0 (vrf default), weight 170 via 192.168.119.3, enp13s0 (vrf default), weight 255 id 69 via 192.168.119.1 dev enp13s0 scope link proto 194 id 70 via 192.168.119.2 dev enp13s0 scope link proto 194 id 71 via 192.168.119.3 dev enp13s0 scope link proto 194 id 127 via 192.168.119.1 dev enp13s0 scope link proto 194 id 128 via 192.168.119.2 dev enp13s0 scope link proto 194 id 181818168 group 69,182/70,218/71,255 proto 194 id 181818169 group 71,255/127,127/128,170 proto 194 This is not a desirable state to be in. If you have a link flapping in the network and weights are changing rapidly you end up with a large number of singleton nexthops that are being used by the nexthop groups. This fills up asic space and clutters the table. Additionally singleton nexthops cannot have any weight and the fact that you attempt to create a singleton nexthop with different weights means nothing to the linux kernel( or any asic dplane ). Let's modify the code to always create the singleton nexthops without a weight and then just creating the NHG's that use the singletons with the appropriate weight. ID: 181818168 (sharp) RefCnt: 1 Uptime: 00:00:32 VRF: default Valid, Installed Depends: (22) (24) (28) via 192.168.119.1, enp13s0 (vrf default), weight 182 via 192.168.119.2, enp13s0 (vrf default), weight 218 via 192.168.119.3, enp13s0 (vrf default), weight 255 ID: 181818169 (sharp) RefCnt: 1 Uptime: 00:00:14 VRF: default Valid, Installed Depends: (22) (24) (28) via 192.168.119.1, enp13s0 (vrf default), weight 153 via 192.168.119.2, enp13s0 (vrf default), weight 204 via 192.168.119.3, enp13s0 (vrf default), weight 255 id 22 via 192.168.119.1 dev enp13s0 scope link proto 194 id 24 via 192.168.119.2 dev enp13s0 scope link proto 194 id 28 via 192.168.119.3 dev enp13s0 scope link proto 194 id 181818168 group 22,182/24,218/28,255 proto 194 id 181818169 group 22,153/24,204/28,255 proto 194 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-22 13:22:06 -04:00
Donald Sharp	b62de7de2e	zebra, tests: Connected and Local routes should have a weight of 1 All routes received by zebra from upper level protocols have a weight of 1. Let's just make everything extremely consistent in our code. Lot's of tests needed to be fixed up to make this work. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-21 10:17:47 -04:00
Donald Sharp	db87ab3d86	zebra: Ensure we cannot send invalid range to kernel The linux kernel adds 1 upon receipt of a weight, if you send a 255 it gets unhappy. Let's Limit range to 254 as that kernel does not like sending of 255. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-21 10:15:55 -04:00
Jafar Al-Gharaibeh	d9775c690c	Merge pull request #16597 from kprovost/master zebra: fix loading kernel routes without netlink	2024-08-19 13:02:36 -04:00
Donald Sharp	38b9df8e58	zebra: Do not assign time and then never use it Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-18 16:18:54 -04:00
Carmine Scarpitta	354f9e9009	zebra: Fix compilation warning Make sure the variable `srv6` is not NULL before dereferencing it. Fixes the following compilation warnings: ``` error 15-Aug-2024 14:20:01 zebra/zebra_srv6_vty.c:974:8: warning: Access to field 'locators' results in a dereference of a null pointer (loaded from variable 'srv6') [core.NullDereference] error 15-Aug-2024 14:20:01 for (ALL_LIST_ELEMENTS_RO(srv6->locators, node, locator)) { error 15-Aug-2024 14:20:01 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error 15-Aug-2024 14:20:01 ./lib/linklist.h:345:11: note: expanded from macro 'ALL_LIST_ELEMENTS_RO' error 15-Aug-2024 14:20:01 (node) = listhead(list), ((data) = NULL); \ error 15-Aug-2024 14:20:01 ^~~~~~~~~~~~~~ error 15-Aug-2024 14:20:01 ./lib/linklist.h:63:22: note: expanded from macro 'listhead' error 15-Aug-2024 14:20:01 #define listhead(X) ((X) ? ((X)->head) : NULL) error 15-Aug-2024 14:20:01 ^ ``` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-08-18 13:03:04 +02:00
Kristof Provost	f0aa8d4f42	zebra: fix loading kernel routs without netlink Commit `605df8d44` zebra: Use zebra dplane for RTM link and addr broke loading of kernel routes at startup for configurations without netlink. It rearranged the startup sequence in zebra_ns_enable() to pass via zebra_ns_startup_continue(), triggered through zebra_dplane_startup_stage() calls. However, it neglected to make these calls in the non-netlink code path. As a result zebra failed to load kernel routes at startup on platforms such as FreeBSD. Insert these calls so we run through all of the expected startup stages. Signed-off-by: Kristof Provost <kprovost@netgate.com>	2024-08-16 10:42:33 +02:00
Donald Sharp	baa0a1df5b	*: Fix spelling errors found Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-15 09:19:37 -04:00
Donald Sharp	90a50fd4af	Merge pull request #16576 from opensourcerouting/fix/free_opaque_data_of_route_entry zebra: Do not forget to free opaque data for route entry	2024-08-13 14:14:36 -04:00
Donatas Abraitis	1e288c9b55	zebra: Do not forget to free opaque data for route entry Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-08-13 18:00:30 +03:00
Donald Sharp	5a1b61aeba	zebra: Ensure non-equal id's are not same nhg's The function zebra_nhg_hash_equal is only used as a hash function for storage of NHG's and retrieval. If you have say two nhg's: 31 (25/26) 32 (25/26) This function would return them as being equal. Which of course leads to the problem when you attempt to hash_release 32 but release 31 from the hash. Then later when you attempt to do hash comparisons 32 has actually been freed leaving to use after free situations and shit goes down hill fast. This hash is only used as part of the hash comparison function for nexthop group storage. Since this is so let's always return the 31/32 nhg's are not equal at all. We possibly have a different problem where we are creating 31 and 32 ( when 31 should have just been used instead of 32 ) but we need to prevent any type of hash release problem at all. This supercedes any other issue( that should be tracked down on it's own ). Since you can have use after free situation that leads to a crash -vs- some possible nexthop group duplication which is very minor in comparison. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-08-10 19:49:35 -04:00
Mark Stapp	0221ed2917	zebra: be consistent about v6 nexthops for v4 routes Treat TYPE_IPV6 and TYPE_IPV6_IFINDEX nexthops the same way when processing v4 (RFC 5549) routes. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-08-06 08:17:47 -04:00
Donatas Abraitis	2419fc5104	bgpd: Show which AFI is affected when prefix-list is not found When the prefix-list is not found, show which AFI is the real one we are looking for. E.g.: looking at this output is not clear: ``` [RYF1Z-ZKDRS] route_match_address_prefix_list: Prefix List p1 specified does not exist defaulting to NO_MATCH ``` route_match_address_prefix_list() is called by route_match_ipv6_address_prefix_list(), and route_match_ip_address_prefix_list(). Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-07-31 13:20:59 +03:00
Donald Sharp	e53fa582bc	zebra: Fix removal of routes on MetaQ when client goes down It is possible that right before an upper level protocol dies or is killed routes would be installed into zebra. These routes could be on the Meta-Q for early route-processing. Leaving us with a situation where the client is removed, and all it's routes that are in the rib at that time, and then after that the MetaQ is run and the routes are reprocessed leaving routes from an upper level daemon post daemon going away from zebra's perspective. These routes will be abandoned. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-07-30 08:34:06 -04:00
mobash-rasool	3ddf5cf28f	Merge pull request #16041 from sri-mohan1/srib-24-zebra-d zebra: changes for code maintainability	2024-07-25 22:36:17 +05:30
Carmine Scarpitta	846bbcba0d	zebra: Remove duplicate `#include "zebra/interface.h"` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:50:02 +02:00
Carmine Scarpitta	e2cb3ab5c6	zebra: Remove duplicate `#include "zebra/interface.h"` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:41:29 +02:00
Carmine Scarpitta	bcf7bc1ce8	zebra: Remove duplicate `#include "zebra/debug.h"` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:41:09 +02:00
Carmine Scarpitta	8b206b0cd7	zebra: Remove duplicate `#include <stdio.h>` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:39:34 +02:00
Carmine Scarpitta	4ca8332922	zebra: Remove duplicate `#include <string.h>` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:39:14 +02:00
Carmine Scarpitta	22aa0ffb81	zebra: Remove duplicate `#include <stdlib.h>` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:38:55 +02:00
Carmine Scarpitta	c432aa0bb4	zebra: Remove duplicate `#include <arpa/inet.h>` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:38:27 +02:00
Carmine Scarpitta	0a68626e48	zebra: Remove duplicate `#include <netinet/in.h>` Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-07-24 23:38:04 +02:00
Russ White	aa9d66e922	Merge pull request #16410 from pguibert6WIND/show_zebra_dplane_nexthop zebra: add nexthop counter to 'show zebra dplane' command	2024-07-23 16:42:41 -04:00
Russ White	9db9128c27	Merge pull request #16330 from donaldsharp/nhg_going_down zebra: Properly note that a nhg's nexthop has gone down	2024-07-23 10:49:49 -04:00
Donatas Abraitis	8eb78b2d89	Merge pull request #16341 from crosser/preserve-needed-rmacs zebra: evpn: not coerce VTEP IP to IPv4 in nh_list	2024-07-22 02:53:26 -07:00
Philippe Guibert	e36e570c6c	zebra: add nexthop counter to 'show zebra dplane' command The nexthop updates counter value was never displayed. Add it. > # show zebra dplane > Zebra dataplane: > Route updates: 7673010 > Route update errors: 0 > Nexthop updates: 1100 > Nexthop update errors: 0 > [..] Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-07-17 11:58:22 +02:00
sri-mohan1	c2f2dde5c1	zebra: changes for code maintainability these changes are for improving the code maintainability and readability Signed-off-by: sri-mohan1 <sri.mohan@samsung.com>	2024-07-15 19:13:47 +05:30
Donatas Abraitis	659741f3fa	Merge pull request #16350 from raja-rajasekar/rajasekarr/table_id_for_2vrf_3970414 zebra: Fix to avoid two Vrfs with same table ids	2024-07-14 03:13:17 +03:00
Rajasekar Raja	c77e15710d	zebra: Fix to avoid two Vrfs with same table ids During internal testing, when the following sequence is followed, two non default vrfs end up pointing to the same table-id - Initially vrf201 has table id 1002 - ip link add dev vrf202 type vrf table 1002 - ip link set dev vrf202 up - ip link set dev <intrerface> master vrf202 This will ideally lead to zebra exit since this is a misconfiguration as expected. However if we perform a restart frr.service at this point, we end up having two vrfs pointing to same table-id and bad things can happen. This is because in the interface_vrf_change, we incorrectly check for vrf_lookup_by_id() to evaluate if there is a misconfig. This works well for a non restart case but not for the startup case. root@mlx-3700-20:mgmt:/var/log/frr# sudo vtysh -c "sh vrf" vrf mgmt id 37 table 1001 vrf vrf201 id 46 table 1002 vrf vrf202 id 59 table 1002 >>>> Fix: in all cases of misconfiguration, exit zebra as expected. Ticket :#3970414 Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Rajasekar Raja <rajasekarr@nvidia.com>	2024-07-12 14:25:33 -07:00
Philippe Guibert	731f74e35f	zebra, topotests: do not set nexthop's FIB flag when DUPLICATE present The bgp_duplicate_nexthop test installs routes with nexthop's flags set to both DUPLICATE and FIB: this should not happen. The DUPLICATE flag of a nexthop indicates this nexthop is already used in the same nexthop-group, and there is no need to install it twice in the system; having the FIB flag set indicates that the nexthop is installed in the system. This is why both flags should not be set on the same nexthop. This case happens at installation time, but can also happen at update time. - Fix this by not setting the FIB flag value when the DUPLICATE flag is present. - Modify the bgp_duplicate_test to check that the FIB flag is not present on duplicated nexthops. - Modify the bgp_peer_type_multipath_relax test. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-07-08 15:42:02 +02:00
Eugene Crosser	f883c7119e	zebra: evpn: not coerce VTEP IP to IPv4 in nh_list In L3 BGP-EVPN, if there are both IPv4 and IPv6 routes in the VPN, zebra maintains two instances of `struct zebra_neigh` object: one with IPv4 address of the nexthop, and another with IPv6 address that is an IPv4 mapped to IPv6, but only one intance of `struct zebra_mac` object, that contains a list of nexthop addresses that use this mac. The code in `zebra_vxlan` module uses the fact that the list is empty as the indication that the `zebra_mac` object is unused, and needs to be dropped. However, preexisting code used nexthop address converted to IPv4 notation for the element of this list. As a result, when two `zebra_neigh` objects, one IPv4 and one IPv6-mapped-IPv4 were linked to the `zebra_mac` object, only one element was added to the list. Consequently, when one of the two `zebra_neigh` objects was dropped, the only element in the list was removed, making it empty, and `zebra_mac` object was dropped, and neigbrour cache elements uninstalled from the kernel. As a result, after the last route in _one_ family was removed from a remote vtep, all remaining routes in the _other_ family became unreachable, because RMAC of the vtep was removed. This commit makes `zebra_mac` use uncoerced IP address of the `zebra_neigh` object for the entries in the `nh_list`. This way, `zebra_mac` object no longer loses track of `zebra_neigh` objects that need it. Bug-URL: https://github.com/FRRouting/frr/issues/16340 Signed-off-by: Eugene Crosser <crosser@average.org>	2024-07-08 10:32:03 +02:00
Donald Sharp	266b061994	zebra: Properly note that a nhg's nexthop has gone down Current code when a link is set down is to just mark the nexthop group as not properly setup. Leaving situations where when an interface goes down and show output is entered we see incorrect state. This is true for anything that would be checking those flags at that point in time. Modify the interface down nexthop group code to notice the nexthops appropriately ( and I mean set the appropriate flags ) and to allow a `show ip route` command to actually display what is going on with the nexthops. eva# show ip route 1.0.0.0 Routing entry for 1.0.0.0/32 Known via "sharp", distance 150, metric 0, best Last update 00:00:06 ago * 192.168.44.33, via dummy1, weight 1 * 192.168.45.33, via dummy2, weight 1 sharpd@eva:~/frr1$ sudo ip link set dummy2 down eva# show ip route 1.0.0.0 Routing entry for 1.0.0.0/32 Known via "sharp", distance 150, metric 0, best Last update 00:00:12 ago * 192.168.44.33, via dummy1, weight 1 192.168.45.33, via dummy2 inactive, weight 1 Notice now that the 1.0.0.0/32 route now correctly displays the route for the nexthop group entry. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-07-03 09:34:55 -04:00
vivek	b5682ffbf0	*: Add and use option for graceful (re)start Add a new start option "-K" to libfrr to denote a graceful start, and use it in zebra and bgpd. zebra will use this option to denote a planned FRR graceful restart (supporting only bgpd currently) to wait for a route sync completion from bgpd before cleaning up old stale routes from the FIB. An optional timer provides an upper-bounds for this cleanup. bgpd will use this option to denote either a planned FRR graceful restart or a bgpd-only graceful restart, and this will drive the BGP GR restarting router procedures. Signed-off-by: Vivek Venkatraman <vivek@nvidia.com>	2024-07-01 13:02:52 -07:00
Donatas Abraitis	e5faf01dcc	Merge pull request #16270 from cscarpitta/fix/fix-coverity-issues zebra: Fix coverity issues	2024-06-25 08:53:13 +03:00
Donatas Abraitis	67421ba30d	Merge pull request #16261 from chiragshah6/evpn_dev4 zebra: clear evpn dup-addr return error-msg when there is no vni	2024-06-25 08:07:17 +03:00
Carmine Scarpitta	df97a9d133	zebra: Fix NULL pointer dereference The `locator` pointer is dereferenced before ensuring it is not NULL. Fix the issue by checking that the pointer is not NULL before dereferencing it. Fixes 1594013 CID 1594013: Null pointer dereferences (REVERSE_INULL) /zebra/zebra_srv6.c: 961 in zebra_srv6_sid_compose() ________________________________________________________________________________________________________ * CID 1594013: Null pointer dereferences (REVERSE_INULL) /zebra/zebra_srv6.c: 961 in zebra_srv6_sid_compose() 955 struct srv6_locator locator, 956 uint32_t sid_func) 957 { 958 uint8_t offset, func_len; 959 struct srv6_sid_format format = locator->sid_format; 960 CID 1594013: Null pointer dereferences (REVERSE_INULL) Null-checking "locator" suggests that it may be null, but it has already been dereferenced on all paths leading to the check. 961 if (!sid_value \|\| !locator) 962 return false; 963 964 if (format) { 965 offset = format->block_len + format->node_len; 966 func_len = format->function_len; Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-24 10:44:14 +02:00
Carmine Scarpitta	375a02d2a3	zebra: Fix wrong variable used in `for` loop The `for` loop starting at line 1848 searches the `func_allocated` array for a pointer that points to a specific `sid_wide_func` element. The loop should iterate over all the elements of the `func_allocated` array and dereference each element to see if it is the one we are looking for. Currently, the loop is using the wrong variable to iterate over the array. Let's fix this issue by using the correct variable in the loop. Fixes CID 1594014 Fixes CID 1594016 CID 1594014: Null pointer dereferences (FORWARD_NULL) /zebra/zebra_srv6.c: 1860 in release_srv6_sid_func_explicit() ________________________________________________________________________________________________________ * CID 1594014: Null pointer dereferences (FORWARD_NULL) /zebra/zebra_srv6.c: 1860 in release_srv6_sid_func_explicit() 1854 1855 /* Lookup SID function in the functions allocated list of EWLIB range / 1856 for (ALL_LIST_ELEMENTS_RO(block->u.usid 1857 .wide_lib[sid_func] 1858 .func_allocated, 1859 node, sid_func_ptr)) CID 1594014: Null pointer dereferences (FORWARD_NULL) Dereferencing null pointer "sid_wide_func_ptr". 1860 if (sid_wide_func_ptr == sid_wide_func) 1861 break; 1862 1863 /* Ensure that the SID function is allocated / 1864 if (!sid_wide_func_ptr) { 1865 zlog_warn("%s: failed to release wide SID function %u, function is not allocated", * CID 1594016: Possible Control flow issues (DEADCODE) /zebra/zebra_srv6.c: 1871 in release_srv6_sid_func_explicit() ________________________________________________________________________________________________________ *** CID 1594016: Possible Control flow issues (DEADCODE) /zebra/zebra_srv6.c: 1871 in release_srv6_sid_func_explicit() 1865 zlog_warn("%s: failed to release wide SID function %u, function is not allocated", 1866 __func__, sid_wide_func); 1867 return -1; 1868 } 1869 1870 /* Release the SID function from the EWLIB range */ CID 1594016: Possible Control flow issues (DEADCODE) Execution cannot reach this statement: "listnode_delete(block->u.us...". 1871 listnode_delete(block->u.usid.wide_lib[sid_func] 1872 .func_allocated, 1873 sid_wide_func_ptr); 1874 zebra_srv6_sid_func_free(sid_wide_func_ptr); 1875 } else { 1876 zlog_warn("%s: function %u is outside ELIB [%u/%u] and EWLIB alloc ranges [%u/%u]", Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-24 10:44:14 +02:00
Carmine Scarpitta	fdb89ab51a	zebra: Remove dead SRv6 code At line 1736, `alloc_mode` is set to `SRV6_SID_ALLOC_MODE_EXPLICIT` or `SRV6_SID_ALLOC_MODE_DYNAMIC` depending on the `sid_value` variable. There will never be a case where alloc_mode will be `SRV6_SID_ALLOC_MODE_MAX` or `SRV6_SID_ALLOC_MODE_UNSPEC`. Let's replace the `switch(alloc_mode) {...}` with an if-else. Fixes CID 1594015. CID 1594015: (DEADCODE) /zebra/zebra_srv6.c: 1782 in get_srv6_sid() /zebra/zebra_srv6.c: 1781 in get_srv6_sid() ________________________________________________________________________________________________________ * CID 1594015: (DEADCODE) /zebra/zebra_srv6.c: 1782 in get_srv6_sid() 1776 } 1777 1778 ret = get_srv6_sid_dynamic(sid, ctx, locator); 1779 1780 break; 1781 case SRV6_SID_ALLOC_MODE_MAX: CID 1594015: (DEADCODE) Execution cannot reach this statement: "case SRV6_SID_ALLOC_MODE_UN...". 1782 case SRV6_SID_ALLOC_MODE_UNSPEC: 1783 default: 1784 flog_err(EC_ZEBRA_SM_CANNOT_ASSIGN_SID, 1785 "%s: SRv6 Manager: Unrecognized alloc mode %u", 1786 __func__, alloc_mode); 1787 /* We should never arrive here */ /zebra/zebra_srv6.c: 1781 in get_srv6_sid() 1775 return -1; 1776 } 1777 1778 ret = get_srv6_sid_dynamic(sid, ctx, locator); 1779 1780 break; CID 1594015: (DEADCODE) Execution cannot reach this statement: "case SRV6_SID_ALLOC_MODE_MAX:". 1781 case SRV6_SID_ALLOC_MODE_MAX: 1782 case SRV6_SID_ALLOC_MODE_UNSPEC: 1783 default: 1784 flog_err(EC_ZEBRA_SM_CANNOT_ASSIGN_SID, 1785 "%s: SRv6 Manager: Unrecognized alloc mode %u", 1786 __func__, alloc_mode); Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-24 10:43:00 +02:00
Sindhu Parvathi Gopinathan	56c16ee529	zebra: clear evpn dup-addr return error-msg when there is no vni clear evpn dup-addr cli returns error-msg for below conditions, - If evpn is not enabled & - If there is no VNI exists. supported command: ``` clear evpn dup-addr vni <vni-id> ``` Ticket: #3495573 Testing: bharat# clear evpn dup-addr vni all Error type: validation Error description: % EVPN not enabled bharat# clear evpn dup-addr vni 20 Error type: validation Error description: % VNI 20 does not exist Signed-off-by: Sindhu Parvathi Gopinathan's <sgopinathan@nvidia.com> Signed-off-by: Chirag Shah <chirag@nvidia.com>	2024-06-20 15:05:37 -07:00
Chirag Shah	e4d843b438	zebra: fix evpn mh bond member proto reinstall In case of EVPN MH bond, a member port going in protodown state due to external reason (one case being linkflap), frr updates the state correctly but upon manually clearing external reason trigger FRR to reinstate protodown without any reason code. Fix is to ensure if the protodown reason was external and new state is to have protodown 'off' then do no reinstate protodown. Ticket: #3947432 Testing: switch:#ip link show swp1 4: swp1: <NO-CARRIER,BROADCAST,MULTICAST,SLAVE,UP> mtu 9216 qdisc pfifo_fast master bond1 state DOWN mode DEFAULT group default qlen 1000 link/ether 1c:34:da:2c:aa:68 brd ff:ff:ff:ff:ff:ff protodown on protodown_reason <linkflap> switch:#ip link set swp1 protodown off protodown_reason linkflap off switch:#ip link show swp1 4: swp1: <NO-CARRIER,BROADCAST,MULTICAST,SLAVE,UP> mtu 9216 qdisc pfifo_fast master bond1 state DOWN mode DEFAULT group default qlen 1000 link/ether 1c:34:da:2c:aa:68 brd ff:ff:ff:ff:ff:ff Signed-off-by: Chirag Shah <chirag@nvidia.com>	2024-06-20 09:09:21 -07:00
Donatas Abraitis	64112ed9e6	Merge pull request #16224 from donaldsharp/zebra_dplane_event_loop_starvation zebra: Prevent starvation in dplane_thread_loop	2024-06-19 10:22:36 +03:00
Russ White	7834c38ab3	Merge pull request #15604 from cscarpitta/srv6-sid-manager Add support for SRv6 SID Manager	2024-06-18 12:16:24 -04:00
Donatas Abraitis	dbf83cfd36	zebra: Set the weight for non-recursive next-hop If using weighted ECMP, the weight for non-recursive next-hop should be inherited from recursive next-hop. Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-06-18 12:15:33 +03:00
Philippe Guibert	36a310cc9f	zebra, lib: add locator name in sid notify messages In the near future, some daemons may only register SIDs. This may be the case for the pathd daemon when creating SRv6 binding SIDs. When a locator is getting deleted at ZEBRA level, the daemon may have an easy way to find out the SIds to unregister to. This commit proposes to add the locator name to the SID_SRV6_NOTIFY message whenever possible. Only case when an allocation failure happens, the locator will not be present. In all other places, the notify API at procol levels has the locator name extra-parameter. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com> Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-17 17:09:45 +02:00
Donald Sharp	6faad863f3	zebra: Prevent starvation in dplane_thread_loop When removing a large number of routes, the linux kernel can take the cpu for an extended amount of time, leaving a situation where FRR detects a starvation event. r1# sharp install routes 10.0.0.0 nexthop 192.168.44.33 1000000 repeat 10 2024-06-14 12:55:49.365 [NTFY] sharpd: [M7Q4P-46WDR] vty[5]@# sharp install routes 10.0.0.0 nexthop 192.168.44.33 1000000 repeat 10 2024-06-14 12:55:49.365 [DEBG] sharpd: [YP4TQ-01TYK] Inserting 1000000 routes 2024-06-14 12:55:57.256 [DEBG] sharpd: [TPHKD-3NYSB] Installed All Items 7.890085 2024-06-14 12:55:57.256 [DEBG] sharpd: [YJ486-NX5R1] Removing 1000000 routes 2024-06-14 12:56:07.802 [WARN] zebra: [QH9AB-Y4XMZ][EC 100663314] STARVATION: task dplane_thread_loop (634377bc8f9e) ran for 7078ms (cpu time 220ms) 2024-06-14 12:56:25.039 [DEBG] sharpd: [WTN53-GK9Y5] Removed all Items 27.783668 2024-06-14 12:56:25.039 [DEBG] sharpd: [YP4TQ-01TYK] Inserting 1000000 routes 2024-06-14 12:56:32.783 [DEBG] sharpd: [TPHKD-3NYSB] Installed All Items 7.743524 2024-06-14 12:56:32.783 [DEBG] sharpd: [YJ486-NX5R1] Removing 1000000 routes 2024-06-14 12:56:41.447 [WARN] zebra: [QH9AB-Y4XMZ][EC 100663314] STARVATION: task dplane_thread_loop (634377bc8f9e) ran for 5175ms (cpu time 179ms) Let's modify the loop in dplane_thread_loop such that after a provider has been run, check to see if the event should yield, if so, stop and reschedule this for the future. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-06-14 13:36:51 -04:00
Donald Sharp	a23a9385a7	zebra: Use built in data structure counter Instead of keeping a counter that is independent of the queue's data structure. Just use the queue's built-in counter. Ensure that it's pthread safe by keeping it wrapped inside the mutex for adding/deleting to the queue. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-06-13 15:30:00 -04:00
Philippe Guibert	209223dae2	zebra: fix sid allocation should be different with 2 isis instances With 2 ISIS SRv6 instances, the second ISIS instance always gets the same End SID as the first one. > router isis 1 > segment-routing srv6 > locator loc1 > exit > exit > router isis 2 > segment-routing srv6 > locator loc2 > end > segment-routing > srv6 > locators > locator loc1 > prefix 2001::1/64 > exit > locator loc2 > prefix 3001::1/64 > output: > 2024/06/11 17:30:15 ISIS: [N6PCR-FQ5ZA] SRv6 locator (locator loc1, prefix 2001::1/64) set for IS-IS area 1 > 2024/06/11 17:30:15 ISIS: [V4RBG-TYW5S] Requesting SRv6 SIDs for IS-IS area 1 > 2024/06/11 17:30:15 ISIS: [ZRHYM-6RMYK] isis_zebra_srv6_sid_notify: received SRv6 SID notify: ctx End USP sid_value 2001::1 sid_func 0 note ZAPI_SRV6_SID_ALLOCATED > [..] > 2024/06/11 17:36:49 ISIS: [N6PCR-FQ5ZA] SRv6 locator (locator loc2, prefix 3001::1/64) set for IS-IS area 2 > 2024/06/11 17:36:49 ISIS: [V4RBG-TYW5S] Requesting SRv6 SIDs for IS-IS area 2 > 2024/06/11 17:36:49 ISIS: [ZRHYM-6RMYK] isis_zebra_srv6_sid_notify: received SRv6 SID notify: ctx End USP sid_value 2001::1 sid_func 0 note ZAPI_SRV6_SID_ALLOCATED Actually, at the second request, ZEBRA always gives an existing dynamic SID of the first available locator, because the locator name is never checked. > 2024/06/11 17:36:49 ZEBRA: [XMBTQ-GE6EY] get_srv6_sid: received SRv6 SID alloc request: SID ctx End USP ((null)), mode=dynamic > 2024/06/11 17:36:49 ZEBRA: [R61Q3-QWR23] get_srv6_sid_dynamic: returning existing SID End USP 2001::1 > 2024/06/11 17:36:49 ZEBRA: [J1GMY-B6CAK] srv6_manager_get_sid_internal: got existing SRv6 SID for ctx End USP: sid_value=2001::1 (func=0) (proto=9, instance=0, sessionId=0), notify client Fix this by checking the locator of the existing SID. Fixes: b771bf8ce687 ("zebra: Add functions to alloc/release SRv6 SIDs") Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-06-13 14:54:17 +02:00
Philippe Guibert	35a87b6acd	zebra: fix display explicit sid refused > 2024/06/07 15:09:43 ZEBRA: [MZYPC-GBDGR] srv6_manager_get_sid_internal: getting SRv6 SID for ctx End.DT4 vrf vrf1, sid_value=1003::4, locator_name= > [..] > 2024/06/07 15:09:43 ZEBRA: [QGJBT-YJ11W] zsend_srv6_sid_notify: notifying ZEBRA_SRV6_SID_NOTIFY ctx End.DT4 vrf vrf2, sid (null) note ZAPI_SRV6_SID_FAIL_ALLOC (proto=30, instance=0, sessionId=0) Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-06-13 14:54:16 +02:00
Philippe Guibert	708f20e58a	zebra: fix display srv6 address only for explicit-sid Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	5365c56614	zebra: Notify all daemons about locator delete Currently, when a locator is deleted in zebra, zebra notifies only the zclient that owns the locator. With the introduction of SID Manager, the locator is no longer owned by any client. Instead, the locator is owned by Zebra, and clients can allocate and release SIDs from the locator using the ZAPI ZEBRA_SRV6_MANAGER_GET_SID and ZEBRA_SRV6_MANAGER_RELEASE_SID. Therefore, when a locator is removed in Zebra, we need to notify all daemons so that they can release/uninstall the SIDs allocated by that locator. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	efa830e89c	zebra: Notify daemons about SIDs Send asynchronous notifications to zclients when an SRv6 SID is allocated/released and when a SID alloc/release operation fails. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	84dd482cb9	zebra: Alloc/Release SIDs to daemons upon request Previous commits introduced two new ZAPI operations, `ZEBRA_SRV6_MANAGER_GET_SRV6_SID` and `ZEBRA_SRV6_MANAGER_RELEASE_SRV6_SID`. These operations allow a daemon to interact with the SRv6 SID Manager to get and release an SRv6 SID, respectively. This commit extends the SID Manager by adding logic to process the requests `ZEBRA_SRV6_MANAGER_GET_SRV6_SID` and `ZEBRA_SRV6_MANAGER_RELEASE_SRV6_SID`, and allocate/release SIDs to requesting daemons. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	c570d2bcae	zebra: Add functions to alloc/release SRv6 SIDs Add functions to allocate/release SRv6 SIDs. SIDs can be allocated either explicitly (allocate a specific SID) or dynamically (allocate any available SID). Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	f8da4a29e5	zebra: Repond to `GET_LOCATOR` ZAPI request The previous commits introduced a new operation, `ZEBRA_SRV6_MANAGER_GET_LOCATOR`, allowing a daemon to request information about a specific SRv6 locator from the SRv6 SID Manager. This commit extends the SID Manager to respond to a `ZEBRA_SRV6_MANAGER_GET_LOCATOR` request and provide the requested locator information. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	8b3f378546	zebra: Add support for SRv6 SIDs Add a data structure to represent an SRv6 SID context and the related management functions (allocate/free). Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	779d4c2702	zebra: CLI to specify format of an SRv6 locator Add the CLI to choose the SID format of a locator. When the SID format of a locator is changed, the SIDs allocated from that locator might no longer be valid (for example, because the new format might involve a different SID allocation schema). In such a case, it is necessary to notify all the zclients so that they can withdraw/uninstall the old SIDs that use the previous format and allocate/install/advertise the new SIDs based on the new format. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	bf10e4437e	zebra: Add support for SRv6 SID blocks An SRv6 block is an IPv6 prefix from which SIDs are allocated. This commit adds support for SRv6 SID blocks. Specifically, it adds a data structure to store information about an SRv6 block (e.g., its occupancy status, which SIDs have been allocated and which are available, which SID format is used for that block, etc.). It also adds some functions to manage the block (allocate / free / lookup). These functions will be used in the next commits to support the allocation of SIDs from a block in the SID Manager. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	51d3cd8a00	zebra: CLI to override default SID format config Add CLI commands to support overriding default configuration of the SID format. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	35f4bedfa3	lib: Add CLI nodes to support SRv6 SID format Add CLI commands to support overriding default configuration of the SID format. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Carmine Scarpitta	1298867671	zebra: Add support for SRv6 SID formats Add functionalities to manage SRv6 SID formats (register / unregister / lookup) and create two SID formats upon SRv6 Manager initialization: `uncompressed-f4024` and `usid-f3216`. In future commits, we will add the CLI to allow the user to choose between the two formats. Signed-off-by: Carmine Scarpitta <cscarpit@cisco.com>	2024-06-13 14:54:16 +02:00
Donatas Abraitis	19c3e0eca6	Merge pull request #16093 from louis-6wind/fix-show-route-memory zebra: fix Out Of Memory issue when displaying large route tables in JSON	2024-06-10 11:26:23 +03:00
Donald Sharp	2a00a648f1	Merge pull request #15900 from mikemallin/v6-vtep-lib-upstream lib, bgpd, tests, zebra: prefix_sg changes for V6 VTEP	2024-06-07 14:34:11 -04:00
Dmytro Shytyi	f62ee81d1e	zebra: return void zebra_mpls_lsp_install zebra_mpls_lsp_install() returned integer is never checked. Return void instead. Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:48 +02:00
Philippe Guibert	229da537e6	zebra: update nhlfe when the outgoing labels differ Because the nhlfe label stack may contain more than one label, ensure to copy all labels. Co-developed-by: Dmytro Shytyi <dmytro.shytyi@6wind.com> Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com> Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:48 +02:00
Philippe Guibert	e9c7b5dee2	zebra: the header containing the re->flags is in zclient.h Change the comment in the code that refers to ZEBRA_FLAG_XXX defines. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com> Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:48 +02:00
Philippe Guibert	9f88ed98a4	zebra: relax mpls entries installation with labeled unicast entries Until now, when a FEC entry is added in zebra, driven by the reception of a BGP labeled unicast update, an LSP entry is created. That LSP entry is resolved by using the route entry which is also installed by BGP labeled unicast. This route entry is not available when we face with i-bgp peering session. I-BGP labeled sessions are used to establish IP connectivity across separate IGPs. The below dumps illustrate a 3 IGP topology. An attempt to create connectivity between the north and the south machines is done. The 3 separate IGPs are configured in Segment routing: - north-east - east-west - west-south We create BGP peerings between each endpoint of each IGP: - iBGP between (north) and (east) - iBGP between (east) and (west) - iBGP between (west) and (south) Before that patch, the FEC entries could not be resolved on the east machine: Before: east-vm# show mpls fec 192.0.2.1/32 Label: 18 Client list: bgp(fd 48) 192.0.2.5/32 Label: 17 Client list: bgp(fd 48) 192.0.2.7/32 Label: 19 Client list: bgp(fd 48) east-vm# show mpls table Inbound Label Type Nexthop Outbound Label -------------------------------------------------------- 1011 SR (OSPF) 192.168.2.2 1011 1022 SR (OSPF) 192.168.2.2 implicit-null 11044 SR (IS-IS) 192.168.3.4 implicit-null 11055 SR (IS-IS) 192.168.3.4 11055 30000 SR (OSPF) 192.168.2.2 implicit-null 30001 SR (OSPF) 192.168.2.2 implicit-null 36000 SR (IS-IS) 192.168.3.4 implicit-null east-vm# show ip route [..] B 192.0.2.1/32 [200/0] via 192.0.2.1 inactive, label implicit-null, weight 1, 00:17:45 O>* 192.0.2.1/32 [110/20] via 192.168.2.2, r3-eth0, label 1011, weight 1, 00:17:47 O>* 192.0.2.2/32 [110/10] via 192.168.2.2, r3-eth0, label implicit-null, weight 1, 00:17:47 O 192.0.2.3/32 [110/0] is directly connected, lo, weight 1, 00:17:57 C>* 192.0.2.3/32 is directly connected, lo, 00:18:03 I>* 192.0.2.4/32 [115/20] via 192.168.3.4, r3-eth1, label implicit-null, weight 1, 00:17:59 B 192.0.2.5/32 [200/0] via 192.0.2.5 inactive, label implicit-null, weight 1, 00:17:56 I>* 192.0.2.5/32 [115/30] via 192.168.3.4, r3-eth1, label 11055, weight 1, 00:17:58 B> 192.0.2.7/32 [200/0] via 192.0.2.5 (recursive), label 19, weight 1, 00:17:45 * via 192.168.3.4, r3-eth1, label 11055/19, weight 1, 00:17:45 [..] After command "mpls fec nexthop-resolution" was applied, the FEC entries will resolve over any non BGP route that has a labeled path selected. east-vm# show mpls table Inbound Label Type Nexthop Outbound Label -------------------------------------------------------- 17 SR (IS-IS) 192.168.3.4 11055 18 SR (OSPF) 192.168.2.2 1011 19 BGP 192.168.3.4 11055/19 1011 SR (OSPF) 192.168.2.2 1011 1022 SR (OSPF) 192.168.2.2 implicit-null 11044 SR (IS-IS) 192.168.3.4 implicit-null 11055 SR (IS-IS) 192.168.3.4 11055 30000 SR (OSPF) 192.168.2.2 implicit-null 30001 SR (OSPF) 192.168.2.2 implicit-null 36000 SR (IS-IS) 192.168.3.4 implicit-null Co-developed-by: Dmytro Shytyi <dmytro.shytyi@6wind.com> Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com> Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-06-07 12:33:48 +02:00
Dmytro Shytyi	110cb369a3	zebra: when adding an LSP, update the log message with multiple label When an LSP entry is created from a FEC entry, multiple labels may now be appended to the LSP entry, instead of one single. Upon lsp creation, the LSP trace will display all the labels appended. Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:47 +02:00
Philippe Guibert	cc25d7bd92	zebra, lib: upon lsp install, iterate nexthop accordingly There are two ways of iterating over nexthops of a given route entry. - Either only the main nexthop are taken into account (which is the case today when attempting to install an LSP entry on a BGP connected labeled route. - Or by taking into account nexthops that are resolved and linked in nexthop->resolved of the previous nexthop which has RECURSIVE flag set. This second case has to be taken into account in the case where recursive routes may be used to install an LSP entry. Introduce a new API in nexthop that will parse over the appropriate nexthop, if the nexthop-resolution flag is turned on or not on the given VRF. Use that API in the lsp_install() function so as to walk over the appropriate nexthops. Co-developed-by: Dmytro Shytyi <dmytro.shytyi@6wind.com> Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com> Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:47 +02:00
Philippe Guibert	eb10a69766	zebra: handle nexthop-resolution updates Upon reconfiguring nexthop-resolution updates, update the lsp entries accordingly. If fec nexthop-resolution becomes true, then call again fec_change_update_lsp() for each fec entry available. If fec nexthop-resolution becomes false, then call again fec_change_update_lsp() for each fec entry available, and if the update fails, uninstall any lsp related with the fec entry. In the case lsp_install() and no lsp entry could be created or updated, then consider this call as a failure, and return -1. Co-developed-by: Dmytro Shytyi <dmytro.shytyi@6wind.com> Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com> Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:47 +02:00
Dmytro Shytyi	d875dc37ee	zebra: add 'mpls fec nexthop-resolution' command to vtysh Commands added: r3# configure r3(config)# mpls fec MPLS FEC table label Label configuration ldp Label Distribution Protocol lsp Establish label switched path r3(config)# mpls fec mpls fec nexthop-resolution Authorise nexthop resolution over all labeled routes. r3(config)# mpls fec mpls fec nexthop-resolution r3# configure r3(config)# vrf default r3(config-vrf)# mpls fec MPLS FEC table r3(config-vrf)# mpls fec mpls fec nexthop-resolution Authorise nexthop resolution over all labeled routes. r3(config-vrf)# mpls fec nexthop-resolution east-vm# show running-config Building configuration... ... ! mpls fec nexthop-resolution ! ... Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-06-07 12:33:47 +02:00
Louis Scalbert	85eb60ffd6	zebra: fix show route memory consumption When displaying a route table in JSON, a table JSON object is storing all the prefix JSON objects containing the prefix information. This results in excessive memory allocation for JSON objects, potentially leading to an out-of-memory error on the machine with large routing tables. To Fix the memory consumption issue for the "show ip[v6] route [vrf XX] json" command, display the prefixes one by one and free the memory of each JSON object after it has been displayed. Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>	2024-06-07 10:13:32 +02:00
Louis Scalbert	cb440058f2	zebra: fix show route vrf all memory consumption `0e2fc3d67f` ("vtysh, zebra: Fix malformed json output for multiple vrfs in command 'show ip route vrf all json'") has been reverted in the previous commit. Although the fix was correct, it was consuming too muca memory when displaying large route tables. A root JSON object was storing all the JSON objects containing the route tables, each containing their respective prefixes in JSON objects. This resulted in excessive memory allocation for JSON objects, potentially leading to an out-of-memory error on the machine. To Fix the memory consumption issue for the "show ip[v6] route vrf all json" command, display the tables one by one and free the memory of each JSON object after it has been displayed. Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>	2024-06-07 10:13:32 +02:00
Louis Scalbert	0c6f14ec14	Revert "vtysh, zebra: Fix malformed json output for multiple vrfs in command 'show ip route vrf all json'" This reverts commit `0e2fc3d67f`. This fix was correct but not optimal for memory consumption at scale. Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>	2024-06-07 10:12:42 +02:00
Donald Sharp	b70a773aa6	Merge pull request #16151 from pguibert6WIND/srv6_fix_source_address zebra: display srv6 encapsulation source-address when configured	2024-06-06 12:22:15 -04:00
Mark Stapp	28d2e126c7	zebra: fix incoming FPM message length validation Validate incoming message length against correct (struct rtmsg) len, not top-level netlink message header size. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-06-05 14:37:41 -04:00
Georgi Valkov	f7242fbf73	zebra: fix compilation with GCC14 Fixes: zebra/zebra_netns_notify.c: In function 'zebra_ns_ready_read': zebra/zebra_netns_notify.c:266:40: error: implicit declaration of function 'basename' [-Wimplicit-function-declaration] 266 \| if (strmatch(VRF_DEFAULT_NAME, basename(netnspath))) { \| ^~~~~~~~ Fixed by including libgen.h, then since basename may modify its parameter, allocate a copy on the stack, using strdupa, and pass the temporary string to basename. According to the man page for basename: With glibc, one gets the POSIX version of basename() when <libgen.h> is included, and the GNU version otherwise. The POSIX version of basename may modify the contents of path, so we should to pass a copy when calling this function. [1] https://man7.org/linux/man-pages/man3/basename.3.html Signed-off-by: Georgi Valkov <gvalkov@gmail.com>	2024-06-04 13:35:57 +03:00
Philippe Guibert	890b67d7a9	zebra: display srv6 encapsulation source-address when configured The 'show running-config' does not display the ipv6 source address when a locator is not configured. Fix this by systematically displaying the ipv6 source address. Fixes: `6a0956169b` ("zebra: Add encap source address to SRv6 config write function") Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-06-04 09:55:21 +02:00
Mike RE Mallin	a514e34e30	bgpd, lib, zebra: Extend ES_VTEP_LIST_STR_SZ to support IPv6 addresses Signed-off-by: Mike RE Mallin <mremallin@gmail.com>	2024-05-31 10:27:22 -04:00
Mike RE Mallin	f4f19cc20f	lib, zebra: Update prefix_sg structure for IPv6 group support Signed-off-by: Mike RE Mallin <mremallin@gmail.com>	2024-05-31 10:27:22 -04:00
Igor Ryzhov	f8abf96440	Merge pull request #15082 from louis-6wind/fix-iff-lower-up lib: take into account the Linux IFF_LOWER_UP flag	2024-05-29 00:01:20 +03:00
Russ White	4bd16486e9	Merge pull request #16032 from Pdoijode/pdoijode/zebra-rt-deny-fix zebra: Deny the routes if ip protocol CLI refers to an undefined rmap	2024-05-21 10:52:52 -04:00
Pooja Jagadeesh Doijode	705e8ef78f	zebra: Deny the routes if ip protocol CLI refers to an undefined rmap Currently zebra does not deny the routes if `ip protocol <proto> route-map FOO` commmand is configured with reference to an undefined route-map (FOO in this case). However, on FRR restart, in zebra_route_map_check() routes get denied if route-map name is available but the route-map is not defined. This change was introduced in `fd303a4ba1`. Fix: When `ip protocol <proto> route-map FOO` CLI is configured with reference to an undefined route-map FOO, let the processing in ip_protocol_rm_add() and ip_protocol_rm_del() go through so that zebra can deny the routes instead of simply returning. This will result in consistent behavior. Testing Done: Before fix: ``` spine-1# configure spine-1(config)# ip protocol bgp route-map rmap7 root@spine-1:mgmt:/var/home/cumulus# vtysh -c "show run" \| grep rmap7 ip protocol bgp route-map rmap7 root@spine-1:mgmt:/var/home/cumulus# spine-1(config)# do show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, A - Babel, D - SHARP, F - PBR, f - OpenFabric, Z - FRR, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure C>* 27.0.0.1/32 is directly connected, lo, 02:27:45 B>* 27.0.0.3/32 [20/0] via fe80::202:ff:fe00:21, downlink_1, weight 1, 02:27:35 B>* 27.0.0.4/32 [20/0] via fe80::202:ff:fe00:29, downlink_2, weight 1, 02:27:40 B>* 27.0.0.5/32 [20/0] via fe80::202:ff:fe00:31, downlink_3, weight 1, 02:27:40 B>* 27.0.0.6/32 [20/0] via fe80::202:ff:fe00:39, downlink_4, weight 1, 02:27:40 ``` After fix: ``` spine-1(config)# ip protocol bgp route-map route-map67 spine-1(config)# do show ip route Codes: K - kernel route, C - connected, S - static, R - RIP, O - OSPF, I - IS-IS, B - BGP, E - EIGRP, N - NHRP, T - Table, A - Babel, D - SHARP, F - PBR, f - OpenFabric, Z - FRR, > - selected route, * - FIB route, q - queued, r - rejected, b - backup t - trapped, o - offload failure C>* 27.0.0.1/32 is directly connected, lo, 00:35:03 B 27.0.0.3/32 [20/0] via fe80::202:ff:fe00:21, downlink_1 inactive, weight 1, 00:34:58 B 27.0.0.4/32 [20/0] via fe80::202:ff:fe00:29, downlink_2 inactive, weight 1, 00:34:57 B 27.0.0.5/32 [20/0] via fe80::202:ff:fe00:31, downlink_3 inactive, weight 1, 00:34:57 B 27.0.0.6/32 [20/0] via fe80::202:ff:fe00:39, downlink_4 inactive, weight 1, 00:34:58 spine-1(config)# root@spine-1:mgmt:/var/home/cumulus# ip route show root@spine-1:mgmt:/var/home/cumulus# ``` Signed-off-by: Pooja Jagadeesh Doijode <pdoijode@nvidia.com>	2024-05-20 12:50:18 -07:00
Donatas Abraitis	b87d5a467e	Merge pull request #15980 from donaldsharp/agentx_update *: Modify agentx to be allowed to be called	2024-05-20 22:33:01 +03:00
Donatas Abraitis	4769239967	Merge pull request #16019 from anlancs/ldpd/fix-cmd-1 zebra: fix mpls command	2024-05-18 16:11:39 +03:00
Donatas Abraitis	815b1d0e58	Merge pull request #16009 from anlancs/zebra/fix-fpm-nl zebra: fix wrong fpm packet header	2024-05-18 16:11:10 +03:00
Donald Sharp	e3fceef597	Merge pull request #16001 from EasyNetDev/dpdk->=22.11-API-fix zebra: Update Zebra DPDK >=22.11 API	2024-05-17 09:41:59 -04:00
anlan_cs	05ad3ccb03	zebra: fix mpls command Configured with "mpls label bind 1.1.1.1/32 explicit-null", the running configuration is: ``` ! mpls label bind 1.1.1.1/32 IPv4 Explicit Null ! ``` After this commit, the running configuration is: ``` ! mpls label bind 1.1.1.1/32 explicit-null ! ``` And add the support for the "no" form: ``` anlan(config)# mpls label bind 1.1.1.1/32 explicit-null anlan(config)# no mpls label bind 1.1.1.1/32 explicit-null ``` Signed-off-by: anlan_cs <anlan_cs@tom.com>	2024-05-16 23:30:41 +08:00
EasyNetDev	0e040960d7	zebra: Update Zebra for DPDK >=22.11 API * Starting from version DPDK 22.11 we have API changes: The rte_driver and rte_device objects are now opaque and must be manipulated through added accessors. We need to update Zebra DPDK sources to DPDK version >=22.11 * Fix clang-format Signed-off-by: EasyNet <devel@easynet.dev>	2024-05-16 14:26:26 +03:00
Philippe Guibert	05b6cfc71a	zebra, lib: move nexthop display helper to lib folder The zebra_nexthop_vty_helper() and zebra_nexthop_json_helper() functions could be very helpful to display nexthop information from whatever daemon. Move the core function in the nexthop_vty_helper() and the nexthop_json_helper() function. The zebra API call remains unchanged. Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-05-15 15:33:52 +02:00
Russ White	db1e2a094d	Merge pull request #16008 from mjstapp/show_routemap_source zebra: include route source set by route-map in show output	2024-05-15 06:18:41 -04:00
anlan_cs	2429dd130f	zebra: fix wrong fpm packet header Signed-off-by: anlan_cs <anlan_cs@tom.com>	2024-05-14 23:26:40 +08:00
Mark Stapp	d6835df9cc	zebra: include route source set by route-map in show output Include the prefix source address when set by a route-map in show output for routes, in various formats. Add some debugs when encoding netlink route messages with a source address. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-05-14 10:28:17 -04:00
Donatas Abraitis	5f8759a4aa	Merge pull request #15968 from donaldsharp/parse_attr_problems Parse attr problems	2024-05-11 14:44:29 +03:00
Donald Sharp	73ad64a6f4	*: Modify agentx to be allowed to be called If you had a situation where an operator turned on ospfd with snmp but not ospf6d and agentx was configured then you get into a situation where ospf6d would complain that the config for agentx did not exist. Let's modify the code to allow this situation to happen. Fixes: #15896 Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-05-10 10:16:29 -04:00
Mark Stapp	6afb6962be	zebra: add some more netlink RTA_ strings Add a few more RTA_ attribute names to the netlink debug output. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-05-09 16:56:07 -04:00
Donald Sharp	bd4fca132a	zebra: Ensure multipath encodes vxlan right for fpm usage The fpm code path for the dplane_fpm_nl module was improperly encoding the multipath nexthop data for vxlan type routes. Move this into the embedded nexthop encoding where it belongs. This change makes it so that the usage of `-M dplane_fpm_nl` is now producing the same netlink messages that `-M fpm` produces when using vxlan based nexthops. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-05-09 11:52:51 -04:00
Donald Sharp	569f9e4394	zebra: Move fpm check to inside of netlink_route_nexthop_encap Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-05-09 11:52:51 -04:00
Donald Sharp	ba5a3538e8	zebra: Move netlink_route_nexthop_encap Move this static function earlier so we can avoid a predecleartion. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-05-09 11:52:51 -04:00
Donald Sharp	084aba4ec0	zebra: Add 2 things to fpm_listener 1) Add ability to hex-dump the received packet for debugging 2) Receive encap type and vxlan vni and display them. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-05-09 11:52:51 -04:00
Donatas Abraitis	5ed6db5e9e	Merge pull request #15920 from chiragshah6/fdev2 zebra: fix EVPN svd based remote nh neigh del	2024-05-08 22:36:58 +03:00
Christian Hopps	c54bc7a8dd	Merge pull request #15594 from idryzhov/mgmt-rpc mgmtd: implement YANG RPC/action support	2024-05-06 16:05:50 -04:00
Donald Sharp	216bac2826	Merge pull request #15902 from opensourcerouting/build-xref-setup-hppa build: fix missing `XREF_SETUP` (and HP PA-RISC build failure)	2024-05-06 10:29:05 -04:00
Chirag Shah	fde6dd7bb9	zebra: fix EVPN svd based remote nh neigh del In the context of SVD (Single VxLAN Device) for L3VNI, the remote VTEP's nexthop is programmed neighbor entry against SVD along with neighbor entry against SVI. However, when L3VNI is removed or the VRF is disabled, all SVI based remote nexthop neighbors are uninstalled and deleted. The SVD based neigh entries remains in Zebra and the Kernel. Subsequently, when reconfiguring L3VNI and relearning the same nexthop, the neighbor entry is not programmed is because it is not removed from Zebra SVD neighbor hash table, leading to the failure to reprogram the entry. With this fix, the SVD nexthop neigh entry is uninstalled and deleted from Zebra and Kernel. Ticket: #3729045 Testing: borderleaf:# ip neigh show 2.2.2.2 2.2.2.2 dev vlan2560_l3 lladdr 00:01:00:00:1d:09 extern_learn NOARP proto zebra 2.2.2.2 dev vxlan99 lladdr 00:01:00:00:1d:09 extern_learn NOARP proto zebra With the fix: Zebra log shows both enties SVD (vxlan99) and SVI (vlan2560_l3) neighbor entries are deleted. 2024/05/03 18:41:33.527125 ZEBRA: [NH6N7-54CD1] Tx RTM_DELNEIGH family ipv4 IF vxlan99(16) Neigh 2.2.2.2 MAC null flags 0x10 state 0x0 ext_flags 0x0 2024/05/03 18:41:33.527128 ZEBRA: [NH6N7-54CD1] Tx RTM_DELNEIGH family ipv4 IF vlan2560_l3(18) Neigh 2.2.2.2 MAC null flags 0x10 state 0x0 ext_flags 0x0 borderleaf:# ip neigh show 2.2.2.2 borderleaf:# Signed-off-by: Chirag Shah <chirag@nvidia.com>	2024-05-03 12:43:01 -07:00
Chirag Shah	07005288c2	zebra: bridge flap handle vlan membership update Upon bridge flap, the associated SVD case, VLAN membership is not updated correctly. When SVI comes up, the VNI could not associate with it as bridge VLAN membership was not updated. Ticket: #3821632 Testing: Before fix: ----------- tor-1:#ifdown br_l3vni ; sleep 1 ; ifup br_l3vni tor-1:# vtysh -c 'show evpn vni 8888' VNI: 8888 Type: L3 Tenant VRF: sym_1 Vlan: 490 Bridge: br_l3vni Local Vtep Ip: 27.0.0.9 Vxlan-Intf: vxlan99 SVI-If: None <<<<<< SVI not found State: Down <<<<<< status remained in down BGP is not informed VNI Filter: none System MAC: None Router MAC: None L2 VNIs: 1800 1801 1900 1901 After fix: ---------- tor-1:# ifdown br_l3vni; sleep 1; ifup br_l3vni tor-1:# vtysh Hello, this is FRRouting (version 8.4.3). Copyright 1996-2005 Kunihiro Ishiguro, et al. tor-1# show evpn vni 8888 VNI: 8888 Type: L3 Tenant VRF: sym_1 Vlan: 490 Bridge: br_l3vni Local Vtep Ip: 27.0.0.9 Vxlan-Intf: vxlan99 SVI-If: vlan490_l3 <<<<<< State: Up <<<<<< VNI Filter: none System MAC: 44:38:39:ff:ff:29 Router MAC: 44:38:39:ff:ff:29 L2 VNIs: 1800 1801 1900 1901 Signed-off-by: Chirag Shah <chirag@nvidia.com>	2024-05-03 10:52:33 -07:00
David Lamparter	3ca60d00b1	: add XREF_SETUP() to libraries and utilites This is theoretically not needed if neither DEFUNs nor zlog_ calls are used, except I'm about to turn it into a build error to catch the cases where it _is_ necessary. Which is libmgmt_be_nb.la in this case, where it causes build failures on hppa. Signed-off-by: David Lamparter <equinox@opensourcerouting.org>	2024-05-02 23:03:08 +02:00
anlan_cs	8a02942eda	zebra: clean up one comment With the commit `605df8d4`, all real things are moved into dplane. So the operations mentioned in this comment have nothing to do with this function `netlink_link_change()`. Just remove that confusing and useless comment. Signed-off-by: anlan_cs <anlan_cs@tom.com>	2024-05-02 16:36:01 +08:00
Donald Sharp	26595e3162	zebra: Ensure proper decoding of netlink message As part of the kernel netlink functionality, it is possible that a bit of nested attributes can be passed up. This attribute has a type value which is stored in the lower 8 bits and in the upper 8 bits are a couple control flags that can be used. FRR can parse this data and then just throw away the value unless we mask off the upper 8 bits. Let's ensure that it can be properly parsed. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-04-30 14:46:08 -04:00
Donatas Abraitis	03a4765cda	Merge pull request #15841 from pguibert6WIND/dx6_support zebra, sharpd: add srv6 End.DX6 support	2024-04-26 23:25:34 +03:00
Russ White	f19817f71d	Merge pull request #15723 from opensourcerouting/feature/extended_link_bw_refactored_v1 bgpd: Implement extended link-bandwidth	2024-04-26 14:41:05 -04:00
Louis Scalbert	bb46027596	zebra: fix fpm_listener compilation I don't know why it appears only now. > ../sdist/zebra/fpm_listener.c:420:8: error: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Werror,-Wsign-compare] > if (!RTNH_OK(rtnh, len)) { > ^~~~~~~~~~~~~~~~~~ > ../sdist/include/linux/rtnetlink.h:437:31: note: expanded from macro 'RTNH_OK' > ((int)(rtnh)->rtnh_len) <= (len)) len is set with RTA_PAYLOAD and should be an integer. > len = RTA_PAYLOAD(mpath_rtattr); > #define RTA_PAYLOAD(rta) ((int)((rta)->rta_len) - RTA_LENGTH(0)) Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>	2024-04-26 17:11:53 +02:00
Philippe Guibert	f6e58d26f6	zebra, sharpd: add srv6 End.DX6 support Add the support for adding DX6 behavior into netlink layer of zebra. Add the necessary test in sharpd. > ubuntu2204# sharp install seg6local-routes 1:1::1:2 nexthop-seg6local loop1 End_DX6 4:4::4:6 1 > ubuntu2204# do show ipv6 route > [..] > D>* 1:1::1:2/128 [150/0] is directly connected, loop1, seg6local End.DX6 nh6 4:4::4:6, weight 1, 00:00:03 Signed-off-by: Philippe Guibert <philippe.guibert@6wind.com>	2024-04-25 13:54:25 +02:00
Donatas Abraitis	a988a0a88a	*: Use uint64_t for weight down the path to Zebra Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-04-22 17:50:08 +03:00
Donald Sharp	29c1ff446e	lib, zebra: Check for not being a blackhole route In zebra_interface_nhg_reinstall zebra is checking that the nhg is a singleton and not a blackhole nhg. This was originally done with checking that the nexthop is a NEXTHOP_TYPE_IFINDEX, NEXTHOP_TYPE_IPV4_IFINDEX and NEXTHOP_TYPE_IPV6_IFINDEX. This was excluding NEXTHOP_TYPE_IPV4 and NEXTHOP_TYPE_IPV6. These were both possible to be received and maintained from the upper level protocol for when a route is being recursively resolved. If we have gotten to this point in zebra_interface_nhg_reinstall the nexthop group has already been installed at least once and we know that it is actually a valid nexthop. What the test is really trying to do is ensure that we are not reinstalling a blackhole nexthop group( Which is not possible to even be here by the way, but safety first! ). So let's change to test for that instead. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-04-22 10:44:55 -04:00
Igor Ryzhov	58a8ebc1fc	lib: rework northbound RPC callback Change input/output arguments of the RPC callback from lists of (xpath/value) tuples to YANG data trees. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2024-04-22 16:36:22 +03:00
Igor Ryzhov	99a723e5a8	Merge pull request #15803 from LabNConsulting/chopps/add-missing-zif-type yang: add missing `zif-gre` zebra interface type	2024-04-20 21:25:28 +03:00
Christian Hopps	74fb4056cb	yang: add missing `zif-gre` zebra interface type This was caught by the grpc_basic test which was receiving an invalid error result, which was returned b/c inside zebra the libyang code was flagging the value as invalid for a derived zebra interface type. Signed-off-by: Christian Hopps <chopps@labn.net>	2024-04-20 08:04:00 -04:00
Igor Ryzhov	57811a53ba	lib, zebra: fix exit commands If a command is not marked as `YANG`-converted, the current command batching buffer is flushed before executing the command. We shouldn't flush the buffer when executing an `exit` command. It should only be flushed if the next command is not `YANG`-converted, which is checked by the command itself, not the previous `exit`. Fixes #15706. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2024-04-16 22:18:01 +03:00
Russ White	13d598712a	Merge pull request #15616 from pguibert6WIND/srv6_seg_list_sid_order zebra: fix static srv6 segment-list sid order	2024-04-16 07:55:05 -04:00
Igor Ryzhov	8d6f7a4397	zebra: fix encoded DNSSL length The encoded DNSSL length is not set so the value is missing from RA packet. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2024-04-15 23:31:26 +03:00
Donatas Abraitis	deef11a22b	Merge pull request #15612 from sri-mohan1/srib-24-zebra-b zebra: changes for code maintainability	2024-04-14 21:06:28 +03:00
Jafar Al-Gharaibeh	692f916b88	Merge pull request #15688 from mjstapp/dplane_api_version lib, zebra: Add a version scheme for the dataplane API	2024-04-11 23:54:02 -05:00
Donald Sharp	4fb336424e	zebra: rtadv uses uninited data to sendmsg valgrind is complaining about this problem. This fixes it. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-04-10 09:24:13 -04:00
Donatas Abraitis	73a6aa6a14	Merge pull request #15708 from donaldsharp/zebra_fix_io_display Zebra fix io display and shorten evpn zapi packet sizes	2024-04-10 08:26:02 +03:00
Donald Sharp	5d7b50d1c0	Merge pull request #15693 from anlancs/fix/zebra-label-bind zebra: fix wrong check for mpls label	2024-04-09 18:33:29 -04:00
Donald Sharp	94516d25aa	zebra: Use more appropriately sized stream packets for evpn When sending EVPN data, let's use more appropriately sized packets for the sending of data. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-04-09 10:08:03 -04:00
Donald Sharp	8d8f12ba8e	zebra: Actually display I/O buffer sizes An operator found a situation where zebra was backing up in a significant way towards BGP with EVPN changes taking up some serious amounts of memory. The key lines that would have clued us in on it were behind a dev build. Let's change this. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-04-09 10:08:03 -04:00
Mark Stapp	9f285dd0bc	zebra: add dataplane API version value Add a version value and accessor API for the zebra dataplane; plugins can test this to detect API changes. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-04-09 08:59:13 -04:00
Donald Sharp	8cfa3b57e9	Merge pull request #15645 from opensourcerouting/fix/show_ip_route_vrf_all_summary_json zebra: Fix JSON output for `show route summary json`	2024-04-08 11:12:40 -04:00
Dmytro Shytyi	8e82e2d5ed	zebra: fix static srv6 segment-list sid order When configuring a SID list by vtysh, the segment list obtained in iproute2 is the exact opposite: > >vtysh: >ipv6 route 2005::/64 eth0 segments 2001:db8:aaaa::7/2002::2/2003::3/2004::4 > >root@r1:/# ip -6 route >2005::/64 nhid 6 encap seg6 mode encap segs 4 [ 2004::4 2003::3 2002::2 2001:db8:aaaa::7 ] dev dummy0 proto 196 metric 20 pref medium > Fix this by keeping the same vtysh config and swap the segment's order of the list in the rt_netlink.c > >root@r1:/# ip -6 route >2005::/64 nhid 6 encap seg6 mode encap segs 4 [ 2001:db8:aaaa::7 2002::2 2003::3 2004::4 ] dev dummy0 proto 196 metric 20 pref medium > Fixes: `f20cf14` ("bgpd,lib,sharpd,zebra: srv6 introduce multiple segs/SIDs in nexthop") Signed-off-by: Dmytro Shytyi <dmytro.shytyi@6wind.com>	2024-04-08 15:45:26 +02:00
anlan_cs	cbd1f3239a	zebra: fix wrong check for mpls label Add a parameter for `zebra_mpls_fec_for_label()`, as before if it is `NULL`, otherwise exclude what this parameter represents. Before: ``` anlan(config)# mpls label bind 1.1.1.2/32 88 anlan(config)# mpls label bind 1.1.1.2/32 88 % Label already bound to a FEC anlan(config)# mpls label bind 1.1.1.3/32 88 % Label already bound to a FEC ``` After: ``` anlan(config)# mpls label bind 1.1.1.2/32 88 anlan(config)# mpls label bind 1.1.1.2/32 88 anlan(config)# mpls label bind 1.1.1.3/32 88 % Label already bound to a FEC ``` Signed-off-by: anlan_cs <anlan_cs@tom.com>	2024-04-06 09:01:39 +08:00
anlan_cs	9fb31c5e24	zebra: cosmetic change for one flag Use `CHECK_FLAG()` for readability. Signed-off-by: anlan_cs <anlan_cs@tom.com>	2024-04-05 14:30:26 +08:00
Mark Stapp	75ef259b1d	zebra: don't deref vxlan-vni array Null-check the vxlan-vni array info coming into zebra from the dataplane. Signed-off-by: Mark Stapp <mjs@cisco.com>	2024-04-01 13:00:04 -04:00
Donatas Abraitis	ca13121849	zebra: Fix JSON output for `show route summary json` Before: ``` { "routes":[ { "fib":1, "rib":1, "fibOffLoaded":0, "fibTrapped":0, "type":"connected" }, { "fib":1, "rib":1, "fibOffLoaded":0, "fibTrapped":0, "type":"local" }, { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"static" } ], "routesTotal":4, "routesTotalFib":4 } { "routes":[ { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"connected" }, { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"local" }, { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"static" } ], "routesTotal":6, "routesTotalFib":6 } ``` After: ``` { "ISR":{ "routes":[ { "fib":1, "rib":1, "fibOffLoaded":0, "fibTrapped":0, "type":"connected" }, { "fib":1, "rib":1, "fibOffLoaded":0, "fibTrapped":0, "type":"local" }, { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"static" } ], "routesTotal":4, "routesTotalFib":4 }, "default":{ "routes":[ { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"connected" }, { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"local" }, { "fib":2, "rib":2, "fibOffLoaded":0, "fibTrapped":0, "type":"static" } ], "routesTotal":6, "routesTotalFib":6 } } ``` Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-03-30 23:04:10 +02:00
Piotr Suchy	0e2fc3d67f	vtysh, zebra: Fix malformed json output for multiple vrfs in command 'show ip route vrf all json' Command 'show ip route vrf <vrf_name> json' returns a valid json object, however if instead of <vrf_name> we specify 'all', we get an invalid json object, like: {//vrf1 routes}{//vrf2 routes}{vrf3 routes} After the fix: {"vrf1":{//vrf1 routes},"vrf2:{//vrf2 routes},"vrf3":{//vrf3 routes}} Which is a valid json object, that can be parsed effectively using built-in modules. The rest of the commands remains unaffected and behave the same. Signed-off-by: Piotr Suchy <psuchy@akamai.com>	2024-03-29 14:01:21 +01:00
Mark Stapp	498967799d	Merge pull request #15598 from louis-6wind/fix-nhg-iface zebra: fix rejected route due to wrong nexthop-group	2024-03-26 11:49:03 -04:00
sri-mohan1	2df51c7fe3	zebra: changes for code maintainability these changes are for improving the code maintainability and readability Signed-off-by: sri-mohan1 <sri.mohan@samsung.com>	2024-03-26 10:21:45 +05:30
Mark Stapp	2313916a9e	Merge pull request #15590 from sri-mohan1/srib-24-zebra-b zebra: changes for code maintainability	2024-03-22 08:33:52 -04:00
Mark Stapp	20e017c379	Merge pull request #15010 from tlsalmin/master zebra: Fix crash on macvlan link down/up	2024-03-22 08:31:12 -04:00
Louis Scalbert	2a13b47185	zebra: fix rejected route due to wrong nexthop-group A specific sequence of actions involving the addition and removal of IP routes and network interfaces can lead to a route installation failure. The issue occurs under the following conditions: - Initially, there is no route present via the ens3 interface. - Adds a route: ip route 10.0.0.0/24 192.168.0.100 ens3 - Removes the same route: no ip route 10.0.0.0/24 192.168.0.100 ens3 - Removes the ens3 interface. - Re-adds the ens3 interface. - Again adds the same route: ip route 10.0.0.0/24 192.168.0.100 ens3 - And again removes it: no ip route 10.0.0.0/24 192.168.0.100 ens3 - Shuts down the ens3 interface - Reactivates the interface - Adds the route once more: ip route 10.0.0.0/24 192.168.0.100 ens3 The route appears to be rejected. > # show ip route nexthop > S>r 10.0.0.0/24 [1/0] (6) via 192.168.0.100, ens3, weight 1, 00:00:01 The commit `35729f38fa` ("zebra: Add a timer to nexthop group deletion") introduced a feature to keep a nexthop-group in Zebra for a certain period even when it is no longer in use. But if a nexthop-group interface is removed during this period, the association between the nexthop-group and the interface is lost in zebra memory. If the interface is later added back and a route is re-established, the nexthop-group interface dependency is not correctly reestablished. As a consequence, the nexthop-group flags remain unset when the interface is down. Upon the interface's reactivation, zebra does not reinstall the nexthop-group in the kernel because it is marked as valid and installed, but in reality, it does not exist in the kernel (it was removed when the interface was down). Thus, attempts to install a route via this nexthop-group ID fail. Stop maintaining a nexthop-group when its associated interface is no longer present. Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>	2024-03-22 11:29:38 +01:00
Donatas Abraitis	874242f129	*: Add missing SPDX-License-Identifier for some .c/.h files Adding them as others: GPL-2.0-or-later Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>	2024-03-21 16:46:58 +02:00
sri-mohan1	2501c3a4dc	zebra: changes for code maintainability these changes are for improving the code maintainability and readability Signed-off-by: sri-mohan1 <sri.mohan@samsung.com>	2024-03-21 14:56:24 +05:30
Russ White	502d1937da	Merge pull request #15552 from sri-mohan1/srib-24-zebra-a zebra: changes for code maintainability	2024-03-19 10:24:13 -04:00
Alexander Skorichenko	444ce317b2	zebra: fix route deletion during zebra shutdown Split zebra's vrf_terminate() into disable() and delete() stages. The former enqueues all events for the dplane thread. Memory freeing is performed in the second stage. Signed-off-by: Alexander Skorichenko <askorichenko@netgate.com>	2024-03-15 11:42:02 +01:00
sri-mohan1	efbf981d3f	zebra: changes for code maintainability these changes are for improving the code maintainability and readability Signed-off-by: sri-mohan1 <sri.mohan@samsung.com>	2024-03-15 10:30:23 +05:30
Donatas Abraitis	4b512f2f07	Merge pull request #15538 from idryzhov/fix-zebra-warning zebra: fix compilation warnings	2024-03-14 10:38:24 +02:00
Donatas Abraitis	f0686aaa2e	Merge pull request #15501 from donaldsharp/fpm_listener_send_response Fpm listener send response	2024-03-14 09:05:32 +02:00
Donatas Abraitis	caefee3571	Merge pull request #15526 from sri-mohan1/srib-24-zebra-a zebra: changes for code maintainability	2024-03-13 12:41:07 +02:00
Donald Sharp	cca4bb6527	zebra: Add ability to pretend routes are offloaded In the fpm_listener add a -r option to allow for routes to be notified back to zebra that the route has been `offloaded` properly. fpm_listener output: New route 4.5.6.7/32, Prot: Static(196), Metric: 20, nhgid: 23 Route Static(196) reflecting back FPM message - Type: 1, Length 56 New route 169.254.0.0/16, Prot: Kernel(2), Metric: 20, nhgid: 2 FPM message - Type: 1, Length 56 New route 192.168.99.0/24, Prot: Kernel(2), Metric: 20, nhgid: 42 FPM message - Type: 1, Length 56 New route 192.168.99.1/32, Prot: Kernel(2), Metric: 20, nhgid: 42 FPM message - Type: 1, Length 56 New route 192.168.119.0/24, Prot: OSPF(188), Metric: 20, nhgid: 20 Route OSPF(188) reflecting back Zebra output: 2024-03-06 21:48:54.613 [DEBG] zebra: [TJXPZ-RC5XQ] default(0:254):4.5.6.7/32 Processing dplane notif ctx 0x7160b4008780 2024-03-06 21:48:54.613 [DEBG] zebra: [TJXPZ-RC5XQ] default(0:254):192.168.119.0/24 Processing dplane notif ctx 0x7160b4008780 eva# show ip route 4.5.6.7 json { "4.5.6.7/32":[ { "prefix":"4.5.6.7/32", "prefixLen":32, "protocol":"static", "vrfId":0, "vrfName":"default", "selected":true, "destSelected":true, "distance":1, "metric":0, "installed":true, "offloaded":true, .... and eva# show ip route 192.168.119.0 json { "192.168.119.0/24":[ { "prefix":"192.168.119.0/24", "prefixLen":24, "protocol":"ospf", "vrfId":0, "vrfName":"default", "selected":true, "destSelected":true, "distance":110, "metric":100, "installed":true, "offloaded":true, ... Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-03-13 07:38:28 +00:00
Donald Sharp	4676c23cd8	zebra: fpm_listener start adding data about nexthops Start some very small bit of work into the fpm_listener about RTM_NEWNEXTHOPS. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-03-13 07:38:28 +00:00
Donald Sharp	58c53ab5ba	zebra: Add some additional known protocols for fpm_listener Just add a few more of the known protocols. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-03-13 07:38:27 +00:00
Igor Ryzhov	3280d81cc9	zebra: fix compilation warnings Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2024-03-13 07:08:44 +00:00
Donald Sharp	9ef76cff98	zebra: Cleanup leaked memory on shutdown from GR code Recent commit: `6b2554b94a` Exposed, via Address Sanitation, that memory was being leaked. Unfortunately the CI system did not catch this. Two pieces of memory were being lost: The zserv client data structure as well as anything on the client->gr_info_queue. Clean these up. Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-03-12 21:18:02 -04:00
sri-mohan1	19a0211b73	zebra: changes for code maintainability these changes are for improving the code maintainability and readability Signed-off-by: sri-mohan1 <sri.mohan@samsung.com>	2024-03-12 12:00:12 +05:30
Donatas Abraitis	1942951ea4	Merge pull request #15523 from donaldsharp/lib_mtype_cleanup Lib mtype cleanup	2024-03-12 07:14:10 +01:00
Donald Sharp	6b2554b94a	zebra: Clean up leaked linklist data structure on shutdown Signed-off-by: Donald Sharp <sharpd@nvidia.com>	2024-03-11 16:05:56 -04:00
Mark Stapp	d0afa12467	Merge pull request #15411 from raja-rajasekar/rajasekarr/backpressure_bgp_zebra_zserv zebra: backpressure - Zebra push back on Buffer/Stream creation	2024-03-11 16:04:13 -04:00
Donald Sharp	7e058c201a	Merge pull request #15469 from LabNConsulting/chopps/keychain-yang add ietf-key-chain YANG module support	2024-03-08 08:15:02 -05:00
Rajasekar Raja	a8efa994da	zebra: backpressure - Zebra push back on Buffer/Stream creation Currently, the way zebra works is it creates pthread per client (BGP is of interest in this case) and this thread loops itself in zserv_read() to check for any incoming data. If there is one, then it reads, validates and adds it in the ibuf_fifo signalling the main thread to process the message. The main thread when it gets a change, processes the message, and invokes the function pointer registered in the header command. (Ex: zserv_handlers). Finally, if all of this was successful, this task reschedules itself and loops in zserv_read() again However, if there are already items on ibuf FIFO, that means zebra is slow in processing. And with the current mechanism if Zebra main is busy, the ibuf FIFO keeps growing holding up the memory. Show memory zebra:(Example: 15k streams hoarding ~160 MB of data) --- qmem libfrr --- Stream : 44 variable 3432352 15042 161243800 Fix: Client IO Thread: (zserv_read) - Stop doing the read events when we know there are X number of items on the FIFO already.(X - zebra zapi-packets <1-10000> (Default-1000) - Determine the number of items on the zserv->ibuf_fifo. Subtract this from the work items and only pull the number of items off that would take us to X items on the ibuf_fifo again. - If the number of items in the ibuf_fifo has reached to the maximum * Either initially when zserv_read() is called (or) * when processing the remainders of the incoming buffer the client IO thread is woken by the the zebra main. Main thread: (zserv_process_message) If the client ibuf always schedules a wakeup to the client IO to read more items from the socked buffer. This way we ensure - Client IO thread always tries to read the socket buffer and add more items to the ibuf_fifo (until max limit) - hidden config change (zebra zapi-packets <>) is taken into account Ticket: #3390099 Signed-off-by: Donald Sharp <sharpd@nvidia.com> Signed-off-by: Rajasekar Raja <rajasekarr@nvidia.com>	2024-03-07 15:16:33 -08:00
Russ White	ac34badf71	Merge pull request #15397 from idryzhov/zebra-oper zebra: oper data additions	2024-03-05 11:41:35 -05:00

... 3 4 5 6 7 ...

6232 commits