Hi colleagues, suggest me, pls, where to look for the solution to the issue. Ubuntu 22.04, frr 8.5.2, router configuration is below Upon start and during some time everything is ok, but for some reasons after the time the following happens (reduced output) and can be fixed with 'clear ip ospf proc' : While FRR RT is ok:
/# sh ip route [ ... ] O 0.0.0.0/0 [110/10] via 100.64.98.3, blue, weight 1, 01:34:12 via 100.64.99.3, orange, weight 1, 01:34:12 B>* 0.0.0.0/0 [20/0] via 212.113.51.33, wan, weight 1, 1d00h30m C>* 100.64.0.0/23 is directly connected, br-int, 1d00h30m O>* 100.64.2.0/23 [110/20] via 100.64.98.2, blue, weight 1, 01:34:13 * via 100.64.99.2, orange, weight 1, 01:34:13 O>* 100.64.4.0/23 [110/20] via 100.64.98.3, blue, weight 1, 01:34:13 * via 100.64.99.3, orange, weight 1, 01:34:13 O 100.64.97.0/24 [110/10] is directly connected, wg0, weight 1, 01:34:24 C>* 100.64.97.0/24 is directly connected, wg0, 1d00h30m /
Linux RT lacks 100.64.2.0/23 and 100.64.4.0/23 -
/# ip route default nhid 229 via 212.113.51.33 dev wan proto bgp metric 20 100.64.0.0/23 dev br-int proto kernel scope link src 100.64.0.1 100.64.97.0/24 dev wg0 proto kernel scope link src 100.64.97.1 /
Issuing if 'clear ip ospf proc' solves the issue - these routes appear in Linux RT again, until next time. It's always ok with OSPF neighborship - hello packets are reaching all destinations (tcpdump confirms this :-) ) :
/vishnu.utc.mygaru# sh ip ospf neigh Neighbor ID Pri State Up Time Dead Time Address Interface RXmtL RqstL DBsmL 100.64.2.1 1 Full/Backup 17m52s 3.823s 100.64.98.2 blue:100.64.98.1 0 0 0 100.64.4.1 1 Full/DR 22m59s 3.403s 100.64.98.3 blue:100.64.98.1 0 0 0 100.64.2.1 1 Full/Backup 17m52s 3.823s 100.64.99.2 orange:100.64.99.1 0 0 0 100.64.4.1 1 Full/DR 22m59s 3.403s 100.64.99.3 orange:100.64.99.1 0 0 0 /
and following daemons are running with the following arguments -
/root@vishnu:~# ps ax |grep frr// // 816287 ? S<s 0:18 /usr/lib/frr/watchfrr -d -F traditional zebra bgpd ospfd staticd// // 816303 ? S<sl 0:05 /usr/lib/frr/zebra -d -F traditional -A 127.0.0.1 -r// // 816308 ? S<sl 0:06 /usr/lib/frr/bgpd -d -F traditional -A 127.0.0.1// // 816315 ? S<s 1:19 /usr/lib/frr/ospfd -d -F traditional -A 127.0.0.1// // 816318 ? S<s 0:03 /usr/lib/frr/staticd -d -F traditional -A 127.0.0.1/
The only information I see in the log is the following: This is restart of the ospf process:/ /
/Aug 9 18:45:03 vishnu ospfd[816315]: [T08NC-EWX63][EC 134217741] Link State Acknowledgment: Unknown Neighbor 100.64.4.1// //Aug 9 18:45:03 vishnu ospfd[816315]: [T08NC-EWX63][EC 134217741] Link State Acknowledgment: Unknown Neighbor 100.64.2.1// //Aug 9 18:45:03 vishnu ospfd[816315]: [X7SPE-Y4BTR][EC 134217741] Link State Update: Unknown Neighbor 100.64.2.1 on int: orange:100.64.99.1// //Aug 9 18:45:03 vishnu ospfd[816315]: [X7SPE-Y4BTR][EC 134217741] Link State Update: Unknown Neighbor 100.64.2.1 on int: blue:100.64.98.1// //Aug 9 18:45:03 vishnu ospfd[816315]: [X7SPE-Y4BTR][EC 134217741] Link State Update: Unknown Neighbor 100.64.2.1 on int: blue:100.64.98.1// //Aug 9 18:45:03 vishnu ospfd[816315]: [X7SPE-Y4BTR][EC 134217741] Link State Update: Unknown Neighbor 100.64.2.1 on int: orange:100.64.99.1/
and a bit later, but I don't know whether this relate to the issue and whether they appear at the same time:
/Aug 9 18:57:41 vishnu zebra[816303]: [RG2NH-FTSDH][EC 4043309102] Kernel deleted a nexthop group with ID (300[253/254]) that we are still using for a route, sending it back down// //Aug 9 18:57:41 vishnu zebra[816303]: [RG2NH-FTSDH][EC 4043309102] Kernel deleted a nexthop group with ID (280[234/235]) that we are still using for a route, sending it back down// //Aug 9 18:57:41 vishnu zebra[816303]: [RG2NH-FTSDH][EC 4043309102] Kernel deleted a nexthop group with ID (276[]) that we are still using for a route, sending it back down// /
The question - any suggestions where to look for the solution to the problem? Which additional information I need to gather which can help solve it? May be I'm missing something in OS networking/kernel configuration? (I, actually, wasn't tweaking configuration except ip.forward=1). Will appreciate any recommendations on this. Thank you. ~~~~~~~~~~~ frr version 8.5.2 frr defaults traditional hostname vishnu log syslog informational no ipv6 forwarding service integrated-vtysh-config ! interface blue description === Blue infra === ip ospf dead-interval 4 ip ospf hello-interval 1 no ip ospf passive exit ! interface orange description === Orange infra === ip ospf dead-interval 4 ip ospf hello-interval 1 no ip ospf passive exit ! this is eBGP to ISP router bgp NNNN [ ... ] exit !this is eBGP to ISP router bgp 64512 view erspan [ ... ] exit ! router ospf ospf router-id 100.64.0.1 passive-interface default network 100.64.0.0/23 area 0 network 100.64.97.0/24 area 0 network 100.64.98.0/24 area 0 network 100.64.99.0/24 area 0 network 192.x.x.x/24 area 0 default-information originate exit ! few these prefix lists used for BGP ip prefix-list [ ... ] ! end -- Volodymyr Litovka "Vision without Execution is Hallucination." -- Thomas Edison