LogicMonitor + Catchpoint: Enter the New Era of Autonomous IT

Learn more

Detect BGP session instability before it becomes an outage.

LogicMonitor monitors BGP session state, reset events, and convergence health, giving your network team early warning of instability and enabling them to respond before a reset cascades into a wider routing disruption.

What is a BGP reset?

A BGP reset is the teardown and re-establishment of a BGP peering session. When a session resets, all route information exchanged on that session is discarded and must be re-learned from scratch via a full routing table exchange. Resets can be intentional (triggered by an operator to apply policy changes) or unintentional (caused by hardware failures, software bugs, hold timer expiry, or network disruption).

What is the difference between a hard reset and a soft reset?

A hard reset tears down the BGP TCP session entirely, discarding all routing state. Both peers must re-establish the session and re-exchange complete routing tables. A soft reset applies policy changes without tearing down the session. Inbound soft resets use the BGP Route Refresh capability (RFC 2918) to request a re-advertisement from the peer; outbound soft resets use locally stored Adj-RIB-Out data to re-evaluate and re-advertise routes under the new policy.

What causes unplanned BGP session resets?

Common causes include: hold timer expiry (no KEEPALIVE received within the negotiated hold time, often caused by high CPU, network congestion, or routing daemon issues), NOTIFICATION message errors (capability mismatches, authentication failures, or malformed updates), TCP session failures (link down, MTU issues, or network path changes), hardware failures on the router or line card, and software bugs in the routing daemon. Each cause leaves a different signature in the session’s error counters and logs.

How does Graceful Restart reduce the impact of BGP resets?

BGP Graceful Restart (RFC 4724) allows a restarting router to maintain its forwarding state while the BGP session re-establishes. Neighboring routers preserve the routes they’ve learned from the restarting router and continue forwarding traffic, rather than immediately withdrawing those routes and causing a routing black hole. This significantly reduces the traffic impact of planned maintenance or software restarts, though it does not help with unplanned failures.