linux/net/tipc
Allan Stephens b4b5610223 tipc: Ensure both nodes recognize loss of contact between them
Enhances TIPC to ensure that a node that loses contact with a
neighboring node does not allow contact to be re-established until
it sees that its peer has also recognized the loss of contact.

Previously, nodes that were connected by two or more links could
encounter a situation in which node A would lose contact with node B
on all of its links, purge its name table of names published by B,
and then fail to repopulate those names once contact with B was restored.
This would happen because B was able to re-establish one or more links
so quickly that it never reached a point where it had no links to A --
meaning that B never saw a loss of contact with A, and consequently
didn't re-publish its names to A.

This problem is now prevented by enhancing the cleanup done by TIPC
following a loss of contact with a neighboring node to ensure that
node A ignores all messages sent by B until it receives a LINK_PROTOCOL
message that indicates B has lost contact with A, thereby preventing
the (re)establishment of links between the nodes. The loss of contact
is recognized when a RESET or ACTIVATE message is received that has
a "redundant link exists" field of 0, indicating that B's sending link
endpoint is in a reset state and that B has no other working links.

Additionally, TIPC now suppresses the sending of (most) link protocol
messages to a neighboring node while it is cleaning up after an earlier
loss of contact with that node. This stops the peer node from prematurely
activating its link endpoint, which would prevent TIPC from later
activating its own end. TIPC still allows outgoing RESET messages to
occur during cleanup, to avoid problems if its own node recognizes
the loss of contact first and tries to notify the peer of the situation.

Finally, TIPC now recognizes an impending loss of contact with a peer node
as soon as it receives a RESET message on a working link that is the
peer's only link to the node, and ensures that the link protocol
suppression mentioned above goes into effect right away -- that is,
even before its own link endpoints have failed. This is necessary to
ensure correct operation when there are redundant links between the nodes,
since otherwise TIPC would send an ACTIVATE message upon receiving a RESET
on its first link and only begin suppressing when a RESET on its second
link was received, instead of initiating suppression with the first RESET
message as it needs to.

Note: The reworked cleanup code also eliminates a check that prevented
a link endpoint's discovery object from responding to incoming messages
while stale name table entries are being purged. This check is now
unnecessary and would have slowed down re-establishment of communication
between the nodes in some situations.

Signed-off-by: Allan Stephens <allan.stephens@windriver.com>
Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
2011-09-17 22:55:03 -04:00
..
addr.c tipc: Eliminate configuration for maximum number of cluster nodes 2011-03-13 16:35:17 -04:00
addr.h tipc: make zone/cluster mask constants a define 2011-05-10 16:03:54 -04:00
bcast.c tipc: Remove obsolete congestion handling when sending a broadcast NACK 2011-09-01 11:16:37 -04:00
bcast.h tipc: Remove non-executable code to handle broadcast bearer congestion 2011-09-01 11:16:36 -04:00
bearer.c tipc: Ensure congested links receive bearer status updates 2011-09-01 11:16:37 -04:00
bearer.h tipc: Eliminate unused field in bearer structure 2011-06-24 16:18:16 -04:00
config.c tipc: Correct misnamed references to neighbor discovery domain 2011-03-13 16:35:18 -04:00
config.h tipc: Remove obsolete inclusions of header files 2010-12-02 13:34:02 -08:00
core.c tipc: Drop __TIME__ usage 2011-05-10 10:31:03 -04:00
core.h atomic: use <linux/atomic.h> 2011-07-26 16:49:47 -07:00
discover.c tipc: Ensure both nodes recognize loss of contact between them 2011-09-17 22:55:03 -04:00
discover.h tipc: Add monitoring of number of nodes discovered by bearer 2011-05-10 16:04:01 -04:00
eth_media.c tipc: remove zeroing assignments to static global variables 2011-01-01 13:57:57 -08:00
handler.c tipc: remove zeroing assignments to static global variables 2011-01-01 13:57:57 -08:00
Kconfig tipc: Eliminate configuration for maximum number of cluster nodes 2011-03-13 16:35:17 -04:00
link.c tipc: Ensure both nodes recognize loss of contact between them 2011-09-17 22:55:03 -04:00
link.h tipc: Avoid recomputation of outgoing message length 2011-05-10 16:03:57 -04:00
log.c tipc: Finish streamlining of debugging code 2011-01-01 13:57:54 -08:00
log.h tipc: update log.h re-include protection to reflect new name 2011-01-01 14:56:18 -08:00
Makefile tipc: rename dbg.[ch] to log.[ch] 2011-01-01 13:57:51 -08:00
msg.c tipc: Eliminate useless check when creating internal message 2011-06-24 16:18:17 -04:00
msg.h tipc: Cleanup of message header size terminology 2011-06-24 16:18:17 -04:00
name_distr.c tipc: Cleanup of message header size terminology 2011-06-24 16:18:17 -04:00
name_distr.h [NET] TIPC: Fix whitespace errors. 2007-02-10 23:20:15 -08:00
name_table.c tipc: Eliminate checks for empty zone list during name translation 2011-06-24 16:18:16 -04:00
name_table.h tipc: Convert name table publication lists to standard kernel lists 2011-06-24 16:18:16 -04:00
net.c tipc: Remove obsolete manipulation of message re-route count field 2011-09-01 11:16:35 -04:00
net.h tipc: manually inline net_start/stop, make assoc. vars static 2011-03-13 16:35:17 -04:00
netlink.c genetlink: make netns aware 2009-07-12 14:03:27 -07:00
node_subscr.c tipc: Cosmetic changes to node subscription code 2011-03-13 16:35:16 -04:00
node_subscr.h tipc: Cosmetic changes to node subscription code 2011-03-13 16:35:16 -04:00
node.c tipc: Ensure both nodes recognize loss of contact between them 2011-09-17 22:55:03 -04:00
node.h tipc: Ensure both nodes recognize loss of contact between them 2011-09-17 22:55:03 -04:00
port.c tipc: Optimize creation of connection protocol messages 2011-06-24 16:18:18 -04:00
port.h tipc: Avoid recomputation of outgoing message length 2011-05-10 16:03:57 -04:00
ref.c tipc: remove zeroing assignments to static global variables 2011-01-01 13:57:57 -08:00
ref.h tipc: cleanup function namespace 2010-10-16 11:13:24 -07:00
socket.c tipc: Prevent rounding issues when saving connect timeout option 2011-09-01 11:16:37 -04:00
subscr.c tipc: Avoid recomputation of outgoing message length 2011-05-10 16:03:57 -04:00
subscr.h Revert d88dca79d3 2010-10-21 04:11:08 -07:00