Causual cluster follower falling behind

tim.hanssen · December 12, 2018, 11:09am

We're running a 3 server causal cluster locally hosted. A few times a day in the debug.log we see the message "follower has fallen behind". Sometimes its just one entry, sometimes a server keeps falling behind seconds after the moved to PIPELINE mode.

2018-12-12 10:05:59.430+0000 INFO [o.n.c.c.c.s.RaftLogShipper] MemberId{f3b56e53}[matchIndex: 50170773, lastSentIndex: 50171030, localAppendIndex: 50171031, mode: PIPELINE]: follower has fallen behind (target prevLogIndex was 50171030, maxAllowedShippingLag is 256), moving to CATCHUP mode

2018-12-12 10:06:01.372+0000 INFO [o.n.c.c.c.s.RaftLogShipper] MemberId{f3b56e53}[matchIndex: 50171094, lastSentIndex: 50171141, localAppendIndex: 50171141, mode: CATCHUP]: caught up, moving to PIPELINE mode

Ubuntu 18 LTS, 62GB
Neo4j 3.4.10
BOLT (without routing)
Causal cluster (from 3 nodes)

Any suggestions on where we should start looking? We're thinking about raising the causal_clustering.log_shipping_max_lag but the docs are not really clear about the implications.

tim.hanssen · December 17, 2018, 10:35am

We changed the causal_clustering.log_shipping_max_lag from 256 to 512 and that seems to fixed the issue.

yacine.2limi · August 26, 2019, 3:32pm

working for me.
did you find why does it happen first time?

Topic		Replies	Views
Causal Cluster: RaftReplicator - Replication attempt to leader failing Cluster cluster	0	480	March 27, 2021
Time lag issue between leader and follower Cypher	1	169	March 24, 2022
Cluster leader keeps changing Operations	7	4071	December 17, 2018
Experimental Multi-DC Causal Cluster with two Core - Bolt Routing issue Cluster	1	758	October 16, 2019
Cluster Alerts Cluster cluster	1	644	November 5, 2019

Causual cluster follower falling behind

Related topics