Raft logs not being pruned

demian · November 2, 2022, 11:25am

We have a causal cluster with 5 core nodes. DB size ~60GB.

Every couple of days one of the nodes becomes offline with status: Quarantine marker is present, but unable to read.

This seems to be caused by running out of space even though each instance has volumes of 250GB.

After close inspection we see that the problem is with the raft.log files growing and growing and not being pruned as it is supposed to. In this particular node, we can see now over 700 files of 250MB each, from the last 3 weeks, taking more than 175GB of available space.

This is our current (and also default) config for pruning:

"raft_log_entry_prefetch_buffer.max_entries":"1024" 
"raft_log_implementation": "SEGMENTED" 
"raft_log_prune_strategy": "1g size" 
"raft_log_pruning_frequency": "10m" 
"raft_log_reader_pool_size": "8" 
"raft_log_rotation_size": "250.00MiB"

We would appreciate any ideas. Is there a specific reason why these logs are not pruned?
Can we prune them manually without affecting the causal cluster?

Thanks in advance!

dana_canzano · November 2, 2022, 5:25pm

@demian

Neo4j version? 4x? 5x?

demian · November 2, 2022, 6:07pm

We are running Neo4j v4.4.4.

dana_canzano · November 2, 2022, 6:23pm

@demian

thanks for this detail. 4.4.4 was released Feb 2022 https://neo4j.com/release-notes/database/neo4j-4-4-4/ and our current 4.4.x is 4.4.12 https://neo4j.com/release-notes/database/neo4j-4-4-12

We did fix a similar issue to this in 4.4.8.

Please upgrade

demian · November 3, 2022, 8:31am

Thanks for getting back to me.

I see that the only "related" fix in v4.4.8 is this one:

Fix bug where prefetching log entries could get stuck, unable to read new entries from the log.

Can that cause our issue?

We were planning to upgrade to v4.4.12 in the coming days in any case, so will let you know if we still see the issue after that.

Thanks!

demian · November 18, 2022, 11:28am

We have successfully updated to v4.4.12 and so far we haven't seen this issue anymore.

Thanks for the help, will report back if something happens.

Topic		Replies	Views
Neo4j DB Stops unknowingly Installation operations , relationship , knowledge-base	0	189	September 24, 2021
Neo4j Transaction Logs eating up entire hard disk space Neo4j Graph Platform performance , operations , knowledge-base	3	362	April 17, 2023
Neo4j memory issue Neo4j Graph Platform	2	228	May 18, 2022
Delete old Neo4j Ops Manager logs Neo4j Operations Manager (NOM)	5	206	August 1, 2024
Want to know Types of Clustering Neo4j supports Operations	6	1298	June 15, 2020

July Summer Fun!

Raft logs not being pruned

Related topics