Neo4j Causal Cluster : Backup Strategies

ashutosh · March 27, 2019, 5:47am

Documentation is still bit sparse for cloud cluster backup starting this thread to document experiences and knowledge what some of us come to know.

I recently setup a cluster on google cloud using marketplace installation. Our production database is a single instance enterprise edition with around 20GB of data.

I used following steps to seed the cluster first for testing that it works fine.

On standalone prod DB I took full online backup that did not require any downtime.

neo4j-admin backup --backup-dir=dir-name --name=backup-name
stop all cluster member (mine was one leader + 2 follower core cluster) and unbind them.
sudo systemctl stop neo4j
sudo neo4j-admin unbind
used scp to copy backup on all 3 cluster machines. (maybe someone can suggest if alternative is there)
Seed from backup on all 3 machines
sudo rm -rf /var/lib/neo4j/data/databases/graph.db
sudo neo4j-admin restore --from=dir-name --database=graph.db
Hit an Intresting issue, after backup cluster DB won't start. After lot of efforts figured out that issue was due to changed permission on copied graph.db files. To solve this use following commands:
sudo chown -R neo4j /var/lib/neo4j/data/databases/
sudo chown -R neo4j /var/lib/neo4j/data/cluster-state/
One major doubt was do we need to stop all cluster machine then start backup/seed or it can be done one by one.

david_allen · March 29, 2019, 12:24pm

This is great guidance, thanks for posting it!

On your question 6 -- all machines should be stopped. The thing is, the nodes in the cluster all participate as part of the same cluster. If you are restoring a backup, they need to be in agreement about what is in the data set. If you ever have a situation where the nodes of the cluster have a very different perspective on what is in the graph, you could run into problems.

Neo4j uses the raft consensus protocol, and uses a lot of majority votes. So suppose you unbound the nodes of the cluster, and then 2 of the 3 have the backup dataset. Depending on the situation, you might see that either the majority pushes their updates to the third machine, or you could get some errors. If only one of the three had the backup dataset and the cluster had formed, the one with the good data would be in the minority.

To prevent these problems, the best thing to do is shut down all 3 and unbind them, restore to each, and then bring them all up after the restore is completed. In this way, when they form a cluster, they will already agree on the dataset.

nlaquan · October 9, 2019, 1:33am

Hi. Thankyou for your sharing. Can you share your config for online backup cluster. I am also working on performing a backup with cluster. But I never had it been successfull.

david_allen · October 9, 2019, 3:12pm

github.com

neo-technology/neo4j-google-k8s-marketplace/blob/3.5/backup/README-BACKUP.md

# Backing up Neo4j Containers

This directory contains files necessary for backing up Neo4j Docker containers
to google storage.

See backup.yaml for an example.   

The "credentials.json" file must be a base64-encoded version of a service key JSON that has permissions to write to the targeted google storage bucket.  The example provided is non-functional, and you must substitute your own.  To determine an appropriate value, perform the following:

- Create a service account with appropriate permissions to write to the google
storage bucket
- Save the key in JSON format to your local disk
- `cat my-key.json | base64`
- Use that resulting value in your `backup.yaml` file
- Finally, after adjusting parameters in backup.yaml, run `kubectl apply -f backup.yaml --namespace my-neo4j-deployed-namespace`

nlaquan · October 9, 2019, 4:54pm

Thankyou for your reply. But I am working on cluster without docker/k8s. Following neo4j document, I must use SSL policy. But I dont know how to setup it properly.

david_allen · October 9, 2019, 9:43pm

Please open up a separate thread -- sorry for my misunderstanding about k8s.

In your separate thread, please post what you did and what the error is, or what you're missing and we'll see if we can get someone on it.

Topic		Replies	Views
Backup/Restore Cluster	24	5421	October 16, 2018
Neo4j Causal Cluster Backup & Restore Neo4j Graph Platform	5	826	April 29, 2021
Backup and restore in Neo4j causal cluster (Azure vm) Cluster cypher-shell , browser , operations , backup , enterprise , azure , cluster , vm	1	763	May 7, 2021
Copying a graph.db Cluster	10	4622	September 24, 2018
Cluster not Forming up after database restore Cluster performance , operations	2	477	July 15, 2021

Take the Course Then Join The Aura Agent Hackathon

Neo4j Causal Cluster : Backup Strategies

Related topics

Take the Course Then Join
The Aura Agent Hackathon