Noe4j Performance Impact with deletion

jalesar2011 · November 21, 2018, 9:14am

My graph contains 2.5 Milion nodes.
Current graph size is 58G.
Machine RAM = 24G.
We need to do the cleanup of DB. We have to delete 10 Millions of nodes with their attached relations. As documented graph size (disk) will not reduce because of the re-use of ids.
Will this deletion improve query performance (Read/Write).
Any suggestions for running store utils for compacting DB on production systems.

michael.hunger · November 21, 2018, 10:27am

Actually re-use will allow you to update your graph without changing of graph sizes.
So your graph shouldn't grow if you are deleting/adding nodes over the course of a day.

If you do huge operations, you'd have to wait for or change the grace period for id-reuse.

Compaction will help with disk and memory use, esp. if you have fragementation and partially filled pages.

You can run store-utils on a copy/backup and then test it afterwards.

jalesar2011 · November 21, 2018, 3:36pm

Hi,

Number of nodes to be deleted are more than 40%. so deletion is the only way for cleanup.
We are setting up casual cluster, How should we run store-utils for compaction on production ?
Is there any chances of data mismatch in case of cluster (Replica or core) ?
And will only deletion is sufficient to reduce the RAM used by db or compaction will help in this?

Thanks

michael.hunger · November 23, 2018, 12:45am

I honestly think that you're best off with deletion and record-reuse.

You can reduce the grace period to something lower than 1hr, e.g. 1 to 10 minutes.

Then your the records marked as unused for deleted node and relationship records will be reused for the new data.

jalesar2011 · November 23, 2018, 8:05am

Hi Michael,

I didn't get the concept of the grace period. is it a config to delete automatically deleted ids.
After deletion of data, will it use less memory (RAM) because graph size is the same?

Thanks

michael.hunger · November 26, 2018, 2:39am

It does never delete records, just reuse them.
The grace period is for reuse.

Topic		Replies	Views
Deleting records does not reduce size of database Neo4j Graph Platform performance	15	3150	July 26, 2020
Reducing the size of a database by removing node properties Neo4j Graph Platform	5	623	April 12, 2021
Large Delete Transaction Best Practices in Neo4j Operations cypher , garbage-collection , heap , memory , knowledge-base , transaction	0	1246	August 23, 2018
How deletes work in Neo4j Neo4j Graph Platform migrated	3	171	June 23, 2022
How deletes work in Neo4j Operations knowledge-base , delete , disk , storage	0	990	August 24, 2018

Noe4j Performance Impact with deletion

Related topics