Head's Up! Site maintenance this Wednesday, February 1. Disruptions expected as we migrate the forums.
β06-03-2021 09:32 AM
Hello,
I've been reloading the same dataset to the same database repeatedly (for the purpose of optimizing my data pipeline) and observed that the store size has been always growing. What is the reason of this and what can I do to release the space? I have an database admin role but do not have access to server configuration.
Thanks in advance.
β06-03-2021 10:30 AM
Can you provide details of
a. what version of Neo4j?
b, how are you determining store size? A simple size of the OS directory? running :sysinfo
from the browser? from some jmx metric? or
β06-03-2021 11:39 AM
Hi there,
Thanks for replying. I'm using Neo4j Enterprise server 4.2.5 and determined the store size from Neo4j Browser using :sysinfo
Thanks
β06-03-2021 11:43 AM
@sabrina.liu thanks for this detail. although there is not enough detail to know what reloading the same dataset
means but if it involves deletes
then you may have encountered Understanding Database Growth - Knowledge Base
β06-03-2021 11:49 AM
You may also want to check if transaction logs are contributing to the size you're seeing. These build over time, I think the default is 7 days of tx logs. You can adjust the retention settings to keep logs for a lesser amount of time, or to keep only up to a certain limit.
β06-08-2021 09:24 PM
The growth is transaction logs. They will rotate out and delete according to the transaction log rotation policy in the neo configs. Transaction logs contain the data youβre sending in the database. So even if you are throwing the same data at it to test cypher or changing dates to ingest large amounts of data, the tx logs will grow. But rotation policy will prune out the logs based on the set schedule. And the DB size or Hdd usage will go down.
All the sessions of the conference are now available online