Apoc.periodic.commit within python/pandas help

FourMoBro · July 22, 2022, 5:30pm

I am looking for the correct syntax to help me load data into Neo4j, in particular using the periodic commit ability when loading from a python/pandas dataframe. My general workflow is as follows:

Within a Jupyter notebook, I load the 1M+ line tab-delimited text file into a dataframe.
Clean the data
Create a smaller dataframe to be used as input parameter for a function
Run function

In general my functions look like this:

def add_data(df1):
query = """
UNWIND $rows as row
MERGE
SET
RETURN COUNT(*) as total
"""
return conn.query(query, parameters = {'rows':df1.to_dict('records')})

columns =
df1 = pd.DataFrame(df[columns])
df1 = df1.explode(columns).drop_duplicates()
add_data(df1)

This works great for creating nodes and relationships when the total count is under 1000, but when there are 1M+ nodes/relationships, it tends to not finish.

I know there are server parameters in neo4j.conf that can be adjusted which may help with the load. I know I can save the dataframe to csv and load from harddisk USING PERIODIC COMMIT. I know I can split my dataframe and create a for loop and process the loop from within python. But I don't want to go those routes. I want to get apoc.periodic.commit to work within the add_data function.

I have tried several iterations in attempt to get it to work, but to no avail. I am hoping the community can help.

Thanks in advance.

bennu_neo · July 23, 2022, 8:08pm

Hi @FourMoBro,

Quick question. How does your Merge statement look? Do you have an index on the properties used?

Regards

Topic		Replies	Views
How to PERIODIC COMMIT when importing data from large Pandas Dataframe? Python import , batching	8	7179	July 2, 2021
Getting error:Executing queries that use periodic commit in an open transaction is not possible by "USING periodic commit" in 4.1.3 neo4j Cypher apoc , cypher , neo4j-desktop	1	1318	October 22, 2020
Using neo4j-import with existing database in Neo4j Desktop Import / Export	7	2572	September 24, 2018
MATCH hundreds of thousands of nodes and return in dataframe chunks Python apoc , cypher	1	631	October 27, 2021
Calling REST API within apoc.periodic.commit() Procedures & APOC apoc , cypher , import , rest	7	2990	July 4, 2019

Apoc.periodic.commit within python/pandas help

Related topics