Load CSV taking excessive amount of time

henry007 · February 22, 2023, 6:03pm

Hello, my LOAD CSV query is taking a very long time, despite having indexes on my columns and a unique constraint on PersonId.

It sometimes takes over 5 minutes for a CSV file with 27k records that is just under 3MB.

Here is my query:

LOAD CSV WITH HEADERS FROM ('myurl.csv') AS row 
MERGE (n:Person {name: row.PersonId }) 
SET n += {EmailAddress: row.EmailAddress, PhoneNumber: row.PhoneNumber}
RETURN true

I have also tried ON CREATE SET with no performance improvement.

Can anyone help me figure out what is causing it to take so long?

Thank you!

lyonwj · February 22, 2023, 6:31pm

Hi @henry007 -

Be sure that your uniqueness constraint has been created for the correct node property. Given the Cypher statement you shared the uniqueness constraint should be on :Person(name) (not PersonId).

You can confirm the index / constraint is being used by adding PROFILE to the beginning of your Cypher query and examining the query plan for usage of the index.

ON CREATE SET will improve performance by not setting property values for nodes that already exist (when the MERGE statement matches instead of creating).

henry007 · February 22, 2023, 6:37pm

Hello, creating the constraint on :Person(name) fixed my issue! Thank you so much!

Topic		Replies	Views
LOAD CSV taking time Import / Export cypher , import	6	702	September 25, 2021
Slow load_csv Cypher cypher	4	2039	July 29, 2019
hello guys, I have a problem when I want to load this query, it takes a very long time, up to 2 hours, does anyone know why? Cypher performance , cypher , operations , import	2	89	June 6, 2024
CSV import issue Import / Export	26	702	June 21, 2023
Import query runtime grows exponentially Cypher cypher , load-csv , import	2	310	March 21, 2023

Get Certified in June!

Load CSV taking excessive amount of time

Related topics