Create/Merge getting slower over time

infraplataforma · April 12, 2024, 1:40am

driver = GraphDatabase.driver(...,
auth=(...))
session = driver.session()
chunk = chunk.drop_duplicates()
for start in range(0, len(chunk), 500):
df_temp = chunk[start:start + 500].copy()
driver.execute_query(item[1], data=df_temp.to_dict('records'))
driver.close()

over time, the operations of create and merge are getting slower. does anyone know the reason? it starts fast but becomes slow over time.

dana_canzano · April 12, 2024, 1:44am

@infraplataforma

Any details relative to what version of Neo4j?

also and per MERGE - Cypher Manual

For performance reasons, creating a schema index on the label or property is highly recommended when using MERGE. See Create, show, and delete indexes for more information.

are there indexes to support the merge?

infraplataforma · April 12, 2024, 1:54am

im working with auradb and yes, i use constraints and indexes and i tried with merge and create, but it doesn't work well. all options are getting slow over time.

dana_canzano · April 12, 2024, 2:00am

@infraplataforma

Aura. And thus Neo4j v5? i presume?

if you preface the MERGE with PROFILE do you see the index being used?

infraplataforma · April 12, 2024, 2:11am

correct! and yes, the index is being used

dana_canzano · April 12, 2024, 2:12am

@infraplataforma

are you able to share the query plan?

infraplataforma · April 12, 2024, 4:00pm

Planner COST

Runtime PIPELINED

Runtime version 5.19

Batch size 128

+---------------------+----+--------------------------------------------------------------+----------------+------+---------+----------------+------------------------+-----------+---------------------+
| Operator | Id | Details | Estimated Rows | Rows | DB Hits | Memory (Bytes) | Page Cache Hits/Misses | Time (ms) | Pipeline |
+---------------------+----+--------------------------------------------------------------+----------------+------+---------+----------------+------------------------+-----------+---------------------+
| +ProduceResults | 0 | | 1 | 0 | 0 | 0 | | | |
| | +----+--------------------------------------------------------------+----------------+------+---------+----------------+ | | |
| +EmptyResult | 1 | | 1 | 0 | 0 | | | | |
| | +----+--------------------------------------------------------------+----------------+------+---------+----------------+ | | |
| +Create | 2 | (a)-[r:HAS_PARTNER_RELATION {type: $autoint_2}]->(b) | 1 | 1 | 3 | | | | |
| | +----+--------------------------------------------------------------+----------------+------+---------+----------------+ | | |
| +MultiNodeIndexSeek | 3 | RANGE INDEX a:Company(document) WHERE document = $autoint_0, | 1 | 0 | 0 | 240 | 4/5 | 34.648 | Fused in Pipeline 0 |
| | | RANGE INDEX b:Company(document) WHERE document = $autoint_1 | | | | | | | |
+---------------------+----+--------------------------------------------------------------+----------------+------+---------+----------------+------------------------+-----------+---------------------+

Total database accesses: 3, total allocated memory: 304

im working with millions nodes but the example is with two only

Topic		Replies	Views
Neo4j merge performance optimization Cypher performance , merge	4	2031	March 26, 2020
Neo4j 5.18.1: massive performance hit compared to 4.4.32 Neo4j Graph Platform performance	4	517	April 8, 2024
Make run the Query in multiple therads Cypher	13	211	November 24, 2024
Improving very slow MERGE on relationship Cypher	11	2321	March 24, 2022
Merge relationship dynamically with variable label is very slowly sometimes Procedures & APOC apoc , performance , cypher	17	1281	March 11, 2021

Create/Merge getting slower over time

Related topics