Slow Cypher Query Help Needed!

gavin.ooi · January 15, 2020, 3:53pm

Hi,

I apologise if someone has asked this question out there somewhere, but i've been looking for awhile and cant seem to find the answer as to why my query is so slow.

I am trying to filter a subgraph with a specific start and end point, with certain restrictions on the connecting relationships. My graph is not particularly large (around 40 nodes, 270 relationships). The graph has several parallel edges but I dont think that would be an issue for neo4j.

This is my query:

MATCH paths = (from:COVERAGEAREA{name:'SGPUCAALL'})-[:CONNECTED_TO*]->(to:COVERAGEAREA{name:'PH - Luzon'})
WHERE  ANY (r IN relationships(paths) WHERE NOT 'UA' IN r.restrictedMerchants AND r.paymentType in ['Both', 'prepaid'])
RETURN paths

philiprichardjames · January 15, 2020, 4:21pm

have you tried using PROFILE to see which part of the query is slow? just checking what you are trying to do aswell, is it any path (no maximum hop limit) between those two nodes as long as they don't use an edge where the restrictedMerchants is UA and the payment type is Both/prepaid.

gavin.ooi · January 15, 2020, 4:30pm

Hi, philip thanks for your help. Yes you are mostly correct, trying to get the subgraph between 2 points as long as the relationship payment type is either both/prepaid and UA is not in its restrictedmerchants

i have tried prepending profile to the query, but the query itself always crashes without a result

** edits
the relationships between nodes represent a time slot, with around 4 slots between each node
i have reduced the slots/relationships between nodes to 1 and was able to get a result back

when i increase it to 2

seems like a lot of hits to the db, any way my query can be further optimized?

philiprichardjames · January 15, 2020, 5:17pm

Hmmmm hard to tell, I will have a think. Have you indexed the coverage area nodes on name? You may want to change your data model to have the slots as nodes and then you can either have nodes for restricted merchants, payment types etc...or keep them as properties on the nodes and index that way? That could potentially effect performance. That is one frustration I have had with Neo4J in general, that you can’t index relationship properties as far as I’m aware. But tbf normally I realise I’m trying to fight a non ideal data model

philiprichardjames · January 16, 2020, 10:12am

yeah thinking about it now, you are really going to struggle as its basically hitting all the relationships from the nodes and then again to filter. have you tried using any of the path APOC's to help? Neo4j APOC Procedures User Guide

gavin.ooi · January 16, 2020, 11:19am

Hi Philip thanks so much for the help! Unfortunately im unable to convert the slots to nodes as we have a need to perform pathfinding between other nodes using these links. Have limited the number of hops per path and this has improved performance slightly.

Will definitely give apoc & indexing a check. Thanks!

philiprichardjames · January 16, 2020, 11:25am

no problem, yeah i dont think indexing will help much in this situation as normally helpful for finding the start of your path, but that doesnt seem to be the cost here. Good luck!

Topic		Replies	Views
Why is this query too slow? Cypher	5	560	October 2, 2023
A slow running cypher query Cypher performance	8	5188	February 10, 2020
Performance query over millions of relationships Cypher	2	2573	January 31, 2020
Why is this geospatial search so slow? Cypher	24	1291	January 19, 2021
Slow subsequent filtering query Neo4j Graph Platform migrated	2	106	June 23, 2022

Get Certified in June!

Slow Cypher Query Help Needed!

Related topics