It takes too long "without a response" while Retrieving all Routes between two nodes for number of hops exceeding 30

mohamed_elzonko · July 24, 2023, 12:51pm

Hi,
The below query takes too much time "without a response" while Retrieving all Routes between two nodes for number of hops exceeding 30.

query = (f"""MATCH (from{{nodeName:"{srcNode}"}}),(to{{nodeName:"{dstNode}"}}),path=(from)-[:ots*1..{hops}]-(to)
WHERE NONE (n IN nodes(path) WHERE size([x IN nodes(path) WHERE n = x]) > 1 )
AND NOT NONE (n IN nodes(path) WHERE n.nodeName IN {includes})
AND NONE (n IN nodes(path) WHERE n.nodeName IN {excludes})
RETURN DISTINCT path AS shortestPath,
reduce(distance = 0, r in relationships(path) | distance + toInteger(r.distance)) AS totalDistance
ORDER BY totalDistance ASC LIMIT {limit}""")

Please support

bennu_neo · July 24, 2023, 2:02pm

Hi @mohamed_elzonko ,

This looks like a very specific traversal query. Without the second where condition AND NOT NONE (n IN nodes(path) WHERE n.nodeName IN {includes}), and if list of excludes names is not too big. (And assuming APOC installed) This query should help (notice labelWithAnIndexOnNodeName):

MATCH (from:labelWithAnIndexOnNodeName{{nodeName:"{srcNode}"}}),(to:labelWithAnIndexOnNodeName{{nodeName:"{dstNode}"}})
WITH from, to
MATCH(exc:labelWithAnIndexOnNodeName)
WHERE exc.nodeName in {excludes}
WITH from, to, collect(exc) as blk
CALL apoc.path.expandConfig(from, {
    minLevel : 1,
    maxLevel :{hops},
    blacklistNodes : blk

}) yield path
WITH path
//WHERE NOT NONE (n IN nodes(path) WHERE n.nodeName IN {includes})
RETURN DISTINCT path AS shortestPath,
reduce(distance = 0, r in relationships(path) | distance + toInteger(r.distance)) AS totalDistance
ORDER BY totalDistance ASC LIMIT {limit}

For a better traversal handling (if tuning on apoc path is not enough) you may like to check on https://neo4j.com/docs/java-reference/current/traversal-framework/

Bennu

mohamed_elzonko · July 24, 2023, 2:28pm

Thank you for your prompt reply. May I know the max number of nodes? //"list of excludes names is not too big"//

Regards,
Mohamed

bennu_neo · July 24, 2023, 2:30pm

Hi @mohamed_elzonko ,

There's no limit. But if you check the query I shared, you will collect nodes, so you will hydrate them in memory. If they are 3, a couple of Bytes will be needed on heap, if they are millions, you can run on memory issues.

Consider the usage of a label with and index on the name too.

mohamed_elzonko · July 24, 2023, 2:31pm

Great. I will try it and let u know. Thank you so much. Really appreciated!

mohamed_elzonko · July 24, 2023, 8:27pm

Hi Bennu,
I actually tried the query but it didn't return anything. I made sure that the data (node names) are valid and there are possible routes reverted after using the first query, but your query doesn't give any response. Would you please have a look maybe something is missing?

//
MATCH (from:labelWithAnIndexOnNodeName{nodeName:"ETSLEACJ-2PS"}),(to:labelWithAnIndexOnNodeName{nodeName:"20500205-1PSX"})
WITH from, to
MATCH(exc:labelWithAnIndexOnNodeName)
WHERE exc.nodeName in
WITH from, to, collect(exc) as blk
CALL apoc.path.expandConfig(from, {minLevel : 1,maxLevel :60,blacklistNodes : blk}) yield path
WITH path
RETURN DISTINCT path AS shortestPath,
reduce(distance = 0, r in relationships(path) | distance + toInteger(r.distance)) AS totalDistance
ORDER BY totalDistance ASC LIMIT 7
//

bennu_neo · July 25, 2023, 7:34am

Hi,

As stated on my previous messages, labelWithAnIndexOnNodeName is a place holder for an existence label on your model. You should replace it with an indexed label.

Topic		Replies	Views
Extremely slow Cypher query to generate all paths b/w 2 nodes Cypher	1	264	March 30, 2022
Shortestpath query is taking long time Cypher performance , cypher	5	52	April 11, 2025
Difficult query not working Cypher operations	5	1125	May 21, 2019
What is the most efficient way to traverse a path? Cypher performance	9	1198	June 3, 2021
Optimizing apoc.path.spanningTree and other cypher related isuue Procedures & APOC apoc , performance , cypher , subquery	1	36	February 28, 2025

It takes too long "without a response" while Retrieving all Routes between two nodes for number of hops exceeding 30

Related topics