How to save the result of a query (sub graph) in cypher

abhishekjayant1 · August 26, 2021, 8:09am

Hi all, this is my query:

match(n)-[r]->(m) where not m.name in ['JANSSEN','PFIZER\BIONTECH','ANOFI PASTEUR','Hospital','Recovered'] and not n.name in ['JANSSEN','PFIZER\BIONTECH','ANOFI PASTEUR','Hospital','Recovered'] return n,r,m

The result of the query will be a subgraph. How can I save this subgraph for future use or use it in some other query?

Thanks in advance

akollegger1 · August 26, 2021, 12:02pm

Hi @abhishekjayant1 ,

Unfortunately there's no way to save a sub-graph in Neo4j itself. While this may possible some day when multi-graph handling has become available, today you have to manage this manually.

There are two main approaches:

mark the query results with a label on the nodes, and a special property on the relationships
save the graph on the client-side, using it as the starting point for subsequent queries

Neo4j Bloom, for example, uses the second approach.

Hope that helps.

Best,
ABK

ameyasoft · August 26, 2021, 6:55pm

Here is a solution that I used, Collected the ids of nodes, created a node and added the ids list as a property. 

Step: 1
match(n)-[r]->(m) where not m.name in ['JANSSEN','PFIZER\BIONTECH','ANOFI PASTEUR','Hospital','Recovered'] and not n.name in ['JANSSEN','PFIZER\BIONTECH','ANOFI PASTEUR','Hospital','Recovered'] 
with collect(distinct id(n)) as n1, collect(distinct id(m)) as m1
with apoc.coll.union(n1, m1) as res1
with apoc.coll.sort(res1) as final

merge (a:SubGraph {name: "xyz", ids: final})

Step 2:

match (c:SubGraph) where c.name = "xyz"
with c.ids as sub1
unwind sub1 as sub2
with collect(sub2) as s1
match(x) where id(x) in s1
return x

Displays the subgraph. Another option is to use apoc.export.json.query to create a json file.

abhishekjayant1 · August 26, 2021, 7:14pm

Hey @abk, thanks for your reply. The first method is easy but also naive. Can you pls explain the 2nd method in some detail? Would appreciate the help :)

abhishekjayant1 · August 26, 2021, 7:15pm

Hey @ameyasoft , thanks a lot for the reply. I didn't think of this way, will definitely help me out.

dan.flavin_enterpris · August 30, 2021, 9:28pm

Abhishek - you might be able to use the function apoc.refactor.cloneSubgraphFromPaths.

match path=(n)-->(m) where not m.name in ['JANSSEN','PFIZER\BIONTECH','ANOFI PASTEUR','Hospital','Recovered'] and not n.name in ['JANSSEN','PFIZER\BIONTECH','ANOFI PASTEUR','Hospital','Recovered'] 
WITH path
CALL apoc.refactor.cloneSubgraphFromPaths([path], {}) 
YIELD input, output, error
RETURN input, output, error;

You will have to have some way of identifying the original nodes from the existing nodes found in the path. See the documentation for some ideas. Or put in a property that indicates "original data" and then skip that property when cloning

ameyasoft · August 31, 2021, 5:04am

The problem here is not cloning, but saving the query results for subsequent reproduction as and when required. Many users use the database and run their own anlaytics and want to save the query results,

dan.flavin_enterpris · March 3, 2022, 2:35am

Only replying here because there might be for those who stumble across this later.

There's no problem if the data in your graph never changes once it's loaded. But, be careful here for two reasons:

1 - internal node id's returned by the id() function are not stable outside of a transaction. They can be reused. See the docs here. Node with internal id of 100 may not be the same node later as id's get reused.

2 - The above does not take into account relationships. You might be making some assumptions based on the way the Neo4j browser behaves when you have the "connect result nodes" option enabled. It will prefetch and return data "in the neighborhood" as a convenience feature. See what happens when you disable for the above.

Solution would be to create your own uuid for each node and relationship, then use something similar to above to recreate the results. Of course if anything in your db changes (e.g. property value on a returned node, removed or added nodes / relationships, etc.) you will not be able to recreate what was seen in the original query. The state and existence of nodes and relationships are not preserved.

vinodpahuja · November 8, 2024, 6:52am

Good Approach,
For me this also worked

MATCH (n {name: 'M10-1'})-[r:CALLS *]->(m)
WITH collect(id(m)) as cg
MATCH (n1)-[r]->(m1)
WHERE id(m1) in cg
RETURN n1,r,m1

malsharafi · November 20, 2024, 11:15am

can't you reduce the query and eliminate the step of:
"
unwind sub1 as sub2
with collect(sub2) as s1
"
just keep:
match(x) where id(x) in sub1
?

Topic		Replies	Views
How to save the result of a query (sub graph) in cypher Neo4j Graph Platform migrated	1	95	July 26, 2022
Facing difficulties to create a subgraph from original one in NEO4j Cypher networkx	15	1918	November 19, 2019
Querying in cypher a subgraph from my DB Cypher cypher , operations	7	147	January 25, 2025
Issue of Performance in Generating our Graphs Cypher apoc , performance , plugin	2	76	November 21, 2024
Copy Subgraph from one neo4j server to another neo4j server Neo4j Graph Platform apoc , cypher , operations	1	328	August 13, 2021

How to save the result of a query (sub graph) in cypher

Related topics