Filter nodes by properity list using intersection

mansour.mh.ali · January 3, 2024, 8:21pm

Hi guys!
I have nodes of types Subject , Object and verbs as relations
verb relation has a property of type list called Synonyms = ["syn1", "syn2","syn3"]
I want to get all nodes of type Subject that are related to Object by any relationship that has a name == passed verb or one of its synonyms that are passed through the query as a list ( i get them from an ontology).
So I need to get the intersection between the passed list and the relation property Synonyms list.
if result is not empty return the Subject
What is the best way to do that please?

Something similar to this pseudocode

MATCH (s:Subject)-[r]->(o:Object)
with r
call apoc.coll.intersection(r.synonyms, ["make", "do"]) as res
if res is not empty
RETURN s.name

Thanks in advance

ameyasoft · January 3, 2024, 10:00pm

r.synonyms is not a collection. First you have to collect all distinct r.synonyms 
with collect(distinct r.synonyms) as synonyms
call apoc.coll.intersection(synonyms, ["make", "do"]) as res

glilienfield · January 4, 2024, 1:53am

You can use the 'any' list predicate.

MATCH (s:Subject)-[r]->(o:Object)
WHERE any(i in r.synonyms where i in ["make", "do"])
RETURN s.name

Test data:

create(s:Subject{name:"a"}),
(s)-[:REL{synonyms:['hi','by']}]->(:Object{id:0}),
(s)-[:REL{synonyms:['two','do']}]->(:Object{id:1}),
(s)-[:REL{synonyms:['make','no']}]->(:Object{id:2})

glilienfield · January 4, 2024, 7:57am

In case you prefer the apoc approach, you can try this. Note, the intersection is a function, not a procedure; therefore, it doesn’t use a ‘call’.

MATCH (s:Subject)-[r]->(o:Object)
WHERE size(apoc.coll.intersection(r.synonyms, ["make", "do"])) > 0
RETURN s.name

mansour.mh.ali · January 4, 2024, 8:29am

Thank you very much, you have saved me a lot of time and effort

mansour.mh.ali · January 4, 2024, 8:46am

I am still learning while creating my project. Please, according to your experience, which of the two solutions is better in terms of speed and computational complexity, assuming that the size of the data is much larger?

glilienfield · January 4, 2024, 11:56am

I would think the list predicate approach, as it will stop searching once one element in the first list is found in the second list. All elements in one list need to be compared to the other list to determine the intersection, so once a first element is found to be in the intersection the algorithm will continue instead of stopping.

Another benefit is you do not have a dependency on the APOC library.

If it is known a priori, set the smaller list as the one that the list predicate function iterates over. That may help performance if the is a large difference in the size of the two lists.

mansour.mh.ali · January 4, 2024, 12:40pm

Thank you very much @glilienfield for your response

Topic		Replies	Views
Basic intersection of sets using Cypher Cypher	1	328	June 10, 2020
Know more on Intersection Cyphers Cypher browser , cypher , knowledge-base	4	3118	November 27, 2018
Help with intersecting nodes query Cypher apoc , cypher	5	456	March 18, 2021
Forming relationship based on non-empty intersection of node properties Neo4j Graph Platform apoc , cypher	1	370	March 14, 2020
Check whether a node exists, which has relation to all given nodes Cypher	2	2410	October 7, 2019

Get Certified in June!

Filter nodes by properity list using intersection

Related topics