Which is better, more relationship types or less?

rich2 · April 7, 2022, 12:11pm

Hi all,

Newbie question here about modeling so thanks in advance for any help. I'm building an ontology laying out our company's information and I have what I think is a simple question but I can't convince myself to go one way or the other.

Here is the graph from the GraphGists Fraud Detection article.

My question is around the "HAS_*" relationships. Why is having "HAS_CREDITCARD", "HAS_BANKACCOUNT", and "HAS_ADDRESS" better than simply having a single "HAS" type that points to things like CREDITCARD, BANKACCOUNT, and ADDRESS?

In other words, why the specificity when I can easily tell from the associated node type?

I can certainly understand the need for the additional relationship types if there are properties on the relationship that make it specific to the end node type. For example, if the HAS_BANKACCOUNT has a property on it that identifies the branch or something associated with the link.

But in general, is it better to have more specific or more general relationships if possible?

Thanks for your help,
Rich'

glilienfield · April 7, 2022, 1:01pm

One reason I can think off is performance. Let’s say you want to know the number of bank accounts a person has. You can write the following query (ignoring the ‘where’ condition to get the specific bank holder):

match(n:AccountHolder)-[r:HAS_BANK_ACCOUNT]->()
return count(r)

If you used only ‘HAS’ relations, you would need to specify an bank account node on the other end as follows:

match(n:AccountHolder)-[r:HAS]-(:BankAccount)
return count(r)

So what is the different? In the first case, only the relation types of the account holder entity need to be interrogated to find the HAS_BANK_ACCOUNT relationships for the count. In the latter case, the endNode of each relationship needs to be interrogated to determine if the node has a label equal to BankAccount. This is more processing and data retrieval.

This is true for any complex match pattern, where the traversal algorithm can determine which relationships of a node to consider by their type, and not have to interrogate the end nodes of each to determine which paths to include when traversing to obtain the matching paths.

It can also make the code more understandable. Let’s assume you want the account holders that have a bank account, you could write the following queries for each scenario:

match(n:AccountHolder)
where exists( (n)-[:HAS_BANK_ACCOUNT]->() )
return n

Instead of

match(n:AccountHolder)
where exists( (n)-[:HAS]->(:BankAccount) )
return n

Of the two, I think performance would be the reason.

rich2 · April 7, 2022, 2:27pm

Thanks very much, Gary. Seems very logical and really shows the advantages of the graph model.

Topic		Replies	Views
Is it better to have many different relationship types or one relationship with properties? Cypher performance	10	8456	January 23, 2020
Specific relationship vs Label Neo4j Graph Platform migrated	3	275	August 26, 2022
Max Number of relationships to a node - Best Modelling Modeling performance , cypher	2	3773	November 12, 2019
What is the optimal number of specific relationship types to use in Neo4j without negatively impacting performance? Neo4j Graph Platform	3	117	June 4, 2025
Filtering by Relationship Type - Contains Neo4j Graph Platform performance , cypher , modeling , data-modeling	5	2302	May 9, 2020

Take the Course Then Join The Aura Agent Hackathon

Which is better, more relationship types or less?

Related topics

Take the Course Then Join
The Aura Agent Hackathon