Classification of nodes and normalization of path lengths

chris23 · August 8, 2022, 1:07pm

I am relatively new to Neo4j and graph databases. Maybe someone could help and steer me in the right direction.

In general, we need a multi-label classification of nodes according to certain criteria/rules for creating a normalized reasoning mechanism between node classes. Between classified nodes there will be edges with weights.

Example:

Node A has class/label A’ and A’’, node B has only class B’.

In our knowledge domain

A’ -> links to, weight =0,8 -> B’
A’’ -> links to, weight =0,4 -> B’

Our issues/questions

How can we normalize path lengths or manual weights for our reasoning approach? Normalize = [0;1]
Maybe we could also employ a kind of cluster similarity between classes. But for each class we would need to compute the similarity between all classes. We guess that’s slow!

Maybe someone who had to do a similar thing can give me some thoughts or point me to some resources. That would be very much appreciated.

glilienfield · August 8, 2022, 10:59pm

You can have multiple labels on a node. For your case, maybe you have ‘A’ and ‘B’ labels (and an other classification). You can then mark any of them with a ‘Prime’ or ‘DoublePrime’ labels. Your match for an A’ node would be match(n:A:Prime).

what do you want to normalize?

chris23 · August 11, 2022, 2:15pm

Maybe these 2 illustrations are helpful:

chris23 · August 11, 2022, 8:04am

As it seems there is a small problem with the rendering on windows machines, so i just wanted to clarify that if there are appearing some weird symbols in the first two bullet points that they should be an arrow pointing to the right. ( - > )

chris23 · August 11, 2022, 8:28am

At the moment we were thinking about having a node for each class and connecting the classified nodes via a relationship or multiple ones if they get classified as more than one. This way we could give the relationships a weight property. Currently we are trying to figure out how we can best classify nodes and how to do the weighting.

The weighting is also what we want to normalize, because if we want to compare different taxonomies we need to normalize the weights. Because we want to say the longer the path length the more specific something is. But what if in one taxonomy the max path length is 3 and in another one it is 5, it would be hard to compare them.

glilienfield · August 11, 2022, 1:58pm

I think you will have to perform the normalization at the time when you want to compare multiple taxonomies, or extract metrics from a taxonomy. If not, every time you modify a taxonomy you will have to renormalize every weight if the max length of the taxonomy changed. This would not be hard using a custom procedure.

chris23 · August 11, 2022, 2:46pm

Are there any functions built in for that matter into neo4j ?

And regarding the classification, I am looking into using gdsl and training a model, do you think that this is a viable approach ?

glilienfield · August 12, 2022, 1:07pm

Assuming you have the classification nodes and weights, what is it you want to do with the above graph?

glilienfield · August 12, 2022, 1:08am

Sorry, I don’t have any experience with gdsl. You have a reference.

chris23 · August 15, 2022, 7:54am

So basically the general graph regarding the task is the one with the round nodes. So what it should do based on the classification is some kind of sophisticated recommendation of products for the user based on personal characteristics. So for example we classify the user as overweight, which would be the general type, based on the weight and height. And on the other side we have our products which have to get classified as well, primarily based on their description and keywords, so that products regarding overweight get classified as the same general class. Then we bring these two general classes together. So we can recommend those products. The weighting should determine the order in which the products get recommended. A person can be associated with more than on general class and general product classes are of course associated with many products. So we need some kind of order besides one based on the time factor (whats the newest thing we know about the person).

Topic		Replies	Views
Can neo4j help to draw a graph where the length of each edge is determined by its value Cypher	1	294	April 4, 2023
Can I give a weight to my graph? Neo4j Graph Platform	0	1140	November 5, 2018
Node Classification using gdsl Neo4j Graph Platform migrated	6	184	August 18, 2022
Weights on Relationships Graph Algorithms/Graph Data Science	1	786	March 23, 2023
Add weights to a relation, and fetch the path with highest weight using neo4j-ogm Newbie Questions	1	290	June 30, 2021

Classification of nodes and normalization of path lengths

Related topics