We have a use case where we want to cluster individuals for a prediction of a classifier. A simplified representation is below
(Person) -- worked at --> (Org)
Nodes of both Person and Org have many properties, The worked_at will have properties such as from_dt, to_dt
We want to apply clustering on Person using a set of properties and also want to include some properties from the node with label Org.
I noticed that the movie database has a similar representation of actor working in multiple films so assume the modelling is fine, hence, keeping the mode intact what is the best way to achieve the clustering?