cancel
Showing results for 
Search instead for 
Did you mean: 

Head's Up! Site migration is underway. Phase 1: replicate users.

Linear Regression and Neo4j

familylife103
Node Link

Hello,
I need to apply linear regression machine learning method on Neo4j already created database to find node importance and correlation.
Any examples to learn from?
Thanks.

6 REPLIES 6

raghad_baiad
Node Clone

hello @familylife103 ,

Are you using neo4j 3.X or neo4j 4.X ?

If you're looking at measuring node importance probably a good place to start are the centrality algorithms - Centrality . They use the structure of the graph itself to measure the importance of nodes.

Similarly, the Similarity algorithms - Neo4j Graph Data Science algorithms, use standard metrics for correlation (jaccard, cosine, pearson, etc) to measure the similarity of nodes.

GDS doesn't currently offer node regression, but there are some apoc procedures you can take a look at.

Thanks a lot for the info @alicia.frame1 , apoc procedures are great start, specially apoc.math.regr(). However, it is missing a lot of regression parameters. There were a great effort done by Lauren Shin in https://towardsdatascience.com/graphs-and-ml-multiple-linear-regression-c6920a1f2e70

However, it doesn't work in neo4j 4.X. It would be great if it is integrated there

familylife103
Node Link

Thanks for your reply, maybe that's why I couldn't apply the regression example on Neo4j, because I'm using the latest version. Does it mean if I change the version I will be able to use regression functions?
Is there any example about this topic using Python connected to the Neo4j graph?

Check out the tutorials under our developer guide pages, here: Link Prediction with GDSL and scikit-learn - Developer Guides

We walk through using Neo4j with scitkit learn, sagemaker, and training models inside neo4j.