Will a spark deploy on neo4j side accelerate my writing?

yuzr1 · June 8, 2023, 2:13am

I am writing data from my hdfs server into my neo4j server using pyspark.

df.write.format("org.neo4j.spark.DataSource").mode("Overwrite").option("url", 'neo4j://192.xx.xx.xxx1:7687') \
    .option("authentication.basic.username", "username").option("authentication.basic.password", "password") \
    .option("query", "match (n:entity {xnamex:'john doe'}) with n set n.xlabelx='temp'").save()

Yet I realized spark (or pyspark) is not installed on my neo4j server, while the code still succeeded.

I just wonder that, do I only need neo4j-spark-connector on my datasource side? Or I can also deploy spark on neo4j side so it will lead to an acceleration？

Topic		Replies	Views
How to write neo4j in python with neo4j-spark-connector Neo4j Graph Platform	12	2125	November 12, 2020
Example using PySpark Neo4j Graph Platform	0	359	September 17, 2020
Install Neo4j on server Installation spark , installation , server	2	1198	July 19, 2019
Write in neo4j from hive Graph Algorithms/Graph Data Science spark , neo4j-python-driver	0	360	June 2, 2023
Can we find any benchmarking figures for neo4j spark connector (DataFrame to DB) Neo4j Graph Platform	1	461	November 12, 2020

Will a spark deploy on neo4j side accelerate my writing?

Related topics