I am running a pipeline from S3 to spark and finally to neo4j. I am using neo4j bolt driver for python to write data. I have 1600 files on S3, I used UNWIND to batch write and also create a constraint for name. Now my database is 31 GB and I am using...
I have 200,000,000 relationships to write into my database. The data is a nested list like b = [[1, 2],[3, 4], [5, 6]]. Right I am using Neo4j Bolt Driver for Python to write data. But the writing is very slow, it would take 5 days to finish all the ...
I am using a spark cluster for batch processing now. I just want to ask if anyone knows how to write Neo4j with neo4j-spark-connector in python. I read the repo README https://github.com/neo4j-contrib/neo4j-spark-connector, I can't find anything ...
Very happy to join the community.
I understand in Neo4j 4, we are able to switch to another database in Neo4j Browser. My question is how can we merge 2 or more databases? Say I have A.db, B.db, and C.db, how can we combine A, B, and C to generat...
Thanks for your reply Anthapu, I check my database and I am sure the constraint is there.
If I don't have the constraint at the beginning, there is no way I can run 30s per file.
Do you think it might be my memory? 8G is not enough?
I tried the code your provided above but got the error below
PicklingError: Could not serialize object: TypeError: Cannot serialize socket object
Do you have any idea how should I fix this?