Importing JSON to Neo4j from a file containing list of JSONs

parvez.hazari · March 6, 2019, 3:03pm

I have a file containing JSONs on each line of the file.
Sample records from file is as below:

{"bookId": "1000027", "relatedBookIds": ["4330592", "4755603", "4330602", "1100247", "4330612", "3042379", "4330596", "4330610", "1100231", "4330606", "3440120", "999901413"]}
{"bookId": "1000029", "relatedBookIds": ["4330606", "4330592", "999931622", "4330576", "3273969", "4755989", "1100223", "4330588", "3339070", "999901411", "4755609", "4330602"]}

The file contain approx million of JSON in a single file.

I was thinking of reading a line from the file and using the apoc.load.json to load the JSON. But considering the number of records, is there a better way to load the file?

michael.hunger · March 6, 2019, 11:26pm

Combine apoc.load.json with apoc.periodic.iterate

call apoc.periodic.iterate('
call apoc.load.json("file:///path/to/file.json") yield value
','
create (n:Node) SET n += value
',{batchSize:10000})

parvez.hazari · March 7, 2019, 2:08am

Thanks Michael for your reply.
Just to understand your suggestion, is this approach equivalent to "USING PERIODIC COMMIT 10000 LOAD CSV" ?
Just wanted to understand as I was thinking of converting this file to a CSV and then using LOAD CSV with periodic commit.
But in case if this approach meets the performance to that of a LOAD CSV, then I would not like to convert the file and use JSON file directly to import.

Also just to clarify my requirement, I have just one file as input which contain a JSON flattened on each line of file. So if I have 10 lines in my file, I will have 10 separate JSONs.
The example in my initial post can be considered as a file containing 2 lines with 2 JSONs present in it.

michael.hunger · March 7, 2019, 6:32am

Yes exactly, the apoc version has some benefits over regular load csv in terms of batching and large datasets.
Yes you will have one json object per line which will be turned into one row aka "value"

parvez.hazari · March 7, 2019, 1:17pm

Thanks Michael.
Let me try it out and I will update with the results.

navjotkaurbham · October 6, 2019, 8:43am

hi
i just load the json file on the neo4j.....it showed the table. I am beginner.... Pls can you help me to tell what i need to do to create a graph from the article.
my json file look like

github.com

Lambda-3/Graphene/blob/master/wiki/files/example.json

{"coreferenced":false,"sentences":[{"originalSentence":"Although the Treasury will announce details of the November refunding on Monday , the funding will be delayed if Congress and President Bush fail to increase the Treasury 's borrowing capacity .","sentenceIdx":0,"extractionMap":{"bacf06771e0f4fc5a8e68c30fc77c9c4":{"id":"bacf06771e0f4fc5a8e68c30fc77c9c4","type":"VERB_BASED","confidence":null,"sentenceIdx":0,"contextLayer":0,"relation":"will announce","arg1":"the Treasury","arg2":"details of the November refunding","linkedContexts":[{"targetID":"948eeebd73564adab7dee5c6f177b3b9","classification":"CONTRAST"}],"simpleContexts":[{"text":"on Monday .","classification":"TEMPORAL","timeInformation":null}]},"948eeebd73564adab7dee5c6f177b3b9":{"id":"948eeebd73564adab7dee5c6f177b3b9","type":"VERB_BASED","confidence":null,"sentenceIdx":0,"contextLayer":0,"relation":"will be delayed","arg1":"the funding","arg2":"","linkedContexts":[{"targetID":"006a71e51295440fab7a8e8c697d2ba6","classification":"CONDITION"},{"targetID":"e4d86228cff443b7a8e9f6d8a5c5987b","classification":"CONDITION"},{"targetID":"bacf06771e0f4fc5a8e68c30fc77c9c4","classification":"CONTRAST"}],"simpleContexts":[]},"006a71e51295440fab7a8e8c697d2ba6":{"id":"006a71e51295440fab7a8e8c697d2ba6","type":"VERB_BASED","confidence":null,"sentenceIdx":0,"contextLayer":1,"relation":"fail","arg1":"Congress","arg2":"to increase the Treasury 's borrowing capacity","linkedContexts":[{"targetID":"e4d86228cff443b7a8e9f6d8a5c5987b","classification":"LIST"}],"simpleContexts":[]},"e4d86228cff443b7a8e9f6d8a5c5987b":{"id":"e4d86228cff443b7a8e9f6d8a5c5987b","type":"VERB_BASED","confidence":null,"sentenceIdx":0,"contextLayer":1,"relation":"fail","arg1":"president Bush","arg2":"to increase the Treasury 's borrowing capacity","linkedContexts":[{"targetID":"006a71e51295440fab7a8e8c697d2ba6","classification":"LIST"}],"simpleContexts":[]}}}]}

thank you

abhideep · October 17, 2019, 3:53pm

Is there any way to upload multiple JSON file in neo4j.
I have 700+ files on my local and want to upload at once.
Kindly let me know if is there any way to do it.

Thanks.

michael.hunger · November 12, 2019, 12:45am

yes you can iterate over that list of files if they have some common file name and then use apoc.load.json

e.g.

UNWIND range(1,700) as id
call apoc.load.json("file:///some-file-"+id+".json") yield value
....

Topic		Replies	Views
Load large json file with new lines as seperator Procedures & APOC json	5	2415	February 15, 2019
Apoc.load.json for importing string Procedures & APOC import , json	2	3677	January 23, 2019
Entering JSON Data into Database Efficiently via Golang Driver Procedures & APOC json	0	129	May 1, 2024
Importing json file from local machine syntax using apoc Cypher apoc	1	2394	December 5, 2019
Import json data by using APOC [HELP] Newbie Questions apoc , import , json	2	436	January 16, 2021

Get Certified in June!

Importing JSON to Neo4j from a file containing list of JSONs

Related topics