cancel
Showing results for 
Search instead for 
Did you mean: 

Null (global id space) is missing data neo4j-admin import

Khaled
Node Link

Hi there
I am facing a problem in importing CSV files using neo4j-admin. I have the following nodes and relationship files.

#Nodes
-Node 1 header
addID:ID(Address),address:string,userID:IGNORE

-Node 1 csv
0,1A1zP1eP5QGefi2DMPTfTL5SLmv7DivfNa
1,12c6DSiU4Rq3P4ZxziKxzrL5LmMBrzjrJX
2,1HLoD9E4SDFFPDiYfNYnkBLQ85Y51J3Zb1
3,1FvzCLoTPGANNjWoUo6jUGuAG3wg1w4YjR
4,15ubicBBWFnvoZLT7GiU2qxjRaKJPdkDMG
5,1JfbZRwdDHKZmuiZgYArJZhcuuzuw2HuMu

-Node 2 header
txID:ID(Transaction),txhash,n_inputs,n_outputs

-Node 1 csv
0,4a5e1e4baab89f3a32518a88c31bc87f618f76673e2cc77ab2127b7afdeda33b,0,1
1,0e3e2357e806b6cdb1f70b54c3a3a17b6714ee1f0e68bebb44a74b1efd512098,0,1
2,9b0fc92260312ce44e74ef369f5c66bbb85848f2eddd5a7a1cde251e54ccfdd5,0,1
3,999e1c837c76a1b7fbb7e57baf87b309960f5ffefbf2a9b95dd890602272f644,0,1
4,df2b060fa2e5e9c8ed5eaf6a45c13753ec8c63282b2688322eba40cd98ea067a,0,1
5,63522845d294ee9b0188ae5cac91bf389a0c3723f084ca1025e7d9cdfe481ce1,0,1

#Relationships

  • Relationship header 1 (incoming transactions)
    :START_ID(Transaction),:END_ID(Address),value

-Relationship csv 1
504,367,5000000000
504,192,100000000
504,255,1000000000
533,293,5000000000
555,533,2500000000

  • Relationship header 2 (outgoing transactions)
    :END_ID(Address),:START_ID(Transaction),value

--Relationship csv 2
0,0,5000000000
1,1,5000000000
2,2,5000000000
3,3,5000000000
4,4,5000000000
5,5,5000000000

-Here is the script I am using to import the data

bin\neo4j-admin import --nodes=Address="import/addresses-header.csv,import/addresses." --nodes=Txhash="import/txhash-header.csv,import/txhash." --relationships=TO="import/incoming-header.csv,import/txin.* " --relationships=SENDING="import/outcoming-header.csv,import/txout.*" --skip-duplicate-nodes --high-io=true --skip-bad-relationships --delimiter=","

There are more than 400 files containing more than 2 billion lines. The import is done successfully but I notice 700 million of the relationships are missing and logged into the import.report.
I am getting this error when the importing is completed

"There were bad entries which were skipped and logged into import.report file"

and here is the sample of the import.report file
370055140 465945065 398435658 (Address)-[SENDING]->null (global id space) is missing data

So far I could not figure out what the error is meaning or how to resolve it.
Can anyone help me to overcome this problem?

Thank you in advance

1 REPLY 1

sam_gijare
Graph Buddy

Hi

Please write your own script to realign your relationships according to domain schema design ( currently I don't have your domain model schema
specification ) constraints ( ID unique and NOT NULL constraint). If there are no such constraints then you can form your relationships yourself.

Thanking you
Sameer S Gijare