Hi there
I am facing a problem in importing CSV files using neo4j-admin. I have the following nodes and relationship files.
#Nodes
-Node 1 header
addID:ID(Address),address:string,userID:IGNORE
-Node 1 csv
0,1A1zP1eP5QGefi2DMPTfTL5SLmv7DivfNa
1,12c6DSiU4Rq3P4ZxziKxzrL5LmMBrzjrJX
2,1HLoD9E4SDFFPDiYfNYnkBLQ85Y51J3Zb1
3,1FvzCLoTPGANNjWoUo6jUGuAG3wg1w4YjR
4,15ubicBBWFnvoZLT7GiU2qxjRaKJPdkDMG
5,1JfbZRwdDHKZmuiZgYArJZhcuuzuw2HuMu
-Node 2 header
txID:ID(Transaction),txhash,n_inputs,n_outputs
-Node 1 csv
0,4a5e1e4baab89f3a32518a88c31bc87f618f76673e2cc77ab2127b7afdeda33b,0,1
1,0e3e2357e806b6cdb1f70b54c3a3a17b6714ee1f0e68bebb44a74b1efd512098,0,1
2,9b0fc92260312ce44e74ef369f5c66bbb85848f2eddd5a7a1cde251e54ccfdd5,0,1
3,999e1c837c76a1b7fbb7e57baf87b309960f5ffefbf2a9b95dd890602272f644,0,1
4,df2b060fa2e5e9c8ed5eaf6a45c13753ec8c63282b2688322eba40cd98ea067a,0,1
5,63522845d294ee9b0188ae5cac91bf389a0c3723f084ca1025e7d9cdfe481ce1,0,1
#Relationships
- Relationship header 1 (incoming transactions)
:START_ID(Transaction),:END_ID(Address),value
-Relationship csv 1
504,367,5000000000
504,192,100000000
504,255,1000000000
533,293,5000000000
555,533,2500000000
- Relationship header 2 (outgoing transactions)
:END_ID(Address),:START_ID(Transaction),value
--Relationship csv 2
0,0,5000000000
1,1,5000000000
2,2,5000000000
3,3,5000000000
4,4,5000000000
5,5,5000000000
-Here is the script I am using to import the data
bin\neo4j-admin import --nodes=Address="import/addresses-header.csv,import/addresses." --nodes=Txhash="import/txhash-header.csv,import/txhash." --relationships=TO="import/incoming-header.csv,import/txin.* " --relationships=SENDING="import/outcoming-header.csv,import/txout.*" --skip-duplicate-nodes --high-io=true --skip-bad-relationships --delimiter=","
There are more than 400 files containing more than 2 billion lines. The import is done successfully but I notice 700 million of the relationships are missing and logged into the import.report.
I am getting this error when the importing is completed
"There were bad entries which were skipped and logged into import.report file"
and here is the sample of the import.report file
370055140 465945065 398435658 (Address)-[SENDING]->null (global id space) is missing data
So far I could not figure out what the error is meaning or how to resolve it.
Can anyone help me to overcome this problem?
Thank you in advance