LOAD CSV from a shared folder?

daniel_amigos · September 9, 2019, 4:16pm

Hi all!

I have a cluster of servers and I have an ETL process that imports multiple files. The issue I am having is that the leader of the cluster is changing and LOAD CSV is looking for the import files on c:\neo4j\import. And I don't want to copy all of the files to each server.

The idea solution would be to have a shared folder like: \sharedFolder\import where all servers can point to and LOAD CSV could load from that URL.

I've tried but with no success.

Is this possible?

michael.hunger · September 9, 2019, 10:13pm

You can either symlink it into your import folder.

Or disable the security restriction that Neo4j has to limit only to importing from import folders.

# This setting constrains all `LOAD CSV` import files to be under the `import` directory. Remove or comment it out to
# allow files to be loaded from anywhere in the filesystem; this introduces possible security problems. See the
# `LOAD CSV` section of the manual for details.
dbms.directories.import=import

daniel_amigos · September 10, 2019, 7:53pm

I have removed the constraint in the config file:

# The name of the database to mount. Note that this is *not* to be confused with
# the causal_clustering.database setting, used to specify a logical database
# name when creating a multi-clustering deployment.
#dbms.active_database=graph.db

I then created a Symlink to a remote folder:
symlink

But I am still not able to load the file from the remote folder:

(I am using windows servers)

lju · September 12, 2019, 8:48am

Hi Daniel,

Neo4j server will be using the local client account, does it have permissions to go to that location?

Thanks!

dana_canzano · September 12, 2019, 12:17pm

given the error of Couldn't load the external resource at: file:/c:/symlink......... can you please provide the LOAD CSV command. When using a file: typically it is file:///......... with 3 /

LOAD CSV - Cypher Manual states

CSV files can be stored on the database server and are then accessible 
using a file:/// URL. Alternatively, LOAD CSV also supports accessing CSV files via HTTPS, 
HTTP, and FTP.

daniel_amigos · September 12, 2019, 7:15pm

Thanks everyone for helping out!

The command was at the top of the second image and that command is:

LOAD CSV WITH HEADERS FROM "file:///c:/symlink/archive/file.csv" AS row RETURN row LIMIT 10

Regarding permissions, "Everyone" has read and write permissions as seen below:
temp

lju · September 13, 2019, 8:57am

Hi!

Just to double-check, could you please see that one of the servers that is running Neo4j is able to access the path under the role level that Neo4j runs on? Perhaps do this via command prompt?

Thanks!

Topic		Replies	Views
Load CSV Neo4j Community Edition Import / Export	3	1365	May 22, 2019
LOAD CSV from local file failing with "Couldn't load the external resource", but valid file exists Import / Export	2	1093	January 3, 2024
Using LOAD CSV with files outside of /var/lib/neo4j/import Import / Export	2	163	April 22, 2024
Python Script to Import CSV from local files to Remote Connection Python cypher , import	2	1095	March 8, 2021
Cannot LOAD CSV on ubuntu using neo4j running on docker Cypher	2	114	June 7, 2024

LOAD CSV from a shared folder?

Related topics