I am trying to load CSV files hosted in an S3 bucket into Neo4j standalone edition deployed in a local docker container.
I am using pre-signed urls for the CSV files and when I docker exec into the docker container that is running Neo4j, I can cURL the pre-signed url and access the file.
However when executing
LOAD CSV FROM ‘https://<pre-signed-url>’ as row RETURN count(row)
I have also tried using apoc.load.csv instead and get a socket timeout exception so suspect it is a networking issue but can access from the container fine.
Is there some kind of configuration in Neo4j.conf that would restrict this?
Using neo4j:4.4.20 docker image, exposing ports through local host.
Yeah I'm using the pre-signed urls and the bucket is open to internal traffic. I can access the link from within the docker container which makes me think it is a neo4j conf issue blocking access or not reading the file correctly.
Managed to get it working, it was down to a combination of bucket policy issues and security group rules so nothing neo4j related.
I don’t know if there is a feature request process but something that would’ve helped troubleshooting here would be returning the http response in the event of an error. At the moment it just raises the ExternalResourceFailed error however passing the http response would’ve highlighted whether it’s an AWS/source access issue or something internal to neo4j/networking.