Hi everyone, I wanted to ask if there is any possible way to import records from multiple rdb's into my neo4j using neo4j-import tool for initial offline bulk load.
I know there are other options available to import data to neo4j from rdbms but online import will be very slow for my needs.
What I want to achieve is
- Build a metadata graph from all my dbs schema info into neo4j. For this I could use SchemaCrawler.
- Import all my data records inside neo4j so I will be able to run graph analytics.
The offline bulk import tool needs an empty db to work so making the connection with multiple dbs and import the records is not feasible (official documentation)
Looking into the current architecture of etl tool:
My initial thought was that it would be easy to write a script that will generate the metadata-mapping
export relational data to CSV, generate Mapping Headers and then import all of them at once...something like the implementation of @michael.hunger in 2015 relational_to_neo4j_import_tool
.....but......the minimal command line example for export it needs also input parameters for import tool. it is also written in the documentation of 'export' command
neo4j-etl export - Export from RDBMS and import into NEO4J via CSV files.
Examples of command usage:
Minimal command line ./bin/neo4j-etl export \ --rdbms:url <url> --rdbms:user <user> --rdbms:password <password> \ --destination $NEO4J_HOME/data/databases/graph.db/ --import-tool $NEO4J_HOME/bin \ --csv-directory $NEO4J_HOME/import
Is it possible to separate export and import from command line ?
If not another option will be to create multiple dbs and then manage & query them to get my insights with Neo4j Fabric.
Any other option you may have to suggest?