Importing Data from Hive into Neo4J through Spark application

aloplop85 · March 5, 2019, 6:06pm

Hello everyone,

I am almost newbie to Neo4J and graph databases world, and I have some doubts about importing data into Neo4J.

In my case, I have several related tables in Hive which I would like to load to Neo4J. I wonder if I should use a Spark2 application which worked as a bridge between both technologies (e.g. transforming Hive data using dataframes) by using Neo4J connector or it is possible to convert Hive contents into edge and nodes directly by using the Hive JDBC driver. This second approach would be similar to the test described in this post of the community.

As I have not found several examples including a Spark application, what should be the best approach for this case? Is it possible to load data directly from Hive?

Furthermore, does it worth using another external tool such as StreamSets Data Collector?
I saw this video by their Technical Director, Pat Patterson, and it looks quite good.

Thanks in advance,

Álvaro López

metadaddy · May 10, 2019, 3:40pm

Hi Álvaro,

Thanks for the mention! I've been integrating Data Collector with Neo4j for some time now - it works well. Feel free to ask questions here, or over at our community: https://streamsets.com/community

Cheers,

Pat

Topic		Replies	Views
Rishi Software Engineer in Support at StreamSets Inc , Bangalore ( India ) Introduce-Yourself introduction	8	1420	May 10, 2019
Write in neo4j from hive Graph Algorithms/Graph Data Science spark , neo4j-python-driver	0	368	June 2, 2023
Neo4j Integration with Hadoop Integrations & Ecosystem	1	1436	October 19, 2018
Neo4j community edition - Can it integrate with Apache Spark Operations	11	770	November 16, 2020
Connect Spark with Neo4j to transform JSONs into Graphs Neo4j Graph Platform spark , import	2	654	November 12, 2020

Free Online Global Conference

Importing Data from Hive into Neo4J through Spark application

Related topics