Feedback requested on proposal for Spark Cypher in Spark 3.0

alastair.green · January 16, 2019, 12:48pm

Xiangrui Meng of Databricks recently posted this on the Apache Spark project users list:

http://apache-spark-user-list.1001560.n3.nabble.com/SPIP-DataFrame-based-Property-Graphs-Cypher-Queries-and-Algorithms-td34358.html

Databricks and Neo4j contributors are looking to bring Cypher queries into the core Spark project as part of Spark 3.0 (slated for release mid-year 2019). This will build on elements from the Cypher for Apache Spark and Graphframes projects.

All the details are in the links in Xiangrui's e-mail.

It would be great to see Neo4j and Spark community users expressing their support for/adding feedback on this Spark Project Improvement Proposal before it goes for a vote in the Spark dev community.

The more detail you can provide on your interest in this, the better, but a simple +1 in reply to Xiangrui's post would be just great if you are short of time ...

Thanks, Alastair

alastair.green · January 23, 2019, 10:08am

CORRECTION

It appears that the nabble.com list is not the right medium for replying to Xiangrui's post with feedback or messages of support.

Please take the following steps to comment on the Spark users list:

Subscribe to user@ by sending an email to user-subscribe@spark.apache.org.
Go to this link https://lists.apache.org/thread.html/269cbffb04a0fbfe2ec298c3e95f01c05b47b5a72838004d27b74169@<user.spark.apache.org> and click reply -> reply via mail client.

Thanks, and apologies for the mix-up.

alastair.green · January 23, 2019, 9:59pm

Sorry, but this appears not to be a simple process.

If you do Step 1, you will get a mail that requires you to reply to confirm. Then and only then will you be able to perform Step 2 (reply to the users list).

alastair.green · February 13, 2019, 10:31am

Following user comments (thanks everyone who pitched in with feedback), Xiangrui launched a vote on the proposal on the Spark devs list, and it closed yesterday with the following result:

Hi all,

The vote passed with the following +1s (* = binding) and no 0s/-1s:

Denny Lee

Jules Damji

Xiao Li*

Dongjoon Hyun

Mingjie Tang

Yanbo Liang*

Marco Gaido

Joseph Bradley*

Xiangrui Meng*

Please watch SPARK-25994 and join future discussions there. Thanks!

Best,
Xiangrui

The binding votes are Apache Spark PMC members. This is a great outcome, reflecting a ton of work from various contributors and backers.

There's going to be a discussion about how Cypher can feed into the proposed international standard GQL at the forthcoming Fifth openCypher Implementers' Meeting in Berlin in early March.

This news about Spark Cypher adds to the importance of making the long-term transition from Cypher to GQL as easy as possible. (There are also reports of one or two additional industrial implementations of Cypher in the works.) The ever-growing interest in a standard graph query language shows how graph data management is beginning to go mainstream, in my view.

Alastair

Topic		Replies	Views
State of Neo4j-Morpheus? Cypher gql	3	1144	November 2, 2020
Neo4j community edition - Can it integrate with Apache Spark Operations	11	782	November 16, 2020
Cypher developed database. GQL becomes an international standard. Whether Cypher automatically switches to GQL. Whether database needs to be redeveloped with GQL Neo4j Graph Platform	1	291	April 5, 2023
Support for running SPARQL into graph Linked Data, RDF, Ontology	6	3676	June 24, 2021
Query an existing Neo4j graph using SPARQL Cypher	0	1528	April 11, 2019

Feedback requested on proposal for Spark Cypher in Spark 3.0

Related topics