I am searching a performance comparison for all GDS algorithms

Dongho · July 5, 2021, 8:59am

Hi, I am trying to test almost all GDS to select the best algorithm for every about 50 various target tasks of a customer like finding some activity patterns on internet.

Some tasks can be accomplished by the several algorithms separately with the different result or time to perform it.
Is there any comparison or report like that already tested on the same data with the different GDS algorithm to compare its quality and performance? Those kind of information would be very helpful even though I must test almost all GDS anyway to build application for each task.

For example I found NodeSimilarity makes very good result quickly to compare thousands set of some news content but it cannot be used to compare hundreds thousands of sentences since it takes forever on my best test machine(Ryzen 5950x 32 threads/128GB RAM) if I am not doing wrong.

I am sorry I cannot open more specific detail of the tasks since it is very confidential project.

alicia_frame1 · July 5, 2021, 5:46pm

If you're looking for run time estimates, you can check out our configuration guide, which includes run times for certain algorithms on a specified graph (LDBC100, ~300M relationships, 1B nodes) and provides the hardware we used to generate the benchmarks. It also provides some guidance on optimizing performance. In general, though, you want to set concurrency as high as possible (EE has unlimited concurrency), and make use of parameters like degreeCutoff topK and topN when available.

"Quality" is a much more nuanced metric - it's going to depend strongly on the data sets you're running an algorithm on, and the problem at hand. Usually we recommend tuning your algo call on a subset of the data to make sure that your parameter combination is giving you sensible results, before running over the full dataset.

Dongho · July 5, 2021, 6:28pm

Thank you for replying to my question. Those information in the guide is what I searched now.

Thanks again, Alicia.

Topic		Replies	Views
Can this query be optimized? GDS Node similarity Graph Algorithms/Graph Data Science performance , cypher , operations	3	748	August 3, 2021
GDS Concurrency parameter doesn't impact performance Graph Algorithms/Graph Data Science performance	2	271	April 18, 2021
Getting started with graph data science experiment design Graph Algorithms/Graph Data Science	2	468	December 11, 2020
GDS 1.6 Preview is now available! Graph Algorithms/Graph Data Science	7	676	June 10, 2021
Graph Algorithms or Graph data science Library Graph Algorithms/Graph Data Science cypher , knowledge-base , neo4j-desktop	4	722	April 28, 2020

Get Certified in June!

I am searching a performance comparison for all GDS algorithms

Related topics