Hi all,
Check my blogpost about detecting clusters from probabilistic data, where false positives are possible. In this case it's about detecting unique speakers from voiceprint similarities, but it can be used for any similar usecases.
https://graphaware.com/analytics/2019/01/28/speaker-identification-meets-graphs.html