Data Science Library to help us correlate through behaviors (histories)

We are kind of new to the Data Science Library. We have a dataset with profile handles and names and their recorded identifiers from several social media platforms (Facebook, X, Reddit, Weibo, Vk, etc.). The recorded identifiers consist of email address, company affiliation, WhatsApp, WeChat, Telegram, and more. Criminals are not going to use their real names and addresses, etc., however they create burner accounts that typically use the same identifiers within their network.

Is there a good mechanism(s) in the Data Science Library to help us try and correlate through their behaviors (histories) a set of unique profiles. In other words, sally123 and dave43 are the same person. Or is there a way to overlay the datasets to identify any relationships between the accounts and unique identifiers.

I was also thinking that each person when they write something uses the same patterns in their writing style. Is there some way to perhaps use this as method to, “lift the vail”? Can and how would we approach this?
Thoughts?

Can anyone help? I could use some guidance.. if you can.. thanks guys!

Try apoc.text.jaroWinklerDistance between the unique identifiers. This will give a similarity strength between 0 to 1.

Thanks.. I will play with this tonight.. THANKS