Performance issues as database gets bigger

Well what I meant is, that sometimes a person has a name like this:

“John Smith PhD.”, which needs to be correctly matched to a Person node with name “Smith John”

Or things like different address formatting:
Fashion Street 123/45, London, UK
Fashion Street 45, London, UK

There has to be some logic in the matching process here, right? Im not using any AI to parse everything into same form beforehand. However im doing some pre-procesing of the data, cleaning, or trying to put addresses in the uniform form like so:
Street StreetNumber, PostalCode City, Country

Beyond that, I tried to do the robust “algorithms” for comparing slightly different nodes, to be accurate, as the data wont always be in 100% state. Do u think this is wrong approach?