Home / Functions / De-duplication / De-duplication with consolidation
Keywords: Address quality, Data quality, Addresses, Address database, CRM, Duplicates, Duplicate check, De-duplication, Consolidation, Software.
It is easily overlooked that it is not sufficient to only find duplicates. They must be cleansed as well.
E.g.:
Selection is the conventional procedure: An address is selected which will survive and the remaining addresses will be cleansed. Although through selection certain priorities may be defined, i.e. addresses from a specific file or the address with the latest date of change are preferred. The problem that information from cleansed addresses eliminated addresses is lost remains.
Even in case of simple applications, like preparing a mailing, Selection gives unsatisfactory results. If there are two duplicates, one with the title and the other with the first name (for e.g. “Osborn, PhD” and “Joe Osborn”), then while selecting, either “Osborn, PhD” or “Joe Osborn” can exist. The desired completion “Joe Osborn, PhD” can only be accomplished by Consolidation. Hence it is recommended to initiate a consolidation also in case of such simple tasks like de-duplication for mailing.
q.address permits complex rules for consolidation: selection of a name from a preferred file (if several files have different priorities) and, within this file, the name from the address with the latest date of change. (Refer to Image 1)
The option of concatenating is also important. If different duplicate addresses contain different telephone numbers, then the user would like to have all the telephone numbers. With the help of concatenating all the telephone numbers are taken over in the result record. (Refer to Image 2)
Observe the numerous possible consolidation functions in Images 1 and 2!
The consolidation (while cleansing address files) is available in:
Further links