Fuzzy matching software
What is fuzzy matching?
Fuzzy matching is used to link data residing at disparate tables or sources that do not contain unique identifiers or appropriate primary and foreign keys. In such cases, a combination of non-unique attributes (such as last name, company name, or street address) is used to find the probability of two records being similar.
To find matches accurately, we use a combination of proprietary and established probabilistic data matching techniques that compute the likelihood of two strings being similar. Instead of a Boolean response (in terms of Yes or No), a fuzzy matching algorithm outputs a percentage value or a relative term that marks the similarity index.
How does fuzzy matching work?
Fuzzy score calculation
Match scores are calculated using the best combination of proprietary and established fuzzy algorithms, such as Levenstein Distance, Edit Distance, Soundex, Metaphone, or Cosine Similarity etc.
Fuzzy match configuration
Select suitable weights (prioritize certain fields more than the other), threshold levels (set the boundary between matches and nonmatches), and the type of fuzzy matching (character-based, phonetic, etc.).
Classification and evaluation
Scores are used to classify and group records as a match or nonmatch. Depending on the nature of data, you may encounter some false positive and negative results which require further evaluation.
Let Data Ladder handle your fuzzy matching process
See DataMatch Enterprise at work
DataMatch Enterprise is a highly visual and intuitive fuzzy matching tool, that automates the entire fuzzy matching process, relieving you of the manual effort and labor required to match data fields. DME intelligently identifies acronyms, name reversals and variations, phonetic words, misspellings, as well as abbreviations.
DME leverages a number of fuzzy matching algorithms, along with exact and phonetic matching, to identify and match records across millions of data points from multiple and disparate data sources including relational databases, web applications, and CRMs.