Getting all “Fuzzy” about DeDuplication

Continuing on from last week’s exploration of duplicates (and why we should care about duplication in our systems) this week we will explore some of the different methodologies that are employed to deal with duplication – the merits of “fuzzy logic”, and the age...

Duplicates causing you Double Trouble?

This week let’s delve into an issue that affects many data sets – duplication. A duplicate is essentially the same entity existing more than once within the same data set. Duplicates exist due to a number of reasons. The main reason is due to multiple data sets...