I have a large data set which contains numerous duplicate entries. The data consists of Company Name, Address fields, City, Town, Postcode, Bank details, VAT number etc etc... however not all the fileds are complete, many fields are blank.
I need to find a way to remove the duplicate entries, which of course is straight forward, I can just use 'Remove Duplicates', but I want to remove the duplicates with the least number of fields completed. There are 15 key fields which require data, but with multpile instances of the same Company Name, one iteration may have 8 fileds completed, another 10 and another 13, how can I ensure that when removing duplicates by looking at the Company Name field, I keep the most populated row in each case and remove the least populated rows?
Many thanks
Bookmarks