+ Reply to Thread
Results 1 to 2 of 2

Removing inexact duplicates

  1. #1
    Registered User
    Join Date
    10-08-2019
    Location
    England
    MS-Off Ver
    2016
    Posts
    1

    Removing inexact duplicates

    Hello! I have large dataset of institutions that contains messy duplicate entries for some of the institutions, like the following:

    1. University of Nairobi libraries
    2. University of nairobi - department of biology
    3. University of nairobi, dept of zoology
    4. The university of Nairobi
    5. University of Nairobi Faculty of Engineering
    Etc.

    Is there a way to (1) remove the inexact duplicates so I’m only left with one of these entries per institution, and/or (2) replace such entries in the example with just “University of Nairobi”?

    Thanks in advance!

  2. #2
    Forum Expert PaulM100's Avatar
    Join Date
    10-09-2017
    Location
    UK
    MS-Off Ver
    Office 365
    Posts
    2,108

    Re: Removing inexact duplicates

    You could create a separate list containing possible entries(like University of Nairobi") let's say in C2:C20. Assuming that your data is in A2 and down, then in B2 you could try a formula like the one below and afterwards you can remove them or filter them:
    =IF(COUNTIF($A$1:$A2,INDEX($C$2:$C$20,MATCH(TRUE,ISNUMBER(SEARCH($C$2:$C$20,A2)),0)))>1,"Remove",INDEX($C$2:$C$20,MATCH(TRUE,ISNUMBER(SEARCH($C$2:$C$20,A2)),0))))
    This is an CSE formula, so press Ctrl+Shift+Enter to activate it.
    Click the * to say thanks.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Remove Inexact Duplicates
    By JillC in forum Excel General
    Replies: 3
    Last Post: 05-30-2017, 05:36 PM
  2. Remove duplicates not removing duplicates bug, so what???!!!
    By abdelrazzaq in forum Excel Formulas & Functions
    Replies: 5
    Last Post: 12-19-2016, 01:43 PM
  3. [SOLVED] removing duplicates only within duplicates on another column
    By eculver in forum Excel Programming / VBA / Macros
    Replies: 3
    Last Post: 10-10-2016, 08:08 PM
  4. Hiding all rows with inexact matches
    By Fair2Midland in forum Excel Programming / VBA / Macros
    Replies: 8
    Last Post: 06-28-2015, 05:27 PM
  5. Remove duplicates function not removing duplicates
    By Berilium2 in forum Excel General
    Replies: 3
    Last Post: 04-01-2015, 06:55 AM
  6. Formula for detecting inexact duplicates?
    By Chucky2222 in forum Excel General
    Replies: 10
    Last Post: 06-21-2012, 08:53 PM
  7. Replies: 6
    Last Post: 03-29-2012, 12:16 AM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1