Hello! I have large dataset of institutions that contains messy duplicate entries for some of the institutions, like the following:
1. University of Nairobi libraries
2. University of nairobi - department of biology
3. University of nairobi, dept of zoology
4. The university of Nairobi
5. University of Nairobi Faculty of Engineering
Etc.
Is there a way to (1) remove the inexact duplicates so I’m only left with one of these entries per institution, and/or (2) replace such entries in the example with just “University of Nairobi”?
Thanks in advance!
Bookmarks