+ Reply to Thread
Results 1 to 7 of 7

Duplicate removal in a large excel sheet

Hybrid View

  1. #1
    Registered User
    Join Date
    08-24-2013
    Location
    India
    MS-Off Ver
    Excel 2011
    Posts
    2

    Duplicate removal in a large excel sheet

    Hi,
    I have an excel sheet with around 12,000 records. This is a database of a set of people. I have around 27 columns and 12,000 rows

    I have the following issue

    I have multiple entries of people with records having some data for e,g, three records of John Doe but one has an address one has a telephone number and one will have nothing.

    How can I merge them such that 4 fields are the same, salutation, first name, last name and company name and records are merged in such a way that all data from rows are in a single record.

    Thanks
    Vijay

    I am on a mac but can get access to a PC

  2. #2
    Valued Forum Contributor Miraun's Avatar
    Join Date
    04-03-2009
    Location
    New England
    MS-Off Ver
    2003, 2007, 2010, 2013
    Posts
    554

    Re: Duplicate removal in a large excel sheet

    Hey Vj2k;

    So, in Excel 2011, there is a remove duplicates function in the Data Ribbon Tab. What this will do is remove all repeat instances of data for your criteria... But, you're making it a little more complex with potentially having incomplete data in the other columns.

    So, I just did some quick testing of the Remove Duplicates option, and what I've found is that it will always keep the first instance that it identifies.

    With that in mind, What we can do in this particular instance is add a 5th column that just simply tells us how many previous columns were filled out.

    By using a =Counta(a1:e1), pulled down for all 12000 records, it'll flag the complete entries.

    From here, sort your data on both the person's name, AND the new Counted field... You want the full counts on the top.

    From here, you can remove duplicates on the person's name, and it should only retain the most full entry associated with that person.

    I hope this helps, and that you can follow the methodology for the use. Post back if you have anymore questions!
    Going for Guru! Click the Star to the bottom left of this post if I helped!

  3. #3
    Registered User
    Join Date
    08-24-2013
    Location
    India
    MS-Off Ver
    Excel 2011
    Posts
    2

    Re: Duplicate removal in a large excel sheet

    Hi,
    Thanks for your reply,
    It does not help me with one thing, I need to merge data for fields which are incomplete

    Vijay

  4. #4
    Valued Forum Contributor Miraun's Avatar
    Join Date
    04-03-2009
    Location
    New England
    MS-Off Ver
    2003, 2007, 2010, 2013
    Posts
    554

    Re: Duplicate removal in a large excel sheet

    Sorry... I'm not sure what you're trying to say. But if it's what I think, when you use the remove duplicates, you should only be concerned with the completed name fields, and not include the entire 4 columns of data.

  5. #5
    Forum Contributor
    Join Date
    08-02-2013
    Location
    Wageningen, The Netherlands
    MS-Off Ver
    365
    Posts
    495

    Re: Duplicate removal in a large excel sheet

    I think VJ's problem is this: if there are several rows of John Doe that have the same numer of data, but still different ones (e.g. name-address-country and name-phonenumber-country) they both have the most data, but keeping only one will be a loss of information.

    But I don't think there is a fully automated solution for this... If you try something like a LOOKUP, you'll always have trouble with adresses that are spelled slightly different (but are in fact the same) etc etc.
    When I say semicolon, u say comma!

  6. #6
    Forum Expert
    Join Date
    12-11-2011
    Location
    Netherlands
    MS-Off Ver
    office 365
    Posts
    3,653

    Re: Duplicate removal in a large excel sheet

    A little excel file will help to find a solution
    Willem
    English is not my native language sorry for errors
    Please correct me if I'm completely wrong

  7. #7
    Administrator FDibbins's Avatar
    Join Date
    12-29-2011
    Location
    Duncansville, PA USA
    MS-Off Ver
    Excel 7/10/13/16/365 (PC ver 2310)
    Posts
    53,048

    Re: Duplicate removal in a large excel sheet

    Hi and welcome to the forum

    I agree with popipipo that you upload a sample work book, (all sensitive data removed), showing what data you are working with, a few examples of what your expected outcome is, and how you arrived at that


    To attach a file to your post,
    click advanced (next to quick post),
    scroll down until you see "manage file",
    click that and select "add files" (top right corner).
    click "select files" find your file, click "open" click "upload" click 'done" bottom right. click "submit reply"

    Once the upload is completed the file name will appear below the input boxes in this window.
    You can then close the window to return to the new post screen.
    1. Use code tags for VBA. [code] Your Code [/code] (or use the # button)
    2. If your question is resolved, mark it SOLVED using the thread tools
    3. Click on the star if you think someone helped you

    Regards
    Ford

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Complex Duplicate Removal
    By bluescreenofdeath in forum Excel Programming / VBA / Macros
    Replies: 2
    Last Post: 12-17-2012, 07:28 PM
  2. Duplicate removal
    By teresa1 in forum Excel General
    Replies: 6
    Last Post: 10-14-2010, 07:05 PM
  3. Duplicate data removal
    By dawsonsoo in forum Excel General
    Replies: 3
    Last Post: 07-20-2009, 11:29 AM
  4. Duplicate entry removal
    By north.penny in forum Excel - New Users/Basics
    Replies: 1
    Last Post: 01-09-2009, 09:00 AM
  5. Removal of Duplicate Rows in Large Files
    By Skoal in forum Excel General
    Replies: 2
    Last Post: 05-04-2005, 06:30 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1