Hello,
I've spent the last few days trying to come up with a solution in capturing the duplicated information I have in a very large (120000 rows) excel document.
I'm not stuck and would like some advise on solving this.
The spreadsheet, has 3 columns The first show each master record, its ID and the duplicated record and its ID, 2nd column shows the address and the 3rd has the names of the person.
A B C
Master Addresses Name
255812 1 test rd Sydney John Doe
1 test rd Sydney
PO BOX 2 Sydney
421504 Address Name
1 test rd Sydney John DOE
Duplicate records found: 1
Master Addresses Name
455444 1111 House rd Sydney Steven Jacks
1111 House rd Sydney
53421 Address Name
1111 House rd Sydney Steven Jacks
1111 House rd Sydney
Level 5, Kings St Sydney
13232 Address Name
1111 House rd Sydney Steven R Jacks
Level 5, Kings St Sydney
Duplicate records found: 2
What I need to do is being able to confirm that the records listed are really duplicates. I was looking at the row labelled Mater and the last row called "Duplicated records found" to identify the range of rows to search for. However its never the same amounts of rows as it could be one duplicate found or 50. I would like to highlight the matched rows of address in a colour from the master record and also from the duplicated records and also match and highlight the name from master record and duplicated records in another colour.
This would allow me to visually identify the duplicates. Ideally due to the 120000 rows to look at, I would like to be able to look at excel to do most of this work for me and put the results in another column.
Any help would be appreciated in solving this.
Bookmarks