Hey friends,
I have a database full of receipt IDs and need to filter out particular records (i.e. receipts belonging to buyers/sellers that no longer exist). I have spent almost 3-4 hours after work, every day, over the last 5 weeks, and have only finished around ~2200 records, one by one. I underestimated how long it would take, as there are around 330,000 records. So I’m 0.67% of the way there...
I’m thinking there must be a faster way to do this in Excel?
So the process is I’m performing:
1) A look-up of a specific string (in the ‘sub_seller’ column) that can appear in one to many (1:M) records in another table’s (‘Search Table’) column called ‘id’, and
2) Once found, across all the rows found, aggregates all the values in the column next to it (‘owner_id’), including the search/lookup string itself, and dumping this result set into the adjacent cell in the original table
3) I’d then like to repeat this process for the other column (obj_buyer) in that same original table.
4) Once the process (lookup, aggregation, data dump) has been performed for both tables, I’d like to join only the common values across the two data dumps only the common values (i.e. INTERSECT operation) into a separate column (‘INTERSECT’).
5) I then have a filter/exclusion table, where if any of the values (id’s) in ‘UNION’ (sorry I just realised I meant to use 'INTERSECT') are found in said table, they are to be excluded from the results (which will help to also exclude all the receipts associated with those excluded id’s
I’ve attached a clean file containing a complete example of this process above. I’ve also put screenshots below to show what each step above looks like / is doing:
(Also just a note, I realised '1636' (lookup value) should also be in the 'aggregate_seller_owners' column, apologies!
Step 1)
Step 2)
Step 3)
Step 4)
Step 5)
Any ideas? Any assistance would be greatly appreciated.
Thanks so much guys!
Bookmarks