Hi guys and girls,
I've got a sheet in which I need to look at data that I pull in from a CSV file.
Now I can pull in all the data into the sheet by just importing the CSV, but it will be hundreds of thousands of rows and to be honest, it makes the file too big and calculating it every time too slow.
So here's the setup :
I have around 500 different products - each identified by a PRODUCT_ID
I want to find out where these products are typically shipped to - that's where the CSV file comes in.
The CSV contains 6 columns as below (these are the headers) - for each day (DAY, column 1) there is a record of all (AMOUNT, column 6) products (PRODUCT_ID, column 3) that are shipped to a warehouse (WH, column 5).
DAY, LOC, PRODUCT_ID, PRODUCT_CODE, WH, AMOUNT
I want to be able to set a date period (for example 1/1/2015 - 1/31/2015) and for each product go through the rows of those dates and find the division of how many are shipped to each warehouse.
So in the end, I'd like the result to look something like this:
PRODUCT_ID TIME PERIOD WH AMOUNT % OF TOTAL
X322-AC 01/01/2015 - 01/31/2015 AMS 458 8.40%
X322-AC 01/01/2015 - 01/31/2015 BRU 1,118 20.50%
X322-AC 01/01/2015 - 01/31/2015 … … …
X322-AC 01/01/2015 - 01/31/2015 … … …
Now, I know how to do this slowly by importing all the data from the CSV into my sheet, then filtering on the product id, looping through the visible cells and assign each warehouse to a variable, then divide the variables by the total to get the percentage per warehouse.
But - IDEALLY, I'd like not to have to import the CSV contents at all, but just pull the relevant numbers from the closed file and only show the result.
Can this be done?
Name of the CSV will be "shipments.txt" and it's tab delimited.
Thanks,
Jasper
Bookmarks