Hi there,
I am currently trying to assess how small a sample of fruit size can be while still returning a value close enough to the true mean fruit size. I have a multiple datasets with 100 measurements in each. For each of these datasets I want to resample 30 times for n=10, n=20, n=50.... and so on. I don't need to see the values for the 30 resamples, I just want the 30 means, one from each resample. With these, I can assess the variation between them to decide on the smallest sample size with an acceptable amount of error.
What I would like to do would be to automate or speed up the random resampling from the n=100 dataset. I've tried multiple approaches (using random numbers and the sampling tool in data analysis), but when repeating 30 times and for various n= resamples, it is still very time consuming.
Any advice would be appreciated.
Cheers.
Bookmarks