I have a list of say 500 samples, which includes some duplicates.
I want to randomly assign all 500 samples into 20 groups (25 in each group). I can do this using rand() and a helper column.
However, I don't want any of the 20 groups to contain duplicates within them i.e. I want all 25 members of each group to be unique.
How can I do this please? Thanks
Bookmarks