Hi there!
I'm new to the forum and came on as I've been trying to split a word list in groups that correspond to the words' syllabic structure, such as CV, VC, CVC, VCC and so on, (where 'C' = consonant and 'V' = vowel).

Here is a tiny sample list from 40,000 words:

bá bé ab ag aba abú cat bun agam agat


The logic of the macro would run as follows:

Vowels (V): a, á, o, ó, u, ú, abh, adh, agh etc [the last three are vowels or diphthongs as it happens]
Consonants (C): b, c, ch, f, gh etc

Operation: word parsed for C and V pattern (ex: 'agat' is VCVC) and placed within its cohort



Resulting in:

ab (CV)
ag

bá (VC)


aba (VCV)
abú

bun (CVC)
cat

agam (VCVC)
agat


[The codes are purely for explanatory purposes]


Any help with this would be greatly appreciated!

James