Hi All!
I am trying to sort 77,000+ lines of alphanumeric data. Ideally I don't want to split text to columns and filter on separate pieces as there is far too much data and the string is of varying lengths (I've so far only done this on the letter and still a large portion is out of sequence).
There are multiple variations of the alphanumeric data and i would first like it to order by the middle letter 'a','b','c','dthen by the first string of numbers, then by the other string of number, and then finally the numbers on the end.
[second element (number string with/without colon (separating the numbers - it is still one string of varying length)].[first element (one character letter).[third element (number of character length 2-3)].[fourth element (number of characters 1-4)]
The primary issue is there is a lot of decimalisation, varying punctuation, no fixed lengths which makes this a nightmare!
I can also access the raw data in SQL so i have the ability to Regex there too! e.g. evaluate('REGEXP_INSTR(%1, ''^[0-9].*A+$'')',"TABLE"."COLUMN")
example of the order i would like :
18.a.89.1
18:27.a.90.1
18:3.a.90.1
18:3.a.90.1a
18:3.a.90.11a
18:3.a.90.22
18:3.a.90.22a
18:3.a.90.23
67:7.a.90.5
69:01.a.1.1-
70:1.a.85.1
70:1.a.85.1-4
70:1.a.85.12-
100:01.b.7.53
10:1.b.90.1
10:1.b.95.1
10:1.b.95.2
10:1.b.95.3
10:2.b.95.1
10:2.b.95.2
58:5.b.201.1
58:5.b.95.1-2
58:5.b.95.6
59:1.b.200.1
69:01.b.36.55
2:55.c.95.422
2:55.c.95.423
2:55.c.95.424
44:5.c.201.16
44:5.c.201.5
44:5.c.90.22
60:1.c.200.14
60:1.c.200.2
60:1.c.202.2
1:8.d.95.2
1:8.d.95.3
1:8.d.95.4
61:29.d.90.1
Bookmarks