Match all occurrences Regex Java

Question

i'd like to recognize all sequences of "word-number-word" of a string with Regex Java API.

For example, if i have "ABC-122-JDHFHG-456-MKJD", i'd like the output : [ABC-122-JDHFHG, JDHFHG-456-MKJD].

String test = "ABC-122-JDHFHG-456-MKJD";

Matcher m = Pattern.compile("(([A-Z]+)-([0-9]+)-([A-Z]+))+")
            .matcher(test);
while (m.find()) {
    System.out.println(m.group());
}

The code above return only "ABC-122-JDHFHG".

Any ideas ?

Answer 1

The last ([AZ]+) matches and consumes JDHFHG , so the regex engine only "sees" -456-MKJD after the first match, and the pattern does not match this string remainder.

You want to get "whole word" overlapping matches.

Use

String test = "ABC-122-JDHFHG-456-MKJD";

Matcher m = Pattern.compile("(?=\\b([A-Z]+-[0-9]+-[A-Z]+)\\b)")
            .matcher(test);
while (m.find()) {
    System.out.println(m.group(1));
} // => [ ABC-122-JDHFHG, JDHFHG-456-MKJD ]

See the Java demo

Pattern details

(?= - start of a positive lookahead that matches a position that is immediately followed with
- \\\\b - a word boundary
- ( - start of a capturing group (to be able to grab the value you need)
- [AZ]+ - 1+ ASCII uppercase letters
- - - a hyphen
- [0-9]+ - 1+ digits
- - - a hyphen
- [AZ]+ - 1+ ASCII uppercase letters
) - end of the capturing group
\\\\b - a word boundary
) - end of the lookahead construct.

Answer 2

Here you go, overlap the last word.
Make an array out of capture group 1.

Basically, find 3 consume 2. This makes the next match position start
on the next possible known word.

(?=(([AZ]+-\\d+-)[AZ]+))\\2

https://regex101.com/r/Sl5FgT/1

Formatted

 (?=                           # Assert to find 
      (                             # (1 start), word,num,word
           (                             # (2 start), word,num
                [A-Z]+ 
                -
                \d+ 
                -
           )                             # (2 end)
           [A-Z]+ 
      )                             # (1 end)
 )
 \2                            # Consume word,num

Match all occurrences Regex Java

Question

2 answers

solution1
3 2018-01-17 22:34:47

solution2
0

Match all occurrences Regex Java

Question

2 answers

solution1 3 2018-01-17 22:34:47

solution2 0

solution1
3 2018-01-17 22:34:47

solution2
0