ruby regex scan multiple match

Question

I am trying to get the text between two tag.

 foobar  => bar

I tried using 'asdasdqwe '.scan(/[a-zA-Z0-9]*<\\/b>(.*)<br\\/>/) and it gives me proper result.

but when I try this :

'<b>exclude</b>op1<br/>exclude 2<b>exclude</b>op2<br/>exclude 2<b>exclude</b>op3<br/>exclude 2'.scan(/<b>[a-zA-Z0-9]*<\/b>(.*)<br\/>/) { |ele|
puts ele
}

It matches the first  tag and the last   tag and returns the whole string I was expecting an array of matches

Answer 1

Instead of using regex on html use nokogiri:

Nokogiri::HTML.fragment(str).css('b').each do |b|
    puts b.next.text
end

Answer 2

Change (.*) to (.*?) to make it ungreedy

/<b>[a-zA-Z0-9]*<\/b>(.*?)<br\/>/

Test

[2] pry(main)> '<b>exclude</b>op1<br/>exclude 2<b>exclude</b>op2<br/>exclude 2<b>exclude</b>op3<br/>exclude 2'.scan(/<b>[a-zA-Z0-9]*<\/b>(.*?)<br\/>/) { |ele|
[2] pry(main)*   puts ele
[2] pry(main)* }  
op1
op2
op3

ruby regex scan multiple match

Question

2 answers

solution1
9 2011-11-25 08:28:13

solution2
8 ACCPTED 2011-11-25 06:44:32

ruby regex scan multiple match

Question

2 answers

solution1 9 2011-11-25 08:28:13

solution2 8 ACCPTED 2011-11-25 06:44:32

solution1
9 2011-11-25 08:28:13

solution2
8 ACCPTED 2011-11-25 06:44:32