regex: find strings that start with [[ and specific string

Question

I have these strings:

[[:File:Example.jpg]]
[[:File:Example.jpg|this example]]
[[Media:Example.jpg]]
[[Georgia (U.S. state)|Georgia]]
[[Arkansas]]
[[Canada]]
[[Virginia]]
[[Image:Houstonia longifolia - Long Leaf Bluet 2.jpg|thumb|left]]

I want to extract with re the strings that start with [[Image or [[Media: or [[:file:

Answer 1

To find the strings beginning with [[:File: , you can use:

  re.search(r"\[\[:File.*?]]", your_strings)

Same with [[Media: and [[Image :

  re.search(r"\[\[Media:.*?]]", your_strings)

  re.search(r"\[\[Image.*?]]", your_strings)

See this example .

Answer 2

Try this Regex

Outputs only if there is [[Image , [[Media: or [[:File: at the beginning of the string (also added re.IGNORECASE flag to give the match at any case)

\\[\\[(?:Image|Media|:File):.+]]

Code:

import re

a = '''[[:File:Example.jpg]]
[[:File:Example.jpg|this example]]
[[Media:Example.jpg]]
[[Georgia (U.S. state)|Georgia]]
[[Arkansas]]
[[Canada]]
[[Virginia]]
[[Image:Houstonia longifolia - Long Leaf Bluet 2.jpg|thumb|left]]'''

print(re.findall(r'\[\[(?:Image|Media|:File):.+]]', a, flags=re.IGNORECASE))

Outputs:

[
 '[[:File:Example.jpg]]',
 '[[:File:Example.jpg|this example]]',
 '[[Media:Example.jpg]]',
 '[[Image:Houstonia longifolia - Long Leaf Bluet 2.jpg|thumb|left]]'
]

Regex101 Demo

Tell me if its not working...

regex: find strings that start with [[ and specific string

Question

2 answers

solution1
0 2021-10-30 13:54:58

solution2
0 2021-10-30 13:58:50

regex: find strings that start with [[ and specific string

Question

2 answers

solution1 0 2021-10-30 13:54:58

solution2 0 2021-10-30 13:58:50

solution1
0 2021-10-30 13:54:58

solution2
0 2021-10-30 13:58:50