Extract URL from lines with specific TLD ReGex

Question

Hi everyone I'm trying to extract URL from a file With the specific ending of ".eu" like.com.

I have this code to get a list of URLs but not with a specific ending. Can anyone improve it to get a specific TLD at the end?

urls = re.findall('https?://(?:[-\w.]|(?:%[\da-fA-F]{2}))+', line).

example of lines and expected results.

akijsdijas adsfaasd asfda https://www.google.eu/asd34a/as3df asdfs dsf76

a56 64ijas adsfaasd asfda https://www.facebook.eu/asd34a/as3df asdfs345 dsf76

fghddijas adsfaasd asfda https://www.facebook.com/asd34a/as3df asdfs dsf76

Expected results:

Answer 1

You may use

re.findall(r'https?://\S*?\.eu\b', line)

See the regex demo .

The regex matches:

Answer 2

try this

urls = re.findall(r'https?://\S*\.eu\b')