Find string after “task-” in a long substring using regex

Question

I have list of files with a pattern sub-*_task-XYZabc_run-*_bold.json and sub-*_task-PQRghu_bold.json , for example:

sub-03_task-dis_run-01_bold.json
sub-03_task-dis_run-02_bold.json
sub-03_task-dis_run-03_bold.json
sub-03_task-dis_run-04_bold.json
sub-03_task-dis_run-05_bold.json
sub-03_task-dis_run-06_bold.json
sub-03_task-fb_run-01_bold.json
sub-03_task-fb_run-02_bold.json
sub-03_task-fb_run-03_bold.json
sub-03_task-fb_run-04_bold.json

I intend to find all different task names from the filename. In the above example, dis and fb are the two tasks.

What kind of regex should I use to find TASKNAME from task-TASKNAME in a given filename?

Answer 1

The following regex should do it :

(?<=task-).*?(?=_)

see regex demo / explanation

python ( demo )

import re
regex = r"(?<=task-).*?(?=_)"
str = """sub-03_task-dis_run-01_bold.json
         sub-03_task-dis_run-02_bold.json
         sub-03_task-dis_run-03_bold.json
         sub-03_task-dis_run-04_bold.json
         sub-03_task-dis_run-05_bold.json
         sub-03_task-dis_run-06_bold.json
         sub-03_task-fb_run-01_bold.json
         sub-03_task-fb_run-02_bold.json
         sub-03_task-fb_run-03_bold.json
         sub-03_task-fb_run-04_bold.json"""
matches = re.finditer(regex, str)
for matchNum, match in enumerate(matches):
    matchNum = matchNum + 1
    print ("{match}".format(matchNum = matchNum, start = match.start(), end = match.end(), match = match.group()))

Find string after “task-” in a long substring using regex

Question

1 answers

solution1
2 2017-02-27 22:42:34

Find string after “task-” in a long substring using regex

Question

1 answers

solution1 2 2017-02-27 22:42:34

solution1
2 2017-02-27 22:42:34