Perl remove multiple line that match regex

Question

I have a file that looks like this:

*
TEST CASE1,
$ some text unque633
PLACEMENT 123
*
TEST CASE2,
$ some text unque759
PLACEMENT 321
*
TEST CASE3,
$ some text unque966
PLACEMENT 856
*

I want to remove multiple lines that match regex. For example, need to remove starting from TEST CASE2 until the line begining with * . How can this be done within perl script. Also, how would I edit text from TEST CASE2 if I only know unque759 . Much appreciated.

Answer 1

Desired goal can be easily achieved with following algorithm: read all data into a variable, substitute block 'TEST CASE2.....' before next '*' to nothing, output result

use strict;
use warnings;
use feature 'say';

my $data = do { local $/; <DATA> };     # read all data at once

$data =~ s/TEST CASE2[^*]*//s;          # substitute requested block with nothing

say $data;

__DATA__
*
TEST CASE1,
$ some text
PLACEMENT 123
*
TEST CASE2,
$ some text
PLACEMENT 321
*
TEST CASE3,
$ some text
PLACEMENT 856
*

Output

*
TEST CASE1,
$ some text
PLACEMENT 123
*
*
TEST CASE3,
$ some text
PLACEMENT 856
*

Answer 2

This will remove blocks beginning with TEST CASE2, , ending with * and containing unque759

cat file.txt 
*
TEST CASE1,
$ some text unque633
PLACEMENT 123
*
TEST CASE2,
$ some text unque759
PLACEMENT 321
*
TEST CASE2,
$ some text unque999
PLACEMENT 321
*
TEST CASE3,
$ some text unque966
PLACEMENT 856
*

perl -0777 -ape 's/TEST CASE2,[^*]+?\bunque759\b[^*]+?\*(?:\R|\z)//' file.txt 
*
TEST CASE1,
$ some text unque633
PLACEMENT 123
*
TEST CASE2,
$ some text unque999
PLACEMENT 321
*
TEST CASE3,
$ some text unque966
PLACEMENT 856
*

Explanation:

-0777            # “slurp” mode, read the file in a single string
s/               # substitute
    TEST CASE2,     # literally
    [^*]+?          # 1 or more non asterisk, not greedy
    \b              # word boundary
    unque759        # literally
    \b              # word boundary
    [^*]+?          # 1 or more non asterisk, not greedy
    \*              # an asterisk
    (?:\R|\z)       # non capture group, end of line OR end of string
//               with nothing

Perl remove multiple line that match regex

Question

2 answers

solution1
2 ACCPTED 2020-03-06 04:15:10

solution2
1 2020-03-06 09:34:39

Perl remove multiple line that match regex

Question

2 answers

solution1 2 ACCPTED 2020-03-06 04:15:10

solution2 1 2020-03-06 09:34:39

solution1
2 ACCPTED 2020-03-06 04:15:10

solution2
1 2020-03-06 09:34:39