用正则表达式/ sed修剪文件

Question

I've got a file with several lines like this: 我有一个包含以下几行的文件：

*wordX*-Sentence1.;Sentence2.;Sentence3.;Sentence4.

One of these Sentences may or may not contain wordX. 这些句子之一可能包含也可能不包含wordX。 What I want is to trim the file to make it look like this: 我要修剪的文件使其看起来像这样：

*wordX*-Sentence1.;Sentence2.

Where Sentence3 was the first to contain wordX. Sentence3是第一个包含wordX的位置。

How can i do this with sed/awk? 我该如何用sed / awk做到这一点？

Edit: 编辑：

Here's a sample file: 这是一个示例文件：

*WordA*-This sentence does not contain what i want.%Neither does this one.;Not here either.;Not here.;Here is WordA.;But not here.
*WordB*-WordA here.;WordB here, time to delete everything.;Including this sentece.
*WordC*-WordA, WordB. %Sample sentence one.;Sample Sentence 2.;Sample sentence 3.;Sample sentence 4.;WordC.;Discard this.

And here is the desired output: 这是所需的输出：

*WordA*-This sentence does not contain what i want.%Neither does this one.;Not here either.;Not here.
*WordB*-WordA here.
*WordC*-WordA, WordB. %Sample sentence one.;Sample Sentence 2.;Sample sentence 3.;Sample sentence 4.

Answer 1

This task is more suited to awk. 此任务更适合awk。 Use following awk command: 使用以下awk命令：

awk -F ";" '/^ *\*.*?\*/ {printf("%s;%s\n", $1, $2)}' inFile

This assumes that the words your are trying to match are always wrapped in asterisks * . 假设您要匹配的单词始终用星号*包裹。

Answer 2

This might work for you (GNU sed): 这可能对您有用（GNU sed）：

sed -r 's/-/;/;:a;s/^(\*([^*]+)\*.*);[^;]+\2.*/\1;/;ta;s/;/-/;s/;$//' file

Convert the - following the wordX to a ; 将wordX的-转换为; . 。 Delete sentences containing wordX ( working from the back to the front of the line). 删除包含wordX句子（从行尾到行尾）。 Replace the original - .Delete the last ; 替换原稿-删除最后一个; . 。

Answer 3

sed -r -e 's/\.;/\n/g' \
       -e 's/-/\n/' \
       -e 's/^(\*([^*]*).*\n)[^\n]*\2.*/\1/' \
       -e 's/\n/-/' \
       -e 's/\n/.;/g' \
       -e 's/;$//'

(edit: added the - : \\n swaps to handle a match in the first sentence.) （编辑：添加了- ： \\n交换以处理第一句中的匹配项。）

用正则表达式/ sed修剪文件

问题描述

3 个解决方案

解决方案1
1 2013-05-08 19:08:39

解决方案2
0 2013-05-08 21:07:29

解决方案3
0 2013-05-09 15:06:53

用正则表达式/ sed修剪文件

问题描述

3 个解决方案

解决方案1 1 2013-05-08 19:08:39

解决方案2 0 2013-05-08 21:07:29

解决方案3 0 2013-05-09 15:06:53

解决方案1
1 2013-05-08 19:08:39

解决方案2
0 2013-05-08 21:07:29

解决方案3
0 2013-05-09 15:06:53