在bash脚本中首次出现模式之前删除所有字符串

Question

so i've got a problem. 所以我有一个问题。 I want to remove the whole content of a file before a certain pattern has been found, but only at the first occurrence of it. 我想在找到某种模式之前删除文件的全部内容，但是只在它第一次出现时才删除。 the pattern : ([0-9]{2}:[0-9]{2}:[0-9]{2}).([0-9]{6}) (it is fits the date part of the string). 模式： ([0-9]{2}:[0-9]{2}:[0-9]{2}).([0-9]{6}) （适合日期中的日期部分字符串）。

For example, this content: 例如，此内容：

-- 10:17:40.614568 00:00:00:00:00:00 > ff:ff:ff:ff:ff:ff, ethertype IPv4 (0x0800), length 303: (tos 0x0, ttl 64, id 0, offset 0, flags [DF], proto UDP (17), length 289) 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 00:00:00:00:00:00, length 261, xid 0x1466f606, Flags [Broadcast]

should be parsed to : 应该解析为：

10:17:40.614568 00:00:00:00:00:00 > ff:ff:ff:ff:ff:ff, ethertype IPv4 (0x0800), length 303: (tos 0x0, ttl 64, id 0, offset 0, flags [DF], proto UDP (17), length 289) 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 00:00:00:00:00:00, length 261, xid 0x1466f606, Flags [Broadcast]

Answer 1

EDIT: Since OP mentioned condition to be met only for very first regex match in entire Input_file so adding this solution now. 编辑：由于OP提到仅在整个Input_file中的第一个正则表达式匹配时才满足条件，所以现在添加此解决方案。

awk --re-interval '
match($0,/[0-9]{2}:[0-9]{2}:[0-9]{2}\.[0-9]{6}/)  && !index{
  print substr($0,RSTART)
  index=1
  next
}
index ' Input_file

Could you please try following. 您可以尝试以下吗？

awk --re-interval '
match($0,/[0-9]{2}:[0-9]{2}:[0-9]{2}\.[0-9]{6}/){
  print substr($0,RSTART)
}' Input_file

It will only print those lines which have a regex match found. 它只会打印那些找到正则表达式匹配项的行。 In case you want to print those lines too which NOT have a match then do following. 如果您也要打印不匹配的行，请执行以下操作。

awk --re-interval '
match($0,/[0-9]{2}:[0-9]{2}:[0-9]{2}\.[0-9]{6}/){
  print substr($0,RSTART)
  next
}
1' Input_file

Since I am using an OLD awk version so I have out --re-interval remove it in case for you above code works(new versions of awk it works) 由于我使用的是旧版本的awk因此我会--re-interval将其删除，以防上述代码对您--re-interval （新版本的awk可以使用）

Explanation of 1st code: 第一个代码的说明：

awk --re-interval '                                ##Starting awk program from here and --re-interval enables ERE for OLD versions of awk.
match($0,/[0-9]{2}:[0-9]{2}:[0-9]{2}\.[0-9]{6}/){  ##Using match utility of awk here which will check REGEX provided to it either it presents on line or not.
  print substr($0,RSTART)                          ##match utility/function has variable named RSTART which denotes the starting of REGEX point, so I am using substring function to print from starting of that point to till end of line, since OP want to remove everything before REGEX match.
}
' Input_file                                       ##Mentioning Input_file name here.

For 2nd code's explanation will be same as 1st code only difference is 2nd code has next will skip lines whose regex is matched and 1 will print non-matched lines in Input_file. 对于第二代码的说明将与第一代码相同，只是不同之处在于第二代码next将跳过其正则表达式匹配的行，而1将在Input_file中打印不匹配的行。

在bash脚本中首次出现模式之前删除所有字符串

问题描述

1 个解决方案

解决方案1
1 已采纳 2018-08-19 11:29:02

在bash脚本中首次出现模式之前删除所有字符串

问题描述

1 个解决方案

解决方案1 1 已采纳 2018-08-19 11:29:02

解决方案1
1 已采纳 2018-08-19 11:29:02