使用bash“拖尾”基于字符串位置的二进制文件？

Question

I've got a bunch of binary files, each containing an embedded string near the end of the file but at different places (only occurs once in each file). 我有一堆二进制文件，每个文件都包含一个嵌入字符串靠近文件末尾但在不同的地方（每个文件只出现一次）。 I need to extract the part of the file starting at the location of the string till the end of the file and dump it into a new file. 我需要从字符串的位置开始提取文件的一部分，直到文件的末尾并将其转储到新文件中。

eg. 例如。 If the file's contents is "AWREDEDEDEXXXERESSDSDS" and the string of interest is "XXX", then the part of the file I need is "XXXERESSDSDS". 如果文件的内容是“AWREDEDEDEXXXERESSDSDS”并且感兴趣的字符串是“XXX”，那么我需要的文件部分是“XXXERESSDSDS”。

What's the easiest way to do this in bash? 在bash中最简单的方法是什么？

Answer 1

In PERL, there is a variable built in that specifically refers to the part of the string after the matched regular expression. 在PERL中，有一个内置的变量，专门引用匹配的正则表达式后的字符串部分。 That would be the method I would use. 这将是我将使用的方法。 It is not just Bash and utilities, but PERL is so commonly installed that you should be OK. 它不只是Bash和实用程序，但PERL是如此常见的安装，你应该没问题。

Answer 2

Following is a small hack shell solution that is not very performant. 以下是一个不太高效的小型hack shell解决方案。 But it works. 但它的确有效。

Write the script file tail.sh as follows: 编写脚本文件tail.sh如下：

#!/bin/sh
dd bs=1 if=$1 of=$2 skip=`grep --binary-files=text -m1 -b -o $3 $1 | cut -d ':' -f 1 | head -1`

Call tail.sh INPUTNAME OUTPUTNAME PATTERN 调用tail.sh INPUTNAME OUTPUTNAME PATTERN

ps: sorry forgot one option to grep in first post ps：抱歉忘记了第一篇文章中grep的一个选项

Answer 3

Would strings and grep do you want? 你想要strings和grep吗？

eg 例如

strings -n 3 myfilename | grep XXX

Answer 4

 strings -n3 file_binary | awk '/XXX/{gsub(/.*XXX/,"");print}'

Answer 5

I came up with this solution: 我想出了这个解决方案：

ls -1 *.bin | xargs strings -n4 --radix=d -f | grep "string" | awk '{sub(/:/, ""); print $2 " " $1 " " $1".";}' | xargs -l1 split -b && rm *.aa

ls -1 *.bin Print only the filenames with the extension "bin" in a list format ls -1 * .bin 仅以列表格式打印扩展名为“bin”的文件名

xargs strings -n4 --radix=d -f List all the strings in the file and their positions and include the filename in the output xargs strings -n4 --radix = d -f 列出文件中的所有字符串及其位置，并在输出中包含文件名

grep "string" Print lines containing "string" (it only occurs once in each file) grep“string” 打印包含“string”的行（每个文件只出现一次）

awk '{sub(/:/, ""); awk'{sub（/：/，“”）; print $2 " " $1 " " $1".";}' Remove the colon after the filename added by strings, and print the position of the string, the filename, and the filename with a period (this line is used as the arguments for the split command print $ 2“”$ 1“”$ 1“。”;}' 在字符串添加文件名后删除冒号，并用句点打印字符串的位置，文件名和文件名（此行用作参数split命令

xargs -l1 split -b Execute the split command for each line using the output of awk as the rest of the arguments xargs -l1 split -b 使用awk的输出作为其余参数，为每一行执行split命令

rm *.aa Delete the first parts of the split files. rm * .aa 删除拆分文件的第一部分。 "aa" is the default suffix for the part of the split files. “aa”是拆分文件部分的默认后缀。

There are probably better/faster/safer ways of doing this but it's fine for my purposes. 可能有更好/更快/更安全的方法，但这对我的目的来说很好。

Answer 6

Try this: 尝试这个：

grep -ao string.* filename

Since you have binary data, you might want to redirect the output to a file. 由于您有二进制数据，因此您可能希望将输出重定向到文件。

grep -ao string.* filename > binary.out

Or pipe it through hexdump or similar for testing: 或者通过hexdump或类似方法将其管道进行测试：

grep -ao string.* filename | hd

使用bash“拖尾”基于字符串位置的二进制文件？

问题描述

6 个解决方案

解决方案1
1 2010-03-30 14:25:40

解决方案2
1 2010-03-30 14:34:49

解决方案3
0 2010-03-30 14:18:15

解决方案4
0 2010-03-30 14:30:08

解决方案5
0 已采纳 2010-03-31 08:42:14

解决方案6
-1 2010-03-30 15:12:36

使用bash“拖尾”基于字符串位置的二进制文件？

问题描述

6 个解决方案

解决方案1 1 2010-03-30 14:25:40

解决方案2 1 2010-03-30 14:34:49

解决方案3 0 2010-03-30 14:18:15

解决方案4 0 2010-03-30 14:30:08

解决方案5 0 已采纳 2010-03-31 08:42:14

解决方案6 -1 2010-03-30 15:12:36

解决方案1
1 2010-03-30 14:25:40

解决方案2
1 2010-03-30 14:34:49

解决方案3
0 2010-03-30 14:18:15

解决方案4
0 2010-03-30 14:30:08

解决方案5
0 已采纳 2010-03-31 08:42:14

解决方案6
-1 2010-03-30 15:12:36