Bash grep in file which is in another file

Question

I have 2 files, one contains this : file1.txt

632121S0 126.78.202.250 1
131145S0 126.178.20.250 1

the other contain this : file2.txt

632121S0        126.78.202.250  OBS
131145S0        126.178.20.250  OBS
313359S2        126.137.37.250  OBS

I want to end up with a third file which contains :

632121S0        126.78.202.250  OBS
131145S0        126.178.20.250  OBS

Only the lines which start by the same string in both files. I can't remember how to do it. I tried several grep, egrep and find, i still cannot use it properly... Can you help please ?

Answer 1

You can use this awk:

$ awk 'FNR==NR {a[$1]; next} $1 in a' f1 f2
632121S0        126.78.202.250  OBS
131145S0        126.178.20.250  OBS

It is based on the idea of two file processing , by looping through files as this:

first loop through first file, storing the first field in the array a .
then loop through second file, checking if its first field is in the array a . If that is true, the line is printed.

Answer 2

To do this with grep, you need to use a process substitution :

grep -f <(cut -d' ' -f1 file1.txt) file2.txt

grep -f uses a file as a list of patterns to search for within file2. In this case, instead of passing file1 unaltered, process substitution is used to output only the first column of the file.

Answer 3

If you have a lot of these lines, then the utility join would likely be useful.

join - join lines of two files on a common field

Here's a set of examples .

Bash grep in file which is in another file

Question

3 answers

solution1
3 ACCPTED 2014-05-06 09:47:28

solution2
1 2014-05-06 09:51:46

solution3
1 2014-05-06 09:52:35

Bash grep in file which is in another file

Question

3 answers

solution1 3 ACCPTED 2014-05-06 09:47:28

solution2 1 2014-05-06 09:51:46

solution3 1 2014-05-06 09:52:35

solution1
3 ACCPTED 2014-05-06 09:47:28

solution2
1 2014-05-06 09:51:46

solution3
1 2014-05-06 09:52:35