打印唯一的行，比较不超过N个字符

Question

With uniq , you can choose to compare only first N characters 使用uniq ，您可以选择仅比较前N字符

$ cat foo.txt
The quick brown fox jumps over the lazy dog.
The quick brown fox jumps over the lazy cat.
The quick brown fox jumps over the lazy mouse.

$ uniq -w 40 foo.txt
The quick brown fox jumps over the lazy dog.

Can the same effect be achieved using awk ? 使用awk可以达到相同的效果吗？ I read this example 我读了这个例子

awk '!a[$0]++'

but it compares the whole line. 但它会比较整行。

Answer 1

awk has substr() function: awk具有substr()函数：

awk '!a[substr($0,1,40)]++'

with your example: 与您的示例：

kent$  echo "The quick brown fox jumps over the lazy dog.
The quick brown fox jumps over the lazy cat.
The quick brown fox jumps over the lazy mouse."|awk '!a[substr($0,1,40)]++'
The quick brown fox jumps over the lazy dog

Answer 2

Two alternatives using FIELDWIDTHS and FPAT : 使用FIELDWIDTHS和FPAT两种选择：

awk '!a[$1]++' FIELDWIDTHS=40

awk '!a[$1]++' FPAT='.{40}'

打印唯一的行，比较不超过N个字符

问题描述

2 个解决方案

解决方案1
11 已采纳 2013-05-01 21:03:02

解决方案2
0 2013-05-02 08:44:45

打印唯一的行，比较不超过N个字符

问题描述

2 个解决方案

解决方案1 11 已采纳 2013-05-01 21:03:02

解决方案2 0 2013-05-02 08:44:45

解决方案1
11 已采纳 2013-05-01 21:03:02

解决方案2
0 2013-05-02 08:44:45