[英]AWK / Linux script calculations from a text file
我編寫了一個AWK腳本,該腳本讀取文件並將行乘以列並將它們相加。 我想一次將幾個文件(任意數量的文件)作為awk腳本中的參數傳遞,例如A.txt,B.txt,C.txt。 我希望AWK腳本能給我行和列的總和。 我一直想跳過每個文本文件的前5列。
每個文本文件可以具有任意數量的列。 一個文件夾中可以有幾個文本文件。
我想運行為:
awk -f foo.awk A.txt B.txt C.txt
例如
如果有3個不同的文件A.txt,B.txt,C.txt,則對每個3個文件的行和列的乘積求和。
輸出應為:
No of columns in A.txt: count of columns in A.txt with first 5 columns ignored
No of columns in B.txt: count of columns in B.txt with first 5 columns ignored
No of columns in C.txt: count of columns in C.txt with first 5 columns ignored
Sum of A.txt: rows in A.txt*columns in A.txt
Sum of B.txt: rows in B.txt*columns in B.txt
Sum of C.txt: rows in C.txt*columns in C.txt
Total Sum: A+B+C
以下是到目前為止(對於偽代碼而言)我對foo.awk的了解(不適用於多個文件):
#!/bin/gawk -f
BEGIN { rows=0; columns=0 }
{
FS="\t";
if(/^#COLS/) {
column=NF-5; #skip first 5 columns
columns+=column
}
if (!/^#/){
rows++;
files[FILENAME]++;
}
}
END {
for (fname in files) {
printf ("%'24d rows in %s\n",files[fname],fname);
}
printf("No of columns in A.txt= %'d\n", columnsA);
printf("No of columns in B.txt= %'d\n", columnsB);
printf("No of columns in C.txt= %'d\n", columnsC);
sum=columns*rows; # multiply no of rows by column in each file and add them up
printf( "Sum of A.txt %d\n", sumA);
printf( "Sum of B.txt %d\n", sumB);
printf( "Sum of C.txt %d\n", sumC);
printf( "Total sum is %d\n", sum_of_A+B+C);
}
例如
A.txt:
#ignore this line -- pattern does not match
#ignore this line -- pattern does not match
#COLS A B C D E F G H I
row1 1 2 3 4 5 6 7 8 9
row2 1 3 3 4 5 6 7 8 9
row3 1 3 3 4 5 6 7 8 9
B.txt:
#ignore this line -- pattern does not match
#ignore this line -- pattern does not match
#COLS A B C D E F G H
row1 1 2 3 4 5 6 7 8
row2 5 3 3 4 6 6 7 8
row3 8 3 3 4 5 6 7 8
C.txt:
#ignore this line -- pattern does not match
#ignore this line -- pattern does not match
#COLS A B C D E F G H I J
row1 1 2 3 3 5 6 7 8 9 2
row2 7 3 3 4 5 6 7 8 9 7
row3 9 3 3 4 5 6 7 8 9 6
row4 9 3 3 4 5 6 7 8 9 6
output:
No of columns in A.txt: 5
No of columns in B.txt: 4
No of columns in C.txt: 6
Sum of A.txt: 3*5=15
Sum of B.txt: 3*4=12
Sum of C.txt: 4*6=24
Total Sum: 12+9+20 = 51
謝謝。
用普通的awk
你可以做到這一點
$ awk '!/^#/{cols[FILENAME]=NF-5;
rows[FILENAME]++}
END{for(f in cols) print "No of columns in " f, cols[f];
for(f in cols)
{r=rows[f];
c=cols[f];
sum+=r*c;
sumstr=sumstr?sumstr"+"r*c:r*c;
print "Sum of "f ":",r "x" c "=" r*c}
print "Total Sum: ", sumstr, "=", sum}' {A,B,C}.txt
No of columns in C.txt 6
No of columns in B.txt 4
No of columns in A.txt 5
Sum of C.txt: 4x6=24
Sum of B.txt: 3x4=12
Sum of A.txt: 3x5=15
Total Sum: 24+12+15 = 51
跳過5或6時,列數不匹配。另外請注意,條目的順序不保留,可以用gawk
排序或下面的少量額外代碼來固定...
$ awk 'FNR==1{order[++k]=FILENAME}
!/^#/{cols[FILENAME]=NF-5; rows[FILENAME]++}
END{for(i=1;i<=k;i++) print "No of columns in " order[i], cols[order[i]];
for(i=1;i<=k;i++) {f=order[i];r=rows[f];c=cols[f];sum+=r*c; sumstr=sumstr?sumstr"+"r*c:r*c; print "Sum of "f ":",r "x" c "=" r*c}
print "Total Sum: ", sumstr, "=", sum}' {A,B,C}.txt
No of columns in A.txt 5
No of columns in B.txt 4
No of columns in C.txt 6
Sum of A.txt: 3x5=15
Sum of B.txt: 3x4=12
Sum of C.txt: 4x6=24
Total Sum: 15+12+24 = 51
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.