从多个文本文件中提取特定行

Question

我想从文件夹中的多个文本文件中打印某些行，具体取决于文件名。 考虑以下由下划线分隔的3个单词命名的以下文本文件：

Small_Apple_Red.txt
Small_Orange_Yellow.txt
Large_Apple_Green.txt
Large_Orange_Green.txt

如何实现以下目标？

if (first word of file name is "Small") {
   // Print row 3 column 2 of the file (space delimited);
}

if (second word of file name is "Orange") {
   // print row 1 column 4 of the file;
}

这有可能与awk？

Answer 1

尝试如下。

使用glob来处理文件夹中的文件。

然后使用正则表达式检查文件名。 grep用于从文件中提取特定内容。

my $path = "folderpath";
while (my $file = glob("$path/*"))
{
    if($file =~/\/Small_Apple/)
    {
        open my $fh, "<", "$file";
        print grep{/content what you want/ } <$fh>;
    }

}

Answer 2

use strict;
use warnings;

my @file_names = ("Small_Apple_Red.txt",
                  "Small_Orange_Yellow.txt",
                  "Large_Apple_Green.txt",
                  "Large_Orange_Green.txt");

foreach my $file ( @file_names) {
    if ( $file =~ /^Small/){ // "^" marks the begining of the string
         print "\n $file has the first word small";
    }
    elsif ( $file =~ /.*?_Orange/){  // .*? is non-greedy, this means that it matches anything<br>
                                  //  until the first "_" is found
        print "\n $file has the second word orange";
    }
}

还有一个特殊情况，你的文件有“Small_Orange”你必须决定哪个更重要。 如果第二个字是更重要的，然后切换内容if部分从内容elsif节

Answer 3

在Awk中：

awk 'FILENAME ~ /^Large/ {print $1,$4}
     FILENAME ~ /^Small/ {print $3,$2}' *

在Perl中：

perl -naE 'say "$F[0] $F[3]" if $ARGV =~ /^Large/;
           say "$F[2] $F[1]" if $ARGV =~ /^Small/ ' *

Answer 4

试试这个：

use strict;
use warnings;
use Cwd;
use File::Basename;

my $dir = getcwd(); #or shift the input values from the user 
my @txtfiles = glob("$dir/*.txt");

foreach my $each_txt_file (@txtfiles)
{
    open(DATA, $each_txt_file) || die "reason: $!";
    my @allLines = <DATA>;
    (my $removeExt = $each_txt_file)=~s/\.txt$//g;
    my($word1, $word2, $word3) = split/\_/, basename $removeExt; #Select the file name with matching case
    if($word1=~m/small/i) #Select your match case
    {
        my @split_space = "";
        my @allrows = split /\n/, $allLines[1]; #Mentioned the row number
        my @allcolns = split /\s/, $allrows[0]; 
        print "\n", $allcolns[1]; #Mentioned the column number
    }
}

从多个文本文件中提取特定行

问题描述

4 个解决方案

解决方案1
0 2016-07-19 05:48:39

解决方案2
0 2016-07-19 07:53:16

解决方案3
0 2016-07-19 08:16:14

解决方案4
0 2016-07-19 08:43:10

从多个文本文件中提取特定行

问题描述

4 个解决方案

解决方案1 0 2016-07-19 05:48:39

解决方案2 0 2016-07-19 07:53:16

解决方案3 0 2016-07-19 08:16:14

解决方案4 0 2016-07-19 08:43:10

解决方案1
0 2016-07-19 05:48:39

解决方案2
0 2016-07-19 07:53:16

解决方案3
0 2016-07-19 08:16:14

解决方案4
0 2016-07-19 08:43:10