[英]Perl: Sort hash to match order of array of cutsites
你好,我在以下代码上遇到麻烦,在与程序相同的文件夹中名为data的文件夹中有一个fasta文件(文本)。
这个文件就是这样
gi | 199581428 | gb | EU710615.1 | 柱状黄杆菌FclIS(fclIS)基因,全cds ATGGTAGAAACTCAATTTAAAAATACGGATATCGGGTTGATTCCTGAGGATTGGGAAGTGAAACAATTAG GAGAAGTTATAACATTAATTAATGGTCGAGCATATTCTCAGAATGAATTATTATTTAATGGAAAATACAG GGTTTTGAGAGTTGGAAATTTCTTTTCAAGTGACAAATGGTATTGGAGTAATTTAGAATTAGCAAGTAAA TTCTATGTAAATAAAGGGGATTTAATGTATGCTTGGTCTGCATCCTTTGGTCCAAAATTTTGGAAAAACG AAAAGACAATTTACCATTATCATATTTGGAAAATCGAATTATCTGAATATTTAGATAAGTTTTATCTTTT TTATGTACTTGAAAAGGATAAAGAGAATATTTTAAATCAATCACAAGGTGGAACAATGTTTCATATAACT AAGGAATCAATGGAAAAAAGAAAAATTCCAATTCCTTCTTTAAAAGAACAACAAGCCATCGCCGAAGTAT TATCTGATACCGATGCTTGGATAGAAAGCCTTGAAAAACTTATTACTAAAAAACGCTTGGTAAAACAAGG AGCAATGCAACAACTTTTAACACCCAAAGAGGATTGGGAGGTAAAGAAGTTGGGAGAAATAGCTGAAGTT AGAGATGGTACTCACCAAACTCCAACTTACGTAGAAAGTGGAATACCATTTTATAGTGTAGAAAGTGTCA CTAAAAACGATTTCAAGAATACAAAATACATTTCAGAACAAGAACATAAAATTCTTACAAAGTCCTTTAG AATTGAAAAAGGCGATATATTAATGACACGAATTGGGTCAATTGGAGATTGTAAATTAATTGATTGGGAT GTCAATGCAAGTTTTTATGTAAGCTTAGCATTATTAAAAGTAAAACCCATATTTTCAGCTAATTATTTAT GTCACTATTCAAAAACAGA AAATTTCAAAAAAGAGATAGATATAAATTCATTACAATCTGCAATTCCTAA AAAAATAAATCTAGGTCCTATTTCTAACGTAAAAATAGAATTGACGTCACTTGATGAACAACAACGTATC GCAACCATTCTGTCCGATATGGATGCCGAAATAGAACATTTAGAAAAGAAAACACAACCAGCCAAGCAACCATCACTCTCCATCGTCACG
我在用户STDIN为“ AluI”时无法对输出进行排序,我想对哈希进行排序,以便通过将输出中的'/'站点从最早到最新进行排序来对输出的哈希进行排序。
my $DIRNAME = 'data';
my %RE =( 'AatII' => 'GACGTC 5', 'AccI' => 'GTMKAC 2', 'AclI' => 'AACGTT 2', 'AcyI' => 'GRCGYC 2', 'AflII' => 'CTTAAG 1', 'AflIII' => 'ACRYGT 1', 'AgeI' => 'ACCGGT 1', 'AhaIII' => 'TTTAAA 3', 'AhdI' => 'GACNNNNNGTC 6', 'AluI' => 'AGCT 2', );
my $input='';
my @enz = keys %RE;
opendir(my $dh, $DIRNAME) || die 'Error opening directory: ' . $DIRNAME;
# Get the list of files
my @filenames = readdir($dh);
my $file_count = @filenames;
# Filter out files without .fa extension
@filenames = grep { /\.fa$/ } @filenames;
# Sort the filenames
@filenames = sort @filenames;
foreach my $filename (@filenames) {
my $fh;
my $path = sprintf('%s/%s', $DIRNAME, $filename);
if (!open($fh, $path)) {print 'Failed to open %s; skipping', $path;
next;
}
my $header = <$fh>;
chomp $header;
if ($header =~ />gi\|(\d+)\|gb\|(\w+)\.(\d+)\|(.+)$/) {
print '*' x 80;
print "\n";
print "filename: $filename";
print "\n";
print "gi number: $1";
print "\n";
print "accession number: $2";
print "\n";
print "version: $3";
print "\n";
print "description: $4";
print "\n";
while ($input ne 'quit') {
print "RE:";
$input = <STDIN>;
chomp $input;
foreach (@enz) {
if ($input =~ /$_/) {
#print "$_\n";
my $val = $RE{$_};
my $real = substr($val, 0, -2);
my $cut = chop $val;
my $length = length ($real);
my %ha;
my @fragaft = ();
my @fragbef = ();
my @endar = ();
while (my @line = <$fh>) {
my $join = join "", @line;
foreach ($join) {
while ($_ =~ /($real)/g) {
my $length = length($&);
#print "$length\n";
my $pos = length($`);
#print "$pos\n";
my $end = $pos + $length;
push @endar, $end;
my $endlen = $end - ($length-$cut);
#print "$endlen\n";
#print "$join\n";
my @fragbef = substr ($join, 0, $endlen);
my @fragaft = substr ($join, $endlen);
@ha{@fragbef} = @fragaft;
}
print "\n\n\t\t\tCUT POSITIONS: $_\n" foreach @endar;
foreach(keys %ha) { print "\n$_ / $ha{$_}\n"; }
}
}
}
}
}
}
}
输出在此处给出: http : //i.stack.imgur.com/17LLb.jpg (我没有足够的声誉在此处发布此图片)在这种情况下,我希望ASDASDA / ASDASDSD的顺序与“ CUTSITE”位于顶部,甚至更好。 要这样显示:Cutsite:1 => A / AAAA Cutsite:3 => AAA / AAAA Cutsite:7 => AAAAAA / A,但带有换行符。
任何帮助,将不胜感激。
您是否正在寻找
foreach ( sort keys %ha ) {
? – tjd是这样-Khuram Bhatti
不仅是这样 ,还可以完全满足您的要求
我希望ASDASDA / ASDASDSD的订单与顶部的“ CUTSITE”的订单相匹配...
现在为
甚至更好。 要这样显示:Cutsite:1 => A / AAAA Cutsite:3 => AAA / AAAA Cutsite:7 => AAAAAA / A,但带有换行符。
更改
my @fragbef = substr ($join, 0, $endlen);
my @fragaft = substr ($join, $endlen);
至
push @fragbef, substr($join, 0, $endlen);
push @fragaft, substr($join, $endlen);
和
print "\n\n\t\t\tCUT POSITIONS: $_\n" foreach @endar;
foreach(keys %ha) { print "\n$_ / $ha{$_}\n"; }
至
for (0..$#endar)
{ print "Cutsite: $endar[$_] =>\n$fragbef[$_]/$fragaft[$_]\n" }
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.