繁体   English   中英

根据每行的列中的最低数字过滤文件

[英]filtering file according to the lowest number in a column of each line

我有以下文件:

chr01_pilon3.g13.t1 trnscript:OIT01734  transcript:OIT01734 1.1e-107    389.8   1000    218 992 1   216 130 345 MDALTRHIQGDVPWCMLFADDIILIDETRAGVSERLEIWRQTLESKGFKISRSKTEYLECKFGDEPSGVGREVMLGSQAIAKRDSVRYLGSVIQGDGEIDGDVTHRIGAGWSKWRLASGVLCDKKIPHKLKGKFFRAMVRPAMFYEAECWPVKNSHIQRMKVAEMRMLRWMCGHTRLDKIKNEVIRQKVGVAPVDKKMGEARLRWFGHVRRRGPDA    MDALTRHIQGDVPWCMLFADDIVLIDETRVGVNERLEVWRQTLESKGFKLSRSKTEYLECKFSAESSEVGRDVKLGSQVIAKRDSFRYLGSVIQGEGEIDGDVTHRIGAGWSKWRLASGVLCDKKVPQKLKGKFYRAVVRPAMLYGAECWPVKNSHVQRMKVAEMRMLRWMRGLTRLDRIRNEVIREKVGVALVDEKMREARLRWYGHVRRRRPDA    MDALTRHIQGDVPWCMLFADDIILIDETRAGVSERLEIWRQTLESKGFKISRSKTEYLECKFGDEPSGVGREVMLGSQAIAKRDSVRYLGSVIQGDGEIDGDVTHRIGAGWSKWRLASGVLCDKKIPHKLKGKFFRAMVRPAMFYEAECWPVKNSHIQRMKVAEMRMLRWMCGHTRLDKIKNEVIRQKVGVAPVDKKMGEARLRWFGHVRRRGPDAR*  MKVWERVVEARVREMTSISVNQFGFMPGRSTTEAIHLVRRLVEHFRDKKKDLHMVFIDLENAYDKVPREVLWRCLEAKSVPEAYIRVIKDMYDGAKTRVRTVGGDSDHFPVVMGLHQGSALSPLLFALVMDALTRHIQGDVPWCMLFADDIVLIDETRVGVNERLEVWRQTLESKGFKLSRSKTEYLECKFSAESSEVGRDVKLGSQVIAKRDSFRYLGSVIQGEGEIDGDVTHRIGAGWSKWRLASGVLCDKKVPQKLKGKFYRAVVRPAMLYGAECWPVKNSHVQRMKVAEMRMLRWMRGLTRLDRIRNEVIREKVGVALVDEKMREARLRWYGHVRRRRPDAPVRIYKSAILGHLNSHGSQNALAGPVEAEENRQKTKKEVMEEIIQKSKFFKAQKAKDREENDELTEQLDKDFTSLVESKALLSLTQPDKINALKALVNKNISVGNVKKDEVADVPRKASIGKEKPDTYEMLVSEMALDMRARPSDRTKTPEEIAQEEKERLELLEQEXXXXXXXXXXXXXXDGNASDDNSKLVKDPRTVSGDDLGDDLEEVPRTKLGWIGEILRRKENELESEDAASSGDSDDGEDEGXXXXXXXXXXXXXXXXXXXXDEEQGKTQTIKDWEQSDDDIIDTELEDDDEGFGDDAKKVVKIKDHKEENLSITVAAENKKKMQVFYGVLLQYFAVLANKKPLNSKLLNLLVKPLMEMSAVSPYFAAICARQRLQRTRAQFCEDLKNTGKSSWPSLKTIFLLRLWSMIFPCSDFRHCVMTPAILLMCEYLMRCTIISGRDIAIASFLCSLLLSVIKQSQKFCPEAIVFIQTLLMAALDRKQRSNSQLDNLMEIKELGPLLCIRSSKVEMDSLDFLTLMDLPEDSQYFHSDNYRTSMLVTVLETLQGFVNVYKELISFPEIFMLISKLLCKMAGENHIPDALREKIKDVSQLIDTKAQEHHMLRQPLKMRKKKPVPIRMLNPKFEENFVKGRDYDPDRERA    389.8   1000    216 85.6    185 31  200 0   0   92.6    0   22IV6AV2SN4IV11IL12GSDA1PS1GE3ED1MK4AV6VF9DE29IV1HQ6FY2MV5FL1EG10IV14CR1HL4KR1KR5QE5PL2KE2GR6FY6GR3 85.6    1.1e-107    99.1
gene.92134.0.0.p1   NisylASAF01033898g0006.1    NisylASAF01033898g0006.1    2.6e-302    1037.7  2682    571 548 2   570 4   548 SRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHPRRRSLGDNDADTDEIEDSQSHVPARSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHALKFADPILGMGEKLVQRMRMRSSRFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTLLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDVFVTNNHGNMAKILAGRRRFFGHKPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATGQISHSTSRLETLSKVTSNDYDIDISENQELDMLLSD   SRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHLPRRRSSGDNDADNEIEDSQSHVPAGSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHSLKFADPILRMGEKLVQRMRMRSSHFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTPLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATVEISHSTSRLETSPKVTSNDYDTDISENPELDMLLSD   QSRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHPRRRSLGDNDADTDEIEDSQSHVPARSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHALKFADPILGMGEKLVQRMRMRSSRFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTLLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDVFVTNNHGNMAKILAGRRRFFGHKPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATGQISHSTSRLETLSKVTSNDYDIDISENQELDMLLSD* MTLSRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHLPRRRSSGDNDADNEIEDSQSHVPAGSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHSLKFADPILRMGEKLVQRMRMRSSHFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTPLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATVEISHSTSRLETSPKVTSNDYDTDISENPELDMLLSD    1037.7  2682    570 93.2    531 13  533 3   26  93.5    0   82-L5LS6TND-11RG175AS8GR14RH91LP42V-F-V-T-N-N-H-G-N-M-A-K-I-L-A-G-R-R-R-F-F-G-H-K-66GVQE10LSSP8IT5QP8   93.2    2.6e-302    99.6
gene.96656.0.5.p2   NisylKD954897g0030.1    NisylKD954897g0030.1    7.7e-75 280.0   715 140 968 1   139 371 509 MRCKYLTLNTPMKKWELHGIAILLQSCPWVEMLHINTESAFEVYHFGLHYKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYMLSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSS MRCKYLTLNTPMKKWELPGIAILLQSCPQVEMLHINTESAFEEYHFGSHFKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYILSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSN MRCKYLTLNTPMKKWELHGIAILLQSCPWVEMLHINTESAFEVYHFGLHYKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYMLSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSS*    MSESEGEHEENLDYDSPRYSPYSXXXXXXXXXXXXXXXXXXSDQSYYGGKCHKTEKTDRISALPDSLILHILSSLDMGEVVRTGVLSKRWHLLWTSQQSLIFSYSGQHVNGIYKFVIFIDNTLLLCRSGMVKKFSVDFIYSKRFVRHVNRWMIFIKNKLVEELDLNLRSRGNLIEIYNLPQIMYFDVRLRHLSLCNCNLVPKEEIYWPALRDLEIGYAELNRDVIKKICSGCRALESLKFRSCYGVDYFDIDSKSVKKLVIHEYGRQNHDDADDDDDELGIYARNVTSLEICGYFHKRILVLEDVKALLDAKLDFYRNTDDYEIEREFRTDQNMLKNLLVSLQHVEKLSIGTWCLQVLTSLEIRNLPCPRMRCKYLTLNTPMKKWELPGIAILLQSCPQVEMLHINTESAFEEYHFGSHFKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYILSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSNRFSSLPDSVLLHILSFLPFDDVVRTTLLCKQWRPLWSFSTSLNFIHRPKDFISLKKFASFVDKSLINLHCNNSSISKLHLDFPFKRCFSSDVTVWVLFAITHKVKELNLILSSDAEDLYKLPKRLFSNPFIEKVNWVGCKFDKVEVFRWDSLRELRIGSIEFCDDMVRKVVFGSPCLELLELDNCWGFKRLDLVGGKVSKLVVNGYNGEAVKKNSMLLDFEVVEIEAPCVKVLELKGCFRRMNNIQLKNVMSCVSVKLDFQFTKDEERVNYVDMLMGMIGSLRHVKDVMLGTWCIEVMSSWPMNILPFSMSSYECLTLHTPIQERYLPGIVRILQSSSNLRTLIIHMAPPYFEFEACFIPIVYDVYSVGGRCQLSMLSKNCGLHLKKIRICCFEGMRSGQEVLFLRDLLLVCANLEEMVIEWRSGHQNSSIRDASDEFVAESLLMVQKRSRNAVILFNN    280.0   715 139 95.0    132 7   135 0   0   97.1    0   17HP10WQ13VE4LS1YF38MI49SN  95.0    7.7e-75 99.3
gene.90968.0.2.p2   transcript:OIT02339 transcript:OIT02339 1.3e-209    729.2   1881    391 1270    1   388 881 1268    MNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKKGLVSGVGLGFSNFVLFCLYALAFYLGAVLVRHDKAKFSEVFKVFFALTMASIGLSFLSNLPSDLSKGKGAAASIFEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVHLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTG    MNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKNGLVSGVGLGFSNFVLFCLYALAFYLGAVLVKHDKAKFSEVFKVFFALTMASIGLSVLSNLPSDLSKGKGAAASIIEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVQLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTG    MNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKKGLVSGVGLGFSNFVLFCLYALAFYLGAVLVRHDKAKFSEVFKVFFALTMASIGLSFLSNLPSDLSKGKGAAASIFEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVHLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTGAT* MAEGGEPSSARRKEEDDQKVPFYMLFAFADRTDVILMLFGTLGAIASGISKPLMSLIFGDLVNSYGKSNQSNILDQVSGISLKFVYLAIGSGIASVFQIACWVVTGERQATRIKCLYLKTILRQDIGFFDTQSATGEFIERMSGDTILVQEAMGDKVANFIMNISTFIGGFVVAFIKGWLLTLVLLTSIPATAISFGCVALVLSKMSGSGQVAYADAGKVVEQTVGGIRTVASFTGEKLAIEDYNSKLESAYSATIKQALASGLGLGTILTLIFFSYGLAIWYGAKLIIEKDYKGGDIISVIFAVMLGGSSLGQASPSLNAFSAGQAAAYKIFETIKRTPKIDPYDPSGIELEDIKGEIELKDVYFKYPARPDVQIFSGFSLYIPSGKTAALVGQSGSGKSTVISLLERFYDPEAGEILIDGVEIKKFQLKWLRQQMGLVSQEPVLFATTIRENIIYGKENASEEEIRNAIQLANAAKFIDKLPKGLDTMVGGHGTQISGGQKQRIAIARAILKDPRILLLDEATSALDVESERIVQDALSNIMVNRTTVVVAHRLTTIRNADLIAVVHLGKLVEQGTHDELIKDPEGAYSQLVQMQQKTKHVENTKGKEIEELNAPKRLSYSKNVSGRSRRFSLSGRKSASKGSSSKFSFAYDLGVSGVVDFHESIRREDGAGTSEYIADTKKKVSTQKLMSLAYLNKPELPIMLVGTVAAAINGMVFPVFGLLVSTIIKIFYESHHELRKDSRFWALMFVVIGIVVMIVSPLQNYAFGVAGAKLIQRIRSMTFSKLVYQEISWFDDPANSCGAIGARLSSDASTIRNMVGDALATLVQNISTIVTGLVIALIANWILALITIAIMPLLALQGYIQIKLLQESNNEAKLMNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKNGLVSGVGLGFSNFVLFCLYALAFYLGAVLVKHDKAKFSEVFKVFFALTMASIGLSVLSNLPSDLSKGKGAAASIIEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVQLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTGTA  729.2   1881    388 98.7    383 5   384 0   0   99.0    0   45KN30RK24FV18FI164HQ102    98.7    1.3e-209    99.2
gene.69001.1.0.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.8e-206    718.8   1854    393 530 1   384 1   384 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHK    MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHK    MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKVSETVVLV*   MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  718.8   1854    384 95.6    367 17  374 0   0   97.4    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM7   95.6    1.8e-206    97.7
gene.35466.0.0.p2   NiotoAWOL01S0001629g0004.1  NiotoAWOL01S0001629g0004.1  1.0e-59 229.6   584 118 889 1   118 669 786 QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETDEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETDEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  MEIDGRERSVEMRDHDDSPVKERWEDGHYDLEESGHDKSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGRDAVDKEKGXXXXXXXXXXADEXXXXXXXXXXGNRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKQEIVSYEDDDRARNNAVETAGSQSSASKLEERILKMKEERLKKKSEGASEVMAWVSKSRKIEEKRTAEKERALQLSKIFEEQDKINDEESDDEEKARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILAGDDINQEVDVLENVEIGEQKKRDDAYKAAKKKTGIYDDKFNDDPGFERKILPQYDDPAEEEGVTLDATGGFSVDAEKKLEELRKRIQGSSSKTLAEDLNSSGKLLSDYYTQEEMLQFKKPKKKKSLRKKEKMDLDALEVEAKSSGLGVGDLGSRNDKTRQALREEMERAEAETKSKSYQAAYAKAEEASKALRPEKTNXXXXXXXXXXXXXXXXELRKSLERARKLALQKQEGLAKTFPESIASLAISRANDSTVDNPSSVSGESQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNSDTPSLSVERMREAQAQFKTPYLVLSGNVKPGQTSDPRSGFATVEKSLPGGLTPMLGDKK   229.6   584 118 99.2    117 1   118 0   0   100.0   0   36DE81  99.2    1.0e-59 100.0
gene.86248.0.0.p1   Nitab4.5_0000420g0110.1 Nitab4.5_0000420g0110.1 Protein of unknown function DUF538  8.2e-74 276.9   707 175 140 35  174 1   140 MTSQVTENHRENAEVFTNPAICKQKSLELLEQTNMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFVEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    MSSQVSENHRENAEVFTDPAICKQKSLELLEQINMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFIEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    THFLYFPFPLSHTEPQTKRNLNPISFPFSFAFTKMTSQVTENHRENAEVFTNPAICKQKSLELLEQTNMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFVEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE* MSSQVSENHRENAEVFTDPAICKQKSLELLEQINMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFIEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    276.9   707 140 96.4    135 5   139 0   0   99.3    0   1TS3TS11ND14TI52VI54    96.4    8.2e-74 80.0
gene.9403.0.4.p1    transcript:OIT35479 transcript:OIT35479 8.5e-191    667.5   1721    690 406 1   378 1   378 MLSAPRVSPPAVAVAAPARFKFPNVCVNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIIWGGTEDDDSSIPSKEVLSWKPLASTPXXXXXXXXXXXXXDEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHNKHNIADASSRSSFSSYNEPDQLKEQQTLSLPRGRAKIQQLDDKKNFQKLIRVEDEDRGIAIENVSKHFAGYSIDSHAQSARVVHPGSKASASPLRGWGGGSSHYSLKRDEIFRERQNLGDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQV  MLSAPRAPPPAVAVAAPARFKFQNVCGNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIVWGGTEDDDSSIPSKEVLSWKPLASTSPDNNHPPPTQSSSNEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHDKHNTTDASSRSSFSSYNEPGQLKEQQTLSLPRGRAKIQQLEDRKNSQKLIRVEDEDRDIAIENVSKHFAGYSSDSHAHSARVVHPGSKASASPLRGWGGGSSHYSLKREEIFRQRRNLDDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQV  MLSAPRVSPPAVAVAAPARFKFPNVCVNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIIWGGTEDDDSSIPSKEVLSWKPLASTPXXXXXXXXXXXXXDEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHNKHNIADASSRSSFSSYNEPDQLKEQQTLSLPRGRAKIQQLDDKKNFQKLIRVEDEDRGIAIENVSKHFAGYSIDSHAQSARVVHPGSKASASPLRGWGGGSSHYSLKRDEIFRERQNLGDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQVLSTCRSFSKSGVPFHSMVVTGGFCQRTQLENLRQELDILIATPGRFMFLIKEGYLQLTNLKCAVLDEVDILFSDEDFETAFQCLINSSPITTQYLFVTATLPMDIYNKLVESFPDCELVSGPGMHRTSPGLEEFLVDCSGDETAEKSPDTAFINKKNALLHLVEDSPVPKTIVFCNKIDSCRKVENALKRFDRKGFSIKILPFHAALDQRRRLANMEEFRRSKMENVSLFLVCTDRASRGIDFEGVDHVVLFDYPRDPSEYVRRVGRTARGAGGKGKAFIFAVGKQVSLARRIMERNKKGHPVHDVPSILT*  MLSAPRAPPPAVAVAAPARFKFQNVCGNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIVWGGTEDDDSSIPSKEVLSWKPLASTSPDNNHPPPTQSSSNEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHDKHNTTDASSRSSFSSYNEPGQLKEQQTLSLPRGRAKIQQLEDRKNSQKLIRVEDEDRDIAIENVSKHFAGYSSDSHAHSARVVHPGSKASASPLRGWGGGSSHYSLKREEIFRQRRNLDDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQVCQISSSIKGTFATYSPYCSATTHTKRKK  667.5   1721    378 91.0    344 34  352 0   0   93.1    0   6VASP14PQ3VG50IV25PSXPXDXNXNXHXPXPXPXTXQXSXSXSDN38ND3ITAT14DG20DE1KR2FS11GD14IS4QH30DE4EQ1QR2GD102  91.0    8.5e-191    54.8
gene.69001.1.0.p2   NisylKD955766g0010.1    NisylKD955766g0010.1    1.8e-61 235.3   599 117 530 1   116 415 530 MSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT    MSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT    MSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT*   MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  235.3   599 116 99.1    115 1   115 0   0   99.1    0   90IT25  99.1    1.8e-61 99.1
gene.91393.0.0.p1   Solyc12g056340.2.1  Solyc12g056340.2.1 RNA helicase DEAD38  1.8e-223    775.4   2001    437 806 24  437 393 806 LPAIETVLKASNSKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHESIGVQTLVGGTRFKEDQKRLESNPCQIIVATPGRLLDHIENKSGFSTRLMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRKRQSLLFSATVPKEVRRVSQLVLKREHGYVDTVGLGLETNPKVKQFYLVAPHEQHFQLVHHLLTSHISEVPDYKVIVFCTTAMMTSLMFSLLREMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQIGIPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPHLDPRAKVKIEEAIGKMDASVKEAAYHAWLGYYNSVREIGRDKTTLVELANQFSESIGLQKPPSLFRRTALKMGLKDIPGIRIRK*  LPAIETVLKASRKKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHEGIGVQTLVGGTRFKEDQKRLECDPCQIIVATPGRLLDHIENKSGFSTRIMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRRRQSLLFSATVPKEVRRISQLVLKREYDYVDTVGLGLETNPKVKQFYLVAPHEQHFQVVHHLLSSHISEVPDYKVIVFCTTAMMTSLMFSLFHEMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQVGLPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPRLDPRVKVKMEEAMEKMDTSVKESAYHAWLGYYNSVREVGRDKTTLVELANQFSESIGLDKPPSLFRRTALKMGLKDIPGIRIRK*  MLFPADYLHVSPVLFIAAIKVQQLPAIETVLKASNSKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHESIGVQTLVGGTRFKEDQKRLESNPCQIIVATPGRLLDHIENKSGFSTRLMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRKRQSLLFSATVPKEVRRVSQLVLKREHGYVDTVGLGLETNPKVKQFYLVAPHEQHFQLVHHLLTSHISEVPDYKVIVFCTTAMMTSLMFSLLREMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQIGIPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPHLDPRAKVKIEEAIGKMDASVKEAAYHAWLGYYNSVREIGRDKTTLVELANQFSESIGLQKPPSLFRRTALKMGLKDIPGIRIRK*   MGGGPRTFPGGLNKWQWKRLHEKKARDKENRLLDQEKQLYQARIRSQIRAKLTSSGEQSDFSNEQQPNYSPVSPQDHIRGLADRFMKEGAEDLWNEDDGPVNTPQINQQSGGISESIDLRKLRDTKFNDVPRSYSFQKARNFCTNISDVFAENCRTRNPTFSDSWSRQNKFLMFGWRLVNIENRNVNNLNGFLNYRCYSVDRMNGNKLRKLDFTRNESSQSEDKLRSVGLVVKGERKAKWPRFRPKPEESXXXXXXXXXXXXXXXXXXRSRGSVKMMSSAALGKYDMKTKKRVPLKFVEDEDDLSLHVAAIRKEVKGRSMQKIETEEDEKETILSSKRFDEYDVSPLTVKALTAAGYVQMTKVQEATLSTCLEGKDALVKARTGTGKSAAFLLPAIETVLKASRKKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHEGIGVQTLVGGTRFKEDQKRLECDPCQIIVATPGRLLDHIENKSGFSTRIMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRRRQSLLFSATVPKEVRRISQLVLKREYDYVDTVGLGLETNPKVKQFYLVAPHEQHFQVVHHLLSSHISEVPDYKVIVFCTTAMMTSLMFSLFHEMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQVGLPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPRLDPRVKVKMEEAMEKMDTSVKESAYHAWLGYYNSVREVGRDKTTLVELANQFSESIGLDKPPSLFRRTALKMGLKDIPGIRIRK*  775.4   2001    414 94.0    389 25  402 0   0   97.1    0   11NRSK36SG20SCND25LI32KR16VI8HYGD29LV5TS27LFRH53IV1IL49HR4AV3IM3IMGE3AT4AS14IV20QD26    94.0    1.8e-223    94.7
gene.69001.1.3.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.4e-228    792.3   2045    434 530 1   420 1   420 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIG    MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIG    MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGMFLLPTLLSSICK*  MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  792.3   2045    420 96.0    403 17  410 0   0   97.6    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM43  96.0    1.4e-228    96.8
gene.18823.1.1.p2   transcript:OIT25066 transcript:OIT25066 1.0e-56 219.5   558 115 185 1   113 72  184 MLGQMVGRIRAKPGGKLEMGEASMVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQVQRILQFVSLPPEDTSKKRS   MLGQMVGRIQAKPGGKLEMGEASVVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQIQRILQFVSLPPEDTSKKRS   MLGQMVGRIRAKPGGKLEMGEASMVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQVQRILQFVSLPPEDTSKKRSI* MGQAFRRATGRIGSSNVDAASSQLKKPIDRTPPPVPAAIKTPSDNVAPVAGSSPKDAVGETLEERDPKFDAMLGQMVGRIQAKPGGKLEMGEASVVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQIQRILQFVSLPPEDTSKKRST   219.5   558 113 97.3    110 3   113 0   0   100.0   0   9RQ13MV69VI19   97.3    1.0e-56 98.3
gene.69001.1.2.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.8e-206    718.8   1854    393 530 1   384 1   384 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHK    MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHK    MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKVSETVVLV*   MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  718.8   1854    384 95.6    367 17  374 0   0   97.4    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM7   95.6    1.8e-206    97.7
gene.71087.0.0.p1   transcript:OIT01688 transcript:OIT01688 3.8e-101    367.9   943 190 639 1   189 451 639 DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEALQNQQISPRSDVYCLGIIILEILTGKFPSQYLNNQKGGTDIVQWVQSAIVDNRESELIDQEIANATDSSEQMVKLLHVGAACTVSDPDERIDMKEASRRIEEISLI   DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEAIQNEQISPKSDVYCLGIIILEILTGKFPSQYLNNQKGGTDVVQWVQSAIAENRESELIDQEIANATDSIEQMVKLLHVGAACTVSDPDKRIDMKEASRRIEEISLI   DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEALQNQQISPRSDVYCLGIIILEILTGKFPSQYLNNQKGGTDIVQWVQSAIVDNRESELIDQEIANATDSSEQMVKLLHVGAACTVSDPDERIDMKEASRRIEEISLI*  MSSNKHGRCSPLPSSSSNLSQKLLFFYISIFFLIFLSNTPFSFSISEDEALIKFKESLKNTTALDSTWHKGSNPCDKNKKWTRVQCEGNAVEGLLLGEAGLSGEIDVDPLIALPGLRVLELANNSFSGTIPEFFLLGALKSIYIDGXXXXXXXPKDFFSKMXXXXXXXXXXXXXXXXXXESLANLKYLMELHLESXXXXXXXXSFSQASLASIDLSNNKLQGEIPQSMSKFGSDSFKGNNELCGKQLGKECNKEKENNTFQKAPMSKLKWIILGLVVGLLLITILFKAKRKEDHFDKLGKENLDEGLHVSSSNRKSMSIRSEGGDSVHGSSRRGAGSQRGKAMGDLVLVNEEKGTFGLPDLMKAAAEVLGNGVLGSAYKAKMVNGLSVVVKRLREMNKMNRDVFDTEIRKISKLRHRNILQLLAYHYRKEEKLLVSEYVPKGSLLYLLHGDRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEAIQNEQISPKSDVYCLGIIILEILTGKFPSQYLNNQKGGTDVVQWVQSAIAENRESELIDQEIANATDSIEQMVKLLHVGAACTVSDPDKRIDMKEASRRIEEISLI 367.9   943 189 95.8    181 8   187 0   0   98.9    0   83LI2QE4RK31IV8VADE17SI19EK17   95.8    3.8e-101    99.5
gene.69001.1.1.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.4e-294    1011.9  2615    531 530 1   530 1   530 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT* MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  1011.9  2615    530 96.6    512 18  519 0   0   97.9    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM127IT25 96.6    1.4e-294    99.8

下面的文件有一些类似的 ID

gene.69001.1.0.p1       
gene.69001.1.0.p2       
gene.69001.1.3.p1       
gene.69001.1.2.p1       
gene.69001.1.1.p1

通过仅gene.69001 ,ID 变得相同。 我使用这个 awk 脚本只保留具有最小值的相同 ID 的行(第 30 列)

awk '!(\$1 in min) || \$30<min[\$1] {min[\$1]=\$30; line[\$1]=\$0} END {for(k in line) print line[k]}' ${2}-ide${i}-cov${cov} > ${2}-ide${i}-cov${cov}-best-hit

不幸的是,我不知道如何修改上面的 awk 脚本来过滤上面的文件,只剩下第 30 列中最小数字的行?

更新作为输出,我想获得所有列的以下 ID。

chr01_pilon3.g13.t1
gene.92134.0.0.p1
gene.90968.0.2.p1
gene.96656.0.5.p2
gene.69001.1.1.p1
gene.35466.0.0.p2
gene.86248.0.0.p1
gene.9403.0.4.p1
gene.91393.0.0.p1
gene.18823.1.1.p2
gene.71087.0.0.p1

更新 2如果第 30 列的值相同,有没有办法保留多个副本?

更新 3我在这里找到了新数据,不幸的是以下解决方案都不起作用。

您可以使用此awk来获取截断的第一列值的最小值:

awk '{
   if (/^gene\./) {
      split($1, a, /\./)
      k = a[1] "." a[2]
    }
    else
       k = $1
}
!(k in min) || $30 <= min[k] {
   min[k] = $30
   if(!($1 in rec))
      ord[++n] = $1
   rec[$1] = $0
}
END {
   for (i=1; i<=n; ++i)
      print rec[ord[i]]
}' gene.txt
chr01_pilon3.g13.t1 trnscript:OIT01734  transcript:OIT01734 1.1e-107    389.8   1000    218 992 1   216 130 345 MDALTRHIQGDVPWCMLFADDIILIDETRAGVSERLEIWRQTLESKGFKISRSKTEYLECKFGDEPSGVGREVMLGSQAIAKRDSVRYLGSVIQGDGEIDGDVTHRIGAGWSKWRLASGVLCDKKIPHKLKGKFFRAMVRPAMFYEAECWPVKNSHIQRMKVAEMRMLRWMCGHTRLDKIKNEVIRQKVGVAPVDKKMGEARLRWFGHVRRRGPDA    MDALTRHIQGDVPWCMLFADDIVLIDETRVGVNERLEVWRQTLESKGFKLSRSKTEYLECKFSAESSEVGRDVKLGSQVIAKRDSFRYLGSVIQGEGEIDGDVTHRIGAGWSKWRLASGVLCDKKVPQKLKGKFYRAVVRPAMLYGAECWPVKNSHVQRMKVAEMRMLRWMRGLTRLDRIRNEVIREKVGVALVDEKMREARLRWYGHVRRRRPDA    MDALTRHIQGDVPWCMLFADDIILIDETRAGVSERLEIWRQTLESKGFKISRSKTEYLECKFGDEPSGVGREVMLGSQAIAKRDSVRYLGSVIQGDGEIDGDVTHRIGAGWSKWRLASGVLCDKKIPHKLKGKFFRAMVRPAMFYEAECWPVKNSHIQRMKVAEMRMLRWMCGHTRLDKIKNEVIRQKVGVAPVDKKMGEARLRWFGHVRRRGPDAR*  MKVWERVVEARVREMTSISVNQFGFMPGRSTTEAIHLVRRLVEHFRDKKKDLHMVFIDLENAYDKVPREVLWRCLEAKSVPEAYIRVIKDMYDGAKTRVRTVGGDSDHFPVVMGLHQGSALSPLLFALVMDALTRHIQGDVPWCMLFADDIVLIDETRVGVNERLEVWRQTLESKGFKLSRSKTEYLECKFSAESSEVGRDVKLGSQVIAKRDSFRYLGSVIQGEGEIDGDVTHRIGAGWSKWRLASGVLCDKKVPQKLKGKFYRAVVRPAMLYGAECWPVKNSHVQRMKVAEMRMLRWMRGLTRLDRIRNEVIREKVGVALVDEKMREARLRWYGHVRRRRPDAPVRIYKSAILGHLNSHGSQNALAGPVEAEENRQKTKKEVMEEIIQKSKFFKAQKAKDREENDELTEQLDKDFTSLVESKALLSLTQPDKINALKALVNKNISVGNVKKDEVADVPRKASIGKEKPDTYEMLVSEMALDMRARPSDRTKTPEEIAQEEKERLELLEQEXXXXXXXXXXXXXXDGNASDDNSKLVKDPRTVSGDDLGDDLEEVPRTKLGWIGEILRRKENELESEDAASSGDSDDGEDEGXXXXXXXXXXXXXXXXXXXXDEEQGKTQTIKDWEQSDDDIIDTELEDDDEGFGDDAKKVVKIKDHKEENLSITVAAENKKKMQVFYGVLLQYFAVLANKKPLNSKLLNLLVKPLMEMSAVSPYFAAICARQRLQRTRAQFCEDLKNTGKSSWPSLKTIFLLRLWSMIFPCSDFRHCVMTPAILLMCEYLMRCTIISGRDIAIASFLCSLLLSVIKQSQKFCPEAIVFIQTLLMAALDRKQRSNSQLDNLMEIKELGPLLCIRSSKVEMDSLDFLTLMDLPEDSQYFHSDNYRTSMLVTVLETLQGFVNVYKELISFPEIFMLISKLLCKMAGENHIPDALREKIKDVSQLIDTKAQEHHMLRQPLKMRKKKPVPIRMLNPKFEENFVKGRDYDPDRERA    389.8   1000    216 85.6    185 31  200 0   0   92.6    0   22IV6AV2SN4IV11IL12GSDA1PS1GE3ED1MK4AV6VF9DE29IV1HQ6FY2MV5FL1EG10IV14CR1HL4KR1KR5QE5PL2KE2GR6FY6GR3 85.6    1.1e-107    99.1
gene.92134.0.0.p1   NisylASAF01033898g0006.1    NisylASAF01033898g0006.1    2.6e-302    1037.7  2682    571 548 2   570 4   548 SRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHPRRRSLGDNDADTDEIEDSQSHVPARSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHALKFADPILGMGEKLVQRMRMRSSRFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTLLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDVFVTNNHGNMAKILAGRRRFFGHKPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATGQISHSTSRLETLSKVTSNDYDIDISENQELDMLLSD   SRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHLPRRRSSGDNDADNEIEDSQSHVPAGSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHSLKFADPILRMGEKLVQRMRMRSSHFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTPLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATVEISHSTSRLETSPKVTSNDYDTDISENPELDMLLSD   QSRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHPRRRSLGDNDADTDEIEDSQSHVPARSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHALKFADPILGMGEKLVQRMRMRSSRFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTLLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDVFVTNNHGNMAKILAGRRRFFGHKPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATGQISHSTSRLETLSKVTSNDYDIDISENQELDMLLSD* MTLSRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHLPRRRSSGDNDADNEIEDSQSHVPAGSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHSLKFADPILRMGEKLVQRMRMRSSHFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTPLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATVEISHSTSRLETSPKVTSNDYDTDISENPELDMLLSD    1037.7  2682    570 93.2    531 13  533 3   26  93.5    0   82-L5LS6TND-11RG175AS8GR14RH91LP42V-F-V-T-N-N-H-G-N-M-A-K-I-L-A-G-R-R-R-F-F-G-H-K-66GVQE10LSSP8IT5QP8   93.2    2.6e-302    99.6
gene.96656.0.5.p2   NisylKD954897g0030.1    NisylKD954897g0030.1    7.7e-75 280.0   715 140 968 1   139 371 509 MRCKYLTLNTPMKKWELHGIAILLQSCPWVEMLHINTESAFEVYHFGLHYKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYMLSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSS MRCKYLTLNTPMKKWELPGIAILLQSCPQVEMLHINTESAFEEYHFGSHFKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYILSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSN MRCKYLTLNTPMKKWELHGIAILLQSCPWVEMLHINTESAFEVYHFGLHYKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYMLSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSS*    MSESEGEHEENLDYDSPRYSPYSXXXXXXXXXXXXXXXXXXSDQSYYGGKCHKTEKTDRISALPDSLILHILSSLDMGEVVRTGVLSKRWHLLWTSQQSLIFSYSGQHVNGIYKFVIFIDNTLLLCRSGMVKKFSVDFIYSKRFVRHVNRWMIFIKNKLVEELDLNLRSRGNLIEIYNLPQIMYFDVRLRHLSLCNCNLVPKEEIYWPALRDLEIGYAELNRDVIKKICSGCRALESLKFRSCYGVDYFDIDSKSVKKLVIHEYGRQNHDDADDDDDELGIYARNVTSLEICGYFHKRILVLEDVKALLDAKLDFYRNTDDYEIEREFRTDQNMLKNLLVSLQHVEKLSIGTWCLQVLTSLEIRNLPCPRMRCKYLTLNTPMKKWELPGIAILLQSCPQVEMLHINTESAFEEYHFGSHFKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYILSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSNRFSSLPDSVLLHILSFLPFDDVVRTTLLCKQWRPLWSFSTSLNFIHRPKDFISLKKFASFVDKSLINLHCNNSSISKLHLDFPFKRCFSSDVTVWVLFAITHKVKELNLILSSDAEDLYKLPKRLFSNPFIEKVNWVGCKFDKVEVFRWDSLRELRIGSIEFCDDMVRKVVFGSPCLELLELDNCWGFKRLDLVGGKVSKLVVNGYNGEAVKKNSMLLDFEVVEIEAPCVKVLELKGCFRRMNNIQLKNVMSCVSVKLDFQFTKDEERVNYVDMLMGMIGSLRHVKDVMLGTWCIEVMSSWPMNILPFSMSSYECLTLHTPIQERYLPGIVRILQSSSNLRTLIIHMAPPYFEFEACFIPIVYDVYSVGGRCQLSMLSKNCGLHLKKIRICCFEGMRSGQEVLFLRDLLLVCANLEEMVIEWRSGHQNSSIRDASDEFVAESLLMVQKRSRNAVILFNN    280.0   715 139 95.0    132 7   135 0   0   97.1    0   17HP10WQ13VE4LS1YF38MI49SN  95.0    7.7e-75 99.3
gene.90968.0.2.p2   transcript:OIT02339 transcript:OIT02339 1.3e-209    729.2   1881    391 1270    1   388 881 1268    MNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKKGLVSGVGLGFSNFVLFCLYALAFYLGAVLVRHDKAKFSEVFKVFFALTMASIGLSFLSNLPSDLSKGKGAAASIFEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVHLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTG    MNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKNGLVSGVGLGFSNFVLFCLYALAFYLGAVLVKHDKAKFSEVFKVFFALTMASIGLSVLSNLPSDLSKGKGAAASIIEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVQLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTG    MNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKKGLVSGVGLGFSNFVLFCLYALAFYLGAVLVRHDKAKFSEVFKVFFALTMASIGLSFLSNLPSDLSKGKGAAASIFEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVHLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTGAT* MAEGGEPSSARRKEEDDQKVPFYMLFAFADRTDVILMLFGTLGAIASGISKPLMSLIFGDLVNSYGKSNQSNILDQVSGISLKFVYLAIGSGIASVFQIACWVVTGERQATRIKCLYLKTILRQDIGFFDTQSATGEFIERMSGDTILVQEAMGDKVANFIMNISTFIGGFVVAFIKGWLLTLVLLTSIPATAISFGCVALVLSKMSGSGQVAYADAGKVVEQTVGGIRTVASFTGEKLAIEDYNSKLESAYSATIKQALASGLGLGTILTLIFFSYGLAIWYGAKLIIEKDYKGGDIISVIFAVMLGGSSLGQASPSLNAFSAGQAAAYKIFETIKRTPKIDPYDPSGIELEDIKGEIELKDVYFKYPARPDVQIFSGFSLYIPSGKTAALVGQSGSGKSTVISLLERFYDPEAGEILIDGVEIKKFQLKWLRQQMGLVSQEPVLFATTIRENIIYGKENASEEEIRNAIQLANAAKFIDKLPKGLDTMVGGHGTQISGGQKQRIAIARAILKDPRILLLDEATSALDVESERIVQDALSNIMVNRTTVVVAHRLTTIRNADLIAVVHLGKLVEQGTHDELIKDPEGAYSQLVQMQQKTKHVENTKGKEIEELNAPKRLSYSKNVSGRSRRFSLSGRKSASKGSSSKFSFAYDLGVSGVVDFHESIRREDGAGTSEYIADTKKKVSTQKLMSLAYLNKPELPIMLVGTVAAAINGMVFPVFGLLVSTIIKIFYESHHELRKDSRFWALMFVVIGIVVMIVSPLQNYAFGVAGAKLIQRIRSMTFSKLVYQEISWFDDPANSCGAIGARLSSDASTIRNMVGDALATLVQNISTIVTGLVIALIANWILALITIAIMPLLALQGYIQIKLLQESNNEAKLMNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKNGLVSGVGLGFSNFVLFCLYALAFYLGAVLVKHDKAKFSEVFKVFFALTMASIGLSVLSNLPSDLSKGKGAAASIIEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVQLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTGTA  729.2   1881    388 98.7    383 5   384 0   0   99.0    0   45KN30RK24FV18FI164HQ102    98.7    1.3e-209    99.2
gene.69001.1.0.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.8e-206    718.8   1854    393 530 1   384 1   384 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHK    MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHK    MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKVSETVVLV*   MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  718.8   1854    384 95.6    367 17  374 0   0   97.4    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM7   95.6    1.8e-206    97.7
gene.35466.0.0.p2   NiotoAWOL01S0001629g0004.1  NiotoAWOL01S0001629g0004.1  1.0e-59 229.6   584 118 889 1   118 669 786 QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETDEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETDEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  MEIDGRERSVEMRDHDDSPVKERWEDGHYDLEESGHDKSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGRDAVDKEKGXXXXXXXXXXADEXXXXXXXXXXGNRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKQEIVSYEDDDRARNNAVETAGSQSSASKLEERILKMKEERLKKKSEGASEVMAWVSKSRKIEEKRTAEKERALQLSKIFEEQDKINDEESDDEEKARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILAGDDINQEVDVLENVEIGEQKKRDDAYKAAKKKTGIYDDKFNDDPGFERKILPQYDDPAEEEGVTLDATGGFSVDAEKKLEELRKRIQGSSSKTLAEDLNSSGKLLSDYYTQEEMLQFKKPKKKKSLRKKEKMDLDALEVEAKSSGLGVGDLGSRNDKTRQALREEMERAEAETKSKSYQAAYAKAEEASKALRPEKTNXXXXXXXXXXXXXXXXELRKSLERARKLALQKQEGLAKTFPESIASLAISRANDSTVDNPSSVSGESQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNSDTPSLSVERMREAQAQFKTPYLVLSGNVKPGQTSDPRSGFATVEKSLPGGLTPMLGDKK   229.6   584 118 99.2    117 1   118 0   0   100.0   0   36DE81  99.2    1.0e-59 100.0
gene.86248.0.0.p1   Nitab4.5_0000420g0110.1 Nitab4.5_0000420g0110.1 Protein of unknown function DUF538  8.2e-74 276.9   707 175 140 35  174 1   140 MTSQVTENHRENAEVFTNPAICKQKSLELLEQTNMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFVEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    MSSQVSENHRENAEVFTDPAICKQKSLELLEQINMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFIEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    THFLYFPFPLSHTEPQTKRNLNPISFPFSFAFTKMTSQVTENHRENAEVFTNPAICKQKSLELLEQTNMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFVEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE* MSSQVSENHRENAEVFTDPAICKQKSLELLEQINMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFIEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    276.9   707 140 96.4    135 5   139 0   0   99.3    0   1TS3TS11ND14TI52VI54    96.4    8.2e-74 80.0
gene.9403.0.4.p1    transcript:OIT35479 transcript:OIT35479 8.5e-191    667.5   1721    690 406 1   378 1   378 MLSAPRVSPPAVAVAAPARFKFPNVCVNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIIWGGTEDDDSSIPSKEVLSWKPLASTPXXXXXXXXXXXXXDEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHNKHNIADASSRSSFSSYNEPDQLKEQQTLSLPRGRAKIQQLDDKKNFQKLIRVEDEDRGIAIENVSKHFAGYSIDSHAQSARVVHPGSKASASPLRGWGGGSSHYSLKRDEIFRERQNLGDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQV  MLSAPRAPPPAVAVAAPARFKFQNVCGNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIVWGGTEDDDSSIPSKEVLSWKPLASTSPDNNHPPPTQSSSNEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHDKHNTTDASSRSSFSSYNEPGQLKEQQTLSLPRGRAKIQQLEDRKNSQKLIRVEDEDRDIAIENVSKHFAGYSSDSHAHSARVVHPGSKASASPLRGWGGGSSHYSLKREEIFRQRRNLDDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQV  MLSAPRVSPPAVAVAAPARFKFPNVCVNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIIWGGTEDDDSSIPSKEVLSWKPLASTPXXXXXXXXXXXXXDEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHNKHNIADASSRSSFSSYNEPDQLKEQQTLSLPRGRAKIQQLDDKKNFQKLIRVEDEDRGIAIENVSKHFAGYSIDSHAQSARVVHPGSKASASPLRGWGGGSSHYSLKRDEIFRERQNLGDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQVLSTCRSFSKSGVPFHSMVVTGGFCQRTQLENLRQELDILIATPGRFMFLIKEGYLQLTNLKCAVLDEVDILFSDEDFETAFQCLINSSPITTQYLFVTATLPMDIYNKLVESFPDCELVSGPGMHRTSPGLEEFLVDCSGDETAEKSPDTAFINKKNALLHLVEDSPVPKTIVFCNKIDSCRKVENALKRFDRKGFSIKILPFHAALDQRRRLANMEEFRRSKMENVSLFLVCTDRASRGIDFEGVDHVVLFDYPRDPSEYVRRVGRTARGAGGKGKAFIFAVGKQVSLARRIMERNKKGHPVHDVPSILT*  MLSAPRAPPPAVAVAAPARFKFQNVCGNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIVWGGTEDDDSSIPSKEVLSWKPLASTSPDNNHPPPTQSSSNEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHDKHNTTDASSRSSFSSYNEPGQLKEQQTLSLPRGRAKIQQLEDRKNSQKLIRVEDEDRDIAIENVSKHFAGYSSDSHAHSARVVHPGSKASASPLRGWGGGSSHYSLKREEIFRQRRNLDDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQVCQISSSIKGTFATYSPYCSATTHTKRKK  667.5   1721    378 91.0    344 34  352 0   0   93.1    0   6VASP14PQ3VG50IV25PSXPXDXNXNXHXPXPXPXTXQXSXSXSDN38ND3ITAT14DG20DE1KR2FS11GD14IS4QH30DE4EQ1QR2GD102  91.0    8.5e-191    54.8
gene.91393.0.0.p1   Solyc12g056340.2.1  Solyc12g056340.2.1 RNA helicase DEAD38  1.8e-223    775.4   2001    437 806 24  437 393 806 LPAIETVLKASNSKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHESIGVQTLVGGTRFKEDQKRLESNPCQIIVATPGRLLDHIENKSGFSTRLMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRKRQSLLFSATVPKEVRRVSQLVLKREHGYVDTVGLGLETNPKVKQFYLVAPHEQHFQLVHHLLTSHISEVPDYKVIVFCTTAMMTSLMFSLLREMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQIGIPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPHLDPRAKVKIEEAIGKMDASVKEAAYHAWLGYYNSVREIGRDKTTLVELANQFSESIGLQKPPSLFRRTALKMGLKDIPGIRIRK*  LPAIETVLKASRKKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHEGIGVQTLVGGTRFKEDQKRLECDPCQIIVATPGRLLDHIENKSGFSTRIMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRRRQSLLFSATVPKEVRRISQLVLKREYDYVDTVGLGLETNPKVKQFYLVAPHEQHFQVVHHLLSSHISEVPDYKVIVFCTTAMMTSLMFSLFHEMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQVGLPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPRLDPRVKVKMEEAMEKMDTSVKESAYHAWLGYYNSVREVGRDKTTLVELANQFSESIGLDKPPSLFRRTALKMGLKDIPGIRIRK*  MLFPADYLHVSPVLFIAAIKVQQLPAIETVLKASNSKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHESIGVQTLVGGTRFKEDQKRLESNPCQIIVATPGRLLDHIENKSGFSTRLMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRKRQSLLFSATVPKEVRRVSQLVLKREHGYVDTVGLGLETNPKVKQFYLVAPHEQHFQLVHHLLTSHISEVPDYKVIVFCTTAMMTSLMFSLLREMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQIGIPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPHLDPRAKVKIEEAIGKMDASVKEAAYHAWLGYYNSVREIGRDKTTLVELANQFSESIGLQKPPSLFRRTALKMGLKDIPGIRIRK*   MGGGPRTFPGGLNKWQWKRLHEKKARDKENRLLDQEKQLYQARIRSQIRAKLTSSGEQSDFSNEQQPNYSPVSPQDHIRGLADRFMKEGAEDLWNEDDGPVNTPQINQQSGGISESIDLRKLRDTKFNDVPRSYSFQKARNFCTNISDVFAENCRTRNPTFSDSWSRQNKFLMFGWRLVNIENRNVNNLNGFLNYRCYSVDRMNGNKLRKLDFTRNESSQSEDKLRSVGLVVKGERKAKWPRFRPKPEESXXXXXXXXXXXXXXXXXXRSRGSVKMMSSAALGKYDMKTKKRVPLKFVEDEDDLSLHVAAIRKEVKGRSMQKIETEEDEKETILSSKRFDEYDVSPLTVKALTAAGYVQMTKVQEATLSTCLEGKDALVKARTGTGKSAAFLLPAIETVLKASRKKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHEGIGVQTLVGGTRFKEDQKRLECDPCQIIVATPGRLLDHIENKSGFSTRIMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRRRQSLLFSATVPKEVRRISQLVLKREYDYVDTVGLGLETNPKVKQFYLVAPHEQHFQVVHHLLSSHISEVPDYKVIVFCTTAMMTSLMFSLFHEMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQVGLPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPRLDPRVKVKMEEAMEKMDTSVKESAYHAWLGYYNSVREVGRDKTTLVELANQFSESIGLDKPPSLFRRTALKMGLKDIPGIRIRK*  775.4   2001    414 94.0    389 25  402 0   0   97.1    0   11NRSK36SG20SCND25LI32KR16VI8HYGD29LV5TS27LFRH53IV1IL49HR4AV3IM3IMGE3AT4AS14IV20QD26    94.0    1.8e-223    94.7
gene.69001.1.3.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.4e-228    792.3   2045    434 530 1   420 1   420 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIG    MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIG    MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGMFLLPTLLSSICK*  MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  792.3   2045    420 96.0    403 17  410 0   0   97.6    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM43  96.0    1.4e-228    96.8
gene.18823.1.1.p2   transcript:OIT25066 transcript:OIT25066 1.0e-56 219.5   558 115 185 1   113 72  184 MLGQMVGRIRAKPGGKLEMGEASMVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQVQRILQFVSLPPEDTSKKRS   MLGQMVGRIQAKPGGKLEMGEASVVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQIQRILQFVSLPPEDTSKKRS   MLGQMVGRIRAKPGGKLEMGEASMVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQVQRILQFVSLPPEDTSKKRSI* MGQAFRRATGRIGSSNVDAASSQLKKPIDRTPPPVPAAIKTPSDNVAPVAGSSPKDAVGETLEERDPKFDAMLGQMVGRIQAKPGGKLEMGEASVVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQIQRILQFVSLPPEDTSKKRST   219.5   558 113 97.3    110 3   113 0   0   100.0   0   9RQ13MV69VI19   97.3    1.0e-56 98.3
gene.71087.0.0.p1   transcript:OIT01688 transcript:OIT01688 3.8e-101    367.9   943 190 639 1   189 451 639 DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEALQNQQISPRSDVYCLGIIILEILTGKFPSQYLNNQKGGTDIVQWVQSAIVDNRESELIDQEIANATDSSEQMVKLLHVGAACTVSDPDERIDMKEASRRIEEISLI   DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEAIQNEQISPKSDVYCLGIIILEILTGKFPSQYLNNQKGGTDVVQWVQSAIAENRESELIDQEIANATDSIEQMVKLLHVGAACTVSDPDKRIDMKEASRRIEEISLI   DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEALQNQQISPRSDVYCLGIIILEILTGKFPSQYLNNQKGGTDIVQWVQSAIVDNRESELIDQEIANATDSSEQMVKLLHVGAACTVSDPDERIDMKEASRRIEEISLI*  MSSNKHGRCSPLPSSSSNLSQKLLFFYISIFFLIFLSNTPFSFSISEDEALIKFKESLKNTTALDSTWHKGSNPCDKNKKWTRVQCEGNAVEGLLLGEAGLSGEIDVDPLIALPGLRVLELANNSFSGTIPEFFLLGALKSIYIDGXXXXXXXPKDFFSKMXXXXXXXXXXXXXXXXXXESLANLKYLMELHLESXXXXXXXXSFSQASLASIDLSNNKLQGEIPQSMSKFGSDSFKGNNELCGKQLGKECNKEKENNTFQKAPMSKLKWIILGLVVGLLLITILFKAKRKEDHFDKLGKENLDEGLHVSSSNRKSMSIRSEGGDSVHGSSRRGAGSQRGKAMGDLVLVNEEKGTFGLPDLMKAAAEVLGNGVLGSAYKAKMVNGLSVVVKRLREMNKMNRDVFDTEIRKISKLRHRNILQLLAYHYRKEEKLLVSEYVPKGSLLYLLHGDRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEAIQNEQISPKSDVYCLGIIILEILTGKFPSQYLNNQKGGTDVVQWVQSAIAENRESELIDQEIANATDSIEQMVKLLHVGAACTVSDPDKRIDMKEASRRIEEISLI 367.9   943 189 95.8    181 8   187 0   0   98.9    0   83LI2QE4RK31IV8VADE17SI19EK17   95.8    3.8e-101    99.5
gene.69001.1.1.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.4e-294    1011.9  2615    531 530 1   530 1   530 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT* MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  1011.9  2615    530 96.6    512 18  519 0   0   97.9    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM127IT25 96.6    1.4e-294    99.8
gene.69001.9.9.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.4e-294    1011.9  2615    531 530 1   530 1   530 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT* MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  1011.9  2615    530 96.6    512 18  519 0   0   97.9    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM127IT25 96.6    1.4e-294    99.8

如果我理解您的问题,并且您希望保留字段 30 中具有最低值的唯一记录,用于字段 1 中具有公共前缀的记录,例如, gene.90968gene.69001 ,它们将对应于以下值:

gene.90968 - 0.0e+00
gene.18823 - 1.0e-56
gene.9403 - 8.5e-191
gene.35466 - 1.0e-59
gene.91393 - 0
gene.92134 - 2.6e-302
gene.71087 - 3.8e-101
gene.69001 - 1.4e-294
gene.96656 - 7.7e-75
gene.86248 - 0

然后你可以split() field-1 on '.' ,并使用第一和第二部分作为数组索引的前缀(如上所示),维护两个数组(1)保存对应于字段 30 的最低值的整个记录​​和(2)第二个数组保存字段-30,您可以仅考虑以"gene"开头的记录执行以下操作:

awk ' /^gene/ {
    split ($1,a,".")
    if (a[1] SUBSEP a[2] in arr) {
        if ($30 < v[a[1],a[2]]) {
            arr[a[1],a[2]]=$0
            v[a[1],a[2]]=$30
        }
        else if ($30 == v[a[1],a[2]]) {       ## handle prefix where field-30
            arr[a[1],a[2],++n[a[1],a[2]]]=$0  ## are equal between the two
        }
    }
    else {
        arr[a[1],a[2]]=$0
        v[a[1],a[2]]=$30
    }
    next
}
{ print }
END { for(i in arr) print arr[i] }' file

"gene"以外的其他内容开头的所有其他记录均保持不变输出。 记录的顺序会改变。

这将输出字段 30 中具有最低值的10唯一记录作为公共前缀。

输出

gene.90968.0.2.p1   transcript:OIT02339 transcript:OIT02339 0.0e+00 1592.0  4121    887 1270    1   881 1   880 MAEGGEPSSARRKEEENDQKIPFYMLFAFADRTDVILMLFGTFGAIASGISQPLMSLIFGDLVNSYGKSDQSNILDQVSGISLKFVYLAIGSGIASVFQIACWVVTGERQATRIKCLYLKTILRQDIGFFDTQSATGEFIEKMSGDTILVQEAMGDKVANFIMNVSTSIGGFVVAFIKGWLLTLVLLTSIPATAISFGCVALVLSKMSGSGQVAYADAGKVVEQTVGGIRTVASFTGEKLAIEDYNSKLESADSATVKQALASGLGLGTILTLIFFSYGLAIWYGAKLIIEKDYKGGDIISVIFAVMLGGSSLGQASPSLNAFSAGQAAAYKIFETIKRTPKIDPYDPSGVKLEDIKGEIELKDVYFKYPARPDVQIFSGFSLYIPSGKTAALVGQSGSGKSTVISLLERFYDPEAGEILIDGVEIKKFQLEWLRQQMGLVSQEPVLFATTIRENIIYGKENASEEEIRNAIQLANAAKFIDKLPKGLDTMVGGHGTQISGGQKQRIAIARAILKDPRILLLDEATSALDVESERTVQDALSNIMINRTTVVVAHRLTTIRNADLIAVVHLGKLVEQGTHDELIKDPEGAYSQLVQMQQKTKHVENTKGKEIEDLNAQKRLSYSKNFSGRSRRFSLSGRKSASKGSSSKFSFAYDLGVSDVVDFHESIRREDEAGTSEYTVDTTKKVSTQKLMSLAYLNKPELPIMLVGTVAAAINGMVYPVFGLLVSTIIKIFYESHHELRKDSRFWALMFVVIGIVVMIVSPLQNYAFGVAGAKLIQRIRSMMFSKLVYQEISWFDDPANSCGAIGARLSSDASTIRNMVGDALATLVQNISTIVTGLVIALIANWILALITIAIMPVLALQGYIQIKLLQESNVEAKL   MAEGGEPSSARRKEEDDQKVPFYMLFAFADRTDVILMLFGTLGAIASGISKPLMSLIFGDLVNSYGKSNQSNILDQVSGISLKFVYLAIGSGIASVFQIACWVVTGERQATRIKCLYLKTILRQDIGFFDTQSATGEFIERMSGDTILVQEAMGDKVANFIMNISTFIGGFVVAFIKGWLLTLVLLTSIPATAISFGCVALVLSKMSGSGQVAYADAGKVVEQTVGGIRTVASFTGEKLAIEDYNSKLESAYSATIKQALASGLGLGTILTLIFFSYGLAIWYGAKLIIEKDYKGGDIISVIFAVMLGGSSLGQASPSLNAFSAGQAAAYKIFETIKRTPKIDPYDPSGIELEDIKGEIELKDVYFKYPARPDVQIFSGFSLYIPSGKTAALVGQSGSGKSTVISLLERFYDPEAGEILIDGVEIKKFQLKWLRQQMGLVSQEPVLFATTIRENIIYGKENASEEEIRNAIQLANAAKFIDKLPKGLDTMVGGHGTQISGGQKQRIAIARAILKDPRILLLDEATSALDVESERIVQDALSNIMVNRTTVVVAHRLTTIRNADLIAVVHLGKLVEQGTHDELIKDPEGAYSQLVQMQQKTKHVENTKGKEIEELNAPKRLSYSKNVSGRSRRFSLSGRKSASKGSSSKFSFAYDLGVSGVVDFHESIRREDGAGTSEYIADTKKKVSTQKLMSLAYLNKPELPIMLVGTVAAAINGMVFPVFGLLVSTIIKIFYESHHELRKDSRFWALMFVVIGIVVMIVSPLQNYAFGVAGAKLIQRIRSMTFSKLVYQEISWFDDPANSCGAIGARLSSDASTIRNMVGDALATLVQNISTIVTGLVIALIANWILALITIAIMPLLALQGYIQIKLLQESNNEAKL    MAEGGEPSSARRKEEENDQKIPFYMLFAFADRTDVILMLFGTFGAIASGISQPLMSLIFGDLVNSYGKSDQSNILDQVSGISLKFVYLAIGSGIASVFQIACWVVTGERQATRIKCLYLKTILRQDIGFFDTQSATGEFIEKMSGDTILVQEAMGDKVANFIMNVSTSIGGFVVAFIKGWLLTLVLLTSIPATAISFGCVALVLSKMSGSGQVAYADAGKVVEQTVGGIRTVASFTGEKLAIEDYNSKLESADSATVKQALASGLGLGTILTLIFFSYGLAIWYGAKLIIEKDYKGGDIISVIFAVMLGGSSLGQASPSLNAFSAGQAAAYKIFETIKRTPKIDPYDPSGVKLEDIKGEIELKDVYFKYPARPDVQIFSGFSLYIPSGKTAALVGQSGSGKSTVISLLERFYDPEAGEILIDGVEIKKFQLEWLRQQMGLVSQEPVLFATTIRENIIYGKENASEEEIRNAIQLANAAKFIDKLPKGLDTMVGGHGTQISGGQKQRIAIARAILKDPRILLLDEATSALDVESERTVQDALSNIMINRTTVVVAHRLTTIRNADLIAVVHLGKLVEQGTHDELIKDPEGAYSQLVQMQQKTKHVENTKGKEIEDLNAQKRLSYSKNFSGRSRRFSLSGRKSASKGSSSKFSFAYDLGVSDVVDFHESIRREDEAGTSEYTVDTTKKVSTQKLMSLAYLNKPELPIMLVGTVAAAINGMVYPVFGLLVSTIIKIFYESHHELRKDSRFWALMFVVIGIVVMIVSPLQNYAFGVAGAKLIQRIRSMMFSKLVYQEISWFDDPANSCGAIGARLSSDASTIRNMVGDALATLVQNISTIVTGLVIALIANWILALITIAIMPVLALQGYIQIKLLQESNVEAKLPVVMF* MAEGGEPSSARRKEEDDQKVPFYMLFAFADRTDVILMLFGTLGAIASGISKPLMSLIFGDLVNSYGKSNQSNILDQVSGISLKFVYLAIGSGIASVFQIACWVVTGERQATRIKCLYLKTILRQDIGFFDTQSATGEFIERMSGDTILVQEAMGDKVANFIMNISTFIGGFVVAFIKGWLLTLVLLTSIPATAISFGCVALVLSKMSGSGQVAYADAGKVVEQTVGGIRTVASFTGEKLAIEDYNSKLESAYSATIKQALASGLGLGTILTLIFFSYGLAIWYGAKLIIEKDYKGGDIISVIFAVMLGGSSLGQASPSLNAFSAGQAAAYKIFETIKRTPKIDPYDPSGIELEDIKGEIELKDVYFKYPARPDVQIFSGFSLYIPSGKTAALVGQSGSGKSTVISLLERFYDPEAGEILIDGVEIKKFQLKWLRQQMGLVSQEPVLFATTIRENIIYGKENASEEEIRNAIQLANAAKFIDKLPKGLDTMVGGHGTQISGGQKQRIAIARAILKDPRILLLDEATSALDVESERIVQDALSNIMVNRTTVVVAHRLTTIRNADLIAVVHLGKLVEQGTHDELIKDPEGAYSQLVQMQQKTKHVENTKGKEIEELNAPKRLSYSKNVSGRSRRFSLSGRKSASKGSSSKFSFAYDLGVSGVVDFHESIRREDGAGTSEYIADTKKKVSTQKLMSLAYLNKPELPIMLVGTVAAAINGMVFPVFGLLVSTIIKIFYESHHELRKDSRFWALMFVVIGIVVMIVSPLQNYAFGVAGAKLIQRIRSMTFSKLVYQEISWFDDPANSCGAIGARLSSDASTIRNMVGDALATLVQNISTIVTGLVIALIANWILALITIAIMPLLALQGYIQIKLLQESNNEAKLMNEEASQVANDAIGSIRTVASFCAEEKVMEMYQKKSEAPLKRGVKNGLVSGVGLGFSNFVLFCLYALAFYLGAVLVKHDKAKFSEVFKVFFALTMASIGLSVLSNLPSDLSKGKGAAASIIEILDSKPRIDSSSNEGITLDAIEGNIELQHISFRYPTRPDMQIFRDLSLSIPAGKTVALVGESGSGKSTVISLLERFYDPEQGNIYLDGVEIRKFNLRWLRQQMGLVGQEPILFNETISSNIAYGREGEVTEEEIISVAKSSNAHNFISSLPNGYKTTVGERGVQLSGGQKQRIAIARAILKDPKILLLDEATSALDTESERIVQEALDRVMVNRTTVVVAHRLTTVKNADVIAVVKNGVVAEKGTHDMLMNNPQGVYASLVALQTGTA  1592.0  4121    881 96.8    853 27  867 1   1   98.4    0   15EDN-3IV21FL8QK17DN71KR22VI2SF84DY3VI93VIKE79EK103TI9IV67DE3QP8FV32DG12EG6TIVA2TK35YF64MT74VL16VN4 96.8    0.0e+00 99.3
gene.18823.1.1.p2   transcript:OIT25066 transcript:OIT25066 1.0e-56 219.5   558 115 185 1   113 72  184 MLGQMVGRIRAKPGGKLEMGEASMVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQVQRILQFVSLPPEDTSKKRS   MLGQMVGRIQAKPGGKLEMGEASVVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQIQRILQFVSLPPEDTSKKRS   MLGQMVGRIRAKPGGKLEMGEASMVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQVQRILQFVSLPPEDTSKKRSI* MGQAFRRATGRIGSSNVDAASSQLKKPIDRTPPPVPAAIKTPSDNVAPVAGSSPKDAVGETLEERDPKFDAMLGQMVGRIQAKPGGKLEMGEASVVEKYDRALPKLRNTTSESSRYEERPAPPGTLNVAQIREIILLHQGRADDHKGSMDINQIAQRFRVDAAQIQRILQFVSLPPEDTSKKRST   219.5   558 113 97.3    110 3   113 0   0   100.0   0   9RQ13MV69VI19   97.3    1.0e-56 98.3
gene.9403.0.4.p1    transcript:OIT35479 transcript:OIT35479 8.5e-191    667.5   1721    690 406 1   378 1   378 MLSAPRVSPPAVAVAAPARFKFPNVCVNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIIWGGTEDDDSSIPSKEVLSWKPLASTPXXXXXXXXXXXXXDEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHNKHNIADASSRSSFSSYNEPDQLKEQQTLSLPRGRAKIQQLDDKKNFQKLIRVEDEDRGIAIENVSKHFAGYSIDSHAQSARVVHPGSKASASPLRGWGGGSSHYSLKRDEIFRERQNLGDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQV  MLSAPRAPPPAVAVAAPARFKFQNVCGNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIVWGGTEDDDSSIPSKEVLSWKPLASTSPDNNHPPPTQSSSNEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHDKHNTTDASSRSSFSSYNEPGQLKEQQTLSLPRGRAKIQQLEDRKNSQKLIRVEDEDRDIAIENVSKHFAGYSSDSHAHSARVVHPGSKASASPLRGWGGGSSHYSLKREEIFRQRRNLDDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQV  MLSAPRVSPPAVAVAAPARFKFPNVCVNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIIWGGTEDDDSSIPSKEVLSWKPLASTPXXXXXXXXXXXXXDEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHNKHNIADASSRSSFSSYNEPDQLKEQQTLSLPRGRAKIQQLDDKKNFQKLIRVEDEDRGIAIENVSKHFAGYSIDSHAQSARVVHPGSKASASPLRGWGGGSSHYSLKRDEIFRERQNLGDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQVLSTCRSFSKSGVPFHSMVVTGGFCQRTQLENLRQELDILIATPGRFMFLIKEGYLQLTNLKCAVLDEVDILFSDEDFETAFQCLINSSPITTQYLFVTATLPMDIYNKLVESFPDCELVSGPGMHRTSPGLEEFLVDCSGDETAEKSPDTAFINKKNALLHLVEDSPVPKTIVFCNKIDSCRKVENALKRFDRKGFSIKILPFHAALDQRRRLANMEEFRRSKMENVSLFLVCTDRASRGIDFEGVDHVVLFDYPRDPSEYVRRVGRTARGAGGKGKAFIFAVGKQVSLARRIMERNKKGHPVHDVPSILT*  MLSAPRAPPPAVAVAAPARFKFQNVCGNPVNLLLLHRNVGSSCKRVVVSTKAAYSRMPMDTPGAYQLIDKESGDKFIVWGGTEDDDSSIPSKEVLSWKPLASTSPDNNHPPPTQSSSNEASTRGLTGNFGRLKFRRMRDLVRKSYTKNKERDVIDHDKHNTTDASSRSSFSSYNEPGQLKEQQTLSLPRGRAKIQQLEDRKNSQKLIRVEDEDRDIAIENVSKHFAGYSSDSHAHSARVVHPGSKASASPLRGWGGGSSHYSLKREEIFRQRRNLDDENNFFSRKSFQELGCSDYMIESLRNQHFVRPSHIQAMTFGPIIAGKSCIISDQSGSGKTLAYLLPLIQRLRQEELQGLSKPSSQSPRVVVLAPTAELASQVCQISSSIKGTFATYSPYCSATTHTKRKK  667.5   1721    378 91.0    344 34  352 0   0   93.1    0   6VASP14PQ3VG50IV25PSXPXDXNXNXHXPXPXPXTXQXSXSXSDN38ND3ITAT14DG20DE1KR2FS11GD14IS4QH30DE4EQ1QR2GD102  91.0    8.5e-191    54.8
gene.35466.0.0.p2   NiotoAWOL01S0001629g0004.1  NiotoAWOL01S0001629g0004.1  1.0e-59 229.6   584 118 889 1   118 669 786 QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETDEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  QKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETDEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGR  MEIDGRERSVEMRDHDDSPVKERWEDGHYDLEESGHDKSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGRDAVDKEKGXXXXXXXXXXADEXXXXXXXXXXGNRRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKQEIVSYEDDDRARNNAVETAGSQSSASKLEERILKMKEERLKKKSEGASEVMAWVSKSRKIEEKRTAEKERALQLSKIFEEQDKINDEESDDEEKARLAAKELGGMKVLHGLDKVVEGGAVVLTLKDQSILAGDDINQEVDVLENVEIGEQKKRDDAYKAAKKKTGIYDDKFNDDPGFERKILPQYDDPAEEEGVTLDATGGFSVDAEKKLEELRKRIQGSSSKTLAEDLNSSGKLLSDYYTQEEMLQFKKPKKKKSLRKKEKMDLDALEVEAKSSGLGVGDLGSRNDKTRQALREEMERAEAETKSKSYQAAYAKAEEASKALRPEKTNXXXXXXXXXXXXXXXXELRKSLERARKLALQKQEGLAKTFPESIASLAISRANDSTVDNPSSVSGESQENKVVFTEMEEFVWGLQLDEEEQKPGSDDVFMEEEVLPKPSDEEMKTEDGGWTEVKETEEEEPSVKEEEMEVTPDATIHEVPVGKGLSGALKLLQERGTLKEDIEWGGRNMDKKKSKLVGIRGEDGKKEIRIERTDEYGRILTPKEAFRLLSHKFHGKGPGKMKQEKRMRQYQEELKIKQMKNSDTPSLSVERMREAQAQFKTPYLVLSGNVKPGQTSDPRSGFATVEKSLPGGLTPMLGDKK   229.6   584 118 99.2    117 1   118 0   0   100.0   0   36DE81  99.2    1.0e-59 100.0
gene.91393.0.0.p1   Solyc12g056340.2.1  Solyc12g056340.2.1 RNA helicase DEAD38  1.8e-223    775.4   2001    437 806 24  437 393 806 LPAIETVLKASNSKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHESIGVQTLVGGTRFKEDQKRLESNPCQIIVATPGRLLDHIENKSGFSTRLMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRKRQSLLFSATVPKEVRRVSQLVLKREHGYVDTVGLGLETNPKVKQFYLVAPHEQHFQLVHHLLTSHISEVPDYKVIVFCTTAMMTSLMFSLLREMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQIGIPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPHLDPRAKVKIEEAIGKMDASVKEAAYHAWLGYYNSVREIGRDKTTLVELANQFSESIGLQKPPSLFRRTALKMGLKDIPGIRIRK*  LPAIETVLKASRKKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHEGIGVQTLVGGTRFKEDQKRLECDPCQIIVATPGRLLDHIENKSGFSTRIMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRRRQSLLFSATVPKEVRRISQLVLKREYDYVDTVGLGLETNPKVKQFYLVAPHEQHFQVVHHLLSSHISEVPDYKVIVFCTTAMMTSLMFSLFHEMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQVGLPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPRLDPRVKVKMEEAMEKMDTSVKESAYHAWLGYYNSVREVGRDKTTLVELANQFSESIGLDKPPSLFRRTALKMGLKDIPGIRIRK*  MLFPADYLHVSPVLFIAAIKVQQLPAIETVLKASNSKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHESIGVQTLVGGTRFKEDQKRLESNPCQIIVATPGRLLDHIENKSGFSTRLMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRKRQSLLFSATVPKEVRRVSQLVLKREHGYVDTVGLGLETNPKVKQFYLVAPHEQHFQLVHHLLTSHISEVPDYKVIVFCTTAMMTSLMFSLLREMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQIGIPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPHLDPRAKVKIEEAIGKMDASVKEAAYHAWLGYYNSVREIGRDKTTLVELANQFSESIGLQKPPSLFRRTALKMGLKDIPGIRIRK*   MGGGPRTFPGGLNKWQWKRLHEKKARDKENRLLDQEKQLYQARIRSQIRAKLTSSGEQSDFSNEQQPNYSPVSPQDHIRGLADRFMKEGAEDLWNEDDGPVNTPQINQQSGGISESIDLRKLRDTKFNDVPRSYSFQKARNFCTNISDVFAENCRTRNPTFSDSWSRQNKFLMFGWRLVNIENRNVNNLNGFLNYRCYSVDRMNGNKLRKLDFTRNESSQSEDKLRSVGLVVKGERKAKWPRFRPKPEESXXXXXXXXXXXXXXXXXXRSRGSVKMMSSAALGKYDMKTKKRVPLKFVEDEDDLSLHVAAIRKEVKGRSMQKIETEEDEKETILSSKRFDEYDVSPLTVKALTAAGYVQMTKVQEATLSTCLEGKDALVKARTGTGKSAAFLLPAIETVLKASRKKSAQRVPPIDVLILCPTRELASQIAAEANVLLKYHEGIGVQTLVGGTRFKEDQKRLECDPCQIIVATPGRLLDHIENKSGFSTRIMGLKMLILDEADHLLDLGFRKDIEKLVDCLPRRRQSLLFSATVPKEVRRISQLVLKREYDYVDTVGLGLETNPKVKQFYLVAPHEQHFQVVHHLLSSHISEVPDYKVIVFCTTAMMTSLMFSLFHEMKMNVREIHSRKPQLYRTRISDEFKETKRVILITSDVSARGMNYPDVTLVIQVGLPVDREQYIHRLGRTGREGKEGEGILLLAPWEQYFLDDIKDLPMENWPVPRLDPRVKVKMEEAMEKMDTSVKESAYHAWLGYYNSVREVGRDKTTLVELANQFSESIGLDKPPSLFRRTALKMGLKDIPGIRIRK*  775.4   2001    414 94.0    389 25  402 0   0   97.1    0   11NRSK36SG20SCND25LI32KR16VI8HYGD29LV5TS27LFRH53IV1IL49HR4AV3IM3IMGE3AT4AS14IV20QD26    94.0    1.8e-223    94.7
gene.92134.0.0.p1   NisylASAF01033898g0006.1    NisylASAF01033898g0006.1    2.6e-302    1037.7  2682    571 548 2   570 4   548 SRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHPRRRSLGDNDADTDEIEDSQSHVPARSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHALKFADPILGMGEKLVQRMRMRSSRFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTLLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDVFVTNNHGNMAKILAGRRRFFGHKPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATGQISHSTSRLETLSKVTSNDYDIDISENQELDMLLSD   SRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHLPRRRSSGDNDADNEIEDSQSHVPAGSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHSLKFADPILRMGEKLVQRMRMRSSHFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTPLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATVEISHSTSRLETSPKVTSNDYDTDISENPELDMLLSD   QSRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHPRRRSLGDNDADTDEIEDSQSHVPARSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHALKFADPILGMGEKLVQRMRMRSSRFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTLLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDVFVTNNHGNMAKILAGRRRFFGHKPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATGQISHSTSRLETLSKVTSNDYDIDISENQELDMLLSD* MTLSRDLRVAQLPLIFIGKLRQTGGESKLPSFTTVPMAFSRRXXXXXSRRRWLIPAISAAFGFLLIFIFFLSILAPSPNGNRLFHLPRRRSSGDNDADNEIEDSQSHVPAGSGGVSDRDIWSSRNSKFFYGCSNASNEFLKAQDITHPNRYLSIVTSGGLNQQRTGITDAVVAARILNATLVVPKLDKSSYWKDSSGFSDIFDVDWFIKYLAKDVSIVKELPLRRGQIWSPYRMRVPRKCTDRCYINRVLPVLNKKHAVQITKFDYRLANKLDTDLQKLRCRVNYHSLKFADPILRMGEKLVQRMRMRSSHFIALHLRFEPDMLAFSGCYYGGGDKERRELGKIRKKWKTLHDSDPDKARRHGRCPLTPEEVGLMLRSLGYGEDVHIYVASGEIYGGEETLTPLKALFPNFHTKDTLATKDELEPFSAFSSRMAALDFIVCDESDPTIRPNGRKLYRLFLNRNYMTEKEFVYRVGKYQRGFMGEPKEVGPSWGVFHENPSSCICEKVDNATVEISHSTSRLETSPKVTSNDYDTDISENPELDMLLSD    1037.7  2682    570 93.2    531 13  533 3   26  93.5    0   82-L5LS6TND-11RG175AS8GR14RH91LP42V-F-V-T-N-N-H-G-N-M-A-K-I-L-A-G-R-R-R-F-F-G-H-K-66GVQE10LSSP8IT5QP8   93.2    2.6e-302    99.6
gene.71087.0.0.p1   transcript:OIT01688 transcript:OIT01688 3.8e-101    367.9   943 190 639 1   189 451 639 DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEALQNQQISPRSDVYCLGIIILEILTGKFPSQYLNNQKGGTDIVQWVQSAIVDNRESELIDQEIANATDSSEQMVKLLHVGAACTVSDPDERIDMKEASRRIEEISLI   DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEAIQNEQISPKSDVYCLGIIILEILTGKFPSQYLNNQKGGTDVVQWVQSAIAENRESELIDQEIANATDSIEQMVKLLHVGAACTVSDPDKRIDMKEASRRIEEISLI   DRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEALQNQQISPRSDVYCLGIIILEILTGKFPSQYLNNQKGGTDIVQWVQSAIVDNRESELIDQEIANATDSSEQMVKLLHVGAACTVSDPDERIDMKEASRRIEEISLI*  MSSNKHGRCSPLPSSSSNLSQKLLFFYISIFFLIFLSNTPFSFSISEDEALIKFKESLKNTTALDSTWHKGSNPCDKNKKWTRVQCEGNAVEGLLLGEAGLSGEIDVDPLIALPGLRVLELANNSFSGTIPEFFLLGALKSIYIDGXXXXXXXPKDFFSKMXXXXXXXXXXXXXXXXXXESLANLKYLMELHLESXXXXXXXXSFSQASLASIDLSNNKLQGEIPQSMSKFGSDSFKGNNELCGKQLGKECNKEKENNTFQKAPMSKLKWIILGLVVGLLLITILFKAKRKEDHFDKLGKENLDEGLHVSSSNRKSMSIRSEGGDSVHGSSRRGAGSQRGKAMGDLVLVNEEKGTFGLPDLMKAAAEVLGNGVLGSAYKAKMVNGLSVVVKRLREMNKMNRDVFDTEIRKISKLRHRNILQLLAYHYRKEEKLLVSEYVPKGSLLYLLHGDRGISHAELNWPTRLKIIQGVASGMSFLHSEFASYVVPHGNLKSSNILLTEKYEPLLSDYAFYPLINNTQTVQCLFAYKSPEAIQNEQISPKSDVYCLGIIILEILTGKFPSQYLNNQKGGTDVVQWVQSAIAENRESELIDQEIANATDSIEQMVKLLHVGAACTVSDPDKRIDMKEASRRIEEISLI 367.9   943 189 95.8    181 8   187 0   0   98.9    0   83LI2QE4RK31IV8VADE17SI19EK17   95.8    3.8e-101    99.5
gene.69001.1.1.p1   NisylKD955766g0010.1    NisylKD955766g0010.1    1.4e-294    1011.9  2615    531 530 1   530 1   530 MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  MKEMCLAVAPLPFRLGNNLIFHNPLSIGSSSHMDVTRLNSMGGTTTSLYAESAEKDLSDTVSSSRSEGVPLLHMISENESNNWISGDAVVRESEDDEILSLDGDQMSCSLSVVSDSSSLCGDDFIGFEVASEIFGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKIEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGHRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQEQWKKAFTNCFLMVDDEVGGTGNHEAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPTALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRAIQKGSKDNITVIVVDLKAQRKFKSKT* MKEMCLAVAPLPFRLGNNLIFRNPPSIGSSSHMDATRLNSMGDTTTSLYAESAEKDLSDTVSSSRSEGVPLLPMISENDRNNWIAGDAVVRESEDDEILSLDGDQVSCSLSVVSDSSSLCGDDFIGFEVASDIYGQNFVDAEKSICSVELIAKPGDLVESGVEDDNVSKPFAVKLEEQITDGSSSKSSQVVVQLPLNKGLSAAVSRSVFEVDYIPLWGFTSVCGRRPEMEDALATVPRFLRIPLQMLVGDRVPDGVSRCLSHLTAHFFGVYDGHGGSQVANYCRDRVHAVLAEELEKFMANLNDESIRQNCQDQWKKAFTNCFLKVDDEVGGTGNREAVAAETVGSTAVVAIVCSSHIIVANCGDSRAVLCRGKEPMALSVDHKPNREDEYARIEAAGGKVIQWNGHRVFGVLAMSRSIGDRYLKPWIIPDPEVMFIPRTKDDECLILASDGLWDVMSNEEACELARKRILLWHKKNGVTLTLERGQGIDPAAQAAAECLSNRATQKGSKDNITVIVVDLKAQRKFKSKT  1011.9  2615    530 96.6    512 18  519 0   0   97.9    0   21HR2LP9VA7GD29HP5EDSR4SA20MV25ED1FY40IL74HD62ED11MK10HR40TM127IT25 96.6    1.4e-294    99.8
gene.96656.0.5.p2   NisylKD954897g0030.1    NisylKD954897g0030.1    7.7e-75 280.0   715 140 968 1   139 371 509 MRCKYLTLNTPMKKWELHGIAILLQSCPWVEMLHINTESAFEVYHFGLHYKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYMLSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSS MRCKYLTLNTPMKKWELPGIAILLQSCPQVEMLHINTESAFEEYHFGSHFKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYILSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSN MRCKYLTLNTPMKKWELHGIAILLQSCPWVEMLHINTESAFEVYHFGLHYKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYMLSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSS*    MSESEGEHEENLDYDSPRYSPYSXXXXXXXXXXXXXXXXXXSDQSYYGGKCHKTEKTDRISALPDSLILHILSSLDMGEVVRTGVLSKRWHLLWTSQQSLIFSYSGQHVNGIYKFVIFIDNTLLLCRSGMVKKFSVDFIYSKRFVRHVNRWMIFIKNKLVEELDLNLRSRGNLIEIYNLPQIMYFDVRLRHLSLCNCNLVPKEEIYWPALRDLEIGYAELNRDVIKKICSGCRALESLKFRSCYGVDYFDIDSKSVKKLVIHEYGRQNHDDADDDDDELGIYARNVTSLEICGYFHKRILVLEDVKALLDAKLDFYRNTDDYEIEREFRTDQNMLKNLLVSLQHVEKLSIGTWCLQVLTSLEIRNLPCPRMRCKYLTLNTPMKKWELPGIAILLQSCPQVEMLHINTESAFEEYHFGSHFKNSNDFNGENYWISRPCWVLHLKTLRIHGYEWWDGDEYILSFLQVVLKNGMVLQKIIIDFFEINSYEKLTKKLLSFPRSSREAVILFSNRFSSLPDSVLLHILSFLPFDDVVRTTLLCKQWRPLWSFSTSLNFIHRPKDFISLKKFASFVDKSLINLHCNNSSISKLHLDFPFKRCFSSDVTVWVLFAITHKVKELNLILSSDAEDLYKLPKRLFSNPFIEKVNWVGCKFDKVEVFRWDSLRELRIGSIEFCDDMVRKVVFGSPCLELLELDNCWGFKRLDLVGGKVSKLVVNGYNGEAVKKNSMLLDFEVVEIEAPCVKVLELKGCFRRMNNIQLKNVMSCVSVKLDFQFTKDEERVNYVDMLMGMIGSLRHVKDVMLGTWCIEVMSSWPMNILPFSMSSYECLTLHTPIQERYLPGIVRILQSSSNLRTLIIHMAPPYFEFEACFIPIVYDVYSVGGRCQLSMLSKNCGLHLKKIRICCFEGMRSGQEVLFLRDLLLVCANLEEMVIEWRSGHQNSSIRDASDEFVAESLLMVQKRSRNAVILFNN    280.0   715 139 95.0    132 7   135 0   0   97.1    0   17HP10WQ13VE4LS1YF38MI49SN  95.0    7.7e-75 99.3
gene.86248.0.0.p1   Nitab4.5_0000420g0110.1 Nitab4.5_0000420g0110.1 Protein of unknown function DUF538  8.2e-74 276.9   707 175 140 35  174 1   140 MTSQVTENHRENAEVFTNPAICKQKSLELLEQTNMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFVEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    MSSQVSENHRENAEVFTDPAICKQKSLELLEQINMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFIEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    THFLYFPFPLSHTEPQTKRNLNPISFPFSFAFTKMTSQVTENHRENAEVFTNPAICKQKSLELLEQTNMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFVEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE* MSSQVSENHRENAEVFTDPAICKQKSLELLEQINMPKGLLPLDDLIEVGRNHQTGFVWLKQKKAKEHRFKKIGKLVWYDTEVTAFIEDRRMKKLTGVKSKEILIWVTISDISIQDPEFQKITFATPTGISKAFPVSAFEE    276.9   707 140 96.4    135 5   139 0   0   99.3    0   1TS3TS11ND14TI52VI54    96.4    8.2e-74 80.0

您能否尝试仅使用显示的示例进行以下、编写和测试。 同样根据 OP 的评论,忽略不是从gene开始的行。

awk '
/^chr/ { print; next }
match($0,/^gene\.[0-9]+/){
  val=substr($0,RSTART,RLENGTH)
  arr[val]=(arr[val]>$30?$30:arr[val])
  valArr[val]=$0
}
END{
  for(i in arr){
    print valArr[i]
  }
}
' Input_file


编辑:根据 OP 的评论,以防最小值有多行,然后尝试以下操作。

awk '
/^chr/ { print; next }
match($0,/^gene\.[0-9]+/){
  val=substr($0,RSTART,RLENGTH)
  arr[val]=(arr[val]>$30?$30:arr[val])
  valArr[val]=(valArr[val]?valArr[val] ORS:"")$0
}
END{
  for(i in arr){
    print valArr[i]
  }
}
' Input_file

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM