Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_AP021884 | Sulfuriferula plumbiphila strain Gro7 | 4 crisprs | csa3,cas3,cas5,cas6e,cas2,DEDDh,DinG,WYL,cas8c,cas7,cas4,cas1 | 0 | 4 | 7 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_1 | 1148298-1148383 | Unclear |
I-E
Consensus repeat of NZ_AP021884_1
|
1 spacers
spacers of NZ_AP021884_1
>1.1|1148323|36|NZ_AP021884|CRISPRCasFinder ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG |
cas2,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_1
The CRISPR arrays of NZ_AP021884_1 >merge|NZ_AP021884|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGAACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAGGTGTTCCCCGCACCCGCGGGATGAG >NZ_AP021884|1|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGA ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG GTGTTCCCCGCACCCGCGGGATGAG
>NZ_AP021884.1|WP_147070477.1|1147906_1148203_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MLVIVLENVPPRLRGRLAIWLLEIRAGVYVGNYSDKVRDHIWHQVEVGIGEGNAVMAWRTSSEAGFDFVTLGKNRRIPVELDGAKLVSFLPQTDTDAL >NZ_AP021884.1|WP_147070479.1|1146692_1146992_+|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MRYQVRFASAAADDLQRLFDFLAEQDLAAAERARAVISQAIEVLQIFPFSCRKASPENPFLRELVISFGSYGYVALFEVEDAESVTVLAVRHQREDDYH >NZ_AP021884.1|WP_147070480.1|1146414_1146696_+|prevent-host-death-protein MKNATLPPLRVESELRAAAESVLQEGETLSGFVLEAVRLNIARREAQREFITRGLVAREEAKLSGHYVSSDEMLKRLDASLAKARAKQAVGNR >NZ_AP021884.1|WP_147070482.1|1145722_1146346_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MFLSRVEIPWDAARNPYNLHRQLWHLFPGEDRESRSSDDETRQGFLFRIEENATGRPARLLVQSRRAPTRANGLLLVGTREITPCPSAGQRLAFVLTANPVKTIVDAQRDAKPGKQSEKCRVPFIKEEEQRQWLLRKLGEAGEVEAVSVLPHAPVYFHKGSRAGKLVTATFEGVLRVRDPDRLAALLANGIGPAKAFGCGLLLVRRI >NZ_AP021884.1|WP_147070484.1|1145195_1145732_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MFSREWPLLAESDIKPKTGKLGWTNSPAFSSCTRSWTARRAPITRTDLKARLECSSASLTSAHRRGAYLFDAAFTVAVGSKPGASVTLTQLAAALRQPLYTPSLGRRSCPLARPLLEGELEAEDALAALAKTAPVDGLVYSETQQSDQPLRLRDVPLHGHKRQFGTRLVYLHKDPTCS >NZ_AP021884.1|WP_147070486.1|1142295_1145139_-|aconitate-hydratase-AcnA MSTAHNLFNTLSEFTLGNGTPGRFYSLSALEAVGIGKISRLPVSIRIVLEAVLRNCDGRKITEQHIRELANWQPNGPRTEEIPFVVARILLQDFTGVPLLADLAAMRSAAAQAGKNPKVIEPLVPVDMVVDHSVQVDVFNQPDALQKNMELEFIRNRERYQFLKWGMQAFDTFKVVPPGIGIVHQVNLEYLARGVMEKDGVHYPDTLVGTDSHTTMINGLGIVAWGVGGIEAEAGMLGQPVYFLTPDVVGVHLKGQIREGVTATDVVLTVTEMLRKAKVVGKFVEFFGAGAAALSLPDRATIANMAPEYGATMGFFPVDEASCAYYAATGRSAEQVDTIRNYFMAQGLFGIPQAGDCDYSQELEIDLGSVVPSVAGPRRPQDRIELGHVKQAFAGLFAKPVAEGGYGKAAATLAQRVALAPAPAGTDIAGGGVQNSDTLPAGGTDPAVVIEREMVDNRPTPDHLASNAVYTAAQSGTLGHGDVVIAAITSCTNTSNPGVMLAAGLLAKKALEKGLTVPAHVKTSLGPGSRVVTEYLKAAGLLDALGEMGFKLVGYGCTTCIGNSGPLPAAIESAITGNDLIAASVLSGNRNFEARVHQNVKANFLMSPPLVVAYAIAGSMNTDLASEPLGTGRDGAPVYLKDIWPSLDEVAAVMATATNPDTYRKLYADFSADNPLWAAVPAPAGAVYDWDGASTYIRQPPFFDGAAGDSGVIRGARALAVFGDSVTTDHISPAGSIKPASPAGKFLLEHGVDRADFNSYGARRGNHEVMMRGTFANVRIRNLMLPGSEGGVTRHQPDGAEMAIYDAAMQYQAAGTPLMIFAGEEYGTGSSRDWAAKGTRLLGVKAVVAKSFERIHRANLVGMGVLPCQFRDGMGADSLKLDGSETFDLLGLEHGITPQQDITLVIHRADGSADAVAVKLRIDTPIEVDYYQSGGILPFVLAQLLAD >NZ_AP021884.1|WP_147070488.1|1141914_1142286_-|DUF202-domain-containing-protein MSDLNDPRVFFAAERTLLAWNRTCLTLMAFGFVVERFGLFLHMLAPQTPQHLERGISFWVGLGFILLGSLMAVLAVIQYRRVLRTLKPVEIPEGYWVNMAALSTLLLAVLGIVLSAYLTMGLK >NZ_AP021884.1|WP_147070490.1|1139808_1141854_-|methionine--tRNA-ligase MTRKILVTSALPYANGAIHLGHLVEYIQTDIWVRFQKMHGHECYYVCADDTHGTPIMLRAEKEGITPEQLIARVHGEHLRDFTGFHVGFDSYHSTNSGENRELSGTVYLKLREAGLIEQKTIEQYYDPVREMFLPDRFIKGQCPKCGAQDQYGDGCEVCGATYTPTDLINPVSAISGSTPVRRESEHYFFRLGACEAFLREWTRSGALQQEAANKLDEWFAAGLQNWDISRDAPYFGFEIPDAPGKYFYVWLDAPIGYMASFKKLAAEKNLDFDAWWQNDSGAELYHFIGKDILYFHALFWPAMLKNAGYRTPSGVFAHGFLTVNGAKMSKSRGTFITAESYLASGMDPEWLRYYYAAKINGSMEDLDLNLADFIARVNSDLVGKYVNIASRTAGFIARRFDGKLAARLPTSELLAEVQHAATLIGECYETREYGKALREIMRLTDLANQYVNDNKPWELAKQEGSEALLHEVCSVSVNLFRLLTLYLKPVLPRLATEVETFLNIAALAWVDAGTLLTSHSINAYSHLMTRVEQKQVDALVAANQQSLAASADAHSPARHAEAQNHVIAPIADTITADDFARIDLRVAKIVNAEHVEGADKLIRLTLDIGEGKTRNVFAGIKSAYDPEKLIGRMTVMVANLAPRKMKFGVSEGMVLAASGETAGLYILSPDDGAVPGMRVK >NZ_AP021884.1|WP_147070492.1|1138856_1139744_+|LysR-family-transcriptional-regulator MDIEQARTFLHVVAIGNFLGAAEKLHVTQSTVSARIQNLERKLGAKLFSRGKQGAALTAAGQRFVRHAQTLVRTADIAKQDVGLPDGYSGGLTVSGRIALWEGFLSRWVAWMRQAAPAISLRLEIGFEQDIMHGLVQNTLDIGLMYTPEARPGLGLERLFDETLVLVTTDRMRPWPDPGYVHVDWGTEFFHQFSLNFPDHPPPALSANVGWLGIQQLLTSGGSAYFPLRMVRTLLAKKRLHRVPGTPHFSVTAHMVYPLSRNDDFLQQALAGLRLLGREERRGQISMDTDNSPNP >NZ_AP021884.1|WP_147070494.1|1137963_1138608_-|maleylacetoacetate-isomerase MQLFSFFSSSTAFRVRIALALKGADYEYQAVNLRAGEQHQQAFLDRNPSGNVPALVDGDFNLGQSLAILDYLDSRYPEPRLIPADTIQRARVLELVNVIACDIHPVNNLRVQLYLKNILGVTEAQKNAWYRHWVAQGLDVVERLLARQEDTPYCFGTHPTLADCCLAPQVWSAARAGCDIAAYPRIDRIYRHCMAQPAFIQAAPEQQADAPQGG >NZ_AP021884.1|WP_147070475.1|1148555_1149455_+|enoyl-CoA-hydratase/isomerase-family-protein MNAVVEFQQFSNASLEQVRIRFDEEYGVMWSFMRPEPRPCFTRTTLQDLLQHHTYLESMKGRVVSNGNFQQTNYLILASDLQGVFNLGGDLAAFGEAIRAQTRKELLSYAKLCIDNVWTFYNLQAPITTISMVQGQAMGGGFEAALSAHVMIAEKSALMGLPEVLFNLFPGMGALSFLSRKIGMRAAEAMVRSGRVYTATELHEMGVVDVLAEDGQGEKTLYDWIRKNHRSLNSFQAIQRARQRVNPLTVEELYEITEIWVDAALRLSERDLRIMERLVRAQNRKVTEPEPVVAEQASA >NZ_AP021884.1|WP_147070472.1|1149459_1151595_-|response-regulator MLDKMKIWFTDRVAKCDRVELEQSLIRLGIGLAILVYLLYRYLTHTTLSHNDIVAFSILSVFLFLTLVLIGSILYSSKPSVVRRLAGAWVDQGGTTLFMAFTGEVGVMVVGVYLWVIFGNGFRFGRKYLIHAQVLSIVGFAITTQVNPYWDEHEAISYSVMLMLLALPIYVSALIRRMNEARQKAEEANAAKTRFVANMSHEIRTPLSGIIGISTLLKATPLNSEQQDLLGTLNSSSRLLVSLLNNVLDFAKIEDGKLAIEHTDFSVNSLLEETVKIFRSQAEAKSIRLDTHIAAAAGTLRGDPHRLQQVLANLVGNAVKFTERGSVTLSLSILGENEHHRNMRFEVADTGVGIPTSAQGKIFESFTQADISTTRRFGGSGLGLTITRHLVEAMGGRLSFESAEGLGSRFWFDLPLEKAVQAQPGSAEIVPLPATRDAGLENTLRILVCEDDATNQKILLRLLELAGHHVSLSANGEELLDQLEQSSFDLVIADLNMAGLSGTDALKLYRFTRADDTRTRFILFTADATLSARQAAKEAGFDAFLSKPVDASTLFGTIANLLGMPSASAEHWLNTVMGGSRSSPPASAETRAVLDAATLRELEILGAGDALFVQRLLRNYLRDSGELLDRIEHAVQQKQYGALRDHCHALKGNSLSIGARGVFGRAETIDRAGPGELRFRGSAMVGLLRTDYAAARAAIEDYLSRRQTAAR >NZ_AP021884.1|WP_147070471.1|1151614_1152046_-|response-regulator MSVRDIRSAPTYRQTVLIIDDQPMVLAIHTAVLKSLSMDLRIVSMTDPKAALEWLRQKPADLIVTDYRMHQMDGIHFVNAVRDSSIEPMRPIIVVTALKDEKIHQQLLAAGVSACLIKPARAAQLSKIARTLLEQSRRQYTTQ >NZ_AP021884.1|WP_147070469.1|1152139_1153207_-|response-regulator MTNFNLPDTSAVLILDDQATSRTILAQVVRSIGSGIRVQEETTPSAALAWAAAHPADLVLADYLMPDMNGVEFIGRLRQLPGYQHVPVVMVTIKQDMETRYAALDAGMTDFLTKPVDMRECLSRCRNLLTLRQQQLALEDKSRVLEDMVGQATEEIRCREKDTLMRLARAGEYRDTDTARHLLRMSRYSRVLADAIGLPEDEAELIELAAPLHDIGKIGIPDSILRKNGPLSDEELAIMRQHPKIGHDILEDSPSKYLRLGGEIALAHHERYDGSGYPFGTTGQDIPLSARIVAIADVFDALTSVRPYKSAWSIKSAMQYLLKESGRHFDPALVKAMLTLEASVEKIQEEHAEPG >NZ_AP021884.1|WP_147070467.1|1153469_1154669_+|malate-dehydrogenase MPTLKQQALDYHQFPKPGKLSVESSKPCATQHELSLAYSPGVAEPVRAIGADPELAYRYTNKGNLVAVITDGTAILGLGNLGPLAAKPVMEGKGVLFKRFANIDVFDIEVNAPSVQAFIDTVVNIAPTFGGINLEDIAAPHCFEIEKALSERLDIPVFHDDQHGTAVIICAGLINALHVQGKKLADARIVCLGAGAAGNASLRLLLAMGADKSRLLVVDKVGVLHTGMIDLPPHHAFFAADTDARTLADAMQGADAFIGVSAANLVTPAMIKSMADKPVVFALANPDPEIAPHDVHAARDDAIIATGRSDYPNQVNNILGFPFIFRGALDARAKRITQKMLIAAVHALMDLAREPVPADVLAIYNLTELAFGRDYILPKPFDARLIERIPPAVMKAAKE >NZ_AP021884.1|WP_147070465.1|1154672_1155050_+|succinate-dehydrogenase,-cytochrome-b556-subunit MRHPSRPVYLNIFKIHLPLPGWMSILQRMSGAVLFLVTPLLLYLLQTSFDADGYARLREWLHIPVVKALSTLLLWGYLLHLLGGLRFLLLDIHVGTALATARKLSAATLLASALLTLVIAGIGLW >NZ_AP021884.1|WP_147070463.1|1155043_1155373_+|succinate-dehydrogenase,-hydrophobic-membrane-anchor-protein MVGGALSAWLVQRVSALLLAAYALFFPVWVALHWPLDFAVWRGLFAPLPMRIVTLLFVVALALHAWVGMRDIFMDYVQPLGLRLALHVGALLWLATCVVWAGAVLWSLP >NZ_AP021884.1|WP_147070461.1|1155369_1157133_+|succinate-dehydrogenase-flavoprotein-subunit MMPVKRKFDAVIVGGGGAGLRAALQLSGSGLQVAVVSKVFPTRSHTVSAQGGITAALGNVTPDNWHWHMYDTVKGSDYLGDQDAIEFMCRHAAEAVIELEHMGLPFSRLDNGRIYQRAFGGQSMNYGGEQATRTCAAADRTGHALLHTLHQQNLKAHTHFFDEYFALDLLRDADGYVLGVTALCIETGAPLVIEARATLLATGGAGRIFRYSTNAHINTGDGLGMVLRAGLALQDMEFWQFHPTGLPGSGSLITEGVRGEGGYLVNNQGERFMERYAPHAKDLAGRDVVARALALEIHAGRGCGPHGDTIHLKLDHLGAALIKDKLPGIRELALRFAGVDPIDAPIPVVPTAHYMMGGIPTDLHGQVVMPARFGPEEPVPGLYAVGECACVSVHGANRLGGNSLLDLVVFGRAAGNHIIETLRDNPFPRLLPESAAEAALARLARWNKTGAGESVAELRLALQTLMQKHCGVFRTETLMGEGIAALDILQARLDNARLADHSQVFNTARIEALELENLFAVARATLVSAHARTESRGAHAREDYPERDDGHWLKHTLYTRENDQIDTKPVRLKPLTVEPFLPKERIY >NZ_AP021884.1|WP_147070460.1|1157132_1157831_+|succinate-dehydrogenase-iron-sulfur-subunit MRFSIYRYDPEHDTKPHMQAYDVDIEPAGNMLLDALLRIKDTLDSTLTLRRSCREGVCGSDGMNINGSNGLACITPLADLRQPVEVRPLPGLPVIRDLVVDMTPFNQQYRSVEPWLNNADPAPEIERLQSPEQRAQLDGLVECIQCGCCSSACPSFWWNPDKFVGPAGLLAAYRFIADSRDQGANQRLDNLQDPYRLFRCHGIMNCVSVCPKGLNPTAAIGKIKTLLVKRGA >NZ_AP021884.1|WP_161984192.1|1157869_1158085_+|succinate-dehydrogenase-assembly-factor-2 MLELDILLLDFLEQQYPVLPSSQQIAFGALLELGDSELWDMIQTGQSAAQPEQAKIIEWLRTGKQKNESTD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_2 | 1831685-1831793 | Orphan |
NA
Consensus repeat of NZ_AP021884_2
|
1 spacers
spacers of NZ_AP021884_2
>2.1|1831723|33|NZ_AP021884|CRISPRCasFinder TTGCGTTGGATACCTCATCCTCATCATTGCGCT |
CRISPR arrays and Neighbor proteins around NZ_AP021884_2
The CRISPR arrays of NZ_AP021884_2 >merge|NZ_AP021884|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGACTTGCGTTGGATACCTCATCCTCATCATTGCGCTGGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC >NZ_AP021884|2|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC TTGCGTTGGATACCTCATCCTCATCATTGCGCT GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC
>NZ_AP021884.1|WP_147074724.1|1830036_1830279_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLALPARIVEMRKQDIGIVDLGGVRKEVSLALVDDLQVDDYVIVHVGYALSKLDPEEAERTLRIFAEMESMPGNVGVGA >NZ_AP021884.1|WP_147074723.1|1828900_1830040_-|hydrogenase-formation-protein-HypD MKYVDEFRDGELANGLASTIARAADTGRNYSFMEFCGGHTHAISRYGVTDLLPANIQMIHGPGCPVCVLPIGRIDLAIGLALDQGVILCTYGDTLRVPASDGLSLMKAKARGGDVRMIYSTADVLAIARDNPDRDVVFLAIGFETTTPPTALLIEQAKNEGIGNLSVLCNHVLTPSAITHILESPEVREYGTLPLDGFIGPSHVSTVIGTQPYEHFAREYRKPVVISGFEPLDVMQGILMLVRQVNEGRAEVENEFFRAVTRGGNRKAQTLVAKIFELRRTFEWRGLGEVPYSALQIRSEYAAFDAEQRYGLRYAPVADNKACECGAILRGVKKPTDCKIFGTVCTPETPMGSCMVSSEGACAAHYTYGRFKDVEIVAA >NZ_AP021884.1|WP_147074722.1|1827848_1828904_-|hydrogenase-expression/formation-protein-HypE MSTVKPGYTRPLDVRNGRIDLSHGGGGRAMAQLIEELFAAAFDNEYLAQGNDGAVLAMPSAGGRLVMATDAHVVSPLFFPGGDIGCLSVHGTVNDVAVMGARPLWLAASFVLEEGFPLSDLKRIVESMANAAKSAGVSVVTGDTKVVERGKGDGVFITTTGVGVLPKGLDLSGNKATPGDVILLSGTIGDHGMAIMSKRENLAFDAPIESDTAALHGLVADMLASGSGIRVLRDPTRGGLATTLNEIAKQSGVGMQLDESSIPVRPVVDAACEFLGLDPLYIANEGKLVAICAPEDAGGLLAVMRAHPLGRESAIIGTVHADPHHFVQMKTRFGGRRNVDWLSGEQLPRIC >NZ_AP021884.1|WP_147074721.1|1826161_1827847_-|hydrogenase-maturation-protein MRILFLTHSFNSLTQRLYVALTELGHEVSVEFDIADSVTEEAVALYRPDIILAPFLKRAIPASVWRHHTCLVVHPGIVGDRGPSALDWAVQNAETEWGVTVLQANAVMDGGDIWANEIFPMRLAKKSSLYRNEVTEAATRAVTTAIERYAQRDFVPCPLEKWSNVAGQERPVMWQEDRRINWLRDDTQTILRRIHAADGFPGVRDSLFDHACFLFDAHAAPDYSGAPGTILGWQGTSLVRATVDGAIRIGHVRRPESAHPFKLPALVAFAAEQASIPVLCEADGESIRYEEQDGVGYLYFDFYNGAMSTAQCRELLAAYRQACSRPTRVIVLMGGDDFWSNGIHLNLIEASEHPAEESWENIQAMDDLAEAIITATSHITVSALANNAGAGGAFLALAADHVWARPSVLLNLHYKNMGNLYGSEFWTYTLPRRVGLEKTSRIVENRLPMSARQAARLGIVDACFGTDAAMFRREVKQRATAISRSPDYDVLRKTKTEARDRDESEKPLLRYRESELSEMHRNFFGFDPSYHYARRYFVHKTLPAWTPRHLCKHRGMVQGNH >NZ_AP021884.1|WP_147074720.1|1824193_1825636_-|sigma-54-interacting-transcriptional-regulator MSLPTVLIVDDEIRSLEALRRTLEEDFTVFTASNVDAALEILRQEFIQIIVCDQRMPVQSGVTFLKHVRADWPDVVRIMLSGYTDTEDIIAGINEAGIFQYLLKPWQPEQLMLVLRSAADVYRLQLENQRLSLELRDSPALLAERVANKRQHVREKFSLDRVARAPDSPLNATCEMIDRIAPYDISVLITGESGTGKELLAHALHYRSGRAAQAFVTQNCGALPDALLEAELFGYKRGAFTGAYSDRVGLFQQADGGTIFLDEIGETTPSFQVKLLRVLQEGEIRPLGSPRSVQVNVRVIAATNRDLEEEVRAGRLRQDLYYRIANLTMHLPPLRERPMDIPLIAEGLLQRAMRQLNRKVRGFTPETLDCFKAYRWPGNVRELQNEILRILALTDSEWLEARLLSPKVLRAAMEESEEQQLDLLAGLDGSLKDRMEQLEARLIRETLIRHRWNKTHAAQELGLSRVGLRSKLVRYGMDKT >NZ_AP021884.1|WP_147074751.1|1823172_1824168_-|HupU-protein MNLIWLQSGGCGGCTMSLLSADVRDLFGMLKDAGINIVWHPGLSEQTGSEAIEVLEACASGDLPLDILCVEGSLLRGPNGTGRFHVLSGTGKPMIEWVRQLAEKAQYTIAVGTCATYGGVTAGGCNPTDACGMQYDGASRGGLLGVDYLSQSGLPVINIAGCPTHPGWVLETLLALAMDSFTQADLDELGRPRFYADHLVHHGCARNEYYEFKASAEKPSDQGCMMENMGCKGTQAHADCNIRPWNGGGSCTDGGYACIGCTEPGFEEPGHPFTQTPKIAGIPIGLPTDMPKAWFVALATLSKSATPKRVKENATSDHLVIVPGIRKTGVK >NZ_AP021884.1|WP_147074719.1|1821718_1823176_-|nickel-dependent-hydrogenase-large-subunit MSRLVVGPFNRVEGDLEVTLDISGGRVDRAYVDSTLYRGFEQILRGKDPMDALVFVPRICGICSVTQSVAAANALRNAMGISIPRNGQLATNLILANENLTDHFTHFYLFFMPDFARDGYSGRPWHGMAEQRFKAVTGSAAGDALPARAAFLHMMGVLAGKWPHTLTLQPGGSSRAVSSTEKIRLYAMLREFRAYLEKIMFGDKLENIVKLDSMRALEAWRDARPPDASDFRLFLEVARDLELHRIGRATDIFLSYGSYEMAGEYLFSPGVWDASKGTLSAIDPSDIVEDLSHSRMTGERDARHPYQGETQPAPDKPDAYTWCKAPRWRGQVLECGALARQVVTGHPLIRDMVEKTGGNVTTRVVARLLEISRVIPAMESWIKSLSPGEPFCVQGRMPDNAKGVGMVEAARGALGHWLVVKEGKIANYQIVAPTTWNFSPRDRDGIPGALEQALVGVPVGEHERVPLAVQHVVRSFDPCMVCTVH >NZ_AP021884.1|WP_147074718.1|1821238_1821706_-|nickel-responsive-transcriptional-regulator-NikR MERFTISLDEDLAQEFDRLILARGYSNRSEAVRDMLRAELEKSRQVRYEGTHCIAALSYVYNHHERELAERLTALQHDHHDLTVSTLHAHLDHDNCIECVVLRGKTAEVRDFAGKLIAERGVRHGNLSVITVSQEQHKHRHGLFARSHIHYKPHN >NZ_AP021884.1|WP_147074717.1|1819887_1821237_-|PAS-domain-containing-protein MFSKTGLLSLPDMPIEGVGEQFWMEVIRKMDEVYSDLLKYQTALEEQNNKLEESQQFIFGVLAAMSDILVVCDQTGTIEDVNQSLIELTGKTSAEWRGHPLVELFADDISRKQAELKFNGLQGQAIHDCEMQIRMANGSSMPVSVNCTARFNKKGKSVGMVITGRPVGELRRAYHALQEAHEALKRTQQQLVHSEKMASLGRLVAGVAHELNNPISFVLGNVHVLERYAGRLKEYLDAVHAGRSGIELAELREKLKIDRILGDIRPLIEGTIEGAERTRDIVDGLKRFSAIDREEECEFNLVEIIQRAVHWVTNITSESFQVEMDLPHFIPVLGSAAQIQQVIMNLVQNAVDATAEVKSPRLRIQAKIEKDKAVVEFRDNGSGILPENFPKIFDPFFTTKPVGKGTGLGLAISYGIVERHNGALFAANDAHDGGTIFVLNLPLYQSAKN >NZ_AP021884.1|WP_147074716.1|1818851_1819808_+|HTH-type-transcriptional-regulator-CysB MKIQQLRYLHEVARQGLNVSLAAEKLHTSQPGVSKQIQLLEEELGVDILVRHGKRVTGITEPGQKILAITERILREAENLKRVGADFTNETHGSLSIATTHTQARYALPSVIKTFSERYPGVQLRLHQGNPAQIVEMVLSGEADIAIATEAIALHDELVTLPCYQWNRCVIVQPDHPLLGEPTLTLERIADYSIITYDFAFAGRSQINKAFMERNLSPNVVLTAIDADVIKTYVGIGLGIGIMASMAFDPGRDQNLRAIDASHLFEPSTTRIGIRQGTYLRGYTFEFIQMFAPHLNHEAVNMAISAACRSAHQEAPKI >NZ_AP021884.1|WP_147074726.1|1832692_1833727_-|hydrogenase-nickel-incorporation-protein-HypB MCTTCGCSAGETRIEGQAMDGHSHVHADGTVHDHRHEAPAADGKMQYHAHHDENAHGHRHADGTWHSHDHGHEGEHVHEHGEDVIDYGQGPAHAHAPGLTQSQMVRIEQDILGKNNAYAGRNRNYFDEHGIFALNLVSSPGSGKTTLLVRTIETLKSRIQVAVVEGDQQTSQDAERIRSTGVRALQINTGKGCHLDANMVGHALERLHPEDDSVLMIENVGNLVCPAAYDLGEAHKVVILSVTEGEDKPLKYPDMFRAASLMLLNKTDLLPYVPFNVQLAIEYAKQVNPGLHIIQTSSTNGDGYEAWLGWIETGLARQRKKRAQTVAVLQKRIQELEAHLAARG >NZ_AP021884.1|WP_147074727.1|1833787_1834129_-|hydrogenase-maturation-nickel-metallochaperone-HypA MHEMSLAEGVLQILEDTATHHGFQQIKRVRLEIGELACVEVESLRFCLDVVVRGSVAENTMLDIVQTPGGGWCMNCSDTVPISALFSACPRCGSYQVQPTHGTEMRVLELEGV >NZ_AP021884.1|WP_147074728.1|1834121_1835225_-|nickel-dependent-hydrogenase-large-subunit MSLAGKLTFSVGWDGYRVTSVEVRSSRPQAACLLEGKTVEEAMRLVPLLFGICGKAQTVAARSAAQAAQNLCGDKQLMLRQRRLVALEAAQEHLWRLLVDWPNRLGLPAKQGLMMEWVKRISISRGDDDVLALGEAMLTMIEQDVLDESLDCWAATLERAERTPMRGLAGASLEMLRGLEPLHSGHPVFGHFLPRQAACLWGNELQPYLDGHFAVRPLWRNAPAEAGALALHHQIPLLAELLRTGHAASARYLARLVDWVSCVRLLRGEASSTELRLDACKLGKNAGLACVDTARGLLLHYIEVALGQIVRYVIVAPTEWNFHPAGPFVQTLRSLRADDAASLYQRINILILAFDPCVEYEVNLHHA >NZ_AP021884.1|WP_161984236.1|1835224_1835764_-|[NiFe]-hydrogenase-assembly-chaperone-HybE MKLMNPRPLENPSRMIESVFDGIARHRMAGLPILNPSLHVEAVGFRLWEGLWLGILITPWTINLMLLPADNPDYAALGLGETRRWRFPSGQYDFMGGEEPGLGSYQACSLFSPVFEFASQEDAVATARAALEQLLLEDLEAAVKREKAQWDQARFSDAPLAEQALSRRGFLRGAFLRDP >NZ_AP021884.1|WP_147074730.1|1835770_1835986_-|rubredoxin MDTFEGSYLGHDDRIDESVRLECGICWLVYDPEVGDPYWHIPPGTPFSRLPEHWTCPNCDAPRHKFMVLKD >NZ_AP021884.1|WP_161984237.1|1836001_1836784_-|hydrogenase-expression/formation-protein MNMPKGMAVFNPPSVPDDVAPELRDQAANLIRQLLAQMRAYRFGATSYPKIDLLKYDPRVVPLINDILGQGEVSIIAHQPTALRAQETVFASVWRVCYPGADGVLERDYLEVCPIPAVVAEIALAPTLKQISPPPPPAGAMNSPALLHEILDVVSTYQAGNPAHIINLTLLPLTPDDLAYLVQALGPGSVSILSRGYGNCRITSSGLANVWWVQYFNSSDQLILNTIEVVEVPEVALAAEEDFSDSIERVEEWLGTMLAA >NZ_AP021884.1|WP_147074732.1|1836869_1837361_-|hydrogenase MSEGVLDYALVAQEKVATEQNALGMLITRLCEQHQFVLVDEGNLEALTQASGDMVLLLTEDVVRSPETWDVAIVLPEILKLFGGRLKAAIADTENSKKLQARFGTTRFPAMVFLRDGEYVDVIQRMLDWDEFVAEVTGVLEKPIGRAPTIGIPVRNEVASSCH >NZ_AP021884.1|WP_147074733.1|1837357_1837666_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLGIPMQVIEAEESYAVCRGRDGNLARIDTMLVGSVQSGQWLMTFLGGAREILNEQQAEQVNSALNALAAVSRGASDVDVHFADLVGREPQLPDFLRKGGQ >NZ_AP021884.1|WP_147074734.1|1837670_1838294_-|HyaD/HybD-family-hydrogenase-maturation-endopeptidase MVEGSQFDTLILGIGNVLWADEGFGVRCVEAMNATYAFPDNVRVMDGGTQGLYLLPYVEAARRLVIFDAVDYGLEPGTLKLVENAEVPKFMGAKKMSLHQTGFQEVLACADLVDHLPEEMVLIGVQPEELEDYGGSLRPRIKARIPEVLEIAVERLVGWGIPVVARGTGETMRTESGILDIQRYEMERPTEEQACRLGDIRFLATGV >NZ_AP021884.1|WP_147074735.1|1838453_1838801_-|HigA-family-addiction-module-antidote-protein MVKTFLPSGLGFGAGALDPRFFSFSLRAPIAPGRFLESRFLHPLGLSQDRLARELGISRRRVNELIRGKRAITPDTAIRLGLFFGTGPVLWLTLQQAWDIHQEWRNFRRRSKAHG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_3 | 2644962-2646084 | TypeI |
I-C
Consensus repeat of NZ_AP021884_3
|
15 spacers
spacers of NZ_AP021884_3
>3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG >3.2|2645072|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG >3.3|2645146|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG >3.4|2645220|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG >3.5|2645293|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG >3.6|2645366|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG >3.7|2645438|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT >3.8|2645510|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG >3.9|2645581|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG >3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC >3.11|2645725|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC >3.12|2645798|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA >3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA >3.14|2645942|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG >3.15|2646013|35|NZ_AP021884|CRISPRCasFinder,CRT GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_3
The CRISPR arrays of NZ_AP021884_3 >merge|NZ_AP021884|3|2644962-2646084|PILER-CR,CRISPRCasFinder,CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATACCGGTAGCGTCGGCAATACCCTGACCGCAGCGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACGACGCAGGTTATAGAGCGTTGCACGGCAAAATTGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCTAACCTGCCTTACACAGCCAGCCGCTACGATGAGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCGGTCATGGGTGCCAGTTACACTATCCCGATGGACGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACAATTTCTTGCTGGATAAAATCAAGCCGCTTAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAGCATGATCCAAATGGATGCCTTGCGGTAGCTTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCCATGGGTAGCACCGATTAGCACCTTGCCAAAGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646012|PILER-CR GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC >NZ_AP021884|3|3|2644962-2646084|CRISPRCasFinder GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646084|CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC
>NZ_AP021884.1|WP_147072286.1|2644492_2644783_+|CRISPR-associated-endonuclease-Cas2 MLIIVTYDVSTETAAGRKRLRRVAKACEKMGQRVQKSVFECTVNEMQFEQLERTLLAEIDETQDNLRFYRITEPVEVRVKQHGCFRSVDFEGPLIA >NZ_AP021884.1|WP_147072284.1|2643449_2644484_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MHTIQNTLYVMTPHAYAHLENATLRIDVEREKKLQVPLHHLGGVVCFGNVMVSPALMHRLADEGKSLVLLDDSGRFKARLEGPVSGNILLRQAHHSKASEPAFALGVARAVVAGKLKNSRTNLQRGAREAADPDEAATLTRSADNLAASLRAAAVANTMDELRGVEGEAARGYFAALNLIVKPLARPSFALNGRSRRPPLDRFNALLSFLYAMLMNDCRSAVEAAGLDAQLGFLHAVRPGRAALALDLQEEFRSILADRLALTLINRGQINAADFDEREGGAVMLGDKGRRTVVTAWQERKQEEITHPLTENKIPIGLLPFIQARFIARTIRGEMEGYLPYQAK >NZ_AP021884.1|WP_147072282.1|2643084_2643402_-|ribbon-helix-helix-protein,-CopG-family MATLTLRLPDNLDRQLTALAAQTHQNRSELARTALEKFLRELEQEQLLAEMVEAARFLATNPEARAESIAIAEEFLPLDNEALDIAEGRKPGDPWPEELGEKWWK >NZ_AP021884.1|WP_147072279.1|2642722_2643097_-|type-II-toxin-antitoxin-system-PemK/MazF-family-toxin MVEIMRRGEIWLARLNPNTGAEAGKVRPVLILLNDALLATGMSPVLCIPLTSKLYKNLAGLRIAIAPRGLLLKPCYAMPEQARALDRNRFGEGSLATLTNAEMAQVEKLFIAACGMAQYLIPQH >NZ_AP021884.1|WP_147072277.1|2642112_2642739_+|CRISPR-associated-protein-Cas4 MANSADEIVALSALQHWIYCPRQCGLIHLEQAFEDNVHTARGQAVHHLVDTPGYEIKSGVRVERALPVWCDRLNLIGKADLVEFHPDDSVYPVEFKHGAKRQKLHDDIQLAAQAICLEEMLNRPVPKGAIFHATSHRRREVSITPELKQLVEETANAIRAMLASGKLPPPVNDARCRECSLKEICQPEALAERGRLERLREELFSAAG >NZ_AP021884.1|WP_147072275.1|2641869_2642112_+|type-II-toxin-antitoxin-system-HicA-family-toxin MRVPRDLSGADLVKRLERMGYCVTRQTGSHMRLTSTVRGEHHITIPNHDPLRLGTLASILASVAAHHGLTRDELIQRLFD >NZ_AP021884.1|WP_124705901.1|2641666_2641873_+|2-oxoisovalerate-dehydrogenase MSEIHFIVEEAPEGGYVARAVGVDIVTEADDLPSLHAQVRDAVHCHFDEGKLPGLIRLHITREEVLTA >NZ_AP021884.1|WP_147072272.1|2640483_2641584_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSIHNRYDFVLLFDVKDGNPNGDPDAGNLPRLDTETGQGLITDVSIKRKIRNFVGITKCKEDGTYETGFDIYIKEKAVLGRAHFAAFEKLGISLGQDATELIPDDLAEQFEALTLPEGMEIDTDEEGRSILNLSGATLDKKEAQKWLKDINPAKPLKNFISKVLKNVTARKPKQEESEKGRVQMCQDFYDIRTFGAVLSLKTAPNCGQVRGPVQITFARSIDPIVTLEHSITRCAVATEAEAEKQGGDNRTMGRKFTVPYGLYRTHGFVSAHLAGQTKFDESDLELLWEALKNMFEHDHSAARGEMATRGLYVFKHESHLGNEAAHKLFDRIKVNKTKDVPRGFEDYEVSVDETEMPSGVALLQKC >NZ_AP021884.1|WP_147072270.1|2638681_2640466_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MILQSLHEYYGRKRDSLPGDGIERKELPFLFVLKPDGAFLHIEDTRQGEGKRKRGNAFLVPQGVKKSVNVAANLLWGNVEYVIGQPDSKKLEEQRKKGKEKHYRERLGDMCSAFRTEIEQLPSEVKSTPEVAAVLAFLSSGNFTHVLADPLWPQVSATGANVSFKLTGAESPVCSASGILASVGQSTEDKGETRICLITGNSDVVERLHPPIKGVWGAQTSGANIVSFNLSAFNSFAREQGSNAPVGKRAAFAYTTALNHLLVSKQRIQIGDASTVFWAAEDNKMESLLSQFFDEPPQDNPDQGTNAVKELLEATLAGTPAIYDDGTRFYVLGLAPNAARIAVRFWHVATVGDLAGHIRQHFEDLEIVRPQYVERPFLSLKALLLAVSPLGDLDKLPPKLAGDFMKAILDGTSYPQTLLQAALRRIHAEQAKKDEKTGKHRDHVPYARAALIKAWLNRQTRNANPDQERKITMSLDESNINSGYRLGRLFAVLEKVQAEANPGLNTTIRDSYFGSASSTPSAVFPTLMRRNQHHMTKLRKEKPGLYVTRDKLIQTICNDGIDGQLGFRPILSLADQGRFVIGYYQQRQDLFTKS >NZ_AP021884.1|WP_147072268.1|2637986_2638685_+|type-I-C-CRISPR-associated-protein-Cas5 MPKTLCLKVWGDFACFTRPEMKVERVSYDVITPSAARAVFEAILWKPEIRWTVTKIEVLKPIKWISVRRNEIGKVASADNGQGDRGLYIEEHRQQRAGLFLRDVAYRLHAQFEVVDGSKHVHHYPELRGRFPAEPEESQPEHPAKYLSMFQRRAKKGQCFWQPYLGCREFSAHFELVDDAAAASLAEPPISDSPSLGWMLHDIDFADAMRPGFFRAEMKSGIIDLEDVEVRR >NZ_AP021884.1|WP_147072288.1|2647299_2647656_-|DUF2934-domain-containing-protein MAESKAKSKASGKPVSAVAETKPKAKTAQPAAGKAAAQSAVAAKPKVAKPKVAAPGANEPAAKRSVKLSNPAVSAEQRYRMIAEAAYYIAERRNFAPGDAAADWAQAEVQIVALLNKK >NZ_AP021884.1|WP_147072290.1|2647849_2649325_-|metalloprotease-TldD MTTSTLTGVLIPNPEMLFQTAHETLLVPNQLEASQLDGVFGRLMDHHVDYADLYFQYTRSEGWSLEEGQVKSGSFNIEQGVGVRAVSGEKTAFAYSDDISQPALLAAAEATRAIARSGAVRKPHAVARGGGHALYQPLDPLTTLKDAEKVALLEKLERYARAIDSRVTQVMASLASEYDVVLIARSDGHQAADVRPLVRLSLQVITEQDGRREQGSAGGGGRFGYDYFSDAMLKKYAEQAVHQALTNLAARPAPAGSMTVVLGAGWPGILLHEAIGHGLEGDFNRKGSSAFAGRVGERVAATGVTVVDDGTLMNRRGSLNVDDEGNLTQCTTLIENGVLKGYMQDTLNARLMGVPITGNARRESFAHIPMPRMTNTYMLNGDKDPEEIIASVKHGLYAVNFGGGQVDITSGKFVFSAAEAYMIEDGKITYPVKGATLIGNGPDVLTRVSMIGNDMALDPGVGTCGKEGQSVPVGVGQPTLRIDGLTVGGTA >NZ_AP021884.1|WP_147072292.1|2649382_2650324_-|carbon-nitrogen-hydrolase-family-protein MEKIVSDKTSKSPSSFAKPKSRTTVAKPARAPAPGVIRMAAIQMASGPNVSANLAEAERLVALAVAGGAKLVVLPEFFAIMGNKDTDKVAAREEEGKGPIQKFLASAAKKHKIWLVGGSVPLACDNPKKVRNSCLVYDDKGKLVARYDKIHLFGLDLGVEHYQEEKTIEPGDQIVVLDSPFGRIGLSVCYDLRFPELYRAMPNVDIILVPSAFTATTGKAHFETLVRARAIENLAYVIAPAQGGYHLSGRETHGDTMIVDPWGVVLDRLPRGSGVVMAGINPAYQASLRKSLPALKHRTLDCSHIQIKDKAIK >NZ_AP021884.1|WP_147072295.1|2650366_2654170_-|TIGR02099-family-protein MIAFSRRWIRRSVDYVVLPLALVVVVLVLLLRLWILPDIDRWRDDIAASISHSAGQRVTLGEINANWQGLHPHLRIRDIRVFGADGRPVLFLADVRATLSWTSLLHGELRLAVLTMDDVALTIRRDMQGIHVAGILLNQSDSSGGFGDWLLAQRHIQVNHATLAWNDERRGAPYLVARDVNLTLQNRGHRHRFRLTAIPPEQLAQPLDIRGDFSGRSLDDLASWHGQVYARVDRTDLGQWRQWLTLPYAISQGYGGLRMWLDVASRQVIAATVDASLRQVSVRFAADLPVLRLADVSGRGLWKRLGPAQSFAVKQLSLRTANFVYVAPFDLTLRLDPANAIQPGSGRIDTNSVQLDRLAALAPYLPLDAVQRRRLADLQPRGQLEKFTLAWSGNADQPLDYQIKGRFTRLGWQAQGNLPGAAGLSGNIDATRSNGTLALTSSGVMLALPRVLFEPDVALTTLTARMNWRATQAGYLIKLTEASFANPDLAGSAFGEYQLQAGRRGVIDLTGRLSRANVASAYHYLPLVVKDPTYQWVRSALLAGQGGAASIRLQGDLSRFPFRKAGDGVFEISTPISNGVLQYAAGWPRIEGIQAQLKFTGTRMEISSDAATIYGAALRRVSAVIPDLVDPDEILEVKGEAAGPLAELVRFANTSPLAAKLDNVTDNLRTTGNSRLGLDLKLPLRRAHHATLVGDIRFLGNTLIPAHGLPTLENVQGRLSFTDTGISAQSISARLLGGAATLSAVTQPGGVTRLLVDGRMTAAGLRPYLGTALAGHLSGMADWHARVDLHQMQAQADFESNLVGMASDLPPPFAKAAADSQPLRVKKSLRGADESLLAIHYGQVASALLLQKQKDGEPVIERGTLRFGGEAVLPEESGLWITGSLLLSDLDLWRNELTAAGNGAIGLPPLAGVNLSFRTLDLFGRRFQDININARNQAGTWRANVAGRGVNGDVTWQAADSRAGQPQDRLGAHFKTLAIPAALPVQGVKSSPSGSLPALDISVDNLQLGNRPLGRLSVSATPLDSGLNFESIRLTQPDSTLTMQGIWNPDRIPQTRAKIHLEVNDVGRFLARFDHPGLVKRGQATLDGEGEWNGTPADIAIPSLSGTFALKASSGQFAKVDPGIGKLLGVLSLQALPRRIGLDFRDVFSDGFAFDEISGTMRLSRGVVYSDDFRMQGPSAKVRMSGMVDINAETQQLRVAVSPKLSESVALAGTLIGGPFVGLGALAVQKLLKDPFGQAATFEYSVTGAWTDPVVKRVARIAGGGEP >NZ_AP021884.1|WP_147072299.1|2654382_2656353_-|acetate--CoA-ligase MANIESVLQETRVFPPSAAFQAQANVSGMASHQALTARAAADYEGFWADMARAGISWKKDFSKILDESNAPFYKWFYDGELNVSYNCLDRHLPEKADKTALIFEADDGAVRRVTYQALYNQVCAFANGLKSRGVQKGDRVIIYMPMGVEAVVAMQACARIGAIHSVVFGGFSAKSLHERIRDAGARLVVTADGSIRGGKMLPLKSAVDAAIALGDCECVEAVVVYRRSGDDTAWNAARDIWWHDLVNGMAQTCEPEWVNAEHPLFILYTSGSTGHPKGVQHSSGGYLLGAILSMQWVFDARPDTDVFWCTADVGWITGHSYVVYGPLALGMTEVIFEGVPTYPDAGRFWKMIQDHQVTTFYTAPTAIRSLIKLGSDLPRQYDLSSLRLLGTVGEPINPEAWMWYYEAVGQSRCPIADTWWQTETGSHMIAPLPGAVATKPGSCTLPLPGIMADVVDEHGGSVPLGQGGYLVIKRPFPSLLRSLWGDPERFRKTYFPAELGGKTYLAGDSAHRDADGYYWIMGRIDDVLNVSGHRLGTMEIESALAANPRVAEAAVVGKPHDIKGEAVVAFVVLKGARASGDEAKKIVAELRDWVGKEIGPIAKPDEIRFGDNLPKTRSGKIMRRLLRAIARGEEITQDVSTLENPAILEQLKEAVR >NZ_AP021884.1|WP_147072301.1|2656415_2659091_+|bifunctional-[glutamate--ammonia-ligase]-adenylyl-L-tyrosine-phosphorylase/[glutamate--ammonia-ligase]-adenylyltransferase MPAHHLIERAASHSRYLARLLAADAQFVDSLASGLAQPFGADAMQAQLQAAAPGDEAMLKTALRKLRQAVMARLIVRDLGGLADLSEVMGTCTDLAETTLRCALAHHSTWLAQKHGMPKNPDGSDMQLVVVGMGKLGGRELNVSSDIDLIYLYPEQGETTGAKPVSHHEFFVLLGKKLGLAISDLTADGFVFRVDMRLRPWGDAGPLAMSYAALEDYLVAHGREWERYAWIKGRALTGTRLAELDQIIRPFVFRKYLDFNAFAAMRELHVQIRREVIRRDRADNIKLGPGGIREIEFTAQVFQLIRGGQVAVLQTRSLLAVLPLLAARGLLPENAVAELQAAYVFLRNLEHRLQYLDDAQTQMLPTQPDDRTRIATSMGFTDYPAFLAALNAHRTQVSRHFDQVFAAPQADSGSHPLAGLWQGALEHADALATLAGLGYTAPAEVCNRLRQIRTSIRYTTLPASNRARFDTLMPALIEVAASCNPPDATLARILDLLETVARRDSYLALLVEYPATLQRVARLCAASPWAAQYLARNPMLLDELLDTRQLYATPDWPALGDELQALMHTHCGDTERQMDAMRQFRQRVTFHLLAQDLAGVLALETLSDHLSDLAALILSATLPLAWAGVRNRHRDTPRFAVIGYGKLGGREMGYASDLDLVFLYEDPAPAAAEHYARLAQRINTWLGSTTAAGVLYETDLRLRPDGTSGLLVSSVEAFSQYQHSHAWTWEHQALTRARYVAGDAAVGAAFERIRCDILTQPRDPARLREDVLAMRQKMHAGHPNHSDLFDLKHDAGGIVDVEFMVQYLVLAHAARHRELTRNSGNIALLRLAAELELIPASDAEAVRSAYRELRRLQHALRLHGIQTARIEPMQVAGHAAAVRRLWRTLFG >NZ_AP021884.1|WP_147072457.1|2659154_2660069_+|branched-chain-amino-acid-transaminase MADRDGFIWYDGKMVPWRDATTHVLTHTLHYGMGVFEGVRAYNTDQGTAIFRLQEHTDRLFRSAHILGMKMPFDKAAISAAQLAAVRDNQLESAYIRPMAFYGAEAMGISAKTLSTHVIVAAWTWGAYMGAEALERGIRVKTSSFARHHVNIAMCKAKANGNYMNSILAHQEAAQDGYQEALLLDVDGFVAEGSGENVFIVRNGKLITPDLTSALEGITRDTIVQLAGEIGLQVVEKRITRDEMYSADEAFFTGTAAEVTPIRELDNRTIGTGARGPITAQLQKMYFDCVTGKDPKHAGWLSYI >NZ_AP021884.1|WP_147072303.1|2660079_2660286_+|zinc-finger-domain-containing-protein MAQVQHENTQRIIEVTADDLPLHCPTPGMIAWDSHPRVFLPVEVKGEALCPYCGTMYILKGGAVAHGH >NZ_AP021884.1|WP_147072305.1|2660694_2661141_+|6-carboxytetrahydropterin-synthase-QueD MLITRRLEFDAGHRIPNHASQCKHLHGHRYAIEITLSGDIITAEGQSEQGMVMDFSDVKRIAREQLVDAWDHAFLAYRGDKPVCDFLATLTDHKTIILELVPTVENLAHIAFDILDPAYRDTYGNQLRLKQVRIYETPNNWADCRQPE >NZ_AP021884.1|WP_147072307.1|2661191_2661659_+|YbhB/YbcL-family-Raf-kinase-inhibitor-like-protein MGMTMTSTAFAHHGAIPEHYTCDATDTSPPLAWAGVPVGAKSLVLIVDDPDAPDPAAPQRTWVHWLLYNLPPTSSGLAEGVTALPAGTLEGINDWKRTGYGGPCPPIGRHRYFHKLYALDVVLPNLDRPSKAALEKAMQGHILAQTELIGLYQRH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_4 | 2907296-2907401 | Orphan |
NA
Consensus repeat of NZ_AP021884_4
|
1 spacers
spacers of NZ_AP021884_4
>4.1|2907330|38|NZ_AP021884|CRISPRCasFinder CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC |
CRISPR arrays and Neighbor proteins around NZ_AP021884_4
The CRISPR arrays of NZ_AP021884_4 >merge|NZ_AP021884|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGGCAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGCGATTCCAACGACTGGGTGCGGCTGGGCAATATGG >NZ_AP021884|4|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGG CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC GATTCCAACGACTGGGTGCGGCTGGGCAATATGG
>NZ_AP021884.1|WP_147070033.1|2903228_2905838_+|alanine--tRNA-ligase MKSSEIRQRFLDFFARHGHTPVASSPLVPGNDPTLLFTNAGMVQFKDVFLGRETRPYARAVSSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKRNAIQFAWEFLTQELGIAKDKLWITVYHTDDEAHTIWTAEMGVPDERVIRIGDKPGGGSDNFWQMGDTGPCGPCTEIFYDHGAEVAGGPPGSADEDGDRYIEIWNLVFMQFNRDEAGNLQPLPRPSVDTGMGLERISAVMQHVHSNYEIDLFQALIHAAARVTGSADLTDNSLKVIADHIRACAFLITDGIIPGNEGRGYVLRRIIRRAIRHGYQLGQKQPFFHLLVADLAMAMGAAYPELVAAQARVTAVLKQEEERFAETLEHGMDILEQALQSGANVLDGATAFKLYDTYGFPLDLTADVGRERGFTVDMAGFEAAMEAQRKRARAASKFTMQAGMRFDGPPTEFRGYDTLSLDSRILALYQDGSPVHSIAAGEAAVIVLDRTPFYAESGGQVGDSGELHGSGSVFVVDDTQKIQPDVFGHTGLLQSGSLKLGDTVSAQVDADARSRAACNHSATHLLHAALRQVLGTHVTQKGSLVDAARTRFDFAHSEAVSAAQLQQIEDLVNREIRRNVIVEARLMNYDAAIAHGAMALFGEKYGDQVRVIGMGEFSTELCGGTHVSRSGDIGLFKIISESGVAAGIRRIEAVTGPAALAMIQAQQRQILEAAALLKAPPQELQQKIAQIVDNVKNLEKELDRLKSRLAAAQGDDLVSQATAVGNAKVLAAMLEGADVKTLRETVDKLKDRLKSCAVVLGSCSDGRVTLVAGVSADLTSKVKAGELANFVASQVGGKGGGRPDMAQAGGTEPAQLPAALQSVAGWVAQRLE >NZ_AP021884.1|WP_147070267.1|2901373_2903095_+|thiosulfohydrolase-SoxB MNRREFLQILAVAAASGMAIDNKQALAGNAPSGFYDLPKFGNVSLLHMTDCHAQLLPIYFREPDVNIGVGAAIGQPPHLVGEYLLKYYGIRPGTREAYAFTYLDFAAAARTYGKVGGFAYLKTLVDKVRASRPGSLLLDGGDTWQGSATSLWTNGQDMVDAAKLLGVNVMTGHWEFTYGAERVKHVVDNDFKGHIDFVAQNIKTNDFGDPVFKPYVIKTMNGVQVAIIGQAFPYTPIANPRYMVPDWSFGINDDNMQKVVNEARAKGAQVVVVLSHNGMDVDLKMATRVTGIDAIFGGHTHDGVPQPTQVKNAKGTTLVTNAGSNGKFLGVMDFDVKNGKIAAWKYRLLPVFSNLLEPDAKMAKLIEDVRAPYASKLNEKLAVTEELLYRRGNFNGTFDQLILDALMAVKGADAAFSPGFRWGTTLLSGDVITMDHLMDQTAITYPSTTLTEMTGATIKSIMEDVCDNLFNADPYYQQGGDMVRVGGIQYAVAPNNKIGNRISNMTLKGKPVMASKKYKVAGWAPVGEGVSGTPIWDVVAEYLRDIKVVKPRKLNEPKIVGIGKNPGIAPGIA >NZ_AP021884.1|WP_147070035.1|2900312_2901191_+|sulfur-oxidation-c-type-cytochrome-SoxA MKTTFREPQAQSPKAHGKKILLALAGAGLLLGALNASATPEQDRQSLLKFYSSKYPDIKVANYIYGALAFDPDAMEQYNSIMDFPPFGSVIEHGKKMWETPFKNGKKYADCFPNGGKNVAGNYPYFDDKAGKVVTFEMAINACRTANGEEAFKYNDMQTMGTLTAYARTLSDGMPMNIKVQGAAATAAYEAGKSQFYSRRGQLNFSCASCHVANAGNHLRSELLSPAVGQATHWPVFRGGEQLVTLQERYVGCNKQVRAVPFAPGSEEYNNLEYFHSYISNGLPLKASVFRK >NZ_AP021884.1|WP_147070037.1|2899922_2900237_+|thiosulfate-oxidation-carrier-complex-protein-SoxZ MAEPMKMRASVSGDVADIKVLMNHPMETGLRKDAKTGQLIPAHFINEVHATVNGKPVLDAQWGGGVSKNPYLGFKVKGAKAGDKVEVSWKDNKGESNKVDGVVA >NZ_AP021884.1|WP_147070039.1|2899397_2899865_+|thiosulfate-oxidation-carrier-protein-SoxY MNALRRNILKSAGATGIVAMAAAAGLLKSGNVLAAWNSSAFAAKTVPEAIKDLGLSTPADSKAISIKAPDIAENGAVVPVEVTSSIAGTTGIAIFAEKNATPLITDFKLSNGAEGFISTRIKMGQTAMVRAVVTAGGKTYTAAKEVKVTIGGCGG >NZ_AP021884.1|WP_147070041.1|2898997_2899366_+|sulfur-oxidation-c-type-cytochrome-SoxX MRAGASLILTASMVGMFVMANYAVAADTPKQEETGKSIAFDKTKGNCLACHAMPTVPDAVAAGTIGPPLIAMSARYPDKAKLRAQIWDATVANPQSVMIPFGKHKVLTEQEIDKVTDFVYGL >NZ_AP021884.1|WP_147070043.1|2897364_2898786_+|M48-family-metalloprotease MKSASLFLALCLASQQLAASELPDLGDVSQGAFSPRDEARVGNEIMRDIYAEPAYYDDPELTDYLNNLGYRLVAASPENRLAFQFFVLRDHTLNAFALPGGFIGVHTGLIEATQSESELAGVLGHEIAHVTQHHLARMIESRNQGILPSLAALAVAILAARSNPQAASAAIATVQATSIQKQLNFSRANEREADRIGMQIMRGAGFDPRAMATFFERLQKNSRLYENNAPAYLLTHPLTSERIADMQNRAASMPVKQVADSLEFQLLRAKLLAGEGRPEEAVRRFTEAIRDTRYNSLAAERYGLVVALLRTRQFDRAEQELDRLNQSGASSPMIAMLGARLRQEAGDLNTALARYQAGRARFPGYRPLLYADANALLQAGKADAALALVTDHLALYPDDYRLYQLQSRAYAMQGKDFLRHHAQAEAYVRQGNLDAAIEQLKLGLKSRDGDFYQMSIAEARLKELVALNQPAKP >NZ_AP021884.1|WP_147070045.1|2896848_2897325_-|cyclic-pyranopterin-monophosphate-synthase-MoaC MNQLTHFDDRGRAQMVDVADKSDTRRVAVAAGRIVMQPATLKMILDGSARKGDVLGVARIAAIAASKRTADLIPLCHPLALTRVAVEFLAEEADSAIECRVTAETVGKTGVEMEALTALSVGLLTIYDMCKAVDRGMRMEGLRLLEKQGGKSGHWRAP >NZ_AP021884.1|WP_147070047.1|2894604_2896824_+|EAL-domain-containing-protein MTQIDTRLISTATWLAGALAGLIALAFPLVYFSLSYEHQAASMETEAEFEAARIARLINANPELWPFEQSRLQELLQDQTETELPESRRIVDVNGRLIAQSQGKSARPYLLRTADLRNSGSVAGRVEIIRSLRPLLLKTAMASLLGLLLGSLAMVIFRAYPLRILKRALNTLANEKERAEVTLHSIGDAVITTNASGHIEYLNPVAEQLTGWTNEAARGLPSWRVFNIINESTGAPLDSPAEKAIKENRIVPLANHAGLVKRNGKIIPIENSAAPICDSQGQIIGAVLVFHDVSHARAMATKLSHQASHDPLTGLINRHTFESRLQQALDNVRRENSHHTLCYMDLDQFKIVNDTCGHRAGDELLRQLAGELRTKVRNSDCLARLGGDEFGLLLEGCTVQQAEHVAATLLQTVKEFRFHWQEHTCAVGVSIGLVGINAGCGDLAKIMGAADSSCYAAKDRGRNCIYVYQPDDKEVAQRRGEMQWVARITRAIDEGRLRLYYQTIQPLAGTQGAHYEILLRMLDEEGRIVPPGTFIPAAERYGLMPAIDRWVIENTFATLGRLYRGDAKKRLHTCAINLSGTSWADESLAGFICGMTGRHGVPARSICFEITETAAISNLGKTIALIRDLKEAGFRFSLDDFGSGVSSFGYLKQLPVDYLKIDGGFVRNIIHDKIDHAMVAAINQIGHIMGIKTIAEFVENEEILERITAMGVDYAQGYAIARPQPLDHINLASAPVLQQ >NZ_AP021884.1|WP_147070049.1|2893538_2894183_-|methyltransferase-domain-containing-protein MQAAEYDAWYQTPRGRWVGETESDLLRRMLGPQSGESLLDVGCGTGFFTRRFARGSRSAVTGLDPNRDWLAFAERHAVSTENYVNGSALALPFDAGGFDLVMAVTALCFISDQRLALTEMLRVARRRIALGLLNRHSLLYWQKGRGGGQGAYRGAHWHTAAQVRELFAGQPVRNLRMAFSIFVPGGGWIAQQLESVLPTSLPLGSFFVVTADVV >NZ_AP021884.1|WP_147070029.1|2907798_2908467_-|alpha/beta-fold-hydrolase MTAFAPLEFVAGSQAVQASVIWLHGLGADGHDFAPVVQALDLPGVRFILPHAPTRPVTINGGHVMPAWYDIRSTGLDADEDAAGLAQSSRVVEDLVAHELARGVASARIIVAGFSQGGALALYAGLAPGRVLGGIMVLSAYLPLMAGFNEWCAAGTHTIPVFMAHGVQDRVVPLQLAERSRQKLVACGFDVEWQIYPMAHSVCEEEIDAIRGWLIRVLQLHV >NZ_AP021884.1|WP_147070027.1|2908463_2909963_-|hypothetical-protein MRRIPLVGKLLPAAGEALAVPVRDDPASYSAHEICESIEQLIETLLSARKKNLDWQRHSIDSLHRQDNFSAPFMTRLTQHYLALPPFVSSVSGRFLAAISGYWEEMSAIHLQCVTYLLGHPESRLGALLPLLIQRALYHHAMQMKWRWLRYQLIPSCFWARLHRLYAVAEKHEFARVPLPLPGMERADSCCETLYLRPQMLHSLRPDTLLPCEIEQVDEWIVRWSKSVLLEPMLLSGKHRYGVNLKGASPPRPLAMLNEPGSYRYWGPGLMLAALHAEHDEADAGAHAGWRQALWRRVVNDWSGIPPLRHHPRQMIGKQTELFLGFNEIHTRIDHHPSRRAHDLPYWRRCRVRDESAEGLGLALNTSDGVPVAINSLIGINSGRHFLVGVVCRIRRHESGWTEIGIRRLAANAVPVKLESVNVNLAGQVVDALYLSMAGAFGQRRCVLIPARISWQDGQWQLLCKGRRHLIRLRAPLKATEDYVLADFDGLAQSEAIAS >NZ_AP021884.1|WP_147070024.1|2909981_2910410_-|hypothetical-protein MKDFIALLVEQSRLQSVAINPALDDALTHLDHALAGLCAAVQVEYRGPYVGVETPLAHQMVVRRHEWKIHQPAWSMKICVAAPAANCRAEWPVQGVGRLRKALVVKALPAFFAGFAEAIKQAGKQDSSAGLRVLELSRRFNL >NZ_AP021884.1|WP_147070022.1|2910437_2912018_+|sigma-54-interacting-transcriptional-regulator MRAQIRANWRKYYHTTRRAVRARSATRTAGNLAQSGQNANNAKEGTNVSLQGISAKPSLLIVDDDPLITDTLNFVLSRDFEVFVADSRSQVKSLLTQLDTPPQLALVDLGLPPLPHKPDEGFHLISELLGYSPGIKILVLSGQNDETNARHARALGAIDFVGKPCEPAQIKSLLFNALLIQDVERSAETEAPAAENLIVGTSFNLDRLRQQITQYANAPFPVLIEGESGSGKELVAASLHKLSGRTKKPYLALNCAAISPTLVEPTLFGYCKGAFTGATSNRAGYFEDACDGTLFLDEIGELPLELQAKLLRVLENGEFQRVGETQSRFSNARVVTATNRDLRQEIKAGRFRADLYHRLSVFGIAVPPLRELGEDKVRLLEHFREFYAREARVKPFALDNRARQMWEDYHFPGNVRELRNIVIRLTTKCAGQNVTAEQLETELDTDTAFPSEIPLPNDGKALYDTARRHLQTLANFSLDQTMKQWEKSYVEAALNLTHGNLSQAAKILGINRTTLYSRMQTYTNEA >NZ_AP021884.1|WP_147070020.1|2912147_2913497_+|AAA-family-ATPase MYHEFFGLKEAPFRITPDTGFFFSGGERGAILQGLAYAIRQGEGIIKVTGEVGSGKTMLCWMLEQHLPDHIETVYLANPNVKPEDVLPSILAELELVRPADASRAGHLRTLNDYLLARHDAGKQVVMFVEEAQGMTLDTLEEIRLLSNLETEREKLLQIVLFGQPELDAKLADPRIRQLRERITTAITLAPLTPDAIRAYLAFRLTTAGYRGPDLFDRRAVRSIARASRGLTRRVNILADKSLLAAYTDNTRTIQPRHIRIALRDSAFNDDANKPQRWLLPVIAMGVMVAVLASFYWRSKPAAAPSRQTQTRPAAGLPGRASAAAPDPVAPLSADPFQQRLAATRTWLMQQPADTRTIQLSLLNSPSEFAAYLRGEGGGLAPDQLRIFRTQAQGHPSWTVIYGSYPTRQTANRALLALPEAVRKRHPYLRTVGGIRNETRQIQQVGEQS >NZ_AP021884.1|WP_147070265.1|2913576_2915352_+|secretin-N-terminal-domain-containing-protein MWLPMLAVPLLAGCVPAAMIQPSQGHIQQSSQPATRLADIPPLVKTIPYLPSPRAETQVPTYTIVVDNVPVKDLLFSLARDTKKNIDIGTGITGNVTLNAVNEPLPAILERIARQASIRYRMEGDTLSIMPDTPYLKTYKVNYVNLSRNTSSSIGVAAQIASTGSGAVGAAASGSAQGGNSSSTTVDSQSNNNFWEVLTENVRAILTSTRASTQRAEDKSARLDAERNARADRLEQAQAVARAGAAAPTLYREAFGNTSSSLLQDSKNEVIVNPVAGTVSVLGNERQQQVVQQYLDGVSQSSQRQVLIEATIVEVSLKDQYRAGIDWSRLANGSKGIFFNTMPAATTNLANSLLPFFNIGYRDRNLTATLNLLESFGNLRVLSSPKLMALNNQTALLKVVDNLVYFTVQAQQGTLSSTGTPLQPTTFTTTAKTVPVGLVMSLTPQISESGMVTLDVRPTISRKIGDVSDPNPGLPVSTPNKIPVIQVREMESVLQVGSGQTVILGGLMQDDSDRARDGIPVLSRPQGFGAIFGQHEHNVQKTELVIFLRPTVITNPSLDSDELKFYKRYLPRANAAPEQWHNGADAAGDPQ >NZ_AP021884.1|WP_147070018.1|2915348_2916524_+|tetratricopeptide-repeat-protein MSLLLKALKQAGDKSAAGARNPSATLADSLSLEPISGSAPDGTAYTSWDGAAPFKRSTARAAWYTPWLSGQRWLVPAVAVVAALFMLIYGVFVYWQTRTPAALVVTPTPHSAAPAAAPPAAAPAQLAAVPSQESGPPLPEINSAVPDAPAALPPPPVQADPTPQWGSGELIREAPPPRRARTQPGRRETRSALPFSMQTATTHINPQLEAAYQAYQAGHTREARNLYLQIPDGERNVDVQLGLAAIALRDNDTPAAARHYQRVLELDPRNSTANSALIGMMGDADPNASETRLKSLIASQPSSQLYFALGNLYAGQNRWPDAEQAYFEAYQKNAANADYAYNLAVSLEHISQSRAALNYYQKARDLMQPGNVQFDPLRLEARIDQLKARQE >NZ_AP021884.1|WP_147070016.1|2916529_2918233_+|Flp-pilus-assembly-complex-ATPase-component-TadA MEARKTLRLGEMLVQQGLITLDQLRIALKEQQHTNLPLGRLLVKLGFITEAVIRDQLAHTIGQTSLDLANVVADPEALKLISEDFARRHHLLPIAFDAQRQVLVVAITDMFNVVALDHLRALLGAGVEVDTVLSGEAQLLEAIDNFYGFELSVDGILREIETGEVDYQSLAMDTEEYTQPVVRLVGSLLVDAVKRGASDIHFEPEHAFLRIRYRIDGVLEQVRSLHKSYWPGIAVRLKVISGMNIAENRAPQDGRLSLTLHGRPIDFRVSSQPTIHGENIVLRVLDREKSIIPLANMDLPTDTHTALQRMMARPEGILIITGPTGSGKTTTLYSLLTHLNNETVNIMTLEDPVEYPVTLMRQSSVNETLKLDFANGIRSIMRQDPDIILVGEIRDRDTAEMAFRAAMTGHQVFTTLHTNSALGAFPRLLDIGIVPDIMAGNIIGVVAQRLVRVLCPHCRAAYTPDADEQKLLDWQATDRRPVYRAVGCPACNGKGYRGRMALMELLRMDSELDDLVARRATHREILNAALMRGYRSLAVDGISRVLEGKTSLAEVSRVVDLTQRILS >NZ_AP021884.1|WP_147070015.1|2918246_2919452_+|type-II-secretion-system-F-family-protein MPYFSYRAVDQIGRTNRGSLSAANEVDLELRLRRMGLDLITLRQMDSRASGFARGAASRRDLITFCFHLEQISRAGIPILDGVRDLRDSMDNPRFRDILTALLEDMEGGRLMSQALAAHPAVFDTVIVNLVRAGEQTGLMREVFENLGASLKRQDELAAQTRRLLIYPTLVLSMVGIIILLLLLFLVPQIADLIKNMGIALPIQTRVLLWLSETLRTWWPLFLILPVAIGSALVVTLRASERARFVADDVKLRLPVIGPILQKIALARFSNFFALMYRSGITILDALRAGEDIAANRVIADAIRRAGGRIGNGEGLTESFQSLSVFPPLVIRMLRVGETTGALDTALENVSYFYTREVSESIEKSLKILEPALTVVLGLVMAVIVGSVLLPMYDVIGTLKP >NZ_AP021884.1|WP_147070013.1|2919448_2920984_+|hypothetical-protein MMFAPQLLVYVCAWSITVACRRAGKIRLVGQFNADEGGRRAFAAVLQAFKNSPVSVMVDGVDEDYRLETLPHVLGNARREMLERRLRQISRNALFSAAWPQGREASGRRDDRYLFISLSNHDAVRPWLDLLHQHGVHLAELTVLPAISHVLLQRIQPTEPHVLLVSEHCGGLRLSYFEHGNLRFSRLTAPESLAEGHAPDLASEINKTDLYLNSQRLMPRDAQLAVYVLDPENAYAGLCREISAENKNLICQAVGSVALAKLVGVDEPLLHRTADVAYLAVLGRSRAAVNLAPAAYTRGYVQLMLRHKLYTGAFAVLATALAISGYLFSRQHDLEQQRLVTQDRIQQQASLYRAVQLALPRAPTSPQNLKRVVETARALYAAPQPMSDFARVSQALETVPDIAVLRLRWLDHDAADTTATHSAVSDNPGAAVRALYFDGEVSPFQGDYKTALASIEHFAATLRNDPGVAEVRVLALPINTDPTATLDESQHTGNSAPRARFRLKLLMRPAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_007766 | Rhizobium etli CFN 42 plasmid p42f, complete sequence | 395837-395869 | 6 | 0.818 |
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NZ_CP020911 | Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence | 526121-526153 | 6 | 0.818 |
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_021911 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence | 601591-601623 | 6 | 0.818 |
NZ_AP021884_3 | 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645871-2645904 | 34 | MN692973 | Marine virus AFVG_117M33, complete genome | 35754-35787 | 9 | 0.735 |
NZ_AP021884_3 | 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2644999-2645034 | 36 | NC_008043 | Ruegeria sp. TM1040 megaplasmid, complete sequence | 694259-694294 | 10 | 0.722 |
NZ_AP021884_3 | 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645653-2645687 | 35 | NZ_CP007794 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence | 1356140-1356174 | 10 | 0.714 |
1. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_007766 (Rhizobium etli CFN 42 plasmid p42f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
2. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NZ_CP020911 (Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
3. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_021911 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
4. spacer 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to MN692973 (Marine virus AFVG_117M33, complete genome) position: , mismatch: 9, identity: 0.735
aacaatttcttgctggataaaatcaagccgctta CRISPR spacer tacaatttcgttctggataaaatcaagtgttgca Protospacer ******** * ***************. . .*
5. spacer 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NC_008043 (Ruegeria sp. TM1040 megaplasmid, complete sequence) position: , mismatch: 10, identity: 0.722
atggtgcgatcctgttgttgctggttgtgctgcggg CRISPR spacer tcgctccgatcctgttgtggctggtggtgctgatca Protospacer .* * ************ ****** ****** .
6. spacer 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007794 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence) position: , mismatch: 10, identity: 0.714
accggttcatcgccgtgccgcctgtccatcgccgc CRISPR spacer gtccagccgtcgccgtgccccctggccatcgccgg Protospacer ..* . .*.********** **** *********
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
622875 : 631682
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|DBSCAN-SWA TTCAGGAATTGCCGCTGTTGTCGATGAAGCTCTTGAGACGGTCAGAGCGCGATGGGTGGCGCAGTTTGCGCAGCGCTTTGGCTTCGATCTGGCGGATACGCTCACGGGTTACGTCGAACTGTTTGCCGACTTCCTCCAGGGTGTGGTCGGTATTCATCTCGATGCCGAAACGCATGCGCAGCACTTTGGCTTCGCGTTGCGTCAGGCCATCGAGAATATCCTTGGTCACTTCCTGCAGGCTGCCGTAAACGGCGGCGTCTATCGGCGCCAGCGTGGCGGTGTCTTCTATGAAATCGCCCAGATGGGAATCCTCGTCGTCGCCGATAGGGGTTTCCATGGAAATAGGCTCTTTGGAGATTTTGAGTATCTTGCGGATTTTCTCCTCGGTCATTTCCATTTTTTCGGCCAGCAATTCCGGGTCGGGTTCCTTGCCGGTTTCCTGAAGAATCTGACGCGAGATACGGTTCATCTTGTTGATGGTCTCGATCATGTGCACCGGAATACGAATGGTGCGTGCCTGATCCGCGATGGAGCGGGTGATGGCCTGACGGATCCACCAGGTGGCGTAGGTCGAGAACTTGTAGCCGCGCCGGTATTCGAATTTGTCCACGGCTTTCATCAGGCCGATGTTGCCTTCCTGGATCAGGTCGAGGAATTGCAGGCCACGGTTGGTGTATTTTTTTGCAATGGAAATCACCAGGCGCAGGTTGGCCTCGATCATTTCACGTTTGGCGCGGCGCGCGCGCGCTTCGCCAGTGGACATCTGGCGATTGATTTCCTTCAAGTCCTTGATCGGGATGCCCACTTTTTCCTGCAGGGCAATCAGGCGTTGCTGACGTTCGACGATAGTGTGCTGGTAGCGTGTCAAATCCTCTGAGTAGGCCTTTTTCGAATTGATTTCCTTGGTAACCCAGTCCAGATTGCTTTCGTTGCCTGGAAACACCTTGATGAAGTGGGCGCGGGGCATACCTGATTTGTTCACCGCGAATTCCATGATGTCACGCTCGTGGCTGCGGACTTCTTCCACCAGATTGCGCAAGCCTTCGCATAGCGCCTCGACTTGCTTGGCAGAAAAGCGGATATTCATTAGCTCAGCGGAGAGCTCTTCCTGCAGTTGCAGATACTGCGGGCTGCCGAAGCCGTTTTTCTTCAACGTGGCCTGTATCCGCTTGAATACCTTGCGGATGACTTCAAAGTGCGCCATGGCGTCGATCTTGAGCTGAGCGAGATTGGCTGCAGCCAGCGCGCTACCGTCATCTTCCTCGGCATCTTCGTCGAGCTCTTCTTCGAGTTCGCTGATGTCAACCTCGGGCTCGCTGGCGATCGCTTCGAGTTCTTCTGCTACGACACCATCCACGAAATCGTCAATGCGAATTTCCTCACGCTCAACCTTGTCCACCAGTGTCAGGATTTCCTGAATCGTGGTGGGACAGGCGGAGATGGCCTGAATCATGTGTTTCAAGCCGTCTTCAATGCGTTTGGCAATTTCGATTTCGCCTTCACGCGTGAGCAGTTCCACCGAGCCCATTTCGCGCATGTACATGCGCACCGGGTCAGTGGTGCGGCCGAACTCGGAATCCACGGTAGATAACGCAGCCTCGGCCTCGGCCACCACGTCCTCATCGGCGACTGCCGGGGCGGCGTCGGACATCAACAGGGTCTCGGCATCAGGGGCTTCGTCATAGACCTGGATGCCCATGTCATTGATCATGCTGATGACGCCTTCGATCTGTTCGGCATCCAGCATGTCATCTGGCAAATGGTCGTTGATCTCGGCGTAGGTCAGGTAGCCCCGCTCCTTGCCGAGCACAATCAGATTCTTCAGGCGTGTGCGTCGGGCTTCGACATCTACCGTTTTCACCTCGCCTTTTTCGTGATCGTTTGCCATATGTCTTGGTCCAGTAAAATTTGGTGCATCAAAAAAACTGACAATTATACCTTTGTTCGAGCCTGATCGCTCATTTTTCCACCTGCTGTCTGGCATTAACCAATTGTCGCAACGCTGCTTTTTCGGTTTCACTCAGTGCGCTGACGGGCTTGTCGGTTACACGCGCACCCCGCTGTTTGTCGAGTTGCATGCGTGCCTGATAACAGGCGTCAGCAAATTCGGCGCTGATGTCGAGATCGGCTGCCCAGCCCATTATTTCCGCGCTGGCGCGCTGCAGGATGGACGCCAAAGCGTTATCGCGAAAATGGTCTATCACGCTCGCAGTGCCTAAATTGGGATGAACGCGCAGCAATTCAACCACAGCGCGTAGCGCGTCTGCATCCGGGTCAGTTGGGGCAATCAAACTGGCGTCAAGTTCCCGTGCCAGGCTGGGCATAAACAGAATCGCACGCAGCAGCCAGTGCCAAATGGAAGCGGGTGCCTGGCGTGGGGCACGGACAGGTGCACGCGCCGGGTTGAATTGCCGGGATTTGATCTGCCACAAGCTGTCGAGTTCGGACAGCTGCAGATTTGCCAGTTCGGCGCAACGTTTGCGCAGCAGGAGTGCCAGGGCGGGTGCGCGTATCTGGCTTAACAGGGGGTGCGCAGCCTGCAAAAAGGCACTGCGTCCTTCGCTGCTGGCCAGGTCATGCTGGGCTGCCAGCTCCTTGAACAGATAGGCAGATAGCGGTACCACCTCGCCGCCCAGCAGCGCTTCGAAAGCGCCTTTGCCAAATGCGCGAATATAGCTGTCCGGGTCGTGTTCCGGCGCCAGAAACAAAAACCCGACACGACTGCCATCGCTCAGTATGGCCAGGCTGTTTTCCAATGCGCGCCAGGCGGCGTGCCGTCCGGCGGCGTCGCCATCAAAGCAGAACACGAGCTCATCAGTGTGGCGCAGCAGTTTTTGCACGTGCGCCGCGGTGGTGGCGGTGCCCAGGGTGGCGACGGCGTACTCCACCCCATGCTGCGCCAGTGCCACCACGTCCATATAGCCTTCCACCACAATCACGCGTCCCGCATCGCGGATTGCGCGGCGTGCCTGGAACAGGCCATACAGCTCGTTACCTTTCTGGAATAGCGGCGTTTCCGGTGAATTCAGGTACTTGGGTTCAGCTGCGTCGAGCACGCGGCCGCCATAGCCGATGACATCCCCGCGTTGCCCGACAATCGGGAACATGATGCGGTCGCGAAAACGGTCATAACGCTGCCCGGCGTCGTTAACGATGACCAGCCCGGCTTCGGCCAGCGCCGGATCGGCATACTGGTCGAACACCGCTGCGAGATTCTGCCAGCCGGCTGGGGCATAGCCAAGGCCAAAGCGGGCGGCGATTTCGCCCGTCAAGCCGCGTTTCTTGAGGTAGTCAATTGCGTGCGGTGTTTGTTTGAGTTGCTGGCGATAGAACTGTGCGGCGCGCTGCATGATTTCGACCAGGCTGGCGGCCTGCCTGGCGCGTTCCGGGTTGGCGGCAGGCCCCTCCGGCACGCTTATGCCCATTTGGCCGGCCAGTTCCCTGATGGCGTCCACATAGCCCAGCCCGGCGTATTCCATCAAAAAACCAATGGCGCTGCCGTGGGCGCCACAGCCAAAGCAATGATAGAACTGCTTGGTGGGCGAGACGGTGAAAGAAGGGGATTTTTCGTTATGAAACGGGCAACACGCCTGATAGTTGGCGCCGGCTTTTTTCAGCGGCACACGGCGGTCTATCACCTCCACGATATCCACGCGGTTCAGCAGCGTCTGGATAAAATCCTGCGGGATCATCTTGTTCCCTCGGGGTGGTGACGTAACAACGCCGCTACGCGGCGGACAATCAGCCCGCCAGCTTGGCTTTGATGTGGATGGAAACTTGCGCCATGTCTGCGCGCCCGGCGAGTTGTGTTTTTAGAAGCGCCATCACCTTGCCCATATCCTTGATGCCGGCAGCGCCGGTATTCGTAATGGCCTGGATGATCAGGCTATCTATTTCTTCAGCGGATGCGGCTTGAGGCATGTAAGCTTGCAACACGCCGCTTTCGAATTTTTCGATGTCCGCCAGCTCCTGGCGTCCGGCAGCCTCAAACTGGGTAATCGAATCGCGGCGCTGCTTGAGCATTTTGTCGATCACGGCGATGATTTGTGCATCATCCAGTTCGATACGCTCATCCACCTCGCGTTGTTTGATCGCGGCGAGCAATAACCGAATTGCCCCCAGGCGTGCCGCATCCTTGGCGCGCATGGCGGTTTTCATGTCTTCGGTGATGCGTGCTTTGAGACTCATAAGCTTATTACCGGCTGGAACAGCGCCCGGTTTGATGATTAATACATCTTGGGTGGCAGGGTCTGGCTGCGGATGCGTTTGAAGTGACGCTTGACTGCGGCGGCCAGCTTGCGCTTGCGCTCGGCAGTGGGCTTTTCGTAAAACTCGCGTGCGCGCAGTTCGGTCAACAGACCGGTTTTCTCAACAGTGCGCTTGAAACGACGCATGGCAACTTCAAAAGGCTCGTTTTCCTTGACGCGAATGTTCGGCATGAAATCTTGTCTCCAGGGACGGAAAAAACCTCAATTATAACTGAAAAAGCGTTTTTTTCAAGGCTATGCATTGCCCTGTCCGTGTGGCGCGATTAAACTCCTGCCCATGCTTATTCTGGGAATCGAATCTTCCTGCGACGAAACCGGTATCGCCCTATACGACACCGGGCGCGGACTACTGGCGCACGCACTTCATTCCCAGGTTGCCATGCACGCCGAATATGGCGGTGTGGTGCCCGAGCTCGCCTCACGCGACCATATCCGGCGTGCGCTACCGCTGACCCGCCAGGTACTGGCACAGGCAGGGTGCACGTTGGCCGACATTGACGCGATTGCCTATACCGAAGGTCCCGGTCTGGCTGGCGCGCTGCTGGTAGGTGCCGGCATCGCCCATGCGCTGGGCGTGGCGCTGGGGGTGCCGGTGCTGGGGGTACATCACCTCGAGGGGCATTTGCTCTCGGCGCTGATTTCCGATACGCCGCCGCAATTTCCGTTTGTGGCGCTGCTGGTATCGGGCGGGCATACGCAATTGATGCAGGTCGACAGCGTGGGGCGTTACACCACGCTGGGCGATACCCTGGATGACGCTGCGGGCGAGGCGTTCGACAAGACCGCACAACTGCTTGGTTTGGGCTATCCGGGCGGGGCGGCGTTATCGACGCTGGCGCAGACCGGCGACCCGCAGCGCTTCAAGCTGCCGCGTCCGATGTTACATTCGGGCGACCTCAATTTCAGTTTCAGCGGTTTGAAAACCGCGGTGCTCACACTCACGCAAAAACATCCCGGTCCCGCTGACCGCGCCGACATCGCTGCTGCGTTTCAGCTTGCCATGGCCGAGGTGCTGACGGCCAAATCGCTGGCGGCGCTCAAACAAACCCGATCCAGGCGGCTGGTGGTAGCCGGTGGCGTGGGTGCCAACCGGCAGTTGCGCGAGGCCTTGAACGCAGGCGTTAGCAAACTGGGCGGTGCGGTATTTTTCCCGCGCCTGGAGTTTTGTACCGATAACGGCGCGATGATTGCCTTTGCCGGCGCGATGCGCCTGGTGCATGGCGGGCGTGCCGCAGGGGTGTTTACGGTACGGCCACGCTGGGACTTGCAGGAAATCCCGGCACCCCATAATCATCCGGGCACCGTCATGGCTTAAGATGCGGTGCCATGCCGTGCACCCAGCCAAGCAACAGCTTGCCGCCCGGCAGTTGCGCCAGCACCTCCGGGAACAAAACCAGGCCGAACAACAGCGCCAGCACGCAGATCAATAACAGGGTGGAAAACACGCTCTGCGGGCGTAACACCCCCCAGTAGCGCATGGCGAGGAACATGACATCCTTGCGATGCTCTGACATGCGCCGCCAGCGCAATGCCAGCCGCACCACCAAATACACCGCATAGCCGCCGGTGAGCGCGGCGAACACAATGCGGAACACGCCAAACAAATCAATGCTGCCTGCCACAAAGTGCTGGTACAGCCACGACACAAACCCCACCAGGATGGAAGCGCTTACCGCAATGAGCACATTGAAAAACAGTCGCCGCCGCTCGCGCGACAAATCCTGCTGGCGCTGGATGAAAAAGCTGTCCACTTCCAGCTTAAAGTATTCCACCTTGTCCTGTACCAGCGCAGAAATTTCGCGTTTGGCCACCGCCATGCGTTCATCCAGAACAGCGCCCAGACGATCAGCAGCATAGTCCACCAGCTCGCGCACGTCGTCCTTGGTGAACTGGCGCTGGTTGTGCAATTCGGCAGAAATCTTGTCGAGCTTGGCGTCGATTTCCTGGCTGGCGCCCACGACCACGTCGCGTAATTCCGCACCGACGCTGGCTGCGCCTTCCTTGACCACGCTGCCCAGTTTGTCGCCCGCCAACTCGATGCTGTCTTTCGAGACCTGGGCCAGGCTCTCGCGCGCGTAGTTGATTTCTTTTTCAAACCAGGCCATGTCTGTCCCGGTTTATTCGGAAGAGGTGTCCGCCGGTTTGGCGAAGCGCCCTTCTTCACCCGCCAGCAGGCGGCGGATATTGGTGCGGTGACGCCAAAAAATCAGCGCACTGATGATGCATAACGCGACCACAGGCAGGGCGCCACCGATTAAATAGGTACCGAGTACTGGCGCCAGTGTGGCCGCAGTGAGTGCGGCCAGTGACGAGATGCGGGTGAGGACAAACACGACAAGCCAGCTGGCCAGCGTTGCCAAACCCAGCCACGGCGAAATCGCGAGCAGTATACCCAGCGCGGTGGCCACGCCCTTGCCCCCCTTGAAGCCGAAAAACAGCGGATATAAATGTCCGAGGAACACGGCGACCGCAGCGCCGTAAGTTGCGGCAATTTCCACGCCGTATTGGCTGCCAAAATAACGCGCCAGATACACCGCCAGCCAGCCTTTGGCCATGTCGCCGACGAGGGTCAGCAGCGCCGCTGATTTGCGCCCGGTGCGCAGCATGTTGGTCGCACCCGGATTGCCCGAGCCATGCTTGCGCGGGTCAGGCAGGCCGAATAGGCGGCTGACGATAACCGCGAAGGACAGCGAGCCGATCAGGTAAGCCGATACGATGAAAAATGAAATGAACATCAGGGATTCCGCTTAAGATATAGGATTTTCGCGCTTTGACAACTACGCATGGATATTCTGTTTCTCAAGGATTTCAGAGTCGAGCTCATTATCGGTATTTACGAGTGGGAACGCAAAGTACCCCAGCCGGTATTGCTCGACCTGGAAATCGGCCTGCCCAATAGTCGTGCCGGTGAAACCGACAATGTGGCAGATACCATTGACTATGGCCAGGTTGCCGCGCGTATCAGGGCGGCCTGTGCCGCACTGCGCCCAGCCCTGGTAGAGGCGCTGGCAGAGCATGTTGCACAATTGATACGCAATGAATTTGGCGCGCCCTGGGTCAGGGTCACCGTGACCAAGCTCGCCATCGTGCGCGGCGTCAAGGCGCTGGGCATCACCATCGAGCGCGGTCAGCGCGGATGCGTAATGAGTCATATGCAGCCAGCGCGCTGAATTAGCCGCGCGGATGATGCCTGGCGTGCAGCTGTTTCAGGCGCTCGCGTGCGACGTGCGTGTAAATTTGCGTGGTCGAAATATCCGCATGGCCGAGGAGCATCTGTACCACGCGCAAATCCGCGCCGTGGTTGAGCAGATGCGTGGCGAAGGCATGACGCAAAACGTGGGGCGAGGGCAGGCGCGCCAGCCCGGCCTGCTGCGCGCGGCGCTTGATGAGATACCAGAATGCCTGGCGCGTCATGGCCGTGCCGCGCCGGGTGACAAACAGCGCATCGCTGATCGTGCCTGCCAGTATCTGCGGGCGTGCGCCAGTCAGATAGCGCGCCAGCCAGAGCAGTGCCTCTTCACCCAGCGGTACCATGCGCTCCTTGCCGCCTTTGCCCATCACGCTTAGCACGCCCATGTCCAGACTCACATTTGCCACTCTCAGTGTCACCAGTTCGGAGACGCGCAGGCCGCTGGCGTAGAGGATTTCCAGCATGGCCTTGTCGCGTAGCCCGAGTGGCTGTGATGTGTCGGGTGCGTTCAACAGCATATCCACGTCGGCCTCCGACAAGCTCTTGGGTAATGAGCGCGGCAGCTTGGGGGTATCGATTTTCAGTGTCGGGTCCAGCACGATACGGCCATCGCGCAGCGCCAGCCGATAGAAGCGTTTCAGTGCGGAGAGCAGGCGCGCAGTGCTGCGCGGGCTGGTTTTACGCGAAAAACGGTATTGCAGATAGGCCTCGATATCCGCCTGGCCAGCGTCCAGCAACAGCGTGCTGCGCAGTGCCTCCAGCCAGGCCGAGAACTGCGTCAGGTCGCGCCGGTAGCTTTGCAGCGTGTTGGGGGAGAGTCCATCCTCCAGCCACAGCAGGTCGCAAAAGCTGTCAAGCGCCTGCTGCGATACCGGATTCATGGTTGAGTAGCCAGTCCTTGTAAGCCAGCGGCGCCCCGCTTGCGGCGTGCATGAAGCCGCCGCGCCCGTTTGCTGCCACCACCCGGTGGCAGGGGATAATGATGGGCAGCGGATTGGCGCCACAGGCCTGACCCACCGCGCGCGGACTCGAATCCAGCCAACGCGCGAGCTGCCCGTAAGTGGTGGTATGGCCGGGCGGGATCGCAGTCAGCGCGCGCCATACCCTGACTTGATGCGCTGTGCCAAGGATTGCCAGCGGCAGGTCAAAGCTGCTGCCGGGGTTATCAAAATAGTGATTCAAGGCAGCGGCGACGCGGCGTGACAGCGGTGAGTCGGGGGCTTGCAAAGGGTAATCCGCAGGCAAAAAATCGATGCCGTGAAGTTTTTCATTGGCCACACTCATGCCGACACAGCCGAATGGCGTGGGCAGCACGGCCTGATAAGGTGATTGAATCGTGCGGCTTTTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|631184_631682_-|WP_147070766.1|DBSCAN-SWA MKSRTIQSPYQAVLPTPFGCVGMSVANEKLHGIDFLPADYPLQAPDSPLSRRVAAALNHYFDNPGSSFDLPLAILGTAHQVRVWRALTAIPPGHTTTYGQLARWLDSSPRAVGQACGANPLPIIIPCHRVVAANGRGGFMHAASGAPLAYKDWLLNHESGIAAGA >NZ_AP021884|622875:631682|630313_631213_-|WP_147070764.1|DBSCAN-SWA MNPVSQQALDSFCDLLWLEDGLSPNTLQSYRRDLTQFSAWLEALRSTLLLDAGQADIEAYLQYRFSRKTSPRSTARLLSALKRFYRLALRDGRIVLDPTLKIDTPKLPRSLPKSLSEADVDMLLNAPDTSQPLGLRDKAMLEILYASGLRVSELVTLRVANVSLDMGVLSVMGKGGKERMVPLGEEALLWLARYLTGARPQILAGTISDALFVTRRGTAMTRQAFWYLIKRRAQQAGLARLPSPHVLRHAFATHLLNHGADLRVVQMLLGHADISTTQIYTHVARERLKQLHARHHPRG >NZ_AP021884|622875:631682|629925_630312_+|WP_147070762.1|DBSCAN-SWA MDILFLKDFRVELIIGIYEWERKVPQPVLLDLEIGLPNSRAGETDNVADTIDYGQVAARIRAACAALRPALVEALAEHVAQLIRNEFGAPWVRVTVTKLAIVRGVKALGITIERGQRGCVMSHMQPAR >NZ_AP021884|622875:631682|626615_627062_-|WP_147070754.1|DBSCAN-SWA MSLKARITEDMKTAMRAKDAARLGAIRLLLAAIKQREVDERIELDDAQIIAVIDKMLKQRRDSITQFEAAGRQELADIEKFESGVLQAYMPQAASAEEIDSLIIQAITNTGAAGIKDMGKVMALLKTQLAGRADMAQVSIHIKAKLAG >NZ_AP021884|622875:631682|628446_629247_-|WP_147070758.1|DBSCAN-SWA MAWFEKEINYARESLAQVSKDSIELAGDKLGSVVKEGAASVGAELRDVVVGASQEIDAKLDKISAELHNQRQFTKDDVRELVDYAADRLGAVLDERMAVAKREISALVQDKVEYFKLEVDSFFIQRQQDLSRERRRLFFNVLIAVSASILVGFVSWLYQHFVAGSIDLFGVFRIVFAALTGGYAVYLVVRLALRWRRMSEHRKDVMFLAMRYWGVLRPQSVFSTLLLICVLALLFGLVLFPEVLAQLPGGKLLLGWVHGMAPHLKP >NZ_AP021884|622875:631682|622875_624762_-|WP_147070750.1|DBSCAN-SWA MANDHEKGEVKTVDVEARRTRLKNLIVLGKERGYLTYAEINDHLPDDMLDAEQIEGVISMINDMGIQVYDEAPDAETLLMSDAAPAVADEDVVAEAEAALSTVDSEFGRTTDPVRMYMREMGSVELLTREGEIEIAKRIEDGLKHMIQAISACPTTIQEILTLVDKVEREEIRIDDFVDGVVAEELEAIASEPEVDISELEEELDEDAEEDDGSALAAANLAQLKIDAMAHFEVIRKVFKRIQATLKKNGFGSPQYLQLQEELSAELMNIRFSAKQVEALCEGLRNLVEEVRSHERDIMEFAVNKSGMPRAHFIKVFPGNESNLDWVTKEINSKKAYSEDLTRYQHTIVERQQRLIALQEKVGIPIKDLKEINRQMSTGEARARRAKREMIEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNRISRQILQETGKEPDPELLAEKMEMTEEKIRKILKISKEPISMETPIGDDEDSHLGDFIEDTATLAPIDAAVYGSLQEVTKDILDGLTQREAKVLRMRFGIEMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSDRLKSFIDNSGNS >NZ_AP021884|622875:631682|624832_626566_-|WP_147070752.1|DBSCAN-SWA MIPQDFIQTLLNRVDIVEVIDRRVPLKKAGANYQACCPFHNEKSPSFTVSPTKQFYHCFGCGAHGSAIGFLMEYAGLGYVDAIRELAGQMGISVPEGPAANPERARQAASLVEIMQRAAQFYRQQLKQTPHAIDYLKKRGLTGEIAARFGLGYAPAGWQNLAAVFDQYADPALAEAGLVIVNDAGQRYDRFRDRIMFPIVGQRGDVIGYGGRVLDAAEPKYLNSPETPLFQKGNELYGLFQARRAIRDAGRVIVVEGYMDVVALAQHGVEYAVATLGTATTAAHVQKLLRHTDELVFCFDGDAAGRHAAWRALENSLAILSDGSRVGFLFLAPEHDPDSYIRAFGKGAFEALLGGEVVPLSAYLFKELAAQHDLASSEGRSAFLQAAHPLLSQIRAPALALLLRKRCAELANLQLSELDSLWQIKSRQFNPARAPVRAPRQAPASIWHWLLRAILFMPSLARELDASLIAPTDPDADALRAVVELLRVHPNLGTASVIDHFRDNALASILQRASAEIMGWAADLDISAEFADACYQARMQLDKQRGARVTDKPVSALSETEKAALRQLVNARQQVEK >NZ_AP021884|622875:631682|627419_628457_+|WP_147070756.1|tRNA|DBSCAN-SWA MLILGIESSCDETGIALYDTGRGLLAHALHSQVAMHAEYGGVVPELASRDHIRRALPLTRQVLAQAGCTLADIDAIAYTEGPGLAGALLVGAGIAHALGVALGVPVLGVHHLEGHLLSALISDTPPQFPFVALLVSGGHTQLMQVDSVGRYTTLGDTLDDAAGEAFDKTAQLLGLGYPGGAALSTLAQTGDPQRFKLPRPMLHSGDLNFSFSGLKTAVLTLTQKHPGPADRADIAAAFQLAMAEVLTAKSLAALKQTRSRRLVVAGGVGANRQLREALNAGVSKLGGAVFFPRLEFCTDNGAMIAFAGAMRLVHGGRAAGVFTVRPRWDLQEIPAPHNHPGTVMA >NZ_AP021884|622875:631682|627100_627313_-|WP_124706067.1|DBSCAN-SWA MPNIRVKENEPFEVAMRRFKRTVEKTGLLTELRAREFYEKPTAERKRKLAAAVKRHFKRIRSQTLPPKMY >NZ_AP021884|622875:631682|629259_629880_-|WP_147070760.1|DBSCAN-SWA MMFISFFIVSAYLIGSLSFAVIVSRLFGLPDPRKHGSGNPGATNMLRTGRKSAALLTLVGDMAKGWLAVYLARYFGSQYGVEIAATYGAAVAVFLGHLYPLFFGFKGGKGVATALGILLAISPWLGLATLASWLVVFVLTRISSLAALTAATLAPVLGTYLIGGALPVVALCIISALIFWRHRTNIRRLLAGEEGRFAKPADTSSE |
10 | Vibrio_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
751882 : 760457
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|DBSCAN-SWA ATTACGCGCCGACTTCTGCCGTAACGGCCATCGGTGCGACACCAATCGCCGCGCGCACTTTATTTTCAATCTCGTGGGCGACCTCGGGGTGCTCGCGCAGGTATTCGCGCGCGTTGTCTTTGCCCTGGCCGATTTTTTCGCCGTTATAGGCATACCAGGCGCCGGATTTTTCCACCAGTTTGTGCTCCACGCCGAGCTCGATGATTTCCCCCTCGCGCGAGATGCCTTCGCCGTAAAGGATATCGAATTCAGCCTGCTTGAATGGCGGCGCGACCTTGTTCTTGACGACCTTGACGCGAGTCTCGGAGCCGATCACTTCGTCGCCTTTCTTGATTGCGCCGGTGCGGCGGATGTCGAGGCGCACCGAGGCGTAGAATTTGAGTGCATTGCCGCCGGTGGTGGTCTCCGGGTTGCCGAACATGACGCCGATTTTCATGCGGATCTGGTTGATGAAGATCACCAGGGTATTGGTGCGCTTGATGTTGCCGGTGAGCTTGCGCAACGCCTGGCTCATCAGGCGGGCTTGCAGGCCCATGTGCGAGTCGCCCATTTCGCCTTCGATTTCGGCCTTGGGAGTCAACGCCGCCACCGAGTCTATCACCACCACGTCCACCGAGCCGGAGCGCACCAGCATGTCGGCAATTTCCAGCGCCTGCTCACCGGTGTCGGGCTGCGAGATGAGCAGGTCGGAGACATTCACCCCGAGTTTTTGTGCGTATTGCGGGTCGAGCGCGTGCTCGGCATCAATGAACGCTGCGGTGCCGCCCAGTTTCTGCATTTCGGCGATGACCTGCAGGGTGAGCGTGGTTTTGCCGGAGGATTCCGGACCGAAGATTTCAACCACGCGGCCGCGCGGCAAACCGCCGACGCCCAGCGCAATATCTAAGCCCAGGGAGCCGGTGGAGACCACCTGAATATCGCGCACGACTTCGCCATCGCCGAGGCGCATGATGGAACCTTTGCCGAAGCTTTTTTCGATTTGTGCCAGGGCGGCGGCGAGGGCTTTGCTTCTGTTTTCGTCCATGTGTTTTTCCTCAAATTAGTGCGGGATTATGGCATAAACCGTTAGAACCCGTTAGCCACGGGCGAATGGTTTTTGTCTAGTCCAGCAACTCAATCACCCCGCGCAACGCGGCAGCCACGGCGCGCGCGCGAATTTCATCGCGGTCGCCGCATAACCTGCAGGTGGTGGCGAGGCGGGTACCGTCCTGCATCGCCCAGGCTATGCACACGGTGCCGACGGGTTTTTGCGGGGTGGCGCCGCCGGGCCCGGCAATGCCGGAGATGGCCAGGGCGATCTGCGCGCGGCTGTGGGCGAGTGCGCCCTGCGCCATTTCCAGTACGGTGGGTTCGGATACCGCGCCCGATGCTTGCAGCGTGGCGTTTTTCACCCCCAGCATGTCGTGTTTGGCGGCGTTGCTGTAGGTGATGAAGCCGCGCTCATACCAGGCCGAGCTGCCCGGCACGGCGGTGATCAGCATGCCCGCCCAGCCGCCGGTGCAGGACTCGGCGCTGGCGAGCATGATGCCGCGCCGGCTGAGGGCCTGGCCGGTTTGTTCGGCCAGTTGGTAGAGTGCTGCGTCGGTCGGTCTCATGGCAGAATTTTTTGCGCGAGGAACAGTGCCAGCAAAGTGTAGCCAGCCGCCAGCAAGTCGTCGAGCATGACGCCGAAGCCATTTTTCAGGCGCGCATCGAATTGGCGGATGGGAAACGGCTTCCAGATATCGAACAGCCGGAACAGGCCAAAGGCAGCGGCGACCCACAGCGGCGTTTGCGGGGTGGCAGCGAGCACGATCCAGAACGCGGCAATCTCATCCCAGACGATGCCGCCGTGATCCGCCACGCCCAAATCGCGTCCGGTTTTGCCGCAAATCCAGATGCCGGCGACAACCGCGATGCCGATGATGAGGTACAGCTGGGTCGGCGTGGCGAACCAGGCCAGCAGGTAGTACAGCGGCAGCGCGGCCAGGGTGCCAAACGTGCCCGGCGCCCTGGGTGCGAGTCCGCTGCCCAGACCAAAGGCGAGAAAATAGGCCGGGTGGCGGGTGATGAAACGCCAGTCAGGCGGAAAAGTGGTCATAGCCGGTGTGGCGCATATCCAGAACTTGTCCTTGTGCATCGCGCACTATCAGGCCTGGCTCGGCGCGAATGCTGCCGATGGCAGTGAGCCGCACGCCGAGGCGGGCGGCGATTTCGCCGAGTGCCTTACGGTGTGCGACGGGTGCAGTGAAACACAGCTCGTAGTCGTCACCGCCGCTCAGTACGCAGGCATCAAATTCCGGATGTGCGGCATAGTCATGGACGATTTCACCCAATGGCAAATGCGTATATTCAACGATGGCACCTACACCGGAGCGCGCCAGAATATGCCCCAAGTCAGCCAGCAGGCCATCCGACACATCGATTGCGCTGCGCGCCAGCCCGCGCAAGGCCAGGCCCAGTTCGACACGCGGCGTGGGGGTATATAGGCGCGCTGCCAGGGTGATCAGATCGGCGTCAGTCAGATTGACCCGGCCGTGCAGGGCGGCAAGTGCCAGTGCTGCGTCACCCAGCGTGCCGGATACCCAGATTTCATCGCCCGCCTGAGCGCCGTCGCGACGCAGGGCCTGGTTGGGCGGCACCTCGCCCAGGATAGTGAGGGTGAGGCTCAATGCGCCGCGCGTGGTGTCGCCACCCACCAGGCTCACGCCAAATTGATCGGCACAGCGATACAGTCCCGTTGCGAACGCCGCCAGCCAATCATCGTCTACTTCCGGCAGTGTCAGCGCCAGCGTCGCCCAGCGCGGCGCGGCACCCATCGCGGCGAGATCGGAGAGATTGACCGCGAGGCTTTTCCAGCCGAGCTTTTCCGGGTCGGCATCGGCGAAAAAATGCACATCGGCGACCAGGGTGTCGGTGGAAACGGCGAGCTGCATCCCGGTTGCGGGTTGCAGCAGGGCGCAATCGTCGCCCACTCCCAGCACCGCGCCGGGTGTGGCGCGGGAGAAATGACGCTGAATCAGGCCGAATTCGGAAGTCATGGATGTTGCATTGCCATCCTGGCGGTTAGCTTGCGGTTTTCTGGGTCATCAACCCTGGCGGCGCGGCGCGTTCACTTCCACTGTGCGCACCTCGGCAGCCAGCTTGTCCATCACGCCGTTGACGTATTTGTGGCCATCGGTACCGCCGAATATCTTGGCGAGTTCGACCGCTTCGTTGATGACGACGCGATACGGCACTTCCAGATGATGCAGCAGCTCCTGCGTGCCCAGCAGCAGAATGGCATGCTCCACCGGGCTCAGTTCCGCAGGCTTGCGATCCAGATAGGCAGCGAGCCGCAGGTCCAGCGCCGGTGCTTCATCGACCACGCCATTCAGTAGTGCCAGGAACATTTTTTCATCAATGTTGCGATACACGGGATCGTCGCGCAGCTGTTTGACGATATCGGCAGTCGGCTGATGGTTGAGCAGCCACTGATACACGCCCTGCACTGCGAATTCGCGGGCCTTGCGACGATTGCCGCTCATAGCTGTTTGAGCAGGTTGGCCATTTCGATCGCGCATTCGCCCGCTTCGGCGCCTTTTACCGACATGCGCGAGGTGGCCTGGTGATCGGTATCGGTGGTCAATACGCCATTGGCAATAGGCACGCCGGTATCGAGCTGGATACGCGCGATACCGTTGGCCATTTCGTTGGCAACCACCTCGAAATGATAGGTATCGCCGCGCACCACGGCGCCCAGCGCCACCAGCGCGTCGAATTTTCCGCTCATGGCCATTTTGCGCAGCGCCAGCGGAATTTCCAGCGCACCCGGCACGGTGGCGAGGAGCAGGTTGCCGGTTTTCACGCCGCGTTTGCCAAGTGCGGTGGTGCAAGCCGCCAGCAAACCCTCGCAAATGTCCATGTTGAAGCGGCTCATGACGATGCCGATACGCAATGCGCTGCCGTCGAGACTGGATTCAAGTTCGGGAATATCGTCGTAGCTTGCCATGATTTATTTCTCCTGATGAGACTGGATTGCATGTATTGCCTGTGCAAGGCGGGGGCGGGTTCAGGTGTTCTCGTCGTAGCCTGTCACTTCCAGATCGAATCCCGCCAGCGACGGCATTTTGCGCTGAGTAGCCAGCAGGCGCATTTTGCCCACGCCGACGTCTTTCAAAATCTGCGCGCCAATGCCATGGTTGCGCGCGTCCCATTTTTGCGGAAGCCTGACACCGGCTTCAGGCATAGCGCGGGTGAGCAGTTCTGCTGCGCTCTCCGGACGGTGCAGCAGGACGACAACGCCCTTGCCCACCGCAGCGATTTTTGCCAGGGCCTGGTTGACACTGTAGGCATGGGTACGGCTGCCGACTTCGAGCATGTCCATCACCGACACTGGCTCGTGTACCCGCACCAGCGTTTCGCTGGCGGCGCTGATTTCGCCCTTGACCAGGGCCAGATGAGCTGCCCCGGAGATTTTTTCACGGTAGGCGATGAGCTGGAATTCGCCGTAAACCGTCTCGATACAGCGGCTGCCTGCACGCTCCACCAGGCTTTCATTGTGGCTGCGGTAGTGGATCAGGTCGACGATTGCGCCAATTTTCAGGCCGTGAATTTTGGCGTATTCCAGCAAATCCGGCAGACGCGCCATGGTACCGTCATCCTTGAGGATTTCGCAAATCACTGCGGCAGGTTCCAGGCCGGCCAGCCCGGCCAGATCGCAGCCTGCCTCGGTGTGGCCGGCACGAATCAGCACGCCGCCGGGTTGGGCGCGCAGCGGAAAAATGTGGCCCGGCTGGATGATGTCGGCGGCCTTGGCGTGTTTGGCGACGGCGGCCTGAATGGTAAGCGCCCGGTCGGCGGCGGAAATGCCGGTGGTAACCCCTGTGGCGGCTTCGATGGAGACAGTGAAAGCAGTGCTATAGGGCGTCTGGTTATCGGCCACCATCTGGCGCAAGCCCAGTTGCTTGCAGCGCTCGTCGGTCAGCGTCAGGCAAATCAGGCCGCGTCCGTGCTTGGCCATGAAGTTGATCGCTTCGGGGGTGGCGAATTCGGCCGCCATCACCAGATCGCCCTCGTTTTCGCGGTCTTCCTCGTCCACCAGTACGACCATTTTTCCGGCTTTCAGGTCGGCGATGATGTCTTCGATAGGGCTCAGGCTCATGTTTGATCTCGATAATTAAGGATGCGTTCGGCGTAGCGCGCCATCATGTCCACTTCCAGATTGACCCGGCTGGCAGGTTGGAGCGTATGCAGATTGGTATGTTCCAGCGTATGCGGAATAAGGTTGATGCTGAAACGGTCGCCGTCTACGCGATTAACGGTGAGGCTCACACCGTTGACGGTGATGGAACCTTTGCTGACGACAAAGCGGGCCAGATCGCCGGGAGCTCGGATGACCAGTTCGAAGCAATCCCCGGCCGGCGCGAAATGCAGCACCTCACCCACCCCGTCCACATGGCCGGAAACCAGGTGTCCGCCGAGACGGTCGGACAGGCGCAGCGCCTTTTCCAGATTGACCAGGCCGTGTTCAGGAAAGCCCGTCGTACAGCGGAAAGTCTCTGCCGATACGTCTACCGAGAAGCTCGCCGCACCCAGCGCCACGACGGTCAGGCACACACCGTTGCAGGCGATGGAGTCACCTGGCGCCACGTCGCTCAAATCCAGATGGGCGGCGTCGATCACCAGGCGTGCATCCGCGTTTCTGGGTTCCACTGCCGCCACCTTGCCTACTGCCTGAATAATGCCTGTAAACATGTTAGTTCTTTTCAAATTGTGCAGTGATGCGTATATCGGCACCGACCTGACGTATGTCGCGCAGCACCAGTTTGCGGCGCTCTTGCATCTGCGCCGGTTCGGCCAGGGAAAACAGGCCACGCGCCGTGTCACCCAGCAATACCGGGGCGACATACATCACCCATTCATCCACAAATCCAGCCGCGATCAGCGCGCCGTTGAGTGTGGCACCGGCTTCGGTCATCACTTCATTTATCCCGCGCTGCGCCAGGAGTGACAATAGCGCCCCCAGATCAACCTGCCCGGCATCACCCGGCAGACAGCGGATTTCTGCCCCGGCGGCTTCCAGTCCGGCGCTGCGCCGGGCGTCCGGTTCGGCACAGGCAATCAGGGTCGGGGCACCGCCCAGTATATTCGCCGAGGGCGGGGTTTGCAGGCGCGAATCGACGATGACCTTGAGCGGCTGGCGCGTGGTTTCCACTGCGCGCACATTCAGTTCCGGATTATCCGCCAGCACCGTACCGATACCGGTCAGAATGGCGCATGAACGCGCACGCAGCCGGTGCACGTCGCGGCGCGCAGGCTCTCCGGTGATCCATTTGCTGGCACCGCCGGATAAGGCCGTTTTGCCATCCAGCGAACTGGCGGTCTTGATGCGCAGCCACGGATGCCCCATGACCATGCGCTTGATAAAGCCCGCGTTGAGTTCACGCGCCTGCGCTTCGAGCAGGCCGCATTCAGTCGCAATCCCGGCCTTTTGCAGCAGGGCCAGACCATTGCCCGCCACCTGGGGGTTGGGATCCTGCATGGCGGCCACGACGCGTGCCACGCCTGCGGCTATCAACGCCTCGGCGCAGGGCGGGGTGCGCCCATGGTGGCTGCACGGCTCAAGCGTGACGTACACCGTGGCACCGCGCGCTGCGTCGCCGGCCTCGCGCAGCGCGTGGATTTCGGCATGCGGCTGTCCGGCCTGCTGGTGCCAGCCGCTGCCGACCATCGCTCCATTCCTGACAATCACACAGCCCACGCGCGGATTCGGGCTGGTAGTGTACAGGCCGTGCTCCGCAAGCTGCAGCGCGCGCGCCATATGGATGTAATCTGTCTGCGAAAACACAGGTTTATTTGTCGAAGTCCTTGAGCACGTCGCGGAAGTCGCCCACATCCTGGAAGCTCTTGTACACTGAGGCAAAGCGGATGTAAGCGATCTTGTCCAGGCGCTTCAACTCGTTCATCACCATCTCGCCAATCTGGCGCGCGGGCAATTCACGTTCGCCCAGCGACAGCACTTGTTTGACGATGCGCCCAATCGCCGCATCCACATATTCGGTCGGTACCGGGCGCTTGTGCAGCGCGCGGCGAAAGCCCTCGTGCAGTTTTTCCTGGCTGAATTCCTGGCGCACGCCGTTGCTTTTAATCACCTGCGGCAGGCGCAGTTCTATGGTTTCGTAGGTGGTAAAGCGCTTGTCGCAAGACGTGCAGCGGCGCCGGCGACGAATCGAGTCGCCGGCTTCGGAAAGGCGCGAATCAACGACCTGGGAGTCGAACGCGCTGCAGAACGGACATTTCATGAAGACGGTTTGTAGCGGGTAAGGTAATAGTCAGCAGCAGGCAGCCTGGTTGGGCGGCACGCTACCCGTTCATCCGTACACCGGATATTTCTTGCACAAGGCTTGCGCGGCAGTGGCCGCGCGCCCAATGACCGCCTCGTCATTGGGCGCGTCGAGCACATCGGCAATCAGGTGCGCCAGTTGCTCGGCTTCCAGTTCCTTGAAACCGCGTGTGGTCATTGCCGGCGTGCCGATGCGGATGCCGGAGGTGACGAAGGGTTTTTGCGGATCGTTGGGGATGGCGTTTTTGTTGACCGTGATGTGCGCCCGTCCGAGCGCGGCTTCTGCCTCCTTGCCGGTAATGCTTTTGGCCTGCAAGTCCACCAGAAACAGATGCGAATCGGTGCGGCCGGAGACAATGCGCAGGCCGCGTTCCTGCAGCACCTTCGCCATCACGCGGGCGTTATCGATCACCTGCTCCTGGTACAACTTGAAGTCCTTGCCCATTGCCTCCTGAAACGCCACTGCCTTGGCGGCGATGACGTGCATCAGCGGACCGCCCTGCAAGCCCGGGAAGATGGCGGAATTGATCGCCTTTTCGTGTTCGGCTTTCATCAGGATAATGCCGCCTCTGGGACCGCGCAGCGTCTTGTGCGTGGTGGAGGTTACCACATCCGCATGCGGTACCGGATTGGGATACACCCCCGCCGCGATCAGTCCGGCGTAGTGCGCCATATCCACCATGAAAATCGCGCCGACTTCCCTGGCTATTTTGGCAAAGCGCTCGAAGTCGATGTGCAGCGAATACGCCGAGGCGCCAGCGATGATGAGTCTGGGCTTATGCTCACGGGCAAGCGCTTCCATACGCGGGTAATCGATTTCTTCTTTTTTATTCAGGCCGTAGGCCACGGCGTTGAACCACTTGCCCGACATGTTGAGCGCCATGCCGTGGGTGAGGTGTCCGCCTTCAGCCAGGCTCATGCCCATGATGGTATCGCCCGGCTTGAGGAAGGCCAGGAATACCGCCTGGTTGGCCTGCGAGCCAGAATGCGGCTGCACGTTGGCGGCTTCCGCACCGAATAATTTCCTGATACGGTCAATTGCCAGTTGTTCGGCGATATCCACATATTCGCAGCCGCCGTAGTAGCGCTTGCCGGGATAGCCTTCCGCGTATTTGTTGGTCAGCACCGAACCCTGGGCTTCCATTACCGCCGGACTGGCATAATTTTCCGAAGCGATCAGCTCGATGTGATCTTCCTGGCGGCCGCGTTCTGCCTCCATGGCTTTCCAGAGAGCGGGATCGGTTTGGGCGAGAGTGTGCTGAGGGTTAAACAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|757597_758737_-|WP_147074502.1|DBSCAN-SWA MWATSATCSRTSTNKPVFSQTDYIHMARALQLAEHGLYTTSPNPRVGCVIVRNGAMVGSGWHQQAGQPHAEIHALREAGDAARGATVYVTLEPCSHHGRTPPCAEALIAAGVARVVAAMQDPNPQVAGNGLALLQKAGIATECGLLEAQARELNAGFIKRMVMGHPWLRIKTASSLDGKTALSGGASKWITGEPARRDVHRLRARSCAILTGIGTVLADNPELNVRAVETTRQPLKVIVDSRLQTPPSANILGGAPTLIACAEPDARRSAGLEAAGAEIRCLPGDAGQVDLGALLSLLAQRGINEVMTEAGATLNGALIAAGFVDEWVMYVAPVLLGDTARGLFSLAEPAQMQERRKLVLRDIRQVGADIRITAQFEKN >NZ_AP021884|751882:760457|756999_757596_-|WP_147074503.1|DBSCAN-SWA MFTGIIQAVGKVAAVEPRNADARLVIDAAHLDLSDVAPGDSIACNGVCLTVVALGAASFSVDVSAETFRCTTGFPEHGLVNLEKALRLSDRLGGHLVSGHVDGVGEVLHFAPAGDCFELVIRAPGDLARFVVSKGSITVNGVSLTVNRVDGDRFSINLIPHTLEHTNLHTLQPASRVNLEVDMMARYAERILNYRDQT >NZ_AP021884|751882:760457|758693_759143_-|WP_147074501.1|DBSCAN-SWA MKCPFCSAFDSQVVDSRLSEAGDSIRRRRRCTSCDKRFTTYETIELRLPQVIKSNGVRQEFSQEKLHEGFRRALHKRPVPTEYVDAAIGRIVKQVLSLGERELPARQIGEMVMNELKRLDKIAYIRFASVYKSFQDVGDFRDVLKDFDK >NZ_AP021884|751882:760457|753472_753961_-|WP_147074508.1|DBSCAN-SWA MTTFPPDWRFITRHPAYFLAFGLGSGLAPRAPGTFGTLAALPLYYLLAWFATPTQLYLIIGIAVVAGIWICGKTGRDLGVADHGGIVWDEIAAFWIVLAATPQTPLWVAAAFGLFRLFDIWKPFPIRQFDARLKNGFGVMLDDLLAAGYTLLALFLAQKILP >NZ_AP021884|751882:760457|752981_753476_-|WP_147074509.1|DBSCAN-SWA MRPTDAALYQLAEQTGQALSRRGIMLASAESCTGGWAGMLITAVPGSSAWYERGFITYSNAAKHDMLGVKNATLQASGAVSEPTVLEMAQGALAHSRAQIALAISGIAGPGGATPQKPVGTVCIAWAMQDGTRLATTCRLCGDRDEIRARAVAAALRGVIELLD >NZ_AP021884|751882:760457|755383_755851_-|WP_147074505.1|DBSCAN-SWA MASYDDIPELESSLDGSALRIGIVMSRFNMDICEGLLAACTTALGKRGVKTGNLLLATVPGALEIPLALRKMAMSGKFDALVALGAVVRGDTYHFEVVANEMANGIARIQLDTGVPIANGVLTTDTDHQATSRMSVKGAEAGECAIEMANLLKQL >NZ_AP021884|751882:760457|753941_754901_-|WP_147074507.1|DBSCAN-SWA MTSEFGLIQRHFSRATPGAVLGVGDDCALLQPATGMQLAVSTDTLVADVHFFADADPEKLGWKSLAVNLSDLAAMGAAPRWATLALTLPEVDDDWLAAFATGLYRCADQFGVSLVGGDTTRGALSLTLTILGEVPPNQALRRDGAQAGDEIWVSGTLGDAALALAALHGRVNLTDADLITLAARLYTPTPRVELGLALRGLARSAIDVSDGLLADLGHILARSGVGAIVEYTHLPLGEIVHDYAAHPEFDACVLSGGDDYELCFTAPVAHRKALGEIAARLGVRLTAIGSIRAEPGLIVRDAQGQVLDMRHTGYDHFSA >NZ_AP021884|751882:760457|754949_755387_-|WP_147074506.1|DBSCAN-SWA MSGNRRKAREFAVQGVYQWLLNHQPTADIVKQLRDDPVYRNIDEKMFLALLNGVVDEAPALDLRLAAYLDRKPAELSPVEHAILLLGTQELLHHLEVPYRVVINEAVELAKIFGGTDGHKYVNGVMDKLAAEVRTVEVNAPRRQG >NZ_AP021884|751882:760457|759212_760457_-|WP_147074500.1|DBSCAN-SWA MFNPQHTLAQTDPALWKAMEAERGRQEDHIELIASENYASPAVMEAQGSVLTNKYAEGYPGKRYYGGCEYVDIAEQLAIDRIRKLFGAEAANVQPHSGSQANQAVFLAFLKPGDTIMGMSLAEGGHLTHGMALNMSGKWFNAVAYGLNKKEEIDYPRMEALAREHKPRLIIAGASAYSLHIDFERFAKIAREVGAIFMVDMAHYAGLIAAGVYPNPVPHADVVTSTTHKTLRGPRGGIILMKAEHEKAINSAIFPGLQGGPLMHVIAAKAVAFQEAMGKDFKLYQEQVIDNARVMAKVLQERGLRIVSGRTDSHLFLVDLQAKSITGKEAEAALGRAHITVNKNAIPNDPQKPFVTSGIRIGTPAMTTRGFKELEAEQLAHLIADVLDAPNDEAVIGRAATAAQALCKKYPVYG >NZ_AP021884|751882:760457|751882_752905_-|WP_147074510.1|DBSCAN-SWA MDENRSKALAAALAQIEKSFGKGSIMRLGDGEVVRDIQVVSTGSLGLDIALGVGGLPRGRVVEIFGPESSGKTTLTLQVIAEMQKLGGTAAFIDAEHALDPQYAQKLGVNVSDLLISQPDTGEQALEIADMLVRSGSVDVVVIDSVAALTPKAEIEGEMGDSHMGLQARLMSQALRKLTGNIKRTNTLVIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKKGDEVIGSETRVKVVKNKVAPPFKQAEFDILYGEGISREGEIIELGVEHKLVEKSGAWYAYNGEKIGQGKDNAREYLREHPEVAHEIENKVRAAIGVAPMAVTAEVGA >NZ_AP021884|751882:760457|755911_757003_-|WP_147074504.1|DBSCAN-SWA MSLSPIEDIIADLKAGKMVVLVDEEDRENEGDLVMAAEFATPEAINFMAKHGRGLICLTLTDERCKQLGLRQMVADNQTPYSTAFTVSIEAATGVTTGISAADRALTIQAAVAKHAKAADIIQPGHIFPLRAQPGGVLIRAGHTEAGCDLAGLAGLEPAAVICEILKDDGTMARLPDLLEYAKIHGLKIGAIVDLIHYRSHNESLVERAGSRCIETVYGEFQLIAYREKISGAAHLALVKGEISAASETLVRVHEPVSVMDMLEVGSRTHAYSVNQALAKIAAVGKGVVVLLHRPESAAELLTRAMPEAGVRLPQKWDARNHGIGAQILKDVGVGKMRLLATQRKMPSLAGFDLEVTGYDENT |
11 | Staphylococcus_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
882982 : 892137
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|DBSCAN-SWA CCTAGGCGGGCATCAGCACGGTCAGCCCCCCCATGTAGGGACGCAACACTTCGGGAAGCGCCACCGATCCATCCGCCTGCTGATGATTTTCCAGAATCGCGACCAGGGTGCGCCCTACCGCCAGCCCGGAGCCGTTCACGCTGTGCAGCAGTTCCGGCTTGCCTTTTTCACCTCTGAAGCGCGCTTGCATGCGGCGTGTCTGAAACGCCTCGAAATTGCTGCACGAGGAAATTTCGCGATAAGTATTTTGCGCCGGCAGCCACACTTCCAGATCGTAGGTCTTGGCGGCAGAAAAACCCATGTCGCCGCCGCACAGCGCCATTTTCCGGTAGGGCAGCCCGAGTGCTTGCAGGATGGCCTCGGCATGGCCGGTGAGTGCTTCCAGCGCGGTGTAGGATTGCTCCGGTTCGACCAGTTGCACCAGCTCCACCTTGTCGAACTGATGCTGGCGGATCATGCCGCGGGTGTCGCGGCCGTAGGAACCGGCCTCGGAGCGGAAGCAGGGGGTGTGGGCGACGAATTTCAGCGGCAGTTGCTCGCGCGCCACGATCGCGTCGCGCACCATGTTGGTCAGCGGCACTTCGGCGGTGGGGATGAGGTAGAGTTTTTCCGCATCCGCACGCGGCACGTGAAACAGATCTTCCTCGAACTTGGGCAACTGCCCGGTACCGCGCATGGAGTCGGCGTTGACCAGATACGGCACATACACTTCGGTGTAGCCATGCACAGCCGTGTGGGTGTCCAGCATGAACTGCGCCAGCGCCCGGTGCAGCCGCGCCAGCCCGCCGCGCAGCAGTGAAAAGCGCGCGCCGGCCAGTTTGCTGGCGGTCTCGAAATCCAGCCCCAGCGCGGTACCTACGTCCACGTGATCTTTCACCGCAAAATCAAACACGCGCGGTGTGCCGACACGTGCGATCTCTACGTTGTCGGCGTCGGATTTACCCGTCGGCACCGATGCATGCGGCAGGTTGGGAATGGTCATCAACAGCGCATTGAGCCGGGCTTGCAGGGCTTCCAGCGCGGATTCCGCGGCTTTCAGTTCGGCGCCGAGATTCGCCACTTCCGCCATGATGGTGGAGACATCCTCGCCCTTGGCCTTGGCCATGCCTATCTGTCTGGAGCTGGCGTTGCGCCTGGCCTGCAGCTCCTGGGTGCGGGTTTGCAGTTGTTTGCGCTCGGCTTCCAGGCGCTGGAATTCGGCAGTGTCCAGGGTGTAGCCGCGCATGGCAAGGCGTTGCGCCACGTCGTCGAGGTCGTTGCGGAGGTGTTGAATGTCTAACATTATTTTTGCCCTGTTTTCTTGTTGGCTTGTTCATCCAGCTTGCGCAAATACGCCAGCCGTTCGGCGATCTTGCCTTCCAGCCCGCGCGGGGTTGGTGCGTAAAAGTGCGCGTTGACACCTTCCGGTAAATAGTCCTCCCCGGCGGCGTAGGCGTCCGGTTCGTCGTGCGCGTAGCGGTAGGCGTGGCCGTAGCCCAGTTGTTTCATCAGTTTGGTGGGGGCGTTGCGCAGGTGCACCGGCACCTCGCGCGATTTGTCCGCCGCCACAAAGCTGCGGGCGTTATTGTACGCCACGTACACGGCGTTGCTCTTGGGCGCGCAGGCGAGATAAATCACCGCCTGCGCCAGCGCCAGTTCGCCTTCCGGACTGCCCAGGCGCTGGTAGGTTTCCACCGCGTCCAGGGTCAGGCGCAGCGCGCGTGGGTCGGCCAGACCGATGTCCTCGCTGGCCATGCGGATCAGGCGGCGGCCGACGTACAGCGGATCGGCACCGCCGTCGAGCATGCGCACCATCCAGTACAGCGCGGCGTCGGGATGGGAACCGCGCACCGATTTGTGCAGCGCGGATATCTGGTCGTAGAAGTTGTCGCCGCCCTTGTCAAAACGGCGTGCGCCGCGCGCCAGCGTGGTCTGGATGAAATCCTCGTCAATCTCATGACGGGCGGCATCGAGTGCGGCATTGGCGGCTTGTTCCAGCAGGTTCAACAGGCGCCGCGCGTCGCCGTCGGCGTAGCCGGTGAGCTGGGCGCGCGCCGCCCCGGTGATGGCAATGTCCGGATAGGTGCTGATCCGCGCGCGCTCCAGCAGGGCGGCGAGGTCGGTTTCCACGATGGGCTTTAACACATACACCTGGGCGCGCGAGAGCAGCGCGCTATTGACCTCGAACGAGGGATTCTCGGTGGTGGCGCCGATGAAAGTAATCAACCCGGCTTCAACGAACGGCAGGAAAGCGTCCTGCTGGGATTTGTTGAAGCGGTGCACTTCGTCCACAAACAGCAGGGTGCGGCGCCCCTGGCCTTGCATCATTTCGGCGCGCGCCACCGCCTCGCGGATTTCCTTGACCCCCGAGAGCACGGCGGACAGCGCGATGAACTCCATGTCAAAACCGTGGCTCATCAGCCGTGCCAGTGTGGTCTTGCCTACGCCCGGCGGCCCCCACAGGATCATCGAGTGTGGCTTGCCGGATTCGAATGCGACGCGCAGCGGCTTGCCGGGGCCGAGCAAATGCGTCTGTCCGATCACTTCGTCCAGATTGCGCGGCCGCAGCCGTTCGGCCAGCGGCGCGCTGTCCAGCGGATGATCGAACAGGTCGGCGTGGCTCACTTGCCGTCGGAGATGACGTCCACGCCCTGGGGCGGGGTGAAATGGAAATCGCTGGCGGGAAGGGCCGGGTTACGTTCCAGCCCGGCGAATTTCAGCACCGTGGTCTGGCCGAAGTTGTCCTTGATCTCCATCGCCACCAGCGTGTTTTTGCTGAATCCCATGCGCACGTTCTCAAACGCGCTTTCCTTGTCGCGCGGGCGCGCGTCCAGCCATTCCAGACCGTCGCGGCTGCCGGCGTCCGTGATCGTGTAAAACCTGCCGATGTCCTTGCTGCCCGCCAGCAAGGCCGCCGGGCTGCTGCCCAGCGCCTGGCCCAGTTTCTTGATGGTGACCTGTTGCAGATCGGCGTCGTACAGCCAGATTCTTTTGCCGTCGCCGACGATTATCTGTTCATAGGGCTTCTCATACACCCAGCGGAACTTGCCGGGACGCGCGAAAGCCATGGTGCCGGACGACTGCTGGCGCGCGTGTCCGTTTTTGTCCAGCACGGTCTGGGTGAACGTGGCGCGCGCGGTCTGGGTATCCGCGACGAACGCCTTGAGCGCGTCGATGCTGGATGCTGCGGCGCTGGCAGAGAAGATCAAGAGTGCGGTAAAGAGACTGAGTTTTTTCATTGGGACTTTAAGTCGAATGGACAGGATTTTTAATGCTTAACAAGAGAGTTGGTCTGGTTTTAGTGCAGAAAATTCAACACCCCCCAAAATTCATATTTATTCTGAATGATCCGGTCCTTATCCTGTTAATCCTGTCTAATCACATTGTTCATTCCCGGTTAGGTGCAATCACTTCGCGGTTGCCGTTGCTCTGCATCGGCGTCACCAGACCCGCCTGTTCCATTGCCTCGATCAGCCGCGCGGCACGGTTGTAGCCGATGCGCAGGTGACGCTGCACGGCGGAGATGGAGGGGCGCCGGGTTTTCAGCACGATGGCGACGGCTTCGTCATAGAGCGGGTCGCTTTCGGCGTCACTGCCGCCGACCGCGCTTTCGCCGCTGCCTTCATTGTCTTCCGGGGTGTCGAGGATGCCGTCAATGTAATCGGGCTCGCCCAGCTGTTTCAGGTATTCCACGACTTTATGCACTTCTTCGTCGGCCACAAAAGCGCCGTGCACGCGCTGCGGATAGCCGGTGCCGGGCGGCAGGTAGAGCATGTCGCCCTGGCCGAGCAGGGCTTCTGCGCCCATCTGGTCGAGGATGGTGCGCGAGTCGATTTTGCTCGATACCTGGAACGCGACGCGGGTGGGGATGTTGGCCTTGATCAGGCCGGTAATCACGTCCACCGACGGGCGCTGCGTGGCCAGGATCAGATGCACCCCGGCGGCGCGGGCCTTCTGCGCCAGCCGCGCAATGAGCTGTTCGACGGCCTTGCCTACCACCATCATCATGTCAGCCAGCTCGTCGATCACCACCACAATCATGGGCTGCTCTTCCAGCGGCTCGGGATTATCCGGGGTCAGGCTGAACGGGTGGGTGAGCGGGGTGGCGGCCCTTTTCGCGTCGCGCACTTTCTGGTTGTAACCAGCCAGGTTGCGCACGCCGAGTGCGGACATCAGCTTGTAGCGGCGCTCCATCTCGGCCACGCACCAGTTGAGCGCGGCGGCGGCCTGACGCATGTCGGTGACTACCGGGGCGAGCAGATGCGGAATGCCCTCGTATACCGACAGCTCGAGCATTTTCGGGTCGACCAGAATCAGTCGCACACGGCTGGCGTCCGCTTTGTACAACAGGGAAAGGATCATCGCGTTGATGGCAACTGATTTGCCCGAGCCGGTGGTGCCCGCCACCAGTACGTGCGGCATTTTGGCCAGATCGGCCACCACCGGGTTGCCGGCTATGTCCTTGCCCATCGCCATTGCCAGCGGGCTGGCCATGGCGTGGTACACCTTGGCGGAAAGGATTTCCGATAACCGCACGATCTCGCGCTTGGGGTTGGGGATTTCCAGCGCCATGGTGGTCTTGCCGGGGATGGTTTCCACCACGCGGATGCTCACCACCGACAGCGCGCGCGCCAGGTCTTTGGCCAGGTTGACGATCTGGCTGCCCTTGACCCCGGAGGCCGGCTCGATCTCGTAGCGGGTGATGACCGGGCCGGGCAGAGCGGCGACCACGCGCGCCTGCACGTTGAATTCGGCCAGCTTGCGCTCGATCAGGCGCGAGGTGAATCCCAGGGTTTCCGCCGACAGGCTCTCCACATGGGCAGTCGCAGCATCCAGCAAATGCAGTGGCGGCAGCGGGGAATCCGGCATTTCGGCAAACAGCGGCACCTGCTTTTCCACCTGCACGCGTTCCGATTTGATAATGGTGGGCGGCGGGGTTTCGATGACCACCGGAGCGCGGTCCATATTGCGGCGCTTTTCTTCCAGTACAATCTCGTCGCGCCGCAGCGTGGCGGCTTGACCCATGCGCCGTTCGCGCCGCGCGGACCAGGTCTCGATTGCCCATAACCAGCTGCGTTCGAGGTATTCCCCGGTGGATTCCACTAGCGCCAGCCAGGAGATGCCGGTAAACAGGCTCAAGCCCGCGGCAATCAGCACCAGCAAGGCCAGCGTGCCGCCGGTAAAGCCCAGCAAATGGGTGACTGCTTCTCCCACAACCGCTCCCAGAATGCCTCCCGGGTGCTCCGGTAGTGCTATGTGCAAGCTATACAGACGCTGCGATTCCAGCCCGGCGCTGGACAGCAGCAGCAACAAAAACCCGGCACTGCGGATATATAACGGGCGGCGGTCGGGCGCCTCGCTAATCTCCAGTTTGCGATATCCCCACCAGGTGGCATACAGCGCCAGCATCACCAGCCACCAGGTGGAGAGCCCGAACAGTGCAAGCAGAATGTCGGCCAGATACGCCCCCAGCGTGCCGCCCATATTGTGCACACTGCCTTGTCCGCTGTGCGACCAACCCGGATCAGACTGGGTGTACGTGAACAGGATGACCGCCAGGAACGCCGCCAGCGCCACGCTCAGCAGCCAGCCGACCTCGCGCAGCAGGCCGGTGAGCCTGGGCGGGAGCGGTTTGCGGACGGATTTGGGCGTATTGCGCCGGGGAATGGACATGGTCAGCAATTTAATCGGAAGGCAAAATTATAACTTGAACCTTGTCCAGTCCGTGACCATCTTCCATACTGTTCCAGTTTTCCAGTTTTCAATCCCGCTACGGAGTACTACATGACCACCCCTCAACATCACCGTCTCATCATCCTCGGCTCCGGCCCCGCAGGCTATTCCGCCGCCGTTTACGCTGCGCGCGCCAACCTGAATCCGGTCGTCATTACCGGCATGGCGCAAGGCGGTCAGCTGATGACCACCACCGATGTGGACAACTGGCCTGCCGACGCCGACGGCGTGCAGGGGCCGGAGTTGATGGCGCGTTTCGAGAAGCACGCGCGCCGCTTCAACACCGAAATTATTTTCGACCACATCCACACTGCCAAACTGACCGACAAGCCCATCGCGCTGGTCGGCGACCAGGGTAGCTACACCTGCGATGCGCTGATTATCGCCACCGGCGCGTCGGCCATGTATCTGGGGCTGGAATCCGAGCAGGCGTTCATGGGCAAGGGCGTATCCGGCTGCGCCACCTGCGATGGATTTTTCTACCGCAATCAGGACGTGGCGGTGATCGGCGGTGGCAACACTGCCGTGGAAGAAGCGCTGTATCTGTCCAACATTGCGCGTCATGTCACCGTGGTGCATCGCCGCGACAAGTTCAAGTCGGAAAAAATTCTCGCCGATCATCTGATGGAGAAGGTCAAGGAAGGCAAGATCAGCGTGGAGTGGAACAGCGAACTGGACGAGGTGCTGGGCGACAAGACGGGTGTGACCGGCATGCGCATCAAGTCCACCGTGGATGGCAGCACCAGGGATATCGCCCTGACCGGCGTGTTCATCGCCATCGGCCACAAGCCCAACACCGATATTTTCACTGGCCAGATCGCGATGGAAGGCGGCTATATCGTCACCCAGGGGGGCAACAAGGGCAATGCCACCGCCACCAGTGTTCCCGGCGTGTTTGCCGCGGGTGACGTGCAGGATCACATCTACCGCCAGGCGGTGACCAGCGCCGGTACCGGCTGCATGGCCGCGCTGGACGCCGACCGCTATCTGGAAAGCCTCGGCAAGTAATCTCCCCATGGCCGGGCGTGTGCCTCCAGCGGACGAAGCCGCACTGTTCAAAGCGGCGGTGCAGGACGCCCAGCCGCTACCCGACCACGGCAAGGTGGAACCGCCCTTGCCGCGCGTTTCCCCTATCCCGCGCCAGCGTATTCGCGATGAGCGTCAGGTCTTGGCCGACAGCCTGTCTGACCACATCGTGTGGGAGGATACCATGGAAACCGGCGAGGAGCTGGTGTTCCTGCGCACTGGCTTGCGCCGCGACACGCTCAAAAAACTGCGGCGCGGGCACTGGGTGCTGCAGGCCGAACTGGATTTGCATGGCCTGGTGAGCGTGGAAGCGCGCCAGGCGCTGAGCGCGTTTATCGCCGGCTGCGGCAAGCGCGGCCTGCGTTGCGTGCGCATCATCCACGGCAAAGGGCTGCGTTCCAAAAACCGCGAGCCGGTGTTGCGCACCAAGGTGAAAAACTGGCTGATGCAAAAAGACGAAGTGCTGGCGTTTTGCCAGGCGCGTGCGGTGGACGGCGGCAGCGGCGCGGTGGTAGTGTTACTCAAGTCTTCATGAAAACTTTTTGGAGGAATGCCATGACCGCAATTACCGAATTCGAACTTCCTTCCACCGGCAACCGAACCTTCAAACTCACCGACATGCGCGGCAAGAAGCTGGTGGTGTACTTCTATCCCAAGGACGACACGCCGGGCTGTACCGTGGAGGGCTCCGACTTCCGCGATTTGTATGCCGGGTTTCAGGCGCACAATTGCGAGATCGTGGGTATTTCGCGCGACGATATGAAATCCCACGAGAAATTCAAGACCAAGCTCAGCCTGCCGTTCGAGCTGTTGTCCGACGCAGACGAAAAAGTGTGCGAACTGTTCGGCGTGATAAAGCTGAAGAACATGTACGGCAAGGAAGTCCGTGGCATTGATCGCAGCACTTTTGTGTTCGACAGCGACGGCAAGCTGGTCAAGGAATGGCGCGGCGTGAAATCCGCCGGCCACGCGCAGGAAGTGCTGGATACCATTAAAACGCTCCAGGAGAAAATCTGAATGCCCCGCAAAGCCGCTGCGCCCACCAAGCTGTTTGTTCTTGATACCAACGTGCTGATGCACGACCCCACCAGCTTGTTCCGTTTTGAGGAGCACGACATATTTCTGCCGATGGGCACGCTGGAGGAACTCGATCACAACAAGAAAGGTATGACCGAAGTGGCGCGCAATGCGCGTCAGGCCAGCCGCTTCCTGGACGAAATCGTATCGGGTTGCGAAGATGCAATCGAAGCCGGGATTCCGCTCAGCAGCCATAGCCGCAAGGCAGCGACCGGACGGCTGTTTCTGCAGACCAGAATGGCGCGCATTGAAACCCCGCTCAGCCTGCCCAACAGCAAGGTGGACAACCAGATTCTCGGCGTGATTCTGAGCCTGCGCGAAGAGCAGCCCAGGCGCCCGATAATCCTGGTGTCCAAAGACATCAACATGCGCATCAAGGCGCGTGCACTGGGCTTACCCGCCGAGGACTACTTCAACGACAAGGTACTCGAAGACACCGACCTGCTCTATGCCGGGGTGCGTGAATTGCCGGAGGATTTCTGGGACAGGCACGGCAAGGGTATCGAGTCCTGGCAGGCTGATGGCCACACCTGGTATCGCGTCAAAGGCCCGCTGGTGACCCGCCTGCTGGTCAACGAATTCGTCTATCAGGAGAGCGCCAGCCCGCTCTACGCCATCGTCAAAACCATCAAGGGCAATGTCGCCGAGCTGCAGACCATCAAGGACTACAGCCACCAGAAAAACAATGTGTGGGGCATCACCGCGCGCAACCGCGAACAGAATTTCGCGCTCAATGTGCTGATGGATCCGGAGGTGGATTTTGTCACTTTATTAGGCCAGGCCGGTACCGGCAAAACCCTGCTCACCCTCGCGGCGGGGCTGATGCAGACGCTGGAGCACAAGCGTTATTCGGAAATCATCATGACCCGCGTGACCGTGCCGGTAGGCGAAGACATCGGTTTCCTGCCCGGCACCGAGGAAGAAAAAATGACGCCGTGGATGGGCGCGCTGGAAGACAATCTCGACGTGCTCAACAAGACCGACGACAGCGCCGGCGACTGGGGACGCGCCGCCACGCAGGACCTGATCCGCAGTCGCATCAAGGTCAAATCGCTCAACTTCATGCGCGGGCGTACCTTCCTCAACAAATACCTGATCATCGACGAGGCGCAGAACCTCACCCCCAAACAGATGAAAACCCTCATCACCCGCGCCGGTCCCGGCACCAAGGTGGTGTGCCTGGGCAACATCTCGCAGATTGATACGCCTTACCTCACCGAGGGCAGCTCCGGCCTGACCTACGTGGTGGACCGCTTCAAGGGCTGGCCCCACGGCGGCCATATCACCCTGGCGCGGGGCGAGCGTTCGCGCCTGGCCGACTGGGCGGCGGAAATGCTATGA
Protein sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|884262_885555_-|WP_147071312.1|DBSCAN-SWA MDSAPLAERLRPRNLDEVIGQTHLLGPGKPLRVAFESGKPHSMILWGPPGVGKTTLARLMSHGFDMEFIALSAVLSGVKEIREAVARAEMMQGQGRRTLLFVDEVHRFNKSQQDAFLPFVEAGLITFIGATTENPSFEVNSALLSRAQVYVLKPIVETDLAALLERARISTYPDIAITGAARAQLTGYADGDARRLLNLLEQAANAALDAARHEIDEDFIQTTLARGARRFDKGGDNFYDQISALHKSVRGSHPDAALYWMVRMLDGGADPLYVGRRLIRMASEDIGLADPRALRLTLDAVETYQRLGSPEGELALAQAVIYLACAPKSNAVYVAYNNARSFVAADKSREVPVHLRNAPTKLMKQLGYGHAYRYAHDEPDAYAAGEDYLPEGVNAHFYAPTPRGLEGKIAERLAYLRKLDEQANKKTGQK >NZ_AP021884|882982:892137|885581_886199_-|WP_147071125.1|DBSCAN-SWA MKKLSLFTALLIFSASAAASSIDALKAFVADTQTARATFTQTVLDKNGHARQQSSGTMAFARPGKFRWVYEKPYEQIIVGDGKRIWLYDADLQQVTIKKLGQALGSSPAALLAGSKDIGRFYTITDAGSRDGLEWLDARPRDKESAFENVRMGFSKNTLVAMEIKDNFGQTTVLKFAGLERNPALPASDFHFTPPQGVDVISDGK >NZ_AP021884|882982:892137|882982_884263_-|WP_147071127.1|tRNA|DBSCAN-SWA MLDIQHLRNDLDDVAQRLAMRGYTLDTAEFQRLEAERKQLQTRTQELQARRNASSRQIGMAKAKGEDVSTIMAEVANLGAELKAAESALEALQARLNALLMTIPNLPHASVPTGKSDADNVEIARVGTPRVFDFAVKDHVDVGTALGLDFETASKLAGARFSLLRGGLARLHRALAQFMLDTHTAVHGYTEVYVPYLVNADSMRGTGQLPKFEEDLFHVPRADAEKLYLIPTAEVPLTNMVRDAIVAREQLPLKFVAHTPCFRSEAGSYGRDTRGMIRQHQFDKVELVQLVEPEQSYTALEALTGHAEAILQALGLPYRKMALCGGDMGFSAAKTYDLEVWLPAQNTYREISSCSNFEAFQTRRMQARFRGEKGKPELLHSVNGSGLAVGRTLVAILENHQQADGSVALPEVLRPYMGGLTVLMPA >NZ_AP021884|882982:892137|890274_890736_+|WP_147071117.1|DBSCAN-SWA MTAITEFELPSTGNRTFKLTDMRGKKLVVYFYPKDDTPGCTVEGSDFRDLYAGFQAHNCEIVGISRDDMKSHEKFKTKLSLPFELLSDADEKVCELFGVIKLKNMYGKEVRGIDRSTFVFDSDGKLVKEWRGVKSAGHAQEVLDTIKTLQEKI >NZ_AP021884|882982:892137|888744_889701_+|WP_147071121.1|DBSCAN-SWA MTTPQHHRLIILGSGPAGYSAAVYAARANLNPVVITGMAQGGQLMTTTDVDNWPADADGVQGPELMARFEKHARRFNTEIIFDHIHTAKLTDKPIALVGDQGSYTCDALIIATGASAMYLGLESEQAFMGKGVSGCATCDGFFYRNQDVAVIGGGNTAVEEALYLSNIARHVTVVHRRDKFKSEKILADHLMEKVKEGKISVEWNSELDEVLGDKTGVTGMRIKSTVDGSTRDIALTGVFIAIGHKPNTDIFTGQIAMEGGYIVTQGGNKGNATATSVPGVFAAGDVQDHIYRQAVTSAGTGCMAALDADRYLESLGK >NZ_AP021884|882982:892137|890736_892137_+|WP_147071115.1|DBSCAN-SWA MPRKAAAPTKLFVLDTNVLMHDPTSLFRFEEHDIFLPMGTLEELDHNKKGMTEVARNARQASRFLDEIVSGCEDAIEAGIPLSSHSRKAATGRLFLQTRMARIETPLSLPNSKVDNQILGVILSLREEQPRRPIILVSKDINMRIKARALGLPAEDYFNDKVLEDTDLLYAGVRELPEDFWDRHGKGIESWQADGHTWYRVKGPLVTRLLVNEFVYQESASPLYAIVKTIKGNVAELQTIKDYSHQKNNVWGITARNREQNFALNVLMDPEVDFVTLLGQAGTGKTLLTLAAGLMQTLEHKRYSEIIMTRVTVPVGEDIGFLPGTEEEKMTPWMGALEDNLDVLNKTDDSAGDWGRAATQDLIRSRIKVKSLNFMRGRTFLNKYLIIDEAQNLTPKQMKTLITRAGPGTKVVCLGNISQIDTPYLTEGSSGLTYVVDRFKGWPHGGHITLARGERSRLADWAAEML >NZ_AP021884|882982:892137|889708_890254_+|WP_147071119.1|DBSCAN-SWA MAGRVPPADEAALFKAAVQDAQPLPDHGKVEPPLPRVSPIPRQRIRDERQVLADSLSDHIVWEDTMETGEELVFLRTGLRRDTLKKLRRGHWVLQAELDLHGLVSVEARQALSAFIAGCGKRGLRCVRIIHGKGLRSKNREPVLRTKVKNWLMQKDEVLAFCQARAVDGGSGAVVVLLKSS >NZ_AP021884|882982:892137|886347_888633_-|WP_147071123.1|DBSCAN-SWA MSIPRRNTPKSVRKPLPPRLTGLLREVGWLLSVALAAFLAVILFTYTQSDPGWSHSGQGSVHNMGGTLGAYLADILLALFGLSTWWLVMLALYATWWGYRKLEISEAPDRRPLYIRSAGFLLLLLSSAGLESQRLYSLHIALPEHPGGILGAVVGEAVTHLLGFTGGTLALLVLIAAGLSLFTGISWLALVESTGEYLERSWLWAIETWSARRERRMGQAATLRRDEIVLEEKRRNMDRAPVVIETPPPTIIKSERVQVEKQVPLFAEMPDSPLPPLHLLDAATAHVESLSAETLGFTSRLIERKLAEFNVQARVVAALPGPVITRYEIEPASGVKGSQIVNLAKDLARALSVVSIRVVETIPGKTTMALEIPNPKREIVRLSEILSAKVYHAMASPLAMAMGKDIAGNPVVADLAKMPHVLVAGTTGSGKSVAINAMILSLLYKADASRVRLILVDPKMLELSVYEGIPHLLAPVVTDMRQAAAALNWCVAEMERRYKLMSALGVRNLAGYNQKVRDAKRAATPLTHPFSLTPDNPEPLEEQPMIVVVIDELADMMMVVGKAVEQLIARLAQKARAAGVHLILATQRPSVDVITGLIKANIPTRVAFQVSSKIDSRTILDQMGAEALLGQGDMLYLPPGTGYPQRVHGAFVADEEVHKVVEYLKQLGEPDYIDGILDTPEDNEGSGESAVGGSDAESDPLYDEAVAIVLKTRRPSISAVQRHLRIGYNRAARLIEAMEQAGLVTPMQSNGNREVIAPNRE |
8 | uncultured_Mediterranean_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1605656 : 1613255
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|DBSCAN-SWA CATGCAAAATCTCGACACTATTGTTGCCGCAGCGCTAGCCGAATTCGCCGCAGTCAACCAGGCCGTTGAACTGGAGCAGGCAAAAGCCCGCTATCTCGGCAAAGCCGGTTTGCTCACCGGGCAATTGAAACAACTGGGCAAGCTTCCCGCCGCAGAACGCCCGGCAGCGGGCAACGTGATCAATCAGGCCAAGGAACGGATTCAGCAGGCGCTGGAAGCGCGCCGCGCAGCCTTGTCCCGGGCTGAGCTGGATAACAGGCTGGCGGCGGAAACCCTGGATGTGACGCTGCCCGGACGCGGCCTGGGCACAGGCGGCCTGCACCCGGTGACGCGCACGCTGGCACGCATCCAGGCGCTGTTCGCCTCGATCGGTTTCGAGGTGGCGGAAGGCCCGGAGATTGAAACCGATTTCTACAATTTCACCGCACTGAATATTCCGGAAAACCACCCGGCGCGCGCCATGCACGACACTTTCTACGTGGATGACAAACACCTGCTGCGCACCCACACGTCGCCGGTGCAGATACATTATTTGCAGAACAATCAGCCGCCGCTCAAGATCATCGCGCCAGGCCGGGTATATCGCTGCGATTCCGACGTGACCCACACACCCATGTTTCATCAAGTCGAGGGATTGTGGGTGGACGAAGAGGTGAGTTTCGCGGCATTGAAAGGCGTGCTGGCGGATTTCATGCAGCGTTTTTTTGAACGCGATGACCTGAAGGTGCGCTTCCGCCCATCGTTTTTCCCGTTCACCGAACCGTCGGCGGAAATGGATATCGCTTGCGTGATGTGCGGTGGCGGCGGTTGCCGCGTATGCAGCCATACCGGCTGGCTGGAAGTGCTGGGCTGCGGCATGGTGCATCCCAATGTGCTGGGACATGTGCATGTGGATAGCGAAAAATACCTCGGTTTCGCGTTTGGCATGGGGGTGGAACGGCTGGCCATGCTGCGCTACGGTGTGGATGACCTGCGCCTGTTTTTCGCTAATGATTTGCGTTTCCTGAAACAGTTCAACTGAACCATCAAGATGAAATTCTCCGAGCTCTGGTTGCGTACCCTTGTTAATCCTGCGCTGGACAGCGCGGCGCTGTCCCATCTTCTTACCATGGCCGGACTGGAGGTCGAAGCGCTGGACCCGGTCGCGGCGGATTTTTCCGGCGTGGTGGTGGGGCAGGTGCTGTCCGTAGCGCCGCATCCGGATGCCGATCGCCTGCGCGTGTGCCTGGTAGATGCCGGCACTGGCAGCCCGTTGCAGATCGTATGTGGCGCACCCAATGTAAGTGAAGGCGCGCGCGTGCCTTGCGCCCTGGCAGGCGCCCGCTTGCCGGGCTTTGAAATCAAGAAAGCCAAGTTGCGGGGTGTGGAATCGCAGGGTATGTTGTGCTCCGCGCGCGAGCTGGGACTGGCAGAACAAGCCGATGGCCTGCTGTTGTTGCCGAACGACGCACCGGTGGGTAGCAATATCCGCGATTATCTGCATCTGGATGACAGGCTTTATACGCTCAAACTTACCCCCAATCGCAGCGATTGCCTGAGCGTGGCCGGCGTGGCGCGTGAAGTGGCCGCGCTTACCGGCAGTCCATTGAACTTGCCCCGGATTGAACCCGCAGCGGTCACCGGCAGGCTCACCCGCATGGTGCAGGTGACTGCAGGACAAGCCTGCCCGCGCTATTGCGGGCGCGTCATCAGCCAGCTCAATCGCGCGGCTCAAACACCGGGCTGGATGATTGAACGCCTTTCCCGCAGTGGCCTGCGCAGTATCAGTCCGGTAGTGGACATTACCAACTATGTATTGCTGGAGTTGGGACAACCCTTGCATGCCTTTGATCTGGACAAGCTTGCTGGCGATATCCAGGTGCGCATGGCCACGCCGGGTGAAACGCTGACGCTGCTGAATGATCAGCGTGCGACGCTGGAAGCGGACATGCTGGTGATCGCCGATGACAACGGCGCGCAGGCGCTCGCTGGCATCATGGGGGGGGCGGCCACCGCAGTGGATGAAAATACCTCGGAAATTTTTCTTGAGGCAGCGTATTTCAGCCCCGGCGCGATTGCCGGACGGGCGCGCCGGCTGGGCTTGTCCACCGATTCATCGCACCGTTTTGAGCGCGGGGTGGACTACGCAGCCACGCGCGATGCGCTGGAACGCGCCACGGCATTGATACTTGAAATTTGCGCTGGCGCGGCAAGTGCAATCACCGAAATAACAGGCGATCTGCCACAACGTGCGCCTGTCATGCTGCGCACCGCGCGTGCCAGCAAAGTGTTGGGCGTGGCGCTGAGTGACGCGCAGGTGGAAGTGTTGCTGGGCCGCCTGTGCTTTGACTTTCAGCGCGATGGCGCGGCCTATCAGGTGACGCCGCCCAGCTACCGCTTTGACCTGAATATCGAGGAAGACCTGATCGAAGAACTGGCGCGGCTCCATGGTTATGACAACATTGTTGCGCAGGCCCCGGTCGCCCGCCTGACCATGTTGCCGCAGCCGGAGCAACAGCGTGGGGTGGATGCGTTGCGCACCCTGCTCACCGCGCGTGATTATCAGGAAGTCATTACCTACAGCTTTGTAGATGCCGCATGGGAAGCGGATTTCGCACCCGGCGCTCAGCCCGTCGTGCTGAAAAATCCCATCGCCAGTCAGATGGGCGTGATGCGCTCCACCTTGTTGGGCGGCCTGATGGATGTGCTGCGCAACAATCTGAACCGGCGCCAGGAGCGTGTGCGTATTTTTGAGAGCGGACGCTGTTATCTGCCGGCGGCCGAGGGCTTCGATCAGCCGCAACGCCTGGCTGGACTGGCTTACGGCAGCGCTATGCCGGAGCAGTGGGGGAGTGCGGCGCGCAACGTGGACTTTTTTGACGTCAAAGCCGATCTCGAAGCGCTGTGCTGGCCACAGCCTGCACGCTTTGAAAAATCCGCTCATCCTGCGCTGCATCCAGGCCAGTGCGCTGAAATGTGGTTGAATGGTGTCCATGCCGGCTGGCTGGGTACATTACACCCACGGCTGACGCAGCAATATGATTTGGCGACAGCGCCGGTTGTGTTTGAACTCGCCCTGCCGGCATTGTTAACGCGGAAGCTGCCCAGGCATGGCGAGATTTCGCGTTTCCAGAGCGTGCGCCGTGATCTGGCCGTGATAGTCGATGAATCGGCGCCGGTACAGGCTTTGATTGATGCGATGTACGCAGCACGCATAGAGGGTGTTGCCGAGATTACATTGTTTGACGTGTATCGCGGCAAAGGCATTGATTCTGATAAAAAAAGTCTTGCATTCCGGGTGCTGTTGCAAGATACTCAAAAGACCTTTACCGACACTGAAGTGGATACCGCCATGGCGTACTTCACCGATCTGTTAAAACAACAATTCAACGCGCAATTACGTTCCTGAGGTAGTCATGACCCTGACCAAGGCAGAACTGGCAGACATGCTGTTCGAAAAAGTTGGCCTGAATAAACGCGAAGCCAAAGACATGGTGGAGTCGTTTTTCGAAGAAATACGCATTGCACTGGAAGCGGGCGATACCGTGAAGCTTTCCGGCTTTGGCAATTTTCAGCTGCGTGACAAACCGCAGCGTCCTGGCCGCAATCCCAAAACCGGCGAAGAAATGCCAATCACGGCACGCCGCGTGGTGACCTTTCACGCCAGCCAGAAACTCAAATCGCAGGTAGAAGACGCGCATGGCGGAACATCAGCCAACTAGTCAACTGCCGCCGATTCCTGCCAAGCGCTACTTCACTATCGGCGAGGTCAGCGAACTGTGCGGGGTGAAGCCGCACGTACTGCGCTACTGGGAACAGGAATTCGGCCAGCTCAAACCAGTCAAGCGACGTGGTAACCGTCGTTACTATCAGCATCATGAAGTGCTGCTGATTCGCCGCATCCGGGAACTGCTTTATGAGCAGGGATTCACGATCAATGGCGCACGCCATCGTCTGGATGTGCTGGCCACATCCGACGCCGCCGAGGCCGCACCCACGGTGACTGAATCGGTAACGGATTATGCAGCACTGCGTCGCGAAATGATGGAAATTGTCGAGTTGCTGCGCCTGTGATTTTTTAGCTCCAGTCTTTGTCCAGGCGCTACAGACTCTGCTATAATCGCGGCCTTCGGGGCGTAGCGCAGCCTGGTAGCGTACTTGCATGGGGTGCAAGTGGTCGGAGGTTCAAATCCTCTCGCCCCGACCAGAACAATGCCCAGGTGTCATGCCTTTCTAGAAAGCAATTCACGGGAATTTCAGAGTAGCCGTCCACTATGCGCATTTTGCTGAGCAACGACGACGGTTACTTTGCACCCGGTCTCGCCATCCTGGCGGATACACTCTCACACATCGCAGATATCACGGTGGTTGCCCCCGAGCGCGACCGCAGCGGCGCCAGCAATTCCCTCACACTCGATCGTCCGCTGATGCTGCGCCAGGCGCACTCCGGGTTTTATTACGTCAATGGTACGCCCACGGACTGCGTTCACCTCGCGGTTACGGGTATGCTCGATCACCTGCCGGACATGGTTATCTCCGGTATCAATCACGGTGCCAACATGGGCGACGACACTATTTATTCGGGCACCATAGCGGCAGCCACCGAAGGGTTTTTACTGGGGGTGCCGTCGCTGGCGATATCCCTGGCGAGCCATGCAGCGGGCAACTATGCCACAGCCGCGCGTGTTGCCAGCGAGCTCGCGCAACGTGTCATGGCACGGCCTTTTGCGGCACCGCTACTGCTCAACGTGAACGTGCCGGATATTCCCTATCAGGACTTGCAAGGCACCGCAATCACCCGCCTCGGACGCCGCCACAAAGCCGAACCGGTGGTCAAATCCACCAATCCGCGCGGTCAGACGGTGTATTGGGTGGGGGCTGCGGGTGCGGCGCAGGATGCAGGCGAAGGCACGGATTTCCATGCGGTGGCGAATGGGCGTGTGTCGGTGACGCCGTTGCAGATGGATCTCACCCAGTTCAGTCAACTGGCGCCGCTGCGGGCGTGGTTGCAGGCATGAGCGTGACGCGTCACAGCGGTATCGGCATGACTTCCGAGCGTACCCGCGCGCGCATGGTCGAGCGCCTGCGTGCGCAGGGGATCAAGGACAACAACGTACTCACCGCGATGGGCATGGTGCCCCGGCATATTTTTGTGGATGAGGCACTGTCCATCCGGGCTTATGAGGACAGCGCGCTGCCGATAGGTTTCGGCCAGACCATTTCCAGCCCCTATAGCGTGGCGCGCATGATCGAGGTGCTGCGTGGCGGCGCCGACCTGCAGTGCGTGCTGGAAGTCGGCACAGGTTGCGGCTACCAGGCCGCTGTACTGGCCAAGCTGGCACGCGAAGTGTACTCGGTTGAGCGCATTGCCACGCTGCTCGGGCGCGCGCGTCGTACCATACGCGAACTACGCATCGGCAATATCAAACTTAAACATGGCGATGGTAGCATTGGGTTAAAGGATGTGGCACCTTTCGATGGCATCATCCTTGCCGCCGCCATACCCACTCCCCCCCAGGCGTTGCTGGAACAACTGGCGCAGGGCGGCCGCATGGTATTGCCGCGAGGTATTGGTGAAACGCAGCAAATGGTGCTGATCGAGCGCACCGCAGAAGGTTTTCAGGAGACGGTGCTGGAAATGGTACATTTTGTTCCACTGTTGCCCGGAGTGCGCTGACGTGATGGTATTCGACAAGATTGCATGCCCGGTTTATCTGGTTGCTGTGATCCTTGCCCTGGGAGGATGCGCCACGCAGAATTCTGCGCCGGTTGTGGATGGCACCCAGCCTGGCACAAGCAATATTGTCAAGCCGGCAATCAAGTCCGCCACTACCCGCGCCGGCGCAGCAAAGCTGCATGACTGGCGACCGGACAGCCATACCGTGCAAAAAGGCGACACGCTCTACAGCATCGCGCTTGAATATGGCCTGGACTACCGTGATCTGGCGAGCTGGAATGCACTATCTGACAATAACCTGATCCGTGTCGGGCAGGTGTTGAAACTGAGTGCGCCGCAGCCAGGCAGCGGCATTGCGCAAGTCACGACATCTGAATCAGCAGTTCAGACCATCCCCCTCAAGATCGAACCGTTACCGCAGGCCCAGATAGCGACCGGCGCGGTGTTGATAACCCAGCCCAAGGCGGTCAAATTGCCCTACTCTGCCGCCGCATTGGCGCAGCTTGAACAAGGCGGGACGCCGCAGCCGGCCGCGCGGCCTCCCGCAACACCGGAGGCAGCGTCCGGGGTGGCGCCCGAGCCCGCATCCTCTGCAGAACAATCCCGACCGGCCGCGACTGCCAAGGAAACCGATGATACGGGTATTGATTGGATCTGGCCTACGCAAGGCCGGGTCATTGCCGGATTTGACGAAGCCAAAAACAGCAAGGGGCTGGATATAGCAGGCAAAGCCGGACAAGCCATATTCGCAGCCGCGCCGGGCAAGGTGGTGTATAGCGGCGCTGGTTTGCGCGGCTATGGCAAGCTGGTTATTATCAAGCACAACGCCATTTATTTGAGTGCCTACGCACACAATCAGCGGGTGCTGGTGAAAGAAGGTCAGACGGTTGCGCGCGGGCAGAAAATCGCCGAAATGGGTGACAGTGATGCCGATCAGGTCGCGCTGCATTTCGAAATCAGGAAAATGGGCCAACCGGTGGACCCGATGAAATATCTTCCCGGAGCACAAAAATAGCGATGCATGACGACAACGAGCAGGAAGACTACGCCGGTATCCCGGACGACAAATCCCAGACCGAAGTGCCCGAGGTCGAACTGGGATTTACCGAGGATGCGCATACCGATGTGACGCAGATGTACCTCAACGAAATTGGCCACAACGCATTGCTCAGCCCCACTGAGGAACGCCGCCTGGCCGAACTCACCCGTGCGGGCGATTTTGACGCCCGGCAAAAAATGATCGAACACAATCTGCGGCTGGTGGTGAATATCGCCAAGCATTACGCCAATCGCGGGCTGGCACTGCTGGACCTGATCGAGGAGGGCAACCTGGGACTGATTCATGCGCTGGAAAAGTTCGAGCCCGAACGCGGATTTCGTTTTTCCACTTATGCCACATGGTGGATACGCCAGAATATCGAGCGCGCCATCATGAACCAGTCGCGCACCATCCGCCTGCCGGTGCACGTCATCAAGGAACTCAACGTCATTTTGCGCGCCCGCCGTCATCTGGAAAATCACGGCGCCTCCGACCCGAGTGACGAGGACATCGCCCATCTGGTCGGGCTGCCGGTAGAAGATGTACGACGCATGTTGCGCCTGAATGACCGGGTGGCATCGCTCGACGCACCGCTCGATATTGATCCCAGTCTGTCCATCGGGGAGGCCATTGCAGATGGCAACAGCGCGTTGCCTGAAGACATGCTTGAGCACGCCGAGACTGAAGCCTTTGTGCGCCTGTGGCTGAGTGACCTCAACGACAAGCAGCGCTGGGTAATTGAGCGGCGTTTCGGGCTGGGCGGGCAGGATGTGCACACCCTGGAACAGCTGGCCGAAAGCCTCGACGTCACCCGTGAACGCGTGCGCCAGATCCAGATGGAAGCCCTGCACCATTTGCGGCGCATGCTGAAACGCACCGGCGTCAACAAGGACGCTCTGTTGTGA
Protein sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|1609908_1610652_+|WP_147073437.1|DBSCAN-SWA MRILLSNDDGYFAPGLAILADTLSHIADITVVAPERDRSGASNSLTLDRPLMLRQAHSGFYYVNGTPTDCVHLAVTGMLDHLPDMVISGINHGANMGDDTIYSGTIAAATEGFLLGVPSLAISLASHAAGNYATAARVASELAQRVMARPFAAPLLLNVNVPDIPYQDLQGTAITRLGRRHKAEPVVKSTNPRGQTVYWVGAAGAAQDAGEGTDFHAVANGRVSVTPLQMDLTQFSQLAPLRAWLQA >NZ_AP021884|1605656:1613255|1612328_1613255_+|WP_147073433.1|DBSCAN-SWA MHDDNEQEDYAGIPDDKSQTEVPEVELGFTEDAHTDVTQMYLNEIGHNALLSPTEERRLAELTRAGDFDARQKMIEHNLRLVVNIAKHYANRGLALLDLIEEGNLGLIHALEKFEPERGFRFSTYATWWIRQNIERAIMNQSRTIRLPVHVIKELNVILRARRHLENHGASDPSDEDIAHLVGLPVEDVRRMLRLNDRVASLDAPLDIDPSLSIGEAIADGNSALPEDMLEHAETEAFVRLWLSDLNDKQRWVIERRFGLGGQDVHTLEQLAESLDVTRERVRQIQMEALHHLRRMLKRTGVNKDALL >NZ_AP021884|1605656:1613255|1606685_1609043_+|WP_147073443.1|tRNA|DBSCAN-SWA MKFSELWLRTLVNPALDSAALSHLLTMAGLEVEALDPVAADFSGVVVGQVLSVAPHPDADRLRVCLVDAGTGSPLQIVCGAPNVSEGARVPCALAGARLPGFEIKKAKLRGVESQGMLCSARELGLAEQADGLLLLPNDAPVGSNIRDYLHLDDRLYTLKLTPNRSDCLSVAGVAREVAALTGSPLNLPRIEPAAVTGRLTRMVQVTAGQACPRYCGRVISQLNRAAQTPGWMIERLSRSGLRSISPVVDITNYVLLELGQPLHAFDLDKLAGDIQVRMATPGETLTLLNDQRATLEADMLVIADDNGAQALAGIMGGAATAVDENTSEIFLEAAYFSPGAIAGRARRLGLSTDSSHRFERGVDYAATRDALERATALILEICAGAASAITEITGDLPQRAPVMLRTARASKVLGVALSDAQVEVLLGRLCFDFQRDGAAYQVTPPSYRFDLNIEEDLIEELARLHGYDNIVAQAPVARLTMLPQPEQQRGVDALRTLLTARDYQEVITYSFVDAAWEADFAPGAQPVVLKNPIASQMGVMRSTLLGGLMDVLRNNLNRRQERVRIFESGRCYLPAAEGFDQPQRLAGLAYGSAMPEQWGSAARNVDFFDVKADLEALCWPQPARFEKSAHPALHPGQCAEMWLNGVHAGWLGTLHPRLTQQYDLATAPVVFELALPALLTRKLPRHGEISRFQSVRRDLAVIVDESAPVQALIDAMYAARIEGVAEITLFDVYRGKGIDSDKKSLAFRVLLQDTQKTFTDTEVDTAMAYFTDLLKQQFNAQLRS >NZ_AP021884|1605656:1613255|1609333_1609708_+|WP_147073439.1|DBSCAN-SWA MAEHQPTSQLPPIPAKRYFTIGEVSELCGVKPHVLRYWEQEFGQLKPVKRRGNRRYYQHHEVLLIRRIRELLYEQGFTINGARHRLDVLATSDAAEAAPTVTESVTDYAALRREMMEIVELLRL >NZ_AP021884|1605656:1613255|1609050_1609356_+|WP_147073441.1|DBSCAN-SWA MTLTKAELADMLFEKVGLNKREAKDMVESFFEEIRIALEAGDTVKLSGFGNFQLRDKPQRPGRNPKTGEEMPITARRVVTFHASQKLKSQVEDAHGGTSAN >NZ_AP021884|1605656:1613255|1610648_1611311_+|WP_147073435.1|DBSCAN-SWA MSVTRHSGIGMTSERTRARMVERLRAQGIKDNNVLTAMGMVPRHIFVDEALSIRAYEDSALPIGFGQTISSPYSVARMIEVLRGGADLQCVLEVGTGCGYQAAVLAKLAREVYSVERIATLLGRARRTIRELRIGNIKLKHGDGSIGLKDVAPFDGIILAAAIPTPPQALLEQLAQGGRMVLPRGIGETQQMVLIERTAEGFQETVLEMVHFVPLLPGVR >NZ_AP021884|1605656:1613255|1605656_1606676_+|WP_147073445.1|tRNA|DBSCAN-SWA MQNLDTIVAAALAEFAAVNQAVELEQAKARYLGKAGLLTGQLKQLGKLPAAERPAAGNVINQAKERIQQALEARRAALSRAELDNRLAAETLDVTLPGRGLGTGGLHPVTRTLARIQALFASIGFEVAEGPEIETDFYNFTALNIPENHPARAMHDTFYVDDKHLLRTHTSPVQIHYLQNNQPPLKIIAPGRVYRCDSDVTHTPMFHQVEGLWVDEEVSFAALKGVLADFMQRFFERDDLKVRFRPSFFPFTEPSAEMDIACVMCGGGGCRVCSHTGWLEVLGCGMVHPNVLGHVHVDSEKYLGFAFGMGVERLAMLRYGVDDLRLFFANDLRFLKQFN >NZ_AP021884|1605656:1613255|1611315_1612326_+|WP_147073482.1|DBSCAN-SWA MVFDKIACPVYLVAVILALGGCATQNSAPVVDGTQPGTSNIVKPAIKSATTRAGAAKLHDWRPDSHTVQKGDTLYSIALEYGLDYRDLASWNALSDNNLIRVGQVLKLSAPQPGSGIAQVTTSESAVQTIPLKIEPLPQAQIATGAVLITQPKAVKLPYSAAALAQLEQGGTPQPAARPPATPEAASGVAPEPASSAEQSRPAATAKETDDTGIDWIWPTQGRVIAGFDEAKNSKGLDIAGKAGQAIFAAAPGKVVYSGAGLRGYGKLVIIKHNAIYLSAYAHNQRVLVKEGQTVARGQKIAEMGDSDADQVALHFEIRKMGQPVDPMKYLPGAQK |
8 | uncultured_Mediterranean_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1977054 : 2023954
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|DBSCAN-SWA AATGCAAACCCAAGTTCCATCAATCGAATCCGGTCGGAATCCCCGCCGGATGAATCCCGGCGGTGCAACCTGCATCGCCCTCGACGAAAACGAGCTCGCCATCCGCTGGGGGCTCTCCGTCAAGACGCTGCGCCGCTGGCGTCAAGAGCAGCTCGGCCCCATCTACTGCAAGCTCGGTCGCCGGGTCACCTACCTCCTGCACGAAATCGAAGCCTTCGAGCGCCGCGTCTCGCGCTACTCGAGCTTCACTCGTGCGTACCAGTGAGGAGGACGGCCATGAGCGATCTGACCATCTTCCCCGTCGACATCGCTGAGATGTCTGTGAGCCAACTGGCCGCGCTGCCGCCCGAGCAGAAGTGCGAGGTCGACAAGAACCTTGATGCTGCCATCGACTGGCTAAAGAAGGCTCGCACCAAGTTCGATGCGGCGCTGGAACAGTGCTACGGCGAGCAGGCCCGTGTCGCACTGCGTGAATCAGGCCGTGACTTTGGTACCGCCCACATCAGCGACGGCCCGCTGCACATCAAGTTCGAGCTGCCCAAAAAGGTCAGCTGGAACCAGAAACAGTTGGGCGAAATCGCCGAGCGCATCGTGGCCTCAGGCGAGAAGGTCGAGGGCTACCTCGACGTCAAGCTCTCAGTGTCCGAGTCCCGGTACATCAACTGGCCGCCTGCATTGCAGCAGCAATTCGCGGCCGCCCGCACGGTCGATTCCGGCAAGCCGTCCTTCACCCTGAGCACCGATGGGGGTGAGGCATGAAGCGGCTACCCATCGTGTCCGCCGTCGAGCGGATGGCCGAGCGCAAGGGCGTGAAGCTGCTGATGCTGGGCAAGTCCGGCATCGGCAAGACGTCCCGGCTCAAAGACCTCGACCCCGCCACCACACTGTTCCTTGACATCGAGGCAGGCGACTTGGCGGTCGCCGACTGGCCGGGCGACACCATCCGCCCGGCGTCCTGGCCCGAGAGCCGCGACTTCTTCGTGTTCCTTGCGGGCCCGGACAAGTCGCTGCCGCCGGAGAGCGCGTTCTCGCAGGCGCACTACGACCACGTCATCGAGAAGTTTGGCGATGCGACGCAGCTCGGTCGCTACCAGACCTTCTTCCTTGACTCGATCACGCAACTGTCTCGCCAGTGCTTTGCGTGGTGCAAGACGCAGCCCGGGGCGGTCAGTGATCGTTCCGGCAAGCCCGATCTGCGCGCGGCCTACGGGCTGCTCGGCCAGGAAATGATCGGCGCGTTGACCCACCTGCAGCACGCCCGTGGCAAGAACGTTGTGTTCGTGGCGATCCTCGATGAGCGACTGGATGACTTCAATCGCAAGGTGTTCGTCCCGCAGATCGAAGGCAGCAAGACCAGCCTGGAGCTGCCCGGCATCGTCGATGAGGTCGTGACGCTGGCCGAGATCAAGGCCGAGGACGGCAGTTCCTACCGCGCCTTCATCACGCACACCGTCAATCCCTACGGCTTCCCGGCCAAAGACCGCAGCGGTCGTCTCGACCTGCTGGAGCCGCCGCATCTCGGCGCGCTGATCGCCAAGTGCGCGGGCGCTGTGCCCGCGCTAGCCAGCGCCGCCAACCCCGCACACATCGAATCTCAGGAGTAATCGCAATGACCGCATGGAATGACTTCAACGACGCCGACTCTCAGCAATCCGGCTTCGATCTGATCCCCAAGGGCACCGTCGTGCCGGTGCGAATGACCATCAAGCCGGGTGGCTATGACGACCCCGAGCAAGGCTGGGGTGGCGGCTACGCCACCGAATCGTTCGAGACCGGTTCCATCTATCTGGCCGCTGAGTTTGTGGTCACCGCTGGCGATCATGCCAAGCGCAAGATGTGGAGCAACGTCGGCCTGCTCTCCAAGAAAGGCCCGACCTGGGGCCAGATGGGGCGCAGCTTCATCCGGGCCGCGCTCAACAGCGCCCGCAACGTCCACCCGCAGGACAACAGCCCACAGGCCGCCGCCGCGCGCCGCATCAATGGCTTCGCCGAACTGGACGGTCTGGAGTTCTTGGCGCGCGTCGACATCGAGAAGGACGCGAAGGGTCAAGACCGCAACGTGGTCAAGCTGGCAGTCGAGCCCGACCACCCCGACTACGCCAAGTTGAAAGGTGTGCCGCCGAAGGGCAGTCCGGGCGGTGGCAACTCCGGCGCTCCGGCGCAGGCGGCCCCGGCCTATTCCGCGCCCACCCCGCAACGCGCACCAGTGACGGGCAAACCGTCCTGGGCTCAGTGAGGAGACGGCTATGAATGCATCCGTCCTCACTGCCAGTCACTACGGCGTCGTGCGCTTCGGCGATCTGCAATGCGAGGCCGTCGTCCTCAAGGGCGGCGAGCGTGGCTACGTTCGTCGCCAACTGGCCAAGCTGCTGGGTTTCCACGAGACGCACAAGGGTGGCCGATTTGCCCGGTTTCTTGCCGACTTCGCTCCTAAGTCCTTGTCGGCATTGGAGAAAACTCGTGAGCCGATTCTGTTGCCGTCAGGTCGGCAGGCGCAGTTCTTCCCGGCCGGGATCATTGCCGACGTCGCGTCGGCGGTGGTCAGCGCGGCCATCAACGGCACGCTGCACAAGGCCCGCCAGGGCATCGTGCCCAATTGCATGAAGATCATGCGCGCGCTGGCCACCACCGGCGAGGTCGCGCTGATCGACGAGGCGACGGGCTACCAGTACCACCGCGCGCCTGACGCGCTGCAGGAACTGATCTCCAAGCTGCTGCGCCAGTCGTGCTCTTCGTGGGAGCGCCGCTTCCACCCGGACTACTACCGCGCCCTCTACCGGCTGTTCGGCTGGAAGTACCAGGGCCACGACCAGAACCCGCCCCACGTTGTCGGTCAGATCACGCAGCGCTGGGTCTACGGCCCGGTGCTGCCCGTCACGCTGATCGACGAGATTCGCGCCCGCAAGGGCATCTCGCAGAAGCACCACCAGTGGCTGTCCGATCAGGGCCTCGCCCGTCTGGAAACGCAGATTCACGCGGTCACCGCCATTGCGCGCAGCTCGACCTGCTACCGCGACTTCGACCGCCGCTGTGAAGCGGCCTTCGCTGGCGGCGCGCTGCAGCTGGCGCTGCTGGCCGAAGACTTTGAGGAGGGGGCGTGAAATGCTGGGTCTGCAAACGACAGGCCCGGGGATTCGGTCACACCGACAACCGACACGGTATCGGCGATCCCCGGCGCTACCCCATCGACTGGGTGTTCTGCTCGCAGCGCTGCCAATCCGCGTTCCACGCTATGTACGGCAACTGGTCGCGCGCCAAGGATGGTCGCAGCGACATCAAGGGGGTCGCCATGATCGATCCCTCTGATATCGAGCTGGCCGCGATGCGCAAGTGCCTCAAGTCCTTCGGCGAGGCGGCAAGCGAGATCGGCTTCACCAAACCACTGGGCAACTACTCCGAAGCCGAGGCGCTGCAGGTGATCGACGCCATCGTCACTTGCTACACCGAGGCGATGGTTGAGCACCACGAGGCGAGCAAGTACCCGCCCGTACGCGGCATGACGCCAACGCCCGACCCCATGACACCGAGTGCAGCCAATCCGTTCGCGGATCTGGACGACGACCTGCCTTGGGAAGAACCGAAGGGGAAGAAGCCATGATGGACTTCAACTCCACTTCGAGCATCTCGGGCCAGATCACTGCGCTGGTCGACGCCGGGATGCAGCGGGCGCGAGCCCAGCAGTCCGAGCGCCAGTACCTTGGTGCCTCGCGGTTGGGCGCTGCCTGCGAGCGTGCGCTGCAGTTTGAGTACGCCAAGGCTCCCGTCGATCACGGCCGGGACACCCCGGGCCGGATGCTGCGCATCTTCGAGCGCGGCCACGTCATGGAGGACTGCATGGTCGCGTGGCTGCGCGACGCCGGTTTCGAATTGCGTACCCGCAGGGCCGATGGCGAGCAGTTTGGCTTCTCCGTGGCTGATGGCCGTCTGCAGGGCCACATCGACGGCGTCATCGTCGATGGCCCGGAGGGCTTTGCCTACCCGGCGCTCTGGGAAAACAAGTGCCTCGGCATGAAGTCCTGGCGCGAGCTGGAGAAGAACCGGCTCGCCGTGGCCAAGCCCGTCTACGCCGCGCAAGTGGCGATCTACCAAGCCTATCTCGAACTGCACGAGCACCCGGCGATCTTCACGGCGCTCAACGCCGACACGATGGAGATCTACACCGAGGCCGTGCCCTTTGACGCAGCCCTGGCCCAGCGAATGTCGGATCGGGCGGTGAAGGTCATCACGGCGACTGAAAGCGCAGATCTCCTGCCGCGTGCCTTCAATGACCCGACCCACTTCGAGTGCCGGATGTGCGCGTGGCAAGACCGCTGCTGGAGAACACAAGCATGACCGACAACAACACCCCGACCACCGGCATCGAGCCGATGATCGATGCCAAGCAGGCGGCCGCCGCGTTGCGCCTGCCGTACTACTGGTTCGCCGACCACGCGATGCGCACCAAGTACCGGATTCCGCACTACCTGATGGGCGGTCTGGTGCGCTACCGGCTGTCCGAACTCTCTGCGTGGGCCACGCGTACCACCGCCGTTCAGGGCCGTGATTCCCAAGATGCGGACGCACCTGTCGAGGGAGCCGAATGATCGACTTCAACGACACCACCCAACCTGCGGAGCACAACAGGGAATCTGAACGAGACGAGATTCGCGCCGACTTGCTTGCGCGTCTGGAGTCGGTGCTGACCACGATGTTTCCGGCTGGCAAGAAGCGCCGTGGCAAGTTCCTGATCGGCGACATCCTCGGCAGTCCAGGTGACAGCCTCGAGGTGGTGCTGGAAGGTGAGAAGGCCGGTCTGTGGACGGATCGTGCCACCGGCGATGGCGGCGACATCTTCGCCCTGATCGCGGCCTATCTCGGTGCGAACGTCCACACCGATTTCCCTCGCGTGCTGGATGAAGCTGCCGATCTGCTCGGGCGGTCGCGGTCGGTGCCAGTGCGCAAGGCGAAGAAGGAAGCGCCTGTAGACGACCTCGGCCCGGCCACGGCGAAGTGGGACTACTTCGATGCCGGTGGCAAGCTGATCGCCGTCGTCTACCGCTATGACCCACCGGGAGGCAAGAAGGAATTCCGACCGTGGGACGCGAAGCGCCGCAAGATGGCCCCGCCTGAGCCGCGCCCGCTGTTCAACCAGCCGGGCATCGGTGCGGCCAGCCACGTCGTCCTGGTCGAGGGCGAGAAGTGCGCGCAGGCCTTGATCGCCAGCGGCGTGGTGGCCACCACCGCCATGCACGGTGCCAATGCCCCGGTCGACAAGACCGACTGGTCGCCACTGGCTGGCAAGACGGTGCTGATCTGGCCCGACCGCGATGCGCCAGGGTGGGACTACGCCGACCGCGCGTCGCAGGCGATCTTGCAGGCAGGCGCGACCTCGGTCGCCATCCTCATGCCACCCGACGACAAGCCGGAGGGGTGGGACGCTGCAGATGCCATTCCCGAAGGTTTCGATGTCGGTGGCTTTCTGGCCGTCGGCGAGCGGATGCCGGTGATGCGCTCGGTGGAGGAAGCGCCTTCGCCAGACTTGCTGACGGGCATTGATTGGACGACCGAGGATGGCCTGTCCAGCGCTTTCACCCGCCGCTATGGCGAAGACTGGCGCTACTGTGCCCTGTGGGGCAAGTGGCTGGTCTGGACGGGTGTGCGCTGGAATCCCGATCAGGTGCTCTACGTGTCGCATCTTTCCAGGGGCATCTGCCGTAACGCCTCGCTGAAAGCGGACACGCCGAGGCTCAAGGGCAAGCTGGCCAGTTCGGCCACGATTTCGTCGGTTGAAAAGATCGCGCGCTCTGACCCGAAGCACGCATCCACCGCCGAGGAATGGGACGCCGATGTCTGGGCGTTGAACACCCCCGGTGGCGTGGTCGATCTGCGCACCGGCCGGATGCGCCCGCACCGGCGGGACGACCGAATGACCAAGGTGACCACGGCTACCCCGCAGGGCAATCCGGACAGTGCCTGCCCAACGTGGCGAGGGTTCCTGACAGACGTCACCGGCGGCGATGCCGATCTGATGGCCTACCTGCAACTGATGGTTGGCTACTGCCTGACGGGCGTCACCAGCGAGCACGCGCTGTTCTTCCTGTACGGCACGGGCGCGAACGGCAAGTCGGTGTTCGTCAACGTGCTAACCACCATCCTGGGCGACTACGCGGCCAACGCCCCGATGGACACTTTCATGGAGGCGCGCAATGACCGACACCCCACCGATCTCGCCGGGTTGCGCGGTGCACGATTCGTGTCATCCATCGAAACGGAGCAAGGGCGGCGCTGGAACGAGTCCAAGGTCAAGGCCATCACCGGTGGCGACAAGGTGTCCGCGCGCTTCATGCGCCAGGACTTCTTCGAGTACCTGCCGCAGTTCAAGTTGGTGATCGCGGGCAATCACAAGCCGTCGATCCGCAACGTCGACGAGGCGATGAAGCGTCGACTGCACCTGATCCCGTTCACGGTGACGATCCCGCCCGAGCGCCGCGACGGCAGGCTGACCGAGAAGCTGCTCAAGGAACGCGATGGGATTTTGGCGTGGGCCGTCGAGGGCTGCAGCCGCTGGCAAAGCCAGGGCTTGAAGCCGCCCGCCAGCGTGGTGTCGGCGACCGAGGAGTATTTCGAGGCCGAGGACGCGCTCGGGCAGTGGATCGAAGAACGCTGTCTGCTGGCCAAGTCGCACCGCGAAGGTGTCTCCGAACTGTTCGCCGATTGGCGTGAATGGGCCGAGCGCGCTGGCGAGTACGTGGGCTCGGTCAAACGCTTCTCGGAGCTGATGGCGACTCGCAAGTTCGACAAGTGTCGGCTGACCGGAGGGGCTCGCGCCATCGCGGGCATCGCCCTCAGGCCCAAGCCGTACAGCCACGCCTACCCCTACCGCGATGACTGATCAATCCGGTCGAGTGACGGATTTGACGGGTTTCCTGATTGACGCGCTACACGTGCGCGCACGTAAAGGGCGTTGTCCTGACAAACCGTCGCATCCGTCACTCGCCCACCCAACACGGAGTAAAGACGATGAAAACGACGATCCTCGCCCTCGATCTGGGCACACACACCGGGTGGGCTCTGCAGCACCTGGACGGCACCATCACCAGCGGCACGGAGCACTTCAAGCCGCAGCGATTTGAAGGCGGCGGGATGCGTTTCCTTCGATTCAAGCGCTGGCTCAACGAACTGCTGTCGGTCAGCAATCACATCAACGCGGTGTTCTTCGAGGAAGTTCGGAGGCACGCTGGCGTTGACGCAGCGCACGCCTACGGCGGATTCATGGGGCACCTGACCGCGTGGTGTGAACATCACAACATCCCCTACCAGGGCGTTCCGGTCGGCACGATCAAGAAGCACGCGACCGGCAAGGGCAATGCGAGCAAGGACGAAATGATCACGTCCGTCCGCGAGCGTGGTCACACCCCAGTCGACGACAACGAAGCCGACGCGCTGGCTCTGCTGCACTGGGCAGTCGAGACGCAGGAGGTGTGACGTGAAGGTTTCGACACCCCAATACCGCTGCCCCCTTGGTCGGCTGCAACCCCAGACCACCGATCTGGACGCCATCAAGGAACGTGGCTGGCGTGACCAGCACATCCTGGTGGTCAACGCGTCCGACGACCGTCTGGACTTCATCGAGCGCGAGATCGTGCGACGCATTGGTGAACGCCTGTACGGGCTGGGAGGGACGCGTCATGGCTGAGTGGACAACCGACGACGTGGCAGCACGCTTCGAGGAGGCCGCCACCACCGGACGACGCTTGCCCCCTGTACGTGTGCAGGGCTACTTCAACTGCTGGCCTGCCTTCGTCCGCAAGGAGTGGGAAGCCTTTGCTGCTGACGAGAAGGTGTATCGCCCCTTCCCACCAAGCCCCGAGGCCATCGACCGGATGCTGGAGACGATGCGCTGGGTGCAGTGGCTCGAGGTCGAGCAGCGACACCTCGTGTGGATGCGGGCCAAGCGCTACGGCTGGAGGGACATCACCATTCGATTTGCCTGCGACCGCACCACGGCGTGGCGGCGTTGGCAGAGGGCAATGGAGATCGTGGCCACGAACCTCAACAGCGAAGGCGTGCGGTTGCCTTCCAAAAACGTGGGCAATTTAGGGTAATGCTTGCCGCGCTTGTCCCTGCTTTGCCTTGATTGTCCGTTTCGAGGCCCGGCAGCCCTGCAACAAAACAGCCCGGTCGGGGGTAGTATTTCGGCTATCTTCTGGACAGCGGTGACGGTTGAGGCGATGGGCCCAGGCAAAAGGGGTCCTTCCTTCCCGAATCGCAATGCGGGGGGCGCGAGCGCGGCATTCGCCTAGCGTCCGACTGCAAACCAAGGTTTGCAGGGTTTGCAGTTTGCACCCGCACCAGTCCGCACCCATCACGAGCCCGCCCACGGTTTTCCGTCGGCGGGTTTTCTTTTTGAGGAAACGATTCTGAACACGCTTAACGTTGAGTACCGCAAGGTCGAGGCGCTGATCCCCTACGCCCGCAATCCACGCACTCACACCGACGAGCAGGTGGCCAAGATCGCCGCCAGCATCGTCGAGTACGGCTGGACGAATCCGGTGCTGGTGGACGGCGACAACGGGATCATTGCGGGCCACGGTCGTTTGGCCGCCGCGCGCAAGCTCGGGCTGGATCAGGTACCGGTCATCGAACTGGCGCACCTCTCACCCACCCAGAAGCGTGCCTACGTCATCTCCGATAACCGGCTGGCGCTCGACGCCGGTTGGAACGAGGAGATGCTGGCGCTGGAAATGGCCGAGCTGTCCGAGGCCGGGTACGACCTTGCACTGACCGGTTTCGAGGATGCTGAGATCGAGGCCTTGCTCGCTGACGAAGTCGCCTCCGATGCCGCCGACCAAGAGCCCGATGCCGACGAGCCGGACGATGGCGACGATGTGCCGGATAGCCCAGTGGTGCCGGTGTCCCGCACCGGCGATTTCTGGGCCATCGGTACCCACCGTCTGATCTGTGGCGACGCCACCGACCCGACCGTGGTCGCCACTCTAATGCAGGGTGATGCGGCCCGGCTGTGCTTTACATCACCGCCTTACGGCAACCAGCGCGACTACACCTCCGGCGGCATCACCGATTGGGATGGCCTGATGCGCGGTGTGTTCGCCAAGGTGCCAATGGACGACGACGGGCAGGTGCTGGTCAACCTCGGGCTGATCCACCGCGACAACGAAGTCATCCCGTATTGGGATGCGTGGCTGGGCTGGATGCGCACGCAGGGTTGGCGGCGCTTTGCTTGGTACGTCTGGGATCAGGGGCCGGGGATGCCCGGAGACTGGGCTGGTCGTTTTGCGCCGAGTTTCGAGTTCGTCTTTCACTTCAACCGCTCTAGTCGCAAGCCCAACAAGATCGTGCCCTGCAAGCACGCGGGCCAGGAATCGCACTTGCGCGCCGACGGGTCGTCCACGGCCATGCGCGGCAAGGACGGCGAAGTCGGTGGCTGGACGCACAAGGGCCAGCCGACGCAGGACACCCGGATTCCCGACTCGGTGATCCGCGTGATGCGGCACAAGGGCAAGATTGGTCAGGACATCGACCACCCGGCTGTGTTCCCGGTGGCGTTGCCCGAGTTCGTGATCGAGGCCTATACGGACGCAGGCGACATCGTGTTCGAACCTTTTGGCGGCAGCGGTACCACGATGCTGGCCGCGCAGCGCAAGGGTCGTGTGTGCCGCTGCGTGGAGATCGCGCCGGAGTACGTGGACGTCGCCATCAAGCGCTTCCAGCAGAACCACCCCGGCGTGCCCGTCACGCTGCTGGCCACAGGCCAGTCCTTCGACGATGTGGTCAATGAACGTCAGGCCACCACGGAGGTAGAGCAATGACCGCCTCCTGGTTTGCCGACAAGATCGAAAAGTGGCCGACTGCCAAGCTGCTGCCCTATGCCCGCAACGCGCGTACTCACTCGGACGATCAGGTGGCGCAGATCGCCGCGTCGATTGCCGAGTTCGGATTCACCAATCCGATCCTGGCGGGCAGCGATGGCGTGATCGTCGCCGGTCACGGACGGCTTGCTGCTGCGCAGAAGCTTGGGCTGGCGGTGGTGCCGGTGGTGGTGCTCGATCATCTGAGCCCGACACAGCGCCGGGCCCTGGTGATCGCAGACAACCGCATCGCCGAGAACGCGGGCTGGGACGATGCGATGCTGCGCATCGAGATCGCATCACTGCAGGACGACGACTTCGACGTGTCGCTGACCGGCTTCGATGCAGATGCGCTGGCCGAATTGATGGCGGGCGACGAGCCGGATGGCGAAGGCGAAACCGATGACGATGCCGTGCCCGAGTTGTCGGAGACGCCGATCTCTCGTCCGGGTGATGTCTGGTCGCTTGGCGGCCACCGGCTGCTGTGCGGGGACTCCACCGTGACTGAGAGCTACGACAGGCTTCTCGATGGCGAGCAGGTCGACATGGTGTTCACCGACCCGCCGTACAACGTGAATTACGCCAACAGCGCCAAGGACAAGATGCGTGGCAAGGACCGCGCGATCCTGAACGACAACCTCGGCGACGGCTTCTACGACTTCCTGTTGGCGGCGCTGACGCCGACCATCGCGCATTGCCGGGGCGGGATCTACGTGGCGATGTCGTCCAGCGAACTGGATGTACTGCAGGCCGCATTCCGCGCCGCCGGTGGCAAGTGGTCGACGTTCATCATCTGGGCCAAGAACACCTTCACGCTGGGCCGTGCCGATTACCAGCGCCAGTACGAGCCGATCCTGTACGGATGGCCAGAGGGCGCGCAGCGTCACTGGTGCGGCGACCGCGACCAGGGCGACGTCTGGAACATCAAGAAGCCGCAGAAGAACGACCTGCATCCGACGATGAAGCCGGTGGAGTTGGTCGAGCGCGCGATCCGCAATTCGAGCCGACCGGGCAACGTGGTGCTCGACCCGTTCGGGGGCTCCGGCACGACGCTGATTGCCGCCGAAAAGTCAGGACGGCTGGCACGGCTGATCGAACTCGACCCTAAGTACGCGGACGTGATCGTGCGCCGCTGGCAGGAATGGACTGGCAAGCAAGCCACCCGTGAGTCGGATGGCGCGCTGTTCGATGATCAGGCGGCGATCGACTCTTCCGCGATCTCGCAATGAATCACGAACCCCGTCAGGTAAGGCAGGCCGCGCGGGATGCCGTACTGCTTGCTGGTCTGGCGGCCAATCGTCCAGCCCATCCACTGTTGGGTGGCGGCGTTGATCGCGTCCGCCAGGGTCTGGCCCCGGTACAGCCCGTTTTGCAAATCGTCCGCAAAGTGGCGGCCGTGGCGACTGTCGAGGAAGACGCGGACTGATTCGAGGGGCTGACCGGTAGCGTCGGAGATGGCGGTCATCGCCAGGGGCCACGCGGTGCTGGCGTGTTCGTTCATCGTGCCCCAAAAGCCCCAGGCATCGTTCTGGGTGGCGGGCATTTGCTGGTTGGTGTTCATCTCTGGCTCCTTGGGGTTGATCGTTGCGACACCCGTAGTAACGCGCTGTTCGATTGAGAAGCCAAGCTGTTCTTGGCCTCTTTCTCAATCAATTTCGATTACCCGAGACGGGCCACGTACCGGGCGTAGTCGCCGCCCTCTGGATTCACGTAAAGGTAGGGGCGACCCGGTGCGGTGACCTCGACGCAAAGATAGCCGTCGCCGGTGCCGCCACCTTTGCCGCGCAGCCAGTCGCGCGATACCAACAGGCTGCGGGCAAAGGCATCGAACTCGTCGACGGTCAGTTCCTTGGTCTCGGTGACATAGACCTTGGTCTGACCCTGGCCGCCAACTTCGTCCAAGTCGGCAGGCTTACGGGCAAACGGCAATCGGACGCTCAACTCCTCGACCTGGAAGGTGGTGTCGCCAAACTGCAGGGTGCGCGGGGTGCGTTCGATGGTGATGGTCATGGTGCTCATGAATGTTCTCCTGGGTGTTGGCGTTGCGATCAGGCTTCTGCGGCGATCCGGTAGACCCGCTCGCTGCCCTGGGCCTTGTCCGAGACGATGGTCAGCCCGAGCTTCTTCTTGAAGGCACCGGCAAAGGTGCCGCGCACCGTGTGCGCCTGCCAGCCGGTGGTCTCGCAGATCTGCTGCACCGTTGCCCCTTCGGGGCGCTGCAGCATCTGGATCACCGTGGCCTGCTTGCTGTTCTCGCGGGTGCGAGGTTTGGCGGCCGCCTTTTCTTGCGCCCACGCGGCCTCGGCTGCCGTCACGGCTGCGTCGAGTTCAGGGTCTGCGGCCACTGGCGCAGGCGTTGGCCGGGCGCGCCCCATCGCGTCGTAGCCCTCGGCGGCGACGAACCAGTGGGTGCCGTCGGAGGTGATCAGCGCGCGGTTGAACAGGCCGTCGAGCACCTTCTTGCGTGCGCCGCCTTTGATGTTGTCGGGGAACCAGTCGATCTTGCCGTCGGTGTGTTCGAGGGCGTAAGCCAGGATCGCGTGCTGGGCCGGGGTCAGTTGGGTGGTGGTCATTTGCTTCTCCTTGTGCAAGGGGTTGATGGGGTGACGTGATGAACGCGCTGTTCGGGAGTGAAGCCAAGCGTTTTCTGCTTGGCTTCGAAGGTTCTTGATCAGCTGTTGGCCTTGTCCGACTTCGTCGCCTTGCGGCCTTGTTCGACGCCTGCGTTGAACGCGGCCTCCAGGGCGTCGCGCAGGCACCAGACCGCCACGTCGTGGAAGTCGAGGCTGTCTGACTTGCGGGTTTCCAGGGTTTCGATGCCCAGCTTGTTTTGTGCGATCTGGGTCAGGAGTTGTTCGAACTTGCTCATTGCTGCTTCCTTTGATGGTGTTGATGACGTCCGTATGAACGCGCTGTTCCAGAGAGAAGCCAAGCTGATTTCGAGTGAAGGTCGAAAAAATGATTGAAGGGGTAACCGGTTCTCAAAATGGGCATTTCGATTCGCGCTTACGCCCGTCACCGTGGTGTGACCGACACCGCTGTTCACAAGGCAATTCGCGCAGGTCGGATCACGCCGGAGGCTGACGGCACCATTGATGCCGACCGTGCTGATCGCGAGTGGGCTCGCAACTCCGATGTGCCGAAGACCGGTACGCGGGCCAAGGCCGCAAAGGTCGCCGTGCCGGAAGGCGGTACGGGTGTTGGCGGTGATGGGCCCGCCGCATTACCCGCTGGCGGCGCGTCCTTACTTCAGGCGCGCACGGTCAACGAGGTCGTCAAGGCGCAGACGAACAAGGTGCGTCTGGCCCGACTGAAGGGCGAGTTGGTGGATCGGCCGCAGGCCATCGCCCACGTCTTCAAGTTGGCGCGCTCCGAGCGTGATGCGTGGCTGAACTGGCCCGCGCGCATCTCTGCGCAGATGGCGGCCAAGCTCAATATCGATCCGCACACGATGCACGTCGCCCTGGAGGCGGCGATACGTGAGCACCTGCAGGAACTGGGCGAACTCCGGCCCCGGGTGGACTGATGCTGAATGTTGAATACGAAGGCGCTGCCGAAATCGAGCGCGCGTGGCGTGAAGGGCTGACACCTGATCCTCTGCTCTCGGTCTCTGAATGGTCGGATCGCCACAGGATGCTATCGAGCAAGGCGTCCGCTGAGCCTGGGCGCTGGCGCACCAGCCGCACGCCGTACCTGAAGGCCATCATGGACTGCCTGTCGCCGACCTCGCCGGTCGAGCGCGTGGTGTTCATGAAAGCCGCACAGCTCGGTGCGACTGAAATGGGCTCGAACTGGATTGGCTATGTGATTCACCACGCACCGGGGCCGATGATGGCGGTGTGGCCAACGGTGGATATGGCTAAGCGCAATTCCAAGCAGCGGATCGATCCGTTGATCGAGGAGTCGGCGGCACTGAGCGAATTGATCTCCCCAGCACGGTCACGCGACTCGGGCAACACCATTCTGGCCAAGGAGTTCCGGGGCGGCGTGCTGGTGATGACCGGGGCGAACAGCGCGGTGGGCTTGCGCTCGATGCCGGTGCGCTACCTGTTCCTCGATGAGGTTGACGGGTATCCGCTGGACGTCGAGGGTGAAGGTGATGCGATCTCGCTGGCCGAGGCGCGCACGCGAACCTTTGCCCGGCGCAAGATCTTCATCGTGTCGACGCCGACGATCTCGGGGGCGAGCGCCATCGAACGCGAGTACGAGGCCAGTGACCAACGTCGCTACTTCTTGCCTTGTCCGCACTGCTCGCATCGCCAATGGCTGCGCTTCGAGCAGTTGCGATGGGAAAAGGGGCAACCGGACACGGCGTCCTACATCTGCGAGTCCTGCGATAAGTCGATTGCCGAGCACCACAAGACCTGGATGCTGGAGCACGGTGAGTGGCGCGCGATGATCAGCGACGGCACGGGCAAGACAGCGGGGTTTCACCTGTCGTCGCTTTACAGCCCGGTTGGCTGGCGCGGTTGGCGCGACATTGCTGCCGCGTGGGAAAGCTCTGTGAACAAGGAATCGGGGTCGGCGGCCGCCATCAAGACCTTCAAAAACACCGAACTGGGTGAAACCTGGGTTGAGGAAGGCGAAGCGCCAGATTGGCAACGGCTGGTCGAACGCCGCGAGGACTACCGGGTTGGCACGGTGCCGCCGGGTGGGTTGCTCCTGGTGGGCGCTGCCGACGTGCAGAAGGATCGCATCGAGGCGTCCATCTGGGCCTTCGGGCGCGGCAAGGAGTCCTGGTTGGTCGAACACCGCGTGCTGATGGGCGACACCGCCCGAGACGCCGTGTGGAAGCGACTCGCCGAGTTGCTCGCCGAAAACTGGACGCACGCCTCGGGCGCGGCGATGCCGCTGGCCCGTTTCGCTCTGGACACCGGCTTTGCGACGCAGGAGGCCTACGCCTTCGTGCGGGCCTGCCGTGACCCGCGCGTGATGCCGGTCAAGGGGGTGCCGCGCGGCGCGGCCCTGATCGGCACGCCGACGGCCATCGATGTTTCGCAGGGCGGCAAGAAGCTGCGCCGGGGCATCAAGGTGTTCACGGTGGCGGTTGGCATCGCCAAGCTGGAGTTCTACAACAACCTGCGCAAGGGCGCGGACGTCAGCGAGGACGGCGTGACCACCGTCTACCCGACGGGGTTCGTTCACTTGCCAAAGATTGACGCGGAGTTCATTCAGCAGCTCTGCGCCGAACAGTTGATTACCCGTCGCGACCGCAACGGCTTCCCGGTGCGCGAATGGCAAAAGATGCGCGAGCGCAATGAAGCGCTCGATTGCTACGTGTACGCCCGCGCGGCCGCATCGGCGGCGGGCCTGGATCGCTTCGAGGAACGCCACTGGCGCGAACTGGAACGCCAACTCGGGATGGAACGGCCACCGGATGAGCCACCCCCGATTCAAGCATTCGACCCAAACGAGGCCACCCAACGCGGTGGCCTCTCTGTTTCTGCAAACCCACCACGGCGGCGCGTCATCAAGAGCCGCTGGTTGTCCTGATTTTCAGAGGAGTTTTCATGAGTCTTGCCACCCGTATCGAGAGCCTGGTCATCCGGGTTGCCCAGGAGTTCAACGACGTCCGCGCGACGGCAGGCAGTCTGGCCAGCCTGTCCACCAACGACAAGTCGAGTCTGGTCGCCGCCATCAACGAGCTCAAGGCAGCGGTTCTGTCCGCGATGGCCATCGATGACAACCAGATCGCCACCACCAGCACCTACTCGTCGAACAAGATCGTGTCGCTGCTGGACGCGCTCAAGACCGACATCCTGGGCGGAGCCGATGCTGCCTACGACACCCTGGTGGAAATCCAGCAGGCGCTGCAGAGCGGTACCAGCGGCCTGGACGCGATTCTGGCTGCGGTCAATCTCCGTGTCCGCTTCGATGCGGCGCAGACCCTGACCGTGGCCGAGCAACTGCAAGCACGTACCAACATTGGTGCGGTCGCTGTCAGTGATGTCGGCAACACCGACACCGATTTCGTCGTGATCTTTGACGGCGCGCTGGCCTGATGAGCCTCGCTTCCAGCATCGCCGCTTTGGCGGCGCGCATCGGCTTCGAGGTCAAAACCAAGATCGACGCCACGCATCCCGGCATTGCCCGGGTGTGGGTCAGCTTCGGCTACGTGGGCGGTCAGGTCGTGATCGCCAGCGCGCACAACGTCGCCAGCGTGGTGCGCACGGCGGCGGGCCGGTACCGCGTGCATTTCGCTGTGGCGATGCCGGATGCGAATTACTGCTGGACGGCGCTCGCGCGCAGCAGCACCAACACCGGTCAGCAGCGCTTGGCCCTGGTACGTGCCAGCTCCGACCTGAAGACCGCGCAGTACGTCGACGTCTCGTGTGCGACGGCCGCGTCGTCGTTTGACGACTCCTCTGAAATCAACCTCGTGGTGTACCGCTGATGGCCTACACAGAAGCCCAACTCCAGGCATTGGAGACCGCGCTCGCCAAGGGCGAACACCGCGTCAGCTTCGGCGACAAGACCGTCGAGTACCGCTCGGTCGATGAACTGAAAGCTGCGATCCGCGAAGTCAAGCGCGGCATCCTGGAGCAGGCAGCCGCCACCGGACTATGGCCGGGTGCGCCGCGCCAGATCCGGGTCACGACCTCGAAGGGGTTCTGATGGCCTGGTATTCGAAGATCCGAAGCCTGTTCGGCCAGCAACCCGTCCACGAAGCGGCTGGCCGTGGTCGCCGCTCGTTGGCTTGGATGCCCGGCAACCCGGGCGCGGTCGCCGCGATGCTGGCGACCAACACCGAACTGCGCATCAAGAGCCGCGACCTCGTGCGCCGCAACGCGTGGGCGCAAGCCGGTATCGAGGCCTTCGTGTCCAACGCGGTCGGCACTGGCATCAAGCCGCAGAGTCTTGCTGCAGACGAGCGCTTCAAGACCGACGTGCAGGCGCTGTGGCGTGACTGGACAGAAGAAGCCGACGCCGCAGGACAGACCGATTTCTACGGCCTGCAGGCATTGGCCTGTCGCGCGATGCTCGAAGGCGGTGAATGCCTGATCCGGCTGCGCCCGCGCCGCCCGGAGGACGGACTGGTCGTTCCTCTGCAGCTTCAGTTGCTGGAGCCCGAGCATCTGCCGATCAGCCTCAACCTCGATCTGCCTTCGGGCAACGTGGTGCGCTCTGGCATCGAATTCGACAGCCTCGGGCGGCGCGTCGCTTACCACCTGTACCGCTCGCACCCCGAAGACGGTCGGCTGGCTCCGATGTCGGGCCAGGGCGGGATGGACACGGTGCGCATCGATGCGAAGGAAATCATCCACCTGTTCCGCGTCCTGCGTCCCGGCCAGATCCGGGGCGAGCCGTGGTTGTCGCGGGCCCTGGTCAAGCTCAACGAACTTGACCAGTACGACGACGCAGAACTGGTGCGCAAGAAGACCGCCGCGATGTTCGCCGGGTTCGTGACACGGCAGAACCCGGAGGACAACCTGATGGGTGAAGGTGCGGCCGATGGCGATGGCATTGCGCTCGCCGGGCTGGAACCGGGCACTTTGCAGATTCTGGAGCCCGGCGAGGACATCAAGTTCTCCGACCCGGCCGACGTCGGTGGCTCGTATGGCGAGTTCCTGCGCACGCAGTTCCGCGCGGTCGCCGCTGCCATCGGTGTCACCTACGAGCAGTTGACCGGCGACCTCACAGGCGTGAACTACTCGTCCATCCGCGCCGGGATGCTGGAGTTTCGGCGTCGCTGCGAGATGGTGCAGCACGGGGTGCTTGTGCATCAGATGTGCCGTCCGGTTTGGGCCGCGTGGATGAAGCAGGCAGTGCTCGCCGGTGCCATCGATGCTCCCGGCTTCGCGCGTGGCGGCCCAGCCCGTCGCCGCCGGTACCTGCAGGTGAAGTGGATTCCACAGGGCTGGCAGTGGGTCGATCCTGAGAAGGAGTTCAAGGCCATGCTGCTGGCCATCAGGGCGGGACTGATGAGCCGCTCGGAAGCCATTTCCGCCTTTGGCTACGACGCCGAGGACGTTGACCGCGAGATCGCCGCCGACAACCAGCGCGCCGACGACCTGGGGTTGATCTTCGACTCCGACCCGCGCCGCACCTCCAAGGACGGCGGAAGCGCCGAGCCGAACAAGAACGCTGCCGACACCACGCAAACCGGCAGCTCATCGTCTGCCTGAAGGATTTCCATGACCCTGTTGCCCCATTTGGCGGCGCGCCTCTACGGTGTGCCGCTGGCGATCCATCGCCCAAAACTTGACGTGATCCTGGCCGTGCTCGGCCCCCGGATCGGCTTGGCTGATTTGGCTGCACCCTCGGGCTTCACGCCGCCCGCACGTCCCGCATCCACCCAGACGACGAAGGTCGCGGTCATCCCCATCCACGGCACGCTGGTGCGCCGCACAGTGGGCCTGGAAGCCGAATCCGGCTTGACCAGCTACGCAGGGCTGACCGCGCAGTTGGACGCCGCGCTGGCCAGCCCGGATGTCGCTGCCATCCTGCTCGATGTCGACTCACCGGGTGGCGAGTCGGGCGGCGTGTTCGATCTGGCCGACCGCATCCGTGCGGCTGCTAAGACGAAGCCGGTCTGGGCTGTAGCCAATGACATGGCGTTCTCGGCAGCTTACGCCCTGGCGTCTGCGGCCAGCAAGGTGTTCGTGTCGCGCACCGGCGGCGTCGGCTCGATTGGCGTCATTGCGATGCACGTCGACCAGTCCGAGAAGGATGCGCAGGACGGCGTTCGGTACACGGCGGTCTTTGCGGGCGACCGCAAGAACGATCTGAACCCACACGAGCCGATTTCCAGCGAAGCCCACGCCTTTCTCAAGGGTGAGGTGAATCGCGTCTACGGCCTGTTCGTCGAGACGGTGGCCCGCAACCGTGGCATCGAGGCATCTGCCGTGCGCGACACCGAGGCGGGGCTGTTCTTCGGGCAGGCCGCCGTGGCTATCGGGTTGGCCGATGCCATCGGCACCTTCGACGACGCCCTTGCGCAGCTTTGCGAATCCGTTTCCCCACTCCCGAAGTTGGCGGCAAGCCACTCCGGTCTTTTTAGCAACCCCCAGATGGAGTCATCAATGAATGATCGAACCGACCCCGCTGCTCCTGATCGGCTTGCTGCTGATCCTGCTGGCAGTCCTTCTCAACCGGCGGCCGCCACCGCCATGACCGTGGCTGACGCGATTGAGGTCGCCCAGACCTGCACCCTGGCCGGGCGCACCGACCTGATCGCGGGCTTCCTCGAAGCGAAGGCACCACCCGCCAAGGTACGCAGCCAGTTGCTGGCCACCCAGGCCGAAGCCAGTCCCGAAATCGTCAGCCGCATCGACCCGCAGTCGGCCATGTCGGCGAGTAGCACTGGCCATCCTGCCTCTTCCCACAACCCTCTGATCCAGGCCGTCAAAAGTCGCCTGGGCACAAAGTAACCCAAAAAGGAGCATCCCGTGCCCGCAATGCAAGAACCAATCAACCTCGGCGACCTCCTGAAGTACGAGGCGCCCAATCTCTATTCGCGCGACCGCGTGACCGTGGCAGCTGGCCAGACCTTGCCGCTGGGTACGGTGCTCGGGCAGATCACGGCGACGGGCAAGGTCAAGCAGATCGACCCGTCGGCCACCGATGGCAGCCAGTACTCCGCTGGTGTGCTGATGCAGGACGCCGATGCTGCTCTCGCCGACCGCAACGACGGGCTGATGGTGGCGCGTCACGCCATCGTGTCAGACCACGCACTGCATTGGCCCACCGGCATCACGACTGCGGAGCAGCAAGCAGCGATCCAACAACTCAAAGCACTGGGCGTCCTGGTGCGTATCGGCGCCTAACGCCAAGGAGACTCAATATGCAAAACCCATTCATCAGTCCGGCATTTTCGATGGCATCAATGACTGCAGCCATCAACTTGATCCCCAACCGCTACGGACGCCTGGAGGAGTTGAATCTGTTTCCGCCCAAGCCGGTTCGAACGCGCCAGGTGATTGTTGAAGAACGCGCCGGTGTCCTGAACCTCCTTCCGACCCAGCCGCCAGGCTCTCCGGGAACAGTGAATGTGCGTGGCAAGCGAACCGTCCGGTCCTTCGTCGTTCCGCACATTCCGCACGACGACGTTGTGTTGCCCGAAGAGGTTCAAGGTCTACGTGCTTTTGGCAGCGAAACCGAAATGGAGTCGATTGCCGGAGTGCTGGCCCAACACTTAGAGACGATGCGCAACAAGCACGCCATCACCCTAGAGCACTTGCGTATGGGGGCGTTGAAAGGCGAGATTCTCGACGCCGACGGCAGCCGTATCTACAACCTGTTTGACGAGTTTGGCATCGATCAACAGAGTGTGGACTTCGAAATCAGCAGCCCGACTACTGGCACTGATGTCAAGGGCAAGTGCACTGATGTGTTGGGCATCATCGAAGAAGCCCTTCTCGGCGAGTTCATGACGGGAGTCCACTGCTTGTGTTCTCCAGAGTTTTTCAAGGCATTGACCGGCCACAAGGATGTCAAGACTGCCTTCACGAACTGGCAGCAAGGCGCCGTCCTTATCAATGATGTTCGCCGTGGCTTCACTTTTGGCGGCATCACTTTCGAGGAGTACCGAGGTAAGGCGACTGATGTCAACAAGACGGTTCGTCGCTTCATCGCTGCTGGCGAAGCACATGCGTTCCCTCTTGGCACTATCGACACCTTCGGAACTTACTTTGCACCGGCCGACTTCAACGAGACTGTCAACACGATGGGCCAGCCGCTTTATGCGAAGCAGGAGCCGCGCAAATTCGACAGGGGCACAGATCTGCACACGCAGGCCAACCCGCTACCGATGTGCCATCGTCCCGGGGTTCTGGTCAGGCTCGTCATGGGTGGTGGCGTATGAGTTTGGTCGCCCAGATCTATGAGTCGGCCGCGAACGCTGGGCTGCTGAAGGAATGCCTTTGGTATCCGTCGAACGGTGCGCCATCGCAACTACATCAGATCGGCTTTGCCGCGCCCGATGAATCACTGCTCGATGGCCTGGCCCTGAGCACCGACTACGAGATGACCTACCCGGTCACGGCATTCGGGGGTCTTGCAGTCCGCGAGGTTGTCGAAATCGGTGGCACGTCCTTCCAGGTGCGAGACATCCGATCGTTAAGCGACGGCTCCGAGATCCGCGCCAAGCTCACCCGGCTGTAAACCCATGGCAGATAACTCGATCCGCGAGCGGATTCTGCTGGCGGTGATGGCGGCTGCCCGTCCGGCGGTCGAAGGTCTCGGGGCCACTTTGCACCGGTCGCCCACGGTGGCCATCAGCCGCGAACTTTGCCCGGCGCTCGCGGTGTTTCCCGAGTCGGAGTCCATCACTGAGCGCGCCAACGACCGCGTCACACGCGAACTGACCGTTCGCGTTGTGGCTCTGGCTCGGGCCGTTCCACCCGCGTCCCCCGAAACCGAGGCCGACCGTCTGCTCACCGCTGCCCACGCTGCCTTGTTCGGGGACGGCACGTTCGGTGGGTTGGCGCTGGGCATCCGTGAACAAGAGAGCGAGTGGGAGGTCGAGGACGCCGACGCGGTGGCCGTGGCCCTCCCGGCGCGCTATCGGCTGACGTACCGGACGCTGGCCAATGACCTTTCAACTCTTGGATGACACCTATGACCCAACTTGTCCTGACGCGCCCGCACACCCACGCGGGCAAGACCTATGGCGTCGGTGACCGGATCGAGATCGACGCGACATCAGCCGACTGGCTGATCGCGCACGACATCGCCACGCCGGAGCCGACCGCCCCAACTGCTGAACCCGTCCCCGAACCCAAACCCCTCCAACGCAAGGAACCCAAGCAATGAGCACCTATGCCAGTTTTCAAGGCCGCGTCTTCCTCGGCAAGCGCGACACCGACGGCCTTCCCATCGAAGTGCGCTCGCCCGGCAACGTCGCAGAGCTGAAGCTCTCCCTCAAGACCGACGTCCTGGAGCATTACGAGAGCCAGACCGGCCAGCGCTCGCTGGATCACCGGATGGTCAAGCAGAAGTCCGCCACCGTGAACCTCACCATCGAGGAATTCACCAAGGAGAATCTCGCGCTGGCCCTGTACGGCAACCACGTCGTCGGCACGCCGGGCACGGTCACCGCCGAGCCAGTGGGCGGTGCCACGCCGATTGCGGGCGACCGCTACTTCCTTGCCCACCCGAAGGTATCGTCCTTGGTCGTGACGGATTCGGCTGGCACGCCCGCGACCCTGGCCTTGGGCACGAACTACACGGCTGATCCCGACTTCGGTGCCCTCCAGTTTCTGGATACCACCGGCTTCACTGCGCCGTTCAAGGCCAGTTACGCCTACGGTGTGGCCACCGAGATCGGCATCTTCACGCAGGCGCTGCCGGAACGCTTCCTGCGGCTCGAAGGCATCAACACGGCCCAGGGCAATGCCAAGGTGCTGGTCGAGCTCTACCGCGTGGCATTCGATCCGCTGAAGGAAATCTCCTTCATCTCGGACGAGTACAACAAATTCGAGCTGGAGGGATCGCTGCTGGCCGACACCACCAAGCCCTTCGACGCGGTGCTGGGCCAGTTCGGCCGCATCGTGCAACTGTGATGGGTGCCGCCATGAGTGATCTGGACACCCTGATTCCGCAGGCGGTCGAACTGGTGATCGACGGTGAGCCGCTGGCCATCAAACCGCTGAAGGTCGGGCAGATGCCCGGTTTTCTGCGAGCGATGTCGCCGGTGATGCAGCAGCTCACTGCCTCCAACATCGACTGGCTGGCGTTGTTCGGCGAGCGCGGCGACGACCTGCTGTCGGCCATCGCCATTGCCGTCGGCAAGCCTCGGGCGTGGGTCGATGAGCTGGCTGCCGACGAGGCCATCCTGCTGGCGGCCAAGGTGATCGAGGTGAACGCCGATTTTTTTACCCAGACGGTGATTCCGAAGCTCGACGGGCTGTTCGGCCAAGTGAAGCTGCCGCCCATCGTGAAAGCGGCGGCTGGTTCGATGCCGTCCAGCACCTGATCGAGCACGGTCACCGCTTGCCCGACATCCTCGACTACACGTTGGCGCAGGTGCGCGGCTTCGTCGTAGCGACGGCGCGCACCGATGCGGCCCGCGATGCACGGCTGCTGTCCGTGATTGCCATCGGCACGCGCAGCGATGCCCGCCAGCTCGACCAAACCCTCGACCGACTTACTGACAAGGCCACCGACCGTGCCTGATGACCATGCGCATTTCCGTCCAGATCGATAGCGCCGCAGCCCAGGCGCAATTGCGCCGCTGGGGCGGCGAATTCCGCGACAAGGTCAAGAAGGCGGTGTCGCGGGCGATTGCCAGCGAGGCGGTCGAACTCAAGCAGGACGTGCGCAGCCACGTCGCCAGCCAGATGGCCGTGGTCAAGAAGTCCTTCCTCAAGGGCTTCACCGCCAAGGTGCTGGACAAAGACCTGAACCGACTGCCCGCGCTGTACGTGGGTTCGCGCATTCCGTGGTCGGCGATGCACGAGACCGGCGGCCAGATTGCCGGGCGGATGCTGATTCCACTGAACGGTCGGGTGGGCCGCAAGCGCTTCAAGGCGCAGGTGGCCGAGCTGATGCGCGGCGGCAATGCCTATTTCATCAAGAACGCGAAGGGAAACATCGTCCTGATGGCCGAGAACATCAAAGAGCACGACCGGCCACTGGCGGGCTTCAAGCGCCGCTACCGCAAGGCAGAGGGCATCAAGCGCCTCAAGCGCGGCGCGGACATCCCGATTGCCGTCCTAGTGCCCAAGGTCGTACTCAAGAAGCGCCTCGATGTCGAGCGGCTGGTCGCGAGTCGCATCCCGCGTCTGGCGGCGGCCGTCGAGAATCAGATCAGTACGGTGGATTGATTCATGGCCAAGCGAATTTCCATCCTCGTCGCGCTCGAAGGGGCCGACGAGGGGCTCAAACGCGCCATCACGTCGGCCGAGCGCAGTCTCGGTGAGCTGTCGACCACCGCCAAGACCGCCGGAGCCAAGGCTGCCGCCGGAATGGCCGAGGTCAAGGCCGGGATGTCGGCCTTCGGCGATCAGGTGGCGACGGCCAAGACGCAATTGCTGGCCTTCCTATCGATCAGCTGGGCGGCAGGAAAGGTGCAAGAGATCGTCCAGATCGCCGACGCATGGAACATGATGTCGGCGCGCTTGAAGTTGGCGACGGCGGGACAGCGTGAATTCACGACCGCGCAAGCGGCCCTGTTCGACATCGCCCAGCGCATCGGTGTGCCGATTCAGGAAACGGCCACGCTGTACGGCAAGCTCCAGCAGGCAATTCGGATGCTGGGTGGCGAGCAGAAGGACGCGCTCACGATCGCCGAGAGCATTTCGCAGGCACTGCGCCTGTCGGGCGCTTCGGCCACCGAGGCGCAGTCCTCTTTGCTGCAATTCGGGCAGGCGCTCGCCTCTGGTGTGCTGCGAGGCGAGGAATTCAACTCCGTCGTCGAAAACAGCCCCCGTCTGGCGCAGGCACTGGCCGATGGCCTGAATGTGCCCATCGGGCGACTGCGCAAGCTGGCCGAAGAAGGCCGCCTGACCGCTGACGTGGTGGTCAACGCGCTGATGAGCCAGAAGGACAAGCTGGCCAGCGAGTACGCCCAACTGCCGCAGACGGTGAGCCAGGCCTTCGAGCGCCTGCGCAATGCCTTCGGGCAGTGGATCAACCGGGTCGATGAATCGACGGGTTTGACCAAGAAGCTGGCCGAGGCTCTGACCATTCTCGCCAACAACCTCGACACGGTCATGCAGTGGTTGAAGCGCATCGCCGAAGTCGGTCTGGCGGTGCTGATCTACCGCCTGATCCCGGCGCTCATCACCGCGTGGCAGACCGCCGGTGCGGCGGCCGTCACGGCCGCCAGTGCCACCGCTGCGGCGTGGACGACGGCCAACCTGTCGGTGTCGGCCGCCGTGGCCAGCGTCGGCTTGCTCAAGACGGCATTCGCCGTGCTGGGTGCCTTCCTGGTCGGCTGGGAGATCGGCACGTGGCTGTCGGAGAAGTTCGAGATCGTCCGCAAGGCGGGCATCTTCATGGTCGAGATGCTGGTCAAGGCGGTCGAGCAGTTGCGCTACCGCTGGGAGGCATTCGCCGCCATCTTCACCTCGGACACGATTGCCGAGGCGACCCAGCGCCACGAGGCCCGTCTCGCGGAGATGAACCAGATCTTCGCGCAGATGTACGCCGACGCGACCAAGGGGGCGGATGCTGCCAAGGGCGCGATGAATACCGCCGCGACGGCTGCGGAGGAAATCGCCAAGCGGCTCGAAGCCGTGCGTCAGGGCACGCAGGAGGCAGTCGGGCGCGGTATCGAGGCTGTCCACAGCGCCCTGGAGAAGCTGAAATCCCGCCTCGGTGAGGTTGAGCAGGCTGTCGGCAAGGCCAATCAGACAGTCAACGACGCCACCGCCAAAATGGCCGAGGCCTATAAGGGCCTGACGTCCATCGTTGAGGCCAACCTGCTGCGCCAGATCGAAGCGGTCAAGGCGCGCTATCAGCAGGAACAGTCGGCGCTGGAGACATCCAAGCAGTCCGAAGCGGCGCTGATCACCAAGTCGACACAGTTGCTGACGGAAGCCCTCACGCAGCAGACCACGTTGCGGCGGCAGTCCACGACAGACACGCTGAAGCTCATTGACGATGAGTCCAAGGCGCGGATCGAGTCGGCCCGCCGCCAGGGTCAGACGGAAGAAGAGCGCCGCGCCAACGTCCAGCGGGTCGAAAACGACATCCTGGCCACCAAGCGCCAGACGATGACGCAGGCGCTGGCCGAGTACCGGCAGCACATCGATGCGCTCAACGCAGAGGCCAACCGGCATCTGACTGAGATCAAGCGCATCGAGGAGGAGAAGCGCCAGCTCTCGATGACGACCGAGGAACGTGTCCGCGACATCCGTCGGCAGGGCATGACCGATTTCGAGGCGACGGAAGATCGCAAGCGCCAGATCGCCGAGTACCAGGGGAAGGCACGTGAGGCGCTGGCCAACGGCGAGTTCGAGCAGGCTCGGCAACTCGCCCAGAAAGCGATGGACTTGGCCGCACAGGTGGCCAGCTCGCAAACCAGTGAAGCCAAGCGCGGCGAAGATGCCCGCAAGCAGTCCGAGCAGGCGGTTTCGCAGGTCACCCAGCTCGAATCGCAGTCACGCGATGCCTATCGCAAGCAGGAATACGCGCAAGCCGAAGCCCTGATGCGCCAAGCGGACGCATTGCGCGCCGAACTGGCCCAGAAGACCAAGGATGCCGACGCACAGATCGCACAGGGCAAGGATGGCGTCAATCAAGCCATCCAGCGCATCCGCGAGTCCGAGGAGATTCTCAACAAGACCCTGGATGCCGAAGCCAAGGCGCACCAGACCGCCGCGCAGTCGGCATTGACCGCGCGCGACCAGATCCAGCAGACCCTCACCCAGACCGAAACCCAGATCGACCAAATCACAGCCAAGCTAAAAGACGGTCTGAAGGTCACGCTGGATGCCGACACGACCCGCTTCGACAAAGCCATCGCTGATCTCGACAAGGCCCTGGCAGAAAAAGAGTACCTGCTCAAGATTCAGGCCGACTTGCAGGAGGCCGAGAAGAAGCTGCAGCAGTACGAACAACTGCTGAAAGAGGGCAAGACACTCCCGGTCGATGCCGACGTGTCCAAGGCCAAGGAGGCGCTGGACAAACTCAAGACCTACGCCGACCAGAACTCGCAGTTCGAACTGAAGGTGGCGACCGAGAAGGCGCAGGCCGCGATCACCAAAGTCGAAGGGATGATCAAGGCGCTGGACCGCATCCAGACCGAGTCCCGGCATCAGGTCAGCACCAATGCCGACGCAGCCCGCTCGGAAATCATGAGTCTCAACTGGGCCAACACCTCGAGCACGCACACGATCTATGTGCGCAAGGTAGAGGCAAACGCGACTGGCGGTTTGGTGGGCGGTGGCGTGCGCCGCTACGCCGATGGCGGCGCTGTGGCCCCGGCCTTTCCTCGGATGAGTGGTGGCTCGGTTCCGGGCTCGGGCCACCACGACACCGTGCCACGCACCCTGGATGCCGGTGCCTTCGTGATTCGCAAGGCGGCGGTGCAGAAGTACGGCGGCGGCGCGCTCTCGCGTCTGGCCAATGGCGTGGCACGGTTTGCCACTGGCGGCGCGGTGATGCTGGGTGGCGGCAAGCGCCCATCCGGCAACGATGCTGATGGCACGCCCAGCACACCAAAGAAGAACCGGGAGGCAGTCGAGGCGATGAAGATGATCGACCTCGGCCTGCAGGGGATGAACGAGTACACCAATTGGCTCCAGTGGAACTACGGTGCCTCGGTCAGTCTGGATATGCGTAGCAAGACGATGGATAGCTACGGCAAGCAGGCCCAACAGGATCGGCGCGCGCTGGAGGACTTCATCAGCCGCAAGACGCTCACCGGCAACGAGCGCCAGAACCTGGAGCGCATCAAGCAGACGTGGCGGCAGGCAATGGCCCAGCCGCTGCTTTGGGGCAAAGACCTAGAGCGCGAGCTGATCGACTATATGGAGCAGAACCAGGGCGAGTTCTACCGTCGCGGTGGCATGGCCAAGTCCGACACCGTCCCGGCGATGCTCACGCCGGGCGAGTTCGTCGTGAACAAGGATGCCGTTTCCCGCTACGGCGCTGGCTTCTTCGAAGCGATCAACAACCTGTCTGCCCCGGCACAAGCTCTGGCCGGTCGCGCGCTTGCGGGCGTTCAGGGCTTCGCCACCGGCGGTCTGGTGCAGCCAAGTGGCTCGCGGTTGGCCCGACCGGTGTTGGCGGCCGATGCCGGGCCCAGCCGCACGGTACGCGTGGAACTGTCCTCGGGGCAGCAGAAGGTCAATGCCACCGTCGACGCACGAGACGAGTCTCGTCTGCTGCAACTTCTGGACGCTGCCCGCGCCCGCACTGCCTGAAGGATTCCCGATGCAACTGACGAACCTCGATGCCGGGGTGGCTTTGCCATTGCCTGACGATTTGCTGTGGAGTGATGAGCACGCGTGGTCGCCCGCCGTGGCGACCACGTCTTACCTCATCACCGGAGCCTTGCTTATCCAGTCTGCCACCCGGCAAGCCGGTCGCCCCATCACGCTGGTGGGCGCACCCGATATGGCCTGGGTGACGCGGGCCACGGTCGAGCAACTGCAGGCCTGGGCCGCGCTTCCAGTGGGCAGCGCCACAGGTCGCTTCGGCTTGACCTTCTCCGATGGCCGCTCGTTCACCGTGGCATTCCGCCACGCAGAAACGGCCATCGAAGCCGAGCCCGTGCTGGGCATCCCGGCCCGTGCCGCTACCGACTTCTATCGCCTGACCCTTCGATTCCTGGAGATTTGAAATGCCGATCCAATCCGGCGACGTGAAACTGCTGAAGTCCGCCGTGATGGCGGATGTGCCCGAGGGCGGTGGCGCGCCCACGGGCAACACCATTGCCGATGGCGTCTCGAACGCCATCTTTCCTGACATCTCCGAGCTGGATCGCGCCGGGGGTCGGGTCAACCTGCGCAAGTCCTTCGTGTCGGTGCAGACCGACGACACCGACACCTACTTCGGTGCCAACGTGATCGTGGCAGAGCCGCCGCAGGATGCGCGCGTCAGCGTCACGCTGTTCAGCACCGAGAAGACCTTCGACACCCGCGAGCAGGCGCAAGTCCGCATCGAGGCCTACCTCAACAAGGGCCCGGAGTGGGCTGGCTACCTGTTCGAGAACCACATCGCCGGTCAGCGGGTGATTCAGCTTTTCCAGCGCACCACCGACACCGTTCCCAATGTCGGCCAGACCTTGGTCTTGATCGAGAACGAGGGCCTGGGCACCCAGAAGGAGCAGTACATCCGGGCCACCTCGGTGTCCGTCGTCGAGCGCACGTTCACTTACGACGGCGACAAGGACTACAAGGCCAGCATCGTCACGGTCGACATCAGCGACGCACTGCGCTACGACTTCACCGGCTCGCCTGCAAGCCGCACGTTCACCCGGGCCGCGAACAGCACCAAGACGCGCGACACGGTCGTGGCGGACGCCGGAACCTACGTCGGCGTAGTACCGCTGACGCAGGCCGCCGCCGTCGGCGACTTCACGATCAAGGGCACCTCGATCTACACGCAGCTGGTGCCAAGCGCGCAAACCGAGACGCCCATTTCCTTCGTTCCTCCCTACGCGGCCGCCGGACTGCCGGTGCCGGGGGCCGTCGCGGTGAGCTACACAGCCAGCCACGCGTGGACGACCAGCATCAAATTCAATCTCCCGGGCGGTTGCTTGCCGGGGTCACTGACCATCGGCACGGACGGCATCACGATATTTGACGACGCGGGCCTGCTCAAGACCGCCAGCGGGACGGTCGGAACCATCGACTACGCCAACGGCATCCTGACCCTGAACTCGGGGACGATGTCGAACGCGAAGGCCATCACCTACACGCCCGCCGCGCAGATTCTGCGTGCTCCGCAAAGCTCGGAGATCCCGGTCACGCCCGAGTCGCGCAGCCAGTCCTACGTGGGCACGGTCAACCCGGTGCCGCAGCCCGGAACGCTGTCCATCAGCTACATGGCCCAAGGGCGCTGGTATGTGCTTTCCGACAGTGGCAACGGCTCGCTCAAGGGCCTGGACGCCAGCTACGGCGCGGGCACCTTCAACAGGAATACCGGAGCCTTCGTGGTCACGCTGGGTGCGTTGCCCGACGTGGGCAGTTCGCTCGTGCTGACCTGGAACGTGCCGACGCAGGAGACGCAGCAGCCATCCACCACCCTGAAGGCCACCCAGAGCCTGGCATTGAACCCACCTGCAGGGACGGCGGTGCAACCCGGGTCGCTCACCGTGTCCTGGGAGTACGGCGGCACCAAGACCTCAACGGCGGCCACGTCGGGCGTGCTGTCGGGTGCCGCCACGGGCAGTCTGAGCGTGGCGCAGAACCGCGTGGACTTCGCGCCCAATGTGCTGCCAGCGGTGGGCACGCAACTCACCGTGAGCTACGTCGCGGGCCCGAAGCAGGAGGACTCGTTTGCTCACCCCTCCCGCAATGGTGCGGGGACGCTGCCAGTCACCGCGACCTTGGGGGCCATCGAACCGGGCTCGCTCGAAGTCGAGTGGAACACGTTCACCGACGAGGCGGTTCTCGGTGCGTACACCTTCGCTCAATTGCAGGAGATGGGTATCGCCGTCTCGATCTGGCGCGACCCCACCCAGATCGCCCGAGATGACGGGAACGGCGGTGTAGTGCTGAACGGGATCTCGATTGGCACCGTCAACTACGCAACCGGTCAGGTGACCTTCAATCCGGATGTCTCGATCCGTATCCCACGCCCGGTCTACACGGCAGTCGCCATCAACGGCACCGGTCGGTGGCGATTGAACTACGGCGGCATCGCCTACGTCGATGCGCCATCGCTGTACCCCAACGACGAATCCGGCTACGTCAAGCTGCGCTACAACAGCGCGGGCTCGACCAGCAACCAGACCGAGACGTTCCAGTTCCTACCGGCCTTCAAGCTGGTACCGGGGGTGAATGCCCAGGTGGTGACAGGCACGGTGCTTCTCTCCATCAGTGGCGCGCAGCCTTGGGGCGACAACGGCCAGGGCACCCTGCGCGAGTTCACCACCAGTGGCTGGGTCACGCGCGGCACGATTAACTACCTATCCGGGGACGTGGCGCTGACGTCCTGGACGGCGGGCACGAACAACGCGATCACACGAGCCAGTTGCGTGACCACGGTCGGCGAGAACATCTCCAGCGAGTTCGTGTTCCGAACTGGCGCGGCACCGCTTCGTCCTGGGTCGCTGTCGATCCAGTACGCCCGCGCGGTTGGTGGCACGCAAAACGTGACGGCCGGGATTGACGGCAAGATCGAGGCAACCGGCATCAGCGGCAGCGTCGACTACGAGACCGGTCTGGTGCGCGTTCGCTTCGGAACGATGGTCACGGCGGCCGGGAACGAGAGCCAGCCTTGGTACGCCGCCGACCGGGTGGGCACGGACGGCAAGATCTTCCGACCCGAGCCGGTGGCCGCATCCAGCGTGCGTTACAGCGCGGTCGCCTACAGCTATCTGCCGCTGGATGCTGATTTGCTTGGCATCGATCCGGTGCGCCTGCCCAGTGATGGGCGGGTGCCGATCTTCCGCCCCGGCGGCTTCGCCGTGGTGGGCCACACCGGCAAGATCACCTCCTCGGTCAGCAACGGCCAGACCATCAACTGCGCTCGGGTGCGCCTGTCGCGCGTGCGCGTCGTCGGCCACGACGGAGCGGTGATCCACACCGGGTACTCCACCGATCTGGAAGCGGGCACCGTCACCTTCATCAACGTGTCGGGCTACAGCCAGCCCGTGACCATCGAGCACCGGATCGAGGACATGGCCGTGGTGCGGGATGTGCAGATCAGCGGCGAGATCAGTTTCACGCGCGCCCTGACGCACGAATATCCGCTGGGGAGTCACGTCTCCAGCGCCCTGGTGGCCGGTGACCTGTTTGCCCGCGTGAATCTGGTGTTCGACCAGTCAACGTGGAACGGCGCGTGGTCAGATGCCTTGTCAGGCAGTTCCGCAACAGCAACGTTCAACAACACGCAGTACCCGATCCGCGTGACGAACCGGGGGGCACTGACCGAGCGTTGGATCGTGCGCCTGACCAACAGCACCTCGTTCGAAGTCATCGGCGAGAACGTCGGCGTGATCGCCACGGGCAACACCAGTGCGGATTGCGCGCCCAACAACCCGGCGACCGGCGTGCCGTACTTCCATCTGCCCGCACTTGGCTGGGGCAATGGCTGGGCCACCGGCAACGTGCTGCGCTTCAACACCATCGGCGCGCAGTTCCCGGTCTGGGTGGTGCGCACCGTTCAGCAGGGGCCGGAGTCCGTGCCCGACGACAACTTCACGTTGCTGATTCGCGGCGACGTGGACACCCCCTGATTTCGTAGACAGGAACCCATGAAATGATCGACCTGACCGTCAAATACTTCAACAGCGGCATGACCGGCGCGCCACAGATCTCCAACAACTGGGGCGATCTGGTGACGATGCTCGATGCCTGCCTCGTCAATGGCTTCGCGCTGAAAGCCATCGACACCTTGACCTTCGCCGATGGCATCGCCACAGCCACCATTTCCACCGGCCACGCCTATCGGCCTTTTCAGGTGGTCGAGATCGCTGGAGCCGAGCAGCCTGAGTACAACGGTTCATTCCGTGTGCTGTCGACGACCACGACCGCCTTCACCTATGCGGTGACCGGAGCGCCGGTGTCGCCCGCGACGACGACCACTAACCTGAGCGCCAAAGTGGCTCCACTTGGGTGGGAGAAGCCGTTCGCGGGGACGAGCAAGGCCGCCTATCGCAGCAAAAACCCACAGTCGCCGCAGAACATCCTGCTGATCGACAACAGCCTCAAGACGCCCAACTACACGACGGGGTGGGCGAAGTGGGCCAACGTCGGGATCGTGGAAGACCTGTCCGACATCGACACCATCGTTGGCGCACAGGCTCCGTATGACCCGAACAACCCGACGCAGAACTGGAAACAGGTCACCGCTAGCCAGTGGGGTTGGTACAAATGGTTCCACGCACGTGGCCCCCAGTACGAAAGCAACGGCGACAGCGGCGGCGGAGGTCGCAACTGGGTGTTGATCGGTGACGACCGTCTGTTCTTCCTGTTCTGCACCAATGCAGCGGGCTACGGCTGGTATGGCCGCAACAGCTATTGCTTCGGCGATCTGATCAGCTTCAAACCCGGTGACAACTACGCGACGGTGCTGGCCGCCGACGACAACTACTCGGGGATGAGCAACTACTGGAGCTATCCAGGGCAGTTCAGCGGCTACGGGCTGGTTTCGTCCCTGGACTTCACCGGCAAGGTGCTGCTGCGCAATCACACCCAACTCGGCAATCCCGTCCGGTTCGGACTCACGTCGCTGAACACCAACAACGGCCAGCAGATCTGCGGTCGGGGCCCGATGCCGTTCCCGAATGGAGCCGACTACAGTTTGTGGCTGTTGCCCACCTACGTGCGGCAGGAGGACGGCCATATGCGCGGCATCCTGCCCGGGATGCTGTGGATGCCCCAGGACCGGCCCTATAGCGATCAGACCATCGTGGACAACGTGGTGGGTCAGGCGGGTAAGCGCTTCTTGCTGGTCAGGACGCAGTACAGCTCGGAAACCGAAGGCGCACAGATCGCGTTCGACATCACTGGCCCGTGGAGGTAAGCCATGAGCTACCCGCTGAGCGAGTCTTTCGCCACGGCTCCTGCGACCGGCTACACCGCCGTCCTCGGCGGAATGGCCGCGACACACAACAACGTGCAGCAGTCCATCGATATCTCGGCCCCCAATAGCCAGTCCATCCTGCGCTTCAACGAAACCGCCCACGGTGACTTCTGGTTCGAGGCGGATGTTGAGTTTCTGACCGACCCGAGCGCCCGCAAGCACATCGGCCTGTGGATGACCACCGGCAACGGTTCCGAGGGCTACCGGTTCGCACATATTGACGGTGCCTGGAGCGTGACACGCTGGAACAGCGGCTTTGGCGACGGCGCGGCAGTGACGGGCGGTGTCAACGATGGAGCGAAGCCGGTCGCGGGCGTTATAGACGTGGCCCCGACCTTCAACGTCGGCCAGCGGATGCCCCTGCGCTGTGAGGTCATCGTCGGAGCCTTTGACGCCAACGGCGTTCCGTGGGCGCGCCTGATCCAGTTCAAGGCCGGTGGGGTGCTGATGTTCCAGGTCGGGGATGCTGCCTACAGGGGCAAGCTGATCCCGGGCGTGTTCCTGTATGGGGCCACGGCGCGCGTCCACGCGATTGCGGGTGACACGCCGTCAGGTCTGCCCGCGTTTCCGGCAACCGTGGGCGTGAACGCCGCCGATGACCTGCTGCCGCTCGCGGGTGGGTCGACTTCGGTGCCCCCTGATCCGGCCGCCAACATCGCCGTCAACGCCGACTGCGACCTGATGCGCTTGAACAGCCCCAACTCTGAGCTGTGGAACCGGGGTGGTGGCTACGACTGGCACTTTCACGCGATTCCGAATGGCCGCAAGAACATCCACTTCAGCGGCCACGGCTTCATCGCCGGAACCGTCAAGGAGAAGGGCCAGCCCGACCAGCCCCTGGTGCGGCGGGTGCAACTGGTCAGCGAGAACACCCGCGTCCTGGTGGCCGAGACCTGGAGCGACACCACGGGCGCGTACCGGTTCGAGCTTATCGACCCGGCCCAGAGATACACCGTGGTCAGCTACGACCACAAGCAGATGTACCGCGCCGTGATCGCGGACAACCTTCATCCGGAGATGATGCCGTGACCGTTGCCATCACTGTCGAACACAACGAGGCGCGACTGGCGGGCACCCTGGCATTCCTGGATGCCGGTAGCAATCCGGCGCGTCTGCGCATCTACGGCGGGACACGACCCGCCAACCCGGCCACGACGCCGACCAGCGCGATGCTGGTCGAGATCAGGCTGACCAAACCCGCAGGCACGATTGCAGGTGGACTCTTGACGCTGACGCAGCAAGAAGACGGGCTGATCACGGCGACCGGCATCGCCACCTGGGCGCGGCTGGTCAACGGCAACGAAGTCACGGCCCTGGATCTGGACTGCAGCGGTACCGACGGCAGCGGTGACGTGAAGCTGGCCAGCACCAACCTCTATCTGGGCGGCGATGCCCGGATGGTGTCTGCGATCCTGGGGTAGTCCGTGCCTGCCGTTCTCAACGAGGTGACCCTGGTCGCCGCGTTGCCTGCGCCCACCGCCAGCGTGGCGGTCGGGCCTCCGCTGGTCGATCTGCTGTTTGACCAACCGGCTGCCACCGACGCCAACTTGGTGTTCGGGGCCAACTACATCGCGCCGCGCGACGACGTCGTGGTGCTGGCCAGCCTGCCGTTGCCGGTCGTGGCGATCAAGTTCATCCCGCCAGCGCGGGCCGCACTGCTGGCCGAGCTACCTGCATTGACGGTGACCACGCTGTTGCTGCGCCCGAGCGTCCCCTTGGACGTGACCGGTGCAAGTCTTCCTGGTGTCGTGTTCTCCGGCGAGGTCAGGTACTACTCGCGCACGCAGCGACCGACAGTCGGCCAGACCGCACACGCTTGGCAGGTGGCAGCGCAGACGGAAGATGGTTCGACACAGGGCCAGCAGGACGCTGCCGCTACACCCGCAGGCTGGGACACGTTCTGGCGACGCACCTTGGGTGTTCCTCAAGGCATCGAGCACAGGTTGCCGCCGGTGCTGGCGGCAGCGCCCGAGCAACGAGGCGCTCGCCACCAGGATGCGACCCGGCTGCAGGATTCGACGTGGTTTGCGCACCAGGACGCCACGCGTTTTGCGGCGACCCGACAAGGTCTGTTCCAGAACGCAGGCCCGTTGCGGGACACCACGCGATTTCGGCATCAGGACGGCGACCGCACCAAACGCGCGGGGCGGGTGAGCTTTTGGCAAATCGCGCGTCTGCTCACCGAGCGCCAGGGGAGTGATTTTCAGATTGCCAGCCCGTCACTCAAGGGCTGGAGTGTCCGGTATCAGGACGCCGTGCCGCCACCGCTGGGGATCAGCGTCTGGGTGGTTCCACAACCGCCAGCGCCGATACCTTGCTACACGCCGAGCGCGCATCTGCTGTTCGCCGCTTTGGCCCCAGCGGACAGCCACTTGCTGTTCGTCTGTGAAAACCACATCAACCCACCGCCTCCCGATGGGGAGCCGGTGGTCGTTCCTGTTCGGAGGGTCTATTTCGTGATCAACAACGTGACCCTGTACCGCGTGTCCGATGGCGCGCCGGTGCCGGTGTTCAACCTTTCGCTGTCGCTCGATGCATCGTCCTGGGCGTGGGGCTTCGATGCGGTGCTCCCTGCGAAAGCCGAGGCGCTGGTCGCGGGCAGCGCTTCCGGGCCCGTCGAACTCGTGGCCAGCGTCAACGGCACCCCGTTTCGCGTGCTGGCCGAGAGCATCAGCCGCGAGCGCATCTTTGGTGACGCCAGCATCCGCATCTCCGGACGGGGGCGCAACGCCGTTCTGGCCGCGCCCTACGCGCCGGTGATGACGTTCTCGAATACCGAAGGCCGCACTGCTCGGCAGTTGATGGACGATGTGCTCACGGTCAATGGCATCCCGCTGGGCTGGGCGGTCGATTGGGGCCTGACGGACTGGAACGTCCCCGCCGGTGCGTTCGCGCAGCAGGGGTCGTGGATCGACGCACTGACCGCCATTGCCGGTGCTGCAGGTGGCTACTTGATTCCGCATCCCTCGGCCCAGAGCATCCGCGTGCGTCACCGCTACCCGGTCGCGCCTTGGGAATGGAGCACGGTCACGCCCGACTTCGTGTTGCCCGTCGATGCTGTCGCCCGCGAGTCGCTGCGCTGGTTGGAAAAGCCTGCGTACAACCGCGTGTTCGTTTCCGGGCAGGACGTAGGCGTGCTCGGGCAGGTGACCCGGGCCGGGACTGCCGGAGAAGTGCTGGCACCGATGGTCGTCGACCCGCTGATCACCGAGGCGGCCGCCGCGCGGCAGCGTGGCGTAGCCGTGCTCGCCGACACGGGTCACCAGCTCGAGGTCAGCCTGCGCCTGCCGGTGCTCGCCGAGACCGGGATCATCGAGCCCGGTGCGTTCGTGGAGTACCAGGACGGCAGCGTCACGCGATTGGGCATTGTCCGCGCGACCCAAGTGGAAGCCGGGTTGCCCGAGGTCTGGCAGACGCTGGGAGTGCAGGCCTATGCGTAACCTCTACGAGCAGTTTCGCCAACTGATCCCCGACCCGCCGCTGCAGGCGGGCACAGTGAGCGACGTCGGCTCTGGCGTGGTCACGGTCGCATTGCCCGGTGGCGGCCGAATCAAGGCGAGGGGCTCTGCGGCCCTTGGCCAGAAGGTGTTCGTGCGCGACGACGCCATCGAAGGCATTGCGCCCAGCCTGACGCTGGAAATCATCGAGATCTGAAACCCAACTGATTCAACCCTGAGACCCGCCCTGATGCTCACGCATCGGGCGGGTTTCGTATTTCTGGAGAAAGCAAATGACCGAACCTGAACAACAACCGGCGCTCGTCGAGAACATGCTCCTGCTGCGCAAGGAGGATTTCGACGATCTGCTCGACCGTGCCGCCGAACGTGGTGCTGAACGCTGCCTCGCCCATCTTGGACTGGAGAACGGCCACGCTGCCCGCGACATCCGGGAGCTGCGCGATCTGCTGGAAGCTTGGCGCGACGCCCGCCGCACGGCTTGGCAGACGACCATCAAGGTGGCCACCACAGGCATCCTGGCCGCGTTGCTGGTCGGTGCCGCCATCAAGCTCAAGCTGATGGGAGGCCCCCAATGATCGAAACCCTCCTCGGTGGCCTCCTGGGCGGGGTCTTCCGTCTTGCGCCCGAAATCCTCAAGTGGCTGGACCGCAAGGGCGAACGCGGCCACGAGCTGGCCATGCAGGACAAGGCGCTGGAGTTCGAGAAAATTCGCGGCGCGCAGCGGATGGCCGAGATCGGTGCGAGCGCCGAAGCCGCCTGGAACGTCGGTGCCGTCGATGCGCTGCGTGAGGCCGTCCGCACCCAAGGTGAGAAGACCGGTGTGCGCTGGGCCGATGCGTTGTCTATCAGCGTGCGACCGGTAATCACCTACTGGTTCATGGCGCTGTACTGTGCGGCCAAGACGGCTGCTTTCGCGGCCGCCGTCACCGCTGGCTCTGGCTGGGGCACGGCCATCCTGCACGCATGGACGGAAGCCGATCAGGCGCTGTGGGCCGGGGTTCTGAACTTCTGGTTCCTCGGGCGCGTGTTCGACCGGGTGCGCTCGTGACCGGGGTGCCGAAAACGGCCATCGAGCTGGCCAAGCGCTTTGAGGGGTTCCACCGGGTGCCGAGGATCGATCCGGGCCGCGCGCATCCGTACATCTGCCCAGCGGGCTACTGGACGATTGGCTACGGCCATCTGTGCGAGTCGACGCACCCGCCGATCACGGAGTCCGAGGCCGAGGTCTATCTTACGCACGACCTGCAAACGGCGCTCGCCGCAACGCTGCGCTACTGCCCGGTGCTCGCAACCGAACCCGAAGGGCGACTTTCGGCCATTGTGGATTTCACCTTCAACCTTGGCGCGGGGCGGCTGCAGACATCGACGCTTCGGCGACGGATCAACCAGCGGGATTGGGCGGTATCAGCTCAGGAGCTCTGCCGATGGATCTATGGTGGCGGAAAGGTCTTGCCAGGTTTGGTGGCAAGGCGAAAGGTCGAGGCGGCGCTGATGTCTGGCCTTCACTACTGAAGCGCTGGCTCACACAAGCGCAGCAATTGCGATCGAATCTCGCCGGGCGATGCTGTCAGATCCACCGTCGCAAATCGGATGCCATGCCCCTGGATGACAACCGTCTCATCAACGGCGTCGCCGACGGAAGGGTGAAGCAACAGCCCACTGGCGTCATCCGCAAGCGGATCGCCGCGCCCGACCTGGGATCGCAGGTAGGCGTAGATCTGGTACACGTACCCGCTGCGCAGCGTCTCCTCTCGATACCATCCGCTCGTCACGATCGACGTGAACTTGGTGTCGATGACGATTCGCCGGCCAGAAGGCGCATGGTCGAGTACCACGTCGGTTCGCATCGTCGGCAAGATCTTGTCGATTCCCGATGTCTTCTGTTCGATTTGCCAGCCCAGCGTACCGCCACAGCGAACCCGCCATCCCTGCGGTTGCAGTACCACGTCATAGAAGCCGCCCACGGCCTTCTCGAACAGTCGCCGGACCCAGGTGACCTCGCGTTCCGGCAGAGTCAACACGTTCGTCCCTGCCACCTCAGTTGGCAAGGCAAGATCAAAAGCCAGCTTCGCCGCAGCCACCATGAATCTGTCATCAGCGTCGTTGCGGCCAAAGCGATCTGTGCTCATCTGCGCGCGAGTGGGTACATCGCCGGAAACACCCATGGCCTTCATGCCGCTGGCAAGTGAGCGGCAGCGATGAGACACGTCCTTCCTTTGCACAATTCGGGCGATGGTCTCTAGTGCCGCGCGCACAAAGCGATTGCGTGGCGTGTCGACGGTCAGCTCATCAAATCGACAGGCCACTAAGCCACGATCCAGCAATCGATGCCGCTCCGTATTCAGGACATCAATTCGCCCGCGCACGCGATTTAGGACAGCATCCCGAGATCGGTAGCCCAGGTTGAGGCGGCGGCGCTGCCTGACTTCGACTGCGTGGGCCAGAATCTCGGCGACCAGATCGGGGAGATCGTCAGGGTTGTCCTCCAGGCCAACCTTGCCGATGCCGCGAGTGCGGAACAATTCCGACGCGTACAGCATCAGCAGCCACAGATTGCGCACCGGAATACGTCCGATGTAGCCATTTGCGTCGACCGACGCGCTCTCGACCTGCTCCGCGACCACACCCATTACCAGCCCTGAAGCAGCCGCGCACACGCCTTCTGCGCTTCGTCAGGGGCGTCAAACCAATACTCATCGAGCAGTGGCCCGATCTCAGTCTCGACCACCTGCTGGAACCACTTTTTCGTGTCCCCGGCCTCCAGCCTATGGGCGGGCGTCACGTAGCTATGGCCGATCCGGAACTGCTTTCCAAGGCGAGCGTCAGCCGCTATCTGGTCATTTAGCTCTGCAATCCGGTGCTCAATATCCGCGACCAGAGCAGGATCGACAGCACACTCCTTGACAGCCCAATCCCGCCATGCGGTGCCAAGCCTTGGCTCGAGCCCCACGAAGGCGAAACGCCGACGCAACGCGAGATCGACCAGGGCCAACGACCTGTCGGCGATATTCATCGTGCCGACGACGTAGAGGTTTTCCGGAATGTGGACGGGGCGACGTTTGCCATCCGCGTCCGGGTAGCACAGTTCCAGTGCCTCATTGGGCGTGCGCTTTCCAGCTTCAAGCAGCGTCAGGAGTTCGCCGAAGATCTGCGCCGGGTTGCCACGGTTGATCTCCTCGATCACCACGACGAACTTCGACGATGGGTCCTTCGACGCTGCCTTGATCGCTTCCATGAATACGCCGTCGGCTAGCGACAATTTCCCCTCGCCGGTCGGCCGCCATCCTCTGACGAAATCCTCGTAGGACAGGTTGGGGTGAAACTGCACCGCACGGACTTTGCTTTCGTCCTTTTGACCCATCAGCGCAAACGCCAGGCGCTTCGCAAGCCACGTCTTGCCGGTGCCTGGTGGCCCCTGAAGGATCAGGTTCTTCTTGGTGCGAAGGCGGTCCAGAAGCCGATCGATCTCGTTGCGTTCAAGGAAGCATCCGTCCTTGAGAATGTCGTCGACCGAGTACGGCACGATCGGCACGGCAACATGAACGTCCTCTGGTGCAGTTGCCTCGGTAGCATCGTCACCATCATCGACGTCACCAGCGTCGTCTTCGCCGACCGGAGACTTCTCATCGGTGGGGTCCTTGTACAGCCATGCTTCCAGCGAAAGCTCAGGGTAGGAATGGACCGGGTAAGCGGTCTCCTGGAAGCGCGGCTCCAATACATCCATCACGGCAAGGTAGTCTGCCGAATTGCAGCGCCTCTTCGGGCCGTGCATGCCAATCGGCACACCGAGCTTCTTGCTGACATAGAGCTGAGAGTTGTGATCAAGGCTCAGGAACGCCCAAGGCCTGATCCAATACAGGCCGAACGTCAGATTCCATGCAACGCCTCGCCGACCGTTCGCGCTGTCGAAAGCCTTGGCGAACTCCTCGCGGGCAAGGTCATCGTCGGTATCTGCGTACGCGATACCCGCTGCGAAGACCCCCCAAAGCGCGTCAATGTGGTCGGTGGCGCGATTGATCTCGAATGGAAAGTACCAGGACTTCAAGTTGTTCAGCAGCGGGATGCCTTCGAACGTCTCTGGAACCGGCTCGTCGACGCCCAGGAATTTTGCCAACTCGGTCGCGATGATCTTGCGGTTGGAGTCCTTGATGCCCCGATTGAACAGACCCATCGTCGTGAACGGGCAGATGTCCTTTACGAAGCCCGTAGTGCCGTCTGCATACTTATCCTCTGCCAGATGGCCGAGTCCGTCGACGCGAACGGAGATCTCCCGGATTCCCTCCACAAGGGCTGCCCTGTTGGCGCGATAGGTGAGCAGCTTGTCCGCGATTGCTTCATAGAACTTTGTCCAGCCGAAGCGATGTTTGTCTGCTGCCACCGTCCCGAATCGCTCCCGCCAGTAGGGAGCGTTTCGGAAACGCTCAACGTCCTGCGGCTTGTTGTGGAAGGCGAAGCTGATCAAACCATCGGTCATCCATTCACCAGGCAAAACGCGCCACACCGTTCCCCGGTGGGTATAGAAATACCACTCACGCACCGGTTCGATTTTTGTCCAGTCAACCTTAACTCGCTGACCATCATTCAGGTTCTCAGTGATCGTACCAATTGCCTTGATCGCCATCACCGAAACCGCTTGTCCACGGCTGTCAAAGGGTAGCCCATGCTTGCGCGTGTAGGACGACTTGATGGCGATCTGATCTCCTGGTCGCATCGATCGCACCACGTCGAGATGCTTGTCCTCGTAGCCGTTCTCCCAAATCCCCTCGGACAGAAAGCGTGGCAACTGATCGTCCGTGCCCCCGTAGCTTGCGCCAACAAACCAGCTCGCCTGTGCGCTCGAGTTCTCTGTTTTTATGTTCATGGATGTCCTCAGATAGTGCTGCGCAAAGCGACGAACTGGTTACTCGGCCCCTCCGACCGTAGGTCGAGGTATGGGGAGTTCGAGACGACGTAGGCCAAAGGTCTCGTTAAACGGAATGGAAACAGGACAGGCAGTGATTGCAGGCTATGAACTGAAACAGGCTTGCCGACGTAGCGCACAGCCGCCTCGATCAGCCATACCGTGAGTTCGTCGCTGTCGATAGGCGTCGCTGCAAGACGGATGACGCGCTTGCCTTTCTCGACACGCTCTACCGCGCCCCAGCTTGCTTGTGTCTGGATGACCATATTGGTCATGCGTCGCGTGCCTTCGCGTTCGCCGTAGATCTCGCTCATACGGCGATGCACTTCGGCAGCTGCGCAGTCGTCTTGGATGGCCGACAAGCGACCCACCAACTCAGACACTTTCCCGAAAAACGGATAGCAGGCTATGGCCACGCCCCAGCACAGTACCGGAATTGGTGTGTCGGGCTGGCTTTTGTAAAGAGCCGCCGCCCGGTCGGCGAAGTCCACCAGCTCGGCGCGTGGCTCCAACCACAGCCGGTTCAAGACGGTTCGTGTTTTTTTCTTGGCTTCCACGCCAAGTTCGGCTGCGTCGAGCAGTGCATTCAGATCATCTAGCCCAGCTGTACCCGCACGAACCCGCAGTGCCGCAGCCGCCCAATCAAGCTGAATGAACCGATCGAACCCAATCTGAGGGGCTGATGTATTCATTCGTTTCCAATCACATATTTAACTTTCACAAACGGCACGATCAGCTCTTCTACCGAGATGCCGCCGTGGACAACCACTTGCTCGCCATCGGGAACAAAAGCCGTTCGGCCACCCGCGAATAAGGGCATATAGCCCGCCGGTAGTCCAGCGATGTCCAAGTGCACTGAGTTGGCATTTGCCGCTGCGGATTTAGCCAACAACGATTCACTGCGATACACGCGCACCCGTTCGCCACGGGCTTCTGGAACATCCCCTTCAGATGGACGTCCCACGCCAACAGCTTCAACGTTGCCGTGATCCGCTGTCAGGTAAATATGAAAGCCTCTGTCGAGCAGCATCGCAAACAAGCGATCGACGAAGCCTGTTTTCAACCAGTTCGCGATCCACAAGGCCACATCTTGTTTGGAGCGTTCCTTGTGCAGTCGGTCATCCACCTCGTCGACCACCAACCCGACCACCTTGGGCCGACGATCGTCCAGCGCTGCTTGCAAGGCATCCAGCTGCTCAATCTGCCGCAACGAGCGCTGGTAGTAGATTTCGCCCGGCTTGACGCCTTGCTCCTGCCAGTAGGCCTTCCACAGGTACTCTTCTTTGTTGGTATGGCCGATCGACTCTTCGAATTCCCGCGGTTTACGGCCAGAGAACAGGGCCTGCCGCGACACCGAGGTCACGGTGGGGAGCCAAGCGAAGGAAGTACCTTCATCAAACGCAAAGCGCTTCGTCGCCTCGACTAACCGCTCCCGAATCTGTACCCACTGGTCCAAGGCTAAGCCATCAAAAACCAGGAGTGCGATCTTGTCGGCCCCGAGCGCTTTCCGGCGAGAGCTCAGATAGTCGGGAATCCGATGCACCATCACCGGCCCGTTGTGGAACGAGAGCGTGCTCAGATCCGCATAGTGCTTGGCAGCCACCCACGCATGTAGCTGCGCATCCGACTGTTCTTGCAGTGTTTTCACGAGAGCCTGCACCTCTGCCAAACCGTCTGTGCCATAGGCATTGCCCAGATCGTGCACGCGGGCGAGAGTTTCGCCGTACTGCTTGGCAAACTCGCTCCAGGCCTTGTGCGTAGCCGTCGCGCTTGGAATCGCGTCAATCAGCTTGGCAATTCCCTTTTTCACAAAAGCGTTGCGCGCTTGCGGGTCTTGCACAATCCCGGCCTTGATCCAGCCTGGCGCGTCAGCTGGCACAACCTCAACGACCAGCGGGTGCAAGGAGCCATTCAAGAACATCGAATCAACGATGGATTGCACATCCGAGTGCGCAAACGGAATATCGACTTTGGCCACGTAATCGGGGGGTGGTGGCTCGCCAATGCGTGAGCCTTCCATGCCCAAGTTCGCCAGGTAGCGATGCCAAGCCTCCTGCACCACGCGCAGCAACGCACTCTTCGACGACAGCCACGCGGCCACCGGAAGGCCTGCAAGCAGTCCCTTGCTCTGGATGATGCTGGCTGCGTGCTCGGCAAAGACCAGGGGCAGCGCACGGTTAGCAAAGTGCATCCGCAGCACATCACGCCAGAAATCACTTTCCGTGCGAATGCTGGAGCGCGGTGCCAGGTGGTAGACCCGCTCCAGTATGAAATCTTTGGACTCGTTCTCGCCGCGAATACCCTGCAGCTCGGTGTCGTGCGCCTCCAGCAAGGCGGCAAAGTGCTCGGGTTCAAGCTGCTTGACCACGTTGTAAGCCAAACGCGGAAACAACTGTGCCAAGCCCAGACTTACGACACGACCATAGTGCCCAAGGTCCCAAGGCAGCTCATTGGGATCTGCACCGCGCCAATGCACAACCACAGCGGGCGTGGGGCCGGGCTCGCCTCGGTCCCAGGCGGCGCGATAGCGCTCCTCAAACTCTGTACGGAACACAAAGGGGTCTTCGTACAACAGCACCTCGAAGCCACGGCTGCGCAGTTCGGCCAGCAAACGCTCGTCCAGCAAGACGTCGTCCGGATCGCAGGCCACCCACAGCCGGTCGAGATCAGCTGTGAAGCGACTGAGAATTCGTTCAATCCACTGGCTCATGTGCTTTGTGTCTCCCTCACGTTCTCGCCAGGCTGAACCTCGGCGGAGACGCGTACCATCATTACCGCGTTCAGATCCGGCACACTGGCCGCAGCCTCGGCCAAGGCGGCCAGTCTGGCGTCGTGTTCTTGTTGCAGACGCTTGCGACGATGCTCGCGGACAGCAGGCAGCCCGATGCGGCCGATAGCCTGTTGCCTTGCTTCAAAGGCGTAGTCAGCACGCTCCCGCTCCTCTTGCAGTCGGGTCCGATGCGCCTCCAGCATCTCGGTAAAGATCCGCTCGCCTTGGGCTTTTGCCGCCGTGAGTGCTGCGTCAAACCATTTCACAGCCTCTTCGGTGCCGCTGACCCCGTGTACGACCACTGTTTCGGTCAGCAGCAAATCCCACACGCGTTTGGCCGTCGGCACAAAAGCGCGGCCCTCTTCGTTGGTAAAAACGGGCAGGTAACGCTTGCGGTTCAAGCCTTCCGCCGCAAGGCTGATCTCCCAGAGGGACCACACTCCGCTCACAGAATCAGGCAGGCCCGTCACCCGTATCACTGGCAAGGGTTGGCCTGCCACAAAGCGGGGCAACTCGCTGATCACTGCGCGCGCGCGCGGGTCTTCCAGCGTCACCCACTCAATGTCATGGTTCTCGTCAGCGGTACGCGCGTCAAAGCAAACCTGCGCTGCTTCGCTGCCATCCGCCCAAGTCACACGCCACGCCTTGCCCACCTTGGTTGCTGCACCGCCGCGTGCGGCCAAGCCAGAGGTGATGGCTCGCTCCAGCCAGAACTGTGCCGGGTGGTCGCGCCATTTGCGGGCATCGTCCGCTTCCAGCTCGTGCGCGTCCGAGAGCAATTCGCTGCTCTTTTTTGACTCGGCCAAGGTCTCGCGCAACTGCGAAACCACCGCATCGCATTCCTGCTCTATCGAAGCCGGGTTCTGCAAACCATGCACGAACAGCTCTTCGAACAAAGGCTCTGCTTCCGCCGAGTCCATCACGTCGGACGCCTTGTCCACGCCGAACTGTTGGGCGATCACTTCCAGCTTTTCTTCCAGCACCTGGCGAACCCGGTGCTCCACGGTGTCTTCCAGCACAAAGTTGATGGCGCGCACCACGTGCCGCTGGCCAATACGGTCGACGCGGCCGATGCGCTGTTCAATGCGCATCGGGTTCCAAGGCATGTCGAAATTGACGATGACGTGGCAGAACTGCAGGTTCAAACCTTCGCCGCCAGCGTCTGTCGAGATCAGTACGCGCACGTCTTGAGAGAACGCCTTTTGCGCCTTGCTGCGTGCATCCAGATCCATGCCGCCGTTAAGGGTGGCCACCGAAAAGCCACGGCTTTCCAGGTAATTGGCCAGCATGGCTTGGGTCGGCACAAACTCGGTGAAGAGCAGCACCTTCAGGGCAGGGTCGTTTTCTTCCTGCTGCAGTTTGTAGATCAGCTCCAGCAAGGCTTCTGCCTTGGCATCGGTGCCCGAGGCTTCGGTCTCGCGGGCCAGAGCGAGCAGTATTTCCACTTCGGACTTTTCCAGCTCCCAGCCGGTGGCCTGCATGGCCAAATCCACCTGCGACTGGCCGTCCAGGTCAGCCCAGTCTTCCTCGCTGGTGTTCTCAAACAACGAGGGTTGTGGCTGGGGCTCCTCCAGCAATGCCAGCCGCTTTTCCAGCGTCGTGCGAATAGCGGCAGTGCTGGACGTCACTAAACGCTGCATCAGAATCATCAAAAACCCGATATGACGCTGCTTGGCGGCCATTGCCTGGTTGTAGCCATGGCGCACGTAGTCGGTCACGGCCTCATACAAGCGCCGCTGCGCGTTGTGGCGCGCCTGCCAGGCCACGGCCTGCAGCCGGGTGACTCGCGGCTTGAAGAGTGGCTGACCATCGGCGTTGATCGACAGCCGCTTTTCTGTACGAATCACAAACGGCCGCACGCGATCGCGGTTCACGCTGCTTTCGTCAGGGAAGGCGTCGCGGTCCAGCAACTGCATCAAGCGCAGGAACTGGTCGGTCTTTCCCTGGTGCGGCGTGGCTGACAGCAACAGGAGGTAGGGCGATGCTTCTGCCAATGCTGCACCCAGCTTGTAGCGCGCCACCTGTTCGGTGCTGCCGCCCATACGGTGGGCTTCATCGATGATGACCAAATCCCAAGAGGCCGAGATCAGGTCTTCAAAGCGTTCGCGGTTGTAGTTGTTGAGCTGCTCCAGACTCCAGCCGCGCCGACTCTCCATCGGTTTGACCGAATCCAGTGAGCAGATCACCTGGTCATGCATACGCCACAGGTTGTCTTCATCCCCCTGGTTGCCACTGCGCCATTGGCGAAATGCAGCCAACTCAGAGGGCTCGATGAACTGCAGATGCTCACCGAAATGCAAACGCATTTCCGCCTGCCACTGGCGCACCAACCCCTTAGGCGCGACCACCAGCACTCTTTTCACCCGGCCGCGTAGCTTCAATTCCCGCAACACCAGCCCGGCTTCGATGGTCTTGCCCAAGCCCACCTCATCTGCCAGCAGGTAACGAATACGGTCGCGGCTGATGGCGCGATTCAGTGCGTACAACTGATGCGGCAGTGGCACCACGCTGGACTGGATGGGCGCGAGCAACAAGTTGTCTTCCAGCGCATCCAGCAGCTTGGCCGCCGCCGTGGTGTGCAGGATTTCCTCCACCGTAGGGCGAACGCTGTCCAACGGCGCAAGATCTGAGGCACGTGCCCGCACCACTGCGTCTTTGGCTGGCAGCCAGACGCGGTAAGCACTCTCACCCCATACCTCTTGCCGATCGATGACGCGACAAGACGCAGCCTGTCGCGTCAGCCAGCACCAATCGCCAACGTTGAAGCCGCCGCCCGCCAC
Protein sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|1992411_1992903_+|WP_147073283.1|DBSCAN-SWA MSLATRIESLVIRVAQEFNDVRATAGSLASLSTNDKSSLVAAINELKAAVLSAMAIDDNQIATTSTYSSNKIVSLLDALKTDILGGADAAYDTLVEIQQALQSGTSGLDAILAAVNLRVRFDAAQTLTVAEQLQARTNIGAVAVSDVGNTDTDFVVIFDGALA >NZ_AP021884|1977054:2023954|2015009_2016134_-|WP_147073241.1|DBSCAN-SWA MGVVAEQVESASVDANGYIGRIPVRNLWLLMLYASELFRTRGIGKVGLEDNPDDLPDLVAEILAHAVEVRQRRRLNLGYRSRDAVLNRVRGRIDVLNTERHRLLDRGLVACRFDELTVDTPRNRFVRAALETIARIVQRKDVSHRCRSLASGMKAMGVSGDVPTRAQMSTDRFGRNDADDRFMVAAAKLAFDLALPTEVAGTNVLTLPEREVTWVRRLFEKAVGGFYDVVLQPQGWRVRCGGTLGWQIEQKTSGIDKILPTMRTDVVLDHAPSGRRIVIDTKFTSIVTSGWYREETLRSGYVYQIYAYLRSQVGRGDPLADDASGLLLHPSVGDAVDETVVIQGHGIRFATVDLTASPGEIRSQLLRLCEPALQ >NZ_AP021884|1977054:2023954|1984720_1985137_+|WP_147073297.1|DBSCAN-SWA MAEWTTDDVAARFEEAATTGRRLPPVRVQGYFNCWPAFVRKEWEAFAADEKVYRPFPPSPEAIDRMLETMRWVQWLEVEQRHLVWMRAKRYGWRDITIRFACDRTTAWRRWQRAMEIVATNLNSEGVRLPSKNVGNLG >NZ_AP021884|1977054:2023954|1996685_1997705_+|WP_058719286.1|capsid|DBSCAN-SWA MQNPFISPAFSMASMTAAINLIPNRYGRLEELNLFPPKPVRTRQVIVEERAGVLNLLPTQPPGSPGTVNVRGKRTVRSFVVPHIPHDDVVLPEEVQGLRAFGSETEMESIAGVLAQHLETMRNKHAITLEHLRMGALKGEILDADGSRIYNLFDEFGIDQQSVDFEISSPTTGTDVKGKCTDVLGIIEEALLGEFMTGVHCLCSPEFFKALTGHKDVKTAFTNWQQGAVLINDVRRGFTFGGITFEEYRGKATDVNKTVRRFIAAGEAHAFPLGTIDTFGTYFAPADFNETVNTMGQPLYAKQEPRKFDRGTDLHTQANPLPMCHRPGVLVRLVMGGGV >NZ_AP021884|1977054:2023954|2013776_2014079_+|WP_147073246.1|DBSCAN-SWA MTEPEQQPALVENMLLLRKEDFDDLLDRAAERGAERCLAHLGLENGHAARDIRELRDLLEAWRDARRTAWQTTIKVATTGILAALLVGAAIKLKLMGGPQ >NZ_AP021884|1977054:2023954|1984052_1984517_+|WP_147073301.1|DBSCAN-SWA MKTTILALDLGTHTGWALQHLDGTITSGTEHFKPQRFEGGGMRFLRFKRWLNELLSVSNHINAVFFEEVRRHAGVDAAHAYGGFMGHLTAWCEHHNIPYQGVPVGTIKKHATGKGNASKDEMITSVRERGHTPVDDNEADALALLHWAVETQEV >NZ_AP021884|1977054:2023954|1986858_1988130_+|WP_147073295.1|DBSCAN-SWA MTASWFADKIEKWPTAKLLPYARNARTHSDDQVAQIAASIAEFGFTNPILAGSDGVIVAGHGRLAAAQKLGLAVVPVVVLDHLSPTQRRALVIADNRIAENAGWDDAMLRIEIASLQDDDFDVSLTGFDADALAELMAGDEPDGEGETDDDAVPELSETPISRPGDVWSLGGHRLLCGDSTVTESYDRLLDGEQVDMVFTDPPYNVNYANSAKDKMRGKDRAILNDNLGDGFYDFLLAALTPTIAHCRGGIYVAMSSSELDVLQAAFRAAGGKWSTFIIWAKNTFTLGRADYQRQYEPILYGWPEGAQRHWCGDRDQGDVWNIKKPQKNDLHPTMKPVELVERAIRNSSRPGNVVLDPFGGSGTTLIAAEKSGRLARLIELDPKYADVIVRRWQEWTGKQATRESDGALFDDQAAIDSSAISQ >NZ_AP021884|1977054:2023954|1996290_1996668_+|WP_147073275.1|head|DBSCAN-SWA MPAMQEPINLGDLLKYEAPNLYSRDRVTVAAGQTLPLGTVLGQITATGKVKQIDPSATDGSQYSAGVLMQDADAALADRNDGLMVARHAIVSDHALHWPTGITTAEQQAAIQQLKALGVLVRIGA >NZ_AP021884|1977054:2023954|2018373_2019096_-|WP_147073237.1|DBSCAN-SWA MNTSAPQIGFDRFIQLDWAAAALRVRAGTAGLDDLNALLDAAELGVEAKKKTRTVLNRLWLEPRAELVDFADRAAALYKSQPDTPIPVLCWGVAIACYPFFGKVSELVGRLSAIQDDCAAAEVHRRMSEIYGEREGTRRMTNMVIQTQASWGAVERVEKGKRVIRLAATPIDSDELTVWLIEAAVRYVGKPVSVHSLQSLPVLFPFRLTRPLAYVVSNSPYLDLRSEGPSNQFVALRSTI >NZ_AP021884|1977054:2023954|2005133_2008697_+|WP_147073258.1|DBSCAN-SWA MPIQSGDVKLLKSAVMADVPEGGGAPTGNTIADGVSNAIFPDISELDRAGGRVNLRKSFVSVQTDDTDTYFGANVIVAEPPQDARVSVTLFSTEKTFDTREQAQVRIEAYLNKGPEWAGYLFENHIAGQRVIQLFQRTTDTVPNVGQTLVLIENEGLGTQKEQYIRATSVSVVERTFTYDGDKDYKASIVTVDISDALRYDFTGSPASRTFTRAANSTKTRDTVVADAGTYVGVVPLTQAAAVGDFTIKGTSIYTQLVPSAQTETPISFVPPYAAAGLPVPGAVAVSYTASHAWTTSIKFNLPGGCLPGSLTIGTDGITIFDDAGLLKTASGTVGTIDYANGILTLNSGTMSNAKAITYTPAAQILRAPQSSEIPVTPESRSQSYVGTVNPVPQPGTLSISYMAQGRWYVLSDSGNGSLKGLDASYGAGTFNRNTGAFVVTLGALPDVGSSLVLTWNVPTQETQQPSTTLKATQSLALNPPAGTAVQPGSLTVSWEYGGTKTSTAATSGVLSGAATGSLSVAQNRVDFAPNVLPAVGTQLTVSYVAGPKQEDSFAHPSRNGAGTLPVTATLGAIEPGSLEVEWNTFTDEAVLGAYTFAQLQEMGIAVSIWRDPTQIARDDGNGGVVLNGISIGTVNYATGQVTFNPDVSIRIPRPVYTAVAINGTGRWRLNYGGIAYVDAPSLYPNDESGYVKLRYNSAGSTSNQTETFQFLPAFKLVPGVNAQVVTGTVLLSISGAQPWGDNGQGTLREFTTSGWVTRGTINYLSGDVALTSWTAGTNNAITRASCVTTVGENISSEFVFRTGAAPLRPGSLSIQYARAVGGTQNVTAGIDGKIEATGISGSVDYETGLVRVRFGTMVTAAGNESQPWYAADRVGTDGKIFRPEPVAASSVRYSAVAYSYLPLDADLLGIDPVRLPSDGRVPIFRPGGFAVVGHTGKITSSVSNGQTINCARVRLSRVRVVGHDGAVIHTGYSTDLEAGTVTFINVSGYSQPVTIEHRIEDMAVVRDVQISGEISFTRALTHEYPLGSHVSSALVAGDLFARVNLVFDQSTWNGAWSDALSGSSATATFNNTQYPIRVTNRGALTERWIVRLTNSTSFEVIGENVGVIATGNTSADCAPNNPATGVPYFHLPALGWGNGWATGNVLRFNTIGAQFPVWVVRTVQQGPESVPDDNFTLLIRGDVDTP >NZ_AP021884|1977054:2023954|2014075_2014552_+|WP_147073244.1|DBSCAN-SWA MIETLLGGLLGGVFRLAPEILKWLDRKGERGHELAMQDKALEFEKIRGAQRMAEIGASAEAAWNVGAVDALREAVRTQGEKTGVRWADALSISVRPVITYWFMALYCAAKTAAFAAAVTAGSGWGTAILHAWTEADQALWAGVLNFWFLGRVFDRVRS >NZ_AP021884|1977054:2023954|1993294_1993516_+|WP_146463160.1|DBSCAN-SWA MAYTEAQLQALETALAKGEHRVSFGDKTVEYRSVDELKAAIREVKRGILEQAAATGLWPGAPRQIRVTTSKGF >NZ_AP021884|1977054:2023954|1998460_1998655_+|WP_147073270.1|DBSCAN-SWA MTQLVLTRPHTHAGKTYGVGDRIEIDATSADWLIAHDIATPEPTAPTAEPVPEPKPLQRKEPKQ >NZ_AP021884|1977054:2023954|1999415_1999817_+|WP_147073266.1|DBSCAN-SWA MSDLDTLIPQAVELVIDGEPLAIKPLKVGQMPGFLRAMSPVMQQLTASNIDWLALFGERGDDLLSAIAIAVGKPRAWVDELAADEAILLAAKVIEVNADFFTQTVIPKLDGLFGQVKLPPIVKAAAGSMPSST >NZ_AP021884|1977054:2023954|1980647_1981385_+|WP_147073307.1|DBSCAN-SWA MMDFNSTSSISGQITALVDAGMQRARAQQSERQYLGASRLGAACERALQFEYAKAPVDHGRDTPGRMLRIFERGHVMEDCMVAWLRDAGFELRTRRADGEQFGFSVADGRLQGHIDGVIVDGPEGFAYPALWENKCLGMKSWRELEKNRLAVAKPVYAAQVAIYQAYLELHEHPAIFTALNADTMEIYTEAVPFDAALAQRMSDRAVKVITATESADLLPRAFNDPTHFECRMCAWQDRCWRTQA >NZ_AP021884|1977054:2023954|2019092_2021114_-|WP_147073235.1|DBSCAN-SWA MSQWIERILSRFTADLDRLWVACDPDDVLLDERLLAELRSRGFEVLLYEDPFVFRTEFEERYRAAWDRGEPGPTPAVVVHWRGADPNELPWDLGHYGRVVSLGLAQLFPRLAYNVVKQLEPEHFAALLEAHDTELQGIRGENESKDFILERVYHLAPRSSIRTESDFWRDVLRMHFANRALPLVFAEHAASIIQSKGLLAGLPVAAWLSSKSALLRVVQEAWHRYLANLGMEGSRIGEPPPPDYVAKVDIPFAHSDVQSIVDSMFLNGSLHPLVVEVVPADAPGWIKAGIVQDPQARNAFVKKGIAKLIDAIPSATATHKAWSEFAKQYGETLARVHDLGNAYGTDGLAEVQALVKTLQEQSDAQLHAWVAAKHYADLSTLSFHNGPVMVHRIPDYLSSRRKALGADKIALLVFDGLALDQWVQIRERLVEATKRFAFDEGTSFAWLPTVTSVSRQALFSGRKPREFEESIGHTNKEEYLWKAYWQEQGVKPGEIYYQRSLRQIEQLDALQAALDDRRPKVVGLVVDEVDDRLHKERSKQDVALWIANWLKTGFVDRLFAMLLDRGFHIYLTADHGNVEAVGVGRPSEGDVPEARGERVRVYRSESLLAKSAAANANSVHLDIAGLPAGYMPLFAGGRTAFVPDGEQVVVHGGISVEELIVPFVKVKYVIGNE >NZ_AP021884|1977054:2023954|1998651_1999404_+|WP_147073268.1|DBSCAN-SWA MSTYASFQGRVFLGKRDTDGLPIEVRSPGNVAELKLSLKTDVLEHYESQTGQRSLDHRMVKQKSATVNLTIEEFTKENLALALYGNHVVGTPGTVTAEPVGGATPIAGDRYFLAHPKVSSLVVTDSAGTPATLALGTNYTADPDFGALQFLDTTGFTAPFKASYAYGVATEIGIFTQALPERFLRLEGINTAQGNAKVLVELYRVAFDPLKEISFISDEYNKFELEGSLLADTTKPFDAVLGQFGRIVQL >NZ_AP021884|1977054:2023954|1980150_1980651_+|WP_147073309.1|DBSCAN-SWA MKCWVCKRQARGFGHTDNRHGIGDPRRYPIDWVFCSQRCQSAFHAMYGNWSRAKDGRSDIKGVAMIDPSDIELAAMRKCLKSFGEAASEIGFTKPLGNYSEAEALQVIDAIVTCYTEAMVEHHEASKYPPVRGMTPTPDPMTPSAANPFADLDDDLPWEEPKGKKP >NZ_AP021884|1977054:2023954|1984518_1984728_+|WP_147073299.1|DBSCAN-SWA MKVSTPQYRCPLGRLQPQTTDLDAIKERGWRDQHILVVNASDDRLDFIEREIVRRIGERLYGLGGTRHG >NZ_AP021884|1977054:2023954|2013478_2013700_+|WP_147073248.1|DBSCAN-SWA MRNLYEQFRQLIPDPPLQAGTVSDVGSGVVTVALPGGGRIKARGSAALGQKVFVRDDAIEGIAPSLTLEIIEI >NZ_AP021884|1977054:2023954|1988093_1988462_-|WP_147073293.1|DBSCAN-SWA MNTNQQMPATQNDAWGFWGTMNEHASTAWPLAMTAISDATGQPLESVRVFLDSRHGRHFADDLQNGLYRGQTLADAINAATQQWMGWTIGRQTSKQYGIPRGLPYLTGFVIHCEIAEESIAA >NZ_AP021884|1977054:2023954|1992902_1993295_+|WP_147073281.1|DBSCAN-SWA MSLASSIAALAARIGFEVKTKIDATHPGIARVWVSFGYVGGQVVIASAHNVASVVRTAAGRYRVHFAVAMPDANYCWTALARSSTNTGQQRLALVRASSDLKTAQYVDVSCATAASSFDDSSEINLVVYR >NZ_AP021884|1977054:2023954|2011071_2011467_+|WP_147073252.1|DBSCAN-SWA MTVAITVEHNEARLAGTLAFLDAGSNPARLRIYGGTRPANPATTPTSAMLVEIRLTKPAGTIAGGLLTLTQQEDGLITATGIATWARLVNGNEVTALDLDCSGTDGSGDVKLASTNLYLGGDARMVSAILG >NZ_AP021884|1977054:2023954|1981632_1983924_+|WP_147073303.1|DBSCAN-SWA MIDFNDTTQPAEHNRESERDEIRADLLARLESVLTTMFPAGKKRRGKFLIGDILGSPGDSLEVVLEGEKAGLWTDRATGDGGDIFALIAAYLGANVHTDFPRVLDEAADLLGRSRSVPVRKAKKEAPVDDLGPATAKWDYFDAGGKLIAVVYRYDPPGGKKEFRPWDAKRRKMAPPEPRPLFNQPGIGAASHVVLVEGEKCAQALIASGVVATTAMHGANAPVDKTDWSPLAGKTVLIWPDRDAPGWDYADRASQAILQAGATSVAILMPPDDKPEGWDAADAIPEGFDVGGFLAVGERMPVMRSVEEAPSPDLLTGIDWTTEDGLSSAFTRRYGEDWRYCALWGKWLVWTGVRWNPDQVLYVSHLSRGICRNASLKADTPRLKGKLASSATISSVEKIARSDPKHASTAEEWDADVWALNTPGGVVDLRTGRMRPHRRDDRMTKVTTATPQGNPDSACPTWRGFLTDVTGGDADLMAYLQLMVGYCLTGVTSEHALFFLYGTGANGKSVFVNVLTTILGDYAANAPMDTFMEARNDRHPTDLAGLRGARFVSSIETEQGRRWNESKVKAITGGDKVSARFMRQDFFEYLPQFKLVIAGNHKPSIRNVDEAMKRRLHLIPFTVTIPPERRDGRLTEKLLKERDGILAWAVEGCSRWQSQGLKPPASVVSATEEYFEAEDALGQWIEERCLLAKSHREGVSELFADWREWAERAGEYVGSVKRFSELMATRKFDKCRLTGGARAIAGIALRPKPYSHAYPYRDD >NZ_AP021884|1977054:2023954|1989572_1989770_-|WP_147073287.1|DBSCAN-SWA MSKFEQLLTQIAQNKLGIETLETRKSDSLDFHDVAVWCLRDALEAAFNAGVEQGRKATKSDKANS >NZ_AP021884|1977054:2023954|2000022_2000667_+|WP_147073263.1|DBSCAN-SWA MRISVQIDSAAAQAQLRRWGGEFRDKVKKAVSRAIASEAVELKQDVRSHVASQMAVVKKSFLKGFTAKVLDKDLNRLPALYVGSRIPWSAMHETGGQIAGRMLIPLNGRVGRKRFKAQVAELMRGGNAYFIKNAKGNIVLMAENIKEHDRPLAGFKRRYRKAEGIKRLKRGADIPIAVLVPKVVLKKRLDVERLVASRIPRLAAAVENQISTVD >NZ_AP021884|1977054:2023954|1995036_1996272_+|WP_147073277.1|DBSCAN-SWA MTLLPHLAARLYGVPLAIHRPKLDVILAVLGPRIGLADLAAPSGFTPPARPASTQTTKVAVIPIHGTLVRRTVGLEAESGLTSYAGLTAQLDAALASPDVAAILLDVDSPGGESGGVFDLADRIRAAAKTKPVWAVANDMAFSAAYALASAASKVFVSRTGGVGSIGVIAMHVDQSEKDAQDGVRYTAVFAGDRKNDLNPHEPISSEAHAFLKGEVNRVYGLFVETVARNRGIEASAVRDTEAGLFFGQAAVAIGLADAIGTFDDALAQLCESVSPLPKLAASHSGLFSNPQMESSMNDRTDPAAPDRLAADPAGSPSQPAAATAMTVADAIEVAQTCTLAGRTDLIAGFLEAKAPPAKVRSQLLATQAEASPEIVSRIDPQSAMSASSTGHPASSHNPLIQAVKSRLGTK >NZ_AP021884|1977054:2023954|1977808_1978657_+|WP_024973178.1|DBSCAN-SWA MKRLPIVSAVERMAERKGVKLLMLGKSGIGKTSRLKDLDPATTLFLDIEAGDLAVADWPGDTIRPASWPESRDFFVFLAGPDKSLPPESAFSQAHYDHVIEKFGDATQLGRYQTFFLDSITQLSRQCFAWCKTQPGAVSDRSGKPDLRAAYGLLGQEMIGALTHLQHARGKNVVFVAILDERLDDFNRKVFVPQIEGSKTSLELPGIVDEVVTLAEIKAEDGSSYRAFITHTVNPYGFPAKDRSGRLDLLEPPHLGALIAKCAGAVPALASAANPAHIESQE >NZ_AP021884|1977054:2023954|1999813_2000017_+|WP_147073264.1|DBSCAN-SWA MIEHGHRLPDILDYTLAQVRGFVVATARTDAARDARLLSVIAIGTRSDARQLDQTLDRLTDKATDRA >NZ_AP021884|1977054:2023954|1978662_1979289_+|WP_024973179.1|DBSCAN-SWA MTAWNDFNDADSQQSGFDLIPKGTVVPVRMTIKPGGYDDPEQGWGGGYATESFETGSIYLAAEFVVTAGDHAKRKMWSNVGLLSKKGPTWGQMGRSFIRAALNSARNVHPQDNSPQAAAARRINGFAELDGLEFLARVDIEKDAKGQDRNVVKLAVEPDHPDYAKLKGVPPKGSPGGGNSGAPAQAAPAYSAPTPQRAPVTGKPSWAQ >NZ_AP021884|1977054:2023954|1997701_1998004_+|WP_147073273.1|DBSCAN-SWA MSLVAQIYESAANAGLLKECLWYPSNGAPSQLHQIGFAAPDESLLDGLALSTDYEMTYPVTAFGGLAVREVVEIGGTSFQVRDIRSLSDGSEIRAKLTRL >NZ_AP021884|1977054:2023954|1977054_1977318_+|WP_024973176.1|DBSCAN-SWA MQTQVPSIESGRNPRRMNPGGATCIALDENELAIRWGLSVKTLRRWRQEQLGPIYCKLGRRVTYLLHEIEAFERRVSRYSSFTRAYQ >NZ_AP021884|1977054:2023954|1989887_1990427_+|WP_147073285.1|DBSCAN-SWA MGISIRAYARHRGVTDTAVHKAIRAGRITPEADGTIDADRADREWARNSDVPKTGTRAKAAKVAVPEGGTGVGGDGPAALPAGGASLLQARTVNEVVKAQTNKVRLARLKGELVDRPQAIAHVFKLARSERDAWLNWPARISAQMAAKLNIDPHTMHVALEAAIREHLQELGELRPRVD >NZ_AP021884|1977054:2023954|1988560_1988920_-|WP_147073291.1|DBSCAN-SWA MSTMTITIERTPRTLQFGDTTFQVEELSVRLPFARKPADLDEVGGQGQTKVYVTETKELTVDEFDAFARSLLVSRDWLRGKGGGTGDGYLCVEVTAPGRPYLYVNPEGGDYARYVARLG >NZ_AP021884|1977054:2023954|1990429_1992394_+|WP_147073339.1|terminase|DBSCAN-SWA MNVEYEGAAEIERAWREGLTPDPLLSVSEWSDRHRMLSSKASAEPGRWRTSRTPYLKAIMDCLSPTSPVERVVFMKAAQLGATEMGSNWIGYVIHHAPGPMMAVWPTVDMAKRNSKQRIDPLIEESAALSELISPARSRDSGNTILAKEFRGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLAEARTRTFARRKIFIVSTPTISGASAIEREYEASDQRRYFLPCPHCSHRQWLRFEQLRWEKGQPDTASYICESCDKSIAEHHKTWMLEHGEWRAMISDGTGKTAGFHLSSLYSPVGWRGWRDIAAAWESSVNKESGSAAAIKTFKNTELGETWVEEGEAPDWQRLVERREDYRVGTVPPGGLLLVGAADVQKDRIEASIWAFGRGKESWLVEHRVLMGDTARDAVWKRLAELLAENWTHASGAAMPLARFALDTGFATQEAYAFVRACRDPRVMPVKGVPRGAALIGTPTAIDVSQGGKKLRRGIKVFTVAVGIAKLEFYNNLRKGADVSEDGVTTVYPTGFVHLPKIDAEFIQQLCAEQLITRRDRNGFPVREWQKMRERNEALDCYVYARAAASAAGLDRFEERHWRELERQLGMERPPDEPPPIQAFDPNEATQRGGLSVSANPPRRRVIKSRWLS >NZ_AP021884|1977054:2023954|2016133_2018365_-|WP_147073239.1|DBSCAN-SWA MNIKTENSSAQASWFVGASYGGTDDQLPRFLSEGIWENGYEDKHLDVVRSMRPGDQIAIKSSYTRKHGLPFDSRGQAVSVMAIKAIGTITENLNDGQRVKVDWTKIEPVREWYFYTHRGTVWRVLPGEWMTDGLISFAFHNKPQDVERFRNAPYWRERFGTVAADKHRFGWTKFYEAIADKLLTYRANRAALVEGIREISVRVDGLGHLAEDKYADGTTGFVKDICPFTTMGLFNRGIKDSNRKIIATELAKFLGVDEPVPETFEGIPLLNNLKSWYFPFEINRATDHIDALWGVFAAGIAYADTDDDLAREEFAKAFDSANGRRGVAWNLTFGLYWIRPWAFLSLDHNSQLYVSKKLGVPIGMHGPKRRCNSADYLAVMDVLEPRFQETAYPVHSYPELSLEAWLYKDPTDEKSPVGEDDAGDVDDGDDATEATAPEDVHVAVPIVPYSVDDILKDGCFLERNEIDRLLDRLRTKKNLILQGPPGTGKTWLAKRLAFALMGQKDESKVRAVQFHPNLSYEDFVRGWRPTGEGKLSLADGVFMEAIKAASKDPSSKFVVVIEEINRGNPAQIFGELLTLLEAGKRTPNEALELCYPDADGKRRPVHIPENLYVVGTMNIADRSLALVDLALRRRFAFVGLEPRLGTAWRDWAVKECAVDPALVADIEHRIAELNDQIAADARLGKQFRIGHSYVTPAHRLEAGDTKKWFQQVVETEIGPLLDEYWFDAPDEAQKACARLLQGW >NZ_AP021884|1977054:2023954|2004724_2005132_+|WP_147073260.1|DBSCAN-SWA MQLTNLDAGVALPLPDDLLWSDEHAWSPAVATTSYLITGALLIQSATRQAGRPITLVGAPDMAWVTRATVEQLQAWAALPVGSATGRFGLTFSDGRSFTVAFRHAETAIEAEPVLGIPARAATDFYRLTLRFLEI >NZ_AP021884|1977054:2023954|2009989_2011075_+|WP_147073254.1|DBSCAN-SWA MSYPLSESFATAPATGYTAVLGGMAATHNNVQQSIDISAPNSQSILRFNETAHGDFWFEADVEFLTDPSARKHIGLWMTTGNGSEGYRFAHIDGAWSVTRWNSGFGDGAAVTGGVNDGAKPVAGVIDVAPTFNVGQRMPLRCEVIVGAFDANGVPWARLIQFKAGGVLMFQVGDAAYRGKLIPGVFLYGATARVHAIAGDTPSGLPAFPATVGVNAADDLLPLAGGSTSVPPDPAANIAVNADCDLMRLNSPNSELWNRGGGYDWHFHAIPNGRKNIHFSGHGFIAGTVKEKGQPDQPLVRRVQLVSENTRVLVAETWSDTTGAYRFELIDPAQRYTVVSYDHKQMYRAVIADNLHPEMMP >NZ_AP021884|1977054:2023954|1985452_1986862_+|WP_147073341.1|DBSCAN-SWA MNTLNVEYRKVEALIPYARNPRTHTDEQVAKIAASIVEYGWTNPVLVDGDNGIIAGHGRLAAARKLGLDQVPVIELAHLSPTQKRAYVISDNRLALDAGWNEEMLALEMAELSEAGYDLALTGFEDAEIEALLADEVASDAADQEPDADEPDDGDDVPDSPVVPVSRTGDFWAIGTHRLICGDATDPTVVATLMQGDAARLCFTSPPYGNQRDYTSGGITDWDGLMRGVFAKVPMDDDGQVLVNLGLIHRDNEVIPYWDAWLGWMRTQGWRRFAWYVWDQGPGMPGDWAGRFAPSFEFVFHFNRSSRKPNKIVPCKHAGQESHLRADGSSTAMRGKDGEVGGWTHKGQPTQDTRIPDSVIRVMRHKGKIGQDIDHPAVFPVALPEFVIEAYTDAGDIVFEPFGGSGTTMLAAQRKGRVCRCVEIAPEYVDVAIKRFQQNHPGVPVTLLATGQSFDDVVNERQATTEVEQ >NZ_AP021884|1977054:2023954|1979299_1980154_+|WP_147073311.1|DBSCAN-SWA MNASVLTASHYGVVRFGDLQCEAVVLKGGERGYVRRQLAKLLGFHETHKGGRFARFLADFAPKSLSALEKTREPILLPSGRQAQFFPAGIIADVASAVVSAAINGTLHKARQGIVPNCMKIMRALATTGEVALIDEATGYQYHRAPDALQELISKLLRQSCSSWERRFHPDYYRALYRLFGWKYQGHDQNPPHVVGQITQRWVYGPVLPVTLIDEIRARKGISQKHHQWLSDQGLARLETQIHAVTAIARSSTCYRDFDRRCEAAFAGGALQLALLAEDFEEGA >NZ_AP021884|1977054:2023954|1993515_1995027_+|WP_147073279.1|portal|DBSCAN-SWA MAWYSKIRSLFGQQPVHEAAGRGRRSLAWMPGNPGAVAAMLATNTELRIKSRDLVRRNAWAQAGIEAFVSNAVGTGIKPQSLAADERFKTDVQALWRDWTEEADAAGQTDFYGLQALACRAMLEGGECLIRLRPRRPEDGLVVPLQLQLLEPEHLPISLNLDLPSGNVVRSGIEFDSLGRRVAYHLYRSHPEDGRLAPMSGQGGMDTVRIDAKEIIHLFRVLRPGQIRGEPWLSRALVKLNELDQYDDAELVRKKTAAMFAGFVTRQNPEDNLMGEGAADGDGIALAGLEPGTLQILEPGEDIKFSDPADVGGSYGEFLRTQFRAVAAAIGVTYEQLTGDLTGVNYSSIRAGMLEFRRRCEMVQHGVLVHQMCRPVWAAWMKQAVLAGAIDAPGFARGGPARRRRYLQVKWIPQGWQWVDPEKEFKAMLLAIRAGLMSRSEAISAFGYDAEDVDREIAADNQRADDLGLIFDSDPRRTSKDGGSAEPNKNAADTTQTGSSSSA >NZ_AP021884|1977054:2023954|1977329_1977812_+|WP_024973177.1|DBSCAN-SWA MSDLTIFPVDIAEMSVSQLAALPPEQKCEVDKNLDAAIDWLKKARTKFDAALEQCYGEQARVALRESGRDFGTAHISDGPLHIKFELPKKVSWNQKQLGEIAERIVASGEKVEGYLDVKLSVSESRYINWPPALQQQFAAARTVDSGKPSFTLSTDGGEA >NZ_AP021884|1977054:2023954|1981381_1981636_+|WP_147073305.1|DBSCAN-SWA MTDNNTPTTGIEPMIDAKQAAAALRLPYYWFADHAMRTKYRIPHYLMGGLVRYRLSELSAWATRTTAVQGRDSQDADAPVEGAE >NZ_AP021884|1977054:2023954|2021110_2023954_-|WP_147073232.1|DBSCAN-SWA MAGGGFNVGDWCWLTRQAASCRVIDRQEVWGESAYRVWLPAKDAVVRARASDLAPLDSVRPTVEEILHTTAAAKLLDALEDNLLLAPIQSSVVPLPHQLYALNRAISRDRIRYLLADEVGLGKTIEAGLVLRELKLRGRVKRVLVVAPKGLVRQWQAEMRLHFGEHLQFIEPSELAAFRQWRSGNQGDEDNLWRMHDQVICSLDSVKPMESRRGWSLEQLNNYNRERFEDLISASWDLVIIDEAHRMGGSTEQVARYKLGAALAEASPYLLLLSATPHQGKTDQFLRLMQLLDRDAFPDESSVNRDRVRPFVIRTEKRLSINADGQPLFKPRVTRLQAVAWQARHNAQRRLYEAVTDYVRHGYNQAMAAKQRHIGFLMILMQRLVTSSTAAIRTTLEKRLALLEEPQPQPSLFENTSEEDWADLDGQSQVDLAMQATGWELEKSEVEILLALARETEASGTDAKAEALLELIYKLQQEENDPALKVLLFTEFVPTQAMLANYLESRGFSVATLNGGMDLDARSKAQKAFSQDVRVLISTDAGGEGLNLQFCHVIVNFDMPWNPMRIEQRIGRVDRIGQRHVVRAINFVLEDTVEHRVRQVLEEKLEVIAQQFGVDKASDVMDSAEAEPLFEELFVHGLQNPASIEQECDAVVSQLRETLAESKKSSELLSDAHELEADDARKWRDHPAQFWLERAITSGLAARGGAATKVGKAWRVTWADGSEAAQVCFDARTADENHDIEWVTLEDPRARAVISELPRFVAGQPLPVIRVTGLPDSVSGVWSLWEISLAAEGLNRKRYLPVFTNEEGRAFVPTAKRVWDLLLTETVVVHGVSGTEEAVKWFDAALTAAKAQGERIFTEMLEAHRTRLQEERERADYAFEARQQAIGRIGLPAVREHRRKRLQQEHDARLAALAEAAASVPDLNAVMMVRVSAEVQPGENVRETQST >NZ_AP021884|1977054:2023954|2008720_2009986_+|WP_147073256.1|DBSCAN-SWA MIDLTVKYFNSGMTGAPQISNNWGDLVTMLDACLVNGFALKAIDTLTFADGIATATISTGHAYRPFQVVEIAGAEQPEYNGSFRVLSTTTTAFTYAVTGAPVSPATTTTNLSAKVAPLGWEKPFAGTSKAAYRSKNPQSPQNILLIDNSLKTPNYTTGWAKWANVGIVEDLSDIDTIVGAQAPYDPNNPTQNWKQVTASQWGWYKWFHARGPQYESNGDSGGGGRNWVLIGDDRLFFLFCTNAAGYGWYGRNSYCFGDLISFKPGDNYATVLAADDNYSGMSNYWSYPGQFSGYGLVSSLDFTGKVLLRNHTQLGNPVRFGLTSLNTNNGQQICGRGPMPFPNGADYSLWLLPTYVRQEDGHMRGILPGMLWMPQDRPYSDQTIVDNVVGQAGKRFLLVRTQYSSETEGAQIAFDITGPWR >NZ_AP021884|1977054:2023954|2011470_2013486_+|WP_147073250.1|DBSCAN-SWA MPAVLNEVTLVAALPAPTASVAVGPPLVDLLFDQPAATDANLVFGANYIAPRDDVVVLASLPLPVVAIKFIPPARAALLAELPALTVTTLLLRPSVPLDVTGASLPGVVFSGEVRYYSRTQRPTVGQTAHAWQVAAQTEDGSTQGQQDAAATPAGWDTFWRRTLGVPQGIEHRLPPVLAAAPEQRGARHQDATRLQDSTWFAHQDATRFAATRQGLFQNAGPLRDTTRFRHQDGDRTKRAGRVSFWQIARLLTERQGSDFQIASPSLKGWSVRYQDAVPPPLGISVWVVPQPPAPIPCYTPSAHLLFAALAPADSHLLFVCENHINPPPPDGEPVVVPVRRVYFVINNVTLYRVSDGAPVPVFNLSLSLDASSWAWGFDAVLPAKAEALVAGSASGPVELVASVNGTPFRVLAESISRERIFGDASIRISGRGRNAVLAAPYAPVMTFSNTEGRTARQLMDDVLTVNGIPLGWAVDWGLTDWNVPAGAFAQQGSWIDALTAIAGAAGGYLIPHPSAQSIRVRHRYPVAPWEWSTVTPDFVLPVDAVARESLRWLEKPAYNRVFVSGQDVGVLGQVTRAGTAGEVLAPMVVDPLITEAAAARQRGVAVLADTGHQLEVSLRLPVLAETGIIEPGAFVEYQDGSVTRLGIVRATQVEAGLPEVWQTLGVQAYA >NZ_AP021884|1977054:2023954|2014548_2015016_+|WP_170227448.1|DBSCAN-SWA MTGVPKTAIELAKRFEGFHRVPRIDPGRAHPYICPAGYWTIGYGHLCESTHPPITESEAEVYLTHDLQTALAATLRYCPVLATEPEGRLSAIVDFTFNLGAGRLQTSTLRRRINQRDWAVSAQELCRWIYGGGKVLPGLVARRKVEAALMSGLHY >NZ_AP021884|1977054:2023954|1988949_1989474_-|WP_147073289.1|DBSCAN-SWA MTTTQLTPAQHAILAYALEHTDGKIDWFPDNIKGGARKKVLDGLFNRALITSDGTHWFVAAEGYDAMGRARPTPAPVAADPELDAAVTAAEAAWAQEKAAAKPRTRENSKQATVIQMLQRPEGATVQQICETTGWQAHTVRGTFAGAFKKKLGLTIVSDKAQGSERVYRIAAEA >NZ_AP021884|1977054:2023954|1998008_1998455_+|WP_147073272.1|DBSCAN-SWA MADNSIRERILLAVMAAARPAVEGLGATLHRSPTVAISRELCPALAVFPESESITERANDRVTRELTVRVVALARAVPPASPETEADRLLTAAHAALFGDGTFGGLALGIREQESEWEVEDADAVAVALPARYRLTYRTLANDLSTLG >NZ_AP021884|1977054:2023954|2000670_2004714_+|WP_147073261.1|DBSCAN-SWA MAKRISILVALEGADEGLKRAITSAERSLGELSTTAKTAGAKAAAGMAEVKAGMSAFGDQVATAKTQLLAFLSISWAAGKVQEIVQIADAWNMMSARLKLATAGQREFTTAQAALFDIAQRIGVPIQETATLYGKLQQAIRMLGGEQKDALTIAESISQALRLSGASATEAQSSLLQFGQALASGVLRGEEFNSVVENSPRLAQALADGLNVPIGRLRKLAEEGRLTADVVVNALMSQKDKLASEYAQLPQTVSQAFERLRNAFGQWINRVDESTGLTKKLAEALTILANNLDTVMQWLKRIAEVGLAVLIYRLIPALITAWQTAGAAAVTAASATAAAWTTANLSVSAAVASVGLLKTAFAVLGAFLVGWEIGTWLSEKFEIVRKAGIFMVEMLVKAVEQLRYRWEAFAAIFTSDTIAEATQRHEARLAEMNQIFAQMYADATKGADAAKGAMNTAATAAEEIAKRLEAVRQGTQEAVGRGIEAVHSALEKLKSRLGEVEQAVGKANQTVNDATAKMAEAYKGLTSIVEANLLRQIEAVKARYQQEQSALETSKQSEAALITKSTQLLTEALTQQTTLRRQSTTDTLKLIDDESKARIESARRQGQTEEERRANVQRVENDILATKRQTMTQALAEYRQHIDALNAEANRHLTEIKRIEEEKRQLSMTTEERVRDIRRQGMTDFEATEDRKRQIAEYQGKAREALANGEFEQARQLAQKAMDLAAQVASSQTSEAKRGEDARKQSEQAVSQVTQLESQSRDAYRKQEYAQAEALMRQADALRAELAQKTKDADAQIAQGKDGVNQAIQRIRESEEILNKTLDAEAKAHQTAAQSALTARDQIQQTLTQTETQIDQITAKLKDGLKVTLDADTTRFDKAIADLDKALAEKEYLLKIQADLQEAEKKLQQYEQLLKEGKTLPVDADVSKAKEALDKLKTYADQNSQFELKVATEKAQAAITKVEGMIKALDRIQTESRHQVSTNADAARSEIMSLNWANTSSTHTIYVRKVEANATGGLVGGGVRRYADGGAVAPAFPRMSGGSVPGSGHHDTVPRTLDAGAFVIRKAAVQKYGGGALSRLANGVARFATGGAVMLGGGKRPSGNDADGTPSTPKKNREAVEAMKMIDLGLQGMNEYTNWLQWNYGASVSLDMRSKTMDSYGKQAQQDRRALEDFISRKTLTGNERQNLERIKQTWRQAMAQPLLWGKDLERELIDYMEQNQGEFYRRGGMAKSDTVPAMLTPGEFVVNKDAVSRYGAGFFEAINNLSAPAQALAGRALAGVQGFATGGLVQPSGSRLARPVLAADAGPSRTVRVELSSGQQKVNATVDARDESRLLQLLDAARARTA |
50 | Acidithiobacillus_phage(45.45%) | head,portal,terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2044160 : 2053121
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|DBSCAN-SWA ATCAGTGTTTCAGCCCCAGGGTGTGGCTGAGGAATGGCACGCCGCCATACCGTCCCATCAGGTAGCCGCGTTCCAGTGCCTCGTTCTTGCGCGGGCTGAATTCGGATAAGCTGACTAACGCCGAGGTCTGATCACGGTCTTTTTCTACGAACCACACTTGATCGCGTCGGAACAAATCTGGTGCGTCCAGCAGCGACGTGTCGTGCGTCGTGAATATCAGTTGTGCGCCGCCGGTGTTGATCTCTGGGCGGTGGAACAGTCGCACGAGTTCGCGCACCAGCAAGGTGTGCAAGCTGGTGTCGAGCTCGTCGATGACCAGCGTTAGCCCTTTGCGGAGGATGTCGAGCACTGGTCCTGCAAGAAACAGCAAATTGCGCGTGCCGTTGGATTCGTCCATCAATTCAAACACGGCTTTGCCCTGTTCGGTGACGTGATGGAAGCGCAGCTTGTGTTCTTCCATTTCTTCCGAGCGCACTTCAGTTTTACCGGCCACCAAGTCAAAGTGAACGGCCTGCCCAGGGACTTTGCGTGTTTCCACATCAATGTCGGCGATGCTGATGTCGGCAGCAGAGAGGAAGTTGCAAATCTCCTTGCGGCCGTCATCCTGCTTGAGCATCTGAATGGACACCTGTGGACTGAGTTGGGCTTGCTCGTTGAAGATCACCAGGCGATTCACAAACCAGTCGAACACCGGTCGCAAGGCTTCGCTGTTGAGTTGTACGGCCATCGACAGGAAGAGGGCGTTCGGTCGGGTGGCACCTTCCCACAGGTTCTTGGGTCCTTTCAGGCCGGGACCGAAGTCGTAGACATCCTTGCCGGTCTCAGTGTCAAAGCGGCGTGTGAACCAACGCTGGGGCTTGAACGCCTTGTAAACCAGCAGGTGCTCACTGACGATCCGCTGAGCTGTCATGGAAAAGCCATACTGGTAGCGCACACCATCGAGCAGGAACGTGACTTCGAACTCGCTGGGTTGACTGGCGGAATCGACATCGAGTCGGAAGGGCTGAACTGCGAAGGTCTGGCCCGGCTGGATCGCCGTCGCGGACTCGGTCACCACGCCGCGCATGTACTGCAGGGCTTTAATGAGATTGGACTTGCCGCTCGCGTTGGCACCATAGACCACAGCACTGCGCACCAGGGTGGGTGCGGCACTGATGCCGGTGGCTTGGGTGTGGGTGTCCTGCAGGGTCTTGTCCTTGGACGCAACAAGACTAAGCACCTGCTCGTCGCGCAGACTGCGGAAGTTCTTGACGCGGAACTCGACCAGCATTGCATTCACCTCATTTTTATAAAATAGAGTCTTATTATGACTCAAAAGTGCAAAAAACAAATATATTTTTGAATTCTGGGGTGGTTTGAGTGATCTGGGTTGGCATCGCAGGCCTGCCGGGTGTCAAGTCCCCGTTTGAACTTCGCTGGAGCCTTGACGTGGCACCCAGCTGCGGTTCGGCCAAATCGTCGAATCCGCGCCCTAAAGACCCGTGCGTTACTATCTGGTTCTGAAAGCGGACGCGGGTTCGGTTCCGTGTTCCACCACCAATAATGAAACCCCAACCGTTCTCGGTTGGGGTTTTTTCTTGCCTGATCGCGCCGGTTTCCGCGTGTTGTTGGGGCTTCCTGCGGAAGCCTGCGGACTGCACCGGTCAGCCTTCCAGCCCGTTCCGGGCCACATTCCACTCTCTCCTGGCCATTCCTCGCTCCGACCTCGCTCCCTGGAACTGGCCCGAAGTCCGCAAAGGCCGCAATTCAGACCTATACAGATCAAAGAGTTACGCGCGGACTAATCAAGTGGTTGGATTGCGGCATTGGCTACCGGAGGGGACACACGCTTGCCCCAGCGAGCTATTGCACTTGCAGTTTGTCAGAAACTCGCATATATTTTCGTACATGAACACGACGACCAAGACTGCCGAGCTGATTCGCGAGCGCATCGAGGCGATGCCGATCGGGGAGCCTTTCACCCCGACAGCATTCCTGGAGTGCGGCACGCGTGCGTCCGTCGATCAGACCCTCTCCCGCCTCGTCAAGGCAGGGTTGATCGAGCGCGTGACGCGCGGTGTCTTTGTGCGTCCCGAGGTCAGCCGTTTCGTCGGCAAGGTTAGCCCCTCGCCGCTGAAGGTGGCCGAGACCGTCGCCAAGACCACGGGTGCCGTCGTCCAGGTTCACGGTGCCGAAGCGGCGCGTCGGCTCGAACTAACCACGCAGGTTCCGACCCAGTCGGTATTCGTGACATCCGGCCCGTCGAAGCGCATCCGCGTGGGGAAGATGGAGATCCGTCTGCAGCACGTCTGTCAGCGCAAGTTGGCCCTGGCAGGTCGACCCGCCGGGCTCGCGCTTGCGGCGATGTGGTATCTCGGCAAGAAGGAAGTGACGCCGGCCCTCGTCGAGAAGATTCGGCGCAAGCTGGGATCGAGCGAGTTCGAGGTGCTGAAGTCAGCCACCAGCTCGATGCCTGCGTGGATGAGCGACGCCATCTTCCGAAACGAGCGGATGGCCGCTCATGCCTGAGTCCTTCCTGCACCTGAAGCCTCAAGAGCAGTCCCAGATCTATCGGGCACTGGCTCCGCAGCTTGCCCGCACGCCCGTCGTACTGGAAAAAGATGTCTGGGTCTGCTGGGTGCTGCAGACCCTGTTCACCATGCCCGACCGACTGCCGATGGCCTTCAAGGGCGGCACATCACTCTCCAAGGTGTTCGGCGCCATTGCGCGCTTCTCCGAGGACGTGGACATCACGCTCGACTACCGTGGCTTAGACGGCTCCTTCGACCCGTTTGCCGAAGGCGTCTCACGCAATCGGCTGAAGAAATTCAGCGAGGATCTCAAGTCCTTCGTGCGCGGCCATGCCCACGGTGTCGTGGCGCCGCACTTTCAGAAGATGCTGGCGGACGAGTTCGATGCCGATGCATTCCAGCTTGAAGTCAGCGATGACGGCGAGCAGATGCGGGTGCACTACCCGAGCGTGCTGGAGGCACCAGGAGACTATGTGGGCAACAGTGTCCTGATCGAGTTCGGTGGCCGTAACATCACCGAGCCGAATGAGGAGCGTGAGGTGCGACCCGACATCGCGGAACATGTCGCTGAACTCGATTTCCCTCGCTCGACGGTCAGTGTGCTGTCTCCGACACGTACCTTCTGGGAAAAGGCGACGCTGATACACGTCGAGTGTCAGCGCGACGAGTTCCGCACAGGCGCCGAACGTCTGTCACGCCACTGGTACGACCTGGCCATGCTGGCCGATCTTGCCCATGGGCAAGCCGCTGTGGCCGATCGCGCTCTGCTCGCGGATGTTGTCAAGCACAAGAAGGTCTTCTACAACGCGAGCTACGCCAACTACGACGCATGCCTGTCCGGGCAGCTCAGACTAATTCCGGAAGATGCTGCACTGGCCGCGCTGCGCGATGACTTCCAGCGCATGATCGGTGCCGGCATGTTCATCGGCGAGCCTCCCGCCTTCGATGCCATCGTCGATCGCCTGCGCGCGCTGGAAACAACAATCAATCAGTGACCTCCCGCTGGCGTTGAGTCGGCAATCCTCGCCCGCTGCTCATCCCACAAGGCTGGCGGATCAACCGCCAGATCAAACAGCGTGATGTGGTTCGGCAGTGCGTCGTCCAGGATGGCGGCCACGATGTCGGGGGCTAGCGTGGTCAGGTTGACCATACGGCTGACGTAGCTGTTGTCGATGCCTTCCCGTGTGGCGATCTCCTTCAAGGACTTCGCTTCTCCTGATTCCAGCATCGCCAACCAGCGGTGGCCCCTGGCCAACGCCAGCTGGATGGAGGTCGGCGCCATGTCCCACGGTCTGACCGGCGCGGTTTCTCCGTTCGGCAAGGTGACCAGCTTGCGGCCGCTACGGCGCTTGATCTGGATCGGTACAGACAGGGTCAGCCTGCCGTCGCTGGTCTGCAGGATGTCCGGCTCGCCGGTTTTCTGGATGCGGATGTCGCTCATGCCAGGGCCTCCTCAGTTTGCTCGACCGGCTCGGGACGCAGTTCCAGCACCAGGCGTTCGATGCCGTTGGTGCGCAGGCGCACTTCGAGGTCGTTGGGTGACACGATGACTTTCTCGACCAGCAATTTCACGATCCGGGTCTGCTCCGCCGGGAATAGCTGATCCCAAATCGCATCAAGCCGGGTCATGGCCACGGTGATCTTGGCCTCGTCCAGCGTCGGGTCGAGCTTGATCGCCTGTGGCAGCATGTTGCCGAGCAGATTCGGGGCATGCAAAATCGCGCGTAGTTGATCGAGTACCGCCGACTCCAGTTCTGCGGCGGGCAGTCGCGGCAGCCCCGAGGCACCCGCGTGTTCCTTGGCGTCGCGCTGGGGCACGTAGTAACGGTAGCGCCGGCCATTCTTCTTGGTGGTGTGCCACGGCGACAGTGCGCGGCCATCGTTGCCGAACACGATGCCCTTGAGCAGATAGGGAACCTTGGCCCGCGTCTTGTTGCCCCGCACCCGGCCATTCGTCTCCAGGATCGCGTGGACGCTGTCCCACAGTTCGCGGCTGACGATCGGCGGGTGTTCGGCCTGGTACCACTGGTCCTTGTGCCGCAACTCGCCAAGGTAGGTCCGGTTGCTCAGGAGCTTGTAAATGTGGCCCTTGTCGATCGGCCTGCCATCGCGGGTCTTGCCGTCTTGTGTGGTCCACGCCTTCGACGTCACGCCATCCAGTTTCAGCTCCTTGACCAGTGCGGTGCTGGAACCGAGTTCAACGAAGCGCTGGAAGATGTGCCGGATCAGCTTGGACTCACGCTCGTTGGGCACCAACCGCCGGTTCTCGACGTCGTAGCCCAGCGGCGGCACGCCACCCATCCACATACCCTTGCGCTTGCTGGCCGCGATCTTGTCTCTGATGCGCTCACCGGTGACCTCGCGCTCAAACTGCGCAAAGGACAGCAGGATGTTCAACATCAACCTGCCCATCGAGGTCGTCGTGTTGAACTGCTGGGTGACCGACACGAACGACACGCCATAGCGCTCGAACACTTCGACCATCTTGGAGAAGTCCGCCAGGCTGCGCGTCAGGCGGTCGATCTTGTAGATGACGACCACGTCGATCTTGCCGGCTTCGATGTCCGCCATCATTCGCTGGAGCGCCGGGCGTTCCATGTTGCCGCCGGAAAAAGCTGGATCGTCGTAATCGTCGGCGACCGGTATCCAGCCTTCGGCGCGCTGGCTGGCGATGTAGGCATGGCCGGCGTCGCGCTGGGCATCGATGGAGTTGTATTCCTGGTCCAGCCCTTCATCGGTGGATTTGCGCGTGTAGACCGCACAGCGCATGCGGCGCTTCAAGACTTCGCTCATCGTCCACCTCTCTTCTTGGTGGACGGCTTGGCCTTGGCGTTGGACGGCGGCTTGAGCCCGAAGAACAGCGGCCCCGACCAGCGCATGCCGGTGATTTCGCGGGCGATCATCGATAGGCTCGGGTACATGCGTCCCTGGAAGTCATACTGGCCGTCGGCGGTTGCGATCACGCGGTATTCGACGCCTTTGTATTCCCGGACCAGCACCGTGCCTGCCGCCGGACGGTAATCGCGGTCACGCTTTTTCACCTTGCCTGTTTCCACCAGAGATGCGATGCGACGCTGGTTGCGATCCAGCAGGTTGGCGTCGGCCTTGCGGAATTCCAGCTCCTGCAGCCGGTAGGCAATCCGGCGTTCGAGGAACTGGCGGTTGTGGGTGGGAGTGTCGCCACCGACCAGCTTCTGCCAGAGGGCCCGGATCTCTGCCATCGGCATCTCGGGCAGCCTGGCGATCTGCGCCGCCACCGATGGCGGCGTGGAAAATGATGGTGTTTGCGTGCTCATTTCGACTCCGTAGTTGTCTTGTTGACGGGGTCTGTATGAACGCGCTGGTTGCCAGAGAAGCCAAGCTCAAACTCGCTCGCTTCTGCCCTGGTTGCGGACTGTTCTGTGCCGGTGATACGCAAGCGTGCCAGGCCGTTGGCCAGCAACGACGCGATCTCGTGACGACGCTGTTTAGCACTGGGCGTCTATCCTGACGTGACGCTATCTGTTGCGAGAAACAAGCGATATGAGGCACGCAAGCTGCTTTCCAATGATGTAGACCCGGCTATGCTCAAACAGGTGACTAAGCGCGCATCGCGCGTGTCTGCTGAAAACAGTTTTGAAGCAATAGCGAGAGAATGGTATGCAAAATTCTCGGGCGAATGGGTGCCTAGCCATGGCGAAAAAATCATCCGCAGATTAGAACGCGACCTGTTTCCCTGGATCGGTAAACGCCCTATTGCCGAGATCACCGCACCTGAACTGTTAGCCGTCTTACGCCGCATTGAAAACCGAGGCGCGCTAGATACGGCGCACCGTGCGCATCAAAACTGCGGGCAAGTGTTCCGCTATGCAATCGCCACTGGGCGCGCTGAACGCGATCCTAGCCCCGACTTGCGCGGCGCATTGCCGCCAGCTGGGTATTCTGGATCATCGTGACCGCTGATTCCGGGCTATCGTGACCGGTCATTCCGGCGCATCGTGACCGGCGATTCCGGTCTATCGTGACCGATTTTGCAGGGTTTCCGGAATCAGTGGTCACGATAGCGGAATCATCGGTCACGATAGCGGAATGGTGTCGTACCGCATGGAAATGGTGTTACGCATAGAGCAACCGAACGAGTACGCTTCCAGCCTTTTGTCTGGAGACAGCGTGCCCGTATCAAGGATCACCATGCGTAAAATTAAAGACGTATTGCGTTTGAAACTGGACGCCAGGCTGTCGCACCAGCAGATCGCCGCTGCGCTGGGCATATCGAAGGGAGTCGTCACCAAGTATGTCGGTCTGGCCGCCGCCGCAGGCCTGGATTGGGCTGCCGTGCAAGACATTGACGAAACCACGTTGGGGCGGCGCCTGCTGGTTACCCCCGAGCGACCGCGCGATCATGTTCAGCCGGACTACGGCCGTTTGCATCAAGAGCTGCGGCGCAAAGGCATGACATTGATGTTGCTCTGGGAAGAGTACCGAGCCGACCACGCCGACCGGCAGACCTATGCTTACTCGCAGTTCTGCGACAACTACCGGCGCTTCGCCAGGCAACTCAAGCGCTCCATGCGCCAGGTTCACCGTGCCGGCGAGAAGCTGTTCATTGATTTCGCCGGCCCCACCATCGCGCTGACCGACGGCAGTCGCGCGCACATCTTCGTCGCGGCACTGGGCGCTTCCAGCTATACCTTTGCCTGCGCCACGCCGCGCGAGACCATGACCGACTGGCTGAAATCGACAGCGCGCGCGTTAAGCTTCATCGGCGGCATGCCCCAGATGATCGTGCCCGACAACCCGAAGGCGCTGATTGCGGACGCCAACCGTTACGAGCCGCGCAGCAACGATACCGTGCTCGATTTCGCGCGCCACTATGGGACGTCGGTGTTGCCAGCACGACCCTACCACCCGCAGGACAAAGCCAAAGCAGAATCGGCGGTACAGATCGTCGAACGCTGGATCATGGCGCGCCTGCGCCACCAGCAATTTGCCAGCGTAGATGATGTCAATCAGGCCATCGCACCGCTGCTTGCCAGGCTCAACGAGAAGCCATTCCAGAAGCTGCCCGGCAGTCGCGCCAGTGCATTTGCCGAAATCGGCGCACCCGCCTTGGCTCCGTTGCCGCTGCAAGCTTATGAGATGGCACACTTCAAGACGGTCAAGGTTCACATCGACTATCACGTAGAAGTCGAACGACACCGCTACAGCGTGCCGCATTCATTGGTCGGACAAGTACTTGAAGCACGGATCACAGTGGCAGTGGTCGAGATCCTGCATCGCGGTAACCGCGTGGCCAGCCATGCCCGCAGCAGTCTGGCCGGTGGCTTTACCACCACCGCCGCGCACATGCCGGCGGCGCATCGCGCCCAGATGGAATGGTCGCCACAACGGCTGATCCACTGGGGCCAAAGCATTGGCCCTGCCGCCGCCGAAGTGGTGACACGGCTACTGAACAAGTACAAGCATCCCGAACATGGCTACCGCGCCTGCCTTGGGCTGCTGTCGCTGGTCAAGCGTTATGGCAAACCCAGACTGGAGGCGGCCTGTACGCTGGCTTTGCAGATCGGCGTCTGCCAGTACCGCCATGTGCGCGACATCCTGAAGAATAACCGCGACGCAGCCGCGCCGCTCAGCACTGAAGAATGGGTCAGCCCCAACCATGTCCACGTGCGCGGTCCTGGCTACTACCAATAAGGAAAGACAACATGATGATGCATACCACGCTGACGCAATTGCGCAGCCTGAAACTGGATGGCCTGGCGACGGGGCTGGAAGAACAACTGGCACAGCCCGGTATGGCTGCACTCAGCTTCGAAGAACGCGTAGCACTGTTGGTGGACCGGGAAGTCCATGCCCGTAATGACCGCAAACTGGCGCGCCTGCTCAAGAACGCTCGCCTGAAATACGGGCAGGCGGCCATCGAGGATATCGACAGCCGCGCAGGACGCGGTATCGACCGGCGCGAGGTGATGAGCCTGGCTTTGGGCGACTGGGTCAACGCCGGCCACAGCATCCTGATTACAGGACCGACCGGCGCCGGTAAATCCTGGCTGGCCTGCGCATTGGCACAATACGTCTGCCGCCGTGGTTACTCAGCCATCTATCAGCGCGTACCCCGCATGCAGGAAGAACTGCGCATCCGGCACGGCAGCGGCACCTTCGGCAAATGGCTGCTGCAACTGGCCAAGACCGACGTATTGGTTCTCGATGACTGGGGCATGGGCGCTATCGACAGCATGACCCGTTCCGACTTGCTGGAGATCATCGACGACCGTGCCGCCAACAAGGCCACCATCATCACCAGTCAGTTGCCGGTGGAGCACTGGCACGCCTGGATAGGCGATGCCACCATCGCCGACGCCATCCTCGACCGCATCATGCAGCGCAACCACCGCTTCACGCTGACCGGCGAGTCGCTGCGAACAGAACAATCAAAAACAAGCAAAAAGGAGGAAAAAACCACCCCATCGTGA
Protein sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|2048106_2049462_-|WP_147074829.1|DBSCAN-SWA MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWIPVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLADFSKMVEVFERYGVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGERIRDKIAASKRKGMWMGGVPPLGYDVENRRLVPNERESKLIRHIFQRFVELGSSTALVKELKLDGVTSKAWTTQDGKTRDGRPIDKGHIYKLLSNRTYLGELRHKDQWYQAEHPPIVSRELWDSVHAILETNGRVRGNKTRAKVPYLLKGIVFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELESAVLDQLRAILHAPNLLGNMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLFPAEQTRIVKLLVEKVIVSPNDLEVRLRTNGIERLVLELRPEPVEQTEEALA >NZ_AP021884|2044160:2053121|2044160_2045429_-|WP_147073207.1|DBSCAN-SWA MLVEFRVKNFRSLRDEQVLSLVASKDKTLQDTHTQATGISAAPTLVRSAVVYGANASGKSNLIKALQYMRGVVTESATAIQPGQTFAVQPFRLDVDSASQPSEFEVTFLLDGVRYQYGFSMTAQRIVSEHLLVYKAFKPQRWFTRRFDTETGKDVYDFGPGLKGPKNLWEGATRPNALFLSMAVQLNSEALRPVFDWFVNRLVIFNEQAQLSPQVSIQMLKQDDGRKEICNFLSAADISIADIDVETRKVPGQAVHFDLVAGKTEVRSEEMEEHKLRFHHVTEQGKAVFELMDESNGTRNLLFLAGPVLDILRKGLTLVIDELDTSLHTLLVRELVRLFHRPEINTGGAQLIFTTHDTSLLDAPDLFRRDQVWFVEKDRDQTSALVSLSEFSPRKNEALERGYLMGRYGGVPFLSHTLGLKH >NZ_AP021884|2044160:2053121|2049458_2049965_-|WP_147074830.1|DBSCAN-SWA MSTQTPSFSTPPSVAAQIARLPEMPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRNQRRIASLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQGRMYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSTKKRGGR >NZ_AP021884|2044160:2053121|2052353_2053121_+|WP_147074833.1|DBSCAN-SWA MMMHTTLTQLRSLKLDGLATGLEEQLAQPGMAALSFEERVALLVDREVHARNDRKLARLLKNARLKYGQAAIEDIDSRAGRGIDRREVMSLALGDWVNAGHSILITGPTGAGKSWLACALAQYVCRRGYSAIYQRVPRMQEELRIRHGSGTFGKWLLQLAKTDVLVLDDWGMGAIDSMTRSDLLEIIDDRAANKATIITSQLPVEHWHAWIGDATIADAILDRIMQRNHRFTLTGESLRTEQSKTSKKEEKTTPS >NZ_AP021884|2044160:2053121|2046046_2046667_+|WP_147074826.1|DBSCAN-SWA MNTTTKTAELIRERIEAMPIGEPFTPTAFLECGTRASVDQTLSRLVKAGLIERVTRGVFVRPEVSRFVGKVSPSPLKVAETVAKTTGAVVQVHGAEAARRLELTTQVPTQSVFVTSGPSKRIRVGKMEIRLQHVCQRKLALAGRPAGLALAAMWYLGKKEVTPALVEKIRRKLGSSEFEVLKSATSSMPAWMSDAIFRNERMAAHA >NZ_AP021884|2044160:2053121|2047657_2048110_-|WP_147074828.1|DBSCAN-SWA MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVRPWDMAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVNLTTLAPDIVAAILDDALPNHITLFDLAVDPPALWDEQRARIADSTPAGGH >NZ_AP021884|2044160:2053121|2050818_2052342_+|WP_147074832.1|transposase|DBSCAN-SWA MPVSRITMRKIKDVLRLKLDARLSHQQIAAALGISKGVVTKYVGLAAAAGLDWAAVQDIDETTLGRRLLVTPERPRDHVQPDYGRLHQELRRKGMTLMLLWEEYRADHADRQTYAYSQFCDNYRRFARQLKRSMRQVHRAGEKLFIDFAGPTIALTDGSRAHIFVAALGASSYTFACATPRETMTDWLKSTARALSFIGGMPQMIVPDNPKALIADANRYEPRSNDTVLDFARHYGTSVLPARPYHPQDKAKAESAVQIVERWIMARLRHQQFASVDDVNQAIAPLLARLNEKPFQKLPGSRASAFAEIGAPALAPLPLQAYEMAHFKTVKVHIDYHVEVERHRYSVPHSLVGQVLEARITVAVVEILHRGNRVASHARSSLAGGFTTTAAHMPAAHRAQMEWSPQRLIHWGQSIGPAAAEVVTRLLNKYKHPEHGYRACLGLLSLVKRYGKPRLEAACTLALQIGVCQYRHVRDILKNNRDAAAPLSTEEWVSPNHVHVRGPGYYQ >NZ_AP021884|2044160:2053121|2050160_2050604_+|WP_147074831.1|DBSCAN-SWA MTLSVARNKRYEARKLLSNDVDPAMLKQVTKRASRVSAENSFEAIAREWYAKFSGEWVPSHGEKIIRRLERDLFPWIGKRPIAEITAPELLAVLRRIENRGALDTAHRAHQNCGQVFRYAIATGRAERDPSPDLRGALPPAGYSGSS >NZ_AP021884|2044160:2053121|2046659_2047664_+|WP_147074827.1|DBSCAN-SWA MPESFLHLKPQEQSQIYRALAPQLARTPVVLEKDVWVCWVLQTLFTMPDRLPMAFKGGTSLSKVFGAIARFSEDVDITLDYRGLDGSFDPFAEGVSRNRLKKFSEDLKSFVRGHAHGVVAPHFQKMLADEFDADAFQLEVSDDGEQMRVHYPSVLEAPGDYVGNSVLIEFGGRNITEPNEEREVRPDIAEHVAELDFPRSTVSVLSPTRTFWEKATLIHVECQRDEFRTGAERLSRHWYDLAMLADLAHGQAAVADRALLADVVKHKKVFYNASYANYDACLSGQLRLIPEDAALAALRDDFQRMIGAGMFIGEPPAFDAIVDRLRALETTINQ |
9 | Acidithiobacillus_phage(66.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2294456 : 2305315
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|DBSCAN-SWA TCTAACTGCGCTGGTAAATGTCCTCAAAGCGCACAATATCATCCTCCCCCAGATACGCGCCCGACTGCACCTCGATCATGTGCAATGCAATCTTGCCCGCATTTTCCAGCCGGTGCGTACTGCCCAATGGAATGTAGGTGGACTGATTCTCGGTGAGCAGCAGCACTTCGTCATTACGCGTAACGCGTGCCGTGCCGCTCACCACTATCCAGTGCTCGGCACGGTGGTGATGCATTTGCAACGACAGTTTCTCGCCCGGTTTGACCATGATGCGCTTGACCTGAAAACGCTCCCCCGCATCAATACCTTCGTACCAACCCCACGGGCGAAACACACGGGTATGGTTGAGATGCTCGGTACGACGGTGTTGTTTCAGGTGCTCGACCACTTTTTTGACGTCCTGCACGCGGTCTTTGTGCGCCACCATCACGGCATCGCTGGTTTCCACGATCACCAGGTCGGATACGCCAATTACCGCGACCATGCGGCTCTCGGCACGGATCAGATTGTTGCTGGCACCGTCGTTGTAGATGTCACCGCGCATGACATTGCCAGCACGATCCTTGGCACCGATTTCCCACAGTGCCGACCACGAGCCGATATCGCTCCAGCCAATGTCGGCGGGCACCACCACGGCGTTACGGGTGCGTTCCATGACGGCGTAATCGATGGATTCGGAGGGGCAAGCCGTAAATGCCTGGGTATCCAGACGGACAAAGTCCAGATCACGCTGGCTGCTGTCCAGTGCCGCCTGGCTGGCGGCGAGAATATCCGGGCGGTAGCCGCGCAATTCATTGACAAACGCCGCAGCCTTGAACAGAAACATGCCGCTGTTCCAGAAATAATCGCCCGATTGCAGATAGCCCTGGGCGATTTCACGGCTGGGTTTTTCCACAAAACGCCCTACTGCAAACACACCGTCCAAATGGCTGTCCGCCGGGCCGCGCTGGATATAGCCATAGCCGGTTTCAGGCGCCTGTGGCACGATGCCGAAGGTCACCAGCTGACTAGCCTGGGCGGCATCCACGGCCTGGGCCACGGCGCGCTCAAATGCGTCCACATCTGCTATCAAGTGATCGGCCGGCAACAGCAGCATCAGGGCCTCGGCATCCCGCGCCATGAGTGCCAGCGCTGCGACTGCCGCTGCCGGTGCGGTATTGCGCCCGATGGGCTCGAGAAAAATAGTCTCGGGCGTGACCTCAATCGCGCGCATCTGCTCGGCCACCATGAAGCGGTGCTCATGGTTGCATACCAGCGTGGGCGGCGTGATGTCGGCGATACCGGAAAGGCGCAGCACGGTTTCCTGCAGCATGGTGCGCTCCGACACCAGCGGCAACAACTGCTTGGGCAACGCGGCGCGGGACAAGGGCCACAGGCGGGTACCGGACCCCCCGGAGAGAATGACGGGATGAATGCGCATGTTGGTTGTTTCCTCAATCAATTCAGTGAATTCATCAGCCTGCCCAAGCTGGGGAAAGGCATCCATGCGATACAGTCCGGCGCGCTGCATCGAGAGCAAGACATCTACGCTACGCGAACCAATGAAAACCGGCTGGCCAGTCAGCCAAACACATATCATGTCTCCTTTTCGGGCTTGTCCAAACACAGGCCCAACGCCGCATCCCAATCCGGCAATAGCAAACCAAAGCTGCGCAGCAGCTTGTCCCCGGCCAGCACGGAATTAGCCGGGCGCCTGGCCGGGGTGGGGTAATCAGTGCTTGGGATCGGGATCAGCTCAGGACGTTTGGCGCCCGCCTGGGTGGTTTGGTCGAGGATGGCCTGGGCGAATTGATACCAGCTCGCCCGGCCCCGGCTGCTGAGGTGGTAAGTGCCGTGCAGGGCTTCTTTACCCTGGCCAAACGGCTGTTGCGCCAGTATCTGCGCCGTGGCCTCGGCGATCATGCGTGACCAGGTGGGTGCGCCGTATTGATCGGCAACAATCCGCAACTGCTCGCGCTCCTTGAACAGGCGCTGCATGGTGAGCAGGAAATTGCCAGCGCGCAGGCCATACACCCAGCTGGTGCGCAGAATGAGGTGGGGAATGGCGGCAGCACGGATGGCGTTCTCACCTTCGAGTTTGGTCTGCCCATAGACGCCTAGCGGGTGGGTGGTGTCGTCCTCAGTGTAGGCACCGGGCTTGTTGCCATCGAACACATAATCGGTAGAGTAGTGAATCATCGCCGCGCCGAGTTTTGCCACCTCTTCGGCCATGATGGCGGGAGCAATGGCATTCACCGCACGTGCCAGTTCCGGCTCGGATTCCGCCTTGTCCACGGCGGTATGGGCGGCCGGGTTGACGATCAGGTTCGGGCGCAAGCTACGTATGAGGCTGCGGATGGCACCCGGGTCGGTCAGGTCAAGCTGGCTGCGGGTGGGCGCGCTCACCTTGCCCAAAGTCGCCAATGTGCGGCGCAACTCCCAGCCGACCTGGCCATTGACACCGGTGAGCAGGATATTCACGCAAACACCTCGCACTGCGCCAGCGGCAGGCCGGCGGCGTCCTTGGCGGCAAGTGCGGGTTCGCCGTGCAGTGGCCAGGTGATGCCCAGCGCCGGATCATTCCACAGCAGGCTGCGCTCGAACTGCGGCGCCCAGTAGTCGGTGGTTTTGTAGAGAAACTCTGCGCTGTCGGAGATCACCAGGAAACCATGGGCGAAGCCTTTGGGTATCCAGGCCATGCGTTTGTTTTCGGCGGACAGTTCCATCCCCACCCATTTGCCAAACGTGGGTGAGGATTTGCGCAAGTCCACCGCCACGTCATACACGCTACCACTAATGACGCGTACCAGCTTTCCCTGAGTATTTTGAATCTGGTAATGCAAGCCACGTAATACGCCCTTGGCTGAGCGCGAGTGATTGTCCTGCACGAAGTCGTCGGGGATACCGGCCCCGGTCATGGCGCGACGGTTGTAGCTCTCGTAGAAAAAGCCGCGCGCGTCGCCGAATACTTTGGGTTCGAGCACAAGCACGTCGGGGATTTCAGTGGGGATGATGTTCATGGTTAAAACAACCGTTCGTTGAGCATGGCCAATAAGTATTGGCCATAGGCATTTTTCTGCAAAGGCGCTGCCAGCCGTCCGACCTGGGCGGCATCAATATAGCCCTGGCGGTAGGCGATTTCTTCCGGACAGGAAATTTTGAGACCCTGGCGCTTTTCAATGGTCTGGATGAATAGCGATGCCTCCAGCAGGGATTCATGGGTGCCGGTGTCCAGCCAGGCATGGCCGCGCCCCATGACTTCCACCTGCAATTGCCCCATTTCGAGGTAATGCCGGTTAATGTCGGTGATTTCCAGTTCGCCGCGCGGAGATGGCTTGAGCTTGCGGGCGATATCGATGACCTGGTTATCGTAAAAATACAGGCCAGTGACGGCGTAGCGCGATTTGGGCTGGGCGGGTTTCTCTTCCAGGCTGATGGCGTTGCCTTGAGCGTCGAATTCAACCACGCCATAGCGTTCCGGGTCATGCACCGGGTAGGCAAACACCGATGCGCCAAACTGGCGGGCAGCGGCGGCGCGCAGGCCGCCGGAAAATTCATGACCGTAAAAGATGTTGTCGCCCAGTATGAGAGCGCTCGACGCATCGCCAATGAAATCCGCGCCAATGACAAAGGCTTGCGCCAGACCGTCGGGCGAGGGCTGAACAGCATAGCTGAGGCGAATCCCCCACTGGCTGCCATCACCCAGCAACTGCTCGAAACGCGGGGTATCCTGCGGGGTGGAAATGATGAGGATGTCCCGGATTCCTGCCAGCATCAGCGTGGTAAGCGGGTAGTAGATCATGGGCTTGTCGTACACCGGCAGCAATTGCTTGGAGACTGCCTGGGTCACGGGATATAGCCGGGTACCCGAGCCACCGGCCAGAATAATGCCCCTGCGCGCGCTCATGCCTGGCTCCCATACTGTTTTTCTACCCAGTGGCGGTATTCACCGGTGGCGATGTTGGCTACCCAGTCCGGGTTGGCCAGGTACCAGGCAACGGTTTTGCGAATCCCGGTCTCAAACGTCTCTTGTGGGCGCCAGCCCAGTTCGCGCTCGATTTTGTGTGCGTCAATGGCGTAGCGACGATCGTGGCCAGCGCGGTCCTTGACATGAGTGATGAGTTTTTCGTGAGGGGTGACGGGTGAACCCGGGTGCAGCGCGTCAAGCATGGCGCAGATGGTCCTGACCACATCGATATTGGTTTTTTCGTTGCAACCGCCAATGTTGTATACCTCGCCTGCCTTGCCCGCAGCCAGCACGGCGCGGATGGCGCTGCAATGGTCGCCGACATAGAGCCAGTCACGTACGTTAAGTCCGTCGCCGTAGATCGGCAGCGGCTTGCCGGCCACGGCGTTCATCATTACCAGCGGGATGAGCTTTTCCGGGAACTGGTAAGGGCCGTAGTTGTTGGAGCAGTTGGTAGTGAGTACCGGCAGGCCATAGGTGTGGTGATAGGCGCGCACCAGGTGGTCGGAGGCAGCCTTGGAGGCTGAGTACGGGCTGTTGGGTGCGTAGGCAGTGGTCTCGGTAAACGCGGCATCATCCGGGCCGAGGGAACCGTACACCTCATCGGTGGAAACGTGCAGGAAGCGGAATCCGGCCTTCTCTATCCCTTCCATTGCATTCCAGTAAGCGCGGATTTCTTCCAGAAAATGAAAAGTGCCGACGACATTGGTCTGGATAAAGTCTTCGGGGCCGTGAATGGAACGATCGACGTGGCTTTCGGCAGCGAAGTTGATGACGGCGCGCGGGCGATGTTCAGCCAAAAGACCTGCGACCAGGGCGCGGTCGCCAATATCGCCTTGTACGAAGAGATGGCGGGCGTCACCCTCAATACTGGCGAGGTTGTGCAGATTGCCTGCGTAGGTGAGTTTGTCGAGGTTGATGACCGGCTCGTCACTGTCCGCCAGCCAGTCCAATACGAAATTGGCACCGATAAAACCGGCGCCGCCTGTGATTAGAATCATGGTAATTCCCGGTTAGCCTTTATTTTATTTCCGGTAACCCGGCGCTGCTGTTCTCGGACTTGCTCTGTGCCATTTTTCCCAGGCTTGCAAACTCGCCCACGTATTCAATTTTTGCTGCATCCCGTAGTTTTTTGATGTCAGCGGTAGCGGCATCGCGCTGTCGGGCAGCAGCCAGATAACGCTCTATTAACGTATGTGCCTTGTCCAGGGTAATCGGTGCAGATTGTATGGAGGCTATCTGCAATACCGTTATCCCGTTTGCAGACGGTAACGCCAGTAACTGTCCATTCTGCATGGTCTGCATGCGCGGCACAATATTCATCGGCAATTGTTCCGCCGCTTTGACCTCGCTGCCGGTGCGGAACGGTATATTCTGGCTGCGCAGCCAGTCCACAAATTCGCCCAGATCATGGCTGGTTTCAAGACGTGAGTTGAGTACAGCGATCCTGCCCCTGGGGGCAGCGATGGCCAGTTCCTGCAGATTATAGATACGACGCTGACTGAACAACTCGGGATGCCGGTTGTAGTAGGCGCTGATGTCGGCTTCGCCAGGTTTGGCGACAGCCTGCCCTGCACGCTGCAGATAGGCCTGGGACAGCACCTGATTGCGCGCGGCCTCCAGCAATTGCTGCACAGCAGGATCCTGATCCAGTTTTTGCGCAGTGGCTTTTTGTACCAGAAGTTGCTGATCCACCAGTGCCTGCACTACCCGATTTTTAGCTTGCGGCGTCAAATCCTGCGCTGCCAGATTCAAGCGCGACAAAGCCAGGTCCAGTTGTGCGCTAGTGATAGCGGTGCCGTTCACGCTGGCGATGGCTGAAGACGGCGTTTCCTGCTTGCTGCAGCCGGCGAGCACTGTTCCCAGCATCAGCGCCAGCGCCATTTTATGCATAATTGGATGCGATTTCATTCGCGTCATCTCCCTGGATTGGTATGTTTTGTGGCGTGATTCACACCCGGCGTCCATGATGCGCAATTATCGGTTGTCTCACAAGCGCCAGCATATAGACCGATCGAATTACAAAATGGTAATGATCATAAAAAATGGCTGGCTCCCTCCGCGGAAGCCAGCCACTACGCTACCTGATGCAGGCAATATCTAGACAGCCACGTTTTCCTCGTCGAATGCCAGTTTGATTGTGCCGGATTCGTCCACGTCCACGACCACATGCCCACCATTGGCCAGACGGCCAAACAGCAATTCATCTGCCAGTGCCCGCCGTATCTCGTCCTGGATCAGCCTGGCCATGGGGCGCGCGCCCATCAACGGGTCAAAACCGCGTTTGCCGAGATGCGCCTTGAGCGCGTCGGTAAACGTCGCCTCGACCTTTTTCTCGTGCAACTGGTCTTCCAGCTGCATCAGGAATTTATCCACCACGCGCAGGATGACCTCCTGGGACAAGGGTGCAAACGAGATCATCGCATCCAGCCGGTTACGGAACTCCGGCGTAAAGGCACGCTTGATGTCTGCCATTTCGTCGCCGGTTTGTTTTTCCTGGGTAAAGCCAATCCCCGACTTGTTGAGCGACTCCGCACCCGCGTTGGTGGTCATCACAATCACCACGTTACGGAAATCCGCCTTGCGCCCATTGTTGTCGGTAAGCGTGCCGTGATCCATCACCTGCAACAGTACGTTGAATACGTCCGGATGCGCTTTTTCGATTTCATCCAGCAACAGCACCGCGTAGGGATGTTTGGTAATCGCCTCGGTCAACAGGCCGCCCTGGTCAAACCCGACATAGCCCGGTGGGGCGCCTATCAACCGCGACACGGCATGGCGTTCCATGTATTCGGACATATCGAAGCGAATCAGCTCAATGCCCATGATATACGCGAGCTGCCGCGCCACTTCGGTCTTGCCCACCCCGGTGGGGCCAGAAAACAGGAAAGAGCCGATAGGCTTCTGCGGATTACCCAAGCCACTGCGTGCCATCTTGATCGCGGCTGCCAGCGCATTGATCGCCTTATCCTGGCCAAACACCACGGTCTTGAGGTCGCGATCGAGGTTTTTCAGCGCATCGCGATCATCACTATTCACATTCTTGGACGGAATCCGCGCAATCTTGGCGATGATCTCTTCGATCTCGCGCTTGCTGATGACTTTCTTCTGCCGTGATTTGGGCAGGATACGCTGCGCCGCGCCGGCTTCGTCGATCACGTCGATTGCCTTGTCCGGCAGGTGGCGGTCATTGATATAGCGTGCCGACAGCTCGGCTGCCGTGGTGAGCGCCGACGCGGTGTATTTGATGCCGTGATGCGCTTCAAAACGCGATTTCAAGCCACGCAGAATCTCGACTGTCTCCTCGATCGAAGGCTCATTCACATCAATCTTCTGAAACCGCCGCGATAACGCATGGTCTTTTTCGAAAATGCCACGATACTCGTTGTAAGTGGTTGCACCGATGCATTTGAGTTGCCCGGATGAAAGCGCCGGTTTAAGCAGGTTGGAAGCATCCAGGGTGCCGCCCGAAGCCGCACCTGCACCAATCAGCGTGTGAATTTCGTCTATGAACAAAATTGCCTGCGGGTTTTCGTATAGCTGCTTCAACACGGCCTTGAGGCGCTGCTCAAAATCACCACGGTATTTGGTGCCTGCCAACAGGGCTCCCATGTCCAGCGAATACACCGTGCTGTCCGATAGAATATCCGGTACCACACCCTCAACGATACGCCGCGCCAGACCTTCAGCGATGGCCGTTTTGCCGACTCCGGCTTCACCCACCAGCAGCGGGTTGTTCTTGCGCCGCCGGCACAATGTCTGGATGACACGCTCCAGTTCCAGCGCACGGCCAATAAGCGGGTCTATTTTTCCCGCCAGCGCCTGCACATTCAGGTTCTGCGTATAGGTTTCCAGCGCCGTTGCGGGTGCAGCTTCCTCACCCGCCTCAGGGGTCGCCTCCGGGCGCGCAGTGCTGCCCTGCGGGACTTTGCTCACCCCATGGGAAATGAAATTCACCACGTCCAGACGCGATACACCCTGCTGATTCAGGAAATACACCGCGTGGGAATCCTTCTCGCCAAAAATGGCCACCAGCACGTTCGCGCCGGTCACTTCTTTTTTGCCTGACGACTGCACATGCAAAATGGCGCGTTGAATCACGCGCTGAAATCCGAGCGTAGGCTGGGTATCGACTTCCTCGCTTCCTGCAACGGTAGGGGTGTGTTCGGTGATGAAATCAGCCAGTCCACGACGCAGTTCGTCGGTATTGGTGCCGCACGCGCGCAACACCTCGGCGGCGGACGGATTATCCAGCATCGCCAGCAACAGGTGCTCGACCGTAATAAACTCATGGCGCTTTTGTCTCGCCTCCATGAACGCCATATGTAAACTAACTTCCAATTCCTGCGCAATCATCTAATTTTCCTCCATCACGCATTGCAGCGGATGCTGGTGCTGGCGGGCGAACCCGACCACTTGCTCTACCTTGGTTGCCGCCACATCGCGGGGGAATACGCCACACACTCCCATGCCGTCCCTATGTACTTTGAGCATGATTTGCGTAGCCTGTTCACGGCTCTTGTAAAAAAAGTTCTGAATCACAAGAACCACAAAATCCATGGGCGTGTAGTCGTCATTCAACAACATTACCTTGTACAAAGGCGGCGGCTTGAGTTTTGTTTCGCTTGCTTCCAGAACGGTGTCATCGCGGTGCTTGGTTGCCATGGCGCTGGATAGTTTCCGGATAGTCAGGAACCATTTTGACGACTGGCGCGAAATTTTCAAGTGCTATCCAGTAAAAAAAAATTTGCCTGGCACTTGCCAAATCAACTAGGCAGGCGTAAAAAGCAATCTGGAGTTTGGCGTCAAGGTTCCTGCAAGTCTGGTAATGCCGTTGTGCGATCTTGATTTTCAAACAGCCCCTGGCCGTTTTGGCCTGTTTTTATCAAGGAAGTAGCAATGGCAACTGGCACTGTAAAGTGGTTCAACGATTCTAAAGGCTTTGGGTTTATTACCCCGGACGACGGTAGTGAAGATCTTTTCGCTCACTTCTCCGCCATCAACATGGGTGGTTTCAAAACCCTGAAGGAAGGTCAAAAAGTCCAATTCGAGGTCTCCCAGGGCCCGAAAGGCAAACAGGCTTCGAACATTCAGCCTGCATAAATCGGCTCACCGATTACTTGAAAACGCGGAACCTGGTTCCGCGTTTTTTTCGCCTCGGGTTTTGAGTTCTGGCTTTTCAAAAGCGCTGTGCTACATTGAAATATCTGTAACACATCCATTTAAGGAGAGCAGAATATGCACATTCAACACCAGCCTGATGGTTCCCTGGTCCTGGACATGAGCCAGAAACAGGCGCGAGAACTCGCAAAAACCGTCATCCAGCACGCCGAAGATGCGCATACCGCACTGCTGGATTTTGCCTACCTGCTGAACGAAGCGCATTACGATGCGGAGAACCAGTTCCGGCAACCACCTCATGCCTGGGAACCGGGTGCGCATCAGCCTGGTACAGAATAGGGGGCTACCATGAACATTTCTGCACTCGACAAACAGACTGCCCAGATCAGTGTGTTGCCGACCGAGGCCGCGCATTTGCTGGAGGGCCTCGAAGCCATGCGCGACGAACTCGGTGAAATCGCCGACGAGTTAATCAGTCTGCTGCGCGGCAGTGGCATTGAACCACCACCCAAACCCGATCATGTTCGCACTGAATACGCCGGGCCTGAGTAAACTTACATGCGCGCGATCATCGCCTCGCCAAATGCTGAACAGGACACCTGCGTCGCACCGTCCATAAGGCGAGCAAAATCGTAGGTTACCGTTTTAGCAGCAATTGCACGCTGCATACTCGCGGTGATGATATCAGCGGCTTCCAGCCAGCCAAGGTGGCGCAGCATCATCTCCGCGGAAAGAATAATCGAGCCCGGGTTGACGTAATCCTGGCCCGCATATTTTGGTGCAGTACCATGCGTGGCCTCAAACATGGCGACTGAATCGGACAGATTGGCGCCCGGCGCAATACCAATGCCGCCCACCTCAGCCGCGAGCGCGTCCGAGATGTAATCGCCGTTCAGATTAAGCGTCGCAATCACGTCGTACTCATCCGGGCGCAACAGTATCTGTTGCAAGAATGCATCGGCAATCACATCCTTGATGACAATGCCGTTGGGCAGCCTGCACCACGGGCCGCCATCCATCTCCACCGCGCCAAATTCACGCCTGGCCAGTTCATAGCCCCATTTTTTGAAGCCTCCCTCGGTGAACTTCATGATATTGCCCTTGTGTACCAAGGTAACGGACTCGCGGCCATTGTCGATGGCATACTGAATCGCCTTGCGGATCAGGCGCTCGCTGCCCTGCACGGAAACCGGTTTGATACCAATGGCGGAAGTTTCCGGGAAGCGAATTTTCTTCACCCCCATTTCGCCTTGCAGGAAGGCGATGATCTTTTTCACCTCATCCGAGCCAGCTTGCCACTCCACCCCGGCGTAAATATCCTCGGTATTTTCGCGGAAGATCACCATATCCACTTTTTCCGGCGCTTTCACCGGACTGGGCACGCCATCGAAGTAACGCACCGGGCGCAGGCAGACATACAAATCCAGCAACTGGCGCAACGCCACATTCAGGGAGCGCATGCCACCGGAAGTCGGCGTGGTCAACGGCCCCTTGATGGAGACGACGTATTCGCGCACGGCGGCTACCGTTTCATCGGGTAACCAGTTGTCGCCACCATAGACCTTGACGGCCTTTTCGCCCGCATACACTTCCATCCAGGCGATACTGCGCCTGCCACCATATGCCTTGGCCACGGCCGCATCCACCACGCGACGCATCACCGGGGTGATATCCACACCGGTACCATCACCTTCGATGAAGGGAATAACCGGCTGATCGGGGACATTGAGCGAAGCGTCAGTGTTGATCGTGATTTTTTCGCCGTGAGTCGGCAGCTGTATATGCTGGTACAT
Protein sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|2294456_2295875_-|WP_147074479.1|DBSCAN-SWA MRIHPVILSGGSGTRLWPLSRAALPKQLLPLVSERTMLQETVLRLSGIADITPPTLVCNHEHRFMVAEQMRAIEVTPETIFLEPIGRNTAPAAAVAALALMARDAEALMLLLPADHLIADVDAFERAVAQAVDAAQASQLVTFGIVPQAPETGYGYIQRGPADSHLDGVFAVGRFVEKPSREIAQGYLQSGDYFWNSGMFLFKAAAFVNELRGYRPDILAASQAALDSSQRDLDFVRLDTQAFTACPSESIDYAVMERTRNAVVVPADIGWSDIGSWSALWEIGAKDRAGNVMRGDIYNDGASNNLIRAESRMVAVIGVSDLVIVETSDAVMVAHKDRVQDVKKVVEHLKQHRRTEHLNHTRVFRPWGWYEGIDAGERFQVKRIMVKPGEKLSLQMHHHRAEHWIVVSGTARVTRNDEVLLLTENQSTYIPLGSTHRLENAGKIALHMIEVQSGAYLGEDDIVRFEDIYQRS >NZ_AP021884|2294456:2305315|2302757_2303066_-|WP_124705779.1|protease|DBSCAN-SWA MATKHRDDTVLEASETKLKPPPLYKVMLLNDDYTPMDFVVLVIQNFFYKSREQATQIMLKVHRDGMGVCGVFPRDVAATKVEQVVGFARQHQHPLQCVMEEN >NZ_AP021884|2294456:2305315|2296911_2297454_-|WP_147074454.1|DBSCAN-SWA MNIIPTEIPDVLVLEPKVFGDARGFFYESYNRRAMTGAGIPDDFVQDNHSRSAKGVLRGLHYQIQNTQGKLVRVISGSVYDVAVDLRKSSPTFGKWVGMELSAENKRMAWIPKGFAHGFLVISDSAEFLYKTTDYWAPQFERSLLWNDPALGITWPLHGEPALAAKDAAGLPLAQCEVFA >NZ_AP021884|2294456:2305315|2303300_2303504_+|WP_124705778.1|DBSCAN-SWA MATGTVKWFNDSKGFGFITPDDGSEDLFAHFSAINMGGFKTLKEGQKVQFEVSQGPKGKQASNIQPA >NZ_AP021884|2294456:2305315|2303639_2303861_+|WP_147074449.1|DBSCAN-SWA MHIQHQPDGSLVLDMSQKQARELAKTVIQHAEDAHTALLDFAYLLNEAHYDAENQFRQPPHAWEPGAHQPGTE >NZ_AP021884|2294456:2305315|2300501_2302757_-|WP_147074450.1|protease|DBSCAN-SWA MIAQELEVSLHMAFMEARQKRHEFITVEHLLLAMLDNPSAAEVLRACGTNTDELRRGLADFITEHTPTVAGSEEVDTQPTLGFQRVIQRAILHVQSSGKKEVTGANVLVAIFGEKDSHAVYFLNQQGVSRLDVVNFISHGVSKVPQGSTARPEATPEAGEEAAPATALETYTQNLNVQALAGKIDPLIGRALELERVIQTLCRRRKNNPLLVGEAGVGKTAIAEGLARRIVEGVVPDILSDSTVYSLDMGALLAGTKYRGDFEQRLKAVLKQLYENPQAILFIDEIHTLIGAGAASGGTLDASNLLKPALSSGQLKCIGATTYNEYRGIFEKDHALSRRFQKIDVNEPSIEETVEILRGLKSRFEAHHGIKYTASALTTAAELSARYINDRHLPDKAIDVIDEAGAAQRILPKSRQKKVISKREIEEIIAKIARIPSKNVNSDDRDALKNLDRDLKTVVFGQDKAINALAAAIKMARSGLGNPQKPIGSFLFSGPTGVGKTEVARQLAYIMGIELIRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTEAITKHPYAVLLLDEIEKAHPDVFNVLLQVMDHGTLTDNNGRKADFRNVVIVMTTNAGAESLNKSGIGFTQEKQTGDEMADIKRAFTPEFRNRLDAMISFAPLSQEVILRVVDKFLMQLEDQLHEKKVEATFTDALKAHLGKRGFDPLMGARPMARLIQDEIRRALADELLFGRLANGGHVVVDVDESGTIKLAFDEENVAV >NZ_AP021884|2294456:2305315|2304076_2305315_-|WP_147074447.1|DBSCAN-SWA MYQHIQLPTHGEKITINTDASLNVPDQPVIPFIEGDGTGVDITPVMRRVVDAAVAKAYGGRRSIAWMEVYAGEKAVKVYGGDNWLPDETVAAVREYVVSIKGPLTTPTSGGMRSLNVALRQLLDLYVCLRPVRYFDGVPSPVKAPEKVDMVIFRENTEDIYAGVEWQAGSDEVKKIIAFLQGEMGVKKIRFPETSAIGIKPVSVQGSERLIRKAIQYAIDNGRESVTLVHKGNIMKFTEGGFKKWGYELARREFGAVEMDGGPWCRLPNGIVIKDVIADAFLQQILLRPDEYDVIATLNLNGDYISDALAAEVGGIGIAPGANLSDSVAMFEATHGTAPKYAGQDYVNPGSIILSAEMMLRHLGWLEAADIITASMQRAIAAKTVTYDFARLMDGATQVSCSAFGEAMIARM >NZ_AP021884|2294456:2305315|2297456_2298341_-|WP_147074453.1|DBSCAN-SWA MSARRGIILAGGSGTRLYPVTQAVSKQLLPVYDKPMIYYPLTTLMLAGIRDILIISTPQDTPRFEQLLGDGSQWGIRLSYAVQPSPDGLAQAFVIGADFIGDASSALILGDNIFYGHEFSGGLRAAAARQFGASVFAYPVHDPERYGVVEFDAQGNAISLEEKPAQPKSRYAVTGLYFYDNQVIDIARKLKPSPRGELEITDINRHYLEMGQLQVEVMGRGHAWLDTGTHESLLEASLFIQTIEKRQGLKISCPEEIAYRQGYIDAAQVGRLAAPLQKNAYGQYLLAMLNERLF >NZ_AP021884|2294456:2305315|2299421_2300312_-|WP_161984264.1|DBSCAN-SWA MKSHPIMHKMALALMLGTVLAGCSKQETPSSAIASVNGTAITSAQLDLALSRLNLAAQDLTPQAKNRVVQALVDQQLLVQKATAQKLDQDPAVQQLLEAARNQVLSQAYLQRAGQAVAKPGEADISAYYNRHPELFSQRRIYNLQELAIAAPRGRIAVLNSRLETSHDLGEFVDWLRSQNIPFRTGSEVKAAEQLPMNIVPRMQTMQNGQLLALPSANGITVLQIASIQSAPITLDKAHTLIERYLAAARQRDAATADIKKLRDAAKIEYVGEFASLGKMAQSKSENSSAGLPEIK >NZ_AP021884|2294456:2305315|2296030_2296915_-|WP_147074455.1|DBSCAN-SWA MNILLTGVNGQVGWELRRTLATLGKVSAPTRSQLDLTDPGAIRSLIRSLRPNLIVNPAAHTAVDKAESEPELARAVNAIAPAIMAEEVAKLGAAMIHYSTDYVFDGNKPGAYTEDDTTHPLGVYGQTKLEGENAIRAAAIPHLILRTSWVYGLRAGNFLLTMQRLFKEREQLRIVADQYGAPTWSRMIAEATAQILAQQPFGQGKEALHGTYHLSSRGRASWYQFAQAILDQTTQAGAKRPELIPIPSTDYPTPARRPANSVLAGDKLLRSFGLLLPDWDAALGLCLDKPEKET >NZ_AP021884|2294456:2305315|2303870_2304074_+|WP_147074448.1|DBSCAN-SWA MNISALDKQTAQISVLPTEAAHLLEGLEAMRDELGEIADELISLLRGSGIEPPPKPDHVRTEYAGPE >NZ_AP021884|2294456:2305315|2298337_2299402_-|WP_147074452.1|DBSCAN-SWA MILITGGAGFIGANFVLDWLADSDEPVINLDKLTYAGNLHNLASIEGDARHLFVQGDIGDRALVAGLLAEHRPRAVINFAAESHVDRSIHGPEDFIQTNVVGTFHFLEEIRAYWNAMEGIEKAGFRFLHVSTDEVYGSLGPDDAAFTETTAYAPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYQFPEKLIPLVMMNAVAGKPLPIYGDGLNVRDWLYVGDHCSAIRAVLAAGKAGEVYNIGGCNEKTNIDVVRTICAMLDALHPGSPVTPHEKLITHVKDRAGHDRRYAIDAHKIERELGWRPQETFETGIRKTVAWYLANPDWVANIATGEYRHWVEKQYGSQA |
12 | Escherichia_phage(33.33%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|