Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_AP021884 | Sulfuriferula plumbiphila strain Gro7 | 4 crisprs | csa3,cas3,cas5,cas6e,cas2,DEDDh,DinG,WYL,cas8c,cas7,cas4,cas1 | 0 | 4 | 7 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_1 | 1148298-1148383 | Unclear |
I-E
Consensus repeat of NZ_AP021884_1
|
1 spacers
spacers of NZ_AP021884_1
>1.1|1148323|36|NZ_AP021884|CRISPRCasFinder ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG |
cas2,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_1
The CRISPR arrays of NZ_AP021884_1 >merge|NZ_AP021884|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGAACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAGGTGTTCCCCGCACCCGCGGGATGAG >NZ_AP021884|1|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGA ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG GTGTTCCCCGCACCCGCGGGATGAG
>NZ_AP021884.1|WP_147070477.1|1147906_1148203_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MLVIVLENVPPRLRGRLAIWLLEIRAGVYVGNYSDKVRDHIWHQVEVGIGEGNAVMAWRTSSEAGFDFVTLGKNRRIPVELDGAKLVSFLPQTDTDAL >NZ_AP021884.1|WP_147070479.1|1146692_1146992_+|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MRYQVRFASAAADDLQRLFDFLAEQDLAAAERARAVISQAIEVLQIFPFSCRKASPENPFLRELVISFGSYGYVALFEVEDAESVTVLAVRHQREDDYH >NZ_AP021884.1|WP_147070480.1|1146414_1146696_+|prevent-host-death-protein MKNATLPPLRVESELRAAAESVLQEGETLSGFVLEAVRLNIARREAQREFITRGLVAREEAKLSGHYVSSDEMLKRLDASLAKARAKQAVGNR >NZ_AP021884.1|WP_147070482.1|1145722_1146346_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MFLSRVEIPWDAARNPYNLHRQLWHLFPGEDRESRSSDDETRQGFLFRIEENATGRPARLLVQSRRAPTRANGLLLVGTREITPCPSAGQRLAFVLTANPVKTIVDAQRDAKPGKQSEKCRVPFIKEEEQRQWLLRKLGEAGEVEAVSVLPHAPVYFHKGSRAGKLVTATFEGVLRVRDPDRLAALLANGIGPAKAFGCGLLLVRRI >NZ_AP021884.1|WP_147070484.1|1145195_1145732_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MFSREWPLLAESDIKPKTGKLGWTNSPAFSSCTRSWTARRAPITRTDLKARLECSSASLTSAHRRGAYLFDAAFTVAVGSKPGASVTLTQLAAALRQPLYTPSLGRRSCPLARPLLEGELEAEDALAALAKTAPVDGLVYSETQQSDQPLRLRDVPLHGHKRQFGTRLVYLHKDPTCS >NZ_AP021884.1|WP_147070486.1|1142295_1145139_-|aconitate-hydratase-AcnA MSTAHNLFNTLSEFTLGNGTPGRFYSLSALEAVGIGKISRLPVSIRIVLEAVLRNCDGRKITEQHIRELANWQPNGPRTEEIPFVVARILLQDFTGVPLLADLAAMRSAAAQAGKNPKVIEPLVPVDMVVDHSVQVDVFNQPDALQKNMELEFIRNRERYQFLKWGMQAFDTFKVVPPGIGIVHQVNLEYLARGVMEKDGVHYPDTLVGTDSHTTMINGLGIVAWGVGGIEAEAGMLGQPVYFLTPDVVGVHLKGQIREGVTATDVVLTVTEMLRKAKVVGKFVEFFGAGAAALSLPDRATIANMAPEYGATMGFFPVDEASCAYYAATGRSAEQVDTIRNYFMAQGLFGIPQAGDCDYSQELEIDLGSVVPSVAGPRRPQDRIELGHVKQAFAGLFAKPVAEGGYGKAAATLAQRVALAPAPAGTDIAGGGVQNSDTLPAGGTDPAVVIEREMVDNRPTPDHLASNAVYTAAQSGTLGHGDVVIAAITSCTNTSNPGVMLAAGLLAKKALEKGLTVPAHVKTSLGPGSRVVTEYLKAAGLLDALGEMGFKLVGYGCTTCIGNSGPLPAAIESAITGNDLIAASVLSGNRNFEARVHQNVKANFLMSPPLVVAYAIAGSMNTDLASEPLGTGRDGAPVYLKDIWPSLDEVAAVMATATNPDTYRKLYADFSADNPLWAAVPAPAGAVYDWDGASTYIRQPPFFDGAAGDSGVIRGARALAVFGDSVTTDHISPAGSIKPASPAGKFLLEHGVDRADFNSYGARRGNHEVMMRGTFANVRIRNLMLPGSEGGVTRHQPDGAEMAIYDAAMQYQAAGTPLMIFAGEEYGTGSSRDWAAKGTRLLGVKAVVAKSFERIHRANLVGMGVLPCQFRDGMGADSLKLDGSETFDLLGLEHGITPQQDITLVIHRADGSADAVAVKLRIDTPIEVDYYQSGGILPFVLAQLLAD >NZ_AP021884.1|WP_147070488.1|1141914_1142286_-|DUF202-domain-containing-protein MSDLNDPRVFFAAERTLLAWNRTCLTLMAFGFVVERFGLFLHMLAPQTPQHLERGISFWVGLGFILLGSLMAVLAVIQYRRVLRTLKPVEIPEGYWVNMAALSTLLLAVLGIVLSAYLTMGLK >NZ_AP021884.1|WP_147070490.1|1139808_1141854_-|methionine--tRNA-ligase MTRKILVTSALPYANGAIHLGHLVEYIQTDIWVRFQKMHGHECYYVCADDTHGTPIMLRAEKEGITPEQLIARVHGEHLRDFTGFHVGFDSYHSTNSGENRELSGTVYLKLREAGLIEQKTIEQYYDPVREMFLPDRFIKGQCPKCGAQDQYGDGCEVCGATYTPTDLINPVSAISGSTPVRRESEHYFFRLGACEAFLREWTRSGALQQEAANKLDEWFAAGLQNWDISRDAPYFGFEIPDAPGKYFYVWLDAPIGYMASFKKLAAEKNLDFDAWWQNDSGAELYHFIGKDILYFHALFWPAMLKNAGYRTPSGVFAHGFLTVNGAKMSKSRGTFITAESYLASGMDPEWLRYYYAAKINGSMEDLDLNLADFIARVNSDLVGKYVNIASRTAGFIARRFDGKLAARLPTSELLAEVQHAATLIGECYETREYGKALREIMRLTDLANQYVNDNKPWELAKQEGSEALLHEVCSVSVNLFRLLTLYLKPVLPRLATEVETFLNIAALAWVDAGTLLTSHSINAYSHLMTRVEQKQVDALVAANQQSLAASADAHSPARHAEAQNHVIAPIADTITADDFARIDLRVAKIVNAEHVEGADKLIRLTLDIGEGKTRNVFAGIKSAYDPEKLIGRMTVMVANLAPRKMKFGVSEGMVLAASGETAGLYILSPDDGAVPGMRVK >NZ_AP021884.1|WP_147070492.1|1138856_1139744_+|LysR-family-transcriptional-regulator MDIEQARTFLHVVAIGNFLGAAEKLHVTQSTVSARIQNLERKLGAKLFSRGKQGAALTAAGQRFVRHAQTLVRTADIAKQDVGLPDGYSGGLTVSGRIALWEGFLSRWVAWMRQAAPAISLRLEIGFEQDIMHGLVQNTLDIGLMYTPEARPGLGLERLFDETLVLVTTDRMRPWPDPGYVHVDWGTEFFHQFSLNFPDHPPPALSANVGWLGIQQLLTSGGSAYFPLRMVRTLLAKKRLHRVPGTPHFSVTAHMVYPLSRNDDFLQQALAGLRLLGREERRGQISMDTDNSPNP >NZ_AP021884.1|WP_147070494.1|1137963_1138608_-|maleylacetoacetate-isomerase MQLFSFFSSSTAFRVRIALALKGADYEYQAVNLRAGEQHQQAFLDRNPSGNVPALVDGDFNLGQSLAILDYLDSRYPEPRLIPADTIQRARVLELVNVIACDIHPVNNLRVQLYLKNILGVTEAQKNAWYRHWVAQGLDVVERLLARQEDTPYCFGTHPTLADCCLAPQVWSAARAGCDIAAYPRIDRIYRHCMAQPAFIQAAPEQQADAPQGG >NZ_AP021884.1|WP_147070475.1|1148555_1149455_+|enoyl-CoA-hydratase/isomerase-family-protein MNAVVEFQQFSNASLEQVRIRFDEEYGVMWSFMRPEPRPCFTRTTLQDLLQHHTYLESMKGRVVSNGNFQQTNYLILASDLQGVFNLGGDLAAFGEAIRAQTRKELLSYAKLCIDNVWTFYNLQAPITTISMVQGQAMGGGFEAALSAHVMIAEKSALMGLPEVLFNLFPGMGALSFLSRKIGMRAAEAMVRSGRVYTATELHEMGVVDVLAEDGQGEKTLYDWIRKNHRSLNSFQAIQRARQRVNPLTVEELYEITEIWVDAALRLSERDLRIMERLVRAQNRKVTEPEPVVAEQASA >NZ_AP021884.1|WP_147070472.1|1149459_1151595_-|response-regulator MLDKMKIWFTDRVAKCDRVELEQSLIRLGIGLAILVYLLYRYLTHTTLSHNDIVAFSILSVFLFLTLVLIGSILYSSKPSVVRRLAGAWVDQGGTTLFMAFTGEVGVMVVGVYLWVIFGNGFRFGRKYLIHAQVLSIVGFAITTQVNPYWDEHEAISYSVMLMLLALPIYVSALIRRMNEARQKAEEANAAKTRFVANMSHEIRTPLSGIIGISTLLKATPLNSEQQDLLGTLNSSSRLLVSLLNNVLDFAKIEDGKLAIEHTDFSVNSLLEETVKIFRSQAEAKSIRLDTHIAAAAGTLRGDPHRLQQVLANLVGNAVKFTERGSVTLSLSILGENEHHRNMRFEVADTGVGIPTSAQGKIFESFTQADISTTRRFGGSGLGLTITRHLVEAMGGRLSFESAEGLGSRFWFDLPLEKAVQAQPGSAEIVPLPATRDAGLENTLRILVCEDDATNQKILLRLLELAGHHVSLSANGEELLDQLEQSSFDLVIADLNMAGLSGTDALKLYRFTRADDTRTRFILFTADATLSARQAAKEAGFDAFLSKPVDASTLFGTIANLLGMPSASAEHWLNTVMGGSRSSPPASAETRAVLDAATLRELEILGAGDALFVQRLLRNYLRDSGELLDRIEHAVQQKQYGALRDHCHALKGNSLSIGARGVFGRAETIDRAGPGELRFRGSAMVGLLRTDYAAARAAIEDYLSRRQTAAR >NZ_AP021884.1|WP_147070471.1|1151614_1152046_-|response-regulator MSVRDIRSAPTYRQTVLIIDDQPMVLAIHTAVLKSLSMDLRIVSMTDPKAALEWLRQKPADLIVTDYRMHQMDGIHFVNAVRDSSIEPMRPIIVVTALKDEKIHQQLLAAGVSACLIKPARAAQLSKIARTLLEQSRRQYTTQ >NZ_AP021884.1|WP_147070469.1|1152139_1153207_-|response-regulator MTNFNLPDTSAVLILDDQATSRTILAQVVRSIGSGIRVQEETTPSAALAWAAAHPADLVLADYLMPDMNGVEFIGRLRQLPGYQHVPVVMVTIKQDMETRYAALDAGMTDFLTKPVDMRECLSRCRNLLTLRQQQLALEDKSRVLEDMVGQATEEIRCREKDTLMRLARAGEYRDTDTARHLLRMSRYSRVLADAIGLPEDEAELIELAAPLHDIGKIGIPDSILRKNGPLSDEELAIMRQHPKIGHDILEDSPSKYLRLGGEIALAHHERYDGSGYPFGTTGQDIPLSARIVAIADVFDALTSVRPYKSAWSIKSAMQYLLKESGRHFDPALVKAMLTLEASVEKIQEEHAEPG >NZ_AP021884.1|WP_147070467.1|1153469_1154669_+|malate-dehydrogenase MPTLKQQALDYHQFPKPGKLSVESSKPCATQHELSLAYSPGVAEPVRAIGADPELAYRYTNKGNLVAVITDGTAILGLGNLGPLAAKPVMEGKGVLFKRFANIDVFDIEVNAPSVQAFIDTVVNIAPTFGGINLEDIAAPHCFEIEKALSERLDIPVFHDDQHGTAVIICAGLINALHVQGKKLADARIVCLGAGAAGNASLRLLLAMGADKSRLLVVDKVGVLHTGMIDLPPHHAFFAADTDARTLADAMQGADAFIGVSAANLVTPAMIKSMADKPVVFALANPDPEIAPHDVHAARDDAIIATGRSDYPNQVNNILGFPFIFRGALDARAKRITQKMLIAAVHALMDLAREPVPADVLAIYNLTELAFGRDYILPKPFDARLIERIPPAVMKAAKE >NZ_AP021884.1|WP_147070465.1|1154672_1155050_+|succinate-dehydrogenase,-cytochrome-b556-subunit MRHPSRPVYLNIFKIHLPLPGWMSILQRMSGAVLFLVTPLLLYLLQTSFDADGYARLREWLHIPVVKALSTLLLWGYLLHLLGGLRFLLLDIHVGTALATARKLSAATLLASALLTLVIAGIGLW >NZ_AP021884.1|WP_147070463.1|1155043_1155373_+|succinate-dehydrogenase,-hydrophobic-membrane-anchor-protein MVGGALSAWLVQRVSALLLAAYALFFPVWVALHWPLDFAVWRGLFAPLPMRIVTLLFVVALALHAWVGMRDIFMDYVQPLGLRLALHVGALLWLATCVVWAGAVLWSLP >NZ_AP021884.1|WP_147070461.1|1155369_1157133_+|succinate-dehydrogenase-flavoprotein-subunit MMPVKRKFDAVIVGGGGAGLRAALQLSGSGLQVAVVSKVFPTRSHTVSAQGGITAALGNVTPDNWHWHMYDTVKGSDYLGDQDAIEFMCRHAAEAVIELEHMGLPFSRLDNGRIYQRAFGGQSMNYGGEQATRTCAAADRTGHALLHTLHQQNLKAHTHFFDEYFALDLLRDADGYVLGVTALCIETGAPLVIEARATLLATGGAGRIFRYSTNAHINTGDGLGMVLRAGLALQDMEFWQFHPTGLPGSGSLITEGVRGEGGYLVNNQGERFMERYAPHAKDLAGRDVVARALALEIHAGRGCGPHGDTIHLKLDHLGAALIKDKLPGIRELALRFAGVDPIDAPIPVVPTAHYMMGGIPTDLHGQVVMPARFGPEEPVPGLYAVGECACVSVHGANRLGGNSLLDLVVFGRAAGNHIIETLRDNPFPRLLPESAAEAALARLARWNKTGAGESVAELRLALQTLMQKHCGVFRTETLMGEGIAALDILQARLDNARLADHSQVFNTARIEALELENLFAVARATLVSAHARTESRGAHAREDYPERDDGHWLKHTLYTRENDQIDTKPVRLKPLTVEPFLPKERIY >NZ_AP021884.1|WP_147070460.1|1157132_1157831_+|succinate-dehydrogenase-iron-sulfur-subunit MRFSIYRYDPEHDTKPHMQAYDVDIEPAGNMLLDALLRIKDTLDSTLTLRRSCREGVCGSDGMNINGSNGLACITPLADLRQPVEVRPLPGLPVIRDLVVDMTPFNQQYRSVEPWLNNADPAPEIERLQSPEQRAQLDGLVECIQCGCCSSACPSFWWNPDKFVGPAGLLAAYRFIADSRDQGANQRLDNLQDPYRLFRCHGIMNCVSVCPKGLNPTAAIGKIKTLLVKRGA >NZ_AP021884.1|WP_161984192.1|1157869_1158085_+|succinate-dehydrogenase-assembly-factor-2 MLELDILLLDFLEQQYPVLPSSQQIAFGALLELGDSELWDMIQTGQSAAQPEQAKIIEWLRTGKQKNESTD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_2 | 1831685-1831793 | Orphan |
NA
Consensus repeat of NZ_AP021884_2
|
1 spacers
spacers of NZ_AP021884_2
>2.1|1831723|33|NZ_AP021884|CRISPRCasFinder TTGCGTTGGATACCTCATCCTCATCATTGCGCT |
CRISPR arrays and Neighbor proteins around NZ_AP021884_2
The CRISPR arrays of NZ_AP021884_2 >merge|NZ_AP021884|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGACTTGCGTTGGATACCTCATCCTCATCATTGCGCTGGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC >NZ_AP021884|2|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC TTGCGTTGGATACCTCATCCTCATCATTGCGCT GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC
>NZ_AP021884.1|WP_147074724.1|1830036_1830279_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLALPARIVEMRKQDIGIVDLGGVRKEVSLALVDDLQVDDYVIVHVGYALSKLDPEEAERTLRIFAEMESMPGNVGVGA >NZ_AP021884.1|WP_147074723.1|1828900_1830040_-|hydrogenase-formation-protein-HypD MKYVDEFRDGELANGLASTIARAADTGRNYSFMEFCGGHTHAISRYGVTDLLPANIQMIHGPGCPVCVLPIGRIDLAIGLALDQGVILCTYGDTLRVPASDGLSLMKAKARGGDVRMIYSTADVLAIARDNPDRDVVFLAIGFETTTPPTALLIEQAKNEGIGNLSVLCNHVLTPSAITHILESPEVREYGTLPLDGFIGPSHVSTVIGTQPYEHFAREYRKPVVISGFEPLDVMQGILMLVRQVNEGRAEVENEFFRAVTRGGNRKAQTLVAKIFELRRTFEWRGLGEVPYSALQIRSEYAAFDAEQRYGLRYAPVADNKACECGAILRGVKKPTDCKIFGTVCTPETPMGSCMVSSEGACAAHYTYGRFKDVEIVAA >NZ_AP021884.1|WP_147074722.1|1827848_1828904_-|hydrogenase-expression/formation-protein-HypE MSTVKPGYTRPLDVRNGRIDLSHGGGGRAMAQLIEELFAAAFDNEYLAQGNDGAVLAMPSAGGRLVMATDAHVVSPLFFPGGDIGCLSVHGTVNDVAVMGARPLWLAASFVLEEGFPLSDLKRIVESMANAAKSAGVSVVTGDTKVVERGKGDGVFITTTGVGVLPKGLDLSGNKATPGDVILLSGTIGDHGMAIMSKRENLAFDAPIESDTAALHGLVADMLASGSGIRVLRDPTRGGLATTLNEIAKQSGVGMQLDESSIPVRPVVDAACEFLGLDPLYIANEGKLVAICAPEDAGGLLAVMRAHPLGRESAIIGTVHADPHHFVQMKTRFGGRRNVDWLSGEQLPRIC >NZ_AP021884.1|WP_147074721.1|1826161_1827847_-|hydrogenase-maturation-protein MRILFLTHSFNSLTQRLYVALTELGHEVSVEFDIADSVTEEAVALYRPDIILAPFLKRAIPASVWRHHTCLVVHPGIVGDRGPSALDWAVQNAETEWGVTVLQANAVMDGGDIWANEIFPMRLAKKSSLYRNEVTEAATRAVTTAIERYAQRDFVPCPLEKWSNVAGQERPVMWQEDRRINWLRDDTQTILRRIHAADGFPGVRDSLFDHACFLFDAHAAPDYSGAPGTILGWQGTSLVRATVDGAIRIGHVRRPESAHPFKLPALVAFAAEQASIPVLCEADGESIRYEEQDGVGYLYFDFYNGAMSTAQCRELLAAYRQACSRPTRVIVLMGGDDFWSNGIHLNLIEASEHPAEESWENIQAMDDLAEAIITATSHITVSALANNAGAGGAFLALAADHVWARPSVLLNLHYKNMGNLYGSEFWTYTLPRRVGLEKTSRIVENRLPMSARQAARLGIVDACFGTDAAMFRREVKQRATAISRSPDYDVLRKTKTEARDRDESEKPLLRYRESELSEMHRNFFGFDPSYHYARRYFVHKTLPAWTPRHLCKHRGMVQGNH >NZ_AP021884.1|WP_147074720.1|1824193_1825636_-|sigma-54-interacting-transcriptional-regulator MSLPTVLIVDDEIRSLEALRRTLEEDFTVFTASNVDAALEILRQEFIQIIVCDQRMPVQSGVTFLKHVRADWPDVVRIMLSGYTDTEDIIAGINEAGIFQYLLKPWQPEQLMLVLRSAADVYRLQLENQRLSLELRDSPALLAERVANKRQHVREKFSLDRVARAPDSPLNATCEMIDRIAPYDISVLITGESGTGKELLAHALHYRSGRAAQAFVTQNCGALPDALLEAELFGYKRGAFTGAYSDRVGLFQQADGGTIFLDEIGETTPSFQVKLLRVLQEGEIRPLGSPRSVQVNVRVIAATNRDLEEEVRAGRLRQDLYYRIANLTMHLPPLRERPMDIPLIAEGLLQRAMRQLNRKVRGFTPETLDCFKAYRWPGNVRELQNEILRILALTDSEWLEARLLSPKVLRAAMEESEEQQLDLLAGLDGSLKDRMEQLEARLIRETLIRHRWNKTHAAQELGLSRVGLRSKLVRYGMDKT >NZ_AP021884.1|WP_147074751.1|1823172_1824168_-|HupU-protein MNLIWLQSGGCGGCTMSLLSADVRDLFGMLKDAGINIVWHPGLSEQTGSEAIEVLEACASGDLPLDILCVEGSLLRGPNGTGRFHVLSGTGKPMIEWVRQLAEKAQYTIAVGTCATYGGVTAGGCNPTDACGMQYDGASRGGLLGVDYLSQSGLPVINIAGCPTHPGWVLETLLALAMDSFTQADLDELGRPRFYADHLVHHGCARNEYYEFKASAEKPSDQGCMMENMGCKGTQAHADCNIRPWNGGGSCTDGGYACIGCTEPGFEEPGHPFTQTPKIAGIPIGLPTDMPKAWFVALATLSKSATPKRVKENATSDHLVIVPGIRKTGVK >NZ_AP021884.1|WP_147074719.1|1821718_1823176_-|nickel-dependent-hydrogenase-large-subunit MSRLVVGPFNRVEGDLEVTLDISGGRVDRAYVDSTLYRGFEQILRGKDPMDALVFVPRICGICSVTQSVAAANALRNAMGISIPRNGQLATNLILANENLTDHFTHFYLFFMPDFARDGYSGRPWHGMAEQRFKAVTGSAAGDALPARAAFLHMMGVLAGKWPHTLTLQPGGSSRAVSSTEKIRLYAMLREFRAYLEKIMFGDKLENIVKLDSMRALEAWRDARPPDASDFRLFLEVARDLELHRIGRATDIFLSYGSYEMAGEYLFSPGVWDASKGTLSAIDPSDIVEDLSHSRMTGERDARHPYQGETQPAPDKPDAYTWCKAPRWRGQVLECGALARQVVTGHPLIRDMVEKTGGNVTTRVVARLLEISRVIPAMESWIKSLSPGEPFCVQGRMPDNAKGVGMVEAARGALGHWLVVKEGKIANYQIVAPTTWNFSPRDRDGIPGALEQALVGVPVGEHERVPLAVQHVVRSFDPCMVCTVH >NZ_AP021884.1|WP_147074718.1|1821238_1821706_-|nickel-responsive-transcriptional-regulator-NikR MERFTISLDEDLAQEFDRLILARGYSNRSEAVRDMLRAELEKSRQVRYEGTHCIAALSYVYNHHERELAERLTALQHDHHDLTVSTLHAHLDHDNCIECVVLRGKTAEVRDFAGKLIAERGVRHGNLSVITVSQEQHKHRHGLFARSHIHYKPHN >NZ_AP021884.1|WP_147074717.1|1819887_1821237_-|PAS-domain-containing-protein MFSKTGLLSLPDMPIEGVGEQFWMEVIRKMDEVYSDLLKYQTALEEQNNKLEESQQFIFGVLAAMSDILVVCDQTGTIEDVNQSLIELTGKTSAEWRGHPLVELFADDISRKQAELKFNGLQGQAIHDCEMQIRMANGSSMPVSVNCTARFNKKGKSVGMVITGRPVGELRRAYHALQEAHEALKRTQQQLVHSEKMASLGRLVAGVAHELNNPISFVLGNVHVLERYAGRLKEYLDAVHAGRSGIELAELREKLKIDRILGDIRPLIEGTIEGAERTRDIVDGLKRFSAIDREEECEFNLVEIIQRAVHWVTNITSESFQVEMDLPHFIPVLGSAAQIQQVIMNLVQNAVDATAEVKSPRLRIQAKIEKDKAVVEFRDNGSGILPENFPKIFDPFFTTKPVGKGTGLGLAISYGIVERHNGALFAANDAHDGGTIFVLNLPLYQSAKN >NZ_AP021884.1|WP_147074716.1|1818851_1819808_+|HTH-type-transcriptional-regulator-CysB MKIQQLRYLHEVARQGLNVSLAAEKLHTSQPGVSKQIQLLEEELGVDILVRHGKRVTGITEPGQKILAITERILREAENLKRVGADFTNETHGSLSIATTHTQARYALPSVIKTFSERYPGVQLRLHQGNPAQIVEMVLSGEADIAIATEAIALHDELVTLPCYQWNRCVIVQPDHPLLGEPTLTLERIADYSIITYDFAFAGRSQINKAFMERNLSPNVVLTAIDADVIKTYVGIGLGIGIMASMAFDPGRDQNLRAIDASHLFEPSTTRIGIRQGTYLRGYTFEFIQMFAPHLNHEAVNMAISAACRSAHQEAPKI >NZ_AP021884.1|WP_147074726.1|1832692_1833727_-|hydrogenase-nickel-incorporation-protein-HypB MCTTCGCSAGETRIEGQAMDGHSHVHADGTVHDHRHEAPAADGKMQYHAHHDENAHGHRHADGTWHSHDHGHEGEHVHEHGEDVIDYGQGPAHAHAPGLTQSQMVRIEQDILGKNNAYAGRNRNYFDEHGIFALNLVSSPGSGKTTLLVRTIETLKSRIQVAVVEGDQQTSQDAERIRSTGVRALQINTGKGCHLDANMVGHALERLHPEDDSVLMIENVGNLVCPAAYDLGEAHKVVILSVTEGEDKPLKYPDMFRAASLMLLNKTDLLPYVPFNVQLAIEYAKQVNPGLHIIQTSSTNGDGYEAWLGWIETGLARQRKKRAQTVAVLQKRIQELEAHLAARG >NZ_AP021884.1|WP_147074727.1|1833787_1834129_-|hydrogenase-maturation-nickel-metallochaperone-HypA MHEMSLAEGVLQILEDTATHHGFQQIKRVRLEIGELACVEVESLRFCLDVVVRGSVAENTMLDIVQTPGGGWCMNCSDTVPISALFSACPRCGSYQVQPTHGTEMRVLELEGV >NZ_AP021884.1|WP_147074728.1|1834121_1835225_-|nickel-dependent-hydrogenase-large-subunit MSLAGKLTFSVGWDGYRVTSVEVRSSRPQAACLLEGKTVEEAMRLVPLLFGICGKAQTVAARSAAQAAQNLCGDKQLMLRQRRLVALEAAQEHLWRLLVDWPNRLGLPAKQGLMMEWVKRISISRGDDDVLALGEAMLTMIEQDVLDESLDCWAATLERAERTPMRGLAGASLEMLRGLEPLHSGHPVFGHFLPRQAACLWGNELQPYLDGHFAVRPLWRNAPAEAGALALHHQIPLLAELLRTGHAASARYLARLVDWVSCVRLLRGEASSTELRLDACKLGKNAGLACVDTARGLLLHYIEVALGQIVRYVIVAPTEWNFHPAGPFVQTLRSLRADDAASLYQRINILILAFDPCVEYEVNLHHA >NZ_AP021884.1|WP_161984236.1|1835224_1835764_-|[NiFe]-hydrogenase-assembly-chaperone-HybE MKLMNPRPLENPSRMIESVFDGIARHRMAGLPILNPSLHVEAVGFRLWEGLWLGILITPWTINLMLLPADNPDYAALGLGETRRWRFPSGQYDFMGGEEPGLGSYQACSLFSPVFEFASQEDAVATARAALEQLLLEDLEAAVKREKAQWDQARFSDAPLAEQALSRRGFLRGAFLRDP >NZ_AP021884.1|WP_147074730.1|1835770_1835986_-|rubredoxin MDTFEGSYLGHDDRIDESVRLECGICWLVYDPEVGDPYWHIPPGTPFSRLPEHWTCPNCDAPRHKFMVLKD >NZ_AP021884.1|WP_161984237.1|1836001_1836784_-|hydrogenase-expression/formation-protein MNMPKGMAVFNPPSVPDDVAPELRDQAANLIRQLLAQMRAYRFGATSYPKIDLLKYDPRVVPLINDILGQGEVSIIAHQPTALRAQETVFASVWRVCYPGADGVLERDYLEVCPIPAVVAEIALAPTLKQISPPPPPAGAMNSPALLHEILDVVSTYQAGNPAHIINLTLLPLTPDDLAYLVQALGPGSVSILSRGYGNCRITSSGLANVWWVQYFNSSDQLILNTIEVVEVPEVALAAEEDFSDSIERVEEWLGTMLAA >NZ_AP021884.1|WP_147074732.1|1836869_1837361_-|hydrogenase MSEGVLDYALVAQEKVATEQNALGMLITRLCEQHQFVLVDEGNLEALTQASGDMVLLLTEDVVRSPETWDVAIVLPEILKLFGGRLKAAIADTENSKKLQARFGTTRFPAMVFLRDGEYVDVIQRMLDWDEFVAEVTGVLEKPIGRAPTIGIPVRNEVASSCH >NZ_AP021884.1|WP_147074733.1|1837357_1837666_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLGIPMQVIEAEESYAVCRGRDGNLARIDTMLVGSVQSGQWLMTFLGGAREILNEQQAEQVNSALNALAAVSRGASDVDVHFADLVGREPQLPDFLRKGGQ >NZ_AP021884.1|WP_147074734.1|1837670_1838294_-|HyaD/HybD-family-hydrogenase-maturation-endopeptidase MVEGSQFDTLILGIGNVLWADEGFGVRCVEAMNATYAFPDNVRVMDGGTQGLYLLPYVEAARRLVIFDAVDYGLEPGTLKLVENAEVPKFMGAKKMSLHQTGFQEVLACADLVDHLPEEMVLIGVQPEELEDYGGSLRPRIKARIPEVLEIAVERLVGWGIPVVARGTGETMRTESGILDIQRYEMERPTEEQACRLGDIRFLATGV >NZ_AP021884.1|WP_147074735.1|1838453_1838801_-|HigA-family-addiction-module-antidote-protein MVKTFLPSGLGFGAGALDPRFFSFSLRAPIAPGRFLESRFLHPLGLSQDRLARELGISRRRVNELIRGKRAITPDTAIRLGLFFGTGPVLWLTLQQAWDIHQEWRNFRRRSKAHG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_3 | 2644962-2646084 | TypeI |
I-C
Consensus repeat of NZ_AP021884_3
|
15 spacers
spacers of NZ_AP021884_3
>3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG >3.2|2645072|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG >3.3|2645146|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG >3.4|2645220|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG >3.5|2645293|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG >3.6|2645366|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG >3.7|2645438|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT >3.8|2645510|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG >3.9|2645581|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG >3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC >3.11|2645725|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC >3.12|2645798|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA >3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA >3.14|2645942|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG >3.15|2646013|35|NZ_AP021884|CRISPRCasFinder,CRT GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_3
The CRISPR arrays of NZ_AP021884_3 >merge|NZ_AP021884|3|2644962-2646084|PILER-CR,CRISPRCasFinder,CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATACCGGTAGCGTCGGCAATACCCTGACCGCAGCGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACGACGCAGGTTATAGAGCGTTGCACGGCAAAATTGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCTAACCTGCCTTACACAGCCAGCCGCTACGATGAGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCGGTCATGGGTGCCAGTTACACTATCCCGATGGACGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACAATTTCTTGCTGGATAAAATCAAGCCGCTTAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAGCATGATCCAAATGGATGCCTTGCGGTAGCTTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCCATGGGTAGCACCGATTAGCACCTTGCCAAAGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646012|PILER-CR GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC >NZ_AP021884|3|3|2644962-2646084|CRISPRCasFinder GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646084|CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC
>NZ_AP021884.1|WP_147072286.1|2644492_2644783_+|CRISPR-associated-endonuclease-Cas2 MLIIVTYDVSTETAAGRKRLRRVAKACEKMGQRVQKSVFECTVNEMQFEQLERTLLAEIDETQDNLRFYRITEPVEVRVKQHGCFRSVDFEGPLIA >NZ_AP021884.1|WP_147072284.1|2643449_2644484_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MHTIQNTLYVMTPHAYAHLENATLRIDVEREKKLQVPLHHLGGVVCFGNVMVSPALMHRLADEGKSLVLLDDSGRFKARLEGPVSGNILLRQAHHSKASEPAFALGVARAVVAGKLKNSRTNLQRGAREAADPDEAATLTRSADNLAASLRAAAVANTMDELRGVEGEAARGYFAALNLIVKPLARPSFALNGRSRRPPLDRFNALLSFLYAMLMNDCRSAVEAAGLDAQLGFLHAVRPGRAALALDLQEEFRSILADRLALTLINRGQINAADFDEREGGAVMLGDKGRRTVVTAWQERKQEEITHPLTENKIPIGLLPFIQARFIARTIRGEMEGYLPYQAK >NZ_AP021884.1|WP_147072282.1|2643084_2643402_-|ribbon-helix-helix-protein,-CopG-family MATLTLRLPDNLDRQLTALAAQTHQNRSELARTALEKFLRELEQEQLLAEMVEAARFLATNPEARAESIAIAEEFLPLDNEALDIAEGRKPGDPWPEELGEKWWK >NZ_AP021884.1|WP_147072279.1|2642722_2643097_-|type-II-toxin-antitoxin-system-PemK/MazF-family-toxin MVEIMRRGEIWLARLNPNTGAEAGKVRPVLILLNDALLATGMSPVLCIPLTSKLYKNLAGLRIAIAPRGLLLKPCYAMPEQARALDRNRFGEGSLATLTNAEMAQVEKLFIAACGMAQYLIPQH >NZ_AP021884.1|WP_147072277.1|2642112_2642739_+|CRISPR-associated-protein-Cas4 MANSADEIVALSALQHWIYCPRQCGLIHLEQAFEDNVHTARGQAVHHLVDTPGYEIKSGVRVERALPVWCDRLNLIGKADLVEFHPDDSVYPVEFKHGAKRQKLHDDIQLAAQAICLEEMLNRPVPKGAIFHATSHRRREVSITPELKQLVEETANAIRAMLASGKLPPPVNDARCRECSLKEICQPEALAERGRLERLREELFSAAG >NZ_AP021884.1|WP_147072275.1|2641869_2642112_+|type-II-toxin-antitoxin-system-HicA-family-toxin MRVPRDLSGADLVKRLERMGYCVTRQTGSHMRLTSTVRGEHHITIPNHDPLRLGTLASILASVAAHHGLTRDELIQRLFD >NZ_AP021884.1|WP_124705901.1|2641666_2641873_+|2-oxoisovalerate-dehydrogenase MSEIHFIVEEAPEGGYVARAVGVDIVTEADDLPSLHAQVRDAVHCHFDEGKLPGLIRLHITREEVLTA >NZ_AP021884.1|WP_147072272.1|2640483_2641584_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSIHNRYDFVLLFDVKDGNPNGDPDAGNLPRLDTETGQGLITDVSIKRKIRNFVGITKCKEDGTYETGFDIYIKEKAVLGRAHFAAFEKLGISLGQDATELIPDDLAEQFEALTLPEGMEIDTDEEGRSILNLSGATLDKKEAQKWLKDINPAKPLKNFISKVLKNVTARKPKQEESEKGRVQMCQDFYDIRTFGAVLSLKTAPNCGQVRGPVQITFARSIDPIVTLEHSITRCAVATEAEAEKQGGDNRTMGRKFTVPYGLYRTHGFVSAHLAGQTKFDESDLELLWEALKNMFEHDHSAARGEMATRGLYVFKHESHLGNEAAHKLFDRIKVNKTKDVPRGFEDYEVSVDETEMPSGVALLQKC >NZ_AP021884.1|WP_147072270.1|2638681_2640466_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MILQSLHEYYGRKRDSLPGDGIERKELPFLFVLKPDGAFLHIEDTRQGEGKRKRGNAFLVPQGVKKSVNVAANLLWGNVEYVIGQPDSKKLEEQRKKGKEKHYRERLGDMCSAFRTEIEQLPSEVKSTPEVAAVLAFLSSGNFTHVLADPLWPQVSATGANVSFKLTGAESPVCSASGILASVGQSTEDKGETRICLITGNSDVVERLHPPIKGVWGAQTSGANIVSFNLSAFNSFAREQGSNAPVGKRAAFAYTTALNHLLVSKQRIQIGDASTVFWAAEDNKMESLLSQFFDEPPQDNPDQGTNAVKELLEATLAGTPAIYDDGTRFYVLGLAPNAARIAVRFWHVATVGDLAGHIRQHFEDLEIVRPQYVERPFLSLKALLLAVSPLGDLDKLPPKLAGDFMKAILDGTSYPQTLLQAALRRIHAEQAKKDEKTGKHRDHVPYARAALIKAWLNRQTRNANPDQERKITMSLDESNINSGYRLGRLFAVLEKVQAEANPGLNTTIRDSYFGSASSTPSAVFPTLMRRNQHHMTKLRKEKPGLYVTRDKLIQTICNDGIDGQLGFRPILSLADQGRFVIGYYQQRQDLFTKS >NZ_AP021884.1|WP_147072268.1|2637986_2638685_+|type-I-C-CRISPR-associated-protein-Cas5 MPKTLCLKVWGDFACFTRPEMKVERVSYDVITPSAARAVFEAILWKPEIRWTVTKIEVLKPIKWISVRRNEIGKVASADNGQGDRGLYIEEHRQQRAGLFLRDVAYRLHAQFEVVDGSKHVHHYPELRGRFPAEPEESQPEHPAKYLSMFQRRAKKGQCFWQPYLGCREFSAHFELVDDAAAASLAEPPISDSPSLGWMLHDIDFADAMRPGFFRAEMKSGIIDLEDVEVRR >NZ_AP021884.1|WP_147072288.1|2647299_2647656_-|DUF2934-domain-containing-protein MAESKAKSKASGKPVSAVAETKPKAKTAQPAAGKAAAQSAVAAKPKVAKPKVAAPGANEPAAKRSVKLSNPAVSAEQRYRMIAEAAYYIAERRNFAPGDAAADWAQAEVQIVALLNKK >NZ_AP021884.1|WP_147072290.1|2647849_2649325_-|metalloprotease-TldD MTTSTLTGVLIPNPEMLFQTAHETLLVPNQLEASQLDGVFGRLMDHHVDYADLYFQYTRSEGWSLEEGQVKSGSFNIEQGVGVRAVSGEKTAFAYSDDISQPALLAAAEATRAIARSGAVRKPHAVARGGGHALYQPLDPLTTLKDAEKVALLEKLERYARAIDSRVTQVMASLASEYDVVLIARSDGHQAADVRPLVRLSLQVITEQDGRREQGSAGGGGRFGYDYFSDAMLKKYAEQAVHQALTNLAARPAPAGSMTVVLGAGWPGILLHEAIGHGLEGDFNRKGSSAFAGRVGERVAATGVTVVDDGTLMNRRGSLNVDDEGNLTQCTTLIENGVLKGYMQDTLNARLMGVPITGNARRESFAHIPMPRMTNTYMLNGDKDPEEIIASVKHGLYAVNFGGGQVDITSGKFVFSAAEAYMIEDGKITYPVKGATLIGNGPDVLTRVSMIGNDMALDPGVGTCGKEGQSVPVGVGQPTLRIDGLTVGGTA >NZ_AP021884.1|WP_147072292.1|2649382_2650324_-|carbon-nitrogen-hydrolase-family-protein MEKIVSDKTSKSPSSFAKPKSRTTVAKPARAPAPGVIRMAAIQMASGPNVSANLAEAERLVALAVAGGAKLVVLPEFFAIMGNKDTDKVAAREEEGKGPIQKFLASAAKKHKIWLVGGSVPLACDNPKKVRNSCLVYDDKGKLVARYDKIHLFGLDLGVEHYQEEKTIEPGDQIVVLDSPFGRIGLSVCYDLRFPELYRAMPNVDIILVPSAFTATTGKAHFETLVRARAIENLAYVIAPAQGGYHLSGRETHGDTMIVDPWGVVLDRLPRGSGVVMAGINPAYQASLRKSLPALKHRTLDCSHIQIKDKAIK >NZ_AP021884.1|WP_147072295.1|2650366_2654170_-|TIGR02099-family-protein MIAFSRRWIRRSVDYVVLPLALVVVVLVLLLRLWILPDIDRWRDDIAASISHSAGQRVTLGEINANWQGLHPHLRIRDIRVFGADGRPVLFLADVRATLSWTSLLHGELRLAVLTMDDVALTIRRDMQGIHVAGILLNQSDSSGGFGDWLLAQRHIQVNHATLAWNDERRGAPYLVARDVNLTLQNRGHRHRFRLTAIPPEQLAQPLDIRGDFSGRSLDDLASWHGQVYARVDRTDLGQWRQWLTLPYAISQGYGGLRMWLDVASRQVIAATVDASLRQVSVRFAADLPVLRLADVSGRGLWKRLGPAQSFAVKQLSLRTANFVYVAPFDLTLRLDPANAIQPGSGRIDTNSVQLDRLAALAPYLPLDAVQRRRLADLQPRGQLEKFTLAWSGNADQPLDYQIKGRFTRLGWQAQGNLPGAAGLSGNIDATRSNGTLALTSSGVMLALPRVLFEPDVALTTLTARMNWRATQAGYLIKLTEASFANPDLAGSAFGEYQLQAGRRGVIDLTGRLSRANVASAYHYLPLVVKDPTYQWVRSALLAGQGGAASIRLQGDLSRFPFRKAGDGVFEISTPISNGVLQYAAGWPRIEGIQAQLKFTGTRMEISSDAATIYGAALRRVSAVIPDLVDPDEILEVKGEAAGPLAELVRFANTSPLAAKLDNVTDNLRTTGNSRLGLDLKLPLRRAHHATLVGDIRFLGNTLIPAHGLPTLENVQGRLSFTDTGISAQSISARLLGGAATLSAVTQPGGVTRLLVDGRMTAAGLRPYLGTALAGHLSGMADWHARVDLHQMQAQADFESNLVGMASDLPPPFAKAAADSQPLRVKKSLRGADESLLAIHYGQVASALLLQKQKDGEPVIERGTLRFGGEAVLPEESGLWITGSLLLSDLDLWRNELTAAGNGAIGLPPLAGVNLSFRTLDLFGRRFQDININARNQAGTWRANVAGRGVNGDVTWQAADSRAGQPQDRLGAHFKTLAIPAALPVQGVKSSPSGSLPALDISVDNLQLGNRPLGRLSVSATPLDSGLNFESIRLTQPDSTLTMQGIWNPDRIPQTRAKIHLEVNDVGRFLARFDHPGLVKRGQATLDGEGEWNGTPADIAIPSLSGTFALKASSGQFAKVDPGIGKLLGVLSLQALPRRIGLDFRDVFSDGFAFDEISGTMRLSRGVVYSDDFRMQGPSAKVRMSGMVDINAETQQLRVAVSPKLSESVALAGTLIGGPFVGLGALAVQKLLKDPFGQAATFEYSVTGAWTDPVVKRVARIAGGGEP >NZ_AP021884.1|WP_147072299.1|2654382_2656353_-|acetate--CoA-ligase MANIESVLQETRVFPPSAAFQAQANVSGMASHQALTARAAADYEGFWADMARAGISWKKDFSKILDESNAPFYKWFYDGELNVSYNCLDRHLPEKADKTALIFEADDGAVRRVTYQALYNQVCAFANGLKSRGVQKGDRVIIYMPMGVEAVVAMQACARIGAIHSVVFGGFSAKSLHERIRDAGARLVVTADGSIRGGKMLPLKSAVDAAIALGDCECVEAVVVYRRSGDDTAWNAARDIWWHDLVNGMAQTCEPEWVNAEHPLFILYTSGSTGHPKGVQHSSGGYLLGAILSMQWVFDARPDTDVFWCTADVGWITGHSYVVYGPLALGMTEVIFEGVPTYPDAGRFWKMIQDHQVTTFYTAPTAIRSLIKLGSDLPRQYDLSSLRLLGTVGEPINPEAWMWYYEAVGQSRCPIADTWWQTETGSHMIAPLPGAVATKPGSCTLPLPGIMADVVDEHGGSVPLGQGGYLVIKRPFPSLLRSLWGDPERFRKTYFPAELGGKTYLAGDSAHRDADGYYWIMGRIDDVLNVSGHRLGTMEIESALAANPRVAEAAVVGKPHDIKGEAVVAFVVLKGARASGDEAKKIVAELRDWVGKEIGPIAKPDEIRFGDNLPKTRSGKIMRRLLRAIARGEEITQDVSTLENPAILEQLKEAVR >NZ_AP021884.1|WP_147072301.1|2656415_2659091_+|bifunctional-[glutamate--ammonia-ligase]-adenylyl-L-tyrosine-phosphorylase/[glutamate--ammonia-ligase]-adenylyltransferase MPAHHLIERAASHSRYLARLLAADAQFVDSLASGLAQPFGADAMQAQLQAAAPGDEAMLKTALRKLRQAVMARLIVRDLGGLADLSEVMGTCTDLAETTLRCALAHHSTWLAQKHGMPKNPDGSDMQLVVVGMGKLGGRELNVSSDIDLIYLYPEQGETTGAKPVSHHEFFVLLGKKLGLAISDLTADGFVFRVDMRLRPWGDAGPLAMSYAALEDYLVAHGREWERYAWIKGRALTGTRLAELDQIIRPFVFRKYLDFNAFAAMRELHVQIRREVIRRDRADNIKLGPGGIREIEFTAQVFQLIRGGQVAVLQTRSLLAVLPLLAARGLLPENAVAELQAAYVFLRNLEHRLQYLDDAQTQMLPTQPDDRTRIATSMGFTDYPAFLAALNAHRTQVSRHFDQVFAAPQADSGSHPLAGLWQGALEHADALATLAGLGYTAPAEVCNRLRQIRTSIRYTTLPASNRARFDTLMPALIEVAASCNPPDATLARILDLLETVARRDSYLALLVEYPATLQRVARLCAASPWAAQYLARNPMLLDELLDTRQLYATPDWPALGDELQALMHTHCGDTERQMDAMRQFRQRVTFHLLAQDLAGVLALETLSDHLSDLAALILSATLPLAWAGVRNRHRDTPRFAVIGYGKLGGREMGYASDLDLVFLYEDPAPAAAEHYARLAQRINTWLGSTTAAGVLYETDLRLRPDGTSGLLVSSVEAFSQYQHSHAWTWEHQALTRARYVAGDAAVGAAFERIRCDILTQPRDPARLREDVLAMRQKMHAGHPNHSDLFDLKHDAGGIVDVEFMVQYLVLAHAARHRELTRNSGNIALLRLAAELELIPASDAEAVRSAYRELRRLQHALRLHGIQTARIEPMQVAGHAAAVRRLWRTLFG >NZ_AP021884.1|WP_147072457.1|2659154_2660069_+|branched-chain-amino-acid-transaminase MADRDGFIWYDGKMVPWRDATTHVLTHTLHYGMGVFEGVRAYNTDQGTAIFRLQEHTDRLFRSAHILGMKMPFDKAAISAAQLAAVRDNQLESAYIRPMAFYGAEAMGISAKTLSTHVIVAAWTWGAYMGAEALERGIRVKTSSFARHHVNIAMCKAKANGNYMNSILAHQEAAQDGYQEALLLDVDGFVAEGSGENVFIVRNGKLITPDLTSALEGITRDTIVQLAGEIGLQVVEKRITRDEMYSADEAFFTGTAAEVTPIRELDNRTIGTGARGPITAQLQKMYFDCVTGKDPKHAGWLSYI >NZ_AP021884.1|WP_147072303.1|2660079_2660286_+|zinc-finger-domain-containing-protein MAQVQHENTQRIIEVTADDLPLHCPTPGMIAWDSHPRVFLPVEVKGEALCPYCGTMYILKGGAVAHGH >NZ_AP021884.1|WP_147072305.1|2660694_2661141_+|6-carboxytetrahydropterin-synthase-QueD MLITRRLEFDAGHRIPNHASQCKHLHGHRYAIEITLSGDIITAEGQSEQGMVMDFSDVKRIAREQLVDAWDHAFLAYRGDKPVCDFLATLTDHKTIILELVPTVENLAHIAFDILDPAYRDTYGNQLRLKQVRIYETPNNWADCRQPE >NZ_AP021884.1|WP_147072307.1|2661191_2661659_+|YbhB/YbcL-family-Raf-kinase-inhibitor-like-protein MGMTMTSTAFAHHGAIPEHYTCDATDTSPPLAWAGVPVGAKSLVLIVDDPDAPDPAAPQRTWVHWLLYNLPPTSSGLAEGVTALPAGTLEGINDWKRTGYGGPCPPIGRHRYFHKLYALDVVLPNLDRPSKAALEKAMQGHILAQTELIGLYQRH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_4 | 2907296-2907401 | Orphan |
NA
Consensus repeat of NZ_AP021884_4
|
1 spacers
spacers of NZ_AP021884_4
>4.1|2907330|38|NZ_AP021884|CRISPRCasFinder CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC |
CRISPR arrays and Neighbor proteins around NZ_AP021884_4
The CRISPR arrays of NZ_AP021884_4 >merge|NZ_AP021884|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGGCAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGCGATTCCAACGACTGGGTGCGGCTGGGCAATATGG >NZ_AP021884|4|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGG CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC GATTCCAACGACTGGGTGCGGCTGGGCAATATGG
>NZ_AP021884.1|WP_147070033.1|2903228_2905838_+|alanine--tRNA-ligase MKSSEIRQRFLDFFARHGHTPVASSPLVPGNDPTLLFTNAGMVQFKDVFLGRETRPYARAVSSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKRNAIQFAWEFLTQELGIAKDKLWITVYHTDDEAHTIWTAEMGVPDERVIRIGDKPGGGSDNFWQMGDTGPCGPCTEIFYDHGAEVAGGPPGSADEDGDRYIEIWNLVFMQFNRDEAGNLQPLPRPSVDTGMGLERISAVMQHVHSNYEIDLFQALIHAAARVTGSADLTDNSLKVIADHIRACAFLITDGIIPGNEGRGYVLRRIIRRAIRHGYQLGQKQPFFHLLVADLAMAMGAAYPELVAAQARVTAVLKQEEERFAETLEHGMDILEQALQSGANVLDGATAFKLYDTYGFPLDLTADVGRERGFTVDMAGFEAAMEAQRKRARAASKFTMQAGMRFDGPPTEFRGYDTLSLDSRILALYQDGSPVHSIAAGEAAVIVLDRTPFYAESGGQVGDSGELHGSGSVFVVDDTQKIQPDVFGHTGLLQSGSLKLGDTVSAQVDADARSRAACNHSATHLLHAALRQVLGTHVTQKGSLVDAARTRFDFAHSEAVSAAQLQQIEDLVNREIRRNVIVEARLMNYDAAIAHGAMALFGEKYGDQVRVIGMGEFSTELCGGTHVSRSGDIGLFKIISESGVAAGIRRIEAVTGPAALAMIQAQQRQILEAAALLKAPPQELQQKIAQIVDNVKNLEKELDRLKSRLAAAQGDDLVSQATAVGNAKVLAAMLEGADVKTLRETVDKLKDRLKSCAVVLGSCSDGRVTLVAGVSADLTSKVKAGELANFVASQVGGKGGGRPDMAQAGGTEPAQLPAALQSVAGWVAQRLE >NZ_AP021884.1|WP_147070267.1|2901373_2903095_+|thiosulfohydrolase-SoxB MNRREFLQILAVAAASGMAIDNKQALAGNAPSGFYDLPKFGNVSLLHMTDCHAQLLPIYFREPDVNIGVGAAIGQPPHLVGEYLLKYYGIRPGTREAYAFTYLDFAAAARTYGKVGGFAYLKTLVDKVRASRPGSLLLDGGDTWQGSATSLWTNGQDMVDAAKLLGVNVMTGHWEFTYGAERVKHVVDNDFKGHIDFVAQNIKTNDFGDPVFKPYVIKTMNGVQVAIIGQAFPYTPIANPRYMVPDWSFGINDDNMQKVVNEARAKGAQVVVVLSHNGMDVDLKMATRVTGIDAIFGGHTHDGVPQPTQVKNAKGTTLVTNAGSNGKFLGVMDFDVKNGKIAAWKYRLLPVFSNLLEPDAKMAKLIEDVRAPYASKLNEKLAVTEELLYRRGNFNGTFDQLILDALMAVKGADAAFSPGFRWGTTLLSGDVITMDHLMDQTAITYPSTTLTEMTGATIKSIMEDVCDNLFNADPYYQQGGDMVRVGGIQYAVAPNNKIGNRISNMTLKGKPVMASKKYKVAGWAPVGEGVSGTPIWDVVAEYLRDIKVVKPRKLNEPKIVGIGKNPGIAPGIA >NZ_AP021884.1|WP_147070035.1|2900312_2901191_+|sulfur-oxidation-c-type-cytochrome-SoxA MKTTFREPQAQSPKAHGKKILLALAGAGLLLGALNASATPEQDRQSLLKFYSSKYPDIKVANYIYGALAFDPDAMEQYNSIMDFPPFGSVIEHGKKMWETPFKNGKKYADCFPNGGKNVAGNYPYFDDKAGKVVTFEMAINACRTANGEEAFKYNDMQTMGTLTAYARTLSDGMPMNIKVQGAAATAAYEAGKSQFYSRRGQLNFSCASCHVANAGNHLRSELLSPAVGQATHWPVFRGGEQLVTLQERYVGCNKQVRAVPFAPGSEEYNNLEYFHSYISNGLPLKASVFRK >NZ_AP021884.1|WP_147070037.1|2899922_2900237_+|thiosulfate-oxidation-carrier-complex-protein-SoxZ MAEPMKMRASVSGDVADIKVLMNHPMETGLRKDAKTGQLIPAHFINEVHATVNGKPVLDAQWGGGVSKNPYLGFKVKGAKAGDKVEVSWKDNKGESNKVDGVVA >NZ_AP021884.1|WP_147070039.1|2899397_2899865_+|thiosulfate-oxidation-carrier-protein-SoxY MNALRRNILKSAGATGIVAMAAAAGLLKSGNVLAAWNSSAFAAKTVPEAIKDLGLSTPADSKAISIKAPDIAENGAVVPVEVTSSIAGTTGIAIFAEKNATPLITDFKLSNGAEGFISTRIKMGQTAMVRAVVTAGGKTYTAAKEVKVTIGGCGG >NZ_AP021884.1|WP_147070041.1|2898997_2899366_+|sulfur-oxidation-c-type-cytochrome-SoxX MRAGASLILTASMVGMFVMANYAVAADTPKQEETGKSIAFDKTKGNCLACHAMPTVPDAVAAGTIGPPLIAMSARYPDKAKLRAQIWDATVANPQSVMIPFGKHKVLTEQEIDKVTDFVYGL >NZ_AP021884.1|WP_147070043.1|2897364_2898786_+|M48-family-metalloprotease MKSASLFLALCLASQQLAASELPDLGDVSQGAFSPRDEARVGNEIMRDIYAEPAYYDDPELTDYLNNLGYRLVAASPENRLAFQFFVLRDHTLNAFALPGGFIGVHTGLIEATQSESELAGVLGHEIAHVTQHHLARMIESRNQGILPSLAALAVAILAARSNPQAASAAIATVQATSIQKQLNFSRANEREADRIGMQIMRGAGFDPRAMATFFERLQKNSRLYENNAPAYLLTHPLTSERIADMQNRAASMPVKQVADSLEFQLLRAKLLAGEGRPEEAVRRFTEAIRDTRYNSLAAERYGLVVALLRTRQFDRAEQELDRLNQSGASSPMIAMLGARLRQEAGDLNTALARYQAGRARFPGYRPLLYADANALLQAGKADAALALVTDHLALYPDDYRLYQLQSRAYAMQGKDFLRHHAQAEAYVRQGNLDAAIEQLKLGLKSRDGDFYQMSIAEARLKELVALNQPAKP >NZ_AP021884.1|WP_147070045.1|2896848_2897325_-|cyclic-pyranopterin-monophosphate-synthase-MoaC MNQLTHFDDRGRAQMVDVADKSDTRRVAVAAGRIVMQPATLKMILDGSARKGDVLGVARIAAIAASKRTADLIPLCHPLALTRVAVEFLAEEADSAIECRVTAETVGKTGVEMEALTALSVGLLTIYDMCKAVDRGMRMEGLRLLEKQGGKSGHWRAP >NZ_AP021884.1|WP_147070047.1|2894604_2896824_+|EAL-domain-containing-protein MTQIDTRLISTATWLAGALAGLIALAFPLVYFSLSYEHQAASMETEAEFEAARIARLINANPELWPFEQSRLQELLQDQTETELPESRRIVDVNGRLIAQSQGKSARPYLLRTADLRNSGSVAGRVEIIRSLRPLLLKTAMASLLGLLLGSLAMVIFRAYPLRILKRALNTLANEKERAEVTLHSIGDAVITTNASGHIEYLNPVAEQLTGWTNEAARGLPSWRVFNIINESTGAPLDSPAEKAIKENRIVPLANHAGLVKRNGKIIPIENSAAPICDSQGQIIGAVLVFHDVSHARAMATKLSHQASHDPLTGLINRHTFESRLQQALDNVRRENSHHTLCYMDLDQFKIVNDTCGHRAGDELLRQLAGELRTKVRNSDCLARLGGDEFGLLLEGCTVQQAEHVAATLLQTVKEFRFHWQEHTCAVGVSIGLVGINAGCGDLAKIMGAADSSCYAAKDRGRNCIYVYQPDDKEVAQRRGEMQWVARITRAIDEGRLRLYYQTIQPLAGTQGAHYEILLRMLDEEGRIVPPGTFIPAAERYGLMPAIDRWVIENTFATLGRLYRGDAKKRLHTCAINLSGTSWADESLAGFICGMTGRHGVPARSICFEITETAAISNLGKTIALIRDLKEAGFRFSLDDFGSGVSSFGYLKQLPVDYLKIDGGFVRNIIHDKIDHAMVAAINQIGHIMGIKTIAEFVENEEILERITAMGVDYAQGYAIARPQPLDHINLASAPVLQQ >NZ_AP021884.1|WP_147070049.1|2893538_2894183_-|methyltransferase-domain-containing-protein MQAAEYDAWYQTPRGRWVGETESDLLRRMLGPQSGESLLDVGCGTGFFTRRFARGSRSAVTGLDPNRDWLAFAERHAVSTENYVNGSALALPFDAGGFDLVMAVTALCFISDQRLALTEMLRVARRRIALGLLNRHSLLYWQKGRGGGQGAYRGAHWHTAAQVRELFAGQPVRNLRMAFSIFVPGGGWIAQQLESVLPTSLPLGSFFVVTADVV >NZ_AP021884.1|WP_147070029.1|2907798_2908467_-|alpha/beta-fold-hydrolase MTAFAPLEFVAGSQAVQASVIWLHGLGADGHDFAPVVQALDLPGVRFILPHAPTRPVTINGGHVMPAWYDIRSTGLDADEDAAGLAQSSRVVEDLVAHELARGVASARIIVAGFSQGGALALYAGLAPGRVLGGIMVLSAYLPLMAGFNEWCAAGTHTIPVFMAHGVQDRVVPLQLAERSRQKLVACGFDVEWQIYPMAHSVCEEEIDAIRGWLIRVLQLHV >NZ_AP021884.1|WP_147070027.1|2908463_2909963_-|hypothetical-protein MRRIPLVGKLLPAAGEALAVPVRDDPASYSAHEICESIEQLIETLLSARKKNLDWQRHSIDSLHRQDNFSAPFMTRLTQHYLALPPFVSSVSGRFLAAISGYWEEMSAIHLQCVTYLLGHPESRLGALLPLLIQRALYHHAMQMKWRWLRYQLIPSCFWARLHRLYAVAEKHEFARVPLPLPGMERADSCCETLYLRPQMLHSLRPDTLLPCEIEQVDEWIVRWSKSVLLEPMLLSGKHRYGVNLKGASPPRPLAMLNEPGSYRYWGPGLMLAALHAEHDEADAGAHAGWRQALWRRVVNDWSGIPPLRHHPRQMIGKQTELFLGFNEIHTRIDHHPSRRAHDLPYWRRCRVRDESAEGLGLALNTSDGVPVAINSLIGINSGRHFLVGVVCRIRRHESGWTEIGIRRLAANAVPVKLESVNVNLAGQVVDALYLSMAGAFGQRRCVLIPARISWQDGQWQLLCKGRRHLIRLRAPLKATEDYVLADFDGLAQSEAIAS >NZ_AP021884.1|WP_147070024.1|2909981_2910410_-|hypothetical-protein MKDFIALLVEQSRLQSVAINPALDDALTHLDHALAGLCAAVQVEYRGPYVGVETPLAHQMVVRRHEWKIHQPAWSMKICVAAPAANCRAEWPVQGVGRLRKALVVKALPAFFAGFAEAIKQAGKQDSSAGLRVLELSRRFNL >NZ_AP021884.1|WP_147070022.1|2910437_2912018_+|sigma-54-interacting-transcriptional-regulator MRAQIRANWRKYYHTTRRAVRARSATRTAGNLAQSGQNANNAKEGTNVSLQGISAKPSLLIVDDDPLITDTLNFVLSRDFEVFVADSRSQVKSLLTQLDTPPQLALVDLGLPPLPHKPDEGFHLISELLGYSPGIKILVLSGQNDETNARHARALGAIDFVGKPCEPAQIKSLLFNALLIQDVERSAETEAPAAENLIVGTSFNLDRLRQQITQYANAPFPVLIEGESGSGKELVAASLHKLSGRTKKPYLALNCAAISPTLVEPTLFGYCKGAFTGATSNRAGYFEDACDGTLFLDEIGELPLELQAKLLRVLENGEFQRVGETQSRFSNARVVTATNRDLRQEIKAGRFRADLYHRLSVFGIAVPPLRELGEDKVRLLEHFREFYAREARVKPFALDNRARQMWEDYHFPGNVRELRNIVIRLTTKCAGQNVTAEQLETELDTDTAFPSEIPLPNDGKALYDTARRHLQTLANFSLDQTMKQWEKSYVEAALNLTHGNLSQAAKILGINRTTLYSRMQTYTNEA >NZ_AP021884.1|WP_147070020.1|2912147_2913497_+|AAA-family-ATPase MYHEFFGLKEAPFRITPDTGFFFSGGERGAILQGLAYAIRQGEGIIKVTGEVGSGKTMLCWMLEQHLPDHIETVYLANPNVKPEDVLPSILAELELVRPADASRAGHLRTLNDYLLARHDAGKQVVMFVEEAQGMTLDTLEEIRLLSNLETEREKLLQIVLFGQPELDAKLADPRIRQLRERITTAITLAPLTPDAIRAYLAFRLTTAGYRGPDLFDRRAVRSIARASRGLTRRVNILADKSLLAAYTDNTRTIQPRHIRIALRDSAFNDDANKPQRWLLPVIAMGVMVAVLASFYWRSKPAAAPSRQTQTRPAAGLPGRASAAAPDPVAPLSADPFQQRLAATRTWLMQQPADTRTIQLSLLNSPSEFAAYLRGEGGGLAPDQLRIFRTQAQGHPSWTVIYGSYPTRQTANRALLALPEAVRKRHPYLRTVGGIRNETRQIQQVGEQS >NZ_AP021884.1|WP_147070265.1|2913576_2915352_+|secretin-N-terminal-domain-containing-protein MWLPMLAVPLLAGCVPAAMIQPSQGHIQQSSQPATRLADIPPLVKTIPYLPSPRAETQVPTYTIVVDNVPVKDLLFSLARDTKKNIDIGTGITGNVTLNAVNEPLPAILERIARQASIRYRMEGDTLSIMPDTPYLKTYKVNYVNLSRNTSSSIGVAAQIASTGSGAVGAAASGSAQGGNSSSTTVDSQSNNNFWEVLTENVRAILTSTRASTQRAEDKSARLDAERNARADRLEQAQAVARAGAAAPTLYREAFGNTSSSLLQDSKNEVIVNPVAGTVSVLGNERQQQVVQQYLDGVSQSSQRQVLIEATIVEVSLKDQYRAGIDWSRLANGSKGIFFNTMPAATTNLANSLLPFFNIGYRDRNLTATLNLLESFGNLRVLSSPKLMALNNQTALLKVVDNLVYFTVQAQQGTLSSTGTPLQPTTFTTTAKTVPVGLVMSLTPQISESGMVTLDVRPTISRKIGDVSDPNPGLPVSTPNKIPVIQVREMESVLQVGSGQTVILGGLMQDDSDRARDGIPVLSRPQGFGAIFGQHEHNVQKTELVIFLRPTVITNPSLDSDELKFYKRYLPRANAAPEQWHNGADAAGDPQ >NZ_AP021884.1|WP_147070018.1|2915348_2916524_+|tetratricopeptide-repeat-protein MSLLLKALKQAGDKSAAGARNPSATLADSLSLEPISGSAPDGTAYTSWDGAAPFKRSTARAAWYTPWLSGQRWLVPAVAVVAALFMLIYGVFVYWQTRTPAALVVTPTPHSAAPAAAPPAAAPAQLAAVPSQESGPPLPEINSAVPDAPAALPPPPVQADPTPQWGSGELIREAPPPRRARTQPGRRETRSALPFSMQTATTHINPQLEAAYQAYQAGHTREARNLYLQIPDGERNVDVQLGLAAIALRDNDTPAAARHYQRVLELDPRNSTANSALIGMMGDADPNASETRLKSLIASQPSSQLYFALGNLYAGQNRWPDAEQAYFEAYQKNAANADYAYNLAVSLEHISQSRAALNYYQKARDLMQPGNVQFDPLRLEARIDQLKARQE >NZ_AP021884.1|WP_147070016.1|2916529_2918233_+|Flp-pilus-assembly-complex-ATPase-component-TadA MEARKTLRLGEMLVQQGLITLDQLRIALKEQQHTNLPLGRLLVKLGFITEAVIRDQLAHTIGQTSLDLANVVADPEALKLISEDFARRHHLLPIAFDAQRQVLVVAITDMFNVVALDHLRALLGAGVEVDTVLSGEAQLLEAIDNFYGFELSVDGILREIETGEVDYQSLAMDTEEYTQPVVRLVGSLLVDAVKRGASDIHFEPEHAFLRIRYRIDGVLEQVRSLHKSYWPGIAVRLKVISGMNIAENRAPQDGRLSLTLHGRPIDFRVSSQPTIHGENIVLRVLDREKSIIPLANMDLPTDTHTALQRMMARPEGILIITGPTGSGKTTTLYSLLTHLNNETVNIMTLEDPVEYPVTLMRQSSVNETLKLDFANGIRSIMRQDPDIILVGEIRDRDTAEMAFRAAMTGHQVFTTLHTNSALGAFPRLLDIGIVPDIMAGNIIGVVAQRLVRVLCPHCRAAYTPDADEQKLLDWQATDRRPVYRAVGCPACNGKGYRGRMALMELLRMDSELDDLVARRATHREILNAALMRGYRSLAVDGISRVLEGKTSLAEVSRVVDLTQRILS >NZ_AP021884.1|WP_147070015.1|2918246_2919452_+|type-II-secretion-system-F-family-protein MPYFSYRAVDQIGRTNRGSLSAANEVDLELRLRRMGLDLITLRQMDSRASGFARGAASRRDLITFCFHLEQISRAGIPILDGVRDLRDSMDNPRFRDILTALLEDMEGGRLMSQALAAHPAVFDTVIVNLVRAGEQTGLMREVFENLGASLKRQDELAAQTRRLLIYPTLVLSMVGIIILLLLLFLVPQIADLIKNMGIALPIQTRVLLWLSETLRTWWPLFLILPVAIGSALVVTLRASERARFVADDVKLRLPVIGPILQKIALARFSNFFALMYRSGITILDALRAGEDIAANRVIADAIRRAGGRIGNGEGLTESFQSLSVFPPLVIRMLRVGETTGALDTALENVSYFYTREVSESIEKSLKILEPALTVVLGLVMAVIVGSVLLPMYDVIGTLKP >NZ_AP021884.1|WP_147070013.1|2919448_2920984_+|hypothetical-protein MMFAPQLLVYVCAWSITVACRRAGKIRLVGQFNADEGGRRAFAAVLQAFKNSPVSVMVDGVDEDYRLETLPHVLGNARREMLERRLRQISRNALFSAAWPQGREASGRRDDRYLFISLSNHDAVRPWLDLLHQHGVHLAELTVLPAISHVLLQRIQPTEPHVLLVSEHCGGLRLSYFEHGNLRFSRLTAPESLAEGHAPDLASEINKTDLYLNSQRLMPRDAQLAVYVLDPENAYAGLCREISAENKNLICQAVGSVALAKLVGVDEPLLHRTADVAYLAVLGRSRAAVNLAPAAYTRGYVQLMLRHKLYTGAFAVLATALAISGYLFSRQHDLEQQRLVTQDRIQQQASLYRAVQLALPRAPTSPQNLKRVVETARALYAAPQPMSDFARVSQALETVPDIAVLRLRWLDHDAADTTATHSAVSDNPGAAVRALYFDGEVSPFQGDYKTALASIEHFAATLRNDPGVAEVRVLALPINTDPTATLDESQHTGNSAPRARFRLKLLMRPAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_007766 | Rhizobium etli CFN 42 plasmid p42f, complete sequence | 395837-395869 | 6 | 0.818 |
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NZ_CP020911 | Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence | 526121-526153 | 6 | 0.818 |
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_021911 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence | 601591-601623 | 6 | 0.818 |
NZ_AP021884_3 | 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645871-2645904 | 34 | MN692973 | Marine virus AFVG_117M33, complete genome | 35754-35787 | 9 | 0.735 |
NZ_AP021884_3 | 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2644999-2645034 | 36 | NC_008043 | Ruegeria sp. TM1040 megaplasmid, complete sequence | 694259-694294 | 10 | 0.722 |
NZ_AP021884_3 | 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645653-2645687 | 35 | NZ_CP007794 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence | 1356140-1356174 | 10 | 0.714 |
1. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_007766 (Rhizobium etli CFN 42 plasmid p42f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
2. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NZ_CP020911 (Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
3. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_021911 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
4. spacer 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to MN692973 (Marine virus AFVG_117M33, complete genome) position: , mismatch: 9, identity: 0.735
aacaatttcttgctggataaaatcaagccgctta CRISPR spacer tacaatttcgttctggataaaatcaagtgttgca Protospacer ******** * ***************. . .*
5. spacer 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NC_008043 (Ruegeria sp. TM1040 megaplasmid, complete sequence) position: , mismatch: 10, identity: 0.722
atggtgcgatcctgttgttgctggttgtgctgcggg CRISPR spacer tcgctccgatcctgttgtggctggtggtgctgatca Protospacer .* * ************ ****** ****** .
6. spacer 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007794 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence) position: , mismatch: 10, identity: 0.714
accggttcatcgccgtgccgcctgtccatcgccgc CRISPR spacer gtccagccgtcgccgtgccccctggccatcgccgg Protospacer ..* . .*.********** **** *********
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
622875 : 631682
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|DBSCAN-SWA TTCAGGAATTGCCGCTGTTGTCGATGAAGCTCTTGAGACGGTCAGAGCGCGATGGGTGGCGCAGTTTGCGCAGCGCTTTGGCTTCGATCTGGCGGATACGCTCACGGGTTACGTCGAACTGTTTGCCGACTTCCTCCAGGGTGTGGTCGGTATTCATCTCGATGCCGAAACGCATGCGCAGCACTTTGGCTTCGCGTTGCGTCAGGCCATCGAGAATATCCTTGGTCACTTCCTGCAGGCTGCCGTAAACGGCGGCGTCTATCGGCGCCAGCGTGGCGGTGTCTTCTATGAAATCGCCCAGATGGGAATCCTCGTCGTCGCCGATAGGGGTTTCCATGGAAATAGGCTCTTTGGAGATTTTGAGTATCTTGCGGATTTTCTCCTCGGTCATTTCCATTTTTTCGGCCAGCAATTCCGGGTCGGGTTCCTTGCCGGTTTCCTGAAGAATCTGACGCGAGATACGGTTCATCTTGTTGATGGTCTCGATCATGTGCACCGGAATACGAATGGTGCGTGCCTGATCCGCGATGGAGCGGGTGATGGCCTGACGGATCCACCAGGTGGCGTAGGTCGAGAACTTGTAGCCGCGCCGGTATTCGAATTTGTCCACGGCTTTCATCAGGCCGATGTTGCCTTCCTGGATCAGGTCGAGGAATTGCAGGCCACGGTTGGTGTATTTTTTTGCAATGGAAATCACCAGGCGCAGGTTGGCCTCGATCATTTCACGTTTGGCGCGGCGCGCGCGCGCTTCGCCAGTGGACATCTGGCGATTGATTTCCTTCAAGTCCTTGATCGGGATGCCCACTTTTTCCTGCAGGGCAATCAGGCGTTGCTGACGTTCGACGATAGTGTGCTGGTAGCGTGTCAAATCCTCTGAGTAGGCCTTTTTCGAATTGATTTCCTTGGTAACCCAGTCCAGATTGCTTTCGTTGCCTGGAAACACCTTGATGAAGTGGGCGCGGGGCATACCTGATTTGTTCACCGCGAATTCCATGATGTCACGCTCGTGGCTGCGGACTTCTTCCACCAGATTGCGCAAGCCTTCGCATAGCGCCTCGACTTGCTTGGCAGAAAAGCGGATATTCATTAGCTCAGCGGAGAGCTCTTCCTGCAGTTGCAGATACTGCGGGCTGCCGAAGCCGTTTTTCTTCAACGTGGCCTGTATCCGCTTGAATACCTTGCGGATGACTTCAAAGTGCGCCATGGCGTCGATCTTGAGCTGAGCGAGATTGGCTGCAGCCAGCGCGCTACCGTCATCTTCCTCGGCATCTTCGTCGAGCTCTTCTTCGAGTTCGCTGATGTCAACCTCGGGCTCGCTGGCGATCGCTTCGAGTTCTTCTGCTACGACACCATCCACGAAATCGTCAATGCGAATTTCCTCACGCTCAACCTTGTCCACCAGTGTCAGGATTTCCTGAATCGTGGTGGGACAGGCGGAGATGGCCTGAATCATGTGTTTCAAGCCGTCTTCAATGCGTTTGGCAATTTCGATTTCGCCTTCACGCGTGAGCAGTTCCACCGAGCCCATTTCGCGCATGTACATGCGCACCGGGTCAGTGGTGCGGCCGAACTCGGAATCCACGGTAGATAACGCAGCCTCGGCCTCGGCCACCACGTCCTCATCGGCGACTGCCGGGGCGGCGTCGGACATCAACAGGGTCTCGGCATCAGGGGCTTCGTCATAGACCTGGATGCCCATGTCATTGATCATGCTGATGACGCCTTCGATCTGTTCGGCATCCAGCATGTCATCTGGCAAATGGTCGTTGATCTCGGCGTAGGTCAGGTAGCCCCGCTCCTTGCCGAGCACAATCAGATTCTTCAGGCGTGTGCGTCGGGCTTCGACATCTACCGTTTTCACCTCGCCTTTTTCGTGATCGTTTGCCATATGTCTTGGTCCAGTAAAATTTGGTGCATCAAAAAAACTGACAATTATACCTTTGTTCGAGCCTGATCGCTCATTTTTCCACCTGCTGTCTGGCATTAACCAATTGTCGCAACGCTGCTTTTTCGGTTTCACTCAGTGCGCTGACGGGCTTGTCGGTTACACGCGCACCCCGCTGTTTGTCGAGTTGCATGCGTGCCTGATAACAGGCGTCAGCAAATTCGGCGCTGATGTCGAGATCGGCTGCCCAGCCCATTATTTCCGCGCTGGCGCGCTGCAGGATGGACGCCAAAGCGTTATCGCGAAAATGGTCTATCACGCTCGCAGTGCCTAAATTGGGATGAACGCGCAGCAATTCAACCACAGCGCGTAGCGCGTCTGCATCCGGGTCAGTTGGGGCAATCAAACTGGCGTCAAGTTCCCGTGCCAGGCTGGGCATAAACAGAATCGCACGCAGCAGCCAGTGCCAAATGGAAGCGGGTGCCTGGCGTGGGGCACGGACAGGTGCACGCGCCGGGTTGAATTGCCGGGATTTGATCTGCCACAAGCTGTCGAGTTCGGACAGCTGCAGATTTGCCAGTTCGGCGCAACGTTTGCGCAGCAGGAGTGCCAGGGCGGGTGCGCGTATCTGGCTTAACAGGGGGTGCGCAGCCTGCAAAAAGGCACTGCGTCCTTCGCTGCTGGCCAGGTCATGCTGGGCTGCCAGCTCCTTGAACAGATAGGCAGATAGCGGTACCACCTCGCCGCCCAGCAGCGCTTCGAAAGCGCCTTTGCCAAATGCGCGAATATAGCTGTCCGGGTCGTGTTCCGGCGCCAGAAACAAAAACCCGACACGACTGCCATCGCTCAGTATGGCCAGGCTGTTTTCCAATGCGCGCCAGGCGGCGTGCCGTCCGGCGGCGTCGCCATCAAAGCAGAACACGAGCTCATCAGTGTGGCGCAGCAGTTTTTGCACGTGCGCCGCGGTGGTGGCGGTGCCCAGGGTGGCGACGGCGTACTCCACCCCATGCTGCGCCAGTGCCACCACGTCCATATAGCCTTCCACCACAATCACGCGTCCCGCATCGCGGATTGCGCGGCGTGCCTGGAACAGGCCATACAGCTCGTTACCTTTCTGGAATAGCGGCGTTTCCGGTGAATTCAGGTACTTGGGTTCAGCTGCGTCGAGCACGCGGCCGCCATAGCCGATGACATCCCCGCGTTGCCCGACAATCGGGAACATGATGCGGTCGCGAAAACGGTCATAACGCTGCCCGGCGTCGTTAACGATGACCAGCCCGGCTTCGGCCAGCGCCGGATCGGCATACTGGTCGAACACCGCTGCGAGATTCTGCCAGCCGGCTGGGGCATAGCCAAGGCCAAAGCGGGCGGCGATTTCGCCCGTCAAGCCGCGTTTCTTGAGGTAGTCAATTGCGTGCGGTGTTTGTTTGAGTTGCTGGCGATAGAACTGTGCGGCGCGCTGCATGATTTCGACCAGGCTGGCGGCCTGCCTGGCGCGTTCCGGGTTGGCGGCAGGCCCCTCCGGCACGCTTATGCCCATTTGGCCGGCCAGTTCCCTGATGGCGTCCACATAGCCCAGCCCGGCGTATTCCATCAAAAAACCAATGGCGCTGCCGTGGGCGCCACAGCCAAAGCAATGATAGAACTGCTTGGTGGGCGAGACGGTGAAAGAAGGGGATTTTTCGTTATGAAACGGGCAACACGCCTGATAGTTGGCGCCGGCTTTTTTCAGCGGCACACGGCGGTCTATCACCTCCACGATATCCACGCGGTTCAGCAGCGTCTGGATAAAATCCTGCGGGATCATCTTGTTCCCTCGGGGTGGTGACGTAACAACGCCGCTACGCGGCGGACAATCAGCCCGCCAGCTTGGCTTTGATGTGGATGGAAACTTGCGCCATGTCTGCGCGCCCGGCGAGTTGTGTTTTTAGAAGCGCCATCACCTTGCCCATATCCTTGATGCCGGCAGCGCCGGTATTCGTAATGGCCTGGATGATCAGGCTATCTATTTCTTCAGCGGATGCGGCTTGAGGCATGTAAGCTTGCAACACGCCGCTTTCGAATTTTTCGATGTCCGCCAGCTCCTGGCGTCCGGCAGCCTCAAACTGGGTAATCGAATCGCGGCGCTGCTTGAGCATTTTGTCGATCACGGCGATGATTTGTGCATCATCCAGTTCGATACGCTCATCCACCTCGCGTTGTTTGATCGCGGCGAGCAATAACCGAATTGCCCCCAGGCGTGCCGCATCCTTGGCGCGCATGGCGGTTTTCATGTCTTCGGTGATGCGTGCTTTGAGACTCATAAGCTTATTACCGGCTGGAACAGCGCCCGGTTTGATGATTAATACATCTTGGGTGGCAGGGTCTGGCTGCGGATGCGTTTGAAGTGACGCTTGACTGCGGCGGCCAGCTTGCGCTTGCGCTCGGCAGTGGGCTTTTCGTAAAACTCGCGTGCGCGCAGTTCGGTCAACAGACCGGTTTTCTCAACAGTGCGCTTGAAACGACGCATGGCAACTTCAAAAGGCTCGTTTTCCTTGACGCGAATGTTCGGCATGAAATCTTGTCTCCAGGGACGGAAAAAACCTCAATTATAACTGAAAAAGCGTTTTTTTCAAGGCTATGCATTGCCCTGTCCGTGTGGCGCGATTAAACTCCTGCCCATGCTTATTCTGGGAATCGAATCTTCCTGCGACGAAACCGGTATCGCCCTATACGACACCGGGCGCGGACTACTGGCGCACGCACTTCATTCCCAGGTTGCCATGCACGCCGAATATGGCGGTGTGGTGCCCGAGCTCGCCTCACGCGACCATATCCGGCGTGCGCTACCGCTGACCCGCCAGGTACTGGCACAGGCAGGGTGCACGTTGGCCGACATTGACGCGATTGCCTATACCGAAGGTCCCGGTCTGGCTGGCGCGCTGCTGGTAGGTGCCGGCATCGCCCATGCGCTGGGCGTGGCGCTGGGGGTGCCGGTGCTGGGGGTACATCACCTCGAGGGGCATTTGCTCTCGGCGCTGATTTCCGATACGCCGCCGCAATTTCCGTTTGTGGCGCTGCTGGTATCGGGCGGGCATACGCAATTGATGCAGGTCGACAGCGTGGGGCGTTACACCACGCTGGGCGATACCCTGGATGACGCTGCGGGCGAGGCGTTCGACAAGACCGCACAACTGCTTGGTTTGGGCTATCCGGGCGGGGCGGCGTTATCGACGCTGGCGCAGACCGGCGACCCGCAGCGCTTCAAGCTGCCGCGTCCGATGTTACATTCGGGCGACCTCAATTTCAGTTTCAGCGGTTTGAAAACCGCGGTGCTCACACTCACGCAAAAACATCCCGGTCCCGCTGACCGCGCCGACATCGCTGCTGCGTTTCAGCTTGCCATGGCCGAGGTGCTGACGGCCAAATCGCTGGCGGCGCTCAAACAAACCCGATCCAGGCGGCTGGTGGTAGCCGGTGGCGTGGGTGCCAACCGGCAGTTGCGCGAGGCCTTGAACGCAGGCGTTAGCAAACTGGGCGGTGCGGTATTTTTCCCGCGCCTGGAGTTTTGTACCGATAACGGCGCGATGATTGCCTTTGCCGGCGCGATGCGCCTGGTGCATGGCGGGCGTGCCGCAGGGGTGTTTACGGTACGGCCACGCTGGGACTTGCAGGAAATCCCGGCACCCCATAATCATCCGGGCACCGTCATGGCTTAAGATGCGGTGCCATGCCGTGCACCCAGCCAAGCAACAGCTTGCCGCCCGGCAGTTGCGCCAGCACCTCCGGGAACAAAACCAGGCCGAACAACAGCGCCAGCACGCAGATCAATAACAGGGTGGAAAACACGCTCTGCGGGCGTAACACCCCCCAGTAGCGCATGGCGAGGAACATGACATCCTTGCGATGCTCTGACATGCGCCGCCAGCGCAATGCCAGCCGCACCACCAAATACACCGCATAGCCGCCGGTGAGCGCGGCGAACACAATGCGGAACACGCCAAACAAATCAATGCTGCCTGCCACAAAGTGCTGGTACAGCCACGACACAAACCCCACCAGGATGGAAGCGCTTACCGCAATGAGCACATTGAAAAACAGTCGCCGCCGCTCGCGCGACAAATCCTGCTGGCGCTGGATGAAAAAGCTGTCCACTTCCAGCTTAAAGTATTCCACCTTGTCCTGTACCAGCGCAGAAATTTCGCGTTTGGCCACCGCCATGCGTTCATCCAGAACAGCGCCCAGACGATCAGCAGCATAGTCCACCAGCTCGCGCACGTCGTCCTTGGTGAACTGGCGCTGGTTGTGCAATTCGGCAGAAATCTTGTCGAGCTTGGCGTCGATTTCCTGGCTGGCGCCCACGACCACGTCGCGTAATTCCGCACCGACGCTGGCTGCGCCTTCCTTGACCACGCTGCCCAGTTTGTCGCCCGCCAACTCGATGCTGTCTTTCGAGACCTGGGCCAGGCTCTCGCGCGCGTAGTTGATTTCTTTTTCAAACCAGGCCATGTCTGTCCCGGTTTATTCGGAAGAGGTGTCCGCCGGTTTGGCGAAGCGCCCTTCTTCACCCGCCAGCAGGCGGCGGATATTGGTGCGGTGACGCCAAAAAATCAGCGCACTGATGATGCATAACGCGACCACAGGCAGGGCGCCACCGATTAAATAGGTACCGAGTACTGGCGCCAGTGTGGCCGCAGTGAGTGCGGCCAGTGACGAGATGCGGGTGAGGACAAACACGACAAGCCAGCTGGCCAGCGTTGCCAAACCCAGCCACGGCGAAATCGCGAGCAGTATACCCAGCGCGGTGGCCACGCCCTTGCCCCCCTTGAAGCCGAAAAACAGCGGATATAAATGTCCGAGGAACACGGCGACCGCAGCGCCGTAAGTTGCGGCAATTTCCACGCCGTATTGGCTGCCAAAATAACGCGCCAGATACACCGCCAGCCAGCCTTTGGCCATGTCGCCGACGAGGGTCAGCAGCGCCGCTGATTTGCGCCCGGTGCGCAGCATGTTGGTCGCACCCGGATTGCCCGAGCCATGCTTGCGCGGGTCAGGCAGGCCGAATAGGCGGCTGACGATAACCGCGAAGGACAGCGAGCCGATCAGGTAAGCCGATACGATGAAAAATGAAATGAACATCAGGGATTCCGCTTAAGATATAGGATTTTCGCGCTTTGACAACTACGCATGGATATTCTGTTTCTCAAGGATTTCAGAGTCGAGCTCATTATCGGTATTTACGAGTGGGAACGCAAAGTACCCCAGCCGGTATTGCTCGACCTGGAAATCGGCCTGCCCAATAGTCGTGCCGGTGAAACCGACAATGTGGCAGATACCATTGACTATGGCCAGGTTGCCGCGCGTATCAGGGCGGCCTGTGCCGCACTGCGCCCAGCCCTGGTAGAGGCGCTGGCAGAGCATGTTGCACAATTGATACGCAATGAATTTGGCGCGCCCTGGGTCAGGGTCACCGTGACCAAGCTCGCCATCGTGCGCGGCGTCAAGGCGCTGGGCATCACCATCGAGCGCGGTCAGCGCGGATGCGTAATGAGTCATATGCAGCCAGCGCGCTGAATTAGCCGCGCGGATGATGCCTGGCGTGCAGCTGTTTCAGGCGCTCGCGTGCGACGTGCGTGTAAATTTGCGTGGTCGAAATATCCGCATGGCCGAGGAGCATCTGTACCACGCGCAAATCCGCGCCGTGGTTGAGCAGATGCGTGGCGAAGGCATGACGCAAAACGTGGGGCGAGGGCAGGCGCGCCAGCCCGGCCTGCTGCGCGCGGCGCTTGATGAGATACCAGAATGCCTGGCGCGTCATGGCCGTGCCGCGCCGGGTGACAAACAGCGCATCGCTGATCGTGCCTGCCAGTATCTGCGGGCGTGCGCCAGTCAGATAGCGCGCCAGCCAGAGCAGTGCCTCTTCACCCAGCGGTACCATGCGCTCCTTGCCGCCTTTGCCCATCACGCTTAGCACGCCCATGTCCAGACTCACATTTGCCACTCTCAGTGTCACCAGTTCGGAGACGCGCAGGCCGCTGGCGTAGAGGATTTCCAGCATGGCCTTGTCGCGTAGCCCGAGTGGCTGTGATGTGTCGGGTGCGTTCAACAGCATATCCACGTCGGCCTCCGACAAGCTCTTGGGTAATGAGCGCGGCAGCTTGGGGGTATCGATTTTCAGTGTCGGGTCCAGCACGATACGGCCATCGCGCAGCGCCAGCCGATAGAAGCGTTTCAGTGCGGAGAGCAGGCGCGCAGTGCTGCGCGGGCTGGTTTTACGCGAAAAACGGTATTGCAGATAGGCCTCGATATCCGCCTGGCCAGCGTCCAGCAACAGCGTGCTGCGCAGTGCCTCCAGCCAGGCCGAGAACTGCGTCAGGTCGCGCCGGTAGCTTTGCAGCGTGTTGGGGGAGAGTCCATCCTCCAGCCACAGCAGGTCGCAAAAGCTGTCAAGCGCCTGCTGCGATACCGGATTCATGGTTGAGTAGCCAGTCCTTGTAAGCCAGCGGCGCCCCGCTTGCGGCGTGCATGAAGCCGCCGCGCCCGTTTGCTGCCACCACCCGGTGGCAGGGGATAATGATGGGCAGCGGATTGGCGCCACAGGCCTGACCCACCGCGCGCGGACTCGAATCCAGCCAACGCGCGAGCTGCCCGTAAGTGGTGGTATGGCCGGGCGGGATCGCAGTCAGCGCGCGCCATACCCTGACTTGATGCGCTGTGCCAAGGATTGCCAGCGGCAGGTCAAAGCTGCTGCCGGGGTTATCAAAATAGTGATTCAAGGCAGCGGCGACGCGGCGTGACAGCGGTGAGTCGGGGGCTTGCAAAGGGTAATCCGCAGGCAAAAAATCGATGCCGTGAAGTTTTTCATTGGCCACACTCATGCCGACACAGCCGAATGGCGTGGGCAGCACGGCCTGATAAGGTGATTGAATCGTGCGGCTTTTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|630313_631213_-|WP_147070764.1|DBSCAN-SWA MNPVSQQALDSFCDLLWLEDGLSPNTLQSYRRDLTQFSAWLEALRSTLLLDAGQADIEAYLQYRFSRKTSPRSTARLLSALKRFYRLALRDGRIVLDPTLKIDTPKLPRSLPKSLSEADVDMLLNAPDTSQPLGLRDKAMLEILYASGLRVSELVTLRVANVSLDMGVLSVMGKGGKERMVPLGEEALLWLARYLTGARPQILAGTISDALFVTRRGTAMTRQAFWYLIKRRAQQAGLARLPSPHVLRHAFATHLLNHGADLRVVQMLLGHADISTTQIYTHVARERLKQLHARHHPRG >NZ_AP021884|622875:631682|629259_629880_-|WP_147070760.1|DBSCAN-SWA MMFISFFIVSAYLIGSLSFAVIVSRLFGLPDPRKHGSGNPGATNMLRTGRKSAALLTLVGDMAKGWLAVYLARYFGSQYGVEIAATYGAAVAVFLGHLYPLFFGFKGGKGVATALGILLAISPWLGLATLASWLVVFVLTRISSLAALTAATLAPVLGTYLIGGALPVVALCIISALIFWRHRTNIRRLLAGEEGRFAKPADTSSE >NZ_AP021884|622875:631682|624832_626566_-|WP_147070752.1|DBSCAN-SWA MIPQDFIQTLLNRVDIVEVIDRRVPLKKAGANYQACCPFHNEKSPSFTVSPTKQFYHCFGCGAHGSAIGFLMEYAGLGYVDAIRELAGQMGISVPEGPAANPERARQAASLVEIMQRAAQFYRQQLKQTPHAIDYLKKRGLTGEIAARFGLGYAPAGWQNLAAVFDQYADPALAEAGLVIVNDAGQRYDRFRDRIMFPIVGQRGDVIGYGGRVLDAAEPKYLNSPETPLFQKGNELYGLFQARRAIRDAGRVIVVEGYMDVVALAQHGVEYAVATLGTATTAAHVQKLLRHTDELVFCFDGDAAGRHAAWRALENSLAILSDGSRVGFLFLAPEHDPDSYIRAFGKGAFEALLGGEVVPLSAYLFKELAAQHDLASSEGRSAFLQAAHPLLSQIRAPALALLLRKRCAELANLQLSELDSLWQIKSRQFNPARAPVRAPRQAPASIWHWLLRAILFMPSLARELDASLIAPTDPDADALRAVVELLRVHPNLGTASVIDHFRDNALASILQRASAEIMGWAADLDISAEFADACYQARMQLDKQRGARVTDKPVSALSETEKAALRQLVNARQQVEK >NZ_AP021884|622875:631682|626615_627062_-|WP_147070754.1|DBSCAN-SWA MSLKARITEDMKTAMRAKDAARLGAIRLLLAAIKQREVDERIELDDAQIIAVIDKMLKQRRDSITQFEAAGRQELADIEKFESGVLQAYMPQAASAEEIDSLIIQAITNTGAAGIKDMGKVMALLKTQLAGRADMAQVSIHIKAKLAG >NZ_AP021884|622875:631682|622875_624762_-|WP_147070750.1|DBSCAN-SWA MANDHEKGEVKTVDVEARRTRLKNLIVLGKERGYLTYAEINDHLPDDMLDAEQIEGVISMINDMGIQVYDEAPDAETLLMSDAAPAVADEDVVAEAEAALSTVDSEFGRTTDPVRMYMREMGSVELLTREGEIEIAKRIEDGLKHMIQAISACPTTIQEILTLVDKVEREEIRIDDFVDGVVAEELEAIASEPEVDISELEEELDEDAEEDDGSALAAANLAQLKIDAMAHFEVIRKVFKRIQATLKKNGFGSPQYLQLQEELSAELMNIRFSAKQVEALCEGLRNLVEEVRSHERDIMEFAVNKSGMPRAHFIKVFPGNESNLDWVTKEINSKKAYSEDLTRYQHTIVERQQRLIALQEKVGIPIKDLKEINRQMSTGEARARRAKREMIEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNRISRQILQETGKEPDPELLAEKMEMTEEKIRKILKISKEPISMETPIGDDEDSHLGDFIEDTATLAPIDAAVYGSLQEVTKDILDGLTQREAKVLRMRFGIEMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSDRLKSFIDNSGNS >NZ_AP021884|622875:631682|629925_630312_+|WP_147070762.1|DBSCAN-SWA MDILFLKDFRVELIIGIYEWERKVPQPVLLDLEIGLPNSRAGETDNVADTIDYGQVAARIRAACAALRPALVEALAEHVAQLIRNEFGAPWVRVTVTKLAIVRGVKALGITIERGQRGCVMSHMQPAR >NZ_AP021884|622875:631682|627419_628457_+|WP_147070756.1|tRNA|DBSCAN-SWA MLILGIESSCDETGIALYDTGRGLLAHALHSQVAMHAEYGGVVPELASRDHIRRALPLTRQVLAQAGCTLADIDAIAYTEGPGLAGALLVGAGIAHALGVALGVPVLGVHHLEGHLLSALISDTPPQFPFVALLVSGGHTQLMQVDSVGRYTTLGDTLDDAAGEAFDKTAQLLGLGYPGGAALSTLAQTGDPQRFKLPRPMLHSGDLNFSFSGLKTAVLTLTQKHPGPADRADIAAAFQLAMAEVLTAKSLAALKQTRSRRLVVAGGVGANRQLREALNAGVSKLGGAVFFPRLEFCTDNGAMIAFAGAMRLVHGGRAAGVFTVRPRWDLQEIPAPHNHPGTVMA >NZ_AP021884|622875:631682|627100_627313_-|WP_124706067.1|DBSCAN-SWA MPNIRVKENEPFEVAMRRFKRTVEKTGLLTELRAREFYEKPTAERKRKLAAAVKRHFKRIRSQTLPPKMY >NZ_AP021884|622875:631682|628446_629247_-|WP_147070758.1|DBSCAN-SWA MAWFEKEINYARESLAQVSKDSIELAGDKLGSVVKEGAASVGAELRDVVVGASQEIDAKLDKISAELHNQRQFTKDDVRELVDYAADRLGAVLDERMAVAKREISALVQDKVEYFKLEVDSFFIQRQQDLSRERRRLFFNVLIAVSASILVGFVSWLYQHFVAGSIDLFGVFRIVFAALTGGYAVYLVVRLALRWRRMSEHRKDVMFLAMRYWGVLRPQSVFSTLLLICVLALLFGLVLFPEVLAQLPGGKLLLGWVHGMAPHLKP >NZ_AP021884|622875:631682|631184_631682_-|WP_147070766.1|DBSCAN-SWA MKSRTIQSPYQAVLPTPFGCVGMSVANEKLHGIDFLPADYPLQAPDSPLSRRVAAALNHYFDNPGSSFDLPLAILGTAHQVRVWRALTAIPPGHTTTYGQLARWLDSSPRAVGQACGANPLPIIIPCHRVVAANGRGGFMHAASGAPLAYKDWLLNHESGIAAGA |
10 | Vibrio_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
751882 : 760457
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|DBSCAN-SWA ATTACGCGCCGACTTCTGCCGTAACGGCCATCGGTGCGACACCAATCGCCGCGCGCACTTTATTTTCAATCTCGTGGGCGACCTCGGGGTGCTCGCGCAGGTATTCGCGCGCGTTGTCTTTGCCCTGGCCGATTTTTTCGCCGTTATAGGCATACCAGGCGCCGGATTTTTCCACCAGTTTGTGCTCCACGCCGAGCTCGATGATTTCCCCCTCGCGCGAGATGCCTTCGCCGTAAAGGATATCGAATTCAGCCTGCTTGAATGGCGGCGCGACCTTGTTCTTGACGACCTTGACGCGAGTCTCGGAGCCGATCACTTCGTCGCCTTTCTTGATTGCGCCGGTGCGGCGGATGTCGAGGCGCACCGAGGCGTAGAATTTGAGTGCATTGCCGCCGGTGGTGGTCTCCGGGTTGCCGAACATGACGCCGATTTTCATGCGGATCTGGTTGATGAAGATCACCAGGGTATTGGTGCGCTTGATGTTGCCGGTGAGCTTGCGCAACGCCTGGCTCATCAGGCGGGCTTGCAGGCCCATGTGCGAGTCGCCCATTTCGCCTTCGATTTCGGCCTTGGGAGTCAACGCCGCCACCGAGTCTATCACCACCACGTCCACCGAGCCGGAGCGCACCAGCATGTCGGCAATTTCCAGCGCCTGCTCACCGGTGTCGGGCTGCGAGATGAGCAGGTCGGAGACATTCACCCCGAGTTTTTGTGCGTATTGCGGGTCGAGCGCGTGCTCGGCATCAATGAACGCTGCGGTGCCGCCCAGTTTCTGCATTTCGGCGATGACCTGCAGGGTGAGCGTGGTTTTGCCGGAGGATTCCGGACCGAAGATTTCAACCACGCGGCCGCGCGGCAAACCGCCGACGCCCAGCGCAATATCTAAGCCCAGGGAGCCGGTGGAGACCACCTGAATATCGCGCACGACTTCGCCATCGCCGAGGCGCATGATGGAACCTTTGCCGAAGCTTTTTTCGATTTGTGCCAGGGCGGCGGCGAGGGCTTTGCTTCTGTTTTCGTCCATGTGTTTTTCCTCAAATTAGTGCGGGATTATGGCATAAACCGTTAGAACCCGTTAGCCACGGGCGAATGGTTTTTGTCTAGTCCAGCAACTCAATCACCCCGCGCAACGCGGCAGCCACGGCGCGCGCGCGAATTTCATCGCGGTCGCCGCATAACCTGCAGGTGGTGGCGAGGCGGGTACCGTCCTGCATCGCCCAGGCTATGCACACGGTGCCGACGGGTTTTTGCGGGGTGGCGCCGCCGGGCCCGGCAATGCCGGAGATGGCCAGGGCGATCTGCGCGCGGCTGTGGGCGAGTGCGCCCTGCGCCATTTCCAGTACGGTGGGTTCGGATACCGCGCCCGATGCTTGCAGCGTGGCGTTTTTCACCCCCAGCATGTCGTGTTTGGCGGCGTTGCTGTAGGTGATGAAGCCGCGCTCATACCAGGCCGAGCTGCCCGGCACGGCGGTGATCAGCATGCCCGCCCAGCCGCCGGTGCAGGACTCGGCGCTGGCGAGCATGATGCCGCGCCGGCTGAGGGCCTGGCCGGTTTGTTCGGCCAGTTGGTAGAGTGCTGCGTCGGTCGGTCTCATGGCAGAATTTTTTGCGCGAGGAACAGTGCCAGCAAAGTGTAGCCAGCCGCCAGCAAGTCGTCGAGCATGACGCCGAAGCCATTTTTCAGGCGCGCATCGAATTGGCGGATGGGAAACGGCTTCCAGATATCGAACAGCCGGAACAGGCCAAAGGCAGCGGCGACCCACAGCGGCGTTTGCGGGGTGGCAGCGAGCACGATCCAGAACGCGGCAATCTCATCCCAGACGATGCCGCCGTGATCCGCCACGCCCAAATCGCGTCCGGTTTTGCCGCAAATCCAGATGCCGGCGACAACCGCGATGCCGATGATGAGGTACAGCTGGGTCGGCGTGGCGAACCAGGCCAGCAGGTAGTACAGCGGCAGCGCGGCCAGGGTGCCAAACGTGCCCGGCGCCCTGGGTGCGAGTCCGCTGCCCAGACCAAAGGCGAGAAAATAGGCCGGGTGGCGGGTGATGAAACGCCAGTCAGGCGGAAAAGTGGTCATAGCCGGTGTGGCGCATATCCAGAACTTGTCCTTGTGCATCGCGCACTATCAGGCCTGGCTCGGCGCGAATGCTGCCGATGGCAGTGAGCCGCACGCCGAGGCGGGCGGCGATTTCGCCGAGTGCCTTACGGTGTGCGACGGGTGCAGTGAAACACAGCTCGTAGTCGTCACCGCCGCTCAGTACGCAGGCATCAAATTCCGGATGTGCGGCATAGTCATGGACGATTTCACCCAATGGCAAATGCGTATATTCAACGATGGCACCTACACCGGAGCGCGCCAGAATATGCCCCAAGTCAGCCAGCAGGCCATCCGACACATCGATTGCGCTGCGCGCCAGCCCGCGCAAGGCCAGGCCCAGTTCGACACGCGGCGTGGGGGTATATAGGCGCGCTGCCAGGGTGATCAGATCGGCGTCAGTCAGATTGACCCGGCCGTGCAGGGCGGCAAGTGCCAGTGCTGCGTCACCCAGCGTGCCGGATACCCAGATTTCATCGCCCGCCTGAGCGCCGTCGCGACGCAGGGCCTGGTTGGGCGGCACCTCGCCCAGGATAGTGAGGGTGAGGCTCAATGCGCCGCGCGTGGTGTCGCCACCCACCAGGCTCACGCCAAATTGATCGGCACAGCGATACAGTCCCGTTGCGAACGCCGCCAGCCAATCATCGTCTACTTCCGGCAGTGTCAGCGCCAGCGTCGCCCAGCGCGGCGCGGCACCCATCGCGGCGAGATCGGAGAGATTGACCGCGAGGCTTTTCCAGCCGAGCTTTTCCGGGTCGGCATCGGCGAAAAAATGCACATCGGCGACCAGGGTGTCGGTGGAAACGGCGAGCTGCATCCCGGTTGCGGGTTGCAGCAGGGCGCAATCGTCGCCCACTCCCAGCACCGCGCCGGGTGTGGCGCGGGAGAAATGACGCTGAATCAGGCCGAATTCGGAAGTCATGGATGTTGCATTGCCATCCTGGCGGTTAGCTTGCGGTTTTCTGGGTCATCAACCCTGGCGGCGCGGCGCGTTCACTTCCACTGTGCGCACCTCGGCAGCCAGCTTGTCCATCACGCCGTTGACGTATTTGTGGCCATCGGTACCGCCGAATATCTTGGCGAGTTCGACCGCTTCGTTGATGACGACGCGATACGGCACTTCCAGATGATGCAGCAGCTCCTGCGTGCCCAGCAGCAGAATGGCATGCTCCACCGGGCTCAGTTCCGCAGGCTTGCGATCCAGATAGGCAGCGAGCCGCAGGTCCAGCGCCGGTGCTTCATCGACCACGCCATTCAGTAGTGCCAGGAACATTTTTTCATCAATGTTGCGATACACGGGATCGTCGCGCAGCTGTTTGACGATATCGGCAGTCGGCTGATGGTTGAGCAGCCACTGATACACGCCCTGCACTGCGAATTCGCGGGCCTTGCGACGATTGCCGCTCATAGCTGTTTGAGCAGGTTGGCCATTTCGATCGCGCATTCGCCCGCTTCGGCGCCTTTTACCGACATGCGCGAGGTGGCCTGGTGATCGGTATCGGTGGTCAATACGCCATTGGCAATAGGCACGCCGGTATCGAGCTGGATACGCGCGATACCGTTGGCCATTTCGTTGGCAACCACCTCGAAATGATAGGTATCGCCGCGCACCACGGCGCCCAGCGCCACCAGCGCGTCGAATTTTCCGCTCATGGCCATTTTGCGCAGCGCCAGCGGAATTTCCAGCGCACCCGGCACGGTGGCGAGGAGCAGGTTGCCGGTTTTCACGCCGCGTTTGCCAAGTGCGGTGGTGCAAGCCGCCAGCAAACCCTCGCAAATGTCCATGTTGAAGCGGCTCATGACGATGCCGATACGCAATGCGCTGCCGTCGAGACTGGATTCAAGTTCGGGAATATCGTCGTAGCTTGCCATGATTTATTTCTCCTGATGAGACTGGATTGCATGTATTGCCTGTGCAAGGCGGGGGCGGGTTCAGGTGTTCTCGTCGTAGCCTGTCACTTCCAGATCGAATCCCGCCAGCGACGGCATTTTGCGCTGAGTAGCCAGCAGGCGCATTTTGCCCACGCCGACGTCTTTCAAAATCTGCGCGCCAATGCCATGGTTGCGCGCGTCCCATTTTTGCGGAAGCCTGACACCGGCTTCAGGCATAGCGCGGGTGAGCAGTTCTGCTGCGCTCTCCGGACGGTGCAGCAGGACGACAACGCCCTTGCCCACCGCAGCGATTTTTGCCAGGGCCTGGTTGACACTGTAGGCATGGGTACGGCTGCCGACTTCGAGCATGTCCATCACCGACACTGGCTCGTGTACCCGCACCAGCGTTTCGCTGGCGGCGCTGATTTCGCCCTTGACCAGGGCCAGATGAGCTGCCCCGGAGATTTTTTCACGGTAGGCGATGAGCTGGAATTCGCCGTAAACCGTCTCGATACAGCGGCTGCCTGCACGCTCCACCAGGCTTTCATTGTGGCTGCGGTAGTGGATCAGGTCGACGATTGCGCCAATTTTCAGGCCGTGAATTTTGGCGTATTCCAGCAAATCCGGCAGACGCGCCATGGTACCGTCATCCTTGAGGATTTCGCAAATCACTGCGGCAGGTTCCAGGCCGGCCAGCCCGGCCAGATCGCAGCCTGCCTCGGTGTGGCCGGCACGAATCAGCACGCCGCCGGGTTGGGCGCGCAGCGGAAAAATGTGGCCCGGCTGGATGATGTCGGCGGCCTTGGCGTGTTTGGCGACGGCGGCCTGAATGGTAAGCGCCCGGTCGGCGGCGGAAATGCCGGTGGTAACCCCTGTGGCGGCTTCGATGGAGACAGTGAAAGCAGTGCTATAGGGCGTCTGGTTATCGGCCACCATCTGGCGCAAGCCCAGTTGCTTGCAGCGCTCGTCGGTCAGCGTCAGGCAAATCAGGCCGCGTCCGTGCTTGGCCATGAAGTTGATCGCTTCGGGGGTGGCGAATTCGGCCGCCATCACCAGATCGCCCTCGTTTTCGCGGTCTTCCTCGTCCACCAGTACGACCATTTTTCCGGCTTTCAGGTCGGCGATGATGTCTTCGATAGGGCTCAGGCTCATGTTTGATCTCGATAATTAAGGATGCGTTCGGCGTAGCGCGCCATCATGTCCACTTCCAGATTGACCCGGCTGGCAGGTTGGAGCGTATGCAGATTGGTATGTTCCAGCGTATGCGGAATAAGGTTGATGCTGAAACGGTCGCCGTCTACGCGATTAACGGTGAGGCTCACACCGTTGACGGTGATGGAACCTTTGCTGACGACAAAGCGGGCCAGATCGCCGGGAGCTCGGATGACCAGTTCGAAGCAATCCCCGGCCGGCGCGAAATGCAGCACCTCACCCACCCCGTCCACATGGCCGGAAACCAGGTGTCCGCCGAGACGGTCGGACAGGCGCAGCGCCTTTTCCAGATTGACCAGGCCGTGTTCAGGAAAGCCCGTCGTACAGCGGAAAGTCTCTGCCGATACGTCTACCGAGAAGCTCGCCGCACCCAGCGCCACGACGGTCAGGCACACACCGTTGCAGGCGATGGAGTCACCTGGCGCCACGTCGCTCAAATCCAGATGGGCGGCGTCGATCACCAGGCGTGCATCCGCGTTTCTGGGTTCCACTGCCGCCACCTTGCCTACTGCCTGAATAATGCCTGTAAACATGTTAGTTCTTTTCAAATTGTGCAGTGATGCGTATATCGGCACCGACCTGACGTATGTCGCGCAGCACCAGTTTGCGGCGCTCTTGCATCTGCGCCGGTTCGGCCAGGGAAAACAGGCCACGCGCCGTGTCACCCAGCAATACCGGGGCGACATACATCACCCATTCATCCACAAATCCAGCCGCGATCAGCGCGCCGTTGAGTGTGGCACCGGCTTCGGTCATCACTTCATTTATCCCGCGCTGCGCCAGGAGTGACAATAGCGCCCCCAGATCAACCTGCCCGGCATCACCCGGCAGACAGCGGATTTCTGCCCCGGCGGCTTCCAGTCCGGCGCTGCGCCGGGCGTCCGGTTCGGCACAGGCAATCAGGGTCGGGGCACCGCCCAGTATATTCGCCGAGGGCGGGGTTTGCAGGCGCGAATCGACGATGACCTTGAGCGGCTGGCGCGTGGTTTCCACTGCGCGCACATTCAGTTCCGGATTATCCGCCAGCACCGTACCGATACCGGTCAGAATGGCGCATGAACGCGCACGCAGCCGGTGCACGTCGCGGCGCGCAGGCTCTCCGGTGATCCATTTGCTGGCACCGCCGGATAAGGCCGTTTTGCCATCCAGCGAACTGGCGGTCTTGATGCGCAGCCACGGATGCCCCATGACCATGCGCTTGATAAAGCCCGCGTTGAGTTCACGCGCCTGCGCTTCGAGCAGGCCGCATTCAGTCGCAATCCCGGCCTTTTGCAGCAGGGCCAGACCATTGCCCGCCACCTGGGGGTTGGGATCCTGCATGGCGGCCACGACGCGTGCCACGCCTGCGGCTATCAACGCCTCGGCGCAGGGCGGGGTGCGCCCATGGTGGCTGCACGGCTCAAGCGTGACGTACACCGTGGCACCGCGCGCTGCGTCGCCGGCCTCGCGCAGCGCGTGGATTTCGGCATGCGGCTGTCCGGCCTGCTGGTGCCAGCCGCTGCCGACCATCGCTCCATTCCTGACAATCACACAGCCCACGCGCGGATTCGGGCTGGTAGTGTACAGGCCGTGCTCCGCAAGCTGCAGCGCGCGCGCCATATGGATGTAATCTGTCTGCGAAAACACAGGTTTATTTGTCGAAGTCCTTGAGCACGTCGCGGAAGTCGCCCACATCCTGGAAGCTCTTGTACACTGAGGCAAAGCGGATGTAAGCGATCTTGTCCAGGCGCTTCAACTCGTTCATCACCATCTCGCCAATCTGGCGCGCGGGCAATTCACGTTCGCCCAGCGACAGCACTTGTTTGACGATGCGCCCAATCGCCGCATCCACATATTCGGTCGGTACCGGGCGCTTGTGCAGCGCGCGGCGAAAGCCCTCGTGCAGTTTTTCCTGGCTGAATTCCTGGCGCACGCCGTTGCTTTTAATCACCTGCGGCAGGCGCAGTTCTATGGTTTCGTAGGTGGTAAAGCGCTTGTCGCAAGACGTGCAGCGGCGCCGGCGACGAATCGAGTCGCCGGCTTCGGAAAGGCGCGAATCAACGACCTGGGAGTCGAACGCGCTGCAGAACGGACATTTCATGAAGACGGTTTGTAGCGGGTAAGGTAATAGTCAGCAGCAGGCAGCCTGGTTGGGCGGCACGCTACCCGTTCATCCGTACACCGGATATTTCTTGCACAAGGCTTGCGCGGCAGTGGCCGCGCGCCCAATGACCGCCTCGTCATTGGGCGCGTCGAGCACATCGGCAATCAGGTGCGCCAGTTGCTCGGCTTCCAGTTCCTTGAAACCGCGTGTGGTCATTGCCGGCGTGCCGATGCGGATGCCGGAGGTGACGAAGGGTTTTTGCGGATCGTTGGGGATGGCGTTTTTGTTGACCGTGATGTGCGCCCGTCCGAGCGCGGCTTCTGCCTCCTTGCCGGTAATGCTTTTGGCCTGCAAGTCCACCAGAAACAGATGCGAATCGGTGCGGCCGGAGACAATGCGCAGGCCGCGTTCCTGCAGCACCTTCGCCATCACGCGGGCGTTATCGATCACCTGCTCCTGGTACAACTTGAAGTCCTTGCCCATTGCCTCCTGAAACGCCACTGCCTTGGCGGCGATGACGTGCATCAGCGGACCGCCCTGCAAGCCCGGGAAGATGGCGGAATTGATCGCCTTTTCGTGTTCGGCTTTCATCAGGATAATGCCGCCTCTGGGACCGCGCAGCGTCTTGTGCGTGGTGGAGGTTACCACATCCGCATGCGGTACCGGATTGGGATACACCCCCGCCGCGATCAGTCCGGCGTAGTGCGCCATATCCACCATGAAAATCGCGCCGACTTCCCTGGCTATTTTGGCAAAGCGCTCGAAGTCGATGTGCAGCGAATACGCCGAGGCGCCAGCGATGATGAGTCTGGGCTTATGCTCACGGGCAAGCGCTTCCATACGCGGGTAATCGATTTCTTCTTTTTTATTCAGGCCGTAGGCCACGGCGTTGAACCACTTGCCCGACATGTTGAGCGCCATGCCGTGGGTGAGGTGTCCGCCTTCAGCCAGGCTCATGCCCATGATGGTATCGCCCGGCTTGAGGAAGGCCAGGAATACCGCCTGGTTGGCCTGCGAGCCAGAATGCGGCTGCACGTTGGCGGCTTCCGCACCGAATAATTTCCTGATACGGTCAATTGCCAGTTGTTCGGCGATATCCACATATTCGCAGCCGCCGTAGTAGCGCTTGCCGGGATAGCCTTCCGCGTATTTGTTGGTCAGCACCGAACCCTGGGCTTCCATTACCGCCGGACTGGCATAATTTTCCGAAGCGATCAGCTCGATGTGATCTTCCTGGCGGCCGCGTTCTGCCTCCATGGCTTTCCAGAGAGCGGGATCGGTTTGGGCGAGAGTGTGCTGAGGGTTAAACAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|753941_754901_-|WP_147074507.1|DBSCAN-SWA MTSEFGLIQRHFSRATPGAVLGVGDDCALLQPATGMQLAVSTDTLVADVHFFADADPEKLGWKSLAVNLSDLAAMGAAPRWATLALTLPEVDDDWLAAFATGLYRCADQFGVSLVGGDTTRGALSLTLTILGEVPPNQALRRDGAQAGDEIWVSGTLGDAALALAALHGRVNLTDADLITLAARLYTPTPRVELGLALRGLARSAIDVSDGLLADLGHILARSGVGAIVEYTHLPLGEIVHDYAAHPEFDACVLSGGDDYELCFTAPVAHRKALGEIAARLGVRLTAIGSIRAEPGLIVRDAQGQVLDMRHTGYDHFSA >NZ_AP021884|751882:760457|757597_758737_-|WP_147074502.1|DBSCAN-SWA MWATSATCSRTSTNKPVFSQTDYIHMARALQLAEHGLYTTSPNPRVGCVIVRNGAMVGSGWHQQAGQPHAEIHALREAGDAARGATVYVTLEPCSHHGRTPPCAEALIAAGVARVVAAMQDPNPQVAGNGLALLQKAGIATECGLLEAQARELNAGFIKRMVMGHPWLRIKTASSLDGKTALSGGASKWITGEPARRDVHRLRARSCAILTGIGTVLADNPELNVRAVETTRQPLKVIVDSRLQTPPSANILGGAPTLIACAEPDARRSAGLEAAGAEIRCLPGDAGQVDLGALLSLLAQRGINEVMTEAGATLNGALIAAGFVDEWVMYVAPVLLGDTARGLFSLAEPAQMQERRKLVLRDIRQVGADIRITAQFEKN >NZ_AP021884|751882:760457|754949_755387_-|WP_147074506.1|DBSCAN-SWA MSGNRRKAREFAVQGVYQWLLNHQPTADIVKQLRDDPVYRNIDEKMFLALLNGVVDEAPALDLRLAAYLDRKPAELSPVEHAILLLGTQELLHHLEVPYRVVINEAVELAKIFGGTDGHKYVNGVMDKLAAEVRTVEVNAPRRQG >NZ_AP021884|751882:760457|755383_755851_-|WP_147074505.1|DBSCAN-SWA MASYDDIPELESSLDGSALRIGIVMSRFNMDICEGLLAACTTALGKRGVKTGNLLLATVPGALEIPLALRKMAMSGKFDALVALGAVVRGDTYHFEVVANEMANGIARIQLDTGVPIANGVLTTDTDHQATSRMSVKGAEAGECAIEMANLLKQL >NZ_AP021884|751882:760457|753472_753961_-|WP_147074508.1|DBSCAN-SWA MTTFPPDWRFITRHPAYFLAFGLGSGLAPRAPGTFGTLAALPLYYLLAWFATPTQLYLIIGIAVVAGIWICGKTGRDLGVADHGGIVWDEIAAFWIVLAATPQTPLWVAAAFGLFRLFDIWKPFPIRQFDARLKNGFGVMLDDLLAAGYTLLALFLAQKILP >NZ_AP021884|751882:760457|756999_757596_-|WP_147074503.1|DBSCAN-SWA MFTGIIQAVGKVAAVEPRNADARLVIDAAHLDLSDVAPGDSIACNGVCLTVVALGAASFSVDVSAETFRCTTGFPEHGLVNLEKALRLSDRLGGHLVSGHVDGVGEVLHFAPAGDCFELVIRAPGDLARFVVSKGSITVNGVSLTVNRVDGDRFSINLIPHTLEHTNLHTLQPASRVNLEVDMMARYAERILNYRDQT >NZ_AP021884|751882:760457|752981_753476_-|WP_147074509.1|DBSCAN-SWA MRPTDAALYQLAEQTGQALSRRGIMLASAESCTGGWAGMLITAVPGSSAWYERGFITYSNAAKHDMLGVKNATLQASGAVSEPTVLEMAQGALAHSRAQIALAISGIAGPGGATPQKPVGTVCIAWAMQDGTRLATTCRLCGDRDEIRARAVAAALRGVIELLD >NZ_AP021884|751882:760457|751882_752905_-|WP_147074510.1|DBSCAN-SWA MDENRSKALAAALAQIEKSFGKGSIMRLGDGEVVRDIQVVSTGSLGLDIALGVGGLPRGRVVEIFGPESSGKTTLTLQVIAEMQKLGGTAAFIDAEHALDPQYAQKLGVNVSDLLISQPDTGEQALEIADMLVRSGSVDVVVIDSVAALTPKAEIEGEMGDSHMGLQARLMSQALRKLTGNIKRTNTLVIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKKGDEVIGSETRVKVVKNKVAPPFKQAEFDILYGEGISREGEIIELGVEHKLVEKSGAWYAYNGEKIGQGKDNAREYLREHPEVAHEIENKVRAAIGVAPMAVTAEVGA >NZ_AP021884|751882:760457|758693_759143_-|WP_147074501.1|DBSCAN-SWA MKCPFCSAFDSQVVDSRLSEAGDSIRRRRRCTSCDKRFTTYETIELRLPQVIKSNGVRQEFSQEKLHEGFRRALHKRPVPTEYVDAAIGRIVKQVLSLGERELPARQIGEMVMNELKRLDKIAYIRFASVYKSFQDVGDFRDVLKDFDK >NZ_AP021884|751882:760457|755911_757003_-|WP_147074504.1|DBSCAN-SWA MSLSPIEDIIADLKAGKMVVLVDEEDRENEGDLVMAAEFATPEAINFMAKHGRGLICLTLTDERCKQLGLRQMVADNQTPYSTAFTVSIEAATGVTTGISAADRALTIQAAVAKHAKAADIIQPGHIFPLRAQPGGVLIRAGHTEAGCDLAGLAGLEPAAVICEILKDDGTMARLPDLLEYAKIHGLKIGAIVDLIHYRSHNESLVERAGSRCIETVYGEFQLIAYREKISGAAHLALVKGEISAASETLVRVHEPVSVMDMLEVGSRTHAYSVNQALAKIAAVGKGVVVLLHRPESAAELLTRAMPEAGVRLPQKWDARNHGIGAQILKDVGVGKMRLLATQRKMPSLAGFDLEVTGYDENT >NZ_AP021884|751882:760457|759212_760457_-|WP_147074500.1|DBSCAN-SWA MFNPQHTLAQTDPALWKAMEAERGRQEDHIELIASENYASPAVMEAQGSVLTNKYAEGYPGKRYYGGCEYVDIAEQLAIDRIRKLFGAEAANVQPHSGSQANQAVFLAFLKPGDTIMGMSLAEGGHLTHGMALNMSGKWFNAVAYGLNKKEEIDYPRMEALAREHKPRLIIAGASAYSLHIDFERFAKIAREVGAIFMVDMAHYAGLIAAGVYPNPVPHADVVTSTTHKTLRGPRGGIILMKAEHEKAINSAIFPGLQGGPLMHVIAAKAVAFQEAMGKDFKLYQEQVIDNARVMAKVLQERGLRIVSGRTDSHLFLVDLQAKSITGKEAEAALGRAHITVNKNAIPNDPQKPFVTSGIRIGTPAMTTRGFKELEAEQLAHLIADVLDAPNDEAVIGRAATAAQALCKKYPVYG |
11 | Staphylococcus_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
882982 : 892137
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|DBSCAN-SWA CCTAGGCGGGCATCAGCACGGTCAGCCCCCCCATGTAGGGACGCAACACTTCGGGAAGCGCCACCGATCCATCCGCCTGCTGATGATTTTCCAGAATCGCGACCAGGGTGCGCCCTACCGCCAGCCCGGAGCCGTTCACGCTGTGCAGCAGTTCCGGCTTGCCTTTTTCACCTCTGAAGCGCGCTTGCATGCGGCGTGTCTGAAACGCCTCGAAATTGCTGCACGAGGAAATTTCGCGATAAGTATTTTGCGCCGGCAGCCACACTTCCAGATCGTAGGTCTTGGCGGCAGAAAAACCCATGTCGCCGCCGCACAGCGCCATTTTCCGGTAGGGCAGCCCGAGTGCTTGCAGGATGGCCTCGGCATGGCCGGTGAGTGCTTCCAGCGCGGTGTAGGATTGCTCCGGTTCGACCAGTTGCACCAGCTCCACCTTGTCGAACTGATGCTGGCGGATCATGCCGCGGGTGTCGCGGCCGTAGGAACCGGCCTCGGAGCGGAAGCAGGGGGTGTGGGCGACGAATTTCAGCGGCAGTTGCTCGCGCGCCACGATCGCGTCGCGCACCATGTTGGTCAGCGGCACTTCGGCGGTGGGGATGAGGTAGAGTTTTTCCGCATCCGCACGCGGCACGTGAAACAGATCTTCCTCGAACTTGGGCAACTGCCCGGTACCGCGCATGGAGTCGGCGTTGACCAGATACGGCACATACACTTCGGTGTAGCCATGCACAGCCGTGTGGGTGTCCAGCATGAACTGCGCCAGCGCCCGGTGCAGCCGCGCCAGCCCGCCGCGCAGCAGTGAAAAGCGCGCGCCGGCCAGTTTGCTGGCGGTCTCGAAATCCAGCCCCAGCGCGGTACCTACGTCCACGTGATCTTTCACCGCAAAATCAAACACGCGCGGTGTGCCGACACGTGCGATCTCTACGTTGTCGGCGTCGGATTTACCCGTCGGCACCGATGCATGCGGCAGGTTGGGAATGGTCATCAACAGCGCATTGAGCCGGGCTTGCAGGGCTTCCAGCGCGGATTCCGCGGCTTTCAGTTCGGCGCCGAGATTCGCCACTTCCGCCATGATGGTGGAGACATCCTCGCCCTTGGCCTTGGCCATGCCTATCTGTCTGGAGCTGGCGTTGCGCCTGGCCTGCAGCTCCTGGGTGCGGGTTTGCAGTTGTTTGCGCTCGGCTTCCAGGCGCTGGAATTCGGCAGTGTCCAGGGTGTAGCCGCGCATGGCAAGGCGTTGCGCCACGTCGTCGAGGTCGTTGCGGAGGTGTTGAATGTCTAACATTATTTTTGCCCTGTTTTCTTGTTGGCTTGTTCATCCAGCTTGCGCAAATACGCCAGCCGTTCGGCGATCTTGCCTTCCAGCCCGCGCGGGGTTGGTGCGTAAAAGTGCGCGTTGACACCTTCCGGTAAATAGTCCTCCCCGGCGGCGTAGGCGTCCGGTTCGTCGTGCGCGTAGCGGTAGGCGTGGCCGTAGCCCAGTTGTTTCATCAGTTTGGTGGGGGCGTTGCGCAGGTGCACCGGCACCTCGCGCGATTTGTCCGCCGCCACAAAGCTGCGGGCGTTATTGTACGCCACGTACACGGCGTTGCTCTTGGGCGCGCAGGCGAGATAAATCACCGCCTGCGCCAGCGCCAGTTCGCCTTCCGGACTGCCCAGGCGCTGGTAGGTTTCCACCGCGTCCAGGGTCAGGCGCAGCGCGCGTGGGTCGGCCAGACCGATGTCCTCGCTGGCCATGCGGATCAGGCGGCGGCCGACGTACAGCGGATCGGCACCGCCGTCGAGCATGCGCACCATCCAGTACAGCGCGGCGTCGGGATGGGAACCGCGCACCGATTTGTGCAGCGCGGATATCTGGTCGTAGAAGTTGTCGCCGCCCTTGTCAAAACGGCGTGCGCCGCGCGCCAGCGTGGTCTGGATGAAATCCTCGTCAATCTCATGACGGGCGGCATCGAGTGCGGCATTGGCGGCTTGTTCCAGCAGGTTCAACAGGCGCCGCGCGTCGCCGTCGGCGTAGCCGGTGAGCTGGGCGCGCGCCGCCCCGGTGATGGCAATGTCCGGATAGGTGCTGATCCGCGCGCGCTCCAGCAGGGCGGCGAGGTCGGTTTCCACGATGGGCTTTAACACATACACCTGGGCGCGCGAGAGCAGCGCGCTATTGACCTCGAACGAGGGATTCTCGGTGGTGGCGCCGATGAAAGTAATCAACCCGGCTTCAACGAACGGCAGGAAAGCGTCCTGCTGGGATTTGTTGAAGCGGTGCACTTCGTCCACAAACAGCAGGGTGCGGCGCCCCTGGCCTTGCATCATTTCGGCGCGCGCCACCGCCTCGCGGATTTCCTTGACCCCCGAGAGCACGGCGGACAGCGCGATGAACTCCATGTCAAAACCGTGGCTCATCAGCCGTGCCAGTGTGGTCTTGCCTACGCCCGGCGGCCCCCACAGGATCATCGAGTGTGGCTTGCCGGATTCGAATGCGACGCGCAGCGGCTTGCCGGGGCCGAGCAAATGCGTCTGTCCGATCACTTCGTCCAGATTGCGCGGCCGCAGCCGTTCGGCCAGCGGCGCGCTGTCCAGCGGATGATCGAACAGGTCGGCGTGGCTCACTTGCCGTCGGAGATGACGTCCACGCCCTGGGGCGGGGTGAAATGGAAATCGCTGGCGGGAAGGGCCGGGTTACGTTCCAGCCCGGCGAATTTCAGCACCGTGGTCTGGCCGAAGTTGTCCTTGATCTCCATCGCCACCAGCGTGTTTTTGCTGAATCCCATGCGCACGTTCTCAAACGCGCTTTCCTTGTCGCGCGGGCGCGCGTCCAGCCATTCCAGACCGTCGCGGCTGCCGGCGTCCGTGATCGTGTAAAACCTGCCGATGTCCTTGCTGCCCGCCAGCAAGGCCGCCGGGCTGCTGCCCAGCGCCTGGCCCAGTTTCTTGATGGTGACCTGTTGCAGATCGGCGTCGTACAGCCAGATTCTTTTGCCGTCGCCGACGATTATCTGTTCATAGGGCTTCTCATACACCCAGCGGAACTTGCCGGGACGCGCGAAAGCCATGGTGCCGGACGACTGCTGGCGCGCGTGTCCGTTTTTGTCCAGCACGGTCTGGGTGAACGTGGCGCGCGCGGTCTGGGTATCCGCGACGAACGCCTTGAGCGCGTCGATGCTGGATGCTGCGGCGCTGGCAGAGAAGATCAAGAGTGCGGTAAAGAGACTGAGTTTTTTCATTGGGACTTTAAGTCGAATGGACAGGATTTTTAATGCTTAACAAGAGAGTTGGTCTGGTTTTAGTGCAGAAAATTCAACACCCCCCAAAATTCATATTTATTCTGAATGATCCGGTCCTTATCCTGTTAATCCTGTCTAATCACATTGTTCATTCCCGGTTAGGTGCAATCACTTCGCGGTTGCCGTTGCTCTGCATCGGCGTCACCAGACCCGCCTGTTCCATTGCCTCGATCAGCCGCGCGGCACGGTTGTAGCCGATGCGCAGGTGACGCTGCACGGCGGAGATGGAGGGGCGCCGGGTTTTCAGCACGATGGCGACGGCTTCGTCATAGAGCGGGTCGCTTTCGGCGTCACTGCCGCCGACCGCGCTTTCGCCGCTGCCTTCATTGTCTTCCGGGGTGTCGAGGATGCCGTCAATGTAATCGGGCTCGCCCAGCTGTTTCAGGTATTCCACGACTTTATGCACTTCTTCGTCGGCCACAAAAGCGCCGTGCACGCGCTGCGGATAGCCGGTGCCGGGCGGCAGGTAGAGCATGTCGCCCTGGCCGAGCAGGGCTTCTGCGCCCATCTGGTCGAGGATGGTGCGCGAGTCGATTTTGCTCGATACCTGGAACGCGACGCGGGTGGGGATGTTGGCCTTGATCAGGCCGGTAATCACGTCCACCGACGGGCGCTGCGTGGCCAGGATCAGATGCACCCCGGCGGCGCGGGCCTTCTGCGCCAGCCGCGCAATGAGCTGTTCGACGGCCTTGCCTACCACCATCATCATGTCAGCCAGCTCGTCGATCACCACCACAATCATGGGCTGCTCTTCCAGCGGCTCGGGATTATCCGGGGTCAGGCTGAACGGGTGGGTGAGCGGGGTGGCGGCCCTTTTCGCGTCGCGCACTTTCTGGTTGTAACCAGCCAGGTTGCGCACGCCGAGTGCGGACATCAGCTTGTAGCGGCGCTCCATCTCGGCCACGCACCAGTTGAGCGCGGCGGCGGCCTGACGCATGTCGGTGACTACCGGGGCGAGCAGATGCGGAATGCCCTCGTATACCGACAGCTCGAGCATTTTCGGGTCGACCAGAATCAGTCGCACACGGCTGGCGTCCGCTTTGTACAACAGGGAAAGGATCATCGCGTTGATGGCAACTGATTTGCCCGAGCCGGTGGTGCCCGCCACCAGTACGTGCGGCATTTTGGCCAGATCGGCCACCACCGGGTTGCCGGCTATGTCCTTGCCCATCGCCATTGCCAGCGGGCTGGCCATGGCGTGGTACACCTTGGCGGAAAGGATTTCCGATAACCGCACGATCTCGCGCTTGGGGTTGGGGATTTCCAGCGCCATGGTGGTCTTGCCGGGGATGGTTTCCACCACGCGGATGCTCACCACCGACAGCGCGCGCGCCAGGTCTTTGGCCAGGTTGACGATCTGGCTGCCCTTGACCCCGGAGGCCGGCTCGATCTCGTAGCGGGTGATGACCGGGCCGGGCAGAGCGGCGACCACGCGCGCCTGCACGTTGAATTCGGCCAGCTTGCGCTCGATCAGGCGCGAGGTGAATCCCAGGGTTTCCGCCGACAGGCTCTCCACATGGGCAGTCGCAGCATCCAGCAAATGCAGTGGCGGCAGCGGGGAATCCGGCATTTCGGCAAACAGCGGCACCTGCTTTTCCACCTGCACGCGTTCCGATTTGATAATGGTGGGCGGCGGGGTTTCGATGACCACCGGAGCGCGGTCCATATTGCGGCGCTTTTCTTCCAGTACAATCTCGTCGCGCCGCAGCGTGGCGGCTTGACCCATGCGCCGTTCGCGCCGCGCGGACCAGGTCTCGATTGCCCATAACCAGCTGCGTTCGAGGTATTCCCCGGTGGATTCCACTAGCGCCAGCCAGGAGATGCCGGTAAACAGGCTCAAGCCCGCGGCAATCAGCACCAGCAAGGCCAGCGTGCCGCCGGTAAAGCCCAGCAAATGGGTGACTGCTTCTCCCACAACCGCTCCCAGAATGCCTCCCGGGTGCTCCGGTAGTGCTATGTGCAAGCTATACAGACGCTGCGATTCCAGCCCGGCGCTGGACAGCAGCAGCAACAAAAACCCGGCACTGCGGATATATAACGGGCGGCGGTCGGGCGCCTCGCTAATCTCCAGTTTGCGATATCCCCACCAGGTGGCATACAGCGCCAGCATCACCAGCCACCAGGTGGAGAGCCCGAACAGTGCAAGCAGAATGTCGGCCAGATACGCCCCCAGCGTGCCGCCCATATTGTGCACACTGCCTTGTCCGCTGTGCGACCAACCCGGATCAGACTGGGTGTACGTGAACAGGATGACCGCCAGGAACGCCGCCAGCGCCACGCTCAGCAGCCAGCCGACCTCGCGCAGCAGGCCGGTGAGCCTGGGCGGGAGCGGTTTGCGGACGGATTTGGGCGTATTGCGCCGGGGAATGGACATGGTCAGCAATTTAATCGGAAGGCAAAATTATAACTTGAACCTTGTCCAGTCCGTGACCATCTTCCATACTGTTCCAGTTTTCCAGTTTTCAATCCCGCTACGGAGTACTACATGACCACCCCTCAACATCACCGTCTCATCATCCTCGGCTCCGGCCCCGCAGGCTATTCCGCCGCCGTTTACGCTGCGCGCGCCAACCTGAATCCGGTCGTCATTACCGGCATGGCGCAAGGCGGTCAGCTGATGACCACCACCGATGTGGACAACTGGCCTGCCGACGCCGACGGCGTGCAGGGGCCGGAGTTGATGGCGCGTTTCGAGAAGCACGCGCGCCGCTTCAACACCGAAATTATTTTCGACCACATCCACACTGCCAAACTGACCGACAAGCCCATCGCGCTGGTCGGCGACCAGGGTAGCTACACCTGCGATGCGCTGATTATCGCCACCGGCGCGTCGGCCATGTATCTGGGGCTGGAATCCGAGCAGGCGTTCATGGGCAAGGGCGTATCCGGCTGCGCCACCTGCGATGGATTTTTCTACCGCAATCAGGACGTGGCGGTGATCGGCGGTGGCAACACTGCCGTGGAAGAAGCGCTGTATCTGTCCAACATTGCGCGTCATGTCACCGTGGTGCATCGCCGCGACAAGTTCAAGTCGGAAAAAATTCTCGCCGATCATCTGATGGAGAAGGTCAAGGAAGGCAAGATCAGCGTGGAGTGGAACAGCGAACTGGACGAGGTGCTGGGCGACAAGACGGGTGTGACCGGCATGCGCATCAAGTCCACCGTGGATGGCAGCACCAGGGATATCGCCCTGACCGGCGTGTTCATCGCCATCGGCCACAAGCCCAACACCGATATTTTCACTGGCCAGATCGCGATGGAAGGCGGCTATATCGTCACCCAGGGGGGCAACAAGGGCAATGCCACCGCCACCAGTGTTCCCGGCGTGTTTGCCGCGGGTGACGTGCAGGATCACATCTACCGCCAGGCGGTGACCAGCGCCGGTACCGGCTGCATGGCCGCGCTGGACGCCGACCGCTATCTGGAAAGCCTCGGCAAGTAATCTCCCCATGGCCGGGCGTGTGCCTCCAGCGGACGAAGCCGCACTGTTCAAAGCGGCGGTGCAGGACGCCCAGCCGCTACCCGACCACGGCAAGGTGGAACCGCCCTTGCCGCGCGTTTCCCCTATCCCGCGCCAGCGTATTCGCGATGAGCGTCAGGTCTTGGCCGACAGCCTGTCTGACCACATCGTGTGGGAGGATACCATGGAAACCGGCGAGGAGCTGGTGTTCCTGCGCACTGGCTTGCGCCGCGACACGCTCAAAAAACTGCGGCGCGGGCACTGGGTGCTGCAGGCCGAACTGGATTTGCATGGCCTGGTGAGCGTGGAAGCGCGCCAGGCGCTGAGCGCGTTTATCGCCGGCTGCGGCAAGCGCGGCCTGCGTTGCGTGCGCATCATCCACGGCAAAGGGCTGCGTTCCAAAAACCGCGAGCCGGTGTTGCGCACCAAGGTGAAAAACTGGCTGATGCAAAAAGACGAAGTGCTGGCGTTTTGCCAGGCGCGTGCGGTGGACGGCGGCAGCGGCGCGGTGGTAGTGTTACTCAAGTCTTCATGAAAACTTTTTGGAGGAATGCCATGACCGCAATTACCGAATTCGAACTTCCTTCCACCGGCAACCGAACCTTCAAACTCACCGACATGCGCGGCAAGAAGCTGGTGGTGTACTTCTATCCCAAGGACGACACGCCGGGCTGTACCGTGGAGGGCTCCGACTTCCGCGATTTGTATGCCGGGTTTCAGGCGCACAATTGCGAGATCGTGGGTATTTCGCGCGACGATATGAAATCCCACGAGAAATTCAAGACCAAGCTCAGCCTGCCGTTCGAGCTGTTGTCCGACGCAGACGAAAAAGTGTGCGAACTGTTCGGCGTGATAAAGCTGAAGAACATGTACGGCAAGGAAGTCCGTGGCATTGATCGCAGCACTTTTGTGTTCGACAGCGACGGCAAGCTGGTCAAGGAATGGCGCGGCGTGAAATCCGCCGGCCACGCGCAGGAAGTGCTGGATACCATTAAAACGCTCCAGGAGAAAATCTGAATGCCCCGCAAAGCCGCTGCGCCCACCAAGCTGTTTGTTCTTGATACCAACGTGCTGATGCACGACCCCACCAGCTTGTTCCGTTTTGAGGAGCACGACATATTTCTGCCGATGGGCACGCTGGAGGAACTCGATCACAACAAGAAAGGTATGACCGAAGTGGCGCGCAATGCGCGTCAGGCCAGCCGCTTCCTGGACGAAATCGTATCGGGTTGCGAAGATGCAATCGAAGCCGGGATTCCGCTCAGCAGCCATAGCCGCAAGGCAGCGACCGGACGGCTGTTTCTGCAGACCAGAATGGCGCGCATTGAAACCCCGCTCAGCCTGCCCAACAGCAAGGTGGACAACCAGATTCTCGGCGTGATTCTGAGCCTGCGCGAAGAGCAGCCCAGGCGCCCGATAATCCTGGTGTCCAAAGACATCAACATGCGCATCAAGGCGCGTGCACTGGGCTTACCCGCCGAGGACTACTTCAACGACAAGGTACTCGAAGACACCGACCTGCTCTATGCCGGGGTGCGTGAATTGCCGGAGGATTTCTGGGACAGGCACGGCAAGGGTATCGAGTCCTGGCAGGCTGATGGCCACACCTGGTATCGCGTCAAAGGCCCGCTGGTGACCCGCCTGCTGGTCAACGAATTCGTCTATCAGGAGAGCGCCAGCCCGCTCTACGCCATCGTCAAAACCATCAAGGGCAATGTCGCCGAGCTGCAGACCATCAAGGACTACAGCCACCAGAAAAACAATGTGTGGGGCATCACCGCGCGCAACCGCGAACAGAATTTCGCGCTCAATGTGCTGATGGATCCGGAGGTGGATTTTGTCACTTTATTAGGCCAGGCCGGTACCGGCAAAACCCTGCTCACCCTCGCGGCGGGGCTGATGCAGACGCTGGAGCACAAGCGTTATTCGGAAATCATCATGACCCGCGTGACCGTGCCGGTAGGCGAAGACATCGGTTTCCTGCCCGGCACCGAGGAAGAAAAAATGACGCCGTGGATGGGCGCGCTGGAAGACAATCTCGACGTGCTCAACAAGACCGACGACAGCGCCGGCGACTGGGGACGCGCCGCCACGCAGGACCTGATCCGCAGTCGCATCAAGGTCAAATCGCTCAACTTCATGCGCGGGCGTACCTTCCTCAACAAATACCTGATCATCGACGAGGCGCAGAACCTCACCCCCAAACAGATGAAAACCCTCATCACCCGCGCCGGTCCCGGCACCAAGGTGGTGTGCCTGGGCAACATCTCGCAGATTGATACGCCTTACCTCACCGAGGGCAGCTCCGGCCTGACCTACGTGGTGGACCGCTTCAAGGGCTGGCCCCACGGCGGCCATATCACCCTGGCGCGGGGCGAGCGTTCGCGCCTGGCCGACTGGGCGGCGGAAATGCTATGA
Protein sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|882982_884263_-|WP_147071127.1|tRNA|DBSCAN-SWA MLDIQHLRNDLDDVAQRLAMRGYTLDTAEFQRLEAERKQLQTRTQELQARRNASSRQIGMAKAKGEDVSTIMAEVANLGAELKAAESALEALQARLNALLMTIPNLPHASVPTGKSDADNVEIARVGTPRVFDFAVKDHVDVGTALGLDFETASKLAGARFSLLRGGLARLHRALAQFMLDTHTAVHGYTEVYVPYLVNADSMRGTGQLPKFEEDLFHVPRADAEKLYLIPTAEVPLTNMVRDAIVAREQLPLKFVAHTPCFRSEAGSYGRDTRGMIRQHQFDKVELVQLVEPEQSYTALEALTGHAEAILQALGLPYRKMALCGGDMGFSAAKTYDLEVWLPAQNTYREISSCSNFEAFQTRRMQARFRGEKGKPELLHSVNGSGLAVGRTLVAILENHQQADGSVALPEVLRPYMGGLTVLMPA >NZ_AP021884|882982:892137|884262_885555_-|WP_147071312.1|DBSCAN-SWA MDSAPLAERLRPRNLDEVIGQTHLLGPGKPLRVAFESGKPHSMILWGPPGVGKTTLARLMSHGFDMEFIALSAVLSGVKEIREAVARAEMMQGQGRRTLLFVDEVHRFNKSQQDAFLPFVEAGLITFIGATTENPSFEVNSALLSRAQVYVLKPIVETDLAALLERARISTYPDIAITGAARAQLTGYADGDARRLLNLLEQAANAALDAARHEIDEDFIQTTLARGARRFDKGGDNFYDQISALHKSVRGSHPDAALYWMVRMLDGGADPLYVGRRLIRMASEDIGLADPRALRLTLDAVETYQRLGSPEGELALAQAVIYLACAPKSNAVYVAYNNARSFVAADKSREVPVHLRNAPTKLMKQLGYGHAYRYAHDEPDAYAAGEDYLPEGVNAHFYAPTPRGLEGKIAERLAYLRKLDEQANKKTGQK >NZ_AP021884|882982:892137|885581_886199_-|WP_147071125.1|DBSCAN-SWA MKKLSLFTALLIFSASAAASSIDALKAFVADTQTARATFTQTVLDKNGHARQQSSGTMAFARPGKFRWVYEKPYEQIIVGDGKRIWLYDADLQQVTIKKLGQALGSSPAALLAGSKDIGRFYTITDAGSRDGLEWLDARPRDKESAFENVRMGFSKNTLVAMEIKDNFGQTTVLKFAGLERNPALPASDFHFTPPQGVDVISDGK >NZ_AP021884|882982:892137|890274_890736_+|WP_147071117.1|DBSCAN-SWA MTAITEFELPSTGNRTFKLTDMRGKKLVVYFYPKDDTPGCTVEGSDFRDLYAGFQAHNCEIVGISRDDMKSHEKFKTKLSLPFELLSDADEKVCELFGVIKLKNMYGKEVRGIDRSTFVFDSDGKLVKEWRGVKSAGHAQEVLDTIKTLQEKI >NZ_AP021884|882982:892137|886347_888633_-|WP_147071123.1|DBSCAN-SWA MSIPRRNTPKSVRKPLPPRLTGLLREVGWLLSVALAAFLAVILFTYTQSDPGWSHSGQGSVHNMGGTLGAYLADILLALFGLSTWWLVMLALYATWWGYRKLEISEAPDRRPLYIRSAGFLLLLLSSAGLESQRLYSLHIALPEHPGGILGAVVGEAVTHLLGFTGGTLALLVLIAAGLSLFTGISWLALVESTGEYLERSWLWAIETWSARRERRMGQAATLRRDEIVLEEKRRNMDRAPVVIETPPPTIIKSERVQVEKQVPLFAEMPDSPLPPLHLLDAATAHVESLSAETLGFTSRLIERKLAEFNVQARVVAALPGPVITRYEIEPASGVKGSQIVNLAKDLARALSVVSIRVVETIPGKTTMALEIPNPKREIVRLSEILSAKVYHAMASPLAMAMGKDIAGNPVVADLAKMPHVLVAGTTGSGKSVAINAMILSLLYKADASRVRLILVDPKMLELSVYEGIPHLLAPVVTDMRQAAAALNWCVAEMERRYKLMSALGVRNLAGYNQKVRDAKRAATPLTHPFSLTPDNPEPLEEQPMIVVVIDELADMMMVVGKAVEQLIARLAQKARAAGVHLILATQRPSVDVITGLIKANIPTRVAFQVSSKIDSRTILDQMGAEALLGQGDMLYLPPGTGYPQRVHGAFVADEEVHKVVEYLKQLGEPDYIDGILDTPEDNEGSGESAVGGSDAESDPLYDEAVAIVLKTRRPSISAVQRHLRIGYNRAARLIEAMEQAGLVTPMQSNGNREVIAPNRE >NZ_AP021884|882982:892137|890736_892137_+|WP_147071115.1|DBSCAN-SWA MPRKAAAPTKLFVLDTNVLMHDPTSLFRFEEHDIFLPMGTLEELDHNKKGMTEVARNARQASRFLDEIVSGCEDAIEAGIPLSSHSRKAATGRLFLQTRMARIETPLSLPNSKVDNQILGVILSLREEQPRRPIILVSKDINMRIKARALGLPAEDYFNDKVLEDTDLLYAGVRELPEDFWDRHGKGIESWQADGHTWYRVKGPLVTRLLVNEFVYQESASPLYAIVKTIKGNVAELQTIKDYSHQKNNVWGITARNREQNFALNVLMDPEVDFVTLLGQAGTGKTLLTLAAGLMQTLEHKRYSEIIMTRVTVPVGEDIGFLPGTEEEKMTPWMGALEDNLDVLNKTDDSAGDWGRAATQDLIRSRIKVKSLNFMRGRTFLNKYLIIDEAQNLTPKQMKTLITRAGPGTKVVCLGNISQIDTPYLTEGSSGLTYVVDRFKGWPHGGHITLARGERSRLADWAAEML >NZ_AP021884|882982:892137|889708_890254_+|WP_147071119.1|DBSCAN-SWA MAGRVPPADEAALFKAAVQDAQPLPDHGKVEPPLPRVSPIPRQRIRDERQVLADSLSDHIVWEDTMETGEELVFLRTGLRRDTLKKLRRGHWVLQAELDLHGLVSVEARQALSAFIAGCGKRGLRCVRIIHGKGLRSKNREPVLRTKVKNWLMQKDEVLAFCQARAVDGGSGAVVVLLKSS >NZ_AP021884|882982:892137|888744_889701_+|WP_147071121.1|DBSCAN-SWA MTTPQHHRLIILGSGPAGYSAAVYAARANLNPVVITGMAQGGQLMTTTDVDNWPADADGVQGPELMARFEKHARRFNTEIIFDHIHTAKLTDKPIALVGDQGSYTCDALIIATGASAMYLGLESEQAFMGKGVSGCATCDGFFYRNQDVAVIGGGNTAVEEALYLSNIARHVTVVHRRDKFKSEKILADHLMEKVKEGKISVEWNSELDEVLGDKTGVTGMRIKSTVDGSTRDIALTGVFIAIGHKPNTDIFTGQIAMEGGYIVTQGGNKGNATATSVPGVFAAGDVQDHIYRQAVTSAGTGCMAALDADRYLESLGK |
8 | uncultured_Mediterranean_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1605656 : 1613255
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|DBSCAN-SWA CATGCAAAATCTCGACACTATTGTTGCCGCAGCGCTAGCCGAATTCGCCGCAGTCAACCAGGCCGTTGAACTGGAGCAGGCAAAAGCCCGCTATCTCGGCAAAGCCGGTTTGCTCACCGGGCAATTGAAACAACTGGGCAAGCTTCCCGCCGCAGAACGCCCGGCAGCGGGCAACGTGATCAATCAGGCCAAGGAACGGATTCAGCAGGCGCTGGAAGCGCGCCGCGCAGCCTTGTCCCGGGCTGAGCTGGATAACAGGCTGGCGGCGGAAACCCTGGATGTGACGCTGCCCGGACGCGGCCTGGGCACAGGCGGCCTGCACCCGGTGACGCGCACGCTGGCACGCATCCAGGCGCTGTTCGCCTCGATCGGTTTCGAGGTGGCGGAAGGCCCGGAGATTGAAACCGATTTCTACAATTTCACCGCACTGAATATTCCGGAAAACCACCCGGCGCGCGCCATGCACGACACTTTCTACGTGGATGACAAACACCTGCTGCGCACCCACACGTCGCCGGTGCAGATACATTATTTGCAGAACAATCAGCCGCCGCTCAAGATCATCGCGCCAGGCCGGGTATATCGCTGCGATTCCGACGTGACCCACACACCCATGTTTCATCAAGTCGAGGGATTGTGGGTGGACGAAGAGGTGAGTTTCGCGGCATTGAAAGGCGTGCTGGCGGATTTCATGCAGCGTTTTTTTGAACGCGATGACCTGAAGGTGCGCTTCCGCCCATCGTTTTTCCCGTTCACCGAACCGTCGGCGGAAATGGATATCGCTTGCGTGATGTGCGGTGGCGGCGGTTGCCGCGTATGCAGCCATACCGGCTGGCTGGAAGTGCTGGGCTGCGGCATGGTGCATCCCAATGTGCTGGGACATGTGCATGTGGATAGCGAAAAATACCTCGGTTTCGCGTTTGGCATGGGGGTGGAACGGCTGGCCATGCTGCGCTACGGTGTGGATGACCTGCGCCTGTTTTTCGCTAATGATTTGCGTTTCCTGAAACAGTTCAACTGAACCATCAAGATGAAATTCTCCGAGCTCTGGTTGCGTACCCTTGTTAATCCTGCGCTGGACAGCGCGGCGCTGTCCCATCTTCTTACCATGGCCGGACTGGAGGTCGAAGCGCTGGACCCGGTCGCGGCGGATTTTTCCGGCGTGGTGGTGGGGCAGGTGCTGTCCGTAGCGCCGCATCCGGATGCCGATCGCCTGCGCGTGTGCCTGGTAGATGCCGGCACTGGCAGCCCGTTGCAGATCGTATGTGGCGCACCCAATGTAAGTGAAGGCGCGCGCGTGCCTTGCGCCCTGGCAGGCGCCCGCTTGCCGGGCTTTGAAATCAAGAAAGCCAAGTTGCGGGGTGTGGAATCGCAGGGTATGTTGTGCTCCGCGCGCGAGCTGGGACTGGCAGAACAAGCCGATGGCCTGCTGTTGTTGCCGAACGACGCACCGGTGGGTAGCAATATCCGCGATTATCTGCATCTGGATGACAGGCTTTATACGCTCAAACTTACCCCCAATCGCAGCGATTGCCTGAGCGTGGCCGGCGTGGCGCGTGAAGTGGCCGCGCTTACCGGCAGTCCATTGAACTTGCCCCGGATTGAACCCGCAGCGGTCACCGGCAGGCTCACCCGCATGGTGCAGGTGACTGCAGGACAAGCCTGCCCGCGCTATTGCGGGCGCGTCATCAGCCAGCTCAATCGCGCGGCTCAAACACCGGGCTGGATGATTGAACGCCTTTCCCGCAGTGGCCTGCGCAGTATCAGTCCGGTAGTGGACATTACCAACTATGTATTGCTGGAGTTGGGACAACCCTTGCATGCCTTTGATCTGGACAAGCTTGCTGGCGATATCCAGGTGCGCATGGCCACGCCGGGTGAAACGCTGACGCTGCTGAATGATCAGCGTGCGACGCTGGAAGCGGACATGCTGGTGATCGCCGATGACAACGGCGCGCAGGCGCTCGCTGGCATCATGGGGGGGGCGGCCACCGCAGTGGATGAAAATACCTCGGAAATTTTTCTTGAGGCAGCGTATTTCAGCCCCGGCGCGATTGCCGGACGGGCGCGCCGGCTGGGCTTGTCCACCGATTCATCGCACCGTTTTGAGCGCGGGGTGGACTACGCAGCCACGCGCGATGCGCTGGAACGCGCCACGGCATTGATACTTGAAATTTGCGCTGGCGCGGCAAGTGCAATCACCGAAATAACAGGCGATCTGCCACAACGTGCGCCTGTCATGCTGCGCACCGCGCGTGCCAGCAAAGTGTTGGGCGTGGCGCTGAGTGACGCGCAGGTGGAAGTGTTGCTGGGCCGCCTGTGCTTTGACTTTCAGCGCGATGGCGCGGCCTATCAGGTGACGCCGCCCAGCTACCGCTTTGACCTGAATATCGAGGAAGACCTGATCGAAGAACTGGCGCGGCTCCATGGTTATGACAACATTGTTGCGCAGGCCCCGGTCGCCCGCCTGACCATGTTGCCGCAGCCGGAGCAACAGCGTGGGGTGGATGCGTTGCGCACCCTGCTCACCGCGCGTGATTATCAGGAAGTCATTACCTACAGCTTTGTAGATGCCGCATGGGAAGCGGATTTCGCACCCGGCGCTCAGCCCGTCGTGCTGAAAAATCCCATCGCCAGTCAGATGGGCGTGATGCGCTCCACCTTGTTGGGCGGCCTGATGGATGTGCTGCGCAACAATCTGAACCGGCGCCAGGAGCGTGTGCGTATTTTTGAGAGCGGACGCTGTTATCTGCCGGCGGCCGAGGGCTTCGATCAGCCGCAACGCCTGGCTGGACTGGCTTACGGCAGCGCTATGCCGGAGCAGTGGGGGAGTGCGGCGCGCAACGTGGACTTTTTTGACGTCAAAGCCGATCTCGAAGCGCTGTGCTGGCCACAGCCTGCACGCTTTGAAAAATCCGCTCATCCTGCGCTGCATCCAGGCCAGTGCGCTGAAATGTGGTTGAATGGTGTCCATGCCGGCTGGCTGGGTACATTACACCCACGGCTGACGCAGCAATATGATTTGGCGACAGCGCCGGTTGTGTTTGAACTCGCCCTGCCGGCATTGTTAACGCGGAAGCTGCCCAGGCATGGCGAGATTTCGCGTTTCCAGAGCGTGCGCCGTGATCTGGCCGTGATAGTCGATGAATCGGCGCCGGTACAGGCTTTGATTGATGCGATGTACGCAGCACGCATAGAGGGTGTTGCCGAGATTACATTGTTTGACGTGTATCGCGGCAAAGGCATTGATTCTGATAAAAAAAGTCTTGCATTCCGGGTGCTGTTGCAAGATACTCAAAAGACCTTTACCGACACTGAAGTGGATACCGCCATGGCGTACTTCACCGATCTGTTAAAACAACAATTCAACGCGCAATTACGTTCCTGAGGTAGTCATGACCCTGACCAAGGCAGAACTGGCAGACATGCTGTTCGAAAAAGTTGGCCTGAATAAACGCGAAGCCAAAGACATGGTGGAGTCGTTTTTCGAAGAAATACGCATTGCACTGGAAGCGGGCGATACCGTGAAGCTTTCCGGCTTTGGCAATTTTCAGCTGCGTGACAAACCGCAGCGTCCTGGCCGCAATCCCAAAACCGGCGAAGAAATGCCAATCACGGCACGCCGCGTGGTGACCTTTCACGCCAGCCAGAAACTCAAATCGCAGGTAGAAGACGCGCATGGCGGAACATCAGCCAACTAGTCAACTGCCGCCGATTCCTGCCAAGCGCTACTTCACTATCGGCGAGGTCAGCGAACTGTGCGGGGTGAAGCCGCACGTACTGCGCTACTGGGAACAGGAATTCGGCCAGCTCAAACCAGTCAAGCGACGTGGTAACCGTCGTTACTATCAGCATCATGAAGTGCTGCTGATTCGCCGCATCCGGGAACTGCTTTATGAGCAGGGATTCACGATCAATGGCGCACGCCATCGTCTGGATGTGCTGGCCACATCCGACGCCGCCGAGGCCGCACCCACGGTGACTGAATCGGTAACGGATTATGCAGCACTGCGTCGCGAAATGATGGAAATTGTCGAGTTGCTGCGCCTGTGATTTTTTAGCTCCAGTCTTTGTCCAGGCGCTACAGACTCTGCTATAATCGCGGCCTTCGGGGCGTAGCGCAGCCTGGTAGCGTACTTGCATGGGGTGCAAGTGGTCGGAGGTTCAAATCCTCTCGCCCCGACCAGAACAATGCCCAGGTGTCATGCCTTTCTAGAAAGCAATTCACGGGAATTTCAGAGTAGCCGTCCACTATGCGCATTTTGCTGAGCAACGACGACGGTTACTTTGCACCCGGTCTCGCCATCCTGGCGGATACACTCTCACACATCGCAGATATCACGGTGGTTGCCCCCGAGCGCGACCGCAGCGGCGCCAGCAATTCCCTCACACTCGATCGTCCGCTGATGCTGCGCCAGGCGCACTCCGGGTTTTATTACGTCAATGGTACGCCCACGGACTGCGTTCACCTCGCGGTTACGGGTATGCTCGATCACCTGCCGGACATGGTTATCTCCGGTATCAATCACGGTGCCAACATGGGCGACGACACTATTTATTCGGGCACCATAGCGGCAGCCACCGAAGGGTTTTTACTGGGGGTGCCGTCGCTGGCGATATCCCTGGCGAGCCATGCAGCGGGCAACTATGCCACAGCCGCGCGTGTTGCCAGCGAGCTCGCGCAACGTGTCATGGCACGGCCTTTTGCGGCACCGCTACTGCTCAACGTGAACGTGCCGGATATTCCCTATCAGGACTTGCAAGGCACCGCAATCACCCGCCTCGGACGCCGCCACAAAGCCGAACCGGTGGTCAAATCCACCAATCCGCGCGGTCAGACGGTGTATTGGGTGGGGGCTGCGGGTGCGGCGCAGGATGCAGGCGAAGGCACGGATTTCCATGCGGTGGCGAATGGGCGTGTGTCGGTGACGCCGTTGCAGATGGATCTCACCCAGTTCAGTCAACTGGCGCCGCTGCGGGCGTGGTTGCAGGCATGAGCGTGACGCGTCACAGCGGTATCGGCATGACTTCCGAGCGTACCCGCGCGCGCATGGTCGAGCGCCTGCGTGCGCAGGGGATCAAGGACAACAACGTACTCACCGCGATGGGCATGGTGCCCCGGCATATTTTTGTGGATGAGGCACTGTCCATCCGGGCTTATGAGGACAGCGCGCTGCCGATAGGTTTCGGCCAGACCATTTCCAGCCCCTATAGCGTGGCGCGCATGATCGAGGTGCTGCGTGGCGGCGCCGACCTGCAGTGCGTGCTGGAAGTCGGCACAGGTTGCGGCTACCAGGCCGCTGTACTGGCCAAGCTGGCACGCGAAGTGTACTCGGTTGAGCGCATTGCCACGCTGCTCGGGCGCGCGCGTCGTACCATACGCGAACTACGCATCGGCAATATCAAACTTAAACATGGCGATGGTAGCATTGGGTTAAAGGATGTGGCACCTTTCGATGGCATCATCCTTGCCGCCGCCATACCCACTCCCCCCCAGGCGTTGCTGGAACAACTGGCGCAGGGCGGCCGCATGGTATTGCCGCGAGGTATTGGTGAAACGCAGCAAATGGTGCTGATCGAGCGCACCGCAGAAGGTTTTCAGGAGACGGTGCTGGAAATGGTACATTTTGTTCCACTGTTGCCCGGAGTGCGCTGACGTGATGGTATTCGACAAGATTGCATGCCCGGTTTATCTGGTTGCTGTGATCCTTGCCCTGGGAGGATGCGCCACGCAGAATTCTGCGCCGGTTGTGGATGGCACCCAGCCTGGCACAAGCAATATTGTCAAGCCGGCAATCAAGTCCGCCACTACCCGCGCCGGCGCAGCAAAGCTGCATGACTGGCGACCGGACAGCCATACCGTGCAAAAAGGCGACACGCTCTACAGCATCGCGCTTGAATATGGCCTGGACTACCGTGATCTGGCGAGCTGGAATGCACTATCTGACAATAACCTGATCCGTGTCGGGCAGGTGTTGAAACTGAGTGCGCCGCAGCCAGGCAGCGGCATTGCGCAAGTCACGACATCTGAATCAGCAGTTCAGACCATCCCCCTCAAGATCGAACCGTTACCGCAGGCCCAGATAGCGACCGGCGCGGTGTTGATAACCCAGCCCAAGGCGGTCAAATTGCCCTACTCTGCCGCCGCATTGGCGCAGCTTGAACAAGGCGGGACGCCGCAGCCGGCCGCGCGGCCTCCCGCAACACCGGAGGCAGCGTCCGGGGTGGCGCCCGAGCCCGCATCCTCTGCAGAACAATCCCGACCGGCCGCGACTGCCAAGGAAACCGATGATACGGGTATTGATTGGATCTGGCCTACGCAAGGCCGGGTCATTGCCGGATTTGACGAAGCCAAAAACAGCAAGGGGCTGGATATAGCAGGCAAAGCCGGACAAGCCATATTCGCAGCCGCGCCGGGCAAGGTGGTGTATAGCGGCGCTGGTTTGCGCGGCTATGGCAAGCTGGTTATTATCAAGCACAACGCCATTTATTTGAGTGCCTACGCACACAATCAGCGGGTGCTGGTGAAAGAAGGTCAGACGGTTGCGCGCGGGCAGAAAATCGCCGAAATGGGTGACAGTGATGCCGATCAGGTCGCGCTGCATTTCGAAATCAGGAAAATGGGCCAACCGGTGGACCCGATGAAATATCTTCCCGGAGCACAAAAATAGCGATGCATGACGACAACGAGCAGGAAGACTACGCCGGTATCCCGGACGACAAATCCCAGACCGAAGTGCCCGAGGTCGAACTGGGATTTACCGAGGATGCGCATACCGATGTGACGCAGATGTACCTCAACGAAATTGGCCACAACGCATTGCTCAGCCCCACTGAGGAACGCCGCCTGGCCGAACTCACCCGTGCGGGCGATTTTGACGCCCGGCAAAAAATGATCGAACACAATCTGCGGCTGGTGGTGAATATCGCCAAGCATTACGCCAATCGCGGGCTGGCACTGCTGGACCTGATCGAGGAGGGCAACCTGGGACTGATTCATGCGCTGGAAAAGTTCGAGCCCGAACGCGGATTTCGTTTTTCCACTTATGCCACATGGTGGATACGCCAGAATATCGAGCGCGCCATCATGAACCAGTCGCGCACCATCCGCCTGCCGGTGCACGTCATCAAGGAACTCAACGTCATTTTGCGCGCCCGCCGTCATCTGGAAAATCACGGCGCCTCCGACCCGAGTGACGAGGACATCGCCCATCTGGTCGGGCTGCCGGTAGAAGATGTACGACGCATGTTGCGCCTGAATGACCGGGTGGCATCGCTCGACGCACCGCTCGATATTGATCCCAGTCTGTCCATCGGGGAGGCCATTGCAGATGGCAACAGCGCGTTGCCTGAAGACATGCTTGAGCACGCCGAGACTGAAGCCTTTGTGCGCCTGTGGCTGAGTGACCTCAACGACAAGCAGCGCTGGGTAATTGAGCGGCGTTTCGGGCTGGGCGGGCAGGATGTGCACACCCTGGAACAGCTGGCCGAAAGCCTCGACGTCACCCGTGAACGCGTGCGCCAGATCCAGATGGAAGCCCTGCACCATTTGCGGCGCATGCTGAAACGCACCGGCGTCAACAAGGACGCTCTGTTGTGA
Protein sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|1612328_1613255_+|WP_147073433.1|DBSCAN-SWA MHDDNEQEDYAGIPDDKSQTEVPEVELGFTEDAHTDVTQMYLNEIGHNALLSPTEERRLAELTRAGDFDARQKMIEHNLRLVVNIAKHYANRGLALLDLIEEGNLGLIHALEKFEPERGFRFSTYATWWIRQNIERAIMNQSRTIRLPVHVIKELNVILRARRHLENHGASDPSDEDIAHLVGLPVEDVRRMLRLNDRVASLDAPLDIDPSLSIGEAIADGNSALPEDMLEHAETEAFVRLWLSDLNDKQRWVIERRFGLGGQDVHTLEQLAESLDVTRERVRQIQMEALHHLRRMLKRTGVNKDALL >NZ_AP021884|1605656:1613255|1610648_1611311_+|WP_147073435.1|DBSCAN-SWA MSVTRHSGIGMTSERTRARMVERLRAQGIKDNNVLTAMGMVPRHIFVDEALSIRAYEDSALPIGFGQTISSPYSVARMIEVLRGGADLQCVLEVGTGCGYQAAVLAKLAREVYSVERIATLLGRARRTIRELRIGNIKLKHGDGSIGLKDVAPFDGIILAAAIPTPPQALLEQLAQGGRMVLPRGIGETQQMVLIERTAEGFQETVLEMVHFVPLLPGVR >NZ_AP021884|1605656:1613255|1609333_1609708_+|WP_147073439.1|DBSCAN-SWA MAEHQPTSQLPPIPAKRYFTIGEVSELCGVKPHVLRYWEQEFGQLKPVKRRGNRRYYQHHEVLLIRRIRELLYEQGFTINGARHRLDVLATSDAAEAAPTVTESVTDYAALRREMMEIVELLRL >NZ_AP021884|1605656:1613255|1606685_1609043_+|WP_147073443.1|tRNA|DBSCAN-SWA MKFSELWLRTLVNPALDSAALSHLLTMAGLEVEALDPVAADFSGVVVGQVLSVAPHPDADRLRVCLVDAGTGSPLQIVCGAPNVSEGARVPCALAGARLPGFEIKKAKLRGVESQGMLCSARELGLAEQADGLLLLPNDAPVGSNIRDYLHLDDRLYTLKLTPNRSDCLSVAGVAREVAALTGSPLNLPRIEPAAVTGRLTRMVQVTAGQACPRYCGRVISQLNRAAQTPGWMIERLSRSGLRSISPVVDITNYVLLELGQPLHAFDLDKLAGDIQVRMATPGETLTLLNDQRATLEADMLVIADDNGAQALAGIMGGAATAVDENTSEIFLEAAYFSPGAIAGRARRLGLSTDSSHRFERGVDYAATRDALERATALILEICAGAASAITEITGDLPQRAPVMLRTARASKVLGVALSDAQVEVLLGRLCFDFQRDGAAYQVTPPSYRFDLNIEEDLIEELARLHGYDNIVAQAPVARLTMLPQPEQQRGVDALRTLLTARDYQEVITYSFVDAAWEADFAPGAQPVVLKNPIASQMGVMRSTLLGGLMDVLRNNLNRRQERVRIFESGRCYLPAAEGFDQPQRLAGLAYGSAMPEQWGSAARNVDFFDVKADLEALCWPQPARFEKSAHPALHPGQCAEMWLNGVHAGWLGTLHPRLTQQYDLATAPVVFELALPALLTRKLPRHGEISRFQSVRRDLAVIVDESAPVQALIDAMYAARIEGVAEITLFDVYRGKGIDSDKKSLAFRVLLQDTQKTFTDTEVDTAMAYFTDLLKQQFNAQLRS >NZ_AP021884|1605656:1613255|1605656_1606676_+|WP_147073445.1|tRNA|DBSCAN-SWA MQNLDTIVAAALAEFAAVNQAVELEQAKARYLGKAGLLTGQLKQLGKLPAAERPAAGNVINQAKERIQQALEARRAALSRAELDNRLAAETLDVTLPGRGLGTGGLHPVTRTLARIQALFASIGFEVAEGPEIETDFYNFTALNIPENHPARAMHDTFYVDDKHLLRTHTSPVQIHYLQNNQPPLKIIAPGRVYRCDSDVTHTPMFHQVEGLWVDEEVSFAALKGVLADFMQRFFERDDLKVRFRPSFFPFTEPSAEMDIACVMCGGGGCRVCSHTGWLEVLGCGMVHPNVLGHVHVDSEKYLGFAFGMGVERLAMLRYGVDDLRLFFANDLRFLKQFN >NZ_AP021884|1605656:1613255|1609050_1609356_+|WP_147073441.1|DBSCAN-SWA MTLTKAELADMLFEKVGLNKREAKDMVESFFEEIRIALEAGDTVKLSGFGNFQLRDKPQRPGRNPKTGEEMPITARRVVTFHASQKLKSQVEDAHGGTSAN >NZ_AP021884|1605656:1613255|1609908_1610652_+|WP_147073437.1|DBSCAN-SWA MRILLSNDDGYFAPGLAILADTLSHIADITVVAPERDRSGASNSLTLDRPLMLRQAHSGFYYVNGTPTDCVHLAVTGMLDHLPDMVISGINHGANMGDDTIYSGTIAAATEGFLLGVPSLAISLASHAAGNYATAARVASELAQRVMARPFAAPLLLNVNVPDIPYQDLQGTAITRLGRRHKAEPVVKSTNPRGQTVYWVGAAGAAQDAGEGTDFHAVANGRVSVTPLQMDLTQFSQLAPLRAWLQA >NZ_AP021884|1605656:1613255|1611315_1612326_+|WP_147073482.1|DBSCAN-SWA MVFDKIACPVYLVAVILALGGCATQNSAPVVDGTQPGTSNIVKPAIKSATTRAGAAKLHDWRPDSHTVQKGDTLYSIALEYGLDYRDLASWNALSDNNLIRVGQVLKLSAPQPGSGIAQVTTSESAVQTIPLKIEPLPQAQIATGAVLITQPKAVKLPYSAAALAQLEQGGTPQPAARPPATPEAASGVAPEPASSAEQSRPAATAKETDDTGIDWIWPTQGRVIAGFDEAKNSKGLDIAGKAGQAIFAAAPGKVVYSGAGLRGYGKLVIIKHNAIYLSAYAHNQRVLVKEGQTVARGQKIAEMGDSDADQVALHFEIRKMGQPVDPMKYLPGAQK |
8 | uncultured_Mediterranean_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1977054 : 2023954
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|DBSCAN-SWA AATGCAAACCCAAGTTCCATCAATCGAATCCGGTCGGAATCCCCGCCGGATGAATCCCGGCGGTGCAACCTGCATCGCCCTCGACGAAAACGAGCTCGCCATCCGCTGGGGGCTCTCCGTCAAGACGCTGCGCCGCTGGCGTCAAGAGCAGCTCGGCCCCATCTACTGCAAGCTCGGTCGCCGGGTCACCTACCTCCTGCACGAAATCGAAGCCTTCGAGCGCCGCGTCTCGCGCTACTCGAGCTTCACTCGTGCGTACCAGTGAGGAGGACGGCCATGAGCGATCTGACCATCTTCCCCGTCGACATCGCTGAGATGTCTGTGAGCCAACTGGCCGCGCTGCCGCCCGAGCAGAAGTGCGAGGTCGACAAGAACCTTGATGCTGCCATCGACTGGCTAAAGAAGGCTCGCACCAAGTTCGATGCGGCGCTGGAACAGTGCTACGGCGAGCAGGCCCGTGTCGCACTGCGTGAATCAGGCCGTGACTTTGGTACCGCCCACATCAGCGACGGCCCGCTGCACATCAAGTTCGAGCTGCCCAAAAAGGTCAGCTGGAACCAGAAACAGTTGGGCGAAATCGCCGAGCGCATCGTGGCCTCAGGCGAGAAGGTCGAGGGCTACCTCGACGTCAAGCTCTCAGTGTCCGAGTCCCGGTACATCAACTGGCCGCCTGCATTGCAGCAGCAATTCGCGGCCGCCCGCACGGTCGATTCCGGCAAGCCGTCCTTCACCCTGAGCACCGATGGGGGTGAGGCATGAAGCGGCTACCCATCGTGTCCGCCGTCGAGCGGATGGCCGAGCGCAAGGGCGTGAAGCTGCTGATGCTGGGCAAGTCCGGCATCGGCAAGACGTCCCGGCTCAAAGACCTCGACCCCGCCACCACACTGTTCCTTGACATCGAGGCAGGCGACTTGGCGGTCGCCGACTGGCCGGGCGACACCATCCGCCCGGCGTCCTGGCCCGAGAGCCGCGACTTCTTCGTGTTCCTTGCGGGCCCGGACAAGTCGCTGCCGCCGGAGAGCGCGTTCTCGCAGGCGCACTACGACCACGTCATCGAGAAGTTTGGCGATGCGACGCAGCTCGGTCGCTACCAGACCTTCTTCCTTGACTCGATCACGCAACTGTCTCGCCAGTGCTTTGCGTGGTGCAAGACGCAGCCCGGGGCGGTCAGTGATCGTTCCGGCAAGCCCGATCTGCGCGCGGCCTACGGGCTGCTCGGCCAGGAAATGATCGGCGCGTTGACCCACCTGCAGCACGCCCGTGGCAAGAACGTTGTGTTCGTGGCGATCCTCGATGAGCGACTGGATGACTTCAATCGCAAGGTGTTCGTCCCGCAGATCGAAGGCAGCAAGACCAGCCTGGAGCTGCCCGGCATCGTCGATGAGGTCGTGACGCTGGCCGAGATCAAGGCCGAGGACGGCAGTTCCTACCGCGCCTTCATCACGCACACCGTCAATCCCTACGGCTTCCCGGCCAAAGACCGCAGCGGTCGTCTCGACCTGCTGGAGCCGCCGCATCTCGGCGCGCTGATCGCCAAGTGCGCGGGCGCTGTGCCCGCGCTAGCCAGCGCCGCCAACCCCGCACACATCGAATCTCAGGAGTAATCGCAATGACCGCATGGAATGACTTCAACGACGCCGACTCTCAGCAATCCGGCTTCGATCTGATCCCCAAGGGCACCGTCGTGCCGGTGCGAATGACCATCAAGCCGGGTGGCTATGACGACCCCGAGCAAGGCTGGGGTGGCGGCTACGCCACCGAATCGTTCGAGACCGGTTCCATCTATCTGGCCGCTGAGTTTGTGGTCACCGCTGGCGATCATGCCAAGCGCAAGATGTGGAGCAACGTCGGCCTGCTCTCCAAGAAAGGCCCGACCTGGGGCCAGATGGGGCGCAGCTTCATCCGGGCCGCGCTCAACAGCGCCCGCAACGTCCACCCGCAGGACAACAGCCCACAGGCCGCCGCCGCGCGCCGCATCAATGGCTTCGCCGAACTGGACGGTCTGGAGTTCTTGGCGCGCGTCGACATCGAGAAGGACGCGAAGGGTCAAGACCGCAACGTGGTCAAGCTGGCAGTCGAGCCCGACCACCCCGACTACGCCAAGTTGAAAGGTGTGCCGCCGAAGGGCAGTCCGGGCGGTGGCAACTCCGGCGCTCCGGCGCAGGCGGCCCCGGCCTATTCCGCGCCCACCCCGCAACGCGCACCAGTGACGGGCAAACCGTCCTGGGCTCAGTGAGGAGACGGCTATGAATGCATCCGTCCTCACTGCCAGTCACTACGGCGTCGTGCGCTTCGGCGATCTGCAATGCGAGGCCGTCGTCCTCAAGGGCGGCGAGCGTGGCTACGTTCGTCGCCAACTGGCCAAGCTGCTGGGTTTCCACGAGACGCACAAGGGTGGCCGATTTGCCCGGTTTCTTGCCGACTTCGCTCCTAAGTCCTTGTCGGCATTGGAGAAAACTCGTGAGCCGATTCTGTTGCCGTCAGGTCGGCAGGCGCAGTTCTTCCCGGCCGGGATCATTGCCGACGTCGCGTCGGCGGTGGTCAGCGCGGCCATCAACGGCACGCTGCACAAGGCCCGCCAGGGCATCGTGCCCAATTGCATGAAGATCATGCGCGCGCTGGCCACCACCGGCGAGGTCGCGCTGATCGACGAGGCGACGGGCTACCAGTACCACCGCGCGCCTGACGCGCTGCAGGAACTGATCTCCAAGCTGCTGCGCCAGTCGTGCTCTTCGTGGGAGCGCCGCTTCCACCCGGACTACTACCGCGCCCTCTACCGGCTGTTCGGCTGGAAGTACCAGGGCCACGACCAGAACCCGCCCCACGTTGTCGGTCAGATCACGCAGCGCTGGGTCTACGGCCCGGTGCTGCCCGTCACGCTGATCGACGAGATTCGCGCCCGCAAGGGCATCTCGCAGAAGCACCACCAGTGGCTGTCCGATCAGGGCCTCGCCCGTCTGGAAACGCAGATTCACGCGGTCACCGCCATTGCGCGCAGCTCGACCTGCTACCGCGACTTCGACCGCCGCTGTGAAGCGGCCTTCGCTGGCGGCGCGCTGCAGCTGGCGCTGCTGGCCGAAGACTTTGAGGAGGGGGCGTGAAATGCTGGGTCTGCAAACGACAGGCCCGGGGATTCGGTCACACCGACAACCGACACGGTATCGGCGATCCCCGGCGCTACCCCATCGACTGGGTGTTCTGCTCGCAGCGCTGCCAATCCGCGTTCCACGCTATGTACGGCAACTGGTCGCGCGCCAAGGATGGTCGCAGCGACATCAAGGGGGTCGCCATGATCGATCCCTCTGATATCGAGCTGGCCGCGATGCGCAAGTGCCTCAAGTCCTTCGGCGAGGCGGCAAGCGAGATCGGCTTCACCAAACCACTGGGCAACTACTCCGAAGCCGAGGCGCTGCAGGTGATCGACGCCATCGTCACTTGCTACACCGAGGCGATGGTTGAGCACCACGAGGCGAGCAAGTACCCGCCCGTACGCGGCATGACGCCAACGCCCGACCCCATGACACCGAGTGCAGCCAATCCGTTCGCGGATCTGGACGACGACCTGCCTTGGGAAGAACCGAAGGGGAAGAAGCCATGATGGACTTCAACTCCACTTCGAGCATCTCGGGCCAGATCACTGCGCTGGTCGACGCCGGGATGCAGCGGGCGCGAGCCCAGCAGTCCGAGCGCCAGTACCTTGGTGCCTCGCGGTTGGGCGCTGCCTGCGAGCGTGCGCTGCAGTTTGAGTACGCCAAGGCTCCCGTCGATCACGGCCGGGACACCCCGGGCCGGATGCTGCGCATCTTCGAGCGCGGCCACGTCATGGAGGACTGCATGGTCGCGTGGCTGCGCGACGCCGGTTTCGAATTGCGTACCCGCAGGGCCGATGGCGAGCAGTTTGGCTTCTCCGTGGCTGATGGCCGTCTGCAGGGCCACATCGACGGCGTCATCGTCGATGGCCCGGAGGGCTTTGCCTACCCGGCGCTCTGGGAAAACAAGTGCCTCGGCATGAAGTCCTGGCGCGAGCTGGAGAAGAACCGGCTCGCCGTGGCCAAGCCCGTCTACGCCGCGCAAGTGGCGATCTACCAAGCCTATCTCGAACTGCACGAGCACCCGGCGATCTTCACGGCGCTCAACGCCGACACGATGGAGATCTACACCGAGGCCGTGCCCTTTGACGCAGCCCTGGCCCAGCGAATGTCGGATCGGGCGGTGAAGGTCATCACGGCGACTGAAAGCGCAGATCTCCTGCCGCGTGCCTTCAATGACCCGACCCACTTCGAGTGCCGGATGTGCGCGTGGCAAGACCGCTGCTGGAGAACACAAGCATGACCGACAACAACACCCCGACCACCGGCATCGAGCCGATGATCGATGCCAAGCAGGCGGCCGCCGCGTTGCGCCTGCCGTACTACTGGTTCGCCGACCACGCGATGCGCACCAAGTACCGGATTCCGCACTACCTGATGGGCGGTCTGGTGCGCTACCGGCTGTCCGAACTCTCTGCGTGGGCCACGCGTACCACCGCCGTTCAGGGCCGTGATTCCCAAGATGCGGACGCACCTGTCGAGGGAGCCGAATGATCGACTTCAACGACACCACCCAACCTGCGGAGCACAACAGGGAATCTGAACGAGACGAGATTCGCGCCGACTTGCTTGCGCGTCTGGAGTCGGTGCTGACCACGATGTTTCCGGCTGGCAAGAAGCGCCGTGGCAAGTTCCTGATCGGCGACATCCTCGGCAGTCCAGGTGACAGCCTCGAGGTGGTGCTGGAAGGTGAGAAGGCCGGTCTGTGGACGGATCGTGCCACCGGCGATGGCGGCGACATCTTCGCCCTGATCGCGGCCTATCTCGGTGCGAACGTCCACACCGATTTCCCTCGCGTGCTGGATGAAGCTGCCGATCTGCTCGGGCGGTCGCGGTCGGTGCCAGTGCGCAAGGCGAAGAAGGAAGCGCCTGTAGACGACCTCGGCCCGGCCACGGCGAAGTGGGACTACTTCGATGCCGGTGGCAAGCTGATCGCCGTCGTCTACCGCTATGACCCACCGGGAGGCAAGAAGGAATTCCGACCGTGGGACGCGAAGCGCCGCAAGATGGCCCCGCCTGAGCCGCGCCCGCTGTTCAACCAGCCGGGCATCGGTGCGGCCAGCCACGTCGTCCTGGTCGAGGGCGAGAAGTGCGCGCAGGCCTTGATCGCCAGCGGCGTGGTGGCCACCACCGCCATGCACGGTGCCAATGCCCCGGTCGACAAGACCGACTGGTCGCCACTGGCTGGCAAGACGGTGCTGATCTGGCCCGACCGCGATGCGCCAGGGTGGGACTACGCCGACCGCGCGTCGCAGGCGATCTTGCAGGCAGGCGCGACCTCGGTCGCCATCCTCATGCCACCCGACGACAAGCCGGAGGGGTGGGACGCTGCAGATGCCATTCCCGAAGGTTTCGATGTCGGTGGCTTTCTGGCCGTCGGCGAGCGGATGCCGGTGATGCGCTCGGTGGAGGAAGCGCCTTCGCCAGACTTGCTGACGGGCATTGATTGGACGACCGAGGATGGCCTGTCCAGCGCTTTCACCCGCCGCTATGGCGAAGACTGGCGCTACTGTGCCCTGTGGGGCAAGTGGCTGGTCTGGACGGGTGTGCGCTGGAATCCCGATCAGGTGCTCTACGTGTCGCATCTTTCCAGGGGCATCTGCCGTAACGCCTCGCTGAAAGCGGACACGCCGAGGCTCAAGGGCAAGCTGGCCAGTTCGGCCACGATTTCGTCGGTTGAAAAGATCGCGCGCTCTGACCCGAAGCACGCATCCACCGCCGAGGAATGGGACGCCGATGTCTGGGCGTTGAACACCCCCGGTGGCGTGGTCGATCTGCGCACCGGCCGGATGCGCCCGCACCGGCGGGACGACCGAATGACCAAGGTGACCACGGCTACCCCGCAGGGCAATCCGGACAGTGCCTGCCCAACGTGGCGAGGGTTCCTGACAGACGTCACCGGCGGCGATGCCGATCTGATGGCCTACCTGCAACTGATGGTTGGCTACTGCCTGACGGGCGTCACCAGCGAGCACGCGCTGTTCTTCCTGTACGGCACGGGCGCGAACGGCAAGTCGGTGTTCGTCAACGTGCTAACCACCATCCTGGGCGACTACGCGGCCAACGCCCCGATGGACACTTTCATGGAGGCGCGCAATGACCGACACCCCACCGATCTCGCCGGGTTGCGCGGTGCACGATTCGTGTCATCCATCGAAACGGAGCAAGGGCGGCGCTGGAACGAGTCCAAGGTCAAGGCCATCACCGGTGGCGACAAGGTGTCCGCGCGCTTCATGCGCCAGGACTTCTTCGAGTACCTGCCGCAGTTCAAGTTGGTGATCGCGGGCAATCACAAGCCGTCGATCCGCAACGTCGACGAGGCGATGAAGCGTCGACTGCACCTGATCCCGTTCACGGTGACGATCCCGCCCGAGCGCCGCGACGGCAGGCTGACCGAGAAGCTGCTCAAGGAACGCGATGGGATTTTGGCGTGGGCCGTCGAGGGCTGCAGCCGCTGGCAAAGCCAGGGCTTGAAGCCGCCCGCCAGCGTGGTGTCGGCGACCGAGGAGTATTTCGAGGCCGAGGACGCGCTCGGGCAGTGGATCGAAGAACGCTGTCTGCTGGCCAAGTCGCACCGCGAAGGTGTCTCCGAACTGTTCGCCGATTGGCGTGAATGGGCCGAGCGCGCTGGCGAGTACGTGGGCTCGGTCAAACGCTTCTCGGAGCTGATGGCGACTCGCAAGTTCGACAAGTGTCGGCTGACCGGAGGGGCTCGCGCCATCGCGGGCATCGCCCTCAGGCCCAAGCCGTACAGCCACGCCTACCCCTACCGCGATGACTGATCAATCCGGTCGAGTGACGGATTTGACGGGTTTCCTGATTGACGCGCTACACGTGCGCGCACGTAAAGGGCGTTGTCCTGACAAACCGTCGCATCCGTCACTCGCCCACCCAACACGGAGTAAAGACGATGAAAACGACGATCCTCGCCCTCGATCTGGGCACACACACCGGGTGGGCTCTGCAGCACCTGGACGGCACCATCACCAGCGGCACGGAGCACTTCAAGCCGCAGCGATTTGAAGGCGGCGGGATGCGTTTCCTTCGATTCAAGCGCTGGCTCAACGAACTGCTGTCGGTCAGCAATCACATCAACGCGGTGTTCTTCGAGGAAGTTCGGAGGCACGCTGGCGTTGACGCAGCGCACGCCTACGGCGGATTCATGGGGCACCTGACCGCGTGGTGTGAACATCACAACATCCCCTACCAGGGCGTTCCGGTCGGCACGATCAAGAAGCACGCGACCGGCAAGGGCAATGCGAGCAAGGACGAAATGATCACGTCCGTCCGCGAGCGTGGTCACACCCCAGTCGACGACAACGAAGCCGACGCGCTGGCTCTGCTGCACTGGGCAGTCGAGACGCAGGAGGTGTGACGTGAAGGTTTCGACACCCCAATACCGCTGCCCCCTTGGTCGGCTGCAACCCCAGACCACCGATCTGGACGCCATCAAGGAACGTGGCTGGCGTGACCAGCACATCCTGGTGGTCAACGCGTCCGACGACCGTCTGGACTTCATCGAGCGCGAGATCGTGCGACGCATTGGTGAACGCCTGTACGGGCTGGGAGGGACGCGTCATGGCTGAGTGGACAACCGACGACGTGGCAGCACGCTTCGAGGAGGCCGCCACCACCGGACGACGCTTGCCCCCTGTACGTGTGCAGGGCTACTTCAACTGCTGGCCTGCCTTCGTCCGCAAGGAGTGGGAAGCCTTTGCTGCTGACGAGAAGGTGTATCGCCCCTTCCCACCAAGCCCCGAGGCCATCGACCGGATGCTGGAGACGATGCGCTGGGTGCAGTGGCTCGAGGTCGAGCAGCGACACCTCGTGTGGATGCGGGCCAAGCGCTACGGCTGGAGGGACATCACCATTCGATTTGCCTGCGACCGCACCACGGCGTGGCGGCGTTGGCAGAGGGCAATGGAGATCGTGGCCACGAACCTCAACAGCGAAGGCGTGCGGTTGCCTTCCAAAAACGTGGGCAATTTAGGGTAATGCTTGCCGCGCTTGTCCCTGCTTTGCCTTGATTGTCCGTTTCGAGGCCCGGCAGCCCTGCAACAAAACAGCCCGGTCGGGGGTAGTATTTCGGCTATCTTCTGGACAGCGGTGACGGTTGAGGCGATGGGCCCAGGCAAAAGGGGTCCTTCCTTCCCGAATCGCAATGCGGGGGGCGCGAGCGCGGCATTCGCCTAGCGTCCGACTGCAAACCAAGGTTTGCAGGGTTTGCAGTTTGCACCCGCACCAGTCCGCACCCATCACGAGCCCGCCCACGGTTTTCCGTCGGCGGGTTTTCTTTTTGAGGAAACGATTCTGAACACGCTTAACGTTGAGTACCGCAAGGTCGAGGCGCTGATCCCCTACGCCCGCAATCCACGCACTCACACCGACGAGCAGGTGGCCAAGATCGCCGCCAGCATCGTCGAGTACGGCTGGACGAATCCGGTGCTGGTGGACGGCGACAACGGGATCATTGCGGGCCACGGTCGTTTGGCCGCCGCGCGCAAGCTCGGGCTGGATCAGGTACCGGTCATCGAACTGGCGCACCTCTCACCCACCCAGAAGCGTGCCTACGTCATCTCCGATAACCGGCTGGCGCTCGACGCCGGTTGGAACGAGGAGATGCTGGCGCTGGAAATGGCCGAGCTGTCCGAGGCCGGGTACGACCTTGCACTGACCGGTTTCGAGGATGCTGAGATCGAGGCCTTGCTCGCTGACGAAGTCGCCTCCGATGCCGCCGACCAAGAGCCCGATGCCGACGAGCCGGACGATGGCGACGATGTGCCGGATAGCCCAGTGGTGCCGGTGTCCCGCACCGGCGATTTCTGGGCCATCGGTACCCACCGTCTGATCTGTGGCGACGCCACCGACCCGACCGTGGTCGCCACTCTAATGCAGGGTGATGCGGCCCGGCTGTGCTTTACATCACCGCCTTACGGCAACCAGCGCGACTACACCTCCGGCGGCATCACCGATTGGGATGGCCTGATGCGCGGTGTGTTCGCCAAGGTGCCAATGGACGACGACGGGCAGGTGCTGGTCAACCTCGGGCTGATCCACCGCGACAACGAAGTCATCCCGTATTGGGATGCGTGGCTGGGCTGGATGCGCACGCAGGGTTGGCGGCGCTTTGCTTGGTACGTCTGGGATCAGGGGCCGGGGATGCCCGGAGACTGGGCTGGTCGTTTTGCGCCGAGTTTCGAGTTCGTCTTTCACTTCAACCGCTCTAGTCGCAAGCCCAACAAGATCGTGCCCTGCAAGCACGCGGGCCAGGAATCGCACTTGCGCGCCGACGGGTCGTCCACGGCCATGCGCGGCAAGGACGGCGAAGTCGGTGGCTGGACGCACAAGGGCCAGCCGACGCAGGACACCCGGATTCCCGACTCGGTGATCCGCGTGATGCGGCACAAGGGCAAGATTGGTCAGGACATCGACCACCCGGCTGTGTTCCCGGTGGCGTTGCCCGAGTTCGTGATCGAGGCCTATACGGACGCAGGCGACATCGTGTTCGAACCTTTTGGCGGCAGCGGTACCACGATGCTGGCCGCGCAGCGCAAGGGTCGTGTGTGCCGCTGCGTGGAGATCGCGCCGGAGTACGTGGACGTCGCCATCAAGCGCTTCCAGCAGAACCACCCCGGCGTGCCCGTCACGCTGCTGGCCACAGGCCAGTCCTTCGACGATGTGGTCAATGAACGTCAGGCCACCACGGAGGTAGAGCAATGACCGCCTCCTGGTTTGCCGACAAGATCGAAAAGTGGCCGACTGCCAAGCTGCTGCCCTATGCCCGCAACGCGCGTACTCACTCGGACGATCAGGTGGCGCAGATCGCCGCGTCGATTGCCGAGTTCGGATTCACCAATCCGATCCTGGCGGGCAGCGATGGCGTGATCGTCGCCGGTCACGGACGGCTTGCTGCTGCGCAGAAGCTTGGGCTGGCGGTGGTGCCGGTGGTGGTGCTCGATCATCTGAGCCCGACACAGCGCCGGGCCCTGGTGATCGCAGACAACCGCATCGCCGAGAACGCGGGCTGGGACGATGCGATGCTGCGCATCGAGATCGCATCACTGCAGGACGACGACTTCGACGTGTCGCTGACCGGCTTCGATGCAGATGCGCTGGCCGAATTGATGGCGGGCGACGAGCCGGATGGCGAAGGCGAAACCGATGACGATGCCGTGCCCGAGTTGTCGGAGACGCCGATCTCTCGTCCGGGTGATGTCTGGTCGCTTGGCGGCCACCGGCTGCTGTGCGGGGACTCCACCGTGACTGAGAGCTACGACAGGCTTCTCGATGGCGAGCAGGTCGACATGGTGTTCACCGACCCGCCGTACAACGTGAATTACGCCAACAGCGCCAAGGACAAGATGCGTGGCAAGGACCGCGCGATCCTGAACGACAACCTCGGCGACGGCTTCTACGACTTCCTGTTGGCGGCGCTGACGCCGACCATCGCGCATTGCCGGGGCGGGATCTACGTGGCGATGTCGTCCAGCGAACTGGATGTACTGCAGGCCGCATTCCGCGCCGCCGGTGGCAAGTGGTCGACGTTCATCATCTGGGCCAAGAACACCTTCACGCTGGGCCGTGCCGATTACCAGCGCCAGTACGAGCCGATCCTGTACGGATGGCCAGAGGGCGCGCAGCGTCACTGGTGCGGCGACCGCGACCAGGGCGACGTCTGGAACATCAAGAAGCCGCAGAAGAACGACCTGCATCCGACGATGAAGCCGGTGGAGTTGGTCGAGCGCGCGATCCGCAATTCGAGCCGACCGGGCAACGTGGTGCTCGACCCGTTCGGGGGCTCCGGCACGACGCTGATTGCCGCCGAAAAGTCAGGACGGCTGGCACGGCTGATCGAACTCGACCCTAAGTACGCGGACGTGATCGTGCGCCGCTGGCAGGAATGGACTGGCAAGCAAGCCACCCGTGAGTCGGATGGCGCGCTGTTCGATGATCAGGCGGCGATCGACTCTTCCGCGATCTCGCAATGAATCACGAACCCCGTCAGGTAAGGCAGGCCGCGCGGGATGCCGTACTGCTTGCTGGTCTGGCGGCCAATCGTCCAGCCCATCCACTGTTGGGTGGCGGCGTTGATCGCGTCCGCCAGGGTCTGGCCCCGGTACAGCCCGTTTTGCAAATCGTCCGCAAAGTGGCGGCCGTGGCGACTGTCGAGGAAGACGCGGACTGATTCGAGGGGCTGACCGGTAGCGTCGGAGATGGCGGTCATCGCCAGGGGCCACGCGGTGCTGGCGTGTTCGTTCATCGTGCCCCAAAAGCCCCAGGCATCGTTCTGGGTGGCGGGCATTTGCTGGTTGGTGTTCATCTCTGGCTCCTTGGGGTTGATCGTTGCGACACCCGTAGTAACGCGCTGTTCGATTGAGAAGCCAAGCTGTTCTTGGCCTCTTTCTCAATCAATTTCGATTACCCGAGACGGGCCACGTACCGGGCGTAGTCGCCGCCCTCTGGATTCACGTAAAGGTAGGGGCGACCCGGTGCGGTGACCTCGACGCAAAGATAGCCGTCGCCGGTGCCGCCACCTTTGCCGCGCAGCCAGTCGCGCGATACCAACAGGCTGCGGGCAAAGGCATCGAACTCGTCGACGGTCAGTTCCTTGGTCTCGGTGACATAGACCTTGGTCTGACCCTGGCCGCCAACTTCGTCCAAGTCGGCAGGCTTACGGGCAAACGGCAATCGGACGCTCAACTCCTCGACCTGGAAGGTGGTGTCGCCAAACTGCAGGGTGCGCGGGGTGCGTTCGATGGTGATGGTCATGGTGCTCATGAATGTTCTCCTGGGTGTTGGCGTTGCGATCAGGCTTCTGCGGCGATCCGGTAGACCCGCTCGCTGCCCTGGGCCTTGTCCGAGACGATGGTCAGCCCGAGCTTCTTCTTGAAGGCACCGGCAAAGGTGCCGCGCACCGTGTGCGCCTGCCAGCCGGTGGTCTCGCAGATCTGCTGCACCGTTGCCCCTTCGGGGCGCTGCAGCATCTGGATCACCGTGGCCTGCTTGCTGTTCTCGCGGGTGCGAGGTTTGGCGGCCGCCTTTTCTTGCGCCCACGCGGCCTCGGCTGCCGTCACGGCTGCGTCGAGTTCAGGGTCTGCGGCCACTGGCGCAGGCGTTGGCCGGGCGCGCCCCATCGCGTCGTAGCCCTCGGCGGCGACGAACCAGTGGGTGCCGTCGGAGGTGATCAGCGCGCGGTTGAACAGGCCGTCGAGCACCTTCTTGCGTGCGCCGCCTTTGATGTTGTCGGGGAACCAGTCGATCTTGCCGTCGGTGTGTTCGAGGGCGTAAGCCAGGATCGCGTGCTGGGCCGGGGTCAGTTGGGTGGTGGTCATTTGCTTCTCCTTGTGCAAGGGGTTGATGGGGTGACGTGATGAACGCGCTGTTCGGGAGTGAAGCCAAGCGTTTTCTGCTTGGCTTCGAAGGTTCTTGATCAGCTGTTGGCCTTGTCCGACTTCGTCGCCTTGCGGCCTTGTTCGACGCCTGCGTTGAACGCGGCCTCCAGGGCGTCGCGCAGGCACCAGACCGCCACGTCGTGGAAGTCGAGGCTGTCTGACTTGCGGGTTTCCAGGGTTTCGATGCCCAGCTTGTTTTGTGCGATCTGGGTCAGGAGTTGTTCGAACTTGCTCATTGCTGCTTCCTTTGATGGTGTTGATGACGTCCGTATGAACGCGCTGTTCCAGAGAGAAGCCAAGCTGATTTCGAGTGAAGGTCGAAAAAATGATTGAAGGGGTAACCGGTTCTCAAAATGGGCATTTCGATTCGCGCTTACGCCCGTCACCGTGGTGTGACCGACACCGCTGTTCACAAGGCAATTCGCGCAGGTCGGATCACGCCGGAGGCTGACGGCACCATTGATGCCGACCGTGCTGATCGCGAGTGGGCTCGCAACTCCGATGTGCCGAAGACCGGTACGCGGGCCAAGGCCGCAAAGGTCGCCGTGCCGGAAGGCGGTACGGGTGTTGGCGGTGATGGGCCCGCCGCATTACCCGCTGGCGGCGCGTCCTTACTTCAGGCGCGCACGGTCAACGAGGTCGTCAAGGCGCAGACGAACAAGGTGCGTCTGGCCCGACTGAAGGGCGAGTTGGTGGATCGGCCGCAGGCCATCGCCCACGTCTTCAAGTTGGCGCGCTCCGAGCGTGATGCGTGGCTGAACTGGCCCGCGCGCATCTCTGCGCAGATGGCGGCCAAGCTCAATATCGATCCGCACACGATGCACGTCGCCCTGGAGGCGGCGATACGTGAGCACCTGCAGGAACTGGGCGAACTCCGGCCCCGGGTGGACTGATGCTGAATGTTGAATACGAAGGCGCTGCCGAAATCGAGCGCGCGTGGCGTGAAGGGCTGACACCTGATCCTCTGCTCTCGGTCTCTGAATGGTCGGATCGCCACAGGATGCTATCGAGCAAGGCGTCCGCTGAGCCTGGGCGCTGGCGCACCAGCCGCACGCCGTACCTGAAGGCCATCATGGACTGCCTGTCGCCGACCTCGCCGGTCGAGCGCGTGGTGTTCATGAAAGCCGCACAGCTCGGTGCGACTGAAATGGGCTCGAACTGGATTGGCTATGTGATTCACCACGCACCGGGGCCGATGATGGCGGTGTGGCCAACGGTGGATATGGCTAAGCGCAATTCCAAGCAGCGGATCGATCCGTTGATCGAGGAGTCGGCGGCACTGAGCGAATTGATCTCCCCAGCACGGTCACGCGACTCGGGCAACACCATTCTGGCCAAGGAGTTCCGGGGCGGCGTGCTGGTGATGACCGGGGCGAACAGCGCGGTGGGCTTGCGCTCGATGCCGGTGCGCTACCTGTTCCTCGATGAGGTTGACGGGTATCCGCTGGACGTCGAGGGTGAAGGTGATGCGATCTCGCTGGCCGAGGCGCGCACGCGAACCTTTGCCCGGCGCAAGATCTTCATCGTGTCGACGCCGACGATCTCGGGGGCGAGCGCCATCGAACGCGAGTACGAGGCCAGTGACCAACGTCGCTACTTCTTGCCTTGTCCGCACTGCTCGCATCGCCAATGGCTGCGCTTCGAGCAGTTGCGATGGGAAAAGGGGCAACCGGACACGGCGTCCTACATCTGCGAGTCCTGCGATAAGTCGATTGCCGAGCACCACAAGACCTGGATGCTGGAGCACGGTGAGTGGCGCGCGATGATCAGCGACGGCACGGGCAAGACAGCGGGGTTTCACCTGTCGTCGCTTTACAGCCCGGTTGGCTGGCGCGGTTGGCGCGACATTGCTGCCGCGTGGGAAAGCTCTGTGAACAAGGAATCGGGGTCGGCGGCCGCCATCAAGACCTTCAAAAACACCGAACTGGGTGAAACCTGGGTTGAGGAAGGCGAAGCGCCAGATTGGCAACGGCTGGTCGAACGCCGCGAGGACTACCGGGTTGGCACGGTGCCGCCGGGTGGGTTGCTCCTGGTGGGCGCTGCCGACGTGCAGAAGGATCGCATCGAGGCGTCCATCTGGGCCTTCGGGCGCGGCAAGGAGTCCTGGTTGGTCGAACACCGCGTGCTGATGGGCGACACCGCCCGAGACGCCGTGTGGAAGCGACTCGCCGAGTTGCTCGCCGAAAACTGGACGCACGCCTCGGGCGCGGCGATGCCGCTGGCCCGTTTCGCTCTGGACACCGGCTTTGCGACGCAGGAGGCCTACGCCTTCGTGCGGGCCTGCCGTGACCCGCGCGTGATGCCGGTCAAGGGGGTGCCGCGCGGCGCGGCCCTGATCGGCACGCCGACGGCCATCGATGTTTCGCAGGGCGGCAAGAAGCTGCGCCGGGGCATCAAGGTGTTCACGGTGGCGGTTGGCATCGCCAAGCTGGAGTTCTACAACAACCTGCGCAAGGGCGCGGACGTCAGCGAGGACGGCGTGACCACCGTCTACCCGACGGGGTTCGTTCACTTGCCAAAGATTGACGCGGAGTTCATTCAGCAGCTCTGCGCCGAACAGTTGATTACCCGTCGCGACCGCAACGGCTTCCCGGTGCGCGAATGGCAAAAGATGCGCGAGCGCAATGAAGCGCTCGATTGCTACGTGTACGCCCGCGCGGCCGCATCGGCGGCGGGCCTGGATCGCTTCGAGGAACGCCACTGGCGCGAACTGGAACGCCAACTCGGGATGGAACGGCCACCGGATGAGCCACCCCCGATTCAAGCATTCGACCCAAACGAGGCCACCCAACGCGGTGGCCTCTCTGTTTCTGCAAACCCACCACGGCGGCGCGTCATCAAGAGCCGCTGGTTGTCCTGATTTTCAGAGGAGTTTTCATGAGTCTTGCCACCCGTATCGAGAGCCTGGTCATCCGGGTTGCCCAGGAGTTCAACGACGTCCGCGCGACGGCAGGCAGTCTGGCCAGCCTGTCCACCAACGACAAGTCGAGTCTGGTCGCCGCCATCAACGAGCTCAAGGCAGCGGTTCTGTCCGCGATGGCCATCGATGACAACCAGATCGCCACCACCAGCACCTACTCGTCGAACAAGATCGTGTCGCTGCTGGACGCGCTCAAGACCGACATCCTGGGCGGAGCCGATGCTGCCTACGACACCCTGGTGGAAATCCAGCAGGCGCTGCAGAGCGGTACCAGCGGCCTGGACGCGATTCTGGCTGCGGTCAATCTCCGTGTCCGCTTCGATGCGGCGCAGACCCTGACCGTGGCCGAGCAACTGCAAGCACGTACCAACATTGGTGCGGTCGCTGTCAGTGATGTCGGCAACACCGACACCGATTTCGTCGTGATCTTTGACGGCGCGCTGGCCTGATGAGCCTCGCTTCCAGCATCGCCGCTTTGGCGGCGCGCATCGGCTTCGAGGTCAAAACCAAGATCGACGCCACGCATCCCGGCATTGCCCGGGTGTGGGTCAGCTTCGGCTACGTGGGCGGTCAGGTCGTGATCGCCAGCGCGCACAACGTCGCCAGCGTGGTGCGCACGGCGGCGGGCCGGTACCGCGTGCATTTCGCTGTGGCGATGCCGGATGCGAATTACTGCTGGACGGCGCTCGCGCGCAGCAGCACCAACACCGGTCAGCAGCGCTTGGCCCTGGTACGTGCCAGCTCCGACCTGAAGACCGCGCAGTACGTCGACGTCTCGTGTGCGACGGCCGCGTCGTCGTTTGACGACTCCTCTGAAATCAACCTCGTGGTGTACCGCTGATGGCCTACACAGAAGCCCAACTCCAGGCATTGGAGACCGCGCTCGCCAAGGGCGAACACCGCGTCAGCTTCGGCGACAAGACCGTCGAGTACCGCTCGGTCGATGAACTGAAAGCTGCGATCCGCGAAGTCAAGCGCGGCATCCTGGAGCAGGCAGCCGCCACCGGACTATGGCCGGGTGCGCCGCGCCAGATCCGGGTCACGACCTCGAAGGGGTTCTGATGGCCTGGTATTCGAAGATCCGAAGCCTGTTCGGCCAGCAACCCGTCCACGAAGCGGCTGGCCGTGGTCGCCGCTCGTTGGCTTGGATGCCCGGCAACCCGGGCGCGGTCGCCGCGATGCTGGCGACCAACACCGAACTGCGCATCAAGAGCCGCGACCTCGTGCGCCGCAACGCGTGGGCGCAAGCCGGTATCGAGGCCTTCGTGTCCAACGCGGTCGGCACTGGCATCAAGCCGCAGAGTCTTGCTGCAGACGAGCGCTTCAAGACCGACGTGCAGGCGCTGTGGCGTGACTGGACAGAAGAAGCCGACGCCGCAGGACAGACCGATTTCTACGGCCTGCAGGCATTGGCCTGTCGCGCGATGCTCGAAGGCGGTGAATGCCTGATCCGGCTGCGCCCGCGCCGCCCGGAGGACGGACTGGTCGTTCCTCTGCAGCTTCAGTTGCTGGAGCCCGAGCATCTGCCGATCAGCCTCAACCTCGATCTGCCTTCGGGCAACGTGGTGCGCTCTGGCATCGAATTCGACAGCCTCGGGCGGCGCGTCGCTTACCACCTGTACCGCTCGCACCCCGAAGACGGTCGGCTGGCTCCGATGTCGGGCCAGGGCGGGATGGACACGGTGCGCATCGATGCGAAGGAAATCATCCACCTGTTCCGCGTCCTGCGTCCCGGCCAGATCCGGGGCGAGCCGTGGTTGTCGCGGGCCCTGGTCAAGCTCAACGAACTTGACCAGTACGACGACGCAGAACTGGTGCGCAAGAAGACCGCCGCGATGTTCGCCGGGTTCGTGACACGGCAGAACCCGGAGGACAACCTGATGGGTGAAGGTGCGGCCGATGGCGATGGCATTGCGCTCGCCGGGCTGGAACCGGGCACTTTGCAGATTCTGGAGCCCGGCGAGGACATCAAGTTCTCCGACCCGGCCGACGTCGGTGGCTCGTATGGCGAGTTCCTGCGCACGCAGTTCCGCGCGGTCGCCGCTGCCATCGGTGTCACCTACGAGCAGTTGACCGGCGACCTCACAGGCGTGAACTACTCGTCCATCCGCGCCGGGATGCTGGAGTTTCGGCGTCGCTGCGAGATGGTGCAGCACGGGGTGCTTGTGCATCAGATGTGCCGTCCGGTTTGGGCCGCGTGGATGAAGCAGGCAGTGCTCGCCGGTGCCATCGATGCTCCCGGCTTCGCGCGTGGCGGCCCAGCCCGTCGCCGCCGGTACCTGCAGGTGAAGTGGATTCCACAGGGCTGGCAGTGGGTCGATCCTGAGAAGGAGTTCAAGGCCATGCTGCTGGCCATCAGGGCGGGACTGATGAGCCGCTCGGAAGCCATTTCCGCCTTTGGCTACGACGCCGAGGACGTTGACCGCGAGATCGCCGCCGACAACCAGCGCGCCGACGACCTGGGGTTGATCTTCGACTCCGACCCGCGCCGCACCTCCAAGGACGGCGGAAGCGCCGAGCCGAACAAGAACGCTGCCGACACCACGCAAACCGGCAGCTCATCGTCTGCCTGAAGGATTTCCATGACCCTGTTGCCCCATTTGGCGGCGCGCCTCTACGGTGTGCCGCTGGCGATCCATCGCCCAAAACTTGACGTGATCCTGGCCGTGCTCGGCCCCCGGATCGGCTTGGCTGATTTGGCTGCACCCTCGGGCTTCACGCCGCCCGCACGTCCCGCATCCACCCAGACGACGAAGGTCGCGGTCATCCCCATCCACGGCACGCTGGTGCGCCGCACAGTGGGCCTGGAAGCCGAATCCGGCTTGACCAGCTACGCAGGGCTGACCGCGCAGTTGGACGCCGCGCTGGCCAGCCCGGATGTCGCTGCCATCCTGCTCGATGTCGACTCACCGGGTGGCGAGTCGGGCGGCGTGTTCGATCTGGCCGACCGCATCCGTGCGGCTGCTAAGACGAAGCCGGTCTGGGCTGTAGCCAATGACATGGCGTTCTCGGCAGCTTACGCCCTGGCGTCTGCGGCCAGCAAGGTGTTCGTGTCGCGCACCGGCGGCGTCGGCTCGATTGGCGTCATTGCGATGCACGTCGACCAGTCCGAGAAGGATGCGCAGGACGGCGTTCGGTACACGGCGGTCTTTGCGGGCGACCGCAAGAACGATCTGAACCCACACGAGCCGATTTCCAGCGAAGCCCACGCCTTTCTCAAGGGTGAGGTGAATCGCGTCTACGGCCTGTTCGTCGAGACGGTGGCCCGCAACCGTGGCATCGAGGCATCTGCCGTGCGCGACACCGAGGCGGGGCTGTTCTTCGGGCAGGCCGCCGTGGCTATCGGGTTGGCCGATGCCATCGGCACCTTCGACGACGCCCTTGCGCAGCTTTGCGAATCCGTTTCCCCACTCCCGAAGTTGGCGGCAAGCCACTCCGGTCTTTTTAGCAACCCCCAGATGGAGTCATCAATGAATGATCGAACCGACCCCGCTGCTCCTGATCGGCTTGCTGCTGATCCTGCTGGCAGTCCTTCTCAACCGGCGGCCGCCACCGCCATGACCGTGGCTGACGCGATTGAGGTCGCCCAGACCTGCACCCTGGCCGGGCGCACCGACCTGATCGCGGGCTTCCTCGAAGCGAAGGCACCACCCGCCAAGGTACGCAGCCAGTTGCTGGCCACCCAGGCCGAAGCCAGTCCCGAAATCGTCAGCCGCATCGACCCGCAGTCGGCCATGTCGGCGAGTAGCACTGGCCATCCTGCCTCTTCCCACAACCCTCTGATCCAGGCCGTCAAAAGTCGCCTGGGCACAAAGTAACCCAAAAAGGAGCATCCCGTGCCCGCAATGCAAGAACCAATCAACCTCGGCGACCTCCTGAAGTACGAGGCGCCCAATCTCTATTCGCGCGACCGCGTGACCGTGGCAGCTGGCCAGACCTTGCCGCTGGGTACGGTGCTCGGGCAGATCACGGCGACGGGCAAGGTCAAGCAGATCGACCCGTCGGCCACCGATGGCAGCCAGTACTCCGCTGGTGTGCTGATGCAGGACGCCGATGCTGCTCTCGCCGACCGCAACGACGGGCTGATGGTGGCGCGTCACGCCATCGTGTCAGACCACGCACTGCATTGGCCCACCGGCATCACGACTGCGGAGCAGCAAGCAGCGATCCAACAACTCAAAGCACTGGGCGTCCTGGTGCGTATCGGCGCCTAACGCCAAGGAGACTCAATATGCAAAACCCATTCATCAGTCCGGCATTTTCGATGGCATCAATGACTGCAGCCATCAACTTGATCCCCAACCGCTACGGACGCCTGGAGGAGTTGAATCTGTTTCCGCCCAAGCCGGTTCGAACGCGCCAGGTGATTGTTGAAGAACGCGCCGGTGTCCTGAACCTCCTTCCGACCCAGCCGCCAGGCTCTCCGGGAACAGTGAATGTGCGTGGCAAGCGAACCGTCCGGTCCTTCGTCGTTCCGCACATTCCGCACGACGACGTTGTGTTGCCCGAAGAGGTTCAAGGTCTACGTGCTTTTGGCAGCGAAACCGAAATGGAGTCGATTGCCGGAGTGCTGGCCCAACACTTAGAGACGATGCGCAACAAGCACGCCATCACCCTAGAGCACTTGCGTATGGGGGCGTTGAAAGGCGAGATTCTCGACGCCGACGGCAGCCGTATCTACAACCTGTTTGACGAGTTTGGCATCGATCAACAGAGTGTGGACTTCGAAATCAGCAGCCCGACTACTGGCACTGATGTCAAGGGCAAGTGCACTGATGTGTTGGGCATCATCGAAGAAGCCCTTCTCGGCGAGTTCATGACGGGAGTCCACTGCTTGTGTTCTCCAGAGTTTTTCAAGGCATTGACCGGCCACAAGGATGTCAAGACTGCCTTCACGAACTGGCAGCAAGGCGCCGTCCTTATCAATGATGTTCGCCGTGGCTTCACTTTTGGCGGCATCACTTTCGAGGAGTACCGAGGTAAGGCGACTGATGTCAACAAGACGGTTCGTCGCTTCATCGCTGCTGGCGAAGCACATGCGTTCCCTCTTGGCACTATCGACACCTTCGGAACTTACTTTGCACCGGCCGACTTCAACGAGACTGTCAACACGATGGGCCAGCCGCTTTATGCGAAGCAGGAGCCGCGCAAATTCGACAGGGGCACAGATCTGCACACGCAGGCCAACCCGCTACCGATGTGCCATCGTCCCGGGGTTCTGGTCAGGCTCGTCATGGGTGGTGGCGTATGAGTTTGGTCGCCCAGATCTATGAGTCGGCCGCGAACGCTGGGCTGCTGAAGGAATGCCTTTGGTATCCGTCGAACGGTGCGCCATCGCAACTACATCAGATCGGCTTTGCCGCGCCCGATGAATCACTGCTCGATGGCCTGGCCCTGAGCACCGACTACGAGATGACCTACCCGGTCACGGCATTCGGGGGTCTTGCAGTCCGCGAGGTTGTCGAAATCGGTGGCACGTCCTTCCAGGTGCGAGACATCCGATCGTTAAGCGACGGCTCCGAGATCCGCGCCAAGCTCACCCGGCTGTAAACCCATGGCAGATAACTCGATCCGCGAGCGGATTCTGCTGGCGGTGATGGCGGCTGCCCGTCCGGCGGTCGAAGGTCTCGGGGCCACTTTGCACCGGTCGCCCACGGTGGCCATCAGCCGCGAACTTTGCCCGGCGCTCGCGGTGTTTCCCGAGTCGGAGTCCATCACTGAGCGCGCCAACGACCGCGTCACACGCGAACTGACCGTTCGCGTTGTGGCTCTGGCTCGGGCCGTTCCACCCGCGTCCCCCGAAACCGAGGCCGACCGTCTGCTCACCGCTGCCCACGCTGCCTTGTTCGGGGACGGCACGTTCGGTGGGTTGGCGCTGGGCATCCGTGAACAAGAGAGCGAGTGGGAGGTCGAGGACGCCGACGCGGTGGCCGTGGCCCTCCCGGCGCGCTATCGGCTGACGTACCGGACGCTGGCCAATGACCTTTCAACTCTTGGATGACACCTATGACCCAACTTGTCCTGACGCGCCCGCACACCCACGCGGGCAAGACCTATGGCGTCGGTGACCGGATCGAGATCGACGCGACATCAGCCGACTGGCTGATCGCGCACGACATCGCCACGCCGGAGCCGACCGCCCCAACTGCTGAACCCGTCCCCGAACCCAAACCCCTCCAACGCAAGGAACCCAAGCAATGAGCACCTATGCCAGTTTTCAAGGCCGCGTCTTCCTCGGCAAGCGCGACACCGACGGCCTTCCCATCGAAGTGCGCTCGCCCGGCAACGTCGCAGAGCTGAAGCTCTCCCTCAAGACCGACGTCCTGGAGCATTACGAGAGCCAGACCGGCCAGCGCTCGCTGGATCACCGGATGGTCAAGCAGAAGTCCGCCACCGTGAACCTCACCATCGAGGAATTCACCAAGGAGAATCTCGCGCTGGCCCTGTACGGCAACCACGTCGTCGGCACGCCGGGCACGGTCACCGCCGAGCCAGTGGGCGGTGCCACGCCGATTGCGGGCGACCGCTACTTCCTTGCCCACCCGAAGGTATCGTCCTTGGTCGTGACGGATTCGGCTGGCACGCCCGCGACCCTGGCCTTGGGCACGAACTACACGGCTGATCCCGACTTCGGTGCCCTCCAGTTTCTGGATACCACCGGCTTCACTGCGCCGTTCAAGGCCAGTTACGCCTACGGTGTGGCCACCGAGATCGGCATCTTCACGCAGGCGCTGCCGGAACGCTTCCTGCGGCTCGAAGGCATCAACACGGCCCAGGGCAATGCCAAGGTGCTGGTCGAGCTCTACCGCGTGGCATTCGATCCGCTGAAGGAAATCTCCTTCATCTCGGACGAGTACAACAAATTCGAGCTGGAGGGATCGCTGCTGGCCGACACCACCAAGCCCTTCGACGCGGTGCTGGGCCAGTTCGGCCGCATCGTGCAACTGTGATGGGTGCCGCCATGAGTGATCTGGACACCCTGATTCCGCAGGCGGTCGAACTGGTGATCGACGGTGAGCCGCTGGCCATCAAACCGCTGAAGGTCGGGCAGATGCCCGGTTTTCTGCGAGCGATGTCGCCGGTGATGCAGCAGCTCACTGCCTCCAACATCGACTGGCTGGCGTTGTTCGGCGAGCGCGGCGACGACCTGCTGTCGGCCATCGCCATTGCCGTCGGCAAGCCTCGGGCGTGGGTCGATGAGCTGGCTGCCGACGAGGCCATCCTGCTGGCGGCCAAGGTGATCGAGGTGAACGCCGATTTTTTTACCCAGACGGTGATTCCGAAGCTCGACGGGCTGTTCGGCCAAGTGAAGCTGCCGCCCATCGTGAAAGCGGCGGCTGGTTCGATGCCGTCCAGCACCTGATCGAGCACGGTCACCGCTTGCCCGACATCCTCGACTACACGTTGGCGCAGGTGCGCGGCTTCGTCGTAGCGACGGCGCGCACCGATGCGGCCCGCGATGCACGGCTGCTGTCCGTGATTGCCATCGGCACGCGCAGCGATGCCCGCCAGCTCGACCAAACCCTCGACCGACTTACTGACAAGGCCACCGACCGTGCCTGATGACCATGCGCATTTCCGTCCAGATCGATAGCGCCGCAGCCCAGGCGCAATTGCGCCGCTGGGGCGGCGAATTCCGCGACAAGGTCAAGAAGGCGGTGTCGCGGGCGATTGCCAGCGAGGCGGTCGAACTCAAGCAGGACGTGCGCAGCCACGTCGCCAGCCAGATGGCCGTGGTCAAGAAGTCCTTCCTCAAGGGCTTCACCGCCAAGGTGCTGGACAAAGACCTGAACCGACTGCCCGCGCTGTACGTGGGTTCGCGCATTCCGTGGTCGGCGATGCACGAGACCGGCGGCCAGATTGCCGGGCGGATGCTGATTCCACTGAACGGTCGGGTGGGCCGCAAGCGCTTCAAGGCGCAGGTGGCCGAGCTGATGCGCGGCGGCAATGCCTATTTCATCAAGAACGCGAAGGGAAACATCGTCCTGATGGCCGAGAACATCAAAGAGCACGACCGGCCACTGGCGGGCTTCAAGCGCCGCTACCGCAAGGCAGAGGGCATCAAGCGCCTCAAGCGCGGCGCGGACATCCCGATTGCCGTCCTAGTGCCCAAGGTCGTACTCAAGAAGCGCCTCGATGTCGAGCGGCTGGTCGCGAGTCGCATCCCGCGTCTGGCGGCGGCCGTCGAGAATCAGATCAGTACGGTGGATTGATTCATGGCCAAGCGAATTTCCATCCTCGTCGCGCTCGAAGGGGCCGACGAGGGGCTCAAACGCGCCATCACGTCGGCCGAGCGCAGTCTCGGTGAGCTGTCGACCACCGCCAAGACCGCCGGAGCCAAGGCTGCCGCCGGAATGGCCGAGGTCAAGGCCGGGATGTCGGCCTTCGGCGATCAGGTGGCGACGGCCAAGACGCAATTGCTGGCCTTCCTATCGATCAGCTGGGCGGCAGGAAAGGTGCAAGAGATCGTCCAGATCGCCGACGCATGGAACATGATGTCGGCGCGCTTGAAGTTGGCGACGGCGGGACAGCGTGAATTCACGACCGCGCAAGCGGCCCTGTTCGACATCGCCCAGCGCATCGGTGTGCCGATTCAGGAAACGGCCACGCTGTACGGCAAGCTCCAGCAGGCAATTCGGATGCTGGGTGGCGAGCAGAAGGACGCGCTCACGATCGCCGAGAGCATTTCGCAGGCACTGCGCCTGTCGGGCGCTTCGGCCACCGAGGCGCAGTCCTCTTTGCTGCAATTCGGGCAGGCGCTCGCCTCTGGTGTGCTGCGAGGCGAGGAATTCAACTCCGTCGTCGAAAACAGCCCCCGTCTGGCGCAGGCACTGGCCGATGGCCTGAATGTGCCCATCGGGCGACTGCGCAAGCTGGCCGAAGAAGGCCGCCTGACCGCTGACGTGGTGGTCAACGCGCTGATGAGCCAGAAGGACAAGCTGGCCAGCGAGTACGCCCAACTGCCGCAGACGGTGAGCCAGGCCTTCGAGCGCCTGCGCAATGCCTTCGGGCAGTGGATCAACCGGGTCGATGAATCGACGGGTTTGACCAAGAAGCTGGCCGAGGCTCTGACCATTCTCGCCAACAACCTCGACACGGTCATGCAGTGGTTGAAGCGCATCGCCGAAGTCGGTCTGGCGGTGCTGATCTACCGCCTGATCCCGGCGCTCATCACCGCGTGGCAGACCGCCGGTGCGGCGGCCGTCACGGCCGCCAGTGCCACCGCTGCGGCGTGGACGACGGCCAACCTGTCGGTGTCGGCCGCCGTGGCCAGCGTCGGCTTGCTCAAGACGGCATTCGCCGTGCTGGGTGCCTTCCTGGTCGGCTGGGAGATCGGCACGTGGCTGTCGGAGAAGTTCGAGATCGTCCGCAAGGCGGGCATCTTCATGGTCGAGATGCTGGTCAAGGCGGTCGAGCAGTTGCGCTACCGCTGGGAGGCATTCGCCGCCATCTTCACCTCGGACACGATTGCCGAGGCGACCCAGCGCCACGAGGCCCGTCTCGCGGAGATGAACCAGATCTTCGCGCAGATGTACGCCGACGCGACCAAGGGGGCGGATGCTGCCAAGGGCGCGATGAATACCGCCGCGACGGCTGCGGAGGAAATCGCCAAGCGGCTCGAAGCCGTGCGTCAGGGCACGCAGGAGGCAGTCGGGCGCGGTATCGAGGCTGTCCACAGCGCCCTGGAGAAGCTGAAATCCCGCCTCGGTGAGGTTGAGCAGGCTGTCGGCAAGGCCAATCAGACAGTCAACGACGCCACCGCCAAAATGGCCGAGGCCTATAAGGGCCTGACGTCCATCGTTGAGGCCAACCTGCTGCGCCAGATCGAAGCGGTCAAGGCGCGCTATCAGCAGGAACAGTCGGCGCTGGAGACATCCAAGCAGTCCGAAGCGGCGCTGATCACCAAGTCGACACAGTTGCTGACGGAAGCCCTCACGCAGCAGACCACGTTGCGGCGGCAGTCCACGACAGACACGCTGAAGCTCATTGACGATGAGTCCAAGGCGCGGATCGAGTCGGCCCGCCGCCAGGGTCAGACGGAAGAAGAGCGCCGCGCCAACGTCCAGCGGGTCGAAAACGACATCCTGGCCACCAAGCGCCAGACGATGACGCAGGCGCTGGCCGAGTACCGGCAGCACATCGATGCGCTCAACGCAGAGGCCAACCGGCATCTGACTGAGATCAAGCGCATCGAGGAGGAGAAGCGCCAGCTCTCGATGACGACCGAGGAACGTGTCCGCGACATCCGTCGGCAGGGCATGACCGATTTCGAGGCGACGGAAGATCGCAAGCGCCAGATCGCCGAGTACCAGGGGAAGGCACGTGAGGCGCTGGCCAACGGCGAGTTCGAGCAGGCTCGGCAACTCGCCCAGAAAGCGATGGACTTGGCCGCACAGGTGGCCAGCTCGCAAACCAGTGAAGCCAAGCGCGGCGAAGATGCCCGCAAGCAGTCCGAGCAGGCGGTTTCGCAGGTCACCCAGCTCGAATCGCAGTCACGCGATGCCTATCGCAAGCAGGAATACGCGCAAGCCGAAGCCCTGATGCGCCAAGCGGACGCATTGCGCGCCGAACTGGCCCAGAAGACCAAGGATGCCGACGCACAGATCGCACAGGGCAAGGATGGCGTCAATCAAGCCATCCAGCGCATCCGCGAGTCCGAGGAGATTCTCAACAAGACCCTGGATGCCGAAGCCAAGGCGCACCAGACCGCCGCGCAGTCGGCATTGACCGCGCGCGACCAGATCCAGCAGACCCTCACCCAGACCGAAACCCAGATCGACCAAATCACAGCCAAGCTAAAAGACGGTCTGAAGGTCACGCTGGATGCCGACACGACCCGCTTCGACAAAGCCATCGCTGATCTCGACAAGGCCCTGGCAGAAAAAGAGTACCTGCTCAAGATTCAGGCCGACTTGCAGGAGGCCGAGAAGAAGCTGCAGCAGTACGAACAACTGCTGAAAGAGGGCAAGACACTCCCGGTCGATGCCGACGTGTCCAAGGCCAAGGAGGCGCTGGACAAACTCAAGACCTACGCCGACCAGAACTCGCAGTTCGAACTGAAGGTGGCGACCGAGAAGGCGCAGGCCGCGATCACCAAAGTCGAAGGGATGATCAAGGCGCTGGACCGCATCCAGACCGAGTCCCGGCATCAGGTCAGCACCAATGCCGACGCAGCCCGCTCGGAAATCATGAGTCTCAACTGGGCCAACACCTCGAGCACGCACACGATCTATGTGCGCAAGGTAGAGGCAAACGCGACTGGCGGTTTGGTGGGCGGTGGCGTGCGCCGCTACGCCGATGGCGGCGCTGTGGCCCCGGCCTTTCCTCGGATGAGTGGTGGCTCGGTTCCGGGCTCGGGCCACCACGACACCGTGCCACGCACCCTGGATGCCGGTGCCTTCGTGATTCGCAAGGCGGCGGTGCAGAAGTACGGCGGCGGCGCGCTCTCGCGTCTGGCCAATGGCGTGGCACGGTTTGCCACTGGCGGCGCGGTGATGCTGGGTGGCGGCAAGCGCCCATCCGGCAACGATGCTGATGGCACGCCCAGCACACCAAAGAAGAACCGGGAGGCAGTCGAGGCGATGAAGATGATCGACCTCGGCCTGCAGGGGATGAACGAGTACACCAATTGGCTCCAGTGGAACTACGGTGCCTCGGTCAGTCTGGATATGCGTAGCAAGACGATGGATAGCTACGGCAAGCAGGCCCAACAGGATCGGCGCGCGCTGGAGGACTTCATCAGCCGCAAGACGCTCACCGGCAACGAGCGCCAGAACCTGGAGCGCATCAAGCAGACGTGGCGGCAGGCAATGGCCCAGCCGCTGCTTTGGGGCAAAGACCTAGAGCGCGAGCTGATCGACTATATGGAGCAGAACCAGGGCGAGTTCTACCGTCGCGGTGGCATGGCCAAGTCCGACACCGTCCCGGCGATGCTCACGCCGGGCGAGTTCGTCGTGAACAAGGATGCCGTTTCCCGCTACGGCGCTGGCTTCTTCGAAGCGATCAACAACCTGTCTGCCCCGGCACAAGCTCTGGCCGGTCGCGCGCTTGCGGGCGTTCAGGGCTTCGCCACCGGCGGTCTGGTGCAGCCAAGTGGCTCGCGGTTGGCCCGACCGGTGTTGGCGGCCGATGCCGGGCCCAGCCGCACGGTACGCGTGGAACTGTCCTCGGGGCAGCAGAAGGTCAATGCCACCGTCGACGCACGAGACGAGTCTCGTCTGCTGCAACTTCTGGACGCTGCCCGCGCCCGCACTGCCTGAAGGATTCCCGATGCAACTGACGAACCTCGATGCCGGGGTGGCTTTGCCATTGCCTGACGATTTGCTGTGGAGTGATGAGCACGCGTGGTCGCCCGCCGTGGCGACCACGTCTTACCTCATCACCGGAGCCTTGCTTATCCAGTCTGCCACCCGGCAAGCCGGTCGCCCCATCACGCTGGTGGGCGCACCCGATATGGCCTGGGTGACGCGGGCCACGGTCGAGCAACTGCAGGCCTGGGCCGCGCTTCCAGTGGGCAGCGCCACAGGTCGCTTCGGCTTGACCTTCTCCGATGGCCGCTCGTTCACCGTGGCATTCCGCCACGCAGAAACGGCCATCGAAGCCGAGCCCGTGCTGGGCATCCCGGCCCGTGCCGCTACCGACTTCTATCGCCTGACCCTTCGATTCCTGGAGATTTGAAATGCCGATCCAATCCGGCGACGTGAAACTGCTGAAGTCCGCCGTGATGGCGGATGTGCCCGAGGGCGGTGGCGCGCCCACGGGCAACACCATTGCCGATGGCGTCTCGAACGCCATCTTTCCTGACATCTCCGAGCTGGATCGCGCCGGGGGTCGGGTCAACCTGCGCAAGTCCTTCGTGTCGGTGCAGACCGACGACACCGACACCTACTTCGGTGCCAACGTGATCGTGGCAGAGCCGCCGCAGGATGCGCGCGTCAGCGTCACGCTGTTCAGCACCGAGAAGACCTTCGACACCCGCGAGCAGGCGCAAGTCCGCATCGAGGCCTACCTCAACAAGGGCCCGGAGTGGGCTGGCTACCTGTTCGAGAACCACATCGCCGGTCAGCGGGTGATTCAGCTTTTCCAGCGCACCACCGACACCGTTCCCAATGTCGGCCAGACCTTGGTCTTGATCGAGAACGAGGGCCTGGGCACCCAGAAGGAGCAGTACATCCGGGCCACCTCGGTGTCCGTCGTCGAGCGCACGTTCACTTACGACGGCGACAAGGACTACAAGGCCAGCATCGTCACGGTCGACATCAGCGACGCACTGCGCTACGACTTCACCGGCTCGCCTGCAAGCCGCACGTTCACCCGGGCCGCGAACAGCACCAAGACGCGCGACACGGTCGTGGCGGACGCCGGAACCTACGTCGGCGTAGTACCGCTGACGCAGGCCGCCGCCGTCGGCGACTTCACGATCAAGGGCACCTCGATCTACACGCAGCTGGTGCCAAGCGCGCAAACCGAGACGCCCATTTCCTTCGTTCCTCCCTACGCGGCCGCCGGACTGCCGGTGCCGGGGGCCGTCGCGGTGAGCTACACAGCCAGCCACGCGTGGACGACCAGCATCAAATTCAATCTCCCGGGCGGTTGCTTGCCGGGGTCACTGACCATCGGCACGGACGGCATCACGATATTTGACGACGCGGGCCTGCTCAAGACCGCCAGCGGGACGGTCGGAACCATCGACTACGCCAACGGCATCCTGACCCTGAACTCGGGGACGATGTCGAACGCGAAGGCCATCACCTACACGCCCGCCGCGCAGATTCTGCGTGCTCCGCAAAGCTCGGAGATCCCGGTCACGCCCGAGTCGCGCAGCCAGTCCTACGTGGGCACGGTCAACCCGGTGCCGCAGCCCGGAACGCTGTCCATCAGCTACATGGCCCAAGGGCGCTGGTATGTGCTTTCCGACAGTGGCAACGGCTCGCTCAAGGGCCTGGACGCCAGCTACGGCGCGGGCACCTTCAACAGGAATACCGGAGCCTTCGTGGTCACGCTGGGTGCGTTGCCCGACGTGGGCAGTTCGCTCGTGCTGACCTGGAACGTGCCGACGCAGGAGACGCAGCAGCCATCCACCACCCTGAAGGCCACCCAGAGCCTGGCATTGAACCCACCTGCAGGGACGGCGGTGCAACCCGGGTCGCTCACCGTGTCCTGGGAGTACGGCGGCACCAAGACCTCAACGGCGGCCACGTCGGGCGTGCTGTCGGGTGCCGCCACGGGCAGTCTGAGCGTGGCGCAGAACCGCGTGGACTTCGCGCCCAATGTGCTGCCAGCGGTGGGCACGCAACTCACCGTGAGCTACGTCGCGGGCCCGAAGCAGGAGGACTCGTTTGCTCACCCCTCCCGCAATGGTGCGGGGACGCTGCCAGTCACCGCGACCTTGGGGGCCATCGAACCGGGCTCGCTCGAAGTCGAGTGGAACACGTTCACCGACGAGGCGGTTCTCGGTGCGTACACCTTCGCTCAATTGCAGGAGATGGGTATCGCCGTCTCGATCTGGCGCGACCCCACCCAGATCGCCCGAGATGACGGGAACGGCGGTGTAGTGCTGAACGGGATCTCGATTGGCACCGTCAACTACGCAACCGGTCAGGTGACCTTCAATCCGGATGTCTCGATCCGTATCCCACGCCCGGTCTACACGGCAGTCGCCATCAACGGCACCGGTCGGTGGCGATTGAACTACGGCGGCATCGCCTACGTCGATGCGCCATCGCTGTACCCCAACGACGAATCCGGCTACGTCAAGCTGCGCTACAACAGCGCGGGCTCGACCAGCAACCAGACCGAGACGTTCCAGTTCCTACCGGCCTTCAAGCTGGTACCGGGGGTGAATGCCCAGGTGGTGACAGGCACGGTGCTTCTCTCCATCAGTGGCGCGCAGCCTTGGGGCGACAACGGCCAGGGCACCCTGCGCGAGTTCACCACCAGTGGCTGGGTCACGCGCGGCACGATTAACTACCTATCCGGGGACGTGGCGCTGACGTCCTGGACGGCGGGCACGAACAACGCGATCACACGAGCCAGTTGCGTGACCACGGTCGGCGAGAACATCTCCAGCGAGTTCGTGTTCCGAACTGGCGCGGCACCGCTTCGTCCTGGGTCGCTGTCGATCCAGTACGCCCGCGCGGTTGGTGGCACGCAAAACGTGACGGCCGGGATTGACGGCAAGATCGAGGCAACCGGCATCAGCGGCAGCGTCGACTACGAGACCGGTCTGGTGCGCGTTCGCTTCGGAACGATGGTCACGGCGGCCGGGAACGAGAGCCAGCCTTGGTACGCCGCCGACCGGGTGGGCACGGACGGCAAGATCTTCCGACCCGAGCCGGTGGCCGCATCCAGCGTGCGTTACAGCGCGGTCGCCTACAGCTATCTGCCGCTGGATGCTGATTTGCTTGGCATCGATCCGGTGCGCCTGCCCAGTGATGGGCGGGTGCCGATCTTCCGCCCCGGCGGCTTCGCCGTGGTGGGCCACACCGGCAAGATCACCTCCTCGGTCAGCAACGGCCAGACCATCAACTGCGCTCGGGTGCGCCTGTCGCGCGTGCGCGTCGTCGGCCACGACGGAGCGGTGATCCACACCGGGTACTCCACCGATCTGGAAGCGGGCACCGTCACCTTCATCAACGTGTCGGGCTACAGCCAGCCCGTGACCATCGAGCACCGGATCGAGGACATGGCCGTGGTGCGGGATGTGCAGATCAGCGGCGAGATCAGTTTCACGCGCGCCCTGACGCACGAATATCCGCTGGGGAGTCACGTCTCCAGCGCCCTGGTGGCCGGTGACCTGTTTGCCCGCGTGAATCTGGTGTTCGACCAGTCAACGTGGAACGGCGCGTGGTCAGATGCCTTGTCAGGCAGTTCCGCAACAGCAACGTTCAACAACACGCAGTACCCGATCCGCGTGACGAACCGGGGGGCACTGACCGAGCGTTGGATCGTGCGCCTGACCAACAGCACCTCGTTCGAAGTCATCGGCGAGAACGTCGGCGTGATCGCCACGGGCAACACCAGTGCGGATTGCGCGCCCAACAACCCGGCGACCGGCGTGCCGTACTTCCATCTGCCCGCACTTGGCTGGGGCAATGGCTGGGCCACCGGCAACGTGCTGCGCTTCAACACCATCGGCGCGCAGTTCCCGGTCTGGGTGGTGCGCACCGTTCAGCAGGGGCCGGAGTCCGTGCCCGACGACAACTTCACGTTGCTGATTCGCGGCGACGTGGACACCCCCTGATTTCGTAGACAGGAACCCATGAAATGATCGACCTGACCGTCAAATACTTCAACAGCGGCATGACCGGCGCGCCACAGATCTCCAACAACTGGGGCGATCTGGTGACGATGCTCGATGCCTGCCTCGTCAATGGCTTCGCGCTGAAAGCCATCGACACCTTGACCTTCGCCGATGGCATCGCCACAGCCACCATTTCCACCGGCCACGCCTATCGGCCTTTTCAGGTGGTCGAGATCGCTGGAGCCGAGCAGCCTGAGTACAACGGTTCATTCCGTGTGCTGTCGACGACCACGACCGCCTTCACCTATGCGGTGACCGGAGCGCCGGTGTCGCCCGCGACGACGACCACTAACCTGAGCGCCAAAGTGGCTCCACTTGGGTGGGAGAAGCCGTTCGCGGGGACGAGCAAGGCCGCCTATCGCAGCAAAAACCCACAGTCGCCGCAGAACATCCTGCTGATCGACAACAGCCTCAAGACGCCCAACTACACGACGGGGTGGGCGAAGTGGGCCAACGTCGGGATCGTGGAAGACCTGTCCGACATCGACACCATCGTTGGCGCACAGGCTCCGTATGACCCGAACAACCCGACGCAGAACTGGAAACAGGTCACCGCTAGCCAGTGGGGTTGGTACAAATGGTTCCACGCACGTGGCCCCCAGTACGAAAGCAACGGCGACAGCGGCGGCGGAGGTCGCAACTGGGTGTTGATCGGTGACGACCGTCTGTTCTTCCTGTTCTGCACCAATGCAGCGGGCTACGGCTGGTATGGCCGCAACAGCTATTGCTTCGGCGATCTGATCAGCTTCAAACCCGGTGACAACTACGCGACGGTGCTGGCCGCCGACGACAACTACTCGGGGATGAGCAACTACTGGAGCTATCCAGGGCAGTTCAGCGGCTACGGGCTGGTTTCGTCCCTGGACTTCACCGGCAAGGTGCTGCTGCGCAATCACACCCAACTCGGCAATCCCGTCCGGTTCGGACTCACGTCGCTGAACACCAACAACGGCCAGCAGATCTGCGGTCGGGGCCCGATGCCGTTCCCGAATGGAGCCGACTACAGTTTGTGGCTGTTGCCCACCTACGTGCGGCAGGAGGACGGCCATATGCGCGGCATCCTGCCCGGGATGCTGTGGATGCCCCAGGACCGGCCCTATAGCGATCAGACCATCGTGGACAACGTGGTGGGTCAGGCGGGTAAGCGCTTCTTGCTGGTCAGGACGCAGTACAGCTCGGAAACCGAAGGCGCACAGATCGCGTTCGACATCACTGGCCCGTGGAGGTAAGCCATGAGCTACCCGCTGAGCGAGTCTTTCGCCACGGCTCCTGCGACCGGCTACACCGCCGTCCTCGGCGGAATGGCCGCGACACACAACAACGTGCAGCAGTCCATCGATATCTCGGCCCCCAATAGCCAGTCCATCCTGCGCTTCAACGAAACCGCCCACGGTGACTTCTGGTTCGAGGCGGATGTTGAGTTTCTGACCGACCCGAGCGCCCGCAAGCACATCGGCCTGTGGATGACCACCGGCAACGGTTCCGAGGGCTACCGGTTCGCACATATTGACGGTGCCTGGAGCGTGACACGCTGGAACAGCGGCTTTGGCGACGGCGCGGCAGTGACGGGCGGTGTCAACGATGGAGCGAAGCCGGTCGCGGGCGTTATAGACGTGGCCCCGACCTTCAACGTCGGCCAGCGGATGCCCCTGCGCTGTGAGGTCATCGTCGGAGCCTTTGACGCCAACGGCGTTCCGTGGGCGCGCCTGATCCAGTTCAAGGCCGGTGGGGTGCTGATGTTCCAGGTCGGGGATGCTGCCTACAGGGGCAAGCTGATCCCGGGCGTGTTCCTGTATGGGGCCACGGCGCGCGTCCACGCGATTGCGGGTGACACGCCGTCAGGTCTGCCCGCGTTTCCGGCAACCGTGGGCGTGAACGCCGCCGATGACCTGCTGCCGCTCGCGGGTGGGTCGACTTCGGTGCCCCCTGATCCGGCCGCCAACATCGCCGTCAACGCCGACTGCGACCTGATGCGCTTGAACAGCCCCAACTCTGAGCTGTGGAACCGGGGTGGTGGCTACGACTGGCACTTTCACGCGATTCCGAATGGCCGCAAGAACATCCACTTCAGCGGCCACGGCTTCATCGCCGGAACCGTCAAGGAGAAGGGCCAGCCCGACCAGCCCCTGGTGCGGCGGGTGCAACTGGTCAGCGAGAACACCCGCGTCCTGGTGGCCGAGACCTGGAGCGACACCACGGGCGCGTACCGGTTCGAGCTTATCGACCCGGCCCAGAGATACACCGTGGTCAGCTACGACCACAAGCAGATGTACCGCGCCGTGATCGCGGACAACCTTCATCCGGAGATGATGCCGTGACCGTTGCCATCACTGTCGAACACAACGAGGCGCGACTGGCGGGCACCCTGGCATTCCTGGATGCCGGTAGCAATCCGGCGCGTCTGCGCATCTACGGCGGGACACGACCCGCCAACCCGGCCACGACGCCGACCAGCGCGATGCTGGTCGAGATCAGGCTGACCAAACCCGCAGGCACGATTGCAGGTGGACTCTTGACGCTGACGCAGCAAGAAGACGGGCTGATCACGGCGACCGGCATCGCCACCTGGGCGCGGCTGGTCAACGGCAACGAAGTCACGGCCCTGGATCTGGACTGCAGCGGTACCGACGGCAGCGGTGACGTGAAGCTGGCCAGCACCAACCTCTATCTGGGCGGCGATGCCCGGATGGTGTCTGCGATCCTGGGGTAGTCCGTGCCTGCCGTTCTCAACGAGGTGACCCTGGTCGCCGCGTTGCCTGCGCCCACCGCCAGCGTGGCGGTCGGGCCTCCGCTGGTCGATCTGCTGTTTGACCAACCGGCTGCCACCGACGCCAACTTGGTGTTCGGGGCCAACTACATCGCGCCGCGCGACGACGTCGTGGTGCTGGCCAGCCTGCCGTTGCCGGTCGTGGCGATCAAGTTCATCCCGCCAGCGCGGGCCGCACTGCTGGCCGAGCTACCTGCATTGACGGTGACCACGCTGTTGCTGCGCCCGAGCGTCCCCTTGGACGTGACCGGTGCAAGTCTTCCTGGTGTCGTGTTCTCCGGCGAGGTCAGGTACTACTCGCGCACGCAGCGACCGACAGTCGGCCAGACCGCACACGCTTGGCAGGTGGCAGCGCAGACGGAAGATGGTTCGACACAGGGCCAGCAGGACGCTGCCGCTACACCCGCAGGCTGGGACACGTTCTGGCGACGCACCTTGGGTGTTCCTCAAGGCATCGAGCACAGGTTGCCGCCGGTGCTGGCGGCAGCGCCCGAGCAACGAGGCGCTCGCCACCAGGATGCGACCCGGCTGCAGGATTCGACGTGGTTTGCGCACCAGGACGCCACGCGTTTTGCGGCGACCCGACAAGGTCTGTTCCAGAACGCAGGCCCGTTGCGGGACACCACGCGATTTCGGCATCAGGACGGCGACCGCACCAAACGCGCGGGGCGGGTGAGCTTTTGGCAAATCGCGCGTCTGCTCACCGAGCGCCAGGGGAGTGATTTTCAGATTGCCAGCCCGTCACTCAAGGGCTGGAGTGTCCGGTATCAGGACGCCGTGCCGCCACCGCTGGGGATCAGCGTCTGGGTGGTTCCACAACCGCCAGCGCCGATACCTTGCTACACGCCGAGCGCGCATCTGCTGTTCGCCGCTTTGGCCCCAGCGGACAGCCACTTGCTGTTCGTCTGTGAAAACCACATCAACCCACCGCCTCCCGATGGGGAGCCGGTGGTCGTTCCTGTTCGGAGGGTCTATTTCGTGATCAACAACGTGACCCTGTACCGCGTGTCCGATGGCGCGCCGGTGCCGGTGTTCAACCTTTCGCTGTCGCTCGATGCATCGTCCTGGGCGTGGGGCTTCGATGCGGTGCTCCCTGCGAAAGCCGAGGCGCTGGTCGCGGGCAGCGCTTCCGGGCCCGTCGAACTCGTGGCCAGCGTCAACGGCACCCCGTTTCGCGTGCTGGCCGAGAGCATCAGCCGCGAGCGCATCTTTGGTGACGCCAGCATCCGCATCTCCGGACGGGGGCGCAACGCCGTTCTGGCCGCGCCCTACGCGCCGGTGATGACGTTCTCGAATACCGAAGGCCGCACTGCTCGGCAGTTGATGGACGATGTGCTCACGGTCAATGGCATCCCGCTGGGCTGGGCGGTCGATTGGGGCCTGACGGACTGGAACGTCCCCGCCGGTGCGTTCGCGCAGCAGGGGTCGTGGATCGACGCACTGACCGCCATTGCCGGTGCTGCAGGTGGCTACTTGATTCCGCATCCCTCGGCCCAGAGCATCCGCGTGCGTCACCGCTACCCGGTCGCGCCTTGGGAATGGAGCACGGTCACGCCCGACTTCGTGTTGCCCGTCGATGCTGTCGCCCGCGAGTCGCTGCGCTGGTTGGAAAAGCCTGCGTACAACCGCGTGTTCGTTTCCGGGCAGGACGTAGGCGTGCTCGGGCAGGTGACCCGGGCCGGGACTGCCGGAGAAGTGCTGGCACCGATGGTCGTCGACCCGCTGATCACCGAGGCGGCCGCCGCGCGGCAGCGTGGCGTAGCCGTGCTCGCCGACACGGGTCACCAGCTCGAGGTCAGCCTGCGCCTGCCGGTGCTCGCCGAGACCGGGATCATCGAGCCCGGTGCGTTCGTGGAGTACCAGGACGGCAGCGTCACGCGATTGGGCATTGTCCGCGCGACCCAAGTGGAAGCCGGGTTGCCCGAGGTCTGGCAGACGCTGGGAGTGCAGGCCTATGCGTAACCTCTACGAGCAGTTTCGCCAACTGATCCCCGACCCGCCGCTGCAGGCGGGCACAGTGAGCGACGTCGGCTCTGGCGTGGTCACGGTCGCATTGCCCGGTGGCGGCCGAATCAAGGCGAGGGGCTCTGCGGCCCTTGGCCAGAAGGTGTTCGTGCGCGACGACGCCATCGAAGGCATTGCGCCCAGCCTGACGCTGGAAATCATCGAGATCTGAAACCCAACTGATTCAACCCTGAGACCCGCCCTGATGCTCACGCATCGGGCGGGTTTCGTATTTCTGGAGAAAGCAAATGACCGAACCTGAACAACAACCGGCGCTCGTCGAGAACATGCTCCTGCTGCGCAAGGAGGATTTCGACGATCTGCTCGACCGTGCCGCCGAACGTGGTGCTGAACGCTGCCTCGCCCATCTTGGACTGGAGAACGGCCACGCTGCCCGCGACATCCGGGAGCTGCGCGATCTGCTGGAAGCTTGGCGCGACGCCCGCCGCACGGCTTGGCAGACGACCATCAAGGTGGCCACCACAGGCATCCTGGCCGCGTTGCTGGTCGGTGCCGCCATCAAGCTCAAGCTGATGGGAGGCCCCCAATGATCGAAACCCTCCTCGGTGGCCTCCTGGGCGGGGTCTTCCGTCTTGCGCCCGAAATCCTCAAGTGGCTGGACCGCAAGGGCGAACGCGGCCACGAGCTGGCCATGCAGGACAAGGCGCTGGAGTTCGAGAAAATTCGCGGCGCGCAGCGGATGGCCGAGATCGGTGCGAGCGCCGAAGCCGCCTGGAACGTCGGTGCCGTCGATGCGCTGCGTGAGGCCGTCCGCACCCAAGGTGAGAAGACCGGTGTGCGCTGGGCCGATGCGTTGTCTATCAGCGTGCGACCGGTAATCACCTACTGGTTCATGGCGCTGTACTGTGCGGCCAAGACGGCTGCTTTCGCGGCCGCCGTCACCGCTGGCTCTGGCTGGGGCACGGCCATCCTGCACGCATGGACGGAAGCCGATCAGGCGCTGTGGGCCGGGGTTCTGAACTTCTGGTTCCTCGGGCGCGTGTTCGACCGGGTGCGCTCGTGACCGGGGTGCCGAAAACGGCCATCGAGCTGGCCAAGCGCTTTGAGGGGTTCCACCGGGTGCCGAGGATCGATCCGGGCCGCGCGCATCCGTACATCTGCCCAGCGGGCTACTGGACGATTGGCTACGGCCATCTGTGCGAGTCGACGCACCCGCCGATCACGGAGTCCGAGGCCGAGGTCTATCTTACGCACGACCTGCAAACGGCGCTCGCCGCAACGCTGCGCTACTGCCCGGTGCTCGCAACCGAACCCGAAGGGCGACTTTCGGCCATTGTGGATTTCACCTTCAACCTTGGCGCGGGGCGGCTGCAGACATCGACGCTTCGGCGACGGATCAACCAGCGGGATTGGGCGGTATCAGCTCAGGAGCTCTGCCGATGGATCTATGGTGGCGGAAAGGTCTTGCCAGGTTTGGTGGCAAGGCGAAAGGTCGAGGCGGCGCTGATGTCTGGCCTTCACTACTGAAGCGCTGGCTCACACAAGCGCAGCAATTGCGATCGAATCTCGCCGGGCGATGCTGTCAGATCCACCGTCGCAAATCGGATGCCATGCCCCTGGATGACAACCGTCTCATCAACGGCGTCGCCGACGGAAGGGTGAAGCAACAGCCCACTGGCGTCATCCGCAAGCGGATCGCCGCGCCCGACCTGGGATCGCAGGTAGGCGTAGATCTGGTACACGTACCCGCTGCGCAGCGTCTCCTCTCGATACCATCCGCTCGTCACGATCGACGTGAACTTGGTGTCGATGACGATTCGCCGGCCAGAAGGCGCATGGTCGAGTACCACGTCGGTTCGCATCGTCGGCAAGATCTTGTCGATTCCCGATGTCTTCTGTTCGATTTGCCAGCCCAGCGTACCGCCACAGCGAACCCGCCATCCCTGCGGTTGCAGTACCACGTCATAGAAGCCGCCCACGGCCTTCTCGAACAGTCGCCGGACCCAGGTGACCTCGCGTTCCGGCAGAGTCAACACGTTCGTCCCTGCCACCTCAGTTGGCAAGGCAAGATCAAAAGCCAGCTTCGCCGCAGCCACCATGAATCTGTCATCAGCGTCGTTGCGGCCAAAGCGATCTGTGCTCATCTGCGCGCGAGTGGGTACATCGCCGGAAACACCCATGGCCTTCATGCCGCTGGCAAGTGAGCGGCAGCGATGAGACACGTCCTTCCTTTGCACAATTCGGGCGATGGTCTCTAGTGCCGCGCGCACAAAGCGATTGCGTGGCGTGTCGACGGTCAGCTCATCAAATCGACAGGCCACTAAGCCACGATCCAGCAATCGATGCCGCTCCGTATTCAGGACATCAATTCGCCCGCGCACGCGATTTAGGACAGCATCCCGAGATCGGTAGCCCAGGTTGAGGCGGCGGCGCTGCCTGACTTCGACTGCGTGGGCCAGAATCTCGGCGACCAGATCGGGGAGATCGTCAGGGTTGTCCTCCAGGCCAACCTTGCCGATGCCGCGAGTGCGGAACAATTCCGACGCGTACAGCATCAGCAGCCACAGATTGCGCACCGGAATACGTCCGATGTAGCCATTTGCGTCGACCGACGCGCTCTCGACCTGCTCCGCGACCACACCCATTACCAGCCCTGAAGCAGCCGCGCACACGCCTTCTGCGCTTCGTCAGGGGCGTCAAACCAATACTCATCGAGCAGTGGCCCGATCTCAGTCTCGACCACCTGCTGGAACCACTTTTTCGTGTCCCCGGCCTCCAGCCTATGGGCGGGCGTCACGTAGCTATGGCCGATCCGGAACTGCTTTCCAAGGCGAGCGTCAGCCGCTATCTGGTCATTTAGCTCTGCAATCCGGTGCTCAATATCCGCGACCAGAGCAGGATCGACAGCACACTCCTTGACAGCCCAATCCCGCCATGCGGTGCCAAGCCTTGGCTCGAGCCCCACGAAGGCGAAACGCCGACGCAACGCGAGATCGACCAGGGCCAACGACCTGTCGGCGATATTCATCGTGCCGACGACGTAGAGGTTTTCCGGAATGTGGACGGGGCGACGTTTGCCATCCGCGTCCGGGTAGCACAGTTCCAGTGCCTCATTGGGCGTGCGCTTTCCAGCTTCAAGCAGCGTCAGGAGTTCGCCGAAGATCTGCGCCGGGTTGCCACGGTTGATCTCCTCGATCACCACGACGAACTTCGACGATGGGTCCTTCGACGCTGCCTTGATCGCTTCCATGAATACGCCGTCGGCTAGCGACAATTTCCCCTCGCCGGTCGGCCGCCATCCTCTGACGAAATCCTCGTAGGACAGGTTGGGGTGAAACTGCACCGCACGGACTTTGCTTTCGTCCTTTTGACCCATCAGCGCAAACGCCAGGCGCTTCGCAAGCCACGTCTTGCCGGTGCCTGGTGGCCCCTGAAGGATCAGGTTCTTCTTGGTGCGAAGGCGGTCCAGAAGCCGATCGATCTCGTTGCGTTCAAGGAAGCATCCGTCCTTGAGAATGTCGTCGACCGAGTACGGCACGATCGGCACGGCAACATGAACGTCCTCTGGTGCAGTTGCCTCGGTAGCATCGTCACCATCATCGACGTCACCAGCGTCGTCTTCGCCGACCGGAGACTTCTCATCGGTGGGGTCCTTGTACAGCCATGCTTCCAGCGAAAGCTCAGGGTAGGAATGGACCGGGTAAGCGGTCTCCTGGAAGCGCGGCTCCAATACATCCATCACGGCAAGGTAGTCTGCCGAATTGCAGCGCCTCTTCGGGCCGTGCATGCCAATCGGCACACCGAGCTTCTTGCTGACATAGAGCTGAGAGTTGTGATCAAGGCTCAGGAACGCCCAAGGCCTGATCCAATACAGGCCGAACGTCAGATTCCATGCAACGCCTCGCCGACCGTTCGCGCTGTCGAAAGCCTTGGCGAACTCCTCGCGGGCAAGGTCATCGTCGGTATCTGCGTACGCGATACCCGCTGCGAAGACCCCCCAAAGCGCGTCAATGTGGTCGGTGGCGCGATTGATCTCGAATGGAAAGTACCAGGACTTCAAGTTGTTCAGCAGCGGGATGCCTTCGAACGTCTCTGGAACCGGCTCGTCGACGCCCAGGAATTTTGCCAACTCGGTCGCGATGATCTTGCGGTTGGAGTCCTTGATGCCCCGATTGAACAGACCCATCGTCGTGAACGGGCAGATGTCCTTTACGAAGCCCGTAGTGCCGTCTGCATACTTATCCTCTGCCAGATGGCCGAGTCCGTCGACGCGAACGGAGATCTCCCGGATTCCCTCCACAAGGGCTGCCCTGTTGGCGCGATAGGTGAGCAGCTTGTCCGCGATTGCTTCATAGAACTTTGTCCAGCCGAAGCGATGTTTGTCTGCTGCCACCGTCCCGAATCGCTCCCGCCAGTAGGGAGCGTTTCGGAAACGCTCAACGTCCTGCGGCTTGTTGTGGAAGGCGAAGCTGATCAAACCATCGGTCATCCATTCACCAGGCAAAACGCGCCACACCGTTCCCCGGTGGGTATAGAAATACCACTCACGCACCGGTTCGATTTTTGTCCAGTCAACCTTAACTCGCTGACCATCATTCAGGTTCTCAGTGATCGTACCAATTGCCTTGATCGCCATCACCGAAACCGCTTGTCCACGGCTGTCAAAGGGTAGCCCATGCTTGCGCGTGTAGGACGACTTGATGGCGATCTGATCTCCTGGTCGCATCGATCGCACCACGTCGAGATGCTTGTCCTCGTAGCCGTTCTCCCAAATCCCCTCGGACAGAAAGCGTGGCAACTGATCGTCCGTGCCCCCGTAGCTTGCGCCAACAAACCAGCTCGCCTGTGCGCTCGAGTTCTCTGTTTTTATGTTCATGGATGTCCTCAGATAGTGCTGCGCAAAGCGACGAACTGGTTACTCGGCCCCTCCGACCGTAGGTCGAGGTATGGGGAGTTCGAGACGACGTAGGCCAAAGGTCTCGTTAAACGGAATGGAAACAGGACAGGCAGTGATTGCAGGCTATGAACTGAAACAGGCTTGCCGACGTAGCGCACAGCCGCCTCGATCAGCCATACCGTGAGTTCGTCGCTGTCGATAGGCGTCGCTGCAAGACGGATGACGCGCTTGCCTTTCTCGACACGCTCTACCGCGCCCCAGCTTGCTTGTGTCTGGATGACCATATTGGTCATGCGTCGCGTGCCTTCGCGTTCGCCGTAGATCTCGCTCATACGGCGATGCACTTCGGCAGCTGCGCAGTCGTCTTGGATGGCCGACAAGCGACCCACCAACTCAGACACTTTCCCGAAAAACGGATAGCAGGCTATGGCCACGCCCCAGCACAGTACCGGAATTGGTGTGTCGGGCTGGCTTTTGTAAAGAGCCGCCGCCCGGTCGGCGAAGTCCACCAGCTCGGCGCGTGGCTCCAACCACAGCCGGTTCAAGACGGTTCGTGTTTTTTTCTTGGCTTCCACGCCAAGTTCGGCTGCGTCGAGCAGTGCATTCAGATCATCTAGCCCAGCTGTACCCGCACGAACCCGCAGTGCCGCAGCCGCCCAATCAAGCTGAATGAACCGATCGAACCCAATCTGAGGGGCTGATGTATTCATTCGTTTCCAATCACATATTTAACTTTCACAAACGGCACGATCAGCTCTTCTACCGAGATGCCGCCGTGGACAACCACTTGCTCGCCATCGGGAACAAAAGCCGTTCGGCCACCCGCGAATAAGGGCATATAGCCCGCCGGTAGTCCAGCGATGTCCAAGTGCACTGAGTTGGCATTTGCCGCTGCGGATTTAGCCAACAACGATTCACTGCGATACACGCGCACCCGTTCGCCACGGGCTTCTGGAACATCCCCTTCAGATGGACGTCCCACGCCAACAGCTTCAACGTTGCCGTGATCCGCTGTCAGGTAAATATGAAAGCCTCTGTCGAGCAGCATCGCAAACAAGCGATCGACGAAGCCTGTTTTCAACCAGTTCGCGATCCACAAGGCCACATCTTGTTTGGAGCGTTCCTTGTGCAGTCGGTCATCCACCTCGTCGACCACCAACCCGACCACCTTGGGCCGACGATCGTCCAGCGCTGCTTGCAAGGCATCCAGCTGCTCAATCTGCCGCAACGAGCGCTGGTAGTAGATTTCGCCCGGCTTGACGCCTTGCTCCTGCCAGTAGGCCTTCCACAGGTACTCTTCTTTGTTGGTATGGCCGATCGACTCTTCGAATTCCCGCGGTTTACGGCCAGAGAACAGGGCCTGCCGCGACACCGAGGTCACGGTGGGGAGCCAAGCGAAGGAAGTACCTTCATCAAACGCAAAGCGCTTCGTCGCCTCGACTAACCGCTCCCGAATCTGTACCCACTGGTCCAAGGCTAAGCCATCAAAAACCAGGAGTGCGATCTTGTCGGCCCCGAGCGCTTTCCGGCGAGAGCTCAGATAGTCGGGAATCCGATGCACCATCACCGGCCCGTTGTGGAACGAGAGCGTGCTCAGATCCGCATAGTGCTTGGCAGCCACCCACGCATGTAGCTGCGCATCCGACTGTTCTTGCAGTGTTTTCACGAGAGCCTGCACCTCTGCCAAACCGTCTGTGCCATAGGCATTGCCCAGATCGTGCACGCGGGCGAGAGTTTCGCCGTACTGCTTGGCAAACTCGCTCCAGGCCTTGTGCGTAGCCGTCGCGCTTGGAATCGCGTCAATCAGCTTGGCAATTCCCTTTTTCACAAAAGCGTTGCGCGCTTGCGGGTCTTGCACAATCCCGGCCTTGATCCAGCCTGGCGCGTCAGCTGGCACAACCTCAACGACCAGCGGGTGCAAGGAGCCATTCAAGAACATCGAATCAACGATGGATTGCACATCCGAGTGCGCAAACGGAATATCGACTTTGGCCACGTAATCGGGGGGTGGTGGCTCGCCAATGCGTGAGCCTTCCATGCCCAAGTTCGCCAGGTAGCGATGCCAAGCCTCCTGCACCACGCGCAGCAACGCACTCTTCGACGACAGCCACGCGGCCACCGGAAGGCCTGCAAGCAGTCCCTTGCTCTGGATGATGCTGGCTGCGTGCTCGGCAAAGACCAGGGGCAGCGCACGGTTAGCAAAGTGCATCCGCAGCACATCACGCCAGAAATCACTTTCCGTGCGAATGCTGGAGCGCGGTGCCAGGTGGTAGACCCGCTCCAGTATGAAATCTTTGGACTCGTTCTCGCCGCGAATACCCTGCAGCTCGGTGTCGTGCGCCTCCAGCAAGGCGGCAAAGTGCTCGGGTTCAAGCTGCTTGACCACGTTGTAAGCCAAACGCGGAAACAACTGTGCCAAGCCCAGACTTACGACACGACCATAGTGCCCAAGGTCCCAAGGCAGCTCATTGGGATCTGCACCGCGCCAATGCACAACCACAGCGGGCGTGGGGCCGGGCTCGCCTCGGTCCCAGGCGGCGCGATAGCGCTCCTCAAACTCTGTACGGAACACAAAGGGGTCTTCGTACAACAGCACCTCGAAGCCACGGCTGCGCAGTTCGGCCAGCAAACGCTCGTCCAGCAAGACGTCGTCCGGATCGCAGGCCACCCACAGCCGGTCGAGATCAGCTGTGAAGCGACTGAGAATTCGTTCAATCCACTGGCTCATGTGCTTTGTGTCTCCCTCACGTTCTCGCCAGGCTGAACCTCGGCGGAGACGCGTACCATCATTACCGCGTTCAGATCCGGCACACTGGCCGCAGCCTCGGCCAAGGCGGCCAGTCTGGCGTCGTGTTCTTGTTGCAGACGCTTGCGACGATGCTCGCGGACAGCAGGCAGCCCGATGCGGCCGATAGCCTGTTGCCTTGCTTCAAAGGCGTAGTCAGCACGCTCCCGCTCCTCTTGCAGTCGGGTCCGATGCGCCTCCAGCATCTCGGTAAAGATCCGCTCGCCTTGGGCTTTTGCCGCCGTGAGTGCTGCGTCAAACCATTTCACAGCCTCTTCGGTGCCGCTGACCCCGTGTACGACCACTGTTTCGGTCAGCAGCAAATCCCACACGCGTTTGGCCGTCGGCACAAAAGCGCGGCCCTCTTCGTTGGTAAAAACGGGCAGGTAACGCTTGCGGTTCAAGCCTTCCGCCGCAAGGCTGATCTCCCAGAGGGACCACACTCCGCTCACAGAATCAGGCAGGCCCGTCACCCGTATCACTGGCAAGGGTTGGCCTGCCACAAAGCGGGGCAACTCGCTGATCACTGCGCGCGCGCGCGGGTCTTCCAGCGTCACCCACTCAATGTCATGGTTCTCGTCAGCGGTACGCGCGTCAAAGCAAACCTGCGCTGCTTCGCTGCCATCCGCCCAAGTCACACGCCACGCCTTGCCCACCTTGGTTGCTGCACCGCCGCGTGCGGCCAAGCCAGAGGTGATGGCTCGCTCCAGCCAGAACTGTGCCGGGTGGTCGCGCCATTTGCGGGCATCGTCCGCTTCCAGCTCGTGCGCGTCCGAGAGCAATTCGCTGCTCTTTTTTGACTCGGCCAAGGTCTCGCGCAACTGCGAAACCACCGCATCGCATTCCTGCTCTATCGAAGCCGGGTTCTGCAAACCATGCACGAACAGCTCTTCGAACAAAGGCTCTGCTTCCGCCGAGTCCATCACGTCGGACGCCTTGTCCACGCCGAACTGTTGGGCGATCACTTCCAGCTTTTCTTCCAGCACCTGGCGAACCCGGTGCTCCACGGTGTCTTCCAGCACAAAGTTGATGGCGCGCACCACGTGCCGCTGGCCAATACGGTCGACGCGGCCGATGCGCTGTTCAATGCGCATCGGGTTCCAAGGCATGTCGAAATTGACGATGACGTGGCAGAACTGCAGGTTCAAACCTTCGCCGCCAGCGTCTGTCGAGATCAGTACGCGCACGTCTTGAGAGAACGCCTTTTGCGCCTTGCTGCGTGCATCCAGATCCATGCCGCCGTTAAGGGTGGCCACCGAAAAGCCACGGCTTTCCAGGTAATTGGCCAGCATGGCTTGGGTCGGCACAAACTCGGTGAAGAGCAGCACCTTCAGGGCAGGGTCGTTTTCTTCCTGCTGCAGTTTGTAGATCAGCTCCAGCAAGGCTTCTGCCTTGGCATCGGTGCCCGAGGCTTCGGTCTCGCGGGCCAGAGCGAGCAGTATTTCCACTTCGGACTTTTCCAGCTCCCAGCCGGTGGCCTGCATGGCCAAATCCACCTGCGACTGGCCGTCCAGGTCAGCCCAGTCTTCCTCGCTGGTGTTCTCAAACAACGAGGGTTGTGGCTGGGGCTCCTCCAGCAATGCCAGCCGCTTTTCCAGCGTCGTGCGAATAGCGGCAGTGCTGGACGTCACTAAACGCTGCATCAGAATCATCAAAAACCCGATATGACGCTGCTTGGCGGCCATTGCCTGGTTGTAGCCATGGCGCACGTAGTCGGTCACGGCCTCATACAAGCGCCGCTGCGCGTTGTGGCGCGCCTGCCAGGCCACGGCCTGCAGCCGGGTGACTCGCGGCTTGAAGAGTGGCTGACCATCGGCGTTGATCGACAGCCGCTTTTCTGTACGAATCACAAACGGCCGCACGCGATCGCGGTTCACGCTGCTTTCGTCAGGGAAGGCGTCGCGGTCCAGCAACTGCATCAAGCGCAGGAACTGGTCGGTCTTTCCCTGGTGCGGCGTGGCTGACAGCAACAGGAGGTAGGGCGATGCTTCTGCCAATGCTGCACCCAGCTTGTAGCGCGCCACCTGTTCGGTGCTGCCGCCCATACGGTGGGCTTCATCGATGATGACCAAATCCCAAGAGGCCGAGATCAGGTCTTCAAAGCGTTCGCGGTTGTAGTTGTTGAGCTGCTCCAGACTCCAGCCGCGCCGACTCTCCATCGGTTTGACCGAATCCAGTGAGCAGATCACCTGGTCATGCATACGCCACAGGTTGTCTTCATCCCCCTGGTTGCCACTGCGCCATTGGCGAAATGCAGCCAACTCAGAGGGCTCGATGAACTGCAGATGCTCACCGAAATGCAAACGCATTTCCGCCTGCCACTGGCGCACCAACCCCTTAGGCGCGACCACCAGCACTCTTTTCACCCGGCCGCGTAGCTTCAATTCCCGCAACACCAGCCCGGCTTCGATGGTCTTGCCCAAGCCCACCTCATCTGCCAGCAGGTAACGAATACGGTCGCGGCTGATGGCGCGATTCAGTGCGTACAACTGATGCGGCAGTGGCACCACGCTGGACTGGATGGGCGCGAGCAACAAGTTGTCTTCCAGCGCATCCAGCAGCTTGGCCGCCGCCGTGGTGTGCAGGATTTCCTCCACCGTAGGGCGAACGCTGTCCAACGGCGCAAGATCTGAGGCACGTGCCCGCACCACTGCGTCTTTGGCTGGCAGCCAGACGCGGTAAGCACTCTCACCCCATACCTCTTGCCGATCGATGACGCGACAAGACGCAGCCTGTCGCGTCAGCCAGCACCAATCGCCAACGTTGAAGCCGCCGCCCGCCAC
Protein sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|1977329_1977812_+|WP_024973177.1|DBSCAN-SWA MSDLTIFPVDIAEMSVSQLAALPPEQKCEVDKNLDAAIDWLKKARTKFDAALEQCYGEQARVALRESGRDFGTAHISDGPLHIKFELPKKVSWNQKQLGEIAERIVASGEKVEGYLDVKLSVSESRYINWPPALQQQFAAARTVDSGKPSFTLSTDGGEA >NZ_AP021884|1977054:2023954|1988949_1989474_-|WP_147073289.1|DBSCAN-SWA MTTTQLTPAQHAILAYALEHTDGKIDWFPDNIKGGARKKVLDGLFNRALITSDGTHWFVAAEGYDAMGRARPTPAPVAADPELDAAVTAAEAAWAQEKAAAKPRTRENSKQATVIQMLQRPEGATVQQICETTGWQAHTVRGTFAGAFKKKLGLTIVSDKAQGSERVYRIAAEA >NZ_AP021884|1977054:2023954|2013478_2013700_+|WP_147073248.1|DBSCAN-SWA MRNLYEQFRQLIPDPPLQAGTVSDVGSGVVTVALPGGGRIKARGSAALGQKVFVRDDAIEGIAPSLTLEIIEI >NZ_AP021884|1977054:2023954|1985452_1986862_+|WP_147073341.1|DBSCAN-SWA MNTLNVEYRKVEALIPYARNPRTHTDEQVAKIAASIVEYGWTNPVLVDGDNGIIAGHGRLAAARKLGLDQVPVIELAHLSPTQKRAYVISDNRLALDAGWNEEMLALEMAELSEAGYDLALTGFEDAEIEALLADEVASDAADQEPDADEPDDGDDVPDSPVVPVSRTGDFWAIGTHRLICGDATDPTVVATLMQGDAARLCFTSPPYGNQRDYTSGGITDWDGLMRGVFAKVPMDDDGQVLVNLGLIHRDNEVIPYWDAWLGWMRTQGWRRFAWYVWDQGPGMPGDWAGRFAPSFEFVFHFNRSSRKPNKIVPCKHAGQESHLRADGSSTAMRGKDGEVGGWTHKGQPTQDTRIPDSVIRVMRHKGKIGQDIDHPAVFPVALPEFVIEAYTDAGDIVFEPFGGSGTTMLAAQRKGRVCRCVEIAPEYVDVAIKRFQQNHPGVPVTLLATGQSFDDVVNERQATTEVEQ >NZ_AP021884|1977054:2023954|2014548_2015016_+|WP_170227448.1|DBSCAN-SWA MTGVPKTAIELAKRFEGFHRVPRIDPGRAHPYICPAGYWTIGYGHLCESTHPPITESEAEVYLTHDLQTALAATLRYCPVLATEPEGRLSAIVDFTFNLGAGRLQTSTLRRRINQRDWAVSAQELCRWIYGGGKVLPGLVARRKVEAALMSGLHY >NZ_AP021884|1977054:2023954|2016133_2018365_-|WP_147073239.1|DBSCAN-SWA MNIKTENSSAQASWFVGASYGGTDDQLPRFLSEGIWENGYEDKHLDVVRSMRPGDQIAIKSSYTRKHGLPFDSRGQAVSVMAIKAIGTITENLNDGQRVKVDWTKIEPVREWYFYTHRGTVWRVLPGEWMTDGLISFAFHNKPQDVERFRNAPYWRERFGTVAADKHRFGWTKFYEAIADKLLTYRANRAALVEGIREISVRVDGLGHLAEDKYADGTTGFVKDICPFTTMGLFNRGIKDSNRKIIATELAKFLGVDEPVPETFEGIPLLNNLKSWYFPFEINRATDHIDALWGVFAAGIAYADTDDDLAREEFAKAFDSANGRRGVAWNLTFGLYWIRPWAFLSLDHNSQLYVSKKLGVPIGMHGPKRRCNSADYLAVMDVLEPRFQETAYPVHSYPELSLEAWLYKDPTDEKSPVGEDDAGDVDDGDDATEATAPEDVHVAVPIVPYSVDDILKDGCFLERNEIDRLLDRLRTKKNLILQGPPGTGKTWLAKRLAFALMGQKDESKVRAVQFHPNLSYEDFVRGWRPTGEGKLSLADGVFMEAIKAASKDPSSKFVVVIEEINRGNPAQIFGELLTLLEAGKRTPNEALELCYPDADGKRRPVHIPENLYVVGTMNIADRSLALVDLALRRRFAFVGLEPRLGTAWRDWAVKECAVDPALVADIEHRIAELNDQIAADARLGKQFRIGHSYVTPAHRLEAGDTKKWFQQVVETEIGPLLDEYWFDAPDEAQKACARLLQGW >NZ_AP021884|1977054:2023954|1984052_1984517_+|WP_147073301.1|DBSCAN-SWA MKTTILALDLGTHTGWALQHLDGTITSGTEHFKPQRFEGGGMRFLRFKRWLNELLSVSNHINAVFFEEVRRHAGVDAAHAYGGFMGHLTAWCEHHNIPYQGVPVGTIKKHATGKGNASKDEMITSVRERGHTPVDDNEADALALLHWAVETQEV >NZ_AP021884|1977054:2023954|2021110_2023954_-|WP_147073232.1|DBSCAN-SWA MAGGGFNVGDWCWLTRQAASCRVIDRQEVWGESAYRVWLPAKDAVVRARASDLAPLDSVRPTVEEILHTTAAAKLLDALEDNLLLAPIQSSVVPLPHQLYALNRAISRDRIRYLLADEVGLGKTIEAGLVLRELKLRGRVKRVLVVAPKGLVRQWQAEMRLHFGEHLQFIEPSELAAFRQWRSGNQGDEDNLWRMHDQVICSLDSVKPMESRRGWSLEQLNNYNRERFEDLISASWDLVIIDEAHRMGGSTEQVARYKLGAALAEASPYLLLLSATPHQGKTDQFLRLMQLLDRDAFPDESSVNRDRVRPFVIRTEKRLSINADGQPLFKPRVTRLQAVAWQARHNAQRRLYEAVTDYVRHGYNQAMAAKQRHIGFLMILMQRLVTSSTAAIRTTLEKRLALLEEPQPQPSLFENTSEEDWADLDGQSQVDLAMQATGWELEKSEVEILLALARETEASGTDAKAEALLELIYKLQQEENDPALKVLLFTEFVPTQAMLANYLESRGFSVATLNGGMDLDARSKAQKAFSQDVRVLISTDAGGEGLNLQFCHVIVNFDMPWNPMRIEQRIGRVDRIGQRHVVRAINFVLEDTVEHRVRQVLEEKLEVIAQQFGVDKASDVMDSAEAEPLFEELFVHGLQNPASIEQECDAVVSQLRETLAESKKSSELLSDAHELEADDARKWRDHPAQFWLERAITSGLAARGGAATKVGKAWRVTWADGSEAAQVCFDARTADENHDIEWVTLEDPRARAVISELPRFVAGQPLPVIRVTGLPDSVSGVWSLWEISLAAEGLNRKRYLPVFTNEEGRAFVPTAKRVWDLLLTETVVVHGVSGTEEAVKWFDAALTAAKAQGERIFTEMLEAHRTRLQEERERADYAFEARQQAIGRIGLPAVREHRRKRLQQEHDARLAALAEAAASVPDLNAVMMVRVSAEVQPGENVRETQST >NZ_AP021884|1977054:2023954|1988560_1988920_-|WP_147073291.1|DBSCAN-SWA MSTMTITIERTPRTLQFGDTTFQVEELSVRLPFARKPADLDEVGGQGQTKVYVTETKELTVDEFDAFARSLLVSRDWLRGKGGGTGDGYLCVEVTAPGRPYLYVNPEGGDYARYVARLG >NZ_AP021884|1977054:2023954|1981381_1981636_+|WP_147073305.1|DBSCAN-SWA MTDNNTPTTGIEPMIDAKQAAAALRLPYYWFADHAMRTKYRIPHYLMGGLVRYRLSELSAWATRTTAVQGRDSQDADAPVEGAE >NZ_AP021884|1977054:2023954|1980647_1981385_+|WP_147073307.1|DBSCAN-SWA MMDFNSTSSISGQITALVDAGMQRARAQQSERQYLGASRLGAACERALQFEYAKAPVDHGRDTPGRMLRIFERGHVMEDCMVAWLRDAGFELRTRRADGEQFGFSVADGRLQGHIDGVIVDGPEGFAYPALWENKCLGMKSWRELEKNRLAVAKPVYAAQVAIYQAYLELHEHPAIFTALNADTMEIYTEAVPFDAALAQRMSDRAVKVITATESADLLPRAFNDPTHFECRMCAWQDRCWRTQA >NZ_AP021884|1977054:2023954|2005133_2008697_+|WP_147073258.1|DBSCAN-SWA MPIQSGDVKLLKSAVMADVPEGGGAPTGNTIADGVSNAIFPDISELDRAGGRVNLRKSFVSVQTDDTDTYFGANVIVAEPPQDARVSVTLFSTEKTFDTREQAQVRIEAYLNKGPEWAGYLFENHIAGQRVIQLFQRTTDTVPNVGQTLVLIENEGLGTQKEQYIRATSVSVVERTFTYDGDKDYKASIVTVDISDALRYDFTGSPASRTFTRAANSTKTRDTVVADAGTYVGVVPLTQAAAVGDFTIKGTSIYTQLVPSAQTETPISFVPPYAAAGLPVPGAVAVSYTASHAWTTSIKFNLPGGCLPGSLTIGTDGITIFDDAGLLKTASGTVGTIDYANGILTLNSGTMSNAKAITYTPAAQILRAPQSSEIPVTPESRSQSYVGTVNPVPQPGTLSISYMAQGRWYVLSDSGNGSLKGLDASYGAGTFNRNTGAFVVTLGALPDVGSSLVLTWNVPTQETQQPSTTLKATQSLALNPPAGTAVQPGSLTVSWEYGGTKTSTAATSGVLSGAATGSLSVAQNRVDFAPNVLPAVGTQLTVSYVAGPKQEDSFAHPSRNGAGTLPVTATLGAIEPGSLEVEWNTFTDEAVLGAYTFAQLQEMGIAVSIWRDPTQIARDDGNGGVVLNGISIGTVNYATGQVTFNPDVSIRIPRPVYTAVAINGTGRWRLNYGGIAYVDAPSLYPNDESGYVKLRYNSAGSTSNQTETFQFLPAFKLVPGVNAQVVTGTVLLSISGAQPWGDNGQGTLREFTTSGWVTRGTINYLSGDVALTSWTAGTNNAITRASCVTTVGENISSEFVFRTGAAPLRPGSLSIQYARAVGGTQNVTAGIDGKIEATGISGSVDYETGLVRVRFGTMVTAAGNESQPWYAADRVGTDGKIFRPEPVAASSVRYSAVAYSYLPLDADLLGIDPVRLPSDGRVPIFRPGGFAVVGHTGKITSSVSNGQTINCARVRLSRVRVVGHDGAVIHTGYSTDLEAGTVTFINVSGYSQPVTIEHRIEDMAVVRDVQISGEISFTRALTHEYPLGSHVSSALVAGDLFARVNLVFDQSTWNGAWSDALSGSSATATFNNTQYPIRVTNRGALTERWIVRLTNSTSFEVIGENVGVIATGNTSADCAPNNPATGVPYFHLPALGWGNGWATGNVLRFNTIGAQFPVWVVRTVQQGPESVPDDNFTLLIRGDVDTP >NZ_AP021884|1977054:2023954|1996685_1997705_+|WP_058719286.1|capsid|DBSCAN-SWA MQNPFISPAFSMASMTAAINLIPNRYGRLEELNLFPPKPVRTRQVIVEERAGVLNLLPTQPPGSPGTVNVRGKRTVRSFVVPHIPHDDVVLPEEVQGLRAFGSETEMESIAGVLAQHLETMRNKHAITLEHLRMGALKGEILDADGSRIYNLFDEFGIDQQSVDFEISSPTTGTDVKGKCTDVLGIIEEALLGEFMTGVHCLCSPEFFKALTGHKDVKTAFTNWQQGAVLINDVRRGFTFGGITFEEYRGKATDVNKTVRRFIAAGEAHAFPLGTIDTFGTYFAPADFNETVNTMGQPLYAKQEPRKFDRGTDLHTQANPLPMCHRPGVLVRLVMGGGV >NZ_AP021884|1977054:2023954|1990429_1992394_+|WP_147073339.1|terminase|DBSCAN-SWA MNVEYEGAAEIERAWREGLTPDPLLSVSEWSDRHRMLSSKASAEPGRWRTSRTPYLKAIMDCLSPTSPVERVVFMKAAQLGATEMGSNWIGYVIHHAPGPMMAVWPTVDMAKRNSKQRIDPLIEESAALSELISPARSRDSGNTILAKEFRGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLAEARTRTFARRKIFIVSTPTISGASAIEREYEASDQRRYFLPCPHCSHRQWLRFEQLRWEKGQPDTASYICESCDKSIAEHHKTWMLEHGEWRAMISDGTGKTAGFHLSSLYSPVGWRGWRDIAAAWESSVNKESGSAAAIKTFKNTELGETWVEEGEAPDWQRLVERREDYRVGTVPPGGLLLVGAADVQKDRIEASIWAFGRGKESWLVEHRVLMGDTARDAVWKRLAELLAENWTHASGAAMPLARFALDTGFATQEAYAFVRACRDPRVMPVKGVPRGAALIGTPTAIDVSQGGKKLRRGIKVFTVAVGIAKLEFYNNLRKGADVSEDGVTTVYPTGFVHLPKIDAEFIQQLCAEQLITRRDRNGFPVREWQKMRERNEALDCYVYARAAASAAGLDRFEERHWRELERQLGMERPPDEPPPIQAFDPNEATQRGGLSVSANPPRRRVIKSRWLS >NZ_AP021884|1977054:2023954|1978662_1979289_+|WP_024973179.1|DBSCAN-SWA MTAWNDFNDADSQQSGFDLIPKGTVVPVRMTIKPGGYDDPEQGWGGGYATESFETGSIYLAAEFVVTAGDHAKRKMWSNVGLLSKKGPTWGQMGRSFIRAALNSARNVHPQDNSPQAAAARRINGFAELDGLEFLARVDIEKDAKGQDRNVVKLAVEPDHPDYAKLKGVPPKGSPGGGNSGAPAQAAPAYSAPTPQRAPVTGKPSWAQ >NZ_AP021884|1977054:2023954|2013776_2014079_+|WP_147073246.1|DBSCAN-SWA MTEPEQQPALVENMLLLRKEDFDDLLDRAAERGAERCLAHLGLENGHAARDIRELRDLLEAWRDARRTAWQTTIKVATTGILAALLVGAAIKLKLMGGPQ >NZ_AP021884|1977054:2023954|1992902_1993295_+|WP_147073281.1|DBSCAN-SWA MSLASSIAALAARIGFEVKTKIDATHPGIARVWVSFGYVGGQVVIASAHNVASVVRTAAGRYRVHFAVAMPDANYCWTALARSSTNTGQQRLALVRASSDLKTAQYVDVSCATAASSFDDSSEINLVVYR >NZ_AP021884|1977054:2023954|1993294_1993516_+|WP_146463160.1|DBSCAN-SWA MAYTEAQLQALETALAKGEHRVSFGDKTVEYRSVDELKAAIREVKRGILEQAAATGLWPGAPRQIRVTTSKGF >NZ_AP021884|1977054:2023954|2000670_2004714_+|WP_147073261.1|DBSCAN-SWA MAKRISILVALEGADEGLKRAITSAERSLGELSTTAKTAGAKAAAGMAEVKAGMSAFGDQVATAKTQLLAFLSISWAAGKVQEIVQIADAWNMMSARLKLATAGQREFTTAQAALFDIAQRIGVPIQETATLYGKLQQAIRMLGGEQKDALTIAESISQALRLSGASATEAQSSLLQFGQALASGVLRGEEFNSVVENSPRLAQALADGLNVPIGRLRKLAEEGRLTADVVVNALMSQKDKLASEYAQLPQTVSQAFERLRNAFGQWINRVDESTGLTKKLAEALTILANNLDTVMQWLKRIAEVGLAVLIYRLIPALITAWQTAGAAAVTAASATAAAWTTANLSVSAAVASVGLLKTAFAVLGAFLVGWEIGTWLSEKFEIVRKAGIFMVEMLVKAVEQLRYRWEAFAAIFTSDTIAEATQRHEARLAEMNQIFAQMYADATKGADAAKGAMNTAATAAEEIAKRLEAVRQGTQEAVGRGIEAVHSALEKLKSRLGEVEQAVGKANQTVNDATAKMAEAYKGLTSIVEANLLRQIEAVKARYQQEQSALETSKQSEAALITKSTQLLTEALTQQTTLRRQSTTDTLKLIDDESKARIESARRQGQTEEERRANVQRVENDILATKRQTMTQALAEYRQHIDALNAEANRHLTEIKRIEEEKRQLSMTTEERVRDIRRQGMTDFEATEDRKRQIAEYQGKAREALANGEFEQARQLAQKAMDLAAQVASSQTSEAKRGEDARKQSEQAVSQVTQLESQSRDAYRKQEYAQAEALMRQADALRAELAQKTKDADAQIAQGKDGVNQAIQRIRESEEILNKTLDAEAKAHQTAAQSALTARDQIQQTLTQTETQIDQITAKLKDGLKVTLDADTTRFDKAIADLDKALAEKEYLLKIQADLQEAEKKLQQYEQLLKEGKTLPVDADVSKAKEALDKLKTYADQNSQFELKVATEKAQAAITKVEGMIKALDRIQTESRHQVSTNADAARSEIMSLNWANTSSTHTIYVRKVEANATGGLVGGGVRRYADGGAVAPAFPRMSGGSVPGSGHHDTVPRTLDAGAFVIRKAAVQKYGGGALSRLANGVARFATGGAVMLGGGKRPSGNDADGTPSTPKKNREAVEAMKMIDLGLQGMNEYTNWLQWNYGASVSLDMRSKTMDSYGKQAQQDRRALEDFISRKTLTGNERQNLERIKQTWRQAMAQPLLWGKDLERELIDYMEQNQGEFYRRGGMAKSDTVPAMLTPGEFVVNKDAVSRYGAGFFEAINNLSAPAQALAGRALAGVQGFATGGLVQPSGSRLARPVLAADAGPSRTVRVELSSGQQKVNATVDARDESRLLQLLDAARARTA >NZ_AP021884|1977054:2023954|2015009_2016134_-|WP_147073241.1|DBSCAN-SWA MGVVAEQVESASVDANGYIGRIPVRNLWLLMLYASELFRTRGIGKVGLEDNPDDLPDLVAEILAHAVEVRQRRRLNLGYRSRDAVLNRVRGRIDVLNTERHRLLDRGLVACRFDELTVDTPRNRFVRAALETIARIVQRKDVSHRCRSLASGMKAMGVSGDVPTRAQMSTDRFGRNDADDRFMVAAAKLAFDLALPTEVAGTNVLTLPEREVTWVRRLFEKAVGGFYDVVLQPQGWRVRCGGTLGWQIEQKTSGIDKILPTMRTDVVLDHAPSGRRIVIDTKFTSIVTSGWYREETLRSGYVYQIYAYLRSQVGRGDPLADDASGLLLHPSVGDAVDETVVIQGHGIRFATVDLTASPGEIRSQLLRLCEPALQ >NZ_AP021884|1977054:2023954|1997701_1998004_+|WP_147073273.1|DBSCAN-SWA MSLVAQIYESAANAGLLKECLWYPSNGAPSQLHQIGFAAPDESLLDGLALSTDYEMTYPVTAFGGLAVREVVEIGGTSFQVRDIRSLSDGSEIRAKLTRL >NZ_AP021884|1977054:2023954|1998651_1999404_+|WP_147073268.1|DBSCAN-SWA MSTYASFQGRVFLGKRDTDGLPIEVRSPGNVAELKLSLKTDVLEHYESQTGQRSLDHRMVKQKSATVNLTIEEFTKENLALALYGNHVVGTPGTVTAEPVGGATPIAGDRYFLAHPKVSSLVVTDSAGTPATLALGTNYTADPDFGALQFLDTTGFTAPFKASYAYGVATEIGIFTQALPERFLRLEGINTAQGNAKVLVELYRVAFDPLKEISFISDEYNKFELEGSLLADTTKPFDAVLGQFGRIVQL >NZ_AP021884|1977054:2023954|2009989_2011075_+|WP_147073254.1|DBSCAN-SWA MSYPLSESFATAPATGYTAVLGGMAATHNNVQQSIDISAPNSQSILRFNETAHGDFWFEADVEFLTDPSARKHIGLWMTTGNGSEGYRFAHIDGAWSVTRWNSGFGDGAAVTGGVNDGAKPVAGVIDVAPTFNVGQRMPLRCEVIVGAFDANGVPWARLIQFKAGGVLMFQVGDAAYRGKLIPGVFLYGATARVHAIAGDTPSGLPAFPATVGVNAADDLLPLAGGSTSVPPDPAANIAVNADCDLMRLNSPNSELWNRGGGYDWHFHAIPNGRKNIHFSGHGFIAGTVKEKGQPDQPLVRRVQLVSENTRVLVAETWSDTTGAYRFELIDPAQRYTVVSYDHKQMYRAVIADNLHPEMMP >NZ_AP021884|1977054:2023954|1999813_2000017_+|WP_147073264.1|DBSCAN-SWA MIEHGHRLPDILDYTLAQVRGFVVATARTDAARDARLLSVIAIGTRSDARQLDQTLDRLTDKATDRA >NZ_AP021884|1977054:2023954|1998008_1998455_+|WP_147073272.1|DBSCAN-SWA MADNSIRERILLAVMAAARPAVEGLGATLHRSPTVAISRELCPALAVFPESESITERANDRVTRELTVRVVALARAVPPASPETEADRLLTAAHAALFGDGTFGGLALGIREQESEWEVEDADAVAVALPARYRLTYRTLANDLSTLG >NZ_AP021884|1977054:2023954|1996290_1996668_+|WP_147073275.1|head|DBSCAN-SWA MPAMQEPINLGDLLKYEAPNLYSRDRVTVAAGQTLPLGTVLGQITATGKVKQIDPSATDGSQYSAGVLMQDADAALADRNDGLMVARHAIVSDHALHWPTGITTAEQQAAIQQLKALGVLVRIGA >NZ_AP021884|1977054:2023954|1984720_1985137_+|WP_147073297.1|DBSCAN-SWA MAEWTTDDVAARFEEAATTGRRLPPVRVQGYFNCWPAFVRKEWEAFAADEKVYRPFPPSPEAIDRMLETMRWVQWLEVEQRHLVWMRAKRYGWRDITIRFACDRTTAWRRWQRAMEIVATNLNSEGVRLPSKNVGNLG >NZ_AP021884|1977054:2023954|1977054_1977318_+|WP_024973176.1|DBSCAN-SWA MQTQVPSIESGRNPRRMNPGGATCIALDENELAIRWGLSVKTLRRWRQEQLGPIYCKLGRRVTYLLHEIEAFERRVSRYSSFTRAYQ >NZ_AP021884|1977054:2023954|1992411_1992903_+|WP_147073283.1|DBSCAN-SWA MSLATRIESLVIRVAQEFNDVRATAGSLASLSTNDKSSLVAAINELKAAVLSAMAIDDNQIATTSTYSSNKIVSLLDALKTDILGGADAAYDTLVEIQQALQSGTSGLDAILAAVNLRVRFDAAQTLTVAEQLQARTNIGAVAVSDVGNTDTDFVVIFDGALA >NZ_AP021884|1977054:2023954|1984518_1984728_+|WP_147073299.1|DBSCAN-SWA MKVSTPQYRCPLGRLQPQTTDLDAIKERGWRDQHILVVNASDDRLDFIEREIVRRIGERLYGLGGTRHG >NZ_AP021884|1977054:2023954|2004724_2005132_+|WP_147073260.1|DBSCAN-SWA MQLTNLDAGVALPLPDDLLWSDEHAWSPAVATTSYLITGALLIQSATRQAGRPITLVGAPDMAWVTRATVEQLQAWAALPVGSATGRFGLTFSDGRSFTVAFRHAETAIEAEPVLGIPARAATDFYRLTLRFLEI >NZ_AP021884|1977054:2023954|1988093_1988462_-|WP_147073293.1|DBSCAN-SWA MNTNQQMPATQNDAWGFWGTMNEHASTAWPLAMTAISDATGQPLESVRVFLDSRHGRHFADDLQNGLYRGQTLADAINAATQQWMGWTIGRQTSKQYGIPRGLPYLTGFVIHCEIAEESIAA >NZ_AP021884|1977054:2023954|1998460_1998655_+|WP_147073270.1|DBSCAN-SWA MTQLVLTRPHTHAGKTYGVGDRIEIDATSADWLIAHDIATPEPTAPTAEPVPEPKPLQRKEPKQ >NZ_AP021884|1977054:2023954|1979299_1980154_+|WP_147073311.1|DBSCAN-SWA MNASVLTASHYGVVRFGDLQCEAVVLKGGERGYVRRQLAKLLGFHETHKGGRFARFLADFAPKSLSALEKTREPILLPSGRQAQFFPAGIIADVASAVVSAAINGTLHKARQGIVPNCMKIMRALATTGEVALIDEATGYQYHRAPDALQELISKLLRQSCSSWERRFHPDYYRALYRLFGWKYQGHDQNPPHVVGQITQRWVYGPVLPVTLIDEIRARKGISQKHHQWLSDQGLARLETQIHAVTAIARSSTCYRDFDRRCEAAFAGGALQLALLAEDFEEGA >NZ_AP021884|1977054:2023954|2011470_2013486_+|WP_147073250.1|DBSCAN-SWA MPAVLNEVTLVAALPAPTASVAVGPPLVDLLFDQPAATDANLVFGANYIAPRDDVVVLASLPLPVVAIKFIPPARAALLAELPALTVTTLLLRPSVPLDVTGASLPGVVFSGEVRYYSRTQRPTVGQTAHAWQVAAQTEDGSTQGQQDAAATPAGWDTFWRRTLGVPQGIEHRLPPVLAAAPEQRGARHQDATRLQDSTWFAHQDATRFAATRQGLFQNAGPLRDTTRFRHQDGDRTKRAGRVSFWQIARLLTERQGSDFQIASPSLKGWSVRYQDAVPPPLGISVWVVPQPPAPIPCYTPSAHLLFAALAPADSHLLFVCENHINPPPPDGEPVVVPVRRVYFVINNVTLYRVSDGAPVPVFNLSLSLDASSWAWGFDAVLPAKAEALVAGSASGPVELVASVNGTPFRVLAESISRERIFGDASIRISGRGRNAVLAAPYAPVMTFSNTEGRTARQLMDDVLTVNGIPLGWAVDWGLTDWNVPAGAFAQQGSWIDALTAIAGAAGGYLIPHPSAQSIRVRHRYPVAPWEWSTVTPDFVLPVDAVARESLRWLEKPAYNRVFVSGQDVGVLGQVTRAGTAGEVLAPMVVDPLITEAAAARQRGVAVLADTGHQLEVSLRLPVLAETGIIEPGAFVEYQDGSVTRLGIVRATQVEAGLPEVWQTLGVQAYA >NZ_AP021884|1977054:2023954|2008720_2009986_+|WP_147073256.1|DBSCAN-SWA MIDLTVKYFNSGMTGAPQISNNWGDLVTMLDACLVNGFALKAIDTLTFADGIATATISTGHAYRPFQVVEIAGAEQPEYNGSFRVLSTTTTAFTYAVTGAPVSPATTTTNLSAKVAPLGWEKPFAGTSKAAYRSKNPQSPQNILLIDNSLKTPNYTTGWAKWANVGIVEDLSDIDTIVGAQAPYDPNNPTQNWKQVTASQWGWYKWFHARGPQYESNGDSGGGGRNWVLIGDDRLFFLFCTNAAGYGWYGRNSYCFGDLISFKPGDNYATVLAADDNYSGMSNYWSYPGQFSGYGLVSSLDFTGKVLLRNHTQLGNPVRFGLTSLNTNNGQQICGRGPMPFPNGADYSLWLLPTYVRQEDGHMRGILPGMLWMPQDRPYSDQTIVDNVVGQAGKRFLLVRTQYSSETEGAQIAFDITGPWR >NZ_AP021884|1977054:2023954|2018373_2019096_-|WP_147073237.1|DBSCAN-SWA MNTSAPQIGFDRFIQLDWAAAALRVRAGTAGLDDLNALLDAAELGVEAKKKTRTVLNRLWLEPRAELVDFADRAAALYKSQPDTPIPVLCWGVAIACYPFFGKVSELVGRLSAIQDDCAAAEVHRRMSEIYGEREGTRRMTNMVIQTQASWGAVERVEKGKRVIRLAATPIDSDELTVWLIEAAVRYVGKPVSVHSLQSLPVLFPFRLTRPLAYVVSNSPYLDLRSEGPSNQFVALRSTI >NZ_AP021884|1977054:2023954|1986858_1988130_+|WP_147073295.1|DBSCAN-SWA MTASWFADKIEKWPTAKLLPYARNARTHSDDQVAQIAASIAEFGFTNPILAGSDGVIVAGHGRLAAAQKLGLAVVPVVVLDHLSPTQRRALVIADNRIAENAGWDDAMLRIEIASLQDDDFDVSLTGFDADALAELMAGDEPDGEGETDDDAVPELSETPISRPGDVWSLGGHRLLCGDSTVTESYDRLLDGEQVDMVFTDPPYNVNYANSAKDKMRGKDRAILNDNLGDGFYDFLLAALTPTIAHCRGGIYVAMSSSELDVLQAAFRAAGGKWSTFIIWAKNTFTLGRADYQRQYEPILYGWPEGAQRHWCGDRDQGDVWNIKKPQKNDLHPTMKPVELVERAIRNSSRPGNVVLDPFGGSGTTLIAAEKSGRLARLIELDPKYADVIVRRWQEWTGKQATRESDGALFDDQAAIDSSAISQ >NZ_AP021884|1977054:2023954|1999415_1999817_+|WP_147073266.1|DBSCAN-SWA MSDLDTLIPQAVELVIDGEPLAIKPLKVGQMPGFLRAMSPVMQQLTASNIDWLALFGERGDDLLSAIAIAVGKPRAWVDELAADEAILLAAKVIEVNADFFTQTVIPKLDGLFGQVKLPPIVKAAAGSMPSST >NZ_AP021884|1977054:2023954|1981632_1983924_+|WP_147073303.1|DBSCAN-SWA MIDFNDTTQPAEHNRESERDEIRADLLARLESVLTTMFPAGKKRRGKFLIGDILGSPGDSLEVVLEGEKAGLWTDRATGDGGDIFALIAAYLGANVHTDFPRVLDEAADLLGRSRSVPVRKAKKEAPVDDLGPATAKWDYFDAGGKLIAVVYRYDPPGGKKEFRPWDAKRRKMAPPEPRPLFNQPGIGAASHVVLVEGEKCAQALIASGVVATTAMHGANAPVDKTDWSPLAGKTVLIWPDRDAPGWDYADRASQAILQAGATSVAILMPPDDKPEGWDAADAIPEGFDVGGFLAVGERMPVMRSVEEAPSPDLLTGIDWTTEDGLSSAFTRRYGEDWRYCALWGKWLVWTGVRWNPDQVLYVSHLSRGICRNASLKADTPRLKGKLASSATISSVEKIARSDPKHASTAEEWDADVWALNTPGGVVDLRTGRMRPHRRDDRMTKVTTATPQGNPDSACPTWRGFLTDVTGGDADLMAYLQLMVGYCLTGVTSEHALFFLYGTGANGKSVFVNVLTTILGDYAANAPMDTFMEARNDRHPTDLAGLRGARFVSSIETEQGRRWNESKVKAITGGDKVSARFMRQDFFEYLPQFKLVIAGNHKPSIRNVDEAMKRRLHLIPFTVTIPPERRDGRLTEKLLKERDGILAWAVEGCSRWQSQGLKPPASVVSATEEYFEAEDALGQWIEERCLLAKSHREGVSELFADWREWAERAGEYVGSVKRFSELMATRKFDKCRLTGGARAIAGIALRPKPYSHAYPYRDD >NZ_AP021884|1977054:2023954|1980150_1980651_+|WP_147073309.1|DBSCAN-SWA MKCWVCKRQARGFGHTDNRHGIGDPRRYPIDWVFCSQRCQSAFHAMYGNWSRAKDGRSDIKGVAMIDPSDIELAAMRKCLKSFGEAASEIGFTKPLGNYSEAEALQVIDAIVTCYTEAMVEHHEASKYPPVRGMTPTPDPMTPSAANPFADLDDDLPWEEPKGKKP >NZ_AP021884|1977054:2023954|1989572_1989770_-|WP_147073287.1|DBSCAN-SWA MSKFEQLLTQIAQNKLGIETLETRKSDSLDFHDVAVWCLRDALEAAFNAGVEQGRKATKSDKANS >NZ_AP021884|1977054:2023954|1989887_1990427_+|WP_147073285.1|DBSCAN-SWA MGISIRAYARHRGVTDTAVHKAIRAGRITPEADGTIDADRADREWARNSDVPKTGTRAKAAKVAVPEGGTGVGGDGPAALPAGGASLLQARTVNEVVKAQTNKVRLARLKGELVDRPQAIAHVFKLARSERDAWLNWPARISAQMAAKLNIDPHTMHVALEAAIREHLQELGELRPRVD >NZ_AP021884|1977054:2023954|1993515_1995027_+|WP_147073279.1|portal|DBSCAN-SWA MAWYSKIRSLFGQQPVHEAAGRGRRSLAWMPGNPGAVAAMLATNTELRIKSRDLVRRNAWAQAGIEAFVSNAVGTGIKPQSLAADERFKTDVQALWRDWTEEADAAGQTDFYGLQALACRAMLEGGECLIRLRPRRPEDGLVVPLQLQLLEPEHLPISLNLDLPSGNVVRSGIEFDSLGRRVAYHLYRSHPEDGRLAPMSGQGGMDTVRIDAKEIIHLFRVLRPGQIRGEPWLSRALVKLNELDQYDDAELVRKKTAAMFAGFVTRQNPEDNLMGEGAADGDGIALAGLEPGTLQILEPGEDIKFSDPADVGGSYGEFLRTQFRAVAAAIGVTYEQLTGDLTGVNYSSIRAGMLEFRRRCEMVQHGVLVHQMCRPVWAAWMKQAVLAGAIDAPGFARGGPARRRRYLQVKWIPQGWQWVDPEKEFKAMLLAIRAGLMSRSEAISAFGYDAEDVDREIAADNQRADDLGLIFDSDPRRTSKDGGSAEPNKNAADTTQTGSSSSA >NZ_AP021884|1977054:2023954|1995036_1996272_+|WP_147073277.1|DBSCAN-SWA MTLLPHLAARLYGVPLAIHRPKLDVILAVLGPRIGLADLAAPSGFTPPARPASTQTTKVAVIPIHGTLVRRTVGLEAESGLTSYAGLTAQLDAALASPDVAAILLDVDSPGGESGGVFDLADRIRAAAKTKPVWAVANDMAFSAAYALASAASKVFVSRTGGVGSIGVIAMHVDQSEKDAQDGVRYTAVFAGDRKNDLNPHEPISSEAHAFLKGEVNRVYGLFVETVARNRGIEASAVRDTEAGLFFGQAAVAIGLADAIGTFDDALAQLCESVSPLPKLAASHSGLFSNPQMESSMNDRTDPAAPDRLAADPAGSPSQPAAATAMTVADAIEVAQTCTLAGRTDLIAGFLEAKAPPAKVRSQLLATQAEASPEIVSRIDPQSAMSASSTGHPASSHNPLIQAVKSRLGTK >NZ_AP021884|1977054:2023954|2014075_2014552_+|WP_147073244.1|DBSCAN-SWA MIETLLGGLLGGVFRLAPEILKWLDRKGERGHELAMQDKALEFEKIRGAQRMAEIGASAEAAWNVGAVDALREAVRTQGEKTGVRWADALSISVRPVITYWFMALYCAAKTAAFAAAVTAGSGWGTAILHAWTEADQALWAGVLNFWFLGRVFDRVRS >NZ_AP021884|1977054:2023954|2000022_2000667_+|WP_147073263.1|DBSCAN-SWA MRISVQIDSAAAQAQLRRWGGEFRDKVKKAVSRAIASEAVELKQDVRSHVASQMAVVKKSFLKGFTAKVLDKDLNRLPALYVGSRIPWSAMHETGGQIAGRMLIPLNGRVGRKRFKAQVAELMRGGNAYFIKNAKGNIVLMAENIKEHDRPLAGFKRRYRKAEGIKRLKRGADIPIAVLVPKVVLKKRLDVERLVASRIPRLAAAVENQISTVD >NZ_AP021884|1977054:2023954|2019092_2021114_-|WP_147073235.1|DBSCAN-SWA MSQWIERILSRFTADLDRLWVACDPDDVLLDERLLAELRSRGFEVLLYEDPFVFRTEFEERYRAAWDRGEPGPTPAVVVHWRGADPNELPWDLGHYGRVVSLGLAQLFPRLAYNVVKQLEPEHFAALLEAHDTELQGIRGENESKDFILERVYHLAPRSSIRTESDFWRDVLRMHFANRALPLVFAEHAASIIQSKGLLAGLPVAAWLSSKSALLRVVQEAWHRYLANLGMEGSRIGEPPPPDYVAKVDIPFAHSDVQSIVDSMFLNGSLHPLVVEVVPADAPGWIKAGIVQDPQARNAFVKKGIAKLIDAIPSATATHKAWSEFAKQYGETLARVHDLGNAYGTDGLAEVQALVKTLQEQSDAQLHAWVAAKHYADLSTLSFHNGPVMVHRIPDYLSSRRKALGADKIALLVFDGLALDQWVQIRERLVEATKRFAFDEGTSFAWLPTVTSVSRQALFSGRKPREFEESIGHTNKEEYLWKAYWQEQGVKPGEIYYQRSLRQIEQLDALQAALDDRRPKVVGLVVDEVDDRLHKERSKQDVALWIANWLKTGFVDRLFAMLLDRGFHIYLTADHGNVEAVGVGRPSEGDVPEARGERVRVYRSESLLAKSAAANANSVHLDIAGLPAGYMPLFAGGRTAFVPDGEQVVVHGGISVEELIVPFVKVKYVIGNE >NZ_AP021884|1977054:2023954|2011071_2011467_+|WP_147073252.1|DBSCAN-SWA MTVAITVEHNEARLAGTLAFLDAGSNPARLRIYGGTRPANPATTPTSAMLVEIRLTKPAGTIAGGLLTLTQQEDGLITATGIATWARLVNGNEVTALDLDCSGTDGSGDVKLASTNLYLGGDARMVSAILG >NZ_AP021884|1977054:2023954|1977808_1978657_+|WP_024973178.1|DBSCAN-SWA MKRLPIVSAVERMAERKGVKLLMLGKSGIGKTSRLKDLDPATTLFLDIEAGDLAVADWPGDTIRPASWPESRDFFVFLAGPDKSLPPESAFSQAHYDHVIEKFGDATQLGRYQTFFLDSITQLSRQCFAWCKTQPGAVSDRSGKPDLRAAYGLLGQEMIGALTHLQHARGKNVVFVAILDERLDDFNRKVFVPQIEGSKTSLELPGIVDEVVTLAEIKAEDGSSYRAFITHTVNPYGFPAKDRSGRLDLLEPPHLGALIAKCAGAVPALASAANPAHIESQE |
50 | Acidithiobacillus_phage(45.45%) | head,portal,terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2044160 : 2053121
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|DBSCAN-SWA ATCAGTGTTTCAGCCCCAGGGTGTGGCTGAGGAATGGCACGCCGCCATACCGTCCCATCAGGTAGCCGCGTTCCAGTGCCTCGTTCTTGCGCGGGCTGAATTCGGATAAGCTGACTAACGCCGAGGTCTGATCACGGTCTTTTTCTACGAACCACACTTGATCGCGTCGGAACAAATCTGGTGCGTCCAGCAGCGACGTGTCGTGCGTCGTGAATATCAGTTGTGCGCCGCCGGTGTTGATCTCTGGGCGGTGGAACAGTCGCACGAGTTCGCGCACCAGCAAGGTGTGCAAGCTGGTGTCGAGCTCGTCGATGACCAGCGTTAGCCCTTTGCGGAGGATGTCGAGCACTGGTCCTGCAAGAAACAGCAAATTGCGCGTGCCGTTGGATTCGTCCATCAATTCAAACACGGCTTTGCCCTGTTCGGTGACGTGATGGAAGCGCAGCTTGTGTTCTTCCATTTCTTCCGAGCGCACTTCAGTTTTACCGGCCACCAAGTCAAAGTGAACGGCCTGCCCAGGGACTTTGCGTGTTTCCACATCAATGTCGGCGATGCTGATGTCGGCAGCAGAGAGGAAGTTGCAAATCTCCTTGCGGCCGTCATCCTGCTTGAGCATCTGAATGGACACCTGTGGACTGAGTTGGGCTTGCTCGTTGAAGATCACCAGGCGATTCACAAACCAGTCGAACACCGGTCGCAAGGCTTCGCTGTTGAGTTGTACGGCCATCGACAGGAAGAGGGCGTTCGGTCGGGTGGCACCTTCCCACAGGTTCTTGGGTCCTTTCAGGCCGGGACCGAAGTCGTAGACATCCTTGCCGGTCTCAGTGTCAAAGCGGCGTGTGAACCAACGCTGGGGCTTGAACGCCTTGTAAACCAGCAGGTGCTCACTGACGATCCGCTGAGCTGTCATGGAAAAGCCATACTGGTAGCGCACACCATCGAGCAGGAACGTGACTTCGAACTCGCTGGGTTGACTGGCGGAATCGACATCGAGTCGGAAGGGCTGAACTGCGAAGGTCTGGCCCGGCTGGATCGCCGTCGCGGACTCGGTCACCACGCCGCGCATGTACTGCAGGGCTTTAATGAGATTGGACTTGCCGCTCGCGTTGGCACCATAGACCACAGCACTGCGCACCAGGGTGGGTGCGGCACTGATGCCGGTGGCTTGGGTGTGGGTGTCCTGCAGGGTCTTGTCCTTGGACGCAACAAGACTAAGCACCTGCTCGTCGCGCAGACTGCGGAAGTTCTTGACGCGGAACTCGACCAGCATTGCATTCACCTCATTTTTATAAAATAGAGTCTTATTATGACTCAAAAGTGCAAAAAACAAATATATTTTTGAATTCTGGGGTGGTTTGAGTGATCTGGGTTGGCATCGCAGGCCTGCCGGGTGTCAAGTCCCCGTTTGAACTTCGCTGGAGCCTTGACGTGGCACCCAGCTGCGGTTCGGCCAAATCGTCGAATCCGCGCCCTAAAGACCCGTGCGTTACTATCTGGTTCTGAAAGCGGACGCGGGTTCGGTTCCGTGTTCCACCACCAATAATGAAACCCCAACCGTTCTCGGTTGGGGTTTTTTCTTGCCTGATCGCGCCGGTTTCCGCGTGTTGTTGGGGCTTCCTGCGGAAGCCTGCGGACTGCACCGGTCAGCCTTCCAGCCCGTTCCGGGCCACATTCCACTCTCTCCTGGCCATTCCTCGCTCCGACCTCGCTCCCTGGAACTGGCCCGAAGTCCGCAAAGGCCGCAATTCAGACCTATACAGATCAAAGAGTTACGCGCGGACTAATCAAGTGGTTGGATTGCGGCATTGGCTACCGGAGGGGACACACGCTTGCCCCAGCGAGCTATTGCACTTGCAGTTTGTCAGAAACTCGCATATATTTTCGTACATGAACACGACGACCAAGACTGCCGAGCTGATTCGCGAGCGCATCGAGGCGATGCCGATCGGGGAGCCTTTCACCCCGACAGCATTCCTGGAGTGCGGCACGCGTGCGTCCGTCGATCAGACCCTCTCCCGCCTCGTCAAGGCAGGGTTGATCGAGCGCGTGACGCGCGGTGTCTTTGTGCGTCCCGAGGTCAGCCGTTTCGTCGGCAAGGTTAGCCCCTCGCCGCTGAAGGTGGCCGAGACCGTCGCCAAGACCACGGGTGCCGTCGTCCAGGTTCACGGTGCCGAAGCGGCGCGTCGGCTCGAACTAACCACGCAGGTTCCGACCCAGTCGGTATTCGTGACATCCGGCCCGTCGAAGCGCATCCGCGTGGGGAAGATGGAGATCCGTCTGCAGCACGTCTGTCAGCGCAAGTTGGCCCTGGCAGGTCGACCCGCCGGGCTCGCGCTTGCGGCGATGTGGTATCTCGGCAAGAAGGAAGTGACGCCGGCCCTCGTCGAGAAGATTCGGCGCAAGCTGGGATCGAGCGAGTTCGAGGTGCTGAAGTCAGCCACCAGCTCGATGCCTGCGTGGATGAGCGACGCCATCTTCCGAAACGAGCGGATGGCCGCTCATGCCTGAGTCCTTCCTGCACCTGAAGCCTCAAGAGCAGTCCCAGATCTATCGGGCACTGGCTCCGCAGCTTGCCCGCACGCCCGTCGTACTGGAAAAAGATGTCTGGGTCTGCTGGGTGCTGCAGACCCTGTTCACCATGCCCGACCGACTGCCGATGGCCTTCAAGGGCGGCACATCACTCTCCAAGGTGTTCGGCGCCATTGCGCGCTTCTCCGAGGACGTGGACATCACGCTCGACTACCGTGGCTTAGACGGCTCCTTCGACCCGTTTGCCGAAGGCGTCTCACGCAATCGGCTGAAGAAATTCAGCGAGGATCTCAAGTCCTTCGTGCGCGGCCATGCCCACGGTGTCGTGGCGCCGCACTTTCAGAAGATGCTGGCGGACGAGTTCGATGCCGATGCATTCCAGCTTGAAGTCAGCGATGACGGCGAGCAGATGCGGGTGCACTACCCGAGCGTGCTGGAGGCACCAGGAGACTATGTGGGCAACAGTGTCCTGATCGAGTTCGGTGGCCGTAACATCACCGAGCCGAATGAGGAGCGTGAGGTGCGACCCGACATCGCGGAACATGTCGCTGAACTCGATTTCCCTCGCTCGACGGTCAGTGTGCTGTCTCCGACACGTACCTTCTGGGAAAAGGCGACGCTGATACACGTCGAGTGTCAGCGCGACGAGTTCCGCACAGGCGCCGAACGTCTGTCACGCCACTGGTACGACCTGGCCATGCTGGCCGATCTTGCCCATGGGCAAGCCGCTGTGGCCGATCGCGCTCTGCTCGCGGATGTTGTCAAGCACAAGAAGGTCTTCTACAACGCGAGCTACGCCAACTACGACGCATGCCTGTCCGGGCAGCTCAGACTAATTCCGGAAGATGCTGCACTGGCCGCGCTGCGCGATGACTTCCAGCGCATGATCGGTGCCGGCATGTTCATCGGCGAGCCTCCCGCCTTCGATGCCATCGTCGATCGCCTGCGCGCGCTGGAAACAACAATCAATCAGTGACCTCCCGCTGGCGTTGAGTCGGCAATCCTCGCCCGCTGCTCATCCCACAAGGCTGGCGGATCAACCGCCAGATCAAACAGCGTGATGTGGTTCGGCAGTGCGTCGTCCAGGATGGCGGCCACGATGTCGGGGGCTAGCGTGGTCAGGTTGACCATACGGCTGACGTAGCTGTTGTCGATGCCTTCCCGTGTGGCGATCTCCTTCAAGGACTTCGCTTCTCCTGATTCCAGCATCGCCAACCAGCGGTGGCCCCTGGCCAACGCCAGCTGGATGGAGGTCGGCGCCATGTCCCACGGTCTGACCGGCGCGGTTTCTCCGTTCGGCAAGGTGACCAGCTTGCGGCCGCTACGGCGCTTGATCTGGATCGGTACAGACAGGGTCAGCCTGCCGTCGCTGGTCTGCAGGATGTCCGGCTCGCCGGTTTTCTGGATGCGGATGTCGCTCATGCCAGGGCCTCCTCAGTTTGCTCGACCGGCTCGGGACGCAGTTCCAGCACCAGGCGTTCGATGCCGTTGGTGCGCAGGCGCACTTCGAGGTCGTTGGGTGACACGATGACTTTCTCGACCAGCAATTTCACGATCCGGGTCTGCTCCGCCGGGAATAGCTGATCCCAAATCGCATCAAGCCGGGTCATGGCCACGGTGATCTTGGCCTCGTCCAGCGTCGGGTCGAGCTTGATCGCCTGTGGCAGCATGTTGCCGAGCAGATTCGGGGCATGCAAAATCGCGCGTAGTTGATCGAGTACCGCCGACTCCAGTTCTGCGGCGGGCAGTCGCGGCAGCCCCGAGGCACCCGCGTGTTCCTTGGCGTCGCGCTGGGGCACGTAGTAACGGTAGCGCCGGCCATTCTTCTTGGTGGTGTGCCACGGCGACAGTGCGCGGCCATCGTTGCCGAACACGATGCCCTTGAGCAGATAGGGAACCTTGGCCCGCGTCTTGTTGCCCCGCACCCGGCCATTCGTCTCCAGGATCGCGTGGACGCTGTCCCACAGTTCGCGGCTGACGATCGGCGGGTGTTCGGCCTGGTACCACTGGTCCTTGTGCCGCAACTCGCCAAGGTAGGTCCGGTTGCTCAGGAGCTTGTAAATGTGGCCCTTGTCGATCGGCCTGCCATCGCGGGTCTTGCCGTCTTGTGTGGTCCACGCCTTCGACGTCACGCCATCCAGTTTCAGCTCCTTGACCAGTGCGGTGCTGGAACCGAGTTCAACGAAGCGCTGGAAGATGTGCCGGATCAGCTTGGACTCACGCTCGTTGGGCACCAACCGCCGGTTCTCGACGTCGTAGCCCAGCGGCGGCACGCCACCCATCCACATACCCTTGCGCTTGCTGGCCGCGATCTTGTCTCTGATGCGCTCACCGGTGACCTCGCGCTCAAACTGCGCAAAGGACAGCAGGATGTTCAACATCAACCTGCCCATCGAGGTCGTCGTGTTGAACTGCTGGGTGACCGACACGAACGACACGCCATAGCGCTCGAACACTTCGACCATCTTGGAGAAGTCCGCCAGGCTGCGCGTCAGGCGGTCGATCTTGTAGATGACGACCACGTCGATCTTGCCGGCTTCGATGTCCGCCATCATTCGCTGGAGCGCCGGGCGTTCCATGTTGCCGCCGGAAAAAGCTGGATCGTCGTAATCGTCGGCGACCGGTATCCAGCCTTCGGCGCGCTGGCTGGCGATGTAGGCATGGCCGGCGTCGCGCTGGGCATCGATGGAGTTGTATTCCTGGTCCAGCCCTTCATCGGTGGATTTGCGCGTGTAGACCGCACAGCGCATGCGGCGCTTCAAGACTTCGCTCATCGTCCACCTCTCTTCTTGGTGGACGGCTTGGCCTTGGCGTTGGACGGCGGCTTGAGCCCGAAGAACAGCGGCCCCGACCAGCGCATGCCGGTGATTTCGCGGGCGATCATCGATAGGCTCGGGTACATGCGTCCCTGGAAGTCATACTGGCCGTCGGCGGTTGCGATCACGCGGTATTCGACGCCTTTGTATTCCCGGACCAGCACCGTGCCTGCCGCCGGACGGTAATCGCGGTCACGCTTTTTCACCTTGCCTGTTTCCACCAGAGATGCGATGCGACGCTGGTTGCGATCCAGCAGGTTGGCGTCGGCCTTGCGGAATTCCAGCTCCTGCAGCCGGTAGGCAATCCGGCGTTCGAGGAACTGGCGGTTGTGGGTGGGAGTGTCGCCACCGACCAGCTTCTGCCAGAGGGCCCGGATCTCTGCCATCGGCATCTCGGGCAGCCTGGCGATCTGCGCCGCCACCGATGGCGGCGTGGAAAATGATGGTGTTTGCGTGCTCATTTCGACTCCGTAGTTGTCTTGTTGACGGGGTCTGTATGAACGCGCTGGTTGCCAGAGAAGCCAAGCTCAAACTCGCTCGCTTCTGCCCTGGTTGCGGACTGTTCTGTGCCGGTGATACGCAAGCGTGCCAGGCCGTTGGCCAGCAACGACGCGATCTCGTGACGACGCTGTTTAGCACTGGGCGTCTATCCTGACGTGACGCTATCTGTTGCGAGAAACAAGCGATATGAGGCACGCAAGCTGCTTTCCAATGATGTAGACCCGGCTATGCTCAAACAGGTGACTAAGCGCGCATCGCGCGTGTCTGCTGAAAACAGTTTTGAAGCAATAGCGAGAGAATGGTATGCAAAATTCTCGGGCGAATGGGTGCCTAGCCATGGCGAAAAAATCATCCGCAGATTAGAACGCGACCTGTTTCCCTGGATCGGTAAACGCCCTATTGCCGAGATCACCGCACCTGAACTGTTAGCCGTCTTACGCCGCATTGAAAACCGAGGCGCGCTAGATACGGCGCACCGTGCGCATCAAAACTGCGGGCAAGTGTTCCGCTATGCAATCGCCACTGGGCGCGCTGAACGCGATCCTAGCCCCGACTTGCGCGGCGCATTGCCGCCAGCTGGGTATTCTGGATCATCGTGACCGCTGATTCCGGGCTATCGTGACCGGTCATTCCGGCGCATCGTGACCGGCGATTCCGGTCTATCGTGACCGATTTTGCAGGGTTTCCGGAATCAGTGGTCACGATAGCGGAATCATCGGTCACGATAGCGGAATGGTGTCGTACCGCATGGAAATGGTGTTACGCATAGAGCAACCGAACGAGTACGCTTCCAGCCTTTTGTCTGGAGACAGCGTGCCCGTATCAAGGATCACCATGCGTAAAATTAAAGACGTATTGCGTTTGAAACTGGACGCCAGGCTGTCGCACCAGCAGATCGCCGCTGCGCTGGGCATATCGAAGGGAGTCGTCACCAAGTATGTCGGTCTGGCCGCCGCCGCAGGCCTGGATTGGGCTGCCGTGCAAGACATTGACGAAACCACGTTGGGGCGGCGCCTGCTGGTTACCCCCGAGCGACCGCGCGATCATGTTCAGCCGGACTACGGCCGTTTGCATCAAGAGCTGCGGCGCAAAGGCATGACATTGATGTTGCTCTGGGAAGAGTACCGAGCCGACCACGCCGACCGGCAGACCTATGCTTACTCGCAGTTCTGCGACAACTACCGGCGCTTCGCCAGGCAACTCAAGCGCTCCATGCGCCAGGTTCACCGTGCCGGCGAGAAGCTGTTCATTGATTTCGCCGGCCCCACCATCGCGCTGACCGACGGCAGTCGCGCGCACATCTTCGTCGCGGCACTGGGCGCTTCCAGCTATACCTTTGCCTGCGCCACGCCGCGCGAGACCATGACCGACTGGCTGAAATCGACAGCGCGCGCGTTAAGCTTCATCGGCGGCATGCCCCAGATGATCGTGCCCGACAACCCGAAGGCGCTGATTGCGGACGCCAACCGTTACGAGCCGCGCAGCAACGATACCGTGCTCGATTTCGCGCGCCACTATGGGACGTCGGTGTTGCCAGCACGACCCTACCACCCGCAGGACAAAGCCAAAGCAGAATCGGCGGTACAGATCGTCGAACGCTGGATCATGGCGCGCCTGCGCCACCAGCAATTTGCCAGCGTAGATGATGTCAATCAGGCCATCGCACCGCTGCTTGCCAGGCTCAACGAGAAGCCATTCCAGAAGCTGCCCGGCAGTCGCGCCAGTGCATTTGCCGAAATCGGCGCACCCGCCTTGGCTCCGTTGCCGCTGCAAGCTTATGAGATGGCACACTTCAAGACGGTCAAGGTTCACATCGACTATCACGTAGAAGTCGAACGACACCGCTACAGCGTGCCGCATTCATTGGTCGGACAAGTACTTGAAGCACGGATCACAGTGGCAGTGGTCGAGATCCTGCATCGCGGTAACCGCGTGGCCAGCCATGCCCGCAGCAGTCTGGCCGGTGGCTTTACCACCACCGCCGCGCACATGCCGGCGGCGCATCGCGCCCAGATGGAATGGTCGCCACAACGGCTGATCCACTGGGGCCAAAGCATTGGCCCTGCCGCCGCCGAAGTGGTGACACGGCTACTGAACAAGTACAAGCATCCCGAACATGGCTACCGCGCCTGCCTTGGGCTGCTGTCGCTGGTCAAGCGTTATGGCAAACCCAGACTGGAGGCGGCCTGTACGCTGGCTTTGCAGATCGGCGTCTGCCAGTACCGCCATGTGCGCGACATCCTGAAGAATAACCGCGACGCAGCCGCGCCGCTCAGCACTGAAGAATGGGTCAGCCCCAACCATGTCCACGTGCGCGGTCCTGGCTACTACCAATAAGGAAAGACAACATGATGATGCATACCACGCTGACGCAATTGCGCAGCCTGAAACTGGATGGCCTGGCGACGGGGCTGGAAGAACAACTGGCACAGCCCGGTATGGCTGCACTCAGCTTCGAAGAACGCGTAGCACTGTTGGTGGACCGGGAAGTCCATGCCCGTAATGACCGCAAACTGGCGCGCCTGCTCAAGAACGCTCGCCTGAAATACGGGCAGGCGGCCATCGAGGATATCGACAGCCGCGCAGGACGCGGTATCGACCGGCGCGAGGTGATGAGCCTGGCTTTGGGCGACTGGGTCAACGCCGGCCACAGCATCCTGATTACAGGACCGACCGGCGCCGGTAAATCCTGGCTGGCCTGCGCATTGGCACAATACGTCTGCCGCCGTGGTTACTCAGCCATCTATCAGCGCGTACCCCGCATGCAGGAAGAACTGCGCATCCGGCACGGCAGCGGCACCTTCGGCAAATGGCTGCTGCAACTGGCCAAGACCGACGTATTGGTTCTCGATGACTGGGGCATGGGCGCTATCGACAGCATGACCCGTTCCGACTTGCTGGAGATCATCGACGACCGTGCCGCCAACAAGGCCACCATCATCACCAGTCAGTTGCCGGTGGAGCACTGGCACGCCTGGATAGGCGATGCCACCATCGCCGACGCCATCCTCGACCGCATCATGCAGCGCAACCACCGCTTCACGCTGACCGGCGAGTCGCTGCGAACAGAACAATCAAAAACAAGCAAAAAGGAGGAAAAAACCACCCCATCGTGA
Protein sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|2044160_2045429_-|WP_147073207.1|DBSCAN-SWA MLVEFRVKNFRSLRDEQVLSLVASKDKTLQDTHTQATGISAAPTLVRSAVVYGANASGKSNLIKALQYMRGVVTESATAIQPGQTFAVQPFRLDVDSASQPSEFEVTFLLDGVRYQYGFSMTAQRIVSEHLLVYKAFKPQRWFTRRFDTETGKDVYDFGPGLKGPKNLWEGATRPNALFLSMAVQLNSEALRPVFDWFVNRLVIFNEQAQLSPQVSIQMLKQDDGRKEICNFLSAADISIADIDVETRKVPGQAVHFDLVAGKTEVRSEEMEEHKLRFHHVTEQGKAVFELMDESNGTRNLLFLAGPVLDILRKGLTLVIDELDTSLHTLLVRELVRLFHRPEINTGGAQLIFTTHDTSLLDAPDLFRRDQVWFVEKDRDQTSALVSLSEFSPRKNEALERGYLMGRYGGVPFLSHTLGLKH >NZ_AP021884|2044160:2053121|2052353_2053121_+|WP_147074833.1|DBSCAN-SWA MMMHTTLTQLRSLKLDGLATGLEEQLAQPGMAALSFEERVALLVDREVHARNDRKLARLLKNARLKYGQAAIEDIDSRAGRGIDRREVMSLALGDWVNAGHSILITGPTGAGKSWLACALAQYVCRRGYSAIYQRVPRMQEELRIRHGSGTFGKWLLQLAKTDVLVLDDWGMGAIDSMTRSDLLEIIDDRAANKATIITSQLPVEHWHAWIGDATIADAILDRIMQRNHRFTLTGESLRTEQSKTSKKEEKTTPS >NZ_AP021884|2044160:2053121|2050818_2052342_+|WP_147074832.1|transposase|DBSCAN-SWA MPVSRITMRKIKDVLRLKLDARLSHQQIAAALGISKGVVTKYVGLAAAAGLDWAAVQDIDETTLGRRLLVTPERPRDHVQPDYGRLHQELRRKGMTLMLLWEEYRADHADRQTYAYSQFCDNYRRFARQLKRSMRQVHRAGEKLFIDFAGPTIALTDGSRAHIFVAALGASSYTFACATPRETMTDWLKSTARALSFIGGMPQMIVPDNPKALIADANRYEPRSNDTVLDFARHYGTSVLPARPYHPQDKAKAESAVQIVERWIMARLRHQQFASVDDVNQAIAPLLARLNEKPFQKLPGSRASAFAEIGAPALAPLPLQAYEMAHFKTVKVHIDYHVEVERHRYSVPHSLVGQVLEARITVAVVEILHRGNRVASHARSSLAGGFTTTAAHMPAAHRAQMEWSPQRLIHWGQSIGPAAAEVVTRLLNKYKHPEHGYRACLGLLSLVKRYGKPRLEAACTLALQIGVCQYRHVRDILKNNRDAAAPLSTEEWVSPNHVHVRGPGYYQ >NZ_AP021884|2044160:2053121|2048106_2049462_-|WP_147074829.1|DBSCAN-SWA MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWIPVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLADFSKMVEVFERYGVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGERIRDKIAASKRKGMWMGGVPPLGYDVENRRLVPNERESKLIRHIFQRFVELGSSTALVKELKLDGVTSKAWTTQDGKTRDGRPIDKGHIYKLLSNRTYLGELRHKDQWYQAEHPPIVSRELWDSVHAILETNGRVRGNKTRAKVPYLLKGIVFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELESAVLDQLRAILHAPNLLGNMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLFPAEQTRIVKLLVEKVIVSPNDLEVRLRTNGIERLVLELRPEPVEQTEEALA >NZ_AP021884|2044160:2053121|2050160_2050604_+|WP_147074831.1|DBSCAN-SWA MTLSVARNKRYEARKLLSNDVDPAMLKQVTKRASRVSAENSFEAIAREWYAKFSGEWVPSHGEKIIRRLERDLFPWIGKRPIAEITAPELLAVLRRIENRGALDTAHRAHQNCGQVFRYAIATGRAERDPSPDLRGALPPAGYSGSS >NZ_AP021884|2044160:2053121|2046659_2047664_+|WP_147074827.1|DBSCAN-SWA MPESFLHLKPQEQSQIYRALAPQLARTPVVLEKDVWVCWVLQTLFTMPDRLPMAFKGGTSLSKVFGAIARFSEDVDITLDYRGLDGSFDPFAEGVSRNRLKKFSEDLKSFVRGHAHGVVAPHFQKMLADEFDADAFQLEVSDDGEQMRVHYPSVLEAPGDYVGNSVLIEFGGRNITEPNEEREVRPDIAEHVAELDFPRSTVSVLSPTRTFWEKATLIHVECQRDEFRTGAERLSRHWYDLAMLADLAHGQAAVADRALLADVVKHKKVFYNASYANYDACLSGQLRLIPEDAALAALRDDFQRMIGAGMFIGEPPAFDAIVDRLRALETTINQ >NZ_AP021884|2044160:2053121|2046046_2046667_+|WP_147074826.1|DBSCAN-SWA MNTTTKTAELIRERIEAMPIGEPFTPTAFLECGTRASVDQTLSRLVKAGLIERVTRGVFVRPEVSRFVGKVSPSPLKVAETVAKTTGAVVQVHGAEAARRLELTTQVPTQSVFVTSGPSKRIRVGKMEIRLQHVCQRKLALAGRPAGLALAAMWYLGKKEVTPALVEKIRRKLGSSEFEVLKSATSSMPAWMSDAIFRNERMAAHA >NZ_AP021884|2044160:2053121|2047657_2048110_-|WP_147074828.1|DBSCAN-SWA MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVRPWDMAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVNLTTLAPDIVAAILDDALPNHITLFDLAVDPPALWDEQRARIADSTPAGGH >NZ_AP021884|2044160:2053121|2049458_2049965_-|WP_147074830.1|DBSCAN-SWA MSTQTPSFSTPPSVAAQIARLPEMPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRNQRRIASLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQGRMYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSTKKRGGR |
9 | Acidithiobacillus_phage(66.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2294456 : 2305315
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|DBSCAN-SWA TCTAACTGCGCTGGTAAATGTCCTCAAAGCGCACAATATCATCCTCCCCCAGATACGCGCCCGACTGCACCTCGATCATGTGCAATGCAATCTTGCCCGCATTTTCCAGCCGGTGCGTACTGCCCAATGGAATGTAGGTGGACTGATTCTCGGTGAGCAGCAGCACTTCGTCATTACGCGTAACGCGTGCCGTGCCGCTCACCACTATCCAGTGCTCGGCACGGTGGTGATGCATTTGCAACGACAGTTTCTCGCCCGGTTTGACCATGATGCGCTTGACCTGAAAACGCTCCCCCGCATCAATACCTTCGTACCAACCCCACGGGCGAAACACACGGGTATGGTTGAGATGCTCGGTACGACGGTGTTGTTTCAGGTGCTCGACCACTTTTTTGACGTCCTGCACGCGGTCTTTGTGCGCCACCATCACGGCATCGCTGGTTTCCACGATCACCAGGTCGGATACGCCAATTACCGCGACCATGCGGCTCTCGGCACGGATCAGATTGTTGCTGGCACCGTCGTTGTAGATGTCACCGCGCATGACATTGCCAGCACGATCCTTGGCACCGATTTCCCACAGTGCCGACCACGAGCCGATATCGCTCCAGCCAATGTCGGCGGGCACCACCACGGCGTTACGGGTGCGTTCCATGACGGCGTAATCGATGGATTCGGAGGGGCAAGCCGTAAATGCCTGGGTATCCAGACGGACAAAGTCCAGATCACGCTGGCTGCTGTCCAGTGCCGCCTGGCTGGCGGCGAGAATATCCGGGCGGTAGCCGCGCAATTCATTGACAAACGCCGCAGCCTTGAACAGAAACATGCCGCTGTTCCAGAAATAATCGCCCGATTGCAGATAGCCCTGGGCGATTTCACGGCTGGGTTTTTCCACAAAACGCCCTACTGCAAACACACCGTCCAAATGGCTGTCCGCCGGGCCGCGCTGGATATAGCCATAGCCGGTTTCAGGCGCCTGTGGCACGATGCCGAAGGTCACCAGCTGACTAGCCTGGGCGGCATCCACGGCCTGGGCCACGGCGCGCTCAAATGCGTCCACATCTGCTATCAAGTGATCGGCCGGCAACAGCAGCATCAGGGCCTCGGCATCCCGCGCCATGAGTGCCAGCGCTGCGACTGCCGCTGCCGGTGCGGTATTGCGCCCGATGGGCTCGAGAAAAATAGTCTCGGGCGTGACCTCAATCGCGCGCATCTGCTCGGCCACCATGAAGCGGTGCTCATGGTTGCATACCAGCGTGGGCGGCGTGATGTCGGCGATACCGGAAAGGCGCAGCACGGTTTCCTGCAGCATGGTGCGCTCCGACACCAGCGGCAACAACTGCTTGGGCAACGCGGCGCGGGACAAGGGCCACAGGCGGGTACCGGACCCCCCGGAGAGAATGACGGGATGAATGCGCATGTTGGTTGTTTCCTCAATCAATTCAGTGAATTCATCAGCCTGCCCAAGCTGGGGAAAGGCATCCATGCGATACAGTCCGGCGCGCTGCATCGAGAGCAAGACATCTACGCTACGCGAACCAATGAAAACCGGCTGGCCAGTCAGCCAAACACATATCATGTCTCCTTTTCGGGCTTGTCCAAACACAGGCCCAACGCCGCATCCCAATCCGGCAATAGCAAACCAAAGCTGCGCAGCAGCTTGTCCCCGGCCAGCACGGAATTAGCCGGGCGCCTGGCCGGGGTGGGGTAATCAGTGCTTGGGATCGGGATCAGCTCAGGACGTTTGGCGCCCGCCTGGGTGGTTTGGTCGAGGATGGCCTGGGCGAATTGATACCAGCTCGCCCGGCCCCGGCTGCTGAGGTGGTAAGTGCCGTGCAGGGCTTCTTTACCCTGGCCAAACGGCTGTTGCGCCAGTATCTGCGCCGTGGCCTCGGCGATCATGCGTGACCAGGTGGGTGCGCCGTATTGATCGGCAACAATCCGCAACTGCTCGCGCTCCTTGAACAGGCGCTGCATGGTGAGCAGGAAATTGCCAGCGCGCAGGCCATACACCCAGCTGGTGCGCAGAATGAGGTGGGGAATGGCGGCAGCACGGATGGCGTTCTCACCTTCGAGTTTGGTCTGCCCATAGACGCCTAGCGGGTGGGTGGTGTCGTCCTCAGTGTAGGCACCGGGCTTGTTGCCATCGAACACATAATCGGTAGAGTAGTGAATCATCGCCGCGCCGAGTTTTGCCACCTCTTCGGCCATGATGGCGGGAGCAATGGCATTCACCGCACGTGCCAGTTCCGGCTCGGATTCCGCCTTGTCCACGGCGGTATGGGCGGCCGGGTTGACGATCAGGTTCGGGCGCAAGCTACGTATGAGGCTGCGGATGGCACCCGGGTCGGTCAGGTCAAGCTGGCTGCGGGTGGGCGCGCTCACCTTGCCCAAAGTCGCCAATGTGCGGCGCAACTCCCAGCCGACCTGGCCATTGACACCGGTGAGCAGGATATTCACGCAAACACCTCGCACTGCGCCAGCGGCAGGCCGGCGGCGTCCTTGGCGGCAAGTGCGGGTTCGCCGTGCAGTGGCCAGGTGATGCCCAGCGCCGGATCATTCCACAGCAGGCTGCGCTCGAACTGCGGCGCCCAGTAGTCGGTGGTTTTGTAGAGAAACTCTGCGCTGTCGGAGATCACCAGGAAACCATGGGCGAAGCCTTTGGGTATCCAGGCCATGCGTTTGTTTTCGGCGGACAGTTCCATCCCCACCCATTTGCCAAACGTGGGTGAGGATTTGCGCAAGTCCACCGCCACGTCATACACGCTACCACTAATGACGCGTACCAGCTTTCCCTGAGTATTTTGAATCTGGTAATGCAAGCCACGTAATACGCCCTTGGCTGAGCGCGAGTGATTGTCCTGCACGAAGTCGTCGGGGATACCGGCCCCGGTCATGGCGCGACGGTTGTAGCTCTCGTAGAAAAAGCCGCGCGCGTCGCCGAATACTTTGGGTTCGAGCACAAGCACGTCGGGGATTTCAGTGGGGATGATGTTCATGGTTAAAACAACCGTTCGTTGAGCATGGCCAATAAGTATTGGCCATAGGCATTTTTCTGCAAAGGCGCTGCCAGCCGTCCGACCTGGGCGGCATCAATATAGCCCTGGCGGTAGGCGATTTCTTCCGGACAGGAAATTTTGAGACCCTGGCGCTTTTCAATGGTCTGGATGAATAGCGATGCCTCCAGCAGGGATTCATGGGTGCCGGTGTCCAGCCAGGCATGGCCGCGCCCCATGACTTCCACCTGCAATTGCCCCATTTCGAGGTAATGCCGGTTAATGTCGGTGATTTCCAGTTCGCCGCGCGGAGATGGCTTGAGCTTGCGGGCGATATCGATGACCTGGTTATCGTAAAAATACAGGCCAGTGACGGCGTAGCGCGATTTGGGCTGGGCGGGTTTCTCTTCCAGGCTGATGGCGTTGCCTTGAGCGTCGAATTCAACCACGCCATAGCGTTCCGGGTCATGCACCGGGTAGGCAAACACCGATGCGCCAAACTGGCGGGCAGCGGCGGCGCGCAGGCCGCCGGAAAATTCATGACCGTAAAAGATGTTGTCGCCCAGTATGAGAGCGCTCGACGCATCGCCAATGAAATCCGCGCCAATGACAAAGGCTTGCGCCAGACCGTCGGGCGAGGGCTGAACAGCATAGCTGAGGCGAATCCCCCACTGGCTGCCATCACCCAGCAACTGCTCGAAACGCGGGGTATCCTGCGGGGTGGAAATGATGAGGATGTCCCGGATTCCTGCCAGCATCAGCGTGGTAAGCGGGTAGTAGATCATGGGCTTGTCGTACACCGGCAGCAATTGCTTGGAGACTGCCTGGGTCACGGGATATAGCCGGGTACCCGAGCCACCGGCCAGAATAATGCCCCTGCGCGCGCTCATGCCTGGCTCCCATACTGTTTTTCTACCCAGTGGCGGTATTCACCGGTGGCGATGTTGGCTACCCAGTCCGGGTTGGCCAGGTACCAGGCAACGGTTTTGCGAATCCCGGTCTCAAACGTCTCTTGTGGGCGCCAGCCCAGTTCGCGCTCGATTTTGTGTGCGTCAATGGCGTAGCGACGATCGTGGCCAGCGCGGTCCTTGACATGAGTGATGAGTTTTTCGTGAGGGGTGACGGGTGAACCCGGGTGCAGCGCGTCAAGCATGGCGCAGATGGTCCTGACCACATCGATATTGGTTTTTTCGTTGCAACCGCCAATGTTGTATACCTCGCCTGCCTTGCCCGCAGCCAGCACGGCGCGGATGGCGCTGCAATGGTCGCCGACATAGAGCCAGTCACGTACGTTAAGTCCGTCGCCGTAGATCGGCAGCGGCTTGCCGGCCACGGCGTTCATCATTACCAGCGGGATGAGCTTTTCCGGGAACTGGTAAGGGCCGTAGTTGTTGGAGCAGTTGGTAGTGAGTACCGGCAGGCCATAGGTGTGGTGATAGGCGCGCACCAGGTGGTCGGAGGCAGCCTTGGAGGCTGAGTACGGGCTGTTGGGTGCGTAGGCAGTGGTCTCGGTAAACGCGGCATCATCCGGGCCGAGGGAACCGTACACCTCATCGGTGGAAACGTGCAGGAAGCGGAATCCGGCCTTCTCTATCCCTTCCATTGCATTCCAGTAAGCGCGGATTTCTTCCAGAAAATGAAAAGTGCCGACGACATTGGTCTGGATAAAGTCTTCGGGGCCGTGAATGGAACGATCGACGTGGCTTTCGGCAGCGAAGTTGATGACGGCGCGCGGGCGATGTTCAGCCAAAAGACCTGCGACCAGGGCGCGGTCGCCAATATCGCCTTGTACGAAGAGATGGCGGGCGTCACCCTCAATACTGGCGAGGTTGTGCAGATTGCCTGCGTAGGTGAGTTTGTCGAGGTTGATGACCGGCTCGTCACTGTCCGCCAGCCAGTCCAATACGAAATTGGCACCGATAAAACCGGCGCCGCCTGTGATTAGAATCATGGTAATTCCCGGTTAGCCTTTATTTTATTTCCGGTAACCCGGCGCTGCTGTTCTCGGACTTGCTCTGTGCCATTTTTCCCAGGCTTGCAAACTCGCCCACGTATTCAATTTTTGCTGCATCCCGTAGTTTTTTGATGTCAGCGGTAGCGGCATCGCGCTGTCGGGCAGCAGCCAGATAACGCTCTATTAACGTATGTGCCTTGTCCAGGGTAATCGGTGCAGATTGTATGGAGGCTATCTGCAATACCGTTATCCCGTTTGCAGACGGTAACGCCAGTAACTGTCCATTCTGCATGGTCTGCATGCGCGGCACAATATTCATCGGCAATTGTTCCGCCGCTTTGACCTCGCTGCCGGTGCGGAACGGTATATTCTGGCTGCGCAGCCAGTCCACAAATTCGCCCAGATCATGGCTGGTTTCAAGACGTGAGTTGAGTACAGCGATCCTGCCCCTGGGGGCAGCGATGGCCAGTTCCTGCAGATTATAGATACGACGCTGACTGAACAACTCGGGATGCCGGTTGTAGTAGGCGCTGATGTCGGCTTCGCCAGGTTTGGCGACAGCCTGCCCTGCACGCTGCAGATAGGCCTGGGACAGCACCTGATTGCGCGCGGCCTCCAGCAATTGCTGCACAGCAGGATCCTGATCCAGTTTTTGCGCAGTGGCTTTTTGTACCAGAAGTTGCTGATCCACCAGTGCCTGCACTACCCGATTTTTAGCTTGCGGCGTCAAATCCTGCGCTGCCAGATTCAAGCGCGACAAAGCCAGGTCCAGTTGTGCGCTAGTGATAGCGGTGCCGTTCACGCTGGCGATGGCTGAAGACGGCGTTTCCTGCTTGCTGCAGCCGGCGAGCACTGTTCCCAGCATCAGCGCCAGCGCCATTTTATGCATAATTGGATGCGATTTCATTCGCGTCATCTCCCTGGATTGGTATGTTTTGTGGCGTGATTCACACCCGGCGTCCATGATGCGCAATTATCGGTTGTCTCACAAGCGCCAGCATATAGACCGATCGAATTACAAAATGGTAATGATCATAAAAAATGGCTGGCTCCCTCCGCGGAAGCCAGCCACTACGCTACCTGATGCAGGCAATATCTAGACAGCCACGTTTTCCTCGTCGAATGCCAGTTTGATTGTGCCGGATTCGTCCACGTCCACGACCACATGCCCACCATTGGCCAGACGGCCAAACAGCAATTCATCTGCCAGTGCCCGCCGTATCTCGTCCTGGATCAGCCTGGCCATGGGGCGCGCGCCCATCAACGGGTCAAAACCGCGTTTGCCGAGATGCGCCTTGAGCGCGTCGGTAAACGTCGCCTCGACCTTTTTCTCGTGCAACTGGTCTTCCAGCTGCATCAGGAATTTATCCACCACGCGCAGGATGACCTCCTGGGACAAGGGTGCAAACGAGATCATCGCATCCAGCCGGTTACGGAACTCCGGCGTAAAGGCACGCTTGATGTCTGCCATTTCGTCGCCGGTTTGTTTTTCCTGGGTAAAGCCAATCCCCGACTTGTTGAGCGACTCCGCACCCGCGTTGGTGGTCATCACAATCACCACGTTACGGAAATCCGCCTTGCGCCCATTGTTGTCGGTAAGCGTGCCGTGATCCATCACCTGCAACAGTACGTTGAATACGTCCGGATGCGCTTTTTCGATTTCATCCAGCAACAGCACCGCGTAGGGATGTTTGGTAATCGCCTCGGTCAACAGGCCGCCCTGGTCAAACCCGACATAGCCCGGTGGGGCGCCTATCAACCGCGACACGGCATGGCGTTCCATGTATTCGGACATATCGAAGCGAATCAGCTCAATGCCCATGATATACGCGAGCTGCCGCGCCACTTCGGTCTTGCCCACCCCGGTGGGGCCAGAAAACAGGAAAGAGCCGATAGGCTTCTGCGGATTACCCAAGCCACTGCGTGCCATCTTGATCGCGGCTGCCAGCGCATTGATCGCCTTATCCTGGCCAAACACCACGGTCTTGAGGTCGCGATCGAGGTTTTTCAGCGCATCGCGATCATCACTATTCACATTCTTGGACGGAATCCGCGCAATCTTGGCGATGATCTCTTCGATCTCGCGCTTGCTGATGACTTTCTTCTGCCGTGATTTGGGCAGGATACGCTGCGCCGCGCCGGCTTCGTCGATCACGTCGATTGCCTTGTCCGGCAGGTGGCGGTCATTGATATAGCGTGCCGACAGCTCGGCTGCCGTGGTGAGCGCCGACGCGGTGTATTTGATGCCGTGATGCGCTTCAAAACGCGATTTCAAGCCACGCAGAATCTCGACTGTCTCCTCGATCGAAGGCTCATTCACATCAATCTTCTGAAACCGCCGCGATAACGCATGGTCTTTTTCGAAAATGCCACGATACTCGTTGTAAGTGGTTGCACCGATGCATTTGAGTTGCCCGGATGAAAGCGCCGGTTTAAGCAGGTTGGAAGCATCCAGGGTGCCGCCCGAAGCCGCACCTGCACCAATCAGCGTGTGAATTTCGTCTATGAACAAAATTGCCTGCGGGTTTTCGTATAGCTGCTTCAACACGGCCTTGAGGCGCTGCTCAAAATCACCACGGTATTTGGTGCCTGCCAACAGGGCTCCCATGTCCAGCGAATACACCGTGCTGTCCGATAGAATATCCGGTACCACACCCTCAACGATACGCCGCGCCAGACCTTCAGCGATGGCCGTTTTGCCGACTCCGGCTTCACCCACCAGCAGCGGGTTGTTCTTGCGCCGCCGGCACAATGTCTGGATGACACGCTCCAGTTCCAGCGCACGGCCAATAAGCGGGTCTATTTTTCCCGCCAGCGCCTGCACATTCAGGTTCTGCGTATAGGTTTCCAGCGCCGTTGCGGGTGCAGCTTCCTCACCCGCCTCAGGGGTCGCCTCCGGGCGCGCAGTGCTGCCCTGCGGGACTTTGCTCACCCCATGGGAAATGAAATTCACCACGTCCAGACGCGATACACCCTGCTGATTCAGGAAATACACCGCGTGGGAATCCTTCTCGCCAAAAATGGCCACCAGCACGTTCGCGCCGGTCACTTCTTTTTTGCCTGACGACTGCACATGCAAAATGGCGCGTTGAATCACGCGCTGAAATCCGAGCGTAGGCTGGGTATCGACTTCCTCGCTTCCTGCAACGGTAGGGGTGTGTTCGGTGATGAAATCAGCCAGTCCACGACGCAGTTCGTCGGTATTGGTGCCGCACGCGCGCAACACCTCGGCGGCGGACGGATTATCCAGCATCGCCAGCAACAGGTGCTCGACCGTAATAAACTCATGGCGCTTTTGTCTCGCCTCCATGAACGCCATATGTAAACTAACTTCCAATTCCTGCGCAATCATCTAATTTTCCTCCATCACGCATTGCAGCGGATGCTGGTGCTGGCGGGCGAACCCGACCACTTGCTCTACCTTGGTTGCCGCCACATCGCGGGGGAATACGCCACACACTCCCATGCCGTCCCTATGTACTTTGAGCATGATTTGCGTAGCCTGTTCACGGCTCTTGTAAAAAAAGTTCTGAATCACAAGAACCACAAAATCCATGGGCGTGTAGTCGTCATTCAACAACATTACCTTGTACAAAGGCGGCGGCTTGAGTTTTGTTTCGCTTGCTTCCAGAACGGTGTCATCGCGGTGCTTGGTTGCCATGGCGCTGGATAGTTTCCGGATAGTCAGGAACCATTTTGACGACTGGCGCGAAATTTTCAAGTGCTATCCAGTAAAAAAAAATTTGCCTGGCACTTGCCAAATCAACTAGGCAGGCGTAAAAAGCAATCTGGAGTTTGGCGTCAAGGTTCCTGCAAGTCTGGTAATGCCGTTGTGCGATCTTGATTTTCAAACAGCCCCTGGCCGTTTTGGCCTGTTTTTATCAAGGAAGTAGCAATGGCAACTGGCACTGTAAAGTGGTTCAACGATTCTAAAGGCTTTGGGTTTATTACCCCGGACGACGGTAGTGAAGATCTTTTCGCTCACTTCTCCGCCATCAACATGGGTGGTTTCAAAACCCTGAAGGAAGGTCAAAAAGTCCAATTCGAGGTCTCCCAGGGCCCGAAAGGCAAACAGGCTTCGAACATTCAGCCTGCATAAATCGGCTCACCGATTACTTGAAAACGCGGAACCTGGTTCCGCGTTTTTTTCGCCTCGGGTTTTGAGTTCTGGCTTTTCAAAAGCGCTGTGCTACATTGAAATATCTGTAACACATCCATTTAAGGAGAGCAGAATATGCACATTCAACACCAGCCTGATGGTTCCCTGGTCCTGGACATGAGCCAGAAACAGGCGCGAGAACTCGCAAAAACCGTCATCCAGCACGCCGAAGATGCGCATACCGCACTGCTGGATTTTGCCTACCTGCTGAACGAAGCGCATTACGATGCGGAGAACCAGTTCCGGCAACCACCTCATGCCTGGGAACCGGGTGCGCATCAGCCTGGTACAGAATAGGGGGCTACCATGAACATTTCTGCACTCGACAAACAGACTGCCCAGATCAGTGTGTTGCCGACCGAGGCCGCGCATTTGCTGGAGGGCCTCGAAGCCATGCGCGACGAACTCGGTGAAATCGCCGACGAGTTAATCAGTCTGCTGCGCGGCAGTGGCATTGAACCACCACCCAAACCCGATCATGTTCGCACTGAATACGCCGGGCCTGAGTAAACTTACATGCGCGCGATCATCGCCTCGCCAAATGCTGAACAGGACACCTGCGTCGCACCGTCCATAAGGCGAGCAAAATCGTAGGTTACCGTTTTAGCAGCAATTGCACGCTGCATACTCGCGGTGATGATATCAGCGGCTTCCAGCCAGCCAAGGTGGCGCAGCATCATCTCCGCGGAAAGAATAATCGAGCCCGGGTTGACGTAATCCTGGCCCGCATATTTTGGTGCAGTACCATGCGTGGCCTCAAACATGGCGACTGAATCGGACAGATTGGCGCCCGGCGCAATACCAATGCCGCCCACCTCAGCCGCGAGCGCGTCCGAGATGTAATCGCCGTTCAGATTAAGCGTCGCAATCACGTCGTACTCATCCGGGCGCAACAGTATCTGTTGCAAGAATGCATCGGCAATCACATCCTTGATGACAATGCCGTTGGGCAGCCTGCACCACGGGCCGCCATCCATCTCCACCGCGCCAAATTCACGCCTGGCCAGTTCATAGCCCCATTTTTTGAAGCCTCCCTCGGTGAACTTCATGATATTGCCCTTGTGTACCAAGGTAACGGACTCGCGGCCATTGTCGATGGCATACTGAATCGCCTTGCGGATCAGGCGCTCGCTGCCCTGCACGGAAACCGGTTTGATACCAATGGCGGAAGTTTCCGGGAAGCGAATTTTCTTCACCCCCATTTCGCCTTGCAGGAAGGCGATGATCTTTTTCACCTCATCCGAGCCAGCTTGCCACTCCACCCCGGCGTAAATATCCTCGGTATTTTCGCGGAAGATCACCATATCCACTTTTTCCGGCGCTTTCACCGGACTGGGCACGCCATCGAAGTAACGCACCGGGCGCAGGCAGACATACAAATCCAGCAACTGGCGCAACGCCACATTCAGGGAGCGCATGCCACCGGAAGTCGGCGTGGTCAACGGCCCCTTGATGGAGACGACGTATTCGCGCACGGCGGCTACCGTTTCATCGGGTAACCAGTTGTCGCCACCATAGACCTTGACGGCCTTTTCGCCCGCATACACTTCCATCCAGGCGATACTGCGCCTGCCACCATATGCCTTGGCCACGGCCGCATCCACCACGCGACGCATCACCGGGGTGATATCCACACCGGTACCATCACCTTCGATGAAGGGAATAACCGGCTGATCGGGGACATTGAGCGAAGCGTCAGTGTTGATCGTGATTTTTTCGCCGTGAGTCGGCAGCTGTATATGCTGGTACAT
Protein sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|2296030_2296915_-|WP_147074455.1|DBSCAN-SWA MNILLTGVNGQVGWELRRTLATLGKVSAPTRSQLDLTDPGAIRSLIRSLRPNLIVNPAAHTAVDKAESEPELARAVNAIAPAIMAEEVAKLGAAMIHYSTDYVFDGNKPGAYTEDDTTHPLGVYGQTKLEGENAIRAAAIPHLILRTSWVYGLRAGNFLLTMQRLFKEREQLRIVADQYGAPTWSRMIAEATAQILAQQPFGQGKEALHGTYHLSSRGRASWYQFAQAILDQTTQAGAKRPELIPIPSTDYPTPARRPANSVLAGDKLLRSFGLLLPDWDAALGLCLDKPEKET >NZ_AP021884|2294456:2305315|2300501_2302757_-|WP_147074450.1|protease|DBSCAN-SWA MIAQELEVSLHMAFMEARQKRHEFITVEHLLLAMLDNPSAAEVLRACGTNTDELRRGLADFITEHTPTVAGSEEVDTQPTLGFQRVIQRAILHVQSSGKKEVTGANVLVAIFGEKDSHAVYFLNQQGVSRLDVVNFISHGVSKVPQGSTARPEATPEAGEEAAPATALETYTQNLNVQALAGKIDPLIGRALELERVIQTLCRRRKNNPLLVGEAGVGKTAIAEGLARRIVEGVVPDILSDSTVYSLDMGALLAGTKYRGDFEQRLKAVLKQLYENPQAILFIDEIHTLIGAGAASGGTLDASNLLKPALSSGQLKCIGATTYNEYRGIFEKDHALSRRFQKIDVNEPSIEETVEILRGLKSRFEAHHGIKYTASALTTAAELSARYINDRHLPDKAIDVIDEAGAAQRILPKSRQKKVISKREIEEIIAKIARIPSKNVNSDDRDALKNLDRDLKTVVFGQDKAINALAAAIKMARSGLGNPQKPIGSFLFSGPTGVGKTEVARQLAYIMGIELIRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTEAITKHPYAVLLLDEIEKAHPDVFNVLLQVMDHGTLTDNNGRKADFRNVVIVMTTNAGAESLNKSGIGFTQEKQTGDEMADIKRAFTPEFRNRLDAMISFAPLSQEVILRVVDKFLMQLEDQLHEKKVEATFTDALKAHLGKRGFDPLMGARPMARLIQDEIRRALADELLFGRLANGGHVVVDVDESGTIKLAFDEENVAV >NZ_AP021884|2294456:2305315|2299421_2300312_-|WP_161984264.1|DBSCAN-SWA MKSHPIMHKMALALMLGTVLAGCSKQETPSSAIASVNGTAITSAQLDLALSRLNLAAQDLTPQAKNRVVQALVDQQLLVQKATAQKLDQDPAVQQLLEAARNQVLSQAYLQRAGQAVAKPGEADISAYYNRHPELFSQRRIYNLQELAIAAPRGRIAVLNSRLETSHDLGEFVDWLRSQNIPFRTGSEVKAAEQLPMNIVPRMQTMQNGQLLALPSANGITVLQIASIQSAPITLDKAHTLIERYLAAARQRDAATADIKKLRDAAKIEYVGEFASLGKMAQSKSENSSAGLPEIK >NZ_AP021884|2294456:2305315|2303870_2304074_+|WP_147074448.1|DBSCAN-SWA MNISALDKQTAQISVLPTEAAHLLEGLEAMRDELGEIADELISLLRGSGIEPPPKPDHVRTEYAGPE >NZ_AP021884|2294456:2305315|2303639_2303861_+|WP_147074449.1|DBSCAN-SWA MHIQHQPDGSLVLDMSQKQARELAKTVIQHAEDAHTALLDFAYLLNEAHYDAENQFRQPPHAWEPGAHQPGTE >NZ_AP021884|2294456:2305315|2297456_2298341_-|WP_147074453.1|DBSCAN-SWA MSARRGIILAGGSGTRLYPVTQAVSKQLLPVYDKPMIYYPLTTLMLAGIRDILIISTPQDTPRFEQLLGDGSQWGIRLSYAVQPSPDGLAQAFVIGADFIGDASSALILGDNIFYGHEFSGGLRAAAARQFGASVFAYPVHDPERYGVVEFDAQGNAISLEEKPAQPKSRYAVTGLYFYDNQVIDIARKLKPSPRGELEITDINRHYLEMGQLQVEVMGRGHAWLDTGTHESLLEASLFIQTIEKRQGLKISCPEEIAYRQGYIDAAQVGRLAAPLQKNAYGQYLLAMLNERLF >NZ_AP021884|2294456:2305315|2296911_2297454_-|WP_147074454.1|DBSCAN-SWA MNIIPTEIPDVLVLEPKVFGDARGFFYESYNRRAMTGAGIPDDFVQDNHSRSAKGVLRGLHYQIQNTQGKLVRVISGSVYDVAVDLRKSSPTFGKWVGMELSAENKRMAWIPKGFAHGFLVISDSAEFLYKTTDYWAPQFERSLLWNDPALGITWPLHGEPALAAKDAAGLPLAQCEVFA >NZ_AP021884|2294456:2305315|2294456_2295875_-|WP_147074479.1|DBSCAN-SWA MRIHPVILSGGSGTRLWPLSRAALPKQLLPLVSERTMLQETVLRLSGIADITPPTLVCNHEHRFMVAEQMRAIEVTPETIFLEPIGRNTAPAAAVAALALMARDAEALMLLLPADHLIADVDAFERAVAQAVDAAQASQLVTFGIVPQAPETGYGYIQRGPADSHLDGVFAVGRFVEKPSREIAQGYLQSGDYFWNSGMFLFKAAAFVNELRGYRPDILAASQAALDSSQRDLDFVRLDTQAFTACPSESIDYAVMERTRNAVVVPADIGWSDIGSWSALWEIGAKDRAGNVMRGDIYNDGASNNLIRAESRMVAVIGVSDLVIVETSDAVMVAHKDRVQDVKKVVEHLKQHRRTEHLNHTRVFRPWGWYEGIDAGERFQVKRIMVKPGEKLSLQMHHHRAEHWIVVSGTARVTRNDEVLLLTENQSTYIPLGSTHRLENAGKIALHMIEVQSGAYLGEDDIVRFEDIYQRS >NZ_AP021884|2294456:2305315|2298337_2299402_-|WP_147074452.1|DBSCAN-SWA MILITGGAGFIGANFVLDWLADSDEPVINLDKLTYAGNLHNLASIEGDARHLFVQGDIGDRALVAGLLAEHRPRAVINFAAESHVDRSIHGPEDFIQTNVVGTFHFLEEIRAYWNAMEGIEKAGFRFLHVSTDEVYGSLGPDDAAFTETTAYAPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYQFPEKLIPLVMMNAVAGKPLPIYGDGLNVRDWLYVGDHCSAIRAVLAAGKAGEVYNIGGCNEKTNIDVVRTICAMLDALHPGSPVTPHEKLITHVKDRAGHDRRYAIDAHKIERELGWRPQETFETGIRKTVAWYLANPDWVANIATGEYRHWVEKQYGSQA >NZ_AP021884|2294456:2305315|2304076_2305315_-|WP_147074447.1|DBSCAN-SWA MYQHIQLPTHGEKITINTDASLNVPDQPVIPFIEGDGTGVDITPVMRRVVDAAVAKAYGGRRSIAWMEVYAGEKAVKVYGGDNWLPDETVAAVREYVVSIKGPLTTPTSGGMRSLNVALRQLLDLYVCLRPVRYFDGVPSPVKAPEKVDMVIFRENTEDIYAGVEWQAGSDEVKKIIAFLQGEMGVKKIRFPETSAIGIKPVSVQGSERLIRKAIQYAIDNGRESVTLVHKGNIMKFTEGGFKKWGYELARREFGAVEMDGGPWCRLPNGIVIKDVIADAFLQQILLRPDEYDVIATLNLNGDYISDALAAEVGGIGIAPGANLSDSVAMFEATHGTAPKYAGQDYVNPGSIILSAEMMLRHLGWLEAADIITASMQRAIAAKTVTYDFARLMDGATQVSCSAFGEAMIARM >NZ_AP021884|2294456:2305315|2303300_2303504_+|WP_124705778.1|DBSCAN-SWA MATGTVKWFNDSKGFGFITPDDGSEDLFAHFSAINMGGFKTLKEGQKVQFEVSQGPKGKQASNIQPA >NZ_AP021884|2294456:2305315|2302757_2303066_-|WP_124705779.1|protease|DBSCAN-SWA MATKHRDDTVLEASETKLKPPPLYKVMLLNDDYTPMDFVVLVIQNFFYKSREQATQIMLKVHRDGMGVCGVFPRDVAATKVEQVVGFARQHQHPLQCVMEEN |
12 | Escherichia_phage(33.33%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|