Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_AP021884 | Sulfuriferula plumbiphila strain Gro7 | 4 crisprs | csa3,cas3,cas5,cas6e,cas2,DEDDh,DinG,WYL,cas8c,cas7,cas4,cas1 | 0 | 4 | 7 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_1 | 1148298-1148383 | Unclear |
I-E
Consensus repeat of NZ_AP021884_1
|
1 spacers
spacers of NZ_AP021884_1
>1.1|1148323|36|NZ_AP021884|CRISPRCasFinder ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG |
cas2,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_1
The CRISPR arrays of NZ_AP021884_1 >merge|NZ_AP021884|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGAACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAGGTGTTCCCCGCACCCGCGGGATGAG >NZ_AP021884|1|1|1148298-1148383|CRISPRCasFinder GTGTTCCCCGCACCCGCGGGGATGA ACCGCAGCCGCAGCCAATCGCCACGCAGCCTGTCAG GTGTTCCCCGCACCCGCGGGATGAG
>NZ_AP021884.1|WP_147070477.1|1147906_1148203_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MLVIVLENVPPRLRGRLAIWLLEIRAGVYVGNYSDKVRDHIWHQVEVGIGEGNAVMAWRTSSEAGFDFVTLGKNRRIPVELDGAKLVSFLPQTDTDAL >NZ_AP021884.1|WP_147070479.1|1146692_1146992_+|type-II-toxin-antitoxin-system-RelE/ParE-family-toxin MRYQVRFASAAADDLQRLFDFLAEQDLAAAERARAVISQAIEVLQIFPFSCRKASPENPFLRELVISFGSYGYVALFEVEDAESVTVLAVRHQREDDYH >NZ_AP021884.1|WP_147070480.1|1146414_1146696_+|prevent-host-death-protein MKNATLPPLRVESELRAAAESVLQEGETLSGFVLEAVRLNIARREAQREFITRGLVAREEAKLSGHYVSSDEMLKRLDASLAKARAKQAVGNR >NZ_AP021884.1|WP_147070482.1|1145722_1146346_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MFLSRVEIPWDAARNPYNLHRQLWHLFPGEDRESRSSDDETRQGFLFRIEENATGRPARLLVQSRRAPTRANGLLLVGTREITPCPSAGQRLAFVLTANPVKTIVDAQRDAKPGKQSEKCRVPFIKEEEQRQWLLRKLGEAGEVEAVSVLPHAPVYFHKGSRAGKLVTATFEGVLRVRDPDRLAALLANGIGPAKAFGCGLLLVRRI >NZ_AP021884.1|WP_147070484.1|1145195_1145732_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MFSREWPLLAESDIKPKTGKLGWTNSPAFSSCTRSWTARRAPITRTDLKARLECSSASLTSAHRRGAYLFDAAFTVAVGSKPGASVTLTQLAAALRQPLYTPSLGRRSCPLARPLLEGELEAEDALAALAKTAPVDGLVYSETQQSDQPLRLRDVPLHGHKRQFGTRLVYLHKDPTCS >NZ_AP021884.1|WP_147070486.1|1142295_1145139_-|aconitate-hydratase-AcnA MSTAHNLFNTLSEFTLGNGTPGRFYSLSALEAVGIGKISRLPVSIRIVLEAVLRNCDGRKITEQHIRELANWQPNGPRTEEIPFVVARILLQDFTGVPLLADLAAMRSAAAQAGKNPKVIEPLVPVDMVVDHSVQVDVFNQPDALQKNMELEFIRNRERYQFLKWGMQAFDTFKVVPPGIGIVHQVNLEYLARGVMEKDGVHYPDTLVGTDSHTTMINGLGIVAWGVGGIEAEAGMLGQPVYFLTPDVVGVHLKGQIREGVTATDVVLTVTEMLRKAKVVGKFVEFFGAGAAALSLPDRATIANMAPEYGATMGFFPVDEASCAYYAATGRSAEQVDTIRNYFMAQGLFGIPQAGDCDYSQELEIDLGSVVPSVAGPRRPQDRIELGHVKQAFAGLFAKPVAEGGYGKAAATLAQRVALAPAPAGTDIAGGGVQNSDTLPAGGTDPAVVIEREMVDNRPTPDHLASNAVYTAAQSGTLGHGDVVIAAITSCTNTSNPGVMLAAGLLAKKALEKGLTVPAHVKTSLGPGSRVVTEYLKAAGLLDALGEMGFKLVGYGCTTCIGNSGPLPAAIESAITGNDLIAASVLSGNRNFEARVHQNVKANFLMSPPLVVAYAIAGSMNTDLASEPLGTGRDGAPVYLKDIWPSLDEVAAVMATATNPDTYRKLYADFSADNPLWAAVPAPAGAVYDWDGASTYIRQPPFFDGAAGDSGVIRGARALAVFGDSVTTDHISPAGSIKPASPAGKFLLEHGVDRADFNSYGARRGNHEVMMRGTFANVRIRNLMLPGSEGGVTRHQPDGAEMAIYDAAMQYQAAGTPLMIFAGEEYGTGSSRDWAAKGTRLLGVKAVVAKSFERIHRANLVGMGVLPCQFRDGMGADSLKLDGSETFDLLGLEHGITPQQDITLVIHRADGSADAVAVKLRIDTPIEVDYYQSGGILPFVLAQLLAD >NZ_AP021884.1|WP_147070488.1|1141914_1142286_-|DUF202-domain-containing-protein MSDLNDPRVFFAAERTLLAWNRTCLTLMAFGFVVERFGLFLHMLAPQTPQHLERGISFWVGLGFILLGSLMAVLAVIQYRRVLRTLKPVEIPEGYWVNMAALSTLLLAVLGIVLSAYLTMGLK >NZ_AP021884.1|WP_147070490.1|1139808_1141854_-|methionine--tRNA-ligase MTRKILVTSALPYANGAIHLGHLVEYIQTDIWVRFQKMHGHECYYVCADDTHGTPIMLRAEKEGITPEQLIARVHGEHLRDFTGFHVGFDSYHSTNSGENRELSGTVYLKLREAGLIEQKTIEQYYDPVREMFLPDRFIKGQCPKCGAQDQYGDGCEVCGATYTPTDLINPVSAISGSTPVRRESEHYFFRLGACEAFLREWTRSGALQQEAANKLDEWFAAGLQNWDISRDAPYFGFEIPDAPGKYFYVWLDAPIGYMASFKKLAAEKNLDFDAWWQNDSGAELYHFIGKDILYFHALFWPAMLKNAGYRTPSGVFAHGFLTVNGAKMSKSRGTFITAESYLASGMDPEWLRYYYAAKINGSMEDLDLNLADFIARVNSDLVGKYVNIASRTAGFIARRFDGKLAARLPTSELLAEVQHAATLIGECYETREYGKALREIMRLTDLANQYVNDNKPWELAKQEGSEALLHEVCSVSVNLFRLLTLYLKPVLPRLATEVETFLNIAALAWVDAGTLLTSHSINAYSHLMTRVEQKQVDALVAANQQSLAASADAHSPARHAEAQNHVIAPIADTITADDFARIDLRVAKIVNAEHVEGADKLIRLTLDIGEGKTRNVFAGIKSAYDPEKLIGRMTVMVANLAPRKMKFGVSEGMVLAASGETAGLYILSPDDGAVPGMRVK >NZ_AP021884.1|WP_147070492.1|1138856_1139744_+|LysR-family-transcriptional-regulator MDIEQARTFLHVVAIGNFLGAAEKLHVTQSTVSARIQNLERKLGAKLFSRGKQGAALTAAGQRFVRHAQTLVRTADIAKQDVGLPDGYSGGLTVSGRIALWEGFLSRWVAWMRQAAPAISLRLEIGFEQDIMHGLVQNTLDIGLMYTPEARPGLGLERLFDETLVLVTTDRMRPWPDPGYVHVDWGTEFFHQFSLNFPDHPPPALSANVGWLGIQQLLTSGGSAYFPLRMVRTLLAKKRLHRVPGTPHFSVTAHMVYPLSRNDDFLQQALAGLRLLGREERRGQISMDTDNSPNP >NZ_AP021884.1|WP_147070494.1|1137963_1138608_-|maleylacetoacetate-isomerase MQLFSFFSSSTAFRVRIALALKGADYEYQAVNLRAGEQHQQAFLDRNPSGNVPALVDGDFNLGQSLAILDYLDSRYPEPRLIPADTIQRARVLELVNVIACDIHPVNNLRVQLYLKNILGVTEAQKNAWYRHWVAQGLDVVERLLARQEDTPYCFGTHPTLADCCLAPQVWSAARAGCDIAAYPRIDRIYRHCMAQPAFIQAAPEQQADAPQGG >NZ_AP021884.1|WP_147070475.1|1148555_1149455_+|enoyl-CoA-hydratase/isomerase-family-protein MNAVVEFQQFSNASLEQVRIRFDEEYGVMWSFMRPEPRPCFTRTTLQDLLQHHTYLESMKGRVVSNGNFQQTNYLILASDLQGVFNLGGDLAAFGEAIRAQTRKELLSYAKLCIDNVWTFYNLQAPITTISMVQGQAMGGGFEAALSAHVMIAEKSALMGLPEVLFNLFPGMGALSFLSRKIGMRAAEAMVRSGRVYTATELHEMGVVDVLAEDGQGEKTLYDWIRKNHRSLNSFQAIQRARQRVNPLTVEELYEITEIWVDAALRLSERDLRIMERLVRAQNRKVTEPEPVVAEQASA >NZ_AP021884.1|WP_147070472.1|1149459_1151595_-|response-regulator MLDKMKIWFTDRVAKCDRVELEQSLIRLGIGLAILVYLLYRYLTHTTLSHNDIVAFSILSVFLFLTLVLIGSILYSSKPSVVRRLAGAWVDQGGTTLFMAFTGEVGVMVVGVYLWVIFGNGFRFGRKYLIHAQVLSIVGFAITTQVNPYWDEHEAISYSVMLMLLALPIYVSALIRRMNEARQKAEEANAAKTRFVANMSHEIRTPLSGIIGISTLLKATPLNSEQQDLLGTLNSSSRLLVSLLNNVLDFAKIEDGKLAIEHTDFSVNSLLEETVKIFRSQAEAKSIRLDTHIAAAAGTLRGDPHRLQQVLANLVGNAVKFTERGSVTLSLSILGENEHHRNMRFEVADTGVGIPTSAQGKIFESFTQADISTTRRFGGSGLGLTITRHLVEAMGGRLSFESAEGLGSRFWFDLPLEKAVQAQPGSAEIVPLPATRDAGLENTLRILVCEDDATNQKILLRLLELAGHHVSLSANGEELLDQLEQSSFDLVIADLNMAGLSGTDALKLYRFTRADDTRTRFILFTADATLSARQAAKEAGFDAFLSKPVDASTLFGTIANLLGMPSASAEHWLNTVMGGSRSSPPASAETRAVLDAATLRELEILGAGDALFVQRLLRNYLRDSGELLDRIEHAVQQKQYGALRDHCHALKGNSLSIGARGVFGRAETIDRAGPGELRFRGSAMVGLLRTDYAAARAAIEDYLSRRQTAAR >NZ_AP021884.1|WP_147070471.1|1151614_1152046_-|response-regulator MSVRDIRSAPTYRQTVLIIDDQPMVLAIHTAVLKSLSMDLRIVSMTDPKAALEWLRQKPADLIVTDYRMHQMDGIHFVNAVRDSSIEPMRPIIVVTALKDEKIHQQLLAAGVSACLIKPARAAQLSKIARTLLEQSRRQYTTQ >NZ_AP021884.1|WP_147070469.1|1152139_1153207_-|response-regulator MTNFNLPDTSAVLILDDQATSRTILAQVVRSIGSGIRVQEETTPSAALAWAAAHPADLVLADYLMPDMNGVEFIGRLRQLPGYQHVPVVMVTIKQDMETRYAALDAGMTDFLTKPVDMRECLSRCRNLLTLRQQQLALEDKSRVLEDMVGQATEEIRCREKDTLMRLARAGEYRDTDTARHLLRMSRYSRVLADAIGLPEDEAELIELAAPLHDIGKIGIPDSILRKNGPLSDEELAIMRQHPKIGHDILEDSPSKYLRLGGEIALAHHERYDGSGYPFGTTGQDIPLSARIVAIADVFDALTSVRPYKSAWSIKSAMQYLLKESGRHFDPALVKAMLTLEASVEKIQEEHAEPG >NZ_AP021884.1|WP_147070467.1|1153469_1154669_+|malate-dehydrogenase MPTLKQQALDYHQFPKPGKLSVESSKPCATQHELSLAYSPGVAEPVRAIGADPELAYRYTNKGNLVAVITDGTAILGLGNLGPLAAKPVMEGKGVLFKRFANIDVFDIEVNAPSVQAFIDTVVNIAPTFGGINLEDIAAPHCFEIEKALSERLDIPVFHDDQHGTAVIICAGLINALHVQGKKLADARIVCLGAGAAGNASLRLLLAMGADKSRLLVVDKVGVLHTGMIDLPPHHAFFAADTDARTLADAMQGADAFIGVSAANLVTPAMIKSMADKPVVFALANPDPEIAPHDVHAARDDAIIATGRSDYPNQVNNILGFPFIFRGALDARAKRITQKMLIAAVHALMDLAREPVPADVLAIYNLTELAFGRDYILPKPFDARLIERIPPAVMKAAKE >NZ_AP021884.1|WP_147070465.1|1154672_1155050_+|succinate-dehydrogenase,-cytochrome-b556-subunit MRHPSRPVYLNIFKIHLPLPGWMSILQRMSGAVLFLVTPLLLYLLQTSFDADGYARLREWLHIPVVKALSTLLLWGYLLHLLGGLRFLLLDIHVGTALATARKLSAATLLASALLTLVIAGIGLW >NZ_AP021884.1|WP_147070463.1|1155043_1155373_+|succinate-dehydrogenase,-hydrophobic-membrane-anchor-protein MVGGALSAWLVQRVSALLLAAYALFFPVWVALHWPLDFAVWRGLFAPLPMRIVTLLFVVALALHAWVGMRDIFMDYVQPLGLRLALHVGALLWLATCVVWAGAVLWSLP >NZ_AP021884.1|WP_147070461.1|1155369_1157133_+|succinate-dehydrogenase-flavoprotein-subunit MMPVKRKFDAVIVGGGGAGLRAALQLSGSGLQVAVVSKVFPTRSHTVSAQGGITAALGNVTPDNWHWHMYDTVKGSDYLGDQDAIEFMCRHAAEAVIELEHMGLPFSRLDNGRIYQRAFGGQSMNYGGEQATRTCAAADRTGHALLHTLHQQNLKAHTHFFDEYFALDLLRDADGYVLGVTALCIETGAPLVIEARATLLATGGAGRIFRYSTNAHINTGDGLGMVLRAGLALQDMEFWQFHPTGLPGSGSLITEGVRGEGGYLVNNQGERFMERYAPHAKDLAGRDVVARALALEIHAGRGCGPHGDTIHLKLDHLGAALIKDKLPGIRELALRFAGVDPIDAPIPVVPTAHYMMGGIPTDLHGQVVMPARFGPEEPVPGLYAVGECACVSVHGANRLGGNSLLDLVVFGRAAGNHIIETLRDNPFPRLLPESAAEAALARLARWNKTGAGESVAELRLALQTLMQKHCGVFRTETLMGEGIAALDILQARLDNARLADHSQVFNTARIEALELENLFAVARATLVSAHARTESRGAHAREDYPERDDGHWLKHTLYTRENDQIDTKPVRLKPLTVEPFLPKERIY >NZ_AP021884.1|WP_147070460.1|1157132_1157831_+|succinate-dehydrogenase-iron-sulfur-subunit MRFSIYRYDPEHDTKPHMQAYDVDIEPAGNMLLDALLRIKDTLDSTLTLRRSCREGVCGSDGMNINGSNGLACITPLADLRQPVEVRPLPGLPVIRDLVVDMTPFNQQYRSVEPWLNNADPAPEIERLQSPEQRAQLDGLVECIQCGCCSSACPSFWWNPDKFVGPAGLLAAYRFIADSRDQGANQRLDNLQDPYRLFRCHGIMNCVSVCPKGLNPTAAIGKIKTLLVKRGA >NZ_AP021884.1|WP_161984192.1|1157869_1158085_+|succinate-dehydrogenase-assembly-factor-2 MLELDILLLDFLEQQYPVLPSSQQIAFGALLELGDSELWDMIQTGQSAAQPEQAKIIEWLRTGKQKNESTD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_2 | 1831685-1831793 | Orphan |
NA
Consensus repeat of NZ_AP021884_2
|
1 spacers
spacers of NZ_AP021884_2
>2.1|1831723|33|NZ_AP021884|CRISPRCasFinder TTGCGTTGGATACCTCATCCTCATCATTGCGCT |
CRISPR arrays and Neighbor proteins around NZ_AP021884_2
The CRISPR arrays of NZ_AP021884_2 >merge|NZ_AP021884|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGACTTGCGTTGGATACCTCATCCTCATCATTGCGCTGGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC >NZ_AP021884|2|2|1831685-1831793|CRISPRCasFinder GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC TTGCGTTGGATACCTCATCCTCATCATTGCGCT GGCAGCATCACGCCTATCCAGGCCAGTCCCGGTGCGAC
>NZ_AP021884.1|WP_147074724.1|1830036_1830279_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLALPARIVEMRKQDIGIVDLGGVRKEVSLALVDDLQVDDYVIVHVGYALSKLDPEEAERTLRIFAEMESMPGNVGVGA >NZ_AP021884.1|WP_147074723.1|1828900_1830040_-|hydrogenase-formation-protein-HypD MKYVDEFRDGELANGLASTIARAADTGRNYSFMEFCGGHTHAISRYGVTDLLPANIQMIHGPGCPVCVLPIGRIDLAIGLALDQGVILCTYGDTLRVPASDGLSLMKAKARGGDVRMIYSTADVLAIARDNPDRDVVFLAIGFETTTPPTALLIEQAKNEGIGNLSVLCNHVLTPSAITHILESPEVREYGTLPLDGFIGPSHVSTVIGTQPYEHFAREYRKPVVISGFEPLDVMQGILMLVRQVNEGRAEVENEFFRAVTRGGNRKAQTLVAKIFELRRTFEWRGLGEVPYSALQIRSEYAAFDAEQRYGLRYAPVADNKACECGAILRGVKKPTDCKIFGTVCTPETPMGSCMVSSEGACAAHYTYGRFKDVEIVAA >NZ_AP021884.1|WP_147074722.1|1827848_1828904_-|hydrogenase-expression/formation-protein-HypE MSTVKPGYTRPLDVRNGRIDLSHGGGGRAMAQLIEELFAAAFDNEYLAQGNDGAVLAMPSAGGRLVMATDAHVVSPLFFPGGDIGCLSVHGTVNDVAVMGARPLWLAASFVLEEGFPLSDLKRIVESMANAAKSAGVSVVTGDTKVVERGKGDGVFITTTGVGVLPKGLDLSGNKATPGDVILLSGTIGDHGMAIMSKRENLAFDAPIESDTAALHGLVADMLASGSGIRVLRDPTRGGLATTLNEIAKQSGVGMQLDESSIPVRPVVDAACEFLGLDPLYIANEGKLVAICAPEDAGGLLAVMRAHPLGRESAIIGTVHADPHHFVQMKTRFGGRRNVDWLSGEQLPRIC >NZ_AP021884.1|WP_147074721.1|1826161_1827847_-|hydrogenase-maturation-protein MRILFLTHSFNSLTQRLYVALTELGHEVSVEFDIADSVTEEAVALYRPDIILAPFLKRAIPASVWRHHTCLVVHPGIVGDRGPSALDWAVQNAETEWGVTVLQANAVMDGGDIWANEIFPMRLAKKSSLYRNEVTEAATRAVTTAIERYAQRDFVPCPLEKWSNVAGQERPVMWQEDRRINWLRDDTQTILRRIHAADGFPGVRDSLFDHACFLFDAHAAPDYSGAPGTILGWQGTSLVRATVDGAIRIGHVRRPESAHPFKLPALVAFAAEQASIPVLCEADGESIRYEEQDGVGYLYFDFYNGAMSTAQCRELLAAYRQACSRPTRVIVLMGGDDFWSNGIHLNLIEASEHPAEESWENIQAMDDLAEAIITATSHITVSALANNAGAGGAFLALAADHVWARPSVLLNLHYKNMGNLYGSEFWTYTLPRRVGLEKTSRIVENRLPMSARQAARLGIVDACFGTDAAMFRREVKQRATAISRSPDYDVLRKTKTEARDRDESEKPLLRYRESELSEMHRNFFGFDPSYHYARRYFVHKTLPAWTPRHLCKHRGMVQGNH >NZ_AP021884.1|WP_147074720.1|1824193_1825636_-|sigma-54-interacting-transcriptional-regulator MSLPTVLIVDDEIRSLEALRRTLEEDFTVFTASNVDAALEILRQEFIQIIVCDQRMPVQSGVTFLKHVRADWPDVVRIMLSGYTDTEDIIAGINEAGIFQYLLKPWQPEQLMLVLRSAADVYRLQLENQRLSLELRDSPALLAERVANKRQHVREKFSLDRVARAPDSPLNATCEMIDRIAPYDISVLITGESGTGKELLAHALHYRSGRAAQAFVTQNCGALPDALLEAELFGYKRGAFTGAYSDRVGLFQQADGGTIFLDEIGETTPSFQVKLLRVLQEGEIRPLGSPRSVQVNVRVIAATNRDLEEEVRAGRLRQDLYYRIANLTMHLPPLRERPMDIPLIAEGLLQRAMRQLNRKVRGFTPETLDCFKAYRWPGNVRELQNEILRILALTDSEWLEARLLSPKVLRAAMEESEEQQLDLLAGLDGSLKDRMEQLEARLIRETLIRHRWNKTHAAQELGLSRVGLRSKLVRYGMDKT >NZ_AP021884.1|WP_147074751.1|1823172_1824168_-|HupU-protein MNLIWLQSGGCGGCTMSLLSADVRDLFGMLKDAGINIVWHPGLSEQTGSEAIEVLEACASGDLPLDILCVEGSLLRGPNGTGRFHVLSGTGKPMIEWVRQLAEKAQYTIAVGTCATYGGVTAGGCNPTDACGMQYDGASRGGLLGVDYLSQSGLPVINIAGCPTHPGWVLETLLALAMDSFTQADLDELGRPRFYADHLVHHGCARNEYYEFKASAEKPSDQGCMMENMGCKGTQAHADCNIRPWNGGGSCTDGGYACIGCTEPGFEEPGHPFTQTPKIAGIPIGLPTDMPKAWFVALATLSKSATPKRVKENATSDHLVIVPGIRKTGVK >NZ_AP021884.1|WP_147074719.1|1821718_1823176_-|nickel-dependent-hydrogenase-large-subunit MSRLVVGPFNRVEGDLEVTLDISGGRVDRAYVDSTLYRGFEQILRGKDPMDALVFVPRICGICSVTQSVAAANALRNAMGISIPRNGQLATNLILANENLTDHFTHFYLFFMPDFARDGYSGRPWHGMAEQRFKAVTGSAAGDALPARAAFLHMMGVLAGKWPHTLTLQPGGSSRAVSSTEKIRLYAMLREFRAYLEKIMFGDKLENIVKLDSMRALEAWRDARPPDASDFRLFLEVARDLELHRIGRATDIFLSYGSYEMAGEYLFSPGVWDASKGTLSAIDPSDIVEDLSHSRMTGERDARHPYQGETQPAPDKPDAYTWCKAPRWRGQVLECGALARQVVTGHPLIRDMVEKTGGNVTTRVVARLLEISRVIPAMESWIKSLSPGEPFCVQGRMPDNAKGVGMVEAARGALGHWLVVKEGKIANYQIVAPTTWNFSPRDRDGIPGALEQALVGVPVGEHERVPLAVQHVVRSFDPCMVCTVH >NZ_AP021884.1|WP_147074718.1|1821238_1821706_-|nickel-responsive-transcriptional-regulator-NikR MERFTISLDEDLAQEFDRLILARGYSNRSEAVRDMLRAELEKSRQVRYEGTHCIAALSYVYNHHERELAERLTALQHDHHDLTVSTLHAHLDHDNCIECVVLRGKTAEVRDFAGKLIAERGVRHGNLSVITVSQEQHKHRHGLFARSHIHYKPHN >NZ_AP021884.1|WP_147074717.1|1819887_1821237_-|PAS-domain-containing-protein MFSKTGLLSLPDMPIEGVGEQFWMEVIRKMDEVYSDLLKYQTALEEQNNKLEESQQFIFGVLAAMSDILVVCDQTGTIEDVNQSLIELTGKTSAEWRGHPLVELFADDISRKQAELKFNGLQGQAIHDCEMQIRMANGSSMPVSVNCTARFNKKGKSVGMVITGRPVGELRRAYHALQEAHEALKRTQQQLVHSEKMASLGRLVAGVAHELNNPISFVLGNVHVLERYAGRLKEYLDAVHAGRSGIELAELREKLKIDRILGDIRPLIEGTIEGAERTRDIVDGLKRFSAIDREEECEFNLVEIIQRAVHWVTNITSESFQVEMDLPHFIPVLGSAAQIQQVIMNLVQNAVDATAEVKSPRLRIQAKIEKDKAVVEFRDNGSGILPENFPKIFDPFFTTKPVGKGTGLGLAISYGIVERHNGALFAANDAHDGGTIFVLNLPLYQSAKN >NZ_AP021884.1|WP_147074716.1|1818851_1819808_+|HTH-type-transcriptional-regulator-CysB MKIQQLRYLHEVARQGLNVSLAAEKLHTSQPGVSKQIQLLEEELGVDILVRHGKRVTGITEPGQKILAITERILREAENLKRVGADFTNETHGSLSIATTHTQARYALPSVIKTFSERYPGVQLRLHQGNPAQIVEMVLSGEADIAIATEAIALHDELVTLPCYQWNRCVIVQPDHPLLGEPTLTLERIADYSIITYDFAFAGRSQINKAFMERNLSPNVVLTAIDADVIKTYVGIGLGIGIMASMAFDPGRDQNLRAIDASHLFEPSTTRIGIRQGTYLRGYTFEFIQMFAPHLNHEAVNMAISAACRSAHQEAPKI >NZ_AP021884.1|WP_147074726.1|1832692_1833727_-|hydrogenase-nickel-incorporation-protein-HypB MCTTCGCSAGETRIEGQAMDGHSHVHADGTVHDHRHEAPAADGKMQYHAHHDENAHGHRHADGTWHSHDHGHEGEHVHEHGEDVIDYGQGPAHAHAPGLTQSQMVRIEQDILGKNNAYAGRNRNYFDEHGIFALNLVSSPGSGKTTLLVRTIETLKSRIQVAVVEGDQQTSQDAERIRSTGVRALQINTGKGCHLDANMVGHALERLHPEDDSVLMIENVGNLVCPAAYDLGEAHKVVILSVTEGEDKPLKYPDMFRAASLMLLNKTDLLPYVPFNVQLAIEYAKQVNPGLHIIQTSSTNGDGYEAWLGWIETGLARQRKKRAQTVAVLQKRIQELEAHLAARG >NZ_AP021884.1|WP_147074727.1|1833787_1834129_-|hydrogenase-maturation-nickel-metallochaperone-HypA MHEMSLAEGVLQILEDTATHHGFQQIKRVRLEIGELACVEVESLRFCLDVVVRGSVAENTMLDIVQTPGGGWCMNCSDTVPISALFSACPRCGSYQVQPTHGTEMRVLELEGV >NZ_AP021884.1|WP_147074728.1|1834121_1835225_-|nickel-dependent-hydrogenase-large-subunit MSLAGKLTFSVGWDGYRVTSVEVRSSRPQAACLLEGKTVEEAMRLVPLLFGICGKAQTVAARSAAQAAQNLCGDKQLMLRQRRLVALEAAQEHLWRLLVDWPNRLGLPAKQGLMMEWVKRISISRGDDDVLALGEAMLTMIEQDVLDESLDCWAATLERAERTPMRGLAGASLEMLRGLEPLHSGHPVFGHFLPRQAACLWGNELQPYLDGHFAVRPLWRNAPAEAGALALHHQIPLLAELLRTGHAASARYLARLVDWVSCVRLLRGEASSTELRLDACKLGKNAGLACVDTARGLLLHYIEVALGQIVRYVIVAPTEWNFHPAGPFVQTLRSLRADDAASLYQRINILILAFDPCVEYEVNLHHA >NZ_AP021884.1|WP_161984236.1|1835224_1835764_-|[NiFe]-hydrogenase-assembly-chaperone-HybE MKLMNPRPLENPSRMIESVFDGIARHRMAGLPILNPSLHVEAVGFRLWEGLWLGILITPWTINLMLLPADNPDYAALGLGETRRWRFPSGQYDFMGGEEPGLGSYQACSLFSPVFEFASQEDAVATARAALEQLLLEDLEAAVKREKAQWDQARFSDAPLAEQALSRRGFLRGAFLRDP >NZ_AP021884.1|WP_147074730.1|1835770_1835986_-|rubredoxin MDTFEGSYLGHDDRIDESVRLECGICWLVYDPEVGDPYWHIPPGTPFSRLPEHWTCPNCDAPRHKFMVLKD >NZ_AP021884.1|WP_161984237.1|1836001_1836784_-|hydrogenase-expression/formation-protein MNMPKGMAVFNPPSVPDDVAPELRDQAANLIRQLLAQMRAYRFGATSYPKIDLLKYDPRVVPLINDILGQGEVSIIAHQPTALRAQETVFASVWRVCYPGADGVLERDYLEVCPIPAVVAEIALAPTLKQISPPPPPAGAMNSPALLHEILDVVSTYQAGNPAHIINLTLLPLTPDDLAYLVQALGPGSVSILSRGYGNCRITSSGLANVWWVQYFNSSDQLILNTIEVVEVPEVALAAEEDFSDSIERVEEWLGTMLAA >NZ_AP021884.1|WP_147074732.1|1836869_1837361_-|hydrogenase MSEGVLDYALVAQEKVATEQNALGMLITRLCEQHQFVLVDEGNLEALTQASGDMVLLLTEDVVRSPETWDVAIVLPEILKLFGGRLKAAIADTENSKKLQARFGTTRFPAMVFLRDGEYVDVIQRMLDWDEFVAEVTGVLEKPIGRAPTIGIPVRNEVASSCH >NZ_AP021884.1|WP_147074733.1|1837357_1837666_-|HypC/HybG/HupF-family-hydrogenase-formation-chaperone MCLGIPMQVIEAEESYAVCRGRDGNLARIDTMLVGSVQSGQWLMTFLGGAREILNEQQAEQVNSALNALAAVSRGASDVDVHFADLVGREPQLPDFLRKGGQ >NZ_AP021884.1|WP_147074734.1|1837670_1838294_-|HyaD/HybD-family-hydrogenase-maturation-endopeptidase MVEGSQFDTLILGIGNVLWADEGFGVRCVEAMNATYAFPDNVRVMDGGTQGLYLLPYVEAARRLVIFDAVDYGLEPGTLKLVENAEVPKFMGAKKMSLHQTGFQEVLACADLVDHLPEEMVLIGVQPEELEDYGGSLRPRIKARIPEVLEIAVERLVGWGIPVVARGTGETMRTESGILDIQRYEMERPTEEQACRLGDIRFLATGV >NZ_AP021884.1|WP_147074735.1|1838453_1838801_-|HigA-family-addiction-module-antidote-protein MVKTFLPSGLGFGAGALDPRFFSFSLRAPIAPGRFLESRFLHPLGLSQDRLARELGISRRRVNELIRGKRAITPDTAIRLGLFFGTGPVLWLTLQQAWDIHQEWRNFRRRSKAHG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_3 | 2644962-2646084 | TypeI |
I-C
Consensus repeat of NZ_AP021884_3
|
15 spacers
spacers of NZ_AP021884_3
>3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG >3.2|2645072|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG >3.3|2645146|37|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG >3.4|2645220|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG >3.5|2645293|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG >3.6|2645366|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG >3.7|2645438|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT >3.8|2645510|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG >3.9|2645581|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG >3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC >3.11|2645725|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC >3.12|2645798|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA >3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA >3.14|2645942|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG >3.15|2646013|35|NZ_AP021884|CRISPRCasFinder,CRT GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around NZ_AP021884_3
The CRISPR arrays of NZ_AP021884_3 >merge|NZ_AP021884|3|2644962-2646084|PILER-CR,CRISPRCasFinder,CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACATACCGGTAGCGTCGGCAATACCCTGACCGCAGCGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACGACGCAGGTTATAGAGCGTTGCACGGCAAAATTGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACCTAACCTGCCTTACACAGCCAGCCGCTACGATGAGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACTCGGTCATGGGTGCCAGTTACACTATCCCGATGGACGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAACAATTTCTTGCTGGATAAAATCAAGCCGCTTAGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACAGCATGATCCAAATGGATGCCTTGCGGTAGCTTGGCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAACGCCATGGGTAGCACCGATTAGCACCTTGCCAAAGCGCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646012|PILER-CR GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC >NZ_AP021884|3|3|2644962-2646084|CRISPRCasFinder GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC >NZ_AP021884|3|1|2644962-2646084|CRT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATGGTGCGATCCTGTTGTTGCTGGTTGTGCTGCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCCATGGTGGGATTCATGATCCAGTGGGCGATTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GTTTTCTGGTGGCAAGGATTCGGCTGTCATGCTGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CGGCCGCTGTGGTCGCGCCCGACATCCTCGCCGCGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACTGGCGGAGGAGATCGAAATGCAAAAAGCCCGGG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ATACCGGTAGCGTCGGCAATACCCTGACCGCAGCG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACGACGCAGGTTATAGAGCGTTGCACGGCAAAATT GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GGCCCATGGCAGCTTAAGGTCGGGTATCCCGCTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC CTAACCTGCCTTACACAGCCAGCCGCTACGATGAG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC ACCGGTTCATCGCCGTGCCGCCTGTCCATCGCCGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC TCGGTCATGGGTGCCAGTTACACTATCCCGATGGAC GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCATAATGGGACGTTCCGTCAATCTGCGAAGCGCGA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AACAATTTCTTGCTGGATAAAATCAAGCCGCTTA GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC AGCATGATCCAAATGGATGCCTTGCGGTAGCTTG GCATCGCCCGGCCTCGCGGCCGGGCGCGGATTGAAAC GCCATGGGTAGCACCGATTAGCACCTTGCCAAAGC GCATCGCCCGGCCTCGCGGCCGGGCGCGGTCAGGTAC
>NZ_AP021884.1|WP_147072286.1|2644492_2644783_+|CRISPR-associated-endonuclease-Cas2 MLIIVTYDVSTETAAGRKRLRRVAKACEKMGQRVQKSVFECTVNEMQFEQLERTLLAEIDETQDNLRFYRITEPVEVRVKQHGCFRSVDFEGPLIA >NZ_AP021884.1|WP_147072284.1|2643449_2644484_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MHTIQNTLYVMTPHAYAHLENATLRIDVEREKKLQVPLHHLGGVVCFGNVMVSPALMHRLADEGKSLVLLDDSGRFKARLEGPVSGNILLRQAHHSKASEPAFALGVARAVVAGKLKNSRTNLQRGAREAADPDEAATLTRSADNLAASLRAAAVANTMDELRGVEGEAARGYFAALNLIVKPLARPSFALNGRSRRPPLDRFNALLSFLYAMLMNDCRSAVEAAGLDAQLGFLHAVRPGRAALALDLQEEFRSILADRLALTLINRGQINAADFDEREGGAVMLGDKGRRTVVTAWQERKQEEITHPLTENKIPIGLLPFIQARFIARTIRGEMEGYLPYQAK >NZ_AP021884.1|WP_147072282.1|2643084_2643402_-|ribbon-helix-helix-protein,-CopG-family MATLTLRLPDNLDRQLTALAAQTHQNRSELARTALEKFLRELEQEQLLAEMVEAARFLATNPEARAESIAIAEEFLPLDNEALDIAEGRKPGDPWPEELGEKWWK >NZ_AP021884.1|WP_147072279.1|2642722_2643097_-|type-II-toxin-antitoxin-system-PemK/MazF-family-toxin MVEIMRRGEIWLARLNPNTGAEAGKVRPVLILLNDALLATGMSPVLCIPLTSKLYKNLAGLRIAIAPRGLLLKPCYAMPEQARALDRNRFGEGSLATLTNAEMAQVEKLFIAACGMAQYLIPQH >NZ_AP021884.1|WP_147072277.1|2642112_2642739_+|CRISPR-associated-protein-Cas4 MANSADEIVALSALQHWIYCPRQCGLIHLEQAFEDNVHTARGQAVHHLVDTPGYEIKSGVRVERALPVWCDRLNLIGKADLVEFHPDDSVYPVEFKHGAKRQKLHDDIQLAAQAICLEEMLNRPVPKGAIFHATSHRRREVSITPELKQLVEETANAIRAMLASGKLPPPVNDARCRECSLKEICQPEALAERGRLERLREELFSAAG >NZ_AP021884.1|WP_147072275.1|2641869_2642112_+|type-II-toxin-antitoxin-system-HicA-family-toxin MRVPRDLSGADLVKRLERMGYCVTRQTGSHMRLTSTVRGEHHITIPNHDPLRLGTLASILASVAAHHGLTRDELIQRLFD >NZ_AP021884.1|WP_124705901.1|2641666_2641873_+|2-oxoisovalerate-dehydrogenase MSEIHFIVEEAPEGGYVARAVGVDIVTEADDLPSLHAQVRDAVHCHFDEGKLPGLIRLHITREEVLTA >NZ_AP021884.1|WP_147072272.1|2640483_2641584_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MSIHNRYDFVLLFDVKDGNPNGDPDAGNLPRLDTETGQGLITDVSIKRKIRNFVGITKCKEDGTYETGFDIYIKEKAVLGRAHFAAFEKLGISLGQDATELIPDDLAEQFEALTLPEGMEIDTDEEGRSILNLSGATLDKKEAQKWLKDINPAKPLKNFISKVLKNVTARKPKQEESEKGRVQMCQDFYDIRTFGAVLSLKTAPNCGQVRGPVQITFARSIDPIVTLEHSITRCAVATEAEAEKQGGDNRTMGRKFTVPYGLYRTHGFVSAHLAGQTKFDESDLELLWEALKNMFEHDHSAARGEMATRGLYVFKHESHLGNEAAHKLFDRIKVNKTKDVPRGFEDYEVSVDETEMPSGVALLQKC >NZ_AP021884.1|WP_147072270.1|2638681_2640466_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MILQSLHEYYGRKRDSLPGDGIERKELPFLFVLKPDGAFLHIEDTRQGEGKRKRGNAFLVPQGVKKSVNVAANLLWGNVEYVIGQPDSKKLEEQRKKGKEKHYRERLGDMCSAFRTEIEQLPSEVKSTPEVAAVLAFLSSGNFTHVLADPLWPQVSATGANVSFKLTGAESPVCSASGILASVGQSTEDKGETRICLITGNSDVVERLHPPIKGVWGAQTSGANIVSFNLSAFNSFAREQGSNAPVGKRAAFAYTTALNHLLVSKQRIQIGDASTVFWAAEDNKMESLLSQFFDEPPQDNPDQGTNAVKELLEATLAGTPAIYDDGTRFYVLGLAPNAARIAVRFWHVATVGDLAGHIRQHFEDLEIVRPQYVERPFLSLKALLLAVSPLGDLDKLPPKLAGDFMKAILDGTSYPQTLLQAALRRIHAEQAKKDEKTGKHRDHVPYARAALIKAWLNRQTRNANPDQERKITMSLDESNINSGYRLGRLFAVLEKVQAEANPGLNTTIRDSYFGSASSTPSAVFPTLMRRNQHHMTKLRKEKPGLYVTRDKLIQTICNDGIDGQLGFRPILSLADQGRFVIGYYQQRQDLFTKS >NZ_AP021884.1|WP_147072268.1|2637986_2638685_+|type-I-C-CRISPR-associated-protein-Cas5 MPKTLCLKVWGDFACFTRPEMKVERVSYDVITPSAARAVFEAILWKPEIRWTVTKIEVLKPIKWISVRRNEIGKVASADNGQGDRGLYIEEHRQQRAGLFLRDVAYRLHAQFEVVDGSKHVHHYPELRGRFPAEPEESQPEHPAKYLSMFQRRAKKGQCFWQPYLGCREFSAHFELVDDAAAASLAEPPISDSPSLGWMLHDIDFADAMRPGFFRAEMKSGIIDLEDVEVRR >NZ_AP021884.1|WP_147072288.1|2647299_2647656_-|DUF2934-domain-containing-protein MAESKAKSKASGKPVSAVAETKPKAKTAQPAAGKAAAQSAVAAKPKVAKPKVAAPGANEPAAKRSVKLSNPAVSAEQRYRMIAEAAYYIAERRNFAPGDAAADWAQAEVQIVALLNKK >NZ_AP021884.1|WP_147072290.1|2647849_2649325_-|metalloprotease-TldD MTTSTLTGVLIPNPEMLFQTAHETLLVPNQLEASQLDGVFGRLMDHHVDYADLYFQYTRSEGWSLEEGQVKSGSFNIEQGVGVRAVSGEKTAFAYSDDISQPALLAAAEATRAIARSGAVRKPHAVARGGGHALYQPLDPLTTLKDAEKVALLEKLERYARAIDSRVTQVMASLASEYDVVLIARSDGHQAADVRPLVRLSLQVITEQDGRREQGSAGGGGRFGYDYFSDAMLKKYAEQAVHQALTNLAARPAPAGSMTVVLGAGWPGILLHEAIGHGLEGDFNRKGSSAFAGRVGERVAATGVTVVDDGTLMNRRGSLNVDDEGNLTQCTTLIENGVLKGYMQDTLNARLMGVPITGNARRESFAHIPMPRMTNTYMLNGDKDPEEIIASVKHGLYAVNFGGGQVDITSGKFVFSAAEAYMIEDGKITYPVKGATLIGNGPDVLTRVSMIGNDMALDPGVGTCGKEGQSVPVGVGQPTLRIDGLTVGGTA >NZ_AP021884.1|WP_147072292.1|2649382_2650324_-|carbon-nitrogen-hydrolase-family-protein MEKIVSDKTSKSPSSFAKPKSRTTVAKPARAPAPGVIRMAAIQMASGPNVSANLAEAERLVALAVAGGAKLVVLPEFFAIMGNKDTDKVAAREEEGKGPIQKFLASAAKKHKIWLVGGSVPLACDNPKKVRNSCLVYDDKGKLVARYDKIHLFGLDLGVEHYQEEKTIEPGDQIVVLDSPFGRIGLSVCYDLRFPELYRAMPNVDIILVPSAFTATTGKAHFETLVRARAIENLAYVIAPAQGGYHLSGRETHGDTMIVDPWGVVLDRLPRGSGVVMAGINPAYQASLRKSLPALKHRTLDCSHIQIKDKAIK >NZ_AP021884.1|WP_147072295.1|2650366_2654170_-|TIGR02099-family-protein MIAFSRRWIRRSVDYVVLPLALVVVVLVLLLRLWILPDIDRWRDDIAASISHSAGQRVTLGEINANWQGLHPHLRIRDIRVFGADGRPVLFLADVRATLSWTSLLHGELRLAVLTMDDVALTIRRDMQGIHVAGILLNQSDSSGGFGDWLLAQRHIQVNHATLAWNDERRGAPYLVARDVNLTLQNRGHRHRFRLTAIPPEQLAQPLDIRGDFSGRSLDDLASWHGQVYARVDRTDLGQWRQWLTLPYAISQGYGGLRMWLDVASRQVIAATVDASLRQVSVRFAADLPVLRLADVSGRGLWKRLGPAQSFAVKQLSLRTANFVYVAPFDLTLRLDPANAIQPGSGRIDTNSVQLDRLAALAPYLPLDAVQRRRLADLQPRGQLEKFTLAWSGNADQPLDYQIKGRFTRLGWQAQGNLPGAAGLSGNIDATRSNGTLALTSSGVMLALPRVLFEPDVALTTLTARMNWRATQAGYLIKLTEASFANPDLAGSAFGEYQLQAGRRGVIDLTGRLSRANVASAYHYLPLVVKDPTYQWVRSALLAGQGGAASIRLQGDLSRFPFRKAGDGVFEISTPISNGVLQYAAGWPRIEGIQAQLKFTGTRMEISSDAATIYGAALRRVSAVIPDLVDPDEILEVKGEAAGPLAELVRFANTSPLAAKLDNVTDNLRTTGNSRLGLDLKLPLRRAHHATLVGDIRFLGNTLIPAHGLPTLENVQGRLSFTDTGISAQSISARLLGGAATLSAVTQPGGVTRLLVDGRMTAAGLRPYLGTALAGHLSGMADWHARVDLHQMQAQADFESNLVGMASDLPPPFAKAAADSQPLRVKKSLRGADESLLAIHYGQVASALLLQKQKDGEPVIERGTLRFGGEAVLPEESGLWITGSLLLSDLDLWRNELTAAGNGAIGLPPLAGVNLSFRTLDLFGRRFQDININARNQAGTWRANVAGRGVNGDVTWQAADSRAGQPQDRLGAHFKTLAIPAALPVQGVKSSPSGSLPALDISVDNLQLGNRPLGRLSVSATPLDSGLNFESIRLTQPDSTLTMQGIWNPDRIPQTRAKIHLEVNDVGRFLARFDHPGLVKRGQATLDGEGEWNGTPADIAIPSLSGTFALKASSGQFAKVDPGIGKLLGVLSLQALPRRIGLDFRDVFSDGFAFDEISGTMRLSRGVVYSDDFRMQGPSAKVRMSGMVDINAETQQLRVAVSPKLSESVALAGTLIGGPFVGLGALAVQKLLKDPFGQAATFEYSVTGAWTDPVVKRVARIAGGGEP >NZ_AP021884.1|WP_147072299.1|2654382_2656353_-|acetate--CoA-ligase MANIESVLQETRVFPPSAAFQAQANVSGMASHQALTARAAADYEGFWADMARAGISWKKDFSKILDESNAPFYKWFYDGELNVSYNCLDRHLPEKADKTALIFEADDGAVRRVTYQALYNQVCAFANGLKSRGVQKGDRVIIYMPMGVEAVVAMQACARIGAIHSVVFGGFSAKSLHERIRDAGARLVVTADGSIRGGKMLPLKSAVDAAIALGDCECVEAVVVYRRSGDDTAWNAARDIWWHDLVNGMAQTCEPEWVNAEHPLFILYTSGSTGHPKGVQHSSGGYLLGAILSMQWVFDARPDTDVFWCTADVGWITGHSYVVYGPLALGMTEVIFEGVPTYPDAGRFWKMIQDHQVTTFYTAPTAIRSLIKLGSDLPRQYDLSSLRLLGTVGEPINPEAWMWYYEAVGQSRCPIADTWWQTETGSHMIAPLPGAVATKPGSCTLPLPGIMADVVDEHGGSVPLGQGGYLVIKRPFPSLLRSLWGDPERFRKTYFPAELGGKTYLAGDSAHRDADGYYWIMGRIDDVLNVSGHRLGTMEIESALAANPRVAEAAVVGKPHDIKGEAVVAFVVLKGARASGDEAKKIVAELRDWVGKEIGPIAKPDEIRFGDNLPKTRSGKIMRRLLRAIARGEEITQDVSTLENPAILEQLKEAVR >NZ_AP021884.1|WP_147072301.1|2656415_2659091_+|bifunctional-[glutamate--ammonia-ligase]-adenylyl-L-tyrosine-phosphorylase/[glutamate--ammonia-ligase]-adenylyltransferase MPAHHLIERAASHSRYLARLLAADAQFVDSLASGLAQPFGADAMQAQLQAAAPGDEAMLKTALRKLRQAVMARLIVRDLGGLADLSEVMGTCTDLAETTLRCALAHHSTWLAQKHGMPKNPDGSDMQLVVVGMGKLGGRELNVSSDIDLIYLYPEQGETTGAKPVSHHEFFVLLGKKLGLAISDLTADGFVFRVDMRLRPWGDAGPLAMSYAALEDYLVAHGREWERYAWIKGRALTGTRLAELDQIIRPFVFRKYLDFNAFAAMRELHVQIRREVIRRDRADNIKLGPGGIREIEFTAQVFQLIRGGQVAVLQTRSLLAVLPLLAARGLLPENAVAELQAAYVFLRNLEHRLQYLDDAQTQMLPTQPDDRTRIATSMGFTDYPAFLAALNAHRTQVSRHFDQVFAAPQADSGSHPLAGLWQGALEHADALATLAGLGYTAPAEVCNRLRQIRTSIRYTTLPASNRARFDTLMPALIEVAASCNPPDATLARILDLLETVARRDSYLALLVEYPATLQRVARLCAASPWAAQYLARNPMLLDELLDTRQLYATPDWPALGDELQALMHTHCGDTERQMDAMRQFRQRVTFHLLAQDLAGVLALETLSDHLSDLAALILSATLPLAWAGVRNRHRDTPRFAVIGYGKLGGREMGYASDLDLVFLYEDPAPAAAEHYARLAQRINTWLGSTTAAGVLYETDLRLRPDGTSGLLVSSVEAFSQYQHSHAWTWEHQALTRARYVAGDAAVGAAFERIRCDILTQPRDPARLREDVLAMRQKMHAGHPNHSDLFDLKHDAGGIVDVEFMVQYLVLAHAARHRELTRNSGNIALLRLAAELELIPASDAEAVRSAYRELRRLQHALRLHGIQTARIEPMQVAGHAAAVRRLWRTLFG >NZ_AP021884.1|WP_147072457.1|2659154_2660069_+|branched-chain-amino-acid-transaminase MADRDGFIWYDGKMVPWRDATTHVLTHTLHYGMGVFEGVRAYNTDQGTAIFRLQEHTDRLFRSAHILGMKMPFDKAAISAAQLAAVRDNQLESAYIRPMAFYGAEAMGISAKTLSTHVIVAAWTWGAYMGAEALERGIRVKTSSFARHHVNIAMCKAKANGNYMNSILAHQEAAQDGYQEALLLDVDGFVAEGSGENVFIVRNGKLITPDLTSALEGITRDTIVQLAGEIGLQVVEKRITRDEMYSADEAFFTGTAAEVTPIRELDNRTIGTGARGPITAQLQKMYFDCVTGKDPKHAGWLSYI >NZ_AP021884.1|WP_147072303.1|2660079_2660286_+|zinc-finger-domain-containing-protein MAQVQHENTQRIIEVTADDLPLHCPTPGMIAWDSHPRVFLPVEVKGEALCPYCGTMYILKGGAVAHGH >NZ_AP021884.1|WP_147072305.1|2660694_2661141_+|6-carboxytetrahydropterin-synthase-QueD MLITRRLEFDAGHRIPNHASQCKHLHGHRYAIEITLSGDIITAEGQSEQGMVMDFSDVKRIAREQLVDAWDHAFLAYRGDKPVCDFLATLTDHKTIILELVPTVENLAHIAFDILDPAYRDTYGNQLRLKQVRIYETPNNWADCRQPE >NZ_AP021884.1|WP_147072307.1|2661191_2661659_+|YbhB/YbcL-family-Raf-kinase-inhibitor-like-protein MGMTMTSTAFAHHGAIPEHYTCDATDTSPPLAWAGVPVGAKSLVLIVDDPDAPDPAAPQRTWVHWLLYNLPPTSSGLAEGVTALPAGTLEGINDWKRTGYGGPCPPIGRHRYFHKLYALDVVLPNLDRPSKAALEKAMQGHILAQTELIGLYQRH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_AP021884_4 | 2907296-2907401 | Orphan |
NA
Consensus repeat of NZ_AP021884_4
|
1 spacers
spacers of NZ_AP021884_4
>4.1|2907330|38|NZ_AP021884|CRISPRCasFinder CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC |
CRISPR arrays and Neighbor proteins around NZ_AP021884_4
The CRISPR arrays of NZ_AP021884_4 >merge|NZ_AP021884|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGGCAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGCGATTCCAACGACTGGGTGCGGCTGGGCAATATGG >NZ_AP021884|4|4|2907296-2907401|CRISPRCasFinder GATTCCAACGACTGGGTGCGGCTGGGCAGTATGG CAGCCACCTTTGGAAAATATCCTGTCTGGTCATGCTGC GATTCCAACGACTGGGTGCGGCTGGGCAATATGG
>NZ_AP021884.1|WP_147070033.1|2903228_2905838_+|alanine--tRNA-ligase MKSSEIRQRFLDFFARHGHTPVASSPLVPGNDPTLLFTNAGMVQFKDVFLGRETRPYARAVSSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKRNAIQFAWEFLTQELGIAKDKLWITVYHTDDEAHTIWTAEMGVPDERVIRIGDKPGGGSDNFWQMGDTGPCGPCTEIFYDHGAEVAGGPPGSADEDGDRYIEIWNLVFMQFNRDEAGNLQPLPRPSVDTGMGLERISAVMQHVHSNYEIDLFQALIHAAARVTGSADLTDNSLKVIADHIRACAFLITDGIIPGNEGRGYVLRRIIRRAIRHGYQLGQKQPFFHLLVADLAMAMGAAYPELVAAQARVTAVLKQEEERFAETLEHGMDILEQALQSGANVLDGATAFKLYDTYGFPLDLTADVGRERGFTVDMAGFEAAMEAQRKRARAASKFTMQAGMRFDGPPTEFRGYDTLSLDSRILALYQDGSPVHSIAAGEAAVIVLDRTPFYAESGGQVGDSGELHGSGSVFVVDDTQKIQPDVFGHTGLLQSGSLKLGDTVSAQVDADARSRAACNHSATHLLHAALRQVLGTHVTQKGSLVDAARTRFDFAHSEAVSAAQLQQIEDLVNREIRRNVIVEARLMNYDAAIAHGAMALFGEKYGDQVRVIGMGEFSTELCGGTHVSRSGDIGLFKIISESGVAAGIRRIEAVTGPAALAMIQAQQRQILEAAALLKAPPQELQQKIAQIVDNVKNLEKELDRLKSRLAAAQGDDLVSQATAVGNAKVLAAMLEGADVKTLRETVDKLKDRLKSCAVVLGSCSDGRVTLVAGVSADLTSKVKAGELANFVASQVGGKGGGRPDMAQAGGTEPAQLPAALQSVAGWVAQRLE >NZ_AP021884.1|WP_147070267.1|2901373_2903095_+|thiosulfohydrolase-SoxB MNRREFLQILAVAAASGMAIDNKQALAGNAPSGFYDLPKFGNVSLLHMTDCHAQLLPIYFREPDVNIGVGAAIGQPPHLVGEYLLKYYGIRPGTREAYAFTYLDFAAAARTYGKVGGFAYLKTLVDKVRASRPGSLLLDGGDTWQGSATSLWTNGQDMVDAAKLLGVNVMTGHWEFTYGAERVKHVVDNDFKGHIDFVAQNIKTNDFGDPVFKPYVIKTMNGVQVAIIGQAFPYTPIANPRYMVPDWSFGINDDNMQKVVNEARAKGAQVVVVLSHNGMDVDLKMATRVTGIDAIFGGHTHDGVPQPTQVKNAKGTTLVTNAGSNGKFLGVMDFDVKNGKIAAWKYRLLPVFSNLLEPDAKMAKLIEDVRAPYASKLNEKLAVTEELLYRRGNFNGTFDQLILDALMAVKGADAAFSPGFRWGTTLLSGDVITMDHLMDQTAITYPSTTLTEMTGATIKSIMEDVCDNLFNADPYYQQGGDMVRVGGIQYAVAPNNKIGNRISNMTLKGKPVMASKKYKVAGWAPVGEGVSGTPIWDVVAEYLRDIKVVKPRKLNEPKIVGIGKNPGIAPGIA >NZ_AP021884.1|WP_147070035.1|2900312_2901191_+|sulfur-oxidation-c-type-cytochrome-SoxA MKTTFREPQAQSPKAHGKKILLALAGAGLLLGALNASATPEQDRQSLLKFYSSKYPDIKVANYIYGALAFDPDAMEQYNSIMDFPPFGSVIEHGKKMWETPFKNGKKYADCFPNGGKNVAGNYPYFDDKAGKVVTFEMAINACRTANGEEAFKYNDMQTMGTLTAYARTLSDGMPMNIKVQGAAATAAYEAGKSQFYSRRGQLNFSCASCHVANAGNHLRSELLSPAVGQATHWPVFRGGEQLVTLQERYVGCNKQVRAVPFAPGSEEYNNLEYFHSYISNGLPLKASVFRK >NZ_AP021884.1|WP_147070037.1|2899922_2900237_+|thiosulfate-oxidation-carrier-complex-protein-SoxZ MAEPMKMRASVSGDVADIKVLMNHPMETGLRKDAKTGQLIPAHFINEVHATVNGKPVLDAQWGGGVSKNPYLGFKVKGAKAGDKVEVSWKDNKGESNKVDGVVA >NZ_AP021884.1|WP_147070039.1|2899397_2899865_+|thiosulfate-oxidation-carrier-protein-SoxY MNALRRNILKSAGATGIVAMAAAAGLLKSGNVLAAWNSSAFAAKTVPEAIKDLGLSTPADSKAISIKAPDIAENGAVVPVEVTSSIAGTTGIAIFAEKNATPLITDFKLSNGAEGFISTRIKMGQTAMVRAVVTAGGKTYTAAKEVKVTIGGCGG >NZ_AP021884.1|WP_147070041.1|2898997_2899366_+|sulfur-oxidation-c-type-cytochrome-SoxX MRAGASLILTASMVGMFVMANYAVAADTPKQEETGKSIAFDKTKGNCLACHAMPTVPDAVAAGTIGPPLIAMSARYPDKAKLRAQIWDATVANPQSVMIPFGKHKVLTEQEIDKVTDFVYGL >NZ_AP021884.1|WP_147070043.1|2897364_2898786_+|M48-family-metalloprotease MKSASLFLALCLASQQLAASELPDLGDVSQGAFSPRDEARVGNEIMRDIYAEPAYYDDPELTDYLNNLGYRLVAASPENRLAFQFFVLRDHTLNAFALPGGFIGVHTGLIEATQSESELAGVLGHEIAHVTQHHLARMIESRNQGILPSLAALAVAILAARSNPQAASAAIATVQATSIQKQLNFSRANEREADRIGMQIMRGAGFDPRAMATFFERLQKNSRLYENNAPAYLLTHPLTSERIADMQNRAASMPVKQVADSLEFQLLRAKLLAGEGRPEEAVRRFTEAIRDTRYNSLAAERYGLVVALLRTRQFDRAEQELDRLNQSGASSPMIAMLGARLRQEAGDLNTALARYQAGRARFPGYRPLLYADANALLQAGKADAALALVTDHLALYPDDYRLYQLQSRAYAMQGKDFLRHHAQAEAYVRQGNLDAAIEQLKLGLKSRDGDFYQMSIAEARLKELVALNQPAKP >NZ_AP021884.1|WP_147070045.1|2896848_2897325_-|cyclic-pyranopterin-monophosphate-synthase-MoaC MNQLTHFDDRGRAQMVDVADKSDTRRVAVAAGRIVMQPATLKMILDGSARKGDVLGVARIAAIAASKRTADLIPLCHPLALTRVAVEFLAEEADSAIECRVTAETVGKTGVEMEALTALSVGLLTIYDMCKAVDRGMRMEGLRLLEKQGGKSGHWRAP >NZ_AP021884.1|WP_147070047.1|2894604_2896824_+|EAL-domain-containing-protein MTQIDTRLISTATWLAGALAGLIALAFPLVYFSLSYEHQAASMETEAEFEAARIARLINANPELWPFEQSRLQELLQDQTETELPESRRIVDVNGRLIAQSQGKSARPYLLRTADLRNSGSVAGRVEIIRSLRPLLLKTAMASLLGLLLGSLAMVIFRAYPLRILKRALNTLANEKERAEVTLHSIGDAVITTNASGHIEYLNPVAEQLTGWTNEAARGLPSWRVFNIINESTGAPLDSPAEKAIKENRIVPLANHAGLVKRNGKIIPIENSAAPICDSQGQIIGAVLVFHDVSHARAMATKLSHQASHDPLTGLINRHTFESRLQQALDNVRRENSHHTLCYMDLDQFKIVNDTCGHRAGDELLRQLAGELRTKVRNSDCLARLGGDEFGLLLEGCTVQQAEHVAATLLQTVKEFRFHWQEHTCAVGVSIGLVGINAGCGDLAKIMGAADSSCYAAKDRGRNCIYVYQPDDKEVAQRRGEMQWVARITRAIDEGRLRLYYQTIQPLAGTQGAHYEILLRMLDEEGRIVPPGTFIPAAERYGLMPAIDRWVIENTFATLGRLYRGDAKKRLHTCAINLSGTSWADESLAGFICGMTGRHGVPARSICFEITETAAISNLGKTIALIRDLKEAGFRFSLDDFGSGVSSFGYLKQLPVDYLKIDGGFVRNIIHDKIDHAMVAAINQIGHIMGIKTIAEFVENEEILERITAMGVDYAQGYAIARPQPLDHINLASAPVLQQ >NZ_AP021884.1|WP_147070049.1|2893538_2894183_-|methyltransferase-domain-containing-protein MQAAEYDAWYQTPRGRWVGETESDLLRRMLGPQSGESLLDVGCGTGFFTRRFARGSRSAVTGLDPNRDWLAFAERHAVSTENYVNGSALALPFDAGGFDLVMAVTALCFISDQRLALTEMLRVARRRIALGLLNRHSLLYWQKGRGGGQGAYRGAHWHTAAQVRELFAGQPVRNLRMAFSIFVPGGGWIAQQLESVLPTSLPLGSFFVVTADVV >NZ_AP021884.1|WP_147070029.1|2907798_2908467_-|alpha/beta-fold-hydrolase MTAFAPLEFVAGSQAVQASVIWLHGLGADGHDFAPVVQALDLPGVRFILPHAPTRPVTINGGHVMPAWYDIRSTGLDADEDAAGLAQSSRVVEDLVAHELARGVASARIIVAGFSQGGALALYAGLAPGRVLGGIMVLSAYLPLMAGFNEWCAAGTHTIPVFMAHGVQDRVVPLQLAERSRQKLVACGFDVEWQIYPMAHSVCEEEIDAIRGWLIRVLQLHV >NZ_AP021884.1|WP_147070027.1|2908463_2909963_-|hypothetical-protein MRRIPLVGKLLPAAGEALAVPVRDDPASYSAHEICESIEQLIETLLSARKKNLDWQRHSIDSLHRQDNFSAPFMTRLTQHYLALPPFVSSVSGRFLAAISGYWEEMSAIHLQCVTYLLGHPESRLGALLPLLIQRALYHHAMQMKWRWLRYQLIPSCFWARLHRLYAVAEKHEFARVPLPLPGMERADSCCETLYLRPQMLHSLRPDTLLPCEIEQVDEWIVRWSKSVLLEPMLLSGKHRYGVNLKGASPPRPLAMLNEPGSYRYWGPGLMLAALHAEHDEADAGAHAGWRQALWRRVVNDWSGIPPLRHHPRQMIGKQTELFLGFNEIHTRIDHHPSRRAHDLPYWRRCRVRDESAEGLGLALNTSDGVPVAINSLIGINSGRHFLVGVVCRIRRHESGWTEIGIRRLAANAVPVKLESVNVNLAGQVVDALYLSMAGAFGQRRCVLIPARISWQDGQWQLLCKGRRHLIRLRAPLKATEDYVLADFDGLAQSEAIAS >NZ_AP021884.1|WP_147070024.1|2909981_2910410_-|hypothetical-protein MKDFIALLVEQSRLQSVAINPALDDALTHLDHALAGLCAAVQVEYRGPYVGVETPLAHQMVVRRHEWKIHQPAWSMKICVAAPAANCRAEWPVQGVGRLRKALVVKALPAFFAGFAEAIKQAGKQDSSAGLRVLELSRRFNL >NZ_AP021884.1|WP_147070022.1|2910437_2912018_+|sigma-54-interacting-transcriptional-regulator MRAQIRANWRKYYHTTRRAVRARSATRTAGNLAQSGQNANNAKEGTNVSLQGISAKPSLLIVDDDPLITDTLNFVLSRDFEVFVADSRSQVKSLLTQLDTPPQLALVDLGLPPLPHKPDEGFHLISELLGYSPGIKILVLSGQNDETNARHARALGAIDFVGKPCEPAQIKSLLFNALLIQDVERSAETEAPAAENLIVGTSFNLDRLRQQITQYANAPFPVLIEGESGSGKELVAASLHKLSGRTKKPYLALNCAAISPTLVEPTLFGYCKGAFTGATSNRAGYFEDACDGTLFLDEIGELPLELQAKLLRVLENGEFQRVGETQSRFSNARVVTATNRDLRQEIKAGRFRADLYHRLSVFGIAVPPLRELGEDKVRLLEHFREFYAREARVKPFALDNRARQMWEDYHFPGNVRELRNIVIRLTTKCAGQNVTAEQLETELDTDTAFPSEIPLPNDGKALYDTARRHLQTLANFSLDQTMKQWEKSYVEAALNLTHGNLSQAAKILGINRTTLYSRMQTYTNEA >NZ_AP021884.1|WP_147070020.1|2912147_2913497_+|AAA-family-ATPase MYHEFFGLKEAPFRITPDTGFFFSGGERGAILQGLAYAIRQGEGIIKVTGEVGSGKTMLCWMLEQHLPDHIETVYLANPNVKPEDVLPSILAELELVRPADASRAGHLRTLNDYLLARHDAGKQVVMFVEEAQGMTLDTLEEIRLLSNLETEREKLLQIVLFGQPELDAKLADPRIRQLRERITTAITLAPLTPDAIRAYLAFRLTTAGYRGPDLFDRRAVRSIARASRGLTRRVNILADKSLLAAYTDNTRTIQPRHIRIALRDSAFNDDANKPQRWLLPVIAMGVMVAVLASFYWRSKPAAAPSRQTQTRPAAGLPGRASAAAPDPVAPLSADPFQQRLAATRTWLMQQPADTRTIQLSLLNSPSEFAAYLRGEGGGLAPDQLRIFRTQAQGHPSWTVIYGSYPTRQTANRALLALPEAVRKRHPYLRTVGGIRNETRQIQQVGEQS >NZ_AP021884.1|WP_147070265.1|2913576_2915352_+|secretin-N-terminal-domain-containing-protein MWLPMLAVPLLAGCVPAAMIQPSQGHIQQSSQPATRLADIPPLVKTIPYLPSPRAETQVPTYTIVVDNVPVKDLLFSLARDTKKNIDIGTGITGNVTLNAVNEPLPAILERIARQASIRYRMEGDTLSIMPDTPYLKTYKVNYVNLSRNTSSSIGVAAQIASTGSGAVGAAASGSAQGGNSSSTTVDSQSNNNFWEVLTENVRAILTSTRASTQRAEDKSARLDAERNARADRLEQAQAVARAGAAAPTLYREAFGNTSSSLLQDSKNEVIVNPVAGTVSVLGNERQQQVVQQYLDGVSQSSQRQVLIEATIVEVSLKDQYRAGIDWSRLANGSKGIFFNTMPAATTNLANSLLPFFNIGYRDRNLTATLNLLESFGNLRVLSSPKLMALNNQTALLKVVDNLVYFTVQAQQGTLSSTGTPLQPTTFTTTAKTVPVGLVMSLTPQISESGMVTLDVRPTISRKIGDVSDPNPGLPVSTPNKIPVIQVREMESVLQVGSGQTVILGGLMQDDSDRARDGIPVLSRPQGFGAIFGQHEHNVQKTELVIFLRPTVITNPSLDSDELKFYKRYLPRANAAPEQWHNGADAAGDPQ >NZ_AP021884.1|WP_147070018.1|2915348_2916524_+|tetratricopeptide-repeat-protein MSLLLKALKQAGDKSAAGARNPSATLADSLSLEPISGSAPDGTAYTSWDGAAPFKRSTARAAWYTPWLSGQRWLVPAVAVVAALFMLIYGVFVYWQTRTPAALVVTPTPHSAAPAAAPPAAAPAQLAAVPSQESGPPLPEINSAVPDAPAALPPPPVQADPTPQWGSGELIREAPPPRRARTQPGRRETRSALPFSMQTATTHINPQLEAAYQAYQAGHTREARNLYLQIPDGERNVDVQLGLAAIALRDNDTPAAARHYQRVLELDPRNSTANSALIGMMGDADPNASETRLKSLIASQPSSQLYFALGNLYAGQNRWPDAEQAYFEAYQKNAANADYAYNLAVSLEHISQSRAALNYYQKARDLMQPGNVQFDPLRLEARIDQLKARQE >NZ_AP021884.1|WP_147070016.1|2916529_2918233_+|Flp-pilus-assembly-complex-ATPase-component-TadA MEARKTLRLGEMLVQQGLITLDQLRIALKEQQHTNLPLGRLLVKLGFITEAVIRDQLAHTIGQTSLDLANVVADPEALKLISEDFARRHHLLPIAFDAQRQVLVVAITDMFNVVALDHLRALLGAGVEVDTVLSGEAQLLEAIDNFYGFELSVDGILREIETGEVDYQSLAMDTEEYTQPVVRLVGSLLVDAVKRGASDIHFEPEHAFLRIRYRIDGVLEQVRSLHKSYWPGIAVRLKVISGMNIAENRAPQDGRLSLTLHGRPIDFRVSSQPTIHGENIVLRVLDREKSIIPLANMDLPTDTHTALQRMMARPEGILIITGPTGSGKTTTLYSLLTHLNNETVNIMTLEDPVEYPVTLMRQSSVNETLKLDFANGIRSIMRQDPDIILVGEIRDRDTAEMAFRAAMTGHQVFTTLHTNSALGAFPRLLDIGIVPDIMAGNIIGVVAQRLVRVLCPHCRAAYTPDADEQKLLDWQATDRRPVYRAVGCPACNGKGYRGRMALMELLRMDSELDDLVARRATHREILNAALMRGYRSLAVDGISRVLEGKTSLAEVSRVVDLTQRILS >NZ_AP021884.1|WP_147070015.1|2918246_2919452_+|type-II-secretion-system-F-family-protein MPYFSYRAVDQIGRTNRGSLSAANEVDLELRLRRMGLDLITLRQMDSRASGFARGAASRRDLITFCFHLEQISRAGIPILDGVRDLRDSMDNPRFRDILTALLEDMEGGRLMSQALAAHPAVFDTVIVNLVRAGEQTGLMREVFENLGASLKRQDELAAQTRRLLIYPTLVLSMVGIIILLLLLFLVPQIADLIKNMGIALPIQTRVLLWLSETLRTWWPLFLILPVAIGSALVVTLRASERARFVADDVKLRLPVIGPILQKIALARFSNFFALMYRSGITILDALRAGEDIAANRVIADAIRRAGGRIGNGEGLTESFQSLSVFPPLVIRMLRVGETTGALDTALENVSYFYTREVSESIEKSLKILEPALTVVLGLVMAVIVGSVLLPMYDVIGTLKP >NZ_AP021884.1|WP_147070013.1|2919448_2920984_+|hypothetical-protein MMFAPQLLVYVCAWSITVACRRAGKIRLVGQFNADEGGRRAFAAVLQAFKNSPVSVMVDGVDEDYRLETLPHVLGNARREMLERRLRQISRNALFSAAWPQGREASGRRDDRYLFISLSNHDAVRPWLDLLHQHGVHLAELTVLPAISHVLLQRIQPTEPHVLLVSEHCGGLRLSYFEHGNLRFSRLTAPESLAEGHAPDLASEINKTDLYLNSQRLMPRDAQLAVYVLDPENAYAGLCREISAENKNLICQAVGSVALAKLVGVDEPLLHRTADVAYLAVLGRSRAAVNLAPAAYTRGYVQLMLRHKLYTGAFAVLATALAISGYLFSRQHDLEQQRLVTQDRIQQQASLYRAVQLALPRAPTSPQNLKRVVETARALYAAPQPMSDFARVSQALETVPDIAVLRLRWLDHDAADTTATHSAVSDNPGAAVRALYFDGEVSPFQGDYKTALASIEHFAATLRNDPGVAEVRVLALPINTDPTATLDESQHTGNSAPRARFRLKLLMRPAR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_007766 | Rhizobium etli CFN 42 plasmid p42f, complete sequence | 395837-395869 | 6 | 0.818 |
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NZ_CP020911 | Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence | 526121-526153 | 6 | 0.818 |
NZ_AP021884_2 | 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder | 1831723-1831755 | 33 | NC_021911 | Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence | 601591-601623 | 6 | 0.818 |
NZ_AP021884_3 | 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645871-2645904 | 34 | MN692973 | Marine virus AFVG_117M33, complete genome | 35754-35787 | 9 | 0.735 |
NZ_AP021884_3 | 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2644999-2645034 | 36 | NC_008043 | Ruegeria sp. TM1040 megaplasmid, complete sequence | 694259-694294 | 10 | 0.722 |
NZ_AP021884_3 | 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT | 2645653-2645687 | 35 | NZ_CP007794 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence | 1356140-1356174 | 10 | 0.714 |
1. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_007766 (Rhizobium etli CFN 42 plasmid p42f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
2. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NZ_CP020911 (Rhizobium etli strain NXC12 plasmid pRetNXC12e, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
3. spacer 2.1|1831723|33|NZ_AP021884|CRISPRCasFinder matches to NC_021911 (Rhizobium etli bv. mimosae str. Mim1 plasmid pRetMIM1f, complete sequence) position: , mismatch: 6, identity: 0.818
ttgcgttggatacctcatcctcatcattgcgct--- CRISPR spacer ctgcgtcggctacctcatcctcatca---cgcttgc Protospacer .*****.** **************** ****
4. spacer 3.13|2645871|34|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to MN692973 (Marine virus AFVG_117M33, complete genome) position: , mismatch: 9, identity: 0.735
aacaatttcttgctggataaaatcaagccgctta CRISPR spacer tacaatttcgttctggataaaatcaagtgttgca Protospacer ******** * ***************. . .*
5. spacer 3.1|2644999|36|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NC_008043 (Ruegeria sp. TM1040 megaplasmid, complete sequence) position: , mismatch: 10, identity: 0.722
atggtgcgatcctgttgttgctggttgtgctgcggg CRISPR spacer tcgctccgatcctgttgtggctggtggtgctgatca Protospacer .* * ************ ****** ****** .
6. spacer 3.10|2645653|35|NZ_AP021884|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007794 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p1, complete sequence) position: , mismatch: 10, identity: 0.714
accggttcatcgccgtgccgcctgtccatcgccgc CRISPR spacer gtccagccgtcgccgtgccccctggccatcgccgg Protospacer ..* . .*.********** **** *********
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
622875 : 631682
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|DBSCAN-SWA TTCAGGAATTGCCGCTGTTGTCGATGAAGCTCTTGAGACGGTCAGAGCGCGATGGGTGGCGCAGTTTGCGCAGCGCTTTGGCTTCGATCTGGCGGATACGCTCACGGGTTACGTCGAACTGTTTGCCGACTTCCTCCAGGGTGTGGTCGGTATTCATCTCGATGCCGAAACGCATGCGCAGCACTTTGGCTTCGCGTTGCGTCAGGCCATCGAGAATATCCTTGGTCACTTCCTGCAGGCTGCCGTAAACGGCGGCGTCTATCGGCGCCAGCGTGGCGGTGTCTTCTATGAAATCGCCCAGATGGGAATCCTCGTCGTCGCCGATAGGGGTTTCCATGGAAATAGGCTCTTTGGAGATTTTGAGTATCTTGCGGATTTTCTCCTCGGTCATTTCCATTTTTTCGGCCAGCAATTCCGGGTCGGGTTCCTTGCCGGTTTCCTGAAGAATCTGACGCGAGATACGGTTCATCTTGTTGATGGTCTCGATCATGTGCACCGGAATACGAATGGTGCGTGCCTGATCCGCGATGGAGCGGGTGATGGCCTGACGGATCCACCAGGTGGCGTAGGTCGAGAACTTGTAGCCGCGCCGGTATTCGAATTTGTCCACGGCTTTCATCAGGCCGATGTTGCCTTCCTGGATCAGGTCGAGGAATTGCAGGCCACGGTTGGTGTATTTTTTTGCAATGGAAATCACCAGGCGCAGGTTGGCCTCGATCATTTCACGTTTGGCGCGGCGCGCGCGCGCTTCGCCAGTGGACATCTGGCGATTGATTTCCTTCAAGTCCTTGATCGGGATGCCCACTTTTTCCTGCAGGGCAATCAGGCGTTGCTGACGTTCGACGATAGTGTGCTGGTAGCGTGTCAAATCCTCTGAGTAGGCCTTTTTCGAATTGATTTCCTTGGTAACCCAGTCCAGATTGCTTTCGTTGCCTGGAAACACCTTGATGAAGTGGGCGCGGGGCATACCTGATTTGTTCACCGCGAATTCCATGATGTCACGCTCGTGGCTGCGGACTTCTTCCACCAGATTGCGCAAGCCTTCGCATAGCGCCTCGACTTGCTTGGCAGAAAAGCGGATATTCATTAGCTCAGCGGAGAGCTCTTCCTGCAGTTGCAGATACTGCGGGCTGCCGAAGCCGTTTTTCTTCAACGTGGCCTGTATCCGCTTGAATACCTTGCGGATGACTTCAAAGTGCGCCATGGCGTCGATCTTGAGCTGAGCGAGATTGGCTGCAGCCAGCGCGCTACCGTCATCTTCCTCGGCATCTTCGTCGAGCTCTTCTTCGAGTTCGCTGATGTCAACCTCGGGCTCGCTGGCGATCGCTTCGAGTTCTTCTGCTACGACACCATCCACGAAATCGTCAATGCGAATTTCCTCACGCTCAACCTTGTCCACCAGTGTCAGGATTTCCTGAATCGTGGTGGGACAGGCGGAGATGGCCTGAATCATGTGTTTCAAGCCGTCTTCAATGCGTTTGGCAATTTCGATTTCGCCTTCACGCGTGAGCAGTTCCACCGAGCCCATTTCGCGCATGTACATGCGCACCGGGTCAGTGGTGCGGCCGAACTCGGAATCCACGGTAGATAACGCAGCCTCGGCCTCGGCCACCACGTCCTCATCGGCGACTGCCGGGGCGGCGTCGGACATCAACAGGGTCTCGGCATCAGGGGCTTCGTCATAGACCTGGATGCCCATGTCATTGATCATGCTGATGACGCCTTCGATCTGTTCGGCATCCAGCATGTCATCTGGCAAATGGTCGTTGATCTCGGCGTAGGTCAGGTAGCCCCGCTCCTTGCCGAGCACAATCAGATTCTTCAGGCGTGTGCGTCGGGCTTCGACATCTACCGTTTTCACCTCGCCTTTTTCGTGATCGTTTGCCATATGTCTTGGTCCAGTAAAATTTGGTGCATCAAAAAAACTGACAATTATACCTTTGTTCGAGCCTGATCGCTCATTTTTCCACCTGCTGTCTGGCATTAACCAATTGTCGCAACGCTGCTTTTTCGGTTTCACTCAGTGCGCTGACGGGCTTGTCGGTTACACGCGCACCCCGCTGTTTGTCGAGTTGCATGCGTGCCTGATAACAGGCGTCAGCAAATTCGGCGCTGATGTCGAGATCGGCTGCCCAGCCCATTATTTCCGCGCTGGCGCGCTGCAGGATGGACGCCAAAGCGTTATCGCGAAAATGGTCTATCACGCTCGCAGTGCCTAAATTGGGATGAACGCGCAGCAATTCAACCACAGCGCGTAGCGCGTCTGCATCCGGGTCAGTTGGGGCAATCAAACTGGCGTCAAGTTCCCGTGCCAGGCTGGGCATAAACAGAATCGCACGCAGCAGCCAGTGCCAAATGGAAGCGGGTGCCTGGCGTGGGGCACGGACAGGTGCACGCGCCGGGTTGAATTGCCGGGATTTGATCTGCCACAAGCTGTCGAGTTCGGACAGCTGCAGATTTGCCAGTTCGGCGCAACGTTTGCGCAGCAGGAGTGCCAGGGCGGGTGCGCGTATCTGGCTTAACAGGGGGTGCGCAGCCTGCAAAAAGGCACTGCGTCCTTCGCTGCTGGCCAGGTCATGCTGGGCTGCCAGCTCCTTGAACAGATAGGCAGATAGCGGTACCACCTCGCCGCCCAGCAGCGCTTCGAAAGCGCCTTTGCCAAATGCGCGAATATAGCTGTCCGGGTCGTGTTCCGGCGCCAGAAACAAAAACCCGACACGACTGCCATCGCTCAGTATGGCCAGGCTGTTTTCCAATGCGCGCCAGGCGGCGTGCCGTCCGGCGGCGTCGCCATCAAAGCAGAACACGAGCTCATCAGTGTGGCGCAGCAGTTTTTGCACGTGCGCCGCGGTGGTGGCGGTGCCCAGGGTGGCGACGGCGTACTCCACCCCATGCTGCGCCAGTGCCACCACGTCCATATAGCCTTCCACCACAATCACGCGTCCCGCATCGCGGATTGCGCGGCGTGCCTGGAACAGGCCATACAGCTCGTTACCTTTCTGGAATAGCGGCGTTTCCGGTGAATTCAGGTACTTGGGTTCAGCTGCGTCGAGCACGCGGCCGCCATAGCCGATGACATCCCCGCGTTGCCCGACAATCGGGAACATGATGCGGTCGCGAAAACGGTCATAACGCTGCCCGGCGTCGTTAACGATGACCAGCCCGGCTTCGGCCAGCGCCGGATCGGCATACTGGTCGAACACCGCTGCGAGATTCTGCCAGCCGGCTGGGGCATAGCCAAGGCCAAAGCGGGCGGCGATTTCGCCCGTCAAGCCGCGTTTCTTGAGGTAGTCAATTGCGTGCGGTGTTTGTTTGAGTTGCTGGCGATAGAACTGTGCGGCGCGCTGCATGATTTCGACCAGGCTGGCGGCCTGCCTGGCGCGTTCCGGGTTGGCGGCAGGCCCCTCCGGCACGCTTATGCCCATTTGGCCGGCCAGTTCCCTGATGGCGTCCACATAGCCCAGCCCGGCGTATTCCATCAAAAAACCAATGGCGCTGCCGTGGGCGCCACAGCCAAAGCAATGATAGAACTGCTTGGTGGGCGAGACGGTGAAAGAAGGGGATTTTTCGTTATGAAACGGGCAACACGCCTGATAGTTGGCGCCGGCTTTTTTCAGCGGCACACGGCGGTCTATCACCTCCACGATATCCACGCGGTTCAGCAGCGTCTGGATAAAATCCTGCGGGATCATCTTGTTCCCTCGGGGTGGTGACGTAACAACGCCGCTACGCGGCGGACAATCAGCCCGCCAGCTTGGCTTTGATGTGGATGGAAACTTGCGCCATGTCTGCGCGCCCGGCGAGTTGTGTTTTTAGAAGCGCCATCACCTTGCCCATATCCTTGATGCCGGCAGCGCCGGTATTCGTAATGGCCTGGATGATCAGGCTATCTATTTCTTCAGCGGATGCGGCTTGAGGCATGTAAGCTTGCAACACGCCGCTTTCGAATTTTTCGATGTCCGCCAGCTCCTGGCGTCCGGCAGCCTCAAACTGGGTAATCGAATCGCGGCGCTGCTTGAGCATTTTGTCGATCACGGCGATGATTTGTGCATCATCCAGTTCGATACGCTCATCCACCTCGCGTTGTTTGATCGCGGCGAGCAATAACCGAATTGCCCCCAGGCGTGCCGCATCCTTGGCGCGCATGGCGGTTTTCATGTCTTCGGTGATGCGTGCTTTGAGACTCATAAGCTTATTACCGGCTGGAACAGCGCCCGGTTTGATGATTAATACATCTTGGGTGGCAGGGTCTGGCTGCGGATGCGTTTGAAGTGACGCTTGACTGCGGCGGCCAGCTTGCGCTTGCGCTCGGCAGTGGGCTTTTCGTAAAACTCGCGTGCGCGCAGTTCGGTCAACAGACCGGTTTTCTCAACAGTGCGCTTGAAACGACGCATGGCAACTTCAAAAGGCTCGTTTTCCTTGACGCGAATGTTCGGCATGAAATCTTGTCTCCAGGGACGGAAAAAACCTCAATTATAACTGAAAAAGCGTTTTTTTCAAGGCTATGCATTGCCCTGTCCGTGTGGCGCGATTAAACTCCTGCCCATGCTTATTCTGGGAATCGAATCTTCCTGCGACGAAACCGGTATCGCCCTATACGACACCGGGCGCGGACTACTGGCGCACGCACTTCATTCCCAGGTTGCCATGCACGCCGAATATGGCGGTGTGGTGCCCGAGCTCGCCTCACGCGACCATATCCGGCGTGCGCTACCGCTGACCCGCCAGGTACTGGCACAGGCAGGGTGCACGTTGGCCGACATTGACGCGATTGCCTATACCGAAGGTCCCGGTCTGGCTGGCGCGCTGCTGGTAGGTGCCGGCATCGCCCATGCGCTGGGCGTGGCGCTGGGGGTGCCGGTGCTGGGGGTACATCACCTCGAGGGGCATTTGCTCTCGGCGCTGATTTCCGATACGCCGCCGCAATTTCCGTTTGTGGCGCTGCTGGTATCGGGCGGGCATACGCAATTGATGCAGGTCGACAGCGTGGGGCGTTACACCACGCTGGGCGATACCCTGGATGACGCTGCGGGCGAGGCGTTCGACAAGACCGCACAACTGCTTGGTTTGGGCTATCCGGGCGGGGCGGCGTTATCGACGCTGGCGCAGACCGGCGACCCGCAGCGCTTCAAGCTGCCGCGTCCGATGTTACATTCGGGCGACCTCAATTTCAGTTTCAGCGGTTTGAAAACCGCGGTGCTCACACTCACGCAAAAACATCCCGGTCCCGCTGACCGCGCCGACATCGCTGCTGCGTTTCAGCTTGCCATGGCCGAGGTGCTGACGGCCAAATCGCTGGCGGCGCTCAAACAAACCCGATCCAGGCGGCTGGTGGTAGCCGGTGGCGTGGGTGCCAACCGGCAGTTGCGCGAGGCCTTGAACGCAGGCGTTAGCAAACTGGGCGGTGCGGTATTTTTCCCGCGCCTGGAGTTTTGTACCGATAACGGCGCGATGATTGCCTTTGCCGGCGCGATGCGCCTGGTGCATGGCGGGCGTGCCGCAGGGGTGTTTACGGTACGGCCACGCTGGGACTTGCAGGAAATCCCGGCACCCCATAATCATCCGGGCACCGTCATGGCTTAAGATGCGGTGCCATGCCGTGCACCCAGCCAAGCAACAGCTTGCCGCCCGGCAGTTGCGCCAGCACCTCCGGGAACAAAACCAGGCCGAACAACAGCGCCAGCACGCAGATCAATAACAGGGTGGAAAACACGCTCTGCGGGCGTAACACCCCCCAGTAGCGCATGGCGAGGAACATGACATCCTTGCGATGCTCTGACATGCGCCGCCAGCGCAATGCCAGCCGCACCACCAAATACACCGCATAGCCGCCGGTGAGCGCGGCGAACACAATGCGGAACACGCCAAACAAATCAATGCTGCCTGCCACAAAGTGCTGGTACAGCCACGACACAAACCCCACCAGGATGGAAGCGCTTACCGCAATGAGCACATTGAAAAACAGTCGCCGCCGCTCGCGCGACAAATCCTGCTGGCGCTGGATGAAAAAGCTGTCCACTTCCAGCTTAAAGTATTCCACCTTGTCCTGTACCAGCGCAGAAATTTCGCGTTTGGCCACCGCCATGCGTTCATCCAGAACAGCGCCCAGACGATCAGCAGCATAGTCCACCAGCTCGCGCACGTCGTCCTTGGTGAACTGGCGCTGGTTGTGCAATTCGGCAGAAATCTTGTCGAGCTTGGCGTCGATTTCCTGGCTGGCGCCCACGACCACGTCGCGTAATTCCGCACCGACGCTGGCTGCGCCTTCCTTGACCACGCTGCCCAGTTTGTCGCCCGCCAACTCGATGCTGTCTTTCGAGACCTGGGCCAGGCTCTCGCGCGCGTAGTTGATTTCTTTTTCAAACCAGGCCATGTCTGTCCCGGTTTATTCGGAAGAGGTGTCCGCCGGTTTGGCGAAGCGCCCTTCTTCACCCGCCAGCAGGCGGCGGATATTGGTGCGGTGACGCCAAAAAATCAGCGCACTGATGATGCATAACGCGACCACAGGCAGGGCGCCACCGATTAAATAGGTACCGAGTACTGGCGCCAGTGTGGCCGCAGTGAGTGCGGCCAGTGACGAGATGCGGGTGAGGACAAACACGACAAGCCAGCTGGCCAGCGTTGCCAAACCCAGCCACGGCGAAATCGCGAGCAGTATACCCAGCGCGGTGGCCACGCCCTTGCCCCCCTTGAAGCCGAAAAACAGCGGATATAAATGTCCGAGGAACACGGCGACCGCAGCGCCGTAAGTTGCGGCAATTTCCACGCCGTATTGGCTGCCAAAATAACGCGCCAGATACACCGCCAGCCAGCCTTTGGCCATGTCGCCGACGAGGGTCAGCAGCGCCGCTGATTTGCGCCCGGTGCGCAGCATGTTGGTCGCACCCGGATTGCCCGAGCCATGCTTGCGCGGGTCAGGCAGGCCGAATAGGCGGCTGACGATAACCGCGAAGGACAGCGAGCCGATCAGGTAAGCCGATACGATGAAAAATGAAATGAACATCAGGGATTCCGCTTAAGATATAGGATTTTCGCGCTTTGACAACTACGCATGGATATTCTGTTTCTCAAGGATTTCAGAGTCGAGCTCATTATCGGTATTTACGAGTGGGAACGCAAAGTACCCCAGCCGGTATTGCTCGACCTGGAAATCGGCCTGCCCAATAGTCGTGCCGGTGAAACCGACAATGTGGCAGATACCATTGACTATGGCCAGGTTGCCGCGCGTATCAGGGCGGCCTGTGCCGCACTGCGCCCAGCCCTGGTAGAGGCGCTGGCAGAGCATGTTGCACAATTGATACGCAATGAATTTGGCGCGCCCTGGGTCAGGGTCACCGTGACCAAGCTCGCCATCGTGCGCGGCGTCAAGGCGCTGGGCATCACCATCGAGCGCGGTCAGCGCGGATGCGTAATGAGTCATATGCAGCCAGCGCGCTGAATTAGCCGCGCGGATGATGCCTGGCGTGCAGCTGTTTCAGGCGCTCGCGTGCGACGTGCGTGTAAATTTGCGTGGTCGAAATATCCGCATGGCCGAGGAGCATCTGTACCACGCGCAAATCCGCGCCGTGGTTGAGCAGATGCGTGGCGAAGGCATGACGCAAAACGTGGGGCGAGGGCAGGCGCGCCAGCCCGGCCTGCTGCGCGCGGCGCTTGATGAGATACCAGAATGCCTGGCGCGTCATGGCCGTGCCGCGCCGGGTGACAAACAGCGCATCGCTGATCGTGCCTGCCAGTATCTGCGGGCGTGCGCCAGTCAGATAGCGCGCCAGCCAGAGCAGTGCCTCTTCACCCAGCGGTACCATGCGCTCCTTGCCGCCTTTGCCCATCACGCTTAGCACGCCCATGTCCAGACTCACATTTGCCACTCTCAGTGTCACCAGTTCGGAGACGCGCAGGCCGCTGGCGTAGAGGATTTCCAGCATGGCCTTGTCGCGTAGCCCGAGTGGCTGTGATGTGTCGGGTGCGTTCAACAGCATATCCACGTCGGCCTCCGACAAGCTCTTGGGTAATGAGCGCGGCAGCTTGGGGGTATCGATTTTCAGTGTCGGGTCCAGCACGATACGGCCATCGCGCAGCGCCAGCCGATAGAAGCGTTTCAGTGCGGAGAGCAGGCGCGCAGTGCTGCGCGGGCTGGTTTTACGCGAAAAACGGTATTGCAGATAGGCCTCGATATCCGCCTGGCCAGCGTCCAGCAACAGCGTGCTGCGCAGTGCCTCCAGCCAGGCCGAGAACTGCGTCAGGTCGCGCCGGTAGCTTTGCAGCGTGTTGGGGGAGAGTCCATCCTCCAGCCACAGCAGGTCGCAAAAGCTGTCAAGCGCCTGCTGCGATACCGGATTCATGGTTGAGTAGCCAGTCCTTGTAAGCCAGCGGCGCCCCGCTTGCGGCGTGCATGAAGCCGCCGCGCCCGTTTGCTGCCACCACCCGGTGGCAGGGGATAATGATGGGCAGCGGATTGGCGCCACAGGCCTGACCCACCGCGCGCGGACTCGAATCCAGCCAACGCGCGAGCTGCCCGTAAGTGGTGGTATGGCCGGGCGGGATCGCAGTCAGCGCGCGCCATACCCTGACTTGATGCGCTGTGCCAAGGATTGCCAGCGGCAGGTCAAAGCTGCTGCCGGGGTTATCAAAATAGTGATTCAAGGCAGCGGCGACGCGGCGTGACAGCGGTGAGTCGGGGGCTTGCAAAGGGTAATCCGCAGGCAAAAAATCGATGCCGTGAAGTTTTTCATTGGCCACACTCATGCCGACACAGCCGAATGGCGTGGGCAGCACGGCCTGATAAGGTGATTGAATCGTGCGGCTTTTCAT
Protein sequences of DBSCAN-SWA_1 >NZ_AP021884|622875:631682|629259_629880_-|WP_147070760.1|DBSCAN-SWA MMFISFFIVSAYLIGSLSFAVIVSRLFGLPDPRKHGSGNPGATNMLRTGRKSAALLTLVGDMAKGWLAVYLARYFGSQYGVEIAATYGAAVAVFLGHLYPLFFGFKGGKGVATALGILLAISPWLGLATLASWLVVFVLTRISSLAALTAATLAPVLGTYLIGGALPVVALCIISALIFWRHRTNIRRLLAGEEGRFAKPADTSSE >NZ_AP021884|622875:631682|628446_629247_-|WP_147070758.1|DBSCAN-SWA MAWFEKEINYARESLAQVSKDSIELAGDKLGSVVKEGAASVGAELRDVVVGASQEIDAKLDKISAELHNQRQFTKDDVRELVDYAADRLGAVLDERMAVAKREISALVQDKVEYFKLEVDSFFIQRQQDLSRERRRLFFNVLIAVSASILVGFVSWLYQHFVAGSIDLFGVFRIVFAALTGGYAVYLVVRLALRWRRMSEHRKDVMFLAMRYWGVLRPQSVFSTLLLICVLALLFGLVLFPEVLAQLPGGKLLLGWVHGMAPHLKP >NZ_AP021884|622875:631682|631184_631682_-|WP_147070766.1|DBSCAN-SWA MKSRTIQSPYQAVLPTPFGCVGMSVANEKLHGIDFLPADYPLQAPDSPLSRRVAAALNHYFDNPGSSFDLPLAILGTAHQVRVWRALTAIPPGHTTTYGQLARWLDSSPRAVGQACGANPLPIIIPCHRVVAANGRGGFMHAASGAPLAYKDWLLNHESGIAAGA >NZ_AP021884|622875:631682|630313_631213_-|WP_147070764.1|DBSCAN-SWA MNPVSQQALDSFCDLLWLEDGLSPNTLQSYRRDLTQFSAWLEALRSTLLLDAGQADIEAYLQYRFSRKTSPRSTARLLSALKRFYRLALRDGRIVLDPTLKIDTPKLPRSLPKSLSEADVDMLLNAPDTSQPLGLRDKAMLEILYASGLRVSELVTLRVANVSLDMGVLSVMGKGGKERMVPLGEEALLWLARYLTGARPQILAGTISDALFVTRRGTAMTRQAFWYLIKRRAQQAGLARLPSPHVLRHAFATHLLNHGADLRVVQMLLGHADISTTQIYTHVARERLKQLHARHHPRG >NZ_AP021884|622875:631682|624832_626566_-|WP_147070752.1|DBSCAN-SWA MIPQDFIQTLLNRVDIVEVIDRRVPLKKAGANYQACCPFHNEKSPSFTVSPTKQFYHCFGCGAHGSAIGFLMEYAGLGYVDAIRELAGQMGISVPEGPAANPERARQAASLVEIMQRAAQFYRQQLKQTPHAIDYLKKRGLTGEIAARFGLGYAPAGWQNLAAVFDQYADPALAEAGLVIVNDAGQRYDRFRDRIMFPIVGQRGDVIGYGGRVLDAAEPKYLNSPETPLFQKGNELYGLFQARRAIRDAGRVIVVEGYMDVVALAQHGVEYAVATLGTATTAAHVQKLLRHTDELVFCFDGDAAGRHAAWRALENSLAILSDGSRVGFLFLAPEHDPDSYIRAFGKGAFEALLGGEVVPLSAYLFKELAAQHDLASSEGRSAFLQAAHPLLSQIRAPALALLLRKRCAELANLQLSELDSLWQIKSRQFNPARAPVRAPRQAPASIWHWLLRAILFMPSLARELDASLIAPTDPDADALRAVVELLRVHPNLGTASVIDHFRDNALASILQRASAEIMGWAADLDISAEFADACYQARMQLDKQRGARVTDKPVSALSETEKAALRQLVNARQQVEK >NZ_AP021884|622875:631682|626615_627062_-|WP_147070754.1|DBSCAN-SWA MSLKARITEDMKTAMRAKDAARLGAIRLLLAAIKQREVDERIELDDAQIIAVIDKMLKQRRDSITQFEAAGRQELADIEKFESGVLQAYMPQAASAEEIDSLIIQAITNTGAAGIKDMGKVMALLKTQLAGRADMAQVSIHIKAKLAG >NZ_AP021884|622875:631682|627419_628457_+|WP_147070756.1|tRNA|DBSCAN-SWA MLILGIESSCDETGIALYDTGRGLLAHALHSQVAMHAEYGGVVPELASRDHIRRALPLTRQVLAQAGCTLADIDAIAYTEGPGLAGALLVGAGIAHALGVALGVPVLGVHHLEGHLLSALISDTPPQFPFVALLVSGGHTQLMQVDSVGRYTTLGDTLDDAAGEAFDKTAQLLGLGYPGGAALSTLAQTGDPQRFKLPRPMLHSGDLNFSFSGLKTAVLTLTQKHPGPADRADIAAAFQLAMAEVLTAKSLAALKQTRSRRLVVAGGVGANRQLREALNAGVSKLGGAVFFPRLEFCTDNGAMIAFAGAMRLVHGGRAAGVFTVRPRWDLQEIPAPHNHPGTVMA >NZ_AP021884|622875:631682|629925_630312_+|WP_147070762.1|DBSCAN-SWA MDILFLKDFRVELIIGIYEWERKVPQPVLLDLEIGLPNSRAGETDNVADTIDYGQVAARIRAACAALRPALVEALAEHVAQLIRNEFGAPWVRVTVTKLAIVRGVKALGITIERGQRGCVMSHMQPAR >NZ_AP021884|622875:631682|627100_627313_-|WP_124706067.1|DBSCAN-SWA MPNIRVKENEPFEVAMRRFKRTVEKTGLLTELRAREFYEKPTAERKRKLAAAVKRHFKRIRSQTLPPKMY >NZ_AP021884|622875:631682|622875_624762_-|WP_147070750.1|DBSCAN-SWA MANDHEKGEVKTVDVEARRTRLKNLIVLGKERGYLTYAEINDHLPDDMLDAEQIEGVISMINDMGIQVYDEAPDAETLLMSDAAPAVADEDVVAEAEAALSTVDSEFGRTTDPVRMYMREMGSVELLTREGEIEIAKRIEDGLKHMIQAISACPTTIQEILTLVDKVEREEIRIDDFVDGVVAEELEAIASEPEVDISELEEELDEDAEEDDGSALAAANLAQLKIDAMAHFEVIRKVFKRIQATLKKNGFGSPQYLQLQEELSAELMNIRFSAKQVEALCEGLRNLVEEVRSHERDIMEFAVNKSGMPRAHFIKVFPGNESNLDWVTKEINSKKAYSEDLTRYQHTIVERQQRLIALQEKVGIPIKDLKEINRQMSTGEARARRAKREMIEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNRISRQILQETGKEPDPELLAEKMEMTEEKIRKILKISKEPISMETPIGDDEDSHLGDFIEDTATLAPIDAAVYGSLQEVTKDILDGLTQREAKVLRMRFGIEMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSDRLKSFIDNSGNS |
10 | Vibrio_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
751882 : 760457
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|DBSCAN-SWA ATTACGCGCCGACTTCTGCCGTAACGGCCATCGGTGCGACACCAATCGCCGCGCGCACTTTATTTTCAATCTCGTGGGCGACCTCGGGGTGCTCGCGCAGGTATTCGCGCGCGTTGTCTTTGCCCTGGCCGATTTTTTCGCCGTTATAGGCATACCAGGCGCCGGATTTTTCCACCAGTTTGTGCTCCACGCCGAGCTCGATGATTTCCCCCTCGCGCGAGATGCCTTCGCCGTAAAGGATATCGAATTCAGCCTGCTTGAATGGCGGCGCGACCTTGTTCTTGACGACCTTGACGCGAGTCTCGGAGCCGATCACTTCGTCGCCTTTCTTGATTGCGCCGGTGCGGCGGATGTCGAGGCGCACCGAGGCGTAGAATTTGAGTGCATTGCCGCCGGTGGTGGTCTCCGGGTTGCCGAACATGACGCCGATTTTCATGCGGATCTGGTTGATGAAGATCACCAGGGTATTGGTGCGCTTGATGTTGCCGGTGAGCTTGCGCAACGCCTGGCTCATCAGGCGGGCTTGCAGGCCCATGTGCGAGTCGCCCATTTCGCCTTCGATTTCGGCCTTGGGAGTCAACGCCGCCACCGAGTCTATCACCACCACGTCCACCGAGCCGGAGCGCACCAGCATGTCGGCAATTTCCAGCGCCTGCTCACCGGTGTCGGGCTGCGAGATGAGCAGGTCGGAGACATTCACCCCGAGTTTTTGTGCGTATTGCGGGTCGAGCGCGTGCTCGGCATCAATGAACGCTGCGGTGCCGCCCAGTTTCTGCATTTCGGCGATGACCTGCAGGGTGAGCGTGGTTTTGCCGGAGGATTCCGGACCGAAGATTTCAACCACGCGGCCGCGCGGCAAACCGCCGACGCCCAGCGCAATATCTAAGCCCAGGGAGCCGGTGGAGACCACCTGAATATCGCGCACGACTTCGCCATCGCCGAGGCGCATGATGGAACCTTTGCCGAAGCTTTTTTCGATTTGTGCCAGGGCGGCGGCGAGGGCTTTGCTTCTGTTTTCGTCCATGTGTTTTTCCTCAAATTAGTGCGGGATTATGGCATAAACCGTTAGAACCCGTTAGCCACGGGCGAATGGTTTTTGTCTAGTCCAGCAACTCAATCACCCCGCGCAACGCGGCAGCCACGGCGCGCGCGCGAATTTCATCGCGGTCGCCGCATAACCTGCAGGTGGTGGCGAGGCGGGTACCGTCCTGCATCGCCCAGGCTATGCACACGGTGCCGACGGGTTTTTGCGGGGTGGCGCCGCCGGGCCCGGCAATGCCGGAGATGGCCAGGGCGATCTGCGCGCGGCTGTGGGCGAGTGCGCCCTGCGCCATTTCCAGTACGGTGGGTTCGGATACCGCGCCCGATGCTTGCAGCGTGGCGTTTTTCACCCCCAGCATGTCGTGTTTGGCGGCGTTGCTGTAGGTGATGAAGCCGCGCTCATACCAGGCCGAGCTGCCCGGCACGGCGGTGATCAGCATGCCCGCCCAGCCGCCGGTGCAGGACTCGGCGCTGGCGAGCATGATGCCGCGCCGGCTGAGGGCCTGGCCGGTTTGTTCGGCCAGTTGGTAGAGTGCTGCGTCGGTCGGTCTCATGGCAGAATTTTTTGCGCGAGGAACAGTGCCAGCAAAGTGTAGCCAGCCGCCAGCAAGTCGTCGAGCATGACGCCGAAGCCATTTTTCAGGCGCGCATCGAATTGGCGGATGGGAAACGGCTTCCAGATATCGAACAGCCGGAACAGGCCAAAGGCAGCGGCGACCCACAGCGGCGTTTGCGGGGTGGCAGCGAGCACGATCCAGAACGCGGCAATCTCATCCCAGACGATGCCGCCGTGATCCGCCACGCCCAAATCGCGTCCGGTTTTGCCGCAAATCCAGATGCCGGCGACAACCGCGATGCCGATGATGAGGTACAGCTGGGTCGGCGTGGCGAACCAGGCCAGCAGGTAGTACAGCGGCAGCGCGGCCAGGGTGCCAAACGTGCCCGGCGCCCTGGGTGCGAGTCCGCTGCCCAGACCAAAGGCGAGAAAATAGGCCGGGTGGCGGGTGATGAAACGCCAGTCAGGCGGAAAAGTGGTCATAGCCGGTGTGGCGCATATCCAGAACTTGTCCTTGTGCATCGCGCACTATCAGGCCTGGCTCGGCGCGAATGCTGCCGATGGCAGTGAGCCGCACGCCGAGGCGGGCGGCGATTTCGCCGAGTGCCTTACGGTGTGCGACGGGTGCAGTGAAACACAGCTCGTAGTCGTCACCGCCGCTCAGTACGCAGGCATCAAATTCCGGATGTGCGGCATAGTCATGGACGATTTCACCCAATGGCAAATGCGTATATTCAACGATGGCACCTACACCGGAGCGCGCCAGAATATGCCCCAAGTCAGCCAGCAGGCCATCCGACACATCGATTGCGCTGCGCGCCAGCCCGCGCAAGGCCAGGCCCAGTTCGACACGCGGCGTGGGGGTATATAGGCGCGCTGCCAGGGTGATCAGATCGGCGTCAGTCAGATTGACCCGGCCGTGCAGGGCGGCAAGTGCCAGTGCTGCGTCACCCAGCGTGCCGGATACCCAGATTTCATCGCCCGCCTGAGCGCCGTCGCGACGCAGGGCCTGGTTGGGCGGCACCTCGCCCAGGATAGTGAGGGTGAGGCTCAATGCGCCGCGCGTGGTGTCGCCACCCACCAGGCTCACGCCAAATTGATCGGCACAGCGATACAGTCCCGTTGCGAACGCCGCCAGCCAATCATCGTCTACTTCCGGCAGTGTCAGCGCCAGCGTCGCCCAGCGCGGCGCGGCACCCATCGCGGCGAGATCGGAGAGATTGACCGCGAGGCTTTTCCAGCCGAGCTTTTCCGGGTCGGCATCGGCGAAAAAATGCACATCGGCGACCAGGGTGTCGGTGGAAACGGCGAGCTGCATCCCGGTTGCGGGTTGCAGCAGGGCGCAATCGTCGCCCACTCCCAGCACCGCGCCGGGTGTGGCGCGGGAGAAATGACGCTGAATCAGGCCGAATTCGGAAGTCATGGATGTTGCATTGCCATCCTGGCGGTTAGCTTGCGGTTTTCTGGGTCATCAACCCTGGCGGCGCGGCGCGTTCACTTCCACTGTGCGCACCTCGGCAGCCAGCTTGTCCATCACGCCGTTGACGTATTTGTGGCCATCGGTACCGCCGAATATCTTGGCGAGTTCGACCGCTTCGTTGATGACGACGCGATACGGCACTTCCAGATGATGCAGCAGCTCCTGCGTGCCCAGCAGCAGAATGGCATGCTCCACCGGGCTCAGTTCCGCAGGCTTGCGATCCAGATAGGCAGCGAGCCGCAGGTCCAGCGCCGGTGCTTCATCGACCACGCCATTCAGTAGTGCCAGGAACATTTTTTCATCAATGTTGCGATACACGGGATCGTCGCGCAGCTGTTTGACGATATCGGCAGTCGGCTGATGGTTGAGCAGCCACTGATACACGCCCTGCACTGCGAATTCGCGGGCCTTGCGACGATTGCCGCTCATAGCTGTTTGAGCAGGTTGGCCATTTCGATCGCGCATTCGCCCGCTTCGGCGCCTTTTACCGACATGCGCGAGGTGGCCTGGTGATCGGTATCGGTGGTCAATACGCCATTGGCAATAGGCACGCCGGTATCGAGCTGGATACGCGCGATACCGTTGGCCATTTCGTTGGCAACCACCTCGAAATGATAGGTATCGCCGCGCACCACGGCGCCCAGCGCCACCAGCGCGTCGAATTTTCCGCTCATGGCCATTTTGCGCAGCGCCAGCGGAATTTCCAGCGCACCCGGCACGGTGGCGAGGAGCAGGTTGCCGGTTTTCACGCCGCGTTTGCCAAGTGCGGTGGTGCAAGCCGCCAGCAAACCCTCGCAAATGTCCATGTTGAAGCGGCTCATGACGATGCCGATACGCAATGCGCTGCCGTCGAGACTGGATTCAAGTTCGGGAATATCGTCGTAGCTTGCCATGATTTATTTCTCCTGATGAGACTGGATTGCATGTATTGCCTGTGCAAGGCGGGGGCGGGTTCAGGTGTTCTCGTCGTAGCCTGTCACTTCCAGATCGAATCCCGCCAGCGACGGCATTTTGCGCTGAGTAGCCAGCAGGCGCATTTTGCCCACGCCGACGTCTTTCAAAATCTGCGCGCCAATGCCATGGTTGCGCGCGTCCCATTTTTGCGGAAGCCTGACACCGGCTTCAGGCATAGCGCGGGTGAGCAGTTCTGCTGCGCTCTCCGGACGGTGCAGCAGGACGACAACGCCCTTGCCCACCGCAGCGATTTTTGCCAGGGCCTGGTTGACACTGTAGGCATGGGTACGGCTGCCGACTTCGAGCATGTCCATCACCGACACTGGCTCGTGTACCCGCACCAGCGTTTCGCTGGCGGCGCTGATTTCGCCCTTGACCAGGGCCAGATGAGCTGCCCCGGAGATTTTTTCACGGTAGGCGATGAGCTGGAATTCGCCGTAAACCGTCTCGATACAGCGGCTGCCTGCACGCTCCACCAGGCTTTCATTGTGGCTGCGGTAGTGGATCAGGTCGACGATTGCGCCAATTTTCAGGCCGTGAATTTTGGCGTATTCCAGCAAATCCGGCAGACGCGCCATGGTACCGTCATCCTTGAGGATTTCGCAAATCACTGCGGCAGGTTCCAGGCCGGCCAGCCCGGCCAGATCGCAGCCTGCCTCGGTGTGGCCGGCACGAATCAGCACGCCGCCGGGTTGGGCGCGCAGCGGAAAAATGTGGCCCGGCTGGATGATGTCGGCGGCCTTGGCGTGTTTGGCGACGGCGGCCTGAATGGTAAGCGCCCGGTCGGCGGCGGAAATGCCGGTGGTAACCCCTGTGGCGGCTTCGATGGAGACAGTGAAAGCAGTGCTATAGGGCGTCTGGTTATCGGCCACCATCTGGCGCAAGCCCAGTTGCTTGCAGCGCTCGTCGGTCAGCGTCAGGCAAATCAGGCCGCGTCCGTGCTTGGCCATGAAGTTGATCGCTTCGGGGGTGGCGAATTCGGCCGCCATCACCAGATCGCCCTCGTTTTCGCGGTCTTCCTCGTCCACCAGTACGACCATTTTTCCGGCTTTCAGGTCGGCGATGATGTCTTCGATAGGGCTCAGGCTCATGTTTGATCTCGATAATTAAGGATGCGTTCGGCGTAGCGCGCCATCATGTCCACTTCCAGATTGACCCGGCTGGCAGGTTGGAGCGTATGCAGATTGGTATGTTCCAGCGTATGCGGAATAAGGTTGATGCTGAAACGGTCGCCGTCTACGCGATTAACGGTGAGGCTCACACCGTTGACGGTGATGGAACCTTTGCTGACGACAAAGCGGGCCAGATCGCCGGGAGCTCGGATGACCAGTTCGAAGCAATCCCCGGCCGGCGCGAAATGCAGCACCTCACCCACCCCGTCCACATGGCCGGAAACCAGGTGTCCGCCGAGACGGTCGGACAGGCGCAGCGCCTTTTCCAGATTGACCAGGCCGTGTTCAGGAAAGCCCGTCGTACAGCGGAAAGTCTCTGCCGATACGTCTACCGAGAAGCTCGCCGCACCCAGCGCCACGACGGTCAGGCACACACCGTTGCAGGCGATGGAGTCACCTGGCGCCACGTCGCTCAAATCCAGATGGGCGGCGTCGATCACCAGGCGTGCATCCGCGTTTCTGGGTTCCACTGCCGCCACCTTGCCTACTGCCTGAATAATGCCTGTAAACATGTTAGTTCTTTTCAAATTGTGCAGTGATGCGTATATCGGCACCGACCTGACGTATGTCGCGCAGCACCAGTTTGCGGCGCTCTTGCATCTGCGCCGGTTCGGCCAGGGAAAACAGGCCACGCGCCGTGTCACCCAGCAATACCGGGGCGACATACATCACCCATTCATCCACAAATCCAGCCGCGATCAGCGCGCCGTTGAGTGTGGCACCGGCTTCGGTCATCACTTCATTTATCCCGCGCTGCGCCAGGAGTGACAATAGCGCCCCCAGATCAACCTGCCCGGCATCACCCGGCAGACAGCGGATTTCTGCCCCGGCGGCTTCCAGTCCGGCGCTGCGCCGGGCGTCCGGTTCGGCACAGGCAATCAGGGTCGGGGCACCGCCCAGTATATTCGCCGAGGGCGGGGTTTGCAGGCGCGAATCGACGATGACCTTGAGCGGCTGGCGCGTGGTTTCCACTGCGCGCACATTCAGTTCCGGATTATCCGCCAGCACCGTACCGATACCGGTCAGAATGGCGCATGAACGCGCACGCAGCCGGTGCACGTCGCGGCGCGCAGGCTCTCCGGTGATCCATTTGCTGGCACCGCCGGATAAGGCCGTTTTGCCATCCAGCGAACTGGCGGTCTTGATGCGCAGCCACGGATGCCCCATGACCATGCGCTTGATAAAGCCCGCGTTGAGTTCACGCGCCTGCGCTTCGAGCAGGCCGCATTCAGTCGCAATCCCGGCCTTTTGCAGCAGGGCCAGACCATTGCCCGCCACCTGGGGGTTGGGATCCTGCATGGCGGCCACGACGCGTGCCACGCCTGCGGCTATCAACGCCTCGGCGCAGGGCGGGGTGCGCCCATGGTGGCTGCACGGCTCAAGCGTGACGTACACCGTGGCACCGCGCGCTGCGTCGCCGGCCTCGCGCAGCGCGTGGATTTCGGCATGCGGCTGTCCGGCCTGCTGGTGCCAGCCGCTGCCGACCATCGCTCCATTCCTGACAATCACACAGCCCACGCGCGGATTCGGGCTGGTAGTGTACAGGCCGTGCTCCGCAAGCTGCAGCGCGCGCGCCATATGGATGTAATCTGTCTGCGAAAACACAGGTTTATTTGTCGAAGTCCTTGAGCACGTCGCGGAAGTCGCCCACATCCTGGAAGCTCTTGTACACTGAGGCAAAGCGGATGTAAGCGATCTTGTCCAGGCGCTTCAACTCGTTCATCACCATCTCGCCAATCTGGCGCGCGGGCAATTCACGTTCGCCCAGCGACAGCACTTGTTTGACGATGCGCCCAATCGCCGCATCCACATATTCGGTCGGTACCGGGCGCTTGTGCAGCGCGCGGCGAAAGCCCTCGTGCAGTTTTTCCTGGCTGAATTCCTGGCGCACGCCGTTGCTTTTAATCACCTGCGGCAGGCGCAGTTCTATGGTTTCGTAGGTGGTAAAGCGCTTGTCGCAAGACGTGCAGCGGCGCCGGCGACGAATCGAGTCGCCGGCTTCGGAAAGGCGCGAATCAACGACCTGGGAGTCGAACGCGCTGCAGAACGGACATTTCATGAAGACGGTTTGTAGCGGGTAAGGTAATAGTCAGCAGCAGGCAGCCTGGTTGGGCGGCACGCTACCCGTTCATCCGTACACCGGATATTTCTTGCACAAGGCTTGCGCGGCAGTGGCCGCGCGCCCAATGACCGCCTCGTCATTGGGCGCGTCGAGCACATCGGCAATCAGGTGCGCCAGTTGCTCGGCTTCCAGTTCCTTGAAACCGCGTGTGGTCATTGCCGGCGTGCCGATGCGGATGCCGGAGGTGACGAAGGGTTTTTGCGGATCGTTGGGGATGGCGTTTTTGTTGACCGTGATGTGCGCCCGTCCGAGCGCGGCTTCTGCCTCCTTGCCGGTAATGCTTTTGGCCTGCAAGTCCACCAGAAACAGATGCGAATCGGTGCGGCCGGAGACAATGCGCAGGCCGCGTTCCTGCAGCACCTTCGCCATCACGCGGGCGTTATCGATCACCTGCTCCTGGTACAACTTGAAGTCCTTGCCCATTGCCTCCTGAAACGCCACTGCCTTGGCGGCGATGACGTGCATCAGCGGACCGCCCTGCAAGCCCGGGAAGATGGCGGAATTGATCGCCTTTTCGTGTTCGGCTTTCATCAGGATAATGCCGCCTCTGGGACCGCGCAGCGTCTTGTGCGTGGTGGAGGTTACCACATCCGCATGCGGTACCGGATTGGGATACACCCCCGCCGCGATCAGTCCGGCGTAGTGCGCCATATCCACCATGAAAATCGCGCCGACTTCCCTGGCTATTTTGGCAAAGCGCTCGAAGTCGATGTGCAGCGAATACGCCGAGGCGCCAGCGATGATGAGTCTGGGCTTATGCTCACGGGCAAGCGCTTCCATACGCGGGTAATCGATTTCTTCTTTTTTATTCAGGCCGTAGGCCACGGCGTTGAACCACTTGCCCGACATGTTGAGCGCCATGCCGTGGGTGAGGTGTCCGCCTTCAGCCAGGCTCATGCCCATGATGGTATCGCCCGGCTTGAGGAAGGCCAGGAATACCGCCTGGTTGGCCTGCGAGCCAGAATGCGGCTGCACGTTGGCGGCTTCCGCACCGAATAATTTCCTGATACGGTCAATTGCCAGTTGTTCGGCGATATCCACATATTCGCAGCCGCCGTAGTAGCGCTTGCCGGGATAGCCTTCCGCGTATTTGTTGGTCAGCACCGAACCCTGGGCTTCCATTACCGCCGGACTGGCATAATTTTCCGAAGCGATCAGCTCGATGTGATCTTCCTGGCGGCCGCGTTCTGCCTCCATGGCTTTCCAGAGAGCGGGATCGGTTTGGGCGAGAGTGTGCTGAGGGTTAAACAT
Protein sequences of DBSCAN-SWA_2 >NZ_AP021884|751882:760457|755911_757003_-|WP_147074504.1|DBSCAN-SWA MSLSPIEDIIADLKAGKMVVLVDEEDRENEGDLVMAAEFATPEAINFMAKHGRGLICLTLTDERCKQLGLRQMVADNQTPYSTAFTVSIEAATGVTTGISAADRALTIQAAVAKHAKAADIIQPGHIFPLRAQPGGVLIRAGHTEAGCDLAGLAGLEPAAVICEILKDDGTMARLPDLLEYAKIHGLKIGAIVDLIHYRSHNESLVERAGSRCIETVYGEFQLIAYREKISGAAHLALVKGEISAASETLVRVHEPVSVMDMLEVGSRTHAYSVNQALAKIAAVGKGVVVLLHRPESAAELLTRAMPEAGVRLPQKWDARNHGIGAQILKDVGVGKMRLLATQRKMPSLAGFDLEVTGYDENT >NZ_AP021884|751882:760457|753472_753961_-|WP_147074508.1|DBSCAN-SWA MTTFPPDWRFITRHPAYFLAFGLGSGLAPRAPGTFGTLAALPLYYLLAWFATPTQLYLIIGIAVVAGIWICGKTGRDLGVADHGGIVWDEIAAFWIVLAATPQTPLWVAAAFGLFRLFDIWKPFPIRQFDARLKNGFGVMLDDLLAAGYTLLALFLAQKILP >NZ_AP021884|751882:760457|753941_754901_-|WP_147074507.1|DBSCAN-SWA MTSEFGLIQRHFSRATPGAVLGVGDDCALLQPATGMQLAVSTDTLVADVHFFADADPEKLGWKSLAVNLSDLAAMGAAPRWATLALTLPEVDDDWLAAFATGLYRCADQFGVSLVGGDTTRGALSLTLTILGEVPPNQALRRDGAQAGDEIWVSGTLGDAALALAALHGRVNLTDADLITLAARLYTPTPRVELGLALRGLARSAIDVSDGLLADLGHILARSGVGAIVEYTHLPLGEIVHDYAAHPEFDACVLSGGDDYELCFTAPVAHRKALGEIAARLGVRLTAIGSIRAEPGLIVRDAQGQVLDMRHTGYDHFSA >NZ_AP021884|751882:760457|754949_755387_-|WP_147074506.1|DBSCAN-SWA MSGNRRKAREFAVQGVYQWLLNHQPTADIVKQLRDDPVYRNIDEKMFLALLNGVVDEAPALDLRLAAYLDRKPAELSPVEHAILLLGTQELLHHLEVPYRVVINEAVELAKIFGGTDGHKYVNGVMDKLAAEVRTVEVNAPRRQG >NZ_AP021884|751882:760457|755383_755851_-|WP_147074505.1|DBSCAN-SWA MASYDDIPELESSLDGSALRIGIVMSRFNMDICEGLLAACTTALGKRGVKTGNLLLATVPGALEIPLALRKMAMSGKFDALVALGAVVRGDTYHFEVVANEMANGIARIQLDTGVPIANGVLTTDTDHQATSRMSVKGAEAGECAIEMANLLKQL >NZ_AP021884|751882:760457|756999_757596_-|WP_147074503.1|DBSCAN-SWA MFTGIIQAVGKVAAVEPRNADARLVIDAAHLDLSDVAPGDSIACNGVCLTVVALGAASFSVDVSAETFRCTTGFPEHGLVNLEKALRLSDRLGGHLVSGHVDGVGEVLHFAPAGDCFELVIRAPGDLARFVVSKGSITVNGVSLTVNRVDGDRFSINLIPHTLEHTNLHTLQPASRVNLEVDMMARYAERILNYRDQT >NZ_AP021884|751882:760457|759212_760457_-|WP_147074500.1|DBSCAN-SWA MFNPQHTLAQTDPALWKAMEAERGRQEDHIELIASENYASPAVMEAQGSVLTNKYAEGYPGKRYYGGCEYVDIAEQLAIDRIRKLFGAEAANVQPHSGSQANQAVFLAFLKPGDTIMGMSLAEGGHLTHGMALNMSGKWFNAVAYGLNKKEEIDYPRMEALAREHKPRLIIAGASAYSLHIDFERFAKIAREVGAIFMVDMAHYAGLIAAGVYPNPVPHADVVTSTTHKTLRGPRGGIILMKAEHEKAINSAIFPGLQGGPLMHVIAAKAVAFQEAMGKDFKLYQEQVIDNARVMAKVLQERGLRIVSGRTDSHLFLVDLQAKSITGKEAEAALGRAHITVNKNAIPNDPQKPFVTSGIRIGTPAMTTRGFKELEAEQLAHLIADVLDAPNDEAVIGRAATAAQALCKKYPVYG >NZ_AP021884|751882:760457|751882_752905_-|WP_147074510.1|DBSCAN-SWA MDENRSKALAAALAQIEKSFGKGSIMRLGDGEVVRDIQVVSTGSLGLDIALGVGGLPRGRVVEIFGPESSGKTTLTLQVIAEMQKLGGTAAFIDAEHALDPQYAQKLGVNVSDLLISQPDTGEQALEIADMLVRSGSVDVVVIDSVAALTPKAEIEGEMGDSHMGLQARLMSQALRKLTGNIKRTNTLVIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKKGDEVIGSETRVKVVKNKVAPPFKQAEFDILYGEGISREGEIIELGVEHKLVEKSGAWYAYNGEKIGQGKDNAREYLREHPEVAHEIENKVRAAIGVAPMAVTAEVGA >NZ_AP021884|751882:760457|752981_753476_-|WP_147074509.1|DBSCAN-SWA MRPTDAALYQLAEQTGQALSRRGIMLASAESCTGGWAGMLITAVPGSSAWYERGFITYSNAAKHDMLGVKNATLQASGAVSEPTVLEMAQGALAHSRAQIALAISGIAGPGGATPQKPVGTVCIAWAMQDGTRLATTCRLCGDRDEIRARAVAAALRGVIELLD >NZ_AP021884|751882:760457|758693_759143_-|WP_147074501.1|DBSCAN-SWA MKCPFCSAFDSQVVDSRLSEAGDSIRRRRRCTSCDKRFTTYETIELRLPQVIKSNGVRQEFSQEKLHEGFRRALHKRPVPTEYVDAAIGRIVKQVLSLGERELPARQIGEMVMNELKRLDKIAYIRFASVYKSFQDVGDFRDVLKDFDK >NZ_AP021884|751882:760457|757597_758737_-|WP_147074502.1|DBSCAN-SWA MWATSATCSRTSTNKPVFSQTDYIHMARALQLAEHGLYTTSPNPRVGCVIVRNGAMVGSGWHQQAGQPHAEIHALREAGDAARGATVYVTLEPCSHHGRTPPCAEALIAAGVARVVAAMQDPNPQVAGNGLALLQKAGIATECGLLEAQARELNAGFIKRMVMGHPWLRIKTASSLDGKTALSGGASKWITGEPARRDVHRLRARSCAILTGIGTVLADNPELNVRAVETTRQPLKVIVDSRLQTPPSANILGGAPTLIACAEPDARRSAGLEAAGAEIRCLPGDAGQVDLGALLSLLAQRGINEVMTEAGATLNGALIAAGFVDEWVMYVAPVLLGDTARGLFSLAEPAQMQERRKLVLRDIRQVGADIRITAQFEKN |
11 | Staphylococcus_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
882982 : 892137
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|DBSCAN-SWA CCTAGGCGGGCATCAGCACGGTCAGCCCCCCCATGTAGGGACGCAACACTTCGGGAAGCGCCACCGATCCATCCGCCTGCTGATGATTTTCCAGAATCGCGACCAGGGTGCGCCCTACCGCCAGCCCGGAGCCGTTCACGCTGTGCAGCAGTTCCGGCTTGCCTTTTTCACCTCTGAAGCGCGCTTGCATGCGGCGTGTCTGAAACGCCTCGAAATTGCTGCACGAGGAAATTTCGCGATAAGTATTTTGCGCCGGCAGCCACACTTCCAGATCGTAGGTCTTGGCGGCAGAAAAACCCATGTCGCCGCCGCACAGCGCCATTTTCCGGTAGGGCAGCCCGAGTGCTTGCAGGATGGCCTCGGCATGGCCGGTGAGTGCTTCCAGCGCGGTGTAGGATTGCTCCGGTTCGACCAGTTGCACCAGCTCCACCTTGTCGAACTGATGCTGGCGGATCATGCCGCGGGTGTCGCGGCCGTAGGAACCGGCCTCGGAGCGGAAGCAGGGGGTGTGGGCGACGAATTTCAGCGGCAGTTGCTCGCGCGCCACGATCGCGTCGCGCACCATGTTGGTCAGCGGCACTTCGGCGGTGGGGATGAGGTAGAGTTTTTCCGCATCCGCACGCGGCACGTGAAACAGATCTTCCTCGAACTTGGGCAACTGCCCGGTACCGCGCATGGAGTCGGCGTTGACCAGATACGGCACATACACTTCGGTGTAGCCATGCACAGCCGTGTGGGTGTCCAGCATGAACTGCGCCAGCGCCCGGTGCAGCCGCGCCAGCCCGCCGCGCAGCAGTGAAAAGCGCGCGCCGGCCAGTTTGCTGGCGGTCTCGAAATCCAGCCCCAGCGCGGTACCTACGTCCACGTGATCTTTCACCGCAAAATCAAACACGCGCGGTGTGCCGACACGTGCGATCTCTACGTTGTCGGCGTCGGATTTACCCGTCGGCACCGATGCATGCGGCAGGTTGGGAATGGTCATCAACAGCGCATTGAGCCGGGCTTGCAGGGCTTCCAGCGCGGATTCCGCGGCTTTCAGTTCGGCGCCGAGATTCGCCACTTCCGCCATGATGGTGGAGACATCCTCGCCCTTGGCCTTGGCCATGCCTATCTGTCTGGAGCTGGCGTTGCGCCTGGCCTGCAGCTCCTGGGTGCGGGTTTGCAGTTGTTTGCGCTCGGCTTCCAGGCGCTGGAATTCGGCAGTGTCCAGGGTGTAGCCGCGCATGGCAAGGCGTTGCGCCACGTCGTCGAGGTCGTTGCGGAGGTGTTGAATGTCTAACATTATTTTTGCCCTGTTTTCTTGTTGGCTTGTTCATCCAGCTTGCGCAAATACGCCAGCCGTTCGGCGATCTTGCCTTCCAGCCCGCGCGGGGTTGGTGCGTAAAAGTGCGCGTTGACACCTTCCGGTAAATAGTCCTCCCCGGCGGCGTAGGCGTCCGGTTCGTCGTGCGCGTAGCGGTAGGCGTGGCCGTAGCCCAGTTGTTTCATCAGTTTGGTGGGGGCGTTGCGCAGGTGCACCGGCACCTCGCGCGATTTGTCCGCCGCCACAAAGCTGCGGGCGTTATTGTACGCCACGTACACGGCGTTGCTCTTGGGCGCGCAGGCGAGATAAATCACCGCCTGCGCCAGCGCCAGTTCGCCTTCCGGACTGCCCAGGCGCTGGTAGGTTTCCACCGCGTCCAGGGTCAGGCGCAGCGCGCGTGGGTCGGCCAGACCGATGTCCTCGCTGGCCATGCGGATCAGGCGGCGGCCGACGTACAGCGGATCGGCACCGCCGTCGAGCATGCGCACCATCCAGTACAGCGCGGCGTCGGGATGGGAACCGCGCACCGATTTGTGCAGCGCGGATATCTGGTCGTAGAAGTTGTCGCCGCCCTTGTCAAAACGGCGTGCGCCGCGCGCCAGCGTGGTCTGGATGAAATCCTCGTCAATCTCATGACGGGCGGCATCGAGTGCGGCATTGGCGGCTTGTTCCAGCAGGTTCAACAGGCGCCGCGCGTCGCCGTCGGCGTAGCCGGTGAGCTGGGCGCGCGCCGCCCCGGTGATGGCAATGTCCGGATAGGTGCTGATCCGCGCGCGCTCCAGCAGGGCGGCGAGGTCGGTTTCCACGATGGGCTTTAACACATACACCTGGGCGCGCGAGAGCAGCGCGCTATTGACCTCGAACGAGGGATTCTCGGTGGTGGCGCCGATGAAAGTAATCAACCCGGCTTCAACGAACGGCAGGAAAGCGTCCTGCTGGGATTTGTTGAAGCGGTGCACTTCGTCCACAAACAGCAGGGTGCGGCGCCCCTGGCCTTGCATCATTTCGGCGCGCGCCACCGCCTCGCGGATTTCCTTGACCCCCGAGAGCACGGCGGACAGCGCGATGAACTCCATGTCAAAACCGTGGCTCATCAGCCGTGCCAGTGTGGTCTTGCCTACGCCCGGCGGCCCCCACAGGATCATCGAGTGTGGCTTGCCGGATTCGAATGCGACGCGCAGCGGCTTGCCGGGGCCGAGCAAATGCGTCTGTCCGATCACTTCGTCCAGATTGCGCGGCCGCAGCCGTTCGGCCAGCGGCGCGCTGTCCAGCGGATGATCGAACAGGTCGGCGTGGCTCACTTGCCGTCGGAGATGACGTCCACGCCCTGGGGCGGGGTGAAATGGAAATCGCTGGCGGGAAGGGCCGGGTTACGTTCCAGCCCGGCGAATTTCAGCACCGTGGTCTGGCCGAAGTTGTCCTTGATCTCCATCGCCACCAGCGTGTTTTTGCTGAATCCCATGCGCACGTTCTCAAACGCGCTTTCCTTGTCGCGCGGGCGCGCGTCCAGCCATTCCAGACCGTCGCGGCTGCCGGCGTCCGTGATCGTGTAAAACCTGCCGATGTCCTTGCTGCCCGCCAGCAAGGCCGCCGGGCTGCTGCCCAGCGCCTGGCCCAGTTTCTTGATGGTGACCTGTTGCAGATCGGCGTCGTACAGCCAGATTCTTTTGCCGTCGCCGACGATTATCTGTTCATAGGGCTTCTCATACACCCAGCGGAACTTGCCGGGACGCGCGAAAGCCATGGTGCCGGACGACTGCTGGCGCGCGTGTCCGTTTTTGTCCAGCACGGTCTGGGTGAACGTGGCGCGCGCGGTCTGGGTATCCGCGACGAACGCCTTGAGCGCGTCGATGCTGGATGCTGCGGCGCTGGCAGAGAAGATCAAGAGTGCGGTAAAGAGACTGAGTTTTTTCATTGGGACTTTAAGTCGAATGGACAGGATTTTTAATGCTTAACAAGAGAGTTGGTCTGGTTTTAGTGCAGAAAATTCAACACCCCCCAAAATTCATATTTATTCTGAATGATCCGGTCCTTATCCTGTTAATCCTGTCTAATCACATTGTTCATTCCCGGTTAGGTGCAATCACTTCGCGGTTGCCGTTGCTCTGCATCGGCGTCACCAGACCCGCCTGTTCCATTGCCTCGATCAGCCGCGCGGCACGGTTGTAGCCGATGCGCAGGTGACGCTGCACGGCGGAGATGGAGGGGCGCCGGGTTTTCAGCACGATGGCGACGGCTTCGTCATAGAGCGGGTCGCTTTCGGCGTCACTGCCGCCGACCGCGCTTTCGCCGCTGCCTTCATTGTCTTCCGGGGTGTCGAGGATGCCGTCAATGTAATCGGGCTCGCCCAGCTGTTTCAGGTATTCCACGACTTTATGCACTTCTTCGTCGGCCACAAAAGCGCCGTGCACGCGCTGCGGATAGCCGGTGCCGGGCGGCAGGTAGAGCATGTCGCCCTGGCCGAGCAGGGCTTCTGCGCCCATCTGGTCGAGGATGGTGCGCGAGTCGATTTTGCTCGATACCTGGAACGCGACGCGGGTGGGGATGTTGGCCTTGATCAGGCCGGTAATCACGTCCACCGACGGGCGCTGCGTGGCCAGGATCAGATGCACCCCGGCGGCGCGGGCCTTCTGCGCCAGCCGCGCAATGAGCTGTTCGACGGCCTTGCCTACCACCATCATCATGTCAGCCAGCTCGTCGATCACCACCACAATCATGGGCTGCTCTTCCAGCGGCTCGGGATTATCCGGGGTCAGGCTGAACGGGTGGGTGAGCGGGGTGGCGGCCCTTTTCGCGTCGCGCACTTTCTGGTTGTAACCAGCCAGGTTGCGCACGCCGAGTGCGGACATCAGCTTGTAGCGGCGCTCCATCTCGGCCACGCACCAGTTGAGCGCGGCGGCGGCCTGACGCATGTCGGTGACTACCGGGGCGAGCAGATGCGGAATGCCCTCGTATACCGACAGCTCGAGCATTTTCGGGTCGACCAGAATCAGTCGCACACGGCTGGCGTCCGCTTTGTACAACAGGGAAAGGATCATCGCGTTGATGGCAACTGATTTGCCCGAGCCGGTGGTGCCCGCCACCAGTACGTGCGGCATTTTGGCCAGATCGGCCACCACCGGGTTGCCGGCTATGTCCTTGCCCATCGCCATTGCCAGCGGGCTGGCCATGGCGTGGTACACCTTGGCGGAAAGGATTTCCGATAACCGCACGATCTCGCGCTTGGGGTTGGGGATTTCCAGCGCCATGGTGGTCTTGCCGGGGATGGTTTCCACCACGCGGATGCTCACCACCGACAGCGCGCGCGCCAGGTCTTTGGCCAGGTTGACGATCTGGCTGCCCTTGACCCCGGAGGCCGGCTCGATCTCGTAGCGGGTGATGACCGGGCCGGGCAGAGCGGCGACCACGCGCGCCTGCACGTTGAATTCGGCCAGCTTGCGCTCGATCAGGCGCGAGGTGAATCCCAGGGTTTCCGCCGACAGGCTCTCCACATGGGCAGTCGCAGCATCCAGCAAATGCAGTGGCGGCAGCGGGGAATCCGGCATTTCGGCAAACAGCGGCACCTGCTTTTCCACCTGCACGCGTTCCGATTTGATAATGGTGGGCGGCGGGGTTTCGATGACCACCGGAGCGCGGTCCATATTGCGGCGCTTTTCTTCCAGTACAATCTCGTCGCGCCGCAGCGTGGCGGCTTGACCCATGCGCCGTTCGCGCCGCGCGGACCAGGTCTCGATTGCCCATAACCAGCTGCGTTCGAGGTATTCCCCGGTGGATTCCACTAGCGCCAGCCAGGAGATGCCGGTAAACAGGCTCAAGCCCGCGGCAATCAGCACCAGCAAGGCCAGCGTGCCGCCGGTAAAGCCCAGCAAATGGGTGACTGCTTCTCCCACAACCGCTCCCAGAATGCCTCCCGGGTGCTCCGGTAGTGCTATGTGCAAGCTATACAGACGCTGCGATTCCAGCCCGGCGCTGGACAGCAGCAGCAACAAAAACCCGGCACTGCGGATATATAACGGGCGGCGGTCGGGCGCCTCGCTAATCTCCAGTTTGCGATATCCCCACCAGGTGGCATACAGCGCCAGCATCACCAGCCACCAGGTGGAGAGCCCGAACAGTGCAAGCAGAATGTCGGCCAGATACGCCCCCAGCGTGCCGCCCATATTGTGCACACTGCCTTGTCCGCTGTGCGACCAACCCGGATCAGACTGGGTGTACGTGAACAGGATGACCGCCAGGAACGCCGCCAGCGCCACGCTCAGCAGCCAGCCGACCTCGCGCAGCAGGCCGGTGAGCCTGGGCGGGAGCGGTTTGCGGACGGATTTGGGCGTATTGCGCCGGGGAATGGACATGGTCAGCAATTTAATCGGAAGGCAAAATTATAACTTGAACCTTGTCCAGTCCGTGACCATCTTCCATACTGTTCCAGTTTTCCAGTTTTCAATCCCGCTACGGAGTACTACATGACCACCCCTCAACATCACCGTCTCATCATCCTCGGCTCCGGCCCCGCAGGCTATTCCGCCGCCGTTTACGCTGCGCGCGCCAACCTGAATCCGGTCGTCATTACCGGCATGGCGCAAGGCGGTCAGCTGATGACCACCACCGATGTGGACAACTGGCCTGCCGACGCCGACGGCGTGCAGGGGCCGGAGTTGATGGCGCGTTTCGAGAAGCACGCGCGCCGCTTCAACACCGAAATTATTTTCGACCACATCCACACTGCCAAACTGACCGACAAGCCCATCGCGCTGGTCGGCGACCAGGGTAGCTACACCTGCGATGCGCTGATTATCGCCACCGGCGCGTCGGCCATGTATCTGGGGCTGGAATCCGAGCAGGCGTTCATGGGCAAGGGCGTATCCGGCTGCGCCACCTGCGATGGATTTTTCTACCGCAATCAGGACGTGGCGGTGATCGGCGGTGGCAACACTGCCGTGGAAGAAGCGCTGTATCTGTCCAACATTGCGCGTCATGTCACCGTGGTGCATCGCCGCGACAAGTTCAAGTCGGAAAAAATTCTCGCCGATCATCTGATGGAGAAGGTCAAGGAAGGCAAGATCAGCGTGGAGTGGAACAGCGAACTGGACGAGGTGCTGGGCGACAAGACGGGTGTGACCGGCATGCGCATCAAGTCCACCGTGGATGGCAGCACCAGGGATATCGCCCTGACCGGCGTGTTCATCGCCATCGGCCACAAGCCCAACACCGATATTTTCACTGGCCAGATCGCGATGGAAGGCGGCTATATCGTCACCCAGGGGGGCAACAAGGGCAATGCCACCGCCACCAGTGTTCCCGGCGTGTTTGCCGCGGGTGACGTGCAGGATCACATCTACCGCCAGGCGGTGACCAGCGCCGGTACCGGCTGCATGGCCGCGCTGGACGCCGACCGCTATCTGGAAAGCCTCGGCAAGTAATCTCCCCATGGCCGGGCGTGTGCCTCCAGCGGACGAAGCCGCACTGTTCAAAGCGGCGGTGCAGGACGCCCAGCCGCTACCCGACCACGGCAAGGTGGAACCGCCCTTGCCGCGCGTTTCCCCTATCCCGCGCCAGCGTATTCGCGATGAGCGTCAGGTCTTGGCCGACAGCCTGTCTGACCACATCGTGTGGGAGGATACCATGGAAACCGGCGAGGAGCTGGTGTTCCTGCGCACTGGCTTGCGCCGCGACACGCTCAAAAAACTGCGGCGCGGGCACTGGGTGCTGCAGGCCGAACTGGATTTGCATGGCCTGGTGAGCGTGGAAGCGCGCCAGGCGCTGAGCGCGTTTATCGCCGGCTGCGGCAAGCGCGGCCTGCGTTGCGTGCGCATCATCCACGGCAAAGGGCTGCGTTCCAAAAACCGCGAGCCGGTGTTGCGCACCAAGGTGAAAAACTGGCTGATGCAAAAAGACGAAGTGCTGGCGTTTTGCCAGGCGCGTGCGGTGGACGGCGGCAGCGGCGCGGTGGTAGTGTTACTCAAGTCTTCATGAAAACTTTTTGGAGGAATGCCATGACCGCAATTACCGAATTCGAACTTCCTTCCACCGGCAACCGAACCTTCAAACTCACCGACATGCGCGGCAAGAAGCTGGTGGTGTACTTCTATCCCAAGGACGACACGCCGGGCTGTACCGTGGAGGGCTCCGACTTCCGCGATTTGTATGCCGGGTTTCAGGCGCACAATTGCGAGATCGTGGGTATTTCGCGCGACGATATGAAATCCCACGAGAAATTCAAGACCAAGCTCAGCCTGCCGTTCGAGCTGTTGTCCGACGCAGACGAAAAAGTGTGCGAACTGTTCGGCGTGATAAAGCTGAAGAACATGTACGGCAAGGAAGTCCGTGGCATTGATCGCAGCACTTTTGTGTTCGACAGCGACGGCAAGCTGGTCAAGGAATGGCGCGGCGTGAAATCCGCCGGCCACGCGCAGGAAGTGCTGGATACCATTAAAACGCTCCAGGAGAAAATCTGAATGCCCCGCAAAGCCGCTGCGCCCACCAAGCTGTTTGTTCTTGATACCAACGTGCTGATGCACGACCCCACCAGCTTGTTCCGTTTTGAGGAGCACGACATATTTCTGCCGATGGGCACGCTGGAGGAACTCGATCACAACAAGAAAGGTATGACCGAAGTGGCGCGCAATGCGCGTCAGGCCAGCCGCTTCCTGGACGAAATCGTATCGGGTTGCGAAGATGCAATCGAAGCCGGGATTCCGCTCAGCAGCCATAGCCGCAAGGCAGCGACCGGACGGCTGTTTCTGCAGACCAGAATGGCGCGCATTGAAACCCCGCTCAGCCTGCCCAACAGCAAGGTGGACAACCAGATTCTCGGCGTGATTCTGAGCCTGCGCGAAGAGCAGCCCAGGCGCCCGATAATCCTGGTGTCCAAAGACATCAACATGCGCATCAAGGCGCGTGCACTGGGCTTACCCGCCGAGGACTACTTCAACGACAAGGTACTCGAAGACACCGACCTGCTCTATGCCGGGGTGCGTGAATTGCCGGAGGATTTCTGGGACAGGCACGGCAAGGGTATCGAGTCCTGGCAGGCTGATGGCCACACCTGGTATCGCGTCAAAGGCCCGCTGGTGACCCGCCTGCTGGTCAACGAATTCGTCTATCAGGAGAGCGCCAGCCCGCTCTACGCCATCGTCAAAACCATCAAGGGCAATGTCGCCGAGCTGCAGACCATCAAGGACTACAGCCACCAGAAAAACAATGTGTGGGGCATCACCGCGCGCAACCGCGAACAGAATTTCGCGCTCAATGTGCTGATGGATCCGGAGGTGGATTTTGTCACTTTATTAGGCCAGGCCGGTACCGGCAAAACCCTGCTCACCCTCGCGGCGGGGCTGATGCAGACGCTGGAGCACAAGCGTTATTCGGAAATCATCATGACCCGCGTGACCGTGCCGGTAGGCGAAGACATCGGTTTCCTGCCCGGCACCGAGGAAGAAAAAATGACGCCGTGGATGGGCGCGCTGGAAGACAATCTCGACGTGCTCAACAAGACCGACGACAGCGCCGGCGACTGGGGACGCGCCGCCACGCAGGACCTGATCCGCAGTCGCATCAAGGTCAAATCGCTCAACTTCATGCGCGGGCGTACCTTCCTCAACAAATACCTGATCATCGACGAGGCGCAGAACCTCACCCCCAAACAGATGAAAACCCTCATCACCCGCGCCGGTCCCGGCACCAAGGTGGTGTGCCTGGGCAACATCTCGCAGATTGATACGCCTTACCTCACCGAGGGCAGCTCCGGCCTGACCTACGTGGTGGACCGCTTCAAGGGCTGGCCCCACGGCGGCCATATCACCCTGGCGCGGGGCGAGCGTTCGCGCCTGGCCGACTGGGCGGCGGAAATGCTATGA
Protein sequences of DBSCAN-SWA_3 >NZ_AP021884|882982:892137|888744_889701_+|WP_147071121.1|DBSCAN-SWA MTTPQHHRLIILGSGPAGYSAAVYAARANLNPVVITGMAQGGQLMTTTDVDNWPADADGVQGPELMARFEKHARRFNTEIIFDHIHTAKLTDKPIALVGDQGSYTCDALIIATGASAMYLGLESEQAFMGKGVSGCATCDGFFYRNQDVAVIGGGNTAVEEALYLSNIARHVTVVHRRDKFKSEKILADHLMEKVKEGKISVEWNSELDEVLGDKTGVTGMRIKSTVDGSTRDIALTGVFIAIGHKPNTDIFTGQIAMEGGYIVTQGGNKGNATATSVPGVFAAGDVQDHIYRQAVTSAGTGCMAALDADRYLESLGK >NZ_AP021884|882982:892137|884262_885555_-|WP_147071312.1|DBSCAN-SWA MDSAPLAERLRPRNLDEVIGQTHLLGPGKPLRVAFESGKPHSMILWGPPGVGKTTLARLMSHGFDMEFIALSAVLSGVKEIREAVARAEMMQGQGRRTLLFVDEVHRFNKSQQDAFLPFVEAGLITFIGATTENPSFEVNSALLSRAQVYVLKPIVETDLAALLERARISTYPDIAITGAARAQLTGYADGDARRLLNLLEQAANAALDAARHEIDEDFIQTTLARGARRFDKGGDNFYDQISALHKSVRGSHPDAALYWMVRMLDGGADPLYVGRRLIRMASEDIGLADPRALRLTLDAVETYQRLGSPEGELALAQAVIYLACAPKSNAVYVAYNNARSFVAADKSREVPVHLRNAPTKLMKQLGYGHAYRYAHDEPDAYAAGEDYLPEGVNAHFYAPTPRGLEGKIAERLAYLRKLDEQANKKTGQK >NZ_AP021884|882982:892137|890274_890736_+|WP_147071117.1|DBSCAN-SWA MTAITEFELPSTGNRTFKLTDMRGKKLVVYFYPKDDTPGCTVEGSDFRDLYAGFQAHNCEIVGISRDDMKSHEKFKTKLSLPFELLSDADEKVCELFGVIKLKNMYGKEVRGIDRSTFVFDSDGKLVKEWRGVKSAGHAQEVLDTIKTLQEKI >NZ_AP021884|882982:892137|886347_888633_-|WP_147071123.1|DBSCAN-SWA MSIPRRNTPKSVRKPLPPRLTGLLREVGWLLSVALAAFLAVILFTYTQSDPGWSHSGQGSVHNMGGTLGAYLADILLALFGLSTWWLVMLALYATWWGYRKLEISEAPDRRPLYIRSAGFLLLLLSSAGLESQRLYSLHIALPEHPGGILGAVVGEAVTHLLGFTGGTLALLVLIAAGLSLFTGISWLALVESTGEYLERSWLWAIETWSARRERRMGQAATLRRDEIVLEEKRRNMDRAPVVIETPPPTIIKSERVQVEKQVPLFAEMPDSPLPPLHLLDAATAHVESLSAETLGFTSRLIERKLAEFNVQARVVAALPGPVITRYEIEPASGVKGSQIVNLAKDLARALSVVSIRVVETIPGKTTMALEIPNPKREIVRLSEILSAKVYHAMASPLAMAMGKDIAGNPVVADLAKMPHVLVAGTTGSGKSVAINAMILSLLYKADASRVRLILVDPKMLELSVYEGIPHLLAPVVTDMRQAAAALNWCVAEMERRYKLMSALGVRNLAGYNQKVRDAKRAATPLTHPFSLTPDNPEPLEEQPMIVVVIDELADMMMVVGKAVEQLIARLAQKARAAGVHLILATQRPSVDVITGLIKANIPTRVAFQVSSKIDSRTILDQMGAEALLGQGDMLYLPPGTGYPQRVHGAFVADEEVHKVVEYLKQLGEPDYIDGILDTPEDNEGSGESAVGGSDAESDPLYDEAVAIVLKTRRPSISAVQRHLRIGYNRAARLIEAMEQAGLVTPMQSNGNREVIAPNRE >NZ_AP021884|882982:892137|890736_892137_+|WP_147071115.1|DBSCAN-SWA MPRKAAAPTKLFVLDTNVLMHDPTSLFRFEEHDIFLPMGTLEELDHNKKGMTEVARNARQASRFLDEIVSGCEDAIEAGIPLSSHSRKAATGRLFLQTRMARIETPLSLPNSKVDNQILGVILSLREEQPRRPIILVSKDINMRIKARALGLPAEDYFNDKVLEDTDLLYAGVRELPEDFWDRHGKGIESWQADGHTWYRVKGPLVTRLLVNEFVYQESASPLYAIVKTIKGNVAELQTIKDYSHQKNNVWGITARNREQNFALNVLMDPEVDFVTLLGQAGTGKTLLTLAAGLMQTLEHKRYSEIIMTRVTVPVGEDIGFLPGTEEEKMTPWMGALEDNLDVLNKTDDSAGDWGRAATQDLIRSRIKVKSLNFMRGRTFLNKYLIIDEAQNLTPKQMKTLITRAGPGTKVVCLGNISQIDTPYLTEGSSGLTYVVDRFKGWPHGGHITLARGERSRLADWAAEML >NZ_AP021884|882982:892137|882982_884263_-|WP_147071127.1|tRNA|DBSCAN-SWA MLDIQHLRNDLDDVAQRLAMRGYTLDTAEFQRLEAERKQLQTRTQELQARRNASSRQIGMAKAKGEDVSTIMAEVANLGAELKAAESALEALQARLNALLMTIPNLPHASVPTGKSDADNVEIARVGTPRVFDFAVKDHVDVGTALGLDFETASKLAGARFSLLRGGLARLHRALAQFMLDTHTAVHGYTEVYVPYLVNADSMRGTGQLPKFEEDLFHVPRADAEKLYLIPTAEVPLTNMVRDAIVAREQLPLKFVAHTPCFRSEAGSYGRDTRGMIRQHQFDKVELVQLVEPEQSYTALEALTGHAEAILQALGLPYRKMALCGGDMGFSAAKTYDLEVWLPAQNTYREISSCSNFEAFQTRRMQARFRGEKGKPELLHSVNGSGLAVGRTLVAILENHQQADGSVALPEVLRPYMGGLTVLMPA >NZ_AP021884|882982:892137|889708_890254_+|WP_147071119.1|DBSCAN-SWA MAGRVPPADEAALFKAAVQDAQPLPDHGKVEPPLPRVSPIPRQRIRDERQVLADSLSDHIVWEDTMETGEELVFLRTGLRRDTLKKLRRGHWVLQAELDLHGLVSVEARQALSAFIAGCGKRGLRCVRIIHGKGLRSKNREPVLRTKVKNWLMQKDEVLAFCQARAVDGGSGAVVVLLKSS >NZ_AP021884|882982:892137|885581_886199_-|WP_147071125.1|DBSCAN-SWA MKKLSLFTALLIFSASAAASSIDALKAFVADTQTARATFTQTVLDKNGHARQQSSGTMAFARPGKFRWVYEKPYEQIIVGDGKRIWLYDADLQQVTIKKLGQALGSSPAALLAGSKDIGRFYTITDAGSRDGLEWLDARPRDKESAFENVRMGFSKNTLVAMEIKDNFGQTTVLKFAGLERNPALPASDFHFTPPQGVDVISDGK |
8 | uncultured_Mediterranean_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1605656 : 1613255
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|DBSCAN-SWA CATGCAAAATCTCGACACTATTGTTGCCGCAGCGCTAGCCGAATTCGCCGCAGTCAACCAGGCCGTTGAACTGGAGCAGGCAAAAGCCCGCTATCTCGGCAAAGCCGGTTTGCTCACCGGGCAATTGAAACAACTGGGCAAGCTTCCCGCCGCAGAACGCCCGGCAGCGGGCAACGTGATCAATCAGGCCAAGGAACGGATTCAGCAGGCGCTGGAAGCGCGCCGCGCAGCCTTGTCCCGGGCTGAGCTGGATAACAGGCTGGCGGCGGAAACCCTGGATGTGACGCTGCCCGGACGCGGCCTGGGCACAGGCGGCCTGCACCCGGTGACGCGCACGCTGGCACGCATCCAGGCGCTGTTCGCCTCGATCGGTTTCGAGGTGGCGGAAGGCCCGGAGATTGAAACCGATTTCTACAATTTCACCGCACTGAATATTCCGGAAAACCACCCGGCGCGCGCCATGCACGACACTTTCTACGTGGATGACAAACACCTGCTGCGCACCCACACGTCGCCGGTGCAGATACATTATTTGCAGAACAATCAGCCGCCGCTCAAGATCATCGCGCCAGGCCGGGTATATCGCTGCGATTCCGACGTGACCCACACACCCATGTTTCATCAAGTCGAGGGATTGTGGGTGGACGAAGAGGTGAGTTTCGCGGCATTGAAAGGCGTGCTGGCGGATTTCATGCAGCGTTTTTTTGAACGCGATGACCTGAAGGTGCGCTTCCGCCCATCGTTTTTCCCGTTCACCGAACCGTCGGCGGAAATGGATATCGCTTGCGTGATGTGCGGTGGCGGCGGTTGCCGCGTATGCAGCCATACCGGCTGGCTGGAAGTGCTGGGCTGCGGCATGGTGCATCCCAATGTGCTGGGACATGTGCATGTGGATAGCGAAAAATACCTCGGTTTCGCGTTTGGCATGGGGGTGGAACGGCTGGCCATGCTGCGCTACGGTGTGGATGACCTGCGCCTGTTTTTCGCTAATGATTTGCGTTTCCTGAAACAGTTCAACTGAACCATCAAGATGAAATTCTCCGAGCTCTGGTTGCGTACCCTTGTTAATCCTGCGCTGGACAGCGCGGCGCTGTCCCATCTTCTTACCATGGCCGGACTGGAGGTCGAAGCGCTGGACCCGGTCGCGGCGGATTTTTCCGGCGTGGTGGTGGGGCAGGTGCTGTCCGTAGCGCCGCATCCGGATGCCGATCGCCTGCGCGTGTGCCTGGTAGATGCCGGCACTGGCAGCCCGTTGCAGATCGTATGTGGCGCACCCAATGTAAGTGAAGGCGCGCGCGTGCCTTGCGCCCTGGCAGGCGCCCGCTTGCCGGGCTTTGAAATCAAGAAAGCCAAGTTGCGGGGTGTGGAATCGCAGGGTATGTTGTGCTCCGCGCGCGAGCTGGGACTGGCAGAACAAGCCGATGGCCTGCTGTTGTTGCCGAACGACGCACCGGTGGGTAGCAATATCCGCGATTATCTGCATCTGGATGACAGGCTTTATACGCTCAAACTTACCCCCAATCGCAGCGATTGCCTGAGCGTGGCCGGCGTGGCGCGTGAAGTGGCCGCGCTTACCGGCAGTCCATTGAACTTGCCCCGGATTGAACCCGCAGCGGTCACCGGCAGGCTCACCCGCATGGTGCAGGTGACTGCAGGACAAGCCTGCCCGCGCTATTGCGGGCGCGTCATCAGCCAGCTCAATCGCGCGGCTCAAACACCGGGCTGGATGATTGAACGCCTTTCCCGCAGTGGCCTGCGCAGTATCAGTCCGGTAGTGGACATTACCAACTATGTATTGCTGGAGTTGGGACAACCCTTGCATGCCTTTGATCTGGACAAGCTTGCTGGCGATATCCAGGTGCGCATGGCCACGCCGGGTGAAACGCTGACGCTGCTGAATGATCAGCGTGCGACGCTGGAAGCGGACATGCTGGTGATCGCCGATGACAACGGCGCGCAGGCGCTCGCTGGCATCATGGGGGGGGCGGCCACCGCAGTGGATGAAAATACCTCGGAAATTTTTCTTGAGGCAGCGTATTTCAGCCCCGGCGCGATTGCCGGACGGGCGCGCCGGCTGGGCTTGTCCACCGATTCATCGCACCGTTTTGAGCGCGGGGTGGACTACGCAGCCACGCGCGATGCGCTGGAACGCGCCACGGCATTGATACTTGAAATTTGCGCTGGCGCGGCAAGTGCAATCACCGAAATAACAGGCGATCTGCCACAACGTGCGCCTGTCATGCTGCGCACCGCGCGTGCCAGCAAAGTGTTGGGCGTGGCGCTGAGTGACGCGCAGGTGGAAGTGTTGCTGGGCCGCCTGTGCTTTGACTTTCAGCGCGATGGCGCGGCCTATCAGGTGACGCCGCCCAGCTACCGCTTTGACCTGAATATCGAGGAAGACCTGATCGAAGAACTGGCGCGGCTCCATGGTTATGACAACATTGTTGCGCAGGCCCCGGTCGCCCGCCTGACCATGTTGCCGCAGCCGGAGCAACAGCGTGGGGTGGATGCGTTGCGCACCCTGCTCACCGCGCGTGATTATCAGGAAGTCATTACCTACAGCTTTGTAGATGCCGCATGGGAAGCGGATTTCGCACCCGGCGCTCAGCCCGTCGTGCTGAAAAATCCCATCGCCAGTCAGATGGGCGTGATGCGCTCCACCTTGTTGGGCGGCCTGATGGATGTGCTGCGCAACAATCTGAACCGGCGCCAGGAGCGTGTGCGTATTTTTGAGAGCGGACGCTGTTATCTGCCGGCGGCCGAGGGCTTCGATCAGCCGCAACGCCTGGCTGGACTGGCTTACGGCAGCGCTATGCCGGAGCAGTGGGGGAGTGCGGCGCGCAACGTGGACTTTTTTGACGTCAAAGCCGATCTCGAAGCGCTGTGCTGGCCACAGCCTGCACGCTTTGAAAAATCCGCTCATCCTGCGCTGCATCCAGGCCAGTGCGCTGAAATGTGGTTGAATGGTGTCCATGCCGGCTGGCTGGGTACATTACACCCACGGCTGACGCAGCAATATGATTTGGCGACAGCGCCGGTTGTGTTTGAACTCGCCCTGCCGGCATTGTTAACGCGGAAGCTGCCCAGGCATGGCGAGATTTCGCGTTTCCAGAGCGTGCGCCGTGATCTGGCCGTGATAGTCGATGAATCGGCGCCGGTACAGGCTTTGATTGATGCGATGTACGCAGCACGCATAGAGGGTGTTGCCGAGATTACATTGTTTGACGTGTATCGCGGCAAAGGCATTGATTCTGATAAAAAAAGTCTTGCATTCCGGGTGCTGTTGCAAGATACTCAAAAGACCTTTACCGACACTGAAGTGGATACCGCCATGGCGTACTTCACCGATCTGTTAAAACAACAATTCAACGCGCAATTACGTTCCTGAGGTAGTCATGACCCTGACCAAGGCAGAACTGGCAGACATGCTGTTCGAAAAAGTTGGCCTGAATAAACGCGAAGCCAAAGACATGGTGGAGTCGTTTTTCGAAGAAATACGCATTGCACTGGAAGCGGGCGATACCGTGAAGCTTTCCGGCTTTGGCAATTTTCAGCTGCGTGACAAACCGCAGCGTCCTGGCCGCAATCCCAAAACCGGCGAAGAAATGCCAATCACGGCACGCCGCGTGGTGACCTTTCACGCCAGCCAGAAACTCAAATCGCAGGTAGAAGACGCGCATGGCGGAACATCAGCCAACTAGTCAACTGCCGCCGATTCCTGCCAAGCGCTACTTCACTATCGGCGAGGTCAGCGAACTGTGCGGGGTGAAGCCGCACGTACTGCGCTACTGGGAACAGGAATTCGGCCAGCTCAAACCAGTCAAGCGACGTGGTAACCGTCGTTACTATCAGCATCATGAAGTGCTGCTGATTCGCCGCATCCGGGAACTGCTTTATGAGCAGGGATTCACGATCAATGGCGCACGCCATCGTCTGGATGTGCTGGCCACATCCGACGCCGCCGAGGCCGCACCCACGGTGACTGAATCGGTAACGGATTATGCAGCACTGCGTCGCGAAATGATGGAAATTGTCGAGTTGCTGCGCCTGTGATTTTTTAGCTCCAGTCTTTGTCCAGGCGCTACAGACTCTGCTATAATCGCGGCCTTCGGGGCGTAGCGCAGCCTGGTAGCGTACTTGCATGGGGTGCAAGTGGTCGGAGGTTCAAATCCTCTCGCCCCGACCAGAACAATGCCCAGGTGTCATGCCTTTCTAGAAAGCAATTCACGGGAATTTCAGAGTAGCCGTCCACTATGCGCATTTTGCTGAGCAACGACGACGGTTACTTTGCACCCGGTCTCGCCATCCTGGCGGATACACTCTCACACATCGCAGATATCACGGTGGTTGCCCCCGAGCGCGACCGCAGCGGCGCCAGCAATTCCCTCACACTCGATCGTCCGCTGATGCTGCGCCAGGCGCACTCCGGGTTTTATTACGTCAATGGTACGCCCACGGACTGCGTTCACCTCGCGGTTACGGGTATGCTCGATCACCTGCCGGACATGGTTATCTCCGGTATCAATCACGGTGCCAACATGGGCGACGACACTATTTATTCGGGCACCATAGCGGCAGCCACCGAAGGGTTTTTACTGGGGGTGCCGTCGCTGGCGATATCCCTGGCGAGCCATGCAGCGGGCAACTATGCCACAGCCGCGCGTGTTGCCAGCGAGCTCGCGCAACGTGTCATGGCACGGCCTTTTGCGGCACCGCTACTGCTCAACGTGAACGTGCCGGATATTCCCTATCAGGACTTGCAAGGCACCGCAATCACCCGCCTCGGACGCCGCCACAAAGCCGAACCGGTGGTCAAATCCACCAATCCGCGCGGTCAGACGGTGTATTGGGTGGGGGCTGCGGGTGCGGCGCAGGATGCAGGCGAAGGCACGGATTTCCATGCGGTGGCGAATGGGCGTGTGTCGGTGACGCCGTTGCAGATGGATCTCACCCAGTTCAGTCAACTGGCGCCGCTGCGGGCGTGGTTGCAGGCATGAGCGTGACGCGTCACAGCGGTATCGGCATGACTTCCGAGCGTACCCGCGCGCGCATGGTCGAGCGCCTGCGTGCGCAGGGGATCAAGGACAACAACGTACTCACCGCGATGGGCATGGTGCCCCGGCATATTTTTGTGGATGAGGCACTGTCCATCCGGGCTTATGAGGACAGCGCGCTGCCGATAGGTTTCGGCCAGACCATTTCCAGCCCCTATAGCGTGGCGCGCATGATCGAGGTGCTGCGTGGCGGCGCCGACCTGCAGTGCGTGCTGGAAGTCGGCACAGGTTGCGGCTACCAGGCCGCTGTACTGGCCAAGCTGGCACGCGAAGTGTACTCGGTTGAGCGCATTGCCACGCTGCTCGGGCGCGCGCGTCGTACCATACGCGAACTACGCATCGGCAATATCAAACTTAAACATGGCGATGGTAGCATTGGGTTAAAGGATGTGGCACCTTTCGATGGCATCATCCTTGCCGCCGCCATACCCACTCCCCCCCAGGCGTTGCTGGAACAACTGGCGCAGGGCGGCCGCATGGTATTGCCGCGAGGTATTGGTGAAACGCAGCAAATGGTGCTGATCGAGCGCACCGCAGAAGGTTTTCAGGAGACGGTGCTGGAAATGGTACATTTTGTTCCACTGTTGCCCGGAGTGCGCTGACGTGATGGTATTCGACAAGATTGCATGCCCGGTTTATCTGGTTGCTGTGATCCTTGCCCTGGGAGGATGCGCCACGCAGAATTCTGCGCCGGTTGTGGATGGCACCCAGCCTGGCACAAGCAATATTGTCAAGCCGGCAATCAAGTCCGCCACTACCCGCGCCGGCGCAGCAAAGCTGCATGACTGGCGACCGGACAGCCATACCGTGCAAAAAGGCGACACGCTCTACAGCATCGCGCTTGAATATGGCCTGGACTACCGTGATCTGGCGAGCTGGAATGCACTATCTGACAATAACCTGATCCGTGTCGGGCAGGTGTTGAAACTGAGTGCGCCGCAGCCAGGCAGCGGCATTGCGCAAGTCACGACATCTGAATCAGCAGTTCAGACCATCCCCCTCAAGATCGAACCGTTACCGCAGGCCCAGATAGCGACCGGCGCGGTGTTGATAACCCAGCCCAAGGCGGTCAAATTGCCCTACTCTGCCGCCGCATTGGCGCAGCTTGAACAAGGCGGGACGCCGCAGCCGGCCGCGCGGCCTCCCGCAACACCGGAGGCAGCGTCCGGGGTGGCGCCCGAGCCCGCATCCTCTGCAGAACAATCCCGACCGGCCGCGACTGCCAAGGAAACCGATGATACGGGTATTGATTGGATCTGGCCTACGCAAGGCCGGGTCATTGCCGGATTTGACGAAGCCAAAAACAGCAAGGGGCTGGATATAGCAGGCAAAGCCGGACAAGCCATATTCGCAGCCGCGCCGGGCAAGGTGGTGTATAGCGGCGCTGGTTTGCGCGGCTATGGCAAGCTGGTTATTATCAAGCACAACGCCATTTATTTGAGTGCCTACGCACACAATCAGCGGGTGCTGGTGAAAGAAGGTCAGACGGTTGCGCGCGGGCAGAAAATCGCCGAAATGGGTGACAGTGATGCCGATCAGGTCGCGCTGCATTTCGAAATCAGGAAAATGGGCCAACCGGTGGACCCGATGAAATATCTTCCCGGAGCACAAAAATAGCGATGCATGACGACAACGAGCAGGAAGACTACGCCGGTATCCCGGACGACAAATCCCAGACCGAAGTGCCCGAGGTCGAACTGGGATTTACCGAGGATGCGCATACCGATGTGACGCAGATGTACCTCAACGAAATTGGCCACAACGCATTGCTCAGCCCCACTGAGGAACGCCGCCTGGCCGAACTCACCCGTGCGGGCGATTTTGACGCCCGGCAAAAAATGATCGAACACAATCTGCGGCTGGTGGTGAATATCGCCAAGCATTACGCCAATCGCGGGCTGGCACTGCTGGACCTGATCGAGGAGGGCAACCTGGGACTGATTCATGCGCTGGAAAAGTTCGAGCCCGAACGCGGATTTCGTTTTTCCACTTATGCCACATGGTGGATACGCCAGAATATCGAGCGCGCCATCATGAACCAGTCGCGCACCATCCGCCTGCCGGTGCACGTCATCAAGGAACTCAACGTCATTTTGCGCGCCCGCCGTCATCTGGAAAATCACGGCGCCTCCGACCCGAGTGACGAGGACATCGCCCATCTGGTCGGGCTGCCGGTAGAAGATGTACGACGCATGTTGCGCCTGAATGACCGGGTGGCATCGCTCGACGCACCGCTCGATATTGATCCCAGTCTGTCCATCGGGGAGGCCATTGCAGATGGCAACAGCGCGTTGCCTGAAGACATGCTTGAGCACGCCGAGACTGAAGCCTTTGTGCGCCTGTGGCTGAGTGACCTCAACGACAAGCAGCGCTGGGTAATTGAGCGGCGTTTCGGGCTGGGCGGGCAGGATGTGCACACCCTGGAACAGCTGGCCGAAAGCCTCGACGTCACCCGTGAACGCGTGCGCCAGATCCAGATGGAAGCCCTGCACCATTTGCGGCGCATGCTGAAACGCACCGGCGTCAACAAGGACGCTCTGTTGTGA
Protein sequences of DBSCAN-SWA_4 >NZ_AP021884|1605656:1613255|1611315_1612326_+|WP_147073482.1|DBSCAN-SWA MVFDKIACPVYLVAVILALGGCATQNSAPVVDGTQPGTSNIVKPAIKSATTRAGAAKLHDWRPDSHTVQKGDTLYSIALEYGLDYRDLASWNALSDNNLIRVGQVLKLSAPQPGSGIAQVTTSESAVQTIPLKIEPLPQAQIATGAVLITQPKAVKLPYSAAALAQLEQGGTPQPAARPPATPEAASGVAPEPASSAEQSRPAATAKETDDTGIDWIWPTQGRVIAGFDEAKNSKGLDIAGKAGQAIFAAAPGKVVYSGAGLRGYGKLVIIKHNAIYLSAYAHNQRVLVKEGQTVARGQKIAEMGDSDADQVALHFEIRKMGQPVDPMKYLPGAQK >NZ_AP021884|1605656:1613255|1606685_1609043_+|WP_147073443.1|tRNA|DBSCAN-SWA MKFSELWLRTLVNPALDSAALSHLLTMAGLEVEALDPVAADFSGVVVGQVLSVAPHPDADRLRVCLVDAGTGSPLQIVCGAPNVSEGARVPCALAGARLPGFEIKKAKLRGVESQGMLCSARELGLAEQADGLLLLPNDAPVGSNIRDYLHLDDRLYTLKLTPNRSDCLSVAGVAREVAALTGSPLNLPRIEPAAVTGRLTRMVQVTAGQACPRYCGRVISQLNRAAQTPGWMIERLSRSGLRSISPVVDITNYVLLELGQPLHAFDLDKLAGDIQVRMATPGETLTLLNDQRATLEADMLVIADDNGAQALAGIMGGAATAVDENTSEIFLEAAYFSPGAIAGRARRLGLSTDSSHRFERGVDYAATRDALERATALILEICAGAASAITEITGDLPQRAPVMLRTARASKVLGVALSDAQVEVLLGRLCFDFQRDGAAYQVTPPSYRFDLNIEEDLIEELARLHGYDNIVAQAPVARLTMLPQPEQQRGVDALRTLLTARDYQEVITYSFVDAAWEADFAPGAQPVVLKNPIASQMGVMRSTLLGGLMDVLRNNLNRRQERVRIFESGRCYLPAAEGFDQPQRLAGLAYGSAMPEQWGSAARNVDFFDVKADLEALCWPQPARFEKSAHPALHPGQCAEMWLNGVHAGWLGTLHPRLTQQYDLATAPVVFELALPALLTRKLPRHGEISRFQSVRRDLAVIVDESAPVQALIDAMYAARIEGVAEITLFDVYRGKGIDSDKKSLAFRVLLQDTQKTFTDTEVDTAMAYFTDLLKQQFNAQLRS >NZ_AP021884|1605656:1613255|1609333_1609708_+|WP_147073439.1|DBSCAN-SWA MAEHQPTSQLPPIPAKRYFTIGEVSELCGVKPHVLRYWEQEFGQLKPVKRRGNRRYYQHHEVLLIRRIRELLYEQGFTINGARHRLDVLATSDAAEAAPTVTESVTDYAALRREMMEIVELLRL >NZ_AP021884|1605656:1613255|1610648_1611311_+|WP_147073435.1|DBSCAN-SWA MSVTRHSGIGMTSERTRARMVERLRAQGIKDNNVLTAMGMVPRHIFVDEALSIRAYEDSALPIGFGQTISSPYSVARMIEVLRGGADLQCVLEVGTGCGYQAAVLAKLAREVYSVERIATLLGRARRTIRELRIGNIKLKHGDGSIGLKDVAPFDGIILAAAIPTPPQALLEQLAQGGRMVLPRGIGETQQMVLIERTAEGFQETVLEMVHFVPLLPGVR >NZ_AP021884|1605656:1613255|1605656_1606676_+|WP_147073445.1|tRNA|DBSCAN-SWA MQNLDTIVAAALAEFAAVNQAVELEQAKARYLGKAGLLTGQLKQLGKLPAAERPAAGNVINQAKERIQQALEARRAALSRAELDNRLAAETLDVTLPGRGLGTGGLHPVTRTLARIQALFASIGFEVAEGPEIETDFYNFTALNIPENHPARAMHDTFYVDDKHLLRTHTSPVQIHYLQNNQPPLKIIAPGRVYRCDSDVTHTPMFHQVEGLWVDEEVSFAALKGVLADFMQRFFERDDLKVRFRPSFFPFTEPSAEMDIACVMCGGGGCRVCSHTGWLEVLGCGMVHPNVLGHVHVDSEKYLGFAFGMGVERLAMLRYGVDDLRLFFANDLRFLKQFN >NZ_AP021884|1605656:1613255|1609050_1609356_+|WP_147073441.1|DBSCAN-SWA MTLTKAELADMLFEKVGLNKREAKDMVESFFEEIRIALEAGDTVKLSGFGNFQLRDKPQRPGRNPKTGEEMPITARRVVTFHASQKLKSQVEDAHGGTSAN >NZ_AP021884|1605656:1613255|1609908_1610652_+|WP_147073437.1|DBSCAN-SWA MRILLSNDDGYFAPGLAILADTLSHIADITVVAPERDRSGASNSLTLDRPLMLRQAHSGFYYVNGTPTDCVHLAVTGMLDHLPDMVISGINHGANMGDDTIYSGTIAAATEGFLLGVPSLAISLASHAAGNYATAARVASELAQRVMARPFAAPLLLNVNVPDIPYQDLQGTAITRLGRRHKAEPVVKSTNPRGQTVYWVGAAGAAQDAGEGTDFHAVANGRVSVTPLQMDLTQFSQLAPLRAWLQA >NZ_AP021884|1605656:1613255|1612328_1613255_+|WP_147073433.1|DBSCAN-SWA MHDDNEQEDYAGIPDDKSQTEVPEVELGFTEDAHTDVTQMYLNEIGHNALLSPTEERRLAELTRAGDFDARQKMIEHNLRLVVNIAKHYANRGLALLDLIEEGNLGLIHALEKFEPERGFRFSTYATWWIRQNIERAIMNQSRTIRLPVHVIKELNVILRARRHLENHGASDPSDEDIAHLVGLPVEDVRRMLRLNDRVASLDAPLDIDPSLSIGEAIADGNSALPEDMLEHAETEAFVRLWLSDLNDKQRWVIERRFGLGGQDVHTLEQLAESLDVTRERVRQIQMEALHHLRRMLKRTGVNKDALL |
8 | uncultured_Mediterranean_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1977054 : 2023954
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|DBSCAN-SWA AATGCAAACCCAAGTTCCATCAATCGAATCCGGTCGGAATCCCCGCCGGATGAATCCCGGCGGTGCAACCTGCATCGCCCTCGACGAAAACGAGCTCGCCATCCGCTGGGGGCTCTCCGTCAAGACGCTGCGCCGCTGGCGTCAAGAGCAGCTCGGCCCCATCTACTGCAAGCTCGGTCGCCGGGTCACCTACCTCCTGCACGAAATCGAAGCCTTCGAGCGCCGCGTCTCGCGCTACTCGAGCTTCACTCGTGCGTACCAGTGAGGAGGACGGCCATGAGCGATCTGACCATCTTCCCCGTCGACATCGCTGAGATGTCTGTGAGCCAACTGGCCGCGCTGCCGCCCGAGCAGAAGTGCGAGGTCGACAAGAACCTTGATGCTGCCATCGACTGGCTAAAGAAGGCTCGCACCAAGTTCGATGCGGCGCTGGAACAGTGCTACGGCGAGCAGGCCCGTGTCGCACTGCGTGAATCAGGCCGTGACTTTGGTACCGCCCACATCAGCGACGGCCCGCTGCACATCAAGTTCGAGCTGCCCAAAAAGGTCAGCTGGAACCAGAAACAGTTGGGCGAAATCGCCGAGCGCATCGTGGCCTCAGGCGAGAAGGTCGAGGGCTACCTCGACGTCAAGCTCTCAGTGTCCGAGTCCCGGTACATCAACTGGCCGCCTGCATTGCAGCAGCAATTCGCGGCCGCCCGCACGGTCGATTCCGGCAAGCCGTCCTTCACCCTGAGCACCGATGGGGGTGAGGCATGAAGCGGCTACCCATCGTGTCCGCCGTCGAGCGGATGGCCGAGCGCAAGGGCGTGAAGCTGCTGATGCTGGGCAAGTCCGGCATCGGCAAGACGTCCCGGCTCAAAGACCTCGACCCCGCCACCACACTGTTCCTTGACATCGAGGCAGGCGACTTGGCGGTCGCCGACTGGCCGGGCGACACCATCCGCCCGGCGTCCTGGCCCGAGAGCCGCGACTTCTTCGTGTTCCTTGCGGGCCCGGACAAGTCGCTGCCGCCGGAGAGCGCGTTCTCGCAGGCGCACTACGACCACGTCATCGAGAAGTTTGGCGATGCGACGCAGCTCGGTCGCTACCAGACCTTCTTCCTTGACTCGATCACGCAACTGTCTCGCCAGTGCTTTGCGTGGTGCAAGACGCAGCCCGGGGCGGTCAGTGATCGTTCCGGCAAGCCCGATCTGCGCGCGGCCTACGGGCTGCTCGGCCAGGAAATGATCGGCGCGTTGACCCACCTGCAGCACGCCCGTGGCAAGAACGTTGTGTTCGTGGCGATCCTCGATGAGCGACTGGATGACTTCAATCGCAAGGTGTTCGTCCCGCAGATCGAAGGCAGCAAGACCAGCCTGGAGCTGCCCGGCATCGTCGATGAGGTCGTGACGCTGGCCGAGATCAAGGCCGAGGACGGCAGTTCCTACCGCGCCTTCATCACGCACACCGTCAATCCCTACGGCTTCCCGGCCAAAGACCGCAGCGGTCGTCTCGACCTGCTGGAGCCGCCGCATCTCGGCGCGCTGATCGCCAAGTGCGCGGGCGCTGTGCCCGCGCTAGCCAGCGCCGCCAACCCCGCACACATCGAATCTCAGGAGTAATCGCAATGACCGCATGGAATGACTTCAACGACGCCGACTCTCAGCAATCCGGCTTCGATCTGATCCCCAAGGGCACCGTCGTGCCGGTGCGAATGACCATCAAGCCGGGTGGCTATGACGACCCCGAGCAAGGCTGGGGTGGCGGCTACGCCACCGAATCGTTCGAGACCGGTTCCATCTATCTGGCCGCTGAGTTTGTGGTCACCGCTGGCGATCATGCCAAGCGCAAGATGTGGAGCAACGTCGGCCTGCTCTCCAAGAAAGGCCCGACCTGGGGCCAGATGGGGCGCAGCTTCATCCGGGCCGCGCTCAACAGCGCCCGCAACGTCCACCCGCAGGACAACAGCCCACAGGCCGCCGCCGCGCGCCGCATCAATGGCTTCGCCGAACTGGACGGTCTGGAGTTCTTGGCGCGCGTCGACATCGAGAAGGACGCGAAGGGTCAAGACCGCAACGTGGTCAAGCTGGCAGTCGAGCCCGACCACCCCGACTACGCCAAGTTGAAAGGTGTGCCGCCGAAGGGCAGTCCGGGCGGTGGCAACTCCGGCGCTCCGGCGCAGGCGGCCCCGGCCTATTCCGCGCCCACCCCGCAACGCGCACCAGTGACGGGCAAACCGTCCTGGGCTCAGTGAGGAGACGGCTATGAATGCATCCGTCCTCACTGCCAGTCACTACGGCGTCGTGCGCTTCGGCGATCTGCAATGCGAGGCCGTCGTCCTCAAGGGCGGCGAGCGTGGCTACGTTCGTCGCCAACTGGCCAAGCTGCTGGGTTTCCACGAGACGCACAAGGGTGGCCGATTTGCCCGGTTTCTTGCCGACTTCGCTCCTAAGTCCTTGTCGGCATTGGAGAAAACTCGTGAGCCGATTCTGTTGCCGTCAGGTCGGCAGGCGCAGTTCTTCCCGGCCGGGATCATTGCCGACGTCGCGTCGGCGGTGGTCAGCGCGGCCATCAACGGCACGCTGCACAAGGCCCGCCAGGGCATCGTGCCCAATTGCATGAAGATCATGCGCGCGCTGGCCACCACCGGCGAGGTCGCGCTGATCGACGAGGCGACGGGCTACCAGTACCACCGCGCGCCTGACGCGCTGCAGGAACTGATCTCCAAGCTGCTGCGCCAGTCGTGCTCTTCGTGGGAGCGCCGCTTCCACCCGGACTACTACCGCGCCCTCTACCGGCTGTTCGGCTGGAAGTACCAGGGCCACGACCAGAACCCGCCCCACGTTGTCGGTCAGATCACGCAGCGCTGGGTCTACGGCCCGGTGCTGCCCGTCACGCTGATCGACGAGATTCGCGCCCGCAAGGGCATCTCGCAGAAGCACCACCAGTGGCTGTCCGATCAGGGCCTCGCCCGTCTGGAAACGCAGATTCACGCGGTCACCGCCATTGCGCGCAGCTCGACCTGCTACCGCGACTTCGACCGCCGCTGTGAAGCGGCCTTCGCTGGCGGCGCGCTGCAGCTGGCGCTGCTGGCCGAAGACTTTGAGGAGGGGGCGTGAAATGCTGGGTCTGCAAACGACAGGCCCGGGGATTCGGTCACACCGACAACCGACACGGTATCGGCGATCCCCGGCGCTACCCCATCGACTGGGTGTTCTGCTCGCAGCGCTGCCAATCCGCGTTCCACGCTATGTACGGCAACTGGTCGCGCGCCAAGGATGGTCGCAGCGACATCAAGGGGGTCGCCATGATCGATCCCTCTGATATCGAGCTGGCCGCGATGCGCAAGTGCCTCAAGTCCTTCGGCGAGGCGGCAAGCGAGATCGGCTTCACCAAACCACTGGGCAACTACTCCGAAGCCGAGGCGCTGCAGGTGATCGACGCCATCGTCACTTGCTACACCGAGGCGATGGTTGAGCACCACGAGGCGAGCAAGTACCCGCCCGTACGCGGCATGACGCCAACGCCCGACCCCATGACACCGAGTGCAGCCAATCCGTTCGCGGATCTGGACGACGACCTGCCTTGGGAAGAACCGAAGGGGAAGAAGCCATGATGGACTTCAACTCCACTTCGAGCATCTCGGGCCAGATCACTGCGCTGGTCGACGCCGGGATGCAGCGGGCGCGAGCCCAGCAGTCCGAGCGCCAGTACCTTGGTGCCTCGCGGTTGGGCGCTGCCTGCGAGCGTGCGCTGCAGTTTGAGTACGCCAAGGCTCCCGTCGATCACGGCCGGGACACCCCGGGCCGGATGCTGCGCATCTTCGAGCGCGGCCACGTCATGGAGGACTGCATGGTCGCGTGGCTGCGCGACGCCGGTTTCGAATTGCGTACCCGCAGGGCCGATGGCGAGCAGTTTGGCTTCTCCGTGGCTGATGGCCGTCTGCAGGGCCACATCGACGGCGTCATCGTCGATGGCCCGGAGGGCTTTGCCTACCCGGCGCTCTGGGAAAACAAGTGCCTCGGCATGAAGTCCTGGCGCGAGCTGGAGAAGAACCGGCTCGCCGTGGCCAAGCCCGTCTACGCCGCGCAAGTGGCGATCTACCAAGCCTATCTCGAACTGCACGAGCACCCGGCGATCTTCACGGCGCTCAACGCCGACACGATGGAGATCTACACCGAGGCCGTGCCCTTTGACGCAGCCCTGGCCCAGCGAATGTCGGATCGGGCGGTGAAGGTCATCACGGCGACTGAAAGCGCAGATCTCCTGCCGCGTGCCTTCAATGACCCGACCCACTTCGAGTGCCGGATGTGCGCGTGGCAAGACCGCTGCTGGAGAACACAAGCATGACCGACAACAACACCCCGACCACCGGCATCGAGCCGATGATCGATGCCAAGCAGGCGGCCGCCGCGTTGCGCCTGCCGTACTACTGGTTCGCCGACCACGCGATGCGCACCAAGTACCGGATTCCGCACTACCTGATGGGCGGTCTGGTGCGCTACCGGCTGTCCGAACTCTCTGCGTGGGCCACGCGTACCACCGCCGTTCAGGGCCGTGATTCCCAAGATGCGGACGCACCTGTCGAGGGAGCCGAATGATCGACTTCAACGACACCACCCAACCTGCGGAGCACAACAGGGAATCTGAACGAGACGAGATTCGCGCCGACTTGCTTGCGCGTCTGGAGTCGGTGCTGACCACGATGTTTCCGGCTGGCAAGAAGCGCCGTGGCAAGTTCCTGATCGGCGACATCCTCGGCAGTCCAGGTGACAGCCTCGAGGTGGTGCTGGAAGGTGAGAAGGCCGGTCTGTGGACGGATCGTGCCACCGGCGATGGCGGCGACATCTTCGCCCTGATCGCGGCCTATCTCGGTGCGAACGTCCACACCGATTTCCCTCGCGTGCTGGATGAAGCTGCCGATCTGCTCGGGCGGTCGCGGTCGGTGCCAGTGCGCAAGGCGAAGAAGGAAGCGCCTGTAGACGACCTCGGCCCGGCCACGGCGAAGTGGGACTACTTCGATGCCGGTGGCAAGCTGATCGCCGTCGTCTACCGCTATGACCCACCGGGAGGCAAGAAGGAATTCCGACCGTGGGACGCGAAGCGCCGCAAGATGGCCCCGCCTGAGCCGCGCCCGCTGTTCAACCAGCCGGGCATCGGTGCGGCCAGCCACGTCGTCCTGGTCGAGGGCGAGAAGTGCGCGCAGGCCTTGATCGCCAGCGGCGTGGTGGCCACCACCGCCATGCACGGTGCCAATGCCCCGGTCGACAAGACCGACTGGTCGCCACTGGCTGGCAAGACGGTGCTGATCTGGCCCGACCGCGATGCGCCAGGGTGGGACTACGCCGACCGCGCGTCGCAGGCGATCTTGCAGGCAGGCGCGACCTCGGTCGCCATCCTCATGCCACCCGACGACAAGCCGGAGGGGTGGGACGCTGCAGATGCCATTCCCGAAGGTTTCGATGTCGGTGGCTTTCTGGCCGTCGGCGAGCGGATGCCGGTGATGCGCTCGGTGGAGGAAGCGCCTTCGCCAGACTTGCTGACGGGCATTGATTGGACGACCGAGGATGGCCTGTCCAGCGCTTTCACCCGCCGCTATGGCGAAGACTGGCGCTACTGTGCCCTGTGGGGCAAGTGGCTGGTCTGGACGGGTGTGCGCTGGAATCCCGATCAGGTGCTCTACGTGTCGCATCTTTCCAGGGGCATCTGCCGTAACGCCTCGCTGAAAGCGGACACGCCGAGGCTCAAGGGCAAGCTGGCCAGTTCGGCCACGATTTCGTCGGTTGAAAAGATCGCGCGCTCTGACCCGAAGCACGCATCCACCGCCGAGGAATGGGACGCCGATGTCTGGGCGTTGAACACCCCCGGTGGCGTGGTCGATCTGCGCACCGGCCGGATGCGCCCGCACCGGCGGGACGACCGAATGACCAAGGTGACCACGGCTACCCCGCAGGGCAATCCGGACAGTGCCTGCCCAACGTGGCGAGGGTTCCTGACAGACGTCACCGGCGGCGATGCCGATCTGATGGCCTACCTGCAACTGATGGTTGGCTACTGCCTGACGGGCGTCACCAGCGAGCACGCGCTGTTCTTCCTGTACGGCACGGGCGCGAACGGCAAGTCGGTGTTCGTCAACGTGCTAACCACCATCCTGGGCGACTACGCGGCCAACGCCCCGATGGACACTTTCATGGAGGCGCGCAATGACCGACACCCCACCGATCTCGCCGGGTTGCGCGGTGCACGATTCGTGTCATCCATCGAAACGGAGCAAGGGCGGCGCTGGAACGAGTCCAAGGTCAAGGCCATCACCGGTGGCGACAAGGTGTCCGCGCGCTTCATGCGCCAGGACTTCTTCGAGTACCTGCCGCAGTTCAAGTTGGTGATCGCGGGCAATCACAAGCCGTCGATCCGCAACGTCGACGAGGCGATGAAGCGTCGACTGCACCTGATCCCGTTCACGGTGACGATCCCGCCCGAGCGCCGCGACGGCAGGCTGACCGAGAAGCTGCTCAAGGAACGCGATGGGATTTTGGCGTGGGCCGTCGAGGGCTGCAGCCGCTGGCAAAGCCAGGGCTTGAAGCCGCCCGCCAGCGTGGTGTCGGCGACCGAGGAGTATTTCGAGGCCGAGGACGCGCTCGGGCAGTGGATCGAAGAACGCTGTCTGCTGGCCAAGTCGCACCGCGAAGGTGTCTCCGAACTGTTCGCCGATTGGCGTGAATGGGCCGAGCGCGCTGGCGAGTACGTGGGCTCGGTCAAACGCTTCTCGGAGCTGATGGCGACTCGCAAGTTCGACAAGTGTCGGCTGACCGGAGGGGCTCGCGCCATCGCGGGCATCGCCCTCAGGCCCAAGCCGTACAGCCACGCCTACCCCTACCGCGATGACTGATCAATCCGGTCGAGTGACGGATTTGACGGGTTTCCTGATTGACGCGCTACACGTGCGCGCACGTAAAGGGCGTTGTCCTGACAAACCGTCGCATCCGTCACTCGCCCACCCAACACGGAGTAAAGACGATGAAAACGACGATCCTCGCCCTCGATCTGGGCACACACACCGGGTGGGCTCTGCAGCACCTGGACGGCACCATCACCAGCGGCACGGAGCACTTCAAGCCGCAGCGATTTGAAGGCGGCGGGATGCGTTTCCTTCGATTCAAGCGCTGGCTCAACGAACTGCTGTCGGTCAGCAATCACATCAACGCGGTGTTCTTCGAGGAAGTTCGGAGGCACGCTGGCGTTGACGCAGCGCACGCCTACGGCGGATTCATGGGGCACCTGACCGCGTGGTGTGAACATCACAACATCCCCTACCAGGGCGTTCCGGTCGGCACGATCAAGAAGCACGCGACCGGCAAGGGCAATGCGAGCAAGGACGAAATGATCACGTCCGTCCGCGAGCGTGGTCACACCCCAGTCGACGACAACGAAGCCGACGCGCTGGCTCTGCTGCACTGGGCAGTCGAGACGCAGGAGGTGTGACGTGAAGGTTTCGACACCCCAATACCGCTGCCCCCTTGGTCGGCTGCAACCCCAGACCACCGATCTGGACGCCATCAAGGAACGTGGCTGGCGTGACCAGCACATCCTGGTGGTCAACGCGTCCGACGACCGTCTGGACTTCATCGAGCGCGAGATCGTGCGACGCATTGGTGAACGCCTGTACGGGCTGGGAGGGACGCGTCATGGCTGAGTGGACAACCGACGACGTGGCAGCACGCTTCGAGGAGGCCGCCACCACCGGACGACGCTTGCCCCCTGTACGTGTGCAGGGCTACTTCAACTGCTGGCCTGCCTTCGTCCGCAAGGAGTGGGAAGCCTTTGCTGCTGACGAGAAGGTGTATCGCCCCTTCCCACCAAGCCCCGAGGCCATCGACCGGATGCTGGAGACGATGCGCTGGGTGCAGTGGCTCGAGGTCGAGCAGCGACACCTCGTGTGGATGCGGGCCAAGCGCTACGGCTGGAGGGACATCACCATTCGATTTGCCTGCGACCGCACCACGGCGTGGCGGCGTTGGCAGAGGGCAATGGAGATCGTGGCCACGAACCTCAACAGCGAAGGCGTGCGGTTGCCTTCCAAAAACGTGGGCAATTTAGGGTAATGCTTGCCGCGCTTGTCCCTGCTTTGCCTTGATTGTCCGTTTCGAGGCCCGGCAGCCCTGCAACAAAACAGCCCGGTCGGGGGTAGTATTTCGGCTATCTTCTGGACAGCGGTGACGGTTGAGGCGATGGGCCCAGGCAAAAGGGGTCCTTCCTTCCCGAATCGCAATGCGGGGGGCGCGAGCGCGGCATTCGCCTAGCGTCCGACTGCAAACCAAGGTTTGCAGGGTTTGCAGTTTGCACCCGCACCAGTCCGCACCCATCACGAGCCCGCCCACGGTTTTCCGTCGGCGGGTTTTCTTTTTGAGGAAACGATTCTGAACACGCTTAACGTTGAGTACCGCAAGGTCGAGGCGCTGATCCCCTACGCCCGCAATCCACGCACTCACACCGACGAGCAGGTGGCCAAGATCGCCGCCAGCATCGTCGAGTACGGCTGGACGAATCCGGTGCTGGTGGACGGCGACAACGGGATCATTGCGGGCCACGGTCGTTTGGCCGCCGCGCGCAAGCTCGGGCTGGATCAGGTACCGGTCATCGAACTGGCGCACCTCTCACCCACCCAGAAGCGTGCCTACGTCATCTCCGATAACCGGCTGGCGCTCGACGCCGGTTGGAACGAGGAGATGCTGGCGCTGGAAATGGCCGAGCTGTCCGAGGCCGGGTACGACCTTGCACTGACCGGTTTCGAGGATGCTGAGATCGAGGCCTTGCTCGCTGACGAAGTCGCCTCCGATGCCGCCGACCAAGAGCCCGATGCCGACGAGCCGGACGATGGCGACGATGTGCCGGATAGCCCAGTGGTGCCGGTGTCCCGCACCGGCGATTTCTGGGCCATCGGTACCCACCGTCTGATCTGTGGCGACGCCACCGACCCGACCGTGGTCGCCACTCTAATGCAGGGTGATGCGGCCCGGCTGTGCTTTACATCACCGCCTTACGGCAACCAGCGCGACTACACCTCCGGCGGCATCACCGATTGGGATGGCCTGATGCGCGGTGTGTTCGCCAAGGTGCCAATGGACGACGACGGGCAGGTGCTGGTCAACCTCGGGCTGATCCACCGCGACAACGAAGTCATCCCGTATTGGGATGCGTGGCTGGGCTGGATGCGCACGCAGGGTTGGCGGCGCTTTGCTTGGTACGTCTGGGATCAGGGGCCGGGGATGCCCGGAGACTGGGCTGGTCGTTTTGCGCCGAGTTTCGAGTTCGTCTTTCACTTCAACCGCTCTAGTCGCAAGCCCAACAAGATCGTGCCCTGCAAGCACGCGGGCCAGGAATCGCACTTGCGCGCCGACGGGTCGTCCACGGCCATGCGCGGCAAGGACGGCGAAGTCGGTGGCTGGACGCACAAGGGCCAGCCGACGCAGGACACCCGGATTCCCGACTCGGTGATCCGCGTGATGCGGCACAAGGGCAAGATTGGTCAGGACATCGACCACCCGGCTGTGTTCCCGGTGGCGTTGCCCGAGTTCGTGATCGAGGCCTATACGGACGCAGGCGACATCGTGTTCGAACCTTTTGGCGGCAGCGGTACCACGATGCTGGCCGCGCAGCGCAAGGGTCGTGTGTGCCGCTGCGTGGAGATCGCGCCGGAGTACGTGGACGTCGCCATCAAGCGCTTCCAGCAGAACCACCCCGGCGTGCCCGTCACGCTGCTGGCCACAGGCCAGTCCTTCGACGATGTGGTCAATGAACGTCAGGCCACCACGGAGGTAGAGCAATGACCGCCTCCTGGTTTGCCGACAAGATCGAAAAGTGGCCGACTGCCAAGCTGCTGCCCTATGCCCGCAACGCGCGTACTCACTCGGACGATCAGGTGGCGCAGATCGCCGCGTCGATTGCCGAGTTCGGATTCACCAATCCGATCCTGGCGGGCAGCGATGGCGTGATCGTCGCCGGTCACGGACGGCTTGCTGCTGCGCAGAAGCTTGGGCTGGCGGTGGTGCCGGTGGTGGTGCTCGATCATCTGAGCCCGACACAGCGCCGGGCCCTGGTGATCGCAGACAACCGCATCGCCGAGAACGCGGGCTGGGACGATGCGATGCTGCGCATCGAGATCGCATCACTGCAGGACGACGACTTCGACGTGTCGCTGACCGGCTTCGATGCAGATGCGCTGGCCGAATTGATGGCGGGCGACGAGCCGGATGGCGAAGGCGAAACCGATGACGATGCCGTGCCCGAGTTGTCGGAGACGCCGATCTCTCGTCCGGGTGATGTCTGGTCGCTTGGCGGCCACCGGCTGCTGTGCGGGGACTCCACCGTGACTGAGAGCTACGACAGGCTTCTCGATGGCGAGCAGGTCGACATGGTGTTCACCGACCCGCCGTACAACGTGAATTACGCCAACAGCGCCAAGGACAAGATGCGTGGCAAGGACCGCGCGATCCTGAACGACAACCTCGGCGACGGCTTCTACGACTTCCTGTTGGCGGCGCTGACGCCGACCATCGCGCATTGCCGGGGCGGGATCTACGTGGCGATGTCGTCCAGCGAACTGGATGTACTGCAGGCCGCATTCCGCGCCGCCGGTGGCAAGTGGTCGACGTTCATCATCTGGGCCAAGAACACCTTCACGCTGGGCCGTGCCGATTACCAGCGCCAGTACGAGCCGATCCTGTACGGATGGCCAGAGGGCGCGCAGCGTCACTGGTGCGGCGACCGCGACCAGGGCGACGTCTGGAACATCAAGAAGCCGCAGAAGAACGACCTGCATCCGACGATGAAGCCGGTGGAGTTGGTCGAGCGCGCGATCCGCAATTCGAGCCGACCGGGCAACGTGGTGCTCGACCCGTTCGGGGGCTCCGGCACGACGCTGATTGCCGCCGAAAAGTCAGGACGGCTGGCACGGCTGATCGAACTCGACCCTAAGTACGCGGACGTGATCGTGCGCCGCTGGCAGGAATGGACTGGCAAGCAAGCCACCCGTGAGTCGGATGGCGCGCTGTTCGATGATCAGGCGGCGATCGACTCTTCCGCGATCTCGCAATGAATCACGAACCCCGTCAGGTAAGGCAGGCCGCGCGGGATGCCGTACTGCTTGCTGGTCTGGCGGCCAATCGTCCAGCCCATCCACTGTTGGGTGGCGGCGTTGATCGCGTCCGCCAGGGTCTGGCCCCGGTACAGCCCGTTTTGCAAATCGTCCGCAAAGTGGCGGCCGTGGCGACTGTCGAGGAAGACGCGGACTGATTCGAGGGGCTGACCGGTAGCGTCGGAGATGGCGGTCATCGCCAGGGGCCACGCGGTGCTGGCGTGTTCGTTCATCGTGCCCCAAAAGCCCCAGGCATCGTTCTGGGTGGCGGGCATTTGCTGGTTGGTGTTCATCTCTGGCTCCTTGGGGTTGATCGTTGCGACACCCGTAGTAACGCGCTGTTCGATTGAGAAGCCAAGCTGTTCTTGGCCTCTTTCTCAATCAATTTCGATTACCCGAGACGGGCCACGTACCGGGCGTAGTCGCCGCCCTCTGGATTCACGTAAAGGTAGGGGCGACCCGGTGCGGTGACCTCGACGCAAAGATAGCCGTCGCCGGTGCCGCCACCTTTGCCGCGCAGCCAGTCGCGCGATACCAACAGGCTGCGGGCAAAGGCATCGAACTCGTCGACGGTCAGTTCCTTGGTCTCGGTGACATAGACCTTGGTCTGACCCTGGCCGCCAACTTCGTCCAAGTCGGCAGGCTTACGGGCAAACGGCAATCGGACGCTCAACTCCTCGACCTGGAAGGTGGTGTCGCCAAACTGCAGGGTGCGCGGGGTGCGTTCGATGGTGATGGTCATGGTGCTCATGAATGTTCTCCTGGGTGTTGGCGTTGCGATCAGGCTTCTGCGGCGATCCGGTAGACCCGCTCGCTGCCCTGGGCCTTGTCCGAGACGATGGTCAGCCCGAGCTTCTTCTTGAAGGCACCGGCAAAGGTGCCGCGCACCGTGTGCGCCTGCCAGCCGGTGGTCTCGCAGATCTGCTGCACCGTTGCCCCTTCGGGGCGCTGCAGCATCTGGATCACCGTGGCCTGCTTGCTGTTCTCGCGGGTGCGAGGTTTGGCGGCCGCCTTTTCTTGCGCCCACGCGGCCTCGGCTGCCGTCACGGCTGCGTCGAGTTCAGGGTCTGCGGCCACTGGCGCAGGCGTTGGCCGGGCGCGCCCCATCGCGTCGTAGCCCTCGGCGGCGACGAACCAGTGGGTGCCGTCGGAGGTGATCAGCGCGCGGTTGAACAGGCCGTCGAGCACCTTCTTGCGTGCGCCGCCTTTGATGTTGTCGGGGAACCAGTCGATCTTGCCGTCGGTGTGTTCGAGGGCGTAAGCCAGGATCGCGTGCTGGGCCGGGGTCAGTTGGGTGGTGGTCATTTGCTTCTCCTTGTGCAAGGGGTTGATGGGGTGACGTGATGAACGCGCTGTTCGGGAGTGAAGCCAAGCGTTTTCTGCTTGGCTTCGAAGGTTCTTGATCAGCTGTTGGCCTTGTCCGACTTCGTCGCCTTGCGGCCTTGTTCGACGCCTGCGTTGAACGCGGCCTCCAGGGCGTCGCGCAGGCACCAGACCGCCACGTCGTGGAAGTCGAGGCTGTCTGACTTGCGGGTTTCCAGGGTTTCGATGCCCAGCTTGTTTTGTGCGATCTGGGTCAGGAGTTGTTCGAACTTGCTCATTGCTGCTTCCTTTGATGGTGTTGATGACGTCCGTATGAACGCGCTGTTCCAGAGAGAAGCCAAGCTGATTTCGAGTGAAGGTCGAAAAAATGATTGAAGGGGTAACCGGTTCTCAAAATGGGCATTTCGATTCGCGCTTACGCCCGTCACCGTGGTGTGACCGACACCGCTGTTCACAAGGCAATTCGCGCAGGTCGGATCACGCCGGAGGCTGACGGCACCATTGATGCCGACCGTGCTGATCGCGAGTGGGCTCGCAACTCCGATGTGCCGAAGACCGGTACGCGGGCCAAGGCCGCAAAGGTCGCCGTGCCGGAAGGCGGTACGGGTGTTGGCGGTGATGGGCCCGCCGCATTACCCGCTGGCGGCGCGTCCTTACTTCAGGCGCGCACGGTCAACGAGGTCGTCAAGGCGCAGACGAACAAGGTGCGTCTGGCCCGACTGAAGGGCGAGTTGGTGGATCGGCCGCAGGCCATCGCCCACGTCTTCAAGTTGGCGCGCTCCGAGCGTGATGCGTGGCTGAACTGGCCCGCGCGCATCTCTGCGCAGATGGCGGCCAAGCTCAATATCGATCCGCACACGATGCACGTCGCCCTGGAGGCGGCGATACGTGAGCACCTGCAGGAACTGGGCGAACTCCGGCCCCGGGTGGACTGATGCTGAATGTTGAATACGAAGGCGCTGCCGAAATCGAGCGCGCGTGGCGTGAAGGGCTGACACCTGATCCTCTGCTCTCGGTCTCTGAATGGTCGGATCGCCACAGGATGCTATCGAGCAAGGCGTCCGCTGAGCCTGGGCGCTGGCGCACCAGCCGCACGCCGTACCTGAAGGCCATCATGGACTGCCTGTCGCCGACCTCGCCGGTCGAGCGCGTGGTGTTCATGAAAGCCGCACAGCTCGGTGCGACTGAAATGGGCTCGAACTGGATTGGCTATGTGATTCACCACGCACCGGGGCCGATGATGGCGGTGTGGCCAACGGTGGATATGGCTAAGCGCAATTCCAAGCAGCGGATCGATCCGTTGATCGAGGAGTCGGCGGCACTGAGCGAATTGATCTCCCCAGCACGGTCACGCGACTCGGGCAACACCATTCTGGCCAAGGAGTTCCGGGGCGGCGTGCTGGTGATGACCGGGGCGAACAGCGCGGTGGGCTTGCGCTCGATGCCGGTGCGCTACCTGTTCCTCGATGAGGTTGACGGGTATCCGCTGGACGTCGAGGGTGAAGGTGATGCGATCTCGCTGGCCGAGGCGCGCACGCGAACCTTTGCCCGGCGCAAGATCTTCATCGTGTCGACGCCGACGATCTCGGGGGCGAGCGCCATCGAACGCGAGTACGAGGCCAGTGACCAACGTCGCTACTTCTTGCCTTGTCCGCACTGCTCGCATCGCCAATGGCTGCGCTTCGAGCAGTTGCGATGGGAAAAGGGGCAACCGGACACGGCGTCCTACATCTGCGAGTCCTGCGATAAGTCGATTGCCGAGCACCACAAGACCTGGATGCTGGAGCACGGTGAGTGGCGCGCGATGATCAGCGACGGCACGGGCAAGACAGCGGGGTTTCACCTGTCGTCGCTTTACAGCCCGGTTGGCTGGCGCGGTTGGCGCGACATTGCTGCCGCGTGGGAAAGCTCTGTGAACAAGGAATCGGGGTCGGCGGCCGCCATCAAGACCTTCAAAAACACCGAACTGGGTGAAACCTGGGTTGAGGAAGGCGAAGCGCCAGATTGGCAACGGCTGGTCGAACGCCGCGAGGACTACCGGGTTGGCACGGTGCCGCCGGGTGGGTTGCTCCTGGTGGGCGCTGCCGACGTGCAGAAGGATCGCATCGAGGCGTCCATCTGGGCCTTCGGGCGCGGCAAGGAGTCCTGGTTGGTCGAACACCGCGTGCTGATGGGCGACACCGCCCGAGACGCCGTGTGGAAGCGACTCGCCGAGTTGCTCGCCGAAAACTGGACGCACGCCTCGGGCGCGGCGATGCCGCTGGCCCGTTTCGCTCTGGACACCGGCTTTGCGACGCAGGAGGCCTACGCCTTCGTGCGGGCCTGCCGTGACCCGCGCGTGATGCCGGTCAAGGGGGTGCCGCGCGGCGCGGCCCTGATCGGCACGCCGACGGCCATCGATGTTTCGCAGGGCGGCAAGAAGCTGCGCCGGGGCATCAAGGTGTTCACGGTGGCGGTTGGCATCGCCAAGCTGGAGTTCTACAACAACCTGCGCAAGGGCGCGGACGTCAGCGAGGACGGCGTGACCACCGTCTACCCGACGGGGTTCGTTCACTTGCCAAAGATTGACGCGGAGTTCATTCAGCAGCTCTGCGCCGAACAGTTGATTACCCGTCGCGACCGCAACGGCTTCCCGGTGCGCGAATGGCAAAAGATGCGCGAGCGCAATGAAGCGCTCGATTGCTACGTGTACGCCCGCGCGGCCGCATCGGCGGCGGGCCTGGATCGCTTCGAGGAACGCCACTGGCGCGAACTGGAACGCCAACTCGGGATGGAACGGCCACCGGATGAGCCACCCCCGATTCAAGCATTCGACCCAAACGAGGCCACCCAACGCGGTGGCCTCTCTGTTTCTGCAAACCCACCACGGCGGCGCGTCATCAAGAGCCGCTGGTTGTCCTGATTTTCAGAGGAGTTTTCATGAGTCTTGCCACCCGTATCGAGAGCCTGGTCATCCGGGTTGCCCAGGAGTTCAACGACGTCCGCGCGACGGCAGGCAGTCTGGCCAGCCTGTCCACCAACGACAAGTCGAGTCTGGTCGCCGCCATCAACGAGCTCAAGGCAGCGGTTCTGTCCGCGATGGCCATCGATGACAACCAGATCGCCACCACCAGCACCTACTCGTCGAACAAGATCGTGTCGCTGCTGGACGCGCTCAAGACCGACATCCTGGGCGGAGCCGATGCTGCCTACGACACCCTGGTGGAAATCCAGCAGGCGCTGCAGAGCGGTACCAGCGGCCTGGACGCGATTCTGGCTGCGGTCAATCTCCGTGTCCGCTTCGATGCGGCGCAGACCCTGACCGTGGCCGAGCAACTGCAAGCACGTACCAACATTGGTGCGGTCGCTGTCAGTGATGTCGGCAACACCGACACCGATTTCGTCGTGATCTTTGACGGCGCGCTGGCCTGATGAGCCTCGCTTCCAGCATCGCCGCTTTGGCGGCGCGCATCGGCTTCGAGGTCAAAACCAAGATCGACGCCACGCATCCCGGCATTGCCCGGGTGTGGGTCAGCTTCGGCTACGTGGGCGGTCAGGTCGTGATCGCCAGCGCGCACAACGTCGCCAGCGTGGTGCGCACGGCGGCGGGCCGGTACCGCGTGCATTTCGCTGTGGCGATGCCGGATGCGAATTACTGCTGGACGGCGCTCGCGCGCAGCAGCACCAACACCGGTCAGCAGCGCTTGGCCCTGGTACGTGCCAGCTCCGACCTGAAGACCGCGCAGTACGTCGACGTCTCGTGTGCGACGGCCGCGTCGTCGTTTGACGACTCCTCTGAAATCAACCTCGTGGTGTACCGCTGATGGCCTACACAGAAGCCCAACTCCAGGCATTGGAGACCGCGCTCGCCAAGGGCGAACACCGCGTCAGCTTCGGCGACAAGACCGTCGAGTACCGCTCGGTCGATGAACTGAAAGCTGCGATCCGCGAAGTCAAGCGCGGCATCCTGGAGCAGGCAGCCGCCACCGGACTATGGCCGGGTGCGCCGCGCCAGATCCGGGTCACGACCTCGAAGGGGTTCTGATGGCCTGGTATTCGAAGATCCGAAGCCTGTTCGGCCAGCAACCCGTCCACGAAGCGGCTGGCCGTGGTCGCCGCTCGTTGGCTTGGATGCCCGGCAACCCGGGCGCGGTCGCCGCGATGCTGGCGACCAACACCGAACTGCGCATCAAGAGCCGCGACCTCGTGCGCCGCAACGCGTGGGCGCAAGCCGGTATCGAGGCCTTCGTGTCCAACGCGGTCGGCACTGGCATCAAGCCGCAGAGTCTTGCTGCAGACGAGCGCTTCAAGACCGACGTGCAGGCGCTGTGGCGTGACTGGACAGAAGAAGCCGACGCCGCAGGACAGACCGATTTCTACGGCCTGCAGGCATTGGCCTGTCGCGCGATGCTCGAAGGCGGTGAATGCCTGATCCGGCTGCGCCCGCGCCGCCCGGAGGACGGACTGGTCGTTCCTCTGCAGCTTCAGTTGCTGGAGCCCGAGCATCTGCCGATCAGCCTCAACCTCGATCTGCCTTCGGGCAACGTGGTGCGCTCTGGCATCGAATTCGACAGCCTCGGGCGGCGCGTCGCTTACCACCTGTACCGCTCGCACCCCGAAGACGGTCGGCTGGCTCCGATGTCGGGCCAGGGCGGGATGGACACGGTGCGCATCGATGCGAAGGAAATCATCCACCTGTTCCGCGTCCTGCGTCCCGGCCAGATCCGGGGCGAGCCGTGGTTGTCGCGGGCCCTGGTCAAGCTCAACGAACTTGACCAGTACGACGACGCAGAACTGGTGCGCAAGAAGACCGCCGCGATGTTCGCCGGGTTCGTGACACGGCAGAACCCGGAGGACAACCTGATGGGTGAAGGTGCGGCCGATGGCGATGGCATTGCGCTCGCCGGGCTGGAACCGGGCACTTTGCAGATTCTGGAGCCCGGCGAGGACATCAAGTTCTCCGACCCGGCCGACGTCGGTGGCTCGTATGGCGAGTTCCTGCGCACGCAGTTCCGCGCGGTCGCCGCTGCCATCGGTGTCACCTACGAGCAGTTGACCGGCGACCTCACAGGCGTGAACTACTCGTCCATCCGCGCCGGGATGCTGGAGTTTCGGCGTCGCTGCGAGATGGTGCAGCACGGGGTGCTTGTGCATCAGATGTGCCGTCCGGTTTGGGCCGCGTGGATGAAGCAGGCAGTGCTCGCCGGTGCCATCGATGCTCCCGGCTTCGCGCGTGGCGGCCCAGCCCGTCGCCGCCGGTACCTGCAGGTGAAGTGGATTCCACAGGGCTGGCAGTGGGTCGATCCTGAGAAGGAGTTCAAGGCCATGCTGCTGGCCATCAGGGCGGGACTGATGAGCCGCTCGGAAGCCATTTCCGCCTTTGGCTACGACGCCGAGGACGTTGACCGCGAGATCGCCGCCGACAACCAGCGCGCCGACGACCTGGGGTTGATCTTCGACTCCGACCCGCGCCGCACCTCCAAGGACGGCGGAAGCGCCGAGCCGAACAAGAACGCTGCCGACACCACGCAAACCGGCAGCTCATCGTCTGCCTGAAGGATTTCCATGACCCTGTTGCCCCATTTGGCGGCGCGCCTCTACGGTGTGCCGCTGGCGATCCATCGCCCAAAACTTGACGTGATCCTGGCCGTGCTCGGCCCCCGGATCGGCTTGGCTGATTTGGCTGCACCCTCGGGCTTCACGCCGCCCGCACGTCCCGCATCCACCCAGACGACGAAGGTCGCGGTCATCCCCATCCACGGCACGCTGGTGCGCCGCACAGTGGGCCTGGAAGCCGAATCCGGCTTGACCAGCTACGCAGGGCTGACCGCGCAGTTGGACGCCGCGCTGGCCAGCCCGGATGTCGCTGCCATCCTGCTCGATGTCGACTCACCGGGTGGCGAGTCGGGCGGCGTGTTCGATCTGGCCGACCGCATCCGTGCGGCTGCTAAGACGAAGCCGGTCTGGGCTGTAGCCAATGACATGGCGTTCTCGGCAGCTTACGCCCTGGCGTCTGCGGCCAGCAAGGTGTTCGTGTCGCGCACCGGCGGCGTCGGCTCGATTGGCGTCATTGCGATGCACGTCGACCAGTCCGAGAAGGATGCGCAGGACGGCGTTCGGTACACGGCGGTCTTTGCGGGCGACCGCAAGAACGATCTGAACCCACACGAGCCGATTTCCAGCGAAGCCCACGCCTTTCTCAAGGGTGAGGTGAATCGCGTCTACGGCCTGTTCGTCGAGACGGTGGCCCGCAACCGTGGCATCGAGGCATCTGCCGTGCGCGACACCGAGGCGGGGCTGTTCTTCGGGCAGGCCGCCGTGGCTATCGGGTTGGCCGATGCCATCGGCACCTTCGACGACGCCCTTGCGCAGCTTTGCGAATCCGTTTCCCCACTCCCGAAGTTGGCGGCAAGCCACTCCGGTCTTTTTAGCAACCCCCAGATGGAGTCATCAATGAATGATCGAACCGACCCCGCTGCTCCTGATCGGCTTGCTGCTGATCCTGCTGGCAGTCCTTCTCAACCGGCGGCCGCCACCGCCATGACCGTGGCTGACGCGATTGAGGTCGCCCAGACCTGCACCCTGGCCGGGCGCACCGACCTGATCGCGGGCTTCCTCGAAGCGAAGGCACCACCCGCCAAGGTACGCAGCCAGTTGCTGGCCACCCAGGCCGAAGCCAGTCCCGAAATCGTCAGCCGCATCGACCCGCAGTCGGCCATGTCGGCGAGTAGCACTGGCCATCCTGCCTCTTCCCACAACCCTCTGATCCAGGCCGTCAAAAGTCGCCTGGGCACAAAGTAACCCAAAAAGGAGCATCCCGTGCCCGCAATGCAAGAACCAATCAACCTCGGCGACCTCCTGAAGTACGAGGCGCCCAATCTCTATTCGCGCGACCGCGTGACCGTGGCAGCTGGCCAGACCTTGCCGCTGGGTACGGTGCTCGGGCAGATCACGGCGACGGGCAAGGTCAAGCAGATCGACCCGTCGGCCACCGATGGCAGCCAGTACTCCGCTGGTGTGCTGATGCAGGACGCCGATGCTGCTCTCGCCGACCGCAACGACGGGCTGATGGTGGCGCGTCACGCCATCGTGTCAGACCACGCACTGCATTGGCCCACCGGCATCACGACTGCGGAGCAGCAAGCAGCGATCCAACAACTCAAAGCACTGGGCGTCCTGGTGCGTATCGGCGCCTAACGCCAAGGAGACTCAATATGCAAAACCCATTCATCAGTCCGGCATTTTCGATGGCATCAATGACTGCAGCCATCAACTTGATCCCCAACCGCTACGGACGCCTGGAGGAGTTGAATCTGTTTCCGCCCAAGCCGGTTCGAACGCGCCAGGTGATTGTTGAAGAACGCGCCGGTGTCCTGAACCTCCTTCCGACCCAGCCGCCAGGCTCTCCGGGAACAGTGAATGTGCGTGGCAAGCGAACCGTCCGGTCCTTCGTCGTTCCGCACATTCCGCACGACGACGTTGTGTTGCCCGAAGAGGTTCAAGGTCTACGTGCTTTTGGCAGCGAAACCGAAATGGAGTCGATTGCCGGAGTGCTGGCCCAACACTTAGAGACGATGCGCAACAAGCACGCCATCACCCTAGAGCACTTGCGTATGGGGGCGTTGAAAGGCGAGATTCTCGACGCCGACGGCAGCCGTATCTACAACCTGTTTGACGAGTTTGGCATCGATCAACAGAGTGTGGACTTCGAAATCAGCAGCCCGACTACTGGCACTGATGTCAAGGGCAAGTGCACTGATGTGTTGGGCATCATCGAAGAAGCCCTTCTCGGCGAGTTCATGACGGGAGTCCACTGCTTGTGTTCTCCAGAGTTTTTCAAGGCATTGACCGGCCACAAGGATGTCAAGACTGCCTTCACGAACTGGCAGCAAGGCGCCGTCCTTATCAATGATGTTCGCCGTGGCTTCACTTTTGGCGGCATCACTTTCGAGGAGTACCGAGGTAAGGCGACTGATGTCAACAAGACGGTTCGTCGCTTCATCGCTGCTGGCGAAGCACATGCGTTCCCTCTTGGCACTATCGACACCTTCGGAACTTACTTTGCACCGGCCGACTTCAACGAGACTGTCAACACGATGGGCCAGCCGCTTTATGCGAAGCAGGAGCCGCGCAAATTCGACAGGGGCACAGATCTGCACACGCAGGCCAACCCGCTACCGATGTGCCATCGTCCCGGGGTTCTGGTCAGGCTCGTCATGGGTGGTGGCGTATGAGTTTGGTCGCCCAGATCTATGAGTCGGCCGCGAACGCTGGGCTGCTGAAGGAATGCCTTTGGTATCCGTCGAACGGTGCGCCATCGCAACTACATCAGATCGGCTTTGCCGCGCCCGATGAATCACTGCTCGATGGCCTGGCCCTGAGCACCGACTACGAGATGACCTACCCGGTCACGGCATTCGGGGGTCTTGCAGTCCGCGAGGTTGTCGAAATCGGTGGCACGTCCTTCCAGGTGCGAGACATCCGATCGTTAAGCGACGGCTCCGAGATCCGCGCCAAGCTCACCCGGCTGTAAACCCATGGCAGATAACTCGATCCGCGAGCGGATTCTGCTGGCGGTGATGGCGGCTGCCCGTCCGGCGGTCGAAGGTCTCGGGGCCACTTTGCACCGGTCGCCCACGGTGGCCATCAGCCGCGAACTTTGCCCGGCGCTCGCGGTGTTTCCCGAGTCGGAGTCCATCACTGAGCGCGCCAACGACCGCGTCACACGCGAACTGACCGTTCGCGTTGTGGCTCTGGCTCGGGCCGTTCCACCCGCGTCCCCCGAAACCGAGGCCGACCGTCTGCTCACCGCTGCCCACGCTGCCTTGTTCGGGGACGGCACGTTCGGTGGGTTGGCGCTGGGCATCCGTGAACAAGAGAGCGAGTGGGAGGTCGAGGACGCCGACGCGGTGGCCGTGGCCCTCCCGGCGCGCTATCGGCTGACGTACCGGACGCTGGCCAATGACCTTTCAACTCTTGGATGACACCTATGACCCAACTTGTCCTGACGCGCCCGCACACCCACGCGGGCAAGACCTATGGCGTCGGTGACCGGATCGAGATCGACGCGACATCAGCCGACTGGCTGATCGCGCACGACATCGCCACGCCGGAGCCGACCGCCCCAACTGCTGAACCCGTCCCCGAACCCAAACCCCTCCAACGCAAGGAACCCAAGCAATGAGCACCTATGCCAGTTTTCAAGGCCGCGTCTTCCTCGGCAAGCGCGACACCGACGGCCTTCCCATCGAAGTGCGCTCGCCCGGCAACGTCGCAGAGCTGAAGCTCTCCCTCAAGACCGACGTCCTGGAGCATTACGAGAGCCAGACCGGCCAGCGCTCGCTGGATCACCGGATGGTCAAGCAGAAGTCCGCCACCGTGAACCTCACCATCGAGGAATTCACCAAGGAGAATCTCGCGCTGGCCCTGTACGGCAACCACGTCGTCGGCACGCCGGGCACGGTCACCGCCGAGCCAGTGGGCGGTGCCACGCCGATTGCGGGCGACCGCTACTTCCTTGCCCACCCGAAGGTATCGTCCTTGGTCGTGACGGATTCGGCTGGCACGCCCGCGACCCTGGCCTTGGGCACGAACTACACGGCTGATCCCGACTTCGGTGCCCTCCAGTTTCTGGATACCACCGGCTTCACTGCGCCGTTCAAGGCCAGTTACGCCTACGGTGTGGCCACCGAGATCGGCATCTTCACGCAGGCGCTGCCGGAACGCTTCCTGCGGCTCGAAGGCATCAACACGGCCCAGGGCAATGCCAAGGTGCTGGTCGAGCTCTACCGCGTGGCATTCGATCCGCTGAAGGAAATCTCCTTCATCTCGGACGAGTACAACAAATTCGAGCTGGAGGGATCGCTGCTGGCCGACACCACCAAGCCCTTCGACGCGGTGCTGGGCCAGTTCGGCCGCATCGTGCAACTGTGATGGGTGCCGCCATGAGTGATCTGGACACCCTGATTCCGCAGGCGGTCGAACTGGTGATCGACGGTGAGCCGCTGGCCATCAAACCGCTGAAGGTCGGGCAGATGCCCGGTTTTCTGCGAGCGATGTCGCCGGTGATGCAGCAGCTCACTGCCTCCAACATCGACTGGCTGGCGTTGTTCGGCGAGCGCGGCGACGACCTGCTGTCGGCCATCGCCATTGCCGTCGGCAAGCCTCGGGCGTGGGTCGATGAGCTGGCTGCCGACGAGGCCATCCTGCTGGCGGCCAAGGTGATCGAGGTGAACGCCGATTTTTTTACCCAGACGGTGATTCCGAAGCTCGACGGGCTGTTCGGCCAAGTGAAGCTGCCGCCCATCGTGAAAGCGGCGGCTGGTTCGATGCCGTCCAGCACCTGATCGAGCACGGTCACCGCTTGCCCGACATCCTCGACTACACGTTGGCGCAGGTGCGCGGCTTCGTCGTAGCGACGGCGCGCACCGATGCGGCCCGCGATGCACGGCTGCTGTCCGTGATTGCCATCGGCACGCGCAGCGATGCCCGCCAGCTCGACCAAACCCTCGACCGACTTACTGACAAGGCCACCGACCGTGCCTGATGACCATGCGCATTTCCGTCCAGATCGATAGCGCCGCAGCCCAGGCGCAATTGCGCCGCTGGGGCGGCGAATTCCGCGACAAGGTCAAGAAGGCGGTGTCGCGGGCGATTGCCAGCGAGGCGGTCGAACTCAAGCAGGACGTGCGCAGCCACGTCGCCAGCCAGATGGCCGTGGTCAAGAAGTCCTTCCTCAAGGGCTTCACCGCCAAGGTGCTGGACAAAGACCTGAACCGACTGCCCGCGCTGTACGTGGGTTCGCGCATTCCGTGGTCGGCGATGCACGAGACCGGCGGCCAGATTGCCGGGCGGATGCTGATTCCACTGAACGGTCGGGTGGGCCGCAAGCGCTTCAAGGCGCAGGTGGCCGAGCTGATGCGCGGCGGCAATGCCTATTTCATCAAGAACGCGAAGGGAAACATCGTCCTGATGGCCGAGAACATCAAAGAGCACGACCGGCCACTGGCGGGCTTCAAGCGCCGCTACCGCAAGGCAGAGGGCATCAAGCGCCTCAAGCGCGGCGCGGACATCCCGATTGCCGTCCTAGTGCCCAAGGTCGTACTCAAGAAGCGCCTCGATGTCGAGCGGCTGGTCGCGAGTCGCATCCCGCGTCTGGCGGCGGCCGTCGAGAATCAGATCAGTACGGTGGATTGATTCATGGCCAAGCGAATTTCCATCCTCGTCGCGCTCGAAGGGGCCGACGAGGGGCTCAAACGCGCCATCACGTCGGCCGAGCGCAGTCTCGGTGAGCTGTCGACCACCGCCAAGACCGCCGGAGCCAAGGCTGCCGCCGGAATGGCCGAGGTCAAGGCCGGGATGTCGGCCTTCGGCGATCAGGTGGCGACGGCCAAGACGCAATTGCTGGCCTTCCTATCGATCAGCTGGGCGGCAGGAAAGGTGCAAGAGATCGTCCAGATCGCCGACGCATGGAACATGATGTCGGCGCGCTTGAAGTTGGCGACGGCGGGACAGCGTGAATTCACGACCGCGCAAGCGGCCCTGTTCGACATCGCCCAGCGCATCGGTGTGCCGATTCAGGAAACGGCCACGCTGTACGGCAAGCTCCAGCAGGCAATTCGGATGCTGGGTGGCGAGCAGAAGGACGCGCTCACGATCGCCGAGAGCATTTCGCAGGCACTGCGCCTGTCGGGCGCTTCGGCCACCGAGGCGCAGTCCTCTTTGCTGCAATTCGGGCAGGCGCTCGCCTCTGGTGTGCTGCGAGGCGAGGAATTCAACTCCGTCGTCGAAAACAGCCCCCGTCTGGCGCAGGCACTGGCCGATGGCCTGAATGTGCCCATCGGGCGACTGCGCAAGCTGGCCGAAGAAGGCCGCCTGACCGCTGACGTGGTGGTCAACGCGCTGATGAGCCAGAAGGACAAGCTGGCCAGCGAGTACGCCCAACTGCCGCAGACGGTGAGCCAGGCCTTCGAGCGCCTGCGCAATGCCTTCGGGCAGTGGATCAACCGGGTCGATGAATCGACGGGTTTGACCAAGAAGCTGGCCGAGGCTCTGACCATTCTCGCCAACAACCTCGACACGGTCATGCAGTGGTTGAAGCGCATCGCCGAAGTCGGTCTGGCGGTGCTGATCTACCGCCTGATCCCGGCGCTCATCACCGCGTGGCAGACCGCCGGTGCGGCGGCCGTCACGGCCGCCAGTGCCACCGCTGCGGCGTGGACGACGGCCAACCTGTCGGTGTCGGCCGCCGTGGCCAGCGTCGGCTTGCTCAAGACGGCATTCGCCGTGCTGGGTGCCTTCCTGGTCGGCTGGGAGATCGGCACGTGGCTGTCGGAGAAGTTCGAGATCGTCCGCAAGGCGGGCATCTTCATGGTCGAGATGCTGGTCAAGGCGGTCGAGCAGTTGCGCTACCGCTGGGAGGCATTCGCCGCCATCTTCACCTCGGACACGATTGCCGAGGCGACCCAGCGCCACGAGGCCCGTCTCGCGGAGATGAACCAGATCTTCGCGCAGATGTACGCCGACGCGACCAAGGGGGCGGATGCTGCCAAGGGCGCGATGAATACCGCCGCGACGGCTGCGGAGGAAATCGCCAAGCGGCTCGAAGCCGTGCGTCAGGGCACGCAGGAGGCAGTCGGGCGCGGTATCGAGGCTGTCCACAGCGCCCTGGAGAAGCTGAAATCCCGCCTCGGTGAGGTTGAGCAGGCTGTCGGCAAGGCCAATCAGACAGTCAACGACGCCACCGCCAAAATGGCCGAGGCCTATAAGGGCCTGACGTCCATCGTTGAGGCCAACCTGCTGCGCCAGATCGAAGCGGTCAAGGCGCGCTATCAGCAGGAACAGTCGGCGCTGGAGACATCCAAGCAGTCCGAAGCGGCGCTGATCACCAAGTCGACACAGTTGCTGACGGAAGCCCTCACGCAGCAGACCACGTTGCGGCGGCAGTCCACGACAGACACGCTGAAGCTCATTGACGATGAGTCCAAGGCGCGGATCGAGTCGGCCCGCCGCCAGGGTCAGACGGAAGAAGAGCGCCGCGCCAACGTCCAGCGGGTCGAAAACGACATCCTGGCCACCAAGCGCCAGACGATGACGCAGGCGCTGGCCGAGTACCGGCAGCACATCGATGCGCTCAACGCAGAGGCCAACCGGCATCTGACTGAGATCAAGCGCATCGAGGAGGAGAAGCGCCAGCTCTCGATGACGACCGAGGAACGTGTCCGCGACATCCGTCGGCAGGGCATGACCGATTTCGAGGCGACGGAAGATCGCAAGCGCCAGATCGCCGAGTACCAGGGGAAGGCACGTGAGGCGCTGGCCAACGGCGAGTTCGAGCAGGCTCGGCAACTCGCCCAGAAAGCGATGGACTTGGCCGCACAGGTGGCCAGCTCGCAAACCAGTGAAGCCAAGCGCGGCGAAGATGCCCGCAAGCAGTCCGAGCAGGCGGTTTCGCAGGTCACCCAGCTCGAATCGCAGTCACGCGATGCCTATCGCAAGCAGGAATACGCGCAAGCCGAAGCCCTGATGCGCCAAGCGGACGCATTGCGCGCCGAACTGGCCCAGAAGACCAAGGATGCCGACGCACAGATCGCACAGGGCAAGGATGGCGTCAATCAAGCCATCCAGCGCATCCGCGAGTCCGAGGAGATTCTCAACAAGACCCTGGATGCCGAAGCCAAGGCGCACCAGACCGCCGCGCAGTCGGCATTGACCGCGCGCGACCAGATCCAGCAGACCCTCACCCAGACCGAAACCCAGATCGACCAAATCACAGCCAAGCTAAAAGACGGTCTGAAGGTCACGCTGGATGCCGACACGACCCGCTTCGACAAAGCCATCGCTGATCTCGACAAGGCCCTGGCAGAAAAAGAGTACCTGCTCAAGATTCAGGCCGACTTGCAGGAGGCCGAGAAGAAGCTGCAGCAGTACGAACAACTGCTGAAAGAGGGCAAGACACTCCCGGTCGATGCCGACGTGTCCAAGGCCAAGGAGGCGCTGGACAAACTCAAGACCTACGCCGACCAGAACTCGCAGTTCGAACTGAAGGTGGCGACCGAGAAGGCGCAGGCCGCGATCACCAAAGTCGAAGGGATGATCAAGGCGCTGGACCGCATCCAGACCGAGTCCCGGCATCAGGTCAGCACCAATGCCGACGCAGCCCGCTCGGAAATCATGAGTCTCAACTGGGCCAACACCTCGAGCACGCACACGATCTATGTGCGCAAGGTAGAGGCAAACGCGACTGGCGGTTTGGTGGGCGGTGGCGTGCGCCGCTACGCCGATGGCGGCGCTGTGGCCCCGGCCTTTCCTCGGATGAGTGGTGGCTCGGTTCCGGGCTCGGGCCACCACGACACCGTGCCACGCACCCTGGATGCCGGTGCCTTCGTGATTCGCAAGGCGGCGGTGCAGAAGTACGGCGGCGGCGCGCTCTCGCGTCTGGCCAATGGCGTGGCACGGTTTGCCACTGGCGGCGCGGTGATGCTGGGTGGCGGCAAGCGCCCATCCGGCAACGATGCTGATGGCACGCCCAGCACACCAAAGAAGAACCGGGAGGCAGTCGAGGCGATGAAGATGATCGACCTCGGCCTGCAGGGGATGAACGAGTACACCAATTGGCTCCAGTGGAACTACGGTGCCTCGGTCAGTCTGGATATGCGTAGCAAGACGATGGATAGCTACGGCAAGCAGGCCCAACAGGATCGGCGCGCGCTGGAGGACTTCATCAGCCGCAAGACGCTCACCGGCAACGAGCGCCAGAACCTGGAGCGCATCAAGCAGACGTGGCGGCAGGCAATGGCCCAGCCGCTGCTTTGGGGCAAAGACCTAGAGCGCGAGCTGATCGACTATATGGAGCAGAACCAGGGCGAGTTCTACCGTCGCGGTGGCATGGCCAAGTCCGACACCGTCCCGGCGATGCTCACGCCGGGCGAGTTCGTCGTGAACAAGGATGCCGTTTCCCGCTACGGCGCTGGCTTCTTCGAAGCGATCAACAACCTGTCTGCCCCGGCACAAGCTCTGGCCGGTCGCGCGCTTGCGGGCGTTCAGGGCTTCGCCACCGGCGGTCTGGTGCAGCCAAGTGGCTCGCGGTTGGCCCGACCGGTGTTGGCGGCCGATGCCGGGCCCAGCCGCACGGTACGCGTGGAACTGTCCTCGGGGCAGCAGAAGGTCAATGCCACCGTCGACGCACGAGACGAGTCTCGTCTGCTGCAACTTCTGGACGCTGCCCGCGCCCGCACTGCCTGAAGGATTCCCGATGCAACTGACGAACCTCGATGCCGGGGTGGCTTTGCCATTGCCTGACGATTTGCTGTGGAGTGATGAGCACGCGTGGTCGCCCGCCGTGGCGACCACGTCTTACCTCATCACCGGAGCCTTGCTTATCCAGTCTGCCACCCGGCAAGCCGGTCGCCCCATCACGCTGGTGGGCGCACCCGATATGGCCTGGGTGACGCGGGCCACGGTCGAGCAACTGCAGGCCTGGGCCGCGCTTCCAGTGGGCAGCGCCACAGGTCGCTTCGGCTTGACCTTCTCCGATGGCCGCTCGTTCACCGTGGCATTCCGCCACGCAGAAACGGCCATCGAAGCCGAGCCCGTGCTGGGCATCCCGGCCCGTGCCGCTACCGACTTCTATCGCCTGACCCTTCGATTCCTGGAGATTTGAAATGCCGATCCAATCCGGCGACGTGAAACTGCTGAAGTCCGCCGTGATGGCGGATGTGCCCGAGGGCGGTGGCGCGCCCACGGGCAACACCATTGCCGATGGCGTCTCGAACGCCATCTTTCCTGACATCTCCGAGCTGGATCGCGCCGGGGGTCGGGTCAACCTGCGCAAGTCCTTCGTGTCGGTGCAGACCGACGACACCGACACCTACTTCGGTGCCAACGTGATCGTGGCAGAGCCGCCGCAGGATGCGCGCGTCAGCGTCACGCTGTTCAGCACCGAGAAGACCTTCGACACCCGCGAGCAGGCGCAAGTCCGCATCGAGGCCTACCTCAACAAGGGCCCGGAGTGGGCTGGCTACCTGTTCGAGAACCACATCGCCGGTCAGCGGGTGATTCAGCTTTTCCAGCGCACCACCGACACCGTTCCCAATGTCGGCCAGACCTTGGTCTTGATCGAGAACGAGGGCCTGGGCACCCAGAAGGAGCAGTACATCCGGGCCACCTCGGTGTCCGTCGTCGAGCGCACGTTCACTTACGACGGCGACAAGGACTACAAGGCCAGCATCGTCACGGTCGACATCAGCGACGCACTGCGCTACGACTTCACCGGCTCGCCTGCAAGCCGCACGTTCACCCGGGCCGCGAACAGCACCAAGACGCGCGACACGGTCGTGGCGGACGCCGGAACCTACGTCGGCGTAGTACCGCTGACGCAGGCCGCCGCCGTCGGCGACTTCACGATCAAGGGCACCTCGATCTACACGCAGCTGGTGCCAAGCGCGCAAACCGAGACGCCCATTTCCTTCGTTCCTCCCTACGCGGCCGCCGGACTGCCGGTGCCGGGGGCCGTCGCGGTGAGCTACACAGCCAGCCACGCGTGGACGACCAGCATCAAATTCAATCTCCCGGGCGGTTGCTTGCCGGGGTCACTGACCATCGGCACGGACGGCATCACGATATTTGACGACGCGGGCCTGCTCAAGACCGCCAGCGGGACGGTCGGAACCATCGACTACGCCAACGGCATCCTGACCCTGAACTCGGGGACGATGTCGAACGCGAAGGCCATCACCTACACGCCCGCCGCGCAGATTCTGCGTGCTCCGCAAAGCTCGGAGATCCCGGTCACGCCCGAGTCGCGCAGCCAGTCCTACGTGGGCACGGTCAACCCGGTGCCGCAGCCCGGAACGCTGTCCATCAGCTACATGGCCCAAGGGCGCTGGTATGTGCTTTCCGACAGTGGCAACGGCTCGCTCAAGGGCCTGGACGCCAGCTACGGCGCGGGCACCTTCAACAGGAATACCGGAGCCTTCGTGGTCACGCTGGGTGCGTTGCCCGACGTGGGCAGTTCGCTCGTGCTGACCTGGAACGTGCCGACGCAGGAGACGCAGCAGCCATCCACCACCCTGAAGGCCACCCAGAGCCTGGCATTGAACCCACCTGCAGGGACGGCGGTGCAACCCGGGTCGCTCACCGTGTCCTGGGAGTACGGCGGCACCAAGACCTCAACGGCGGCCACGTCGGGCGTGCTGTCGGGTGCCGCCACGGGCAGTCTGAGCGTGGCGCAGAACCGCGTGGACTTCGCGCCCAATGTGCTGCCAGCGGTGGGCACGCAACTCACCGTGAGCTACGTCGCGGGCCCGAAGCAGGAGGACTCGTTTGCTCACCCCTCCCGCAATGGTGCGGGGACGCTGCCAGTCACCGCGACCTTGGGGGCCATCGAACCGGGCTCGCTCGAAGTCGAGTGGAACACGTTCACCGACGAGGCGGTTCTCGGTGCGTACACCTTCGCTCAATTGCAGGAGATGGGTATCGCCGTCTCGATCTGGCGCGACCCCACCCAGATCGCCCGAGATGACGGGAACGGCGGTGTAGTGCTGAACGGGATCTCGATTGGCACCGTCAACTACGCAACCGGTCAGGTGACCTTCAATCCGGATGTCTCGATCCGTATCCCACGCCCGGTCTACACGGCAGTCGCCATCAACGGCACCGGTCGGTGGCGATTGAACTACGGCGGCATCGCCTACGTCGATGCGCCATCGCTGTACCCCAACGACGAATCCGGCTACGTCAAGCTGCGCTACAACAGCGCGGGCTCGACCAGCAACCAGACCGAGACGTTCCAGTTCCTACCGGCCTTCAAGCTGGTACCGGGGGTGAATGCCCAGGTGGTGACAGGCACGGTGCTTCTCTCCATCAGTGGCGCGCAGCCTTGGGGCGACAACGGCCAGGGCACCCTGCGCGAGTTCACCACCAGTGGCTGGGTCACGCGCGGCACGATTAACTACCTATCCGGGGACGTGGCGCTGACGTCCTGGACGGCGGGCACGAACAACGCGATCACACGAGCCAGTTGCGTGACCACGGTCGGCGAGAACATCTCCAGCGAGTTCGTGTTCCGAACTGGCGCGGCACCGCTTCGTCCTGGGTCGCTGTCGATCCAGTACGCCCGCGCGGTTGGTGGCACGCAAAACGTGACGGCCGGGATTGACGGCAAGATCGAGGCAACCGGCATCAGCGGCAGCGTCGACTACGAGACCGGTCTGGTGCGCGTTCGCTTCGGAACGATGGTCACGGCGGCCGGGAACGAGAGCCAGCCTTGGTACGCCGCCGACCGGGTGGGCACGGACGGCAAGATCTTCCGACCCGAGCCGGTGGCCGCATCCAGCGTGCGTTACAGCGCGGTCGCCTACAGCTATCTGCCGCTGGATGCTGATTTGCTTGGCATCGATCCGGTGCGCCTGCCCAGTGATGGGCGGGTGCCGATCTTCCGCCCCGGCGGCTTCGCCGTGGTGGGCCACACCGGCAAGATCACCTCCTCGGTCAGCAACGGCCAGACCATCAACTGCGCTCGGGTGCGCCTGTCGCGCGTGCGCGTCGTCGGCCACGACGGAGCGGTGATCCACACCGGGTACTCCACCGATCTGGAAGCGGGCACCGTCACCTTCATCAACGTGTCGGGCTACAGCCAGCCCGTGACCATCGAGCACCGGATCGAGGACATGGCCGTGGTGCGGGATGTGCAGATCAGCGGCGAGATCAGTTTCACGCGCGCCCTGACGCACGAATATCCGCTGGGGAGTCACGTCTCCAGCGCCCTGGTGGCCGGTGACCTGTTTGCCCGCGTGAATCTGGTGTTCGACCAGTCAACGTGGAACGGCGCGTGGTCAGATGCCTTGTCAGGCAGTTCCGCAACAGCAACGTTCAACAACACGCAGTACCCGATCCGCGTGACGAACCGGGGGGCACTGACCGAGCGTTGGATCGTGCGCCTGACCAACAGCACCTCGTTCGAAGTCATCGGCGAGAACGTCGGCGTGATCGCCACGGGCAACACCAGTGCGGATTGCGCGCCCAACAACCCGGCGACCGGCGTGCCGTACTTCCATCTGCCCGCACTTGGCTGGGGCAATGGCTGGGCCACCGGCAACGTGCTGCGCTTCAACACCATCGGCGCGCAGTTCCCGGTCTGGGTGGTGCGCACCGTTCAGCAGGGGCCGGAGTCCGTGCCCGACGACAACTTCACGTTGCTGATTCGCGGCGACGTGGACACCCCCTGATTTCGTAGACAGGAACCCATGAAATGATCGACCTGACCGTCAAATACTTCAACAGCGGCATGACCGGCGCGCCACAGATCTCCAACAACTGGGGCGATCTGGTGACGATGCTCGATGCCTGCCTCGTCAATGGCTTCGCGCTGAAAGCCATCGACACCTTGACCTTCGCCGATGGCATCGCCACAGCCACCATTTCCACCGGCCACGCCTATCGGCCTTTTCAGGTGGTCGAGATCGCTGGAGCCGAGCAGCCTGAGTACAACGGTTCATTCCGTGTGCTGTCGACGACCACGACCGCCTTCACCTATGCGGTGACCGGAGCGCCGGTGTCGCCCGCGACGACGACCACTAACCTGAGCGCCAAAGTGGCTCCACTTGGGTGGGAGAAGCCGTTCGCGGGGACGAGCAAGGCCGCCTATCGCAGCAAAAACCCACAGTCGCCGCAGAACATCCTGCTGATCGACAACAGCCTCAAGACGCCCAACTACACGACGGGGTGGGCGAAGTGGGCCAACGTCGGGATCGTGGAAGACCTGTCCGACATCGACACCATCGTTGGCGCACAGGCTCCGTATGACCCGAACAACCCGACGCAGAACTGGAAACAGGTCACCGCTAGCCAGTGGGGTTGGTACAAATGGTTCCACGCACGTGGCCCCCAGTACGAAAGCAACGGCGACAGCGGCGGCGGAGGTCGCAACTGGGTGTTGATCGGTGACGACCGTCTGTTCTTCCTGTTCTGCACCAATGCAGCGGGCTACGGCTGGTATGGCCGCAACAGCTATTGCTTCGGCGATCTGATCAGCTTCAAACCCGGTGACAACTACGCGACGGTGCTGGCCGCCGACGACAACTACTCGGGGATGAGCAACTACTGGAGCTATCCAGGGCAGTTCAGCGGCTACGGGCTGGTTTCGTCCCTGGACTTCACCGGCAAGGTGCTGCTGCGCAATCACACCCAACTCGGCAATCCCGTCCGGTTCGGACTCACGTCGCTGAACACCAACAACGGCCAGCAGATCTGCGGTCGGGGCCCGATGCCGTTCCCGAATGGAGCCGACTACAGTTTGTGGCTGTTGCCCACCTACGTGCGGCAGGAGGACGGCCATATGCGCGGCATCCTGCCCGGGATGCTGTGGATGCCCCAGGACCGGCCCTATAGCGATCAGACCATCGTGGACAACGTGGTGGGTCAGGCGGGTAAGCGCTTCTTGCTGGTCAGGACGCAGTACAGCTCGGAAACCGAAGGCGCACAGATCGCGTTCGACATCACTGGCCCGTGGAGGTAAGCCATGAGCTACCCGCTGAGCGAGTCTTTCGCCACGGCTCCTGCGACCGGCTACACCGCCGTCCTCGGCGGAATGGCCGCGACACACAACAACGTGCAGCAGTCCATCGATATCTCGGCCCCCAATAGCCAGTCCATCCTGCGCTTCAACGAAACCGCCCACGGTGACTTCTGGTTCGAGGCGGATGTTGAGTTTCTGACCGACCCGAGCGCCCGCAAGCACATCGGCCTGTGGATGACCACCGGCAACGGTTCCGAGGGCTACCGGTTCGCACATATTGACGGTGCCTGGAGCGTGACACGCTGGAACAGCGGCTTTGGCGACGGCGCGGCAGTGACGGGCGGTGTCAACGATGGAGCGAAGCCGGTCGCGGGCGTTATAGACGTGGCCCCGACCTTCAACGTCGGCCAGCGGATGCCCCTGCGCTGTGAGGTCATCGTCGGAGCCTTTGACGCCAACGGCGTTCCGTGGGCGCGCCTGATCCAGTTCAAGGCCGGTGGGGTGCTGATGTTCCAGGTCGGGGATGCTGCCTACAGGGGCAAGCTGATCCCGGGCGTGTTCCTGTATGGGGCCACGGCGCGCGTCCACGCGATTGCGGGTGACACGCCGTCAGGTCTGCCCGCGTTTCCGGCAACCGTGGGCGTGAACGCCGCCGATGACCTGCTGCCGCTCGCGGGTGGGTCGACTTCGGTGCCCCCTGATCCGGCCGCCAACATCGCCGTCAACGCCGACTGCGACCTGATGCGCTTGAACAGCCCCAACTCTGAGCTGTGGAACCGGGGTGGTGGCTACGACTGGCACTTTCACGCGATTCCGAATGGCCGCAAGAACATCCACTTCAGCGGCCACGGCTTCATCGCCGGAACCGTCAAGGAGAAGGGCCAGCCCGACCAGCCCCTGGTGCGGCGGGTGCAACTGGTCAGCGAGAACACCCGCGTCCTGGTGGCCGAGACCTGGAGCGACACCACGGGCGCGTACCGGTTCGAGCTTATCGACCCGGCCCAGAGATACACCGTGGTCAGCTACGACCACAAGCAGATGTACCGCGCCGTGATCGCGGACAACCTTCATCCGGAGATGATGCCGTGACCGTTGCCATCACTGTCGAACACAACGAGGCGCGACTGGCGGGCACCCTGGCATTCCTGGATGCCGGTAGCAATCCGGCGCGTCTGCGCATCTACGGCGGGACACGACCCGCCAACCCGGCCACGACGCCGACCAGCGCGATGCTGGTCGAGATCAGGCTGACCAAACCCGCAGGCACGATTGCAGGTGGACTCTTGACGCTGACGCAGCAAGAAGACGGGCTGATCACGGCGACCGGCATCGCCACCTGGGCGCGGCTGGTCAACGGCAACGAAGTCACGGCCCTGGATCTGGACTGCAGCGGTACCGACGGCAGCGGTGACGTGAAGCTGGCCAGCACCAACCTCTATCTGGGCGGCGATGCCCGGATGGTGTCTGCGATCCTGGGGTAGTCCGTGCCTGCCGTTCTCAACGAGGTGACCCTGGTCGCCGCGTTGCCTGCGCCCACCGCCAGCGTGGCGGTCGGGCCTCCGCTGGTCGATCTGCTGTTTGACCAACCGGCTGCCACCGACGCCAACTTGGTGTTCGGGGCCAACTACATCGCGCCGCGCGACGACGTCGTGGTGCTGGCCAGCCTGCCGTTGCCGGTCGTGGCGATCAAGTTCATCCCGCCAGCGCGGGCCGCACTGCTGGCCGAGCTACCTGCATTGACGGTGACCACGCTGTTGCTGCGCCCGAGCGTCCCCTTGGACGTGACCGGTGCAAGTCTTCCTGGTGTCGTGTTCTCCGGCGAGGTCAGGTACTACTCGCGCACGCAGCGACCGACAGTCGGCCAGACCGCACACGCTTGGCAGGTGGCAGCGCAGACGGAAGATGGTTCGACACAGGGCCAGCAGGACGCTGCCGCTACACCCGCAGGCTGGGACACGTTCTGGCGACGCACCTTGGGTGTTCCTCAAGGCATCGAGCACAGGTTGCCGCCGGTGCTGGCGGCAGCGCCCGAGCAACGAGGCGCTCGCCACCAGGATGCGACCCGGCTGCAGGATTCGACGTGGTTTGCGCACCAGGACGCCACGCGTTTTGCGGCGACCCGACAAGGTCTGTTCCAGAACGCAGGCCCGTTGCGGGACACCACGCGATTTCGGCATCAGGACGGCGACCGCACCAAACGCGCGGGGCGGGTGAGCTTTTGGCAAATCGCGCGTCTGCTCACCGAGCGCCAGGGGAGTGATTTTCAGATTGCCAGCCCGTCACTCAAGGGCTGGAGTGTCCGGTATCAGGACGCCGTGCCGCCACCGCTGGGGATCAGCGTCTGGGTGGTTCCACAACCGCCAGCGCCGATACCTTGCTACACGCCGAGCGCGCATCTGCTGTTCGCCGCTTTGGCCCCAGCGGACAGCCACTTGCTGTTCGTCTGTGAAAACCACATCAACCCACCGCCTCCCGATGGGGAGCCGGTGGTCGTTCCTGTTCGGAGGGTCTATTTCGTGATCAACAACGTGACCCTGTACCGCGTGTCCGATGGCGCGCCGGTGCCGGTGTTCAACCTTTCGCTGTCGCTCGATGCATCGTCCTGGGCGTGGGGCTTCGATGCGGTGCTCCCTGCGAAAGCCGAGGCGCTGGTCGCGGGCAGCGCTTCCGGGCCCGTCGAACTCGTGGCCAGCGTCAACGGCACCCCGTTTCGCGTGCTGGCCGAGAGCATCAGCCGCGAGCGCATCTTTGGTGACGCCAGCATCCGCATCTCCGGACGGGGGCGCAACGCCGTTCTGGCCGCGCCCTACGCGCCGGTGATGACGTTCTCGAATACCGAAGGCCGCACTGCTCGGCAGTTGATGGACGATGTGCTCACGGTCAATGGCATCCCGCTGGGCTGGGCGGTCGATTGGGGCCTGACGGACTGGAACGTCCCCGCCGGTGCGTTCGCGCAGCAGGGGTCGTGGATCGACGCACTGACCGCCATTGCCGGTGCTGCAGGTGGCTACTTGATTCCGCATCCCTCGGCCCAGAGCATCCGCGTGCGTCACCGCTACCCGGTCGCGCCTTGGGAATGGAGCACGGTCACGCCCGACTTCGTGTTGCCCGTCGATGCTGTCGCCCGCGAGTCGCTGCGCTGGTTGGAAAAGCCTGCGTACAACCGCGTGTTCGTTTCCGGGCAGGACGTAGGCGTGCTCGGGCAGGTGACCCGGGCCGGGACTGCCGGAGAAGTGCTGGCACCGATGGTCGTCGACCCGCTGATCACCGAGGCGGCCGCCGCGCGGCAGCGTGGCGTAGCCGTGCTCGCCGACACGGGTCACCAGCTCGAGGTCAGCCTGCGCCTGCCGGTGCTCGCCGAGACCGGGATCATCGAGCCCGGTGCGTTCGTGGAGTACCAGGACGGCAGCGTCACGCGATTGGGCATTGTCCGCGCGACCCAAGTGGAAGCCGGGTTGCCCGAGGTCTGGCAGACGCTGGGAGTGCAGGCCTATGCGTAACCTCTACGAGCAGTTTCGCCAACTGATCCCCGACCCGCCGCTGCAGGCGGGCACAGTGAGCGACGTCGGCTCTGGCGTGGTCACGGTCGCATTGCCCGGTGGCGGCCGAATCAAGGCGAGGGGCTCTGCGGCCCTTGGCCAGAAGGTGTTCGTGCGCGACGACGCCATCGAAGGCATTGCGCCCAGCCTGACGCTGGAAATCATCGAGATCTGAAACCCAACTGATTCAACCCTGAGACCCGCCCTGATGCTCACGCATCGGGCGGGTTTCGTATTTCTGGAGAAAGCAAATGACCGAACCTGAACAACAACCGGCGCTCGTCGAGAACATGCTCCTGCTGCGCAAGGAGGATTTCGACGATCTGCTCGACCGTGCCGCCGAACGTGGTGCTGAACGCTGCCTCGCCCATCTTGGACTGGAGAACGGCCACGCTGCCCGCGACATCCGGGAGCTGCGCGATCTGCTGGAAGCTTGGCGCGACGCCCGCCGCACGGCTTGGCAGACGACCATCAAGGTGGCCACCACAGGCATCCTGGCCGCGTTGCTGGTCGGTGCCGCCATCAAGCTCAAGCTGATGGGAGGCCCCCAATGATCGAAACCCTCCTCGGTGGCCTCCTGGGCGGGGTCTTCCGTCTTGCGCCCGAAATCCTCAAGTGGCTGGACCGCAAGGGCGAACGCGGCCACGAGCTGGCCATGCAGGACAAGGCGCTGGAGTTCGAGAAAATTCGCGGCGCGCAGCGGATGGCCGAGATCGGTGCGAGCGCCGAAGCCGCCTGGAACGTCGGTGCCGTCGATGCGCTGCGTGAGGCCGTCCGCACCCAAGGTGAGAAGACCGGTGTGCGCTGGGCCGATGCGTTGTCTATCAGCGTGCGACCGGTAATCACCTACTGGTTCATGGCGCTGTACTGTGCGGCCAAGACGGCTGCTTTCGCGGCCGCCGTCACCGCTGGCTCTGGCTGGGGCACGGCCATCCTGCACGCATGGACGGAAGCCGATCAGGCGCTGTGGGCCGGGGTTCTGAACTTCTGGTTCCTCGGGCGCGTGTTCGACCGGGTGCGCTCGTGACCGGGGTGCCGAAAACGGCCATCGAGCTGGCCAAGCGCTTTGAGGGGTTCCACCGGGTGCCGAGGATCGATCCGGGCCGCGCGCATCCGTACATCTGCCCAGCGGGCTACTGGACGATTGGCTACGGCCATCTGTGCGAGTCGACGCACCCGCCGATCACGGAGTCCGAGGCCGAGGTCTATCTTACGCACGACCTGCAAACGGCGCTCGCCGCAACGCTGCGCTACTGCCCGGTGCTCGCAACCGAACCCGAAGGGCGACTTTCGGCCATTGTGGATTTCACCTTCAACCTTGGCGCGGGGCGGCTGCAGACATCGACGCTTCGGCGACGGATCAACCAGCGGGATTGGGCGGTATCAGCTCAGGAGCTCTGCCGATGGATCTATGGTGGCGGAAAGGTCTTGCCAGGTTTGGTGGCAAGGCGAAAGGTCGAGGCGGCGCTGATGTCTGGCCTTCACTACTGAAGCGCTGGCTCACACAAGCGCAGCAATTGCGATCGAATCTCGCCGGGCGATGCTGTCAGATCCACCGTCGCAAATCGGATGCCATGCCCCTGGATGACAACCGTCTCATCAACGGCGTCGCCGACGGAAGGGTGAAGCAACAGCCCACTGGCGTCATCCGCAAGCGGATCGCCGCGCCCGACCTGGGATCGCAGGTAGGCGTAGATCTGGTACACGTACCCGCTGCGCAGCGTCTCCTCTCGATACCATCCGCTCGTCACGATCGACGTGAACTTGGTGTCGATGACGATTCGCCGGCCAGAAGGCGCATGGTCGAGTACCACGTCGGTTCGCATCGTCGGCAAGATCTTGTCGATTCCCGATGTCTTCTGTTCGATTTGCCAGCCCAGCGTACCGCCACAGCGAACCCGCCATCCCTGCGGTTGCAGTACCACGTCATAGAAGCCGCCCACGGCCTTCTCGAACAGTCGCCGGACCCAGGTGACCTCGCGTTCCGGCAGAGTCAACACGTTCGTCCCTGCCACCTCAGTTGGCAAGGCAAGATCAAAAGCCAGCTTCGCCGCAGCCACCATGAATCTGTCATCAGCGTCGTTGCGGCCAAAGCGATCTGTGCTCATCTGCGCGCGAGTGGGTACATCGCCGGAAACACCCATGGCCTTCATGCCGCTGGCAAGTGAGCGGCAGCGATGAGACACGTCCTTCCTTTGCACAATTCGGGCGATGGTCTCTAGTGCCGCGCGCACAAAGCGATTGCGTGGCGTGTCGACGGTCAGCTCATCAAATCGACAGGCCACTAAGCCACGATCCAGCAATCGATGCCGCTCCGTATTCAGGACATCAATTCGCCCGCGCACGCGATTTAGGACAGCATCCCGAGATCGGTAGCCCAGGTTGAGGCGGCGGCGCTGCCTGACTTCGACTGCGTGGGCCAGAATCTCGGCGACCAGATCGGGGAGATCGTCAGGGTTGTCCTCCAGGCCAACCTTGCCGATGCCGCGAGTGCGGAACAATTCCGACGCGTACAGCATCAGCAGCCACAGATTGCGCACCGGAATACGTCCGATGTAGCCATTTGCGTCGACCGACGCGCTCTCGACCTGCTCCGCGACCACACCCATTACCAGCCCTGAAGCAGCCGCGCACACGCCTTCTGCGCTTCGTCAGGGGCGTCAAACCAATACTCATCGAGCAGTGGCCCGATCTCAGTCTCGACCACCTGCTGGAACCACTTTTTCGTGTCCCCGGCCTCCAGCCTATGGGCGGGCGTCACGTAGCTATGGCCGATCCGGAACTGCTTTCCAAGGCGAGCGTCAGCCGCTATCTGGTCATTTAGCTCTGCAATCCGGTGCTCAATATCCGCGACCAGAGCAGGATCGACAGCACACTCCTTGACAGCCCAATCCCGCCATGCGGTGCCAAGCCTTGGCTCGAGCCCCACGAAGGCGAAACGCCGACGCAACGCGAGATCGACCAGGGCCAACGACCTGTCGGCGATATTCATCGTGCCGACGACGTAGAGGTTTTCCGGAATGTGGACGGGGCGACGTTTGCCATCCGCGTCCGGGTAGCACAGTTCCAGTGCCTCATTGGGCGTGCGCTTTCCAGCTTCAAGCAGCGTCAGGAGTTCGCCGAAGATCTGCGCCGGGTTGCCACGGTTGATCTCCTCGATCACCACGACGAACTTCGACGATGGGTCCTTCGACGCTGCCTTGATCGCTTCCATGAATACGCCGTCGGCTAGCGACAATTTCCCCTCGCCGGTCGGCCGCCATCCTCTGACGAAATCCTCGTAGGACAGGTTGGGGTGAAACTGCACCGCACGGACTTTGCTTTCGTCCTTTTGACCCATCAGCGCAAACGCCAGGCGCTTCGCAAGCCACGTCTTGCCGGTGCCTGGTGGCCCCTGAAGGATCAGGTTCTTCTTGGTGCGAAGGCGGTCCAGAAGCCGATCGATCTCGTTGCGTTCAAGGAAGCATCCGTCCTTGAGAATGTCGTCGACCGAGTACGGCACGATCGGCACGGCAACATGAACGTCCTCTGGTGCAGTTGCCTCGGTAGCATCGTCACCATCATCGACGTCACCAGCGTCGTCTTCGCCGACCGGAGACTTCTCATCGGTGGGGTCCTTGTACAGCCATGCTTCCAGCGAAAGCTCAGGGTAGGAATGGACCGGGTAAGCGGTCTCCTGGAAGCGCGGCTCCAATACATCCATCACGGCAAGGTAGTCTGCCGAATTGCAGCGCCTCTTCGGGCCGTGCATGCCAATCGGCACACCGAGCTTCTTGCTGACATAGAGCTGAGAGTTGTGATCAAGGCTCAGGAACGCCCAAGGCCTGATCCAATACAGGCCGAACGTCAGATTCCATGCAACGCCTCGCCGACCGTTCGCGCTGTCGAAAGCCTTGGCGAACTCCTCGCGGGCAAGGTCATCGTCGGTATCTGCGTACGCGATACCCGCTGCGAAGACCCCCCAAAGCGCGTCAATGTGGTCGGTGGCGCGATTGATCTCGAATGGAAAGTACCAGGACTTCAAGTTGTTCAGCAGCGGGATGCCTTCGAACGTCTCTGGAACCGGCTCGTCGACGCCCAGGAATTTTGCCAACTCGGTCGCGATGATCTTGCGGTTGGAGTCCTTGATGCCCCGATTGAACAGACCCATCGTCGTGAACGGGCAGATGTCCTTTACGAAGCCCGTAGTGCCGTCTGCATACTTATCCTCTGCCAGATGGCCGAGTCCGTCGACGCGAACGGAGATCTCCCGGATTCCCTCCACAAGGGCTGCCCTGTTGGCGCGATAGGTGAGCAGCTTGTCCGCGATTGCTTCATAGAACTTTGTCCAGCCGAAGCGATGTTTGTCTGCTGCCACCGTCCCGAATCGCTCCCGCCAGTAGGGAGCGTTTCGGAAACGCTCAACGTCCTGCGGCTTGTTGTGGAAGGCGAAGCTGATCAAACCATCGGTCATCCATTCACCAGGCAAAACGCGCCACACCGTTCCCCGGTGGGTATAGAAATACCACTCACGCACCGGTTCGATTTTTGTCCAGTCAACCTTAACTCGCTGACCATCATTCAGGTTCTCAGTGATCGTACCAATTGCCTTGATCGCCATCACCGAAACCGCTTGTCCACGGCTGTCAAAGGGTAGCCCATGCTTGCGCGTGTAGGACGACTTGATGGCGATCTGATCTCCTGGTCGCATCGATCGCACCACGTCGAGATGCTTGTCCTCGTAGCCGTTCTCCCAAATCCCCTCGGACAGAAAGCGTGGCAACTGATCGTCCGTGCCCCCGTAGCTTGCGCCAACAAACCAGCTCGCCTGTGCGCTCGAGTTCTCTGTTTTTATGTTCATGGATGTCCTCAGATAGTGCTGCGCAAAGCGACGAACTGGTTACTCGGCCCCTCCGACCGTAGGTCGAGGTATGGGGAGTTCGAGACGACGTAGGCCAAAGGTCTCGTTAAACGGAATGGAAACAGGACAGGCAGTGATTGCAGGCTATGAACTGAAACAGGCTTGCCGACGTAGCGCACAGCCGCCTCGATCAGCCATACCGTGAGTTCGTCGCTGTCGATAGGCGTCGCTGCAAGACGGATGACGCGCTTGCCTTTCTCGACACGCTCTACCGCGCCCCAGCTTGCTTGTGTCTGGATGACCATATTGGTCATGCGTCGCGTGCCTTCGCGTTCGCCGTAGATCTCGCTCATACGGCGATGCACTTCGGCAGCTGCGCAGTCGTCTTGGATGGCCGACAAGCGACCCACCAACTCAGACACTTTCCCGAAAAACGGATAGCAGGCTATGGCCACGCCCCAGCACAGTACCGGAATTGGTGTGTCGGGCTGGCTTTTGTAAAGAGCCGCCGCCCGGTCGGCGAAGTCCACCAGCTCGGCGCGTGGCTCCAACCACAGCCGGTTCAAGACGGTTCGTGTTTTTTTCTTGGCTTCCACGCCAAGTTCGGCTGCGTCGAGCAGTGCATTCAGATCATCTAGCCCAGCTGTACCCGCACGAACCCGCAGTGCCGCAGCCGCCCAATCAAGCTGAATGAACCGATCGAACCCAATCTGAGGGGCTGATGTATTCATTCGTTTCCAATCACATATTTAACTTTCACAAACGGCACGATCAGCTCTTCTACCGAGATGCCGCCGTGGACAACCACTTGCTCGCCATCGGGAACAAAAGCCGTTCGGCCACCCGCGAATAAGGGCATATAGCCCGCCGGTAGTCCAGCGATGTCCAAGTGCACTGAGTTGGCATTTGCCGCTGCGGATTTAGCCAACAACGATTCACTGCGATACACGCGCACCCGTTCGCCACGGGCTTCTGGAACATCCCCTTCAGATGGACGTCCCACGCCAACAGCTTCAACGTTGCCGTGATCCGCTGTCAGGTAAATATGAAAGCCTCTGTCGAGCAGCATCGCAAACAAGCGATCGACGAAGCCTGTTTTCAACCAGTTCGCGATCCACAAGGCCACATCTTGTTTGGAGCGTTCCTTGTGCAGTCGGTCATCCACCTCGTCGACCACCAACCCGACCACCTTGGGCCGACGATCGTCCAGCGCTGCTTGCAAGGCATCCAGCTGCTCAATCTGCCGCAACGAGCGCTGGTAGTAGATTTCGCCCGGCTTGACGCCTTGCTCCTGCCAGTAGGCCTTCCACAGGTACTCTTCTTTGTTGGTATGGCCGATCGACTCTTCGAATTCCCGCGGTTTACGGCCAGAGAACAGGGCCTGCCGCGACACCGAGGTCACGGTGGGGAGCCAAGCGAAGGAAGTACCTTCATCAAACGCAAAGCGCTTCGTCGCCTCGACTAACCGCTCCCGAATCTGTACCCACTGGTCCAAGGCTAAGCCATCAAAAACCAGGAGTGCGATCTTGTCGGCCCCGAGCGCTTTCCGGCGAGAGCTCAGATAGTCGGGAATCCGATGCACCATCACCGGCCCGTTGTGGAACGAGAGCGTGCTCAGATCCGCATAGTGCTTGGCAGCCACCCACGCATGTAGCTGCGCATCCGACTGTTCTTGCAGTGTTTTCACGAGAGCCTGCACCTCTGCCAAACCGTCTGTGCCATAGGCATTGCCCAGATCGTGCACGCGGGCGAGAGTTTCGCCGTACTGCTTGGCAAACTCGCTCCAGGCCTTGTGCGTAGCCGTCGCGCTTGGAATCGCGTCAATCAGCTTGGCAATTCCCTTTTTCACAAAAGCGTTGCGCGCTTGCGGGTCTTGCACAATCCCGGCCTTGATCCAGCCTGGCGCGTCAGCTGGCACAACCTCAACGACCAGCGGGTGCAAGGAGCCATTCAAGAACATCGAATCAACGATGGATTGCACATCCGAGTGCGCAAACGGAATATCGACTTTGGCCACGTAATCGGGGGGTGGTGGCTCGCCAATGCGTGAGCCTTCCATGCCCAAGTTCGCCAGGTAGCGATGCCAAGCCTCCTGCACCACGCGCAGCAACGCACTCTTCGACGACAGCCACGCGGCCACCGGAAGGCCTGCAAGCAGTCCCTTGCTCTGGATGATGCTGGCTGCGTGCTCGGCAAAGACCAGGGGCAGCGCACGGTTAGCAAAGTGCATCCGCAGCACATCACGCCAGAAATCACTTTCCGTGCGAATGCTGGAGCGCGGTGCCAGGTGGTAGACCCGCTCCAGTATGAAATCTTTGGACTCGTTCTCGCCGCGAATACCCTGCAGCTCGGTGTCGTGCGCCTCCAGCAAGGCGGCAAAGTGCTCGGGTTCAAGCTGCTTGACCACGTTGTAAGCCAAACGCGGAAACAACTGTGCCAAGCCCAGACTTACGACACGACCATAGTGCCCAAGGTCCCAAGGCAGCTCATTGGGATCTGCACCGCGCCAATGCACAACCACAGCGGGCGTGGGGCCGGGCTCGCCTCGGTCCCAGGCGGCGCGATAGCGCTCCTCAAACTCTGTACGGAACACAAAGGGGTCTTCGTACAACAGCACCTCGAAGCCACGGCTGCGCAGTTCGGCCAGCAAACGCTCGTCCAGCAAGACGTCGTCCGGATCGCAGGCCACCCACAGCCGGTCGAGATCAGCTGTGAAGCGACTGAGAATTCGTTCAATCCACTGGCTCATGTGCTTTGTGTCTCCCTCACGTTCTCGCCAGGCTGAACCTCGGCGGAGACGCGTACCATCATTACCGCGTTCAGATCCGGCACACTGGCCGCAGCCTCGGCCAAGGCGGCCAGTCTGGCGTCGTGTTCTTGTTGCAGACGCTTGCGACGATGCTCGCGGACAGCAGGCAGCCCGATGCGGCCGATAGCCTGTTGCCTTGCTTCAAAGGCGTAGTCAGCACGCTCCCGCTCCTCTTGCAGTCGGGTCCGATGCGCCTCCAGCATCTCGGTAAAGATCCGCTCGCCTTGGGCTTTTGCCGCCGTGAGTGCTGCGTCAAACCATTTCACAGCCTCTTCGGTGCCGCTGACCCCGTGTACGACCACTGTTTCGGTCAGCAGCAAATCCCACACGCGTTTGGCCGTCGGCACAAAAGCGCGGCCCTCTTCGTTGGTAAAAACGGGCAGGTAACGCTTGCGGTTCAAGCCTTCCGCCGCAAGGCTGATCTCCCAGAGGGACCACACTCCGCTCACAGAATCAGGCAGGCCCGTCACCCGTATCACTGGCAAGGGTTGGCCTGCCACAAAGCGGGGCAACTCGCTGATCACTGCGCGCGCGCGCGGGTCTTCCAGCGTCACCCACTCAATGTCATGGTTCTCGTCAGCGGTACGCGCGTCAAAGCAAACCTGCGCTGCTTCGCTGCCATCCGCCCAAGTCACACGCCACGCCTTGCCCACCTTGGTTGCTGCACCGCCGCGTGCGGCCAAGCCAGAGGTGATGGCTCGCTCCAGCCAGAACTGTGCCGGGTGGTCGCGCCATTTGCGGGCATCGTCCGCTTCCAGCTCGTGCGCGTCCGAGAGCAATTCGCTGCTCTTTTTTGACTCGGCCAAGGTCTCGCGCAACTGCGAAACCACCGCATCGCATTCCTGCTCTATCGAAGCCGGGTTCTGCAAACCATGCACGAACAGCTCTTCGAACAAAGGCTCTGCTTCCGCCGAGTCCATCACGTCGGACGCCTTGTCCACGCCGAACTGTTGGGCGATCACTTCCAGCTTTTCTTCCAGCACCTGGCGAACCCGGTGCTCCACGGTGTCTTCCAGCACAAAGTTGATGGCGCGCACCACGTGCCGCTGGCCAATACGGTCGACGCGGCCGATGCGCTGTTCAATGCGCATCGGGTTCCAAGGCATGTCGAAATTGACGATGACGTGGCAGAACTGCAGGTTCAAACCTTCGCCGCCAGCGTCTGTCGAGATCAGTACGCGCACGTCTTGAGAGAACGCCTTTTGCGCCTTGCTGCGTGCATCCAGATCCATGCCGCCGTTAAGGGTGGCCACCGAAAAGCCACGGCTTTCCAGGTAATTGGCCAGCATGGCTTGGGTCGGCACAAACTCGGTGAAGAGCAGCACCTTCAGGGCAGGGTCGTTTTCTTCCTGCTGCAGTTTGTAGATCAGCTCCAGCAAGGCTTCTGCCTTGGCATCGGTGCCCGAGGCTTCGGTCTCGCGGGCCAGAGCGAGCAGTATTTCCACTTCGGACTTTTCCAGCTCCCAGCCGGTGGCCTGCATGGCCAAATCCACCTGCGACTGGCCGTCCAGGTCAGCCCAGTCTTCCTCGCTGGTGTTCTCAAACAACGAGGGTTGTGGCTGGGGCTCCTCCAGCAATGCCAGCCGCTTTTCCAGCGTCGTGCGAATAGCGGCAGTGCTGGACGTCACTAAACGCTGCATCAGAATCATCAAAAACCCGATATGACGCTGCTTGGCGGCCATTGCCTGGTTGTAGCCATGGCGCACGTAGTCGGTCACGGCCTCATACAAGCGCCGCTGCGCGTTGTGGCGCGCCTGCCAGGCCACGGCCTGCAGCCGGGTGACTCGCGGCTTGAAGAGTGGCTGACCATCGGCGTTGATCGACAGCCGCTTTTCTGTACGAATCACAAACGGCCGCACGCGATCGCGGTTCACGCTGCTTTCGTCAGGGAAGGCGTCGCGGTCCAGCAACTGCATCAAGCGCAGGAACTGGTCGGTCTTTCCCTGGTGCGGCGTGGCTGACAGCAACAGGAGGTAGGGCGATGCTTCTGCCAATGCTGCACCCAGCTTGTAGCGCGCCACCTGTTCGGTGCTGCCGCCCATACGGTGGGCTTCATCGATGATGACCAAATCCCAAGAGGCCGAGATCAGGTCTTCAAAGCGTTCGCGGTTGTAGTTGTTGAGCTGCTCCAGACTCCAGCCGCGCCGACTCTCCATCGGTTTGACCGAATCCAGTGAGCAGATCACCTGGTCATGCATACGCCACAGGTTGTCTTCATCCCCCTGGTTGCCACTGCGCCATTGGCGAAATGCAGCCAACTCAGAGGGCTCGATGAACTGCAGATGCTCACCGAAATGCAAACGCATTTCCGCCTGCCACTGGCGCACCAACCCCTTAGGCGCGACCACCAGCACTCTTTTCACCCGGCCGCGTAGCTTCAATTCCCGCAACACCAGCCCGGCTTCGATGGTCTTGCCCAAGCCCACCTCATCTGCCAGCAGGTAACGAATACGGTCGCGGCTGATGGCGCGATTCAGTGCGTACAACTGATGCGGCAGTGGCACCACGCTGGACTGGATGGGCGCGAGCAACAAGTTGTCTTCCAGCGCATCCAGCAGCTTGGCCGCCGCCGTGGTGTGCAGGATTTCCTCCACCGTAGGGCGAACGCTGTCCAACGGCGCAAGATCTGAGGCACGTGCCCGCACCACTGCGTCTTTGGCTGGCAGCCAGACGCGGTAAGCACTCTCACCCCATACCTCTTGCCGATCGATGACGCGACAAGACGCAGCCTGTCGCGTCAGCCAGCACCAATCGCCAACGTTGAAGCCGCCGCCCGCCAC
Protein sequences of DBSCAN-SWA_5 >NZ_AP021884|1977054:2023954|2000670_2004714_+|WP_147073261.1|DBSCAN-SWA MAKRISILVALEGADEGLKRAITSAERSLGELSTTAKTAGAKAAAGMAEVKAGMSAFGDQVATAKTQLLAFLSISWAAGKVQEIVQIADAWNMMSARLKLATAGQREFTTAQAALFDIAQRIGVPIQETATLYGKLQQAIRMLGGEQKDALTIAESISQALRLSGASATEAQSSLLQFGQALASGVLRGEEFNSVVENSPRLAQALADGLNVPIGRLRKLAEEGRLTADVVVNALMSQKDKLASEYAQLPQTVSQAFERLRNAFGQWINRVDESTGLTKKLAEALTILANNLDTVMQWLKRIAEVGLAVLIYRLIPALITAWQTAGAAAVTAASATAAAWTTANLSVSAAVASVGLLKTAFAVLGAFLVGWEIGTWLSEKFEIVRKAGIFMVEMLVKAVEQLRYRWEAFAAIFTSDTIAEATQRHEARLAEMNQIFAQMYADATKGADAAKGAMNTAATAAEEIAKRLEAVRQGTQEAVGRGIEAVHSALEKLKSRLGEVEQAVGKANQTVNDATAKMAEAYKGLTSIVEANLLRQIEAVKARYQQEQSALETSKQSEAALITKSTQLLTEALTQQTTLRRQSTTDTLKLIDDESKARIESARRQGQTEEERRANVQRVENDILATKRQTMTQALAEYRQHIDALNAEANRHLTEIKRIEEEKRQLSMTTEERVRDIRRQGMTDFEATEDRKRQIAEYQGKAREALANGEFEQARQLAQKAMDLAAQVASSQTSEAKRGEDARKQSEQAVSQVTQLESQSRDAYRKQEYAQAEALMRQADALRAELAQKTKDADAQIAQGKDGVNQAIQRIRESEEILNKTLDAEAKAHQTAAQSALTARDQIQQTLTQTETQIDQITAKLKDGLKVTLDADTTRFDKAIADLDKALAEKEYLLKIQADLQEAEKKLQQYEQLLKEGKTLPVDADVSKAKEALDKLKTYADQNSQFELKVATEKAQAAITKVEGMIKALDRIQTESRHQVSTNADAARSEIMSLNWANTSSTHTIYVRKVEANATGGLVGGGVRRYADGGAVAPAFPRMSGGSVPGSGHHDTVPRTLDAGAFVIRKAAVQKYGGGALSRLANGVARFATGGAVMLGGGKRPSGNDADGTPSTPKKNREAVEAMKMIDLGLQGMNEYTNWLQWNYGASVSLDMRSKTMDSYGKQAQQDRRALEDFISRKTLTGNERQNLERIKQTWRQAMAQPLLWGKDLERELIDYMEQNQGEFYRRGGMAKSDTVPAMLTPGEFVVNKDAVSRYGAGFFEAINNLSAPAQALAGRALAGVQGFATGGLVQPSGSRLARPVLAADAGPSRTVRVELSSGQQKVNATVDARDESRLLQLLDAARARTA >NZ_AP021884|1977054:2023954|1977054_1977318_+|WP_024973176.1|DBSCAN-SWA MQTQVPSIESGRNPRRMNPGGATCIALDENELAIRWGLSVKTLRRWRQEQLGPIYCKLGRRVTYLLHEIEAFERRVSRYSSFTRAYQ >NZ_AP021884|1977054:2023954|2013478_2013700_+|WP_147073248.1|DBSCAN-SWA MRNLYEQFRQLIPDPPLQAGTVSDVGSGVVTVALPGGGRIKARGSAALGQKVFVRDDAIEGIAPSLTLEIIEI >NZ_AP021884|1977054:2023954|1985452_1986862_+|WP_147073341.1|DBSCAN-SWA MNTLNVEYRKVEALIPYARNPRTHTDEQVAKIAASIVEYGWTNPVLVDGDNGIIAGHGRLAAARKLGLDQVPVIELAHLSPTQKRAYVISDNRLALDAGWNEEMLALEMAELSEAGYDLALTGFEDAEIEALLADEVASDAADQEPDADEPDDGDDVPDSPVVPVSRTGDFWAIGTHRLICGDATDPTVVATLMQGDAARLCFTSPPYGNQRDYTSGGITDWDGLMRGVFAKVPMDDDGQVLVNLGLIHRDNEVIPYWDAWLGWMRTQGWRRFAWYVWDQGPGMPGDWAGRFAPSFEFVFHFNRSSRKPNKIVPCKHAGQESHLRADGSSTAMRGKDGEVGGWTHKGQPTQDTRIPDSVIRVMRHKGKIGQDIDHPAVFPVALPEFVIEAYTDAGDIVFEPFGGSGTTMLAAQRKGRVCRCVEIAPEYVDVAIKRFQQNHPGVPVTLLATGQSFDDVVNERQATTEVEQ >NZ_AP021884|1977054:2023954|1992902_1993295_+|WP_147073281.1|DBSCAN-SWA MSLASSIAALAARIGFEVKTKIDATHPGIARVWVSFGYVGGQVVIASAHNVASVVRTAAGRYRVHFAVAMPDANYCWTALARSSTNTGQQRLALVRASSDLKTAQYVDVSCATAASSFDDSSEINLVVYR >NZ_AP021884|1977054:2023954|1996290_1996668_+|WP_147073275.1|head|DBSCAN-SWA MPAMQEPINLGDLLKYEAPNLYSRDRVTVAAGQTLPLGTVLGQITATGKVKQIDPSATDGSQYSAGVLMQDADAALADRNDGLMVARHAIVSDHALHWPTGITTAEQQAAIQQLKALGVLVRIGA >NZ_AP021884|1977054:2023954|1979299_1980154_+|WP_147073311.1|DBSCAN-SWA MNASVLTASHYGVVRFGDLQCEAVVLKGGERGYVRRQLAKLLGFHETHKGGRFARFLADFAPKSLSALEKTREPILLPSGRQAQFFPAGIIADVASAVVSAAINGTLHKARQGIVPNCMKIMRALATTGEVALIDEATGYQYHRAPDALQELISKLLRQSCSSWERRFHPDYYRALYRLFGWKYQGHDQNPPHVVGQITQRWVYGPVLPVTLIDEIRARKGISQKHHQWLSDQGLARLETQIHAVTAIARSSTCYRDFDRRCEAAFAGGALQLALLAEDFEEGA >NZ_AP021884|1977054:2023954|1989887_1990427_+|WP_147073285.1|DBSCAN-SWA MGISIRAYARHRGVTDTAVHKAIRAGRITPEADGTIDADRADREWARNSDVPKTGTRAKAAKVAVPEGGTGVGGDGPAALPAGGASLLQARTVNEVVKAQTNKVRLARLKGELVDRPQAIAHVFKLARSERDAWLNWPARISAQMAAKLNIDPHTMHVALEAAIREHLQELGELRPRVD >NZ_AP021884|1977054:2023954|1984518_1984728_+|WP_147073299.1|DBSCAN-SWA MKVSTPQYRCPLGRLQPQTTDLDAIKERGWRDQHILVVNASDDRLDFIEREIVRRIGERLYGLGGTRHG >NZ_AP021884|1977054:2023954|2015009_2016134_-|WP_147073241.1|DBSCAN-SWA MGVVAEQVESASVDANGYIGRIPVRNLWLLMLYASELFRTRGIGKVGLEDNPDDLPDLVAEILAHAVEVRQRRRLNLGYRSRDAVLNRVRGRIDVLNTERHRLLDRGLVACRFDELTVDTPRNRFVRAALETIARIVQRKDVSHRCRSLASGMKAMGVSGDVPTRAQMSTDRFGRNDADDRFMVAAAKLAFDLALPTEVAGTNVLTLPEREVTWVRRLFEKAVGGFYDVVLQPQGWRVRCGGTLGWQIEQKTSGIDKILPTMRTDVVLDHAPSGRRIVIDTKFTSIVTSGWYREETLRSGYVYQIYAYLRSQVGRGDPLADDASGLLLHPSVGDAVDETVVIQGHGIRFATVDLTASPGEIRSQLLRLCEPALQ >NZ_AP021884|1977054:2023954|2011071_2011467_+|WP_147073252.1|DBSCAN-SWA MTVAITVEHNEARLAGTLAFLDAGSNPARLRIYGGTRPANPATTPTSAMLVEIRLTKPAGTIAGGLLTLTQQEDGLITATGIATWARLVNGNEVTALDLDCSGTDGSGDVKLASTNLYLGGDARMVSAILG >NZ_AP021884|1977054:2023954|1981632_1983924_+|WP_147073303.1|DBSCAN-SWA MIDFNDTTQPAEHNRESERDEIRADLLARLESVLTTMFPAGKKRRGKFLIGDILGSPGDSLEVVLEGEKAGLWTDRATGDGGDIFALIAAYLGANVHTDFPRVLDEAADLLGRSRSVPVRKAKKEAPVDDLGPATAKWDYFDAGGKLIAVVYRYDPPGGKKEFRPWDAKRRKMAPPEPRPLFNQPGIGAASHVVLVEGEKCAQALIASGVVATTAMHGANAPVDKTDWSPLAGKTVLIWPDRDAPGWDYADRASQAILQAGATSVAILMPPDDKPEGWDAADAIPEGFDVGGFLAVGERMPVMRSVEEAPSPDLLTGIDWTTEDGLSSAFTRRYGEDWRYCALWGKWLVWTGVRWNPDQVLYVSHLSRGICRNASLKADTPRLKGKLASSATISSVEKIARSDPKHASTAEEWDADVWALNTPGGVVDLRTGRMRPHRRDDRMTKVTTATPQGNPDSACPTWRGFLTDVTGGDADLMAYLQLMVGYCLTGVTSEHALFFLYGTGANGKSVFVNVLTTILGDYAANAPMDTFMEARNDRHPTDLAGLRGARFVSSIETEQGRRWNESKVKAITGGDKVSARFMRQDFFEYLPQFKLVIAGNHKPSIRNVDEAMKRRLHLIPFTVTIPPERRDGRLTEKLLKERDGILAWAVEGCSRWQSQGLKPPASVVSATEEYFEAEDALGQWIEERCLLAKSHREGVSELFADWREWAERAGEYVGSVKRFSELMATRKFDKCRLTGGARAIAGIALRPKPYSHAYPYRDD >NZ_AP021884|1977054:2023954|1999813_2000017_+|WP_147073264.1|DBSCAN-SWA MIEHGHRLPDILDYTLAQVRGFVVATARTDAARDARLLSVIAIGTRSDARQLDQTLDRLTDKATDRA >NZ_AP021884|1977054:2023954|2014548_2015016_+|WP_170227448.1|DBSCAN-SWA MTGVPKTAIELAKRFEGFHRVPRIDPGRAHPYICPAGYWTIGYGHLCESTHPPITESEAEVYLTHDLQTALAATLRYCPVLATEPEGRLSAIVDFTFNLGAGRLQTSTLRRRINQRDWAVSAQELCRWIYGGGKVLPGLVARRKVEAALMSGLHY >NZ_AP021884|1977054:2023954|2011470_2013486_+|WP_147073250.1|DBSCAN-SWA MPAVLNEVTLVAALPAPTASVAVGPPLVDLLFDQPAATDANLVFGANYIAPRDDVVVLASLPLPVVAIKFIPPARAALLAELPALTVTTLLLRPSVPLDVTGASLPGVVFSGEVRYYSRTQRPTVGQTAHAWQVAAQTEDGSTQGQQDAAATPAGWDTFWRRTLGVPQGIEHRLPPVLAAAPEQRGARHQDATRLQDSTWFAHQDATRFAATRQGLFQNAGPLRDTTRFRHQDGDRTKRAGRVSFWQIARLLTERQGSDFQIASPSLKGWSVRYQDAVPPPLGISVWVVPQPPAPIPCYTPSAHLLFAALAPADSHLLFVCENHINPPPPDGEPVVVPVRRVYFVINNVTLYRVSDGAPVPVFNLSLSLDASSWAWGFDAVLPAKAEALVAGSASGPVELVASVNGTPFRVLAESISRERIFGDASIRISGRGRNAVLAAPYAPVMTFSNTEGRTARQLMDDVLTVNGIPLGWAVDWGLTDWNVPAGAFAQQGSWIDALTAIAGAAGGYLIPHPSAQSIRVRHRYPVAPWEWSTVTPDFVLPVDAVARESLRWLEKPAYNRVFVSGQDVGVLGQVTRAGTAGEVLAPMVVDPLITEAAAARQRGVAVLADTGHQLEVSLRLPVLAETGIIEPGAFVEYQDGSVTRLGIVRATQVEAGLPEVWQTLGVQAYA >NZ_AP021884|1977054:2023954|1977329_1977812_+|WP_024973177.1|DBSCAN-SWA MSDLTIFPVDIAEMSVSQLAALPPEQKCEVDKNLDAAIDWLKKARTKFDAALEQCYGEQARVALRESGRDFGTAHISDGPLHIKFELPKKVSWNQKQLGEIAERIVASGEKVEGYLDVKLSVSESRYINWPPALQQQFAAARTVDSGKPSFTLSTDGGEA >NZ_AP021884|1977054:2023954|1984052_1984517_+|WP_147073301.1|DBSCAN-SWA MKTTILALDLGTHTGWALQHLDGTITSGTEHFKPQRFEGGGMRFLRFKRWLNELLSVSNHINAVFFEEVRRHAGVDAAHAYGGFMGHLTAWCEHHNIPYQGVPVGTIKKHATGKGNASKDEMITSVRERGHTPVDDNEADALALLHWAVETQEV >NZ_AP021884|1977054:2023954|2014075_2014552_+|WP_147073244.1|DBSCAN-SWA MIETLLGGLLGGVFRLAPEILKWLDRKGERGHELAMQDKALEFEKIRGAQRMAEIGASAEAAWNVGAVDALREAVRTQGEKTGVRWADALSISVRPVITYWFMALYCAAKTAAFAAAVTAGSGWGTAILHAWTEADQALWAGVLNFWFLGRVFDRVRS >NZ_AP021884|1977054:2023954|2004724_2005132_+|WP_147073260.1|DBSCAN-SWA MQLTNLDAGVALPLPDDLLWSDEHAWSPAVATTSYLITGALLIQSATRQAGRPITLVGAPDMAWVTRATVEQLQAWAALPVGSATGRFGLTFSDGRSFTVAFRHAETAIEAEPVLGIPARAATDFYRLTLRFLEI >NZ_AP021884|1977054:2023954|1990429_1992394_+|WP_147073339.1|terminase|DBSCAN-SWA MNVEYEGAAEIERAWREGLTPDPLLSVSEWSDRHRMLSSKASAEPGRWRTSRTPYLKAIMDCLSPTSPVERVVFMKAAQLGATEMGSNWIGYVIHHAPGPMMAVWPTVDMAKRNSKQRIDPLIEESAALSELISPARSRDSGNTILAKEFRGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLAEARTRTFARRKIFIVSTPTISGASAIEREYEASDQRRYFLPCPHCSHRQWLRFEQLRWEKGQPDTASYICESCDKSIAEHHKTWMLEHGEWRAMISDGTGKTAGFHLSSLYSPVGWRGWRDIAAAWESSVNKESGSAAAIKTFKNTELGETWVEEGEAPDWQRLVERREDYRVGTVPPGGLLLVGAADVQKDRIEASIWAFGRGKESWLVEHRVLMGDTARDAVWKRLAELLAENWTHASGAAMPLARFALDTGFATQEAYAFVRACRDPRVMPVKGVPRGAALIGTPTAIDVSQGGKKLRRGIKVFTVAVGIAKLEFYNNLRKGADVSEDGVTTVYPTGFVHLPKIDAEFIQQLCAEQLITRRDRNGFPVREWQKMRERNEALDCYVYARAAASAAGLDRFEERHWRELERQLGMERPPDEPPPIQAFDPNEATQRGGLSVSANPPRRRVIKSRWLS >NZ_AP021884|1977054:2023954|2019092_2021114_-|WP_147073235.1|DBSCAN-SWA MSQWIERILSRFTADLDRLWVACDPDDVLLDERLLAELRSRGFEVLLYEDPFVFRTEFEERYRAAWDRGEPGPTPAVVVHWRGADPNELPWDLGHYGRVVSLGLAQLFPRLAYNVVKQLEPEHFAALLEAHDTELQGIRGENESKDFILERVYHLAPRSSIRTESDFWRDVLRMHFANRALPLVFAEHAASIIQSKGLLAGLPVAAWLSSKSALLRVVQEAWHRYLANLGMEGSRIGEPPPPDYVAKVDIPFAHSDVQSIVDSMFLNGSLHPLVVEVVPADAPGWIKAGIVQDPQARNAFVKKGIAKLIDAIPSATATHKAWSEFAKQYGETLARVHDLGNAYGTDGLAEVQALVKTLQEQSDAQLHAWVAAKHYADLSTLSFHNGPVMVHRIPDYLSSRRKALGADKIALLVFDGLALDQWVQIRERLVEATKRFAFDEGTSFAWLPTVTSVSRQALFSGRKPREFEESIGHTNKEEYLWKAYWQEQGVKPGEIYYQRSLRQIEQLDALQAALDDRRPKVVGLVVDEVDDRLHKERSKQDVALWIANWLKTGFVDRLFAMLLDRGFHIYLTADHGNVEAVGVGRPSEGDVPEARGERVRVYRSESLLAKSAAANANSVHLDIAGLPAGYMPLFAGGRTAFVPDGEQVVVHGGISVEELIVPFVKVKYVIGNE >NZ_AP021884|1977054:2023954|1984720_1985137_+|WP_147073297.1|DBSCAN-SWA MAEWTTDDVAARFEEAATTGRRLPPVRVQGYFNCWPAFVRKEWEAFAADEKVYRPFPPSPEAIDRMLETMRWVQWLEVEQRHLVWMRAKRYGWRDITIRFACDRTTAWRRWQRAMEIVATNLNSEGVRLPSKNVGNLG >NZ_AP021884|1977054:2023954|2018373_2019096_-|WP_147073237.1|DBSCAN-SWA MNTSAPQIGFDRFIQLDWAAAALRVRAGTAGLDDLNALLDAAELGVEAKKKTRTVLNRLWLEPRAELVDFADRAAALYKSQPDTPIPVLCWGVAIACYPFFGKVSELVGRLSAIQDDCAAAEVHRRMSEIYGEREGTRRMTNMVIQTQASWGAVERVEKGKRVIRLAATPIDSDELTVWLIEAAVRYVGKPVSVHSLQSLPVLFPFRLTRPLAYVVSNSPYLDLRSEGPSNQFVALRSTI >NZ_AP021884|1977054:2023954|1992411_1992903_+|WP_147073283.1|DBSCAN-SWA MSLATRIESLVIRVAQEFNDVRATAGSLASLSTNDKSSLVAAINELKAAVLSAMAIDDNQIATTSTYSSNKIVSLLDALKTDILGGADAAYDTLVEIQQALQSGTSGLDAILAAVNLRVRFDAAQTLTVAEQLQARTNIGAVAVSDVGNTDTDFVVIFDGALA >NZ_AP021884|1977054:2023954|2008720_2009986_+|WP_147073256.1|DBSCAN-SWA MIDLTVKYFNSGMTGAPQISNNWGDLVTMLDACLVNGFALKAIDTLTFADGIATATISTGHAYRPFQVVEIAGAEQPEYNGSFRVLSTTTTAFTYAVTGAPVSPATTTTNLSAKVAPLGWEKPFAGTSKAAYRSKNPQSPQNILLIDNSLKTPNYTTGWAKWANVGIVEDLSDIDTIVGAQAPYDPNNPTQNWKQVTASQWGWYKWFHARGPQYESNGDSGGGGRNWVLIGDDRLFFLFCTNAAGYGWYGRNSYCFGDLISFKPGDNYATVLAADDNYSGMSNYWSYPGQFSGYGLVSSLDFTGKVLLRNHTQLGNPVRFGLTSLNTNNGQQICGRGPMPFPNGADYSLWLLPTYVRQEDGHMRGILPGMLWMPQDRPYSDQTIVDNVVGQAGKRFLLVRTQYSSETEGAQIAFDITGPWR >NZ_AP021884|1977054:2023954|1978662_1979289_+|WP_024973179.1|DBSCAN-SWA MTAWNDFNDADSQQSGFDLIPKGTVVPVRMTIKPGGYDDPEQGWGGGYATESFETGSIYLAAEFVVTAGDHAKRKMWSNVGLLSKKGPTWGQMGRSFIRAALNSARNVHPQDNSPQAAAARRINGFAELDGLEFLARVDIEKDAKGQDRNVVKLAVEPDHPDYAKLKGVPPKGSPGGGNSGAPAQAAPAYSAPTPQRAPVTGKPSWAQ >NZ_AP021884|1977054:2023954|1999415_1999817_+|WP_147073266.1|DBSCAN-SWA MSDLDTLIPQAVELVIDGEPLAIKPLKVGQMPGFLRAMSPVMQQLTASNIDWLALFGERGDDLLSAIAIAVGKPRAWVDELAADEAILLAAKVIEVNADFFTQTVIPKLDGLFGQVKLPPIVKAAAGSMPSST >NZ_AP021884|1977054:2023954|2016133_2018365_-|WP_147073239.1|DBSCAN-SWA MNIKTENSSAQASWFVGASYGGTDDQLPRFLSEGIWENGYEDKHLDVVRSMRPGDQIAIKSSYTRKHGLPFDSRGQAVSVMAIKAIGTITENLNDGQRVKVDWTKIEPVREWYFYTHRGTVWRVLPGEWMTDGLISFAFHNKPQDVERFRNAPYWRERFGTVAADKHRFGWTKFYEAIADKLLTYRANRAALVEGIREISVRVDGLGHLAEDKYADGTTGFVKDICPFTTMGLFNRGIKDSNRKIIATELAKFLGVDEPVPETFEGIPLLNNLKSWYFPFEINRATDHIDALWGVFAAGIAYADTDDDLAREEFAKAFDSANGRRGVAWNLTFGLYWIRPWAFLSLDHNSQLYVSKKLGVPIGMHGPKRRCNSADYLAVMDVLEPRFQETAYPVHSYPELSLEAWLYKDPTDEKSPVGEDDAGDVDDGDDATEATAPEDVHVAVPIVPYSVDDILKDGCFLERNEIDRLLDRLRTKKNLILQGPPGTGKTWLAKRLAFALMGQKDESKVRAVQFHPNLSYEDFVRGWRPTGEGKLSLADGVFMEAIKAASKDPSSKFVVVIEEINRGNPAQIFGELLTLLEAGKRTPNEALELCYPDADGKRRPVHIPENLYVVGTMNIADRSLALVDLALRRRFAFVGLEPRLGTAWRDWAVKECAVDPALVADIEHRIAELNDQIAADARLGKQFRIGHSYVTPAHRLEAGDTKKWFQQVVETEIGPLLDEYWFDAPDEAQKACARLLQGW >NZ_AP021884|1977054:2023954|1997701_1998004_+|WP_147073273.1|DBSCAN-SWA MSLVAQIYESAANAGLLKECLWYPSNGAPSQLHQIGFAAPDESLLDGLALSTDYEMTYPVTAFGGLAVREVVEIGGTSFQVRDIRSLSDGSEIRAKLTRL >NZ_AP021884|1977054:2023954|2021110_2023954_-|WP_147073232.1|DBSCAN-SWA MAGGGFNVGDWCWLTRQAASCRVIDRQEVWGESAYRVWLPAKDAVVRARASDLAPLDSVRPTVEEILHTTAAAKLLDALEDNLLLAPIQSSVVPLPHQLYALNRAISRDRIRYLLADEVGLGKTIEAGLVLRELKLRGRVKRVLVVAPKGLVRQWQAEMRLHFGEHLQFIEPSELAAFRQWRSGNQGDEDNLWRMHDQVICSLDSVKPMESRRGWSLEQLNNYNRERFEDLISASWDLVIIDEAHRMGGSTEQVARYKLGAALAEASPYLLLLSATPHQGKTDQFLRLMQLLDRDAFPDESSVNRDRVRPFVIRTEKRLSINADGQPLFKPRVTRLQAVAWQARHNAQRRLYEAVTDYVRHGYNQAMAAKQRHIGFLMILMQRLVTSSTAAIRTTLEKRLALLEEPQPQPSLFENTSEEDWADLDGQSQVDLAMQATGWELEKSEVEILLALARETEASGTDAKAEALLELIYKLQQEENDPALKVLLFTEFVPTQAMLANYLESRGFSVATLNGGMDLDARSKAQKAFSQDVRVLISTDAGGEGLNLQFCHVIVNFDMPWNPMRIEQRIGRVDRIGQRHVVRAINFVLEDTVEHRVRQVLEEKLEVIAQQFGVDKASDVMDSAEAEPLFEELFVHGLQNPASIEQECDAVVSQLRETLAESKKSSELLSDAHELEADDARKWRDHPAQFWLERAITSGLAARGGAATKVGKAWRVTWADGSEAAQVCFDARTADENHDIEWVTLEDPRARAVISELPRFVAGQPLPVIRVTGLPDSVSGVWSLWEISLAAEGLNRKRYLPVFTNEEGRAFVPTAKRVWDLLLTETVVVHGVSGTEEAVKWFDAALTAAKAQGERIFTEMLEAHRTRLQEERERADYAFEARQQAIGRIGLPAVREHRRKRLQQEHDARLAALAEAAASVPDLNAVMMVRVSAEVQPGENVRETQST >NZ_AP021884|1977054:2023954|2013776_2014079_+|WP_147073246.1|DBSCAN-SWA MTEPEQQPALVENMLLLRKEDFDDLLDRAAERGAERCLAHLGLENGHAARDIRELRDLLEAWRDARRTAWQTTIKVATTGILAALLVGAAIKLKLMGGPQ >NZ_AP021884|1977054:2023954|1988949_1989474_-|WP_147073289.1|DBSCAN-SWA MTTTQLTPAQHAILAYALEHTDGKIDWFPDNIKGGARKKVLDGLFNRALITSDGTHWFVAAEGYDAMGRARPTPAPVAADPELDAAVTAAEAAWAQEKAAAKPRTRENSKQATVIQMLQRPEGATVQQICETTGWQAHTVRGTFAGAFKKKLGLTIVSDKAQGSERVYRIAAEA >NZ_AP021884|1977054:2023954|1977808_1978657_+|WP_024973178.1|DBSCAN-SWA MKRLPIVSAVERMAERKGVKLLMLGKSGIGKTSRLKDLDPATTLFLDIEAGDLAVADWPGDTIRPASWPESRDFFVFLAGPDKSLPPESAFSQAHYDHVIEKFGDATQLGRYQTFFLDSITQLSRQCFAWCKTQPGAVSDRSGKPDLRAAYGLLGQEMIGALTHLQHARGKNVVFVAILDERLDDFNRKVFVPQIEGSKTSLELPGIVDEVVTLAEIKAEDGSSYRAFITHTVNPYGFPAKDRSGRLDLLEPPHLGALIAKCAGAVPALASAANPAHIESQE >NZ_AP021884|1977054:2023954|1998651_1999404_+|WP_147073268.1|DBSCAN-SWA MSTYASFQGRVFLGKRDTDGLPIEVRSPGNVAELKLSLKTDVLEHYESQTGQRSLDHRMVKQKSATVNLTIEEFTKENLALALYGNHVVGTPGTVTAEPVGGATPIAGDRYFLAHPKVSSLVVTDSAGTPATLALGTNYTADPDFGALQFLDTTGFTAPFKASYAYGVATEIGIFTQALPERFLRLEGINTAQGNAKVLVELYRVAFDPLKEISFISDEYNKFELEGSLLADTTKPFDAVLGQFGRIVQL >NZ_AP021884|1977054:2023954|1988093_1988462_-|WP_147073293.1|DBSCAN-SWA MNTNQQMPATQNDAWGFWGTMNEHASTAWPLAMTAISDATGQPLESVRVFLDSRHGRHFADDLQNGLYRGQTLADAINAATQQWMGWTIGRQTSKQYGIPRGLPYLTGFVIHCEIAEESIAA >NZ_AP021884|1977054:2023954|1988560_1988920_-|WP_147073291.1|DBSCAN-SWA MSTMTITIERTPRTLQFGDTTFQVEELSVRLPFARKPADLDEVGGQGQTKVYVTETKELTVDEFDAFARSLLVSRDWLRGKGGGTGDGYLCVEVTAPGRPYLYVNPEGGDYARYVARLG >NZ_AP021884|1977054:2023954|1981381_1981636_+|WP_147073305.1|DBSCAN-SWA MTDNNTPTTGIEPMIDAKQAAAALRLPYYWFADHAMRTKYRIPHYLMGGLVRYRLSELSAWATRTTAVQGRDSQDADAPVEGAE >NZ_AP021884|1977054:2023954|2005133_2008697_+|WP_147073258.1|DBSCAN-SWA MPIQSGDVKLLKSAVMADVPEGGGAPTGNTIADGVSNAIFPDISELDRAGGRVNLRKSFVSVQTDDTDTYFGANVIVAEPPQDARVSVTLFSTEKTFDTREQAQVRIEAYLNKGPEWAGYLFENHIAGQRVIQLFQRTTDTVPNVGQTLVLIENEGLGTQKEQYIRATSVSVVERTFTYDGDKDYKASIVTVDISDALRYDFTGSPASRTFTRAANSTKTRDTVVADAGTYVGVVPLTQAAAVGDFTIKGTSIYTQLVPSAQTETPISFVPPYAAAGLPVPGAVAVSYTASHAWTTSIKFNLPGGCLPGSLTIGTDGITIFDDAGLLKTASGTVGTIDYANGILTLNSGTMSNAKAITYTPAAQILRAPQSSEIPVTPESRSQSYVGTVNPVPQPGTLSISYMAQGRWYVLSDSGNGSLKGLDASYGAGTFNRNTGAFVVTLGALPDVGSSLVLTWNVPTQETQQPSTTLKATQSLALNPPAGTAVQPGSLTVSWEYGGTKTSTAATSGVLSGAATGSLSVAQNRVDFAPNVLPAVGTQLTVSYVAGPKQEDSFAHPSRNGAGTLPVTATLGAIEPGSLEVEWNTFTDEAVLGAYTFAQLQEMGIAVSIWRDPTQIARDDGNGGVVLNGISIGTVNYATGQVTFNPDVSIRIPRPVYTAVAINGTGRWRLNYGGIAYVDAPSLYPNDESGYVKLRYNSAGSTSNQTETFQFLPAFKLVPGVNAQVVTGTVLLSISGAQPWGDNGQGTLREFTTSGWVTRGTINYLSGDVALTSWTAGTNNAITRASCVTTVGENISSEFVFRTGAAPLRPGSLSIQYARAVGGTQNVTAGIDGKIEATGISGSVDYETGLVRVRFGTMVTAAGNESQPWYAADRVGTDGKIFRPEPVAASSVRYSAVAYSYLPLDADLLGIDPVRLPSDGRVPIFRPGGFAVVGHTGKITSSVSNGQTINCARVRLSRVRVVGHDGAVIHTGYSTDLEAGTVTFINVSGYSQPVTIEHRIEDMAVVRDVQISGEISFTRALTHEYPLGSHVSSALVAGDLFARVNLVFDQSTWNGAWSDALSGSSATATFNNTQYPIRVTNRGALTERWIVRLTNSTSFEVIGENVGVIATGNTSADCAPNNPATGVPYFHLPALGWGNGWATGNVLRFNTIGAQFPVWVVRTVQQGPESVPDDNFTLLIRGDVDTP >NZ_AP021884|1977054:2023954|1980150_1980651_+|WP_147073309.1|DBSCAN-SWA MKCWVCKRQARGFGHTDNRHGIGDPRRYPIDWVFCSQRCQSAFHAMYGNWSRAKDGRSDIKGVAMIDPSDIELAAMRKCLKSFGEAASEIGFTKPLGNYSEAEALQVIDAIVTCYTEAMVEHHEASKYPPVRGMTPTPDPMTPSAANPFADLDDDLPWEEPKGKKP >NZ_AP021884|1977054:2023954|1998008_1998455_+|WP_147073272.1|DBSCAN-SWA MADNSIRERILLAVMAAARPAVEGLGATLHRSPTVAISRELCPALAVFPESESITERANDRVTRELTVRVVALARAVPPASPETEADRLLTAAHAALFGDGTFGGLALGIREQESEWEVEDADAVAVALPARYRLTYRTLANDLSTLG >NZ_AP021884|1977054:2023954|1995036_1996272_+|WP_147073277.1|DBSCAN-SWA MTLLPHLAARLYGVPLAIHRPKLDVILAVLGPRIGLADLAAPSGFTPPARPASTQTTKVAVIPIHGTLVRRTVGLEAESGLTSYAGLTAQLDAALASPDVAAILLDVDSPGGESGGVFDLADRIRAAAKTKPVWAVANDMAFSAAYALASAASKVFVSRTGGVGSIGVIAMHVDQSEKDAQDGVRYTAVFAGDRKNDLNPHEPISSEAHAFLKGEVNRVYGLFVETVARNRGIEASAVRDTEAGLFFGQAAVAIGLADAIGTFDDALAQLCESVSPLPKLAASHSGLFSNPQMESSMNDRTDPAAPDRLAADPAGSPSQPAAATAMTVADAIEVAQTCTLAGRTDLIAGFLEAKAPPAKVRSQLLATQAEASPEIVSRIDPQSAMSASSTGHPASSHNPLIQAVKSRLGTK >NZ_AP021884|1977054:2023954|1989572_1989770_-|WP_147073287.1|DBSCAN-SWA MSKFEQLLTQIAQNKLGIETLETRKSDSLDFHDVAVWCLRDALEAAFNAGVEQGRKATKSDKANS >NZ_AP021884|1977054:2023954|2000022_2000667_+|WP_147073263.1|DBSCAN-SWA MRISVQIDSAAAQAQLRRWGGEFRDKVKKAVSRAIASEAVELKQDVRSHVASQMAVVKKSFLKGFTAKVLDKDLNRLPALYVGSRIPWSAMHETGGQIAGRMLIPLNGRVGRKRFKAQVAELMRGGNAYFIKNAKGNIVLMAENIKEHDRPLAGFKRRYRKAEGIKRLKRGADIPIAVLVPKVVLKKRLDVERLVASRIPRLAAAVENQISTVD >NZ_AP021884|1977054:2023954|1980647_1981385_+|WP_147073307.1|DBSCAN-SWA MMDFNSTSSISGQITALVDAGMQRARAQQSERQYLGASRLGAACERALQFEYAKAPVDHGRDTPGRMLRIFERGHVMEDCMVAWLRDAGFELRTRRADGEQFGFSVADGRLQGHIDGVIVDGPEGFAYPALWENKCLGMKSWRELEKNRLAVAKPVYAAQVAIYQAYLELHEHPAIFTALNADTMEIYTEAVPFDAALAQRMSDRAVKVITATESADLLPRAFNDPTHFECRMCAWQDRCWRTQA >NZ_AP021884|1977054:2023954|1996685_1997705_+|WP_058719286.1|capsid|DBSCAN-SWA MQNPFISPAFSMASMTAAINLIPNRYGRLEELNLFPPKPVRTRQVIVEERAGVLNLLPTQPPGSPGTVNVRGKRTVRSFVVPHIPHDDVVLPEEVQGLRAFGSETEMESIAGVLAQHLETMRNKHAITLEHLRMGALKGEILDADGSRIYNLFDEFGIDQQSVDFEISSPTTGTDVKGKCTDVLGIIEEALLGEFMTGVHCLCSPEFFKALTGHKDVKTAFTNWQQGAVLINDVRRGFTFGGITFEEYRGKATDVNKTVRRFIAAGEAHAFPLGTIDTFGTYFAPADFNETVNTMGQPLYAKQEPRKFDRGTDLHTQANPLPMCHRPGVLVRLVMGGGV >NZ_AP021884|1977054:2023954|1986858_1988130_+|WP_147073295.1|DBSCAN-SWA MTASWFADKIEKWPTAKLLPYARNARTHSDDQVAQIAASIAEFGFTNPILAGSDGVIVAGHGRLAAAQKLGLAVVPVVVLDHLSPTQRRALVIADNRIAENAGWDDAMLRIEIASLQDDDFDVSLTGFDADALAELMAGDEPDGEGETDDDAVPELSETPISRPGDVWSLGGHRLLCGDSTVTESYDRLLDGEQVDMVFTDPPYNVNYANSAKDKMRGKDRAILNDNLGDGFYDFLLAALTPTIAHCRGGIYVAMSSSELDVLQAAFRAAGGKWSTFIIWAKNTFTLGRADYQRQYEPILYGWPEGAQRHWCGDRDQGDVWNIKKPQKNDLHPTMKPVELVERAIRNSSRPGNVVLDPFGGSGTTLIAAEKSGRLARLIELDPKYADVIVRRWQEWTGKQATRESDGALFDDQAAIDSSAISQ >NZ_AP021884|1977054:2023954|1993515_1995027_+|WP_147073279.1|portal|DBSCAN-SWA MAWYSKIRSLFGQQPVHEAAGRGRRSLAWMPGNPGAVAAMLATNTELRIKSRDLVRRNAWAQAGIEAFVSNAVGTGIKPQSLAADERFKTDVQALWRDWTEEADAAGQTDFYGLQALACRAMLEGGECLIRLRPRRPEDGLVVPLQLQLLEPEHLPISLNLDLPSGNVVRSGIEFDSLGRRVAYHLYRSHPEDGRLAPMSGQGGMDTVRIDAKEIIHLFRVLRPGQIRGEPWLSRALVKLNELDQYDDAELVRKKTAAMFAGFVTRQNPEDNLMGEGAADGDGIALAGLEPGTLQILEPGEDIKFSDPADVGGSYGEFLRTQFRAVAAAIGVTYEQLTGDLTGVNYSSIRAGMLEFRRRCEMVQHGVLVHQMCRPVWAAWMKQAVLAGAIDAPGFARGGPARRRRYLQVKWIPQGWQWVDPEKEFKAMLLAIRAGLMSRSEAISAFGYDAEDVDREIAADNQRADDLGLIFDSDPRRTSKDGGSAEPNKNAADTTQTGSSSSA >NZ_AP021884|1977054:2023954|1993294_1993516_+|WP_146463160.1|DBSCAN-SWA MAYTEAQLQALETALAKGEHRVSFGDKTVEYRSVDELKAAIREVKRGILEQAAATGLWPGAPRQIRVTTSKGF >NZ_AP021884|1977054:2023954|2009989_2011075_+|WP_147073254.1|DBSCAN-SWA MSYPLSESFATAPATGYTAVLGGMAATHNNVQQSIDISAPNSQSILRFNETAHGDFWFEADVEFLTDPSARKHIGLWMTTGNGSEGYRFAHIDGAWSVTRWNSGFGDGAAVTGGVNDGAKPVAGVIDVAPTFNVGQRMPLRCEVIVGAFDANGVPWARLIQFKAGGVLMFQVGDAAYRGKLIPGVFLYGATARVHAIAGDTPSGLPAFPATVGVNAADDLLPLAGGSTSVPPDPAANIAVNADCDLMRLNSPNSELWNRGGGYDWHFHAIPNGRKNIHFSGHGFIAGTVKEKGQPDQPLVRRVQLVSENTRVLVAETWSDTTGAYRFELIDPAQRYTVVSYDHKQMYRAVIADNLHPEMMP >NZ_AP021884|1977054:2023954|1998460_1998655_+|WP_147073270.1|DBSCAN-SWA MTQLVLTRPHTHAGKTYGVGDRIEIDATSADWLIAHDIATPEPTAPTAEPVPEPKPLQRKEPKQ |
50 | Acidithiobacillus_phage(45.45%) | head,portal,terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2044160 : 2053121
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|DBSCAN-SWA ATCAGTGTTTCAGCCCCAGGGTGTGGCTGAGGAATGGCACGCCGCCATACCGTCCCATCAGGTAGCCGCGTTCCAGTGCCTCGTTCTTGCGCGGGCTGAATTCGGATAAGCTGACTAACGCCGAGGTCTGATCACGGTCTTTTTCTACGAACCACACTTGATCGCGTCGGAACAAATCTGGTGCGTCCAGCAGCGACGTGTCGTGCGTCGTGAATATCAGTTGTGCGCCGCCGGTGTTGATCTCTGGGCGGTGGAACAGTCGCACGAGTTCGCGCACCAGCAAGGTGTGCAAGCTGGTGTCGAGCTCGTCGATGACCAGCGTTAGCCCTTTGCGGAGGATGTCGAGCACTGGTCCTGCAAGAAACAGCAAATTGCGCGTGCCGTTGGATTCGTCCATCAATTCAAACACGGCTTTGCCCTGTTCGGTGACGTGATGGAAGCGCAGCTTGTGTTCTTCCATTTCTTCCGAGCGCACTTCAGTTTTACCGGCCACCAAGTCAAAGTGAACGGCCTGCCCAGGGACTTTGCGTGTTTCCACATCAATGTCGGCGATGCTGATGTCGGCAGCAGAGAGGAAGTTGCAAATCTCCTTGCGGCCGTCATCCTGCTTGAGCATCTGAATGGACACCTGTGGACTGAGTTGGGCTTGCTCGTTGAAGATCACCAGGCGATTCACAAACCAGTCGAACACCGGTCGCAAGGCTTCGCTGTTGAGTTGTACGGCCATCGACAGGAAGAGGGCGTTCGGTCGGGTGGCACCTTCCCACAGGTTCTTGGGTCCTTTCAGGCCGGGACCGAAGTCGTAGACATCCTTGCCGGTCTCAGTGTCAAAGCGGCGTGTGAACCAACGCTGGGGCTTGAACGCCTTGTAAACCAGCAGGTGCTCACTGACGATCCGCTGAGCTGTCATGGAAAAGCCATACTGGTAGCGCACACCATCGAGCAGGAACGTGACTTCGAACTCGCTGGGTTGACTGGCGGAATCGACATCGAGTCGGAAGGGCTGAACTGCGAAGGTCTGGCCCGGCTGGATCGCCGTCGCGGACTCGGTCACCACGCCGCGCATGTACTGCAGGGCTTTAATGAGATTGGACTTGCCGCTCGCGTTGGCACCATAGACCACAGCACTGCGCACCAGGGTGGGTGCGGCACTGATGCCGGTGGCTTGGGTGTGGGTGTCCTGCAGGGTCTTGTCCTTGGACGCAACAAGACTAAGCACCTGCTCGTCGCGCAGACTGCGGAAGTTCTTGACGCGGAACTCGACCAGCATTGCATTCACCTCATTTTTATAAAATAGAGTCTTATTATGACTCAAAAGTGCAAAAAACAAATATATTTTTGAATTCTGGGGTGGTTTGAGTGATCTGGGTTGGCATCGCAGGCCTGCCGGGTGTCAAGTCCCCGTTTGAACTTCGCTGGAGCCTTGACGTGGCACCCAGCTGCGGTTCGGCCAAATCGTCGAATCCGCGCCCTAAAGACCCGTGCGTTACTATCTGGTTCTGAAAGCGGACGCGGGTTCGGTTCCGTGTTCCACCACCAATAATGAAACCCCAACCGTTCTCGGTTGGGGTTTTTTCTTGCCTGATCGCGCCGGTTTCCGCGTGTTGTTGGGGCTTCCTGCGGAAGCCTGCGGACTGCACCGGTCAGCCTTCCAGCCCGTTCCGGGCCACATTCCACTCTCTCCTGGCCATTCCTCGCTCCGACCTCGCTCCCTGGAACTGGCCCGAAGTCCGCAAAGGCCGCAATTCAGACCTATACAGATCAAAGAGTTACGCGCGGACTAATCAAGTGGTTGGATTGCGGCATTGGCTACCGGAGGGGACACACGCTTGCCCCAGCGAGCTATTGCACTTGCAGTTTGTCAGAAACTCGCATATATTTTCGTACATGAACACGACGACCAAGACTGCCGAGCTGATTCGCGAGCGCATCGAGGCGATGCCGATCGGGGAGCCTTTCACCCCGACAGCATTCCTGGAGTGCGGCACGCGTGCGTCCGTCGATCAGACCCTCTCCCGCCTCGTCAAGGCAGGGTTGATCGAGCGCGTGACGCGCGGTGTCTTTGTGCGTCCCGAGGTCAGCCGTTTCGTCGGCAAGGTTAGCCCCTCGCCGCTGAAGGTGGCCGAGACCGTCGCCAAGACCACGGGTGCCGTCGTCCAGGTTCACGGTGCCGAAGCGGCGCGTCGGCTCGAACTAACCACGCAGGTTCCGACCCAGTCGGTATTCGTGACATCCGGCCCGTCGAAGCGCATCCGCGTGGGGAAGATGGAGATCCGTCTGCAGCACGTCTGTCAGCGCAAGTTGGCCCTGGCAGGTCGACCCGCCGGGCTCGCGCTTGCGGCGATGTGGTATCTCGGCAAGAAGGAAGTGACGCCGGCCCTCGTCGAGAAGATTCGGCGCAAGCTGGGATCGAGCGAGTTCGAGGTGCTGAAGTCAGCCACCAGCTCGATGCCTGCGTGGATGAGCGACGCCATCTTCCGAAACGAGCGGATGGCCGCTCATGCCTGAGTCCTTCCTGCACCTGAAGCCTCAAGAGCAGTCCCAGATCTATCGGGCACTGGCTCCGCAGCTTGCCCGCACGCCCGTCGTACTGGAAAAAGATGTCTGGGTCTGCTGGGTGCTGCAGACCCTGTTCACCATGCCCGACCGACTGCCGATGGCCTTCAAGGGCGGCACATCACTCTCCAAGGTGTTCGGCGCCATTGCGCGCTTCTCCGAGGACGTGGACATCACGCTCGACTACCGTGGCTTAGACGGCTCCTTCGACCCGTTTGCCGAAGGCGTCTCACGCAATCGGCTGAAGAAATTCAGCGAGGATCTCAAGTCCTTCGTGCGCGGCCATGCCCACGGTGTCGTGGCGCCGCACTTTCAGAAGATGCTGGCGGACGAGTTCGATGCCGATGCATTCCAGCTTGAAGTCAGCGATGACGGCGAGCAGATGCGGGTGCACTACCCGAGCGTGCTGGAGGCACCAGGAGACTATGTGGGCAACAGTGTCCTGATCGAGTTCGGTGGCCGTAACATCACCGAGCCGAATGAGGAGCGTGAGGTGCGACCCGACATCGCGGAACATGTCGCTGAACTCGATTTCCCTCGCTCGACGGTCAGTGTGCTGTCTCCGACACGTACCTTCTGGGAAAAGGCGACGCTGATACACGTCGAGTGTCAGCGCGACGAGTTCCGCACAGGCGCCGAACGTCTGTCACGCCACTGGTACGACCTGGCCATGCTGGCCGATCTTGCCCATGGGCAAGCCGCTGTGGCCGATCGCGCTCTGCTCGCGGATGTTGTCAAGCACAAGAAGGTCTTCTACAACGCGAGCTACGCCAACTACGACGCATGCCTGTCCGGGCAGCTCAGACTAATTCCGGAAGATGCTGCACTGGCCGCGCTGCGCGATGACTTCCAGCGCATGATCGGTGCCGGCATGTTCATCGGCGAGCCTCCCGCCTTCGATGCCATCGTCGATCGCCTGCGCGCGCTGGAAACAACAATCAATCAGTGACCTCCCGCTGGCGTTGAGTCGGCAATCCTCGCCCGCTGCTCATCCCACAAGGCTGGCGGATCAACCGCCAGATCAAACAGCGTGATGTGGTTCGGCAGTGCGTCGTCCAGGATGGCGGCCACGATGTCGGGGGCTAGCGTGGTCAGGTTGACCATACGGCTGACGTAGCTGTTGTCGATGCCTTCCCGTGTGGCGATCTCCTTCAAGGACTTCGCTTCTCCTGATTCCAGCATCGCCAACCAGCGGTGGCCCCTGGCCAACGCCAGCTGGATGGAGGTCGGCGCCATGTCCCACGGTCTGACCGGCGCGGTTTCTCCGTTCGGCAAGGTGACCAGCTTGCGGCCGCTACGGCGCTTGATCTGGATCGGTACAGACAGGGTCAGCCTGCCGTCGCTGGTCTGCAGGATGTCCGGCTCGCCGGTTTTCTGGATGCGGATGTCGCTCATGCCAGGGCCTCCTCAGTTTGCTCGACCGGCTCGGGACGCAGTTCCAGCACCAGGCGTTCGATGCCGTTGGTGCGCAGGCGCACTTCGAGGTCGTTGGGTGACACGATGACTTTCTCGACCAGCAATTTCACGATCCGGGTCTGCTCCGCCGGGAATAGCTGATCCCAAATCGCATCAAGCCGGGTCATGGCCACGGTGATCTTGGCCTCGTCCAGCGTCGGGTCGAGCTTGATCGCCTGTGGCAGCATGTTGCCGAGCAGATTCGGGGCATGCAAAATCGCGCGTAGTTGATCGAGTACCGCCGACTCCAGTTCTGCGGCGGGCAGTCGCGGCAGCCCCGAGGCACCCGCGTGTTCCTTGGCGTCGCGCTGGGGCACGTAGTAACGGTAGCGCCGGCCATTCTTCTTGGTGGTGTGCCACGGCGACAGTGCGCGGCCATCGTTGCCGAACACGATGCCCTTGAGCAGATAGGGAACCTTGGCCCGCGTCTTGTTGCCCCGCACCCGGCCATTCGTCTCCAGGATCGCGTGGACGCTGTCCCACAGTTCGCGGCTGACGATCGGCGGGTGTTCGGCCTGGTACCACTGGTCCTTGTGCCGCAACTCGCCAAGGTAGGTCCGGTTGCTCAGGAGCTTGTAAATGTGGCCCTTGTCGATCGGCCTGCCATCGCGGGTCTTGCCGTCTTGTGTGGTCCACGCCTTCGACGTCACGCCATCCAGTTTCAGCTCCTTGACCAGTGCGGTGCTGGAACCGAGTTCAACGAAGCGCTGGAAGATGTGCCGGATCAGCTTGGACTCACGCTCGTTGGGCACCAACCGCCGGTTCTCGACGTCGTAGCCCAGCGGCGGCACGCCACCCATCCACATACCCTTGCGCTTGCTGGCCGCGATCTTGTCTCTGATGCGCTCACCGGTGACCTCGCGCTCAAACTGCGCAAAGGACAGCAGGATGTTCAACATCAACCTGCCCATCGAGGTCGTCGTGTTGAACTGCTGGGTGACCGACACGAACGACACGCCATAGCGCTCGAACACTTCGACCATCTTGGAGAAGTCCGCCAGGCTGCGCGTCAGGCGGTCGATCTTGTAGATGACGACCACGTCGATCTTGCCGGCTTCGATGTCCGCCATCATTCGCTGGAGCGCCGGGCGTTCCATGTTGCCGCCGGAAAAAGCTGGATCGTCGTAATCGTCGGCGACCGGTATCCAGCCTTCGGCGCGCTGGCTGGCGATGTAGGCATGGCCGGCGTCGCGCTGGGCATCGATGGAGTTGTATTCCTGGTCCAGCCCTTCATCGGTGGATTTGCGCGTGTAGACCGCACAGCGCATGCGGCGCTTCAAGACTTCGCTCATCGTCCACCTCTCTTCTTGGTGGACGGCTTGGCCTTGGCGTTGGACGGCGGCTTGAGCCCGAAGAACAGCGGCCCCGACCAGCGCATGCCGGTGATTTCGCGGGCGATCATCGATAGGCTCGGGTACATGCGTCCCTGGAAGTCATACTGGCCGTCGGCGGTTGCGATCACGCGGTATTCGACGCCTTTGTATTCCCGGACCAGCACCGTGCCTGCCGCCGGACGGTAATCGCGGTCACGCTTTTTCACCTTGCCTGTTTCCACCAGAGATGCGATGCGACGCTGGTTGCGATCCAGCAGGTTGGCGTCGGCCTTGCGGAATTCCAGCTCCTGCAGCCGGTAGGCAATCCGGCGTTCGAGGAACTGGCGGTTGTGGGTGGGAGTGTCGCCACCGACCAGCTTCTGCCAGAGGGCCCGGATCTCTGCCATCGGCATCTCGGGCAGCCTGGCGATCTGCGCCGCCACCGATGGCGGCGTGGAAAATGATGGTGTTTGCGTGCTCATTTCGACTCCGTAGTTGTCTTGTTGACGGGGTCTGTATGAACGCGCTGGTTGCCAGAGAAGCCAAGCTCAAACTCGCTCGCTTCTGCCCTGGTTGCGGACTGTTCTGTGCCGGTGATACGCAAGCGTGCCAGGCCGTTGGCCAGCAACGACGCGATCTCGTGACGACGCTGTTTAGCACTGGGCGTCTATCCTGACGTGACGCTATCTGTTGCGAGAAACAAGCGATATGAGGCACGCAAGCTGCTTTCCAATGATGTAGACCCGGCTATGCTCAAACAGGTGACTAAGCGCGCATCGCGCGTGTCTGCTGAAAACAGTTTTGAAGCAATAGCGAGAGAATGGTATGCAAAATTCTCGGGCGAATGGGTGCCTAGCCATGGCGAAAAAATCATCCGCAGATTAGAACGCGACCTGTTTCCCTGGATCGGTAAACGCCCTATTGCCGAGATCACCGCACCTGAACTGTTAGCCGTCTTACGCCGCATTGAAAACCGAGGCGCGCTAGATACGGCGCACCGTGCGCATCAAAACTGCGGGCAAGTGTTCCGCTATGCAATCGCCACTGGGCGCGCTGAACGCGATCCTAGCCCCGACTTGCGCGGCGCATTGCCGCCAGCTGGGTATTCTGGATCATCGTGACCGCTGATTCCGGGCTATCGTGACCGGTCATTCCGGCGCATCGTGACCGGCGATTCCGGTCTATCGTGACCGATTTTGCAGGGTTTCCGGAATCAGTGGTCACGATAGCGGAATCATCGGTCACGATAGCGGAATGGTGTCGTACCGCATGGAAATGGTGTTACGCATAGAGCAACCGAACGAGTACGCTTCCAGCCTTTTGTCTGGAGACAGCGTGCCCGTATCAAGGATCACCATGCGTAAAATTAAAGACGTATTGCGTTTGAAACTGGACGCCAGGCTGTCGCACCAGCAGATCGCCGCTGCGCTGGGCATATCGAAGGGAGTCGTCACCAAGTATGTCGGTCTGGCCGCCGCCGCAGGCCTGGATTGGGCTGCCGTGCAAGACATTGACGAAACCACGTTGGGGCGGCGCCTGCTGGTTACCCCCGAGCGACCGCGCGATCATGTTCAGCCGGACTACGGCCGTTTGCATCAAGAGCTGCGGCGCAAAGGCATGACATTGATGTTGCTCTGGGAAGAGTACCGAGCCGACCACGCCGACCGGCAGACCTATGCTTACTCGCAGTTCTGCGACAACTACCGGCGCTTCGCCAGGCAACTCAAGCGCTCCATGCGCCAGGTTCACCGTGCCGGCGAGAAGCTGTTCATTGATTTCGCCGGCCCCACCATCGCGCTGACCGACGGCAGTCGCGCGCACATCTTCGTCGCGGCACTGGGCGCTTCCAGCTATACCTTTGCCTGCGCCACGCCGCGCGAGACCATGACCGACTGGCTGAAATCGACAGCGCGCGCGTTAAGCTTCATCGGCGGCATGCCCCAGATGATCGTGCCCGACAACCCGAAGGCGCTGATTGCGGACGCCAACCGTTACGAGCCGCGCAGCAACGATACCGTGCTCGATTTCGCGCGCCACTATGGGACGTCGGTGTTGCCAGCACGACCCTACCACCCGCAGGACAAAGCCAAAGCAGAATCGGCGGTACAGATCGTCGAACGCTGGATCATGGCGCGCCTGCGCCACCAGCAATTTGCCAGCGTAGATGATGTCAATCAGGCCATCGCACCGCTGCTTGCCAGGCTCAACGAGAAGCCATTCCAGAAGCTGCCCGGCAGTCGCGCCAGTGCATTTGCCGAAATCGGCGCACCCGCCTTGGCTCCGTTGCCGCTGCAAGCTTATGAGATGGCACACTTCAAGACGGTCAAGGTTCACATCGACTATCACGTAGAAGTCGAACGACACCGCTACAGCGTGCCGCATTCATTGGTCGGACAAGTACTTGAAGCACGGATCACAGTGGCAGTGGTCGAGATCCTGCATCGCGGTAACCGCGTGGCCAGCCATGCCCGCAGCAGTCTGGCCGGTGGCTTTACCACCACCGCCGCGCACATGCCGGCGGCGCATCGCGCCCAGATGGAATGGTCGCCACAACGGCTGATCCACTGGGGCCAAAGCATTGGCCCTGCCGCCGCCGAAGTGGTGACACGGCTACTGAACAAGTACAAGCATCCCGAACATGGCTACCGCGCCTGCCTTGGGCTGCTGTCGCTGGTCAAGCGTTATGGCAAACCCAGACTGGAGGCGGCCTGTACGCTGGCTTTGCAGATCGGCGTCTGCCAGTACCGCCATGTGCGCGACATCCTGAAGAATAACCGCGACGCAGCCGCGCCGCTCAGCACTGAAGAATGGGTCAGCCCCAACCATGTCCACGTGCGCGGTCCTGGCTACTACCAATAAGGAAAGACAACATGATGATGCATACCACGCTGACGCAATTGCGCAGCCTGAAACTGGATGGCCTGGCGACGGGGCTGGAAGAACAACTGGCACAGCCCGGTATGGCTGCACTCAGCTTCGAAGAACGCGTAGCACTGTTGGTGGACCGGGAAGTCCATGCCCGTAATGACCGCAAACTGGCGCGCCTGCTCAAGAACGCTCGCCTGAAATACGGGCAGGCGGCCATCGAGGATATCGACAGCCGCGCAGGACGCGGTATCGACCGGCGCGAGGTGATGAGCCTGGCTTTGGGCGACTGGGTCAACGCCGGCCACAGCATCCTGATTACAGGACCGACCGGCGCCGGTAAATCCTGGCTGGCCTGCGCATTGGCACAATACGTCTGCCGCCGTGGTTACTCAGCCATCTATCAGCGCGTACCCCGCATGCAGGAAGAACTGCGCATCCGGCACGGCAGCGGCACCTTCGGCAAATGGCTGCTGCAACTGGCCAAGACCGACGTATTGGTTCTCGATGACTGGGGCATGGGCGCTATCGACAGCATGACCCGTTCCGACTTGCTGGAGATCATCGACGACCGTGCCGCCAACAAGGCCACCATCATCACCAGTCAGTTGCCGGTGGAGCACTGGCACGCCTGGATAGGCGATGCCACCATCGCCGACGCCATCCTCGACCGCATCATGCAGCGCAACCACCGCTTCACGCTGACCGGCGAGTCGCTGCGAACAGAACAATCAAAAACAAGCAAAAAGGAGGAAAAAACCACCCCATCGTGA
Protein sequences of DBSCAN-SWA_6 >NZ_AP021884|2044160:2053121|2049458_2049965_-|WP_147074830.1|DBSCAN-SWA MSTQTPSFSTPPSVAAQIARLPEMPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRNQRRIASLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQGRMYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSTKKRGGR >NZ_AP021884|2044160:2053121|2047657_2048110_-|WP_147074828.1|DBSCAN-SWA MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVRPWDMAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVNLTTLAPDIVAAILDDALPNHITLFDLAVDPPALWDEQRARIADSTPAGGH >NZ_AP021884|2044160:2053121|2046659_2047664_+|WP_147074827.1|DBSCAN-SWA MPESFLHLKPQEQSQIYRALAPQLARTPVVLEKDVWVCWVLQTLFTMPDRLPMAFKGGTSLSKVFGAIARFSEDVDITLDYRGLDGSFDPFAEGVSRNRLKKFSEDLKSFVRGHAHGVVAPHFQKMLADEFDADAFQLEVSDDGEQMRVHYPSVLEAPGDYVGNSVLIEFGGRNITEPNEEREVRPDIAEHVAELDFPRSTVSVLSPTRTFWEKATLIHVECQRDEFRTGAERLSRHWYDLAMLADLAHGQAAVADRALLADVVKHKKVFYNASYANYDACLSGQLRLIPEDAALAALRDDFQRMIGAGMFIGEPPAFDAIVDRLRALETTINQ >NZ_AP021884|2044160:2053121|2050818_2052342_+|WP_147074832.1|transposase|DBSCAN-SWA MPVSRITMRKIKDVLRLKLDARLSHQQIAAALGISKGVVTKYVGLAAAAGLDWAAVQDIDETTLGRRLLVTPERPRDHVQPDYGRLHQELRRKGMTLMLLWEEYRADHADRQTYAYSQFCDNYRRFARQLKRSMRQVHRAGEKLFIDFAGPTIALTDGSRAHIFVAALGASSYTFACATPRETMTDWLKSTARALSFIGGMPQMIVPDNPKALIADANRYEPRSNDTVLDFARHYGTSVLPARPYHPQDKAKAESAVQIVERWIMARLRHQQFASVDDVNQAIAPLLARLNEKPFQKLPGSRASAFAEIGAPALAPLPLQAYEMAHFKTVKVHIDYHVEVERHRYSVPHSLVGQVLEARITVAVVEILHRGNRVASHARSSLAGGFTTTAAHMPAAHRAQMEWSPQRLIHWGQSIGPAAAEVVTRLLNKYKHPEHGYRACLGLLSLVKRYGKPRLEAACTLALQIGVCQYRHVRDILKNNRDAAAPLSTEEWVSPNHVHVRGPGYYQ >NZ_AP021884|2044160:2053121|2048106_2049462_-|WP_147074829.1|DBSCAN-SWA MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWIPVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLADFSKMVEVFERYGVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGERIRDKIAASKRKGMWMGGVPPLGYDVENRRLVPNERESKLIRHIFQRFVELGSSTALVKELKLDGVTSKAWTTQDGKTRDGRPIDKGHIYKLLSNRTYLGELRHKDQWYQAEHPPIVSRELWDSVHAILETNGRVRGNKTRAKVPYLLKGIVFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELESAVLDQLRAILHAPNLLGNMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLFPAEQTRIVKLLVEKVIVSPNDLEVRLRTNGIERLVLELRPEPVEQTEEALA >NZ_AP021884|2044160:2053121|2050160_2050604_+|WP_147074831.1|DBSCAN-SWA MTLSVARNKRYEARKLLSNDVDPAMLKQVTKRASRVSAENSFEAIAREWYAKFSGEWVPSHGEKIIRRLERDLFPWIGKRPIAEITAPELLAVLRRIENRGALDTAHRAHQNCGQVFRYAIATGRAERDPSPDLRGALPPAGYSGSS >NZ_AP021884|2044160:2053121|2046046_2046667_+|WP_147074826.1|DBSCAN-SWA MNTTTKTAELIRERIEAMPIGEPFTPTAFLECGTRASVDQTLSRLVKAGLIERVTRGVFVRPEVSRFVGKVSPSPLKVAETVAKTTGAVVQVHGAEAARRLELTTQVPTQSVFVTSGPSKRIRVGKMEIRLQHVCQRKLALAGRPAGLALAAMWYLGKKEVTPALVEKIRRKLGSSEFEVLKSATSSMPAWMSDAIFRNERMAAHA >NZ_AP021884|2044160:2053121|2044160_2045429_-|WP_147073207.1|DBSCAN-SWA MLVEFRVKNFRSLRDEQVLSLVASKDKTLQDTHTQATGISAAPTLVRSAVVYGANASGKSNLIKALQYMRGVVTESATAIQPGQTFAVQPFRLDVDSASQPSEFEVTFLLDGVRYQYGFSMTAQRIVSEHLLVYKAFKPQRWFTRRFDTETGKDVYDFGPGLKGPKNLWEGATRPNALFLSMAVQLNSEALRPVFDWFVNRLVIFNEQAQLSPQVSIQMLKQDDGRKEICNFLSAADISIADIDVETRKVPGQAVHFDLVAGKTEVRSEEMEEHKLRFHHVTEQGKAVFELMDESNGTRNLLFLAGPVLDILRKGLTLVIDELDTSLHTLLVRELVRLFHRPEINTGGAQLIFTTHDTSLLDAPDLFRRDQVWFVEKDRDQTSALVSLSEFSPRKNEALERGYLMGRYGGVPFLSHTLGLKH >NZ_AP021884|2044160:2053121|2052353_2053121_+|WP_147074833.1|DBSCAN-SWA MMMHTTLTQLRSLKLDGLATGLEEQLAQPGMAALSFEERVALLVDREVHARNDRKLARLLKNARLKYGQAAIEDIDSRAGRGIDRREVMSLALGDWVNAGHSILITGPTGAGKSWLACALAQYVCRRGYSAIYQRVPRMQEELRIRHGSGTFGKWLLQLAKTDVLVLDDWGMGAIDSMTRSDLLEIIDDRAANKATIITSQLPVEHWHAWIGDATIADAILDRIMQRNHRFTLTGESLRTEQSKTSKKEEKTTPS |
9 | Acidithiobacillus_phage(66.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2294456 : 2305315
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|DBSCAN-SWA TCTAACTGCGCTGGTAAATGTCCTCAAAGCGCACAATATCATCCTCCCCCAGATACGCGCCCGACTGCACCTCGATCATGTGCAATGCAATCTTGCCCGCATTTTCCAGCCGGTGCGTACTGCCCAATGGAATGTAGGTGGACTGATTCTCGGTGAGCAGCAGCACTTCGTCATTACGCGTAACGCGTGCCGTGCCGCTCACCACTATCCAGTGCTCGGCACGGTGGTGATGCATTTGCAACGACAGTTTCTCGCCCGGTTTGACCATGATGCGCTTGACCTGAAAACGCTCCCCCGCATCAATACCTTCGTACCAACCCCACGGGCGAAACACACGGGTATGGTTGAGATGCTCGGTACGACGGTGTTGTTTCAGGTGCTCGACCACTTTTTTGACGTCCTGCACGCGGTCTTTGTGCGCCACCATCACGGCATCGCTGGTTTCCACGATCACCAGGTCGGATACGCCAATTACCGCGACCATGCGGCTCTCGGCACGGATCAGATTGTTGCTGGCACCGTCGTTGTAGATGTCACCGCGCATGACATTGCCAGCACGATCCTTGGCACCGATTTCCCACAGTGCCGACCACGAGCCGATATCGCTCCAGCCAATGTCGGCGGGCACCACCACGGCGTTACGGGTGCGTTCCATGACGGCGTAATCGATGGATTCGGAGGGGCAAGCCGTAAATGCCTGGGTATCCAGACGGACAAAGTCCAGATCACGCTGGCTGCTGTCCAGTGCCGCCTGGCTGGCGGCGAGAATATCCGGGCGGTAGCCGCGCAATTCATTGACAAACGCCGCAGCCTTGAACAGAAACATGCCGCTGTTCCAGAAATAATCGCCCGATTGCAGATAGCCCTGGGCGATTTCACGGCTGGGTTTTTCCACAAAACGCCCTACTGCAAACACACCGTCCAAATGGCTGTCCGCCGGGCCGCGCTGGATATAGCCATAGCCGGTTTCAGGCGCCTGTGGCACGATGCCGAAGGTCACCAGCTGACTAGCCTGGGCGGCATCCACGGCCTGGGCCACGGCGCGCTCAAATGCGTCCACATCTGCTATCAAGTGATCGGCCGGCAACAGCAGCATCAGGGCCTCGGCATCCCGCGCCATGAGTGCCAGCGCTGCGACTGCCGCTGCCGGTGCGGTATTGCGCCCGATGGGCTCGAGAAAAATAGTCTCGGGCGTGACCTCAATCGCGCGCATCTGCTCGGCCACCATGAAGCGGTGCTCATGGTTGCATACCAGCGTGGGCGGCGTGATGTCGGCGATACCGGAAAGGCGCAGCACGGTTTCCTGCAGCATGGTGCGCTCCGACACCAGCGGCAACAACTGCTTGGGCAACGCGGCGCGGGACAAGGGCCACAGGCGGGTACCGGACCCCCCGGAGAGAATGACGGGATGAATGCGCATGTTGGTTGTTTCCTCAATCAATTCAGTGAATTCATCAGCCTGCCCAAGCTGGGGAAAGGCATCCATGCGATACAGTCCGGCGCGCTGCATCGAGAGCAAGACATCTACGCTACGCGAACCAATGAAAACCGGCTGGCCAGTCAGCCAAACACATATCATGTCTCCTTTTCGGGCTTGTCCAAACACAGGCCCAACGCCGCATCCCAATCCGGCAATAGCAAACCAAAGCTGCGCAGCAGCTTGTCCCCGGCCAGCACGGAATTAGCCGGGCGCCTGGCCGGGGTGGGGTAATCAGTGCTTGGGATCGGGATCAGCTCAGGACGTTTGGCGCCCGCCTGGGTGGTTTGGTCGAGGATGGCCTGGGCGAATTGATACCAGCTCGCCCGGCCCCGGCTGCTGAGGTGGTAAGTGCCGTGCAGGGCTTCTTTACCCTGGCCAAACGGCTGTTGCGCCAGTATCTGCGCCGTGGCCTCGGCGATCATGCGTGACCAGGTGGGTGCGCCGTATTGATCGGCAACAATCCGCAACTGCTCGCGCTCCTTGAACAGGCGCTGCATGGTGAGCAGGAAATTGCCAGCGCGCAGGCCATACACCCAGCTGGTGCGCAGAATGAGGTGGGGAATGGCGGCAGCACGGATGGCGTTCTCACCTTCGAGTTTGGTCTGCCCATAGACGCCTAGCGGGTGGGTGGTGTCGTCCTCAGTGTAGGCACCGGGCTTGTTGCCATCGAACACATAATCGGTAGAGTAGTGAATCATCGCCGCGCCGAGTTTTGCCACCTCTTCGGCCATGATGGCGGGAGCAATGGCATTCACCGCACGTGCCAGTTCCGGCTCGGATTCCGCCTTGTCCACGGCGGTATGGGCGGCCGGGTTGACGATCAGGTTCGGGCGCAAGCTACGTATGAGGCTGCGGATGGCACCCGGGTCGGTCAGGTCAAGCTGGCTGCGGGTGGGCGCGCTCACCTTGCCCAAAGTCGCCAATGTGCGGCGCAACTCCCAGCCGACCTGGCCATTGACACCGGTGAGCAGGATATTCACGCAAACACCTCGCACTGCGCCAGCGGCAGGCCGGCGGCGTCCTTGGCGGCAAGTGCGGGTTCGCCGTGCAGTGGCCAGGTGATGCCCAGCGCCGGATCATTCCACAGCAGGCTGCGCTCGAACTGCGGCGCCCAGTAGTCGGTGGTTTTGTAGAGAAACTCTGCGCTGTCGGAGATCACCAGGAAACCATGGGCGAAGCCTTTGGGTATCCAGGCCATGCGTTTGTTTTCGGCGGACAGTTCCATCCCCACCCATTTGCCAAACGTGGGTGAGGATTTGCGCAAGTCCACCGCCACGTCATACACGCTACCACTAATGACGCGTACCAGCTTTCCCTGAGTATTTTGAATCTGGTAATGCAAGCCACGTAATACGCCCTTGGCTGAGCGCGAGTGATTGTCCTGCACGAAGTCGTCGGGGATACCGGCCCCGGTCATGGCGCGACGGTTGTAGCTCTCGTAGAAAAAGCCGCGCGCGTCGCCGAATACTTTGGGTTCGAGCACAAGCACGTCGGGGATTTCAGTGGGGATGATGTTCATGGTTAAAACAACCGTTCGTTGAGCATGGCCAATAAGTATTGGCCATAGGCATTTTTCTGCAAAGGCGCTGCCAGCCGTCCGACCTGGGCGGCATCAATATAGCCCTGGCGGTAGGCGATTTCTTCCGGACAGGAAATTTTGAGACCCTGGCGCTTTTCAATGGTCTGGATGAATAGCGATGCCTCCAGCAGGGATTCATGGGTGCCGGTGTCCAGCCAGGCATGGCCGCGCCCCATGACTTCCACCTGCAATTGCCCCATTTCGAGGTAATGCCGGTTAATGTCGGTGATTTCCAGTTCGCCGCGCGGAGATGGCTTGAGCTTGCGGGCGATATCGATGACCTGGTTATCGTAAAAATACAGGCCAGTGACGGCGTAGCGCGATTTGGGCTGGGCGGGTTTCTCTTCCAGGCTGATGGCGTTGCCTTGAGCGTCGAATTCAACCACGCCATAGCGTTCCGGGTCATGCACCGGGTAGGCAAACACCGATGCGCCAAACTGGCGGGCAGCGGCGGCGCGCAGGCCGCCGGAAAATTCATGACCGTAAAAGATGTTGTCGCCCAGTATGAGAGCGCTCGACGCATCGCCAATGAAATCCGCGCCAATGACAAAGGCTTGCGCCAGACCGTCGGGCGAGGGCTGAACAGCATAGCTGAGGCGAATCCCCCACTGGCTGCCATCACCCAGCAACTGCTCGAAACGCGGGGTATCCTGCGGGGTGGAAATGATGAGGATGTCCCGGATTCCTGCCAGCATCAGCGTGGTAAGCGGGTAGTAGATCATGGGCTTGTCGTACACCGGCAGCAATTGCTTGGAGACTGCCTGGGTCACGGGATATAGCCGGGTACCCGAGCCACCGGCCAGAATAATGCCCCTGCGCGCGCTCATGCCTGGCTCCCATACTGTTTTTCTACCCAGTGGCGGTATTCACCGGTGGCGATGTTGGCTACCCAGTCCGGGTTGGCCAGGTACCAGGCAACGGTTTTGCGAATCCCGGTCTCAAACGTCTCTTGTGGGCGCCAGCCCAGTTCGCGCTCGATTTTGTGTGCGTCAATGGCGTAGCGACGATCGTGGCCAGCGCGGTCCTTGACATGAGTGATGAGTTTTTCGTGAGGGGTGACGGGTGAACCCGGGTGCAGCGCGTCAAGCATGGCGCAGATGGTCCTGACCACATCGATATTGGTTTTTTCGTTGCAACCGCCAATGTTGTATACCTCGCCTGCCTTGCCCGCAGCCAGCACGGCGCGGATGGCGCTGCAATGGTCGCCGACATAGAGCCAGTCACGTACGTTAAGTCCGTCGCCGTAGATCGGCAGCGGCTTGCCGGCCACGGCGTTCATCATTACCAGCGGGATGAGCTTTTCCGGGAACTGGTAAGGGCCGTAGTTGTTGGAGCAGTTGGTAGTGAGTACCGGCAGGCCATAGGTGTGGTGATAGGCGCGCACCAGGTGGTCGGAGGCAGCCTTGGAGGCTGAGTACGGGCTGTTGGGTGCGTAGGCAGTGGTCTCGGTAAACGCGGCATCATCCGGGCCGAGGGAACCGTACACCTCATCGGTGGAAACGTGCAGGAAGCGGAATCCGGCCTTCTCTATCCCTTCCATTGCATTCCAGTAAGCGCGGATTTCTTCCAGAAAATGAAAAGTGCCGACGACATTGGTCTGGATAAAGTCTTCGGGGCCGTGAATGGAACGATCGACGTGGCTTTCGGCAGCGAAGTTGATGACGGCGCGCGGGCGATGTTCAGCCAAAAGACCTGCGACCAGGGCGCGGTCGCCAATATCGCCTTGTACGAAGAGATGGCGGGCGTCACCCTCAATACTGGCGAGGTTGTGCAGATTGCCTGCGTAGGTGAGTTTGTCGAGGTTGATGACCGGCTCGTCACTGTCCGCCAGCCAGTCCAATACGAAATTGGCACCGATAAAACCGGCGCCGCCTGTGATTAGAATCATGGTAATTCCCGGTTAGCCTTTATTTTATTTCCGGTAACCCGGCGCTGCTGTTCTCGGACTTGCTCTGTGCCATTTTTCCCAGGCTTGCAAACTCGCCCACGTATTCAATTTTTGCTGCATCCCGTAGTTTTTTGATGTCAGCGGTAGCGGCATCGCGCTGTCGGGCAGCAGCCAGATAACGCTCTATTAACGTATGTGCCTTGTCCAGGGTAATCGGTGCAGATTGTATGGAGGCTATCTGCAATACCGTTATCCCGTTTGCAGACGGTAACGCCAGTAACTGTCCATTCTGCATGGTCTGCATGCGCGGCACAATATTCATCGGCAATTGTTCCGCCGCTTTGACCTCGCTGCCGGTGCGGAACGGTATATTCTGGCTGCGCAGCCAGTCCACAAATTCGCCCAGATCATGGCTGGTTTCAAGACGTGAGTTGAGTACAGCGATCCTGCCCCTGGGGGCAGCGATGGCCAGTTCCTGCAGATTATAGATACGACGCTGACTGAACAACTCGGGATGCCGGTTGTAGTAGGCGCTGATGTCGGCTTCGCCAGGTTTGGCGACAGCCTGCCCTGCACGCTGCAGATAGGCCTGGGACAGCACCTGATTGCGCGCGGCCTCCAGCAATTGCTGCACAGCAGGATCCTGATCCAGTTTTTGCGCAGTGGCTTTTTGTACCAGAAGTTGCTGATCCACCAGTGCCTGCACTACCCGATTTTTAGCTTGCGGCGTCAAATCCTGCGCTGCCAGATTCAAGCGCGACAAAGCCAGGTCCAGTTGTGCGCTAGTGATAGCGGTGCCGTTCACGCTGGCGATGGCTGAAGACGGCGTTTCCTGCTTGCTGCAGCCGGCGAGCACTGTTCCCAGCATCAGCGCCAGCGCCATTTTATGCATAATTGGATGCGATTTCATTCGCGTCATCTCCCTGGATTGGTATGTTTTGTGGCGTGATTCACACCCGGCGTCCATGATGCGCAATTATCGGTTGTCTCACAAGCGCCAGCATATAGACCGATCGAATTACAAAATGGTAATGATCATAAAAAATGGCTGGCTCCCTCCGCGGAAGCCAGCCACTACGCTACCTGATGCAGGCAATATCTAGACAGCCACGTTTTCCTCGTCGAATGCCAGTTTGATTGTGCCGGATTCGTCCACGTCCACGACCACATGCCCACCATTGGCCAGACGGCCAAACAGCAATTCATCTGCCAGTGCCCGCCGTATCTCGTCCTGGATCAGCCTGGCCATGGGGCGCGCGCCCATCAACGGGTCAAAACCGCGTTTGCCGAGATGCGCCTTGAGCGCGTCGGTAAACGTCGCCTCGACCTTTTTCTCGTGCAACTGGTCTTCCAGCTGCATCAGGAATTTATCCACCACGCGCAGGATGACCTCCTGGGACAAGGGTGCAAACGAGATCATCGCATCCAGCCGGTTACGGAACTCCGGCGTAAAGGCACGCTTGATGTCTGCCATTTCGTCGCCGGTTTGTTTTTCCTGGGTAAAGCCAATCCCCGACTTGTTGAGCGACTCCGCACCCGCGTTGGTGGTCATCACAATCACCACGTTACGGAAATCCGCCTTGCGCCCATTGTTGTCGGTAAGCGTGCCGTGATCCATCACCTGCAACAGTACGTTGAATACGTCCGGATGCGCTTTTTCGATTTCATCCAGCAACAGCACCGCGTAGGGATGTTTGGTAATCGCCTCGGTCAACAGGCCGCCCTGGTCAAACCCGACATAGCCCGGTGGGGCGCCTATCAACCGCGACACGGCATGGCGTTCCATGTATTCGGACATATCGAAGCGAATCAGCTCAATGCCCATGATATACGCGAGCTGCCGCGCCACTTCGGTCTTGCCCACCCCGGTGGGGCCAGAAAACAGGAAAGAGCCGATAGGCTTCTGCGGATTACCCAAGCCACTGCGTGCCATCTTGATCGCGGCTGCCAGCGCATTGATCGCCTTATCCTGGCCAAACACCACGGTCTTGAGGTCGCGATCGAGGTTTTTCAGCGCATCGCGATCATCACTATTCACATTCTTGGACGGAATCCGCGCAATCTTGGCGATGATCTCTTCGATCTCGCGCTTGCTGATGACTTTCTTCTGCCGTGATTTGGGCAGGATACGCTGCGCCGCGCCGGCTTCGTCGATCACGTCGATTGCCTTGTCCGGCAGGTGGCGGTCATTGATATAGCGTGCCGACAGCTCGGCTGCCGTGGTGAGCGCCGACGCGGTGTATTTGATGCCGTGATGCGCTTCAAAACGCGATTTCAAGCCACGCAGAATCTCGACTGTCTCCTCGATCGAAGGCTCATTCACATCAATCTTCTGAAACCGCCGCGATAACGCATGGTCTTTTTCGAAAATGCCACGATACTCGTTGTAAGTGGTTGCACCGATGCATTTGAGTTGCCCGGATGAAAGCGCCGGTTTAAGCAGGTTGGAAGCATCCAGGGTGCCGCCCGAAGCCGCACCTGCACCAATCAGCGTGTGAATTTCGTCTATGAACAAAATTGCCTGCGGGTTTTCGTATAGCTGCTTCAACACGGCCTTGAGGCGCTGCTCAAAATCACCACGGTATTTGGTGCCTGCCAACAGGGCTCCCATGTCCAGCGAATACACCGTGCTGTCCGATAGAATATCCGGTACCACACCCTCAACGATACGCCGCGCCAGACCTTCAGCGATGGCCGTTTTGCCGACTCCGGCTTCACCCACCAGCAGCGGGTTGTTCTTGCGCCGCCGGCACAATGTCTGGATGACACGCTCCAGTTCCAGCGCACGGCCAATAAGCGGGTCTATTTTTCCCGCCAGCGCCTGCACATTCAGGTTCTGCGTATAGGTTTCCAGCGCCGTTGCGGGTGCAGCTTCCTCACCCGCCTCAGGGGTCGCCTCCGGGCGCGCAGTGCTGCCCTGCGGGACTTTGCTCACCCCATGGGAAATGAAATTCACCACGTCCAGACGCGATACACCCTGCTGATTCAGGAAATACACCGCGTGGGAATCCTTCTCGCCAAAAATGGCCACCAGCACGTTCGCGCCGGTCACTTCTTTTTTGCCTGACGACTGCACATGCAAAATGGCGCGTTGAATCACGCGCTGAAATCCGAGCGTAGGCTGGGTATCGACTTCCTCGCTTCCTGCAACGGTAGGGGTGTGTTCGGTGATGAAATCAGCCAGTCCACGACGCAGTTCGTCGGTATTGGTGCCGCACGCGCGCAACACCTCGGCGGCGGACGGATTATCCAGCATCGCCAGCAACAGGTGCTCGACCGTAATAAACTCATGGCGCTTTTGTCTCGCCTCCATGAACGCCATATGTAAACTAACTTCCAATTCCTGCGCAATCATCTAATTTTCCTCCATCACGCATTGCAGCGGATGCTGGTGCTGGCGGGCGAACCCGACCACTTGCTCTACCTTGGTTGCCGCCACATCGCGGGGGAATACGCCACACACTCCCATGCCGTCCCTATGTACTTTGAGCATGATTTGCGTAGCCTGTTCACGGCTCTTGTAAAAAAAGTTCTGAATCACAAGAACCACAAAATCCATGGGCGTGTAGTCGTCATTCAACAACATTACCTTGTACAAAGGCGGCGGCTTGAGTTTTGTTTCGCTTGCTTCCAGAACGGTGTCATCGCGGTGCTTGGTTGCCATGGCGCTGGATAGTTTCCGGATAGTCAGGAACCATTTTGACGACTGGCGCGAAATTTTCAAGTGCTATCCAGTAAAAAAAAATTTGCCTGGCACTTGCCAAATCAACTAGGCAGGCGTAAAAAGCAATCTGGAGTTTGGCGTCAAGGTTCCTGCAAGTCTGGTAATGCCGTTGTGCGATCTTGATTTTCAAACAGCCCCTGGCCGTTTTGGCCTGTTTTTATCAAGGAAGTAGCAATGGCAACTGGCACTGTAAAGTGGTTCAACGATTCTAAAGGCTTTGGGTTTATTACCCCGGACGACGGTAGTGAAGATCTTTTCGCTCACTTCTCCGCCATCAACATGGGTGGTTTCAAAACCCTGAAGGAAGGTCAAAAAGTCCAATTCGAGGTCTCCCAGGGCCCGAAAGGCAAACAGGCTTCGAACATTCAGCCTGCATAAATCGGCTCACCGATTACTTGAAAACGCGGAACCTGGTTCCGCGTTTTTTTCGCCTCGGGTTTTGAGTTCTGGCTTTTCAAAAGCGCTGTGCTACATTGAAATATCTGTAACACATCCATTTAAGGAGAGCAGAATATGCACATTCAACACCAGCCTGATGGTTCCCTGGTCCTGGACATGAGCCAGAAACAGGCGCGAGAACTCGCAAAAACCGTCATCCAGCACGCCGAAGATGCGCATACCGCACTGCTGGATTTTGCCTACCTGCTGAACGAAGCGCATTACGATGCGGAGAACCAGTTCCGGCAACCACCTCATGCCTGGGAACCGGGTGCGCATCAGCCTGGTACAGAATAGGGGGCTACCATGAACATTTCTGCACTCGACAAACAGACTGCCCAGATCAGTGTGTTGCCGACCGAGGCCGCGCATTTGCTGGAGGGCCTCGAAGCCATGCGCGACGAACTCGGTGAAATCGCCGACGAGTTAATCAGTCTGCTGCGCGGCAGTGGCATTGAACCACCACCCAAACCCGATCATGTTCGCACTGAATACGCCGGGCCTGAGTAAACTTACATGCGCGCGATCATCGCCTCGCCAAATGCTGAACAGGACACCTGCGTCGCACCGTCCATAAGGCGAGCAAAATCGTAGGTTACCGTTTTAGCAGCAATTGCACGCTGCATACTCGCGGTGATGATATCAGCGGCTTCCAGCCAGCCAAGGTGGCGCAGCATCATCTCCGCGGAAAGAATAATCGAGCCCGGGTTGACGTAATCCTGGCCCGCATATTTTGGTGCAGTACCATGCGTGGCCTCAAACATGGCGACTGAATCGGACAGATTGGCGCCCGGCGCAATACCAATGCCGCCCACCTCAGCCGCGAGCGCGTCCGAGATGTAATCGCCGTTCAGATTAAGCGTCGCAATCACGTCGTACTCATCCGGGCGCAACAGTATCTGTTGCAAGAATGCATCGGCAATCACATCCTTGATGACAATGCCGTTGGGCAGCCTGCACCACGGGCCGCCATCCATCTCCACCGCGCCAAATTCACGCCTGGCCAGTTCATAGCCCCATTTTTTGAAGCCTCCCTCGGTGAACTTCATGATATTGCCCTTGTGTACCAAGGTAACGGACTCGCGGCCATTGTCGATGGCATACTGAATCGCCTTGCGGATCAGGCGCTCGCTGCCCTGCACGGAAACCGGTTTGATACCAATGGCGGAAGTTTCCGGGAAGCGAATTTTCTTCACCCCCATTTCGCCTTGCAGGAAGGCGATGATCTTTTTCACCTCATCCGAGCCAGCTTGCCACTCCACCCCGGCGTAAATATCCTCGGTATTTTCGCGGAAGATCACCATATCCACTTTTTCCGGCGCTTTCACCGGACTGGGCACGCCATCGAAGTAACGCACCGGGCGCAGGCAGACATACAAATCCAGCAACTGGCGCAACGCCACATTCAGGGAGCGCATGCCACCGGAAGTCGGCGTGGTCAACGGCCCCTTGATGGAGACGACGTATTCGCGCACGGCGGCTACCGTTTCATCGGGTAACCAGTTGTCGCCACCATAGACCTTGACGGCCTTTTCGCCCGCATACACTTCCATCCAGGCGATACTGCGCCTGCCACCATATGCCTTGGCCACGGCCGCATCCACCACGCGACGCATCACCGGGGTGATATCCACACCGGTACCATCACCTTCGATGAAGGGAATAACCGGCTGATCGGGGACATTGAGCGAAGCGTCAGTGTTGATCGTGATTTTTTCGCCGTGAGTCGGCAGCTGTATATGCTGGTACAT
Protein sequences of DBSCAN-SWA_7 >NZ_AP021884|2294456:2305315|2303639_2303861_+|WP_147074449.1|DBSCAN-SWA MHIQHQPDGSLVLDMSQKQARELAKTVIQHAEDAHTALLDFAYLLNEAHYDAENQFRQPPHAWEPGAHQPGTE >NZ_AP021884|2294456:2305315|2299421_2300312_-|WP_161984264.1|DBSCAN-SWA MKSHPIMHKMALALMLGTVLAGCSKQETPSSAIASVNGTAITSAQLDLALSRLNLAAQDLTPQAKNRVVQALVDQQLLVQKATAQKLDQDPAVQQLLEAARNQVLSQAYLQRAGQAVAKPGEADISAYYNRHPELFSQRRIYNLQELAIAAPRGRIAVLNSRLETSHDLGEFVDWLRSQNIPFRTGSEVKAAEQLPMNIVPRMQTMQNGQLLALPSANGITVLQIASIQSAPITLDKAHTLIERYLAAARQRDAATADIKKLRDAAKIEYVGEFASLGKMAQSKSENSSAGLPEIK >NZ_AP021884|2294456:2305315|2297456_2298341_-|WP_147074453.1|DBSCAN-SWA MSARRGIILAGGSGTRLYPVTQAVSKQLLPVYDKPMIYYPLTTLMLAGIRDILIISTPQDTPRFEQLLGDGSQWGIRLSYAVQPSPDGLAQAFVIGADFIGDASSALILGDNIFYGHEFSGGLRAAAARQFGASVFAYPVHDPERYGVVEFDAQGNAISLEEKPAQPKSRYAVTGLYFYDNQVIDIARKLKPSPRGELEITDINRHYLEMGQLQVEVMGRGHAWLDTGTHESLLEASLFIQTIEKRQGLKISCPEEIAYRQGYIDAAQVGRLAAPLQKNAYGQYLLAMLNERLF >NZ_AP021884|2294456:2305315|2303870_2304074_+|WP_147074448.1|DBSCAN-SWA MNISALDKQTAQISVLPTEAAHLLEGLEAMRDELGEIADELISLLRGSGIEPPPKPDHVRTEYAGPE >NZ_AP021884|2294456:2305315|2296911_2297454_-|WP_147074454.1|DBSCAN-SWA MNIIPTEIPDVLVLEPKVFGDARGFFYESYNRRAMTGAGIPDDFVQDNHSRSAKGVLRGLHYQIQNTQGKLVRVISGSVYDVAVDLRKSSPTFGKWVGMELSAENKRMAWIPKGFAHGFLVISDSAEFLYKTTDYWAPQFERSLLWNDPALGITWPLHGEPALAAKDAAGLPLAQCEVFA >NZ_AP021884|2294456:2305315|2303300_2303504_+|WP_124705778.1|DBSCAN-SWA MATGTVKWFNDSKGFGFITPDDGSEDLFAHFSAINMGGFKTLKEGQKVQFEVSQGPKGKQASNIQPA >NZ_AP021884|2294456:2305315|2304076_2305315_-|WP_147074447.1|DBSCAN-SWA MYQHIQLPTHGEKITINTDASLNVPDQPVIPFIEGDGTGVDITPVMRRVVDAAVAKAYGGRRSIAWMEVYAGEKAVKVYGGDNWLPDETVAAVREYVVSIKGPLTTPTSGGMRSLNVALRQLLDLYVCLRPVRYFDGVPSPVKAPEKVDMVIFRENTEDIYAGVEWQAGSDEVKKIIAFLQGEMGVKKIRFPETSAIGIKPVSVQGSERLIRKAIQYAIDNGRESVTLVHKGNIMKFTEGGFKKWGYELARREFGAVEMDGGPWCRLPNGIVIKDVIADAFLQQILLRPDEYDVIATLNLNGDYISDALAAEVGGIGIAPGANLSDSVAMFEATHGTAPKYAGQDYVNPGSIILSAEMMLRHLGWLEAADIITASMQRAIAAKTVTYDFARLMDGATQVSCSAFGEAMIARM >NZ_AP021884|2294456:2305315|2298337_2299402_-|WP_147074452.1|DBSCAN-SWA MILITGGAGFIGANFVLDWLADSDEPVINLDKLTYAGNLHNLASIEGDARHLFVQGDIGDRALVAGLLAEHRPRAVINFAAESHVDRSIHGPEDFIQTNVVGTFHFLEEIRAYWNAMEGIEKAGFRFLHVSTDEVYGSLGPDDAAFTETTAYAPNSPYSASKAASDHLVRAYHHTYGLPVLTTNCSNNYGPYQFPEKLIPLVMMNAVAGKPLPIYGDGLNVRDWLYVGDHCSAIRAVLAAGKAGEVYNIGGCNEKTNIDVVRTICAMLDALHPGSPVTPHEKLITHVKDRAGHDRRYAIDAHKIERELGWRPQETFETGIRKTVAWYLANPDWVANIATGEYRHWVEKQYGSQA >NZ_AP021884|2294456:2305315|2294456_2295875_-|WP_147074479.1|DBSCAN-SWA MRIHPVILSGGSGTRLWPLSRAALPKQLLPLVSERTMLQETVLRLSGIADITPPTLVCNHEHRFMVAEQMRAIEVTPETIFLEPIGRNTAPAAAVAALALMARDAEALMLLLPADHLIADVDAFERAVAQAVDAAQASQLVTFGIVPQAPETGYGYIQRGPADSHLDGVFAVGRFVEKPSREIAQGYLQSGDYFWNSGMFLFKAAAFVNELRGYRPDILAASQAALDSSQRDLDFVRLDTQAFTACPSESIDYAVMERTRNAVVVPADIGWSDIGSWSALWEIGAKDRAGNVMRGDIYNDGASNNLIRAESRMVAVIGVSDLVIVETSDAVMVAHKDRVQDVKKVVEHLKQHRRTEHLNHTRVFRPWGWYEGIDAGERFQVKRIMVKPGEKLSLQMHHHRAEHWIVVSGTARVTRNDEVLLLTENQSTYIPLGSTHRLENAGKIALHMIEVQSGAYLGEDDIVRFEDIYQRS >NZ_AP021884|2294456:2305315|2300501_2302757_-|WP_147074450.1|protease|DBSCAN-SWA MIAQELEVSLHMAFMEARQKRHEFITVEHLLLAMLDNPSAAEVLRACGTNTDELRRGLADFITEHTPTVAGSEEVDTQPTLGFQRVIQRAILHVQSSGKKEVTGANVLVAIFGEKDSHAVYFLNQQGVSRLDVVNFISHGVSKVPQGSTARPEATPEAGEEAAPATALETYTQNLNVQALAGKIDPLIGRALELERVIQTLCRRRKNNPLLVGEAGVGKTAIAEGLARRIVEGVVPDILSDSTVYSLDMGALLAGTKYRGDFEQRLKAVLKQLYENPQAILFIDEIHTLIGAGAASGGTLDASNLLKPALSSGQLKCIGATTYNEYRGIFEKDHALSRRFQKIDVNEPSIEETVEILRGLKSRFEAHHGIKYTASALTTAAELSARYINDRHLPDKAIDVIDEAGAAQRILPKSRQKKVISKREIEEIIAKIARIPSKNVNSDDRDALKNLDRDLKTVVFGQDKAINALAAAIKMARSGLGNPQKPIGSFLFSGPTGVGKTEVARQLAYIMGIELIRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTEAITKHPYAVLLLDEIEKAHPDVFNVLLQVMDHGTLTDNNGRKADFRNVVIVMTTNAGAESLNKSGIGFTQEKQTGDEMADIKRAFTPEFRNRLDAMISFAPLSQEVILRVVDKFLMQLEDQLHEKKVEATFTDALKAHLGKRGFDPLMGARPMARLIQDEIRRALADELLFGRLANGGHVVVDVDESGTIKLAFDEENVAV >NZ_AP021884|2294456:2305315|2296030_2296915_-|WP_147074455.1|DBSCAN-SWA MNILLTGVNGQVGWELRRTLATLGKVSAPTRSQLDLTDPGAIRSLIRSLRPNLIVNPAAHTAVDKAESEPELARAVNAIAPAIMAEEVAKLGAAMIHYSTDYVFDGNKPGAYTEDDTTHPLGVYGQTKLEGENAIRAAAIPHLILRTSWVYGLRAGNFLLTMQRLFKEREQLRIVADQYGAPTWSRMIAEATAQILAQQPFGQGKEALHGTYHLSSRGRASWYQFAQAILDQTTQAGAKRPELIPIPSTDYPTPARRPANSVLAGDKLLRSFGLLLPDWDAALGLCLDKPEKET >NZ_AP021884|2294456:2305315|2302757_2303066_-|WP_124705779.1|protease|DBSCAN-SWA MATKHRDDTVLEASETKLKPPPLYKVMLLNDDYTPMDFVVLVIQNFFYKSREQATQIMLKVHRDGMGVCGVFPRDVAATKVEQVVGFARQHQHPLQCVMEEN |
12 | Escherichia_phage(33.33%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|